llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-11 13:37:07 +00:00

Author	SHA1	Message	Date
Matt Arsenault	c77e92f437	AMDGPU: Add intrinsics for sin/cos These provide direct access to the hardware instruction without the unit version required like llvm.sin/llvm.cos lowering requires. llvm-svn: 260782	2016-02-13 01:19:56 +00:00
Matt Arsenault	4ff4c396c1	AMDGPU: Rename intrinsic to better match instruction name Also fixes missing f32 test. llvm-svn: 260780	2016-02-13 01:03:00 +00:00
Tom Stellard	a308dba9ed	AMDGPU/SI: Add instruction defs for VOP1 DPP instructions Reviewers: nhaustov, cfang, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17159 llvm-svn: 260774	2016-02-13 00:51:31 +00:00
Matt Arsenault	4cdd9956f3	AMDGPU: Fix broken condition causing warning llvm-svn: 260773	2016-02-13 00:36:10 +00:00
Alexey Samsonov	cd76db6136	Fix Windows buildbot breakage. llvm-svn: 260766	2016-02-12 23:51:06 +00:00
Tom Stellard	9943755afb	AMDGPU/SI: Detect uniform branches and emit s_cbranch instructions Reviewers: arsenm Subscribers: mareko, MatzeB, qcolombet, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D16603 llvm-svn: 260765	2016-02-12 23:45:29 +00:00
Yunzhong Gao	9e56bc9706	Disable the vzeroupper insertion pass on PS4. Differential Revision: http://reviews.llvm.org/D16837 llvm-svn: 260764	2016-02-12 23:37:57 +00:00
Derek Schuff	eed189f4ef	[WebAssembly] Report more meaningful error messages for some unsupported ops. Computed gotos and RETURNADDR may never be supported; we can do FRAMEADDR in the future. llvm-svn: 260759	2016-02-12 22:56:03 +00:00
Krzysztof Parzyszek	702277f07f	[Hexagon] Optimize stack slot spills Replace spills to memory with spills to registers, if possible. This applies mostly to predicate registers (both scalar and vector), since they are very limited in number. A spill of a predicate register may happen even if there is a general-purpose register available. In cases like this the stack spill/reload may be eliminated completely. This optimization will consider all stack objects, regardless of where they came from and try to match the live range of the stack slot with a dead range of a register from an appropriate register class. llvm-svn: 260758	2016-02-12 22:53:35 +00:00
Krzysztof Parzyszek	882483351a	[Hexagon] Mark HVX registers as volatile llvm-svn: 260753	2016-02-12 22:26:44 +00:00
Derek Schuff	6f279569a2	[WebAssembly] Update test expectations after r260737 llvm-svn: 260750	2016-02-12 22:05:08 +00:00
Krzysztof Parzyszek	4004463702	[Hexagon] Recognize more cases in copyPhysReg and stack slot load/store llvm-svn: 260748	2016-02-12 21:56:41 +00:00
Dan Gohman	25721173a7	[WebAssembly] Fix byval for empty types. llvm-svn: 260740	2016-02-12 21:30:18 +00:00
Chad Rosier	9f8cc1098d	[AArch64] Enable post-RA MI scheduler for Kryo. This should have landed in r260686. llvm-svn: 260739	2016-02-12 21:27:33 +00:00
Dan Gohman	3aceffcf65	[WebAssembly] Fix insertion of a BLOCK in a loop header that also ends a BLOCK. llvm-svn: 260737	2016-02-12 21:19:25 +00:00
Krzysztof Parzyszek	31aecfe356	[Hexagon] Recognize more instructions in isLoadFromStackSlot/isStoreToStackSlot llvm-svn: 260725	2016-02-12 20:54:15 +00:00
Krzysztof Parzyszek	01c8f9e8e7	[Hexagon] Add utility functions to detect sign- and zero-extending loads llvm-svn: 260698	2016-02-12 18:37:23 +00:00
Krzysztof Parzyszek	8094113d3a	[Hexagon] Replace expansion of spill pseudo-instructions in frame lowering Rewrite the code to handle all pseudo-instructions in a single pass. This temporarily reverts spill slot optimization that used general- purpose registers to hold values of spilled predicate registers. llvm-svn: 260696	2016-02-12 18:19:53 +00:00
Tom Stellard	10d903c4f3	[AMDGPU] Assembler: Swap operands of flat_store instructions to match AMD assembler Historically, AMD internal sp3 assembler has flat_store* addr, data format. To match existing code and to enable reuse, change LLVM definitions to match. Also update MC and CodeGen tests. Differential Revision: http://reviews.llvm.org/D16927 Patch by: Nikolay Haustov llvm-svn: 260694	2016-02-12 17:57:54 +00:00
Changpeng Fang	7cf99f3396	AMDGPU/SI: Annotate Loops with Constant Condition in SIAnnotateControlFlow pass. Summary: It is possible that the loop condition can be a boolean constant (infinite loop, for example). So we sould handle constant condition in annotating a loop. This patch adds this functionality to support annotating constant condition. Reviewers: tstellarAMD, arsenm Subscribers: llvm-commits, arsenm Differential Revision: http://reviews.llvm.org/D15093 llvm-svn: 260692	2016-02-12 17:11:04 +00:00
Krzysztof Parzyszek	c69fcdf24c	[Hexagon] Remove HexagonExpandPredSpillCode pass This code is dead. The expansion is now done in HexagonFrameLowering. llvm-svn: 260691	2016-02-12 17:09:58 +00:00
Krzysztof Parzyszek	782b19da54	[Hexagon] Eliminate pseudo instructions for circ/brev loads and stores We can generate the actual instructions from the intrinsics without the need for pseudo-instructions. Also, since the intrinsics have a side- effect in a form of a store, attempt to optimize away loads from the store location. llvm-svn: 260690	2016-02-12 17:01:51 +00:00
Geoff Berry	2d034feb0d	[AArch64] Reduce number of callee-save save/restores. Summary: Before this change, callee-save registers would be rounded up to even pairs of GPRs and FPRs. This change eliminates these extra padding load/stores, though it does keep the stack allocation the same size unless both the GPR and FPR sets have an odd size, in which case one full pair stack slot (16 bytes) is saved. This optimization cannot currently be done for MachO targets since they rely on a fast-path .debug_frame equivalent that can only encode callee-save registers as pairs. Reviewers: t.p.northover, rengolin, mcrosier, jmolloy Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D17000 llvm-svn: 260689	2016-02-12 16:31:41 +00:00
Krzysztof Parzyszek	e3cbbaf0f0	[Hexagon] Handle out-of-range offsets in eliminateFrameIndex Create a virtual register that will hold the actual address and use it with the offset of 0 in the place of the original FI. llvm-svn: 260688	2016-02-12 16:27:23 +00:00
Chad Rosier	81da1b9bcf	[AArch64] Add support for Qualcomm Kryo CPU. Machine model description by Dave Estes <cestes@codeaurora.org>. llvm-svn: 260686	2016-02-12 15:51:51 +00:00
Jun Bum Lim	2e0485683a	[AArch64] Merge two adjacent str WZR into str XZR Summary: This change merges adjacent 32 bit zero stores into a 64 bit zero store. e.g., str wzr, [x0] str wzr, [x0, #4] becomes str xzr, [x0] Therefore, four adjacent 32 bit zero stores will be a single stp. e.g., str wzr, [x0] str wzr, [x0, #4] str wzr, [x0, #8] str wzr, [x0, #12] becomes stp xzr, xzr, [x0] Reviewers: mcrosier, jmolloy, gberry, t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16933 llvm-svn: 260682	2016-02-12 15:25:39 +00:00
Krzysztof Parzyszek	642cb06cb0	[Hexagon] Specify vector alignment in DataLayout string The DataLayout can calculate alignment of vectors based on the alignment of the element type and the number of elements. In fact, it is the product of these two values. The problem is that for vectors of N x i1, this will return the alignment of N bytes, since the alignment of i1 is 8 bits. The vector types of vNi1 should be aligned to N bits instead. Provide explicit alignment for HVX vectors to avoid such complications. llvm-svn: 260678	2016-02-12 14:47:38 +00:00
Benjamin Kramer	0edb1aff00	Fix uninitialized memory read. Found by msan. llvm-svn: 260676	2016-02-12 12:37:21 +00:00
Matt Arsenault	37f2de7107	AMDGPU: Set flat_scratch from flat_scratch_init reg This was hardcoded to the static private size, but this would be missing the offset and additional size for someday when we have dynamic sizing. Also stops always initializing flat_scratch even when unused. In the future we should stop emitting this unless flat instructions are used to access private memory. For example this will initialize it almost always on VI because flat is used for global access. llvm-svn: 260658	2016-02-12 06:31:30 +00:00
Mehdi Amini	59261c912a	C API: Remove LLVMGetDataLayout that was deprecated in 3.7 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 260657	2016-02-12 06:22:00 +00:00
Matt Arsenault	628b2818b6	AMDGPU: Set element_size in private resource descriptor Introduce a subtarget feature for this, and leave the default with the current behavior which assumes up to 16-byte loads/stores can be used. The field also seems to have the ability to be set to 2 bytes, but I'm not sure what that would be used for. llvm-svn: 260651	2016-02-12 02:40:47 +00:00
Matt Arsenault	3886ad30a4	AMDGPU: Fix mishandling alignment when scalarizing vector loads/stores I don't think this was causing any real problems, so I'm not sure how to test for this. llvm-svn: 260646	2016-02-12 02:22:21 +00:00
Matt Arsenault	a9b518a63b	AMDGPU: Initialize SILowerControlFlow llvm-svn: 260645	2016-02-12 02:16:10 +00:00
Matt Arsenault	2f802da399	AMDGPU: Remove trailing whitespace llvm-svn: 260644	2016-02-12 02:16:07 +00:00
Sanjay Patel	f95e368e4f	[x86] simplify getZeroVector() ; NFCI Let DAG.getConstant() handle the splatting; there's no need to repeat that logic here. See also: http://reviews.llvm.org/rL258833 http://reviews.llvm.org/rL260582 llvm-svn: 260609	2016-02-11 22:17:04 +00:00
Quentin Colombet	fbd668aea2	[AArch64] Implements the lowering of formal arguments for GlobalISel. This is just a trivial implementation: - Support only arguments passed in registers. - Support only "plain" arguments, i.e., no sext/zext attribute. At this point, it is possible to play with the IRTranslator on AArch64: llc -mtriple arm64-<vendor>-<os> -print-machineinstrs <input.ll> -o - -global-isel For now, we only support the translation of program with adds and returns. Follow-up patches are on their way to add a test case (the MIRParser is not ready as it is). llvm-svn: 260600	2016-02-11 21:45:08 +00:00
Tom Stellard	7b646abe2d	AMDGPU/SI: Make sure MIMG descriptors and samplers stay in SGPRs Summary: It's possible to have resource descriptors and samplers stored in VGPRs, either by a VMEM instruction or in the case of samplers, floating-point calculations. When this happens, we need to use v_readfirstlane to copy these values back to sgprs. Reviewers: mareko, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17102 llvm-svn: 260599	2016-02-11 21:45:07 +00:00
Tom Stellard	4c0cead548	AMDGPU/SI: When splitting SMRD instructions, add its users to VALU worklist Summary: When we split SMRD instructions into two MUBUFs we were adding the users of the newly created MUBUFs to the VALU worklist. However, the only users these instructions had was the REG_SEQUENCE that was inserted by splitSMRD when the original SMRD instruction was split. We need to make sure to add the users of the original SMRD to the VALU worklist before it is split. I have a test case, but it requires one other bug fix, so it will be added in a later commt. Reviewers: mareko, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17101 llvm-svn: 260588	2016-02-11 21:14:34 +00:00
Derek Schuff	c7a1a0c125	[WebAssembly] Reformat WebAssemblyFrameLowering and WebAssemblyISelLowering Reviewers: sunfish, jfb Subscribers: jfb, dschuff Differential Revision: http://reviews.llvm.org/D17156 llvm-svn: 260585	2016-02-11 20:57:09 +00:00
Quentin Colombet	2081fb8111	[AArch64] Trivial implementation of lower return for the IRTranslator. llvm-svn: 260574	2016-02-11 19:45:27 +00:00
Kevin B. Smith	14481efc9f	[X86] New pass to change byte and word instructions to zero-extending versions. Differential Revision: http://reviews.llvm.org/D17032 llvm-svn: 260572	2016-02-11 19:43:04 +00:00
Quentin Colombet	0ce1d525f1	[AArch64] Plug the beginning of the GlobalISel pipeline. llvm-svn: 260569	2016-02-11 19:35:06 +00:00
Tom Stellard	60df0370a2	[AMDGPU] Fix for "v_div_scale_f64 reg, vcc, ..." parsing Summary: Added support for "VOP3Only" attribute in VOP3bInst encoding. Set VOP3Only=1 for V_DIV_SCALE_F64/32 insns. Added support for multi-dest instructions in AMDGPUAs::cvt*(). Added lit test for "V_DIV_SCALE_F64\|F32 vreg,vcc\|sreg,vreg,vreg,vreg". Reviewers: tstellarAMD, arsenm Subscribers: arsenm, SamWot, nhaustov, vpykhtin Differential Revision: http://reviews.llvm.org/D16995 Patch By: Artem Tamazov llvm-svn: 260560	2016-02-11 18:25:26 +00:00
Artem Belevich	672363ee16	[NVPTX] emit .file directives for files referenced by subprograms. .. so .loc directives referring to those files work correctly. Differential Revision: http://reviews.llvm.org/D17086 llvm-svn: 260557	2016-02-11 18:21:47 +00:00
Hans Wennborg	3625a4f59b	Revert r260507: "[X86] Enable the LEA optimization pass by default." This caused PR26575. llvm-svn: 260538	2016-02-11 16:44:06 +00:00
Jun Bum Lim	a57736cc50	[AArch64] Refactoring findMatchingStore() in aarch64-ldst-opt; NFC Summary: This change makes findMatchingStore() follow the same coding style introduced in r260275. Reviewers: gberry, junbuml Subscribers: aemerson, rengolin, haicheng, bmakam, mssimpso Differential Revision: http://reviews.llvm.org/D17083 llvm-svn: 260534	2016-02-11 16:18:24 +00:00
Chad Rosier	0febca6df8	[AArch64] Improve load/store optimizer to handle LDUR + LDR. This patch allows the mixing of scaled and unscaled load/stores to form load/store pairs. This is a reapplication of r259812, which had an incorrect assert. The test_stur_str_no_assert() test is a reduced version of the issue hit in the AArch64 self-host. PR24465 llvm-svn: 260523	2016-02-11 14:25:08 +00:00
Andrey Turetskiy	50853c0d36	[X86] Enable the LEA optimization pass by default. Differential Revision: http://reviews.llvm.org/D16877 llvm-svn: 260507	2016-02-11 10:51:26 +00:00
Simon Atanasyan	b344cf2221	[MC][ELF] Handle MIPS specific .sdata and .sbss directives MIPS specific .sdata and .sbss directives create corresponding sections with proper initialized ELF flags including ELF::SHF_MIPS_GPREL. Differential Revision: http://reviews.llvm.org/D17001 llvm-svn: 260498	2016-02-11 06:45:54 +00:00
Matt Arsenault	36dc1c179e	AMDGPU: Fix constant bus use check with subregisters If the two operands to an instruction were both subregisters of the same super register, it would incorrectly think this counted as the same constant bus use. This fixes the verifier error in fmin_legacy.ll which was missing -verify-machineinstrs. llvm-svn: 260495	2016-02-11 06:15:39 +00:00

1 2 3 4 5 ...

36139 Commits