llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-07 03:26:53 +00:00

Author	SHA1	Message	Date
Nikita Popov	fad5d2671a	[AArch64] Expand v8i8 cttz (PR39729) Fix for https://bugs.llvm.org/show_bug.cgi?id=39729. Rather than adding just a case for v8i8 I'm setting cttz to expand for all vector types. Differential Revision: https://reviews.llvm.org/D58008 llvm-svn: 353872	2019-02-12 18:55:53 +00:00
Jessica Paquette	bc25529f39	[GlobalISel][AArch64] Select llvm.bswap* for non-vector types This teaches the IRTranslator to emit G_BSWAP when it runs into Intrinsic::bswap. This allows us to select G_BSWAP for non-vector types in AArch64. Add a select-bswap.mir test, and add global isel checks to a couple existing tests in test/CodeGen/AArch64. This doesn't handle every bswap case, since some of these rely on known bits stuff. This just lets us handle the naive case. Differential Revision: https://reviews.llvm.org/D58081 llvm-svn: 353861	2019-02-12 17:28:17 +00:00
Daniel Sanders	f1063b8ec8	[globalisel] Correct string emitted by GISelChangeObserver::erasingInstr() The API indicates that the MI is about to be erased rather than it has been erased. llvm-svn: 353746	2019-02-11 20:45:19 +00:00
Jessica Paquette	c562e11af8	[AArch64][GlobalISel] Add isel support for a couple vector exts/truncs Add support for - v4s16 <-> v4s32 - v2s64 <-> v2s32 And update tests that use them to show that we generate the correct instructions. Differential Revision: https://reviews.llvm.org/D57832 llvm-svn: 353732	2019-02-11 18:56:39 +00:00
Jessica Paquette	93c1e7153b	[GlobalISel][AArch64] NFC: Remove unnecessary IR from select-fp-casts.mir The IR section in this test doesn't do anything, so there's no point in it being there. Since it's redundant, just remove it. llvm-svn: 353731	2019-02-11 18:41:22 +00:00
Jessica Paquette	5a40815c0d	[GlobalISel][AArch64] Select G_FFLOOR This teaches the legalizer about G_FFLOOR, and lets us select G_FFLOOR in AArch64. It updates the existing floating point tests, and adds a select-floor.mir test. Differential Revision: https://reviews.llvm.org/D57486 llvm-svn: 353722	2019-02-11 17:22:58 +00:00
Jessica Paquette	6d08ea2404	Recommit "[GlobalISel] Add IRTranslator support for G_FFLOOR" After the changes introduced in r353586, this instruction doesn't cause any issues for any backend. Original review: https://reviews.llvm.org/D57485 llvm-svn: 353720	2019-02-11 17:16:32 +00:00
Matt Arsenault	84e44687ee	GlobalISel: Add G_FCANONICALIZE instruction llvm-svn: 353719	2019-02-11 17:05:20 +00:00
Mandeep Singh Grang	e64659fe82	[GlobalISel] Regex the opcodes in unit test to fix non-deterministic ordering Differential Revision: https://reviews.llvm.org/D57988 llvm-svn: 353652	2019-02-10 19:53:43 +00:00
Simon Pilgrim	31e25b2295	[AArch64] Regenerate bswap tests llvm-svn: 353648	2019-02-10 18:27:37 +00:00
Jessica Paquette	cb5da3f61a	Recommit "[GlobalISel] Introduce a generic floating point floor opcode, G_FFLOOR"" After r353586, we won't fail on the AMDGPU floor pattern that was killing the importer before. llvm-svn: 353589	2019-02-09 00:37:31 +00:00
Francis Visoiu Mistrih	aa89ecf9c8	Re-apply r353553 "[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder" With a fix after r353563 that adds some more opcodes. llvm-svn: 353579	2019-02-08 23:34:11 +00:00
Francis Visoiu Mistrih	4b7c8f212e	Revert r353553 "[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder" This reverts commit r353553. This breaks CodeGen/AArch64/GlobalISel/legalize-ext-csedebug-output.mir: http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/57963/console llvm-svn: 353575	2019-02-08 22:49:43 +00:00
Nemanja Ivanovic	fd45134124	[DAGCombine] Optimize pow(X, 0.75) to sqrt(X) * sqrt(sqrt(X)) The sqrt case is faster and we already do this for the case where the exponent is 0.25. This adds the 0.75 case which is also not sensitive to signed zeros. Patch by Whitney Tsang (Whitney) Differential revision: https://reviews.llvm.org/D57434 llvm-svn: 353557	2019-02-08 19:50:58 +00:00
Aditya Nandakumar	1cbeacb1ea	[GISel][NFC]: Add missing call to record CSE hits in the CSEMIRBuilder https://reviews.llvm.org/D57932 Add some logging + tests to make sure CSEInfo prints debug output. reviewed by: arsenm llvm-svn: 353553	2019-02-08 19:41:13 +00:00
Eli Friedman	b87d675297	[AArch64] Fix condition for "high-vector" DUP optimizations. AArch64 NEON has a bunch of instructions with a "2" suffix that extract the top half of the source vectors, instead of the bottom half. We have some DAGCombines to try to take advantage of that. However, they assumed that any EXTRACT_VECTOR was extracting the high half of the vector in question. This issue has apparently existed since the AArch64 backend was merged. Fixes https://bugs.llvm.org/show_bug.cgi?id=40632 . Differential Revision: https://reviews.llvm.org/D57862 llvm-svn: 353486	2019-02-08 00:23:35 +00:00
Nikita Popov	439d6e3c64	[CodeGen] Handle vector UADDO, SADDO, USUBO, SSUBO This is part of https://bugs.llvm.org/show_bug.cgi?id=40442. Vector legalization is implemented for the add/sub overflow opcodes. UMULO/SMULO are also handled as far as legalization is concerned, but they don't support vector expansion yet (so no tests for them). The vector result widening implementation is suboptimal, because it could result in a legalization loop. Differential Revision: https://reviews.llvm.org/D57639 llvm-svn: 353464	2019-02-07 21:02:22 +00:00
Matt Arsenault	e739762e43	GlobalISel: Implement narrowScalar for shift main type This is pretty much directly ported from SelectionDAG. Doesn't include the shift by non-constant but known bits version, since there isn't a globalisel version of computeKnownBits yet. This shows a disadvantage of targets not specifically which type should be used for the shift amount. If type 0 is legalized before type 1, the operations on the shift amount type use the wider type (which are also less likely to legalize). This can be avoided by targets specifying legalization actions on type 1 earlier than for type 0. llvm-svn: 353455	2019-02-07 19:37:44 +00:00
Tim Northover	77f59ae9f5	AArch64: implement copy for paired GPR registers. When doing 128-bit atomics using CASP we might need to copy a GPRPair to a different register, but that was unimplemented up to now. llvm-svn: 353383	2019-02-07 10:35:34 +00:00
Tim Northover	009fdea45c	AArch64: enforce even/odd register pairs for CASP instructions. ARMv8.1a CASP instructions need the first of the pair to be an even register (otherwise the encoding is unallocated). We enforced this during assembly, but not CodeGen before. llvm-svn: 353308	2019-02-06 15:26:35 +00:00
Oliver Stannard	a390364af8	[AArch64][Outliner] Don't outline BTI instructions We can't outline BTI instructions, because they need to be the very first instruction executed after an indirect call or branch. If we outline them, then an indirect call might go to the branch to the outlined function, which will fault. Differential revision: https://reviews.llvm.org/D57753 llvm-svn: 353190	2019-02-05 17:21:57 +00:00
Sanjay Patel	d8f3430e74	[AArch64][x86] add tests for unsigned subtract with overflow; NFC llvm-svn: 353178	2019-02-05 15:26:42 +00:00
Matt Arsenault	f2a56c8653	GlobalISel: Consolidate load/store legalization The fewerElementsVectors implementation for load/stores handles the scalar reduction case just as well, so drop the redundant code in narrowScalar. This also introduces support for narrowing irregular size breakdowns for scalars. llvm-svn: 353125	2019-02-05 00:26:12 +00:00
Matt Arsenault	8dd78349f8	GlobalISel: Combine g_extract with g_merge_values Try to use the underlying source registers. This enables legalization in more cases where some irregular operations are widened and others narrowed. This seems to make the test_combines_2 AArch64 test worse, since the MERGE_VALUES has multiple uses. Since this should be required for legalization, a hasOneUse check is probably inappropriate (or maybe should only be used if the merge is legal?). llvm-svn: 353121	2019-02-04 23:41:59 +00:00
Matt Arsenault	0514622b4b	GlobalISel: Enforce operand types for constants A number of of tests were using imm operands, not cimm. Since CSE relies on the exact ConstantInt* pointer used, and implicit conversions are generally evil, also enforce the bitsize of the types. llvm-svn: 353113	2019-02-04 23:29:31 +00:00
Matt Arsenault	0714e5a58a	GlobalISel: Verify g_select Factor the common vector element consistency check many instructions need out, although this makes the error messages worse. llvm-svn: 353112	2019-02-04 23:29:16 +00:00
Jessica Paquette	b78928bcbd	Revert "[GlobalISel] Introduce a generic floating point floor opcode, G_FFLOOR" This reverts commit b05ecba6d687fcb3078509220c67458bf1d77a2e. Apparently adding floor breaks AMDGPU somehow, so I have to back this out while I look into it. llvm-svn: 353065	2019-02-04 17:32:47 +00:00
Jessica Paquette	e04ef6ecc0	Revert "[GlobalISel] Add IRTranslator support for G_FFLOOR" This reverts commit 8bbd570fd5205a04d88d2e5513a6e4adbd028039. Apparently adding ffloor breaks AMDGPU somehow, so I need to back this out while I look into it. llvm-svn: 353064	2019-02-04 17:32:43 +00:00
Jessica Paquette	cd735a0458	[GlobalISel] Add IRTranslator support for G_FFLOOR Follow-up to https://reviews.llvm.org/D57484 Adds G_FFLOOR to translateKnownIntrinsic and update arm64-irtranslator.ll. Differential Revision: https://reviews.llvm.org/D57485 llvm-svn: 353058	2019-02-04 17:15:34 +00:00
Jessica Paquette	b34b9ca867	[GlobalISel] Introduce a generic floating point floor opcode, G_FFLOOR This introduces a generic opcode for floating point floor, working towards selecting @llvm.floor. Differential Revision: https://reviews.llvm.org/D57484 llvm-svn: 353057	2019-02-04 17:10:55 +00:00
Simon Pilgrim	0a8e3e49df	[NFC] Make vector types legal in UREM test As discussed in D50222, this changes the vector types in tests required for that revision to ones legal for X86. Patch by @hermord (Dmytro Shynkevych) Differential Revision: https://reviews.llvm.org/D56372 llvm-svn: 353004	2019-02-03 19:38:15 +00:00
Matt Arsenault	eb50f27657	GlobalISel: Implement widenScalar for G_UNMERGE_VALUES For the scalar case only. Also move the similar G_MERGE_VALUES handling to a separate function and cleanup to make them look more similar. llvm-svn: 352979	2019-02-03 00:07:33 +00:00
Mandeep Singh Grang	1bfeff1fc4	[AutoUpgrade] Fix AutoUpgrade for x86.seh.recoverfp Summary: This fixes the bug in https://reviews.llvm.org/D56747#inline-502711. Reviewers: efriedma Reviewed By: efriedma Subscribers: javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57614 llvm-svn: 352945	2019-02-02 01:32:48 +00:00
Mandeep Singh Grang	9cf3ca95c2	[COFF, ARM64] Fix localaddress to handle stack realignment and variable size objects Summary: This fixes using the correct stack registers for SEH when stack realignment is needed or when variable size objects are present. Reviewers: rnk, efriedma, ssijaric, TomTan Reviewed By: rnk, efriedma Subscribers: javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D57183 llvm-svn: 352923	2019-02-01 21:41:33 +00:00
Adhemerval Zanella	36b7b3c0fa	[AArch64] Optimize floating point materialization This patch changes isFPImmLegal to return if the value can be enconded as the immediate operand of a logical instruction besides checking if for immediate field for fmov. This optimizes some floating point materization, inclusive values used on isinf lowering. Reviewed By: rengolin, efriedma, evandro Differential Revision: https://reviews.llvm.org/D57044 llvm-svn: 352866	2019-02-01 12:26:06 +00:00
Matt Arsenault	f42efedb28	GlobalISel: Fix MMO creation with non-power-of-2 mem size It should probably just be mandatory for getTgtMemIntrinsic to return the alignment. llvm-svn: 352817	2019-01-31 23:41:23 +00:00
Sjoerd Meijer	068d715728	[SelectionDAG] Codesize: don't expand SHIFT to SHIFT_PARTS And instead just generate a libcall. My motivating example on ARM was a simple: shl i64 %A, %B for which the code bloat is quite significant. For other targets that also accept __int128/i128 such as AArch64 and X86, it is also beneficial for these cases to generate a libcall when optimising for minsize. On these 64-bit targets, the 64-bits shifts are of course unaffected because the SHIFT/SHIFT_PARTS lowering operation action is not set to custom/expand. Differential Revision: https://reviews.llvm.org/D57386 llvm-svn: 352736	2019-01-31 08:07:30 +00:00
Matt Arsenault	51c3d3146e	GlobalISel: Allow bitcount ops to have different result type For AMDGPU the result is always 32-bit for 64-bit inputs. llvm-svn: 352717	2019-01-31 02:09:57 +00:00
Matt Arsenault	1067bf29d3	GlobalISel: Fix creating MMOs with align 0 llvm-svn: 352712	2019-01-31 01:38:47 +00:00
Jessica Paquette	156cec86e5	[GlobalISel][AArch64] Select G_FEXP This teaches the legalizer to handle G_FEXP in AArch64. As a result, it also allows us to select G_FEXP. It... - Updates the legalizer-info tests - Adds a test for legalizing exp - Updates the existing fp tests to show that we can now select G_FEXP https://reviews.llvm.org/D57483 llvm-svn: 352692	2019-01-30 23:46:15 +00:00
Jessica Paquette	d46e179ece	[GlobalISel][AArch64] Select G_FABS This adds instruction selection support for G_FABS in AArch64. It also updates the existing basic FP tests, adds a selection test for G_FABS. https://reviews.llvm.org/D57418 llvm-svn: 352684	2019-01-30 22:54:21 +00:00
Sanjay Patel	31c3bd6557	[DAGCombiner] sub X, 0/1 --> add X, 0/-1 This extends the existing transform for: add X, 0/1 --> sub X, 0/-1 ...to allow the sibling subtraction fold. This pattern could regress with the proposed change in D57401. llvm-svn: 352680	2019-01-30 22:41:35 +00:00
Sanjay Patel	3276e2f2d5	[AArch64][x86] add tests for add/sub signbits fold; NFC As discussed/shown in D57401, we are missing a fold for subtract of 0/1 --> add 0/-1. llvm-svn: 352678	2019-01-30 21:58:20 +00:00
Jessica Paquette	d5349f419b	[GlobalISel][AArch64] Add instruction selection support for @llvm.log2 This teaches GlobalISel to emit a RTLib call for @llvm.log2 when it encounters it. It updates the existing floating point tests to show that we don't fall back on the intrinsic, and select the correct instructions. It also adds a legalizer test for G_FLOG2. https://reviews.llvm.org/D57357 llvm-svn: 352673	2019-01-30 21:16:04 +00:00
Jessica Paquette	d03d1c2ace	[GlobalISel][AArch64] Add instruction selection support for @llvm.sqrt This teaches the legalizer about G_FSQRT in AArch64. Also adds a legalizer test for G_FSQRT, a selection test for it, and updates existing floating point tests. https://reviews.llvm.org/D57361 llvm-svn: 352671	2019-01-30 21:03:52 +00:00
Jessica Paquette	5e71e4e829	[GlobalISel] Add IRTranslator support for @llvm.sqrt -> G_FSQRT Follow-up commit to https://reviews.llvm.org/D57359. (r352668) This adds IRTranslator support for recognising a @llvm.sqrt intrinsic and translating it into a G_FSQRT. https://reviews.llvm.org/D57360 llvm-svn: 352670	2019-01-30 20:58:14 +00:00
Jessica Paquette	2b66bd8551	[GlobalISel] Introduce a G_FSQRT generic instruction This introduces a generic instruction for computing the floating point square root of a value. Right now, we can't select @llvm.sqrt, so this is working towards fixing that. llvm-svn: 352668	2019-01-30 20:49:50 +00:00
Matt Arsenault	491b2f2654	GlobalISel: Verify pointer casts Not sure if the old AArch64 tests should be just deleted or not. llvm-svn: 352562	2019-01-29 23:29:00 +00:00
Matt Arsenault	ab0a209220	GlobalISel: Partially implement widenScalar for MERGE_VALUES llvm-svn: 352560	2019-01-29 23:17:35 +00:00
Amara Emerson	478ae74dcd	[AArch64][GlobalISel] Unmerge into scalars from a vector should use FPR bank. This currently shows up as a selection fallback since the dest regs were given GPR banks but the source was a vector FPR reg. Differential Revision: https://reviews.llvm.org/D57408 llvm-svn: 352545	2019-01-29 21:19:33 +00:00

1 2 3 4 5 ...

2502 Commits