llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-10 13:51:37 +00:00

Author	SHA1	Message	Date
Alex Bradbury	f2d0693efb	[X86] Fix test/MC/AsmParser/exprs-invalid.s after rL336104 This was my mistake for only running test/MC/X86 and test/CodeGen/X86. Arguably .word should be removed from this test, as it is not supported universally. llvm-svn: 336107	2018-07-02 14:13:27 +00:00
John Brawn	42883315e9	[llvm-exegesis] Change how the native architecture is determined Currently the llvm-exegesis native architecture is determined by comparing the llvm native architecture with X86, so to add a new target would mean adding a new check. Change this to building up a list of the targets llvm-exegesis supports then using that, as this means that when adding a new target you just add the target to the list of supported targets. Differential Revision: https://reviews.llvm.org/D48778 llvm-svn: 336105	2018-07-02 13:53:46 +00:00
Alex Bradbury	694c4ed3d7	[X86] Use addAliasForDirective to support the .word directive (reland) The X86 asm parser currently has custom parsing logic for .word. Rather than use this custom logic, we can just use addAliasForDirective to enable the reuse of AsmParser::parseDirectiveValue. See also similar changes to Sparc (rL333078), AArch64 (rL333077), and Hexagon (rL332607) backends. Differential Revision: https://reviews.llvm.org/D47004 This is a fixed reland of rL336100. This should have been caught in pre-commit testing so apologies for the noise. llvm-svn: 336104	2018-07-02 13:49:52 +00:00
Alex Bradbury	c4121502cd	Revert r336100 This was a bad change. .word == 2byte on x86. llvm-svn: 336103	2018-07-02 13:43:45 +00:00
Simon Pilgrim	bd73b34751	[SLPVectorizer] Remove nullptr early-outs from Instruction::ShuffleVector getEntryCost This code is only used by alternate opcodes so the InstructionsState has already confirmed that every Value is an Instruction, plus we use cast<Instruction> which will assert on failure. llvm-svn: 336102	2018-07-02 13:41:29 +00:00
Sanjay Patel	c7319c8a0f	[InstCombine] adjust shuffle tests with IR flags; NFC Due to current limitations in constant analysis, we need flags on add or mul to show propagation for the potential transform suggested in these tests (no other binops currently report identity constants). llvm-svn: 336101	2018-07-02 13:40:54 +00:00
Alex Bradbury	19171674be	[X86] Use addAliasForDirective to support the .word directive The X86 asm parser currently has custom parsing logic for .word. Rather than use this custom logic, we can just use addAliasForDirective to enable the reuse of AsmParser::parseDirectiveValue. See also similar changes to Sparc (rL333078), AArch64 (rL333077), and Hexagon (rL332607) backends. Differential Revision: https://reviews.llvm.org/D47004 llvm-svn: 336100	2018-07-02 13:37:15 +00:00
John Brawn	86bf3b3bb5	[llvm-exegesis] Delegate the decision of cycle counter name to the target Currently the cycle counter is taken from the subtarget schedule model, which isn't any use if the subtarget doesn't have one. Delegate the decision to the target benchmark runner, as it may know better what to do in that case, with the default being the current behaviour. Differential Revision: https://reviews.llvm.org/D48779 llvm-svn: 336099	2018-07-02 13:14:49 +00:00
Florian Hahn	0c6c82408b	Recommit r328307: [IPSCCP] Use constant range information for comparisons of parameters. This version contains a fix to add values for which the state in ParamState change to the worklist if the state in ValueState did not change. To avoid adding the same value multiple times, mergeInValue returns true, if it added the value to the worklist. The value is added to the worklist depending on its state in ValueState. Original message: For comparisons with parameters, we can use the ParamState lattice elements which also provide constant range information. This improves the code for PR33253 further and gets us closer to use ValueLatticeElement for all values. Also, as we are using the range information in the solver directly, we do not need tryToReplaceWithConstantRange afterwards anymore. Reviewers: dberlin, mssimpso, davide, efriedma Reviewed By: mssimpso Differential Revision: https://reviews.llvm.org/D43762 llvm-svn: 336098	2018-07-02 12:44:04 +00:00
Sanjay Patel	8af577661c	[InstCombine] add tests for shuffle-binop; NFC This is another pattern mentioned in PR37806. llvm-svn: 336096	2018-07-02 12:30:46 +00:00
Simon Pilgrim	ffa8d2ee7c	[SLPVectorizer] Fix alternate opcode + shuffle cost function to correct handle SK_Select patterns. We were always using the opcodes of the first 2 scalars for the costs of the alternate opcode + shuffle. This made sense when we used SK_Alternate and opcodes were guaranteed to be alternating, but this fails for the more general SK_Select case. This fix exposes an issue demonstrated by the fmul_fdiv_v4f32_const test - the SLM model has v4f32 fdiv costs which are more than twice those of the f32 scalar cost, meaning that the cost model determines that the vectorization is not performant. Unfortunately it completely ignores the fact that the fdiv by a constant will be changed into a fmul by InstCombine for a much lower cost vectorization. But at least we're seeing this now... llvm-svn: 336095	2018-07-02 11:28:01 +00:00
Simon Pilgrim	a0b4decfe6	[SLPVectorizer] Only Alternate opcodes use ShuffleVector cases for getEntryCost/vectorizeTree. NFCI. Add assertions - we're already assuming this in how we use the AltOpcode and treat everything as BinaryOperators. llvm-svn: 336092	2018-07-02 10:54:19 +00:00
Sander de Smalen	7edb4ddf5e	[AArch64][SVE] Asm: Support for (SQ)INCP/DECP (scalar, vector) Increments/decrements the result with the number of active bits from the predicate. The inc/dec variants added are: - incp x0, p0.h (scalar) - incp z0.h, p0 (vector) The unsigned saturating inc/dec variants added are: - uqincp x0, p0.h (scalar) - uqincp w0, p0.h (scalar, 32bit) - uqincp z0.h, p0 (vector) The signed saturating inc/dec variants added are: - sqincp x0, p0.h (scalar) - sqincp x0, p0.h, w0 (scalar, 32bit) - sqincp z0.h, p0 (vector) llvm-svn: 336091	2018-07-02 10:08:36 +00:00
Sander de Smalen	997a0cf82e	[AArch64][SVE] Asm: Support for (saturating) vector INC/DEC instructions. Increment/decrement vector by multiple of predicate constraint element count. The variants added by this patch are: - INCH, INCW, INC and (saturating): - SQINCH, SQINCW, SQINCD - UQINCH, UQINCW, UQINCW - SQDECH, SQINCW, SQINCD - UQDECH, UQINCW, UQINCW For example: incw z0.s, all, mul #4 llvm-svn: 336090	2018-07-02 09:31:11 +00:00
Simon Pilgrim	9c8fac17d3	[X86][BtVer2] Added Jaguar FPU Pipe0/1 uop counters to permit basic llvm-exegesis uop testing We don't have PMCs to cover many of the Jaguar resources but we can at least monitor the FPU issue pipes which give an indication of the fpu uop count, just not the execution resources. llvm-svn: 336089	2018-07-02 09:15:01 +00:00
Petar Jovanovic	9e8e4b9c31	[Mips][FastISel] Do not duplicate condition while lowering branches This change fixes the issue that arises when we duplicate condition from the predecessor block. If the condition's arguments are not considered alive across the blocks, fast regalloc gets confused and starts generating reloads from the slots that have never been spilled to. This change also leads to smaller code given that, unlike on architectures with condition codes, on Mips we can branch directly on register value, thus we gain nothing by duplication. Patch by Dragan Mladjenovic. Differential Revision: https://reviews.llvm.org/D48642 llvm-svn: 336084	2018-07-02 08:56:57 +00:00
Sander de Smalen	7d47585b61	[AArch64][SVE] Asm: Support for vector element compares (immediate). Compare vector elements with a signed/unsigned immediate, e.g. cmpgt p0.s, p0/z, z0.s, #-16 cmphi p0.s, p0/z, z0.s, #127 llvm-svn: 336081	2018-07-02 08:20:59 +00:00
Sander de Smalen	703f486b92	Reapply r334980 and r334983. These patches were previously reverted as they led to buildbot time-outs caused by large switch statement in printAliasInstr when using UBSan and O3. The issue has been addressed with a workaround (r335525). llvm-svn: 336079	2018-07-02 07:34:52 +00:00
Max Kazantsev	b1f4aecf8e	[NFC] Test that shows unprofitability of instcombine with bit ranges llvm-svn: 336078	2018-07-02 06:55:00 +00:00
Craig Topper	0bd5027433	[X86] Put some cases in switch statements back on one line to be more compact and make it easier to see the similarities. NFC It looks like someone ran clang-format over this entire file which reformatted these switches into a multiline form. But I think the single line form is more useful here. llvm-svn: 336077	2018-07-02 06:42:42 +00:00
Clement Courbet	c7da651e62	[llvm-exegesis][NFC] Cleanup useless braces. llvm-svn: 336076	2018-07-02 06:39:55 +00:00
Craig Topper	8838161306	[X86] Remove FMA3Info DenseMap. Break into sorted tables that we can binary search. I separated out the rounding and broadcast groups into their own tables because it made the ordering in the main table easier. Further splitting of the tables might make it possible to directly index using bits from the TSFlags, but its probably not worth it right now. llvm-svn: 336075	2018-07-02 06:23:39 +00:00
QingShan Zhang	032393eea9	[PowerPC] Don't make it as pre-inc candidate if displacement isn't 4's multiple for i64 pre-inc load/store For the below case, pre-inc prep think it's a good candidate to use pre-inc for the bucket, but 64bit integer load/store update (pre-inc) instruction on Power requires the displacement field should be DS-form (4's multiple). Since it can't satisfy the constraint, we have to do some fix ups later. As below, the original load/stores could be well-form, it makes things worse. unsigned long long result = 0; unsigned long long foo(char p, unsigned long long n) { for (unsigned long long i = 0; i < n; i++) { unsigned long long x1 = (unsigned long long )(p - 50000 + i); unsigned long long x2 = (unsigned long long )(p - 61024 + i); unsigned long long x3 = (unsigned long long )(p - 62048 + i); unsigned long long x4 = (unsigned long long )(p - 64096 + i); result = x1 * x2 * x3 * x4; } return result; } Patch by jedilyn(Kewen Lin). Differential Revision: https://reviews.llvm.org/D48813 --This line, and those below, will be ignored-- M lib/Target/PowerPC/PPCLoopPreIncPrep.cpp A test/CodeGen/PowerPC/preincprep-i64-check.ll llvm-svn: 336074	2018-07-02 05:46:09 +00:00
Piotr Padlewski	f85a84745e	Implement strip.invariant.group Summary: This patch introduce new intrinsic - strip.invariant.group that was described in the RFC: Devirtualization v2 Reviewers: rsmith, hfinkel, nlopes, sanjoy, amharc, kuhar Subscribers: arsenm, nhaehnle, JDevlieghere, hiraditya, xbolva00, llvm-commits Differential Revision: https://reviews.llvm.org/D47103 Co-authored-by: Krzysztof Pszeniczny <krzysztof.pszeniczny@gmail.com> llvm-svn: 336073	2018-07-02 04:49:30 +00:00
Eric Christopher	592986ec19	Add an entry for rodata constant merge sections to the default section flags in the ELF assembler. This matches the defaults given in the rest of MC. Fixes PR37997 where we couldn't assemble our own assembly output without warnings. llvm-svn: 336072	2018-07-02 00:16:39 +00:00
Craig Topper	448ddaf03d	[X86] Fix a few test names in avx512-intrinsics-fast-isel.ll to match their clang intrinsic names. I thought I fixed these yesterday, but I guess I missed a few. llvm-svn: 336071	2018-07-01 23:49:06 +00:00
Craig Topper	7e84d8b836	[X86] Remove the places that return nullptr from X86InstrInfo::commuteInstructionImpl. findCommutedOpIndices does the pre-checking for whether commuting is possible. There should be no reason left to fail in commuteInstructionImpl. There was a missing pre-check that I've added there and changed the check to an assert in commuteInstructionImpl. llvm-svn: 336070	2018-07-01 23:27:41 +00:00
Simon Pilgrim	978f4e7f3d	[SLPVectorizer] Call InstructionsState.isOpcodeOrAlt with Instruction instead of an opcode. NFCI. llvm-svn: 336069	2018-07-01 20:22:46 +00:00
Simon Pilgrim	4016f678d7	[SLPVectorizer] Replace sameOpcodeOrAlt with InstructionsState.isOpcodeOrAlt helper. NFCI. This is a basic step towards matching more general instructions types than just opcodes. llvm-svn: 336068	2018-07-01 20:07:30 +00:00
Craig Topper	c1e3200b4c	[X86][Disassembler] Remove TYPE_BNDR from translateImmediate. I've check the disassembler tables and this shouldn't be reachable. Which is good since if it was reachable there should have been a 'return' after the addOperand line. llvm-svn: 336066	2018-07-01 17:50:29 +00:00
Sanjay Patel	832f2c00d2	[InstCombine] add abs tests with undef elts; NFC llvm-svn: 336065	2018-07-01 17:14:37 +00:00
Sanjay Patel	d0b447b76b	[PatternMatch] allow undef elements in vectors with m_Neg This is similar to the m_Not change from D44076. llvm-svn: 336064	2018-07-01 13:42:57 +00:00
Simon Pilgrim	1b899b8559	[SLPVectorizer] Use InstructionsState Op/Alt opcodes directly. NFCI. llvm-svn: 336063	2018-07-01 13:41:58 +00:00
David Green	3248675f42	[UnrollAndJam] New Unroll and Jam pass This is a simple implementation of the unroll-and-jam classical loop optimisation. The basic idea is that we take an outer loop of the form: for i.. ForeBlocks(i) for j.. SubLoopBlocks(i, j) AftBlocks(i) Instead of doing normal inner or outer unrolling, we unroll as follows: for i... i+=2 ForeBlocks(i) ForeBlocks(i+1) for j.. SubLoopBlocks(i, j) SubLoopBlocks(i+1, j) AftBlocks(i) AftBlocks(i+1) Remainder Loop So we have unrolled the outer loop, then jammed the two inner loops into one. This can lead to a simpler inner loop if memory accesses can be shared between the now jammed loops. To do this we have to prove that this is all safe, both for the memory accesses (using dependence analysis) and that ForeBlocks(i+1) can move before AftBlocks(i) and SubLoopBlocks(i, j). Differential Revision: https://reviews.llvm.org/D41953 llvm-svn: 336062	2018-07-01 12:47:30 +00:00
Paul Semel	6837964f75	Revert "[llvm-readobj] Fix printing format" There is a problem with the formatting on windows build. I need to investigate on this. llvm-svn: 336061	2018-07-01 11:54:09 +00:00
Simon Pilgrim	efda73a303	[SLPVectorizer][X86] Add some alternate tests for cast operators Alternate opcode handling only supports binary operators, these tests demonstrate missed opportunities to vectorize some sitofp/uitofp and fptosi/fptoui style casts as well as some (successful) float bits manipulations llvm-svn: 336060	2018-07-01 11:29:46 +00:00
Eugene Leviant	0528ad6e6b	[Evaluator] Improve evaluation of call instruction Recommit of r335324 after buildbot failure fix llvm-svn: 336059	2018-07-01 11:02:07 +00:00
Paul Semel	406504e4b7	[llvm-readobj] Fix printing format We were printing every character, even those that weren't printable. It doesn't really make sense for this option. The string content was sticked to its address, added two spaces in between. Differential Revision: https://reviews.llvm.org/D48271 llvm-svn: 336058	2018-07-01 09:51:59 +00:00
Craig Topper	50dc237abc	[X86] Remove unnecessary include. NFC Leftover from when the pass contained a DenseMap before it switched to binary search. llvm-svn: 336057	2018-07-01 05:54:22 +00:00
Craig Topper	a21f95eb29	[X86] Move the memory unfolding table creation into its own class and make it a ManagedStatic. Also move the static folding tables, their search functions and the new class into new cpp/h files. The unfolding table is effectively static data. It's just a different ordering and a subset of the static folding tables. By putting it in a separate ManagedStatic we ensure we only have one copy instead of one per X86InstrInfo object. This way also makes it only get initialized when really needed. llvm-svn: 336056	2018-07-01 05:47:49 +00:00
Craig Topper	b3927f4cb4	[X86] Move the X86InstrFMA3Info class into the cpp file. Expose only a getFMA3Group free function. NFCI The class only exists to hold a DenseMap and is only created as a ManagedStatic. It used to expose a single static method that outside code was expected to use. This patch moves that static function out of the class and moves it implementation into the cpp file. It can now access the ManagedStatic directly by name without the need for the other static method that accessed the ManagedStatic. llvm-svn: 336055	2018-06-30 22:38:42 +00:00
Craig Topper	1f4ba95145	[X86] Remove the AsmName from the HAX,HDX,HCX,HBX,HSI,HDI,HBP,HSP,HIP artificial registers so they can't be parsed by the assembly parser. There are no instructions that use them so they weren't causing any bad matches. But they weren't being diagnosed as "invalid register name" if they were used and would instead trigger some form of invalid operand. llvm-svn: 336054	2018-06-30 22:38:41 +00:00
Craig Topper	8cc063c7f8	[X86] Use MVT::i8 for scalar shift amounts since that is what they ultimately need to legalize to. I believe all of these are constants so legalizing them should be pretty trivial, but this saves a step. In one case it looks like we may have been creating a shift amount larger than the shift input itself. llvm-svn: 336052	2018-06-30 18:30:31 +00:00
Craig Topper	3a323d3964	[X86] When combining load to BZHI, make sure we create the shift instruction with an i8 type. This combine runs pretty late and causes us to introduce a shift after the op legalization phase has run. We need to be sure we create the shift with the proper type for the shift amount. If we don't do this, we will still re-legalize the operation properly, but we won't get a chance to fully optimize the truncate that gets inserted. So this patch adds the necessary truncate when the shift is created. I've also narrowed the subtract that gets created to always be an i32 type. The truncate would have trigered SimplifyDemandedBits to optimize it anyway. But using a more appropriate VT here is free and saves an optimization step. llvm-svn: 336051	2018-06-30 17:49:42 +00:00
Sanjay Patel	2211408668	[InstCombine] add tests for negate vector with undef elts; NFC llvm-svn: 336050	2018-06-30 14:11:46 +00:00
Simon Pilgrim	59d88c83aa	Fix Wdocumentation compiler warning. NFCI. llvm-svn: 336049	2018-06-30 12:24:23 +00:00
Simon Pilgrim	ceb9146ef7	[DAGCombiner] Handle correctly non-splat power of 2 -1 divisor (PR37119) The combine added in commit 329525 overlooked the case where one, but not all, of the divisor elements is -1, -1 is the only power of two value for which the sdiv expansion recipe breaks. Thanks to @zvi for the original patch. Differential Revision: https://reviews.llvm.org/D45806 llvm-svn: 336048	2018-06-30 12:22:55 +00:00
Craig Topper	ad5cc3a074	[X86] Update some avx512 fast-isel tests to match their real clang IRgen. Especially of note was the test_mm_mask_set1_epi64 and other set1 tests that were truncating the element to be broadcasted to i8 and broadcasting that instead of a whole 64 bit value. Some of the others were just correcting mask sizes on parameters due to bugs in the clang test case they were generated from that have now been fixed. Some were converting i8 to <4 x i1>/<2 x i1> by truncating to i4/i2 and then bitcasting. But the clang codegen is bitcast to <8 x i1>, then extract to <4 x i1>/<2 x i1>. This is likely to incur less trouble from the integer type legalizer in the backend. llvm-svn: 336045	2018-06-30 07:25:29 +00:00
Craig Topper	026e66fc30	[X86] Change some chec-prefixes from X32 to X86 to match the FileCheck command line. I think this test changed and these test cases were created around the same time and missed the change. llvm-svn: 336044	2018-06-30 06:45:10 +00:00
Craig Topper	6ad8764b8f	[X86] Remove test cases from avx512vl-intrinsics-fast-isel.ll for intrinsics that don't really exist in clang. llvm-svn: 336043	2018-06-30 06:45:09 +00:00

1 2 3 4 5 ...

166082 Commits