llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-27 06:54:30 +00:00

Author	SHA1	Message	Date
Matt Arsenault	8fe94b1401	InstCombine: Preserve nuw when reassociating nuw ops [3/3] Alive says this is OK. llvm-svn: 364235	2019-06-24 21:37:03 +00:00
Matt Arsenault	e5079e363e	InstCombine: Preserve nuw when reassociating nuw ops [2/3] Alive says this is OK. llvm-svn: 364234	2019-06-24 21:37:02 +00:00
Matt Arsenault	e6a660f3ce	InstCombine: Preserve nuw when reassociating nuw ops [1/3] Alive says this is OK. llvm-svn: 364233	2019-06-24 21:36:59 +00:00
Cameron McInally	eb920dca48	[NFC][Reassociate] Add unary FNeg tests to fast-ReassociateVector.ll llvm-svn: 364232	2019-06-24 21:36:09 +00:00
David Blaikie	7d33255fe8	NFC: DataExtractor: use decodeULEB128 to implement getULEB128 llvm-svn: 364230	2019-06-24 20:43:36 +00:00
Nikita Popov	1e88e417e2	[CVP] Reenable nowrap flag inference Inference of nowrap flags in CVP has been disabled, because it triggered a bug in LFTR (https://bugs.llvm.org/show_bug.cgi?id=31181). This issue has been fixed in D60935, so we should be able to reenable nowrap flag inference now. Differential Revision: https://reviews.llvm.org/D62776 llvm-svn: 364228	2019-06-24 20:13:13 +00:00
Sanjay Patel	26444fab58	[InstCombine] add tests for more variants of isPowerOf2; NFC llvm-svn: 364227	2019-06-24 20:11:40 +00:00
Peter Collingbourne	b3fb585c0e	llvm-symbolizer: Add a FRAME command. This command prints a description of the referenced function's stack frame. For each formal parameter and local variable, the tool prints: - function name - variable name - file/line of declaration - FP-relative variable location (if available) - size in bytes - HWASAN tag offset This information will be used by the HWASAN runtime to identify local variables in UAR reports. Differential Revision: https://reviews.llvm.org/D63468 llvm-svn: 364225	2019-06-24 20:03:23 +00:00
Huihui Zhang	ca835cfb69	[InstCombine] Regenerate test pr17827. NFCI. Prep work for upcoming patch D63505. llvm-svn: 364224	2019-06-24 19:49:42 +00:00
Roland Froese	5ec080f8a9	[CodeGen] Add missing vector type legalization for ctlz_zero_undef Widen vector result type for ctlz_zero_undef and cttz_zero_undef the same as ctlz and cttz. Differential Revision: https://reviews.llvm.org/D63463 llvm-svn: 364221	2019-06-24 19:27:07 +00:00
Philip Reames	df1ddd5754	[Tests] Add cases where we're failing to discharge provably loop exits (tests for D63733) llvm-svn: 364220	2019-06-24 19:26:17 +00:00
Cameron McInally	1fe4ddd99c	[SLP] Support unary FNeg vectorization Differential Revision: https://reviews.llvm.org/D63609 llvm-svn: 364219	2019-06-24 19:24:23 +00:00
Nico Weber	81e340127a	Remove flag for no longer supported MSVC version llvm-svn: 364218	2019-06-24 18:47:53 +00:00
Matt Arsenault	255c995005	AMDGPU/GlobalISel: Select G_TRUNC llvm-svn: 364215	2019-06-24 18:02:18 +00:00
Matt Arsenault	87c7810f93	AMDGPU/GlobalISel: RegBankSelect for amdgcn.class llvm-svn: 364214	2019-06-24 18:00:47 +00:00
Jinsong Ji	17dfc77db2	[PowerPC][UpdateTestChecks] powerpc- triple support There are quite some old testcases with powerpc- triple, we should add this triple support so that we can update them with script. Differential Revision: https://reviews.llvm.org/D63723 llvm-svn: 364213	2019-06-24 18:00:34 +00:00
Matt Arsenault	3871a412cf	AMDGPU/GlobalISel: Split VALU s64 G_ZEXT/G_SEXT in RegBankSelect Scalar extends to s64 can use S_BFE_{I64\|U64}, but vector extends need to extend to the 32-bit half, and then to 64. I'm not sure what the line should be between what RegBankSelect handles, and what instruction select does, but for now I'm erring on the side of RegBankSelect for future post-RBS combines. llvm-svn: 364212	2019-06-24 17:54:12 +00:00
Yuanfang Chen	681dfd17a2	[llvm-objdump] Match GNU objdump on symbol types shown in disassembly output. STT_OBJECT and STT_COMMON are dumped as data, not disassembled. https://bugs.llvm.org/show_bug.cgi?id=41947 Differential Revision: https://reviews.llvm.org/D62964 llvm-svn: 364211	2019-06-24 17:47:56 +00:00
Tim Renouf	1fecf6fed0	[AMDGPU] Allow any value in unused src0 field in v_nop Summary: The LLVM disassembler assumes that the unused src0 operand of v_nop is zero. Other tools can put another value in that field, which is still valid. This commit fixes the LLVM disassembler to recognize such an encoding as v_nop, in the same way as we already do for s_getpc. Differential Revision: https://reviews.llvm.org/D63724 Change-Id: Iaf0363eae26ff92fc4ebc716216476adbff37a6f llvm-svn: 364208	2019-06-24 17:35:20 +00:00
Craig Topper	b7bb8e3759	[X86] Don't a vzext_movl in LowerBuildVectorv16i8/LowerBuildVectorv8i16 if there are no zeroes in the vector we're building. In LowerBuildVectorv16i8 we took care to use an any_extend if the first pair is in the lower 16-bits of the vector and no elements are 0. So bits [31:16] will be undefined. But we still emitted a vzext_movl to ensure that bits [127:32] are 0. If we don't need any zeroes we should be consistent and make all of 127:16 undefined. In LowerBuildVectorv8i16 we can just delete the vzext_movl code because we only use the scalar_to_vector when there are no zeroes. So the vzext_movl is always unnecessary. Found while investigating whether (vzext_movl (scalar_to_vector (loadi32)) patterns are necessary. At least one of the cases where they were necessary was where the loadi32 matched 32-bit aligned 16-bit extload. Seemed weird that we required vzext_movl for that case. Differential Revision: https://reviews.llvm.org/D63700 llvm-svn: 364207	2019-06-24 17:28:41 +00:00
Craig Topper	3821ac5bff	[X86] Cleanups and safety checks around the isFNEG This patch does a few things to start cleaning up the isFNEG function. -Remove the Op0/Op1 peekThroughBitcast calls that seem unnecessary. getTargetConstantBitsFromNode has its own peekThroughBitcast inside. And we have a separate peekThroughBitcast on the return value. -Add a check of the scalar size after the first peekThroughBitcast to ensure we haven't changed the element size and just did something like f32->i32 or f64->i64. -Remove an unnecessary check that Op1's type is floating point after the peekThroughBitcast. We're just going to look for a bit pattern from a constant. We don't care about its type. -Add VT checks on several places that consume the return value of isFNEG. Due to the peekThroughBitcasts inside, the type of the return value isn't guaranteed. So its not safe to use it to build other nodes without ensuring the type matches the type being used to build the node. We might be able to replace these checks with bitcasts instead, but I don't have a test case so a bail out check seemed better for now. Differential Revision: https://reviews.llvm.org/D63683 llvm-svn: 364206	2019-06-24 17:28:26 +00:00
Simon Pilgrim	fa740150ef	[AArch64] Regenerate vcvt tests. NFCI. Prep work for an upcoming patch llvm-svn: 364205	2019-06-24 17:18:20 +00:00
Simon Pilgrim	36c9aa37ee	[AArch64] Regenerate 2velem tests. NFCI. Prep work for an upcoming patch llvm-svn: 364204	2019-06-24 16:58:19 +00:00
Simon Pilgrim	10fdea101c	[AArch64] Regenerate merge-store tests. NFCI. Prep work for an upcoming patch llvm-svn: 364203	2019-06-24 16:57:12 +00:00
Simon Pilgrim	71e32f6180	[X86] Regenerate fast fadd reduction tests. NFCI Fix whitespace. llvm-svn: 364200	2019-06-24 16:25:30 +00:00
Matt Arsenault	fdfc544a87	AMDGPU/GlobalISel: Fix selecting G_IMPLICIT_DEF for s1 Try to fail for scc, since I don't think that should ever be produced. llvm-svn: 364199	2019-06-24 16:24:03 +00:00
Ayke van Laethem	4645f9cb65	[bindings/go] Add debug information accessors Add debug information accessors, as provided in the following patches: https://reviews.llvm.org/D46627 (DILocation) https://reviews.llvm.org/D52693 metadata kind https://reviews.llvm.org/D60481 get/set debug location on a Value https://reviews.llvm.org/D60489 (DIScope) The API as proposed in this patch is similar to the current Value API, with a single root type and methods that are only valid for certain subclasses. I have considered just implementing generic Line() calls (that are valid on all DINodes that have a line) but the implementation of that got a bit awkward without support from the C API. I've also considered creating generic getters like a Metadata.DebugLoc() that returns a DebugLoc, but there is a mismatch between the Go DI nodes in the LLVM API and the actual DINode class hierarchy, so that's also hard to get right (without being confusing or breaking the API). Differential Revision: https://reviews.llvm.org/D63056 llvm-svn: 364198	2019-06-24 16:23:17 +00:00
Matt Arsenault	9b09f68667	Hexagon: Rename another copy of Register class For some reason clang is happy with the conflict, but MSVC is not. llvm-svn: 364196	2019-06-24 16:16:19 +00:00
Matt Arsenault	e9d4d77d16	ARC: Fix -Wimplicit-fallthrough llvm-svn: 364195	2019-06-24 16:16:16 +00:00
Matt Arsenault	ae5999e106	GlobalISel: Remove unsigned variant of SrcOp Force using Register. One downside is the generated register enums require explicit conversion. llvm-svn: 364194	2019-06-24 16:16:12 +00:00
Matt Arsenault	0738f328d7	CodeGen: Introduce a class for registers Avoids using a plain unsigned for registers throughoug codegen. Doesn't attempt to change every register use, just something a little more than the set needed to build after changing the return type of MachineOperand::getReg(). llvm-svn: 364191	2019-06-24 15:50:29 +00:00
Bjorn Pettersson	c2783fba6f	[AMDGPU] Remove unused variable AllSGPRSpilledToVGPRs. NFC Summary: Removing the unused variable AllSGPRSpilledToVGPRs in SIFrameLowering::processFunctionBeforeFrameFinalized to avoid error: variable 'AllSGPRSpilledToVGPRs' set but not used [-Werror=unused-but-set-variable] Reviewers: arsenm, nhaehnle Reviewed By: nhaehnle Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63721 llvm-svn: 364190	2019-06-24 15:50:18 +00:00
Matt Arsenault	6c90661a7c	Hexagon: Rename Register class This avoids a naming conflict in a future patch. llvm-svn: 364188	2019-06-24 15:27:29 +00:00
Sanjay Patel	222c2734f3	[InstCombine] reduce funnel-shift i16 X, X, 8 to bswap X Prefer the more exact intrinsic to remove a use of the input value and possibly make further transforms easier (we will still need to match patterns with funnel-shift of wider types as pieces of bswap, especially if we want to canonicalize to funnel-shift with constant shift amount). Discussed in D46760. llvm-svn: 364187	2019-06-24 15:20:49 +00:00
Matt Arsenault	186dcc67f5	AMDGPU/GlobalISel: Fix RegBankSelect for s1 sext/zext/anyext This needs different handling if the source is known to be a valid condition or not. Handle turning it into shifts or a select during regbankselect. llvm-svn: 364186	2019-06-24 14:53:58 +00:00
Matt Arsenault	90cca807df	AMDGPU: Fold frame index into MUBUF This matters for byval uses outside of the entry block, which appear as copies. Previously, the only folding done was during selection, which could not see the underlying frame index. For any uses outside the entry block, the frame index was materialized in the entry block relative to the global scratch wave offset. This may produce worse code in cases where the offset ends up not fitting in the MUBUF offset field. A better heuristic would be helpfu for extreme frames. llvm-svn: 364185	2019-06-24 14:53:56 +00:00
Sanjay Patel	02be8b516f	[InstCombine] add tests for funnel-shift to bswap; NFC llvm-svn: 364184	2019-06-24 14:47:02 +00:00
Matt Arsenault	099a8c4919	AMDGPU: Cleanup checking when spills need emergency slots Address fixme, which should no longer be a problem since r363757. llvm-svn: 364182	2019-06-24 14:34:40 +00:00
Simon Pilgrim	8d6f027a2f	[InstCombine] SliceUpIllegalIntegerPHI - bail on out of range shifts trunc(lshr) handling - if the shift is out of range (undefined) then bail like we do for non-constant shifts. Fixes OSS Fuzz #15217 llvm-svn: 364181	2019-06-24 13:13:36 +00:00
Simon Pilgrim	fd320855ef	[DAGCombine] visitMUL - allow shift by zero in MulByConstant. This can occur under certain circumstances when undefs are created later on in the constant multipliers (e.g. in this case due to SimplifyDemandedVectorElts). Its better to let the shift by zero to occur and perform any cleanup afterward. Fixes OSS Fuzz #15429 llvm-svn: 364179	2019-06-24 12:47:17 +00:00
Bjorn Pettersson	fa8797df3d	[ConstantFolding] Use hasVectorInstrinsicScalarOpd. NFC Summary: Use the hasVectorInstrinsicScalarOpd helper function in ConstantFoldVectorCall. Reviewers: rengolin, RKSimon, dblaikie Reviewed By: rengolin, RKSimon Subscribers: tschuett, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63705 llvm-svn: 364178	2019-06-24 12:07:17 +00:00
Bjorn Pettersson	3ab529361d	[Scalarizer] Add scalarizer support for smul.fix.sat Summary: Handle smul.fix.sat in the scalarizer. This is done by adding smul.fix.sat to the set of "isTriviallyVectorizable" intrinsics. The addition of smul.fix.sat in isTriviallyVectorizable and hasVectorInstrinsicScalarOpd can also be seen as a preparation to be able to use hasVectorInstrinsicScalarOpd in ConstantFolding. Reviewers: rengolin, RKSimon, dblaikie Reviewed By: rengolin Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63704 llvm-svn: 364177	2019-06-24 12:07:11 +00:00
James Henderson	fc7e193281	[docs][llvm-nm] Add missing options to documentation There were several options missing from the documentation. This patch adds them as well as improving some wording and separating the Mach-O only options into a separate section. Fixes https://bugs.llvm.org/show_bug.cgi?id=42234. Reviewed by: MaskRay Differential Revision: https://reviews.llvm.org/D63655 llvm-svn: 364176	2019-06-24 10:50:49 +00:00
Fangrui Song	4c33942c6e	[sancov] Avoid unnecessary unique_ptr llvm-svn: 364175	2019-06-24 10:23:47 +00:00
Simon Tatham	e7b30c99aa	[ARM] Add MVE interleaving load/store family. This adds the family of loads and stores with names like VLD20.8 and VST42.32, which load and store parts of multiple q-registers in such a way that executing both VLD20 and VLD21, or all four of VLD40..VLD43, will distribute 2 or 4 vectors' worth of memory data across the lanes of the same number of registers but in a transposed order. In addition to the Tablegen descriptions of the instructions themselves, this patch also adds encode and decode support for the QQPR and QQQQPR register classes (representing the range of loaded or stored vector registers), and tweaks to the parsing system for lists of vector registers to make it return the right format in this case (since, unlike NEON, MVE regards q-registers as primitive, and not just an alias for two d-registers). llvm-svn: 364172	2019-06-24 10:00:39 +00:00
James Henderson	3a45c77365	[docs][llvm-nm] Improve symbol code documentation The existing symbol code documentation was very incomplete. This patch adds the missing codes, and defines them based on the current code behaviour. Fixes https://bugs.llvm.org/show_bug.cgi?id=42231. Reviewed by: rupprecht, mtrent, MaskRay Differential Revision: https://reviews.llvm.org/D63327 llvm-svn: 364171	2019-06-24 09:53:02 +00:00
Pavel Labath	cc91deeda7	[Support] Fix error handling in DataExtractor::get[US]LEB128 Summary: These functions are documented as not modifying the offset argument if the extraction fails (just like other DataExtractor functions). However, while reviewing D63591 we discovered that this is not the case -- if the function reaches the end of the data buffer, it will just return the value parsed until that point and set offset to point to the end of the buffer. This fixes the functions to act as advertised, and adds a regression test. Reviewers: dblaikie, probinson, bkramer Subscribers: kristina, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63645 llvm-svn: 364169	2019-06-24 09:11:24 +00:00
Sjoerd Meijer	59638a429d	Follow up of rL363913. NFC. Minor reshuffle in AArch64 targetparser unittest, solving a potential problem with querying iterators too early. llvm-svn: 364168	2019-06-24 08:44:29 +00:00
George Rimar	1224f15847	[llvm-readobj/llvm-readelf] - Eliminate the elf-groups.x86_64 precompiled binary from the inputs. We do not need the elf-groups.x86_64. In one of the tests, it was used for no solid reason, and for the second test case we can use YAML input with SHT_GROUP sections. The patch performs a cleanup of one of the test cases, removes another one completely (since during the review was found out it actually duplicates one of the existent tests) and removes the precompiled binary. Differential revision: https://reviews.llvm.org/D63647 llvm-svn: 364167	2019-06-24 08:29:54 +00:00
Craig Topper	3433ead1f5	[X86] Turn v16i16->v16i8 truncate+store into a any_extend+truncstore if we avx512f, but not avx512bw. Ideally we'd be able to represent this truncate as a any_extend to v16i32 and a truncate, but SelectionDAG doens't know how to not fold those together. We have isel patterns to use a vpmovzxwd+vpdmovdb for the truncate, but we aren't able to simultaneously fold the load and the store from the isel pattern. By pulling the truncate into the store we can successfully hide it from the DAG combiner. Then we can isel pattern match the truncstore and load+any_extend separately. llvm-svn: 364163	2019-06-23 23:51:21 +00:00

1 2 3 4 5 ...

180852 Commits