llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-10 22:43:53 +00:00

Author	SHA1	Message	Date
Chris Matthews	c01e6f9ecb	Requirements can have & in them! Lets escape those so the XML is valid! llvm-svn: 332161	2018-05-12 00:13:54 +00:00
Chris Matthews	5e42577a9e	Add the message attribute to skipped JUnit xml allows for a message attribute to be displayed on skips. Lets populate that with an analysis of why we skipped the test. llvm-svn: 332156	2018-05-11 23:15:11 +00:00
Sanjay Patel	7858faaa20	[DAG] add convenience function to propagate FMF; NFC There's only one use of this currently, but that could change with D46563. Either way, we shouldn't have to update code outside of the flags struct when those flag definitions change. llvm-svn: 332155	2018-05-11 23:13:36 +00:00
Tom Stellard	7545a2a711	AMDGPU/GlobalISel: Implement select() for >32-bit G_STORE Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D46153 llvm-svn: 332154	2018-05-11 23:12:49 +00:00
Alina Sbirlea	7c5733f40d	[MemorySSA] getIncomingValueForBlock should return a MemoryAccess. Summary: getIncomingValueForBlock is just a wrapper API that should return a MemoryAccess, instead of a generic Value. Reviewers: george.burgess.iv Subscribers: sanjoy, jlebar, Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D46779 llvm-svn: 332153	2018-05-11 22:59:37 +00:00
Sergey Dmitriev	eea7082b7d	[CodeExtractor] Allow extracting blocks with exception handling This is a CodeExtractor improvement which adds support for extracting blocks which have exception handling constructs if that is legal to do. CodeExtractor performs validation checks to ensure that extraction is legal when it finds invoke instructions or EH pads (landingpad, catchswitch, or cleanuppad) in blocks to be extracted. I have also added an option to allow extraction of blocks with alloca instructions, but no validation is done for allocas. CodeExtractor caller has to validate it himself before allowing alloca instructions to be extracted. By default allocas are still not allowed in extraction blocks. Differential Revision: https://reviews.llvm.org/D45904 llvm-svn: 332151	2018-05-11 22:49:49 +00:00
Sanjay Patel	0f1681ef4d	[DAG] clean up flag propagation for binops; NFCI llvm-svn: 332150	2018-05-11 22:45:22 +00:00
Chris Matthews	267c8ad156	Overhaul unicode handling in xunit output I have seen a lot of errors where the xunit does not encode unicode test output correctly. Handle that explicitly now. llvm-svn: 332148	2018-05-11 22:18:22 +00:00
Changpeng Fang	bedba6b647	AMDGPU/SI: Don't promote alloca to vector for AddrSpaceCast instruction. Summary: We have no logic to promote alloca to vector for an AddrSpaceCast instruction. Reviewer: arsenm Differential Revision: https://reviews.llvm.org/D45993 llvm-svn: 332147	2018-05-11 22:17:57 +00:00
Craig Topper	c3968b5f43	[X86] Remove and autoupgrade a bunch of FMA instrinsics that are no longer used by clang. llvm-svn: 332146	2018-05-11 21:59:34 +00:00
Artem Belevich	3657df88ef	[Split GEP] handle trunc() in separate-const-offset-from-gep pass. Let separate-const-offset-from-gep pass handle trunc() when it calculates constant offset relative to base. The pass itself may insert trunc() instructions when it canonicalises array indices to pointer-size integers and needs to handle trunc() in order to evaluate the offset. Differential Revision: https://reviews.llvm.org/D46732 llvm-svn: 332142	2018-05-11 21:13:19 +00:00
Yaxun Liu	be274cfd1b	[AMDGPU] Fix compilation failure when IR contains comdat Remove a useless SwitchSection which also causes compilation failure when IR contains comdat. The SwitchSection is useless because the current section is already correct text section for the function therefore no need to switch. It causes compilation failure for comdat because functions with comdat has specific text section, not the default .text section. Since HIP uses comdat, this bug caused failures for HIP. Differential Revision: https://reviews.llvm.org/D46770 llvm-svn: 332137	2018-05-11 20:40:14 +00:00
Sanjay Patel	0a4f88e840	[DAG] reduce code duplication; NFCI llvm-svn: 332133	2018-05-11 20:08:23 +00:00
Daniel Neilson	6975bd40b3	[InstCombine] Handle atomic memset in the same way as regular memset Summary: This change adds handling of the atomic memset intrinsic to the code path that simplifies the regular memset. In practice this means that we will now also expand a small constant-length atomic memset into a single unordered atomic store. Reviewers: apilipenko, skatkov, mkazantsev, anna, reames Reviewed By: reames Subscribers: reames, llvm-commits Differential Revision: https://reviews.llvm.org/D46660 llvm-svn: 332132	2018-05-11 20:04:50 +00:00
David Blaikie	3c91607119	Move standard library inclusions to after internal inclusions. llvm-svn: 332124	2018-05-11 19:21:40 +00:00
Vedant Kumar	d39b740ee3	[DAGCombiner] Set the right SDLoc on extended SETCC uses (7/N) ExtendSetCCUses updates SETCC nodes which use a load (OriginalLoad) to reflect a simplification to the load (ExtLoad). Based on my reading, ExtendSetCCUses may create new nodes to extend a constant attached to a SETCC. It also creates fresh SETCC nodes which refer to any updated operands. ISTM that the location applied to the new constant and SETCC nodes should be the same as the location of the ExtLoad. This was suggested by Adrian in https://reviews.llvm.org/D45995. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46216 llvm-svn: 332119	2018-05-11 18:40:10 +00:00
Vedant Kumar	8a31d7c70d	[DAGCombiner] Set the right SDLoc on a newly-created sextload (6/N) This teaches tryToFoldExtOfLoad to set the right location on a newly-created extload. With that in place, the logic for performing a certain ([s\|z]ext (load ...)) combine becomes identical for sexts and zexts, and we can get rid of one copy of the logic. The test case churn is due to dependencies on IROrders inherited from the wrong SDLoc. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46158 llvm-svn: 332118	2018-05-11 18:40:08 +00:00
Vedant Kumar	95b78428ff	[DAGCombiner] Factor out duplicated logic for an extload combine, NFC (5/N) Part of the logic for combining (zext (load ...)) and (sext (load ...)) is duplicated. This creates problems because bugs in one version have to be fixed again in the other version. To address this, as a first step, I've extracted the duplicate logic into a helper. I'll fix the debug location bug in the helper and eliminate the copy of its logic in a followup. Part of: llvm.org/PR37262 Differential Revision: https://reviews.llvm.org/D46157 llvm-svn: 332117	2018-05-11 18:40:02 +00:00
Chris Matthews	c499ebbb98	[LIT] replace output escapes wit a cdata block CDATA blocks don't need to have XML stuff escaped. Makes sense to wrap output in them instead of escaping. llvm-svn: 332116	2018-05-11 18:38:02 +00:00
Nico Weber	a89e464fbd	make add_llvm_fuzzer calls slightly more consisten with other cmake llvm-svn: 332112	2018-05-11 17:58:52 +00:00
David Bolvansky	1c3c8a0fe0	[InstCombine] snprintf optimizations Reviewers: spatel, efriedma, majnemer, rja, bkramer Reviewed By: rja, bkramer Subscribers: mstorsjo, rja, llvm-commits Differential Revision: https://reviews.llvm.org/D46285 llvm-svn: 332110	2018-05-11 17:50:49 +00:00
Simon Pilgrim	de0ff1af6e	[X86][BtVer2] Model ymm move as double pumped instructions We still need to handle mmx/xmm moves as 'decode-only' no-pipe instructions llvm-svn: 332109	2018-05-11 17:38:36 +00:00
Alex Bradbury	ac7024c937	[RISCV] Support .option rvc and norvc assembler directives These directives allow the 'C' (compressed) extension to be enabled/disabled within a single file. Differential Revision: https://reviews.llvm.org/D45864 Patch by Kito Cheng llvm-svn: 332107	2018-05-11 17:30:28 +00:00
Martin Storsjo	f736dbd653	[Analysis] Validate the return type of s(n)printf like libcalls If the sprintf function is static (as on mingw-w64, where many stdio functions are static inline wrappers), earlier optimization passes could optimize out the return value altogether, and make it void, which could break optimizations of this libcall that touch the return value. This fixes the issue discussed in PR37408 for the sprintf function. Differential Revision: https://reviews.llvm.org/D46752 llvm-svn: 332106	2018-05-11 16:53:56 +00:00
Simon Pilgrim	ca2f944c23	[X86][MMX] Tag MMX Move/Load/Store as WriteVec schedule classes Fixes an issue on SLM/Btver2 where we had instructions were being treated as scalar loads/stores llvm-svn: 332104	2018-05-11 16:38:59 +00:00
Geoff Berry	84b578a23c	[AArch64] Fix performPostLD1Combine to check for constant lane index. Summary: performPostLD1Combine in AArch64ISelLowering looks for vector insert_vector_elt of a loaded value which it can optimize into a single LD1LANE instruction. The code checking for the pattern was not checking if the lane index was a constant which could cause two problems: - an assert when lowering the LD1LANE ISD node since it assumes an constant operand - an assert in isel if the lane index value depends on the post-incremented base register Both of these issues are avoided by simply checking that the lane index is a constant. Fixes bug 35822. Reviewers: t.p.northover, javed.absar Subscribers: rengolin, kristof.beyls, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D46591 llvm-svn: 332103	2018-05-11 16:25:06 +00:00
Simon Dardis	0ea4d8784a	[mips] Rename Filler to MipsDelaySlotFiller and initialize the pass llvm-svn: 332102	2018-05-11 16:13:53 +00:00
Sanjoy Das	42e300425d	Use iteration instead of recursion in CFIInserter Summary: This recursive step can overflow the stack. Reviewers: djokov, petarj Subscribers: mcrosier, jlebar, bixia, llvm-commits Differential Revision: https://reviews.llvm.org/D46671 llvm-svn: 332101	2018-05-11 15:54:46 +00:00
Davide Italiano	3999513966	[Reassociate] Prevent infinite loops when processing PHIs. Phi nodes can reside in live blocks but one of their incoming arguments can come from a dead block. Dead blocks and reassociate don't play nice together. In fact, reassociate performs an RPO as a first step to avoid processing dead blocks. The reason why Reassociate might not fixpoint when examining dead blocks is that the following: %xor0 = xor i16 %xor1, undef %xor1 = xor i16 %xor0, undef is perfectly valid LLVM IR (if it appears in a dead block), so the worklist algorithm keeps pushing the two instructions for reexamination. Note that this is not Reassociate fault, at least not entirely. It's llvm that has a weird definition of dominance. Fixes PR37390. llvm-svn: 332100	2018-05-11 15:45:36 +00:00
Nico Weber	38cc0a7bb2	Remove unused SyncExecutor and make it clearer that the whole file is only used if LLVM_ENABLE_THREADS llvm-svn: 332098	2018-05-11 15:25:38 +00:00
Simon Dardis	11f7996bf1	[mips] Enable disassembly of fused (negative) multiply add/sub instructions Reviewers: atanasyan, smaksimovic, abeserminji Differential Revision: https://reviews.llvm.org/D46392 llvm-svn: 332097	2018-05-11 15:21:40 +00:00
Simon Pilgrim	8437c54756	[X86][SLM] Vector stores only use the MEC port. Confirmed by both Agner and Intel's AOM - the IEC/FPC are not required for pure load/stores (even if its a partial update). Can't fix WriteStore until all RMW instructions are cleaned up though.... llvm-svn: 332096	2018-05-11 15:16:15 +00:00
Simon Pilgrim	915a53f86e	[X86] Split WriteF/WriteVec Move/Load/Store scheduler classes by vector width Fixes a SNB issue that was missing vlddqu/vmovntdqa ymm instructions llvm-svn: 332094	2018-05-11 14:30:54 +00:00
Daniel Neilson	d7e4cf1520	[InstCombine] Unify handling of atomic memtransfer with non-atomic memtransfer Summary: This change reworks the handling of atomic memcpy within the instcombine pass. Previously, a constant length atomic memcpy would be lowered into loads & stores as long as no more than 16 load/store pairs are created. This is quite different from the lowering done for a non-atomic memcpy; which only ever lowers into a single load/store pair of no more than 8 bytes. Larger constant-sized memcpy calls are expanded to load/stores in later passes, such as SelectionDAG lowering. In this change the behaviour for atomic memcpy is unified with non-atomic memcpy; atomic memcpy is now treated in the same was as non-atomic memcpy has always been. We leave it to later passes to lower longer-length atomic memcpy calls. Due to the structure of the pass's handling of memtransfer intrinsics, this change also gives us handling of atomic memmove that we did not previously have. Reviewers: apilipenko, skatkov, mkazantsev, anna, reames Reviewed By: reames Subscribers: reames, llvm-commits Differential Revision: https://reviews.llvm.org/D46658 llvm-svn: 332093	2018-05-11 14:30:02 +00:00
Simon Pilgrim	9ca321a77d	[X86] Added scheduler helper classes to split move/load/store by size Nothing uses this yet but this will allow us to specialize MMX/XMM/YMM/ZMM vector moves. llvm-svn: 332090	2018-05-11 12:46:54 +00:00
Sven van Haastregt	31fc47e039	[APFloat] Set losesInfo on no-op convert losesInfo would be left unset when no conversion needs to be done. A caller such as InstCombine's fitsInFPType would then branch on an uninitialized value. Caught using valgrind on an out-of-tree target. Differential Revision: https://reviews.llvm.org/D46645 llvm-svn: 332087	2018-05-11 09:45:42 +00:00
Jakub Kuderski	eaa14c9d93	[IRTests] Verify PDT instead of DT Summary: Fix two typos which result in verifying wrong data structures (DT) instead of PDT in DominatorTreeBatchUpdatesTest. Reviewers: davide, kuhar, grosser, dberlin Reviewed By: davide, kuhar, dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46696 llvm-svn: 332086	2018-05-11 09:30:29 +00:00
Tom Stellard	82f61e1f97	AMDGPU/GlobalISel: Implement select() for 32-bit G_FPTOUI Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D45883 llvm-svn: 332082	2018-05-11 05:44:16 +00:00
Alexander Shaposhnikov	099f0ecb8a	[llvm-strip] Add support for -remove-section This diff adds support for -remove-section to llvm-strip. Test plan: make check-all Differential revision: https://reviews.llvm.org/D46567 llvm-svn: 332081	2018-05-11 05:27:06 +00:00
Craig Topper	27b8020b8e	[X86] Remove and autoupgrade the avx512.mask.store.ss intrinsic. llvm-svn: 332079	2018-05-11 04:33:18 +00:00
Alexander Shaposhnikov	22e394ed73	[llvm-objcopy] Update remove-section.test Verify that the input binary is not getting modified and add an invocation which uses -remove-section instead of -R. Test plan: make check-all llvm-svn: 332078	2018-05-11 04:30:57 +00:00
Brian Gesiak	c57bdbb244	[Coroutines] PR34897: Fix incorrect elisions Summary: https://bugs.llvm.org/show_bug.cgi?id=34897 demonstrates an incorrect coroutine frame allocation elision in the coro-elide pass. The elision is performed on the basis that the SSA variables from all llvm.coro.begin are directly referenced in subsequent llvm.coro.destroy instructions. However, this ignores the fact that the function may exit through paths that do not run these destroy instructions. In the sample program from PR34897, for example, the llvm.coro.destroy instruction is only executed in exception handling code. When the coroutine function exits normally, llvm.coro.destroy is not called. Eliding the allocation in this case causes a subsequent reference to the coroutine handle from outside of the function to access freed memory. To fix the issue, when finding an llvm.coro.destroy for each llvm.coro.begin, only consider llvm.coro.destroy that are executed along non-exceptional paths. Test Plan: 1. Download the sample program from https://bugs.llvm.org/show_bug.cgi?id=34897, compile it with `clang++ -fcoroutines-ts -stdlib=libc++ -std=c++1z -O2`, and run it. It should print `"run1\ncheck1\nrun2\ncheck2"` and then exit successfully. 2. Compile https://godbolt.org/g/mCKfnr and confirm it is still optimized to a single instruction, 'return 1190'. 3. `check-llvm` Reviewers: rsmith, GorNishanov, eric_niebler Reviewed By: GorNishanov Subscribers: andrewrk, lewissbaker, EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D43242 llvm-svn: 332077	2018-05-11 03:12:28 +00:00
Brian Gesiak	3c09c00c23	[Support] Add docs for 'openFileFor{Write,Read}' Summary: Add documentation for the LLVM Support functions `openFileForWrite` and `openFileForRead`. The `openFileForRead` parameter `RealPath`, in particular, I think warranted some explanation. In addition, make the behavior of the functions more consistent across platforms. Prior to this patch, Windows would set or not set the result file descriptor based on the nature of the error, whereas Unix would consistently set it to `-1` if the open failed. Make Windows consistently set it to `-1` as well. Test Plan: 1. `ninja check-llvm` 2. `ninja docs-llvm-html` Reviewers: zturner, rnk, danielmartin, scanon Reviewed By: danielmartin, scanon Subscribers: scanon, danielmartin, llvm-commits Differential Revision: https://reviews.llvm.org/D46499 llvm-svn: 332075	2018-05-11 01:47:27 +00:00
Kostya Serebryany	c60a094f7b	[sanitizer-coverage] don't instrument a function if it's entry block ends with 'unreachable' llvm-svn: 332072	2018-05-11 01:09:39 +00:00
Kamil Rytarowski	6687acf7c5	Register NetBSD/i386 in AddressSanitizer.cpp Summary: Ship kNetBSD_ShadowOffset32 set to 1ULL << 30. This is prepared for the amd64 kernel runtime. Sponsored by <The NetBSD Foundation> Reviewers: vitalybuka, joerg, kcc Reviewed By: vitalybuka Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46724 llvm-svn: 332069	2018-05-11 00:58:01 +00:00
Craig Topper	80b54def63	[InstCombine] Add tests for cases where we don't recognize type promoted rotate idioms. These rotates take the form (x << (n & mask)) \| (x >> (-n & mask)) where mask is bitwidth - 1. If x has been promoted to a wider type than its original bit width due to type promotion we fail to narrower it and therefore don't recognize it as a rotate. llvm-svn: 332068	2018-05-11 00:46:09 +00:00
Chris Matthews	4826565393	Support Unsupported Tests in xunit output We were reporting "Unsupported" tests in xunit as passes, however since they are not run, it make more sense to mark them as skipped. The Junit xml standard has support for that, so lets use it. llvm-svn: 332065	2018-05-11 00:25:43 +00:00
Chris Matthews	f5b7f9eb76	Refactor xunit test case builder to not use as much str addition String concatenation in python is slow. Refactor to not concatenate the possibly large strings of test output and instead write them directly to the output file. llvm-svn: 332064	2018-05-11 00:25:42 +00:00
Wei Mi	31d6000df6	[SampleFDO] Don't treat warm callsite with inline instance in the profile as cold We found current sampleFDO had a performance issue when triaging a regression. For a callsite with inline instance in the profile, even if hot callsite inliner cannot inline it, it may still execute enough times and should not be treated as cold in regular inliner later. However, currently if such callsite is not inlined by hot callsite inliner, and the BB where the callsite locates doesn't get samples from other instructions inside of it, the callsite will have no profile metadata annotated. In regular inliner cost analysis, if the callsite has no profile annotated and its caller has profile information, it will be treated as cold. The fix changes the isCallsiteHot check and chooses to compare CallsiteTotalSamples with hot cutoff value computed by ProfileSummaryInfo. Differential Revision: https://reviews.llvm.org/D45377 llvm-svn: 332058	2018-05-10 23:02:27 +00:00
Vedant Kumar	1cb9bbbb8e	[STLExtras] Add distance() for ranges, pred_size(), and succ_size() This commit adds a wrapper for std::distance() which works with ranges. As it would be a common case to write `distance(predecessors(BB))`, this also introduces `pred_size()` and `succ_size()` helpers to make that easier to write. Differential Revision: https://reviews.llvm.org/D46668 llvm-svn: 332057	2018-05-10 23:01:54 +00:00

1 2 3 4 5 ...

163994 Commits