llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-11 07:18:44 +00:00

Author	SHA1	Message	Date
Jonas Devlieghere	18c6c56bf4	[ADT] Recognize ppc as valid architecture in target triple. Until this patch, only `powerpc` and `ppc32` were recognized as valid PowerPC 32-bit architectures in a target triple. This was incompatible with the triple `ppc-apple-darwin` as returned for libObject. I found out about this when working on a test case using a binary generated on an old PowerBook G4. We had the choice of either fix this in the Mach-O object parser or in the Triple implementation. I chose the latter because it feels like the most canonical place. Differential revision: https://reviews.llvm.org/D43760 llvm-svn: 326182	2018-02-27 10:09:58 +00:00
Florian Hahn	d8fb401b6d	[NewGVN] Update phi-of-ops def block when updating existing ValuePHI. In case we update a ValuePHI node created earlier, we could update it based on a different OpPHI which could be in a different block. We need to update the TempToBlock mapping reflecting the new block, otherwise we would end up placing the new phi node in a wrong block. This problem is exposed by the test case in https://bugs.llvm.org/show_bug.cgi?id=36504. This patch fixes a slightly simpler problem than in the bug report. In the bug's re-producer, the additional problem is that we are re-using a ValuePHI node with to few incoming values for the new OpPHI. If this patch makes sense, I will follow it up with a patch that creates a new PHI node if the existing PHI node has a different number of incoming values. Reviewers: davide, dberlin Reviewed By: dberlin Differential Revision: https://reviews.llvm.org/D43770 llvm-svn: 326181	2018-02-27 09:34:51 +00:00
Jonas Paulsson	4b941433c5	[SystemZ] Make sure SelectCode() is not called on a target opcode. Since getNode() might not always return the requsted opcode, for instance if called with (ISD::AND, -1) arguments, there should be a check so that SelectCode() is only called when appropriate. Review: Ulrich Weigand llvm-svn: 326178	2018-02-27 07:53:23 +00:00
George Burgess IV	e3a72f8453	[MemorySSA] Invalidate def caches on deletion The only cases I can come up with where this invalidation needs to happen is when there's a deletion somewhere. If we find more creative test-cases, we can probably go with another approach mentioned on PR36529. Fixes PR36529. llvm-svn: 326177	2018-02-27 07:20:49 +00:00
George Burgess IV	f0bdcb7452	[MemorySSA] Call the correct dtors It appears that there were many cases where we were directly (through templates) calling the dtor of MemoryAccess, which is conceptually an abstract class. This hasn't been a problem, since the data members of all of the subclasses of MemoryAccess have been POD. I'm planning on changing that. :) llvm-svn: 326175	2018-02-27 06:43:19 +00:00
Serguei Katkov	13c7f86c5f	[SCEV] Cleanup SCEVInitRewriter. NFC. Set default value for IgnoreOtherLoops of SCEVInitRewriter::rewrite to true to be consistent with SCEVPostIncRewriter which does not have this parameter but behaves as it would be true. This is follow up for rL326067. llvm-svn: 326174	2018-02-27 06:39:31 +00:00
Craig Topper	e35270bbad	[X86] Simplify if condition. NFC SSE2 implies SSE1 and we already covered f32 in the SSE1 check so we don't need to check f32 in the SSE2 check. llvm-svn: 326170	2018-02-27 06:00:38 +00:00
Adam Nemet	74608c9091	Make test agnostic to cost model This was causing bot failures on greendragon llvm-svn: 326169	2018-02-27 05:41:16 +00:00
Craig Topper	3a5307a95d	[X86] Replace an impossible if condition with an assert. llvm-svn: 326167	2018-02-27 03:50:00 +00:00
Evgeny Stupachenko	00368b6cf9	Fix r326154 buildbots test fail Summary: Add specific mtriples to tests added in r326154. From: Evgeny Stupachenko <evstupac@gmail.com> <evgeny.v.stupachenko@intel.com> llvm-svn: 326158	2018-02-27 01:33:11 +00:00
Evgeny Stupachenko	1be6bd5cdf	Fix PR36032, PR35432 Summary: The change fix an assert fail at ScalarEvolutionExpander.cpp: assert(ExitCount != SE.getCouldNotCompute() && "Invalid loop count"); Reviewers: sbaranga Differential Revision: http://reviews.llvm.org/D42604 From: Evgeny Stupachenko <evstupac@gmail.com> <evgeny.v.stupachenko@intel.com> llvm-svn: 326154	2018-02-27 00:17:31 +00:00
Craig Topper	bf5d3fdc29	[SelectionDAG] Remove code from PromoteIntRes_CONCAT_VECTORS that was added in r320674 to help X86. AVX512 used to promote v32i1 to v32i8 during legalization when BWI was disabled. So this code was added to improve legalization of v32i1 concat_vectors of v16i1 by extending the v16i1 to v16i8 to avoid scalarization. X86 has since switched to legalizing v32i1 by splitting to v16i1 instead. This has rendered this code unnecessary and its no longer exercised. llvm-svn: 326153	2018-02-27 00:07:24 +00:00
Sanjay Patel	42a0576ba2	[InstCombine, InstSimplify] add tests with undef elements in constant FP vectors; NFC llvm-svn: 326148	2018-02-26 23:23:02 +00:00
Evandro Menezes	226db1297f	[AArch64] Harden test cases NFC llvm-svn: 326147	2018-02-26 23:19:25 +00:00
Aditya Nandakumar	5814a979c0	[GISel]: Don't assert when constraining RegisterOperands which are uses. Currently we assert that only non target specific opcodes can have missing RegisterClass constraints in the MCDesc. The backend can have instructions with register operands but don't have RegisterClass constraints (say using unknown_class) in which case the instruction defining the register will constrain it. Change the assert to only fire if a def has no regclass. https://reviews.llvm.org/D43409 llvm-svn: 326142	2018-02-26 22:56:21 +00:00
Craig Topper	d323486433	[ValueTracking] Teach cannotBeOrderedLessThanZeroImpl to handle vector constants. Summary: This allows vector fabs to be removed in more cases. Reviewers: spatel, arsenm, RKSimon Reviewed By: spatel Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D43739 llvm-svn: 326138	2018-02-26 22:33:17 +00:00
Simon Pilgrim	b411e57bfe	[X86][SSE] Reduce FADD/FSUB/FMUL costs on later targets (PR36280) Agner's tables indicate that for SSE42+ targets (Core2 and later) we can reduce the FADD/FSUB/FMUL costs down to 1, which should fix the Himeno benchmark. Note: the AVX512 FDIV costs look rather dodgy, but this isn't part of this patch. Differential Revision: https://reviews.llvm.org/D43733 llvm-svn: 326133	2018-02-26 22:10:17 +00:00
Scott Linder	48859b5145	[DebugInfo] Remove target-specific instructions in test This AsmParser test is target-agnostic, but contained some target-specific instructions, which broke on SystemZ. llvm-svn: 326129	2018-02-26 21:21:19 +00:00
Craig Topper	f7c00cdf86	[X86] Add constant folding to combineMOVMSK. There's still some shortcoming in our ability to combine binops of constants with different sizes separated by an extend. I'll try to look at that next. llvm-svn: 326128	2018-02-26 21:17:33 +00:00
Adam Nemet	2ac531973e	[opt-viewer] Kill parser processes before moving onto rendering The main benefit is that they release the memory they were holding onto. llvm-svn: 326127	2018-02-26 21:15:51 +00:00
Adam Nemet	b38d91df8a	opt-diff: Support splitting to multiple output files When reading the resulting files back with opt-viewer, they will be parsed in parallel. llvm-svn: 326126	2018-02-26 21:15:51 +00:00
Adam Nemet	4ac322d5ad	[opt-viewer] Set title for the source pages llvm-svn: 326125	2018-02-26 21:15:50 +00:00
Adam Nemet	fce17d346b	opt-viewer: also find thinlto opt.yaml files llvm-svn: 326124	2018-02-26 21:15:49 +00:00
Adam Nemet	9a50f35955	opt-viewer: output index first One can start looking at the index while the pages are still generating llvm-svn: 326123	2018-02-26 21:15:47 +00:00
Craig Topper	f4e4e91a6a	[X86] Add a custom legalization for (i16 (bitcast v16i1)) and (i32 (bitcast v32i1)) without AVX512 to prevent scalarization Summary: We have an early DAG combine to turn these patterns into MOVMSK, but that combine doesn't work if the vXi1 type has more elements than the widest legal vXi8 type. Type legalization will eventually split it down to v16i1 or v32i1 and then the bitcast gets legalized to a truncstore and a scalar load. The truncstore will get lowered to a series of extracts and bit math. This patch adds a custom legalization to use a sign extend and MOVMSK instead. This prevents the eventual scalarization. Reviewers: spatel, RKSimon, zvi Reviewed By: RKSimon Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D43593 llvm-svn: 326119	2018-02-26 20:32:27 +00:00
Alexey Bataev	19c404c562	[SLP] Added new test + fixed some checks, NFC. llvm-svn: 326117	2018-02-26 20:01:24 +00:00
Craig Topper	35a1a2ec29	[InstCombine] Add test cases with vector constants to fpextend.ll llvm-svn: 326115	2018-02-26 19:36:37 +00:00
Craig Topper	e2c3788d3e	[InstCombine] Switch to using FileCheck instead of grep. Auto-generate checks. NFC llvm-svn: 326114	2018-02-26 19:36:36 +00:00
David Zarzycki	cf1b5b5fd0	[ADT] Simplify and optimize StringSwitch This change improves incremental rebuild performance on dual Xeon 8168 machines by 54%. This change also improves run time code gen by not forcing the case values to be lvalues. llvm-svn: 326109	2018-02-26 18:41:26 +00:00
Adam Nemet	dd0f9242a1	[LTO] Support filtering by hotness threshold This wires up -pass-remarks-hotness-threshold to LTO and ThinLTO. Next is to change the clang driver to pass this with -fdiagnostics-hotness-threshold. Differential Revision: https://reviews.llvm.org/D41465 llvm-svn: 326107	2018-02-26 18:37:45 +00:00
Simon Pilgrim	c01265261f	[X86][AVX] createPSADBW - support 256-bit cases on AVX1 via SplitBinaryOpsAndApply llvm-svn: 326104	2018-02-26 18:17:25 +00:00
Matt Arsenault	bc0507a97a	AMDGPU/GlobalISel: Make f64 constants legal llvm-svn: 326101	2018-02-26 17:20:43 +00:00
Sanjay Patel	473137160c	[InstCombine] allow fdiv folds with less than fully 'fast' ops Note: gcc appears to allow this fold with -freciprocal-math alone, but clang/llvm require more than that with this patch. The wording in the definitions seems fuzzy enough that it could go either way, but we'll err on the conservative side of FMF interpretation. This patch also changes the newly created fmul to have FMF propagated by the last fdiv rather than intersecting the FMF of the fdivs. This matches the behavior of other folds near here. The new fmul is only used to produce an intermediate op for the final fdiv result, so it shouldn't be any stricter than that result. The previous behavior could result in dropping FMF via other folds in instcombine or CSE. Differential Revision: https://reviews.llvm.org/D43398 llvm-svn: 326098	2018-02-26 16:02:45 +00:00
Simon Pilgrim	9dcd90a981	[X86][AVX] Add AVX1 PSAD tests Cleanup check-prefixes to share more AVX/AVX512 codegen checks llvm-svn: 326097	2018-02-26 15:55:25 +00:00
Ilya Biryukov	469668c08f	Revert r326092: [gtest] Add PrintTo overload for StringRef. It seems to break the following buildbot: http://lab.llvm.org:8011/builders/sanitizer-windows/builds/24729 Will resubmit after investigating and fixing it. llvm-svn: 326096	2018-02-26 15:54:59 +00:00
Francis Visoiu Mistrih	0164936dcf	[CodeGen] Don't omit any redundant information in -debug output In r322867, we introduced IsStandalone when printing MIR in -debug output. The default behaviour for that was: 1) If any of MBB, MI, or MO are -debug-printed separately, don't omit any redundant information. 2) When -debug-printing a MF entirely, don't print any redundant information. 3) When printing MIR, don't print any redundant information. I'd like to change 2) to: 2) When -debug-printing a MF entirely, don't omit any redundant information. Differential Revision: https://reviews.llvm.org/D43337 llvm-svn: 326094	2018-02-26 15:23:42 +00:00
Simon Pilgrim	6d0f13ed44	[X86][SSE] Regenerate PSAD tests Fixes scary typo in a check that lost the end digit off a reg#... llvm-svn: 326093	2018-02-26 15:21:58 +00:00
Ilya Biryukov	15612d8d16	[gtest] Add PrintTo overload for StringRef. Summary: It was printed using code for generic containers before, resulting in unreadable output. Reviewers: sammccall, labath Reviewed By: sammccall, labath Subscribers: labath, zturner, llvm-commits Differential Revision: https://reviews.llvm.org/D43330 llvm-svn: 326092	2018-02-26 15:19:26 +00:00
Jonas Devlieghere	e115cbbdd0	Re-land: "[Support] Replace HashString with djbHash." This patch removes the HashString function from StringExtraces and replaces its uses with calls to djbHash from DJB.h. This change is almost NFC. While the algorithm is identical, the djbHash implementation in StringExtras used 0 as its default seed while the implementation in DJB uses 5381. The latter has been shown to result in less collisions and improved avalanching and is used by the DWARF accelerator tables. Because some test were implicitly relying on the hash order, I've reverted to using zero as a seed for the following two files: lld/include/lld/Core/SymbolTable.h llvm/lib/Support/StringMap.cpp Differential revision: https://reviews.llvm.org/D43615 llvm-svn: 326091	2018-02-26 15:16:42 +00:00
Tim Renouf	3506872ca3	[AMDGPU] Scratch setup fix on AMDPAL gfx9+ merge shader Summary: With OS type AMDPAL, the scratch descriptor is hardwired to be loaded from offset 0 of the global information table, whose low pointer is passed in s0. For a merge shader on gfx9+, it needs to be s8 instead, as the hardware reserves s0-s7. Reviewers: kzhuravl Subscribers: arsenm, nhaehnle, dstuttard, llvm-commits, t-tye, yaxunl, wdng, kzhuravl Differential Revision: https://reviews.llvm.org/D42203 llvm-svn: 326088	2018-02-26 14:46:43 +00:00
Tim Renouf	ef2523a622	[LiveIntervals] Handle moving up dead partial write Summary: In the test case, the machine scheduler moves a dead write to a subreg up into the middle of a segment of the overall reg's live range, where the segment had liveness only for other subregs in the reg. handleMoveUp created an invalid live range, causing an assert a bit later. This commit fixes it to handle that situation. The segment is split in two at the insertion point, and the part after the split, and any subsequent segments up to the old position, are changed to be defined by the moved def. V2: Better test. Subscribers: MatzeB, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D43478 Change-Id: Ibc42445ddca84e79ad1f616401015d22bc63832e llvm-svn: 326087	2018-02-26 14:42:13 +00:00
David Zarzycki	bea3786cf9	Test commit llvm-svn: 326085	2018-02-26 13:05:18 +00:00
Jonas Devlieghere	f3b584f7be	Revert "[Support] Replace HashString with djbHash." It looks like some of our tests depend on the ordering of hashed values. I'm reverting my changes while I try to reproduce and fix this locally. Failing builds: lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/18388 lab.llvm.org:8011/builders/clang-cmake-x86_64-sde-avx512-linux/builds/6743 lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast/builds/15607 llvm-svn: 326082	2018-02-26 12:05:18 +00:00
Jonas Devlieghere	e8b48cad10	[Support] Replace HashString with djbHash. This removes the HashString function from StringExtraces and replaces its uses with calls to djbHash from DJB.h This is almost NFC. While the algorithm is identical, the djbHash implementation in StringExtras used 0 as its seed while the implementation in DJB uses 5381. The latter has been shown to result in less collisions and improved avalanching. https://reviews.llvm.org/D43615 (cherry picked from commit 77f7f965bc9499a9ae768a296ca5a1f7347d1d2c) llvm-svn: 326081	2018-02-26 11:30:13 +00:00
Benjamin Kramer	c3fa426be7	[WebAssembly] Relax constexpr for old standard libraries. This will still be constexpr when the standard library supports it, but doesn't force constexpr. Old libraries will get a global constructor, which is not too bad. llvm-svn: 326080	2018-02-26 11:07:25 +00:00
Renato Golin	758dbd2bbb	[LV] Move isLegalMasked* functions from Legality to CostModel All SIMD architectures can emulate masked load/store/gather/scatter through element-wise condition check, scalar load/store, and insert/extract. Therefore, bailing out of vectorization as legality failure, when they return false, is incorrect. We should proceed to cost model and determine profitability. This patch is to address the vectorizer's architectural limitation described above. As such, I tried to keep the cost model and vectorize/don't-vectorize behavior nearly unchanged. Cost model tuning should be done separately. Please see http://lists.llvm.org/pipermail/llvm-dev/2018-January/120164.html for RFC and the discussions. Closes D43208. Patch by: Hideki Saito <hideki.saito@intel.com> llvm-svn: 326079	2018-02-26 11:06:36 +00:00
Florian Hahn	823dbe7c62	[LoopInterchange] Add test case for D43236. llvm-svn: 326078	2018-02-26 10:46:25 +00:00
Florian Hahn	32006dcf64	[LoopInterchange] Loops with empty dependency matrix are safe. The dependency matrix is only empty if no conflicting load/store instructions have been found. In that case, it is safe to interchange. For the LLVM test-suite, after this change around 1900 loops are interchanged, whereas it is 15 before this change. On cortex-a57, this gives an improvement of -0.57% on the geomean execution time of SPEC2006, SPEC2000 and the test-suite. There are a few small perf regressions, but I think we can improve on those by making the cost model better. Reviewers: karthikthecool, mcrosier Reviewed by: karthikthecool Differential Revision: https://reviews.llvm.org/D43236 llvm-svn: 326077	2018-02-26 10:45:25 +00:00
Andrew V. Tischenko	2e806a1d4c	The final step to close D41278 [MachineCombiner] Improve debug output (NFC). Differential Revision: https://reviews.llvm.org/D41278 llvm-svn: 326074	2018-02-26 09:43:21 +00:00
Serguei Katkov	910a58d557	[SCEV] Factor out getUsedLoops The patch introduces the new function in ScalarEvolution to get all loops used in specified SCEV. This is a preparation for re-writing isKnownPredicate utility as described in https://reviews.llvm.org/D42417. Reviewers: sanjoy, mkazantsev, reames Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D43504 llvm-svn: 326072	2018-02-26 09:26:41 +00:00

1 2 3 4 5 ...

160699 Commits