llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-30 23:00:36 +00:00

Author	SHA1	Message	Date
Martin Storsjö	7b4e9362ee	[llvm-rc] Update a comment. NFC. Fix a typo and mention one missing step.	2020-09-16 09:34:26 +03:00
Martin Storsjö	50a529e27d	[llvm-rc] Allow omitting components from VERSIONINFO versions MS rc.exe doesn't require specifying all 4 components. Differential Revision: https://reviews.llvm.org/D87570	2020-09-16 09:34:26 +03:00
Alina Sbirlea	3b2eea568d	[MemorySSA] Set MustDominate to true for PhiTranslation.	2020-09-15 23:29:57 -07:00
Craig Topper	acde6e65a0	[X86] Don't scalarize gather/scatters with non-power of 2 element counts. Widen instead. We can pad the mask with zeros in order to widen. We already do this for power 2 types that are smaller than a legal type.	2020-09-15 23:22:53 -07:00
Craig Topper	1eb43a5d55	[X86] Add test case for non-power of 2 scatter. NFC	2020-09-15 23:03:39 -07:00
Max Kazantsev	67567b8db5	[Test] Add signed version of a test	2020-09-16 11:30:21 +07:00
Serguei Katkov	4f6cfebd9e	[InstCombine] Add tests for statepoint simplification This tests increase coverage for change introduced in D85959 Reviewers: reames, reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D87224	2020-09-16 10:56:38 +07:00
Arthur Eubanks	8e54da3615	[NewPM] Fix opt-hot-cold-split.ll under NPM Pin to legacy PM, there are already NPM RUN lines.	2020-09-15 20:29:20 -07:00
Arthur Eubanks	623ad434a7	[NewPM][SCEV] Fix constant-fold-gep.ll under NPM	2020-09-15 20:25:35 -07:00
Arthur Eubanks	dd08b528d1	[NewPM] Fix 2003-02-19-LoopInfoNestingBug.ll under NPM Also move it to a more appropriate directory.	2020-09-15 20:21:45 -07:00
Craig Topper	16cd8d4cc5	[X86] Always use 16-bit displacement in 16-bit mode when there is no base or index register. Previously we only did this if the immediate fit in 16 bits, but the GNU assembler seems to just truncate. Fixes PR46952	2020-09-15 19:31:48 -07:00
Alina Sbirlea	29208e15c0	Fix test after D86156.	2020-09-15 19:13:39 -07:00
Krzysztof Parzyszek	718a375cea	[Hexagon] Replace incorrect pattern for vpackl HWI32 -> HVi8 V6_vdealb4w is not correct for pairs, use V6_vpackeh/V6_vpackeb instead.	2020-09-15 20:34:50 -05:00
Arthur Eubanks	5540a2de5d	[NewPM] Port strip* passes to NPM strip-nondebug and strip-debug-declare have no existing associated tests Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D87639	2020-09-15 18:25:12 -07:00
Arthur Eubanks	f33d2689ad	[LowerSwitch][NewPM] Port lowerswitch to NPM Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D87726	2020-09-15 18:18:31 -07:00
Wenlei He	a2451f2478	SVML support for log10, sqrt Although LLVM supports vectorization of loops containing log10/sqrt, it did not support using SVML implementation of it. Added support so that when clang is invoked with -fveclib=SVML now an appropriate SVML library log2 implementation will be invoked. Follow up on: https://reviews.llvm.org/D77114 Tests: Added unit tests to svml-calls.ll, svml-calls-finite.ll. Can be run with llvm-lint. Created a simple c++ file that tests log10/sqrt, and used clang+ to build it, and output final assembly. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D87169	2020-09-15 17:29:44 -07:00
Wenlei He	c7934c2798	[LICM] Make Loop ICM profile aware again D65060 was reverted because it introduced non-determinism by using BFI counts from already freed blocks. The parent of this revision fixes that by using a VH callback on blocks to prevent this from happening and makes sure BFI data is passed correctly in LoopStandardAnalysisResults. This re-introduces the previous optimization of using BFI data to prevent LICM from hoisting/sinking if the instruction will end up moving to a colder block. Internally at Facebook this change results in a ~7% win in a CPU related metric in one of our big services by preventing hoisting cold code into a hot pre-header like the added test case demonstrates. Testing: ninja check Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D87551	2020-09-15 17:21:58 -07:00
Jessica Paquette	5ffe1901d5	[AArch64][GlobalISel] Refactor + improve CMN, ADDS, and ADD emit functions These functions were extremely similar: - `emitADD` - `emitADDS` - `emitCMN` Refactor them a little, introducing a more generic `emitInstr` function to do most of the work. Also add support for the immediate + shifted register addressing modes in each of them. Update select-uaddo.mir to show that selecing ADDS now supports folding immediates + shifts. (I don't think this can impact CMN, because the CMN checks require a G_SUB with a non-constant on the RHS.) This is around a 0.02% code size improvement on CTMark at -O3. Differential Revision: https://reviews.llvm.org/D87529	2020-09-15 17:18:05 -07:00
Arthur Eubanks	d8616d8c76	[CGSCC][NewPM] Fix adding mutually recursive new functions When adding a new function via addNewFunctionIntoRefSCC(), it creates a new node and immediately populates the edges. Since populateSlow() calls G->get() on all referenced functions, it will create a node (but not populate it) for functions that haven't yet been added. If we add two mutually recursive functions, the assert that the node should never have been created will fire when the second function is added. So here we remove that assert since the node may have already been created (but not yet populated). createNode() is only called from addNewFunctionInto{,Ref}SCC(). https://bugs.llvm.org/show_bug.cgi?id=47502 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D87623	2020-09-15 16:44:08 -07:00
Volkan Keles	dfd3344cb4	GlobalISel: Fix a failing combiner test test/CodeGen/AArch64/GlobalISel/combine-trunc.mir was failing due to the different order for evaluating function arguments. This patch updates the related code to fix the issue.	2020-09-15 16:40:38 -07:00
Alexandre Ganea	f0d611207d	[llvm][cmake] Change LLVM_INTEGRATED_CRT_ALLOC to a path instead of a boolean Differential Revision: https://reviews.llvm.org/D87609	2020-09-15 19:18:52 -04:00
Wenlei He	5c1dccafc2	[BFI] Make BFI information available through loop passes inside LoopStandardAnalysisResults ~~D65060 uncovered that trying to use BFI in loop passes can lead to non-deterministic behavior when blocks are re-used while retaining old BFI data.~~ ~~To make sure BFI is preserved through loop passes a Value Handle (VH) callback is registered on blocks themselves. When a block is freed it now also wipes out the accompanying BFI entry such that stale BFI data can no longer persist resolving the determinism issue. ~~ ~~An optimistic approach would be to incrementally update BFI information throughout the loop passes rather than only invalidating them on removed blocks. The issues with that are:~~ ~~1. It is not clear how BFI information should be incrementally updated: If a block is duplicated does its BFI information come with? How about if it's split/modified/moved around? ~~ ~~2. Assuming we can address these problems the implementation here will be a massive undertaking. ~~ ~~There's a known need of BFI in LICM analysis which requires correct but not incrementally updated BFI data. A follow-up change can register BFI in all loop passes so this preserved but potentially lossy data is available to any loop pass that wants it.~~ See: D75341 for an identical implementation of preserving BFI via VH callbacks. The previous statements do still apply but this change no longer has to be in this diff because it's already upstream 😄 . This diff also moves BFI to be a part of LoopStandardAnalysisResults since the previous method using getCachedResults now (correctly!) statically asserts (D72893) that this data isn't static through the loop passes. Testing Ninja check Reviewed By: asbirlea, nikic Differential Revision: https://reviews.llvm.org/D86156	2020-09-15 16:16:24 -07:00
Aditya Nandakumar	b83e257aa9	[GISel] Add new GISel combiners for G_MUL https://reviews.llvm.org/D87668 Patch adds two new GICombinerRules, one for G_MUL(X, 1) and another for G_MUL(X, -1). G_MUL(X, 1) is an identity combine, and G_MUL(X, -1) gets replaced with G_SUB(0, X). Patch additionally adds new combiner tests for the AArch64 target to test these new combiner rules, as well as updates AMDGPU GISel tests. Patch by mkitzan	2020-09-15 16:08:47 -07:00
Mircea Trofin	2d0a6945c4	[ThinLTO] add post-thinlto-merge option to -lto-embed-bitcode This will embed bitcode after (Thin)LTO merge, but before optimizations. In the case the thinlto backend is called from clang, the .llvmcmd section is also produced. Doing so in the case where the caller is the linker doesn't yet have a motivation, and would require plumbing through command line args. Differential Revision: https://reviews.llvm.org/D87636	2020-09-15 15:56:11 -07:00
Volkan Keles	f54424411f	GlobalISel: Add combines for G_TRUNC https://reviews.llvm.org/D87050	2020-09-15 15:50:34 -07:00
Stanislav Mekhanoshin	7f0d01b1a0	[AMDGPU] Unify intrinsic ret/nortn interface We have a single noret intrinsic an a lot of special handling around it. Declare it just as any other but do not define rtn instructions itself instead. Differential Revision: https://reviews.llvm.org/D87719	2020-09-15 15:26:42 -07:00
Xun Li	7bfa6044db	[TSAN] Handle musttail call properly in EscapeEnumerator (and TSAN) Call instructions with musttail tag must be optimized as a tailcall, otherwise could lead to incorrect program behavior. When TSAN is instrumenting functions, it broke the contract by adding a call to the tsan exit function inbetween the musttail call and return instruction, and also inserted exception handling code. This happend throguh EscapeEnumerator, which adds exception handling code and returns ret instructions as the place to insert instrumentation calls. This becomes especially problematic for coroutines, because coroutines rely on tail calls to do symmetric transfers properly. To fix this, this patch moves the location to insert instrumentation calls prior to the musttail call for ret instructions that are following musttail calls, and also does not handle exception for musttail calls. Differential Revision: https://reviews.llvm.org/D87620	2020-09-15 15:20:05 -07:00
Huihui Zhang	32cbc304b2	[SLPVectorizer][SVE] Skip scalable-vector instructions before vectorizeSimpleInstructions. For scalable type, the aggregated size is unknown at compile-time. Skip instructions with scalable type to ensure the list of instructions for vectorizeSimpleInstructions does not contains any scalable-vector instructions. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D87550	2020-09-15 13:10:15 -07:00
Ta-Wei Tu	524ed02943	[TableGen] Fix invalid comparison function `SizeOrder` in `getMatchingSubClassWithSubRegs` Building LLVM with -DEXPENSIVE_CHECKS fails with the following error message with libstdc++ in debug mode: Error: comparison doesn't meet irreflexive requirements, assert(!(a < a)). The patch fixes the comparison function SizeOrder by returning false when comparing two equal items.	2020-09-15 15:48:43 -04:00
Matt Arsenault	7872d20b80	InferAddressSpaces: Fix assert with unreachable code Invalid IR in unreachable code is technically valid IR. In this case, the address space of the value was never inferred, and we tried to rewrite it with an invalid address space value which would assert.	2020-09-15 15:48:43 -04:00
Muhammad Asif Manzoor	b757c8c8cd	[AArch64][SVE] Add lowering for llvm fsqrt Add the functionality to lower fsqrt for passthru variant Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D87707	2020-09-15 15:26:17 -04:00
Albion Fung	ef3cfcdc3b	[PowerPC] Implement __int128 vector divide operations This patch implements __int128 vector divide operations for ISA3.1. Differential Revision: https://reviews.llvm.org/D85453	2020-09-15 15:19:35 -04:00
Arthur Eubanks	50c071e797	[Dominators][NewPM] Pin tests with -analyze to legacy PM -analyze isn't supported in NPM. All affected tests have corresponding NPM RUN line.	2020-09-15 11:59:00 -07:00
Arthur Eubanks	32327822f1	[DemandedBits][NewPM] Pin some tests to legacy PM All tests have corresponding NPM RUN lines. -analyze doesn't work under NPM.	2020-09-15 11:55:58 -07:00
LLVM GN Syncbot	6c32c3191f	[gn build] Port 3d42d549554	2020-09-15 18:32:17 +00:00
Florian Hahn	142a546861	[ConstraintElimination] Add constraint elimination pass. This patch is a first draft of a new pass that adds a more flexible way to eliminate compares based on more complex constraints collected from dominating conditions. In particular, it aims at simplifying conditions of the forms below using a forward propagation approach, rather than instcomine-style ad-hoc backwards walking of def-use chains. if (x < y) if (y < z) if (x < z) <- simplify or if (x + 2 < y) if (x + 1 < y) <- simplify assuming no wraps The general approach is to collect conditions and blocks, sort them by dominance and then iterate over the sorted list. Conditions are turned into a linear inequality and add it to a system containing the linear inequalities that hold on entry to the block. For blocks, we check each compare against the system and see if it is implied by the constraints in the system. We also keep a stack of processed conditions and remove conditions from the stack and the constraint system once they go out-of-scope (= do not dominate the current block any longer). Currently there still are the least the following areas for improvements * Currently large unsigned constants cannot be added to the system (coefficients must be represented as integers) * The way constraints are managed currently is not very optimized. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D84547	2020-09-15 19:31:11 +01:00
Arthur Eubanks	6ec58c2371	[Bugpoint][NewPM] Pin bugpoint to legacy PM Bugpoint has lots of assumptions and hacks around the legacy PM, put off migrating it to NPM until later. Fixes tests under BugPoint under NPM. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D87655	2020-09-15 11:29:10 -07:00
Craig Topper	b5fb888606	[X86] Use Align in reduceMaskedLoadToScalarLoad/reduceMaskedStoreToScalarStore. Correct pointer info. If we offset the pointer, we also need to offset the pointer info Differential Revision: https://reviews.llvm.org/D87593	2020-09-15 11:22:02 -07:00
Arthur Eubanks	ff74782c01	[PostDominators][NewPM] Fix tests to work under NPM Each test has a legacy PM pinned to legacy PM and a NPM RUN line. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D87660	2020-09-15 11:19:01 -07:00
Arthur Eubanks	0328e43463	[NewPM][opt] Fix -globals-aa not being recognized as alias analysis in NPM Was missing MODULE_ALIAS_ANALYSIS, previously only FUNCTION_ALIAS_ANALYSIS was taken into account. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D87664	2020-09-15 11:18:19 -07:00
Arthur Eubanks	6c10abc35d	[RegionInfo][NewPM] Fix RegionInfo tests to work under NPM Pin RUN lines with -analyze to legacy PM, add corresponding NPM RUN line if missing. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D87658	2020-09-15 11:12:14 -07:00
Arthur Eubanks	b3cd1dc241	[DependenceAnalysis][NewPM] Fix tests to work under NPM All tests had corresponding NPM lines, simply pin non-NPM lines to legacy PM. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D87665	2020-09-15 11:11:23 -07:00
Arthur Eubanks	53307c47b6	[LoopAccessAnalysis][NewPM] Fix tests to work under NPM Pin RUN lines with -analyze to legacy PM, add corresponding NPM RUN lines. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D87662	2020-09-15 11:06:47 -07:00
Fangrui Song	21a91af6c8	[MemoryBuffer] Revert unintended MemoryBuffer change from D86996 Fixes SupportsTest MemoryBufferTest.mmapVolatileNoNull	2020-09-15 10:21:05 -07:00
Florian Hahn	207fc3a3d8	Revert "[DSE] Switch to MemorySSA-backed DSE by default." This reverts commit fb109c42d91c30c8c7497ef1fd7aff6f2969c6e7. Temporarily revert due to a mis-compile pointed out at D87163.	2020-09-15 18:07:56 +01:00
Petr Hosek	90f3b709ad	Revert "[DebugInfo] Remove dots from getFilenameByIndex return value" This is failing on Windows bots due to path separator normalization. This reverts commit 042c23506869b4ae9a49d2c4bc5ea6e6baeabe78.	2020-09-15 10:06:47 -07:00
Fangrui Song	306ab784d5	[VectorCombine] Don't vectorize scalar load under asan/hwasan/memtag/tsan Similar to the tsan suppression in `Utils/VNCoercion.cpp:getLoadLoadClobberFullWidthSize` (rL175034; load widening used by GVN), the D81766 optimization should be suppressed under tsan due to potential spurious data race reports: struct A { int i; const short s; // the load cannot be vectorized because int modify; // it overlaps with bytes being concurrently modified long pad1, pad2; }; // __tsan_read16 does not know that some bytes are undef and accessing is safe Similarly, under asan, users can mark memory regions with `__asan_poison_memory_region`. A widened load can lead to a spurious use-after-poison error. hwasan/memtag should be similarly suppressed. `mustSuppressSpeculation` suppresses asan/hwasan/tsan but not memtag, so we need to exclude memtag in `vectorizeLoadInsert`. Note, memtag suppression can be relaxed if the load is aligned to the its granule (usually 16), but that is out of scope of this patch. Reviewed By: spatel, vitalybuka Differential Revision: https://reviews.llvm.org/D87538	2020-09-15 09:47:21 -07:00
Jonas Devlieghere	a4b944f181	[lldb] Add -l/--language option to script command Make it possible to run the script command with a different language than currently selected. $ ./bin/lldb -l python (lldb) script -l lua >>> io.stdout:write("Hello, World!\n") Hello, World! When passing the language option and a raw command, you need to separate the flag from the script code with --. $ ./bin/lldb -l python (lldb) script -l lua -- io.stdout:write("Hello, World!\n") Hello, World! Differential revision: https://reviews.llvm.org/D86996	2020-09-15 09:40:17 -07:00
Simon Pilgrim	ad3567aae0	[X86][AVX] lowerShuffleWithSHUFPS - handle missed canonicalization cases. PR47534 exposes a case where calling lowerShuffleWithSHUFPS directly from a derived repeated mask (found by is128BitLaneRepeatedShuffleMask) results in us using an non-canonicalized mask. The missed canonicalization in this case is trivial - just commute the mask so we have more (swapped) LHS than RHS references so lowerShuffleWithSHUFPS can handle it.	2020-09-15 17:31:08 +01:00
Guozhi Wei	93598bb042	[MachineBasicBlock] Fix a typo in function copySuccessor The condition used to decide if need to copy probability should be reversed. Differential Revision: https://reviews.llvm.org/D87417	2020-09-15 09:18:18 -07:00

1 2 3 4 5 ...

203569 Commits