llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-03 00:47:07 +00:00

Author	SHA1	Message	Date
Florian Hahn	006733d526	[DSE,MemorySSA] Remove some duplicated test functions. Some tests from multibuild-malloc-free.ll do not actually use malloc or free and where split out to multiblock-throwing.ll, but not removed from the original file. This patch cleans that up. It also moves @test22 to simple.ll, because it does not involve multiple blocks.	2020-09-04 17:52:59 +01:00
Wei Wang	b71380c837	[OpenMPOpt] Assume indirect call always changes ICV When checking call sites, give special handling to indirect call, as the callee may be unknown and can lead to nullptr dereference later. Assume conservatively that the ICV always changes in such case. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D87104	2020-09-04 09:05:32 -07:00
Bryan Chan	a99283db0a	[EarlyCSE] Verify hash code in regression tests As discussed in D86843, -earlycse-debug-hash should be used in more regression tests to catch inconsistency between the hashing and the equivalence check. Differential Revision: https://reviews.llvm.org/D86863	2020-09-04 10:40:35 -04:00
Florian Hahn	756735e559	[DSE] Move legacy tests to DeadStoreElimination/MemDepAnalysis. This patch moves the tests for the old MemDepAnalysis based DSE implementation to the MemDepAnalysis subdirectory and updates them to pass -enable-dse-memoryssa=false. This is in preparation for the switch to MemorySSA-backed DSE.	2020-09-04 14:38:03 +01:00
Florian Hahn	f632c6e1cd	[MemCpyOpt] Account for case that MemInsertPoint == BI. In that case, the new MemoryDef needs to be inserted before MemInsertPoint.	2020-09-04 14:04:08 +01:00
Max Kazantsev	9f2ecc6f20	[Test] Range fix in test test02_neg is not testing what it claims to test because its starting value -1 lies outside of specified range.	2020-09-04 19:28:58 +07:00
Florian Hahn	587c7dcbfd	[DSE] Remove some dead code from DSE tests. Some tests depend on DSE removing dead instructions unrelated to any memory optimization. That's not really DSE's job, remove it.	2020-09-04 09:39:40 +01:00
Florian Hahn	ce6e59900a	[MemCpyOpt] Preserve MemorySSA. This patch updates MemCpyOpt to preserve MemorySSA. It uses the MemoryDef at the insertion point of the builder and inserts the new def after that def. In some cases, we just modify a memory instruction. In that case, get the defining access, then remove the memory access and add a new one. If the defining access is in a different block, insert a new def at the beginning of the current block, otherwise after the defining access. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D86651	2020-09-04 09:05:33 +01:00
Bryan Chan	c65dfd8c46	Replace CRLF with LF; NFC	2020-09-03 15:30:08 -04:00
Wenlei He	9ba54f747b	SVML support for log2 Although LLVM supports vectorization of loops containing log2, it did not support using SVML implementation of it. Added support so that when clang is invoked with -fveclib=SVML now an appropriate SVML library log2 implementation will be invoked. Follow up on: https://reviews.llvm.org/D77114 Tests: Added unit tests to svml-calls.ll, svml-calls-finite.ll. Can be run with llvm-lint. Created a simple c++ file that tests log2, and used clang+ to build it, and output final assembly. Reviewed By: wenlei, craig.topper Differential Revision: https://reviews.llvm.org/D86730	2020-09-03 11:52:29 -07:00
Sanjay Patel	421259677f	[InstCombine] canonicalize all commutative intrinsics with constant arg	2020-09-03 12:42:04 -04:00
Sanjay Patel	f122d1d01f	[InstCombine] add tests for commutative intrinsics; NFC	2020-09-03 12:42:04 -04:00
Sanjay Patel	80dc8a6aaa	[IR][GVN] add/allow commutative intrinsics with >2 args Follow-up to D86798 and rGe25449f.	2020-09-03 10:14:53 -04:00
Sanjay Patel	97f39fdd17	[GVN] add tests for >2 arg commutable intrinsics; NFC	2020-09-03 10:14:52 -04:00
Sanjay Patel	63cee4c52d	[EarlyCSE] add tests for fma/fmuladd; NFC	2020-09-03 09:11:54 -04:00
Max Kazantsev	247b72ded4	[Test] Add test showing some simple cases that IndVarSimplify does not cover	2020-09-03 18:35:26 +07:00
Florian Hahn	dc272f692f	[GVN] Preserve MemorySSA if it is available. Preserve MemorySSA if it is available before running GVN. DSE with MemorySSA will run closely after GVN. If GVN and 2 other passes preserve MemorySSA, DSE can re-use MemorySSA used by LICM when doing LTO. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D86534	2020-09-03 12:28:13 +01:00
Florian Hahn	7273f76cf8	Revert "[SCCP] Do not replace deref'able ptr with un-deref'able one." This reverts commit 3542feeb2077f267bff1ab98fb4bf20099f44bb8. This seems to be causing issues with a sanitizer build http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap-msan/builds/21677	2020-09-03 10:28:42 +01:00
Florian Hahn	fc0ecc3cd2	[SCCP] Do not replace deref'able ptr with un-deref'able one. Currently IPSCCP (and others like CVP/GVN) blindly propagate pointer equalities. In certain cases, that leads to dereferenceable pointers being replaced, as in the example test case. I think this is not allowed, as it introduces an access of an un-dereferenceable pointer. Note that the pointer is inbounds, but one past the last element, so it is valid, but not dereferenceable. This patch is mostly to highlight the issue and start a discussion. Currently it only checks for specifically looking one-past-the-last-element pointers with array typed bases. This causes the mis-compile outlined in https://stackoverflow.com/questions/55754313/is-this-gcc-clang-past-one-pointer-comparison-behavior-conforming-or-non-standar In the test case, if we replace %p with the GEP for the store, we subsequently determine that the store and the load cannot alias, because they are to different underlying objects. Note that Alive2 seems to think that the replacement is valid: https://alive2.llvm.org/ce/z/2rorhk Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D85332	2020-09-03 10:22:21 +01:00
Florian Hahn	1ced77a5a7	[SCCP] Add test where dereferenceable ptr is replaced with un-dereferenceable one	2020-09-03 10:06:51 +01:00
Craig Topper	639e60808d	[CodeGenPrepare][X86] Teach optimizeGatherScatterInst to turn a splat pointer into GEP with scalar base and 0 index This helps SelectionDAGBuilder recognize the splat can be used as a uniform base. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D86371	2020-09-02 20:44:12 -07:00
Eli Friedman	1e3d2f90ba	[InstCombine] Fix a couple crashes with extractelement on a scalable vector. Differential Revision: https://reviews.llvm.org/D86989	2020-09-02 18:02:07 -07:00
Huihui Zhang	dc1e85f7b4	[VectorCombine][SVE] Do not fold bitcast shuffle for scalable type. First, shuffle cost for scalable type is not known for scalable type; Second, we cannot reason if the narrowed shuffle mask for scalable type is a splat or not. E.g., Bitcast splat vector from type <vscale x 4 x i32> to <vscale x 8 x i16> will involve narrowing shuffle mask <vscale x 4 x i32> zeroinitializer to <vscale x 8 x i32> with element sequence of <0, 1, 0, 1, ...>, which cannot be reasoned if it's a valid splat or not. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D86995	2020-09-02 15:02:16 -07:00
Congzhe Cao	654fb2b9d5	[IPSCCP] Fix a bug that the "returned" attribute is not cleared when function is optimized to return undef In IPSCCP when a function is optimized to return undef, it should clear the returned attribute for all its input arguments and its corresponding call sites. The bug is exposed when the value of an input argument of the function is assigned to a physical register and because of the argument having a returned attribute, the value of this physical register will continue to be used as the function return value right after the call instruction returns, even if the value that this register holds may be clobbered during the function call. This potentially results in incorrect values being used afterwards. Reviewed By: jdoerfert, fhahn Differential Revision: https://reviews.llvm.org/D84220	2020-09-02 11:21:48 -04:00
David Stenberg	44cdc22e09	[GlobalOpt] Fix an incorrect Modified status When marking a global variable constant, and simplifying users using CleanupConstantGlobalUsers(), the pass could incorrectly return false if there were still some uses left, and no further optimizations was done. This was caught using the check introduced by D80916. This fixes PR46749. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D85837	2020-09-02 15:00:45 +02:00
Venkataramanan Kumar	36b1d49191	[InstCombine] Transform 1.0/sqrt(X) * X to X/sqrt(X) These transforms will now be performed irrespective of the number of uses for the expression "1.0/sqrt(X)": 1.0/sqrt(X) * X => X/sqrt(X) X * 1.0/sqrt(X) => X/sqrt(X) We already handle more general cases, and we are intentionally not creating extra (and likely expensive) fdiv ops in IR. This pattern is the exception to the rule because we always expect the Backend to reduce X/sqrt(X) to sqrt(X), if it has the necessary (reassoc) fast-math-flags. Ref: DagCombiner optimizes the X/sqrt(X) to sqrt(X). Differential Revision: https://reviews.llvm.org/D86726	2020-09-02 08:23:48 -04:00
Sanjay Patel	4e9822e551	[VectorCombine] allow vector loads with mismatched insert type This is an enhancement to D81766 to allow loading the minimum target vector type into an IR vector with a different number of elements. In one of the motivating tests from PR16739, SLP creates <2 x float> load ops mixed with <4 x float> insert ops, so we want to handle that pattern in addition to potential oversized vectors created by the vectorizers. For now, we are assuming the insert/extract subvector with undef is free because there is no exact corresponding TTI modeling for that. Differential Revision: https://reviews.llvm.org/D86160	2020-09-02 08:11:36 -04:00
Max Kazantsev	260b1e427d	[Test] Simplify test by removing unneeded variable	2020-09-02 18:39:43 +07:00
Shinji Okumura	ad9adda18c	[Attributor] Make use of AANoUndef in AAUndefinedBehavior This patch makes it possible for AAUB to use information from AANoUndef. This is the next patch of D86983 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D86984	2020-09-02 16:08:03 +09:00
Aaron Liu	a410101daf	[LV] Interleave to expose ILP for small loops with scalar reductions. Interleave for small loops that have reductions inside, which breaks dependencies and expose. This gives very significant performance improvements for some benchmarks. Because small loops could be in very hot functions in real applications. Differential Revision: https://reviews.llvm.org/D81416	2020-09-01 19:47:32 +00:00
Anh Tuyen Tran	c3a3d1596a	[LoopIdiomRecognizePass] Options to disable part or the entire Loop Idiom Recognize Pass Loop Idiom Recognize Pass (LIRP) attempts to transform loops with subscripted arrays into memcpy/memset function calls. In some particular situation, this transformation introduces negative impacts. For example: https://bugs.llvm.org/show_bug.cgi?id=47300 This patch will enable users to disable a particular part of the transformation, while he/she can still enjoy the benefit brought about by the rest of LIRP. The default behavior stays unchanged: no part of LIRP is disabled by default. Reviewed By: etiotto (Ettore Tiotto) Differential Revision: https://reviews.llvm.org/D86262	2020-09-01 13:59:24 +00:00
Sanjay Patel	41c06ff50d	[InstCombine] adjust recip sqrt tests for better coverage; NFC Goes with D86726	2020-09-01 09:43:42 -04:00
Max Kazantsev	6748e7476e	[Test] Move IndVars test to a proper place	2020-09-01 12:17:31 +07:00
Hamilton Tobon Mosquera	b5757ca2c4	[OpenMPOpt][HideMemTransfersLatency] Get values stored in offload arrays getValuesInOffloadArrays goes through the offload arrays in __tgt_target_data_begin_mapper getting the values stored in them before the call is issued. call void @__tgt_target_data_begin_mapper(arg0, arg1, i8 %offload_baseptrs, i8 %offload_ptrs, i64* %offload_sizes, ...) Diferential Revision: https://reviews.llvm.org/D86300	2020-08-31 15:33:05 -05:00
Craig Topper	0ce780b084	[X86][CGP] Add gather test cases for D86371.	2020-08-31 13:12:53 -07:00
Sanjay Patel	251968146e	[IR][GVN] allow intrinsics in Instruction's isCommutative query (2nd try) The 1st try was reverted because I missed an assert that needed softening. As discussed in D86798 / rG09652721 , we were potentially returning a different result for whether an Instruction is commutable depending on if we call the base class or derived class method. This requires relaxing asserts in GVN, but that pass seems to be working otherwise. NewGVN requires more work because it uses different code paths for numbering binops and calls.	2020-08-31 16:01:19 -04:00
Roman Lebedev	4aba95741f	[NFC][InstCombine] visitPHINode(): cleanup PHI CSE instruction replacement As @nikic is pointing out in https://reviews.llvm.org/rGbf21ce7b908e#inline-4647 this must be sufficient otherwise `EliminateDuplicatePHINodes()` would have hit issues with it already.	2020-08-31 22:29:39 +03:00
Craig Topper	94e25784f5	[X86][CGP] Pre-commit test cases for D86371.	2020-08-31 10:48:56 -07:00
Fangrui Song	e148cbff94	[Sink][test] Add nounwind test and properly test convergent	2020-08-30 18:26:42 -07:00
Venkataramanan Kumar	3f6f7f6b7b	[InstCombine] add extra-use tests for fmul+sqrt; NFC	2020-08-30 18:41:01 -04:00
Sanjay Patel	7d3ad258b7	[GVN] add another commutable intrinsic test; NFC This is a reduced version of a test-suite crasher with rG25597f7	2020-08-30 18:36:28 -04:00
Sanjay Patel	56bc7f03f4	Revert "[IR][GVN] allow intrinsics in Instruction's isCommutative query" This reverts commit 25597f7783e7038b8a2ee88bb49ac605b211b564. It is causing crashing on bots such as: http://lab.llvm.org:8011/builders/fuchsia-x86_64-linux/builds/10523/steps/ninja-build/logs/stdio	2020-08-30 17:02:51 -04:00
Sanjay Patel	d58c2f282d	[IR][GVN] allow intrinsics in Instruction's isCommutative query As discussed in D86798 / rG09652721 , we were potentially returning a different result for whether an Instruction is commutable depending on if we call the base class or derived class method. This requires relaxing an assert in GVN, but that pass seems to be working otherwise. NewGVN requires more work because it uses different code paths for numbering binops and calls.	2020-08-30 16:49:22 -04:00
Sanjay Patel	c84adaaae2	[NewGVN] add test for commutative intrinsic; NFC	2020-08-30 16:33:45 -04:00
Sanjay Patel	6a90c8322d	[GVN] add test for commutative intrinsic; NFC	2020-08-30 16:33:45 -04:00
Florian Hahn	11bbe69ccc	[LV] Update CFG before adding runtime checks. addRuntimeChecks uses SCEVExpander, which relies on the DT/LoopInfo to be up-to-date. Changing the CFG afterwards may invalidate some inserted instructions, especially LCSSA phis. Reorder the code to first update the CFG and then create the runtime checks. This should not have any impact on the generated code, as we adjust the CFG and generate runtime checks together. Fixes PR47343.	2020-08-30 18:21:44 +01:00
Nikita Popov	8aaf0731a7	[CVP] Regenerate test checks (NFC)	2020-08-30 16:23:59 +02:00
sstefan1	3610c7de35	Reland [OpenMPOpt] ICV tracking for calls The problem with module slice has been addressed in D86319 Introduce two new AAs. AAICVTrackerFunctionReturned which checks if a function can have a unique ICV value after it is finished, and AAICVCallSiteReturned which checks AAICVTrackerFunctionReturned for a call site. This enables us to check the value of a call and if it changes the ICV. This also changes the approach in `getReplacementValues()` to a worklist-based approach so we can explore all relevant BBs. Differential Revision: https://reviews.llvm.org/D85544	2020-08-30 11:27:48 +02:00
sstefan1	34f7bfd3c0	[Attributor] Introduce module slice. Summary: The module slice describes which functions we can analyze and transform while working on an SCC as part of the Attributor-CGSCC pass. So far we simply restricted it to the SCC. Reviewers: jdoerfert Differential Revision: https://reviews.llvm.org/D86319	2020-08-30 10:30:44 +02:00
sstefan1	1270c76171	[OpenMPOpt][NFC] add reproducer for problem found in D85544	2020-08-30 09:59:25 +02:00

1 2 3 4 5 ...

15950 Commits