llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-17 02:38:16 +00:00

Author	SHA1	Message	Date
Gor Nishanov	0346aed75a	[Coroutines] Part 5: Add CGSCC restart trigger Summary: CoroSplit pass processes the coroutine twice. First, it lets it go through complete IPO optimization pipeline as a single function. It forces restart of the pipeline by inserting an indirect call to an empty function "coro.devirt.trigger" which is devirtualized by CoroElide pass that triggers a restart of the pipeline by CGPassManager. (In later patches, when CoroSplit pass sees the same coroutine the second time, it splits it up, adds coroutine subfunctions to the SCC to be processed by IPO pipeline.) Documentation and overview is here: http://llvm.org/docs/Coroutines.html. Upstreaming sequence (rough plan) 1.Add documentation. (https://reviews.llvm.org/D22603) 2.Add coroutine intrinsics. (https://reviews.llvm.org/D22659) 3.Add empty coroutine passes. (https://reviews.llvm.org/D22847) 4.Add coroutine devirtualization + tests. ab) Lower coro.resume and coro.destroy (https://reviews.llvm.org/D22998) c) Do devirtualization (https://reviews.llvm.org/D23229) 5.Add CGSCC restart trigger + tests. <= we are here 6.Add coroutine heap elision + tests. 7.Add the rest of the logic (split into more patches) Reviewers: mehdi_amini, majnemer Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D23234 llvm-svn: 277936	2016-08-06 20:44:39 +00:00
Craig Topper	73d1a08dfd	[AVX-512] Add SQRT/RCP14/RNDSCALE to hasUndefRegUpdate. llvm-svn: 277934	2016-08-06 19:31:52 +00:00
Craig Topper	d62f7e9da2	[AVX-512] Add AVX-512 scalar CVT instructions to hasUndefRegUpdate. llvm-svn: 277933	2016-08-06 19:31:50 +00:00
Craig Topper	f868922e18	[AVX-512] Add AVX512 run line to a test and re-generate the checks. Future commits will refine some of the sequences. llvm-svn: 277932	2016-08-06 19:31:47 +00:00
Craig Topper	600115d8de	[X86] Add VRCPSSr_Int, VRSQRTSSr_Int, VSQRTSSr_Int, and VSQRTSDr_Int to hasUndefRegUpdate. llvm-svn: 277931	2016-08-06 19:31:44 +00:00
Simon Pilgrim	78c8b55825	[X86][SSE] Enable commutation between MOVHLPS and UNPCKHPD Assuming SSE2 is available then we can safely commute between these, removing some unnecessary register moves and improving memory folding opportunities. VEX encoded versions don't benefit so I haven't added support to them. llvm-svn: 277930	2016-08-06 18:40:28 +00:00
Mike Aizatsky	4109691507	[libfuzzer] do not warn about missing pcbuffer functions: they are new. llvm-svn: 277927	2016-08-06 17:03:22 +00:00
Simon Pilgrim	4afd83c320	[X86][SSE] Regenerate SSE1 shuffle tests llvm-svn: 277925	2016-08-06 13:46:09 +00:00
Benjamin Kramer	4e1e12845a	[ARM] Don't copy MCInsts in loop. NFC. llvm-svn: 277924	2016-08-06 12:58:24 +00:00
Benjamin Kramer	90985d6d85	[Inliner] Use function_ref for functors which are never taken ownership of. llvm-svn: 277922	2016-08-06 12:33:46 +00:00
Benjamin Kramer	0c2ef5c636	[LoadCombine] Simplify code with a brace init. NFC. llvm-svn: 277921	2016-08-06 12:11:11 +00:00
Simon Pilgrim	e70f883bd5	[X86][SSE] Add 2 input shuffle support to matchBinaryVectorShuffle Not actually used yet... llvm-svn: 277919	2016-08-06 11:22:39 +00:00
Benjamin Kramer	a733725b3a	Move helpers into anonymous namespaces. NFC. llvm-svn: 277916	2016-08-06 11:13:10 +00:00
David Majnemer	be67decdd0	[CodeGen] Fix a -Wdocumentation warning A parameter was documented with the wrong name. No functionality change is intended. llvm-svn: 277915	2016-08-06 08:37:12 +00:00
David Majnemer	cbcaf6adec	[ValueTracking] Teach computeKnownBits about [su]min/max Reasoning about a select in terms of a min or max allows us to derive a tigher bound on the result. llvm-svn: 277914	2016-08-06 08:16:00 +00:00
David Majnemer	52836ff27d	[CallGraphSCCPass] Use an ArrayRef instead of a pair of iterators No functional change is intended. llvm-svn: 277913	2016-08-06 06:21:02 +00:00
Chandler Carruth	a0e0d32a56	[ADT] Make the triple test 1000x faster through more focused test cases. The current approach isn't a long-term viable pattern. Given the set of architectures A, vendors V, operating systems O, and environments E, it does \|A\| * \|V\| * \|O\| * \|E\| * 4! tests. As LLVM grows, this test keeps getting slower, despite my working very hard to make it get some "optimizations" even in -O0 builds in order to lower the constant factors. Fundamentally, we're doing an unreasonable amount of work.i Looking at the specific thing being tested -- the goal seems very clearly to be testing the permutations, not the combinations. The combinations are driving up the complexity much more than anything else. Instead, test every possible value for a given triple entry in every permutation of some triple. This really seems to cover the core goal of the test. Every single possible triple component is tested in every position. But because we keep the rest of the triple constant, it does so in a dramatically more scalable amount of time. With this model we do (\|A\| + \|V\| + \|O\| + \|E\|) * 4! tests. For me on a debug build, this goes from running for 19 seconds to 19 milliseconds, or a 1000x improvement. This makes a world of difference for the critical path of 'ninja check-llvm' and other extremely common workflows. Thanks to Renato, Dean, and David for the helpful review comments and helping me refine the explanation of the change. Differential Revision: https://reviews.llvm.org/D23156 llvm-svn: 277912	2016-08-06 06:00:50 +00:00
Sanjoy Das	aca3962ed4	[InstCombine] Don't coerce non-integral pointers to integers Reviewers: majnemer Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D23231 llvm-svn: 277910	2016-08-06 02:58:48 +00:00
Matthias Braun	61a94e86f0	Revert "(refs/bisect/bad) GVN-hoist: enable by default" GVN-Hoist appears to miscompile llvm-testsuite SingleSource/Benchmarks/Misc/fbench.c at the moment. I filed http://llvm.org/PR28880 This reverts commit r277786. llvm-svn: 277909	2016-08-06 02:23:15 +00:00
Gor Nishanov	d04999a10d	Part 4c: Coroutine Devirtualization: Devirtualize coro.resume and coro.destroy. Summary: This is the 4c patch of the coroutine series. CoroElide pass now checks if PostSplit coro.begin is referenced by coro.subfn.addr intrinsics. If so replace coro.subfn.addrs with an appropriate coroutine subfunction associated with that coro.begin. Documentation and overview is here: http://llvm.org/docs/Coroutines.html. Upstreaming sequence (rough plan) 1.Add documentation. (https://reviews.llvm.org/D22603) 2.Add coroutine intrinsics. (https://reviews.llvm.org/D22659) 3.Add empty coroutine passes. (https://reviews.llvm.org/D22847) 4.Add coroutine devirtualization + tests. ab) Lower coro.resume and coro.destroy (https://reviews.llvm.org/D22998) c) Do devirtualization <= we are here 5.Add CGSCC restart trigger + tests. 6.Add coroutine heap elision + tests. 7.Add the rest of the logic (split into more patches) Reviewers: majnemer Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D23229 llvm-svn: 277908	2016-08-06 02:16:35 +00:00
Nico Weber	187570f797	Revert r277896. It breaks ExecutionEngine/OrcLazy/weak-function.ll on most bots. Script: -- ... -- Exit Code: 1 Command Output (stderr): -- Could not find main function. llvm-svn: 277907	2016-08-06 02:00:45 +00:00
Kyle Butt	2437437aae	CodeGen: If Convert blocks that would form a diamond when tail-merged. The following function currently relies on tail-merging for if conversion to succeed. The common tail of cond_true and cond_false is extracted, and this then forms a diamond pattern that can be successfully if converted. If this block does not get extracted, either because tail-merging is disabled or the threshold is higher, we should still recognize this pattern and if-convert it. define i32 @t2(i32 %a, i32 %b) nounwind { entry: %tmp1434 = icmp eq i32 %a, %b ; <i1> [#uses=1] br i1 %tmp1434, label %bb17, label %bb.outer bb.outer: ; preds = %cond_false, %entry %b_addr.021.0.ph = phi i32 [ %b, %entry ], [ %tmp10, %cond_false ] %a_addr.026.0.ph = phi i32 [ %a, %entry ], [ %a_addr.026.0, %cond_false ] br label %bb bb: ; preds = %cond_true, %bb.outer %indvar = phi i32 [ 0, %bb.outer ], [ %indvar.next, %cond_true ] %tmp. = sub i32 0, %b_addr.021.0.ph %tmp.40 = mul i32 %indvar, %tmp. %a_addr.026.0 = add i32 %tmp.40, %a_addr.026.0.ph %tmp3 = icmp sgt i32 %a_addr.026.0, %b_addr.021.0.ph br i1 %tmp3, label %cond_true, label %cond_false cond_true: ; preds = %bb %tmp7 = sub i32 %a_addr.026.0, %b_addr.021.0.ph %tmp1437 = icmp eq i32 %tmp7, %b_addr.021.0.ph %indvar.next = add i32 %indvar, 1 br i1 %tmp1437, label %bb17, label %bb cond_false: ; preds = %bb %tmp10 = sub i32 %b_addr.021.0.ph, %a_addr.026.0 %tmp14 = icmp eq i32 %a_addr.026.0, %tmp10 br i1 %tmp14, label %bb17, label %bb.outer bb17: ; preds = %cond_false, %cond_true, %entry %a_addr.026.1 = phi i32 [ %a, %entry ], [ %tmp7, %cond_true ], [ %a_addr.026.0, %cond_false ] ret i32 %a_addr.026.1 } Without tail-merging or diamond-tail if conversion: LBB1_1: @ %bb @ =>This Inner Loop Header: Depth=1 cmp r0, r1 ble LBB1_3 @ BB#2: @ %cond_true @ in Loop: Header=BB1_1 Depth=1 subs r0, r0, r1 cmp r1, r0 it ne cmpne r0, r1 bgt LBB1_4 LBB1_3: @ %cond_false @ in Loop: Header=BB1_1 Depth=1 subs r1, r1, r0 cmp r1, r0 bne LBB1_1 LBB1_4: @ %bb17 bx lr With diamond-tail if conversion, but without tail-merging: @ BB#0: @ %entry cmp r0, r1 it eq bxeq lr LBB1_1: @ %bb @ =>This Inner Loop Header: Depth=1 cmp r0, r1 ite le suble r1, r1, r0 subgt r0, r0, r1 cmp r1, r0 bne LBB1_1 @ BB#2: @ %bb17 bx lr llvm-svn: 277905	2016-08-06 01:52:37 +00:00
Kyle Butt	5a3ac8d61b	IfConverter: Split ScanInstructions into 2 functions. ScanInstructions is now 2 functions: AnalyzeBranches and ScanInstructions. ScanInstructions also now takes a pair of arguments delimiting the instructions to be scanned. This will be used for forked diamond support to re-scan only a portion of the block. llvm-svn: 277904	2016-08-06 01:52:34 +00:00
Kyle Butt	f1dc880c22	IfConversion: Document countDuplicatedInstructions. NFC llvm-svn: 277903	2016-08-06 01:52:33 +00:00
Kyle Butt	c2f044ae3a	IfConversion: factor out 2 functions to skip debug instrs. NFC Skipping debug instructions occurrs repeatedly, factor it out. llvm-svn: 277902	2016-08-06 01:52:31 +00:00
Michael Zolotukhin	ca933d23c6	Revert "[LoopSimplify] Fix updating LCSSA after separating nested loops." This reverts commit r277877. Try to appease clang-x64-ninja-win7 buildbot. llvm-svn: 277901	2016-08-06 01:48:51 +00:00
Lang Hames	60cc270b53	[ORC] Add (partial) weak symbol support to the CompileOnDemand layer. This adds partial support for weak functions to the CompileOnDemandLayer by modifying the addLogicalModule method to check for existing stub definitions before building a new stub for a weak function. This scheme is sufficient to support ODR definitions, but fails for general weak definitions if strong definition is encountered after the first weak definition. (A more extensive refactor will be required to fully support weak symbols). This patch does not add weak symbol support to RuntimeDyld: I hope to add that in the near future. llvm-svn: 277896	2016-08-06 00:54:43 +00:00
Zachary Turner	acb9d58b0c	Fix a -Wunused-const-variable due to a bug in clang. llvm-svn: 277893	2016-08-06 00:13:32 +00:00
Sanjoy Das	58e4d175cb	[IRCE] Remove unused headers; NFC llvm-svn: 277892	2016-08-06 00:02:01 +00:00
Sanjoy Das	ac0f2a4d82	[IRCE] Preserve loop-simplify form Fixes PR28764. Right now there is no way to test this, but (as mentioned on the PR) with Michael Zolotukhin's yet to be checked in LoopSimplify verfier, 8 of the llvm-lit tests for IRCE crash. llvm-svn: 277891	2016-08-06 00:01:56 +00:00
Zachary Turner	f0e850c9cd	Resubmit "Make YAML support SmallVector" This resubmits a3770391c5fb64108d565e12f61dd77ce71b5b4f, which was reverted due to breakages on non-Windows machines. Due to differences in template instantiation rules on Microsoft and non-Microsoft platforms, a member access restriction was triggering on non-Microsoft compilers. Previously, a friend declaration for std::vector<> had been introduced into the DebugMap class to make the member access restriction pass, but the introduction of support for SmallVector<> meant that an additional friend declaration would need to be added. This didn't really make a lot of sense since the user of the macro is probably only using one type (SmallVector<>, vector<>, etc) and we could in theory add support for even more types to this macro in the future (e.g. std::deque), so rather than add another friend declaration, I just made the type being referenced a public nested typedef instead of a private nested typedef. llvm-svn: 277888	2016-08-05 23:12:31 +00:00
Sanjay Patel	cf40a88b7b	[InstCombine] refactor ctlz/cttz folds (NFCI) Note that this fold really belongs in InstSimplify. Refactoring here anyway as an intermediate step because there's a planned addition to this function in D23134. Differential Revision: https://reviews.llvm.org/D23223 llvm-svn: 277883	2016-08-05 22:42:46 +00:00
Justin Bogner	cf530c6c70	Revert "Make YAML support SmallVector" This breaks building dsymutil, causing my local build and many bots to fail. This reverts r277870. llvm-svn: 277881	2016-08-05 22:32:33 +00:00
Daniel Berlin	816ae4062c	[MSSA] Use depth first iterator instead of custom version. Summary: Originally the plan was to use the custom worklist to do some block popping, and because we don't actually need a visited set. The custom one we have here is slightly broken, and it's not worth fixing vs using depth_first_iterator since we aren't going to go the route we originally were. Fixes PR28874 Reviewers: george.burgess.iv Subscribers: llvm-commits, gberry Differential Revision: https://reviews.llvm.org/D23187 llvm-svn: 277880	2016-08-05 22:09:14 +00:00
Justin Bogner	049f0b1295	CodeView: Remove an unused variable It was breaking the -Werror build. llvm-svn: 277878	2016-08-05 21:57:10 +00:00
Michael Zolotukhin	fa30ea5db2	[LoopSimplify] Fix updating LCSSA after separating nested loops. This fixes PR28825. The problem was that we only checked if a value from a created inner loop is used in the outer loop, and fixed LCSSA for them. But we missed to fixup LCSSA for values used in exits of the outer loop. llvm-svn: 277877	2016-08-05 21:52:58 +00:00
Zachary Turner	d023c59def	Fix non portable include path. llvm-svn: 277876	2016-08-05 21:50:02 +00:00
Justin Bogner	ebdddbfed5	PowerPC: Add a triple to this test This is running opt without specifying a triple, which isn't correct. llvm-svn: 277875	2016-08-05 21:49:54 +00:00
Daniel Berlin	a948a2224e	[MSSA] Match assert vs llvm_unreachable style in verification functions. llvm-svn: 277873	2016-08-05 21:47:20 +00:00
Daniel Berlin	242310219e	Rewrite domination verifier to handle local domination as well. Summary: Rewrite domination verifier to handle local domination as well. This catches a bug Geoff Berry noticed. Reviewers: george.burgess.iv Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23184 llvm-svn: 277872	2016-08-05 21:46:52 +00:00
Zachary Turner	a3ce9cabee	[CodeView] Decouple record deserialization from visitor dispatch. Until now, our use case for the visitor has been to take a stream of bytes representing a type stream, deserialize the records in sequence, and do something with them, where "something" is determined by how the user implements a particular set of callbacks on an abstract class. For actually writing PDBs, however, we want to do the reverse. We have some kind of description of the list of records in their in-memory format, and we want to process each one. Perhaps by serializing them to a byte stream, or perhaps by converting them from one description format (Yaml) to another (in-memory representation). This was difficult in the current model because deserialization and invoking the callbacks were tightly coupled. With this patch we change this so that TypeDeserializer is itself an implementation of the particular set of callbacks. This decouples deserialization from the iteration over a list of records and invocation of the callbacks. TypeDeserializer is initialized with another implementation of the callback interface, so that upon deserialization it can pass the deserialized record through to the next set of callbacks. In a sense this is like an implementation of the Decorator design pattern, where the Deserializer is a decorator. This will be useful for writing Pdbs from yaml, where we have a description of the type records in Yaml format. In this case, the visitor implementation would have each visitation callback method implemented in such a way as to extract the proper set of fields from the Yaml, and it could maintain state that builds up a list of these records. Finally at the end we can pass this information through to another set of callbacks which serializes them into a byte stream. Reviewed By: majnemer, ruiu, rnk Differential Revision: https://reviews.llvm.org/D23177 llvm-svn: 277871	2016-08-05 21:45:34 +00:00
Zachary Turner	b8afc08016	Make YAML support SmallVector Currently YAML sequences require std::vectors. All of the methods that the YAML parser accesses though are present in SmallVector, so there's no reason we can't support SmallVector inherently. This patch does that. Reviewed By: majnemer Differential Revision: https://reviews.llvm.org/D23213 llvm-svn: 277870	2016-08-05 21:45:19 +00:00
Marek Olsak	e93050ba9d	AMDGPU/SI: Increase SGPR limit to 96 on Tonga/Iceland Summary: This is the setting of the Vulkan closed source driver. It decreases the max wave count from 10 to 8. 26010 shaders in 14650 tests Totals: VGPRS: 829593 -> 808440 (-2.55 %) Spilled SGPRs: 81878 -> 42226 (-48.43 %) Spilled VGPRs: 367 -> 358 (-2.45 %) Scratch VGPRs: 1764 -> 1748 (-0.91 %) dwords per thread Code Size: 36677864 -> 35923932 (-2.06 %) bytes There is a massive decrease in SGPR spilling in general and -7.4% spilled VGPRs for DiRT Showdown (= SGPRs spilled to scratch?) Reviewers: arsenm, tstellarAMD, nhaehnle Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: https://reviews.llvm.org/D23034 llvm-svn: 277867	2016-08-05 21:23:29 +00:00
Weiming Zhao	25d501fdb1	[ARM] Constant Materialize: imms with specific value can be encoded into mov.w Summary: Thumb2 supports encoding immediates with specific patterns into mov.w by splatting the low 8 bits into other bytes. I'm resubmitting this patch. The test case in the original commit r277610 does not specify triple, so builds with differnt default triple will have different output. This patch fixed trile as thumb-darwin-apple. Reviewers: john.brawn, jmolloy, bruno Subscribers: jmolloy, aemerson, rengolin, samparker, llvm-commits Differential Revision: https://reviews.llvm.org/D23090 llvm-svn: 277865	2016-08-05 20:58:29 +00:00
Davide Italiano	3bdeddf37c	[FlattenCFG] Simplify + remove unused variable. NFCI. llvm-svn: 277864	2016-08-05 20:53:35 +00:00
Dehao Chen	a21187d4ef	Remove cold callsite heuristic that is not necessary because of cold callee heuristic. llvm-svn: 277863	2016-08-05 20:49:04 +00:00
Sanjoy Das	4e9513d4ff	[SCEV] Un-grep'ify tests; NFC llvm-svn: 277861	2016-08-05 20:33:49 +00:00
Dehao Chen	3e893cb241	Replace hot-callsite based heuristic to use its own threshold parameter instead of share inline-hint parameter Summary: Hot callsites should have higher threshold than inline hints. This patch uses separate threshold parameter for hot callsites. Reviewers: davidxl, eraman Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D22368 llvm-svn: 277860	2016-08-05 20:28:41 +00:00
Mike Aizatsky	63896f700f	[sanitizers] trace buffer API to use user-allocated buffer. Differential Revision: https://reviews.llvm.org/D23185 llvm-svn: 277859	2016-08-05 20:09:53 +00:00
Mehdi Amini	76b7eb7e92	Update outdated comments in the new PM internals (NFC) The analysis manager was made not optional and turned into a reference instead of a pointer in r272978. Some comments were still refering to the previous behavior. llvm-svn: 277857	2016-08-05 19:51:00 +00:00

1 2 3 4 5 ...

136240 Commits