llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-06 19:31:13 +00:00

Author	SHA1	Message	Date
Nick Lewycky	9adbd36737	Make some intrinsics safe to speculatively execute. llvm-svn: 147036	2011-12-21 05:52:02 +00:00
Andrew Trick	a1c4f73f87	Unit test for r146950: LSR postinc expansion, PR11571. llvm-svn: 146951	2011-12-20 01:43:20 +00:00
Joerg Sonnenberger	8cf8d64d19	Allow inlining of functions with returns_twice calls, if they have the attribute themselve. llvm-svn: 146851	2011-12-18 20:35:43 +00:00
Kevin Enderby	42fffe915a	Revert r146822 at Pete Cooper's request as it broke clang self hosting. Hope I did this correctly :) llvm-svn: 146834	2011-12-17 19:48:52 +00:00
Pete Cooper	0ec73f6e98	SimplifyCFG now predicts some conditional branches to true or false depending on previous branch on same comparison operands. For example, if (a == b) { if (a > b) // this is false Fixes some of the issues on <rdar://problem/10554090> llvm-svn: 146822	2011-12-17 06:32:38 +00:00
Pete Cooper	550b96ab46	Added InstCombine for "select cond, ~cond, x" type patterns These can be reduced to "~cond & x" or "~cond \| x" llvm-svn: 146624	2011-12-15 00:56:45 +00:00
Eli Friedman	5dd57bb40a	Make loop preheader insertion in LoopSimplify handle the case where the loop header is a landing pad correctly (by splitting the landingpad out of the loop header). Make some adjustments to the rest of LoopSimplify to make it clear that the rest of LoopSimplify isn't making bad assumptions about the presence of landing pads. PR11575. llvm-svn: 146621	2011-12-15 00:50:34 +00:00
Dan Gohman	1add31cc93	Move Instruction::isSafeToSpeculativelyExecute out of VMCore and into Analysis as a standalone function, since there's no need for it to be in VMCore. Also, update it to use isKnownNonZero and other goodies available in Analysis, making it more precise, enabling more aggressive optimization. llvm-svn: 146610	2011-12-14 23:49:11 +00:00
Andrew Trick	9c88f32f94	LSR: Fold redundant bitcasts on-the-fly. llvm-svn: 146597	2011-12-14 22:07:19 +00:00
Stepan Dyatkovskiy	14cb78c6fb	Fix for bug #11429 : Wrong behaviour for switches. Small improvement for code size heuristics. llvm-svn: 146578	2011-12-14 19:19:17 +00:00
Dan Gohman	e9572aa680	It turns out that clang does use pointer-to-function types to point to ARC-managed pointers sometimes. This fixes rdar://10551239. llvm-svn: 146577	2011-12-14 19:10:53 +00:00
Joerg Sonnenberger	5b25b4d437	Only replace fwrite with fputc, if the return value is unused. llvm-svn: 146411	2011-12-12 20:18:31 +00:00
Chandler Carruth	2bedf185c9	Manually upgrade the test suite to specify the flag to cttz and ctlz. I followed three heuristics for deciding whether to set 'true' or 'false': - Everything target independent got 'true' as that is the expected common output of the GCC builtins. - If the target arch only has one way of implementing this operation, set the flag in the way that exercises the most of codegen. For most architectures this is also the likely path from a GCC builtin, with 'true' being set. It will (eventually) require lowering away that difference, and then lowering to the architecture's operation. - Otherwise, set the flag differently dependending on which target operation should be tested. Let me know if anyone has any issue with this pattern or would like specific tests of another form. This should allow the x86 codegen to just iteratively improve as I teach the backend how to differentiate between the two forms, and everything else should remain exactly the same. llvm-svn: 146370	2011-12-12 11:59:10 +00:00
Andrew Trick	4f0b3bb42b	Add -unroll-runtime for unrolling loops with run-time trip counts. Patch by Brendon Cahoon! This extends the existing LoopUnroll and LoopUnrollPass. Brendon measured no regressions in the llvm test suite with -unroll-runtime enabled. This implementation works by using the existing loop unrolling code to unroll the loop by a power-of-two (default 8). It generates an if-then-else sequence of code prior to the loop to execute the extra iterations before entering the unrolled loop. llvm-svn: 146245	2011-12-09 06:19:40 +00:00
Nick Lewycky	d2c1661e9f	Fix infinite loop in DSE when deleting a free in a reachable loop that's also trivially infinite. llvm-svn: 146197	2011-12-08 22:36:35 +00:00
Andrew Trick	04c98888bc	LSR: prune undesirable formulae early. It's always good to prune early, but formulae that are unsatisfactory in their own right need to be removed before running any other pruning heuristics. We easily avoid generating such formulae, but we need them as an intermediate basis for forming other good formulae. llvm-svn: 145906	2011-12-06 03:13:31 +00:00
Chad Rosier	7096fea51c	Probably not a good idea to convert a single vector load into a memcpy. We don't do this now, but add a test case to prevent this from happening in the future. Additional test for rdar://9892684 llvm-svn: 145879	2011-12-06 00:19:08 +00:00
Chad Rosier	c50cbc5a65	Make the MemCpyOptimizer a bit more aggressive. I can't think of a scenerio where this would be bad as the backend shouldn't have a problem inlining small memcpys. rdar://10510150 llvm-svn: 145865	2011-12-05 22:37:00 +00:00
Nadav Rotem	1a91e4381d	Add support for vectors of pointers. llvm-svn: 145801	2011-12-05 06:29:09 +00:00
Pete Cooper	32e376f7e1	Fixed deadstoreelimination bug where negative indices were incorrectly causing the optimisation to occur Turns out long long + unsigned long long is unsigned. Doh! Fixes http://llvm.org/bugs/show_bug.cgi?id=11455 llvm-svn: 145731	2011-12-03 00:04:30 +00:00
Chad Rosier	d830d783e2	Add support for constant folding the pow intrinsic. rdar://10514247 llvm-svn: 145730	2011-12-03 00:00:03 +00:00
Chad Rosier	4d25975a28	Prevent library calls from being folded if -fno-builtin has been specified. rdar://10500969 llvm-svn: 145639	2011-12-01 22:14:50 +00:00
Pete Cooper	c708e83499	Improved fix for abs(val) != 0 to check other similar case. Also fixed style issues and confusing comment llvm-svn: 145618	2011-12-01 19:13:26 +00:00
Pete Cooper	d4569610df	Removed use of grep from test and moved it to be with other icmp tests llvm-svn: 145570	2011-12-01 04:35:26 +00:00
Pete Cooper	7e03b7250d	Added instcombine pattern to spot comparing -val or val against 0. (val != 0) == (-val != 0) so "abs(val) != 0" becomes "val != 0" Fixes <rdar://problem/10482509> llvm-svn: 145563	2011-12-01 03:58:40 +00:00
Andrew Trick	8da55f9048	Better test case found in duplicate PR10570. llvm-svn: 145484	2011-11-30 06:26:42 +00:00
Andrew Trick	247f749767	LSR: handle the expansion of phi operands that use postinc forms of the IV. Fixes PR11431: SCEVExpander::expandAddRecExprLiterally(const llvm::SCEVAddRecExpr*): Assertion `(!isa<Instruction>(Result) \|\| SE.DT->dominates(cast<Instruction>(Result), Builder.GetInsertPoint())) && "postinc expansion does not dominate use"' failed. llvm-svn: 145482	2011-11-30 06:07:54 +00:00
Chad Rosier	c5fa9f413a	Add support for sqrt, sqrtl, and sqrtf in TargetLibraryInfo. Disable (fptrunc (sqrt (fpext x))) -> (sqrtf x) transformation if -fno-builtin is specified. rdar://10466410 llvm-svn: 145460	2011-11-29 23:57:10 +00:00
Duncan Sands	97cc6da56c	Fix a theoretical problem (not seen in the wild): if different instances of a weak variable are compiled by different compilers, such as GCC and LLVM, while LLVM may increase the alignment to the preferred alignment there is no reason to think that GCC will use anything more than the ABI alignment. Since it is the GCC version that might end up in the final program (as the linkage is weak), it is wrong to increase the alignment of loads from the global up to the preferred alignment as the alignment might only be the ABI alignment. Increasing alignment up to the ABI alignment might be OK, but I'm not totally convinced that it is. It seems better to just leave the alignment of weak globals alone. llvm-svn: 145413	2011-11-29 18:26:38 +00:00
Andrew Trick	4c2bb1ade3	Reenable this IndVars unit test. SCEV can't optimize undef in all cases, which is a separate issue from this test case. llvm-svn: 145343	2011-11-29 00:52:04 +00:00
Eli Friedman	bc47555417	Add a missing safety check to ProcessUGT_ADDCST_ADD. Fixes PR11438. llvm-svn: 145316	2011-11-28 23:32:19 +00:00
Eli Friedman	473a76a0df	Make SelectionDAG::InferPtrAlignment use llvm::ComputeMaskedBits instead of duplicating the logic for globals. Make llvm::ComputeMaskedBits handle GlobalVariables slightly more aggressively, to match what InferPtrAlignment knew how to do. llvm-svn: 145304	2011-11-28 22:48:22 +00:00
Chris Lattner	c050288d89	remove a test that is using old-style llvm.dbg intrinsics, apparently only fails on ppc and arm hosts. llvm-svn: 145188	2011-11-27 18:13:47 +00:00
Chris Lattner	84bf52737a	remove autoupgrade support for old forms of llvm.prefetch and the old trampoline forms. Both of these were correct in LLVM 3.0, and we don't need to support LLVM 2.9 and earlier in mainline. llvm-svn: 145174	2011-11-27 07:42:04 +00:00
Chris Lattner	9d1e8420ff	Upgrade syntax of tests using volatile instructions to use 'load volatile' instead of 'volatile load', which is archaic. llvm-svn: 145171	2011-11-27 06:54:59 +00:00
Chris Lattner	011a5bf0aa	remove autoupgrade support for really old-style debug info intrinsics. I think this is the last of autoupgrade that can be removed in 3.1. Can the atomic upgrade stuff also go? llvm-svn: 145169	2011-11-27 06:18:33 +00:00
Chandler Carruth	2bf0dccc04	FileCheck-ize this test and make it more precise. This is in preparation for adding other tests. llvm-svn: 145143	2011-11-26 08:24:25 +00:00
Richard Smith	d647537b9c	Correctly byte-swap APInts with bit-widths greater than 64. llvm-svn: 145111	2011-11-23 21:33:37 +00:00
Duncan Sands	3c1878ef53	Fix a crash in which a multiplication was being reported as being both negative and positive: positive, because it could be directly computed to be positive; negative, because the nsw flags means it is either negative or undefined (the multiplication always overflowed). llvm-svn: 145104	2011-11-23 16:26:47 +00:00
Nick Lewycky	566ea855fd	Fix crasher in GVN due to my recent capture tracking changes. llvm-svn: 145047	2011-11-21 19:42:56 +00:00
Benjamin Kramer	1967d4e20b	XFAIL this test until I figure out what indvars is doing here (or find someone who does) llvm-svn: 145008	2011-11-20 11:10:03 +00:00
Andrew Trick	fe5f7fc3b8	Fix a corner case in updating LoopInfo after fully unrolling an outer loop. The loop tree's inclusive block lists are painful and expensive to update. (I have no idea why they're inclusive). The design was supposed to handle this case but the implementation missed it and my unit tests weren't thorough enough. Fixes PR11335: loop unroll update. llvm-svn: 144970	2011-11-18 03:42:41 +00:00
Andrew Trick	7dc21d8c0e	Fix an overly general check in SimplifyIndvar to handle useless phi cycles. The right way to check for a binary operation is cast<BinaryOperator>. The original check: cast<Instruction> && numOperands() == 2 would match phi "instructions", leading to an infinite loop in extreme corner case: a useless phi with operands [self, constant] that prior optimization passes failed to remove, being used in the loop by another useless phi, in turn being used by an lshr or udiv. Fixes PR11350: runaway iteration assertion. llvm-svn: 144935	2011-11-17 23:36:35 +00:00
Eli Friedman	d02d82d355	Add support for custom names for library functions in TargetLibraryInfo. Add a custom name for fwrite and fputs on x86-32 OSX. Make SimplifyLibCalls honor the custom names for fwrite and fputs. Fixes <rdar://problem/9815881>. llvm-svn: 144876	2011-11-17 01:27:36 +00:00
Nick Lewycky	29efc8f15d	Fix typo in test. llvm-svn: 144774	2011-11-16 03:56:38 +00:00
Nick Lewycky	ff690249a9	Merge isObjectPointerWithTrustworthySize with getPointerSize. Use it when looking at the size of the pointee. Fixes PR11390! llvm-svn: 144773	2011-11-16 03:49:48 +00:00
Andrew Trick	fe618116fc	Fix SCEV overly optimistic back edge taken count for multi-exit loops. Fixes PR11375: Different results for 'clang++ huh.cpp'... llvm-svn: 144746	2011-11-16 00:52:40 +00:00
Nick Lewycky	a0b2f7ca1d	Refactor capture tracking (which already had a couple flags for whether returns and stores capture) to permit the caller to see each capture point and decide whether to continue looking. Use this inside memdep to do an analysis that basicaa won't do. This lets us solve another devirtualization case, fixing PR8908! llvm-svn: 144580	2011-11-14 22:49:42 +00:00
Nick Lewycky	772024a00d	Don't try to loop on iterators that are potentially invalidated inside the loop. Fixes PR11361! llvm-svn: 144454	2011-11-12 03:09:12 +00:00
Eli Friedman	a83fbaff5f	Make sure scalarrepl picks the correct alloca when it rewrites a bitcast. Fixes PR11353. llvm-svn: 144442	2011-11-12 02:07:50 +00:00
Eli Friedman	127d98ab35	Get rid of an optimization in SCCP which appears to have many issues. Specifically, it doesn't handle many cases involving undef correctly, and it is missing other checks which lead to it trying to re-mark a value marked as a constant with a different value. It also appears to trigger very rarely. Fixes PR11357. llvm-svn: 144352	2011-11-11 01:16:15 +00:00
Pete Cooper	38700a1201	DeadStoreElimination can now trim the size of a store if the end of the store is dead. Currently checks alignment and killing stores on a power of 2 boundary as this is likely to trim the size of the earlier store without breaking large vector stores into scalar ones. Fixes <rdar://problem/10140300> llvm-svn: 144239	2011-11-09 23:07:35 +00:00
Eli Friedman	6bda990650	Fix code to match comment. Fixes PR11340, a regression from r143209. llvm-svn: 144121	2011-11-08 21:08:02 +00:00
Pete Cooper	a85aa24d64	LICM pass now understands invariant load metadata. Nothing generates this yet so it will currently never get used in real tests llvm-svn: 144107	2011-11-08 19:30:00 +00:00
Bill Wendling	a855903bda	Convert to the new EH model. llvm-svn: 144050	2011-11-08 00:23:01 +00:00
Nick Lewycky	7ea3dd8ae5	Do simple cross-block DSE when we encounter a free statement. Fixes PR11240. llvm-svn: 143808	2011-11-05 10:48:42 +00:00
Dan Gohman	e689158987	Add tests for existing InstSimplify features. llvm-svn: 143721	2011-11-04 18:39:16 +00:00
Dan Gohman	19a8523a2f	Teach instsimplify to simplify calls to undef. llvm-svn: 143719	2011-11-04 18:32:42 +00:00
Daniel Dunbar	0193e03f99	Speculatively revert "DeadStoreElimination can now trim the size of a store if the end of it is dead.", which appears to break bootstrapping LLVM. llvm-svn: 143668	2011-11-04 00:48:26 +00:00
Pete Cooper	4902705b5f	DeadStoreElimination can now trim the size of a store if the end of it is dead. Only currently done if the later store is writing to a power of 2 address or has the same alignment as the earlier store as then its likely to not break up large stores into smaller ones Fixes <rdar://problem/10140300> llvm-svn: 143630	2011-11-03 18:01:56 +00:00
Andrew Trick	3c1e831108	Rewrite LinearFunctionTestReplace to handle pointer-type IVs. We've been hitting asserts in this code due to the many supported combintions of modes (iv-rewrite/no-iv-rewrite) and IV types. This second rewrite of the code attempts to deal with these cases systematically. llvm-svn: 143546	2011-11-02 17:19:57 +00:00
Andrew Trick	c9baf3a7a1	Broaden an assert to handle enable-iv-rewrite=true following r143183. Narrowest possible fix for PR11279. llvm-svn: 143522	2011-11-02 00:02:45 +00:00
Eli Friedman	676558ae92	Make sure we use the right insertion point when instcombine replaces a PHI with another instruction. (Specifically, don't insert an arbitrary instruction before a PHI.) Fixes PR11275. llvm-svn: 143437	2011-11-01 04:49:29 +00:00
Duncan Sands	1077c1fa88	Reapply commit 143214 with a fix: m_ICmp doesn't match conditions with the given predicate, it matches any condition and returns the predicate - d'oh! Original commit message: The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143318	2011-10-30 19:56:36 +00:00
Benjamin Kramer	d32c541fe4	SimplifyLibCalls: Use IRBuilder.CreateGlobalString when creating a string for printf->puts, which correctly sets the unnamed_addr bit on the resulting GlobalVariable. Fixes PR11264. llvm-svn: 143289	2011-10-29 19:43:31 +00:00
Eli Friedman	7c9bef9ba8	Revert r143214; it's breaking a bunch of stuff. llvm-svn: 143265	2011-10-29 00:56:07 +00:00
Duncan Sands	7791a854c3	The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143214	2011-10-28 19:01:20 +00:00
Duncan Sands	3483c23658	A shift of a power of two is a power of two or zero. For completeness - not spotted in the wild. llvm-svn: 143211	2011-10-28 18:30:05 +00:00
Duncan Sands	5730fe6a31	Fold icmp ugt (udiv X, Y), X to false. Spotted by my super-optimizer in 186.crafty. llvm-svn: 143209	2011-10-28 18:17:44 +00:00
Andrew Trick	77532be5e0	LFTR should avoid a type mismatch with null pointer IVs. Fixes rdar://10359193 Indvar LinearFunctionTestReplace assertion llvm-svn: 143183	2011-10-28 03:45:11 +00:00
Duncan Sands	ca325638c8	Reapply commit 143028 with a fix: the problem was casting a ConstantExpr Mul using BinaryOperator (which only works for instructions) when it should have been a cast to OverflowingBinaryOperator (which also works for constants). While there, correct a few other dubious looking uses of BinaryOperator. Thanks to Chad Rosier for the testcase. Original commit message: My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143125	2011-10-27 19:16:21 +00:00
Bob Wilson	2ca603d9b7	Revert Duncan's r143028 expression folding which appears to be the culprit behind a compile failure on 483.xalancbmk. llvm-svn: 143102	2011-10-27 15:47:25 +00:00
Eli Friedman	e6918ac01a	It is not safe to sink an alloca into a stacksave/stackrestore pair, so don't do that. <rdar://problem/10352360> llvm-svn: 143093	2011-10-27 01:33:51 +00:00
Duncan Sands	5c8fa99c32	The maximum power of 2 dividing a power of 2 is itself. This occurs in 403.gcc and was spotted by my super-optimizer. llvm-svn: 143054	2011-10-26 20:55:21 +00:00
Duncan Sands	c463f54342	My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143028	2011-10-26 15:31:51 +00:00
Nick Lewycky	4d47e224d7	A dead malloc, a free(NULL) and a free(undef) are all trivially dead instructions. This doesn't introduce any optimizations we weren't doing before (except potentially due to pass ordering issues), now passes will eliminate them sooner as part of their own cleanups. llvm-svn: 142787	2011-10-24 04:35:36 +00:00
Cameron Zwarich	2dd06afcf5	The element insertion code in scalar replacement doesn't handle incorrect element types, even though the element extraction code does. It is surprising that this bug has been here for so long. Fixes <rdar://problem/10318778>. llvm-svn: 142740	2011-10-23 07:02:10 +00:00
Nick Lewycky	1d759dcde7	Oops! Fix test I forgot to submit as part of r142735. llvm-svn: 142736	2011-10-22 22:07:31 +00:00
Nick Lewycky	25e5f6896b	A non-escaping malloc in the entry block is not unlike an alloca. Do dead-store elimination on them too. llvm-svn: 142735	2011-10-22 21:59:35 +00:00
Eli Friedman	5012ac7cc0	Remap blockaddress correctly when inlining a function. Fixes PR10162. llvm-svn: 142684	2011-10-21 20:45:19 +00:00
Eli Friedman	fb0b9216e1	Extend instcombine's shufflevector simplification to handle more cases where the input and output vectors have different sizes. Patch by Xiaoyi Guo. llvm-svn: 142671	2011-10-21 19:06:29 +00:00
Eli Friedman	e8f8cf1f33	Refactor code from inlining and globalopt that checks whether a function definition is unused, and enhance it so it can tell that functions which are only used by a blockaddress are in fact dead. This probably doesn't happen much on most code, but the Linux kernel's _THIS_IP_ can trigger this issue with blockaddress. (GlobalDCE can also handle the given tescase, but we only run that at -O3.) Found while looking at PR11180. llvm-svn: 142572	2011-10-20 05:23:42 +00:00
Nick Lewycky	21a67a1454	"@string = constant i8 0" is a value i8* string of length zero. Analyze that correctly in GetStringLength, fixing PR11181! llvm-svn: 142558	2011-10-20 00:34:35 +00:00
Dan Gohman	5e2d8538d7	Teach the ARC optimizer about the !clang.arc.copy_on_escape metadata tag on objc_retainBlock calls, which indicates that they may be optimized away. rdar://10211286. llvm-svn: 142298	2011-10-17 22:53:25 +00:00
Lang Hames	5ef0a146b9	Fixed quoting on default data layout option. llvm-svn: 142286	2011-10-17 21:54:43 +00:00
Bill Wendling	2c5486d770	Add support for the Objective-C personality function to the instruction combining of the landingpad instruction. The ObjC personality function acts almost identically to the C++ personality function. In particular, it uses "null" as a "catch-all" value. llvm-svn: 142256	2011-10-17 21:20:24 +00:00
Dan Gohman	13624a6c83	Suppress partial retain+release elimination when there's a possibility that it will span multiple CFG diamonds/triangles which could have different controlling predicates. rdar://10282956 llvm-svn: 142222	2011-10-17 18:48:25 +00:00
Bill Wendling	584c5f9c62	Correct over-zealous removal of hack. Some code want to check that any call within a function has the 'returns twice' attribute, not just that the current function has one. llvm-svn: 142221	2011-10-17 18:43:40 +00:00
Bill Wendling	c5372de48f	Temporarily XFAIL waiting for a fix. llvm-svn: 142215	2011-10-17 18:25:32 +00:00
Chandler Carruth	9c33ff8a8b	Add a routine to swap branch instruction operands, and update any profile metadata at the same time. Use it to preserve metadata attached to a branch when re-writing it in InstCombine. Add metadata to the canonicalize_branch InstCombine test, and check that it is tranformed correctly. Reviewed by Nick Lewycky! llvm-svn: 142168	2011-10-17 01:11:57 +00:00
Nick Lewycky	f590cdf15e	Oops! Fix testcase. llvm-svn: 142151	2011-10-16 20:20:15 +00:00
Nick Lewycky	c8b7f776e6	When looking for dependencies on the src pointer, scan the src pointer. Scanning on the memcpy call will pull up other unrelated stuff. Fixes PR11142. llvm-svn: 142150	2011-10-16 20:13:32 +00:00
Andrew Trick	0ef2965563	Fix SCEVExpander assert during LSR: "argument of incompatible type". Just because we're dealing with a GEP doesn't mean we can assert the SCEV has a pointer type. The fix is simply to ignore the SCEV pointer type, which we really didn't need. Fixes PR11138 webkit crash. llvm-svn: 142058	2011-10-15 06:19:55 +00:00
Andrew Trick	923129b028	Reapply r141870, SCEV expansion of post-inc. Speculatively reapply to see if this test case still crashes on linux. I may have fixed it in my last checkin. llvm-svn: 141895	2011-10-13 21:55:29 +00:00
Andrew Trick	109f7dbd1e	Revert r141870. The test case crashes on linux with data corruption. A deeper issue was exposed. llvm-svn: 141873	2011-10-13 17:58:24 +00:00
Andrew Trick	05d7cb17d5	LSR: Reuse the post-inc expansion of expressions. This avoids unnecessary expansion of expressions and allows the SCEV expander to work on expression DAGs, not just trees. Fixes PR11090. llvm-svn: 141870	2011-10-13 17:31:47 +00:00
Lang Hames	069669eb13	Removed colons from some target datalayout strings in test, since they don't match the required format. llvm-svn: 141825	2011-10-12 22:24:17 +00:00
Cameron Zwarich	fac176ac51	Fix PR11106 by correcting a typo that has been in the code for over a year. This would have never worked, since the element type of a vector type is never a vector type. Also fix the conditional to be more direct in checking whether EltTy is a vector type. llvm-svn: 141713	2011-10-11 21:26:40 +00:00
Cameron Zwarich	211901eb9f	Add a test for PR10565. llvm-svn: 141647	2011-10-11 06:10:37 +00:00
Cameron Zwarich	a34d748f83	Remove a lot of the fancy scalar replacement code for dealing with llvm-gcc's lowering of NEON code. It provides little-to-no benefit now and only introduces additional complexity. llvm-svn: 141646	2011-10-11 06:10:30 +00:00
Andrew Trick	d36852e6b1	Move replaceCongruentIVs into SCEVExapander and bias toward "expanded" IVs. Indvars previously chose randomly between congruent IVs. Now it will bias the decision toward IVs that SCEVExpander likes to create. This was not done to fix any problem, it's just a welcome side effect of factoring code. llvm-svn: 141633	2011-10-11 02:28:51 +00:00
Lang Hames	386b01379a	Added a testcase for r141599, rdar://problem/10063881. llvm-svn: 141628	2011-10-11 01:32:10 +00:00
Andrew Trick	9d4d1281ad	Unit test for LSR phi reuse in r141442. llvm-svn: 141472	2011-10-08 02:34:51 +00:00
Duncan Sands	559ef2f491	Teach GVN to also propagate switch cases. For example, in this code switch (n) { case 27: do_something(x); ... } the call do_something(x) will be replaced with do_something(27). In gcc-as-one-big-file this results in the removal of about 500 lines of bitcode (about 0.02%), so has about 1/10 of the effect of propagating branch conditions. llvm-svn: 141360	2011-10-07 08:29:06 +00:00
Eli Friedman	dd48bb30de	PR11061: Make simplifylibcalls fold strcmp("", x) correctly. While I'm here, fix the related issue with strncmp, add some actual tests for strcmp and strncmp, and start using StringRef::compare for constant folding instead of using strcmp/strncmp so that the optimized IR isn't dependent on the host's implementation of strcmp. llvm-svn: 141227	2011-10-05 22:27:16 +00:00
Jim Grosbach	254b9ed208	Revert 141203. InstCombine is looping on unit tests. llvm-svn: 141209	2011-10-05 20:44:29 +00:00
Rafael Espindola	8247f7a5dd	Check for the returns_twice attribute in callsFunctionThatReturnsTwice. This fixes PR11038, but there are still some cleanups to be done. llvm-svn: 141204	2011-10-05 20:05:13 +00:00
Jim Grosbach	a03dd9189f	Update InstCombine worklist after instruction transform is complete. When updating the worklist for InstCombine, the Add/AddUsersToWorklist functions may access the instruction(s) being added, for debug output for example. If the instructions aren't yet added to the basic block, this can result in a crash. Finish the instruction transformation before adjusting the worklist instead. rdar://10238555 llvm-svn: 141203	2011-10-05 20:05:00 +00:00
Duncan Sands	f7df28c1f5	GVN does simple propagation of conditions: when it sees a conditional branch "br i1 %x, label %if_true, label %if_false" then it replaces "%x" with "true" in places only reachable via the %if_true arm, and with "false" in places only reachable via the %if_false arm. Except that actually it doesn't: if value numbering shows that %y is equal to %x then, yes, %y will be turned into true/false in this way, but any occurrences of %x itself are not transformed. Fix this. What's more, it's often the case that %x is an equality comparison such as "%x = icmp eq %A, 0", in which case every occurrence of %A that is only reachable via the %if_true arm can be replaced with 0. Implement this and a few other variations on this theme. This reduces the number of lines of LLVM IR in "GCC as one big file" by 0.2%. It has a bigger impact on Ada code, typically reducing the number of lines of bitcode by around 0.4% by removing repeated compiler generated checks. Passes the LLVM nightly testsuite and the Ada ACATS testsuite. llvm-svn: 141177	2011-10-05 14:28:49 +00:00
Duncan Sands	348e8c285a	Generalize GVN's conditional propagation logic slightly: it's OK for the false/true destination to have multiple predecessors as long as the extra ones are dominated by the branch destination. llvm-svn: 141176	2011-10-05 14:17:01 +00:00
Andrew Trick	c60e2addd9	LSR should avoid redundant edge splitting. This handles the case in which LSR rewrites an IV user that is a phi and splits critical edges originating from a switch. Fixes <rdar://problem/6453893> LSR is not splitting edges "nicely" llvm-svn: 141059	2011-10-04 03:50:44 +00:00
Andrew Trick	f9b98a3c3e	Unit test for r140919, loop unroll heuristics. llvm-svn: 141049	2011-10-04 00:07:02 +00:00
Rafael Espindola	4700f53cee	Add the returns_twice attribute to LLVM. llvm-svn: 141001	2011-10-03 14:45:37 +00:00
Nick Lewycky	7cd1bfb89d	Add a new icmp+select optz'n. Also shows off the load(cst) folding added in r140966. llvm-svn: 140969	2011-10-02 10:37:37 +00:00
Jim Grosbach	d35eaaeb6e	float comparison to double 'zero' constant can just be a float 'zero.' InstCombine was incorrectly considering the conversion of the constant zero to be unsafe. We want to transform: define float @bar(float %x) nounwind readnone optsize ssp { %conv = fpext float %x to double %cmp = fcmp olt double %conv, 0.000000e+00 %conv1 = zext i1 %cmp to i32 %conv2 = sitofp i32 %conv1 to float ret float %conv2 } Into: define float @bar(float %x) nounwind readnone optsize ssp { %cmp = fcmp olt float %x, 0.000000e+00 ; <---- This %conv1 = zext i1 %cmp to i32 %conv2 = sitofp i32 %conv1 to float ret float %conv2 } rdar://10215914 llvm-svn: 140869	2011-09-30 18:45:50 +00:00
Duncan Sands	14b9c222c6	Add forgotten tests that the cleanup flag is cleared if there is a catch-all landingpad clause. llvm-svn: 140858	2011-09-30 17:00:34 +00:00
Duncan Sands	b4c8b2d9fa	Inlining often produces landingpad instructions with repeated catch or repeated filter clauses. Teach instcombine a bunch of tricks for simplifying landingpad clauses. Currently the code only recognizes the GNU C++ and Ada personality functions, but that doesn't stop it doing a bunch of "generic" transforms which are hopefully fine for any real-world personality function. If these "generic" transforms turn out not to be generic, they can always be conditioned on the personality function. Probably someone should add the ObjC++ personality function. I didn't as I don't know anything about it. llvm-svn: 140852	2011-09-30 13:12:16 +00:00
Dan Gohman	7dca165e93	When eliminating unnecessary retain+autorelease on return values, handle the case where the retain is in a different basic block. rdar://10210274. llvm-svn: 140815	2011-09-29 22:27:34 +00:00
Dan Gohman	2df048a94b	Don't eliminate objc_retainBlock calls on stack objects if the objc_retainBlock call is potentially responsible for copying the block to the heap to extend its lifetime. rdar://10209613. llvm-svn: 140814	2011-09-29 22:25:23 +00:00
Andrew Trick	5167b59724	Test case for r140670: indvars should hoist sext. llvm-svn: 140671	2011-09-28 02:13:32 +00:00
Benjamin Kramer	355b353595	Stop emitting instructions with the name "tmp" they eat up memory and have to be uniqued, without any benefit. If someone prefers %tmp42 to %42, run instnamer. llvm-svn: 140634	2011-09-27 20:39:19 +00:00
Bill Wendling	589336d854	Split the landing pad basic block with the correct function. Also merge the split landingpad instructions into a PHI node. PR11016 llvm-svn: 140592	2011-09-27 00:59:31 +00:00
Eli Friedman	56e68f7271	Convert more tests over to the new atomic instructions. llvm-svn: 140559	2011-09-26 20:27:49 +00:00
Eli Friedman	ed450a3064	Upgrade a couple more tests to the new atomic instructions. llvm-svn: 140558	2011-09-26 20:15:56 +00:00
Eli Friedman	6e15091fc6	PR10987: add a missed safety check to isSafePHIToSpeculate in scalarrepl. llvm-svn: 140327	2011-09-22 18:56:30 +00:00
Eli Friedman	2599a202e2	Make sure IPSCCP never marks a tracked call as overdefined in SCCPSolver::ResolvedUndefsIn. If we do, we can end up in a situation where a function is resolved to return a constant, but the caller is marked overdefined, which confuses the code later. <rdar://problem/9956541> (again). llvm-svn: 140210	2011-09-20 23:28:51 +00:00
Bill Wendling	296896eb2d	Update this test to the new EH model. Though I think it may be obsolete with the loop extract changes. And I couldn't get the old version of LLVM to compile so that I could reduce this testcase. llvm-svn: 140197	2011-09-20 22:29:43 +00:00
Bill Wendling	19257face9	Update to new EH model. llvm-svn: 140177	2011-09-20 20:21:16 +00:00
Bill Wendling	58db2d7708	Update to the new EH syntax. llvm-svn: 140103	2011-09-19 23:50:34 +00:00
Bill Wendling	86ea5a2df4	Dramatically reduce this testcase. llvm-svn: 140101	2011-09-19 23:47:06 +00:00
Bill Wendling	a5db9cbefd	Revert r140083 and r140084 until buildbots can be fixed. llvm-svn: 140094	2011-09-19 23:30:41 +00:00
Bill Wendling	e54729ae4f	Update test to remove the 'unwind' instruction. llvm-svn: 140084	2011-09-19 23:01:11 +00:00
Bill Wendling	f1c054eace	Replace uses of unwind with unreachable for the same effect. llvm-svn: 140077	2011-09-19 22:16:15 +00:00
Bill Wendling	47fa03b39d	Replace more uses of 'unwind' in the tests with calls to landingpad and resume. Note that some of these tests were basically dead. llvm-svn: 140076	2011-09-19 22:11:35 +00:00
Eli Friedman	9ed4ecaf4b	Fix an infinite loop where a transform in InstCombiner::visitAnd claims a construct is changed when it is not. (See included testcase.) Patch by Xiaoyi Guo. llvm-svn: 140072	2011-09-19 21:58:15 +00:00
Bill Wendling	e821eb23cd	This testcase is dead. It doesn't inline even if I add the 'alwaysinline' attribute to the @foo function. llvm-svn: 140067	2011-09-19 21:14:33 +00:00
Andrew Trick	856edc0ef5	[indvars] Fix PR10946: SCEV cannot handle Vector IVs. llvm-svn: 140026	2011-09-19 17:54:39 +00:00
Andrew Trick	1dfb51c692	Reapply r139759. Disable IV rewriting by default. See PR10916. llvm-svn: 139842	2011-09-15 20:58:37 +00:00
Eli Friedman	2109f34467	Make demanded-elt simplification for shufflevector slightly stronger. Spotted by inspection. llvm-svn: 139768	2011-09-15 01:14:29 +00:00
Dan Gohman	223fdfc56f	objc_retainBlock is not NoModRef because it can update forwarding pointers in memory relevant to the optimizer. rdar://10050579. llvm-svn: 139708	2011-09-14 18:13:00 +00:00
Andrew Trick	2e22ddc364	[indvars] Revert r139579 until 401.bzip -arch i386 miscompilation is fixed. PR10920. llvm-svn: 139583	2011-09-13 05:23:49 +00:00
Andrew Trick	2bfa2824c6	Disable IV rewriting by default. See PR10916. llvm-svn: 139579	2011-09-13 03:23:21 +00:00
Andrew Trick	3faad5fc36	Generalize test case to handle multiple indvars modes. llvm-svn: 139578	2011-09-13 03:17:25 +00:00
Andrew Trick	2cc8637af2	[indvars] Fix bugs in floating point IV range checks noticed by inspection. llvm-svn: 139574	2011-09-13 01:59:32 +00:00
Andrew Trick	cc7f5a609a	Conditionalize indvars test that relies on SCEV expansion of geps, which is only relevant with canonical IVs llvm-svn: 139556	2011-09-12 23:13:57 +00:00
Andrew Trick	a3c28142a5	indvars test only relevant for -enable-iv-rewrite. Otherwise this case is now covered by no-iv-rewrite.ll. llvm-svn: 139552	2011-09-12 22:59:00 +00:00
Andrew Trick	bf1bc9a077	Conditionalize indvars tests that rely on SCEV expansion of geps, which is relevant with canonical IVs. Anything else being checked by these tests is already covered by early CSE. llvm-svn: 139535	2011-09-12 20:26:34 +00:00
Andrew Trick	efe8cdcf20	Removing indvars tests that directly test canonical IVs and nothing else. llvm-svn: 139518	2011-09-12 18:33:08 +00:00
Andrew Trick	09cf4287c2	Rename -disable-iv-rewrite to -enable-iv-rewrite=false in preparation for default change. llvm-svn: 139517	2011-09-12 18:28:44 +00:00
Andrew Trick	19f9e653cd	Test case for r139453, WidenIV::GetExtendedOperandRecurrence. llvm-svn: 139504	2011-09-12 17:20:57 +00:00

1 2 3 4 5 ...

2774 Commits