llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-06 11:19:09 +00:00

Author	SHA1	Message	Date
Duncan Sands	7791a854c3	The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143214	2011-10-28 19:01:20 +00:00
Duncan Sands	3483c23658	A shift of a power of two is a power of two or zero. For completeness - not spotted in the wild. llvm-svn: 143211	2011-10-28 18:30:05 +00:00
Duncan Sands	5730fe6a31	Fold icmp ugt (udiv X, Y), X to false. Spotted by my super-optimizer in 186.crafty. llvm-svn: 143209	2011-10-28 18:17:44 +00:00
Duncan Sands	ca325638c8	Reapply commit 143028 with a fix: the problem was casting a ConstantExpr Mul using BinaryOperator (which only works for instructions) when it should have been a cast to OverflowingBinaryOperator (which also works for constants). While there, correct a few other dubious looking uses of BinaryOperator. Thanks to Chad Rosier for the testcase. Original commit message: My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143125	2011-10-27 19:16:21 +00:00
Bob Wilson	2ca603d9b7	Revert Duncan's r143028 expression folding which appears to be the culprit behind a compile failure on 483.xalancbmk. llvm-svn: 143102	2011-10-27 15:47:25 +00:00
Duncan Sands	5c8fa99c32	The maximum power of 2 dividing a power of 2 is itself. This occurs in 403.gcc and was spotted by my super-optimizer. llvm-svn: 143054	2011-10-26 20:55:21 +00:00
Duncan Sands	c463f54342	My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143028	2011-10-26 15:31:51 +00:00
Duncan Sands	be9c2e6e13	Restore commits 142790 and 142843 - they weren't breaking the build bots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142919	2011-10-25 12:28:52 +00:00
Chandler Carruth	3cbbc35715	Fix the API usage in loop probability heuristics. It was incorrectly classifying many edges as exiting which were in fact not. These mainly formed edges into sub-loops. It was also not correctly classifying all returning edges out of loops as leaving the loop. With this match most of the loop heuristics are more rational. Several serious regressions on loop-intesive benchmarks like perlbench's loop tests when built with -enable-block-placement are fixed by these updated heuristics. Unfortunately they in turn uncover some other regressions. There are still several improvemenst that should be made to loop heuristics including trip-count, and early back-edge management. llvm-svn: 142917	2011-10-25 09:47:41 +00:00
Duncan Sands	da835efa2a	Speculatively revert commits 142790 and 142843 to see if it fixes the dragonegg and llvm-gcc self-host buildbots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142916	2011-10-25 09:26:43 +00:00
Nick Lewycky	289c30130a	Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142843	2011-10-24 21:02:38 +00:00
Chandler Carruth	d04f838629	Remove return heuristics from the static branch probabilities, and introduce no-return or unreachable heuristics. The return heuristics from the Ball and Larus paper don't work well in practice as they pessimize early return paths. The only good hitrate return heuristics are those for: - NULL return - Constant return - negative integer return Only the last of these three can possibly require significant code for the returning block, and even the last is fairly rare and usually also a constant. As a consequence, even for the cold return paths, there is little code on that return path, and so little code density to be gained by sinking it. The places where sinking these blocks is valuable (inner loops) will already be weighted appropriately as the edge is a loop-exit branch. All of this aside, early returns are nearly as common as all three of these return categories, and should actually be predicted as taken! Rather than muddy the waters of the static predictions, just remain silent on returns and let the CFG itself dictate any layout or other issues. However, the return heuristic was flagging one very important case: unreachable. Unfortunately it still gave a 1/4 chance of the branch-to-unreachable occuring. It also didn't do a rigorous job of finding those blocks which post-dominate an unreachable block. This patch builds a more powerful analysis that should flag all branches to blocks known to then reach unreachable. It also has better worst-case runtime complexity by not looping through successors for each block. The previous code would perform an N^2 walk in the event of a single entry block branching to N successors with a switch where each successor falls through to the next and they finally fall through to a return. Test case added for noreturn heuristics. Also doxygen comments improved along the way. llvm-svn: 142793	2011-10-24 12:01:08 +00:00
Nick Lewycky	64d4e26aec	Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142790	2011-10-24 06:57:05 +00:00
Nick Lewycky	341bf1548e	PHI nodes not in the loop header aren't part of the loop iteration initial state. Furthermore, they might not have two operands. This fixes the underlying issue behind the crashes introduced in r142781. llvm-svn: 142788	2011-10-24 05:51:01 +00:00
Nick Lewycky	d72de74587	Speculatively revert r142781. Bots are showing Assertion `i_nocapture < OperandTraits<PHINode>::operands(this) && "getOperand() out of range!"' failed. coming out of indvars. llvm-svn: 142786	2011-10-24 04:00:25 +00:00
Chandler Carruth	c5722dbec0	Simplify the design of BranchProbabilityInfo by collapsing it into a single class. Previously it was split between two classes, one internal and one external. The concern seemed to center around exposing the weights used, but those can remain confined to the implementation file. Having a single class to maintain the state and analyses in use will also simplify several of the enhancements I want to make to our static heuristics. llvm-svn: 142783	2011-10-24 01:40:45 +00:00
Nick Lewycky	5ab7948d71	Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142781	2011-10-23 23:43:14 +00:00
Chandler Carruth	93ee01158c	Tidy up a loop to be more idiomatic for LLVM's codebase, and remove some extraneous whitespace. Trying to clean-up this pass as much as I can before I start making functional changes. llvm-svn: 142780	2011-10-23 22:40:13 +00:00
Chandler Carruth	151d4fc273	Teach the BranchProbabilityInfo pass to print its results, and use that to bring it under direct test instead of merely indirectly testing it in the BlockFrequencyInfo pass. The next step is to start adding tests for the various heuristics employed, and to start fixing those heuristics once they're under test. llvm-svn: 142778	2011-10-23 21:21:50 +00:00
Benjamin Kramer	9adc582e35	Add compare operators to BranchProbability and use it to determine if an edge is hot. llvm-svn: 142751	2011-10-23 11:19:14 +00:00
Nick Lewycky	ce8bfeadff	Make SCEV's brute force analysis stronger in two ways. Firstly, we should be able to constant fold load instructions where the argument is a constant. Second, we should be able to watch multiple PHI nodes through the loop; this patch only supports PHIs in loop headers, more can be done here. With this patch, we now constant evaluate: static const int arr[] = {1, 2, 3, 4, 5}; int test() { int sum = 0; for (int i = 0; i < 5; ++i) sum += arr[i]; return sum; } llvm-svn: 142731	2011-10-22 19:58:20 +00:00
Benjamin Kramer	917737037d	Extend the floating point heuristic to consider NaN checks unlikely. llvm-svn: 142687	2011-10-21 21:13:47 +00:00
Benjamin Kramer	66a199d42f	BranchProbabilityInfo: floating point equality is unlikely. This is from the same paper from Ball and Larus as the rest of the currently implemented heuristics. llvm-svn: 142677	2011-10-21 20:12:47 +00:00
Eli Friedman	0c3b1df94e	A FIXME about block addresses and indirectbr. llvm-svn: 142569	2011-10-20 04:05:33 +00:00
Eli Friedman	6ff41ea07a	Simplify; no intended functional change. llvm-svn: 142567	2011-10-20 03:23:14 +00:00
Nick Lewycky	21a67a1454	"@string = constant i8 0" is a value i8* string of length zero. Analyze that correctly in GetStringLength, fixing PR11181! llvm-svn: 142558	2011-10-20 00:34:35 +00:00
Chandler Carruth	12a645d6f6	Generalize the reading of probability metadata to work for both branches and switches, with arbitrary numbers of successors. Still optimized for the common case of 2 successors for a conditional branch. Add a test case for switch metadata showing up in the BlockFrequencyInfo pass. llvm-svn: 142493	2011-10-19 10:32:19 +00:00
Chandler Carruth	18a382b4b6	Teach the BranchProbabilityInfo analysis pass to read any metadata encoding of probabilities. In the absense of metadata, it continues to fall back on static heuristics. This allows __builtin_expect, after lowering through llvm.expect a branch instruction's metadata, to actually enter the branch probability model. This is one component of resolving PR2577. llvm-svn: 142492	2011-10-19 10:30:30 +00:00
Chandler Carruth	13b475d4f6	Add pass printing support to BlockFrequencyInfo pass. The implementation layer already had support for printing the results of this analysis, but the wiring was missing. Now that printing the analysis works, actually bring some of this analysis, and the BranchProbabilityInfo analysis that it wraps, under test! I'm planning on fixing some bugs and doing other work here, so having a nice place to add regression tests and a way to observe the results is really useful. llvm-svn: 142491	2011-10-19 10:12:41 +00:00
Devang Patel	fb32a2ca1b	Update DebugInfoFinder to match recent debug info encoding changes. llvm-svn: 142295	2011-10-17 22:30:34 +00:00
Bill Wendling	584c5f9c62	Correct over-zealous removal of hack. Some code want to check that any call within a function has the 'returns twice' attribute, not just that the current function has one. llvm-svn: 142221	2011-10-17 18:43:40 +00:00
Bill Wendling	2ee7de36b8	Now that we have the ReturnsTwice function attribute, this method is obsolete. Check the attribute instead. <rdar://problem/8031714> llvm-svn: 142212	2011-10-17 18:22:52 +00:00
Chandler Carruth	0334d27ec8	Delete a dead member. Dunno if this was ever used, but the current code directly manipulates the weights inside of the BranchProbabilityInfo that is passed in. llvm-svn: 142163	2011-10-16 22:27:54 +00:00
Andrew Trick	0ef2965563	Fix SCEVExpander assert during LSR: "argument of incompatible type". Just because we're dealing with a GEP doesn't mean we can assert the SCEV has a pointer type. The fix is simply to ignore the SCEV pointer type, which we really didn't need. Fixes PR11138 webkit crash. llvm-svn: 142058	2011-10-15 06:19:55 +00:00
Nick Lewycky	610fc64d78	An instruction's operands aren't necessarily instructions or constants. They could be arguments, for example. No testcase because this is a bug-fix broken out of a larger optimization patch. llvm-svn: 141951	2011-10-14 09:38:46 +00:00
Eli Friedman	e702d68556	Enhance the memdep interface so that users can tell the difference between a dependency which cannot be calculated and a path reaching the entry point of the function. This patch introduces isNonFuncLocal, which replaces isUnknown in some cases. Patch by Xiaoyi Guo. llvm-svn: 141896	2011-10-13 22:14:57 +00:00
Andrew Trick	923129b028	Reapply r141870, SCEV expansion of post-inc. Speculatively reapply to see if this test case still crashes on linux. I may have fixed it in my last checkin. llvm-svn: 141895	2011-10-13 21:55:29 +00:00
Andrew Trick	03cd1b6a5d	Fix memory corruption I introduced a few checkins ago. Self-review easily caught this obvious bug. llvm-svn: 141880	2011-10-13 18:49:23 +00:00
Andrew Trick	109f7dbd1e	Revert r141870. The test case crashes on linux with data corruption. A deeper issue was exposed. llvm-svn: 141873	2011-10-13 17:58:24 +00:00
Andrew Trick	05d7cb17d5	LSR: Reuse the post-inc expansion of expressions. This avoids unnecessary expansion of expressions and allows the SCEV expander to work on expression DAGs, not just trees. Fixes PR11090. llvm-svn: 141870	2011-10-13 17:31:47 +00:00
Andrew Trick	38d4b87695	SCEV: Rewrite TrandformForPostIncUse to handle expression DAGs, not just expression trees. Partially fixes PR11090. Test case will be with the full fix. llvm-svn: 141868	2011-10-13 17:21:09 +00:00
Andrew Trick	5286636188	Slightly more useful tracing. llvm-svn: 141867	2011-10-13 17:06:38 +00:00
Eric Christopher	57c57a3260	Add a new wrapper node for a DILexicalBlock that encapsulates it and a file. Since it should only be used when necessary propagate it through the backend code generation and tweak testcases accordingly. This helps with code like in clang's test/CodeGen/debug-info-line.c where we have multiple #line directives within a single lexical block and want to generate only a single block that contains each file change. Part of rdar://10246360 llvm-svn: 141729	2011-10-11 22:59:11 +00:00
Andrew Trick	d36852e6b1	Move replaceCongruentIVs into SCEVExapander and bias toward "expanded" IVs. Indvars previously chose randomly between congruent IVs. Now it will bias the decision toward IVs that SCEVExpander likes to create. This was not done to fix any problem, it's just a welcome side effect of factoring code. llvm-svn: 141633	2011-10-11 02:28:51 +00:00
Andrew Trick	430029d79a	Add an extra safety check in front of the optimization in r141442. llvm-svn: 141470	2011-10-08 02:16:39 +00:00
Andrew Trick	75743b069e	LSR should only reuse phis that match its formula. Fixes rdar://problem/5064068 llvm-svn: 141442	2011-10-07 23:46:21 +00:00
Eli Friedman	4d63ca106a	Remove the old atomic instrinsics. autoupgrade functionality is included with this patch. llvm-svn: 141333	2011-10-06 23:20:49 +00:00
Andrew Trick	671f73b951	Fixes PR11070 - assert in SCEV getConstantEvolvingPHIOperands. llvm-svn: 141219	2011-10-05 22:06:53 +00:00
Andrew Trick	94a7b27585	Typo. Thanks Bob. llvm-svn: 141188	2011-10-05 16:52:28 +00:00
Chandler Carruth	9fc5856d54	Fix a broken assert found by -Wparentheses. llvm-svn: 141168	2011-10-05 07:02:23 +00:00
Andrew Trick	79a14ca34e	Fix disabled SCEV analysis caused r141161 and add unit test. I noticed during self-review that my previous checkin disabled some analysis. Even with the reenabled analysis the test case runs in about 5ms. Without the fix, it will take several minutes at least. llvm-svn: 141164	2011-10-05 05:58:49 +00:00
Andrew Trick	5789485111	Avoid exponential recursion in SCEV getConstantEvolvingPHI and EvaluateExpression. Note to compiler writers: never recurse on multiple instruction operands without memoization. Fixes rdar://10187945. Was taking 45s, now taking 5ms. llvm-svn: 141161	2011-10-05 03:25:31 +00:00
Nick Lewycky	6833d72fc0	The product of two chrec's can always be represented as a chrec. llvm-svn: 141066	2011-10-04 06:51:26 +00:00
Nick Lewycky	4898eef762	Reapply r140979 with fix! We never did get a testcase, but careful review of the logic by David Meyer revealed this bug. llvm-svn: 140992	2011-10-03 07:10:45 +00:00
Nick Lewycky	79fec8116f	Revert r140979 due to reports of bootstrap failure. llvm-svn: 140980	2011-10-03 05:14:59 +00:00
Nick Lewycky	a760a29395	Add one more case we compute a max trip count. llvm-svn: 140979	2011-10-03 01:03:57 +00:00
Andrew Trick	0489c5410d	Inlining and unrolling heuristics should be aware of free truncs. We want heuristics to be based on accurate data, but more importantly we don't want llvm to behave randomly. A benign trunc inserted by an upstream pass should not cause a wild swings in optimization level. See PR11034. It's a general problem with threshold-based heuristics, but we can make it less bad. llvm-svn: 140919	2011-10-01 01:39:05 +00:00
Andrew Trick	a1161d94f5	whitespace llvm-svn: 140916	2011-10-01 01:27:56 +00:00
Andrew Trick	50915b5136	indvars: generalize SCEV getPreStartForSignExtend. Handle general Add expressions to avoid leaving around redundant 32-bit IVs. llvm-svn: 140701	2011-09-28 17:02:54 +00:00
Eli Friedman	f4f4a75d2b	PR10628: Fix getModRefInfo so it queries the underlying alias() implementation correctly while checking nocapture calls. llvm-svn: 140666	2011-09-28 00:34:27 +00:00
Benjamin Kramer	355b353595	Stop emitting instructions with the name "tmp" they eat up memory and have to be uniqued, without any benefit. If someone prefers %tmp42 to %42, run instnamer. llvm-svn: 140634	2011-09-27 20:39:19 +00:00
Eli Friedman	9c1a430966	Enhance alias analysis for atomic instructions a bit. Upgrade a couple alias-analysis tests to the new atomic instructions. llvm-svn: 140557	2011-09-26 20:15:28 +00:00
Galina Kistanova	5d64adbe43	Fix for DbgInfoPrinter.cpp:174:12: warning: ‘LineNo’ may be used uninitialized in this function. llvm-svn: 140281	2011-09-21 23:34:23 +00:00
Devang Patel	e5dc3a1f1e	Add support to emit debug info for C++0x nullptr type. llvm-svn: 139751	2011-09-14 23:13:28 +00:00
Eric Christopher	665ace6bce	Fix typo. llvm-svn: 139530	2011-09-12 19:58:22 +00:00
Devang Patel	c7a1210467	Add asserts to keep front-ends honest while encoding debug info into LLVM IR using DIBuilder. llvm-svn: 139515	2011-09-12 18:26:08 +00:00
Andrew Trick	8af62b87e4	Set NSW/NUW flags on SCEVAddExpr when the operation is flagged as such. I'm doing this now for completeness because I can't think of/remember any reason that it was left out. I'm not sure it will help anything, but if we don't do it we need to explain why in comments. llvm-svn: 139450	2011-09-10 01:09:50 +00:00
Eli Friedman	c933295353	A couple minor corrections to r139276. llvm-svn: 139277	2011-09-08 02:37:07 +00:00
Eli Friedman	6e9cab83b0	Fix the logic in BasicAliasAnalysis::aliasGEP for comparing GEP's with variable differences so that it actually does something sane. Fixes PR10881. llvm-svn: 139276	2011-09-08 02:23:31 +00:00
Owen Anderson	9ae90800a2	memset_pattern16 uses a 16 BYTE pattern, not a 16 BIT pattern. Add comments to that effect. llvm-svn: 139205	2011-09-06 23:43:26 +00:00
Owen Anderson	483f94e8d1	Teach BasicAA about the aliasing properties of memset_pattern16. Fixes PR10872 and <rdar://problem/10065079>. llvm-svn: 139204	2011-09-06 23:33:25 +00:00
Nick Lewycky	8203bcfd03	This transform only handles two-operand AddRec's. Prevent it from trying to handle anything more complex. Fixes PR10383 again! llvm-svn: 139186	2011-09-06 21:42:18 +00:00
Devang Patel	2c2dd9114e	Now, named mdnode llvm.dbg.cu keeps track of all compile units in a module. Update DebugInfoFinder to collect compile units from llvm.dbg.cu. llvm-svn: 139147	2011-09-06 17:40:08 +00:00
Nick Lewycky	e1c0b41d41	Fix typo in comment again. llvm-svn: 139139	2011-09-06 07:02:40 +00:00
Nick Lewycky	700f71a0ac	Apparently we compile the code, not the comments. Thanks Eli! llvm-svn: 139138	2011-09-06 06:56:00 +00:00
Nick Lewycky	4add6eec38	Fix typo in comment. llvm-svn: 139137	2011-09-06 06:46:01 +00:00
Nick Lewycky	0c9df5d6c2	Nope! I had it right the first time. Revert the operative part of r139135 and add more showing of my work. llvm-svn: 139136	2011-09-06 06:39:54 +00:00
Nick Lewycky	39b165bb7d	Fix flipped sign. While there, show my math. llvm-svn: 139135	2011-09-06 05:33:18 +00:00
Nick Lewycky	fdc650ea7a	No no no, fix typo properly! llvm-svn: 139134	2011-09-06 05:08:09 +00:00
Nick Lewycky	3823432a57	The logic inside getMulExpr to simplify {a,+,b}*{c,+,d} was wrong, which was visible given a=b=c=d=1, on iteration #1 (the second iteration). Replace it with correct math. Fixes PR10383! llvm-svn: 139133	2011-09-06 05:05:14 +00:00
Nick Lewycky	18c0b01a56	Revert r139126 due to selfhost failures reported by buildbots. llvm-svn: 139130	2011-09-06 02:43:13 +00:00
Nick Lewycky	30dcc754df	Teach SCEV to report a max backedge count in one interesting case in HowFarToZero; the case for a canonical loop. llvm-svn: 139126	2011-09-05 23:25:16 +00:00
Benjamin Kramer	ec933b857e	InstSimplify: Don't try to replace an extractvalue/insertvalue pair with the original value if types don't match. Fixes clang selfhost. llvm-svn: 139120	2011-09-05 18:16:19 +00:00
Duncan Sands	a74d10bb60	Add some simple insertvalue simplifications, for the purpose of cleaning up do-nothing exception handling code produced by dragonegg. llvm-svn: 139113	2011-09-05 06:52:48 +00:00
Benjamin Kramer	0c1a5d2067	Use canonical forms for the branch probability zero heutistic. - Drop support for X >u 0, it's equivalent to X != 0 and should be canonicalized into the latter. - Add X < 1 -> unlikely, which is what instcombine canonicalizes X <= 0 into. - Add X > -1 -> likely, which is what instcombine canonicalizes X >= 0 into. llvm-svn: 139110	2011-09-04 23:53:04 +00:00
Andrew Trick	43d88c3879	Comment and clarifying assert. llvm-svn: 139036	2011-09-02 21:20:46 +00:00
Devang Patel	26f7aeed7c	After r138010, subroutine type does not have context info. Update type verifier accordingly. This fixes ptype.exp gdb testsuite regressions. llvm-svn: 138869	2011-08-31 18:04:31 +00:00
Nadav Rotem	43912ff374	Fixes following the CR by Chris and Duncan: Optimize chained bitcasts of the form A->B->A. Undo r138722 and change isEliminableCastPair to allow this case. llvm-svn: 138756	2011-08-29 19:58:36 +00:00
Andrew Trick	026fa4da72	Reapply r138695. Fix PassManager stack depths. Patch by Xiaoyi Guo! llvm-svn: 138737	2011-08-29 17:07:00 +00:00
Nadav Rotem	6280c8eecc	Bitcasts are transitive. Bitcast-Bitcast-X becomes Bitcast-X. llvm-svn: 138722	2011-08-28 11:51:08 +00:00
Andrew Trick	65754eb3f7	Reverting r138695 to see if it fixes clang self host. llvm-svn: 138701	2011-08-27 06:10:16 +00:00
Andrew Trick	e38568bc07	Fix PassManager stack depths. Patch by Xiaoyi Guo! llvm-svn: 138695	2011-08-27 02:11:03 +00:00
Eric Christopher	09fd6a961c	Whitespace and 80-col. llvm-svn: 138654	2011-08-26 21:02:40 +00:00
Andrew Trick	504187a152	LoopInfo::updateUnloop fix, and verify Block->Loop maps. Fixes an oversight, and adds verification to catch it in the unloop.ll tests. llvm-svn: 138622	2011-08-26 03:06:34 +00:00
Bill Wendling	15062f817a	Skip the landingpad instruction when determining the insertion point. llvm-svn: 138481	2011-08-24 21:06:46 +00:00
Nadav Rotem	43673d6d0b	Implement Constant::isAllOnesValue(). Fix ConstantFolding to use the new api. llvm-svn: 138469	2011-08-24 20:18:38 +00:00
Eric Christopher	aa4993b9aa	Revert "Address Duncan's CR request:" This reverts commit 20a05be15ea5271ab6185b83200fa88263362400. (svn rev 138340) Conflicts: test/Transforms/InstCombine/bitcast.ll llvm-svn: 138366	2011-08-23 20:11:10 +00:00
Nadav Rotem	d449bc9bff	Address Duncan's CR request: 1. Cleanup the tests in ConstantFolding.cpp 2. Implement isAllOnes for Constant, ConstantFP, ConstantVector llvm-svn: 138340	2011-08-23 17:48:43 +00:00
Nadav Rotem	1dc1eb1ae0	Add constant folding support for bitcasts of splat vectors to integers. llvm-svn: 138206	2011-08-20 14:02:29 +00:00
Devang Patel	e4127d626e	Do not use named md nodes to track variables that are completely optimized. This does not scale while doing LTO with debug info. New approach is to include list of variables in the subprogram info directly. llvm-svn: 138145	2011-08-19 23:28:12 +00:00
Benjamin Kramer	0d60a5573c	Make a bunch of symbols private. llvm-svn: 138025	2011-08-19 01:42:18 +00:00
Benjamin Kramer	c5a90f3683	C API functions must be able to see their extern "C" definitions, or it will be impossible to call them from C. llvm-svn: 138022	2011-08-19 01:36:54 +00:00
Devang Patel	9fa42d022b	There is no need to add file as context for subroutine type. The subroutine type does not need any context. llvm-svn: 138010	2011-08-18 23:50:57 +00:00
Bill Wendling	3566980062	Revert r137655. There is some question about whether the 'landingpad' instruction should be marked as potentially reading and/or writing memory. llvm-svn: 137863	2011-08-17 20:36:44 +00:00
Eli Friedman	6da9a85aaa	Revert r137781; I agree with Duncan's comment that the situation in question is clearly impossible given the current structure of the code. llvm-svn: 137853	2011-08-17 19:31:49 +00:00
Eli Friedman	fc821baad4	Extend the undef ^ undef idiom once more. No testcase: I can't figure out how to actually trigger the codepath in question at the moment, but it might get exposed in the future. llvm-svn: 137781	2011-08-16 22:38:34 +00:00
Devang Patel	630d8ce36b	Until now all debug info MDNodes referred to a root MDNode, a compile unit. This simplified handling of these needs in dwarf writer. However, one side effect of this is that during link time optimization all these MDNodes are _not_ uniqued. In other words there will be N number of MDNodes describing "int", "char" and all other types, which would suddenly grow when each object file starts using libraries like STL. MDNodes graph structure such that compiler unit keeps track of important MDNodes and update dwarf writer to process mdnodes top-down instead of bottom up. llvm-svn: 137778	2011-08-16 22:09:43 +00:00
Bill Wendling	3d7b8eaa78	Use the getFirstInsertionPt() method instead of getFirstNonPHI + an 'isa<>' check for a LandingPadInst. llvm-svn: 137745	2011-08-16 20:45:24 +00:00
Bill Wendling	3e159bd43d	A few places where we want to skip the landingpad instruction for insertion. llvm-svn: 137712	2011-08-16 04:52:55 +00:00
Devang Patel	f1b61abc8c	Add a finalize() hook, that'll let DIBuilder construct compile unit lazily. llvm-svn: 137673	2011-08-15 23:00:00 +00:00
Eli Friedman	a88908f4e7	Add some comments here because the lack of a check for volatile/atomic here is a bit unusual. llvm-svn: 137662	2011-08-15 21:56:39 +00:00
Bill Wendling	a75d2d0416	Duncan pointed out that the LandingPadInst might read memory. (It might also write to memory.) Marking it as such makes some checks for immobility go away. llvm-svn: 137655	2011-08-15 21:14:31 +00:00
Eli Friedman	5acfaeab2d	Misc analysis passes that need to be aware of atomic load/store. llvm-svn: 137650	2011-08-15 20:54:19 +00:00
Eli Friedman	e18709e5b2	Atomic load/store support in LICM. llvm-svn: 137648	2011-08-15 20:52:09 +00:00
Bill Wendling	8cc5168e87	The landingpad instruction isn't loop-invariant. llvm-svn: 137628	2011-08-15 18:22:49 +00:00
Devang Patel	1b73744344	Refactor. Global variables are part of compile unit so let CompileUnit create new global variable. llvm-svn: 137621	2011-08-15 17:57:41 +00:00
Duncan Sands	10a9e984bc	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. llvm-svn: 137460	2011-08-12 14:54:45 +00:00
Andrew Trick	5a5a5ebe68	Allow loop unrolling to get known trip counts from ScalarEvolution. SCEV unrolling can unroll loops with arbitrary induction variables. It is a prerequisite for -disable-iv-rewrite performance. It is also easily handles loops of arbitrary structure including multiple exits and is generally more robust. This is under a temporary option to avoid affecting default behavior for the next couple of weeks. It is needed so that I can checkin unit tests for updateUnloop. llvm-svn: 137384	2011-08-11 23:36:16 +00:00
Andrew Trick	b31724d9ff	Fix for LoopInfo::updateUnloop. Remove subloop blocks from former ancestor loops. I have a unit test that depends on scev-unroll, which unfortunately isn't checked in. But I will check it in when I can. llvm-svn: 137341	2011-08-11 20:27:32 +00:00
Andrew Trick	eb6a7aad9e	Cleanup. Another thorough review by Nick! llvm-svn: 137317	2011-08-11 17:54:58 +00:00
Andrew Trick	306e3d0508	Reapplying r136844. An algorithm for incrementally updating LoopInfo within a LoopPassManager. The incremental update should be extremely cheap in most cases and can be used in places where it's not feasible to regenerate the entire loop forest. - "Unloop" is a node in the loop tree whose last backedge has been removed. - Perform reverse dataflow on the block inside Unloop to propagate the nearest loop from the block's successors. - For reducible CFG, each block in unloop is visited exactly once. This is because unloop no longer has a backedge and blocks within subloops don't change parents. - Immediate subloops are summarized by the nearest loop reachable from their exits or exits within nested subloops. - At completion the unloop blocks each have a new parent loop, and each immediate subloop has a new parent. llvm-svn: 137276	2011-08-10 23:22:57 +00:00
Devang Patel	1541972a13	Distinguish between two copies of one inlined variable. Take 2. llvm-svn: 137253	2011-08-10 21:50:54 +00:00
Andrew Trick	d10d5f0609	Cleanup. Added LoopBlocksDFS::perform for simple clients. llvm-svn: 137195	2011-08-10 01:59:05 +00:00
Devang Patel	1d038e6e01	Provide method to print variable's extended name which includes inline location. llvm-svn: 137095	2011-08-09 01:03:14 +00:00
Andrew Trick	a7fc7983d4	Made SCEV's UDiv expressions more canonical. When dividing a recurrence, the initial values low bits can sometimes be ignored. To take advantage of this, added FoldIVUser to IndVarSimplify to fold an IV operand into a udiv/lshr if the operator doesn't affect the result. -indvars -disable-iv-rewrite now transforms i = phi i4 i1 = i0 + 1 idx = i1 >> (2 or more) i4 = i + 4 into i = phi i4 idx = i0 >> ... i4 = i + 4 llvm-svn: 137013	2011-08-06 07:00:37 +00:00
Chandler Carruth	e833bbd11e	Temporarily revert r135528 which distinguishes between two copies of one inlined variable, based on the discussion in PR10542. This explodes the runtime of several passes down the pipeline due to a large number of "copies" remaining live across a large function. This only shows up with both debug and opt, but when it does it creates a many-minute compile when self-hosting LLVM+Clang. There are several other cases that show these types of regressions. All of this is tracked in PR10542, and progress is being made on fixing the issue. Once its addressed, the re-instated, but until then this restores the performance for self-hosting and other opt+debug builds. Devang, let me know if this causes any trouble, or impedes fixing it in any way, and thanks for working on this! llvm-svn: 136953	2011-08-05 00:51:31 +00:00
Duncan Sands	be4435d95e	Fix what seems an obvious typo. Patch by Ivan Krasin. Problem reported at http://habrahabr.ru/blogs/compilers/125626/. llvm-svn: 136865	2011-08-04 10:02:21 +00:00
Andrew Trick	aebed187b7	Reverting r136884 updateUnloop, which crashed a linux builder. llvm-svn: 136857	2011-08-04 01:04:37 +00:00
Andrew Trick	cfb48e5133	An algorithm for incrementally updating LoopInfo within a LoopPassManager. The incremental update should be extremely cheap in most cases and can be used in places where it's not feasible to regenerate the entire loop forest. - "Unloop" is a node in the loop tree whose last backedge has been removed. - Perform reverse dataflow on the block inside Unloop to propagate the nearest loop from the block's successors. - For reducible CFG, each block in unloop is visited exactly once. This is because unloop no longer has a backedge and blocks within subloops don't change parents. - Immediate subloops are summarized by the nearest loop reachable from their exits or exits within nested subloops. - At completion the unloop blocks each have a new parent loop, and each immediate subloop has a new parent. llvm-svn: 136844	2011-08-03 23:50:25 +00:00
Andrew Trick	a700dedcd1	whitespace llvm-svn: 136843	2011-08-03 23:45:50 +00:00
Jakub Staszak	7b30ba0db8	Add more constantness in BlockFrequencyInfo. llvm-svn: 136816	2011-08-03 21:30:57 +00:00
Bill Wendling	752f75c8ce	Add this back in for now. There are still a few passes which create unwind instructions at the moment. llvm-svn: 136756	2011-08-03 01:07:57 +00:00
Bill Wendling	ad3c4a32ea	Replace the 'UnwindInst' check with a check for 'ResumeInst', which also exits the function, because the UnwindInst is going away. llvm-svn: 136751	2011-08-03 00:30:19 +00:00
Andrew Trick	86f6a3c645	Use consistent terminology for loop exit/exiting blocks. Name change only. llvm-svn: 136677	2011-08-02 04:23:35 +00:00
Jakub Staszak	5a47c78397	Change SmallVector to SmallPtrSet in BranchProbabilityInfo. Handle cases where one than one successor goes to the same block. llvm-svn: 136638	2011-08-01 19:16:26 +00:00
Jakub Staszak	8f686498df	Do not handle cases with >= and <= predicates. llvm-svn: 136588	2011-07-31 05:54:04 +00:00
Jakub Staszak	0fe2246aab	Remove untrue comment. llvm-svn: 136587	2011-07-31 04:51:14 +00:00
Jakub Staszak	6795c54cb7	Do not handle case where LHS is equal to zero, because InstCombiner always moves it to RHS anyway. llvm-svn: 136586	2011-07-31 04:47:20 +00:00
Jakub Staszak	384a405be9	Add Zero Heurestics to BranchProbabilityInfo. If we compare value to zero we decide whether condition is likely to be true this way: x == 0 -> false x < 0 -> false x <= 0 -> false x != 0 -> true x > 0 -> true x >= 0 -> true llvm-svn: 136583	2011-07-31 03:27:24 +00:00
Jakub Staszak	02fa079dc3	Add more constantness in BranchProbabilityInfo. llvm-svn: 136502	2011-07-29 19:30:00 +00:00
Jakub Staszak	ad93033183	Remove incEdgeWeight and decEdgeWeight. Set edge weight directly to avoid rounding errors. llvm-svn: 136456	2011-07-29 02:36:53 +00:00
Chandler Carruth	f7890e34b9	Rewrite the CMake build to use explicit dependencies between libraries, specified in the same file that the library itself is created. This is more idiomatic for CMake builds, and also allows us to correctly specify dependencies that are missed due to bugs in the GenLibDeps perl script, or change from compiler to compiler. On Linux, this returns CMake to a place where it can relably rebuild several targets of LLVM. I have tried not to change the dependencies from the ones in the current auto-generated file. The only places I've really diverged are in places where I was seeing link failures, and added a dependency. The goal of this patch is not to start changing the dependencies, merely to move them into the correct location, and an explicit form that we can control and change when necessary. This also removes a serialization point in the build because we don't have to scan all the libraries before we begin building various tools. We no longer have a step of the build that regenerates a file inside the source tree. A few other associated cleanups fall out of this. This isn't really finished yet though. After talking to dgregor he urged switching to a single CMake macro to construct libraries with both sources and dependencies in the arguments. Migrating from the two macros to that style will be a follow-up patch. Also, llvm-config is still generated with GenLibDeps.pl, which means it still has slightly buggy dependencies. The internal CMake 'llvm-config-like' macro uses the correct explicitly specified dependencies however. A future patch will switch llvm-config generation (when using CMake) to be based on these deps as well. This may well break Windows. I'm getting a machine set up now to dig into any failures there. If anyone can chime in with problems they see or ideas of how to solve them for Windows, much appreciated. llvm-svn: 136433	2011-07-29 00:14:25 +00:00
Jakub Staszak	d6d0b202f1	Change LBH_TAKEN_WEIGHT to 124 (from 128). Right now, sum of LBH_TAKEN_WEIGHT + LBH_NONTAKEN_WEIGHT = 128 which in _most_ cases reduce number of rounding errors. llvm-svn: 136428	2011-07-28 23:42:08 +00:00
Jakub Staszak	53da869f0c	Heuristics are in descending priority now. If we use one of them, skip the rest. llvm-svn: 136402	2011-07-28 21:45:07 +00:00
Jakub Staszak	c5d6bbc669	Add InEdges (edges from header to the loop) in Loop Branch Heuristics, so there is no frequency difference whether condition is in the header or in the latch. llvm-svn: 136398	2011-07-28 21:33:46 +00:00
Jakub Staszak	f5076015fc	Use BlockFrequency instead of uint32_t in BlockFrequencyInfo. llvm-svn: 136278	2011-07-27 22:05:51 +00:00
Jeffrey Yasskin	8a0f9f17a3	Explicitly cast narrowing conversions inside {}s that will become errors in C++0x. llvm-svn: 136211	2011-07-27 06:22:51 +00:00
Eli Friedman	8c5930556f	Minor simplification. llvm-svn: 136202	2011-07-27 01:02:25 +00:00
Eli Friedman	2da127e60a	Fix AliasSetTracker so that it doesn't make any assumptions about instructions it doesn't know about (like the atomic instructions I'm adding). llvm-svn: 136198	2011-07-27 00:46:46 +00:00
Andrew Trick	ae93f1bf46	SCEV: Added a data structure for storing not-taken info per loop exit. Added an interfaces for querying either the loop's exact/max backedge taken count or a specific loop exit's not-taken count. llvm-svn: 136100	2011-07-26 17:19:55 +00:00
Duncan Sands	52f6e018e2	Add helper function for getting true/false constants in a uniform way for i1 and vector of i1 types. Use these to make some code more self-documenting. llvm-svn: 136079	2011-07-26 15:03:53 +00:00
Jakub Staszak	5c309cbead	Rename BlockFrequency to BlockFrequencyInfo and MachineBlockFrequency to MachineBlockFrequencyInfo. llvm-svn: 135937	2011-07-25 19:25:40 +00:00
Frits van Bommel	775ac35cf1	Shorten some expressions by using ArrayRef::slice(). llvm-svn: 135910	2011-07-25 15:13:01 +00:00
Jay Foad	6513dac6e2	Convert GetElementPtrInst to use ArrayRef. llvm-svn: 135904	2011-07-25 09:48:08 +00:00
Jay Foad	42463ed852	Convert IRBuilder::CreateGEP and IRBuilder::CreateInBoundsGEP to use ArrayRef. llvm-svn: 135761	2011-07-22 08:16:57 +00:00
Jakub Staszak	cf5ebedf56	Allow getBlockFreq to return 0. llvm-svn: 135742	2011-07-22 02:24:57 +00:00
Jay Foad	d4458dd707	Convert ConstantExpr::getGetElementPtr and ConstantExpr::getInBoundsGetElementPtr to use ArrayRef. llvm-svn: 135673	2011-07-21 14:31:17 +00:00
Devang Patel	284b502be9	There are two ways to map a variable to its lexical scope. Lexical scope information is embedded in MDNode describing the variable. It is also available as a part of DebugLoc attached with DBG_VALUE instruction. DebugLoc attached with an instruction is less reliable in optimized code so use information embedded in the MDNode. llvm-svn: 135629	2011-07-20 22:18:50 +00:00
Devang Patel	65afdd27d0	Distinguish between two copies of one inlined variable. llvm-svn: 135528	2011-07-19 22:31:15 +00:00
Devang Patel	5a4bb57ec5	Reapply r135457. This needs llvm-gcc change, that I forgot to check-in yesterday. llvm-svn: 135504	2011-07-19 19:41:54 +00:00
Bob Wilson	7c9092c9aa	Revert "Make a provision to encode inline location in a variable. This will enable dwarf writer to easily distinguish between two instances of a inlined variable in one basic block." This reverts commit 9fec5e346efdf744b151ae6604f912908315fa7a. llvm-svn: 135486	2011-07-19 16:32:50 +00:00
Jay Foad	bbbf29aab7	Convert SimplifyGEPInst to use ArrayRef. llvm-svn: 135482	2011-07-19 15:07:52 +00:00
Jay Foad	0974b71f17	Convert TargetData::getIndexedOffset to use ArrayRef. llvm-svn: 135478	2011-07-19 14:01:37 +00:00
Jay Foad	ae5894c5cc	Use ArrayRef in ConstantFoldInstOperands and ConstantFoldCall. llvm-svn: 135477	2011-07-19 13:32:40 +00:00
Devang Patel	e50d45c876	Make a provision to encode inline location in a variable. This will enable dwarf writer to easily distinguish between two instances of a inlined variable in one basic block. llvm-svn: 135457	2011-07-19 01:03:32 +00:00
Frits van Bommel	6c24f9c277	Migrate LLVM and Clang to use the new makeArrayRef(...) functions where previously explicit non-default constructors were used. Mostly mechanical with some manual reformatting. llvm-svn: 135390	2011-07-18 12:00:32 +00:00
Chris Lattner	e1fe7061ce	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Benjamin Kramer	9de5b6c35b	Silence compiler warnings. llvm-svn: 135358	2011-07-16 22:26:27 +00:00
Jakub Staszak	7bd3be24d8	Remove "LoopInfo.h" include from BranchProbabilityInfo.h. llvm-svn: 135353	2011-07-16 20:31:15 +00:00
Andrew Trick	b2c2eec80d	Fix SCEVEXpander to handle arbitrary phi expansion. Includes two related bug fixes and corresponding assertions for uninitialized data and missing NULL check. Test cases will be included with the new LFTR. llvm-svn: 135333	2011-07-16 00:59:39 +00:00
Jakub Staszak	ada22d8fca	Fix pointer heuristic. Check whether predicator is ICMP_NE instead of if it is not isEquality(). llvm-svn: 135296	2011-07-15 20:51:06 +00:00
Jay Foad	c826df8fb7	Convert CallInst and InvokeInst APIs to use ArrayRef. llvm-svn: 135265	2011-07-15 08:37:34 +00:00
Jay Foad	88fb4f4597	Convert InsertValueInst and ExtractValueInst APIs to use ArrayRef. llvm-svn: 135040	2011-07-13 10:26:04 +00:00
Chris Lattner	f0f71ca28b	stop using WriteTypeSymbolic. llvm-svn: 134833	2011-07-09 18:02:13 +00:00
Devang Patel	bfd474b954	Preserve debug loc. llvm-svn: 134441	2011-07-05 21:48:22 +00:00
Dan Gohman	c093f48834	Teach IVUsers to stop at non-affine expressions unless they are both outside the loop and reducible. This more completely hides them from LSR, which isn't usually able to do anything meaningful with non-affine expressions anyway, and this consequently hides them from SCEVExpander, which is acutely unprepared for non-affine expressions. Replace test/CodeGen/X86/lsr-nonaffine.ll with a new test that tests the new behavior. This works around the bug in PR10117 / rdar://problem/9633149, and is generally an improvement besides. llvm-svn: 134268	2011-07-01 22:05:19 +00:00
Dan Gohman	ba1afd789e	Improve constant folding of undef for cmp and select operators. llvm-svn: 134223	2011-07-01 01:03:43 +00:00
Andrew Trick	1fcf6615c4	Cleanup. Fix a stupid variable name. llvm-svn: 133995	2011-06-28 05:41:52 +00:00
Andrew Trick	4ac5dd5154	SCEVExpander: give new insts a name that identifies the reponsible pass. llvm-svn: 133992	2011-06-28 05:07:32 +00:00
Andrew Trick	eb9ed2c82e	indvars --disable-iv-rewrite: sever ties with IVUsers. llvm-svn: 133988	2011-06-28 03:01:46 +00:00
Nick Lewycky	4a0f5218f2	Move onlyUsedByLifetimeMarkers to ValueTracking so that it can be used by other passes as well. llvm-svn: 133904	2011-06-27 04:20:45 +00:00
Devang Patel	10271bcdad	Fix struct member's scope. Patch by Xi Wang. llvm-svn: 133828	2011-06-24 22:00:39 +00:00
Jakub Staszak	4cd344d775	Calculate backedge probability correctly. llvm-svn: 133776	2011-06-23 23:52:11 +00:00
Jakub Staszak	d28a800e6b	Missing files for the BlockFrequency analysis added. llvm-svn: 133767	2011-06-23 21:56:59 +00:00
Jakub Staszak	d74cc36bcb	Introduce BlockFrequency analysis for BasicBlocks. llvm-svn: 133766	2011-06-23 21:45:20 +00:00
Rafael Espindola	6ac573231c	Revert "revert 133714" This reverts commit e8e00f5efb4a22238f2407bf813de4606f30c5aa. The cmake build on OS X is still broken. llvm-svn: 133718	2011-06-23 14:19:39 +00:00
Dylan Noblesmith	9b0ec7b92a	revert 133714 It broke the build worse. llvm-svn: 133716	2011-06-23 13:56:01 +00:00
Rafael Espindola	b6bbcca398	133713 broke the build, revert it. llvm-svn: 133714	2011-06-23 13:37:38 +00:00
Dylan Noblesmith	0adb1b1393	Support: make floating-exception header private It has only one user. This eliminates the last include of config.h from the public headers -- ideally, config.h shouldn't even be installed by `make install` anymore. llvm-svn: 133713	2011-06-23 12:45:54 +00:00
Devang Patel	5640b59656	New binops need debug loc. llvm-svn: 133642	2011-06-22 20:56:56 +00:00
Andrew Trick	04c3bb47ce	IVUsers no longer needs to record the phis. llvm-svn: 133518	2011-06-21 15:43:52 +00:00
Chris Lattner	d456ff35d1	Revamp the "ConstantStruct::get" methods. Previously, these were scattered all over the place in different styles and variants. Standardize on two preferred entrypoints: one that takes a StructType and ArrayRef, and one that takes StructType and varargs. In cases where there isn't a struct type convenient, we now add a ConstantStruct::getAnon method (whose name will make more sense after a few more patches land). It would be "really really nice" if the ConstantStruct::get and ConstantVector::get methods didn't make temporary std::vectors. llvm-svn: 133412	2011-06-20 04:01:31 +00:00
Chris Lattner	d8515bc2d6	simplify some code. llvm-svn: 133362	2011-06-18 21:46:23 +00:00
Benjamin Kramer	a16cba3e83	Simplify code. No functionality change. llvm-svn: 133351	2011-06-18 14:42:42 +00:00
Jakub Staszak	5c7b7d64ba	Introduce MachineBranchProbabilityInfo class, which has similar API to BranchProbabilityInfo (expect setEdgeWeight which is not available here). Branch Weights are kept in MachineBasicBlocks. To turn off this analysis set -use-mbpi=false. llvm-svn: 133184	2011-06-16 20:22:37 +00:00
Eli Friedman	9eef1c75a5	Add a limit to the number of instructions memdep will scan in a single block. This prevents (at least in some cases) O(N^2) runtime in passes like DSE. The limit in this patch is probably too high, but it is enough to stop DSE from going completely insane on a testcase I have (which has a single block with around 50,000 non-aliasing stores in it). rdar://9471075 llvm-svn: 133111	2011-06-15 23:59:25 +00:00
Eli Friedman	47b3cd9fff	Add "unknown" results for memdep, which mean "I don't know whether a dependence for the given instruction exists in the given block". This cleans up all the existing hacks in memdep which represent this concept by returning clobber with various unrelated instructions. llvm-svn: 133031	2011-06-15 00:47:34 +00:00
Benjamin Kramer	8f8ff3e797	Move class into an anonymous namespace. llvm-svn: 132925	2011-06-13 18:38:56 +00:00
Andrew Trick	4591f882c2	Branch profiling: floating-point avoidance. Patch by: Jakub Staszak! Introduces BranchProbability. Changes unsigned to uint32_t all over and uint64_t only when overflow is expected. llvm-svn: 132867	2011-06-11 01:05:22 +00:00
Dan Gohman	27a1b5f7c5	Initialize BasicAA's AliasCache to set it to use fewer buckets by default, since it usually has very few elements. This speeds up alias queries in many cases, because AliasCache.clear() doesn't have to visit as many buckets. llvm-svn: 132862	2011-06-10 22:30:30 +00:00

... 2 3 4 5 6 ...

4180 Commits