llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-09 13:21:30 +00:00

Author	SHA1	Message	Date
Michael Gottesman	0a6d89b0cc	[llvm] Updated the test fold-vector-select so that we test the vector selects exhaustively. llvm-svn: 162953	2012-08-30 23:11:49 +00:00
Benjamin Kramer	766538b9bb	Fix test case. llvm-svn: 162913	2012-08-30 15:42:45 +00:00
Benjamin Kramer	d473cfcfb4	LoopRotate: Also rotate loops with multiple exits. The old PHI updating code in loop-rotate was replaced with SSAUpdater a while ago, it has no problems with comples PHIs. What had to be fixed is detecting whether a loop was already rotated and updating dominators when multiple exits were present. This change increases overall code size a bit, mostly due to additional loop unrolling opportunities. Passes test-suite and selfhost with -verify-dom-info. Fixes PR7447. Thanks to Andy for the input on the domtree updating code. llvm-svn: 162912	2012-08-30 15:39:42 +00:00
Nadav Rotem	5848719c42	It is illegal to transform (sdiv (ashr X c1) c2) -> (sdiv x (2^c1 * c2)), because C always rounds towards zero. Thanks Dirk and Ben. llvm-svn: 162899	2012-08-30 11:23:20 +00:00
Benjamin Kramer	b92d13cc42	Make MemoryBuiltins aware of TargetLibraryInfo. This disables malloc-specific optimization when -fno-builtin (or -ffreestanding) is specified. This has been a problem for a long time but became more severe with the recent memory builtin improvements. Since the memory builtin functions are used everywhere, this required passing TLI in many places. This means that functions that now have an optional TLI argument, like RecursivelyDeleteTriviallyDeadFunctions, won't remove dead mallocs anymore if the TLI argument is missing. I've updated most passes to do the right thing. Fixes PR13694 and probably others. llvm-svn: 162841	2012-08-29 15:32:21 +00:00
Benjamin Kramer	bc139a63fc	InstCombine: Guard the transform introduced in r162743 against large ints and non-const shifts. llvm-svn: 162751	2012-08-28 13:08:13 +00:00
Nadav Rotem	d0cd39e7c9	Make sure that we don't call getZExtValue on values > 64 bits. Thanks Benjamin for noticing this. llvm-svn: 162749	2012-08-28 12:23:22 +00:00
Nadav Rotem	9582c96aef	Teach InstCombine to canonicalize [SU]div+[AL]shl patterns. For example: %1 = lshr i32 %x, 2 %2 = udiv i32 %1, 100 rdar://12182093 llvm-svn: 162743	2012-08-28 10:01:43 +00:00
Benjamin Kramer	d456ce1d6b	SimplifyLibCalls: Give all safely-shrinkable libcalls the same treatment. llvm-svn: 162383	2012-08-22 19:39:15 +00:00
Chad Rosier	de7cbacc5f	Whitespace. llvm-svn: 162370	2012-08-22 17:34:11 +00:00
Chad Rosier	2e46e833db	Add test case for r162368. llvm-svn: 162369	2012-08-22 17:31:04 +00:00
Chandler Carruth	b82f8d4af5	Port the global copy optimization from the SROA pass to InstCombine. This optimization is really just replacing allocas wholesale with globals, there is no scalarization. The underlying motivation for this patch is to simplify the SROA pass and focus it on splitting and promoting allocas. llvm-svn: 162271	2012-08-21 08:39:44 +00:00
Benjamin Kramer	54d9d1a993	InstCombine: Fix a crasher when encountering a function pointer. llvm-svn: 162180	2012-08-18 22:04:34 +00:00
Benjamin Kramer	1a05d12328	InstCombine: Add a couple of fabs identities for comparing with 0.0. llvm-svn: 162174	2012-08-18 20:06:47 +00:00
Benjamin Kramer	31074f526a	SimplifyLibcalls: Add fabs and trunc to the list of libcalls that are safe to shrink from double to float. llvm-svn: 162173	2012-08-18 19:27:32 +00:00
Benjamin Kramer	4e9e4d1818	MemoryBuiltins: Properly guard ObjectSizeOffsetVisitor against cycles in the IR. The previous fix only checked for simple cycles, use a set to catch longer cycles too. Drop the broken check from the ObjectSizeOffsetEvaluator. The BoundsChecking pass doesn't have to deal with invalid IR like InstCombine does. llvm-svn: 162120	2012-08-17 19:26:41 +00:00
Benjamin Kramer	d431f3a1f2	Guard MemoryBuiltins against self-looping GEPs, which can occur in unreachable code due to constant propagation. Fixes PR13621. llvm-svn: 162098	2012-08-17 14:16:37 +00:00
Benjamin Kramer	b42939c43b	Fix broken check lines. I really need to find a way to automate this, but I can't come up with a regex that has no false positives while handling tricky cases like custom check prefixes. llvm-svn: 162097	2012-08-17 12:28:26 +00:00
Rafael Espindola	d7ec990084	Teach GVN to reason about edges dominating uses. This allows it to handle cases where some fact lake a=b dominates a use in a phi, but doesn't dominate the basic block itself. This feature could also be implemented by splitting critical edges, but at least with the current algorithm reasoning about the dominance directly is faster. The time for running "opt -O2" in the testcase in pr10584 is 1.003 times slower and on gcc as a single file it is 1.0007 times faster. llvm-svn: 162023	2012-08-16 15:09:43 +00:00
Michael Liao	cd290ba4fd	fix infinite loop in instcombine with more than 4GB memcpy - memcpy size is wrongly truncated into 32-bit and treat 8GB memcpy is 0-sized memcpy - as 0-sized memcpy/memset is already removed before SimplifyMemTransfer and SimplifyMemSet in visitCallInst, replace 0 checking with assertions. - replace getZExtValue() with getLimitedValue() according to Eli Friedman llvm-svn: 161923	2012-08-15 03:49:59 +00:00
Craig Topper	6a2fe056ce	Change greater than to greater than or equal so that an identical sized store to the same offset is treated as completing overwriting. llvm-svn: 161857	2012-08-14 07:32:05 +00:00
Nadav Rotem	eb22b069bb	During the CodeGenPrepare we often lower intrinsics (such as objsize) and allow some optimizations to turn conditional branches into unconditional. This commit adds a simple control-flow optimization which merges two consecutive basic blocks which are connected by a single edge. This allows the codegen to operate on larger basic blocks. rdar://11973998 llvm-svn: 161852	2012-08-14 05:19:07 +00:00
Eli Friedman	449495cd62	The normal edge of an invoke is not allowed to branch to a block with a landingpad. Enforce it in the verifier, and fix the regression tests to match. llvm-svn: 161697	2012-08-10 20:55:20 +00:00
Pete Cooper	22f2513465	Fix crash when when do lto on Bullet. Dynamic GEPs in SROA were incorrectly being applied to all accesses to an alloca, not just the ones which read from the GEP. Thanks to Evan for reducing the test. rdar://11861001 llvm-svn: 161654	2012-08-10 03:26:36 +00:00
Eli Friedman	a64c4c130d	isAllocLikeFn is allowed to return true for functions which read memory; make sure we account for that correctly in DeadStoreElimination. Fixes a regression from r158919. PR13547. llvm-svn: 161468	2012-08-08 02:17:32 +00:00
Dan Gohman	df7f8afaf2	Avoid recomputing the unique exit blocks and their insert points when doing multiple scalar promotions on a single loop. This also has the effect of preserving the order of stores sunk out of loops, which is aesthetically pleasing, and it happens to fix the testcase in PR13542, though it doesn't fix the underlying problem. llvm-svn: 161459	2012-08-08 00:00:26 +00:00
Bob Wilson	51c50d44b7	Fix a serious typo in InstCombine's optimization of comparisons. An unsigned value converted to floating-point will always be greater than a negative constant. Unfortunately InstCombine reversed the check so that unsigned values were being optimized to always be greater than all positive floating-point constants. <rdar://problem/12029145> llvm-svn: 161452	2012-08-07 22:35:16 +00:00
Benjamin Kramer	b8389165be	PR13095: Give an inline cost bonus to functions using byval arguments. We give a bonus for every argument because the argument setup is not needed anymore when the function is inlined. With this patch we interpret byval arguments as a compact representation of many arguments. The byval argument setup is implemented in the backend as an inline memcpy, so to model the cost as accurately as possible we take the number of pointer-sized elements in the byval argument and give a bonus of 2 instructions for every one of those. The bonus is capped at 8 elements, which is the number of stores at which the x86 backend switches from an expanded inline memcpy to a real memcpy. It would be better to use the real memcpy threshold from the backend, but it's not available via TargetData. This change brings the performance of c-ray in line with gcc 4.7. The included test case tries to reproduce the c-ray problem to catch regressions for this benchmark early, its performance is dominated by the inline decision of a specific call. This only has a small impact on most code, more on x86 and arm than on x86_64 due to the way the ABI works. When building LLVM for x86 it gives a small inline cost boost to virtually any function using StringRef or STL allocators, but only a 0.01% increase in overall binary size. The size of gcc compiled by clang actually shrunk by a couple bytes with this patch applied, but not significantly. llvm-svn: 161413	2012-08-07 11:13:19 +00:00
Chandler Carruth	ca6b087618	Fix PR13412, a nasty miscompile due to the interleaved instsimplify+inline strategy. The crux of the problem is that instsimplify was reasonably relying on an invariant that is true within any single function, but is no longer true mid-inline the way we use it. This invariant is that an argument pointer != a local (alloca) pointer. The fix is really light weight though, and allows instsimplify to be resiliant to these situations: when checking the relation ships to function arguments, ensure that the argumets come from the same function. If they come from different functions, then none of these assumptions hold. All credit to Benjamin Kramer for coming up with this clever solution to the problem. llvm-svn: 161410	2012-08-07 10:59:59 +00:00
Chandler Carruth	49d4e3f282	Add a much more conservative strategy for aligning branch targets. Previously, MBP essentially aligned every branch target it could. This bloats code quite a bit, especially non-looping code which has no real reason to prefer aligned branch targets so heavily. As Andy said in review, it's still a bit odd to do this without a real cost model, but this at least has much more plausible heuristics. Fixes PR13265. llvm-svn: 161409	2012-08-07 09:45:24 +00:00
Nadav Rotem	1fbb339620	When constant folding GEP expressions, keep the address space information of pointers. Together with Ran Chachick <ran.chachick@intel.com> llvm-svn: 160954	2012-07-30 07:25:20 +00:00
Nick Lewycky	a1eb0b5f2e	Add testcases for GlobalOpt changes in r160693 and r160757. llvm-svn: 160925	2012-07-29 01:15:37 +00:00
Nuno Lopes	a4d7ce1441	fix PR13390: do not loop forever with self-referencing self instructions llvm-svn: 160876	2012-07-27 18:21:15 +00:00
Nuno Lopes	7ec9936cb2	fix infinite loop in instcombine in the presence of a (malformed) self-referencing select inst. This can happen as long as the instruction is not reachable. Instcombine does generate these unreachable malformed selects when doing RAUW llvm-svn: 160874	2012-07-27 18:03:57 +00:00
Pete Cooper	ddb89a91ca	Simplify demanded bits of select sources where the condition is a constant vector llvm-svn: 160835	2012-07-26 23:10:24 +00:00
Pete Cooper	8d971d19cb	Teach SimplifyDemandedBits how to look through fpext and fptrunc to simplify their operand llvm-svn: 160823	2012-07-26 22:37:04 +00:00
Duncan Sands	c785ace7fd	Stop reassociate from looking through expressions of arbitrary complexity. This is a temporary measure until my fix for PR13021 is ready. llvm-svn: 160778	2012-07-26 09:26:40 +00:00
Duncan Sands	9f6bce9180	Don't perform an overaligned load in this test, since that's undefined behaviour that might be exploited one day. llvm-svn: 160714	2012-07-25 09:45:37 +00:00
Duncan Sands	8080fe449f	When folding a load from a global constant, if the load started in the middle of an array element (rather than at the beginning of the element) and extended into the next element, then the load from the second element was being handled wrong due to incorrect updating of the notion of which byte to load next. This fixes PR13442. Thanks to Chris Smowton for reporting the problem, analyzing it and providing a fix. llvm-svn: 160711	2012-07-25 09:14:54 +00:00
Nuno Lopes	06ac861756	teach objectsize about strdup() and strndup() llvm-svn: 160676	2012-07-24 16:28:13 +00:00
Nick Lewycky	6644694650	Teach globalopt to not nuke all stores to globals. Keep them around of they might be deliberate "one time" leaks, so that leak checkers can find them. This is a reapply of r160602 with the fix that this time I'm committing the code I thought I was committing last time; the I->eraseFromParent() goes after the break out of the loop. llvm-svn: 160664	2012-07-24 07:21:08 +00:00
Dan Gohman	b9b982cd41	An objc_retain can serve as a may-use for a different pointer. rdar://11931823. llvm-svn: 160637	2012-07-23 19:27:31 +00:00
Nick Lewycky	f1a5d95995	Revert r160602. llvm-svn: 160603	2012-07-21 09:03:15 +00:00
Nick Lewycky	9d1d5bfd50	Teach globalopt to play nice with leak checkers. This is a reapplication of r160529 that was subsequently reverted. The fix was to not call GV->eraseFromParent() right before the caller does the same. The existing testcases already caught this bug if run under valgrind. llvm-svn: 160602	2012-07-21 08:29:45 +00:00
Nuno Lopes	66a3934c7a	move the bounds checking pass to the instrumentation folder, where it belongs. I dunno why in the world I dropped it in the Scalar folder in the first place. No functionality change. llvm-svn: 160587	2012-07-20 22:39:33 +00:00
Richard Osborne	f82086baa5	Fix assertion in jump threading (PR13405). GetBestDestForJumpOnUndef() assumes there is at least 1 successor, which isn't true if the block ends in an indirect branch with no successors. Fix this by bailing out earlier in this case. llvm-svn: 160546	2012-07-20 10:36:17 +00:00
Nick Lewycky	62064c6cc3	Revert r160529 due to crashes. llvm-svn: 160532	2012-07-19 23:59:21 +00:00
Nick Lewycky	8a31eaccbd	Don't wipe out global variables that are probably storing pointers to heap memory. This makes clang play nice with leak checkers. llvm-svn: 160529	2012-07-19 22:35:28 +00:00
Andrew Trick	db674bed44	Added unit test for PR13361: LSR + SCEV "hangs" on reasonably sized test. llvm-svn: 160439	2012-07-18 18:07:52 +00:00
Andrew Trick	612785f908	indvars: Linear function test replace should avoid reusing undef. Fixes PR13371: indvars pass incorrectly substitutes 'undef' values. I do not like this fix. It's needed until/unless the meaning of undef changes. It attempts to be complete according to the IR spec, but I don't have much confidence in the implementation given the difficulty testing undefined behavior. Worse, this invalidates some of my hard-fought work on indvars and LSR to optimize pointer induction variables. It results benchmark regressions, which I'll track internally. On x86_64 no LTO I see: -3% huffbench -3% 400.perlbench -8% fhourstones My only suggestion for recovering is to change the meaning of undef. If we could trust an arbitrary instruction to produce a some real value that can be manipulated (e.g. incremented) according to non-undef rules, then this case could be easily handled with SCEV. llvm-svn: 160421	2012-07-18 04:35:10 +00:00

1 2 3 4 5 ...

3037 Commits