llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-29 06:53:53 +00:00

Author	SHA1	Message	Date
NAKAMURA Takumi	8068191bb8	BBVectorize.cpp: Try to fix MSVC build. map::iterator and multimap::iterator are incompatible. llvm-svn: 149475	2012-02-01 06:11:58 +00:00
Hal Finkel	15dafd979c	A few of the changes suggested in code review (by Nick Lewycky) llvm-svn: 149472	2012-02-01 05:51:45 +00:00
Argyrios Kyrtzidis	492f34016f	Revert Chris' commits up to r149348 that started causing VMCoreTests unit test to fail. These are: r149348 r149351 r149352 r149354 r149356 r149357 r149361 r149362 r149364 r149365 llvm-svn: 149470	2012-02-01 04:51:17 +00:00
Hal Finkel	8cf5de5774	Add a basic-block autovectorization pass. This is the initial checkin of the basic-block autovectorization pass along with some supporting vectorization infrastructure. Special thanks to everyone who helped review this code over the last several months (especially Tobias Grosser). llvm-svn: 149468	2012-02-01 03:51:43 +00:00
Jim Grosbach	6186319c3f	Disable InstCombine unsafe folding bitcasts of calls w/ varargs. Changing arguments from being passed as fixed to varargs is unsafe, as the ABI may require they be handled differently (stack vs. register, for example). Remove two tests which rely on the bitcast being folded into the direct call, which is exactly the transformation that's unsafe. llvm-svn: 149457	2012-02-01 00:08:17 +00:00
Lenny Maiorani	e3f2427596	bz11794 : EarlyCSE stack overflow on long functions. Make the EarlyCSE optimizer not use recursion to do a depth first iteration. llvm-svn: 149445	2012-01-31 23:14:41 +00:00
Bill Wendling	0ebee7acc5	Increase the initial vector size to be equivalent to the size of the Deps vector. This potentially saves a resizing. llvm-svn: 149369	2012-01-31 07:04:52 +00:00
Bill Wendling	91826c63c8	Cache the size of the vector instead of calling .size() all over the place. llvm-svn: 149368	2012-01-31 06:57:53 +00:00
Chris Lattner	a9b3505e9a	eliminate the "string" form of ConstantArray::get, using ConstantDataArray::getString instead. llvm-svn: 149365	2012-01-31 06:18:43 +00:00
Chris Lattner	3ca194bce8	eliminate the last uses of GetConstantStringInfo from this file, I didn't realize I was that close... llvm-svn: 149354	2012-01-31 04:54:27 +00:00
Chris Lattner	96d5f62396	start moving SimplifyLibcalls over to getConstantStringInfo, which is dramatically more efficient than GetConstantStringInfo. llvm-svn: 149352	2012-01-31 04:43:11 +00:00
Chris Lattner	054e412291	enhance logic to support ConstantDataArray. llvm-svn: 149340	2012-01-31 02:55:06 +00:00
Bill Wendling	0e5559bc4d	s/getInnerUnwindDest/getInnerResumeDest/g llvm-svn: 149328	2012-01-31 01:48:40 +00:00
Bill Wendling	a2a758eff6	Remove ivar which is identical to another ivar. llvm-svn: 149323	2012-01-31 01:25:54 +00:00
Bill Wendling	cf0fa6b786	Remove unused ivars and s/getOuterUnwindDest/getOuterResumeDest/g. llvm-svn: 149322	2012-01-31 01:22:03 +00:00
Bill Wendling	1b3c45b027	Remove more dead functions. llvm-svn: 149318	2012-01-31 01:18:21 +00:00
Bill Wendling	1be31fd0b1	s/getInnerUnwindDestNewEH/getInnerUnwindDest/g llvm-svn: 149317	2012-01-31 01:15:59 +00:00
Bill Wendling	6a5ff9da71	Remove some unused, old-EH methods. llvm-svn: 149316	2012-01-31 01:14:49 +00:00
Bill Wendling	ab823c2ad4	Get rid of references to dead intrinsics. The eh.selector and eh.resume intrinsics aren't used anymore. Get rid of some calls to them. llvm-svn: 149314	2012-01-31 01:05:20 +00:00
Bill Wendling	d9f60f87d0	Formatting cleanups. No functionality change. llvm-svn: 149312	2012-01-31 01:01:16 +00:00
Bill Wendling	92e817c1cc	Remove no-longer-useful dyn_casts and pals. llvm-svn: 149307	2012-01-31 00:56:53 +00:00
Kostya Serebryany	e5567f0fe4	[asan] fix the ObjC support (asan Issue #33 ) llvm-svn: 149300	2012-01-30 23:50:10 +00:00
Chad Rosier	6e1866cd0a	Typo. llvm-svn: 149289	2012-01-30 22:44:13 +00:00
Chad Rosier	908361b6a5	Typo. llvm-svn: 149275	2012-01-30 21:13:22 +00:00
Alexander Potapenko	bc80f88393	Fix compilation of ASan tests on OS X Lion (see http://code.google.com/p/address-sanitizer/issues/detail?id=32 ) The redzones emitted by AddressSanitizer for CFString instances confuse the linker and are of little use, so we shouldn't add them. llvm-svn: 149243	2012-01-30 10:40:22 +00:00
Nick Lewycky	8fba23af8b	Fix typo. llvm-svn: 149185	2012-01-28 23:33:44 +00:00
Kostya Serebryany	0a789a0d8c	[asan] correctly use ConstantExpr::getGetElementPtr. Catch by NAKAMURA Takumi llvm-svn: 149172	2012-01-28 04:27:16 +00:00
Chris Lattner	466ff4b5f1	continue making the world safe for ConstantDataVector. At this point, we should (theoretically optimize and codegen ConstantDataVector as well as ConstantVector. llvm-svn: 149116	2012-01-27 03:08:05 +00:00
Chris Lattner	fd273f7516	Continue improving support for ConstantDataAggregate, and use the new methods recently added to (sometimes greatly!) simplify code. llvm-svn: 149024	2012-01-26 02:32:04 +00:00
Chris Lattner	1c318c6440	some general cleanup, using new methods and tidying up old code. llvm-svn: 149006	2012-01-26 00:42:34 +00:00
Nick Lewycky	670c3c4fb5	Gracefully degrade precision in branch probability numbers. llvm-svn: 148946	2012-01-25 09:43:14 +00:00
Chris Lattner	4218b7524c	use Constant::getAggregateElement to simplify a bunch of code. llvm-svn: 148934	2012-01-25 06:48:06 +00:00
Chris Lattner	473bdbaabc	use ConstantVector::getSplat in a few places. llvm-svn: 148929	2012-01-25 06:02:56 +00:00
Kostya Serebryany	ca8b911b2d	[asan] enable asan only for the functions that have Attribute::AddressSafety llvm-svn: 148846	2012-01-24 19:34:43 +00:00
Chris Lattner	380a325520	basic instcombine support for CDS. llvm-svn: 148806	2012-01-24 14:31:22 +00:00
Alexander Potapenko	bd636a5fe8	Implemented AddressSanitizer::getPassName() llvm-svn: 148697	2012-01-23 11:22:43 +00:00
David Blaikie	06ecc99a56	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
Andrew Trick	207780ec8e	Handle a corner case with IV chain collection with bailout instead of assert. Fixes PR11783: bad cast to AddRecExpr. llvm-svn: 148572	2012-01-20 21:23:40 +00:00
Kostya Serebryany	b37a1263e1	Extend Attributes to 64 bits Problem: LLVM needs more function attributes than currently available (32 bits). One such proposed attribute is "address_safety", which shows that a function is being checked for address safety (by AddressSanitizer, SAFECode, etc). Solution: - extend the Attributes from 32 bits to 64-bits - wrap the object into a class so that unsigned is never erroneously used instead - change "unsigned" to "Attributes" throughout the code, including one place in clang. - the class has no "operator uint64 ()", but it has "uint64_t Raw() " to support packing/unpacking. - the class has "safe operator bool()" to support the common idiom: if (Attributes attr = getAttrs()) useAttrs(attr); - The CTOR from uint64_t is marked explicit, so I had to add a few explicit CTOR calls - Add the new attribute "address_safety". Doing it in the same commit to check that attributes beyond first 32 bits actually work. - Some of the functions from the Attribute namespace are worth moving inside the class, but I'd prefer to have it as a separate commit. Tested: "make check" on Linux (32-bit and 64-bit) and Mac (10.6) built/run spec CPU 2006 on Linux with clang -O2. This change will break clang build in lib/CodeGen/CGCall.cpp. The following patch will fix it. llvm-svn: 148553	2012-01-20 17:56:17 +00:00
Andrew Trick	be3e9530e1	SCEVExpander fixes. Affects LSR and indvars. LSR has gradually been improved to more aggressively reuse existing code, particularly existing phi cycles. This exposed problems with the SCEVExpander's sloppy treatment of its insertion point. I applied some rigor to the insertion point problem that will hopefully avoid an endless bug cycle in this area. Changes: - Always used properlyDominates to check safe code hoisting. - The insertion point provided to SCEV is now considered a lower bound. This is usually a block terminator or the use itself. Under no cirumstance may SCEVExpander insert below this point. - LSR is reponsible for finding a "canonical" insertion point across expansion of different expressions. - Robust logic to determine whether IV increments are in "expanded" form and/or can be safely hoisted above some insertion point. Fixes PR11783: SCEVExpander assert. llvm-svn: 148535	2012-01-20 07:41:13 +00:00
Dan Gohman	9bb84ffb6c	Set the "tail" flag on pattern-matched objc_storeStrong calls. rdar://10531041. llvm-svn: 148490	2012-01-19 19:14:36 +00:00
Nick Lewycky	522fc7d29f	Actually, this code handles wrapped sets just fine. Noticed by inspection. llvm-svn: 148487	2012-01-19 18:19:42 +00:00
Dan Gohman	7e17e84f9c	Add a depth limit to avoid runaway recursion. llvm-svn: 148419	2012-01-18 21:24:45 +00:00
Dan Gohman	48f4e5752e	Use llvm.global_ctors to locate global constructors instead of recognizing them by name. llvm-svn: 148416	2012-01-18 21:19:38 +00:00
Jakub Staszak	edd8e46c61	Remove trailing spaces and unneeded includes. llvm-svn: 148415	2012-01-18 21:16:33 +00:00
Dan Gohman	9b37a5592c	Add a new ObjC ARC optimization pass to eliminate unneeded autorelease push+pop pairs. llvm-svn: 148330	2012-01-17 20:52:24 +00:00
Dan Gohman	45976c3482	Add a new PassManagerBuilder customization point, EP_ModuleOptimizerEarly, to allow passes to be added before the main ModulePass optimizers. llvm-svn: 148329	2012-01-17 20:51:32 +00:00
Andrew Trick	f2988aa6f4	LSR fix: broaden the check for loop preheaders. It's becoming clear that LoopSimplify needs to unconditionally create loop preheaders. But that is a bigger fix. For now, continuing to hack LSR. Fixes rdar://10701050 "Cannot split an edge from an IndirectBrInst" assert. llvm-svn: 148288	2012-01-17 06:45:52 +00:00
David Blaikie	2526691971	Remove unreachable code. (replace with llvm_unreachable to help GCC where necessary) llvm-svn: 148284	2012-01-17 04:43:56 +00:00
Stepan Dyatkovskiy	38bc1a8899	Fixed comment in loop-unswitch. llvm-svn: 148252	2012-01-16 20:48:04 +00:00
Stepan Dyatkovskiy	2e727a1727	Cosmetic patch for r148215. llvm-svn: 148216	2012-01-15 09:45:11 +00:00
Stepan Dyatkovskiy	d7b16b0e44	Fixup for r148132. Type replacement for LoopsProperties: from DenseMap to std::map, since we need to keep a valid pointer to properties of current loop. Message for r148132: LoopUnswitch: All helper data that is collected during loop-unswitch iterations was moved to separated class (LUAnalysisCache). llvm-svn: 148215	2012-01-15 09:44:07 +00:00
Dan Gohman	4539e2a975	Fix an unused variable warning that Chad noticed. llvm-svn: 148164	2012-01-14 00:47:44 +00:00
Eli Friedman	a70048903b	Speculatively revert r148132+r148133 to try and fix a buildbot failure. llvm-svn: 148149	2012-01-13 22:34:39 +00:00
Stepan Dyatkovskiy	81514d2471	Cosmetic patch for r148132. llvm-svn: 148133	2012-01-13 19:27:22 +00:00
Stepan Dyatkovskiy	94682abb75	LoopUnswitch: All helper data that is collected during loop-unswitch iterations was moved to separated class (LUAnalysisCache). llvm-svn: 148132	2012-01-13 19:13:54 +00:00
Dan Gohman	922244c634	Implement proper ObjC ARC objc_retainBlock "escape" analysis, so that the optimizer doesn't eliminate objc_retainBlock calls which are needed for their side effect of copying blocks onto the heap. This implements rdar://10361249. llvm-svn: 148076	2012-01-13 00:39:07 +00:00
Eli Friedman	487774ddbe	Re-fix the issue Bill fixed in r147899 in a slightly different way, which doesn't abuse the semantics of linker_private. We don't really want to merge any string constant with a weak_odr global. llvm-svn: 147971	2012-01-11 22:06:46 +00:00
Kostya Serebryany	03444e57e8	[asan] extend the workaround for http://llvm.org/bugs/show_bug.cgi?id=11395 : don't instrument the function at all on x86_32 if it has a large asm blob llvm-svn: 147953	2012-01-11 18:15:23 +00:00
Stepan Dyatkovskiy	7ba274153a	Improved compile time: 1. Size heuristics changed. Now we calculate number of unswitching branches only once per loop. 2. Some checks was moved from UnswitchIfProfitable to processCurrentLoop, since it is not changed during processCurrentLoop iteration. It allows decide to skip some loops at an early stage. Extended statistics: - Added total number of instructions analyzed. llvm-svn: 147935	2012-01-11 08:40:51 +00:00
Bill Wendling	2a03f15116	If the global variable is removed by the linker, then don't constant merge it with other symbols. An object in the __cfstring section is suppoed to be filled with CFString objects, which have a pointer to ___CFConstantStringClassReference followed by a pointer to a __cstring. If we allow the object in the __cstring section to be merged with another global, then it could end up in any section. Because the linker is going to remove these symbols in the final executable, we shouldn't bother to merge them. <rdar://problem/10564621> llvm-svn: 147899	2012-01-11 00:13:08 +00:00
Andrew Trick	db66631fb3	Enable LSR IV Chains with sufficient heuristics. These heuristics are sufficient for enabling IV chains by default. Performance analysis has been done for i386, x86_64, and thumbv7. The optimization is rarely important, but can significantly speed up certain cases by eliminating spill code within the loop. Unrolled loops are prime candidates for IV chains. In many cases, the final code could still be improved with more target specific optimization following LSR. The goal of this feature is for LSR to make the best choice of induction variables. Instruction selection may not completely take advantage of this feature yet. As a result, there could be cases of slight code size increase. Code size can be worse on x86 because it doesn't support postincrement addressing. In fact, when chains are formed, you may see redundant address plus stride addition in the addressing mode. GenerateIVChains tries to compensate for the common cases. On ARM, code size increase can be mitigated by using postincrement addressing, but downstream codegen currently misses some opportunities. llvm-svn: 147826	2012-01-10 01:45:08 +00:00
Andrew Trick	09d73ea35b	Adding IV chain generation to LSR. After collecting chains, check if any should be materialized. If so, hide the chained IV users from the LSR solver. LSR will only solve for the head of the chain. GenerateIVChains will then materialize the chained IV users by computing the IV relative to its previous value in the chain. In theory, chained IV users could be exposed to LSR's solver. This would be considerably complicated to implement and I'm not aware of a case where we need it. In practice it's more important to intelligently prune the search space of nontrivial loops before running the solver, otherwise the solver is often forced to prune the most optimal solutions. Hiding the chained users does this well, so that LSR is more likely to find the best IV for the chain as a whole. llvm-svn: 147801	2012-01-09 21:18:52 +00:00
Andrew Trick	b6ee006eaf	Adding collection of IV chains to LSR. This collects a set of IV uses within the loop whose values can be computed relative to each other in a sequence. Following checkins will make use of this information. llvm-svn: 147797	2012-01-09 19:50:34 +00:00
Andrew Trick	b442611358	"Minor LSR debugging stuff" llvm-svn: 147785	2012-01-09 18:58:16 +00:00
Benjamin Kramer	48d318717f	Move assert to the right place. llvm-svn: 147779	2012-01-09 17:36:29 +00:00
Benjamin Kramer	f9cefbfed0	InstCombine: Teach foldLogOpOfMaskedICmpsHelper that sign bit tests are bit tests. This subsumes several other transforms while enabling us to catch more cases. llvm-svn: 147777	2012-01-09 17:23:27 +00:00
Benjamin Kramer	e1321329f4	Tweak my last commit to be less conservative about uses. We still save an instruction when just the "and" part is replaced. Also change the code to match comments more closely. llvm-svn: 147753	2012-01-08 21:12:51 +00:00
Benjamin Kramer	e94856c8c4	InstCombine: If we have a bit test and a sign test anded/ored together, merge the sign bit into the bit test. This is common in bit field code, e.g. checking if the first or the last bit of a bit field is set. llvm-svn: 147749	2012-01-08 18:32:24 +00:00
Andrew Trick	60dbff489b	Enable redundant phi elimination after LSR. This will be more important as we extend the LSR pass in ways that don't rely on the formula solver. In particular, we need it for constructing IV chains. llvm-svn: 147724	2012-01-07 07:08:17 +00:00
Andrew Trick	d9eb9c8780	LSR: Don't optimize loops if an outer loop has no preheader. LoopSimplify may not run on some outer loops, e.g. because of indirect branches. SCEVExpander simply cannot handle outer loops with no preheaders. Fixes rdar://10655343 SCEVExpander segfault. llvm-svn: 147718	2012-01-07 03:16:50 +00:00
Andrew Trick	5b49f3b782	LSR: run DeleteDeadPhis before replaceCongruentPhis. llvm-svn: 147711	2012-01-07 01:36:44 +00:00
Andrew Trick	8a5a1e603e	Extended replaceCongruentPhis to handle mixed phi types. llvm-svn: 147707	2012-01-07 01:12:09 +00:00
Kostya Serebryany	a7d1b796b9	[asan] cleanup: remove the SIGILL-related code (compiler part) llvm-svn: 147667	2012-01-06 18:09:21 +00:00
Dan Gohman	a4fde8485d	Fix SpeculativelyExecuteBB to either speculate all or none of the phis present in the bottom of the CFG triangle, as the transformation isn't ever valuable if the branch can't be eliminated. Also, unify some heuristics between SimplifyCFG's multiple if-converters, for consistency. This fixes rdar://10627242. llvm-svn: 147630	2012-01-05 23:58:56 +00:00
Eli Friedman	5af9c3cbbb	PR11705, part 2: globalopt shouldn't put inttoptr/ptrtoint operations into global initializers if there's an implied extension or truncation. llvm-svn: 147625	2012-01-05 23:03:32 +00:00
Dan Gohman	4fc691d9ef	Revert r56315. When the instruction to speculate is a load, this code can incorrectly move the load across a store. This never happens in practice today, but only because the current heuristics accidentally preclude it. llvm-svn: 147623	2012-01-05 22:54:35 +00:00
Nick Lewycky	b02dd59b35	SCCCaptured is trivially false on entry to this loop and not modified inside it. Eliminate the dead test for it on each loop iteration. No functionality change. llvm-svn: 147616	2012-01-05 22:21:45 +00:00
Nick Lewycky	44fbcb95ba	Remove pointless asserts. llvm-svn: 147529	2012-01-04 09:42:30 +00:00
Nick Lewycky	d6260dc3cb	Teach instcombine all sorts of great stuff about shifts that have exact, nuw or nsw bits on them. llvm-svn: 147528	2012-01-04 09:28:29 +00:00
Nick Lewycky	c7e12f7dbf	Make use of the exact bit when optimizing '(X >>exact 3) << 1' to eliminate the 'and' that would zero out the trailing bits, and to produce an exact shift ourselves. llvm-svn: 147391	2011-12-31 21:30:22 +00:00
Nick Lewycky	7425820374	Change CaptureTracking to pass a Use* instead of a Value* when a value is captured. This allows the tracker to look at the specific use, which may be especially interesting for function calls. Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does not iterate until a fixpoint and does not guarantee that it produces the same result regardless of iteration order. The new implementation builds up a graph of how arguments are passed from function to function, and uses a bottom-up walk on the argument-SCCs to assign nocapture. This gets us nocapture more often, and does so rather efficiently and independent of iteration order. llvm-svn: 147327	2011-12-28 23:24:21 +00:00
Nick Lewycky	17d4e8dae6	Demystify this comment. llvm-svn: 147307	2011-12-28 06:57:32 +00:00
Nick Lewycky	4c5662bae0	Use false not zero, as a bool. llvm-svn: 147292	2011-12-27 18:27:22 +00:00
Nick Lewycky	f4c21901a3	Turn cos(-x) into cos(x). Patch by Alexander Malyshev! llvm-svn: 147291	2011-12-27 18:25:50 +00:00
Nick Lewycky	295e397220	Teach simplifycfg to recompute branch weights when merging some branches, and to discard weights when appropriate. Still more to do (and a new TODO), but it's a start! llvm-svn: 147286	2011-12-27 04:31:52 +00:00
Rafael Espindola	d448dfaa25	Fix warning. llvm-svn: 147284	2011-12-26 23:12:42 +00:00
Nick Lewycky	56e04db381	Update the branch weight metadata when reversing the order of a branch. llvm-svn: 147280	2011-12-26 20:54:14 +00:00
Nick Lewycky	80b1ea7ea3	Sort includes, canonicalize whitespace, fix typos. No functionality change. llvm-svn: 147279	2011-12-26 20:37:40 +00:00
Benjamin Kramer	94f07f8c2c	InstCombine: Add a combine that turns (2^n)-1 ^ x back into (2^n)-1 - x iff x is smaller than 2^n and it fuses with a following add. This was intended to undo the sub canonicalization in cases where it's not profitable, but it also finds some cases on it's own. llvm-svn: 147256	2011-12-24 17:31:53 +00:00
Benjamin Kramer	0b4d2e3d2a	InstCombine: Canonicalize (2^n)-1 - x into (2^n)-1 ^ x iff x is known to be smaller than 2^n. This has the obvious advantage of being commutable and is always a win on x86 because const - x wastes a register there. On less weird architectures this may lead to a regression because other arithmetic doesn't fuse with it anymore. I'll address that problem in a followup. llvm-svn: 147254	2011-12-24 17:31:38 +00:00
Nick Lewycky	a3bc09fec4	Fix typo "infinte". llvm-svn: 147226	2011-12-23 23:49:25 +00:00
Mon P Wang	e062e56cab	When not destroying the source, the linker is not remapping the types. Added support to CloneFunctionInto to allow remapping for this case. llvm-svn: 147217	2011-12-23 02:18:32 +00:00
Chad Rosier	0bfa96dd95	Add the actual code for r147175. llvm-svn: 147176	2011-12-22 21:10:46 +00:00
Chad Rosier	4ab165f664	Speculatively revert r146578 to determine if it is the cause of a number of performance regressions (both execution-time and compile-time) on our nightly testers. Original commit message: Fix for bug #11429: Wrong behaviour for switches. Small improvement for code size heuristics. llvm-svn: 147131	2011-12-22 02:40:57 +00:00
Dan Gohman	17bd9795e9	Fix a copy+pasto. No testcase, because the symptoms of dereferencing an invalid iterator aren't reproducible. rdar://10614085. llvm-svn: 147098	2011-12-21 21:43:50 +00:00
Nick Lewycky	9adbd36737	Make some intrinsics safe to speculatively execute. llvm-svn: 147036	2011-12-21 05:52:02 +00:00
David Blaikie	576aba04f1	Unweaken vtables as per http://llvm.org/docs/CodingStandards.html#ll_virtual_anch llvm-svn: 146960	2011-12-20 02:50:00 +00:00
Jakub Staszak	b62af984a1	- Use getExitingBlock instead of getExitingBlocks. - Remove trailing spaces. llvm-svn: 146854	2011-12-18 21:52:30 +00:00
Kevin Enderby	42fffe915a	Revert r146822 at Pete Cooper's request as it broke clang self hosting. Hope I did this correctly :) llvm-svn: 146834	2011-12-17 19:48:52 +00:00
Pete Cooper	0ec73f6e98	SimplifyCFG now predicts some conditional branches to true or false depending on previous branch on same comparison operands. For example, if (a == b) { if (a > b) // this is false Fixes some of the issues on <rdar://problem/10554090> llvm-svn: 146822	2011-12-17 06:32:38 +00:00
Pete Cooper	6627aaa416	Refactor code used in InstCombine::FoldAndOfICmps to new file. This will be used by SimplifyCfg in a later commit. llvm-svn: 146803	2011-12-17 01:20:32 +00:00
Dan Gohman	9c8c9a8f62	The powers that be have decided that LLVM IR should now support 16-bit "half precision" floating-point with a first-class type. This patch adds basic IR support (but not codegen support). llvm-svn: 146786	2011-12-17 00:04:22 +00:00
Andrew Trick	df958760c8	Avoid a confusing assert for silly options: -unroll-runtime -unroll-count=1. No need for an explicit test case for an unsupported combination of options. llvm-svn: 146721	2011-12-16 02:03:48 +00:00
Kostya Serebryany	847d2d2c25	[asan] add the name of the module to the description of a global variable. This improves the readability of global-buffer-overflow reports. llvm-svn: 146698	2011-12-15 22:55:55 +00:00
Kostya Serebryany	65849ee22a	[asan] fix a bug (issue 19) where dlclose and the following mmap caused a false positive. compiler part. llvm-svn: 146688	2011-12-15 21:59:03 +00:00
Pete Cooper	550b96ab46	Added InstCombine for "select cond, ~cond, x" type patterns These can be reduced to "~cond & x" or "~cond \| x" llvm-svn: 146624	2011-12-15 00:56:45 +00:00
Eli Friedman	5dd57bb40a	Make loop preheader insertion in LoopSimplify handle the case where the loop header is a landing pad correctly (by splitting the landingpad out of the loop header). Make some adjustments to the rest of LoopSimplify to make it clear that the rest of LoopSimplify isn't making bad assumptions about the presence of landing pads. PR11575. llvm-svn: 146621	2011-12-15 00:50:34 +00:00
Dan Gohman	1add31cc93	Move Instruction::isSafeToSpeculativelyExecute out of VMCore and into Analysis as a standalone function, since there's no need for it to be in VMCore. Also, update it to use isKnownNonZero and other goodies available in Analysis, making it more precise, enabling more aggressive optimization. llvm-svn: 146610	2011-12-14 23:49:11 +00:00
Stepan Dyatkovskiy	14cb78c6fb	Fix for bug #11429 : Wrong behaviour for switches. Small improvement for code size heuristics. llvm-svn: 146578	2011-12-14 19:19:17 +00:00
Dan Gohman	e9572aa680	It turns out that clang does use pointer-to-function types to point to ARC-managed pointers sometimes. This fixes rdar://10551239. llvm-svn: 146577	2011-12-14 19:10:53 +00:00
Kostya Serebryany	7d3cbe255e	[asan] remove .preinit_array from the compiler module (it breaks .so builds). This should be done in the run-time. llvm-svn: 146527	2011-12-14 00:01:51 +00:00
Kostya Serebryany	8c4be7b06e	[asan] report an error if blacklist file contains a malformed regex. fixes asan issue 17 llvm-svn: 146503	2011-12-13 19:34:53 +00:00
Andrew Trick	c86869b858	Cleanup. Clarify LSRInstance public methods. llvm-svn: 146459	2011-12-13 00:55:33 +00:00
Andrew Trick	67432b451b	Indvars: guard against exponential behavior in isHighCostExpansion. This should always be done as a matter of principal. I don't have a case that exposes the problem. I just noticed this recently while scanning the code and realized I meant to fix it long ago. llvm-svn: 146438	2011-12-12 22:46:16 +00:00
Daniel Dunbar	b72534060e	LLVMBuild: Introduce a common section which currently has a list of the subdirectories to traverse into. - Originally I wanted to avoid this and just autoscan, but this has one key flaw in that new subdirectories can not automatically trigger a rerun of the llvm-build tool. This is particularly a pain when switching back and forth between trees where one has added a subdirectory, as the dependencies will tend to be wrong. This will also eliminates FIXME implicitly. llvm-svn: 146436	2011-12-12 22:45:54 +00:00
Joerg Sonnenberger	5b25b4d437	Only replace fwrite with fputc, if the return value is unused. llvm-svn: 146411	2011-12-12 20:18:31 +00:00
Daniel Dunbar	30d6a45140	LLVMBuild: Remove trailing newline, which irked me. llvm-svn: 146409	2011-12-12 19:48:00 +00:00
Dan Gohman	40acf5d720	When computing reverse-CFG reverse-post-order, skip backedges, as detected in the forward-CFG DFS. This prevents the reverse-CFG from visiting blocks inside loops after blocks that dominate them in the case where loops have multiple exits. No testcase, because this fixes a bug which in practice only shows up in a full optimizer run, due to the use-list order. This fixes rdar://10422791 and others. llvm-svn: 146408	2011-12-12 19:42:25 +00:00
Dan Gohman	73c245acaa	Add a TODO comment. llvm-svn: 146389	2011-12-12 18:30:26 +00:00
Dan Gohman	9144e6bb3e	Fix a copy+pasto in a comment. llvm-svn: 146385	2011-12-12 18:20:00 +00:00
Dan Gohman	ee8b344c67	Use getArgOperand instead of getOperand on a call. llvm-svn: 146384	2011-12-12 18:19:12 +00:00
Dan Gohman	61f78d27b0	Inline SetSeqToRelease into its only caller, since it's more clear that way. llvm-svn: 146383	2011-12-12 18:16:56 +00:00
Dan Gohman	3ebb9f4498	Fix omitted break statements in a switch. llvm-svn: 146380	2011-12-12 18:13:53 +00:00
Kostya Serebryany	da73ce34a1	[asan] use .preinit_array only on linux llvm-svn: 146379	2011-12-12 18:01:46 +00:00
Chandler Carruth	083a91fab1	Switch llvm.cttz and llvm.ctlz to accept a second i1 parameter which indicates whether the intrinsic has a defined result for a first argument equal to zero. This will eventually allow these intrinsics to accurately model the semantics of GCC's __builtin_ctz and __builtin_clz and the X86 instructions (prior to AVX) which implement them. This patch merely sets the stage by extending the signature of these intrinsics and establishing auto-upgrade logic so that the old spelling still works both in IR and in bitcode. The upgrade logic preserves the existing (inefficient) semantics. This patch should not change any behavior. CodeGen isn't updated because it can use the existing semantics regardless of the flag's value. Note that this will be followed by API updates to Clang and DragonEgg. Reviewed by Nick Lewycky! llvm-svn: 146357	2011-12-12 04:26:04 +00:00
Andrew Trick	65fca605c6	LSR: ignore strides in outer loops. Since we're not rewriting IVs in other loops, there's not much reason to consider their stride when generating formulae. This should reduce the number of useless formulas considered by LSR. llvm-svn: 146302	2011-12-10 00:25:00 +00:00
Kostya Serebryany	e3281eff91	[asan] call __asan_init from .preinit_array. This simplifies __asan_init vs malloc chicken-and-egg situation on Android and probably on other flavours of Linux. Patch by eugenis@google.com. llvm-svn: 146284	2011-12-09 22:09:32 +00:00
Jakub Staszak	4077c5b401	SplitBlockPredecessors uses ArrayRef instead of Data and Size. llvm-svn: 146277	2011-12-09 21:19:53 +00:00
Andrew Trick	4f0b3bb42b	Add -unroll-runtime for unrolling loops with run-time trip counts. Patch by Brendon Cahoon! This extends the existing LoopUnroll and LoopUnrollPass. Brendon measured no regressions in the llvm test suite with -unroll-runtime enabled. This implementation works by using the existing loop unrolling code to unroll the loop by a power-of-two (default 8). It generates an if-then-else sequence of code prior to the loop to execute the extra iterations before entering the unrolled loop. llvm-svn: 146245	2011-12-09 06:19:40 +00:00
Nick Lewycky	d2c1661e9f	Fix infinite loop in DSE when deleting a free in a reachable loop that's also trivially infinite. llvm-svn: 146197	2011-12-08 22:36:35 +00:00
Duncan Sands	8b1c4875c3	Remove unused include. llvm-svn: 146037	2011-12-07 17:18:31 +00:00
Benjamin Kramer	7df1659ad7	Simplify common predecessor finding. - Walking over pred_begin/pred_end is an expensive operation. - PHINodes contain a value for each predecessor anyway. - While it may look like we used to save a few iterations with the set, be aware that getIncomingValueForBlock does a linear search on the values of the phi node. - Another -5% on ARMDisassembler.cpp (Release build). This was the last entry in the profile that was obviously wasting time. llvm-svn: 145937	2011-12-06 16:14:29 +00:00
Benjamin Kramer	933fd2afff	Push StringRefs through the metadata interface. llvm-svn: 145934	2011-12-06 11:50:26 +00:00
Andrew Trick	04c98888bc	LSR: prune undesirable formulae early. It's always good to prune early, but formulae that are unsatisfactory in their own right need to be removed before running any other pruning heuristics. We easily avoid generating such formulae, but we need them as an intermediate basis for forming other good formulae. llvm-svn: 145906	2011-12-06 03:13:31 +00:00
Nick Lewycky	d59dcc5ddb	Expose a switch for the new gcov format. llvm-svn: 145880	2011-12-06 00:29:13 +00:00
Chad Rosier	4f8c6f6a9c	Update comment. llvm-svn: 145866	2011-12-05 22:53:09 +00:00
Chad Rosier	c50cbc5a65	Make the MemCpyOptimizer a bit more aggressive. I can't think of a scenerio where this would be bad as the backend shouldn't have a problem inlining small memcpys. rdar://10510150 llvm-svn: 145865	2011-12-05 22:37:00 +00:00
Benjamin Kramer	f274d5feac	Add a little heuristic to Value::isUsedInBasicBlock to speed it up for small basic blocks. - Calling getUser in a loop is much more expensive than iterating over a few instructions. - Use it instead of the open-coded loop in AddrModeMatcher. - 5% speedup on ARMDisassembler.cpp Release builds. llvm-svn: 145810	2011-12-05 17:23:27 +00:00
Nadav Rotem	1a91e4381d	Add support for vectors of pointers. llvm-svn: 145801	2011-12-05 06:29:09 +00:00
Pete Cooper	32e376f7e1	Fixed deadstoreelimination bug where negative indices were incorrectly causing the optimisation to occur Turns out long long + unsigned long long is unsigned. Doh! Fixes http://llvm.org/bugs/show_bug.cgi?id=11455 llvm-svn: 145731	2011-12-03 00:04:30 +00:00
Benjamin Kramer	c477861998	Fix quadratic behavior in InlineFunction by fetching the personality function of the callee once and not for every invoke in the caller. The callee is usually smaller than the caller, too. This reduces the compile time of ARMDisassembler.cpp by 32% (Release build). It still takes ages to compile though. llvm-svn: 145690	2011-12-02 18:37:31 +00:00
Chad Rosier	fdca220a9e	Fix a few more places where TargetData/TargetLibraryInfo is not being passed. Add FIXMEs to places that are non-trivial to fix. llvm-svn: 145661	2011-12-02 01:26:24 +00:00
Chad Rosier	0b4bd4832a	Last bit of TargetLibraryInfo propagation. Also fixed a case for TargetData where it appeared beneficial to pass. More of rdar://10500969 llvm-svn: 145630	2011-12-01 21:29:16 +00:00
Pete Cooper	c708e83499	Improved fix for abs(val) != 0 to check other similar case. Also fixed style issues and confusing comment llvm-svn: 145618	2011-12-01 19:13:26 +00:00
Kostya Serebryany	78bd9e81b4	[asan] two minor fixes: use UnreachableInst after the neverreturn function call; use report_fatal_error when blacklist file can not be found llvm-svn: 145611	2011-12-01 18:54:53 +00:00
Pete Cooper	7e03b7250d	Added instcombine pattern to spot comparing -val or val against 0. (val != 0) == (-val != 0) so "abs(val) != 0" becomes "val != 0" Fixes <rdar://problem/10482509> llvm-svn: 145563	2011-12-01 03:58:40 +00:00
Chad Rosier	49a66381f7	Propagate TargetLibraryInfo throughout ConstantFolding.cpp and InstructionSimplify.cpp. Other fixups as needed. Part of rdar://10500969 llvm-svn: 145559	2011-12-01 03:08:23 +00:00
Kostya Serebryany	aa616ec887	make asan work at -O0, llvm part. Patch by glider@google.com llvm-svn: 145530	2011-11-30 22:19:26 +00:00
Eli Friedman	2d9e4aa665	Make GlobalMerge honor the preferred alignment on globals without an explicitly specified alignment. <rdar://problem/10497732>. llvm-svn: 145523	2011-11-30 21:54:15 +00:00
Chad Rosier	8f94cb4dd5	Whitespace. llvm-svn: 145470	2011-11-30 01:59:59 +00:00
Chad Rosier	c5fa9f413a	Add support for sqrt, sqrtl, and sqrtf in TargetLibraryInfo. Disable (fptrunc (sqrt (fpext x))) -> (sqrtf x) transformation if -fno-builtin is specified. rdar://10466410 llvm-svn: 145460	2011-11-29 23:57:10 +00:00
Stepan Dyatkovskiy	56753569db	Potential bug in RewriteLoopBodyWithConditionConstant: use iterator should not be changed inside the uses enumeration loop. llvm-svn: 145432	2011-11-29 20:34:39 +00:00
Daniel Dunbar	4e00f5f8fd	build/CMake: Finish removal of add_llvm_library_dependencies. llvm-svn: 145420	2011-11-29 19:25:30 +00:00
Duncan Sands	97cc6da56c	Fix a theoretical problem (not seen in the wild): if different instances of a weak variable are compiled by different compilers, such as GCC and LLVM, while LLVM may increase the alignment to the preferred alignment there is no reason to think that GCC will use anything more than the ABI alignment. Since it is the GCC version that might end up in the final program (as the linkage is weak), it is wrong to increase the alignment of loads from the global up to the preferred alignment as the alignment might only be the ABI alignment. Increasing alignment up to the ABI alignment might be OK, but I'm not totally convinced that it is. It seems better to just leave the alignment of weak globals alone. llvm-svn: 145413	2011-11-29 18:26:38 +00:00
Andrew Trick	63f81b112e	SCEV fix. In general, Add/Mul expressions should not inherit NSW/NUW. This reverts r139450, fixes r139453, and adds much needed comments and a unit test. llvm-svn: 145367	2011-11-29 02:16:38 +00:00
Eli Friedman	1d55ba306b	Zap some completely ridiculous code. There's probably a miscompile here, but I don't really want to try to write a testcase involving an invoke returning a pointer to a varargs function... llvm-svn: 145347	2011-11-29 01:18:23 +00:00
Eli Friedman	bc47555417	Add a missing safety check to ProcessUGT_ADDCST_ADD. Fixes PR11438. llvm-svn: 145316	2011-11-28 23:32:19 +00:00
Andrew Trick	8c051c1949	Remove the temporary flag -disable-unroll-scev and dead code. SCEV should now be used for trip count analysis, not LoopInfo. llvm-svn: 145262	2011-11-28 19:22:09 +00:00
Nick Lewycky	f8210714b8	Place the "cfg checksum" around a test. This was recently added in April 2011 to gcc, though I thought it was older (my gcc 4.4 has it as a local patch. Whoops!) This fixes PR10589. Also add some debugging statements. Remove GcnoFiles, the mapping from CompilationUnit to raw_ostream. Now that we start by iterating over each CU and descending into them, there's no need to maintain a mapping. llvm-svn: 145208	2011-11-27 23:22:20 +00:00
Benjamin Kramer	d861d825f2	Move code into anonymous namespaces. llvm-svn: 145154	2011-11-26 23:01:57 +00:00
Kostya Serebryany	908509d41a	[asan] do not instrument threadlocal globals, this is buggy llvm-svn: 145092	2011-11-23 02:10:54 +00:00
Nick Lewycky	39c6f0a5d5	Refactor code to use new attribute getters on CallSite for NoCapture and ByVal. Suggested in code review by Eli. That code in InstCombine looks kinda suspicious. llvm-svn: 145013	2011-11-20 19:09:04 +00:00
Kostya Serebryany	3a83736893	[asan] workaround for reg alloc bug 11395: don't instrument functions with large chunks of inline assembler llvm-svn: 144962	2011-11-18 01:41:06 +00:00
Kostya Serebryany	6081213d59	quick fix: remove GlobalVariable::GlobalVariable mistakenly commited at r144933. For some reason this compiles on linux llvm-svn: 144936	2011-11-17 23:37:53 +00:00
Andrew Trick	7dc21d8c0e	Fix an overly general check in SimplifyIndvar to handle useless phi cycles. The right way to check for a binary operation is cast<BinaryOperator>. The original check: cast<Instruction> && numOperands() == 2 would match phi "instructions", leading to an infinite loop in extreme corner case: a useless phi with operands [self, constant] that prior optimization passes failed to remove, being used in the loop by another useless phi, in turn being used by an lshr or udiv. Fixes PR11350: runaway iteration assertion. llvm-svn: 144935	2011-11-17 23:36:35 +00:00
Kostya Serebryany	3b8d362511	fall back to explicit list of allowed linkages when instrumenting globals in asan; add a test check that asan does not touch linkonce_odr llvm-svn: 144933	2011-11-17 23:14:59 +00:00
Eli Friedman	d02d82d355	Add support for custom names for library functions in TargetLibraryInfo. Add a custom name for fwrite and fputs on x86-32 OSX. Make SimplifyLibCalls honor the custom names for fwrite and fputs. Fixes <rdar://problem/9815881>. llvm-svn: 144876	2011-11-17 01:27:36 +00:00
Nick Lewycky	ff690249a9	Merge isObjectPointerWithTrustworthySize with getPointerSize. Use it when looking at the size of the pointee. Fixes PR11390! llvm-svn: 144773	2011-11-16 03:49:48 +00:00
Kostya Serebryany	4105068ea9	AddressSanitizer, first commit (compiler module only) llvm-svn: 144758	2011-11-16 01:35:23 +00:00
Kostya Serebryany	dedd750c82	test commit to verify that commit access works (added blank line) llvm-svn: 144748	2011-11-16 01:14:38 +00:00
Nadav Rotem	d8497a8354	Fix MSVC warnings by adding a cast. llvm-svn: 144721	2011-11-15 22:54:21 +00:00
Benjamin Kramer	4a8534a158	StringRefize and simplify. llvm-svn: 144675	2011-11-15 19:12:09 +00:00
Benjamin Kramer	a2f57dee6d	Remove all remaining uses of Value::getNameStr(). llvm-svn: 144648	2011-11-15 16:27:03 +00:00
Benjamin Kramer	6d85bbc486	Make headers standalone, move a virtual method out of line. llvm-svn: 144536	2011-11-14 17:22:45 +00:00
Daniel Dunbar	73d41b0f03	build: Attempt to rectify inconsistencies between CMake and LLVMBuild versions of explicit dependencies. - The hope is that we have a tool/test to verify these are accurate (and tight) soon. llvm-svn: 144444	2011-11-12 02:10:57 +00:00
Eli Friedman	a83fbaff5f	Make sure scalarrepl picks the correct alloca when it rewrites a bitcast. Fixes PR11353. llvm-svn: 144442	2011-11-12 02:07:50 +00:00
Daniel Dunbar	d6784c1269	LLVMBuild: Alphabetize required_libraries lists. llvm-svn: 144416	2011-11-11 22:59:23 +00:00
Eli Friedman	127d98ab35	Get rid of an optimization in SCCP which appears to have many issues. Specifically, it doesn't handle many cases involving undef correctly, and it is missing other checks which lead to it trying to re-mark a value marked as a constant with a different value. It also appears to trigger very rarely. Fixes PR11357. llvm-svn: 144352	2011-11-11 01:16:15 +00:00
Pete Cooper	93d3c835a0	Fixed bug in DeadStoreElimination commit r144239 Size of data being pointed to wasn't always being checked so some small writes were killing big writes Fixes <rdar://problem/10426753> llvm-svn: 144312	2011-11-10 20:22:08 +00:00
Pete Cooper	38700a1201	DeadStoreElimination can now trim the size of a store if the end of the store is dead. Currently checks alignment and killing stores on a power of 2 boundary as this is likely to trim the size of the earlier store without breaking large vector stores into scalar ones. Fixes <rdar://problem/10140300> llvm-svn: 144239	2011-11-09 23:07:35 +00:00
Pete Cooper	a85aa24d64	LICM pass now understands invariant load metadata. Nothing generates this yet so it will currently never get used in real tests llvm-svn: 144107	2011-11-08 19:30:00 +00:00
Pete Cooper	1d5d364e06	InstCombine now optimizes vector udiv by power of 2 to shifts Fixes r8429 llvm-svn: 144036	2011-11-07 23:04:49 +00:00
Bill Wendling	8b76d87b10	Make sure we don't insert instructions before a landingpad instruction. <rdar://problem/10405911> llvm-svn: 144000	2011-11-07 19:38:34 +00:00
Nick Lewycky	7ea3dd8ae5	Do simple cross-block DSE when we encounter a free statement. Fixes PR11240. llvm-svn: 143808	2011-11-05 10:48:42 +00:00
Daniel Dunbar	0193e03f99	Speculatively revert "DeadStoreElimination can now trim the size of a store if the end of it is dead.", which appears to break bootstrapping LLVM. llvm-svn: 143668	2011-11-04 00:48:26 +00:00
Daniel Dunbar	3760ebeebb	build: Add initial cut at LLVMBuild.txt files. llvm-svn: 143634	2011-11-03 18:53:17 +00:00
Pete Cooper	4902705b5f	DeadStoreElimination can now trim the size of a store if the end of it is dead. Only currently done if the later store is writing to a power of 2 address or has the same alignment as the earlier store as then its likely to not break up large stores into smaller ones Fixes <rdar://problem/10140300> llvm-svn: 143630	2011-11-03 18:01:56 +00:00
Andrew Trick	3c1e831108	Rewrite LinearFunctionTestReplace to handle pointer-type IVs. We've been hitting asserts in this code due to the many supported combintions of modes (iv-rewrite/no-iv-rewrite) and IV types. This second rewrite of the code attempts to deal with these cases systematically. llvm-svn: 143546	2011-11-02 17:19:57 +00:00
Chandler Carruth	06c2886572	Add parentheses to disambiguate the precedence of these operations and silence -Wparentheses. llvm-svn: 143534	2011-11-02 05:43:44 +00:00
Andrew Trick	c9baf3a7a1	Broaden an assert to handle enable-iv-rewrite=true following r143183. Narrowest possible fix for PR11279. llvm-svn: 143522	2011-11-02 00:02:45 +00:00
Eli Friedman	676558ae92	Make sure we use the right insertion point when instcombine replaces a PHI with another instruction. (Specifically, don't insert an arbitrary instruction before a PHI.) Fixes PR11275. llvm-svn: 143437	2011-11-01 04:49:29 +00:00
Devang Patel	85caca9190	Add utility to append a function to the list of global constructors. Patch by Kostya Serebryany. llvm-svn: 143405	2011-10-31 23:58:51 +00:00
Benjamin Kramer	d32c541fe4	SimplifyLibCalls: Use IRBuilder.CreateGlobalString when creating a string for printf->puts, which correctly sets the unnamed_addr bit on the resulting GlobalVariable. Fixes PR11264. llvm-svn: 143289	2011-10-29 19:43:31 +00:00
Andrew Trick	77532be5e0	LFTR should avoid a type mismatch with null pointer IVs. Fixes rdar://10359193 Indvar LinearFunctionTestReplace assertion llvm-svn: 143183	2011-10-28 03:45:11 +00:00
Eli Friedman	e6918ac01a	It is not safe to sink an alloca into a stacksave/stackrestore pair, so don't do that. <rdar://problem/10352360> llvm-svn: 143093	2011-10-27 01:33:51 +00:00
Nick Lewycky	4d47e224d7	A dead malloc, a free(NULL) and a free(undef) are all trivially dead instructions. This doesn't introduce any optimizations we weren't doing before (except potentially due to pass ordering issues), now passes will eliminate them sooner as part of their own cleanups. llvm-svn: 142787	2011-10-24 04:35:36 +00:00
Cameron Zwarich	2dd06afcf5	The element insertion code in scalar replacement doesn't handle incorrect element types, even though the element extraction code does. It is surprising that this bug has been here for so long. Fixes <rdar://problem/10318778>. llvm-svn: 142740	2011-10-23 07:02:10 +00:00
Nick Lewycky	25e5f6896b	A non-escaping malloc in the entry block is not unlike an alloca. Do dead-store elimination on them too. llvm-svn: 142735	2011-10-22 21:59:35 +00:00
Eli Friedman	5012ac7cc0	Remap blockaddress correctly when inlining a function. Fixes PR10162. llvm-svn: 142684	2011-10-21 20:45:19 +00:00
Eli Friedman	28f3ff0d3d	Minor simplification: use ShuffleVectorInst::getMaskValue instead of a more expensive helper. llvm-svn: 142672	2011-10-21 19:11:34 +00:00
Eli Friedman	fb0b9216e1	Extend instcombine's shufflevector simplification to handle more cases where the input and output vectors have different sizes. Patch by Xiaoyi Guo. llvm-svn: 142671	2011-10-21 19:06:29 +00:00
Eli Friedman	e8f8cf1f33	Refactor code from inlining and globalopt that checks whether a function definition is unused, and enhance it so it can tell that functions which are only used by a blockaddress are in fact dead. This probably doesn't happen much on most code, but the Linux kernel's _THIS_IP_ can trigger this issue with blockaddress. (GlobalDCE can also handle the given tescase, but we only run that at -O3.) Found while looking at PR11180. llvm-svn: 142572	2011-10-20 05:23:42 +00:00
Devang Patel	f46908afb4	Initialze ScalarEvalution dependency. Patch by Pranav Bhandarkar! llvm-svn: 142556	2011-10-19 23:56:07 +00:00
Dan Gohman	5e2d8538d7	Teach the ARC optimizer about the !clang.arc.copy_on_escape metadata tag on objc_retainBlock calls, which indicates that they may be optimized away. rdar://10211286. llvm-svn: 142298	2011-10-17 22:53:25 +00:00
Bill Wendling	2c5486d770	Add support for the Objective-C personality function to the instruction combining of the landingpad instruction. The ObjC personality function acts almost identically to the C++ personality function. In particular, it uses "null" as a "catch-all" value. llvm-svn: 142256	2011-10-17 21:20:24 +00:00
Dan Gohman	13624a6c83	Suppress partial retain+release elimination when there's a possibility that it will span multiple CFG diamonds/triangles which could have different controlling predicates. rdar://10282956 llvm-svn: 142222	2011-10-17 18:48:25 +00:00
Bill Wendling	584c5f9c62	Correct over-zealous removal of hack. Some code want to check that any call within a function has the 'returns twice' attribute, not just that the current function has one. llvm-svn: 142221	2011-10-17 18:43:40 +00:00
Bill Wendling	2ee7de36b8	Now that we have the ReturnsTwice function attribute, this method is obsolete. Check the attribute instead. <rdar://problem/8031714> llvm-svn: 142212	2011-10-17 18:22:52 +00:00
Michael J. Spencer	7ce30d63fd	Fix CMake build. llvm-svn: 142204	2011-10-17 17:50:39 +00:00
Devang Patel	8f9c569a13	svn mv Target/ARM/ARMGlobalMerge.cpp Transforms/Scalar/GlobalMerge.cpp There is no reason to have simple IR level pass in lib/Target. llvm-svn: 142200	2011-10-17 17:17:43 +00:00
Chandler Carruth	9c33ff8a8b	Add a routine to swap branch instruction operands, and update any profile metadata at the same time. Use it to preserve metadata attached to a branch when re-writing it in InstCombine. Add metadata to the canonicalize_branch InstCombine test, and check that it is tranformed correctly. Reviewed by Nick Lewycky! llvm-svn: 142168	2011-10-17 01:11:57 +00:00
Chandler Carruth	aabf428f3c	Add a proper LLVM banner to this file. llvm-svn: 142162	2011-10-16 22:15:07 +00:00
Nick Lewycky	c8b7f776e6	When looking for dependencies on the src pointer, scan the src pointer. Scanning on the memcpy call will pull up other unrelated stuff. Fixes PR11142. llvm-svn: 142150	2011-10-16 20:13:32 +00:00
Duncan Sands	c4a6aaa716	Don't replace all dominated uses if there is only one use, since that use can't be dominated, saving one domtree lookup. llvm-svn: 142066	2011-10-15 11:13:42 +00:00
Andrew Trick	7cc89bde59	Fix indvars randomness by removing iteration over a map. I rewrote the algorithm a while back so it doesn't require map lookup, but neglected to change the data structure. This was caught by llvm-gcc self host, not because there's anything special about llvm-gcc, but because it is the only test for nondeterminism we currently have. Unit tests don't work well for everything; we should always try to have a nondeterminism stress test running. Fixes PR11133: llvm-gcc self host .o mismatch after enable-iv-rewrite=false llvm-svn: 142036	2011-10-15 01:38:14 +00:00
Eli Friedman	667bf19c57	Avoid undefined behavior in negation in LSR. Patch by Ahmed Charles. Someone more familiar with LSR should double-check that the extra cast is actually doing the right thing in the overflow cases; I'm not completely confident that's that case. llvm-svn: 141916	2011-10-13 23:48:33 +00:00
Eli Friedman	e702d68556	Enhance the memdep interface so that users can tell the difference between a dependency which cannot be calculated and a path reaching the entry point of the function. This patch introduces isNonFuncLocal, which replaces isUnknown in some cases. Patch by Xiaoyi Guo. llvm-svn: 141896	2011-10-13 22:14:57 +00:00
Eli Friedman	22c39edd2e	Fix a couple hash functions so that they do not depend on undefined shifts. Based on patch by Ahmed Charles. llvm-svn: 141820	2011-10-12 22:00:26 +00:00
Nick Lewycky	e7b3961fab	Add missing space. llvm-svn: 141750	2011-10-12 00:14:31 +00:00
Cameron Zwarich	fac176ac51	Fix PR11106 by correcting a typo that has been in the code for over a year. This would have never worked, since the element type of a vector type is never a vector type. Also fix the conditional to be more direct in checking whether EltTy is a vector type. llvm-svn: 141713	2011-10-11 21:26:40 +00:00
Cameron Zwarich	a34d748f83	Remove a lot of the fancy scalar replacement code for dealing with llvm-gcc's lowering of NEON code. It provides little-to-no benefit now and only introduces additional complexity. llvm-svn: 141646	2011-10-11 06:10:30 +00:00
Andrew Trick	23866a5e65	Add experimental -enable-lsr-phielim option. I'm not sure we will need it in the long run, but the option is currently useful for checking if the output of LSR is "clean". llvm-svn: 141634	2011-10-11 02:30:45 +00:00
Andrew Trick	d36852e6b1	Move replaceCongruentIVs into SCEVExapander and bias toward "expanded" IVs. Indvars previously chose randomly between congruent IVs. Now it will bias the decision toward IVs that SCEVExpander likes to create. This was not done to fix any problem, it's just a welcome side effect of factoring code. llvm-svn: 141633	2011-10-11 02:28:51 +00:00
Lang Hames	be4997db2f	Add a natural stack alignment field to TargetData, and prevent InstCombine from promoting allocas to preferred alignments that exceed the natural alignment. This avoids some potentially expensive dynamic stack realignments. The natural stack alignment is set in target data strings via the "S<size>" option. Size is in bits and must be a multiple of 8. The natural stack alignment defaults to "unspecified" (represented by a zero value), and the "unspecified" value does not prevent any alignment promotions. Target maintainers that care about avoiding promotions should explicitly add the "S<size>" option to their target data strings. llvm-svn: 141599	2011-10-10 23:42:08 +00:00
Andrew Trick	75743b069e	LSR should only reuse phis that match its formula. Fixes rdar://problem/5064068 llvm-svn: 141442	2011-10-07 23:46:21 +00:00
Duncan Sands	559ef2f491	Teach GVN to also propagate switch cases. For example, in this code switch (n) { case 27: do_something(x); ... } the call do_something(x) will be replaced with do_something(27). In gcc-as-one-big-file this results in the removal of about 500 lines of bitcode (about 0.02%), so has about 1/10 of the effect of propagating branch conditions. llvm-svn: 141360	2011-10-07 08:29:06 +00:00
Eli Friedman	4d63ca106a	Remove the old atomic instrinsics. autoupgrade functionality is included with this patch. llvm-svn: 141333	2011-10-06 23:20:49 +00:00
Eli Friedman	dd48bb30de	PR11061: Make simplifylibcalls fold strcmp("", x) correctly. While I'm here, fix the related issue with strncmp, add some actual tests for strcmp and strncmp, and start using StringRef::compare for constant folding instead of using strcmp/strncmp so that the optimized IR isn't dependent on the host's implementation of strcmp. llvm-svn: 141227	2011-10-05 22:27:16 +00:00
Jim Grosbach	a0e2c52a5c	Re-commit 141203, but much more conservative. Just pull the instruction name, but don't change the order of anything else. That keeps --debug happy and non-crashing, but doesn't change how the worklist gets built. llvm-svn: 141210	2011-10-05 20:53:43 +00:00
Jim Grosbach	254b9ed208	Revert 141203. InstCombine is looping on unit tests. llvm-svn: 141209	2011-10-05 20:44:29 +00:00
Jim Grosbach	a03dd9189f	Update InstCombine worklist after instruction transform is complete. When updating the worklist for InstCombine, the Add/AddUsersToWorklist functions may access the instruction(s) being added, for debug output for example. If the instructions aren't yet added to the basic block, this can result in a crash. Finish the instruction transformation before adjusting the worklist instead. rdar://10238555 llvm-svn: 141203	2011-10-05 20:05:00 +00:00
Duncan Sands	f7df28c1f5	GVN does simple propagation of conditions: when it sees a conditional branch "br i1 %x, label %if_true, label %if_false" then it replaces "%x" with "true" in places only reachable via the %if_true arm, and with "false" in places only reachable via the %if_false arm. Except that actually it doesn't: if value numbering shows that %y is equal to %x then, yes, %y will be turned into true/false in this way, but any occurrences of %x itself are not transformed. Fix this. What's more, it's often the case that %x is an equality comparison such as "%x = icmp eq %A, 0", in which case every occurrence of %A that is only reachable via the %if_true arm can be replaced with 0. Implement this and a few other variations on this theme. This reduces the number of lines of LLVM IR in "GCC as one big file" by 0.2%. It has a bigger impact on Ada code, typically reducing the number of lines of bitcode by around 0.4% by removing repeated compiler generated checks. Passes the LLVM nightly testsuite and the Ada ACATS testsuite. llvm-svn: 141177	2011-10-05 14:28:49 +00:00
Duncan Sands	348e8c285a	Generalize GVN's conditional propagation logic slightly: it's OK for the false/true destination to have multiple predecessors as long as the extra ones are dominated by the branch destination. llvm-svn: 141176	2011-10-05 14:17:01 +00:00
Andrew Trick	c60e2addd9	LSR should avoid redundant edge splitting. This handles the case in which LSR rewrites an IV user that is a phi and splits critical edges originating from a switch. Fixes <rdar://problem/6453893> LSR is not splitting edges "nicely" llvm-svn: 141059	2011-10-04 03:50:44 +00:00
Andrew Trick	94d203b172	whitespace llvm-svn: 141058	2011-10-04 03:34:49 +00:00
Nick Lewycky	7cd1bfb89d	Add a new icmp+select optz'n. Also shows off the load(cst) folding added in r140966. llvm-svn: 140969	2011-10-02 10:37:37 +00:00
Nick Lewycky	3282ef025d	Enhance a couple places where we were doing constant folding of instructions, but not load instructions. Noticed by inspection. llvm-svn: 140966	2011-10-02 09:12:55 +00:00
Andrew Trick	0489c5410d	Inlining and unrolling heuristics should be aware of free truncs. We want heuristics to be based on accurate data, but more importantly we don't want llvm to behave randomly. A benign trunc inserted by an upstream pass should not cause a wild swings in optimization level. See PR11034. It's a general problem with threshold-based heuristics, but we can make it less bad. llvm-svn: 140919	2011-10-01 01:39:05 +00:00
Andrew Trick	a1161d94f5	whitespace llvm-svn: 140916	2011-10-01 01:27:56 +00:00
Jim Grosbach	96af96b83d	Don't modify constant in-place. llvm-svn: 140875	2011-09-30 19:58:46 +00:00
Jim Grosbach	d35eaaeb6e	float comparison to double 'zero' constant can just be a float 'zero.' InstCombine was incorrectly considering the conversion of the constant zero to be unsafe. We want to transform: define float @bar(float %x) nounwind readnone optsize ssp { %conv = fpext float %x to double %cmp = fcmp olt double %conv, 0.000000e+00 %conv1 = zext i1 %cmp to i32 %conv2 = sitofp i32 %conv1 to float ret float %conv2 } Into: define float @bar(float %x) nounwind readnone optsize ssp { %cmp = fcmp olt float %x, 0.000000e+00 ; <---- This %conv1 = zext i1 %cmp to i32 %conv2 = sitofp i32 %conv1 to float ret float %conv2 } rdar://10215914 llvm-svn: 140869	2011-09-30 18:45:50 +00:00
Jim Grosbach	651c847dc5	Tidy up. Trailing whitespace. llvm-svn: 140865	2011-09-30 18:09:53 +00:00
Duncan Sands	b4c8b2d9fa	Inlining often produces landingpad instructions with repeated catch or repeated filter clauses. Teach instcombine a bunch of tricks for simplifying landingpad clauses. Currently the code only recognizes the GNU C++ and Ada personality functions, but that doesn't stop it doing a bunch of "generic" transforms which are hopefully fine for any real-world personality function. If these "generic" transforms turn out not to be generic, they can always be conditioned on the personality function. Probably someone should add the ObjC++ personality function. I didn't as I don't know anything about it. llvm-svn: 140852	2011-09-30 13:12:16 +00:00
Nick Lewycky	fc476a3d3f	Fold two identical set lookups into one. No functionality change. llvm-svn: 140821	2011-09-29 23:40:12 +00:00
Dan Gohman	7dca165e93	When eliminating unnecessary retain+autorelease on return values, handle the case where the retain is in a different basic block. rdar://10210274. llvm-svn: 140815	2011-09-29 22:27:34 +00:00
Dan Gohman	2df048a94b	Don't eliminate objc_retainBlock calls on stack objects if the objc_retainBlock call is potentially responsible for copying the block to the heap to extend its lifetime. rdar://10209613. llvm-svn: 140814	2011-09-29 22:25:23 +00:00
Eli Friedman	ac33381aa1	Clean up uses of switch instructions so they are not dependent on the operand ordering. Patch by Stepan Dyatkovskiy. llvm-svn: 140803	2011-09-29 20:21:17 +00:00
Andrew Trick	fa2c108a22	typo + pasto llvm-svn: 140769	2011-09-29 01:53:08 +00:00
Andrew Trick	e1d5ae73ac	LSR: rewrite inner loops only. Rewriting the entire loop nest now requires -enable-lsr-nested. See PR11035 for some performance data. A few unit tests specifically test nested LSR, and are now under a flag. llvm-svn: 140762	2011-09-29 01:33:38 +00:00

... 3 4 5 6 7 ...

8810 Commits