llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-30 00:24:00 +00:00

Author	SHA1	Message	Date
Chris Lattner	0e03e40a76	Teach inst combine to merge GEPs through PHIs. This is really important because it is sinking the loads using the GEPs, but not the GEPs themselves. This triggers 647 times on 403.gcc and makes the .s file much much nicer. For example before: je LBB1_87 ## bb78 LBB1_62: ## bb77 leal 84(%esi), %eax LBB1_63: ## bb79 movl (%eax), %eax ... LBB1_87: ## bb78 movl $0, 4(%esp) movl %esi, (%esp) call L_make_decl_rtl$stub jmp LBB1_62 ## bb77 after: jne LBB1_63 ## bb79 LBB1_62: ## bb78 movl $0, 4(%esp) movl %esi, (%esp) call L_make_decl_rtl$stub LBB1_63: ## bb79 movl 84(%esi), %eax The input code was (and the GEPs are merged and the PHI is now eliminated by instcombine): br i1 %tmp233, label %bb78, label %bb77 bb77: %tmp234 = getelementptr %struct.tree_node* %t_addr.3, i32 0, i32 0, i32 22 br label %bb79 bb78: call void @make_decl_rtl(%struct.tree_node* %t_addr.3, i8* null) nounwind %tmp235 = getelementptr %struct.tree_node* %t_addr.3, i32 0, i32 0, i32 22 br label %bb79 bb79: %iftmp.12.0.in = phi %struct.rtx_def [ %tmp235, %bb78 ], [ %tmp234, %bb77 ] %iftmp.12.0 = load %struct.rtx_def %iftmp.12.0.in llvm-svn: 60322	2008-12-01 02:34:36 +00:00
Chris Lattner	01150dce74	testcase for my previous commit. llvm-svn: 60315	2008-12-01 01:42:03 +00:00
Bill Wendling	23684a026c	Implement ((A\|B)&1)\|(B&-2) -> (A&1) \| B transformation. This also takes care of permutations of this pattern. llvm-svn: 60312	2008-12-01 01:07:11 +00:00
Bill Wendling	66a7442059	Add instruction combining for ((A&~B)\|(~A&B)) -> A^B and all permutations. llvm-svn: 60291	2008-11-30 13:52:49 +00:00
Bill Wendling	3e27ac16a6	Implement (A&((~A)\|B)) -> A&B transformation in the instruction combiner. This takes care of all permutations of this pattern. llvm-svn: 60290	2008-11-30 13:08:13 +00:00
Bill Wendling	97ad688c1b	getSExtValue() doesn't work for ConstantInts with bitwidth > 64 bits. Use all APInt calls instead. This fixes PR3144. llvm-svn: 60288	2008-11-30 12:38:24 +00:00
Eli Friedman	2bc3921ce2	Optimize memmove and memset into the LLVM builtins. Note that these only show up in code from front-ends besides llvm-gcc, like clang. llvm-svn: 60287	2008-11-30 08:32:11 +00:00
Eli Friedman	ccdfdbfc99	Followup to r60283: optimize arbitrary width signed divisions as well as unsigned divisions. Same caveats as before. llvm-svn: 60284	2008-11-30 06:35:39 +00:00
Eli Friedman	d7a261120f	Fix for PR2164: allow transforming arbitrary-width unsigned divides into multiplies. Some more cleverness would be nice, though. It would be nice if we could do this transformation on illegal types. Also, we would prefer a narrower constant when possible so that we can use a narrower multiply, which can be cheaper. llvm-svn: 60283	2008-11-30 06:02:26 +00:00
Eli Friedman	0ef5e1dc82	APIntify a test which is potentially unsafe otherwise, and fix the nearby FIXME. I'm not sure what the right way to fix the Cell test was; if the approach I used isn't okay, please let me know. llvm-svn: 60277	2008-11-30 04:59:26 +00:00
Bill Wendling	5020e916ef	Strengthen check for div inst-combining. llvm-svn: 60276	2008-11-30 04:33:53 +00:00
Bill Wendling	ac11f7d37e	Instcombine was illegally transforming -X/C into X/-C when either X or C overflowed on negation. This commit checks to make sure that neithe C nor X overflows. This requires that the RHS of X (a subtract instruction) be a constant integer. llvm-svn: 60275	2008-11-30 03:42:12 +00:00
Chris Lattner	203a3299e9	don't require GVN to work on dead values, just make the test return the loaded value. llvm-svn: 60252	2008-11-29 21:21:48 +00:00
Chris Lattner	f3e49f038c	Fix a thinko that manifested as a crash on clamav last night. llvm-svn: 60251	2008-11-29 20:29:04 +00:00
Chris Lattner	494758e720	Fix PR3141 by ensuring that MemoryDependenceAnalysis::removeInstruction properly updates the reverse dependency map when it installs updated dependencies for instructions that depend on the removed instruction. llvm-svn: 60222	2008-11-28 22:51:08 +00:00
Chris Lattner	a854ab3760	don't call MergeBasicBlockIntoOnlyPred on a block whose only predecessor is itself. This doesn't make sense, and this is a dead infinite loop anyway. llvm-svn: 60210	2008-11-28 19:54:49 +00:00
Nick Lewycky	40db216722	Chris prefers icmp/select over udiv! llvm-svn: 60187	2008-11-27 22:41:10 +00:00
Nick Lewycky	882443585d	Add a couple of missed optimizations on integer vectors. Multiply and divide by 1, as well as multiply by -1. llvm-svn: 60182	2008-11-27 20:21:08 +00:00
Chris Lattner	73b251b3bf	Fix PR3138: if we merge the entry block into another block, make sure to move the other block back up into the entry position! llvm-svn: 60179	2008-11-27 19:25:19 +00:00
Bill Wendling	7742719284	XFAil test due to reverting of patch. llvm-svn: 60161	2008-11-27 07:34:10 +00:00
Chris Lattner	532458b89f	Make jump threading substantially more powerful, in the following ways: 1. Make it fold blocks separated by an unconditional branch. This enables jump threading to see a broader scope. 2. Make jump threading able to eliminate locally redundant loads when they feed the branch condition of a block. This frequently occurs due to reg2mem running. 3. Make jump threading able to eliminate partially redundant loads when they feed the branch condition of a block. This is common in code with lots of loads and stores like C++ code and 255.vortex. This implements thread-loads.ll and rdar://6402033. Per the fixme's, several pieces of this should be moved into Transforms/Utils. llvm-svn: 60148	2008-11-27 05:07:53 +00:00
Evan Cheng	ee5e950c25	Avoid inserting noop's in the middle of a loop. llvm-svn: 60141	2008-11-27 01:16:00 +00:00
Evan Cheng	f18016728c	On x86 favors folding short immediate into some arithmetic operations (e.g. add, and, xor, etc.) because materializing an immediate in a register is expensive in turns of code size. e.g. movl 4(%esp), %eax addl $4, %eax is 2 bytes shorter than movl $4, %eax addl 4(%esp), %eax llvm-svn: 60139	2008-11-27 00:49:46 +00:00
Evan Cheng	4da44412cf	Add -march=x86. llvm-svn: 60135	2008-11-27 00:37:06 +00:00
Bill Wendling	3376836463	Add x86-specific test for add-with-overflow intrinsics. llvm-svn: 60125	2008-11-26 22:42:19 +00:00
Chris Lattner	d01522d33a	Turn on my codegen prepare heuristic by default. It doesn't affect performance in most cases on the Grawp tester, but does speed some things up (like shootout/hash by 15%). This also doesn't impact compile time in a noticable way on the Grawp tester. It also, of course, gets the testcase it was designed for right :) llvm-svn: 60120	2008-11-26 22:16:44 +00:00
Duncan Sands	f64dd4b09c	Check that running the DAG combiner between type and operation legalization does something useful. llvm-svn: 60108	2008-11-26 16:44:30 +00:00
Bill Wendling	f069b62cd7	Add test for rdar://6394879. llvm-svn: 60079	2008-11-26 02:21:12 +00:00
Chris Lattner	61c2a0fc8a	This adds in some code (currently disabled unless you pass -enable-smarter-addr-folding to llc) that gives CGP a better cost model for when to sink computations into addressing modes. The basic observation is that sinking increases register pressure when part of the addr computation has to be available for other reasons, such as having a use that is a non-memory operation. In cases where it works, it can substantially reduce register pressure. This code is currently an overall win on 403.gcc and 255.vortex (the two things I've been looking at), but there are several things I want to do before enabling it by default: 1. This isn't doing any caching of results, so it is much slower than it could be. It currently slows down release-asserts llc by 1.7% on 176.gcc: 27.12s -> 27.60s. 2. This doesn't think about inline asm memory operands yet. 3. The cost model botches the case when the needed value is live across the computation for other reasons. I'll continue poking at this, and eventually turn it on as llcbeta. llvm-svn: 60074	2008-11-26 02:00:14 +00:00
Chris Lattner	8209f83091	Teach CodeGenPrepare to look through Bitcast instructions when attempting to optimize addressing modes. This allows us to optimize things like isel-sink2.ll into: movl 4(%esp), %eax cmpb $0, 4(%eax) jne LBB1_2 ## F LBB1_1: ## TB movl $4, %eax ret LBB1_2: ## F movzbl 7(%eax), %eax ret instead of: _test: movl 4(%esp), %eax cmpb $0, 4(%eax) leal 4(%eax), %eax jne LBB1_2 ## F LBB1_1: ## TB movl $4, %eax ret LBB1_2: ## F movzbl 3(%eax), %eax ret This shrinks (e.g.) 403.gcc from 1133510 to 1128345 lines of .s. Note that the 2008-10-16-SpillerBug.ll testcase is dubious at best, I doubt it is really testing what it thinks it is. llvm-svn: 60068	2008-11-26 00:26:16 +00:00
Chris Lattner	017dde7e2b	fix an over-reduced test. llvm-svn: 60067	2008-11-26 00:12:08 +00:00
Chris Lattner	72db9f8bdd	this doesn't need EH llvm-svn: 60066	2008-11-26 00:03:26 +00:00
Mikhail Glushenkov	89bfeb825b	Since the old llvmc was removed, rename llvmc2 to llvmc. llvm-svn: 60048	2008-11-25 21:38:12 +00:00
Evan Cheng	c11d7e324f	convertToSignExtendedInteger should return opInvalidOp instead of asserting if sematics of float does not allow arithmetics. llvm-svn: 60042	2008-11-25 19:00:29 +00:00
Scott Michel	59013b297c	CellSPU: (a) Remove conditionally removed code in SelectXAddr. Basically, hope for the best that the A-form and D-form address predicates catch everything before the code decides to emit a X-form address. (b) Expand vector store test cases to include the usual suspects. llvm-svn: 60034	2008-11-25 17:29:43 +00:00
Scott Michel	bb575152bc	CellSPU: test should use shlqby, not shlqbyi llvm-svn: 60001	2008-11-25 01:30:37 +00:00
Bill Wendling	c9f3eec3f9	XFAIL this test. A recent CellSPU check-in broke it. llvm-svn: 60000	2008-11-25 00:56:34 +00:00
Dan Gohman	92cedc8a95	Initial support for anti-dependence breaking. Currently this code does not introduce any new spilling; it just uses unused registers. Refactor the SUnit topological sort code out of the RRList scheduler and make use of it to help with the post-pass scheduler. llvm-svn: 59999	2008-11-25 00:52:40 +00:00
Bill Wendling	cb92038dbd	Testcase for constant CFStrings. llvm-svn: 59992	2008-11-24 23:28:09 +00:00
Chris Lattner	a07ad05059	reenable test llvm-svn: 59986	2008-11-24 21:27:20 +00:00
Bill Wendling	36ee715e71	Temporarily XFAIL this test. r59976 and r59972 broke it. llvm-svn: 59981	2008-11-24 20:43:33 +00:00
Chris Lattner	e5bf93e61f	Fix 3113: If we have a dead cyclic PHI, replace the whole thing with an undef. llvm-svn: 59972	2008-11-24 19:25:36 +00:00
Scott Michel	259a64c097	CellSPU: (a) Slight rethink on i64 zero/sign/any extend code - use a shuffle to directly zero-extend i32 to i64, but use rotates and shifts for sign extension. Also ensure unified register consistency. (b) Add new test harness for i64 operations: i64ops.ll llvm-svn: 59970	2008-11-24 18:20:46 +00:00
Scott Michel	c3965308a4	CellSPU: (a) Improve the extract element code: there's no need to do gymnastics with rotates into the preferred slot if a shuffle will do the same thing. (b) Rename a couple of SPUISD pseudo-instructions for readability and better semantic correspondence. (c) Fix i64 sign/any/zero extension lowering. llvm-svn: 59965	2008-11-24 17:11:17 +00:00
Bill Wendling	855ac77084	Test add-with-overflow with fast ISel. llvm-svn: 59945	2008-11-24 05:23:38 +00:00
Nick Lewycky	47fa9bd187	Extend the 'noalias' attribute to function return values. This is intended to indicate functions that allocate, such as operator new, or list::insert. The actual definition is slightly less strict (for now). No changes to the bitcode reader/writer, asm printer or verifier were needed. llvm-svn: 59934	2008-11-24 03:41:24 +00:00
Bill Wendling	4bb8a7a498	Add support for llvm.uadd.with.overflow. llvm-svn: 59926	2008-11-24 01:38:29 +00:00
Scott Michel	50e49b28f0	CellSPU: Fix bug 3056. Varadic extract_element was not implemented (nor was it ever conceived to occur). llvm-svn: 59891	2008-11-22 23:50:42 +00:00
Nick Lewycky	2fbf26fe70	Optimize (x/y)*y into x-(x%y) in general. Div and rem are about the same, and a subtract is cheaper than a multiply. This generalizes an existing transform. llvm-svn: 59800	2008-11-21 07:33:58 +00:00
Scott Michel	314d705baf	CellSPU: (a) Fix bgs 3052, 3057 (b) Incorporate Duncan's suggestions re: i1 promotion (c) Indentation updates. llvm-svn: 59790	2008-11-21 02:56:16 +00:00

1 2 3 4 5 ...

6075 Commits