llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-27 14:07:32 +00:00

Author	SHA1	Message	Date
Chris Lattner	b35e482ca3	add a comment, make save/restore logic more obvious. llvm-svn: 60076	2008-11-26 02:11:11 +00:00
Chris Lattner	61c2a0fc8a	This adds in some code (currently disabled unless you pass -enable-smarter-addr-folding to llc) that gives CGP a better cost model for when to sink computations into addressing modes. The basic observation is that sinking increases register pressure when part of the addr computation has to be available for other reasons, such as having a use that is a non-memory operation. In cases where it works, it can substantially reduce register pressure. This code is currently an overall win on 403.gcc and 255.vortex (the two things I've been looking at), but there are several things I want to do before enabling it by default: 1. This isn't doing any caching of results, so it is much slower than it could be. It currently slows down release-asserts llc by 1.7% on 176.gcc: 27.12s -> 27.60s. 2. This doesn't think about inline asm memory operands yet. 3. The cost model botches the case when the needed value is live across the computation for other reasons. I'll continue poking at this, and eventually turn it on as llcbeta. llvm-svn: 60074	2008-11-26 02:00:14 +00:00
Evan Cheng	383a9a9264	Revert r60042. IndVarSimplify should check if APFloat is PPCDoubleDouble first before trying to convert it to an integer. llvm-svn: 60072	2008-11-26 01:11:57 +00:00
Chris Lattner	8209f83091	Teach CodeGenPrepare to look through Bitcast instructions when attempting to optimize addressing modes. This allows us to optimize things like isel-sink2.ll into: movl 4(%esp), %eax cmpb $0, 4(%eax) jne LBB1_2 ## F LBB1_1: ## TB movl $4, %eax ret LBB1_2: ## F movzbl 7(%eax), %eax ret instead of: _test: movl 4(%esp), %eax cmpb $0, 4(%eax) leal 4(%eax), %eax jne LBB1_2 ## F LBB1_1: ## TB movl $4, %eax ret LBB1_2: ## F movzbl 3(%eax), %eax ret This shrinks (e.g.) 403.gcc from 1133510 to 1128345 lines of .s. Note that the 2008-10-16-SpillerBug.ll testcase is dubious at best, I doubt it is really testing what it thinks it is. llvm-svn: 60068	2008-11-26 00:26:16 +00:00
Chris Lattner	242f05e503	Teach MatchScaledValue to handle Scales by 1 with MatchAddr (which can recursively match things) and scales by 0 by ignoring them. This triggers once in 403.gcc, saving 1 (!!!!) instruction in the whole huge app. llvm-svn: 60013	2008-11-25 07:25:26 +00:00
Chris Lattner	5e0361dd74	significantly refactor all the addressing mode matching logic into a new AddressingModeMatcher class. This makes it easier to reason about and reduces passing around of stuff, but has no functionality change. llvm-svn: 60012	2008-11-25 07:09:13 +00:00
Chris Lattner	9ed8c8eec9	refactor all the constantexpr/instruction handling code out into a new FindMaximalLegalAddressingModeForOperation helper method. llvm-svn: 60011	2008-11-25 05:15:49 +00:00
Chris Lattner	587e7c0202	another minor tweak llvm-svn: 60010	2008-11-25 04:47:41 +00:00
Chris Lattner	85da435a15	minor cleanups no functionality change. llvm-svn: 60009	2008-11-25 04:42:10 +00:00
Chris Lattner	2f8129238b	rearrange and tidy some code, no functionality change. llvm-svn: 59990	2008-11-24 22:44:16 +00:00
Chris Lattner	9b5c9d8b0b	minor cleanups to debug code, no functionality change. llvm-svn: 59989	2008-11-24 22:40:05 +00:00
Chris Lattner	9f34dba0e4	reenable the right part of the code. llvm-svn: 59985	2008-11-24 21:26:21 +00:00
Chris Lattner	561b0adda9	revert an accidental commit, this fixes the regression on test/CodeGen/X86/isel-sink.ll llvm-svn: 59976	2008-11-24 19:40:34 +00:00
Chris Lattner	e5bf93e61f	Fix 3113: If we have a dead cyclic PHI, replace the whole thing with an undef. llvm-svn: 59972	2008-11-24 19:25:36 +00:00
Devang Patel	ce00a4dac7	Fix build failure. llvm-svn: 59844	2008-11-21 21:00:20 +00:00
Devang Patel	a2ccbea45a	Silence unused variable warnings. llvm-svn: 59841	2008-11-21 20:00:59 +00:00
Chris Lattner	08bdf9dfab	reapply Sanjiv's patch to genericize memcpy/memset/memmove to take an arbitrary integer width for the count. llvm-svn: 59823	2008-11-21 16:42:48 +00:00
Bill Wendling	4c5afef830	Revert r59802. It was breaking the build of llvm-gcc: g++ -m32 -c -g -DIN_GCC -W -Wall -Wwrite-strings -Wmissing-format-attribute -fno-common -mdynamic-no-pic -DHAVE_CONFIG_H -Wno-unused -DTARGET_NAME=\"i386-apple-darwin9.5.0\" -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include ../../llvm-gcc.src/gcc/llvm-types.cpp -o llvm-types.o ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemCpy(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemMove(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemSet(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i64' is not a member of 'llvm::Intrinsic' make[3]: [llvm-convert.o] Error 1 make[3]: * Waiting for unfinished jobs.... rm fsf-funding.pod gcov.pod gfdl.pod cpp.pod gpl.pod gcc.pod make[2]: * [all-stage1-gcc] Error 2 make[1]: * [stage1-bubble] Error 2 make: *** [all] Error 2 llvm-svn: 59809	2008-11-21 09:09:41 +00:00
Sanjiv Gupta	89a7e67578	Make mem[cpy,move,set] intrinsics overloaded. llvm-svn: 59802	2008-11-21 07:49:09 +00:00
Nick Lewycky	2fbf26fe70	Optimize (x/y)*y into x-(x%y) in general. Div and rem are about the same, and a subtract is cheaper than a multiply. This generalizes an existing transform. llvm-svn: 59800	2008-11-21 07:33:58 +00:00
Devang Patel	b6d277da6c	Fix unused variable warnings. llvm-svn: 59778	2008-11-21 01:52:59 +00:00
Bill Wendling	4020b2b907	Fix error where it wasn't getting the correct caller function. llvm-svn: 59758	2008-11-21 00:09:21 +00:00
Bill Wendling	f85b54db6c	If the function being inlined has a higher stack protection level than the inlining function, then increase the stack protection level on the inlining function. llvm-svn: 59757	2008-11-21 00:06:32 +00:00
Devang Patel	0c43c5a17f	Don't forget arguments! llvm-svn: 59745	2008-11-20 19:50:17 +00:00
Devang Patel	20dec363b1	Do not forget llvm.dbg.declare's first argument while removing debugging information. llvm-svn: 59688	2008-11-20 01:20:42 +00:00
Oscar Fuentes	212b29b9d9	CMake: Removed source file. llvm-svn: 59662	2008-11-19 19:32:19 +00:00
Devang Patel	e5a07d989e	Do not use separate utility to walk all instructions and remove dead dbg intrinsics. Let instcombiner do this job. llvm-svn: 59659	2008-11-19 19:01:37 +00:00
Devang Patel	79a342d7d3	Let instcombiner remove redundant dbg intrinsics. llvm-svn: 59658	2008-11-19 18:59:41 +00:00
Devang Patel	cd2e68c069	If there are two consecutive llvm.dbg.stoppoint calls then it is likely that the optimizer deleted code in between these two intrinsics. Keep only the last llvm.dbg.stoppoint in this case. llvm-svn: 59657	2008-11-19 18:56:50 +00:00
Devang Patel	dd2491943e	Remove unused variables. llvm-svn: 59570	2008-11-19 00:22:02 +00:00
Devang Patel	b49b40c2fb	Fix typo. llvm-svn: 59569	2008-11-19 00:19:18 +00:00
Devang Patel	233728e8c3	Add new helper pass that strips all symbol names except debugging information. This pass makes it easier to test wheter debugging info. influences optimization passes or not. llvm-svn: 59552	2008-11-18 21:34:39 +00:00
Devang Patel	0728a7d7e5	Remove even more llvm.dbg variables. Remove all dead globals from llvm.metadata. Ignore linkonce linkage for selected llvm.dbg values. llvm-svn: 59547	2008-11-18 21:13:41 +00:00
Devang Patel	d8981479c2	Initialize MallocFunc and FreeFunc properly. llvm-svn: 59538	2008-11-18 18:43:07 +00:00
Bill Wendling	5ba12c88c1	Cast to remove warning about comparing signed and unsigned. llvm-svn: 59518	2008-11-18 10:57:27 +00:00
Devang Patel	2a0aa9fa51	Give SIToFPInst preference over UIToFPInst because it is faster on platforms that are widely used. llvm-svn: 59476	2008-11-18 00:40:02 +00:00
Devang Patel	705f88d5b5	While handling floating point IVs lift restrictions on initial value and increment value. llvm-svn: 59471	2008-11-17 23:27:13 +00:00
Devang Patel	2d0c234523	Handle floating point ivs during doInitialization(). llvm-svn: 59466	2008-11-17 21:32:02 +00:00
Devang Patel	6c2dd85bf3	Let AnalyzeAlloca() remove debug intrinsics. llvm-svn: 59454	2008-11-17 18:37:53 +00:00
Torok Edwin	f67e9bb406	If SI->size() is 0, we are not allowed to dereference ->begin(). This fixed PR3078. llvm-svn: 59416	2008-11-16 17:21:25 +00:00
Chris Lattner	ba8e67c635	eliminate some std::set's. llvm-svn: 59409	2008-11-16 07:17:51 +00:00
Chris Lattner	d85de82f04	simplify loop llvm-svn: 59406	2008-11-16 06:35:18 +00:00
Chris Lattner	652917424d	simplify a bunch more instcombines to use m_Specific etc. llvm-svn: 59403	2008-11-16 05:38:51 +00:00
Chris Lattner	c487057a1e	factor the code for simplifying (icmp)\|(icmp) into its own function. llvm-svn: 59402	2008-11-16 05:20:07 +00:00
Chris Lattner	6b5b2c3606	do some computation with apints instead of ConstantInts. llvm-svn: 59401	2008-11-16 05:14:43 +00:00
Chris Lattner	f47d16afe3	merge a check into a place where it is simpler. llvm-svn: 59400	2008-11-16 05:10:52 +00:00
Chris Lattner	3b058783bc	factor a whole bunch of code out into a helper function. llvm-svn: 59398	2008-11-16 05:06:21 +00:00
Chris Lattner	f9dd858359	simplify the conditions on two gigantic if's, decreasing indentation a bit. Next step is to factor out into their own helper functions. llvm-svn: 59397	2008-11-16 04:55:20 +00:00
Chris Lattner	762c52d684	simplify some instcombine matches by using m_Specific llvm-svn: 59395	2008-11-16 04:46:19 +00:00
Chris Lattner	a5aee38775	Use new m_SelectCst template to eliminate macros. llvm-svn: 59392	2008-11-16 04:33:38 +00:00
Chris Lattner	cba75c1b7b	simplify code. llvm-svn: 59390	2008-11-16 04:26:55 +00:00
Chris Lattner	21f18c9760	Handle the case where there is no "not". It is possible it got folded into the select. llvm-svn: 59389	2008-11-16 04:25:26 +00:00
Chris Lattner	6afddeeed1	factor a bunch of copy/paste code out into a helper function. Eliminate the cases checking for cond?0:-1, since that is already handled by commutative checking. llvm-svn: 59388	2008-11-16 04:24:12 +00:00
Chris Lattner	9dd963a73a	rearrange some code, no functionality change. llvm-svn: 59381	2008-11-16 03:56:24 +00:00
Chris Lattner	0c0c68bab4	if we're going to use a macro, use it maximally. no functionality change. llvm-svn: 59380	2008-11-16 03:54:57 +00:00
Devang Patel	86c9f12e47	Refactor code. Strip debug information before stripping symbol names. llvm-svn: 59328	2008-11-14 22:49:37 +00:00
Devang Patel	2876af73db	Really remove all debug information. llvm-svn: 59208	2008-11-13 01:28:40 +00:00
Oscar Fuentes	444fd3cc2d	CMake: Remove removed source file. llvm-svn: 59098	2008-11-12 00:14:12 +00:00
Devang Patel	0a73799659	Remove llvm-svn: 59093	2008-11-11 23:58:15 +00:00
Devang Patel	0616706919	Undo previous check-in. llvm-svn: 59092	2008-11-11 23:57:33 +00:00
Oscar Fuentes	ac819d0394	CMake: Updated list of source files for lib/Transforms/Utils. llvm-svn: 59077	2008-11-11 19:51:36 +00:00
Devang Patel	c9fdfd2f0b	Add utility pass to remove dbg info. llvm-svn: 59068	2008-11-11 19:33:39 +00:00
Devang Patel	ef6a48b56f	Use actual function name in comments. llvm-svn: 59063	2008-11-11 19:16:41 +00:00
Cedric Venet	49a701ae17	Update CMakeLists.txt llvm-svn: 59039	2008-11-11 09:55:48 +00:00
Devang Patel	fbb29cdaee	Cleanup debug info. assocated with deleted instructions. llvm-svn: 59012	2008-11-11 00:54:10 +00:00
Devang Patel	ec4ae29ae4	Add utility routines to remove dead debug info. llvm-svn: 59011	2008-11-11 00:53:02 +00:00
Devang Patel	f0d6bd18d5	If the sign of exit condition and split condition does not match then do not split loop index. llvm-svn: 58995	2008-11-10 19:48:34 +00:00
Bill Wendling	b7d5ca543e	Third time's a charm. The previous patches didn't match correctly. Also, we need to make sure that the conditional is the same before doing the transformation. llvm-svn: 58978	2008-11-10 06:59:06 +00:00
Mon P Wang	911ee5bf8b	Added support for the following definition of shufflevector <result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <m x i32> <mask> llvm-svn: 58964	2008-11-10 04:46:22 +00:00
Bill Wendling	137550d34d	Correction for the last patch. Should match the conditional in the first part of the select match, not the select instruction itself. llvm-svn: 58947	2008-11-09 23:37:53 +00:00
Bill Wendling	3b91357ef0	The method of doing the matching with a 'select' instruction was wrong. The original code was matching like this: if (match(A, m_Not(m_Value(B)))) B was already matched as a 'select' instruction. However, this isn't matching what we think it's matching. It would match B as a 'Value', so basically anything would match to it. In this case, a Constant matched. B was replaced with a constant representation. And then the wrong value would be used in the SelectInst::Create statement, causing a crash. After thinking on this for a moment, and after Nick L. told me how the pattern matching stuff was supposed to work, the solution was to match NOT an m_Value, but an m_Select. llvm-svn: 58946	2008-11-09 23:17:42 +00:00
Nuno Lopes	6dfcc8cf45	fix leakage of ValueNumbering llvm-svn: 58933	2008-11-09 12:45:23 +00:00
Bill Wendling	436d4cce83	If the LHS of the FCMP is coming from a UIToFP instruction, then we don't want to generate signed ICMP instructions to replace the FCMP. This would violate the following: define i1 @test1(i32 %val) { %1 = uitofp i32 %val to double %2 = fcmp ole double %1, 0.000000e+00 ret i1 %2 } would be transformed into: define i1 @test1(i32 %val) { %1 = icmp slt i33 %val, 1 ret i1 %1 } which is obviously wrong. This patch modifes InstCombiner::FoldFCmp_IntToFP_Cst to handle when the LHS comes from UIToFP. llvm-svn: 58929	2008-11-09 04:26:50 +00:00
Daniel Dunbar	ce947e0039	Rework r58829, allowing removal of dbg info intrinsics during alloca promotion. - Eliminate uses after free and simplify tests. Devang: Please check that this is still doing what you intended. llvm-svn: 58887	2008-11-08 04:12:17 +00:00
Bill Wendling	a5eb92b137	BCUI + 1 doesn't work. Use next instead. llvm-svn: 58830	2008-11-07 01:59:41 +00:00
Devang Patel	0958cd6437	Handle (delete) dbg intrinsics while promoting alloca. llvm-svn: 58826	2008-11-07 01:30:07 +00:00
Mon P Wang	888f4e6fb0	Fixed scalarizing an extract subvector and prevent an infinite loop when simplify a vector. llvm-svn: 58820	2008-11-06 22:52:21 +00:00
Devang Patel	cea9dfa11f	InstructionNamer preserves everything. llvm-svn: 58787	2008-11-06 01:00:16 +00:00
Devang Patel	de9b95965a	Do now allow InlineAlways pass to remove dead functions. llvm-svn: 58744	2008-11-05 01:39:16 +00:00
Devang Patel	bc6ef4c191	Check Attribute::NoInline. llvm-svn: 58742	2008-11-05 01:37:05 +00:00
Oscar Fuentes	afb65a6c2a	CMake: updated list of source files. llvm-svn: 58736	2008-11-05 00:11:22 +00:00
Dan Gohman	c162a200ad	Add a new pass to simplify specific half_powr function calls. This is a specialized pass that it not likely to be generally useful. llvm-svn: 58732	2008-11-04 23:41:45 +00:00
Dale Johannesen	e51c5296a6	Allow SROA of vectors. Removing this caused a huge performance regression in something we care about. This may not be final fix. llvm-svn: 58718	2008-11-04 20:54:03 +00:00
Devang Patel	40cfcf7ce2	Fix unused variable warnings. llvm-svn: 58651	2008-11-03 23:14:09 +00:00
Devang Patel	c959b4dbd3	Ignore conditions that are outside the loop. llvm-svn: 58631	2008-11-03 19:38:07 +00:00
Andrew Lenharth	97343f0e8b	add a period at the end of the comment, ignoring the fact that the comment would be hard pressed to be considered a sentence, but if it makes Bill happy... llvm-svn: 58630	2008-11-03 19:29:29 +00:00
Devang Patel	78b7de25d1	Turn floating point IVs into integer IVs where possible. This allows SCEV users to effectively calculate trip count. LSR later on transforms back integer IVs to floating point IVs later on to avoid int-to-float casts inside the loop. llvm-svn: 58625	2008-11-03 18:32:19 +00:00
Andrew Lenharth	312c00cdd4	Ensure that we are checking only calls to the function we are interested in specializing llvm-svn: 58615	2008-11-03 16:05:35 +00:00
Nick Lewycky	f393d56364	Replace explicit loop with utility function. llvm-svn: 58593	2008-11-03 03:49:14 +00:00
Nick Lewycky	49abbde699	Changes from Duncan's review: * merge two weak functions by making them both alias a third non-weak fn * don't reimplement CallSite::hasArgument * whitelist the safe linkage types llvm-svn: 58568	2008-11-02 16:46:26 +00:00
Duncan Sands	613d060c2e	Get this building on 64 bit machines (error: cast from ‘const llvm::PointerType*’ to ‘unsigned int’ loses precision). llvm-svn: 58561	2008-11-02 09:00:33 +00:00
Oscar Fuentes	45b63793cf	CMake: added a source file. llvm-svn: 58559	2008-11-02 06:01:39 +00:00
Nick Lewycky	15a23d029c	Add a new MergeFunctions pass. It finds identical functions and merges them. This triggers only 60 times in llvm-test (look at .llvm.bc, not .linked.rbc) and so it probably wont be turned on by default. Also, may of those are likely to go away when PR2973 is fixed. llvm-svn: 58557	2008-11-02 05:52:50 +00:00
Nick Lewycky	bcadcbb1ec	Fix demanded bits analysis with srem by negative number. Based on a patch by Richard Osborne. llvm-svn: 58555	2008-11-02 02:41:50 +00:00
Dan Gohman	1f1ebc5389	Fix this recently moved code to use the correct type. CI is now a ConstantInt, and SI is the original cast instruction. This fixes PR2996. llvm-svn: 58549	2008-11-02 00:17:33 +00:00
Daniel Dunbar	8abdac1eb4	Fix warning. llvm-svn: 58486	2008-10-31 01:50:01 +00:00
Dan Gohman	50061675c5	Canonicalize sext(i1) to i1?-1:0, and update various instcombine optimizations accordingly. llvm-svn: 58457	2008-10-30 20:40:10 +00:00
Daniel Dunbar	097da598fb	Add InlineCost class for represent the estimated cost of inlining a function. - This explicitly models the costs for functions which should "always" or "never" be inlined. This fixes bugs where such costs were not previously respected. llvm-svn: 58450	2008-10-30 19:26:59 +00:00
Chris Lattner	4af91a146a	Fix PR2967 by not deleting volatile load/stores that occur before unreachable. I don't really see this as being needed, but there is little harm from doing it. llvm-svn: 58385	2008-10-29 17:46:26 +00:00
Daniel Dunbar	2265cf485e	Factor shouldInline method out of Inliner. - No functionality change. llvm-svn: 58355	2008-10-29 01:02:02 +00:00
Daniel Dunbar	e2fe2de3f4	Assorted comment/naming fixes, 80-col violations, and reindentation. - No functionality change. llvm-svn: 58352	2008-10-28 23:24:26 +00:00
Dan Gohman	3ceee36545	(A & sext(C)) \| (B & ~sext(C) -> C ? A : B llvm-svn: 58351	2008-10-28 22:38:57 +00:00
Torok Edwin	0c0acebd09	export an ID for the instructionNamer, allowing analysis/transformation passes that need it to require it by ID. llvm-svn: 58238	2008-10-27 10:16:27 +00:00
Chris Lattner	17afc211e7	Rewrite all the 'PromoteLocallyUsedAlloca[s]' logic. With the power of LargeBlockInfo, we can now dramatically simplify their implementation and speed them up at the same time. Now the code has time proportional to the number of uses of the alloca, not the size of the block. This also eliminates code that tried to batch up different allocas which are used in the same blocks, and eliminates the 'retry list' logic which was baroque and no unneccesary. In addition to being a speedup for crazy cases, this is also a nice cleanup: PromoteMemoryToRegister.cpp \| 270 +++++++++++++++----------------------------- 1 file changed, 96 insertions(+), 174 deletions(-) llvm-svn: 58229	2008-10-27 07:05:53 +00:00
Chris Lattner	39c853cf16	Add a new LargeBlockInfo helper, which is just a wrapper around a trivial dense map. Use this in RewriteSingleStoreAlloca to avoid aggressively rescanning blocks over and over again. This fixes PR2925, speeding up mem2reg on the testcase in that bug from 4.56s to 0.02s in a debug build on my machine. llvm-svn: 58227	2008-10-27 06:05:26 +00:00
Nick Lewycky	3d4f21006b	Add value range analyzing of Add and Sub. Understand that mul %x, 1 = %x. llvm-svn: 58069	2008-10-24 04:00:26 +00:00
Daniel Dunbar	5447f9679d	Change createPass factory functions to return Pass instead of LoopPass*. - Although less precise, this means they can be used in clients without RTTI (who would otherwise need to include LoopPass.h, which eventually includes things using dynamic_cast). This was the simplest solution that presented itself, but I am happy to use a better one if available. llvm-svn: 58010	2008-10-22 23:32:42 +00:00
Dan Gohman	a480933bbd	Use Function::getEntryBlock() instead of Function::front(), for clarity. llvm-svn: 57870	2008-10-21 03:10:28 +00:00
Dan Gohman	080dca2129	Fix a bug that prevented llvm-extract -delete from working. llvm-svn: 57864	2008-10-21 01:08:07 +00:00
Dan Gohman	133708df57	Use 0 instead of false to return a null pointer. llvm-svn: 57660	2008-10-17 00:56:52 +00:00
Dan Gohman	6f40163d83	Teach instcombine's visitLoad to scan back several instructions to find opportunities for store-to-load forwarding or load CSE, in the same way that visitStore scans back to do DSE. Also, define a new helper function for testing whether the addresses of two memory accesses are known to have the same value, and use it in both visitStore and visitLoad. These two changes allow instcombine to eliminate loads in code produced by front-ends that frequently emit obviously redundant addressing for memory references. llvm-svn: 57608	2008-10-15 23:19:35 +00:00
Evan Cheng	591baeed7c	Combine (fcmp cc0 x, y) \| (fcmp cc1 x, y) into a single fcmp when possible. llvm-svn: 57515	2008-10-14 18:44:08 +00:00
Evan Cheng	778b47e6c0	- Somehow I forgot about one / une. - Renumber fcmp predicates to match their icmp counterparts. - Try swapping operands to expose more optimization opportunities. llvm-svn: 57513	2008-10-14 18:13:38 +00:00
Evan Cheng	91528965e7	Optimize anding of two fcmp into a single fcmp if the operands are the same. e.g. uno && ueq -> ueq ord && olt -> olt ord && ueq -> oeq llvm-svn: 57507	2008-10-14 17:15:11 +00:00
Matthijs Kooijman	b86d48c717	Make InstructionCombining::getBitCastOperand() recognize GEP instructions and constant expression with all zero indices as being the same as a bitcast. llvm-svn: 57442	2008-10-13 15:17:01 +00:00
Chris Lattner	7a61ef92f5	Fix PR2697 by rewriting the '(X / pos) op neg' logic. This also changes a couple other cases for clarity, but shouldn't affect correctness. Patch by Eli Friedman! llvm-svn: 57387	2008-10-11 22:55:00 +00:00
Devang Patel	255fee3bce	Check loop exit predicate properly while eliminating one iteration loop. This patch fixes PR 2869 llvm-svn: 57369	2008-10-10 22:02:57 +00:00
Nuno Lopes	cc4f37aa68	fix memleak by cleaning the global sets on pass exit llvm-svn: 57353	2008-10-10 16:25:50 +00:00
Dale Johannesen	075a62519f	Add a "loses information" return value to APFloat::convert and APFloat::convertToInteger. Restore return value to IEEE754. Adjust all users accordingly. llvm-svn: 57329	2008-10-09 23:00:39 +00:00
Nick Lewycky	843c7dfe05	Don't drop alignment on globals when cloning. llvm-svn: 57320	2008-10-09 06:27:14 +00:00
Nuno Lopes	a4813e472c	dont specialize weak functions and the like llvm-svn: 57305	2008-10-08 18:45:59 +00:00
Duncan Sands	8f296a3788	Add <cstdio> include where needed by gcc-4.4. Patch by Samuel Tardieu. llvm-svn: 57291	2008-10-08 07:23:46 +00:00
Chris Lattner	0aa4423c8c	Add parentheses to avoid warnings in GCC 4.4.0, patch by Samuel Tardieu! llvm-svn: 57288	2008-10-08 06:42:28 +00:00
Andrew Lenharth	c83731e00a	Correctly set attributes when removing args during cloning. Fixes PR2765 llvm-svn: 57254	2008-10-07 18:08:38 +00:00
Devang Patel	378c8633e5	Fix typo, fix PR 2865. llvm-svn: 57221	2008-10-06 23:22:54 +00:00
Matthijs Kooijman	12cd5d041d	Allow scalarrepl to treat an all-zero GEP just as bitcast. This includes not marking a GEP involving a vector as unsafe, but only when it has all zero indices. This allows scalarrepl to work in a few more cases. llvm-svn: 57177	2008-10-06 16:23:31 +00:00
Chris Lattner	107e8f8b60	rewrite bswap matching to be more general, allowing arbitrary shifting and masking inside a bswap expr. This allows it to handle the cases from PR2842, which involve the intermediate 'or' expressions being shifted, not just the input value. llvm-svn: 57095	2008-10-05 02:13:19 +00:00
Chris Lattner	6fae76bbf3	fix a bug where the bswap matcher could match a case involving ashr. It should only apply to lshr. llvm-svn: 57089	2008-10-05 00:50:57 +00:00
Duncan Sands	e22385630e	Ignore loads from and stores to local memory (i.e. allocas) when deciding whether to mark a function readnone/readonly. Since the pass is currently run before SROA, this may be quite helpful. Requested by Chris on IRC. llvm-svn: 57050	2008-10-04 13:24:24 +00:00
Dan Gohman	700611dfc6	Clean up some multiple-return-value code that is no longer applicable. llvm-svn: 57033	2008-10-03 22:21:24 +00:00
Devang Patel	55401473f9	Nick Lewycky's patch. While hosting instruction check PHI node. llvm-svn: 57025	2008-10-03 18:57:37 +00:00
Duncan Sands	5edd9b2350	Teach internalize to preserve the callgraph. Why? Because it was there! llvm-svn: 56996	2008-10-03 07:36:09 +00:00
Owen Anderson	e98ca283b0	SplitBlock should only attempt to update LoopInfo if it is actually being used. llvm-svn: 56994	2008-10-03 06:55:35 +00:00
Duncan Sands	88d8323743	Factorize code: remove variants of "strip off pointer bitcasts and GEP's", and centralize the logic in Value::getUnderlyingObject. The difference with stripPointerCasts is that stripPointerCasts only strips GEPs if all indices are zero, while getUnderlyingObject strips GEPs no matter what the indices are. llvm-svn: 56922	2008-10-01 15:25:41 +00:00
Nuno Lopes	8bf32ebde6	revert the addition of Preverves(CallGraph), per Duncan's comments llvm-svn: 56917	2008-10-01 09:13:40 +00:00
Dan Gohman	b6d0fe7797	Call ScalarEvolution's deleteValueFromRecords before deleting an instruction, not after. This fixes some uses of free'd memory. llvm-svn: 56908	2008-10-01 02:02:03 +00:00
Nuno Lopes	36e7e0b190	add preserversCFG() + preservers(CallGraph) llvm-svn: 56887	2008-09-30 22:04:30 +00:00
Nuno Lopes	2219758606	add AU.setPreservesCFG() since this pass only adds and removes function attributes llvm-svn: 56868	2008-09-30 18:34:38 +00:00
Nick Lewycky	9e918179c8	Fix misoptimization of: xor i1 (icmp eq (X, C1), icmp s[lg]t (X, C2)) llvm-svn: 56834	2008-09-30 06:08:34 +00:00
Duncan Sands	187322048e	Speed up these passes when the callgraph has huge simply connected components. Suggested by Chris. llvm-svn: 56787	2008-09-29 14:59:04 +00:00
Nuno Lopes	a4d69747c1	remove redundant test (mayBeOverriden() includes hasLinkOnceLinkage) llvm-svn: 56786	2008-09-29 14:40:32 +00:00
Duncan Sands	60dc0b5cbe	Tweak some comments. llvm-svn: 56784	2008-09-29 13:35:31 +00:00
Duncan Sands	651eeb4be2	Rename isWeakForLinker to mayBeOverridden. Use it instead of hasWeakLinkage in a bunch of optimization passes. llvm-svn: 56782	2008-09-29 11:25:42 +00:00
Devang Patel	47a504c87c	Implement function notes as function attributes. llvm-svn: 56716	2008-09-26 23:51:19 +00:00
Devang Patel	e4e6ea0728	Now Attributes are divided in three groups - return attributes - inreg, zext and sext - parameter attributes - function attributes - nounwind, readonly, readnone, noreturn Return attributes use 0 as the index. Function attributes use ~0U as the index. This patch requires corresponding changes in llvm-gcc and clang. llvm-svn: 56704	2008-09-26 22:53:05 +00:00
Devang Patel	64dd7a2e89	Large mechanical patch. s/ParamAttr/Attribute/g s/PAList/AttrList/g s/FnAttributeWithIndex/AttributeWithIndex/g s/FnAttr/Attribute/g This sets the stage - to implement function notes as function attributes and - to distinguish between function attributes and return value attributes. This requires corresponding changes in llvm-gcc and clang. llvm-svn: 56622	2008-09-25 21:00:45 +00:00
Evan Cheng	99c7d558a5	Commit CodeGenPrepare.cpp changes which was accidentially left out of 56526. llvm-svn: 56549	2008-09-24 06:48:55 +00:00
Eric Christopher	463646c8aa	Fix fallout in CodeGenPrepare from 56526. Will likely need more work. llvm-svn: 56546	2008-09-24 05:32:41 +00:00
Devang Patel	f0bda74eca	s/ParamAttrsWithIndex/FnAttributeWithIndex/g llvm-svn: 56535	2008-09-24 00:55:02 +00:00
Devang Patel	25ecdc49d9	Put FN_NOTE_AlwaysInline and others in FnAttr namespace. llvm-svn: 56527	2008-09-24 00:06:15 +00:00
Devang Patel	b39defd763	Move FN_NOTE_AlwaysInline and other out of ParamAttrs namespace. Do not check isDeclaration() in hasNote(). It is clients' responsibility. llvm-svn: 56524	2008-09-23 23:52:03 +00:00
Devang Patel	a3e9bf1bca	s/ParameterAttributes/Attributes/g llvm-svn: 56513	2008-09-23 23:03:40 +00:00
Devang Patel	ad8ca34acd	Use parameter attribute store (soon to be renamed) for Function Notes also. Function notes are stored at index ~0. llvm-svn: 56511	2008-09-23 22:35:17 +00:00
Devang Patel	c7fc5ed65f	Add hasNote() to check note associated with a function. llvm-svn: 56477	2008-09-22 22:32:29 +00:00
Oscar Fuentes	0f25988689	Initial support for the CMake build system. llvm-svn: 56419	2008-09-22 01:08:49 +00:00
Duncan Sands	38b44d2a40	Implement review feedback from Devang: make use of mayReadFromMemory and mayWriteToMemory. llvm-svn: 56387	2008-09-20 16:45:58 +00:00
Duncan Sands	b2b5ddfaf6	Remove the MarkModRef pass (use AddReadAttrs instead). Unfortunately this means removing one regression test of GlobalsModRef because I couldn't work out how to perform it without MarkModRef. llvm-svn: 56342	2008-09-19 08:23:44 +00:00
Duncan Sands	c2ff9ca568	Add a new pass AddReadAttrs which works out which functions can get the readnone/readonly attributes, and gives them it. The plan is to remove markmodref (which did the same thing by querying GlobalsModRef) and delete the analogous functionality from GlobalsModRef. llvm-svn: 56341	2008-09-19 08:17:05 +00:00
Devang Patel	1e7ddf5d31	splitLoop does not handle split condition EQ. Fixes PR 2805 llvm-svn: 56321	2008-09-18 23:45:14 +00:00
Bill Wendling	aadf9b22a7	Decrementing the iterator here could be wrong if the worklist is empty after the "erase". Thanks to Ji Young Park for the patch! llvm-svn: 56316	2008-09-18 23:04:18 +00:00
Devang Patel	201223af46	Try to place hoisted instructions befoe icmp instruction. llvm-svn: 56315	2008-09-18 22:50:42 +00:00
Devang Patel	88efee2dcc	Do not hoist instruction above branch condition. The instruction may use branch condition. llvm-svn: 56286	2008-09-17 18:21:49 +00:00
Devang Patel	50c22bf69b	Do not ignore iv uses outside the loop. This one slipped through cracks very well. llvm-svn: 56284	2008-09-17 17:53:47 +00:00
Dan Gohman	c598e29a1c	Improve instcombine's handling of integer min and max in two ways: - Recognize expressions like "x > -1 ? x : 0" as min/max and turn them into expressions like "x < 0 ? 0 : x", which is easily recognizable as a min/max operation. - Refrain from folding expression like "y/2 < 1" to "y < 2" when the comparison is being used as part of a min or max idiom, like "y/2 < 1 ? 1 : y/2". In that case, the division has another use, so folding doesn't eliminate it, and obfuscates the min/max, making it harder to recognize as a min/max operation. These benefit ScalarEvolution, CodeGen, and anything else that wants to recognize integer min and max. llvm-svn: 56246	2008-09-16 18:46:06 +00:00
Dan Gohman	faa15214e0	Teach LSR to optimize away SMAX operations for tripcounts in common cases. See the comment above OptimizeSMax for the full story, and the testcase for an example. This cancels out a pessimization commonly attributed to indvars, and will allow us to lift some of the artificial throttles in indvars, rather than add new ones. llvm-svn: 56230	2008-09-15 21:22:06 +00:00
Dan Gohman	0b6d3a9a9b	On 64-bit targets, change 32-bit getelementptr indices to be 64-bit getelementptr indices, inserting an explicit cast if necessary. This helps expose the sign-extension operation to other optimizations. llvm-svn: 56133	2008-09-11 23:06:38 +00:00
Dan Gohman	5e154a591d	Fix a vectorshuffle instcombine bug introduced by r55995. Patch by Nicolas Capens! llvm-svn: 56129	2008-09-11 22:47:57 +00:00
Dan Gohman	4ae5fa57b4	Fix a copy+paste bug that Duncan spotted. For several cases it was still getting lucky and detecting overflow but it was clearly incorrect. llvm-svn: 56113	2008-09-11 18:53:02 +00:00
Dan Gohman	36e01356ec	In my analysis for r56076 I missed the case where the original multiplication overflows. llvm-svn: 56082	2008-09-11 00:25:00 +00:00
Dan Gohman	ebfb483309	Fix an icmp+sdiv optimization to check for and handle an overflow condition. This fixes PR2740. llvm-svn: 56076	2008-09-10 23:30:57 +00:00
Devang Patel	092285d4e4	fix white spaces. llvm-svn: 56056	2008-09-10 14:49:55 +00:00
Dan Gohman	3809753d3b	Fix a warning about comparing signed and unsigned values. llvm-svn: 56040	2008-09-10 01:09:32 +00:00
Devang Patel	7365754925	if loop induction variable is always sign or zero extended then extend the type of induction variable. llvm-svn: 56017	2008-09-09 21:41:07 +00:00
Devang Patel	36ad68421e	fix overflow check. llvm-svn: 56011	2008-09-09 20:54:34 +00:00
Anton Korobeynikov	6ad8b060d0	Make safer variant of alias resolution routine to be default llvm-svn: 56005	2008-09-09 20:05:04 +00:00
Anton Korobeynikov	4676c09dc8	Resolve aliases, when possible llvm-svn: 56001	2008-09-09 19:04:59 +00:00
Dan Gohman	28c911b79b	Make SimplifyDemandedVectorElts simplify vectors with multiple users, and teach it about shufflevector instructions. Also, fix a subtle bug in SimplifyDemandedVectorElts' insertelement code. This is a patch that was originally written by Eli Friedman, with some fixes and cleanup by me. llvm-svn: 55995	2008-09-09 18:11:14 +00:00
Devang Patel	b6380f607d	Fix simplifycfg crash in handing block merge. llvm-svn: 55971	2008-09-09 01:06:56 +00:00
Devang Patel	7df7366784	s/RemoveUnreachableBlocks/RemoveUnreachableBlocksFromFn/g llvm-svn: 55965	2008-09-08 22:14:17 +00:00
Devang Patel	f341d87755	Remove unused counter. llvm-svn: 55924	2008-09-08 17:14:54 +00:00
Devang Patel	071f99e1df	Remove OptimizeIVType() llvm-svn: 55913	2008-09-08 16:13:27 +00:00
Duncan Sands	0fcef7437c	Update the callgraph correctly. llvm-svn: 55896	2008-09-08 11:08:09 +00:00
Duncan Sands	62431a29a5	Update the callgraph correctly in ArgumentPromotion. llvm-svn: 55895	2008-09-08 11:07:35 +00:00
Duncan Sands	58d434d3e8	Reapply 55859. This doesn't change anything as long as the callgraph is correct. It checks for wrong callgraphs more strictly. llvm-svn: 55894	2008-09-08 11:05:51 +00:00
Duncan Sands	ae32389a41	Correct a comment and strip trailing whitespace. llvm-svn: 55883	2008-09-07 09:54:09 +00:00
Nuno Lopes	a03bc9ce10	fix crash when the malloc/free function is defined or is a declaration with 0 parameters. this pass doesnt seem to be used, but still it's now a little more correct llvm-svn: 55873	2008-09-06 17:44:06 +00:00
Duncan Sands	c4ec7871bf	When PruneEH turned an invoke into an ordinary call (thus changing the call site) it didn't inform the callgraph about this. But the call site does matter - as shown by the testcase, the callgraph become invalid after the inliner ran (with an edge between two functions simply missing), resulting in wrong deductions by GlobalsModRef. llvm-svn: 55872	2008-09-06 17:19:29 +00:00
Owen Anderson	453bcfcf8d	Revert r55859. This is breaking the build in the abscence of its companion commit. llvm-svn: 55865	2008-09-05 23:36:01 +00:00
Devang Patel	4148488e49	Remove unused map. llvm-svn: 55861	2008-09-05 21:55:33 +00:00
Duncan Sands	fdfa2d24fe	Delete the removeCallEdgeTo callgraph method, because it does not maintain a correct list of callsites. I discovered (see following commit) that the inliner will create a wrong callgraph if it is fed a callgraph with correct edges but incorrect callsites. These were created by Prune-EH, and while it wasn't done via removeCallEdgeTo, it could have been done via removeCallEdgeTo, which is an accident waiting to happen. Use removeCallEdgeFor instead. llvm-svn: 55859	2008-09-05 21:43:04 +00:00
Duncan Sands	b63fde1edb	Use removeAllCalledFunctions rather than removing edges one by one by hand. llvm-svn: 55836	2008-09-05 14:56:53 +00:00
Duncan Sands	c568792f0a	Remove trailing whitespace. llvm-svn: 55835	2008-09-05 12:37:12 +00:00
Duncan Sands	8de0cd8f10	Make this pass return that it made a change if it modifies a functions attributes. llvm-svn: 55831	2008-09-05 09:08:37 +00:00
Devang Patel	3d42ac68df	A loop may be unswitched multiple times. Reconstruct dom info. at the end. llvm-svn: 55806	2008-09-04 22:43:59 +00:00
Devang Patel	25b88b66e8	Initialize loop data first. llvm-svn: 55792	2008-09-04 20:36:36 +00:00
Devang Patel	c96f319a3c	Do not unswitch if the function notes say we're optimizing this function for size. llvm-svn: 55786	2008-09-04 18:55:13 +00:00
Andrew Lenharth	4523deb38b	try to seperate the mechanism into something others can use llvm-svn: 55785	2008-09-04 18:51:26 +00:00
Dale Johannesen	bbd799c544	Add intrinsic forms of pow and exp2. The non-intrinsic forms remain to handle older IR files, but will go away soon. llvm-svn: 55781	2008-09-04 18:30:46 +00:00
Dan Gohman	e1f9be27bc	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Andrew Lenharth	9e27a7b4b0	cleanup as per Duncan's review llvm-svn: 55766	2008-09-04 14:34:22 +00:00
Devang Patel	49483d797e	Update inline threshold for current function if the notes say, optimize for size. llvm-svn: 55745	2008-09-03 23:06:09 +00:00
Owen Anderson	94bd638e81	Fix a bug that prevented PRE from applying in some cases. llvm-svn: 55744	2008-09-03 23:06:07 +00:00
Andrew Lenharth	4a69775c64	Initial version of a Partial Specialization IPO pass. It triggers a couple hundred times on 176.gcc. I don't know the performance impact yet, the heuristic is quite simple still. llvm-svn: 55734	2008-09-03 21:00:28 +00:00
Devang Patel	41331065dd	Fix typo in a comment. llvm-svn: 55720	2008-09-03 20:25:40 +00:00
Devang Patel	b25dcdea78	Add parentheses to make code more readable. llvm-svn: 55717	2008-09-03 19:57:15 +00:00
Devang Patel	24618c5a6a	Fix comments. llvm-svn: 55716	2008-09-03 19:52:17 +00:00
Devang Patel	c70392bfcd	Add custom inliner that handles only functions that are marked as always_inline. llvm-svn: 55713	2008-09-03 18:50:53 +00:00
Devang Patel	42fe8ff6e8	Handle "always inline" note during inline cost analysis. llvm-svn: 55712	2008-09-03 18:47:45 +00:00
Devang Patel	dae7f0b9d7	Check noinline note and ignore other notes. llvm-svn: 55711	2008-09-03 18:46:35 +00:00
Devang Patel	653bcea20d	Handle "noinline" note inside the simple inliner. llvm-svn: 55708	2008-09-03 18:10:21 +00:00
Nick Lewycky	3b35dcc455	Don't apply this transform to vectors. Fixes PR2756. llvm-svn: 55690	2008-09-03 06:24:21 +00:00
Devang Patel	5fa8cc79ed	Add additional check to ensure that iv is canonicalized. llvm-svn: 55682	2008-09-03 00:29:13 +00:00
Devang Patel	4dfecae8fe	Check iteration count. llvm-svn: 55680	2008-09-03 00:10:56 +00:00
Devang Patel	df5dce4aa8	While removing PHI, use basicblock to identify incoming value. llvm-svn: 55678	2008-09-03 00:02:42 +00:00
Devang Patel	5659d2508e	s/FP_AlwaysInline/FN_NOTE_AlwaysInline/g llvm-svn: 55676	2008-09-02 22:43:57 +00:00
Devang Patel	a1e2066b1d	If all IV uses are extending integer IV then change the type of IV itself, if possible. llvm-svn: 55674	2008-09-02 22:18:08 +00:00
Devang Patel	cda9086d29	respect inline=never and inline=always notes. llvm-svn: 55673	2008-09-02 22:16:13 +00:00
Duncan Sands	0338e9f60d	Add a small pass that sets the readnone/readonly attributes on functions, based on the result of alias analysis. It's not hardwired to use GlobalsModRef even though this is the only (AFAIK) alias analysis that results in this pass actually doing something. Enable as follows: opt ... -globalsmodref-aa -markmodref ... Advantages of this pass: (1) records the result of globalsmodref in the bitcode, meaning it is available for use by later passes (currently the pass manager isn't smart enough to magically make an advanced alias analysis available to all later passes), which may expose more optimization opportunities; (2) hopefully speeds up compilation when code is optimized twice, for example when a file is compiled to bitcode, then later LTO is done on it: marking functions readonly/readnone when producing the initial bitcode should speed up alias analysis during LTO; (3) good for discovering that globalsmodref doesn't work very well :) Not currently turned on by default. llvm-svn: 55604	2008-09-01 11:40:11 +00:00
Devang Patel	7abe1944ae	Do not apply the transformation if the target does not support DestTy natively. llvm-svn: 55433	2008-08-27 20:55:23 +00:00
Devang Patel	23569e6a1f	Fix typos and whitespaces. Other cosmetic changes based on feedback. llvm-svn: 55424	2008-08-27 17:50:18 +00:00
Owen Anderson	14510f8fee	Put a heuristic in place to prevent GVN from falling into bad cases with massively complicated CFGs. This speeds up a particular testcase from 12+ hours to 5 seconds with little perceptible loss of quality. llvm-svn: 55391	2008-08-26 22:07:42 +00:00
Devang Patel	211c05f7a3	If IV is used in a int-to-float cast inside the loop then try to eliminate the cast operation. llvm-svn: 55374	2008-08-26 17:57:54 +00:00
Chris Lattner	9f10b57760	improve encapsulation of the BBExecutable set. llvm-svn: 55271	2008-08-23 23:39:31 +00:00
Chris Lattner	15f29cad83	Switch an assortment of maps, sets and vectors to more efficient versions, patch contributed by m-s! llvm-svn: 55270	2008-08-23 23:36:38 +00:00
Chris Lattner	fe3155fc62	Switch the asmprinter (.ll) and all the stuff it requires over to use raw_ostream instead of std::ostream. Among other goodness, this speeds up llvm-dis of kc++ with a release build from 0.85s to 0.49s (88% faster). Other interesting changes: 1) This makes Value::print be non-virtual. 2) AP[S]Int and ConstantRange can no longer print to ostream directly, use raw_ostream instead. 3) This fixes a bug in raw_os_ostream where it didn't flush itself when destroyed. 4) This adds a new SDNode::print method, instead of only allowing "dump". A lot of APIs have both std::ostream and raw_ostream versions, it would be useful to go through and systematically anihilate the std::ostream versions. This passes dejagnu, but there may be minor fallout, plz let me know if so and I'll fix it. llvm-svn: 55263	2008-08-23 22:23:09 +00:00
Chris Lattner	2f629c4b80	Add a new trivial -inst-namer pass which makes it possible to diff the before/after effects of a pass, crazy! llvm-svn: 55230	2008-08-23 06:07:02 +00:00
Chris Lattner	72ee6ebb0a	Fix PR2423 by checking all indices for out of range access, not only indices that start with an array subscript. x->field[10000] is just as bad as (*X)[14][10000]. llvm-svn: 55226	2008-08-23 05:21:06 +00:00
Chris Lattner	fd64cbf22d	consolidate DenseMapInfo implementations, and add one for std::pair. Patch contributed by m-s. llvm-svn: 55167	2008-08-22 05:08:25 +00:00
Nick Lewycky	7b87c4d8a4	Revert r54876 r54877 r54906 and r54907. Evan found that these caused a 20% slowdown in bzip2. llvm-svn: 55113	2008-08-21 05:56:10 +00:00
Evan Cheng	0a35c35153	Silence a compiler warning. llvm-svn: 55087	2008-08-20 23:36:48 +00:00
Mon P Wang	e5aefd3967	Fixed shuffle optimizations to handle non power of 2 vectors llvm-svn: 55035	2008-08-20 02:23:25 +00:00
Chris Lattner	b5e0d71a07	don't use the result of WriteAsOperand llvm-svn: 54979	2008-08-19 04:45:19 +00:00
Nick Lewycky	371935de4a	Make this comment clearer. Instead of using an ambiguous ~ (not) on an icmp predicate, swap the order of the operands. llvm-svn: 54907	2008-08-17 20:02:02 +00:00
Nick Lewycky	30a0ad8900	Consider the case where xor by -1 and xor by 128 have been combined already to produce an xor by 127. llvm-svn: 54906	2008-08-17 19:58:24 +00:00
Gordon Henriksen	2cc861a6c1	Rename some GC classes so that their roll will hopefully be clearer. In particular, Collector was confusing to implementors. Several thought that this compile-time class was the place to implement their runtime GC heap. Of course, it doesn't even exist at runtime. Specifically, the renames are: Collector -> GCStrategy CollectorMetadata -> GCFunctionInfo CollectorModuleMetadata -> GCModuleInfo CollectorRegistry -> GCRegistry Function::getCollector -> getGC (setGC, hasGC, clearGC) Several accessors and nested types have also been renamed to be consistent. These changes should be obvious. llvm-svn: 54899	2008-08-17 18:44:35 +00:00
Evan Cheng	a3c27d36bc	Revert 54821. It's miscompiling 252.eon and 447.dealII llvm-svn: 54878	2008-08-17 08:07:31 +00:00
Nick Lewycky	d797456fb4	I found a better place for this optz'n. llvm-svn: 54877	2008-08-17 07:54:14 +00:00
Nick Lewycky	205be593b8	Xor'ing both sides of icmp by sign-bit is equivalent to swapping signedness of the predicate. Also, make this optz'n apply in more cases where it's safe to do so. llvm-svn: 54876	2008-08-17 07:34:14 +00:00
Chris Lattner	c0610874cc	Rework the routines that convert AP[S]Int into a string. Now, instead of returning an std::string by value, it fills in a SmallString/SmallVector passed in. This significantly reduces string thrashing in some cases. More specifically, this: - Adds an operator<< and a print method for APInt that allows you to directly send them to an ostream. - Reimplements APInt::toString to be much simpler and more efficient algorithmically in addition to not thrashing strings quite as much. This speeds up llvm-dis on kc++ by 7%, and may also slightly speed up the asmprinter. This also fixes a bug I introduced into the asmwriter in a previous patch w.r.t. alias printing. llvm-svn: 54873	2008-08-17 07:19:36 +00:00
Owen Anderson	13498aa150	Remove GCSE, ValueNumbering, and LoadValueNumbering. These have been deprecated for almost a year; it's finally time for them to go away. llvm-svn: 54822	2008-08-15 21:31:02 +00:00
Devang Patel	a8e9ee0529	Reapply 54786. Add overflow and number of mantissa bits checks. llvm-svn: 54821	2008-08-15 21:21:34 +00:00
Evan Cheng	b2e1459e48	Revert 54786. It's not checking for overflows, etc. llvm-svn: 54813	2008-08-15 08:12:11 +00:00
Chris Lattner	28a74aa6ca	use smallvector instead of vector for a couple worklists. This speeds up instcombine by ~10% on some testcases. llvm-svn: 54811	2008-08-15 04:03:01 +00:00
Bill Wendling	bebfe7b7fc	Temporarily revert r54792. It's causing an ICE during bootstrapping. llvm-svn: 54804	2008-08-14 23:05:24 +00:00
Devang Patel	8aa62f7dfa	Use DenseMap. Patch by Pratik Solanki. llvm-svn: 54792	2008-08-14 21:31:10 +00:00
Devang Patel	67f9b08a91	If IV is used in a int-to-float cast inside the loop then try to eliminate the cast opeation. llvm-svn: 54786	2008-08-14 20:58:31 +00:00
Dan Gohman	027326ba3e	Use empty() instead of begin() == end(). llvm-svn: 54780	2008-08-14 18:13:49 +00:00
Matthijs Kooijman	3bbc2deb9c	Replace two for loops with while(!X->use_empty()) loops. This prevents invalidating the iterator by deleting the current use. This fixes a segfault on 64 bit linux reported in PR2675. Also remove an unneeded if. llvm-svn: 54778	2008-08-14 15:03:05 +00:00
Dan Gohman	db5b503d60	Fix a bogus srem rule - a negative value srem'd by a power-of-2 can have a non-negative result; for example, -16%16 is 0. Also, clarify the related comments. This fixes PR2670. llvm-svn: 54767	2008-08-13 23:12:35 +00:00
Dan Gohman	374d9328b7	Fix SCCP's handling of struct value loads and stores. SCCP doesn't track individual leaf values in such cases, so it needs to treat struct values as normal values in this case. llvm-svn: 54760	2008-08-13 21:22:48 +00:00

... 3 4 5 6 7 ...

4833 Commits