llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-28 06:27:16 +00:00

Author	SHA1	Message	Date
Dan Gohman	fb30c38893	Mark the x86 fp stack registers as "reserved". This tells LiveVariables and the RegisterScavenger not to expect traditional liveness techniques are applicable to these registers, since we don't fully modify the effects of push and pop after stackification. llvm-svn: 61179	2008-12-18 01:05:09 +00:00
Dale Johannesen	3e0c1f771b	Fix the time regression I introduced in 464.h264ref with my last patch to this file. The issue there was that all uses of an IV inside a loop are actually references to Base[IV2], and there was one use outside that was the same but LSR didn't see the base or the scaling because it didn't recurse into uses outside the loop; thus, it used base+IVscale mode inside the loop instead of pulling base out of the loop. This was extra bad because register pressure later forced both base and IV into memory. Doing that recursion, at least enough to figure out addressing modes, is a good idea in general; the change in AddUsersIfInteresting does this. However, there were side effects.... It is also possible for recursing outside the loop to introduce another IV where there was only 1 before (if the refs inside are not scaled and the ref outside is). I don't think this is a common case, but it's in the testsuite. It is right to be very aggressive about getting rid of such introduced IVs (CheckForIVReuse and the handling of nonzero RewriteFactor in StrengthReduceStridedIVUsers). In the testcase in question the new IV produced this way has both a nonconstant stride and a nonzero base, neither of which was handled before. (This patch does not handle all the cases where this can happen.) And when inserting new code that feeds into a PHI, it's right to put such code at the original location rather than in the PHI's immediate predecessor(s) when the original location is outside the loop (a case that couldn't happen before) (RewriteInstructionToUseNewBase); better to avoid making multiple copies of it in this case. Everything above is exercised in CodeGen/X86/lsr-negative-stride.ll (and ifcvt4 in ARM which is the same IR). llvm-svn: 61178	2008-12-18 00:57:22 +00:00
Chris Lattner	d159077cb9	reapply this hunk from Bill's reversion in r61169, it is conservative and safe and orthogonal from turning off load pre. llvm-svn: 61177	2008-12-18 00:51:32 +00:00
Chris Lattner	005d68a2a9	make instnamer name unnamed blocks as well as instructions and args. llvm-svn: 61175	2008-12-18 00:33:11 +00:00
Bill Wendling	3eb7c0254b	Temporarily revert r61027. It was causing a bootstrap failure in "release" mode with everyone's favorite error messages: Comparing stages 2 and 3 warning: ./cc1-checksum.o differs warning: ./cc1plus-checksum.o differs Bootstrap comparison failure! ./c-decl.o differs ./cp/decl.o differs ./df-core.o differs ./gcc.o differs ./i386.o differs ./stor-layout.o differs ./tree-pretty-print.o differs ./tree.o differs make[2]: * [compare] Error 1 make[1]: * [stage3-bubble] Error 2 See PR3227. llvm-svn: 61169	2008-12-17 23:31:20 +00:00
Devang Patel	ceeecba890	Today the front-ends (llvm-gcc and clang) generate multiple llvm.dbg.compile_units to identify source file for various debug entities. Each llvm.dbg.compile_unit matches one file on the disk. However, the backend only supports one DW_TAG_compile_unit per .o file. The backend selects first compile_unit from the vector to construct DW_TAG_compile_unit entry, which is not correct in all cases. First step to resolve this is, record file name and directory directly in debug info for various debug entities. llvm-svn: 61164	2008-12-17 22:39:29 +00:00
Owen Anderson	5f1bc95673	Revert r61158 for now, as it caused some test failures. llvm-svn: 61159	2008-12-17 22:17:27 +00:00
Owen Anderson	446162d848	Fix miscompilations caused by renumbering, and enable it as part of prealloc splitting. llvm-svn: 61158	2008-12-17 22:06:59 +00:00
Chris Lattner	a2aa680882	This adds some missing functions to the C binding: - ability to insert previously created instructions using a builder - creation of aliases - creation of inline asm constants Patch by Zoltan Varga! llvm-svn: 61153	2008-12-17 21:39:50 +00:00
Bill Wendling	d364440e53	Forgot to revert r61031 when I reverted r61019, r61030, and r61040. llvm-svn: 61150	2008-12-17 20:59:57 +00:00
Mon P Wang	bc3622287b	Fix expansion of vsetcc to set the high bit for true instead of 1. llvm-svn: 61129	2008-12-17 08:49:47 +00:00
Chris Lattner	c6134bffaf	insert some sequence points and preincrement an iterator to avoid iterator invalidation problems. llvm-svn: 61124	2008-12-17 05:42:08 +00:00
Chris Lattner	196c166a06	Enhance heap sra to be substantially more aggressive w.r.t PHI nodes. This allows it to do fairly general phi insertion if a load from a pointer global wants to be SRAd but the load is used by (recursive) phi nodes. This fixes a pessimization on ppc introduced by Load PRE. llvm-svn: 61123	2008-12-17 05:28:49 +00:00
Dan Gohman	a8796f4908	Double the amount of memory reserved for SUnits. This is a temporary workaround for an obscure bug. When node cloning is used, it is possible that more SUnits will be created, and if the SUnits std::vector has to reallocate, it will invalidate all the graph edges. llvm-svn: 61122	2008-12-17 04:30:46 +00:00
Dan Gohman	6ee60e3ac3	Use getDepth() and getHeight() instead of accessing the Depth and Height members directly, as they may not be current. llvm-svn: 61121	2008-12-17 04:25:52 +00:00
Eli Friedman	4aae828bf8	Fix for PR3225: disable a broken optimization in DAGTypeLegalizer::ExpandShiftWithKnownAmountBit. In terms of restoring the optimization, the best fix here isn't obvious... any ideas? llvm-svn: 61119	2008-12-17 03:35:17 +00:00
Dale Johannesen	7a81d1b0ab	Clarify that the scale factor from CheckForIVReuse can be negative. Keep track of whether all uses of an IV are outside the loop. Some cosmetics; no functional change. llvm-svn: 61109	2008-12-16 22:16:28 +00:00
Dale Johannesen	e348900657	A new dag combine; several permutations of this are there under ADD, this one was missing. llvm-svn: 61107	2008-12-16 22:13:49 +00:00
Owen Anderson	36aba82416	Add code to renumber split intervals into new vregs. This is disabled for now until I finish working out some iterator invalidation issues. llvm-svn: 61104	2008-12-16 21:35:08 +00:00
Chris Lattner	c4cc4a328f	Fix another crash found by inspection. If we have a PHI node merging the load multiple times, make sure the check the uses of the PHI to ensure they are transformable. llvm-svn: 61102	2008-12-16 21:24:51 +00:00
Chris Lattner	8b1f2f76d7	fix a crash found by inspection. llvm-svn: 61101	2008-12-16 21:04:51 +00:00
Eli Friedman	de614f9842	Add a helper to remove a branch and DCE the condition, and use it consistently for deleting branches. In addition to being slightly more readable, this makes SimplifyCFG a bit better about cleaning up after itself when it makes conditions unused. llvm-svn: 61100	2008-12-16 20:54:32 +00:00
Dan Gohman	38316f49bd	Eliminate the loop that walks the critical path. Instead, just track the position in the critical path during the main instruction walk. This eliminates the need for the CritialAntiDep DenseMap. llvm-svn: 61096	2008-12-16 19:27:52 +00:00
Bill Wendling	f807a68f2e	Temporarily revert r61019, r61030, and r61040. These were breaking LLVM Release builds. llvm-svn: 61094	2008-12-16 19:06:48 +00:00
Dan Gohman	f8f5625748	Preserve SourceValue information when lowering produces multiple loads from different offsets within the same stack slot. llvm-svn: 61093	2008-12-16 18:25:36 +00:00
Evan Cheng	96d87db03b	We have decided not to support inline asm where an output operand with a matching input operand with incompatible type (i.e. either one is a floating point and the other is an integer or the sizes of the types differ). SelectionDAGBuild will catch these and exit with an error. llvm-svn: 61092	2008-12-16 18:21:39 +00:00
Oscar Fuentes	9a300fe336	CMake: Added DbgInfoPrinter.cpp to lib/Analysis/CMakeFiles.txt. llvm-svn: 61087	2008-12-16 12:25:04 +00:00
Torok Edwin	9562aa1742	Add -print-dbginfo pass that prints LLVM IR with comments inserted to show which source/line a certain BB/instruction comes from, original variable names, and original (unmangled) C++ name of functions. llvm-svn: 61085	2008-12-16 09:09:19 +00:00
Torok Edwin	fe974a7ca9	Add utility functions to search for DbgStopPointInst corresponding to an instruction or BasicBlock, and to search for DbgDeclareInst corresponding to a variable. llvm-svn: 61084	2008-12-16 09:07:36 +00:00
Torok Edwin	2e2c464771	use different name for parameter to make it clear that we set DIDescriptor::GV llvm-svn: 61083	2008-12-16 09:06:01 +00:00
Nick Lewycky	1b0fc83809	Generalize support for analyzing loops to include SLE/SGE loop exit conditions and support for non-unit strides with signed exit conditions. llvm-svn: 61082	2008-12-16 08:30:01 +00:00
Chris Lattner	e35c79577f	switch some std::set/std::map to SmallPtrSet/DenseMap. llvm-svn: 61081	2008-12-16 07:34:30 +00:00
Chris Lattner	b3becc5776	fix PR3217: fully cached queries need to be verified against the visited set before they are used. If used, their blocks need to be added to the visited set so that subsequent queries don't use conflicting pointer values in the cache result blocks. llvm-svn: 61080	2008-12-16 07:10:09 +00:00
Dan Gohman	10eb3ccaeb	Enable anti-dependence breaking by default when post-RA scheduling is enabled. llvm-svn: 61078	2008-12-16 06:21:45 +00:00
Dan Gohman	9f37a0296b	When breaking an anti-dependency, don't use a register which has seen one of its aliases defined. This is conservative, but tricky subreg corner cases are outside the primary aim of this pass. llvm-svn: 61077	2008-12-16 06:20:58 +00:00
Dan Gohman	c3e24d559b	Add initial support for back-scheduling address computations, especially in the case of addresses computed from loop induction variables. llvm-svn: 61075	2008-12-16 03:35:01 +00:00
Dan Gohman	e2cf452271	Remove some special-case logic in ScheduleDAGSDNodes's latency computation code that is no longer needed with the new method for handling latencies. llvm-svn: 61074	2008-12-16 03:31:11 +00:00
Dan Gohman	40a40dd7c1	Fix some register-alias-related bugs in the post-RA scheduler liveness computation code. Also, avoid adding output-depenency edges when both defs are dead, which frequently happens with EFLAGS defs. Compute Depth and Height lazily, and always in terms of edge latency values. For the schedulers that don't care about latency, edge latencies are set to 1. Eliminate Cycle and CycleBound, and LatencyPriorityQueue's Latencies array. These are all subsumed by the Depth and Height fields. llvm-svn: 61073	2008-12-16 03:25:46 +00:00
Dan Gohman	67e694b0ea	Add a simple target-independent heuristic to allow targets with no instruction itinerary data to back-schedule loads. llvm-svn: 61070	2008-12-16 02:38:22 +00:00
Dan Gohman	8ddcdef08a	Move addPred and removePred out-of-line. llvm-svn: 61067	2008-12-16 01:05:52 +00:00
Dan Gohman	23aae3bba9	Make addPred and removePred return void, since the return value is not currently used by anything. llvm-svn: 61066	2008-12-16 01:00:55 +00:00
Dan Gohman	d6ad3f6178	This getEdgeAttributes doesn't need a template argument. llvm-svn: 61065	2008-12-16 00:55:00 +00:00
Chris Lattner	9255745f90	enhance heap-sra to apply to fixed sized array allocations, not just variable sized array allocations. llvm-svn: 61051	2008-12-15 21:44:34 +00:00
Mon P Wang	bb3c2994f0	Added support for splitting and scalarizing vector shifts. llvm-svn: 61050	2008-12-15 21:44:00 +00:00
Chris Lattner	2356082b5e	Use stripPointerCasts. llvm-svn: 61047	2008-12-15 21:20:32 +00:00
Chris Lattner	15ac84e027	minor tweaks for formatting, allow bitcast in ValueIsOnlyUsedLocallyOrStoredToOneGlobal. llvm-svn: 61046	2008-12-15 21:08:54 +00:00
Chris Lattner	592852605f	refactor some code into a new TryToOptimizeStoreOfMallocToGlobal function. Use GetElementPtrInst::hasAllZeroIndices where possible. llvm-svn: 61045	2008-12-15 21:02:25 +00:00
Chris Lattner	0e79aa6595	Teach basicaa to use the nocapture attribute when possible. When the intrinsics are properly marked nocapture, the fixme should be addressed. llvm-svn: 61040	2008-12-15 18:59:22 +00:00
Dan Gohman	f3c46b3496	Fix printing of PseudoSourceValues in SDNode graphs. llvm-svn: 61036	2008-12-15 17:28:10 +00:00
Chris Lattner	f678691da6	add some more notes. llvm-svn: 61033	2008-12-15 08:32:28 +00:00
Chris Lattner	8119a1f70d	Add a testcase for GCC PR 23455, which lpre handles now. Add some comments about why we're not getting other cases. llvm-svn: 61032	2008-12-15 07:49:24 +00:00
Nick Lewycky	212b42c4c0	Update generated files after nocapture syntax change. llvm-svn: 61031	2008-12-15 07:31:07 +00:00
Nick Lewycky	504288e7af	It turns out that "align 1" and unaligned are different. Add a bias to the alignment attribute such that 0 means unaligned. This will probably require a rebuild of llvm-gcc because of the change to Attributes.h. If you see many test failures on "make check", please rebuild your llvm-gcc. llvm-svn: 61030	2008-12-15 07:29:55 +00:00
Mon P Wang	2f96113348	Added support to LegalizeType for expanding the operands of scalar to vector and insert vector element. Modified extract vector element to extend the result to match the expected promoted type. llvm-svn: 61029	2008-12-15 06:57:02 +00:00
Chris Lattner	30c1871282	gvn now hoists this load out of the hot non-call path. llvm-svn: 61028	2008-12-15 06:34:48 +00:00
Chris Lattner	b467a5b4a5	Enable Load PRE. This teaches GVN to push partially redundant loads up the CFG when there is exactly one predecessor where the load is not available. This is designed to not increase code size but still eliminate partially redundant loads. This fires 1765 times on 403.gcc even though it doesn't do critical edge splitting yet (the most common reason for it to fail). llvm-svn: 61027	2008-12-15 05:28:29 +00:00
Chris Lattner	be89ad1615	if we have a phi translation failure of the start block, return just a clobber of the start block, not other random stuff as well. llvm-svn: 61026	2008-12-15 04:58:29 +00:00
Owen Anderson	90af4c9640	Ifdef out some code that I didn't mean to enable by default yet. llvm-svn: 61024	2008-12-15 03:52:17 +00:00
Chris Lattner	22cfa14eed	make GVN try to rename inputs to the resultant replaced values, which cleans up the generated code a bit. This should have the added benefit of not randomly renaming functions/globals like my previous patch did. :) llvm-svn: 61023	2008-12-15 03:46:38 +00:00
Chris Lattner	c92b131639	Implement initial support for PHI translation in memdep. This means that memdep keeps track of how PHIs affect the pointer in dep queries, which allows it to eliminate the load in cases like rle-phi-translate.ll, which basically end up being: BB1: X = load P br BB3 BB2: Y = load Q br BB3 BB3: R = phi [P] [Q] load R turning "load R" into a phi of X/Y. In addition to additional exposed opportunities, this makes memdep safe in many cases that it wasn't before (which is required for load PRE) and also makes it substantially more efficient. For example, consider: bb1: // has many predecessors. P = some_operator() load P In this example, previously memdep would scan all the predecessors of BB1 to see if they had something that would mustalias P. In some cases (e.g. test/Transforms/GVN/rle-must-alias.ll) it would actually find them and end up eliminating something. In many other cases though, it would scan and not find anything useful. MemDep now stops at a block if the pointer is defined in that block and cannot be phi translated to predecessors. This causes it to miss the (rare) cases like rle-must-alias.ll, but makes it faster by not scanning tons of stuff that is unlikely to be useful. For example, this speeds up GVN as a whole from 3.928s to 2.448s (60%)!. IMO, scalar GVN should be enhanced to simplify the rle-must-alias pointer base anyway, which would allow the loads to be eliminated. In the future, this should be enhanced to phi translate through geps and bitcasts as well (as indicated by FIXMEs) making memdep even more powerful. llvm-svn: 61022	2008-12-15 03:35:32 +00:00
Owen Anderson	c2d2c0bdf3	Add support for slow-path GVN with full phi construction for scalars. This is disabled for now, as it actually pessimizes code in the abscence of phi translation for load elimination. This slow down GVN a bit, by about 2% on 403.gcc. llvm-svn: 61021	2008-12-15 02:03:00 +00:00
Nick Lewycky	120e01b631	Fix whitespace in comment. Remove TODO; icmp isn't a binary operator, so this function will never deal with them. llvm-svn: 61020	2008-12-15 01:35:36 +00:00
Nick Lewycky	8bdae4db80	Introducing nocapture, a parameter attribute for pointers to indicate that the callee will not introduce any new aliases of that pointer. The attributes had all bits allocated already, so I decided to collapse alignment. Alignment was previously stored as a 16-bit integer from bits 16 to 32 of the attribute, but it was required to be a power of 2. Now it's stored in log2 encoded form in five bits from 16 to 21. That gives us 11 more bits of space. You may have already noticed that you only need four bits to encode a 16-bit power of two, so why five bits? Because the AsmParser accepted 32-bit alignments, even though we couldn't store them (they were silently discarded). Now we can store them in memory, but not in the bitcode. The bitcode format was already storing these as 64-bit VBR integers. So, the bitcode format stays the same, keeping the alignment values stored as 16 bit raw values. There's some hideous code in the reader and writer that deals with this, waiting to be ripped out the moment we run out of bits again and have to replace the parameter attributes table encoding. llvm-svn: 61019	2008-12-15 01:34:58 +00:00
Chris Lattner	10a0fb1e83	silence warning when asserts disabled. llvm-svn: 61014	2008-12-14 21:38:24 +00:00
Chris Lattner	05dda70cd4	silence warning when asserts disabled. llvm-svn: 61013	2008-12-14 21:37:33 +00:00
Chris Lattner	9458712db4	eliminate warning when asserts disabled. llvm-svn: 61012	2008-12-14 21:36:23 +00:00
Owen Anderson	47efff5b14	Generalize GVN's phi construciton routine to work for things other than loads. llvm-svn: 61009	2008-12-14 19:10:35 +00:00
Duncan Sands	ef671b5627	Reapply r60997, this time without forgetting that target constants are allowed to have an illegal type. llvm-svn: 61006	2008-12-14 09:43:15 +00:00
Bill Wendling	380fbdc9f8	Temporarily revert r60997. It was causing this failure: Running /Users/void/llvm/llvm.src/test/CodeGen/Generic/dg.exp ... FAIL: /Users/void/llvm/llvm.src/test/CodeGen/Generic/asm-large-immediate.ll Failed with exit(1) at line 1 while running: llvm-as < /Users/void/llvm/llvm.src/test/CodeGen/Generic/asm-large-immediate.ll \| llc \| /usr/bin/grep 68719476738 Assertion failed: ((TypesNeedLegalizing \|\| getTypeAction(VT) == Legal) && "Illegal type introduced after type legalization?"), function HandleOp, file /Users/void/llvm/llvm.src/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp, line 493. 0 llc 0x0085392e char const* std::find<char const, char>(char const, char const, char const&) + 98 1 llc 0x00853e63 llvm::sys::PrintStackTraceOnErrorSignal() + 593 2 libSystem.B.dylib 0x96cac09b _sigtramp + 43 3 libSystem.B.dylib 0xffffffff _sigtramp + 1765097359 4 libSystem.B.dylib 0x96d24ec2 raise + 26 5 libSystem.B.dylib 0x96d3447f abort + 73 6 libSystem.B.dylib 0x96d26063 __assert_rtn + 101 7 llc 0x004f9018 llvm::cast_retty<llvm::SubprogramDesc, llvm::DebugInfoDesc>::ret_type llvm::cast<llvm::Sub ... llvm-svn: 61001	2008-12-13 23:53:00 +00:00
Duncan Sands	7cddec2a2f	LegalizeDAG is not supposed to introduce illegal types into the DAG if they were not already there. Check this with an assertion. llvm-svn: 60997	2008-12-13 22:33:38 +00:00
Chris Lattner	0be74c4208	These messages should always be emitted when NDEBUG is unset, not when NDEBUG is unset and -debug is passed. llvm-svn: 60986	2008-12-13 18:37:58 +00:00
Bill Wendling	34182ae3ae	Temporarily revert r60973. It's inexplicably causing a failure when self-hosting LLVM: llvm[2]: Linking Release executable opt (without symbols) ... Undefined symbols: "llvm::APFloat::IEEEsingle", referenced from: __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(Constants.o) __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o) __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o) "llvm::APFloat::IEEEdouble", referenced from: __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(Constants.o) __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o) __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o) ld: symbol(s) not found This is in release mode. To replicate, compile llvm and llvm-gcc in optimized mode. Then build llvm, in optimized mode, with the newly created compiler. llvm-svn: 60977	2008-12-13 09:28:44 +00:00
Torok Edwin	8898288749	Fix getFieldAs() to use the parameter instead of 6. Add missing DIType constructor, needed by DIVariable::getType(). llvm-svn: 60976	2008-12-13 08:25:29 +00:00
Mon P Wang	2880dc8e8c	Remove assertion to allow promotion of a truncating store operand llvm-svn: 60975	2008-12-13 08:16:43 +00:00
Mon P Wang	da91e0e191	Added basic support for expanding VSETCC llvm-svn: 60974	2008-12-13 08:15:14 +00:00
Chris Lattner	8753175cd6	make RLE preserve the name of the load that it replaces. This is just a pretification of the IR. llvm-svn: 60973	2008-12-13 07:22:47 +00:00
Duncan Sands	1faa6258eb	On big-endian machines it is wrong to do a full width register load followed by a truncating store for the copy, since the load will not place the value in the lower bits. Probably partial loads/stores can never happen here, but fix it anyway. llvm-svn: 60972	2008-12-13 07:18:38 +00:00
Misha Brukman	5e6eec9337	Fix spelling. llvm-svn: 60971	2008-12-13 05:21:37 +00:00
Devang Patel	5b7938b1cc	Do not print empty DW_AT_comp_dir. llvm-svn: 60965	2008-12-12 21:57:54 +00:00
Duncan Sands	ddce2cb415	When expanding unaligned loads and stores do not make use of illegal integer types: instead, use a stack slot and copying via integer registers. The existing code is still used if the bitconvert is to a legal integer type. This fires on the PPC testcases 2007-09-08-unaligned.ll and vec_misaligned.ll. It looks like equivalent code is generated with these changes, just permuted, but it's hard to tell. With these changes, nothing in LegalizeDAG produces illegal integer types anymore. This is a prerequisite for removing the LegalizeDAG type legalization code. While there I noticed that the existing code doesn't handle trunc store of f64 to f32: it turns this into an i64 store, which represents a 4 byte stack smash. I added a FIXME about this. Hopefully someone more motivated than I am will take care of it. llvm-svn: 60964	2008-12-12 21:47:02 +00:00
Bill Wendling	13e4a3d0b0	- Use patterns instead of creating completely new instruction matching patterns, which are identical to the original patterns. - Change the multiply with overflow so that we distinguish between signed and unsigned multiplication. Currently, unsigned multiplication with overflow isn't working! llvm-svn: 60963	2008-12-12 21:15:41 +00:00
Evan Cheng	56d9fc70bd	Fix add/sub expansion: don't create ADD / SUB with two results (seems like everyone is doing this these days :-). Patch by Daniel M Gessel! llvm-svn: 60958	2008-12-12 18:49:09 +00:00
Nick Lewycky	51228d6707	Revert my re-instated reverted commit, fixes the bootstrap build on x86-64 linux. llvm-svn: 60951	2008-12-12 17:09:07 +00:00
Duncan Sands	06ecf57a87	When using a 4 byte jump table on a 64 bit machine, do an extending load of the 4 bytes rather than a potentially illegal (type) i32 load followed by a sign extend. llvm-svn: 60945	2008-12-12 08:13:38 +00:00
Duncan Sands	9f8a7550b6	Don't make use of an illegal type (i64) when lowering f64 function arguments. llvm-svn: 60944	2008-12-12 08:05:40 +00:00
Mon P Wang	53d0c96c6f	Added support for SELECT v8i8 v4i16 for X86 (MMX) Added support for TRUNC v8i16 to v8i8 for X86 (MMX) llvm-svn: 60916	2008-12-12 01:25:51 +00:00
Bill Wendling	5d026e47c1	Redo the arithmetic with overflow architecture. I was changing the semantics of ISD::ADD to emit an implicit EFLAGS. This was horribly broken. Instead, replace the intrinsic with an ISD::SADDO node. Then custom lower that into an X86ISD::ADD node with a associated SETCC that checks the correct condition code (overflow or carry). Then that gets lowered into the correct X86::ADDOvf instruction. Similar for SUB and MUL instructions. llvm-svn: 60915	2008-12-12 00:56:36 +00:00
Evan Cheng	dfa19a4009	Fix a 80 col. violation. llvm-svn: 60901	2008-12-11 22:02:02 +00:00
Nick Lewycky	312d95be37	Sneaky, sneaky: move the -1 to the outside of the SMax. Reinstate the optimization of SGE/SLE with unit stride, now that it works properly. llvm-svn: 60881	2008-12-11 17:40:14 +00:00
Torok Edwin	9d454874f3	fix grammar, thanks Duncan! llvm-svn: 60875	2008-12-11 11:44:49 +00:00
Torok Edwin	34056e3cc9	introduce BasicBlock::getUniquePredecessor() llvm-svn: 60872	2008-12-11 10:36:07 +00:00
Mon P Wang	f578029326	Avoid generating a convert_rndsat node when the src and dest type are the same. llvm-svn: 60869	2008-12-11 03:30:13 +00:00
Bill Wendling	060f17c854	Clarify FIXME. llvm-svn: 60867	2008-12-11 01:26:44 +00:00
Mon P Wang	80cfaeecfe	Whitespace clean up (tabs with spaces) llvm-svn: 60866	2008-12-11 00:44:22 +00:00
Mon P Wang	4448877ed7	Make fix for r60829 less conservative to allow the proper optimization for vec_extract-sse4.ll. llvm-svn: 60865	2008-12-11 00:26:16 +00:00
Bill Wendling	02555039a0	Add a newline after this debug output. llvm-svn: 60861	2008-12-10 23:24:43 +00:00
Bill Wendling	292263313b	If ADD, SUB, or MUL have an overflow bit that's used, don't do transformation on them. The DAG combiner expects that nodes that are transformed have one value result. llvm-svn: 60857	2008-12-10 22:36:00 +00:00
Evan Cheng	fc73640f83	Preliminary ARM debug support based on patch by Mikael of FlexyCore. llvm-svn: 60851	2008-12-10 21:54:21 +00:00
Evan Cheng	487c9ff802	Some code clean up. llvm-svn: 60850	2008-12-10 21:49:05 +00:00
Bill Wendling	417d88be16	Only perform SETO/SETC to JO/JC conversion if extractvalue is coming from an arithmetic with overflow instruction. llvm-svn: 60844	2008-12-10 19:44:24 +00:00
Duncan Sands	81499a8e1c	For amusement, implement SADDO, SSUBO, UADDO, USUBO for promoted integer types, eg: i16 on ppc-32, or i24 on any platform. Complete support for arbitrary precision integers would require handling expanded integer types, eg: i128, but I couldn't be bothered. llvm-svn: 60834	2008-12-10 12:30:42 +00:00
Duncan Sands	ecb1273c5b	Don't dereference the end() iterator. This was causing a bunch of failures when running "make ENABLE_EXPENSIVE_CHECKS=1 check". llvm-svn: 60832	2008-12-10 09:38:36 +00:00
Mon P Wang	308879dcfc	Fixed a bug when trying to optimize a extract vector element of a bit convert that changes the number of elements of a shuffle. llvm-svn: 60829	2008-12-10 03:59:02 +00:00
Evan Cheng	caa31a82fc	Fix MachineCodeEmitter to use uintptr_t instead of intptr_t. This avoids some overflow issues. Patch by Thomas Jablin. llvm-svn: 60828	2008-12-10 02:32:19 +00:00
Bill Wendling	d33b6dfd4f	Whitespace changes. llvm-svn: 60826	2008-12-10 02:01:32 +00:00
Evan Cheng	1264f4bc9c	Fix a bug introduced by r59265. If lazy compilation is disabled, return actual function ptr instead of ptr to stub if function is already compiled. llvm-svn: 60822	2008-12-10 01:33:59 +00:00
Chris Lattner	3987712b2d	move an entry, add some notes, remove a completed item (IMPLICIT_DEF) llvm-svn: 60821	2008-12-10 01:30:48 +00:00
Chris Lattner	e2b5854e41	Allow basicaa to walk through geps with identical indices in parallel, allowing it to decide that P/Q must alias if A/B must alias in things like: P = gep A, 0, i, 1 Q = gep B, 0, i, 1 This allows GVN to delete 62 more instructions out of 403.gcc. llvm-svn: 60820	2008-12-10 01:04:47 +00:00
Bill Wendling	a3b718a3c9	Whitespace fixes. llvm-svn: 60818	2008-12-10 00:28:22 +00:00
Dan Gohman	1967880025	Update CalcLatency to work in terms of edge latencies, rather than node latencies. Use CalcLatency instead of manual code in CalculatePriorities to keep it consistent. Previously it computed slightly different results. llvm-svn: 60817	2008-12-10 00:24:36 +00:00
Evan Cheng	9419dfe08a	Fix a couple of Dwarf bugs. - Emit DW_AT_byte_size for struct and union of size zero. - Emit DW_AT_declaration for forward type declaration. llvm-svn: 60812	2008-12-10 00:15:44 +00:00
Scott Michel	0b5c67e1e0	CellSPU: - Fix bug 3185, with misc other cleanups. - Needed to implement SPUInstrInfo::InsertBranch(). CAUTION: Not sure what gets or needs to get passed to InsertBranch() to insert a conditional branch. This will abort for now until a good test case shows up. llvm-svn: 60811	2008-12-10 00:15:19 +00:00
Bill Wendling	1c1dacdd42	Implement fast-isel conversion of a branch instruction that's branching on an overflow/carry from the "arithmetic with overflow" intrinsics. It searches the machine basic block from bottom to top to find the SETO/SETC instruction that is its conditional. If an instruction modifies EFLAGS before it reaches the SETO/SETC instruction, then it defaults to the normal instruction emission. llvm-svn: 60807	2008-12-09 23:19:12 +00:00
Dan Gohman	036cc300ad	Rewrite the SDep class, and simplify some of the related code. The Cost field is removed. It was only being used in a very limited way, to indicate when the scheduler should attempt to protect a live register, and it isn't really needed to do that. If we ever want the scheduler to start inserting copies in non-prohibitive situations, we'll have to rethink some things anyway. A Latency field is added. Instead of giving each node a single fixed latency, each edge can have its own latency. This will eventually be used to model various micro-architecture properties more accurately. The PointerIntPair class and an internal union are now used, which reduce the overall size. llvm-svn: 60806	2008-12-09 22:54:47 +00:00
Chris Lattner	2550938060	loosen up an assertion that isn't valid when called from invalidateCachedPointerInfo. Thanks to Bill for sending me a testcase. llvm-svn: 60805	2008-12-09 22:45:32 +00:00
Dan Gohman	a52e83a6ca	Minor code simplification. llvm-svn: 60804	2008-12-09 22:45:08 +00:00
Bill Wendling	4c8fb3a0cc	Add sub/mul overflow intrinsics. This currently doesn't have a target-independent way of determining overflow on multiplication. It's very tricky. Patch by Zoltan Varga! llvm-svn: 60800	2008-12-09 22:08:41 +00:00
Chris Lattner	23369496bc	Teach GVN to invalidate some memdep information when it does an RAUW of a pointer. This allows is to catch more equivalencies. For example, the type_lists_compatible_p function used to require two iterations of the gvn pass (!) to delete its 18 redundant loads because the first pass would CSE all the addressing computation cruft, which would unblock the second memdep/gvn passes from recognizing them. This change allows memdep/gvn to catch all 18 when run just once on the function (as is typical :) instead of just 3. On all of 403.gcc, this bumps up the # reundandancies found from: 63 gvn - Number of instructions PRE'd 153991 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted to: 63 gvn - Number of instructions PRE'd 154137 gvn - Number of instructions deleted 50185 gvn - Number of loads deleted +120 loads deleted isn't bad. llvm-svn: 60799	2008-12-09 22:06:23 +00:00
Duncan Sands	88a2901801	Fix PR3117: not all nodes being legalized. The essential problem was that the DAG can contain random unused nodes which were never analyzed. When remapping a value of a node being processed, such a node may become used and need to be analyzed; however due to operands being transformed during analysis the node may morph into a different one. Users of the morphing node need to be updated, and this wasn't happening. While there I added a bunch of documentation and sanity checks, so I (or some other poor soul) won't have to scratch their head over this stuff so long trying to remember how it was all supposed to work next time some obscure problem pops up! The extra sanity checking exposed a few places where invariants weren't being preserved, so those are fixed too. Since some of the sanity checking is expensive, I added a flag to turn it on. It is also turned on when building with ENABLE_EXPENSIVE_CHECKS=1. llvm-svn: 60797	2008-12-09 21:33:20 +00:00
Chris Lattner	6a5e9eaa36	Teach BasicAA::getModRefInfo(CallSite, CallSite) some tricks based on readnone/readonly functions. Teach memdep to look past readonly calls when analyzing deps for a readonly call. This allows elimination of a few more calls from 403.gcc: before: 63 gvn - Number of instructions PRE'd 153986 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted after: 63 gvn - Number of instructions PRE'd 153991 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted 5 calls isn't much, but this adds plumbing for the next change. llvm-svn: 60794	2008-12-09 21:19:42 +00:00
Chris Lattner	e0f69b32e5	Fix a fixme: allow memdep to see past read-only calls when doing load dependence queries. This allows GVN to eliminate a few more instructions on 403.gcc: 152598 gvn - Number of instructions deleted 49240 gvn - Number of loads deleted after: 153986 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted llvm-svn: 60786	2008-12-09 19:47:40 +00:00
Chris Lattner	0dbe02a270	rename getNonLocalDependency -> getNonLocalCallDependency, and remove pointer stuff from it, simplifying the code a bit. llvm-svn: 60783	2008-12-09 19:38:05 +00:00
Chris Lattner	3768aeb290	Switch GVN::processNonLocalLoad to using the new MemDep::getNonLocalPointerDependency method. There are some open issues with this (missed optimizations) and plenty of future work, but this does allow GVN to eliminate slightly more loads (49246 vs 49033). Switching over now allows simplification of the other code path in memdep. llvm-svn: 60780	2008-12-09 19:25:07 +00:00
Chris Lattner	b0aa96c25d	random cleanups, no functionality change. llvm-svn: 60779	2008-12-09 19:21:47 +00:00
Evan Cheng	b81948d88b	Back out 60748 for now. It's breaking SPASS, 254.gap, and 464.h264ref. llvm-svn: 60776	2008-12-09 18:21:49 +00:00
Evan Cheng	92299e0722	Cosmetic changes. llvm-svn: 60771	2008-12-09 17:56:30 +00:00
Nuno Lopes	360aae23b1	remove unused var llvm-svn: 60770	2008-12-09 17:04:06 +00:00
Dan Gohman	275e6da839	Whitespace cleanups. llvm-svn: 60769	2008-12-09 16:37:48 +00:00
Duncan Sands	392ed50974	Handle a compiler warning. llvm-svn: 60755	2008-12-09 09:58:11 +00:00
Chris Lattner	ff8be5564e	fix typos gabor noticed llvm-svn: 60754	2008-12-09 08:38:36 +00:00
Bill Wendling	d034543c7e	Correct my English. llvm-svn: 60753	2008-12-09 07:55:31 +00:00
Chris Lattner	b7b4a2dbac	restructure the top level non-local ptr dep query to handle the first block of a query specially. This makes the "complete query caching" subsystem more effective, avoiding predecessor queries. This speeds up GVN another 4%. llvm-svn: 60752	2008-12-09 07:52:59 +00:00
Chris Lattner	f61cc24ef3	rename getNonLocalPointerDepInternal -> getNonLocalPointerDepFromBB and split its inner loop out into a new GetNonLocalInfoForBlock function. No functionality change. llvm-svn: 60751	2008-12-09 07:47:11 +00:00
Evan Cheng	3130873f30	DisableGVCompilation should not abort on internal GlobalValue's. llvm-svn: 60750	2008-12-09 07:31:49 +00:00
Mon P Wang	85d72dabc5	In LegalizeOp, don't change the result type of CONVERT_RNDSAT when promoting one of its operand. llvm-svn: 60749	2008-12-09 07:27:39 +00:00
Nick Lewycky	41060b1556	It's easy to handle SLE/SGE when the loop has a unit stride. llvm-svn: 60748	2008-12-09 07:25:04 +00:00
Chris Lattner	7358482dbc	if we have two elements, insert both, don't use std::sort. This speeds up the new GVN by another 3% llvm-svn: 60747	2008-12-09 07:05:45 +00:00
Chris Lattner	e428927eda	If we're only adding one new element to 'Cache', insert it into its known position instead of using a full sort. This speeds up GVN by ~4% with the new memdep stuff. llvm-svn: 60746	2008-12-09 06:58:04 +00:00
Chris Lattner	951d1310c9	convert a couple other places that use pred_iterator to use the caching pred iterator. llvm-svn: 60745	2008-12-09 06:44:17 +00:00
Chris Lattner	d7df2dce20	use hte new pred cache to speed up the new non-local memdep queries. This speeds up GVN using the new queries (not yet checked in) by just over 10%. llvm-svn: 60743	2008-12-09 06:28:49 +00:00
Mon P Wang	0c011f8ba9	Fix getNode to allow a vector for the shift amount for shifts of vectors. Fix the shift amount when unrolling a vector shift into scalar shifts. Fix problem in getShuffleScalarElt where it assumes that the input of a bit convert must be a vector. llvm-svn: 60740	2008-12-09 05:46:39 +00:00
Chris Lattner	e32dbaddd2	Fix a really subtle off-by-one bug that Duncan noticed with valgrind on test/CodeGen/Generic/2007-06-06-CriticalEdgeLandingPad. llvm-svn: 60739	2008-12-09 04:47:21 +00:00
Scott Michel	cf7ec43939	CellSPU: - Change default scheduling preference to list-burr, which produces somewhat better code than the default. Could also use list-tdrr, but need to ask dev list about the appropriate handy mnemonic before commiting. llvm-svn: 60738	2008-12-09 03:37:19 +00:00
Bill Wendling	7250a29def	Add initial support for fast-isel of the [SU]ADDO intrinsics. It isn't complete. For instance, it lowers the common case into this less-than-optimal code: addl %ecx, %eax seto %cl testb %cl, %cl jne LBB1_2 ## overflow instead of: addl %ecx, %eax jo LBB1_2 ## overflow That will come in a future commit. llvm-svn: 60737	2008-12-09 02:42:50 +00:00
Dan Gohman	e99b76aa24	Don't charge full latency for an anti-dependence, in this simplistic pipeline model. llvm-svn: 60733	2008-12-09 00:26:46 +00:00
Dan Gohman	9e5cc22129	Fix a couple of mistaken switch case fall-throughs. Thanks to Bill for spotting these! llvm-svn: 60728	2008-12-08 23:50:06 +00:00
Chris Lattner	23e2ac8894	remove DebugIterations option. Despite the accusations, jump threading has been shown to only expose problems not have bugs itself. I'm sure it's completely bug free! ;-) llvm-svn: 60725	2008-12-08 22:44:07 +00:00
Evan Cheng	3bb2ad8a0a	Re-apply 60689 now my head is screwed on right. llvm-svn: 60711	2008-12-08 19:29:03 +00:00
Dan Gohman	6f3258586e	Fix the top-level comments, and fix some 80-column violations. llvm-svn: 60707	2008-12-08 17:50:35 +00:00
Dan Gohman	5bca97fc4f	Revert 60689. It caused many regressions on Darwin targets. llvm-svn: 60705	2008-12-08 17:38:02 +00:00
Devang Patel	a8d0117253	Fix spelling. Thanks Duncan! llvm-svn: 60702	2008-12-08 17:07:24 +00:00
Devang Patel	82fb6bc606	Undo previous patch. llvm-svn: 60701	2008-12-08 17:02:37 +00:00
Duncan Sands	982c6ed1d9	Fix comment typo. llvm-svn: 60697	2008-12-08 14:01:59 +00:00
Dan Gohman	14d4094968	Factor out the code for sign-extending/truncating gep indices and use it in x86 address mode folding. Also, make getRegForValue return 0 for illegal types even if it has a ValueMap for them, because Argument values are put in the ValueMap. This fixes PR3181. llvm-svn: 60696	2008-12-08 07:57:47 +00:00
Chris Lattner	7307ef0ba3	add another level of caching for non-local pointer queries, keeping track of whether the CachedNonLocalPointerInfo for a block is specific to a block. If so, just return it without any pred scanning. This is good for a 6% speedup on GVN (when it uses this lookup method, which it doesn't right now). llvm-svn: 60695	2008-12-08 07:31:50 +00:00
Chris Lattner	9020891916	consistency llvm-svn: 60694	2008-12-08 07:21:39 +00:00
Chris Lattner	6ab4673c40	introduce a new RoundUpAlignment helper function, use it to remove some more 64-bit divs and rems from the StructLayout ctor. llvm-svn: 60692	2008-12-08 07:11:56 +00:00
Dan Gohman	f187690987	Make ConstantAggregateZero::get return a ConstantAggregateZero*, as suggested in PR3182. llvm-svn: 60691	2008-12-08 07:10:54 +00:00
Dan Gohman	84876ffe23	Update CPP backend for PrintModulePass API changes. llvm-svn: 60690	2008-12-08 07:07:24 +00:00
Evan Cheng	d668dd83c0	Perform cheap checks first. llvm-svn: 60689	2008-12-08 06:52:43 +00:00
Chris Lattner	494eb47570	Some minor optimizations for isObjectSmallerThan. llvm-svn: 60687	2008-12-08 06:28:54 +00:00
Dan Gohman	e4b1a93573	Move createVirtualRegister out-of-line. llvm-svn: 60684	2008-12-08 04:54:11 +00:00
Dan Gohman	7954facae5	Clarify some comments. llvm-svn: 60683	2008-12-08 04:53:23 +00:00
Nick Lewycky	69eb224a5e	Fixes for Visual Studio users. Patch by OvermindDL1 on llvm-dev! llvm-svn: 60679	2008-12-08 00:45:02 +00:00
Chris Lattner	1ff38d6629	add an assert. the cast<> below would catch this but a message is more useful. llvm-svn: 60674	2008-12-07 18:45:15 +00:00
Chris Lattner	8cd875dac6	factor some code better. llvm-svn: 60673	2008-12-07 18:42:51 +00:00
Chris Lattner	9ebcc276e4	factor some code, fixing some fixme's. llvm-svn: 60672	2008-12-07 18:39:13 +00:00
Chris Lattner	16ea827dfd	add support for caching pointer dependence queries. Nothing uses this yet so it "can't" break anything. That said, it does appear to work. llvm-svn: 60654	2008-12-07 08:50:20 +00:00
Chris Lattner	a79a341f1e	fix a bug I introduced in simplifycfg handling single entry phi nodes. FoldSingleEntryPHINodes deletes the PHI, so there is no need to delete it afterward. llvm-svn: 60653	2008-12-07 07:22:45 +00:00
Owen Anderson	a5f2ce1ee3	Switch to top-down mode and fix a crasher this exposed caused by an error in the live interval updating. llvm-svn: 60652	2008-12-07 05:33:18 +00:00
Nick Lewycky	e277f75880	Fix typo, psuedo -> pseudo. llvm-svn: 60651	2008-12-07 03:49:52 +00:00
Chris Lattner	35095d7722	Some internal refactoring to make it easier to cache results. llvm-svn: 60650	2008-12-07 02:56:57 +00:00
Chris Lattner	ad82419b46	Introduce a new MemDep::getNonLocalPointerDependency method. This will eventually take over load/store dep queries from getNonLocalDependency. For now it works fine, but is incredibly slow because it does no caching. Lets not switch GVN to use it until that is fixed :) llvm-svn: 60649	2008-12-07 02:15:47 +00:00
Chris Lattner	8e8a6b4ec3	push the "pointer case" up the analysis stack a bit. This causes duplication of logic (in 2 places) to determine what pointer a load/store touches. This will be addressed in a future commit. llvm-svn: 60648	2008-12-07 01:50:16 +00:00
Chris Lattner	d14e6778c6	make clients have to know how to call getCallSiteDependencyFrom instead of making getDependencyFrom do it. llvm-svn: 60647	2008-12-07 01:21:14 +00:00
Chris Lattner	ddfcaff37c	rename some variables for consistency llvm-svn: 60644	2008-12-07 00:39:19 +00:00
Chris Lattner	e4c5f66b3b	I love how using out of scope variables is not an error with GCC, no really I do. llvm-svn: 60643	2008-12-07 00:38:27 +00:00
Chris Lattner	20b7d9667d	Rename getCallSiteDependency -> getCallSiteDependencyFrom to emphasize the scanning and make it more similar to getDependencyFrom llvm-svn: 60642	2008-12-07 00:35:51 +00:00
Chris Lattner	dc8cf1fa91	a memdep query on a volatile load/store will always return clobber with the current implementation. Instead of returning a "precise clobber" just return a fuzzy one. This doesn't matter to any clients anyway and should speed up analysis time very very slightly. llvm-svn: 60641	2008-12-07 00:28:02 +00:00
Chris Lattner	135a48d48b	don't bother touching volatile stores, they will just return clobber on everything interesting anyway. llvm-svn: 60640	2008-12-07 00:25:15 +00:00
Chris Lattner	a04521164c	remove the ability to get memdep info for vaarg. I don't think the original impl was correct and noone actually makes the query anyway. llvm-svn: 60639	2008-12-07 00:21:18 +00:00
Chris Lattner	bd507e3e4d	improve a note. llvm-svn: 60636	2008-12-07 00:15:10 +00:00
Chris Lattner	1fa53e3e56	some more PRE/GVN/DSE related notes. llvm-svn: 60633	2008-12-06 22:52:12 +00:00
Chris Lattner	00104cf8f8	add a note llvm-svn: 60632	2008-12-06 22:49:05 +00:00
Chris Lattner	a87ff83a83	some random notes. llvm-svn: 60624	2008-12-06 19:28:22 +00:00
Nick Lewycky	d33c83b1af	Minor cleanup. Use dyn_cast, not isa/cast pairs. No functionality change. llvm-svn: 60623	2008-12-06 17:57:05 +00:00
Evan Cheng	5c92d425a9	Clean up some ARM GV asm printing out; minor fixes to match what gcc does. llvm-svn: 60621	2008-12-06 02:00:55 +00:00
Chris Lattner	022b15083b	Reimplement the inner loop of DSE. It now uniformly uses getDependence(), doesn't do its own local caching, and is slightly more aggressive about free/store dse (see testcase). This eliminates the last external client of MemDep::getDependenceFrom(). llvm-svn: 60619	2008-12-06 00:53:22 +00:00
Dan Gohman	e2ee41d1d1	Don't use plain %x to print pointer values. I had changed it from %p since %p isn't formatted consistently, but obviously plain %x is wrong. PRIxPTR with a cast to uintptr_t would work here, but that requires inconvenient build-system changes. %lu works on all current and foreseable future hosts. llvm-svn: 60616	2008-12-05 23:39:24 +00:00
Dale Johannesen	c6404f98b2	Forgot a file. llvm-svn: 60609	2008-12-05 21:55:35 +00:00
Dale Johannesen	f5a072c388	Make LoopStrengthReduce smarter about hoisting things out of loops when they can be subsumed into addressing modes. Change X86 addressing mode check to realize that some PIC references need an extra register. (I believe this is correct for Linux, if not, I'm sure someone will tell me.) llvm-svn: 60608	2008-12-05 21:47:27 +00:00
Chris Lattner	2b5e1b5263	Make a few major changes to memdep and its clients: 1. Merge the 'None' result into 'Normal', making loads and stores return their dependencies on allocations as Normal. 2. Split the 'Normal' result into 'Clobber' and 'Def' to distinguish between the cases when memdep knows the value is produced from when we just know if may be changed. 3. Move some of the logic for determining whether readonly calls are CSEs into memdep instead of it being in GVN. This still leaves verification that the arguments are hte same to GVN to let it know about value equivalences in different contexts. 4. Change memdep's call/call dependency analysis to use getModRefInfo(CallSite,CallSite) instead of doing something very weak. This only really matters for things like DSA, but someday maybe we'll have some other decent context sensitive analyses :) 5. This reimplements the guts of memdep to handle the new results. 6. This simplifies GVN significantly: a) readonly call CSE is slightly simpler b) I eliminated the "getDependencyFrom" chaining for load elimination and load CSE doesn't have to worry about volatile (they are always clobbers) anymore. c) GVN no longer does any 'lastLoad' caching, leaving it to memdep. 7. The logic in DSE is simplified a bit and sped up. A potentially unsafe case was eliminated. llvm-svn: 60607	2008-12-05 21:04:20 +00:00
Dan Gohman	5e30c5b83b	Demangle and pretty-print symbols in internal backtraces. Patch by Wesley Peck, with a few fixes by me. llvm-svn: 60605	2008-12-05 20:12:48 +00:00
Anton Korobeynikov	30085a6f51	Revert invalid r60393. It causes llvm-gcc bootstrap fails in release builds. See PR3160 for details llvm-svn: 60604	2008-12-05 19:38:49 +00:00
Chris Lattner	08ad59d631	Make it illegal to call getDependency* on non-memory instructions like binary operators. llvm-svn: 60600	2008-12-05 18:46:19 +00:00
Evan Cheng	03ef7cf749	Reason #3 from 60595 doesn't hold true. If we can fold a PIC load from constpool into a use, the rewrite happens at time of spill (not in VirtRegMap). Later on, if the GlobalBaseReg is spilled, the spiller can see the use uses GlobalBaseReg and do the right thing. llvm-svn: 60596	2008-12-05 17:41:31 +00:00
Evan Cheng	144447bfa0	Effectively undo 60461 in PIC mode which simply transform V_SET0 / V_SETALLONES into a load from constpool in order to fold into restores. This is not safe to do when PIC base is being used for a number of reasons: 1. GlobalBaseReg may have been spilled. 2. It may not be live at the use. 3. Spiller doesn't know this is happening so it won't prevent GlobalBaseReg from being spilled later (That by itself is a nasty hack. It's needed because we don't insert the reload until later). llvm-svn: 60595	2008-12-05 17:23:48 +00:00
Chris Lattner	211146e709	Fix test/Transforms/GVN/pre-load.ll llvm-svn: 60594	2008-12-05 17:04:12 +00:00
Evan Cheng	6879b66c9e	Fix comment. llvm-svn: 60592	2008-12-05 17:00:16 +00:00
Chris Lattner	35547ba5ca	Make IsValueFullyAvailableInBlock safe. llvm-svn: 60588	2008-12-05 07:49:08 +00:00
Dan Gohman	1e7dff35a6	Drop the reg argument to isRegReDefinedByTwoAddr, which was redundant. llvm-svn: 60586	2008-12-05 05:45:42 +00:00
Dan Gohman	c157324a23	Teach StackSlotColoring to update MachineMemOperands when changing the stack slots on an instruction, to keep them consistent with the actual memory addresses. llvm-svn: 60584	2008-12-05 05:31:14 +00:00
Dan Gohman	be3e0caacb	Ignore IMPLICIT_DEF instructions when computing physreg liveness. While they appear to provide a normal clobbering def, they don't in the case of the awkward IMPLICIT_DEF+INSERT_SUBREG idiom. It would be good to change INSERT_SUBREG; until then, this change allows post-regalloc scheduling to cope in a mildly conservative way. llvm-svn: 60583	2008-12-05 05:30:02 +00:00
Evan Cheng	1b795803dd	Re-did 60519. It turns out Darwin's handling of hidden visibility symbols are a bit more complicate than I expected. Both declarations and weak definitions still need a stub indirection. However, the stubs are in data section and they contain the addresses of the actual symbols. llvm-svn: 60571	2008-12-05 01:06:39 +00:00
Ted Kremenek	a851e459e1	Have raw_fd_ostream keep track of the position in the file to make tell() go faster by not requiring a flush(). llvm-svn: 60560	2008-12-04 22:51:11 +00:00
Devang Patel	4fcea36b8b	Rewrite code that 1) filters loops and 2) calculates new loop bounds. This fixes many bugs. I will add more test cases in a separate check-in. Some day, the code that manipulates CFG and updates dom. info could use refactoring help. llvm-svn: 60554	2008-12-04 21:38:42 +00:00
Owen Anderson	9e2293bda3	Factor out some common code. llvm-svn: 60553	2008-12-04 21:20:30 +00:00
Scott Michel	6e9747d2d6	CellSPU: Fix bug 3055 - Add v4f32, v2f64 to LowerVECTOR_SHUFFLE - Look for vector rotate in shuffle elements, generate a vector rotate instead of a full-blown shuffle when opportunity presents itself. - Generate larger test harness and fix a few interesting but obscure bugs. llvm-svn: 60552	2008-12-04 21:01:44 +00:00
Duncan Sands	658b461a3c	When allocating a stack temporary, use the correct number of bytes for types such as i1 which are not a multiple of 8 bits in length. llvm-svn: 60543	2008-12-04 18:08:40 +00:00
Scott Michel	26d15f31ac	Missing closing brace and reverse conditional condition on NDEBUG llvm-svn: 60541	2008-12-04 17:16:59 +00:00
Chris Lattner	7b3576824a	Start simplifying a switch that has a successor that is a switch. llvm-svn: 60534	2008-12-04 06:31:07 +00:00
Chris Lattner	3acb266d60	This code is apparently quite confused. In the meantime, get it building when NDEBUG is set. llvm-svn: 60532	2008-12-04 06:14:27 +00:00
Bill Wendling	a0466523bd	Temporarily revert r60519. It was causing a bootstrap failure: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/sys-include -DHAVE_CONFIG_H -I. -I../../../llvm-gcc.src/libgomp -I. -I../../../llvm-gcc.src/libgomp/config/posix -I../../../llvm-gcc.src/libgomp -Wall -pthread -Werror -O2 -g -O2 -MT barrier.lo -MD -MP -MF .deps/barrier.Tpo -c ../../../llvm-gcc.src/libgomp/barrier.c -fno-common -DPIC -o .libs/barrier.o checking for sys/file.h... /var/folders/zG/zGE-ZJOGFiGjv0B5cs5oYE+++TM/-Tmp-//cc34Jg5P.s:13:non-relocatable subtraction expression, "_gomp_tls_key" minus "L1$pb" /var/folders/zG/zGE-ZJOGFiGjv0B5cs5oYE+++TM/-Tmp-//cc34Jg5P.s:13:symbol: "_gomp_tls_key" can't be undefined in a subtraction expression make[4]: * [barrier.lo] Error 1 make[4]: * Waiting for unfinished jobs.... /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.5.0/sys-include -DHAVE_CONFIG_H -I. -I../../../llvm-gcc.src/libgomp -I. -I../../../llvm-gcc.src/libgomp/config/posix -I../../../llvm-gcc.src/libgomp -Wall -pthread -Werror -O2 -g -O2 -MT alloc.lo -MD -MP -MF .deps/alloc.Tpo -c ../../../llvm-gcc.src/libgomp/alloc.c -o alloc.o >/dev/null 2>&1 yes checking for sys/param.h... make[3]: * [all-recursive] Error 1 make[2]: * [all] Error 2 make[1]: * [all-target-libgomp] Error 2 make[1]: * Waiting for unfinished jobs.... llvm-svn: 60527	2008-12-04 04:07:00 +00:00
Scott Michel	1f907dd784	CellSPU: - First patch from Nehal Desai, a new contributor at Aerospace. Nehal's patch fixes sign/zero/any-extending loads for integers and floating point. Example code, compiled w/o debugging or optimization where he first noticed the bug: int main(void) { float a = 99.0; printf("%d\n", a); return 0; } Verified that this code actually works on a Cell SPU. Changes by Scott Michel: - Fix bug in the value type list constructed by SPUISD::LDRESULT to include both the load result's result and chain, not just the chain alone. - Simplify LowerLOAD and remove extraneous and unnecessary chains. - Remove unused SPUISD pseudo instructions. llvm-svn: 60526	2008-12-04 03:02:42 +00:00
Dan Gohman	6ff2c1234b	Use register names instead of numbers in debug output. llvm-svn: 60525	2008-12-04 02:15:26 +00:00
Dan Gohman	93e73ed7f2	Make debug output more informative. llvm-svn: 60524	2008-12-04 02:14:57 +00:00
Evan Cheng	d4b7459179	Visibility hidden GVs do not require extra load of symbol address from the GOT or non-lazy-ptr. llvm-svn: 60519	2008-12-04 01:56:50 +00:00
Dan Gohman	f8e215d4b1	Add minimal support for disambiguating memory references. Currently the main thing this covers is spills to distinct spill slots. llvm-svn: 60517	2008-12-04 01:35:46 +00:00
Chris Lattner	2677286c25	add a debugging option to help track down j-t problems. llvm-svn: 60514	2008-12-04 00:07:59 +00:00
Dan Gohman	3836431ec6	Rewrite the liveness bookkeeping code to fix a bunch of issues with subreg operands and tied operands. llvm-svn: 60510	2008-12-03 23:07:27 +00:00
Dale Johannesen	0a0e2b1033	Make the debugging dump be a full line. llvm-svn: 60509	2008-12-03 22:45:31 +00:00
Dale Johannesen	119036d435	Remove an unused field. llvm-svn: 60508	2008-12-03 22:43:56 +00:00
Dan Gohman	0edbed16c7	Have PseudoSourceValue override Value::dump, so that it works on PseudoSourceValue values. This also fixes a FIXME in lib/VMCode/AsmWriter.cpp. llvm-svn: 60507	2008-12-03 21:37:21 +00:00
Dale Johannesen	a0b1516bdc	Fix a misspelled function name. llvm-svn: 60506	2008-12-03 20:56:12 +00:00
Chris Lattner	420385f8c3	Factor some code into a new FoldSingleEntryPHINodes method. llvm-svn: 60501	2008-12-03 19:44:02 +00:00
Dan Gohman	19b43e462f	Fix an inconsistency in a comment. llvm-svn: 60500	2008-12-03 19:38:38 +00:00
Evan Cheng	05ded29738	Use mmx (punpckldq VR64, (mmx_v_set0)) to clear high 32-bits of a VR64 register. llvm-svn: 60499	2008-12-03 19:38:05 +00:00
Dan Gohman	af9b4a8a21	Don't charge the full latency for anti and output dependencies. This is an area where eventually it would be good to use target-dependent information. llvm-svn: 60498	2008-12-03 19:37:34 +00:00
Dale Johannesen	6322cd40c6	A step towards geting linux ppc to work (see PR 3099) llvm-svn: 60497	2008-12-03 19:33:10 +00:00
Dan Gohman	4f8709518d	When looking for anti-dependences on the critical path, don't bother examining non-anti-dependence edges. llvm-svn: 60496	2008-12-03 19:32:26 +00:00
Dan Gohman	1020320a05	Add a comment about callee-saved registers. llvm-svn: 60495	2008-12-03 19:30:13 +00:00
Dale Johannesen	a851280d26	Fix a really wrong comment. llvm-svn: 60494	2008-12-03 19:25:46 +00:00
Dan Gohman	74529a2226	Split foldMemoryOperand into public non-virtual and protected virtual parts, and add target-independent code to add/preserve MachineMemOperands. llvm-svn: 60488	2008-12-03 18:43:12 +00:00
Dan Gohman	5dad0993a9	Rename isSimpleLoad to canFoldAsLoad, to better reflect its meaning. llvm-svn: 60487	2008-12-03 18:15:48 +00:00
Dan Gohman	fc05cdda64	Extend X86's addFrameReference to add a MachineMemOperand for the frame reference. This will help post-RA scheduling determine that spills to distinct stack slots are independent. llvm-svn: 60486	2008-12-03 18:11:40 +00:00
Dan Gohman	6be47e9542	Update a comment. llvm-svn: 60484	2008-12-03 17:10:41 +00:00
Duncan Sands	fbc8da66d6	Only check that the result of the mapping was not a new node if the node was actually remapped. llvm-svn: 60482	2008-12-03 12:36:16 +00:00
Rafael Espindola	0c800cf35e	Fix bug 3140. Print a single parameter .file directive if we have an ELF target. llvm-svn: 60480	2008-12-03 11:01:37 +00:00
Richard Osborne	e74ae9dbb7	Add support for ISD::TRAP to the XCore backend llvm-svn: 60479	2008-12-03 10:59:16 +00:00
Evan Cheng	440e75e1d5	Refactor code. No functionality change. llvm-svn: 60478	2008-12-03 08:38:43 +00:00
Bill Wendling	d2208d570b	CC should only be a ConstantSDNode at this point. Just use 'cast' instead of 'dyn_cast'. llvm-svn: 60477	2008-12-03 08:32:02 +00:00
Chris Lattner	f00b2f3fb4	Teach jump threading some more simple tricks: 1) have it fold "br undef", which does occur with surprising frequency as jump threading iterates. 2) teach j-t to delete dead blocks. This removes the successor edges, reducing the in-edges of other blocks, allowing recursive simplification. 3) Fold things like: br COND, BBX, BBY BBX: br COND, BBZ, BBW which also happens because jump threading iterates. llvm-svn: 60470	2008-12-03 07:48:08 +00:00
Chris Lattner	29326a6d1f	third time is the charm. llvm-svn: 60469	2008-12-03 07:45:15 +00:00
Chris Lattner	d03c1b5440	fix assertion. llvm-svn: 60468	2008-12-03 07:43:05 +00:00
Chris Lattner	7a00825f57	Rename DeleteBlockIfDead to DeleteDeadBlock and make it unconditionally delete the block. All likely clients will do the checking anyway. llvm-svn: 60464	2008-12-03 06:40:52 +00:00
Chris Lattner	12c3938837	Factor some code out of SimplifyCFG, forming a new DeleteBlockIfDead method. llvm-svn: 60463	2008-12-03 06:37:44 +00:00
Dan Gohman	ac6561793c	Mark x86's V_SET0 and V_SETALLONES with isSimpleLoad, and teach X86's foldMemoryOperand how to "fold" them, by converting them into constant-pool loads. When they aren't folded, they use xorps/cmpeqd, but for example when register pressure is high, they may now be folded as memory operands, which reduces register pressure. Also, mark V_SET0 isAsCheapAsAMove so that two-address-elimination will remat it instead of copying zeros around (V_SETALLONES was already marked). llvm-svn: 60461	2008-12-03 05:21:24 +00:00
Dan Gohman	6333d48459	Add a sanity-check to tablegen to catch the case where isSimpleLoad is set but mayLoad is not set. Fix all the problems this turned up. Change code to not use isSimpleLoad instead of mayLoad unless it really wants isSimpleLoad. llvm-svn: 60459	2008-12-03 02:30:17 +00:00
Dan Gohman	18c4a4c9ea	Fix a missing #include. llvm-svn: 60458	2008-12-03 02:10:00 +00:00
Dan Gohman	86b0a220af	Fix this comment to reflect that it applies to types other than just i32. llvm-svn: 60455	2008-12-03 01:39:44 +00:00

... 3 4 5 6 7 ...

26470 Commits