llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-07 03:40:35 +00:00

Author	SHA1	Message	Date
Benjamin Kramer	a58b69aa9d	Try to reuse the value when lowering memset. This allows us to compile: void test(char *s, int a) { __builtin_memset(s, a, 15); } into 1 mul + 3 stores instead of 3 muls + 3 stores. llvm-svn: 122710	2011-01-02 19:57:05 +00:00
Benjamin Kramer	38491f47ce	Lower the i8 extension in memset to a multiply instead of a potentially long series of shifts and ors. We could implement a DAGCombine to turn x * 0x0101 back into logic operations on targets that doesn't support the multiply or it is slow (p4) if someone cares enough. Example code: void test(char *s, int a) { __builtin_memset(s, a, 4); } before: _test: ## @test movzbl 8(%esp), %eax movl %eax, %ecx shll $8, %ecx orl %eax, %ecx movl %ecx, %eax shll $16, %eax orl %ecx, %eax movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret after: _test: ## @test movzbl 8(%esp), %eax imull $16843009, %eax, %eax ## imm = 0x1010101 movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret llvm-svn: 122707	2011-01-02 19:44:58 +00:00
Cameron Zwarich	ae468579bb	Use getVRegDef() instead of def_iterator. This leads to fewer defs being added with 2-address instructions, for about a 3.5% speedup of StrongPHIElimination on 403.gcc. llvm-svn: 122635	2010-12-30 00:42:23 +00:00
Cameron Zwarich	1e7124e6fa	None of the other pass names in CodeGen have terminating periods. llvm-svn: 122628	2010-12-29 11:49:10 +00:00
Cameron Zwarich	a7052a3c06	Instead of processing every instruction when splitting interferences, only process those instructions that define phi sources. This is a 47% speedup of StrongPHIElimination compile time on 403.gcc. llvm-svn: 122627	2010-12-29 11:00:09 +00:00
Cameron Zwarich	292870da06	Add a missing word to a comment. llvm-svn: 122625	2010-12-29 04:42:39 +00:00
Cameron Zwarich	6fc15ba38b	Add text explaining an assertion. llvm-svn: 122617	2010-12-29 03:52:51 +00:00
Cameron Zwarich	0fa638e27c	Simplify some code in MachineVerifier that was doing the correct thing, but not in the most obvious way. llvm-svn: 122610	2010-12-28 23:45:38 +00:00
Cameron Zwarich	3eacb7fff8	Revert the optimization in r122596. It is correct for all current targets, but it relies on assumptions that may not be true in the future. llvm-svn: 122608	2010-12-28 23:02:56 +00:00
Cameron Zwarich	c9c7488542	Avoid iterating every operand of an instruction in StrongPHIElimination, since we are only interested in the defs when discovering interferences. This is a 28% speedup running StrongPHIElimination on 403.gcc. llvm-svn: 122596	2010-12-28 10:49:33 +00:00
Duncan Sands	cc5a4497fd	Pacify the compiler. BestWeight cannot in fact be used uninitialized in this function, but the compiler was warning that it might be when doing a release build. llvm-svn: 122595	2010-12-28 10:07:15 +00:00
Cameron Zwarich	30f2239301	Change an assertion to assert what the code actually relies upon. llvm-svn: 122586	2010-12-27 22:08:42 +00:00
Cameron Zwarich	cfdb10a1bb	Land a first cut at StrongPHIElimination. There are only 5 new test failures when running without the verifier, and I have not yet checked them to see if the new results are still correct. There are more verifier failures, but they all seem to be additional occurrences of verifier failures that occur with the existing PHIElimination pass. There are a few obvious issues with the code: 1) It doesn't properly update the register equivalence classes during copy insertion, and instead recomputes them before merging live intervals and renaming registers. I wanted to keep this first patch simple for debugging purposes, but it shouldn't be very hard to do this. 2) It doesn't mix the renaming and live interval merging with the copy insertion process, which leads to a lot of virtual register churn. Virtual registers and live intervals are created, only to later be merged into others. The code should be smarter and only create a new virtual register if there is no existing register in the same congruence class. 3) In one place the code uses a DenseMap per basic block, which is unnecessary heap allocation. There should be an inline storage version of DenseMap. I did a quick compile-time test of running llc on 403.gcc with and without StrongPHIElimination. It is slightly slower with StrongPHIElimination, because the small decrease in the coalescer runtime can't beat the increase in phi elimination runtime. Perhaps fixing the above performance issues will narrow the gap. I also haven't yet run any tests of the quality of the generated code. llvm-svn: 122582	2010-12-27 10:08:19 +00:00
Cameron Zwarich	66289e34e1	Add knowledge of phi-def and phi-kill valnos to MachineVerifier's predecessor valno verification. The "Different value live out of predecessor" check is incorrect in the case of phi-def valnos, so just skip that check for phi-def valnos and instead check that all of the valnos for predecessors have phi-kill. Fixes PR8863. llvm-svn: 122581	2010-12-27 05:17:23 +00:00
Andrew Trick	dfa31b1cf9	Minor cleanup related to my latest scheduler changes. llvm-svn: 122545	2010-12-24 07:10:19 +00:00
Andrew Trick	c926e98fc7	Fix a few cases where the scheduler is not checking for phys reg copies. The scheduling node may have a NULL DAG node, yuck. llvm-svn: 122544	2010-12-24 06:46:50 +00:00
Andrew Trick	134b2a5907	Various bits of framework needed for precise machine-level selection DAG scheduling during isel. Most new functionality is currently guarded by -enable-sched-cycles and -enable-sched-hazard. Added InstrItineraryData::IssueWidth field, currently derived from ARM itineraries, but could be initialized differently on other targets. Added ScheduleHazardRecognizer::MaxLookAhead to indicate whether it is active, and if so how many cycles of state it holds. Added SchedulingPriorityQueue::HasReadyFilter to allowing gating entry into the scheduler's available queue. ScoreboardHazardRecognizer now accesses the ScheduleDAG in order to get information about it's SUnits, provides RecedeCycle for bottom-up scheduling, correctly computes scoreboard depth, tracks IssueCount, and considers potential stall cycles when checking for hazards. ScheduleDAGRRList now models machine cycles and hazards (under flags). It tracks MinAvailableCycle, drives the hazard recognizer and priority queue's ready filter, manages a new PendingQueue, properly accounts for stall cycles, etc. llvm-svn: 122541	2010-12-24 05:03:26 +00:00
Andrew Trick	53f4556c64	whitespace llvm-svn: 122539	2010-12-24 04:28:06 +00:00
Cameron Zwarich	a7ad357a13	Simplify a check for implicit defs and remove a FIXME. llvm-svn: 122537	2010-12-24 03:09:36 +00:00
Chris Lattner	b607e7deda	flags -> glue for selectiondag llvm-svn: 122509	2010-12-23 17:24:32 +00:00
Chris Lattner	fb9ff7a4ff	sdisel flag -> glue. llvm-svn: 122507	2010-12-23 17:13:18 +00:00
Andrew Trick	ca2e267ddc	Reorganize ListScheduleBottomUp in preparation for modeling machine cycles and instruction issue. llvm-svn: 122491	2010-12-23 05:42:20 +00:00
Andrew Trick	c046a115d4	Converted LiveRegCycles to LiveRegGens. It's easier to work with and allows multiple nodes per cycle. llvm-svn: 122474	2010-12-23 04:16:14 +00:00
Andrew Trick	e48d5d8395	In CheckForLiveRegDef use TRI->getOverlaps. llvm-svn: 122473	2010-12-23 03:43:21 +00:00
Andrew Trick	cc701bcfdc	Fixes PR8823: add-with-overflow-128.ll In the bottom-up selection DAG scheduling, handle two-address instructions that read/write unspillable registers. Treat the entire chain of two-address nodes as a single live range. llvm-svn: 122472	2010-12-23 03:15:51 +00:00
Jeffrey Yasskin	a199652a3e	Change all self assignments X=X to (void)X, so that we can turn on a new gcc warning that complains on self-assignments and self-initializations. llvm-svn: 122458	2010-12-23 00:58:24 +00:00
Benjamin Kramer	49942a90b7	DAGCombine add (sext i1), X into sub X, (zext i1) if sext from i1 is illegal. The latter usually compiles into smaller code. example code: unsigned foo(unsigned x, unsigned y) { if (x != 0) y--; return y; } before: _foo: ## @foo cmpl $1, 4(%esp) ## encoding: [0x83,0x7c,0x24,0x04,0x01] sbbl %eax, %eax ## encoding: [0x19,0xc0] notl %eax ## encoding: [0xf7,0xd0] addl 8(%esp), %eax ## encoding: [0x03,0x44,0x24,0x08] ret ## encoding: [0xc3] after: _foo: ## @foo cmpl $1, 4(%esp) ## encoding: [0x83,0x7c,0x24,0x04,0x01] movl 8(%esp), %eax ## encoding: [0x8b,0x44,0x24,0x08] adcl $-1, %eax ## encoding: [0x83,0xd0,0xff] ret ## encoding: [0xc3] llvm-svn: 122455	2010-12-22 23:17:45 +00:00
Jakob Stoklund Olesen	f761c75efb	When RegAllocGreedy decides to spill the interferences of the current register, pick the victim with the lowest total spill weight. llvm-svn: 122445	2010-12-22 22:01:30 +00:00
Jakob Stoklund Olesen	71e527ef4b	Include a shadow of the original CFG edges in the edge bundle graph. llvm-svn: 122444	2010-12-22 22:01:28 +00:00
Chris Lattner	04ef853e23	Fix a bug in ReduceLoadWidth that wasn't handling extending loads properly. We miscompiled the testcase into: _test: ## @test movl $128, (%rdi) movzbl 1(%rdi), %eax ret Now we get a proper: _test: ## @test movl $128, (%rdi) movsbl (%rdi), %eax movzbl %ah, %eax ret This fixes PR8757. llvm-svn: 122392	2010-12-22 08:02:57 +00:00
Chris Lattner	35fcc63498	more cleanups, move a check for "roundedness" earlier to reject unhanded cases faster and simplify code. llvm-svn: 122391	2010-12-22 08:01:44 +00:00
Chris Lattner	60dcb2b5c2	reduce indentation and improve comments, no functionality change. llvm-svn: 122389	2010-12-22 07:36:50 +00:00
Andrew Trick	afec190a28	In DelayForLiveRegsBottomUp, handle instructions that read and write the same physical register. Simplifies the fix from the previous checkin r122211. llvm-svn: 122370	2010-12-21 22:27:44 +00:00
Andrew Trick	1e3ad9f721	whitespace llvm-svn: 122368	2010-12-21 22:25:04 +00:00
Dale Johannesen	e0fb87c3d7	Reapply 122353-122355 with fixes. 122354 was wrong; the shift type was needed one place, the shift count type another. The transform in 123555 had the same problem. llvm-svn: 122366	2010-12-21 21:55:50 +00:00
Dale Johannesen	972aba543a	Revert 122353-122355 for the moment, they broke stuff. llvm-svn: 122360	2010-12-21 21:22:27 +00:00
Dale Johannesen	39186cfb0b	Add a new transform to DAGCombiner. llvm-svn: 122355	2010-12-21 20:10:51 +00:00
Dale Johannesen	5f3e7b08f6	Get the type of a shift from the shift, not from its shift count operand. These should be the same but apparently are not always, and this is cleaner anyway. This improves the code in an existing test. llvm-svn: 122354	2010-12-21 20:06:19 +00:00
Dale Johannesen	bad19334ee	Shift by the word size is invalid IR; don't create it. llvm-svn: 122353	2010-12-21 20:00:06 +00:00
Chris Lattner	8a3058137a	fix some typos llvm-svn: 122349	2010-12-21 18:05:22 +00:00
Stuart Hastings	fedc21e594	Fix indentation, add comment. llvm-svn: 122345	2010-12-21 17:16:58 +00:00
Stuart Hastings	a1f786efa9	Missing logic for nested CALLSEQ_START/END. llvm-svn: 122342	2010-12-21 17:07:24 +00:00
Cameron Zwarich	0243f1d21e	Incremental progress towards a new implementation of StrongPHIElimination. Most of the problems with my last attempt were in the updating of LiveIntervals rather than the coalescing itself. Therefore, I decided to get that right first by essentially reimplementing the existing PHIElimination using LiveIntervals. It works correctly, with only a few tests failing (which may not be legitimate failures) and no new verifier failures (at least as far as I can tell, I didn't count the number per file). llvm-svn: 122321	2010-12-21 06:54:43 +00:00
Chris Lattner	65c5243bd6	rename MVT::Flag to MVT::Glue. "Flag" is a terrible name for something that just glues two nodes together, even if it is sometimes used for flags. llvm-svn: 122310	2010-12-21 02:38:05 +00:00
Chris Lattner	b37e697277	improve "cannot yet select" errors a trivial amount: now they are just as useless, but at least a bit more gramatical llvm-svn: 122305	2010-12-21 02:07:03 +00:00
Jakob Stoklund Olesen	e9eb1be4dd	Add EdgeBundles to SplitKit. Edge bundles is an annotation on the CFG that turns it into a bipartite directed graph where each basic block is connected to an outgoing and an ingoing bundle. These bundles are useful for identifying regions of the CFG for live range splitting. llvm-svn: 122301	2010-12-21 01:50:21 +00:00
Jakob Stoklund Olesen	86786c46c2	Use IntEqClasses to compute connected components of live intervals. llvm-svn: 122296	2010-12-21 00:48:17 +00:00
Dale Johannesen	036c3da142	Cosmetic changes. llvm-svn: 122259	2010-12-20 20:10:50 +00:00
Cameron Zwarich	ad29bd5325	MachineVerifier should count landing pad successors as basic blocks rather than out-edges. Fixes PR8824. llvm-svn: 122228	2010-12-20 04:19:48 +00:00
Cameron Zwarich	6970ec515e	Teach MachineVerifier that early clobber defs begin at USE slots and other defs begin at DEF slots. Fixes the second half of PR8813. llvm-svn: 122225	2010-12-20 03:15:20 +00:00
Cameron Zwarich	31af86ef44	Add a missing check from r122218. llvm-svn: 122224	2010-12-20 02:59:51 +00:00
Chris Lattner	249e131f39	implement type legalization promotion support for SMULO and UMULO, giving ARM (and other 32-bit-only) targets support for i8 and i16 overflow multiplies. The generated code isn't great, but this at least fixes CodeGen/Generic/overflow.ll when running on ARM hosts. llvm-svn: 122221	2010-12-20 02:05:39 +00:00
Cameron Zwarich	bcd02fd9a4	Don't assume that an instruction ending a register's live range always reads the register; it may be a dead def instead. Fixes PR8820. llvm-svn: 122218	2010-12-20 01:22:37 +00:00
Chris Lattner	0f801998bf	Fix a bug in the scheduler's handling of "unspillable" vregs. Imagine we see: EFLAGS = inst1 EFLAGS = inst2 FLAGS gpr = inst3 EFLAGS Previously, we would refuse to schedule inst2 because it clobbers the EFLAGS of the predecessor. However, it also uses the EFLAGS of the predecessor, so it is safe to emit. SDep edges ensure that the right order happens already anyway. This fixes 2 testsuite crashes with the X86 patch I'm going to commit next. llvm-svn: 122211	2010-12-20 00:55:43 +00:00
Chris Lattner	85875bf06b	the result of CheckForLiveRegDef is dead, remove it. llvm-svn: 122209	2010-12-20 00:51:56 +00:00
Chris Lattner	ee7fa0d706	reduce indentation, no functionality change. llvm-svn: 122208	2010-12-20 00:50:16 +00:00
Cameron Zwarich	8c00d690f5	Ignore debug values when performing MachineVerifier liveness checks. Fixes PR8822. llvm-svn: 122207	2010-12-20 00:08:10 +00:00
Cameron Zwarich	c8dfbe7503	Early clobber operands are allowed to be defined at use indices. This fixes one half of PR8813. llvm-svn: 122205	2010-12-19 23:50:53 +00:00
Cameron Zwarich	6f5c1021ba	Fix PR8815 by checking for an explicit clobber def tied to a use operand in ConnectedVNInfoEqClasses::Classify(). llvm-svn: 122202	2010-12-19 22:12:45 +00:00
Cameron Zwarich	37aec9c35d	Fix PR8811 by teaching MachineVerifier about optional defs. llvm-svn: 122199	2010-12-19 21:37:23 +00:00
Cameron Zwarich	64fbc5e267	StrongPHIElimination will never run before TwoAddressInstructionPass. llvm-svn: 122197	2010-12-19 21:32:29 +00:00
Nick Lewycky	c85935836b	Add missing standard headers. Patch by Joerg Sonnenberger! llvm-svn: 122193	2010-12-19 20:43:38 +00:00
Chris Lattner	92dcd2af36	teach MaskedValueIsZero how to analyze ADDE. This is enough to teach it that ADDE(0,0) is known 0 except the low bit, for example. llvm-svn: 122191	2010-12-19 20:38:28 +00:00
Cameron Zwarich	163792fb1f	Remove some checks for StrongPHIElim. These checks make it impossible to use an alternative register allocator that does not require LiveIntervals by specifying it on the command-line for a target that has StrongPHIElimination enabled by default. These checks are pretty meaningless anyways, since StrongPHIElimination and PHIElimination are never used at the same time. llvm-svn: 122176	2010-12-19 18:03:27 +00:00
Chris Lattner	ac82ea26da	fix PR8642: if a critical edge has a PHI value that can trap, isel is required to split the edge. PHI values get evaluated on the edge, not in their predecessor block. llvm-svn: 122170	2010-12-19 04:58:57 +00:00
Jakob Stoklund Olesen	bdf06d6c7b	Apparently, operandices is not a word. llvm-svn: 122135	2010-12-18 03:28:32 +00:00
Jakob Stoklund Olesen	e06ded7533	Teach the inline spiller to attempt folding a load instruction into its single use before rematerializing the load. This allows us to produce: addps LCPI0_1(%rip), %xmm2 Instead of: movaps LCPI0_1(%rip), %xmm3 addps %xmm3, %xmm2 Saving a register and an instruction. The standard spiller already knows how to do this. llvm-svn: 122133	2010-12-18 03:04:14 +00:00
Jakob Stoklund Olesen	485b7965b3	Tweak debug spew. llvm-svn: 122132	2010-12-18 03:04:11 +00:00
Jakob Stoklund Olesen	a2f2eab8d4	Check that the register is live-in to the loop header before inserting copies in the loop predecessors. The register can be live-out from a predecessor without being live-in to the loop header if there is a critical edge from the predecessor. llvm-svn: 122123	2010-12-18 01:06:19 +00:00
Nick Lewycky	30eef45106	Fix GCC warning: lib/CodeGen/RegAllocGreedy.cpp:311: error: unused variable 'PhysReg' [-Wunused-variable] llvm-svn: 122122	2010-12-18 01:05:55 +00:00
Jakob Stoklund Olesen	2879da5e13	Pass a Banner argument to the machine code verifier both from createMachineVerifierPass and MachineFunction::verify. The banner is printed before the machine code dump, just like the printer pass. llvm-svn: 122113	2010-12-18 00:06:56 +00:00
Jakob Stoklund Olesen	6498db2c8c	Avoid dereferencing end() in collectInterferingVRegs() when there is no interference. llvm-svn: 122108	2010-12-17 23:16:38 +00:00
Jakob Stoklund Olesen	db4b62f32e	Make the -verify-regalloc command line option available to base classes as RegAllocBase::VerifyEnabled. Run the machine code verifier in a few interesting places during RegAllocGreedy. llvm-svn: 122107	2010-12-17 23:16:35 +00:00
Jakob Stoklund Olesen	df9e162423	Enable loop splitting in RegAllocGreedy. The heuristics split around the largest loop where the current register may be allocated without interference. llvm-svn: 122106	2010-12-17 23:16:32 +00:00
Bill Wendling	c16f9b1ccc	During local stack slot allocation, the materializeFrameBaseRegister function may be called. If the entry block is empty, the insertion point iterator will be the "end()" value. Calling ->getParent() on it (among others) causes problems. Modify materializeFrameBaseRegister to take the machine basic block and insert the frame base register at the beginning of that block. (It's very similar to what the code does all ready. The only difference is that it will always insert at the beginning of the entry block instead of after a previous materialization of the frame base register. I doubt that that matters here.) <rdar://problem/8782198> llvm-svn: 122104	2010-12-17 23:09:14 +00:00
Bob Wilson	c57c2d755b	Fix a DAGCombiner crash when folding binary vector operations with constant BUILD_VECTOR operands where the element type is not legal. I had previously changed this code to insert TRUNCATE operations, but that was just wrong. llvm-svn: 122102	2010-12-17 23:06:49 +00:00
Dale Johannesen	c2c6ebd82a	Add a transform to DAG Combiner. This improves the code for the case where 32-bit divide by constant is turned into 64-bit multiply by constant. 8771012. llvm-svn: 122090	2010-12-17 21:45:49 +00:00
Jakob Stoklund Olesen	0dc90e6b1b	Allow missing kill flags on an untied operand of a two-address instruction when the operand uses the same register as a tied operand: %r1 = add %r1, %r1 If add were a three-address instruction, kill flags would be required on at least one of the uses. Since it is a two-address instruction, the tied use operand must not have a kill flag. This change makes the kill flag on the untied use operand optional. llvm-svn: 122082	2010-12-17 19:18:41 +00:00
Jakob Stoklund Olesen	f4a0c81371	Add MachineLoopRange comparators for sorting loop lists by number and by area. llvm-svn: 122073	2010-12-17 18:13:52 +00:00
Jakob Stoklund Olesen	40f23cd5ca	Provide LiveIntervalUnion::Query::checkLoopInterference. This is a three-way interval list intersection between a virtual register, a live interval union, and a loop. It will be used to identify interference-free loops for live range splitting. llvm-svn: 122034	2010-12-17 04:09:47 +00:00
Bob Wilson	e06f6eabe7	Fix crash compiling a QQQQ REG_SEQUENCE for a Neon vld3_lane operation. Radar 8776599 llvm-svn: 122018	2010-12-17 01:21:12 +00:00
Bob Wilson	f6fe49f7d2	Fix a comment typo. llvm-svn: 122016	2010-12-17 01:21:05 +00:00
Daniel Dunbar	8ab9be2005	MC: Make TargetAsmBackend available to the AsmStreamer. - Treaty talks on the non-proliferation of MC objects broke down. llvm-svn: 121949	2010-12-16 03:05:59 +00:00
Jakob Stoklund Olesen	1811e4cb20	Start using SplitKit and MachineLoopRanges in RegAllocGreedy in preparation of live range splitting around loops guided by register pressure. So far, trySplit() simply prints a lot of debug output. llvm-svn: 121918	2010-12-15 23:46:13 +00:00
Jakob Stoklund Olesen	d40af5ffbd	Add MachineLoopRanges analysis. A MachineLoopRange contains the intervals of slot indexes covered by the blocks in a loop. This representation of the loop blocks is more efficient to compare against interfering registers during register coalescing. llvm-svn: 121917	2010-12-15 23:41:23 +00:00
Evan Cheng	68e1ed8752	Teach machine cse to commute instructions. llvm-svn: 121903	2010-12-15 22:16:21 +00:00
Dan Gohman	295ba3ab26	Move Value::getUnderlyingObject to be a standalone function so that it can live in Analysis instead of VMCore. llvm-svn: 121885	2010-12-15 20:02:24 +00:00
Jakob Stoklund Olesen	3cfea82733	Fix build. llvm-svn: 121872	2010-12-15 18:07:48 +00:00
Jakob Stoklund Olesen	308656b955	Detect and enumerate bypass loops. Bypass loops have the current live range live through, but contain no uses or defs. Splitting around a bypass loop can free registers for other uses inside the loop by spilling the split range. llvm-svn: 121871	2010-12-15 17:49:52 +00:00
Jakob Stoklund Olesen	849388944e	Separate SplitAnalysis::getSplitLoops(). This method returns the set of loops with uses that are candidates for splitting. llvm-svn: 121870	2010-12-15 17:41:19 +00:00
Chris Lattner	81815cd4db	take care of some todos, transforming [us]mul_lohi into a wider mul if the wider mul is legal. llvm-svn: 121848	2010-12-15 06:04:19 +00:00
Chris Lattner	746d6a1f60	when transforming a MULHS into a wider MUL, there is no need to SRA the result, the top bits are truncated off anyway, just use SRL. llvm-svn: 121846	2010-12-15 05:51:39 +00:00
Jakob Stoklund Olesen	48800c9689	Simplify RegAllocGreedy's use of register aliases. llvm-svn: 121807	2010-12-14 23:38:19 +00:00
Jakob Stoklund Olesen	7ee6f83da1	Simplify CCState's use of register aliases. llvm-svn: 121806	2010-12-14 23:28:01 +00:00
Jakob Stoklund Olesen	03856151db	Simplify AggressiveAntiDepBreaker's use of register aliases. llvm-svn: 121805	2010-12-14 23:23:15 +00:00
Jakob Stoklund Olesen	870fbdd686	Simplyfy RegAllocBasic by using getOverlaps instead of getAliasSet. llvm-svn: 121801	2010-12-14 23:10:48 +00:00
Evan Cheng	7e96e67d98	Fix a minor bug in two-address pass. It was missing a commute opportunity. regB = move RCX regA = op regB, regC RAX = move regA where both regB and regC are killed. If regB is constrainted to non-compatible physical registers but regC is not constrainted at all, then it's better to commute the instruction. movl %edi, %eax shlq $32, %rcx leaq (%rcx,%rax), %rax => movl %edi, %eax shlq $32, %rcx orq %rcx, %rax rdar://8762995 llvm-svn: 121793	2010-12-14 21:34:53 +00:00
Matt Beaumont-Gay	01264443a8	Move debugging code entirely within DEBUG(). Silences an unused variable warning in the opt build. llvm-svn: 121791	2010-12-14 21:14:55 +00:00
Jakob Stoklund Olesen	c13ce4748e	Add LiveIntervalUnion print methods, RegAllocGreedy::trySplit debug spew. llvm-svn: 121783	2010-12-14 19:38:49 +00:00
Jakob Stoklund Olesen	c5ad05ca30	Use TRI::printReg instead of AbstractRegisterDescription when printing LiveIntervalUnions. llvm-svn: 121781	2010-12-14 18:53:47 +00:00
Jakob Stoklund Olesen	74ba8b77e6	Q.seenAllInterferences() must be called after Q.collectInterferingVRegs(). llvm-svn: 121774	2010-12-14 17:47:36 +00:00
Jakob Stoklund Olesen	d4d3c5dd1e	Remove unused vector. llvm-svn: 121741	2010-12-14 00:58:47 +00:00
Jakob Stoklund Olesen	00d6ac22d0	Try reassigning all virtual register interferences, not just those with lower spill weight. Filter out fixed registers instead. Add support for reassigning an interference that was assigned to an alias. llvm-svn: 121737	2010-12-14 00:37:49 +00:00
Jakob Stoklund Olesen	ffefd5bd4e	Add stub for RAGreedy::trySplit. llvm-svn: 121736	2010-12-14 00:37:44 +00:00
Chris Lattner	14810c808b	Add a couple dag combines to transform mulhi/mullo into a wider multiply when the wider type is legal. This allows us to compile: define zeroext i16 @test1(i16 zeroext %x) nounwind { entry: %div = udiv i16 %x, 33 ret i16 %div } into: test1: # @test1 movzwl 4(%esp), %eax imull $63551, %eax, %eax # imm = 0xF83F shrl $21, %eax ret instead of: test1: # @test1 movw $-1985, %ax # imm = 0xFFFFFFFFFFFFF83F mulw 4(%esp) andl $65504, %edx # imm = 0xFFE0 movl %edx, %eax shrl $5, %eax ret Implementing rdar://8760399 and example #4 from: http://blog.regehr.org/archives/320 We should implement the same thing for [su]mul_hilo, but I don't have immediate plans to do this. llvm-svn: 121696	2010-12-13 08:39:01 +00:00
Chris Lattner	324f849088	remove the verbose-asm "constant pool double" comments that we were printing for each constant pool entry. Using WriteTypeSymbolic here takes time proportional to the size of the module, for each constant pool entry. This speeds up -verbose-asm llc on 252.eon (a random testcase at my disposal) from 4.4s to 2.137s. llc takes 2.11s with asm-verbose off, so this is now a pretty reasonable cost for verbose comments. llvm-svn: 121691	2010-12-13 07:35:47 +00:00
Chris Lattner	6df4d5d88e	reduce indentation by using continue, no functionality change. llvm-svn: 121662	2010-12-13 01:11:17 +00:00
Duncan Sands	47a4bbd31d	Catch attempts to remove a deleted node from the CSE maps. Better to catch this here rather than later after accessing uninitialized memory etc. Fires when compiling the testcase in PR8237. llvm-svn: 121635	2010-12-12 13:22:50 +00:00
Jakob Stoklund Olesen	a523d5f048	Add named timer groups for the different stages of register allocation. llvm-svn: 121604	2010-12-11 00:19:56 +00:00
Jakob Stoklund Olesen	ef80efea1d	Move MRI into RegAllocBase. Clean up debug output a bit. llvm-svn: 121599	2010-12-10 23:49:00 +00:00
Nick Lewycky	46a6ed1f0f	Remove extraneous close parenthesis. Fix build breakage. llvm-svn: 121596	2010-12-10 23:14:35 +00:00
Nick Lewycky	9afbedbc48	Move variable that's unused in an NDEBUG build inside the DEBUG() macro, fixing lib/CodeGen/RegAllocGreedy.cpp:233: error: unused variable 'TRC' [-Wunused-variable] llvm-svn: 121594	2010-12-10 23:05:10 +00:00
Jakob Stoklund Olesen	6cd6e644e7	Force the greedy register allocator to always use the inline spiller. Soon, RegAllocGreedy will start splitting live ranges, and then deferred spilling won't work anyway. llvm-svn: 121591	2010-12-10 22:54:44 +00:00
Jakob Stoklund Olesen	cbd4bac09d	Rip out live range splitting support from the inline spiller. The spiller should only spill. The register allocator will drive live range splitting, it has the needed information about register pressure and interferences. llvm-svn: 121590	2010-12-10 22:54:40 +00:00
Jakob Stoklund Olesen	5ab6552845	Use AllocationOrder in RegAllocGreedy, fix a bug in the hint calculation. llvm-svn: 121584	2010-12-10 22:21:05 +00:00
Jakob Stoklund Olesen	ea59381fc8	Fix miscompilation caused by trivial logic error in the reassignVReg() interference check. llvm-svn: 121519	2010-12-10 20:45:04 +00:00
Jakob Stoklund Olesen	e3924a3c85	Add an AllocationOrder class that can iterate over the allocatable physical registers for a given virtual register. Reserved registers are filtered from the allocation order, and any valid hint is returned as the first suggestion. For target dependent hints, a number of arcane target hooks are invoked. llvm-svn: 121497	2010-12-10 18:36:02 +00:00
Rafael Espindola	0e665e502d	Fixed version of 121434 with no new memory leaks. llvm-svn: 121471	2010-12-10 07:39:47 +00:00
Rafael Espindola	011e168728	Revert my previous patch to make the valgrind bots happy. llvm-svn: 121461	2010-12-10 04:01:09 +00:00
Rafael Espindola	03ad1e8f1f	Initial support for the cfi directives. This is just enough to get f: .cfi_startproc nop .cfi_endproc assembled (on ELF). llvm-svn: 121434	2010-12-09 23:48:29 +00:00
Stuart Hastings	f7bba0cfe3	Initial support for nested CALLSEQ_START/CALLSEQ_END constructs in LegalizeDAG. Necessary for byval support on ARM. Radar 7662569. llvm-svn: 121412	2010-12-09 21:25:20 +00:00
Jakob Stoklund Olesen	fe4b9ee934	Remember to filter out reserved rergisters from the allocation order. llvm-svn: 121411	2010-12-09 21:20:46 +00:00
Jakob Stoklund Olesen	5bb5c67227	Add a forgotten initializer for CheckedFirstInterference. llvm-svn: 121410	2010-12-09 21:20:44 +00:00
Andrew Trick	ec37b93b07	Added register reassignment prototype to RAGreedy. It's a simple heuristic to reshuffle register assignments when we can't find an available reg. llvm-svn: 121388	2010-12-09 18:15:21 +00:00
Eric Christopher	ebd7ab9857	80-col fixups. llvm-svn: 121356	2010-12-09 04:48:06 +00:00
Jakob Stoklund Olesen	17b2e8c293	IntervalMap iterators are heavyweight, so avoid copying them around and use references instead. Similarly, IntervalMap::begin() is almost as expensive as find(), so use find(x) instead of begin().advanceTo(x); This makes RegAllocBasic run another 5% faster. llvm-svn: 121344	2010-12-09 01:06:52 +00:00
Devang Patel	b3e0d80b1f	DW_FORM_data1 may not provide sufficient room for vtable index, use _udata instead. This fixes radar 8730409. llvm-svn: 121323	2010-12-09 00:10:40 +00:00
Jakob Stoklund Olesen	ffc0f6586a	Properly deal with empty intervals when checking for interference. llvm-svn: 121319	2010-12-08 23:51:35 +00:00
Jakob Stoklund Olesen	37df3f04c2	Implement very primitive hinting support in RegAllocGreedy. The hint is simply tried first and then forgotten if it couldn't be allocated immediately. llvm-svn: 121306	2010-12-08 22:57:16 +00:00
Jakob Stoklund Olesen	3c81b6a50b	Store (priority,regnum) pairs in the priority queue instead of providing an abstract priority queue interface in subclasses that want to override the priority calculations. Subclasses must provide a getPriority() implementation instead. This approach requires less code as long as priorities are expressable as simple floats, and it avoids the dangers of defining potentially expensive priority comparison functions. It also should speed up priority_queue operations since they no longer have to chase pointers when comparing registers. This is not measurable, though. Preferably, we shouldn't use floats to guide code generation. The use of floats here is derived from the use of floats for spill weights. Spill weights have a dynamic range that doesn't lend itself easily to a fixpoint implementation. When someone invents a stable spill weight representation, it can be reused for allocation priorities. llvm-svn: 121294	2010-12-08 22:22:41 +00:00
Eric Christopher	d492f798d1	Reword comment slightly. llvm-svn: 121293	2010-12-08 22:21:42 +00:00
Eric Christopher	77d3a7b3fb	Fix comment. llvm-svn: 121285	2010-12-08 21:35:09 +00:00
Jakob Stoklund Olesen	f04d283db1	Trim includes. llvm-svn: 121283	2010-12-08 21:12:00 +00:00
Andrew Trick	fb72ca2129	Generalize PostRAHazardRecognizer so it can be used in any pass for both forward and backward scheduling. Rename it to ScoreboardHazardRecognizer (Scoreboard is one word). Remove integer division from the scoreboard's critical path. llvm-svn: 121274	2010-12-08 20:04:29 +00:00
Jakob Stoklund Olesen	d638b989f2	Stub out RegAllocGreedy. This new register allocator is initially identical to RegAllocBasic, but it will receive all of the tricks that RegAllocBasic won't get. RegAllocGreedy will eventually replace linear scan. llvm-svn: 121234	2010-12-08 03:26:16 +00:00
Jakob Stoklund Olesen	77e7ad803a	Move RABasic::addMBBLiveIns to the base class, it is generally useful. Minor optimization to the use of IntervalMap iterators. They are fairly heavyweight, so prefer SI.valid() over SI != end(). llvm-svn: 121217	2010-12-08 01:06:06 +00:00
Jakob Stoklund Olesen	9d6472e894	Switch LiveIntervalUnion from std::set to IntervalMap. This speeds up RegAllocBasic by 20%, not counting releaseMemory which becomes way faster. llvm-svn: 121201	2010-12-07 23:18:47 +00:00
Jakob Stoklund Olesen	39e22e19bf	Simplify assertion. llvm-svn: 121162	2010-12-07 18:51:27 +00:00
Jay Foad	79e18ed269	PR5207: Change APInt methods trunc(), sext(), zext(), sextOrTrunc() and zextOrTrunc(), and APSInt methods extend(), extOrTrunc() and new method trunc(), to be const and to return a new value instead of modifying the object in place. llvm-svn: 121120	2010-12-07 08:25:19 +00:00
Jakob Stoklund Olesen	48ba44334f	Remove unused member. llvm-svn: 121098	2010-12-07 01:32:45 +00:00
Devang Patel	12459bc442	Undefined value in reg 0 may need a marker to identify end of source range. This will be used to truncate live range of DBG_VALUE instruction by register allocator and friends. llvm-svn: 121061	2010-12-06 22:48:22 +00:00
Devang Patel	6fe7fe8dd4	If dbg_declare() or dbg_value() is not lowered by isel then emit DEBUG message instead of creating DBG_VALUE for undefined value in reg0. llvm-svn: 121059	2010-12-06 22:39:26 +00:00
Rafael Espindola	3e954d16f4	Second try at making direct object emission produce the same results as llc + llvm-mc. This time ELF is not changed and I tested that llvm-gcc bootstrap on darwin10 using darwin9's assembler and linker. llvm-svn: 121006	2010-12-06 17:27:56 +00:00
Rafael Espindola	4ec917db9b	Revert previous two patches while I try to find out how to make both linux and darwin assemblers happy :-( llvm-svn: 121004	2010-12-06 15:35:15 +00:00
Rafael Espindola	3dc2b4cba7	Add an EmitAbsValue helper method and use it in cases where we want to be sure that no relocations are used (on MochO). Fixes llc producing different output from llc + llvm-mc. llvm-svn: 121000	2010-12-06 14:53:14 +00:00
Cameron Zwarich	f56ba80bb2	Some cleanup before I start committing some incremental progress on StrongPHIElimination. llvm-svn: 120961	2010-12-05 22:34:08 +00:00
Cameron Zwarich	f64c26bb9e	Remove the PHIElimination.h header, as it is no longer needed. llvm-svn: 120959	2010-12-05 21:39:42 +00:00
Cameron Zwarich	fbe9e91d97	I forgot to actually remove the FindCopyInsertPoint() declaration from PHIElimination.h. llvm-svn: 120953	2010-12-05 19:58:57 +00:00
Cameron Zwarich	cb613dcf69	Remove the SplitCriticalEdge() method declaration from PHIElimination.h. At one time, this method existed, but now PHIElimination uses the method of the same name on MachineBasicBlock. llvm-svn: 120952	2010-12-05 19:54:23 +00:00
Cameron Zwarich	c680f44c1b	Move the FindCopyInsertPoint method of PHIElimination to a new standalone function so that it can be shared with StrongPHIElimination. llvm-svn: 120951	2010-12-05 19:51:05 +00:00
Cameron Zwarich	5e3c712e67	Remove PHIElimination's private copy of SkipPHIsAndLabels. llvm-svn: 120918	2010-12-04 20:40:15 +00:00
Benjamin Kramer	009451fddc	Remove unneeded zero arrays. llvm-svn: 120910	2010-12-04 15:28:22 +00:00
Jakob Stoklund Olesen	8ed86d7fd3	Rename virtRegMap to avoid confusion with the VirtRegMap that it isn't. llvm-svn: 120846	2010-12-03 22:25:09 +00:00
Jakob Stoklund Olesen	8b893e3575	Coalesce debug locations when possible, causing less DBG_VALUE instructions to be emitted. llvm-svn: 120845	2010-12-03 22:25:07 +00:00
Jakob Stoklund Olesen	c18ef29bc6	Emit DBG_VALUE instructions from LiveDebugVariables. llvm-svn: 120842	2010-12-03 21:47:10 +00:00
Jakob Stoklund Olesen	1d753a5f7f	Also update virtRegMap when renaming virtual registers. llvm-svn: 120841	2010-12-03 21:47:08 +00:00
Jakob Stoklund Olesen	1ec6a68038	Delete the StrongPHIElimination pass, leaving only a shell. The StrongPHIElimination pass did not work, and nobody has worked on it for two years. A rewrite is underway, so I am leaving this shell pass instead of deleting it completely. llvm-svn: 120830	2010-12-03 19:21:53 +00:00
Jakob Stoklund Olesen	08f52108b1	Update LiveDebugVariables during coalescing. llvm-svn: 120720	2010-12-02 18:15:44 +00:00
Jakob Stoklund Olesen	54b6cd6d38	Implement the first half of LiveDebugVariables. Scan the MachineFunction for DBG_VALUE instructions, and replace them with a data structure similar to LiveIntervals. The live range of a DBG_VALUE is determined by propagating it down the dominator tree until a new DBG_VALUE is found. When a DBG_VALUE lives in a register, its live range is confined to the live range of the register's value. LiveDebugVariables runs before coalescing, so DBG_VALUEs are not artificially extended when registers are joined. The missing half will recreate DBG_VALUE instructions from the intervals when register allocation is complete. The pass is disabled by default. It can be enabled with the temporary command line option -live-debug-variables. llvm-svn: 120636	2010-12-02 00:37:37 +00:00
Jay Foad	789e8ac4f7	PR5207: Rename overloaded APInt methods set(), clear(), flip() to setAllBits(), setBit(unsigned), etc. llvm-svn: 120564	2010-12-01 08:53:58 +00:00
Andrew Trick	a2cec41172	Comment typo. llvm-svn: 120504	2010-11-30 23:59:50 +00:00
Evan Cheng	f7e586d749	Enable sibling call optimization of libcalls which are expanded during legalization time. Since at legalization time there is no mapping from SDNode back to the corresponding LLVM instruction and the return SDNode is target specific, this requires a target hook to check for eligibility. Only x86 and ARM support this form of sibcall optimization right now. rdar://8707777 llvm-svn: 120501	2010-11-30 23:55:39 +00:00
Andrew Trick	89dd5f7eac	Coding style. No significant functionality. Abandon linear scan style in favor of the widespread llvm style. Capitalize variables and add newlines for visual parsing. Rename variables for readability. And other cleanup. llvm-svn: 120490	2010-11-30 23:18:47 +00:00
Chris Lattner	f541629573	add TLI support indicating that jumps are more expensive than logical operations and use this to disable a specific optimization. Patch by Micah Villmow! llvm-svn: 120435	2010-11-30 18:12:52 +00:00
Jay Foad	c663d7e4f6	PR5207: Make APInt::set(), APInt::clear() and APInt::flip() return void. llvm-svn: 120413	2010-11-30 09:02:01 +00:00
Jakob Stoklund Olesen	59dfcec5f9	Stub out a new LiveDebugVariables pass. This analysis is going to run immediately after LiveIntervals. It will stay alive during register allocation and keep track of user variables mentioned in DBG_VALUE instructions. When the register allocator is moving values between registers and the stack, it is very hard to keep track of DBG_VALUE instructions. We usually get it wrong. This analysis maintains a data structure that makes it easy to update DBG_VALUE instructions. llvm-svn: 120385	2010-11-30 02:17:10 +00:00
Michael J. Spencer	d5ec932c3a	Merge System into Support. llvm-svn: 120298	2010-11-29 18:16:10 +00:00
Bob Wilson	3f245c3b4f	Fix a comment typo. llvm-svn: 120235	2010-11-28 06:51:19 +00:00
Anton Korobeynikov	598465c605	Move more PEI-related hooks to TFI llvm-svn: 120229	2010-11-27 23:05:25 +00:00
Anton Korobeynikov	c87f68e32e	Move callee-saved regs spills / reloads to TFI llvm-svn: 120228	2010-11-27 23:05:03 +00:00
Benjamin Kramer	35b0f6e5a6	Namespacify. llvm-svn: 120146	2010-11-25 16:42:51 +00:00
Wesley Peck	d589353ad0	Renaming ISD::BIT_CONVERT to ISD::BITCAST to better reflect the LLVM IR concept. llvm-svn: 119990	2010-11-23 03:31:01 +00:00
Benjamin Kramer	632a91cba5	Implement the "if (X == 6 \|\| X == 4)" -> "if ((X\|2) == 6)" optimization. This currently only catches the most basic case, a two-case switch, but can be extended later. llvm-svn: 119964	2010-11-22 09:45:38 +00:00
Anton Korobeynikov	ff8c52bd51	Move some more hooks to TargetFrameInfo llvm-svn: 119904	2010-11-20 15:59:32 +00:00
Benjamin Kramer	c99bab43c9	Silence Release build warnings about unused functions. llvm-svn: 119903	2010-11-20 15:53:24 +00:00
Duncan Sands	028cf0619e	On X86, MEMBARRIER, MFENCE, SFENCE, LFENCE are not target memory intrinsics, so don't claim they are. They are allocated using DAG.getNode, so attempts to access MemSDNode fields results in reading off the end of the allocated memory. This fixes crashes with "llc -debug" due to debug code trying to print MemSDNode fields for these barrier nodes (since the crashes are not deterministic, use valgrind to see this). Add some nasty checking to try to catch this kind of thing in the future. llvm-svn: 119901	2010-11-20 11:25:00 +00:00
Andrew Trick	3166f72d7a	Removing the useless test that I added recently. It was meant as an example, but not complicated enough to merit another test. llvm-svn: 119898	2010-11-20 07:26:51 +00:00
Andrew Trick	189c10021d	RABasic fix. Regalloc is responsible for updating block live ins. llvm-svn: 119896	2010-11-20 02:57:05 +00:00
Andrew Trick	39d05a622e	Whitespace. llvm-svn: 119895	2010-11-20 02:43:55 +00:00
Bill Wendling	55c499847a	Check for _setjmp too, because it's also used. llvm-svn: 119875	2010-11-20 00:03:09 +00:00
Mon P Wang	4965983b22	Make isScalarToVector to return false if the node is a scalar. This will prevent DAGCombine from making an illegal transformation of bitcast of a scalar to a vector into a scalar_to_vector. llvm-svn: 119819	2010-11-19 19:08:12 +00:00
Jakob Stoklund Olesen	e3f7aad5c5	Don't attempt trivial coalescing for sub-register copies. Patch by Krister Wombell! llvm-svn: 119791	2010-11-19 05:45:24 +00:00
Rafael Espindola	b33e5e07e1	Change some methods in MCDwarf.cpp to be able to handle an arbitrary MCStreamer instead of just MCObjectStreamer. Address changes cannot be as efficient as we have to use DW_LNE_set_addres, but at least most of the logic is shared. This will be used so that, with CodeGen still using EmitDwarfLocDirective, llvm-gcc is able to produce debug_line sections without needing an assembler that supports .loc. llvm-svn: 119777	2010-11-19 02:26:16 +00:00
Anton Korobeynikov	ce676f96e1	Move getInitialFrameState() to TargetFrameInfo llvm-svn: 119754	2010-11-18 23:25:52 +00:00
Anton Korobeynikov	269e7d3be1	Move hasFP() and few related hooks to TargetFrameInfo. llvm-svn: 119740	2010-11-18 21:19:35 +00:00
Duncan Sands	9327aedbea	Fix thinko: we must turn select(anyext, sext) into sext(select) not anyext(select). Spotted by Frits van Bommel. llvm-svn: 119739	2010-11-18 21:16:28 +00:00
Duncan Sands	a61bc1a41a	The DAGCombiner was threading select over pairs of extending loads even if the extension types were not the same. The result was that if you fed a select with sext and zext loads, as in the testcase, then it would get turned into a zext (or sext) of the select, which is wrong in the cases when it should have been an sext (resp. zext). Reported and diagnosed by Sebastien Deldon. llvm-svn: 119728	2010-11-18 20:05:18 +00:00
Dan Gohman	3b19bfe496	Oops, missed this file when remaing ExpandPseudos to ExpandISelPseudos. llvm-svn: 119717	2010-11-18 18:48:28 +00:00
Dan Gohman	3998a0430f	Rename ExpandPseudos to ExpandISelPseudos to help clarify its role. llvm-svn: 119716	2010-11-18 18:45:06 +00:00
Dan Gohman	b0ffd6beca	Fix typos. llvm-svn: 119712	2010-11-18 17:44:17 +00:00
Dan Gohman	3a630f4051	ExpandPseudos doesn't have any dependencies, so it can use the simple form of INITIALIZE_PASS. llvm-svn: 119707	2010-11-18 17:14:05 +00:00
Rafael Espindola	93a07b464e	Change CodeGen to use .loc directives. This produces a lot more readable output and testing is easier. A good example is the unknown-location.ll test that now can just look for ".loc 1 0 0". We also don't use a DW_LNE_set_address for every address change anymore. llvm-svn: 119613	2010-11-18 02:04:25 +00:00
Dale Johannesen	06f479d543	Do not throw away alignment when generating the DAG for memset; we may need it to decide between MOVAPS and MOVUPS later. Adjust a test that was looking for wrong code. PR 3866 / 8675131. llvm-svn: 119605	2010-11-18 01:35:23 +00:00
John Thompson	b33f935bc3	Bug 8621 fix - pointer cast stripped from inline asm constraint argument. llvm-svn: 119590	2010-11-17 23:58:47 +00:00
Evan Cheng	ce610bd6b3	Remove ARM isel hacks that fold large immediates into a pair of add, sub, and, and xor. The 32-bit move immediates can be hoisted out of loops by machine LICM but the isel hacks were preventing them. Instead, let peephole optimization pass recognize registers that are defined by immediates and the ARM target hook will fold the immediates in. Other changes include 1) do not fold and / xor into cmp to isel TST / TEQ instructions if there are multiple uses. This happens when the 'and' is live out, machine sink would have sinked the computation and that ends up pessimizing code. The peephole pass would recognize situations where the 'and' can be toggled to define CPSR and eliminate the comparison anyway. 2) Move peephole pass to after machine LICM, sink, and CSE to avoid blocking important optimizations. rdar://8663787, rdar://8241368 llvm-svn: 119548	2010-11-17 20:13:28 +00:00
Chris Lattner	5faff99d84	With the newly simplified SourceMgr interfaces and the generalized SrcMgrDiagHandler, we can improve clang diagnostics for inline asm: instead of reporting them on a source line of the original line, we can report it on the correct line wherever the string literal came from. For something like this: void foo() { asm("push %rax\n" ".code32\n"); } we used to get this: (note that the line in t.c isn't helpful) t.c:4:7: error: warning: ignoring directive for now asm("push %rax\n" ^ <inline asm>:2:1: note: instantiated into assembly here .code32 ^ now we get: t.c:5:8: error: warning: ignoring directive for now ".code32\n" ^ <inline asm>:2:1: note: instantiated into assembly here .code32 ^ Note that we're pointing to line 5 properly now. llvm-svn: 119488	2010-11-17 08:20:42 +00:00
Chris Lattner	99b8654169	now that AsmPrinter::EmitInlineAsm is factored right, we can eliminate the cookie argument to the SourceMgr diagnostic stuff. This cleanly separates LLVMContext's inlineasm handler from the sourcemgr error handling definition, increasing type safety and cleaning things up. llvm-svn: 119486	2010-11-17 08:13:01 +00:00
Chris Lattner	2cc6515616	rearrange how the handler in SourceMgr is installed, eliminating the use of the cookie argument to setDiagHandler llvm-svn: 119483	2010-11-17 08:03:32 +00:00
Chris Lattner	003e3db609	refactor the interface to EmitInlineAsm a bit, no functionality change. llvm-svn: 119482	2010-11-17 07:53:40 +00:00
Eric Christopher	fb840882c6	Only avoid the check if we're the last operand before the variable operands in a variadic instruction. llvm-svn: 119446	2010-11-17 00:55:36 +00:00
Dan Gohman	eb3b1c3576	Fix grammaro. llvm-svn: 119386	2010-11-16 21:27:00 +00:00
Evan Cheng	5a62a6c2b6	Add ExpandPseudos.cpp. llvm-svn: 119385	2010-11-16 21:20:36 +00:00
Dan Gohman	52a761760d	Split pseudo-instruction expansion into a separate pass, to make it easier to debug, and to avoid complications when the CFG changes in the middle of the instruction selection process. llvm-svn: 119382	2010-11-16 21:02:37 +00:00
Jakob Stoklund Olesen	fb47e37338	Fix emergency spilling in LiveIntervals::spillPhysRegAroundRegDefsUses. Always spill the full representative register at any point where any subregister is live. This fixes PR8620 which caused the old logic to get confused and not spill anything at all. The fundamental problem here is that the coalescer is too aggressive about physical register coalescing. It sometimes makes it impossible to allocate registers without these emergency spills. llvm-svn: 119375	2010-11-16 19:55:14 +00:00
Jakob Stoklund Olesen	6673408896	Print out the register class of the current interval. llvm-svn: 119374	2010-11-16 19:55:12 +00:00
Eric Christopher	954db96e84	Make the verifier a little quieter on instructions that it's probably (and likely) wrong about anyhow. llvm-svn: 119320	2010-11-16 01:58:21 +00:00
Jakob Stoklund Olesen	5d9371952b	Remember to resize SpillSlotToUsesMap when allocating an emergency spill slot. Use amazing new function call technology instead of writing identical code in multiple places. This fixes PR8604. llvm-svn: 119306	2010-11-16 00:41:01 +00:00
Jakob Stoklund Olesen	ea79975a68	Fix PR8612 in the standard spiller, take two. The live range of a register defined by an early clobber starts at the use slot, not the def slot. Except when it is an early clobber tied to a use operand. Then it starts at the def slot like a standard def. llvm-svn: 119305	2010-11-16 00:40:59 +00:00
Jakob Stoklund Olesen	465c07cc66	Revert "Fix PR8612 in the standard spiller as well." This reverts r119183 which borke the buildbots. llvm-svn: 119270	2010-11-15 21:51:51 +00:00
Evan Cheng	4afa3a6b1f	Code clean up. The peephole pass should be the one updating the instruction iterator, not TII->OptimizeCompareInstr. llvm-svn: 119186	2010-11-15 21:20:45 +00:00
Jakob Stoklund Olesen	920856ab38	Fix PR8612 in the standard spiller as well. The live range of a register defined by an early clobber starts at the use slot, not the def slot. llvm-svn: 119183	2010-11-15 20:55:53 +00:00
Jakob Stoklund Olesen	86fba16eb1	When spilling a register defined by an early clobber, make sure that the new live ranges for the spill register are also defined at the use slot instead of the normal def slot. This fixes PR8612 for the inline spiller. A use was being allocated to the same register as a spilled early clobber def. This problem exists in all the spillers. A fix for the standard spiller is forthcoming. llvm-svn: 119182	2010-11-15 20:55:49 +00:00
Anton Korobeynikov	76c52dcf44	First step of huge frame-related refactoring: move emit{Prologue,Epilogue} out of TargetRegisterInfo to TargetFrameInfo, which is definitely much better suitable place llvm-svn: 119097	2010-11-15 00:06:54 +00:00
Chris Lattner	51168d6510	move the pic base symbol stuff up to MachineFunction since it is trivial and will be shared between ppc and x86. This substantially simplifies the X86 backend also. llvm-svn: 119089	2010-11-14 22:48:15 +00:00
Devang Patel	3dd764feda	Remove DW_AT_start_scope support. It is incomplete and superseeded by location entries support. llvm-svn: 118940	2010-11-12 23:20:42 +00:00
Andrew Trick	68d533cca4	typo (4th checkin for one fix) llvm-svn: 118913	2010-11-12 18:36:03 +00:00
Andrew Trick	d0c919dcad	Fixes PR8287: SD scheduling time. The fix is a failsafe that prevents catastrophic compilation time in the event of unreasonable LLVM IR. Code quality is a separate issue--someone upstream needs to do a better job of reducing to llvm.memcpy. If the situation can be reproduced with any supported frontend, then it will be a separate bug. llvm-svn: 118904	2010-11-12 17:50:46 +00:00
Chris Lattner	bcdb696f6c	tidy up. llvm-svn: 118896	2010-11-12 17:24:29 +00:00
Lang Hames	b10e4b99e3	Fix some style issues in PBQP. Patch by David Blaikie. llvm-svn: 118883	2010-11-12 05:47:21 +00:00
Dan Gohman	d88f1d63ac	Add a FIXME comment. llvm-svn: 118803	2010-11-11 18:08:43 +00:00
Andrew Trick	5e77065d78	Check TRI->getReservedRegs because other allocators do it. Even though it makes no sense for allocation_order iterators to visit reserved regs. The inline spiller depends on AliasAnalysis. Manage the Query state to avoid uninitialized or stale results. llvm-svn: 118800	2010-11-11 17:46:29 +00:00
Dan Gohman	8e986f9c2f	Remove the memmove->memcpy optimization from CodeGen. MemCpyOpt does this. llvm-svn: 118789	2010-11-11 16:24:49 +00:00
Jakob Stoklund Olesen	e7e1f14d50	Delete SplittingSpiller. It was not being used by anyone, and it is being superceded by SplitKit. llvm-svn: 118754	2010-11-11 00:52:44 +00:00
Jakob Stoklund Olesen	313b78d28e	Insert two blank SlotIndexes between basic blocks instead of just one. This is the first small step towards using closed intervals for liveness instead of the half-open intervals we're using now. We want to be able to distinguish between a SlotIndex that represents a variable being live-out of a basic block, and an index representing a variable live-in to its successor. That requires two separate indexes between blocks. One for live-outs and one for live-ins. With this change, getMBBEndIdx(MBB).getPrevSlot() becomes stable so it stays greater than any instructions inserted at the end of MBB. llvm-svn: 118747	2010-11-11 00:19:20 +00:00
Jakob Stoklund Olesen	2f72fda7ef	No need to add liveness that's already there. llvm-svn: 118742	2010-11-10 23:56:00 +00:00
Jakob Stoklund Olesen	6a63cbe06d	Hook up AliasAnalysis in InlineSpiller. This is used for rematerializing constant loads. llvm-svn: 118741	2010-11-10 23:55:56 +00:00
Devang Patel	5520a6fc97	Take care of special characters while creating named MDNode name to hold function specific local variable's info. This fixes radar 8653152. I am checking in testcase as a separate check-in. llvm-svn: 118726	2010-11-10 22:19:21 +00:00
Jakob Stoklund Olesen	00c4d94862	Basic rematerialization during splitting. Whenever splitting wants to insert a copy, it checks if the value can be rematerialized cheaply instead. Missing features: - Delete instructions when all uses have been rematerialized. - Truncate live ranges to the remaining uses after rematerialization. llvm-svn: 118702	2010-11-10 19:31:50 +00:00
Andrew Trick	9d60f59b55	RABasic is nearly functionally complete. There are a few remaining benchmarks hitting an assertion. Adds LiveIntervalUnion::collectInterferingVRegs. Fixes "late spilling" by checking for any unspillable live vregs among all physReg aliases. llvm-svn: 118701	2010-11-10 19:18:47 +00:00
Jakob Stoklund Olesen	e259b04730	Simplify the LiveRangeEdit::canRematerializeAt() interface a bit. llvm-svn: 118661	2010-11-10 01:05:12 +00:00
Rafael Espindola	3c0f11a265	Fixed version of 118639 with an extra assert to catch similar problems earlier. Implicit bool -> int conversions are evil! llvm-svn: 118651	2010-11-09 23:42:07 +00:00
Andrew Trick	374490bf92	Adds RABasic verification and tracing. (retry now that the windows build is green) llvm-svn: 118630	2010-11-09 21:04:34 +00:00
Matt Beaumont-Gay	0f262ff853	Add a trivial virtual dtor to AbstractRegisterDescription to appease -Wnon-virtual-dtor. llvm-svn: 118616	2010-11-09 19:56:25 +00:00
Andrew Trick	6b9d6df8c3	Reverting r118604. Windows build broke. llvm-svn: 118613	2010-11-09 19:47:51 +00:00
Andrew Trick	ce7b5df15e	Adds RABasic verification and tracing. llvm-svn: 118604	2010-11-09 19:01:17 +00:00
Dan Gohman	903935bf3e	Fix DAGCombiner to avoid folding a sext-in-reg or similar through a shl in order to fold it into a load. llvm-svn: 118471	2010-11-09 01:54:35 +00:00
Dale Johannesen	88f85df7f7	Fix an inline asm pasto from 117667; was preventing {i64, i64} from matching i128. llvm-svn: 118465	2010-11-09 01:15:07 +00:00
Andrew Trick	45ec210e3c	Adds support for spilling previously allocated live intervals to handle cases in which a register is unavailable for spill code. Adds LiveIntervalUnion::extract. While processing interferences on a live virtual register, reuses the same Query object for each physcial reg. llvm-svn: 118423	2010-11-08 18:02:08 +00:00
Che-Liang Chiou	4cc802839c	Add registry hook for assembly text output llvm-svn: 118394	2010-11-08 02:21:17 +00:00
Benjamin Kramer	96ac873014	Prune includes. llvm-svn: 118342	2010-11-06 11:45:59 +00:00
Duncan Sands	96b03ec2ce	When passing a parameter using the 'byval' mechanism, inline code needs to be used to perform the copy, which may be of lots of memory []. It would be good if the fall-back code generated something reasonable, i.e. did the copy in a loop, rather than vast numbers of loads and stores. Add a note about this. Currently target specific code seems to always kick in so this is more of a theoretical issue rather than a practical one now that X86 has been fixed. [] It's amazing how often people pass mega-byte long arrays by copy... llvm-svn: 118275	2010-11-05 15:20:29 +00:00
Rafael Espindola	3f4a79b243	Add 118023 back, but with proper spelling for .uleb128/.sleb128. llvm-svn: 118254	2010-11-04 18:17:08 +00:00
Rafael Espindola	618eefb925	Revert previous patch. Some targets don't support uleb and say they do :-( llvm-svn: 118250	2010-11-04 17:04:24 +00:00
Rafael Espindola	afb48cd73d	MCize. llvm-svn: 118249	2010-11-04 16:32:18 +00:00
Duncan Sands	3bf2a701a5	In the calling convention logic, ValVT is always a legal type, and as such can be represented by an MVT - the more complicated EVT is not needed. Use MVT for ValVT everywhere. llvm-svn: 118245	2010-11-04 10:49:57 +00:00
Jakob Stoklund Olesen	aa7acbe740	Disable fancy splitting during spilling unless -extra-spiller-splits is given. This way, InlineSpiller does the same amount of splitting as the standard spiller. Splitting should really be guided by the register allocator, and doesn't belong in the spiller at all. llvm-svn: 118216	2010-11-04 00:32:32 +00:00
Eric Christopher	3f1ac311ff	Just return undef for invalid masks or elts, and since we're doing that, just do it earlier too. llvm-svn: 118195	2010-11-03 20:44:42 +00:00
Jakob Stoklund Olesen	13bf5713f2	Let RegAllocBasic require MachineDominators - they are already available and splitting needs them. llvm-svn: 118194	2010-11-03 20:39:26 +00:00
Jakob Stoklund Olesen	7dc4b810ed	Tag debug output as regalloc llvm-svn: 118193	2010-11-03 20:39:23 +00:00
Duncan Sands	41edf30895	Simplify uses of MVT and EVT. An MVT can be compared directly with a SimpleValueType, while an EVT supports equality and inequality comparisons with SimpleValueType. llvm-svn: 118169	2010-11-03 12:17:33 +00:00
Duncan Sands	f6e5e02c9b	Inside the calling convention logic LocVT is always a simple value type, so there is no point in passing it around using an EVT. Use the simpler MVT everywhere. Rather than trying to propagate this information maximally in all the code that using the calling convention stuff, I chose to do a mainly low impact change instead. llvm-svn: 118167	2010-11-03 11:35:31 +00:00
Eric Christopher	719d9d324b	If we have an undef mask our Elt will be -1 for our access, handle this by using an undef as a pointer. Fixes rdar://8625016 llvm-svn: 118164	2010-11-03 09:36:40 +00:00
Dan Gohman	8071a75d31	Fix DAGCombiner to avoid going into an infinite loop when it encounters (and:i64 (shl:i64 (load:i64), 1), 0xffffffff). This fixes rdar://8606584. llvm-svn: 118143	2010-11-03 01:47:46 +00:00
Evan Cheng	67db408634	Two sets of changes. Sorry they are intermingled. 1. Fix pre-ra scheduler so it doesn't try to push instructions above calls to "optimize for latency". Call instructions don't have the right latency and this is more likely to use introduce spills. 2. Fix if-converter cost function. For ARM, it should use instruction latencies, not # of micro-ops since multi-latency instructions is completely executed even when the predicate is false. Also, some instruction will be "slower" when they are predicated due to the register def becoming implicit input. rdar://8598427 llvm-svn: 118135	2010-11-03 00:45:17 +00:00
Andrew Trick	f861447393	Fixes <rdar://problem/8612856>: During postRAsched, the antidependence breaker needs to check all definitions of the antidepenent register to avoid multiple defs of the same new register. llvm-svn: 118032	2010-11-02 18:16:45 +00:00
Devang Patel	37fa9bb66a	Simplify. llvm-svn: 118027	2010-11-02 17:37:00 +00:00
Devang Patel	e3575ef54f	If value map does not have register for an argument then try to find frame index before giving up. llvm-svn: 118022	2010-11-02 17:19:03 +00:00
Devang Patel	efd9ac540a	Use frameindex, if available, as a last resort to emit debug info for a parameter. llvm-svn: 118020	2010-11-02 17:01:30 +00:00
Jakob Stoklund Olesen	840357ca6a	Don't try to split weird critical edges that really aren't: BB#1: derived from LLVM BB %bb.nph28 Live Ins: %AL Predecessors according to CFG: BB#0 TEST8rr %reg16384<kill>, %reg16384, %EFLAGS<imp-def>; GR8:%reg16384 JNE_4 <BB#2>, %EFLAGS<imp-use,kill> JMP_4 <BB#2> Successors according to CFG: BB#2 BB#2 These double CFG edges only ever occur in bugpoint-generated code, so there is no need to attempt something clever. llvm-svn: 117992	2010-11-02 00:58:37 +00:00
Jakob Stoklund Olesen	56f67c7d02	MachineLICM should not claim to be preserving the CFG when it can split critical edges on demand. llvm-svn: 117982	2010-11-01 23:59:55 +00:00
Jakob Stoklund Olesen	b0f64b3d2b	Be more precise about verifying missing kill flags. It is legal for an instruction to have two operands using the same register, only one a kill. This is interpreted as a kill. llvm-svn: 117981	2010-11-01 23:59:53 +00:00
Jakob Stoklund Olesen	a052004a89	When inserting copies during splitting, always use the parent register as the source, and let rewrite() clean it up. This way, kill flags on the inserted copies are fixed as well during rewrite(). We can't just assume that all the copies we insert are going to be kills since critical edges into loop headers sometimes require both source and dest to be live out of a block. llvm-svn: 117980	2010-11-01 23:59:48 +00:00
Jakob Stoklund Olesen	0ed56c87e4	Add kill flag verification. At least X86FloatingPoint requires correct kill flags after register allocation, and targets using register scavenging benefit. Conservative kill flags are not enough. llvm-svn: 117960	2010-11-01 21:51:31 +00:00
Jakob Stoklund Olesen	78ba455b51	Update kill flags while rewriting instructions after splitting. llvm-svn: 117959	2010-11-01 21:51:29 +00:00
Bill Wendling	4340c9449a	When we look at instructions to convert to setting the 's' flag, we need to look at more than those which define CPSR. You can have this situation: (1) subs ... (2) sub r6, r5, r4 (3) movge ... (4) cmp r6, 0 (5) movge ... We cannot convert (2) to "subs" because (3) is using the CPSR set by (1). There's an analogous situation here: (1) sub r1, r2, r3 (2) sub r4, r5, r6 (3) cmp r4, ... (5) movge ... (6) cmp r1, ... (7) movge ... We cannot convert (1) to "subs" because of the intervening use of CPSR. llvm-svn: 117950	2010-11-01 20:41:43 +00:00
Jakob Stoklund Olesen	66012df062	Don't assign new registers created during a split to the same stack slot, but give them individual stack slots once the are actually spilled. llvm-svn: 117945	2010-11-01 19:49:57 +00:00
Jakob Stoklund Olesen	b4a55702b7	Add basic LiveStacks verification. When an instruction refers to a spill slot with a LiveStacks entry, check that the spill slot is live at the instruction. llvm-svn: 117944	2010-11-01 19:49:52 +00:00
Bill Wendling	70856991c5	The testcase is now XFAILed. Sorry about the breakage. llvm-svn: 117904	2010-11-01 05:50:55 +00:00
Eric Christopher	7295ed492c	Revert r117876 for now, it's causing more testsuite failures. llvm-svn: 117879	2010-10-31 22:42:55 +00:00
Bill Wendling	13936421e6	Disable the peephole optimizer until 186.crafty on armv6 is fixed. This is what looks like is happening: Without the peephole optimizer: (1) sub r6, r6, #32 orr r12, r12, lr, lsl r9 orr r2, r2, r3, lsl r10 (x) cmp r6, #0 ldr r9, LCPI2_10 ldr r10, LCPI2_11 (2) sub r8, r8, #32 (a) movge r12, lr, lsr r6 (y) cmp r8, #0 LPC2_10: ldr lr, [pc, r10] (b) movge r2, r3, lsr r8 With the peephole optimizer: ldr r9, LCPI2_10 ldr r10, LCPI2_11 (1) subs r6, r6, #32 (2) subs r8, r8, #32 (a) movge r12, lr, lsr r6 (b) movge r2, r3, lsr r8 (1) is used by (x) for the conditional move at (a). (2) is used by (y) for the conditional move at (b). After the peephole optimizer, these the flags resulting from (1) are ignored and only the flags from (2) are considered for both conditional moves. llvm-svn: 117876	2010-10-31 22:07:12 +00:00
Nicolas Geoffray	6889997474	Attach a GCModuleInfo to a MachineFunction. llvm-svn: 117867	2010-10-31 20:38:38 +00:00
Jakob Stoklund Olesen	b3b1db4e67	Include MachineBasicBlock numbers in viewCFG() output. llvm-svn: 117765	2010-10-30 01:26:19 +00:00
Jakob Stoklund Olesen	035667c0d6	Make sure copies are inserted after any exception handling labels at the top of a basic block. llvm-svn: 117764	2010-10-30 01:26:16 +00:00
Jakob Stoklund Olesen	0ab92619d0	Add SkipPHIsAndLabels from PHIElimination to MachineBasicBlock. It is needed elsewhere. llvm-svn: 117763	2010-10-30 01:26:14 +00:00
Jakob Stoklund Olesen	b751dffc9f	Disable more of physical register live intervals verification. llvm-svn: 117762	2010-10-30 01:26:11 +00:00
Jakob Stoklund Olesen	4425035bfd	Print out register class of spilled register. llvm-svn: 117761	2010-10-30 01:26:09 +00:00
Evan Cheng	d81c33d91e	Teach machine cse to eliminate instructions with multiple physreg uses and defs. rdar://8610857. llvm-svn: 117745	2010-10-29 23:36:03 +00:00
Bob Wilson	d67dddb134	Remove DAG combiner patch to fold vector splats. Instcombiner does it now. llvm-svn: 117720	2010-10-29 22:03:02 +00:00
Jakob Stoklund Olesen	469a9ef414	Fix sign error. llvm-svn: 117677	2010-10-29 18:21:18 +00:00
Evan Cheng	392d2cbdcc	Avoiding overly aggressive latency scheduling. If the two nodes share an operand and one of them has a single use that is a live out copy, favor the one that is live out. Otherwise it will be difficult to eliminate the copy if the instruction is a loop induction variable update. e.g. BB: sub r1, r3, #1 str r0, [r2, r3] mov r3, r1 cmp bne BB => BB: str r0, [r2, r3] sub r3, r3, #1 cmp bne BB This fixed the recent 256.bzip2 regression. llvm-svn: 117675	2010-10-29 18:09:28 +00:00
Jakob Stoklund Olesen	1f2fb9a020	Don't transfer unused values to the new intervals formed by splitting. llvm-svn: 117673	2010-10-29 17:47:49 +00:00
Benjamin Kramer	d84e7a7b91	Silence Release build warnings. llvm-svn: 117671	2010-10-29 17:40:05 +00:00
Jakob Stoklund Olesen	d56c4457a6	Teach ConnectedVNInfoEqClasses::Classify to deal with unused values. We don't want unused values forming their own equivalence classes, so we lump them all together in one class, and then merge them with the class of the last used value. llvm-svn: 117670	2010-10-29 17:37:29 +00:00
Jakob Stoklund Olesen	e577489ba5	Never propagate the idom value out of a block that defines its own value. llvm-svn: 117669	2010-10-29 17:37:25 +00:00
John Thompson	6115a7f1d4	Inline asm multiple alternative constraints development phase 2 - improved basic logic, added initial platform support. llvm-svn: 117667	2010-10-29 17:29:13 +00:00
Bill Wendling	d9412db2c8	This may be an ARM target, so check for _Unwind_SjLj_Resume. llvm-svn: 117643	2010-10-29 07:46:01 +00:00
Jakob Stoklund Olesen	b7a4b7a7b0	Fix broken equivalence class calculation. We could probably also use EquvivalenceClasses.h except it looks like overkill when elements are continuous integers. llvm-svn: 117631	2010-10-29 00:40:59 +00:00
Jakob Stoklund Olesen	1210a5145a	Print out the connected components in the verifier after complaining about their multiplicity. llvm-svn: 117630	2010-10-29 00:40:57 +00:00
Jakob Stoklund Olesen	e446520687	Run a verification pass before any splitting to better distribute blame. llvm-svn: 117629	2010-10-29 00:40:55 +00:00
Devang Patel	bf9c3c8947	Ignore empty blocks. llvm-svn: 117615	2010-10-28 22:11:59 +00:00
Jakob Stoklund Olesen	c570151dc2	One day, physical register live ranges will be sensible. llvm-svn: 117602	2010-10-28 20:44:22 +00:00
Jakob Stoklund Olesen	ccbfbefd93	Replace SplitKit SSA update with an iterative algorithm very similar to the one in SSAUpdaterImpl.h Verifying live intervals revealed that the old method was completely wrong, and we need an iterative approach to calculating PHI placemant. Fortunately, we have MachineDominators available, so we don't have to compute that over and over like SSAUpdaterImpl.h must. Live-out values are cached between calls to mapValue() and computed in a greedy way, so most calls will be working with very small block sets. Thanks to Bob for explaining how this should work. llvm-svn: 117599	2010-10-28 20:34:52 +00:00
Jakob Stoklund Olesen	6f4262448e	Make MachineDominators available for SplitEditor. We are going to need it for proper SSA updating. This doesn't cause MachineDominators to be recomputed since we are already requiring MachineLoopInfo which uses dominators as well. llvm-svn: 117598	2010-10-28 20:34:50 +00:00
Jakob Stoklund Olesen	35f9c58245	Add a temporary command line option to verify machine code after each spill or split. llvm-svn: 117597	2010-10-28 20:34:47 +00:00
Devang Patel	a496603faf	Do not work too hard to find type's file info. There is a special field to record file info. llvm-svn: 117588	2010-10-28 19:50:08 +00:00
Devang Patel	0e43a2d84f	Technically DIFile scope should also be handled here. llvm-svn: 117563	2010-10-28 17:30:52 +00:00
Bob Wilson	65124cd7c7	Teach the DAG combiner to fold a splat of a splat. Radar 8597790. Also do some minor refactoring to reduce indentation. llvm-svn: 117558	2010-10-28 17:06:14 +00:00
Evan Cheng	bc4588c439	Re-commit 117518 and 117519 now that ARM MC test failures are out of the way. llvm-svn: 117531	2010-10-28 06:47:08 +00:00
Evan Cheng	fdc80a0316	Revert 117518 and 117519 for now. They changed scheduling and cause MC tests to fail. Ugh. llvm-svn: 117520	2010-10-28 02:00:25 +00:00
Evan Cheng	a07cd58978	Fix a major bug in operand latency computation. The use index must be adjusted by the number of defs first for it to match the instruction itinerary. llvm-svn: 117518	2010-10-28 01:46:29 +00:00
Evan Cheng	77693156fb	Putting r117193 back except for the compile time cost. Rather than assuming fallthroughs uses all registers, just gather the union of all successor liveins. llvm-svn: 117506	2010-10-27 23:17:17 +00:00
Michael J. Spencer	bf82646290	COFF: Add IMAGE_SCN_MEM_READ to text sections. There are currently 100 references to COFF::IMAGE_SCN in 6 files and 11 different functions. Section to attribute mapping really needs to happen in one place to avoid problems like this. llvm-svn: 117473	2010-10-27 18:52:29 +00:00
Michael J. Spencer	fc69783598	Fix whitespace. llvm-svn: 117472	2010-10-27 18:52:20 +00:00
Jim Grosbach	ddd67f8e88	Formatting. llvm-svn: 117453	2010-10-27 16:30:18 +00:00
Jakob Stoklund Olesen	66180b2c06	Handle critical loop predecessors by making both inside and outside registers live out. This doesn't prevent us from inserting a loop preheader later on, if that is better. llvm-svn: 117424	2010-10-27 00:39:07 +00:00
Jakob Stoklund Olesen	514eb703b8	Compute critical loop predecessors in the same way as critical loop exits. Critical edges going into a loop are not as bad as critical exits. We can handle them by splitting the critical edge, or by having both inside and outside registers live out of the predecessor. llvm-svn: 117423	2010-10-27 00:39:05 +00:00
Jakob Stoklund Olesen	4701c56446	Physical registers trivially have multiple connected components all the time. Only virtuals should be requires to be connected. llvm-svn: 117422	2010-10-27 00:39:01 +00:00
Dale Johannesen	e7f07349e4	Use a MemIntrinsicSDNode for ISD::PREFETCH, which touches memory, so a MachineMemOperand is useful (not propagated into the MachineInstr yet). No functional change except for dump output. llvm-svn: 117413	2010-10-26 23:11:10 +00:00
Andrew Trick	6363e80ded	Remove the vector of live vregs. I thought we would need to track them, but hopefully we won't. And this is not the right data structure to do it anyway. llvm-svn: 117412	2010-10-26 22:58:24 +00:00
Jakob Stoklund Olesen	bb4b26c42b	After splitting, compute connected components of all new registers, not just for the remainder register. Example: bb0: x = 1 bb1: use(x) ... x = 2 jump bb1 When x is isolated in bb1, the inner part breaks into two components, x1 and x2: bb0: x0 = 1 bb1: x1 = x0 use(x1) ... x2 = 2 x0 = x2 jump bb1 llvm-svn: 117408	2010-10-26 22:36:09 +00:00
Jakob Stoklund Olesen	3bf3211458	Verify that live intervals are connected. If there are multiple connected components, each should get its own virtual register. llvm-svn: 117407	2010-10-26 22:36:07 +00:00
Jakob Stoklund Olesen	fa777d622d	Call RenumberValues for all new registers created during splitting. This is necessary to get correct hasPHIKill flags. llvm-svn: 117406	2010-10-26 22:36:05 +00:00
Jakob Stoklund Olesen	0007e9a345	Preserve PHIDef bits in cloned values during splitting. llvm-svn: 117405	2010-10-26 22:36:02 +00:00
Devang Patel	7796146304	Assign source ordering to nodes created for StoreInst. llvm-svn: 117404	2010-10-26 22:14:52 +00:00
Jakob Stoklund Olesen	3a4c0c13eb	Teach MachineBasicBlock::print() to annotate instructions and blocks with SlotIndexes when available. llvm-svn: 117392	2010-10-26 20:21:46 +00:00
Jakob Stoklund Olesen	195132633a	Remmeber to print full live interval on verification error. llvm-svn: 117391	2010-10-26 20:21:43 +00:00
Andrew Trick	06fb7ed472	Jakob's review of the basic register allocator. llvm-svn: 117384	2010-10-26 18:34:01 +00:00
Devang Patel	e544bb3842	s/beginScope/beginInstruction/g s/endScope/endInstruction/g llvm-svn: 117376	2010-10-26 17:49:02 +00:00
Jakob Stoklund Olesen	de018e2d8e	Don't verify physical registers going into landing pads. Magic is happening that we don't understand. llvm-svn: 117370	2010-10-26 16:49:23 +00:00
Evan Cheng	324e678bb7	Use instruction itinerary to determine what instructions are 'cheap'. llvm-svn: 117348	2010-10-26 02:08:50 +00:00
Nick Lewycky	f7dddd658f	For statistics that are only used in functions declared in !NDEBUG, wrap the declarations in !NDEBUG to avoid -Wunused-variable warnings. Patch by Matt Beaumont-Gay! llvm-svn: 117345	2010-10-26 00:51:57 +00:00
Jakob Stoklund Olesen	b0c10ed95a	InlineSpiller can also update LiveStacks. llvm-svn: 117338	2010-10-26 00:11:35 +00:00
Jakob Stoklund Olesen	3988c3fb55	Make the spiller responsible for updating the LiveStacks analysis. llvm-svn: 117337	2010-10-26 00:11:33 +00:00
Bob Wilson	309484bb46	When the "true" and "false" blocks of a diamond if-conversion are the same, do not double-count the duplicate instructions by counting once from the beginning and again from the end. Keep track of where the duplicates from the beginning ended and don't go past that point when counting duplicates at the end. Radar 8589805. This change causes one of the MC/ARM/simple-fp-encoding tests to produce different (better!) code without the vmovne instruction being tested. I changed the test to produce vmovne and vmoveq instructions but moving between register files in the opposite direction. That's not quite the same but predicated versions of those instructions weren't being tested before, so at least the test coverage is not any worse, just different. llvm-svn: 117333	2010-10-26 00:02:24 +00:00
Bob Wilson	685e143630	Change if-conversion to keep track of the extra cost due to microcoded instructions separately from the count of non-predicated instructions. The instruction count is used in places to determine how many instructions to copy, predicate, etc. and things get confused if that count includes the extra cost for microcoded ops. llvm-svn: 117332	2010-10-26 00:02:21 +00:00
Evan Cheng	2439e51388	Neuter r117193 as it causes significant post-ra scheduler compile time regression. llvm-svn: 117329	2010-10-25 23:56:21 +00:00
Devang Patel	fa145a94d1	Simplify. Do not count use of sdisel for single call instruction. llvm-svn: 117316	2010-10-25 21:31:46 +00:00
Devang Patel	3779ad5d59	Add counters to count basic blocks and machine basic blocks with out of order line number info. Add counters to count how many basic blocks are entirely selected by fastisel. llvm-svn: 117310	2010-10-25 20:55:43 +00:00
Devang Patel	24a6b791b6	Add simple counter to count no. of basic blocks without any line number information. At -O0, these basic block coule cause less than optimial debugging experience. llvm-svn: 117307	2010-10-25 20:45:32 +00:00
Jakob Stoklund Olesen	7aeb8acd65	In which I learn how to forward declare template classes. llvm-svn: 117272	2010-10-25 17:27:30 +00:00
Chandler Carruth	7dd652736f	Move the remaining attribute macros to systematic names based on the attribute name and prefixed with 'LLVM_'. llvm-svn: 117203	2010-10-23 08:40:19 +00:00
Chandler Carruth	25324e76ae	Fix a likely bug in an assertion by adding parentheses around '\|\|'. This bug was found by a GCC warning. ;] llvm-svn: 117199	2010-10-23 07:46:14 +00:00
Evan Cheng	030f55c496	Properly model the latency of register defs which are 1) function returns or 2) live-outs. Previously the post-RA schedulers completely ignore these dependencies since returns, branches, etc. are all scheduling barriers. This patch model the latencies between instructions being scheduled and the barriers. It also handle calls by marking their register uses. llvm-svn: 117193	2010-10-23 02:10:46 +00:00
Jakob Stoklund Olesen	964f6a6be6	Verify LiveIntervals against the CFG, ensuring that live-in values are live-out of all predecessors. llvm-svn: 117191	2010-10-23 00:49:09 +00:00
Andrew Trick	4043309a54	Nonvirtual dtor that was accessible enough to be bad. llvm-svn: 117180	2010-10-22 23:33:19 +00:00
Andrew Trick	7a1dadd47d	This is a prototype of an experimental register allocation framework. It's purpose is not to improve register allocation per se, but to make it easier to develop powerful live range splitting. I call it the basic allocator because it is as simple as a global allocator can be but provides the building blocks for sophisticated register allocation with live range splitting. A minimal implementation is provided that trivially spills whenever it runs out of registers. I'm checking in now to get high-level design and style feedback. I've only done minimal testing. The next step is implementing a "greedy" allocation algorithm that does some register reassignment and makes better splitting decisions. llvm-svn: 117174	2010-10-22 23:09:15 +00:00
Jakob Stoklund Olesen	c78f761faf	Add more verification of LiveIntervals. llvm-svn: 117170	2010-10-22 22:48:58 +00:00
Jakob Stoklund Olesen	68258f467a	Be more strict about detecting multi-use blocks for isolation. When a block has exactly two uses and the register is both live-in and live-out, don't isolate the block. We would be inserting two copies, so we haven't really made any progress. If the live-in and live-out values separate into disconnected components after splitting, we would be making progress. We can't detect that for now. llvm-svn: 117169	2010-10-22 22:48:56 +00:00
Evan Cheng	2c2dcc69d8	Unbreak build. llvm-svn: 117155	2010-10-22 21:49:09 +00:00
Evan Cheng	20b70697bb	Transfer implicit ops when forming load multiple and return instructions. llvm-svn: 117151	2010-10-22 21:29:58 +00:00
Jakob Stoklund Olesen	de1439ce2b	Be more strict when detecting critical edges before loop splitting. An exit block with a critical edge must only have predecessors in the loop, or just before the loop. This guarantees that the inserted copies in the loop predecessors dominate the exit block. llvm-svn: 117144	2010-10-22 20:28:23 +00:00
Jakob Stoklund Olesen	967bcbf445	Add print methods llvm-svn: 117143	2010-10-22 20:28:21 +00:00
Michael J. Spencer	08602f9803	X86: Base _fltused on the FunctionType of the called value instead of the potentially null "CalledFunction". Thanks Duncan! This is needed for indirect calls. llvm-svn: 117061	2010-10-21 20:49:23 +00:00
Jakob Stoklund Olesen	4794d28044	Don't include the destination interval in the union when computing Parent - union(Y, ...). Doh. llvm-svn: 117042	2010-10-21 18:47:08 +00:00
Jakob Stoklund Olesen	82da267156	Permit landing pad successor blocks when verifying basic blocks that end in an unconditional branch. llvm-svn: 117041	2010-10-21 18:47:06 +00:00
Duncan Sands	c40461600a	The variable liTRC is not used for anything useful, zap it (gcc-4.6 warning). llvm-svn: 117022	2010-10-21 16:04:43 +00:00
Duncan Sands	0c166168a0	Remove some variables that are never really used (gcc-4.6 warns about these). llvm-svn: 117021	2010-10-21 16:03:28 +00:00
Michael J. Spencer	b9cffadc06	CodeGen-Windows: Only emit _fltused if a VarArg function is called with floating point args. This should be the minimum set of functions that could possibly need it. llvm-svn: 116978	2010-10-21 00:08:21 +00:00
Jakob Stoklund Olesen	03fd8bf61c	Remember to keep track of rematted values. llvm-svn: 116962	2010-10-20 22:50:42 +00:00
Evan Cheng	9a06e4c7c8	More accurate estimate / tracking of register pressure. - Initial register pressure in the loop should be all the live defs into the loop. Not just those from loop preheader which is often empty. - When an instruction is hoisted, update register pressure from loop preheader to the original BB. - Treat only use of a virtual register as kill since the code is still SSA. llvm-svn: 116956	2010-10-20 22:03:58 +00:00

... 5 6 7 8 9 ...

11355 Commits