llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-28 06:27:16 +00:00

Author	SHA1	Message	Date
Jakob Stoklund Olesen	b2c7febf35	Rename CT -> MTM. MachineTraceMetrics is abbreviated MTM. llvm-svn: 161072	2012-07-31 20:25:13 +00:00
Manman Ren	3d7e85d5b8	MachineSink: Sort the successors before trying to find SuccToSinkTo. One motivating example is to sink an instruction from a basic block which has two successors: one outside the loop, the other inside the loop. We should try to sink the instruction outside the loop. rdar://11980766 llvm-svn: 161062	2012-07-31 18:10:39 +00:00
Micah Villmow	122d115419	Conform to LLVM coding style. llvm-svn: 161061	2012-07-31 18:07:43 +00:00
Micah Villmow	8b7fb5e605	Don't generate ordered or unordered comparison operations if it is not legal to do so. llvm-svn: 161053	2012-07-31 16:48:03 +00:00
Jakob Stoklund Olesen	ed1a4d695a	Clear kill flags in removeCopyByCommutingDef(). We are extending live ranges, so kill flags are not accurate. They aren't needed until they are recomputed after RA anyway. <rdar://problem/11950722> llvm-svn: 161023	2012-07-31 02:47:24 +00:00
Manman Ren	3769ac64a6	Reverse order of the two branches at end of a basic block if it is profitable. We branch to the successor with higher edge weight first. Convert from je LBB4_8 --> to outer loop jmp LBB4_14 --> to inner loop to jne LBB4_14 jmp LBB4_8 PR12750 rdar: 11393714 llvm-svn: 161018	2012-07-31 01:11:07 +00:00
Andrew Trick	2773dbddf3	Use the latest MachineRegisterInfo APIs. No functionality. llvm-svn: 161010	2012-07-30 23:48:17 +00:00
Andrew Trick	dc8c12f62b	Inline MachineRegisterInfo::hasOneUse llvm-svn: 161007	2012-07-30 23:48:12 +00:00
Jakob Stoklund Olesen	12a22d4f0f	Avoid looking at stale data in verifyAnalysis(). llvm-svn: 161004	2012-07-30 23:15:12 +00:00
Jakob Stoklund Olesen	a9e1543215	Allow traces to enter nested loops. This lets traces include the final iteration of a nested loop above the center block, and the first iteration of a nested loop below the center block. We still don't allow traces to contain backedges, and traces are truncated where they would leave a loop, as seen from the center block. llvm-svn: 161003	2012-07-30 23:15:10 +00:00
Jakob Stoklund Olesen	e9523d88c3	Clarify invalidation strategy in comment. llvm-svn: 160997	2012-07-30 21:16:22 +00:00
Jakob Stoklund Olesen	2a129fa92c	Assert that all trace candidate blocks have been visited by the PO. When computing a trace, all the candidates for pred/succ must have been visited. Filter out back-edges first, though. The PO traversal ignores them. Thanks to Andy for spotting this in review. llvm-svn: 160995	2012-07-30 21:10:27 +00:00
Jakob Stoklund Olesen	d91215215c	Hook into PassManager's analysis verification. By overriding Pass::verifyAnalysis(), the pass contents will be verified by the pass manager. llvm-svn: 160994	2012-07-30 20:57:50 +00:00
Pete Cooper	e45da564cf	Consider address spaces for hashing and CSEing DAG nodes. Otherwise two loads from different x86 segments but the same address would get CSEd llvm-svn: 160987	2012-07-30 20:23:19 +00:00
Jakob Stoklund Olesen	7bbe0b0328	Add MachineInstr::isTransient(). This is a cleaned up version of the isFree() function in MachineTraceMetrics.cpp. Transient instructions are very unlikely to produce any code in the final output. Either because they get eliminated by RegisterCoalescing, or because they are pseudo-instructions like labels and debug values. llvm-svn: 160977	2012-07-30 18:34:14 +00:00
Jakob Stoklund Olesen	c3b8765d57	Add MachineTraceMetrics::verify(). This function verifies the consistency of cached data in the MachineTraceMetrics analysis. llvm-svn: 160976	2012-07-30 18:34:11 +00:00
Jakob Stoklund Olesen	6653a31973	Verify that the CFG hasn't changed during invalidate(). The MachineTraceMetrics analysis must be invalidated before modifying the CFG. This will catch some of the violations of that rule. llvm-svn: 160969	2012-07-30 17:36:49 +00:00
Jakob Stoklund Olesen	4f3254f73c	Add MachineBasicBlock::isPredecessor(). A->isPredecessor(B) is the same as B->isSuccessor(A), but it can tolerate a B that is null or dangling. This shouldn't happen normally, but it it useful for verification code. llvm-svn: 160968	2012-07-30 17:36:47 +00:00
Manman Ren	ceef7c4d9b	Revert r160920 and r160919 due to dragonegg and clang selfhost failure llvm-svn: 160927	2012-07-29 02:44:09 +00:00
Manman Ren	ea77f9076b	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. rdar://10554090 and rdar://11873276 llvm-svn: 160919	2012-07-28 16:48:01 +00:00
Andrew Trick	0320969afa	Reenable a basic SSA DAG builder optimization. Jakob fixed ProcessImplicifDefs in r159149. llvm-svn: 160910	2012-07-28 01:48:15 +00:00
Jakob Stoklund Olesen	0eacb18967	Add more debug output to MachineTraceMetrics. llvm-svn: 160905	2012-07-27 23:58:38 +00:00
Jakob Stoklund Olesen	fefd43f7a9	Keep track of the head and tail of the trace through each block. This makes it possible to quickly detect blocks that are outside the trace. llvm-svn: 160904	2012-07-27 23:58:36 +00:00
Eric Christopher	acd91c534d	Add a DW_AT_high_pc for CUs that are a single address range. Update all tests accordingly. Fixes PR13351. Patch by shinichiro hamaji! llvm-svn: 160899	2012-07-27 22:00:05 +00:00
Jakob Stoklund Olesen	88319a3e66	Also compute register mask lists under -new-live-intervals. llvm-svn: 160898	2012-07-27 21:56:39 +00:00
Jakob Stoklund Olesen	8e957f3c0b	Eliminate the IS_PHI_DEF flag and VNInfo::setIsPHIDef(). A value number is a PHI def if and only if it begins at a block boundary. This can be derived from the def slot, a separate flag is not necessary. llvm-svn: 160893	2012-07-27 21:11:14 +00:00
Jakob Stoklund Olesen	d60f4942e6	Add a -new-live-intervals experimental option. This option replaces the existing live interval computation with one based on LiveRangeCalc.cpp. The new algorithm does not depend on LiveVariables, and it can be run at any time, before or after leaving SSA form. llvm-svn: 160892	2012-07-27 20:58:46 +00:00
Jakob Stoklund Olesen	03a59af504	Add <imp-def> of super-register when lowering SUBREG_TO_REG. Patch by Tyler Nowicki! llvm-svn: 160888	2012-07-27 20:19:49 +00:00
Jakob Stoklund Olesen	f953068467	Use an otherwise unused variable. llvm-svn: 160798	2012-07-26 19:42:56 +00:00
Jakob Stoklund Olesen	0d3c0a9aea	Start scaffolding for a MachineTraceMetrics analysis pass. This is still a work in progress. Out-of-order CPUs usually execute instructions from multiple basic blocks simultaneously, so it is necessary to look at longer traces when estimating the performance effects of code transformations. The MachineTraceMetrics analysis will pick a typical trace through a given basic block and provide performance metrics for the trace. Metrics will include: - Instruction count through the trace. - Issue count per functional unit. - Critical path length, and per-instruction 'slack'. These metrics can be used to determine the performance limiting factor when executing the trace, and how it will be affected by a code transformation. Initially, this will be used by the early if-conversion pass. llvm-svn: 160796	2012-07-26 18:38:11 +00:00
Dan Gohman	7ff5ef1757	Add a floor intrinsic. llvm-svn: 160791	2012-07-26 17:43:27 +00:00
Manman Ren	6b3550a998	Disable rematerialization in TwoAddressInstructionPass. It is redundant; RegisterCoalescer will do the remat if it can't eliminate the copy. Collected instruction counts before and after this. A few extra instructions are generated due to spilling but it is normal to see these kinds of changes with almost any small codegen change, according to Jakob. This also fixed rdar://11830760 where xor is expected instead of movi0. llvm-svn: 160749	2012-07-25 18:28:13 +00:00
Jakob Stoklund Olesen	e470961b23	Preserve 2-addr constraints in ConnectedVNInfoEqClasses. When a live range splits into multiple connected components, we would arbitrarily assign <undef> uses to component 0. This is wrong when the use is tied to a def that gets assigned to a different component: %vreg69<def> = ADD8ri %vreg68<undef>, 1 The use and def must get the same virtual register. Fix this by assigning <undef> uses to the same component as the value defined by the instruction, if any: %vreg69<def> = ADD8ri %vreg69<undef>, 1 This fixes PR13402. The PR has a test case which I am not including because it is unlikely to keep exposing this behavior in the future. llvm-svn: 160739	2012-07-25 17:15:15 +00:00
Jakob Stoklund Olesen	45755341ad	Verify two-address constraints more carefully. Include <undef> operands and virtual registers after leaving SSA form. llvm-svn: 160734	2012-07-25 16:49:11 +00:00
Craig Topper	227c1316f4	Change llvm_unreachable in SplitVectorOperand to report_fatal_error. Keeps release builds from crashing if code uses an intrinsic with an illegal type. llvm-svn: 160661	2012-07-24 04:11:21 +00:00
Sylvestre Ledru	bf8acb65ac	Fix a typo (the the => the) llvm-svn: 160621	2012-07-23 08:51:15 +00:00
Nadav Rotem	180a9e3758	Fixed DAGCombine optimizations which generate select_cc for targets that do not support it (X86 does not lower select_cc). PR: 13428 Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160619	2012-07-23 07:59:50 +00:00
Craig Topper	7bb456e013	Tidy up. Fix indentation and remove trailing whitespace. llvm-svn: 160617	2012-07-23 05:38:07 +00:00
Craig Topper	63f980aea7	Change llvm_unreachable in SplitVectorResult to report_fatal_error. Keeps release builds from crashing if code uses an intrinsic with an illegal type. For instance 256-bit AVX intrinsics without having AVX enabled. llvm-svn: 160616	2012-07-23 04:34:49 +00:00
Benjamin Kramer	87e459b047	Remove unused private member variables uncovered by the recent changes to clang's -Wunused-private-field. llvm-svn: 160583	2012-07-20 22:05:57 +00:00
Jakob Stoklund Olesen	e3c4840e77	Avoid folding loads that are unsafe to move. LiveRangeEdit::foldAsLoad() can eliminate a register by folding a load into its only use. Only do that when the load is safe to move, and it won't extend any live ranges. This fixes PR13414. llvm-svn: 160575	2012-07-20 21:29:31 +00:00
Jakob Stoklund Olesen	3eb650542a	Split loop exiting edges more aggressively. PHIElimination splits critical edges when it predicts it can resolve interference and eliminate copies. It doesn't split the edge if the interference wouldn't be resolved anyway because the phi-use register is live in the critical edge anyway. Teach PHIElimination to split loop exiting edges with interference, even if it wouldn't resolve the interference. This removes the necessary copies from the loop, which is still an improvement from injecting the copies into the loop. The test case demonstrates the improvement. Before: LBB0_1: cmpb $0, (%rdx) leaq 1(%rdx), %rdx movl %esi, %eax je LBB0_1 After: LBB0_1: cmpb $0, (%rdx) leaq 1(%rdx), %rdx je LBB0_1 movl %esi, %eax llvm-svn: 160571	2012-07-20 20:49:53 +00:00
Pete Cooper	4a544942c5	Fix crash in machine verifier when trying to print the def of a register which has no def llvm-svn: 160531	2012-07-19 23:40:38 +00:00
Benjamin Kramer	ec66856001	Replace some explicit compare loops with std::equal. No functionality change. llvm-svn: 160501	2012-07-19 10:46:05 +00:00
Galina Kistanova	c4a2a7cce5	Fixed few warnings. llvm-svn: 160493	2012-07-19 04:50:12 +00:00
Bill Wendling	343eebdbe4	Remove tabs. llvm-svn: 160475	2012-07-19 00:04:14 +00:00
Chandler Carruth	5d1c4f0605	Fix a somewhat nasty crasher in PR13378. This crashes inside of LiveIntervals due to the two-addr pass generating bogus MI code. The crux of the issue was a loop nesting problem. The intent of the code which attempts to transform instructions before converting them to two-addr form is to defer and reprocess any transformed instructions as the second processing is likely to have more opportunities to coalesce copies, etc. Unfortunately, there was one section of processing that was not deferred -- the INSERT_SUBREG rewriting. Due to quirks of how this rewriting proceeded, not only did it occur early, it removed the bits of information needed for the deferred processing to correctly generate the necessary two address form (specifically inserting a copy), but didn't trigger any immediate assertions and produced what appeared to be already valid two-address from code. Thus, the assertion only fired much later in the pipeline. The fix is to hoist the transformation logic up layer to where it can more firmly defer all further processing, and to teach the normal processing to handle an edge case previously handled as part of the transformation logic. This edge case (already matched tied register operands) needs to not defer any steps. As has been brought up repeatedly in the process: wow does this code need refactoring. I may squeeze in some time to at least bring sanity to this loop... but wow... =] Thanks to Jakob for helpful hints on the way here, and the review. llvm-svn: 160443	2012-07-18 18:58:22 +00:00
Nuno Lopes	99e140d517	ignore 'invoke @llvm.donothing', but still keep the edge to the continuation BB llvm-svn: 160411	2012-07-18 00:07:17 +00:00
Evan Cheng	5e82ad04d5	Back out r160101 and instead implement a dag combine to recover from instcombine transformation. llvm-svn: 160387	2012-07-17 18:54:11 +00:00
Jakob Stoklund Olesen	fb9eb735f6	Add some trace output to TwoAddressInstructionPass. llvm-svn: 160380	2012-07-17 17:57:23 +00:00

1 2 3 4 5 ...

13846 Commits