llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-04-01 23:31:52 +00:00

Author	SHA1	Message	Date
Andrew Trick	79c6fbc97e	Minor cleanup of defaultDefLatency API llvm-svn: 161470	2012-08-08 02:44:11 +00:00
Jakob Stoklund Olesen	e89d9ba157	Revert "Fix a quadratic algorithm in MachineBranchProbabilityInfo." It caused an assertion failure when compiling consumer-typeset. llvm-svn: 161463	2012-08-08 01:10:31 +00:00
Manman Ren	967804ad0a	X86: enable CSE between CMP and SUB We perform the following: 1> Use SUB instead of CMP for i8,i16,i32 and i64 in ISel lowering. 2> Modify MachineCSE to correctly handle implicit defs. 3> Convert SUB back to CMP if possible at peephole. Removed pattern matching of (a>b) ? (a-b):0 and like, since they are handled by peephole now. rdar://11873276 llvm-svn: 161462	2012-08-08 00:51:41 +00:00
Jakob Stoklund Olesen	10c3144381	Fix a quadratic algorithm in MachineBranchProbabilityInfo. The getSumForBlock function was quadratic in the number of successors because getSuccWeight would perform a linear search for an already known iterator. llvm-svn: 161460	2012-08-08 00:20:37 +00:00
Jakob Stoklund Olesen	08414c1860	Skip tied operand pairs that already have the same register. llvm-svn: 161454	2012-08-07 22:47:06 +00:00
Jakob Stoklund Olesen	3c221664e3	Add SelectionDAG::getTargetIndex. This adds support for TargetIndex operands during isel. The meaning of these (index, offset, flags) operands is entirely defined by the target. llvm-svn: 161453	2012-08-07 22:37:05 +00:00
Bill Wendling	06a580b5da	For non-Darwin platforms, we want to generate stack protectors only for character arrays. This is in line with what GCC does. <rdar://problem/10529227> llvm-svn: 161446	2012-08-07 20:59:05 +00:00
Jakob Stoklund Olesen	301af79343	Add a new kind of MachineOperand: MO_TargetIndex. A target index operand looks a lot like a constant pool reference, but it is completely target-defined. It contains the 8-bit TargetFlags, a 32-bit index, and a 64-bit offset. It is preserved by all code generator passes. TargetIndex operands can be used to carry target-specific information in cases where immediate operands won't suffice. llvm-svn: 161441	2012-08-07 18:56:39 +00:00
Jakob Stoklund Olesen	8836660866	Fix a couple of typos. llvm-svn: 161437	2012-08-07 18:32:57 +00:00
Jakob Stoklund Olesen	438bc30c3d	Add trace accessor methods, implement primitive if-conversion heuristic. Compare the critical paths of the two traces through an if-conversion candidate. If the difference is larger than the branch brediction penalty, reject the if-conversion. If would never pay. llvm-svn: 161433	2012-08-07 18:02:19 +00:00
Chandler Carruth	49d4e3f282	Add a much more conservative strategy for aligning branch targets. Previously, MBP essentially aligned every branch target it could. This bloats code quite a bit, especially non-looping code which has no real reason to prefer aligned branch targets so heavily. As Andy said in review, it's still a bit odd to do this without a real cost model, but this at least has much more plausible heuristics. Fixes PR13265. llvm-svn: 161409	2012-08-07 09:45:24 +00:00
Manman Ren	5d43c19d9e	MachineCSE: Update the heuristics for isProfitableToCSE. If the result of a common subexpression is used at all uses of the candidate expression, CSE should not increase the live range of the common subexpression. rdar://11393714 and rdar://11819721 llvm-svn: 161396	2012-08-07 06:16:46 +00:00
Jakob Stoklund Olesen	d5b3babd6f	Delete a dead variable. TwoAddressInstructionPass doesn't remat any more. llvm-svn: 161285	2012-08-04 00:04:03 +00:00
Jakob Stoklund Olesen	69611c470c	TwoAddressInstructionPass refactoring: Extract another method. llvm-svn: 161284	2012-08-03 23:57:58 +00:00
Bob Wilson	9f6e25017a	Refactor and check "onlyReadsMemory" before optimizing builtins. This patch is mostly just refactoring a bunch of copy-and-pasted code, but it also adds a check that the call instructions are readnone or readonly. That check was already present for sin, cos, sqrt, log2, and exp2 calls, but it was missing for the rest of the builtins being handled in this code. llvm-svn: 161282	2012-08-03 23:29:17 +00:00
Jakob Stoklund Olesen	7713568689	TwoAddressInstructionPass refactoring: Extract a method. No functional change intended, except replacing a DenseMap with a SmallDenseMap which should behave identically. llvm-svn: 161281	2012-08-03 23:25:45 +00:00
Jakob Stoklund Olesen	643fdb449e	Begin adding support for updating LiveIntervals in TwoAddressInstructionPass. This is far from complete, and only changes behavior when the -early-live-intervals flag is passed to llc. llvm-svn: 161273	2012-08-03 22:58:34 +00:00
Jakob Stoklund Olesen	14c88af5f2	Add an experimental -early-live-intervals option. This option runs LiveIntervals before TwoAddressInstructionPass which will eventually learn to exploit and update the analysis. Eventually, LiveIntervals will run before PHIElimination, and we can get rid of LiveVariables. llvm-svn: 161270	2012-08-03 22:12:54 +00:00
Jakob Stoklund Olesen	1e192f98dd	Delete merged physreg copies in joinReservedPhysReg(). Previously, the identity copy would survive through register allocation before it was removed by the rewriter. llvm-svn: 161269	2012-08-03 22:12:51 +00:00
Bob Wilson	7e2fb62620	Try to reduce the compile time impact of r161232. The previous change caused fast isel to not attempt handling any calls to builtin functions. That included things like "printf" and caused some noticable regressions in compile time. I wanted to avoid having fast isel keep a separate list of functions that had to be kept in sync with what the code in SelectionDAGBuilder.cpp was handling. I've resolved that here by moving the list into TargetLibraryInfo. This is somewhat redundant in SelectionDAGBuilder but it will ensure that we keep things consistent. llvm-svn: 161263	2012-08-03 21:26:24 +00:00
Bob Wilson	7d92f01d57	Fix memcmp code-gen to honor -fno-builtin. I noticed that SelectionDAGBuilder::visitCall was missing a check for memcmp in TargetLibraryInfo, so that it would use custom code for memcmp calls even with -fno-builtin. I also had to add a new -disable-simplify-libcalls option to llc so that I could write a test for this. llvm-svn: 161262	2012-08-03 21:26:18 +00:00
Jakob Stoklund Olesen	bf7a000191	Completely eliminate VNInfo flags. The 'unused' state of a value number can be represented as an invalid def SlotIndex. This also exposed code that shouldn't have been looking at unused value VNInfos. llvm-svn: 161258	2012-08-03 20:59:32 +00:00
Jakob Stoklund Olesen	9af03604de	Fix a couple of loops that were processing unused value numbers. Unused VNInfos should be left alone. Their def SlotIndex doesn't point to anything. llvm-svn: 161257	2012-08-03 20:59:29 +00:00
Matt Beaumont-Gay	acc7302f80	Silence unused variable warning in -asserts build llvm-svn: 161256	2012-08-03 20:54:11 +00:00
Jakob Stoklund Olesen	38aec46f18	Eliminate the VNInfo::hasPHIKill() flag. The only real user of the flag was removeCopyByCommutingDef(), and it has been switched to LiveIntervals::hasPHIKill(). All the code changed by this patch was only concerned with computing and propagating the flag. llvm-svn: 161255	2012-08-03 20:19:44 +00:00
Jakob Stoklund Olesen	74dfaec8d7	Make the hasPHIKills flag a computed property. The VNInfo::HAS_PHI_KILL is only half supported. We precompute it in LiveIntervalAnalysis, but it isn't properly updated by live range splitting and functions like shrinkToUses(). It is only used in one place: RegisterCoalescer::removeCopyByCommutingDef(). This patch changes that function to use a new LiveIntervals::hasPHIKill() function that computes the flag for a given value number. llvm-svn: 161254	2012-08-03 20:10:24 +00:00
Jakob Stoklund Olesen	df448a4b76	Delete dead function. llvm-svn: 161242	2012-08-03 15:21:21 +00:00
Jakob Stoklund Olesen	95dc0b7b93	Don't delete dead code in TwoAddressInstructionPass. This functionality was added before we started running DeadMachineInstructionElim on all targets. It serves no purpose now. llvm-svn: 161241	2012-08-03 15:11:57 +00:00
Bob Wilson	d1eefbeac2	Fall back to selection DAG isel for calls to builtin functions. Fast isel doesn't currently have support for translating builtin function calls to target instructions. For embedded environments where the library functions are not available, this is a matter of correctness and not just optimization. Most of this patch is just arranging to make the TargetLibraryInfo available in fast isel. <rdar://problem/12008746> llvm-svn: 161232	2012-08-03 04:06:28 +00:00
Manman Ren	2c236a30f6	X86 Peephole: fold loads to the source register operand if possible. Add more comments and use early returns to reduce nesting in isLoadFoldable. Also disable folding for V_SET0 to avoid introducing a const pool entry and a const pool load. rdar://10554090 and rdar://11873276 llvm-svn: 161207	2012-08-02 19:37:32 +00:00
Jakob Stoklund Olesen	a299894307	Compute the critical path length through a trace. Whenever both instruction depths and instruction heights are known in a block, it is possible to compute the length of the critical path as max(depth+height) over the instructions in the block. The stored live-in lists make it possible to accurately compute the length of a critical path that bypasses the current (small) block. llvm-svn: 161197	2012-08-02 18:45:54 +00:00
Jakob Stoklund Olesen	4be65d0d0c	Verify regunit intervals along with virtreg intervals. Don't cause regunit intervals to be computed just to verify them. Only check the already cached intervals. llvm-svn: 161183	2012-08-02 16:36:50 +00:00
Jakob Stoklund Olesen	ac79668cdf	Avoid creating dangling physreg live ranges during DCE. LiveRangeEdit::eliminateDeadDefs() can delete a dead instruction that reads unreserved physregs. This would leave the corresponding regunit live interval dangling because we don't have shrinkToUses() for physical registers. Fix this problem by turning the instruction into a KILL instead of deleting it. This happens in a landing pad in test/CodeGen/X86/2012-05-19-CoalescerCrash.ll: %vreg27<def,dead> = COPY %EDX<kill>; GR32:%vreg27 becomes: KILL %EDX<kill> An upcoming fix to the machine verifier will catch problems like this by verifying regunit live intervals. This fixes PR13498. I am not including the test case from the PR since we already have one exposing the problem once the verifier is fixed. llvm-svn: 161182	2012-08-02 16:36:47 +00:00
Jakob Stoklund Olesen	802cd7688f	Add report() functions that take a LiveInterval argument. llvm-svn: 161178	2012-08-02 14:31:49 +00:00
Manman Ren	78b8d454cc	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. This patch is a rework of r160919 and was tested on clang self-host on my local machine. rdar://10554090 and rdar://11873276 llvm-svn: 161152	2012-08-02 00:56:42 +00:00
Jakob Stoklund Olesen	5848ce22b0	Extract some methods from verifyLiveIntervals. No functional change. llvm-svn: 161149	2012-08-02 00:20:20 +00:00
Jakob Stoklund Olesen	85840cd877	Also verify RegUnit intervals at uses. llvm-svn: 161147	2012-08-01 23:52:40 +00:00
Jakob Stoklund Olesen	b223807a3d	Compute instruction heights through a trace. The height on an instruction is the minimum number of cycles from the instruction is issued to the end of the trace. Heights are computed for all instructions in and below the trace center block. The method for computing heights is different from the depth computation. As we visit instructions in the trace bottom-up, heights of used instructions are pushed upwards. This way, we avoid scanning long use lists, looking for uses in the current trace. At each basic block boundary, a list of live-in registers and their minimum heights is saved in the trace block info. These live-in lists are used when restarting depth computations on a trace that converges with an already computed trace. They will also be used to accurately compute the critical path length. llvm-svn: 161138	2012-08-01 22:36:00 +00:00
Eric Christopher	dde7784606	Temporarily revert c23b933d5f8be9b51a1d22e717c0311f65f87dcd. It's causing failures in the debug testsuite and possibly PR13486. llvm-svn: 161121	2012-08-01 18:19:01 +00:00
Jakob Stoklund Olesen	beae25c46c	Add DataDep constructors. Explicitly check SSA form. llvm-svn: 161115	2012-08-01 16:02:59 +00:00
Elena Demikhovsky	0fec7026d9	Added FMA functionality to X86 target. llvm-svn: 161110	2012-08-01 12:06:00 +00:00
Manman Ren	b9e9c55911	MachineSink: Sort the successors before trying to find SuccToSinkTo. Use stable_sort instead of sort. Follow-up to r161062. rdar://11980766 llvm-svn: 161075	2012-07-31 20:45:38 +00:00
Jakob Stoklund Olesen	3dc189f67b	Compute instruction depths through the current trace. Assuming infinite issue width, compute the earliest each instruction in the trace can issue, when considering the latency of data dependencies. The issue cycle is record as a 'depth' from the beginning of the trace. This is half the computation required to find the length of the critical path through the trace. Heights are next. llvm-svn: 161074	2012-07-31 20:44:38 +00:00
Jakob Stoklund Olesen	b2c7febf35	Rename CT -> MTM. MachineTraceMetrics is abbreviated MTM. llvm-svn: 161072	2012-07-31 20:25:13 +00:00
Manman Ren	3d7e85d5b8	MachineSink: Sort the successors before trying to find SuccToSinkTo. One motivating example is to sink an instruction from a basic block which has two successors: one outside the loop, the other inside the loop. We should try to sink the instruction outside the loop. rdar://11980766 llvm-svn: 161062	2012-07-31 18:10:39 +00:00
Micah Villmow	122d115419	Conform to LLVM coding style. llvm-svn: 161061	2012-07-31 18:07:43 +00:00
Micah Villmow	8b7fb5e605	Don't generate ordered or unordered comparison operations if it is not legal to do so. llvm-svn: 161053	2012-07-31 16:48:03 +00:00
Jakob Stoklund Olesen	ed1a4d695a	Clear kill flags in removeCopyByCommutingDef(). We are extending live ranges, so kill flags are not accurate. They aren't needed until they are recomputed after RA anyway. <rdar://problem/11950722> llvm-svn: 161023	2012-07-31 02:47:24 +00:00
Manman Ren	3769ac64a6	Reverse order of the two branches at end of a basic block if it is profitable. We branch to the successor with higher edge weight first. Convert from je LBB4_8 --> to outer loop jmp LBB4_14 --> to inner loop to jne LBB4_14 jmp LBB4_8 PR12750 rdar: 11393714 llvm-svn: 161018	2012-07-31 01:11:07 +00:00
Andrew Trick	2773dbddf3	Use the latest MachineRegisterInfo APIs. No functionality. llvm-svn: 161010	2012-07-30 23:48:17 +00:00

1 2 3 4 5 ...

13889 Commits