llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-10 13:51:37 +00:00

Author	SHA1	Message	Date
Dan Gohman	64c713c605	Micro-optimize SCEVConstant comparison. llvm-svn: 111142	2010-08-16 16:25:35 +00:00
Dan Gohman	6d808db278	Move SCEVNAryExpr's virtual member functions out of line, and convert them to iterators. llvm-svn: 111140	2010-08-16 16:21:27 +00:00
Dan Gohman	d55de2457e	Use iterators instead of indices in simple cases. llvm-svn: 111138	2010-08-16 16:16:11 +00:00
Dan Gohman	daec40b96d	Avoid gratuitous inefficiency in ifndef NDEBUG code. llvm-svn: 111137	2010-08-16 16:13:54 +00:00
Dan Gohman	345dc8add7	Make one getAddExpr call when analyzing a+b+c+d+e+... instead of one for each add instruction. Ditto for Mul. llvm-svn: 111136	2010-08-16 16:03:49 +00:00
Dan Gohman	fec602e7c5	Delete an unused function. llvm-svn: 111135	2010-08-16 15:57:14 +00:00
Dan Gohman	7900e1ace3	Instead of having CollectSubexpr's categorize operands as interesting or uninteresting, just put all the operands on one list and make GenerateReassociations make the decision about what's interesting. This is simpler, and it avoids an extra ScalarEvolution::getAddExpr call. llvm-svn: 111133	2010-08-16 15:50:00 +00:00
Dan Gohman	38d11cdfe0	Put add operands in ScalarEvolution-canonical order, when convenient. This isn't necessary, because ScalarEvolution sorts them anyway, but it's tidier this way. llvm-svn: 111132	2010-08-16 15:39:27 +00:00
Dan Gohman	06963ff7c4	Add hooks to FoldingSetTrait to allow specializations to provide implementations of equality comparison and hash computation. This can be used to optimize node lookup by avoiding creating lots of temporary ID values just for hashing and comparison purposes. llvm-svn: 111130	2010-08-16 15:30:39 +00:00
Dan Gohman	4446294688	Reverse the order of GetNodeProfile's arguments, for consistency with FoldingSetTrait::Profile. llvm-svn: 111127	2010-08-16 14:53:42 +00:00
Dan Gohman	80b2503100	Avoid #include <ScalarEvolution.h> in LoopSimplify.cpp, which doesn't actually use ScalarEvolution. llvm-svn: 111124	2010-08-16 14:44:03 +00:00
Dan Gohman	9178d0792f	Instead, teach SimplifyCFG to trim non-address-taken blocks from indirectbr destination lists. llvm-svn: 111122	2010-08-16 14:41:14 +00:00
Dan Gohman	8ed4d1646e	Revert r111058, the lint check for indirectbr successors that aren't address-taken. This can occur normally, if the code which took the address got DCEd. llvm-svn: 111121	2010-08-16 14:39:19 +00:00
Argyrios Kyrtzidis	75b69c1de3	Revert r111082. No warnings for this common pattern. llvm-svn: 111102	2010-08-15 10:27:23 +00:00
Mikhail Glushenkov	10302168ea	llvmc: remove dynamic plugins. llvm-svn: 111094	2010-08-15 07:07:12 +00:00
Eric Christopher	1470fe415c	Rework how the non-sse2 memory barrier is lowered so that the encoding is correct for the built-in assembler. Based on a patch from Chris. llvm-svn: 111083	2010-08-14 21:51:50 +00:00
Argyrios Kyrtzidis	70b248e3ac	Add ATTRIBUTE_UNUSED to methods that are not supposed to be used. llvm-svn: 111082	2010-08-14 21:35:10 +00:00
Chris Lattner	8426971169	improve indentation llvm-svn: 111073	2010-08-14 17:26:09 +00:00
Bob Wilson	b1eb015fc8	T2I_rbin_irs rr variant is for disassembly only, so don't provide a pattern. llvm-svn: 111068	2010-08-14 03:18:29 +00:00
Ted Kremenek	8091488511	Update CMake build. llvm-svn: 111063	2010-08-14 01:55:09 +00:00
Dan Gohman	afb3db46d2	LoopSimplify shouldn't split loop backedges that use indirectbr. PR7867. llvm-svn: 111061	2010-08-14 00:43:09 +00:00
Dan Gohman	d04a608a73	Teach SimplifyCFG how to simplify indirectbr instructions. - Eliminate redundant successors. - Convert an indirectbr with one successor into a direct branch. Also, generalize SimplifyCFG to be able to be run on a function entry block. It knows quite a few simplifications which are applicable to the entry block, and it only needs a few checks to avoid trouble with the entry block. llvm-svn: 111060	2010-08-14 00:29:42 +00:00
Jim Grosbach	a4d3174cba	Add a local stack object block allocation pass. This is still an experimental pass that allocates locals relative to one another before register allocation and then assigns them to actual stack slots as a block later in PEI. This will eventually allow targets with limited index offset range to allocate additional base registers (not just FP and SP) to more efficiently reference locals, as well as handle situations where locals cannot be referenced via SP or FP at all (dynamic stack realignment together with variable sized objects, for example). It's currently incomplete and almost certainly buggy. Work in progress. Disabled by default and gated via the -enable-local-stack-alloc command line option. rdar://8277890 llvm-svn: 111059	2010-08-14 00:15:52 +00:00
Dan Gohman	422c164d8d	Add a lint check for an indirectbr destination which has not had its address taken. llvm-svn: 111058	2010-08-13 23:56:28 +00:00
Bob Wilson	92bf5a7425	Add a Thumb2 t2RSBrr instruction for disassembly only. This fixes another part of PR7792. llvm-svn: 111057	2010-08-13 23:24:25 +00:00
Jakob Stoklund Olesen	44b77ea344	Clean up the Spiller.h interface. The earliestStart argument is entirely specific to linear scan allocation, and can be easily calculated by RegAllocLinearScan. Replace std::vector with SmallVector. llvm-svn: 111055	2010-08-13 22:56:53 +00:00
Bob Wilson	ca672ee828	Temporarily disable tail calls on ARM to work around some linker problems. llvm-svn: 111050	2010-08-13 22:43:33 +00:00
Bob Wilson	0883c6aae3	Move the Thumb2 SSAT and USAT optional shift operator out of the instruction opcode. This fixes part of PR7792. llvm-svn: 111047	2010-08-13 21:48:10 +00:00
Dan Gohman	076597a026	Various optimizations. Don't compare two loops' depths when they are the same loop. Don't compare two instructions' loop depths when they are in the same block. llvm-svn: 111045	2010-08-13 21:24:58 +00:00
Jakob Stoklund Olesen	70604cb116	Implement splitting inside a single block. When a live range is contained a single block, we can split it around instruction clusters. The current approach is very primitive, splitting before and after the largest gap between uses. llvm-svn: 111043	2010-08-13 21:18:48 +00:00
Dan Gohman	3e7c2a2040	Fix LSR's ExtractImmediate and ExtractSymbol to avoid calling ScalarEvolution::getAddExpr, which can be pretty expensive, when nothing has changed, which is pretty common. llvm-svn: 111042	2010-08-13 21:17:19 +00:00
Bruno Cardoso Lopes	1eaa601d84	Add comments to some pattern fragments in x86 llvm-svn: 111041	2010-08-13 20:39:01 +00:00
Dan Gohman	ecb279a1c2	When testing whether one loop contains another, test this directly rather than testing whether the loop contains the other's header. llvm-svn: 111039	2010-08-13 20:23:25 +00:00
Dan Gohman	02de74d4dc	Add a const. llvm-svn: 111038	2010-08-13 20:17:27 +00:00
Dan Gohman	3650d2058f	When creating a symmetric SCEV with a constant operand, put the constant operand on the left, as that's where ScalarEvolution will end up canonicalizing to. llvm-svn: 111037	2010-08-13 20:17:14 +00:00
Dan Gohman	de171ff360	An add recurrence is loop-invariant in any loop inside of its associated loop. This avoids potentially expensive traversals of the add recurrence's operands. llvm-svn: 111034	2010-08-13 20:11:39 +00:00
Bob Wilson	c044a43293	Refactor the code for disassembling Thumb2 saturate instructions along the same lines as the change I made for ARM saturate instructions. llvm-svn: 111029	2010-08-13 19:04:21 +00:00
Dale Johannesen	3f9c148d0e	Revert 110491. While not wrong, it was based on a misanalysis and is undesirable. llvm-svn: 111028	2010-08-13 18:43:45 +00:00
Bruno Cardoso Lopes	8b07859f3a	Fix comment to reflect code, and remove an unused argument llvm-svn: 111022	2010-08-13 17:50:47 +00:00
Bruno Cardoso Lopes	de5f3f5cb6	Improve comment to make explicit why not to touch this could before JIT goes MC llvm-svn: 111021	2010-08-13 17:44:10 +00:00
Jim Grosbach	4f58c74300	tidy up whitespace a bit llvm-svn: 111019	2010-08-13 16:55:08 +00:00
Eric Christopher	63c83f19a0	Revert last patch and r110954 as I meant to. llvm-svn: 111001	2010-08-13 02:37:50 +00:00
Eric Christopher	e9a4223bc8	Revert r110954 for now, pseudo instructions can't make it through to the JIT. llvm-svn: 111000	2010-08-13 02:30:00 +00:00
Jakob Stoklund Olesen	d802c303fd	Let LiveInterval::addRange extend existing ranges, it will verify that value numbers match. The old check could accidentally leave holes in openli. Also let useIntv add all ranges for the phi-def value inserted by enterIntvAtEnd. This works as long at the value mapping is established in enterIntvAtEnd. llvm-svn: 110995	2010-08-13 01:05:26 +00:00
Jakob Stoklund Olesen	7bac4bf66d	Remember to actually update SplitAnalysis statistics now that we have a fancy function to do it. llvm-svn: 110994	2010-08-13 01:05:23 +00:00
Nate Begeman	e57074fc48	Reapply this transformation now that it is passing the external test which it previously failed. llvm-svn: 110987	2010-08-13 00:17:53 +00:00
Nate Begeman	833a7cd504	Move some code from Verifier into SVI::isValidOperands. This allows us to catch bad shufflevector operations when they are created, rather than waiting for someone to notice later on. llvm-svn: 110986	2010-08-13 00:16:46 +00:00
Dan Gohman	88b65340ea	Trim #includes. llvm-svn: 110983	2010-08-12 23:56:03 +00:00
Dan Gohman	6fd1662d06	Tidy up whitespace. llvm-svn: 110982	2010-08-12 23:50:08 +00:00
Dan Gohman	f6ffbfbbd1	Use .empty() instead of .size(). llvm-svn: 110981	2010-08-12 23:46:28 +00:00
Jakob Stoklund Olesen	2dc2440b42	Handle an empty dupli. This can happen if the original interval has been broken into two disconnected parts. Ideally, we should be able to detect when the graph is disconnected and create separate intervals, but that code is not implemented yet. Example: Two basic blocks are both branching to a loop header. Our interval is defined in both basic blocks, and live into the loop along both edges. We decide to split the interval around the loop. The interval is split into an inside part and an outside part. The outside part now has two disconnected segments, one in each basic block. If we later decide to split the outside interval into single blocks, we get one interval per basic block and an empty dupli for the remainder. llvm-svn: 110976	2010-08-12 23:02:57 +00:00
Jakob Stoklund Olesen	f8957964f7	Update the SplitAnalysis statistics as uses are moved from curli to the new split intervals. THis means the analysis can be used for multiple splits as long as curli doesn't shrink. llvm-svn: 110975	2010-08-12 23:02:55 +00:00
Chris Lattner	fd40059e71	fix PR7876: If ipsccp decides that a function's address is taken before it rewrites the code, we need to use that in the post-rewrite pass. llvm-svn: 110962	2010-08-12 22:25:23 +00:00
Bruno Cardoso Lopes	350d186d69	Some small clean-up: use of pseudo instructions llvm-svn: 110954	2010-08-12 20:55:18 +00:00
Johnny Chen	78345b1dfe	Cleaned up the for-disassembly-only entries in the arm instruction table so that the memory barrier variants (other than 'SY' full system domain read and write) are treated as one instruction with option operand. llvm-svn: 110951	2010-08-12 20:46:17 +00:00
Jakob Stoklund Olesen	1337aa8e38	Also recompute HasPHIKill flags in LiveInterval::RenumberValues. If a phi-def value were removed from the interval, the phi-kill flags are no longer valid. llvm-svn: 110949	2010-08-12 20:38:03 +00:00
Evan Cheng	362df591b6	Make sure ARM constant island pass does not break up an IT block. If the split point is in the middle of an IT block, it should move it up to just above the IT instruction. rdar://8302637 llvm-svn: 110947	2010-08-12 20:30:05 +00:00
Bruno Cardoso Lopes	7cb26cb8be	- Teach SSEDomainFix to switch between different levels of AVX instructions. Here we guess that AVX will have domain issues, so just implement them for consistency and in the future we remove if it's unnecessary. - Make foldMemoryOperandImpl aware of 256-bit zero vectors folding and support the 128-bit counterparts of AVX too. - Make sure MOV[AU]PS instructions are only selected when SSE1 is enabled, and duplicate the patterns to match AVX. - Add a testcase for a simple 128-bit zero vector creation. llvm-svn: 110946	2010-08-12 20:20:53 +00:00
Jakob Stoklund Olesen	cbb21e8c0e	Remove trailing whitespace. llvm-svn: 110944	2010-08-12 20:01:23 +00:00
Jakob Stoklund Olesen	d3e41f910a	Clean up debug output. llvm-svn: 110940	2010-08-12 18:50:55 +00:00
Bruno Cardoso Lopes	99b5298854	Define AVX 128-bit pattern versions of SET0PS/PD. llvm-svn: 110937	2010-08-12 18:20:59 +00:00
Jakob Stoklund Olesen	886eebfa6f	Implement single block splitting. Before spilling a live range, we split it into a separate range for each basic block where it is used. That way we only get one reload per basic block if the new smaller ranges can allocate to a register. This type of splitting is already present in the standard spiller. llvm-svn: 110934	2010-08-12 17:07:14 +00:00
Dan Gohman	8103471a35	Optimize ScalarEvolution::getAddExpr's operand factoring code by having it finish processing all of the muliply operands before starting the whole getAddExpr process over again, instead of immediately after the first simplification. llvm-svn: 110916	2010-08-12 15:00:23 +00:00
Dan Gohman	d7b79a35df	Hoist some loop-invariant code out of a hot loop. llvm-svn: 110915	2010-08-12 14:52:55 +00:00
Dan Gohman	cc5fb07653	Optimize ScalarEvolution::getAddExpr's duplicate operand detection by having it finish processing the whole operand list before starting the whole getAddExpr process over again, instead of immediately after the first duplicate is found. llvm-svn: 110914	2010-08-12 14:46:54 +00:00
Duncan Sands	6f5776e1d2	Add a 'normalize' method to the Triple class, which takes a mucked up target triple and straightens it out. This does less than gcc's script config.sub, for example it turns i386-mingw32 into i386--mingw32 not i386-pc-mingw32, but it does a decent job of turning funky triples into something that the rest of the Triple class can understand. The plan is to use this to canonicalize triple's when they are first provided by users, and have the rest of LLVM only deal with canonical triples. Once this is done the special case workarounds in the Triple constructor can be removed, making the class more regular and easier to use. The comments and unittests for the Triple class are already adjusted in this patch appropriately for this brave new world of increased uniformity. llvm-svn: 110909	2010-08-12 11:31:39 +00:00
Eric Christopher	34acdf57df	Temporarily revert 110737 and 110734, they were causing failures in an external testsuite. llvm-svn: 110905	2010-08-12 07:01:22 +00:00
Bruno Cardoso Lopes	43a7ba2bbc	Fix comment order llvm-svn: 110898	2010-08-12 02:08:52 +00:00
Bruno Cardoso Lopes	bb491bd56c	Begin to support some vector operations for AVX 256-bit intructions. The long term goal here is to be able to match enough of vector_shuffle and build_vector so all avx intrinsics which aren't mapped to their own built-ins but to shufflevector calls can be codegen'd. This is the first (baby) step, support building zeroed vectors. llvm-svn: 110897	2010-08-12 02:06:36 +00:00
Johnny Chen	fef1367b50	The autogened decoder was confusing the ARM STRBT for ARM USAT, because the .td entry for ARM STRBT is actually a super-instruction for A8.6.199 STRBT A1 & A2. Recover by looking for ARM:USAT encoding pattern before delegating to the auto- gened decoder. Added a "usat" test case to arm-tests.txt. llvm-svn: 110894	2010-08-12 01:40:54 +00:00
Daniel Dunbar	4f45de1b1e	MC/X86/AsmParser: Give an explicit error message when we reject an instruction because it could have an ambiguous suffix. llvm-svn: 110890	2010-08-12 00:55:42 +00:00
Daniel Dunbar	f2b4982344	MC/AsmParser: Push the burdon of emitting diagnostics about unmatched instructions onto the target specific parser, which can do a better job. llvm-svn: 110889	2010-08-12 00:55:38 +00:00
Daniel Dunbar	0a98bc5619	tblgen/AsmMatcher: Always emit the match function as 'MatchInstructionImpl', target specific parsers can adapt the TargetAsmParser to this. llvm-svn: 110888	2010-08-12 00:55:32 +00:00
Johnny Chen	9a37d16281	Changed the format of DMBsy, DSBsy, and friends from Pseudo to MiscFrm. Added two test cases to arm-tests.txt. llvm-svn: 110880	2010-08-11 23:35:12 +00:00
Devang Patel	d425ead3c8	Even if a variable has constant value all the time, it is still a variable in gdb's eyes. Tested by scope.exp in gdb testsuite. llvm-svn: 110876	2010-08-11 23:17:54 +00:00
Bob Wilson	3582107cf8	Move the ARM SSAT and USAT optional shift amount operand out of the instruction opcode. This also fixes part of PR7792. llvm-svn: 110875	2010-08-11 23:10:46 +00:00
Jakob Stoklund Olesen	5a62f10abc	Fix <rdar://problem/8282498> even if it doesn't reproduce on trunk. When a register is defined by a partial load: %reg1234:sub_32 = MOV32mr <fi#-1>; GR64:%reg1234 That load cannot be folded into an instruction using the full 64-bit register. It would become a 64-bit load. This is related to the recent change to have isLoadFromStackSlot return false on a sub-register load. llvm-svn: 110874	2010-08-11 23:08:22 +00:00
Owen Anderson	97ce6f3223	Fix a subtle use-after-free issue. llvm-svn: 110863	2010-08-11 22:36:04 +00:00
Nick Lewycky	90e835b327	Clean up ConstantRange a bit: - remove ashr which never worked. - fix lshr and shl and add tests. - remove dead function "intersect1Wrapped". - add a new sub method to subtract ranges, with test. llvm-svn: 110861	2010-08-11 22:04:36 +00:00
Dan Gohman	2d7bf8c073	Make LoopPass::getContainedPass return a LoopPass* instead of a Pass* and remove casts from all its callers. llvm-svn: 110848	2010-08-11 20:34:43 +00:00
Dan Gohman	da120221cb	Remove BasicBlockPass::runOnFunction, which was unused. llvm-svn: 110847	2010-08-11 20:28:16 +00:00
Dan Gohman	0f3058d723	Delete FunctionPass::run, which is unused. llvm-svn: 110843	2010-08-11 19:11:05 +00:00
Dan Gohman	18a8add4ab	Delete FunctionPass::runOnModule, which is unused. llvm-svn: 110842	2010-08-11 19:05:53 +00:00
Dan Gohman	54027cf446	Don't use unsigned char for alignments in TargetData. There aren't that many of these things, so the memory savings isn't significant, and there are now situations where there can be alignments greater than 128. llvm-svn: 110836	2010-08-11 18:15:01 +00:00
Dan Gohman	d91d51116b	Use ISD::ADD instead of ISD::SUB with a negated constant. This avoids trouble if the return type of TD->getPointerSize() is changed to something which doesn't promote to a signed type, and is simpler anyway. Also, use getCopyFromReg instead of getRegister to read a physical register's value. llvm-svn: 110835	2010-08-11 18:14:00 +00:00
Jakob Stoklund Olesen	ccf528b792	Fix a FIXME. The SlotIndex::Slot enum should be private. llvm-svn: 110826	2010-08-11 16:50:17 +00:00
Jim Grosbach	1128a47289	cortex m4 has floating point support, but only single precision. llvm-svn: 110810	2010-08-11 15:44:15 +00:00
Bill Wendling	f10d5c00fc	Consider this code snippet: float t1(int argc) { return (argc == 1123) ? 1.234f : 2.38213f; } We would generate truly awful code on ARM (those with a weak stomach should look away): _t1: movw r1, #1123 movs r2, #1 movs r3, #0 cmp r0, r1 mov.w r0, #0 it eq moveq r0, r2 movs r1, #4 cmp r0, #0 it ne movne r3, r1 adr r0, #LCPI1_0 ldr r0, [r0, r3] bx lr The problem was that legalization was creating a cascade of SELECT_CC nodes, for for the comparison of "argc == 1123" which was fed into a SELECT node for the ?: statement which was itself converted to a SELECT_CC node. This is because the ARM back-end doesn't have custom lowering for SELECT nodes, so it used the default "Expand". I added a fairly simple "LowerSELECT" to the ARM back-end. It takes care of this testcase, but can obviously be expanded to include more cases. Now we generate this, which looks optimal to me: _t1: movw r1, #1123 movs r2, #0 cmp r0, r1 adr r0, #LCPI0_0 it eq moveq r2, #4 ldr r0, [r0, r2] bx lr .align 2 LCPI0_0: .long 1075344593 @ float 2.382130e+00 .long 1067316150 @ float 1.234000e+00 llvm-svn: 110799	2010-08-11 08:43:16 +00:00
Evan Cheng	f8604b772e	Report error if codegen tries to instantiate a ARM target when the cpu does support it. e.g. cortex-m* processors. llvm-svn: 110798	2010-08-11 07:17:46 +00:00
Evan Cheng	4929ba9d20	ArchV7M implies HW division instructions. llvm-svn: 110797	2010-08-11 07:00:16 +00:00
Evan Cheng	31e15214c6	ArchV6T2, V7A, and V7M implies Thumb2; Archv7A implies NEON. llvm-svn: 110796	2010-08-11 06:57:53 +00:00
Evan Cheng	273160895e	Add ARM Archv6M and let it implies FeatureDB (having dmb, etc.) llvm-svn: 110795	2010-08-11 06:51:54 +00:00
Daniel Dunbar	bc7c0a60da	MC/ARM: Add basic support for handling predication by parsing it out of the mnemonic into a separate operand form. llvm-svn: 110794	2010-08-11 06:37:20 +00:00
Daniel Dunbar	63628f1443	MC/ARM: Split mnemonic on '.' characters. llvm-svn: 110793	2010-08-11 06:37:16 +00:00
Daniel Dunbar	bbaa88a848	MC/ARM: Fill in ARMOperand::dump a bit. llvm-svn: 110792	2010-08-11 06:37:12 +00:00
Daniel Dunbar	4882ffc373	llvm-mc: Add -show-inst-operands, for dumping the parsed instruction representation before matching. llvm-svn: 110791	2010-08-11 06:37:09 +00:00
Daniel Dunbar	ee80a239ed	MCAsmParser: Add dump() hook to MCParsedAsmOperand. llvm-svn: 110790	2010-08-11 06:37:04 +00:00
Daniel Dunbar	74ed9321a3	MC/ARM: Add an ARMOperand class for condition codes. llvm-svn: 110788	2010-08-11 06:36:53 +00:00
Evan Cheng	e67c4c3723	Really control isel of barrier instructions with cpu feature. llvm-svn: 110787	2010-08-11 06:36:31 +00:00
Evan Cheng	e5bab36c75	Add Cortex-M0 support. It's a ARMv6m device (no ARM mode) with some 32-bit instructions: dmb, dsb, isb, msr, and mrs. llvm-svn: 110786	2010-08-11 06:30:38 +00:00
Evan Cheng	5fca4ca5f9	- Add subtarget feature -mattr=+db which determine whether an ARM cpu has the memory and synchronization barrier dmb and dsb instructions. - Change instruction names to something more sensible (matching name of actual instructions). - Added tests for memory barrier codegen. llvm-svn: 110785	2010-08-11 06:22:01 +00:00
Daniel Dunbar	89a64ee590	MC/ARM: Switch to using the generated match functions instead of stub implementations. llvm-svn: 110783	2010-08-11 05:24:50 +00:00
Daniel Dunbar	0d725e0080	MC/ARM: Enable generation of the ARM asm matcher, not that it can do much. llvm-svn: 110782	2010-08-11 05:09:20 +00:00
Daniel Dunbar	8311cf950b	ARM: Mark some disassembler only instructions as not available for matching -- for some reason they have a very odd MCInst form where the operands overlap, but I haven't dug in to find out why yet. llvm-svn: 110781	2010-08-11 04:46:13 +00:00
Daniel Dunbar	a77e3fc8d8	ARM: Quote $p in an asm string. llvm-svn: 110780	2010-08-11 04:46:10 +00:00
Owen Anderson	0692635e68	Improve indentation. llvm-svn: 110778	2010-08-11 04:24:25 +00:00
Bill Wendling	615aad17f7	Handle ARM compares as well as converting for ARM adds, subs, and thumb2's adds. llvm-svn: 110762	2010-08-11 00:23:00 +00:00
Bill Wendling	735305d4d8	Mark ARM compare instructions as isCompare. llvm-svn: 110761	2010-08-11 00:22:27 +00:00
Dan Gohman	603e66618f	When analyzing loop exit conditions combined with and and or, don't make any assumptions about when the two conditions will agree on when to permit the loop to exit. This fixes PR7845. llvm-svn: 110758	2010-08-11 00:12:36 +00:00
Bob Wilson	0650cceb38	Add a separate ARM instruction format for Saturate instructions. (I discovered 2 more copies of the ARM instruction format list, bringing the total to 4!! Two of them were already out of sync. I haven't yet gotten into the disassembler enough to know the best way to fix this, but something needs to be done.) Add support for encoding these instructions. llvm-svn: 110754	2010-08-11 00:01:18 +00:00
Dan Gohman	7e49302e9a	Rename and reorder the arguments to isImpliedCond, for consistency and clarity. llvm-svn: 110750	2010-08-10 23:46:30 +00:00
Evan Cheng	966ed540a6	CBZ and CBNZ are implemented. llvm-svn: 110745	2010-08-10 23:27:11 +00:00
Bruno Cardoso Lopes	6eb24fd744	Add AVX matching patterns to Packed Bit Test intrinsics. Apply the same approach of SSE4.1 ptest intrinsics but create a new x86 node "testp" since AVX introduces vtest{ps}{pd} instructions which set ZF and CF depending on sign bit AND and ANDN of packed floating-point sources. This is slightly different from what the "ptest" does. Tests comming with the other 256 intrinsics tests. llvm-svn: 110744	2010-08-10 23:25:42 +00:00
Owen Anderson	a095d3fe73	Now that we're using ConstantRange to represent potential values, make use of that represenation to create constraints from comparisons other than eq/neq. llvm-svn: 110742	2010-08-10 23:20:01 +00:00
Nate Begeman	713062e756	Add the minimal amount of smarts necessary to instcombine of shufflevectors to recognize patterns generated by clang for transpose of a matrix in generic vectors. This is made of two parts: 1) Propagating vector extracts of hi/lo half into their users 2) Recognizing an insertion of even elements followed by the odd elements as an unpack. Testcase to come, but this shrinks the # of shuffle instructions generated on x86 from ~40 to the minimal 8. llvm-svn: 110734	2010-08-10 21:38:12 +00:00
Bill Wendling	c8117e507d	Turn optimize compares back on with fix. We needed to test that a machine op was a register before checking if it was defined. llvm-svn: 110733	2010-08-10 21:38:11 +00:00
Jakob Stoklund Olesen	e0262a6e24	Give up on register class recalculation when the register is used with subreg operands. We don't currently have a hook to provide "the largest super class of A where all registers' getSubReg(subidx) is valid and in B". llvm-svn: 110730	2010-08-10 21:16:16 +00:00
Dan Gohman	9abea56ce4	Revert r110718; it broke clang-i386-darwin9. llvm-svn: 110726	2010-08-10 20:49:33 +00:00
Jakob Stoklund Olesen	6036d58c40	Avoid editing the current live interval during remat. The live interval may be used for a spill slot as well, and that spill slot could be shared by split registers. We cannot shrink it, even if we know the current register won't need the spill slot in that range. llvm-svn: 110721	2010-08-10 20:45:07 +00:00
Jakob Stoklund Olesen	d3f939d7a2	More debug spew llvm-svn: 110720	2010-08-10 20:45:01 +00:00
Bill Wendling	0acc8f8a02	Turn optimize cmps on by default so that we can get some testing by the nightly ARM testers. llvm-svn: 110718	2010-08-10 20:23:02 +00:00
Devang Patel	17606414a4	Add missing argument. CreateCompositeTypeEx() users, please verify. llvm-svn: 110717	2010-08-10 20:22:49 +00:00
Owen Anderson	a3a2b79f01	Switch over to using ConstantRange to track integral values. llvm-svn: 110714	2010-08-10 20:03:09 +00:00
Devang Patel	9b12559c4f	Do not forget debug info for enums. Use named mdnode to keep track of these types. llvm-svn: 110712	2010-08-10 20:01:20 +00:00
Evan Cheng	784a286b92	Delete some unused instructions. llvm-svn: 110710	2010-08-10 19:36:22 +00:00
Evan Cheng	d9a1b0d046	Re-apply r110655 with fixes. Epilogue must restore sp from fp if the function stack frame has a var-sized object. Also added a test case to check for the added benefit of this patch: it's optimizing away the unnecessary restore of sp from fp for some non-leaf functions. llvm-svn: 110707	2010-08-10 19:30:19 +00:00
Jakob Stoklund Olesen	245a1faf76	Implement register class inflation. When splitting a live range, the new registers have fewer uses and the permissible register class may be less constrained. Recompute the register class constraint from the uses of new registers created for a split. This may let them be allocated from a larger set, possibly avoiding a spill. llvm-svn: 110703	2010-08-10 18:37:40 +00:00
Daniel Dunbar	872e84afb5	Revert r110655, "Fix ARM hasFP() semantics. It should return true whenever FP register is", it breaks a couple test-suite tests. llvm-svn: 110701	2010-08-10 18:32:02 +00:00
Daniel Dunbar	f7233103ff	MC/AsmParser: Fix a bug in macro argument parsing, which was dropping parentheses from argument lists. llvm-svn: 110692	2010-08-10 17:38:52 +00:00
Jakob Stoklund Olesen	e51a747336	Recalculate the spill weight and allocation hint for virtual registers created during live range splitting. llvm-svn: 110686	2010-08-10 17:07:22 +00:00
Michael J. Spencer	26a45fe57e	Revert "MC/MachO: Fix possible null pointer dereference." This reverts commit 110575. Target.isAbsolute() is true if SD would be null. llvm-svn: 110683	2010-08-10 16:00:49 +00:00
Chandler Carruth	69dd8d673c	Mark this variable as used. llvm-svn: 110667	2010-08-10 10:39:25 +00:00
Tobias Grosser	7b96737b7f	RegionInfo: Do not assert if a BB is not part of the dominance tree. llvm-svn: 110665	2010-08-10 09:54:35 +00:00
Devang Patel	84f48b5483	Handle TAG_constant for integers. llvm-svn: 110656	2010-08-10 07:11:13 +00:00
Evan Cheng	3d47dbe761	Fix ARM hasFP() semantics. It should return true whenever FP register is reserved, not available for general allocation. This eliminates all the extra checks for Darwin. This change also fixes the use of FP to access frame indices in leaf functions and cleaned up some confusing code in epilogue emission. llvm-svn: 110655	2010-08-10 06:26:49 +00:00
Bill Wendling	eb1b0564a7	Update CMake...sorry for the breakage. llvm-svn: 110654	2010-08-10 05:16:06 +00:00
Devang Patel	8edc7575a4	Simplify. llvm-svn: 110653	2010-08-10 04:12:17 +00:00
Devang Patel	191ed4c41d	Drop "const". It does not add value here. llvm-svn: 110652	2010-08-10 04:09:06 +00:00
Bruno Cardoso Lopes	f1928b60c0	Add AVX movnt{pd,ps,dq} 256-bit intrinsics llvm-svn: 110650	2010-08-10 02:49:24 +00:00
Evan Cheng	7aaf83ba17	Add missing null check reported by Amaury Pouly. llvm-svn: 110649	2010-08-10 02:39:45 +00:00
Bruno Cardoso Lopes	f5884c6791	Add AVX movmsk 256-bit intrinsics llvm-svn: 110648	2010-08-10 02:34:56 +00:00
Bruno Cardoso Lopes	2a7ed4b5c9	Support AVX 256-bit load and store intrinsics llvm-svn: 110645	2010-08-10 01:43:16 +00:00
Devang Patel	ef61383343	Do not include file static variable in pubnames list. Refactor and simplify code to avoid redundant checks. llvm-svn: 110642	2010-08-10 01:37:23 +00:00
Chris Lattner	838e42928b	remove code setting rw locks to PTHREAD_PROCESS_PRIVATE, which is the default. Patch by NAKAMURA Takumi! llvm-svn: 110636	2010-08-10 00:34:06 +00:00
Bruno Cardoso Lopes	1ea37cfa7b	Patterns to match AVX cmp instructions llvm-svn: 110633	2010-08-10 00:13:20 +00:00
Jakob Stoklund Olesen	1ab2fab3af	Transpose the calculation of spill weights such that we are calculating one register at a time. This turns out to be slightly faster than iterating over instructions, but more importantly, it allows us to compute spill weights for new registers created after the spill weight pass has run. Also compute the allocation hint at the same time as the spill weight. This allows us to use the spill weight as a cost metric for copies, and choose the most profitable hint if there is more than one possibility. The new hints provide a very small (< 0.1%) but universal code size improvement. llvm-svn: 110631	2010-08-10 00:02:26 +00:00
Bruno Cardoso Lopes	4e8d77892c	Add matching patterns for vblend AVX intrinsics llvm-svn: 110630	2010-08-10 00:02:05 +00:00
Bill Wendling	8a7a43a1cb	Merge the OptimizeExts and OptimizeCmps passes into one PeepholeOptimizer pass. This pass should expand with all of the small, fine-grained optimization passes to reduce compile time and increase happiment. llvm-svn: 110627	2010-08-09 23:59:04 +00:00
Devang Patel	c5f885e228	Undo accidental commit. llvm-svn: 110623	2010-08-09 23:28:52 +00:00
Devang Patel	4ccbd73f20	Simplify. Avoid redundant checks. llvm-svn: 110621	2010-08-09 23:26:06 +00:00

1 2 3 4 5 ...

40732 Commits