llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-15 15:48:38 +00:00

Author	SHA1	Message	Date
Chris Lattner	1eeea0d262	add some more check for vector compares. llvm-svn: 75024	2009-07-08 18:51:25 +00:00
Chris Lattner	182817004d	convert a test to "FileCheck" style. llvm-svn: 75023	2009-07-08 18:48:24 +00:00
Bob Wilson	3809b333de	Implement NEON vld1 instructions. llvm-svn: 75019	2009-07-08 18:11:30 +00:00
David Goodwin	d19a9aa207	Add rev16 test... xfail for now llvm-svn: 75012	2009-07-08 16:15:06 +00:00
David Goodwin	5bdef4b3f7	Checkpoint Thumb2 Instr info work. Generalized base code so that it can be shared between ARM and Thumb2. Not yet activated because register information must be generalized first. llvm-svn: 75010	2009-07-08 16:09:28 +00:00
Nick Lewycky	d46a7b2d22	Remove the vicmp and vfcmp instructions. Because we never had a release with these instructions, no autoupgrade or backwards compatibility support is provided. llvm-svn: 74991	2009-07-08 03:04:38 +00:00
Chris Lattner	c153b998d9	eliminate the v[if]cmp versions of these tests, now that [if]cmp+sext works. llvm-svn: 74980	2009-07-08 00:49:35 +00:00
Chris Lattner	2939f0a318	Change these tests to use [fi]cmp+sext instead of v[fi]cmp. No functionality change. llvm-svn: 74979	2009-07-08 00:46:57 +00:00
Chris Lattner	ea7bd9b484	dag combine sext(setcc) -> vsetcc before legalize. To make this safe, VSETCC must define all bits, which is different than it was documented to before. Since all targets that implement VSETCC already have this behavior, and we don't optimize based on this, just change the documentation. We now get nice code for vec_compare.ll llvm-svn: 74978	2009-07-08 00:31:33 +00:00
Chris Lattner	e2435c4f6f	add support for legalizing an icmp where the result is illegal (4xi1) but the input is legal (4 x i32) llvm-svn: 74964	2009-07-07 23:03:54 +00:00
Chris Lattner	1de1b3155b	add a trivial test that vector compares work. llvm-svn: 74963	2009-07-07 22:51:09 +00:00
Chris Lattner	a754f344b2	implement support for spliting and scalarizing vector setcc's. This finishes off enough support for vector compares to get the icmp/fcmp version of 2008-07-23-VSetCC.ll passing. llvm-svn: 74961	2009-07-07 22:47:46 +00:00
Chris Lattner	573a3eeda2	verify that the fcmp version of this works just as well as the vfcmp version. We actually get better code for this silly testcase. llvm-svn: 74954	2009-07-07 22:07:47 +00:00
Evan Cheng	393e38e44b	Add Thumb2 movcc instructions. llvm-svn: 74946	2009-07-07 20:39:03 +00:00
Evan Cheng	37abb8fc28	Add missing tests. llvm-svn: 74945	2009-07-07 20:38:08 +00:00
Chris Lattner	2bea79b45b	Implement parsing support for the .comm directive. Patch by Kevin Enderby! llvm-svn: 74944	2009-07-07 20:30:46 +00:00
Evan Cheng	fa864ab886	Add Thumb2 pkhbt / pkhtb. llvm-svn: 74895	2009-07-07 05:35:52 +00:00
Evan Cheng	46b98516f6	Add some more Thumb2 multiplication instructions. llvm-svn: 74889	2009-07-07 01:17:28 +00:00
Evan Cheng	5a279bb4b2	Add bfc to armv6t2. llvm-svn: 74868	2009-07-06 22:23:46 +00:00
Evan Cheng	2570d8b541	Added ARM::mls for armv6t2. llvm-svn: 74866	2009-07-06 22:05:45 +00:00
Evan Cheng	29ce3bfbb8	Avoid adding a duplicate def. This fixes PR4478. llvm-svn: 74857	2009-07-06 21:34:05 +00:00
Stuart Hastings	7ab35df2d2	Mark this test as Darwin only. Patch by Bill Wendling. llvm-svn: 74829	2009-07-06 15:36:23 +00:00
Nick Lewycky	d9573d899d	When comparing constants, consider a less wide constant to be "less complex" than a wider one, before trying to compare their contents which will crash if their sizes are different. llvm-svn: 74792	2009-07-04 17:24:52 +00:00
Evan Cheng	f20e4fba49	Add thumb2 sign / zero extend with rotate instructions. llvm-svn: 74755	2009-07-03 01:43:10 +00:00
Evan Cheng	162bd9cead	Added indexed stores. llvm-svn: 74740	2009-07-03 00:06:39 +00:00
Evan Cheng	fcab8e743a	Sign extending pre/post indexed loads. llvm-svn: 74736	2009-07-02 23:16:11 +00:00
Chris Lattner	54c0359890	do not try to analyze bitcasts from i64 to <2 x i32> in ComputedMaskedBits. While we could do this, doing so requires adjusting the demanded mask and the code isn't doing that yet. This fixes PR4495 llvm-svn: 74699	2009-07-02 16:04:08 +00:00
Evan Cheng	dad6a41d14	Thumb2 pre/post indexed loads. llvm-svn: 74696	2009-07-02 07:28:31 +00:00
Chris Lattner	4cddab0f14	@GOTPCREL is also rip-relative. Fix fast-isel to do the right thing. This fixes an llvm-gcc bootstrap problem I introduced. llvm-svn: 74691	2009-07-02 04:22:01 +00:00
Chris Lattner	e703feb0ac	Fix yet-another bug I introduced into fastisel, this time handling constant pool references that weren't getting properly rip-relative. llvm-svn: 74689	2009-07-02 03:14:25 +00:00
Daniel Dunbar	298ac2d87c	llvm-mc/x86: Test case for x86 operand parsing. llvm-svn: 74688	2009-07-02 02:28:23 +00:00
Dan Gohman	e3b1f9e14b	Fix an instcombine abort on a scalar-to-vector bitcast. This fixes PR4487. llvm-svn: 74646	2009-07-01 21:38:46 +00:00
Devang Patel	86e8a3a0a8	new test case llvm-svn: 74633	2009-07-01 19:40:59 +00:00
Devang Patel	905d32435c	Support stand alone metadata syntax. !0 = constant metadata !{i32 21, i32 22} @llvm.blah = constant metadata !{i32 1000, i16 200, metadata !0} llvm-svn: 74630	2009-07-01 19:21:12 +00:00
Chris Lattner	b51c7950bf	Fix codegen for references to available_externally symbols. This fixes PR4482. llvm-svn: 74613	2009-07-01 16:53:44 +00:00
Stuart Hastings	4db03551ba	Insure that __block_holder_tmp is allocated on the stack when a byref variable is present. llvm-svn: 74610	2009-07-01 15:40:10 +00:00
Evan Cheng	e6989735a6	CommuteChangesDestination() should check if to-be-commuted instruction defines any register. Also teaches the default commuteInstruction() to commute instruction without definitions (e.g. X86::test / ARM::tsp). llvm-svn: 74602	2009-07-01 08:29:08 +00:00
Evan Cheng	7d78cb531e	Remove special handling of implicit_def. Fix a couple more bugs in liveintervalanalysis and coalescer handling of implicit_def. Note, isUndef marker must be placed even on implicit_def def operand or else the scavenger will not ignore it. This is necessary because -O0 path does not use liveintervalanalysis, it treats implicit_def just like any other def. llvm-svn: 74601	2009-07-01 08:19:36 +00:00
Chris Lattner	2bbdc61f92	Fix some fast-isel problems selecting global variable addressing in pic mode. llvm-svn: 74582	2009-07-01 03:27:19 +00:00
Evan Cheng	37503e9671	Handle IMPLICIT_DEF with isUndef operand marker, part 2. This patch moves the code to annotate machineoperands to LiveIntervalAnalysis. It also add markers for implicit_def that define physical registers. The rest, is just a lot of details. llvm-svn: 74580	2009-07-01 01:59:31 +00:00
David Goodwin	19aa5c7d51	Add PIC load and store patterns for Thumb-2. llvm-svn: 74577	2009-07-01 00:01:13 +00:00
Daniel Dunbar	d0589c59b3	llvm-mc: Symbols in a relocatable expression of the (a - b + cst) form are allowed to be undefined when the expression is seen, we cannot enforce the same-section requirement until the entire assembly file has been seen. llvm-svn: 74565	2009-06-30 22:49:27 +00:00
David Goodwin	5805e9aef5	Add thumb-2 store word, halfword, and byte. llvm-svn: 74555	2009-06-30 22:11:34 +00:00
David Goodwin	aad223dd8a	Improve Thumb-2 jump table support. llvm-svn: 74549	2009-06-30 19:50:22 +00:00
Rafael Espindola	340632e814	Fix PR4485. Avoid unnecessary duplication of operand 0 of X86::FpSET_ST0_80. This duplication would cause one register to remain on the stack at the function return. llvm-svn: 74534	2009-06-30 16:40:03 +00:00
Rafael Espindola	33b0aa0274	Fix PR4484. This was caused by me confounding FP0 and ST(0). llvm-svn: 74523	2009-06-30 12:18:16 +00:00
Evan Cheng	28b9e77f19	Temporarily restore the scavenger implicit_def checking code. MachineOperand isUndef mark is not being put on implicit_def of physical registers (created for parameter passing, etc.). llvm-svn: 74519	2009-06-30 09:19:42 +00:00
Evan Cheng	c6c942b70f	Add a bit IsUndef to MachineOperand. This indicates the def / use register operand is defined by an implicit_def. That means it can def / use any register and passes (e.g. register scavenger) can feel free to ignore them. The register allocator, when it allocates a register to a virtual register defined by an implicit_def, can allocate any physical register without worrying about overlapping live ranges. It should mark all of operands of the said virtual register so later passes will do the right thing. This is not the best solution. But it should be a lot less fragile to having the scavenger try to track what is defined by implicit_def. llvm-svn: 74518	2009-06-30 08:49:04 +00:00
Evan Cheng	2a527c3419	A few more load instructions. llvm-svn: 74500	2009-06-30 02:15:48 +00:00
Andreas Bolka	71ef361d1d	Array accesses are independent if the underlying arrays differ. llvm-svn: 74499	2009-06-30 02:12:10 +00:00
Daniel Dunbar	2d92d62fb2	llvm-mc: Accept relocatable expressions for .org, assignments, .byte, etc. llvm-svn: 74498	2009-06-30 02:10:03 +00:00
David Goodwin	6ed9f9c9c5	Enhance tests to include shifted-register operand testing. llvm-svn: 74490	2009-06-30 01:02:20 +00:00
Daniel Dunbar	e4fd18eda8	llvm-mc: Parse symbol attribute directives. llvm-svn: 74487	2009-06-30 00:33:19 +00:00
Daniel Dunbar	72c0434597	llvm-mc: Parse .{,b,p2}align{,w,l} directives. llvm-svn: 74478	2009-06-29 23:46:59 +00:00
David Goodwin	4f53387d26	Add Thumb-2 support for TEQ amd TST. llvm-svn: 74468	2009-06-29 22:49:42 +00:00
David Goodwin	e7df96eedf	Thumb-2 tests llvm-svn: 74464	2009-06-29 22:25:22 +00:00
Daniel Dunbar	fca88cf26b	llvm-mc: Recognize C++ style comments. llvm-svn: 74463	2009-06-29 22:00:57 +00:00
Daniel Dunbar	ade4f03bd0	MC: Improve expression parsing and implement evaluation of absolute expressions. llvm-svn: 74448	2009-06-29 20:37:27 +00:00
Dan Gohman	d496b35af0	Don't cache PHI exit values from exhaustive evaluations, because an individual exhaustive evaluation reflects only the exit value implied by an individual exit, which may differ from the actual exit value of the loop if there are other exits. This fixes PR4477. llvm-svn: 74447	2009-06-29 20:34:13 +00:00
Rafael Espindola	a0fdda93be	FIX PR 4459. Not sure I understand how the temp register gets used, but this fixes a bug and introduces no regressions. llvm-svn: 74446	2009-06-29 20:29:59 +00:00
David Greene	f39b82b3b3	Add some tests of advanced TableGen list functionality. llvm-svn: 74445	2009-06-29 20:07:17 +00:00
David Goodwin	9e1280adf3	Rename ARMcmpNZ to ARMcmpZ and use it to represent comparisons that set only the Z flag (i.e. eq and ne). Make ARMcmpZ commutative. llvm-svn: 74423	2009-06-29 15:33:01 +00:00
Evan Cheng	093adf3ff9	Implement Thumb2 ldr. After much back and forth, I decided to deviate from ARM design and split LDR into 4 instructions (r + imm12, r + imm8, r + r << imm12, constantpool). The advantage of this is 1) it follows the latest ARM technical manual, and 2) makes it easier to reduce the width of the instruction later. The down side is this creates more inconsistency between the two sub-targets. We should split ARM LDR instruction in a similar fashion later. I've added a README entry for this. llvm-svn: 74420	2009-06-29 07:51:04 +00:00
Andreas Bolka	bed5622fb7	Print pairwise dependence results, add testcases. llvm-svn: 74402	2009-06-28 00:35:22 +00:00
Dan Gohman	28702fab4e	Don't try to split a loop when the controlling icmp instruction doesn't have an IV-based operand. This fixes PR4471. llvm-svn: 74399	2009-06-27 22:58:27 +00:00
Dan Gohman	8d2a45fadb	Teach LoopSimplify how to merge multiple loop exits into a single exit, when one of them can be converted to a trivial icmp and conditional branch. This addresses what is essentially a phase ordering problem. SimplifyCFG knows how to do this transformation, but it doesn't do so if the primary block has any instructions in it other than an icmp and a branch. In the given testcase, the block contains other instructions, however they are loop-invariant and can be hoisted. SimplifyCFG doesn't have LoopInfo though, so it can't hoist them. And, it's important that the blocks be merged before LoopRotation, as it doesn't support multiple-exit loops. llvm-svn: 74396	2009-06-27 21:30:38 +00:00
Dan Gohman	4acfd5098d	When a value is used multiple times within a single PHI, instructions inserted to replace that value must dominate all of of the basic blocks associated with the uses of the value in the PHI, not just one of them. llvm-svn: 74376	2009-06-27 05:16:57 +00:00
Chris Lattner	e711b85035	factor some logic out into a helper function, allow remat of loads from constant globals. This implements remat-constant.ll even without aggressive-remat. llvm-svn: 74373	2009-06-27 04:38:55 +00:00
Chris Lattner	19eb0dad26	Reimplement rip-relative addressing in the X86-64 backend. The new implementation primarily differs from the former in that the asmprinter doesn't make a zillion decisions about whether or not something will be RIP relative or not. Instead, those decisions are made by isel lowering and propagated through to the asm printer. To achieve this, we: 1. Represent RIP relative addresses by setting the base of the X86 addr mode to X86::RIP. 2. When ISel Lowering decides that it is safe to use RIP, it lowers to X86ISD::WrapperRIP. When it is unsafe to use RIP, it lowers to X86ISD::Wrapper as before. 3. This removes isRIPRel from X86ISelAddressMode, representing it with a basereg of RIP instead. 4. The addressing mode matching logic in isel is greatly simplified. 5. The asmprinter is greatly simplified, notably the "NotRIPRel" predicate passed through various printoperand routines is gone now. 6. The various symbol printing routines in asmprinter now no longer infer when to emit (%rip), they just print the symbol. I think this is a big improvement over the previous situation. It does have two small caveats though: 1. I implemented a horrible "no-rip" modifier for the inline asm "P" constraint modifier. This is a short term hack, there is a much better, but more involved, solution. 2. I had to xfail an -aggressive-remat testcase because it isn't handling the use of RIP in the constant-pool reading instruction. This specific test is easy to fix without -aggressive-remat, which I intend to do next. llvm-svn: 74372	2009-06-27 04:16:01 +00:00
Chris Lattner	aef726f8b9	remove some unneeded eh info. llvm-svn: 74371	2009-06-27 04:07:31 +00:00
Chris Lattner	3e94ce2426	testcase for PR4466 llvm-svn: 74367	2009-06-27 01:33:35 +00:00
David Goodwin	90fc344e41	When possible, use "mvn ra, rb" instead of "eor ra, rb, -1" because mvn has a narrow version and eor(i) does not. llvm-svn: 74355	2009-06-26 23:13:13 +00:00
Dan Gohman	49b2ecafe7	Add some testcases for some of the recent ScalarEvolution bug fixes. llvm-svn: 74353	2009-06-26 22:54:11 +00:00
David Goodwin	4997a459c7	Thumb-2 tests llvm-svn: 74345	2009-06-26 22:37:07 +00:00
Chris Lattner	4384816259	remove unwind info, add test for asmprinting of jump table labels with (%rip) llvm-svn: 74337	2009-06-26 22:16:49 +00:00
Evan Cheng	016ed65455	Add x86 support for 'n' inline asm modifier. This will be handled target independently as part of MC work. llvm-svn: 74336	2009-06-26 22:00:19 +00:00
David Goodwin	921faa64cd	Thumb-2 has CLZ. llvm-svn: 74322	2009-06-26 20:47:43 +00:00
David Goodwin	9da977f216	Use "adcs/sbcs" only when the carry-out is live, otherwise use "adc/sbc". llvm-svn: 74321	2009-06-26 20:45:56 +00:00
Daniel Dunbar	23337e07ce	More spelling Count as count. llvm-svn: 74306	2009-06-26 18:35:07 +00:00
Daniel Dunbar	fa37f8bf95	Spell Count as count. llvm-svn: 74298	2009-06-26 18:21:54 +00:00
David Goodwin	4a98908300	Add Thumb-2 tests. llvm-svn: 74295	2009-06-26 18:10:30 +00:00
David Goodwin	46eb5a7a2d	ADC used to implement adde should use "adcs" opcode instead of "adc". llvm-svn: 74293	2009-06-26 18:07:25 +00:00
David Goodwin	b2c485c6bd	ORN and BIC tests. llvm-svn: 74289	2009-06-26 16:20:06 +00:00
David Goodwin	877790aa5f	Currently there is a pattern for the thumb-2 MOV 16-bit immediate instruction. That instruction cannot write the flags so it should use T2I instead of T2sI. Also, added a pattern for the thumb-2 MOV of shifted immediate since that can encode immediates not encodable by the 16-bit immediate. llvm-svn: 74288	2009-06-26 16:10:07 +00:00
Evan Cheng	7883ae3121	Fix tests: Count -> count. llvm-svn: 74282	2009-06-26 07:05:57 +00:00
Evan Cheng	da10be895c	Fix a CodeGenDAGPatterns bug. Check if top level predicates match when it's looking for duplicates. llvm-svn: 74276	2009-06-26 05:59:16 +00:00
Shantonu Sen	eb968301d7	Clarify how to configure llvm-gcc-4.2 for use with test suite. Remove documentation for --with-f2c, which is no longer supported. Remove information about obtaining tcl/expect, which ship with Mac OS X by default since 10.4. llvm-svn: 74271	2009-06-26 05:44:53 +00:00
Devang Patel	a7a5664fbb	Let's ignore MDStrings also! llvm-svn: 74255	2009-06-26 02:26:12 +00:00
Daniel Dunbar	be7f3311ad	Fix spelling of 'count' llvm-svn: 74249	2009-06-26 01:33:02 +00:00
Dan Gohman	ba8760719f	Fix LCSSA to avoid emitting a PHI node for the unwind destination of an invoke instruction, since the value isn't really live across that edge. llvm-svn: 74242	2009-06-26 00:31:13 +00:00
Evan Cheng	4ac765118d	Select ADC, SBC, and RSC instead of the ADCS, SBCS, and RSCS when the carry bit def is not used. llvm-svn: 74228	2009-06-25 23:34:10 +00:00
David Goodwin	74414108e9	Use MVN for ~t2_so_imm immediates. llvm-svn: 74223	2009-06-25 23:11:21 +00:00
Daniel Dunbar	bfc585a86e	MC: Parse .org directives. llvm-svn: 74218	2009-06-25 22:44:51 +00:00
Bill Wendling	f962a1df07	Don't grep the -debug output. This isn't the way to test changes. llvm-svn: 74211	2009-06-25 21:59:32 +00:00
Daniel Dunbar	d7a883b5e5	MC: Parse .set and assignments. llvm-svn: 74208	2009-06-25 21:56:11 +00:00
Chris Lattner	f035685176	down with unwind info :) llvm-svn: 74206	2009-06-25 21:48:17 +00:00
Evan Cheng	0cced3daa8	ISD::ADDE / ISD::SUBE updates the carry bit so they should isle to ADCS and SBCS / RSCS. llvm-svn: 74200	2009-06-25 20:59:23 +00:00
Devang Patel	d98b076409	No need to code gen MDNodes llvm-svn: 74150	2009-06-25 00:47:42 +00:00
Evan Cheng	e8c58ee743	Add Thumb2 pc relative add. llvm-svn: 74141	2009-06-24 23:47:58 +00:00
Daniel Dunbar	5cf59dd842	Basic .s parsing for .asci[iz], .fill, .space, {.byte, .short, ... } - Includes some DG tests in test/MC/AsmParser, which are rather primitive since we don't have a -verify mode yet. llvm-svn: 74139	2009-06-24 23:30:00 +00:00
Evan Cheng	f7814163db	We should run these tests as well. llvm-svn: 74121	2009-06-24 21:36:26 +00:00
Chris Lattner	6e06dc1168	unwind info not needed. llvm-svn: 74112	2009-06-24 19:48:04 +00:00
Dan Gohman	b4e1f166e1	Simplify [su]max(MAX, n) to MAX. This comes up in loop tripcount computations in loops with multiple exits. Adjust the testcase for PR4436 so that the relevant portion isn't optimized away. llvm-svn: 74073	2009-06-24 14:46:22 +00:00
Dan Gohman	c2c1e1ff38	When inserting code into a loop preheader, insert it before the terminator, instead of after the last phi. This fixes a bug exposed by ScalarEvolution analyzing more kinds of loops. This fixes PR4436. llvm-svn: 74072	2009-06-24 14:31:06 +00:00
Evan Cheng	b4139189b0	Move thumb and thumb2 tests into separate directories. llvm-svn: 74068	2009-06-24 06:36:07 +00:00
Evan Cheng	7292cadf06	Fix support for inline asm input / output operand tying when operand spans across multiple registers (e.g. two i64 operands in 32-bit mode). llvm-svn: 74053	2009-06-24 02:05:51 +00:00
Dan Gohman	60844e20f0	Add a testcase demoing some of ScalarEvolution's new trip count logic. llvm-svn: 74049	2009-06-24 01:22:30 +00:00
Dan Gohman	4f4bda36df	Extend ScalarEvolution's multiple-exit support to compute exact trip counts in more cases. Generalize ScalarEvolution's isLoopGuardedByCond code to recognize And and Or conditions, splitting the code out into an isNecessaryCond helper function so that it can evaluate Ands and Ors recursively, and make SCEVExpander be much more aggressive about hoisting instructions out of loops. test/CodeGen/X86/pr3495.ll has an additional instruction now, but it appears to be due to an arbitrary register allocation difference. llvm-svn: 74048	2009-06-24 01:18:18 +00:00
Devang Patel	13317bfc64	upgrade llvm-svn: 74016	2009-06-23 22:11:37 +00:00
Evan Cheng	eaad82627b	Proper patterns for thumb2 shift and rotate instructions. llvm-svn: 73987	2009-06-23 19:39:13 +00:00
Dale Johannesen	5b05728e00	The correct values here (as defined by gcc-4.2) are different for ppc; add another version of the test. llvm-svn: 73983	2009-06-23 18:42:26 +00:00
Dale Johannesen	9f516abaa5	This test only works on ppc32 if some optimization is done before generating LLVM IR; it is correct in the code as written to use 8-byte-aligned operations to copy Key in bar. Formerly the gcc inliner was run, now it isn't. I don't think it's possible to preserve this as a pure FE test. Adding -O2 lets the llvm optimizers get rid of the 8-byte-aligned stores, at least. llvm-svn: 73981	2009-06-23 18:34:37 +00:00
Dale Johannesen	3550840eb3	See test. Judging from PR 1278, at the time the test was committed, the generated code was apparently doing stores directly into the return value aggregate; now, it's doing a copy from a compiler-generated static object. That object is initialized using [4 x i8] which breaks the test. I believe this change preserves the original point of the test. Of course it would be better for the code to do stores directly into the return aggregate, but that is not what happens at -O0; the llvm optimizers seem to do that on x86 but not on ppc32, possibly because of the explicit padding (which is unavoidable). I think it must have been being done by a gcc optimizer pass before. llvm-svn: 73972	2009-06-23 18:03:49 +00:00
Dan Gohman	058143a236	Fix a bug in the trip-count computation with And/Or. If either of the sides is CouldNotCompute, the resulting exact count must be CouldNotCompute. llvm-svn: 73920	2009-06-22 23:28:56 +00:00
Bob Wilson	6db76aaf10	Add support for ARM's Advanced SIMD (NEON) instruction set. This is still a work in progress but most of the NEON instruction set is supported. llvm-svn: 73919	2009-06-22 23:27:02 +00:00
Dan Gohman	b85ac02513	Fix llvm::ComputeNumSignBits to handle pointer types conservatively correctly, instead of aborting. llvm-svn: 73908	2009-06-22 22:02:32 +00:00
Evan Cheng	2814371831	It's coalescer, not coaleser. llvm-svn: 73902	2009-06-22 21:09:17 +00:00
Bob Wilson	0c2c5f65e2	For Darwin on ARMv6 and newer, make register r9 available for use as a caller-saved register. llvm-svn: 73901	2009-06-22 21:01:46 +00:00
Evan Cheng	2410955c62	Fix another register coalescer crash: forgot to check if the instruction being updated has already been coalesced. llvm-svn: 73898	2009-06-22 20:49:32 +00:00
Evan Cheng	b37e7e24d0	hasFP should return true if frame address is taken. llvm-svn: 73893	2009-06-22 18:38:48 +00:00
Dale Johannesen	bb07cd38e3	Testcase for (llvm-gcc) 73873. Usually std::pair<double, float*> is 16 bytes on darwin-powerpc, but not always. See testcase for full weirdness. llvm-svn: 73874	2009-06-22 04:47:32 +00:00
Dan Gohman	db7860aa40	Teach ScalarEvolution how to analyze loops with multiple exit blocks, and also exit blocks with multiple conditions (combined with (bitwise) ands and ors). It's often infeasible to compute an exact trip count in such cases, but a useful upper bound can often be found. llvm-svn: 73866	2009-06-22 00:31:57 +00:00
Dan Gohman	eb0a67278b	Fix ScalarEvolution's backedge-taken count computations to check for overflow when computing a integer division to round up. Thanks to Nick Lewycky for noticing this! llvm-svn: 73862	2009-06-21 23:46:38 +00:00
Rafael Espindola	373c6bdbc5	Fix PR4185. Handle FpSET_ST0_80 being used when ST0 is still alive. llvm-svn: 73850	2009-06-21 12:02:51 +00:00
Nick Lewycky	4020821885	Expand this test to handle more cases (remainder and shifts) of zero. llvm-svn: 73839	2009-06-21 01:56:41 +00:00
Chris Lattner	affcc71da2	implement PR4424: 0/x is always 0 for integer division. llvm-svn: 73835	2009-06-21 01:15:55 +00:00
Chris Lattner	580eecebbd	change TLS_ADDR lowering to lower to a real mem operand, instead of matching as a global with that gets printed with the :mem modifier. All operands to lea's should be handled with the lea32mem operand kind, and this allows the TLS stuff to do this. There are several better ways to do this, but I went for the minimal change since I can't really test this (beyond make check). This also makes the use of EBX explicit in the operand list in the 32-bit, instead of implicit in the instruction. llvm-svn: 73834	2009-06-20 20:38:48 +00:00
Chris Lattner	965cc0e45b	no need for unwind info llvm-svn: 73832	2009-06-20 19:48:26 +00:00
Chris Lattner	33d1976328	no need for unwind info here. llvm-svn: 73831	2009-06-20 19:43:09 +00:00
Evan Cheng	b45918e5bb	Fix PR4419: handle defs of partial uses. llvm-svn: 73816	2009-06-20 04:34:51 +00:00
Dan Gohman	b60dedbf0a	Tweak this test to be a little less unusual. llvm-svn: 73808	2009-06-20 00:40:56 +00:00
Dan Gohman	29100270c0	Generalize isLoopGuardedByCond's checking to consider two SCEVUnknowns with identical Instructions to be equal. This allows it to analze cases such as the attached testcase, where the front-end has cloned the loop controlling expression. Along with r73805, this lets IndVarSimplify eliminate all the sign-extend casts in the loop in the attached testcase. llvm-svn: 73807	2009-06-20 00:35:32 +00:00
Dan Gohman	651faa1905	Re-apply r73718, now that the fix in r73787 is in, and add a hand-crafted testcase which demonstrates the bug that was exposed in 254.gap. llvm-svn: 73793	2009-06-19 23:23:27 +00:00
Evan Cheng	f18de63563	Enable arm pre-allocation load / store multiple optimization pass. llvm-svn: 73791	2009-06-19 23:17:27 +00:00
Evan Cheng	b90241ac42	Revert 73718. It's breaking 254.gap. llvm-svn: 73783	2009-06-19 21:15:06 +00:00
Dan Gohman	d920fdb643	Don't (unconditionally) use getSCEVAtScope to simplify the step expression in IVUsers, because in the case of a use of a non-linear addrec outside of a loop, this causes the addrec to be evaluated as a linear addrec. llvm-svn: 73774	2009-06-19 17:33:15 +00:00
Chris Lattner	8f6f044afd	make jump threading handle lexically identical compare instructions as if they were multiple uses of the same instruction. This interacts well with the existing loadpre that j-t does to open up many new jump threads earlier. llvm-svn: 73768	2009-06-19 16:27:56 +00:00
Eli Friedman	5cccb60bad	Fix for PR2484: add an SSE1 pattern for a shuffle we normally prefer to handle with an SSE2 instruction. llvm-svn: 73760	2009-06-19 07:00:55 +00:00
Nick Lewycky	a5f89b09c6	Teach jump threading to look at comparisons between phi nodes and non-constants. llvm-svn: 73755	2009-06-19 04:56:29 +00:00
Chris Lattner	8ddc06469c	Improve tail call elim to move loads above readonly calls when it allows forming a tail call. Patch by Frits van Bommel. This implements PR4323. llvm-svn: 73752	2009-06-19 04:22:16 +00:00
Chris Lattner	3a683c551f	part of PR4405: disable a contentious optimization for strcmp -> memcmp when the lengths of the strings are unknown. Patch by Nick Lewycky! llvm-svn: 73751	2009-06-19 04:17:36 +00:00
Eli Friedman	003abaa60d	Mark a few Thumb instructions commutable; just happened to spot this while experimenting. I'm reasonably sure this is correct, but please tell me if these instructions have some strange property which makes this change unsafe. llvm-svn: 73746	2009-06-19 01:43:08 +00:00
Chris Lattner	cce5b90ede	new testcase for rdar://6983634 llvm-svn: 73729	2009-06-18 22:53:26 +00:00
Evan Cheng	6c1c55f942	On Darwin, ams printer should output a second label before a jump table so the linker knows it's a new atom. But this is only needed if the jump table is put in a separate section from the function body. llvm-svn: 73720	2009-06-18 20:37:15 +00:00
Dan Gohman	da82dc2ec1	Generalize LSR's OptimizeSMax to handle unsigned max tests as well as signed max tests. Along with r73717, this helps CodeGen avoid emitting code for a maximum operation for this class of loop. llvm-svn: 73718	2009-06-18 20:23:18 +00:00
Dan Gohman	fd857b0406	Remove the code from IVUsers that attempted to handle casted induction variables in cases where the cast isn't foldable. It ended up being a pessimization in many cases. This could be fixed, but it would require a bunch of complicated code in IVUsers' clients. The advantages of this approach aren't visible enough to justify it at this time. llvm-svn: 73706	2009-06-18 16:54:06 +00:00
Dan Gohman	dc884a7830	Generalize the zext(trunc(t) & C) instcombine to work even with C is not a low-bits mask, and add a similar instcombine for zext((trunc(t) & C) ^ C). llvm-svn: 73705	2009-06-18 16:30:21 +00:00
Dan Gohman	e72fd546a2	Teach ScalarEvolution how to recognize another xor(and(x, C), C) case. If C is a single bit and the and gets analyzed as a truncate and zero-extend, the xor can be represnted as an add. llvm-svn: 73664	2009-06-18 00:00:20 +00:00
Dan Gohman	1530824138	Instcombine zext(trunc(x) & mask) to x&mask, even if the trunc has multiple users. llvm-svn: 73656	2009-06-17 23:17:05 +00:00
Dan Gohman	50b7d0d843	Add -disable-output to a bunch of tests that don't care about the output. llvm-svn: 73633	2009-06-17 20:56:26 +00:00
Dale Johannesen	26f0dd9021	This fixes a bug introduced in 72661, which can move loads back past a check that the load address is valid, see new testcase. The test that went in with 72661 has exactly this case, except that the conditional it's moving past is checking something else; I've settled for changing that test to reference a global, not a pointer. It may be possible to scan all the tests you pass and make sure none of them are checking any component of the address, but it's not trivial and I'm not trying to do that here. llvm-svn: 73632	2009-06-17 20:48:23 +00:00
Anton Korobeynikov	7fd29c57a8	Initial support for some Thumb2 instructions. Patch by Viktor Kutuzov and Anton Korzh from Access Softek, Inc. llvm-svn: 73622	2009-06-17 18:13:58 +00:00
Eli Friedman	36d7ca738e	Correct an accidental duplication of the test (patch doesn't handle creating new files very well). llvm-svn: 73599	2009-06-17 03:05:00 +00:00
Eli Friedman	b3947071ff	PR3439: Correct a silly mistake in the SimplifyDemandedUseBits code for SRem. llvm-svn: 73598	2009-06-17 02:57:36 +00:00
Dan Gohman	473789f75a	Fix ScalarEvolution's Xor handling to not assume that an And that gets recognized with a SCEVZeroExtendExpr must be an And with a low-bits mask. With r73540, this is no longer the case. llvm-svn: 73594	2009-06-17 01:22:39 +00:00
Dale Johannesen	64c7072138	Test for llvm-gcc patch 73564. llvm-svn: 73565	2009-06-16 22:18:33 +00:00
Anton Korobeynikov	d6004a164c	Make the test target-neutral llvm-svn: 73547	2009-06-16 20:25:25 +00:00
Dan Gohman	54bbef1525	Generalize a few more instcombines to be vector/scalar-independent. llvm-svn: 73541	2009-06-16 19:55:29 +00:00
Dan Gohman	56b5a88785	Instcombine's ShrinkDemandedConstant may strip bits out of constants, obscuring what would otherwise be a low-bits mask. Use ComputeMaskedBits to compute what ShrinkDemandedConstant knew about to reconstruct a low-bits mask value. llvm-svn: 73540	2009-06-16 19:52:01 +00:00
Anton Korobeynikov	a74b8323d0	GNU as refuses to assemble "pop {}" instruction. Do not emit such (this is the case when we have thumb vararg function with single callee-saved register, which is handled separately). llvm-svn: 73529	2009-06-16 18:49:08 +00:00
Chris Lattner	f54c97c579	Testcase for r73506 llvm-svn: 73508	2009-06-16 17:23:25 +00:00
Evan Cheng	a98ff05fca	If a val# is defined by an implicit_def and it is being removed, all of the copies off the val# were removed. This causes problem later since the scavenger will see uses of registers without defs. The proper solution is to change the copies into implicit_def's instead. TurnCopyIntoImpDef turns a copy into implicit_def and remove the val# defined by it. This causes an scavenger assertion later if the def reaches other blocks. Disable the transformation if the value live interval extends beyond its def block. llvm-svn: 73478	2009-06-16 07:12:58 +00:00
Eli Friedman	6a984089f4	Add some generic expansion logic for SMULO and UMULO. Fixes UMULO support for x86, and UMULO/SMULO for many architectures, including PPC (PR4201), ARM, and Cell. The resulting expansion isn't perfect, but it's not bad. llvm-svn: 73477	2009-06-16 06:58:29 +00:00
Devang Patel	5941941827	Use MainCU if it is available. llvm-svn: 73457	2009-06-16 02:09:30 +00:00
Dan Gohman	255bcad466	Update this test to use fmul instead of mul. llvm-svn: 73436	2009-06-15 22:49:34 +00:00
Dan Gohman	2e737ac21f	Support vector casts in more places, fixing a variety of assertion failures. To support this, add some utility functions to Type to help support vector/scalar-independent code. Change ConstantInt::get and ConstantFP::get to support vector types, and add an overload to ConstantInt::get that uses a static IntegerType type, for convenience. Introduce a new getConstant method for ScalarEvolution, to simplify common use cases. llvm-svn: 73431	2009-06-15 22:12:54 +00:00
Devang Patel	1fb2606b12	Gracefully handle imbalanced inline function begin and end markers. llvm-svn: 73426	2009-06-15 21:45:50 +00:00
Evan Cheng	4b77794613	ifcvt should ignore cfg where true and false successors are the same. llvm-svn: 73423	2009-06-15 21:24:34 +00:00
Dale Johannesen	2d0be306fb	Fix the crash in this test. This is basically the same problem addressed in 31284, but the patch there only addressed the case where an invoke is the first thing in a block. llvm-svn: 73416	2009-06-15 20:59:27 +00:00
Bill Wendling	a0a5984345	This test is failing. Revert for now. llvm-svn: 73404	2009-06-15 19:10:56 +00:00
Bill Wendling	1ea00229de	Add another testcase for r71478. llvm-svn: 73399	2009-06-15 18:36:34 +00:00
Arnold Schwaighofer	6b340f9247	CheckTailCallReturnConstraints is missing a check on the incomming chain of the RETURN node. The incomming chain must be the outgoing chain of the CALL node. This causes the backend to identify tail calls that are not tail calls. This patch fixes this. llvm-svn: 73387	2009-06-15 14:43:36 +00:00
Evan Cheng	3219c7fbe5	Part 1. - Change register allocation hint to a pair of unsigned integers. The hint type is zero (which means prefer the register specified as second part of the pair) or entirely target dependent. - Allow targets to specify alternative register allocation orders based on allocation hint. Part 2. - Use the register allocation hint system to implement more aggressive load / store multiple formation. - Aggressively form LDRD / STRD. These are formed before register allocation. It has to be done this way to shorten live interval of base and offset registers. e.g. v1025 = LDR v1024, 0 v1026 = LDR v1024, 0 => v1025,v1026 = LDRD v1024, 0 If this transformation isn't done before allocation, v1024 will overlap v1025 which means it more difficult to allocate a register pair. - Even with the register allocation hint, it may not be possible to get the desired allocation. In that case, the post-allocation load / store multiple pass must fix the ldrd / strd instructions. They can either become ldm / stm instructions or back to a pair of ldr / str instructions. This is work in progress, not yet enabled. llvm-svn: 73381	2009-06-15 08:28:29 +00:00
Chris Lattner	52510b0788	fix testcase to properly check for the patch in r73195. llvm-svn: 73380	2009-06-15 05:46:02 +00:00
Dan Gohman	d3a8d79c0d	Implement more aggressive folding of add operand lists when they contain multiplications of constants with add operations. This helps simplify several kinds of things; in particular it helps simplify expressions like ((-1 * (%a + %b)) + %a) to %b, as expressions like this often come up in loop trip count computations. llvm-svn: 73361	2009-06-14 22:58:51 +00:00
Duncan Sands	3a4ae072d0	Testcase for PR4332. llvm-svn: 73353	2009-06-14 22:22:42 +00:00
Dan Gohman	37fef35e88	Teach SCEVExpander's visitAddRecExpr to reuse an existing canonical induction variable when the addrec to be expanded does not require a wider type. This eliminates the need for IndVarSimplify to micro-manage SCEV expansions, because SCEVExpander now automatically expands them in the form that IndVarSimplify considers to be canonical. (LSR still micro-manages its SCEV expansions, because it's optimizing for the target, rather than for other optimizations.) Also, this uses the new getAnyExtendExpr, which has more clever expression simplification logic than the IndVarSimplify code it replaces, and this cleans up some ugly expansions in code such as the included masked-iv.ll testcase. llvm-svn: 73294	2009-06-13 16:25:49 +00:00
Evan Cheng	d0a66e438f	Add a ARM specific pre-allocation pass that re-schedule loads / stores from consecutive addresses togther. This makes it easier for the post-allocation pass to form ldm / stm. This is step 1. We are still missing a lot of ldm / stm opportunities because of register allocation are not done in the desired order. More enhancements coming. llvm-svn: 73291	2009-06-13 09:12:55 +00:00
Devang Patel	bcc1187643	llvm.dbg.region.end() intrinsic is not required to be in _last_ basic block in a function. If that happens then any basic block that follows (lexically) the block with regin.end will not have scope info available. LexicalScopeStack relies on processing basic block in CFG order, but this processing order is not guaranteed. Things get complicated when the optimizer gets a chance to optimizer IR with dbg intrinsics. Apply defensive patch to preserve at least one lexical scope till the end of function. llvm-svn: 73282	2009-06-13 02:16:18 +00:00
Dan Gohman	67ec24b541	Adjust this test's regex strings so that they work regardless of the target's pointer size. This avoids the need for -m32 on the llvm-gcc command-line, which some targets may not support. llvm-svn: 73270	2009-06-12 23:31:14 +00:00
Dan Gohman	e27a52f9b1	Add -m32 to llvm-gcc commands, so that this test behaves as expected on systems which default to a 64-bit target. llvm-svn: 73265	2009-06-12 23:02:02 +00:00
Evan Cheng	98216808fe	If killed register is defined by implicit_def, do not clear it since it's live range may overlap another def of same register. llvm-svn: 73255	2009-06-12 21:34:26 +00:00
Evan Cheng	2f784781aa	Mark some pattern-less instructions as neverHasSideEffects. llvm-svn: 73252	2009-06-12 20:46:18 +00:00
Devang Patel	8d9aa4249a	Clear AbstractInstanceRootMap at the end of the function. llvm-svn: 73244	2009-06-12 19:24:05 +00:00
Dan Gohman	f9b0419cd8	Don't do (x - (y - z)) --> (x + (z - y)) on floating-point types, because it may round differently. This fixes PR4374. llvm-svn: 73243	2009-06-12 19:23:25 +00:00
Dale Johannesen	b5be21ef41	Testcase for llvm-gcc patch 73238. llvm-svn: 73239	2009-06-12 18:41:53 +00:00
Arnold Schwaighofer	780e3addf8	Fix Bug 4278: X86-64 with -tailcallopt calling convention out of sync with regular cc. The only difference between the tail call cc and the normal cc was that one parameter register - R9 - was reserved for calling functions through a function pointer. After time the tail call cc has gotten out of sync with the regular cc. We can use R11 which is also caller saved but not used as parameter register for potential function pointers and remove the special tail call cc on x86-64. llvm-svn: 73233	2009-06-12 16:26:57 +00:00
Nick Lewycky	1e36649f95	Given two identical weak functions, produce one internal function and two weak thunks. llvm-svn: 73230	2009-06-12 15:56:56 +00:00
Nick Lewycky	cc239d7680	This test is wrong. If you have two weak functions F and G you can't make either one call the other since either one can be replaced at link time, and they need to be independent. llvm-svn: 73225	2009-06-12 13:24:41 +00:00
Nick Lewycky	61f78a2674	Fix regular expression. llvm-svn: 73221	2009-06-12 05:39:02 +00:00
Nick Lewycky	bbce41f698	Don't remove aggregate-typed module level constants before encoding functions since functions may contain aggregate constants too. llvm-svn: 73220	2009-06-12 05:20:12 +00:00
Nick Lewycky	127b1cc900	In an XFAIL line, treat "XFAIL: foobar" as a regular expression to be matched against the target triple, instead of equivalent to "XFAIL: ". llvm-svn: 73219	2009-06-12 05:18:32 +00:00
Nick Lewycky	e3b5c81cb8	XFAIL this on PPC Linux. This keeps showing up in the buildbot and isn't easy to fix, and I'd like it to stop masking real failures. llvm-svn: 73211	2009-06-11 23:43:02 +00:00
Dale Johannesen	60e261db11	Test for rev 73205 (PR 4349) llvm-svn: 73206	2009-06-11 20:48:09 +00:00
Chris Lattner	e0360f8ae8	Fix 4366: store to null in non-default addr space should not be turned into unreachable. llvm-svn: 73195	2009-06-11 17:54:56 +00:00
Daniel Dunbar	06ef64d379	Remove empty test (my DejaGNU doesn't like this) llvm-svn: 73148	2009-06-09 21:24:39 +00:00
Bill Wendling	c34ea869f5	Remove empty file. llvm-svn: 73140	2009-06-09 18:55:39 +00:00
David Greene	a51f014e59	Revert 73074 and 73099 because Windows doesn't have POSIX regular expressions. We will add an OpenBSD implementation and re-apply ASAP. llvm-svn: 73138	2009-06-09 18:31:17 +00:00
David Greene	e3c4370a47	Add a !patsubst operator. Use on string types. llvm-svn: 73099	2009-06-08 23:05:37 +00:00
Anton Korobeynikov	c82243e658	Add testcase for register scanveger assertion fix in r72755 (double def due to livevars) llvm-svn: 73096	2009-06-08 22:54:15 +00:00
David Greene	1f88852460	Add a more robust !if test. llvm-svn: 73091	2009-06-08 22:34:57 +00:00
David Greene	5b0714ad86	Fix DejaGNU run line to escape special characters. llvm-svn: 73090	2009-06-08 22:20:58 +00:00
David Greene	62a2f2fb97	Make IntInits and ListInits typed. This helps deduce types of !if and other operators. For the rare cases where a list type cannot be deduced, provide a []<type> syntax, where <type> is the list element type. llvm-svn: 73078	2009-06-08 20:23:18 +00:00
David Greene	21ba6012b2	Add a !regmatch operator to do pattern matching in TableGen. llvm-svn: 73074	2009-06-08 17:00:34 +00:00
Eli Friedman	62028b7323	Fix the run-line for this test to work correctly outside of x86. llvm-svn: 73025	2009-06-07 09:44:19 +00:00
Eli Friedman	2964aa5a38	Tweak the expansion code for BIT_CONVERT to generate better code converting from an MMX vector to an i64. llvm-svn: 73024	2009-06-07 09:41:57 +00:00
Eli Friedman	d4b463b0dc	Slightly generalize the code that handles shuffles of consecutive loads on x86 to handle more cases. Fix a bug in said code that would cause it to read past the end of an object. Rewrite the code in SelectionDAGLegalize::ExpandBUILD_VECTOR to be a bit more general. Remove PerformBuildVectorCombine, which is no longer necessary with these changes. In addition to simplifying the code, with this change, we can now catch a few more cases of consecutive loads. llvm-svn: 73012	2009-06-07 06:52:44 +00:00
Eli Friedman	2b6cb1684f	PR3628: Add patterns to match SHL/SRL/SRA to the corresponding Altivec instructions. llvm-svn: 73009	2009-06-07 01:07:55 +00:00
Eli Friedman	770f633389	PR4340: Run SimplifyDemandedVectorElts on insertelement instructions; sometimes it can find simplifications that won't be found otherwise. llvm-svn: 73006	2009-06-06 20:08:03 +00:00
Eli Friedman	2dadbd05f9	Fix the expansion for CONCAT_VECTORS so that it doesn't create illegal types. llvm-svn: 72993	2009-06-06 07:08:26 +00:00
Eli Friedman	4395222136	Avoid crashing on a variable-index insertelement with element type i16. llvm-svn: 72991	2009-06-06 06:32:50 +00:00
Eli Friedman	e546f94ef5	Get rid of some bogus patterns for X86vzmovl. Don't create VZEXT_MOVL nodes for vectors with an i16 element type. Add an optimization for building a vector which is all zeros/undef except for the bottom element, where the bottom element is an i8 or i16. llvm-svn: 72988	2009-06-06 06:05:10 +00:00
Eli Friedman	539325c8e7	Fix an obvious typo. llvm-svn: 72987	2009-06-06 05:55:37 +00:00
Eli Friedman	1227d199be	Get rid of a bogus pattern that interferes with optimization. llvm-svn: 72985	2009-06-06 04:17:04 +00:00
Eli Friedman	05eef883e8	PR2598: make sure to expand illegal forms of integer/floating-point conversions for x86, like <2 x i32> -> <2 x float> and <4 x i16> -> <4 x float>. llvm-svn: 72983	2009-06-06 03:57:58 +00:00
Devang Patel	8d170194e8	Add new function attribute - noimplicitfloat Update code generator to use this attribute and remove NoImplicitFloat target option. Update llc to set this attribute when -no-implicit-float command line option is used. llvm-svn: 72959	2009-06-05 21:57:13 +00:00
Nate Begeman	058d4eeccf	Adapt the x86 build_vector dagcombine to the current state of the legalizer. build vectors with i64 elements will only appear on 32b x86 before legalize. Since vector widening occurs during legalize, and produces i64 build_vector elements, the dag combiner is never run on these before legalize splits them into 32b elements. Teach the build_vector dag combine in x86 back end to recognize consecutive loads producing the low part of the vector. Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes since that was required implicitly. Add a testcase for the transform. Old: subl $28, %esp movl 32(%esp), %eax movl 4(%eax), %ecx movl %ecx, 4(%esp) movl (%eax), %eax movl %eax, (%esp) movaps (%esp), %xmm0 pmovzxwd %xmm0, %xmm0 movl 36(%esp), %eax movaps %xmm0, (%eax) addl $28, %esp ret New: movl 4(%esp), %eax pmovzxwd (%eax), %xmm0 movl 8(%esp), %eax movaps %xmm0, (%eax) ret llvm-svn: 72957	2009-06-05 21:37:30 +00:00
Evan Cheng	ea31ec569b	Changing allocation ordering from r3 ... r0 back to r0 ... r3. The order change no longer make sense after the coalescing changes we have made since then. llvm-svn: 72955	2009-06-05 19:08:58 +00:00
Dan Gohman	31fc8d27b1	Fix an erroneous check for isFNeg; the FNeg case is handled a few lines later on. llvm-svn: 72904	2009-06-04 23:43:29 +00:00
Bill Wendling	60f5c8184b	Fix these so that they work on non-x86 Darwin machines. llvm-svn: 72903	2009-06-04 23:37:19 +00:00
Bill Wendling	b7c990bc90	Specify that this works for Darwin. llvm-svn: 72899	2009-06-04 22:56:29 +00:00
Dan Gohman	5f6f8101d5	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Devang Patel	9757e4f9f3	Add new function attribute - noredzone. Update code generator to use this attribute and remove DisableRedZone target option. Update llc to set this attribute when -disable-red-zone command line option is used. llvm-svn: 72894	2009-06-04 22:05:33 +00:00
Evan Cheng	dada49d18a	RALinScan::attemptTrivialCoalescing() was returning a virtual register instead of the physical register it is allocated to. This resulted in virtual register(s) being added the live-in sets. llvm-svn: 72890	2009-06-04 20:53:36 +00:00
Evan Cheng	8a6c448ab0	A value defined by an implicit_def can be liven to a use BB. This is unfortunate. But register allocator still has to add it to the live-in set of the use BB. llvm-svn: 72888	2009-06-04 20:25:48 +00:00
Dale Johannesen	a9a7e5d234	For XTARGET to work on targets not in the list, there must also be an XFAIL line. Fix a couple examples of this. llvm-svn: 72876	2009-06-04 18:27:43 +00:00
Dan Gohman	05fe1217c7	Check in test changes that I accidentally left out of r72872. llvm-svn: 72875	2009-06-04 18:22:31 +00:00
Eli Friedman	11070e275f	PR3739, part 2: Use an explicit store to spill XMM registers. (Previously, the code tried to use "push", which doesn't exist for XMM registers.) llvm-svn: 72836	2009-06-04 02:32:04 +00:00
Eli Friedman	fd27229206	PR3739, part 1: Disable the red zone on Win64. llvm-svn: 72830	2009-06-04 02:02:01 +00:00
Evan Cheng	e3a05e6690	Re-apply 72756 with fixes. One of those was introduced by we changed MachineInstrBuilder::addReg() interface. llvm-svn: 72826	2009-06-04 01:15:28 +00:00
Eli Friedman	dbf32ddf16	PR4317: Handle splits where the new block is unreachable correctly in DominatorTreeBase::Split. llvm-svn: 72810	2009-06-03 21:42:06 +00:00
Evan Cheng	b71402d6ae	For Darwin / x86_64, override -relocation-model=static to pic if the output is assembly since Darwin assembler does not really support -static codeine. I view this as a temporary workaround until the assembler / linker changes. llvm-svn: 72806	2009-06-03 21:13:54 +00:00
Dan Gohman	6e9ad19ef7	Don't attempt to simplify an non-affine IV expression if it can't be simplified to a loop-invariant value. This fixes PR4315. llvm-svn: 72798	2009-06-03 19:11:31 +00:00
Evan Cheng	4e47a019ba	Fix for PR4225: When rewriter reuse a value in a physical register , it clear the register kill operand marker and its kill ops information. However, the cleared operand may be a def of a super-register. Clear the kill ops info for the super-register's sub-registers as well. llvm-svn: 72758	2009-06-03 09:00:27 +00:00
Evan Cheng	82f8fa333e	Temporarily revert 72756 for now. llvm-svn: 72757	2009-06-03 07:40:47 +00:00
Evan Cheng	5afbef29fa	Fold preceding / trailing base inc / dec into the single load / store as well. llvm-svn: 72756	2009-06-03 06:14:58 +00:00
Dan Gohman	609f627ed7	Revert r72734. The Darwin assembler doesn't support the static relocation model on x86-64. Higher level logic should override the relocation model to PIC on x86_64-apple-darwin. llvm-svn: 72746	2009-06-03 00:37:20 +00:00
Dan Gohman	f6e6588203	Fix CodeGenPrepare's address-mode sinking to handle unusual addresses, involving Base values which do not have Pointer type. This fixes PR4297. llvm-svn: 72739	2009-06-02 21:29:13 +00:00
Evan Cheng	7e66d61bec	On Darwin x86_64 small code model doesn't guarantee code address fits in 32-bit. llvm-svn: 72734	2009-06-02 20:09:31 +00:00
Evan Cheng	7875093e82	Avoid infinite looping in AllGlobalLoadUsesSimpleEnoughForHeapSRA(). This can happen when PHI uses are recursively dependent on each other. llvm-svn: 72710	2009-06-02 00:56:07 +00:00
Eli Friedman	2b0edc3327	PR4286: Make RewriteLoadUserOfWholeAlloca and RewriteStoreUserOfWholeAlloca deal with tail padding because isSafeUseOfBitCastedAllocation expects them to. Otherwise, we crash trying to erase the bitcast. llvm-svn: 72688	2009-06-01 09:14:32 +00:00
Owen Anderson	928040c625	Be more aggressive in doing LoadPRE by tracing backwards when a block only has a single predecessor. Patch by Jakub Staszak. llvm-svn: 72661	2009-05-31 09:03:40 +00:00
Chris Lattner	8ac63163fe	fix PR4284, a bug in simplifylibcalls handling memcmp. Patch by Benjamin Kramer! llvm-svn: 72625	2009-05-30 18:43:04 +00:00
Duncan Sands	3d77d1fcfc	Adjust these tests now that "extern inline" functions are being output with bodies and available_externally linkage. llvm-svn: 72620	2009-05-30 13:57:05 +00:00
Evan Cheng	2d198e1bc2	(i64 (zext (srl GR32 8))) -> movzbl AH is not safe since srl 8 only clear the top 8 bits. llvm-svn: 72618	2009-05-30 08:43:27 +00:00
Nick Lewycky	a9de2f1c81	Give embedded metadata its own type instead of relying on EmptyStructTy. llvm-svn: 72610	2009-05-30 05:06:04 +00:00
Duncan Sands	f4fe76d46b	Dan noticed that the verifier wasn't thoroughly checking uses of invoke results (see the testcases). Tighten up the checking. llvm-svn: 72586	2009-05-29 19:39:36 +00:00
Evan Cheng	57f85a1529	Remove an accidental commit. llvm-svn: 72560	2009-05-29 05:28:52 +00:00
Evan Cheng	550fc9ba9f	More h-registers tricks: folding zext nodes. llvm-svn: 72558	2009-05-29 01:44:43 +00:00
Evan Cheng	a36a15ff66	Do not try to create a MVT type of width 0. llvm-svn: 72557	2009-05-28 23:52:18 +00:00

... 3 4 5 6 7 ...

7404 Commits