llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-02 00:51:56 +00:00

Author	SHA1	Message	Date
Evan Cheng	d299f09bc5	Rematerialization logic was overly conservative when it comes to loads from fixed stack slots. llvm-svn: 47529	2008-02-23 03:38:34 +00:00
Evan Cheng	6782480bd1	Update test. llvm-svn: 47527	2008-02-23 02:57:25 +00:00
Evan Cheng	4e9d5f1ead	Remat of pic loads are now on by default. llvm-svn: 47525	2008-02-23 02:08:30 +00:00
Dan Gohman	8d536a33ff	Fix a bug that caused opt and other tools to silently ignore invalid command-line options. llvm-svn: 47523	2008-02-23 01:55:25 +00:00
Devang Patel	4995a89a8d	Properly read and write bitcodes for multiple return values. llvm-svn: 47521	2008-02-23 01:44:55 +00:00
Evan Cheng	7c3a8d0056	Really. Why doesn't every arch support MMX? llvm-svn: 47513	2008-02-23 00:56:14 +00:00
Evan Cheng	3b35d2a86c	Test case for PR2082. llvm-svn: 47501	2008-02-22 20:38:49 +00:00
Evan Cheng	1b417c4d84	Allow re-materialization of pic load (controlled by -remat-pic-load for now). llvm-svn: 47476	2008-02-22 09:25:47 +00:00
Chris Lattner	a64d4179d4	copy mmx values from/to memory with GPRs on x86-32 instead of with mmx registers. This horribleness is apparently done by gcc to avoid having to insert emms in places that really should have it. This is the second half of rdar://5741668. llvm-svn: 47474	2008-02-22 05:18:04 +00:00
Chris Lattner	e70bc39d74	Start using GPR's to copy around mmx value instead of mmx regs. GCC apparently does this, and code depends on not having to do emms when this happens. This is x86-64 only so far, second half should handle x86-32. rdar://5741668 llvm-svn: 47470	2008-02-22 02:09:43 +00:00
Chris Lattner	4f87f1c087	Treat clobber operands like early clobbers: if we have any, we force sdisel to do all regalloc for an asm. This leads to gross but correct codegen. This fixes the rest of PR2078. llvm-svn: 47454	2008-02-21 19:43:13 +00:00
Tanya Lattner	8116db05a6	Remove llvm-upgrade and update tests. llvm-svn: 47432	2008-02-21 07:42:26 +00:00
Chris Lattner	2f3bffc338	testcase for PR1133 llvm-svn: 47427	2008-02-21 05:27:08 +00:00
Chris Lattner	99b5a37d39	Fix a (harmless) but where vregs were added to the used reg lists for inline asms. Fix PR2078 by marking aliases of registers used when a register is marked used. This prevents EAX from being allocated when AX is listed in the clobber set for the asm. llvm-svn: 47426	2008-02-21 04:55:52 +00:00
Devang Patel	e60ebc7744	Let invoke return aggregate value. llvm-svn: 47425	2008-02-21 02:14:01 +00:00
Devang Patel	702c4a60a6	Let function call return aggregate. Now, we have very first multiple return value testcase! llvm-svn: 47424	2008-02-21 01:54:02 +00:00
Devang Patel	5c037ae5d5	XFAIL for now. llvm-svn: 47411	2008-02-20 23:10:42 +00:00
Devang Patel	6436033c71	Now functions can return aggregate values. llvm-svn: 47409	2008-02-20 22:51:28 +00:00
Nick Lewycky	0b6cdd1eff	Add 'umax' similar to 'smax' SCEV. Closes PR2003. Parse reversed smax and umax as smin and umin and express them with negative or binary-not SCEVs (which are really just subtract under the hood). Parse 'xor %x, -1' as (-1 - %x). Remove dead code (ConstantInt::get always returns a ConstantInt). Don't use getIntegerSCEV(-1, Ty). The first value is an int, then it gets passed into a uint64_t. Instead, create the -1 directly from ConstantInt::getAllOnesValue(). llvm-svn: 47360	2008-02-20 06:48:22 +00:00
Evan Cheng	33ee06fa48	XFAIL this for now. llvm-svn: 47355	2008-02-20 02:38:58 +00:00
Chris Lattner	aaafe47a55	this test requires sse2 llvm-svn: 47331	2008-02-19 18:07:46 +00:00
Chris Lattner	3a4ac3a69e	Don't fold and's into test instructions if they have multiple uses. This compiles test-nofold.ll into: _test: movl $15, %ecx andl 4(%esp), %ecx testl %ecx, %ecx movl $42, %eax cmove %ecx, %eax ret instead of: _test: movl 4(%esp), %eax movl %eax, %ecx andl $15, %ecx testl $15, %eax movl $42, %eax cmove %ecx, %eax ret llvm-svn: 47330	2008-02-19 17:37:35 +00:00
Chris Lattner	67f2a6c009	rename tests to avoid a test- prefix when they aren't related to the test instruction. llvm-svn: 47329	2008-02-19 17:33:52 +00:00
Tanya Lattner	230c714773	Remove llvm-upgrade and update tests. llvm-svn: 47325	2008-02-19 08:07:33 +00:00
Chris Lattner	8880d6ba5b	dead pass llvm-svn: 47324	2008-02-19 07:58:11 +00:00
Chris Lattner	5400cc3907	Fix PR2060 by rejecting invalid types for integer constants. llvm-svn: 47311	2008-02-19 04:36:07 +00:00
Nick Lewycky	69457748ab	Don't spew stats to stderr. llvm-svn: 47308	2008-02-19 03:11:47 +00:00
Nick Lewycky	0560401b2e	Fix up the run line for this new test. llc: for the -info-output-file option: requires a value! llvm-svn: 47306	2008-02-19 02:58:36 +00:00
Evan Cheng	de4579d0b3	New test. llvm-svn: 47302	2008-02-19 02:09:58 +00:00
Tanya Lattner	169eaef57b	Remove llvm-upgrade and update tests. llvm-svn: 47297	2008-02-19 01:44:26 +00:00
Tanya Lattner	aeb9bb8ba4	Remove llvm-upgrade and update tests. llvm-svn: 47296	2008-02-19 01:41:04 +00:00
Evan Cheng	bb577266bf	- When DAG combiner is folding a bit convert into a BUILD_VECTOR, it should check if it's essentially a SCALAR_TO_VECTOR. Avoid turning (v8i16) <10, u, u, u> to <10, 0, u, u, u, u, u, u>. Instead, simply convert it to a SCALAR_TO_VECTOR of the proper type. - X86 now normalize SCALAR_TO_VECTOR to (BIT_CONVERT (v4i32 SCALAR_TO_VECTOR)). Get rid of X86ISD::S2VEC. llvm-svn: 47290	2008-02-18 23:04:32 +00:00
Nick Lewycky	1f3c58df08	Correctly fold divide-by-constant, even when faced with overflow. llvm-svn: 47287	2008-02-18 22:48:05 +00:00
Dan Gohman	70b9b2f77f	Don't mark scalar integer multiplication as Expand on x86, since x86 has plain one-result scalar integer multiplication instructions. This avoids expanding such instructions into MUL_LOHI sequences that must be special-cased at isel time, and avoids the problem with that code that provented memory operands from being folded. This fixes PR1874, addressesing the most common case. The uncommon cases of optimizing multiply-high operations will require work in DAGCombiner. llvm-svn: 47277	2008-02-18 17:55:26 +00:00
Chris Lattner	a6dc8aaa3f	make this just a bit more strict. llvm-svn: 47274	2008-02-18 17:33:10 +00:00
Owen Anderson	7b092ea631	Add support to GVN for performing sret return slot optimization. This means that, if an sret function tail calls another sret function, it should pass its own sret parameter to the tail callee, allowing it to fill in the correct return value. llvm-gcc does not emit this by default. Instead, it allocates space in the caller for the sret of the tail call and then uses memcpy to copy the result into the caller's sret parameter. This optimization detects and optimizes that case. llvm-svn: 47265	2008-02-18 09:24:53 +00:00
Chris Lattner	9851db050b	optimize away stackrestore calls that have no intervening alloca or call. llvm-svn: 47258	2008-02-18 06:12:38 +00:00
Chris Lattner	dd489bf3eb	upgrade this test. llvm-svn: 47257	2008-02-18 06:11:00 +00:00
Chris Lattner	2fa904b3af	Fold (-x + -y) -> -(x+y) which promotes better association, fixing the second half of PR2047 llvm-svn: 47244	2008-02-17 21:03:36 +00:00
Chris Lattner	4a34461d64	Split up subtracts into add+negate if they have a reassociable use or operand that is also a subtract. This implements PR2047 and Transforms/Reassociate/subtest2.ll llvm-svn: 47241	2008-02-17 20:51:26 +00:00
Chris Lattner	7cd16f34b3	upgrade and simplify this test. llvm-svn: 47240	2008-02-17 20:48:43 +00:00
Tanya Lattner	03755061ae	Remove llvm-upgrade. llvm-svn: 47238	2008-02-17 20:02:20 +00:00
Chris Lattner	1ce2265908	fix this test. llvm-svn: 47232	2008-02-17 00:15:25 +00:00
Tanya Lattner	bcf113b2ca	Remove llvm-upgrade llvm-svn: 47231	2008-02-17 00:15:09 +00:00
Tanya Lattner	71db645ff2	Remove llvm-upgrade. llvm-svn: 47230	2008-02-17 00:13:09 +00:00
Chris Lattner	37c93d2a82	this test isn't useful since we added @ notation for globals. llvm-svn: 47229	2008-02-17 00:12:03 +00:00
Chris Lattner	52d5239f6d	this line was commented out. llvm-svn: 47228	2008-02-17 00:09:08 +00:00
Tanya Lattner	c08b5f0208	Remove llvm-upgrade and update tests. llvm-svn: 47227	2008-02-16 23:55:46 +00:00
Tanya Lattner	1ffe4f2e88	Removing llvm upgrade, so remove tests specific to llvm-upgrade and update the tests that used it. llvm-svn: 47225	2008-02-16 23:27:24 +00:00
Duncan Sands	cd0325cce1	Remove any 'nest' parameter attributes if the function is not passed as an argument to a trampoline intrinsic. llvm-svn: 47220	2008-02-16 20:56:04 +00:00
Andrew Lenharth	c178981b85	llvm.memory.barrier, and impl for x86 and alpha llvm-svn: 47204	2008-02-16 01:24:58 +00:00
Chris Lattner	f8ab0af3c0	upgrade this test, which wasn't testing the right thing since llvm-upgrade came around. llvm-svn: 47194	2008-02-15 23:58:25 +00:00
Chris Lattner	afae31f812	rename llx -> ll llvm-svn: 47192	2008-02-15 23:51:48 +00:00
Evan Cheng	94742fb5d4	This test is not interesting. llvm-svn: 47189	2008-02-15 23:06:21 +00:00
Dale Johannesen	15e608542e	Modify test to expect improved code. llvm-svn: 47182	2008-02-15 22:05:15 +00:00
Duncan Sands	e0330d632a	Fix this test on linux, which returns S242 using sret. llvm-svn: 47173	2008-02-15 19:42:13 +00:00
Devang Patel	01f9252f73	Fix PR2028 llvm-svn: 47150	2008-02-15 01:24:49 +00:00
Devang Patel	8af92b942c	If loop header is also loop exiting block then OrigPN is incoming value for B loop header. Fixes PR 2030. llvm-svn: 47141	2008-02-14 23:18:47 +00:00
Chris Lattner	b8bf200b77	Fix PR2029 llvm-svn: 47129	2008-02-14 19:18:13 +00:00
Chris Lattner	9c24f3ec37	Fix a miscompilation from Dan's recent apintification. llvm-svn: 47128	2008-02-14 18:48:56 +00:00
Tanya Lattner	df7da98f55	Remove llvm-upgrade llvm-svn: 47119	2008-02-14 07:57:12 +00:00
Nick Lewycky	12098ea401	Testcase for PR2032. llvm-svn: 47113	2008-02-14 07:15:11 +00:00
Tanya Lattner	f37f44f9ec	Remove llvm-upgrade. llvm-svn: 47110	2008-02-14 06:56:27 +00:00
Chris Lattner	d696c25db5	This readme entry is done, testcase here: CodeGen/X86/zero-remat.ll llvm-svn: 47106	2008-02-14 05:39:46 +00:00
Evan Cheng	cbbcb3144d	Fix test. llvm-svn: 47102	2008-02-14 01:32:53 +00:00
Devang Patel	38de2d11a5	A loop latch phi node may have uses inside loop, not just in loop header. llvm-svn: 47093	2008-02-13 22:23:07 +00:00
Devang Patel	1e71afe2df	While moving exit condition, do not drop loop latch on the floor. llvm-svn: 47089	2008-02-13 22:06:36 +00:00
Devang Patel	d48bbbf07b	Keep track of exit value operand number when operands are swapped. llvm-svn: 47082	2008-02-13 19:48:48 +00:00
Dale Johannesen	cc4f128930	New test, see comments. llvm-svn: 47078	2008-02-13 18:36:48 +00:00
Duncan Sands	2e9661573f	Teach LegalizeTypes how to expand and promote CTLZ, CTTZ and CTPOP. The expansion code differs from that in LegalizeDAG in that it chooses to take the CTLZ/CTTZ count from the Hi/Lo part depending on whether the Hi/Lo value is zero, not on whether CTLZ/CTTZ of Hi/Lo returned 32 (or whatever the width of the type is) for it. I made this change because the optimizers may well know that Hi/Lo is zero and exploit it. The promotion code for CTTZ also differs from that in LegalizeDAG: it uses an "or" to get the right result when the original value is zero, rather than using a compare and select. This also means the value doesn't need to be zero extended. llvm-svn: 47075	2008-02-13 18:01:53 +00:00
Eli Friedman	cf6273d015	Add a note pointing to PR1996. llvm-svn: 47055	2008-02-13 07:56:04 +00:00
Chris Lattner	a30946c576	In SDISel, for targets that support FORMAL_ARGUMENTS nodes, lower this node as soon as we create it in SDISel. Previously we would lower it in legalize. The problem with this is that it only exposes the argument loads implied by FORMAL_ARGUMENTs after legalize, so that only dag combine 2 can hack on them. This causes us to miss some optimizations because datatype expansion also happens here. Exposing the loads early allows us to do optimizations on them. For example we now compile arg-cast.ll to: _foo: movl $2147483647, %eax andl 8(%esp), %eax ret where we previously produced: _foo: subl $12, %esp movsd 16(%esp), %xmm0 movsd %xmm0, (%esp) movl $2147483647, %eax andl 4(%esp), %eax addl $12, %esp ret It might also make sense to do this for ISD::CALL nodes, which have implicit stores on many targets. llvm-svn: 47054	2008-02-13 07:39:09 +00:00
Eli Friedman	75220639b6	Add test for PR1996. (This is my first time adding a test for a transform, so please review.) llvm-svn: 47050	2008-02-13 06:55:57 +00:00
Nate Begeman	cfd9883301	Add testcase for recent legalizer change llvm-svn: 47049	2008-02-13 06:48:40 +00:00
Evan Cheng	68a88c1f52	New tests. llvm-svn: 47047	2008-02-13 03:23:53 +00:00
Owen Anderson	274aa2846e	Re-apply the patch to improve the optimizations of memcpy's, with several bugs fixed. This now passes PPC bootstrap. llvm-svn: 47026	2008-02-12 21:15:18 +00:00
Evan Cheng	0d2efb485d	Don't mask the isel bug. llvm-svn: 47018	2008-02-12 19:11:29 +00:00
Evan Cheng	6c7520f922	This test assumes no SSE4.1. llvm-svn: 47017	2008-02-12 19:11:08 +00:00
Wojciech Matyjewicz	6df5450bc4	Now that ScalarEvolution::print writes to the correct stream, there is no need to redirect stderr into stdout. llvm-svn: 47009	2008-02-12 15:12:40 +00:00
Wojciech Matyjewicz	ae01857e92	Change negative grep into positive one in my yesterday's testcase. llvm-svn: 47008	2008-02-12 15:10:35 +00:00
Wojciech Matyjewicz	b6e6dacc60	Fix PR2002. Suppose n is the initial value for the induction variable (with step 1) and m is its final value. Then, the correct trip count is SMAX(m,n)-n. Previously, we used SMAX(0,m-n), but m-n may overflow and can't in general be interpreted as signed. Patch by Nick Lewycky. llvm-svn: 47007	2008-02-12 15:09:36 +00:00
Tanya Lattner	cefcef7955	Test case for annotate builtin. llvm-svn: 46999	2008-02-12 07:46:33 +00:00
Evan Cheng	1ab096a313	Fix some test cases. llvm-svn: 46998	2008-02-12 07:22:46 +00:00
Wojciech Matyjewicz	2874a19254	If the LHS of the comparison is a loop-invariant we also want to move it to the RHS. This simple change allows to compute loop iteration count for loops with condition similar to the one in the testcase (which seems to be quite common). llvm-svn: 46959	2008-02-11 18:37:34 +00:00
Wojciech Matyjewicz	76c2b22ee4	Fix PR1798 - an error in the evaluation of SCEVAddRecExpr at an arbitrary iteration. The patch: 1) changes SCEVSDivExpr into SCEVUDivExpr, 2) replaces PartialFact() function with BinomialCoefficient(); the computations (essentially, the division) in BinomialCoefficient() are performed with the apprioprate bitwidth necessary to avoid overflow; unsigned division is used instead of the signed one. Computations in BinomialCoefficient() require support from the code generator for APInts. Currently, we use a hack rounding up the neccessary bitwidth to the nearest power of 2. The hack is easy to turn off in future. One remaining issue: we assume the divisor of the binomial coefficient formula can be computed accurately using 16 bits. It means we can handle AddRecs of length up to 9. In future, we should use APInts to evaluate the divisor. Thanks to Nicholas for cooperation! llvm-svn: 46955	2008-02-11 11:03:14 +00:00
Evan Cheng	19f684ed72	Determine whether a spill kills the register it's spilling before insertion rather than trying to undo the kill marker afterwards. llvm-svn: 46953	2008-02-11 08:30:52 +00:00
Dan Gohman	cabaec582f	Rename MRegisterInfo to TargetRegisterInfo. llvm-svn: 46930	2008-02-10 18:45:23 +00:00
Dale Johannesen	304406f01c	Alignment of struct containing vectors depends on whether SSE is present, on Darwin anyway. Make it explicit. llvm-svn: 46909	2008-02-09 19:04:25 +00:00
Devang Patel	9aba178666	Fix PR 1995. llvm-svn: 46898	2008-02-08 22:49:13 +00:00
Evan Cheng	90f03a0b88	It's not always safe to fold movsd into xorpd, etc. Check the alignment of the load address first to make sure it's 16 byte aligned. llvm-svn: 46893	2008-02-08 21:20:40 +00:00
Evan Cheng	b2bc19ee5b	Added missing entries in X86 load / store folding tables. llvm-svn: 46866	2008-02-08 00:12:56 +00:00
Evan Cheng	a377b2bbd1	Fix a x86-64 codegen deficiency. Allow gv + offset when using rip addressing mode. Before: _main: subq $8, %rsp leaq _X(%rip), %rax movsd 8(%rax), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Now: _main: subq $8, %rsp movsd _X+8(%rip), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Notice there is another idiotic codegen issue that needs to be fixed asap: xorl %ecx, %ecx movl %ecx, %eax llvm-svn: 46850	2008-02-07 08:53:49 +00:00
Evan Cheng	6b03a1aeb9	It's PR1925, not PR1609. llvm-svn: 46825	2008-02-06 22:07:17 +00:00
Bill Wendling	8a28ab4b1f	Temporarily reverting: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20080128/057882.html This is causing a miscompilation on PPC G5 and just now seeing it on iMac x86-64. llvm-svn: 46822	2008-02-06 20:03:07 +00:00
Evan Cheng	2091d9a2e8	Fix a number of local register allocator issues: PR1609. llvm-svn: 46821	2008-02-06 19:16:53 +00:00
Evan Cheng	851d353eb8	Fix PR1975: dag isel emitter produces patterns that isel wrong flag result. llvm-svn: 46776	2008-02-05 22:50:29 +00:00
Evan Cheng	69d5e0fc0f	If a vr is already marked alive in a bb, then it has PHI uses that are visited earlier, then it is not killed in the def block (i.e. not dead). llvm-svn: 46763	2008-02-05 20:04:18 +00:00
Chris Lattner	96deed5d4d	Fix a bug compiling PR1978 (perhaps not the only one though) which was incorrectly simplifying "x == (gep x, 1, i)" into false, even though i could be negative. As it turns out, all the code to handle this already existed, we just need to disable the incorrect optimization case and let the general case handle it. llvm-svn: 46739	2008-02-05 04:45:32 +00:00
Evan Cheng	f1ad13301a	This should also work on x86 now. llvm-svn: 46730	2008-02-05 00:25:31 +00:00
Duncan Sands	b65b2462c8	Crashes LegalizeTypes with "Do not know how to expand the result of this operator!" (node: ctlz). llvm-svn: 46713	2008-02-04 18:07:02 +00:00
Duncan Sands	123f86e781	Crashes LegalizeTypes with "Do not know how to split this operator's operand" (node: extract_subvector). llvm-svn: 46712	2008-02-04 18:05:42 +00:00
Chris Lattner	cad0478491	remove target triple to make this test more "generic" llvm-svn: 46711	2008-02-04 18:02:37 +00:00
Duncan Sands	36a938c4fb	Crashed the new type legalizer. Not likely to catch any bugs in the future since to get the crash you also need hacked in fake libcall support (which creates odd but legal trees), but since adding it doesn't hurt... Thanks to Chris for this ultimately reduced version. llvm-svn: 46706	2008-02-04 09:40:27 +00:00
Owen Anderson	e2150dfe24	Make this test more aggressive, to cover recent improvements. llvm-svn: 46695	2008-02-04 04:55:24 +00:00
Owen Anderson	aaba6f96da	Allow GVN to hack on memcpy's, making them open to further optimization. llvm-svn: 46693	2008-02-04 02:59:58 +00:00
Nick Lewycky	81cc718a25	Tag this test with the PR reference. llvm-svn: 46688	2008-02-03 16:35:19 +00:00
Nick Lewycky	febd3642ce	There are some cases where icmp(add) can be folded into a new icmp. Handle them. llvm-svn: 46687	2008-02-03 16:33:09 +00:00
Gordon Henriksen	b75d9e974d	Fixing a bug creating floating point constants of type other than double through the C bindings. Thanks to Tomas Lindquist Olsen for reporting it. llvm-svn: 46656	2008-02-02 01:07:50 +00:00
Lauro Ramos Venancio	563e0a3ea3	CBackend: Implement unaligned load/store. llvm-svn: 46646	2008-02-01 21:25:59 +00:00
Duncan Sands	454a8eaee9	Don't drop function/call return attributes like 'nounwind'. llvm-svn: 46645	2008-02-01 20:37:16 +00:00
Dale Johannesen	916037c01a	Accept getelementptr starting at GV with all 0 indices as a legitimate way of representing global variable GV in debug info. llvm-svn: 46565	2008-01-30 19:00:21 +00:00
Tanya Lattner	bfeb583bfb	Pointers change size depending upon the target. Remove them to make the test more stable. llvm-svn: 46548	2008-01-30 05:15:15 +00:00
Owen Anderson	ad9a347656	Make DSE much more aggressive by performing DCE earlier. Update a testcase to reflect this increased aggressiveness. llvm-svn: 46542	2008-01-30 01:24:47 +00:00
Chris Lattner	83227e350d	Fix a bug where scalarrepl would discard offset if type would match. In practice this can only happen on code with already undefined behavior, but this is still a good thing to handle correctly. llvm-svn: 46539	2008-01-30 00:39:15 +00:00
Evan Cheng	6a35d0f26e	Update this test case. llvm-svn: 46526	2008-01-29 19:30:05 +00:00
Chris Lattner	e08ec140e0	Don't let globalopt hack on volatile loads or stores. llvm-svn: 46523	2008-01-29 19:01:37 +00:00
Chris Lattner	83f411c586	eliminate additions of 0.0 when they are obviously dead. This has to be careful to avoid turning -0.0 + 0.0 -> -0.0 which is incorrect. llvm-svn: 46499	2008-01-29 06:52:45 +00:00
Owen Anderson	2688087c9c	Add a testcase for eliminating memcpy's at the end of functions. Forgot to commit this with my last commit. llvm-svn: 46497	2008-01-29 06:40:32 +00:00
Chris Lattner	35f063e37c	Add target triples to these so they don't fail on linux. llvm-svn: 46496	2008-01-29 06:26:07 +00:00
Duncan Sands	47bcca5cea	This would be better done as an executable test. llvm-svn: 46493	2008-01-29 06:04:54 +00:00
Duncan Sands	84bc852b52	After recent changes we fail to optimize this test sufficiently to have it pass. I'm removing it from the testsuite and adding it to PR452 instead. llvm-svn: 46492	2008-01-29 05:57:23 +00:00
Devang Patel	86ff705c22	Filter loops that subtract induction variables. These loops are not yet handled. Fix PR 1912. llvm-svn: 46484	2008-01-29 02:20:41 +00:00
Scott Michel	dc780aeb57	Overhaul Cell SPU's addressing mode internals so that there are now only two addressing mode nodes, SPUaform and SPUindirect (vice the three previous ones, SPUaform, SPUdform and SPUxform). This improves code somewhat because we now avoid using reg+reg addressing when it can be avoided. It also simplifies the address selection logic, which was the main point for doing this. Also, for various global variables that would be loaded using SPU's A-form addressing, prefer D-form offs[reg] addressing, keeping the base in a register if the variable is used more than once. llvm-svn: 46483	2008-01-29 02:16:57 +00:00
Devang Patel	51fde22367	New test. llvm-svn: 46479	2008-01-29 01:10:04 +00:00
Bill Wendling	839e21bce4	Add test to make sure that #pragma mark/error doesn't error if there are unbalanced quotes. llvm-svn: 46476	2008-01-29 00:41:29 +00:00
Duncan Sands	93f785a638	Pure/const functions with ByVal parameters cannot be marked readonly either. llvm-svn: 46456	2008-01-28 19:25:47 +00:00
Chris Lattner	20854cf4e7	this test is now compiled into the right thing. llvm-svn: 46454	2008-01-28 17:38:46 +00:00
Duncan Sands	ecab334ce0	Make this more likely to be passed byval. llvm-svn: 46451	2008-01-28 10:35:11 +00:00
Nick Lewycky	6b070b1b93	Handle some more combinations of extend and icmp. Fixes PR1940. llvm-svn: 46431	2008-01-28 03:48:02 +00:00
Chris Lattner	359756ea4b	Fix PR1932 by disabling an xform invalid for fdiv. llvm-svn: 46429	2008-01-28 00:58:18 +00:00
Chris Lattner	7250586ec9	Fix PR1938 by forcing the code that uses an undefined value to branch one way or the other. Rewriting the code itself prevents subsequent analysis passes from making contradictory conclusions about the code that could cause an infeasible path to be made feasible. llvm-svn: 46427	2008-01-28 00:32:30 +00:00
Chris Lattner	26a8116f49	Update this test. Due to dag combiner improvements, we now compile f7/f11 to: _f7: eor r0, r0, #2, 2 @ -2147483648 bx lr _f11: bic r0, r0, #2, 2 @ -2147483648 bx lr instead of: _f7: fmsr s0, r0 fnegs s0, s0 fmrs r0, s0 bx lr _f11: fmsr s0, r0 fabss s0, s0 fmrs r0, s0 bx lr llvm-svn: 46423	2008-01-27 23:26:37 +00:00
Nick Lewycky	cd28ef8950	Be more careful modifying the use_list while also iterating through it. llvm-svn: 46417	2008-01-27 18:35:00 +00:00
Duncan Sands	e77256b325	Revert r46393: readonly/readnone functions are no longer allowed to write through byval arguments. llvm-svn: 46416	2008-01-27 18:12:58 +00:00
Chris Lattner	2ab1fd3824	Implement some dag combines that allow doing fneg/fabs/fcopysign in integer registers if used by a bitconvert or using a bitconvert. This allows us to avoid constant pool loads and use cheaper integer instructions when the values come from or end up in integer regs anyway. For example, we now compile CodeGen/X86/fp-in-intregs.ll to: _test1: movl $2147483648, %eax xorl 4(%esp), %eax ret _test2: movl $1065353216, %eax orl 4(%esp), %eax andl $3212836864, %eax ret Instead of: _test1: movss 4(%esp), %xmm0 xorps LCPI2_0, %xmm0 movd %xmm0, %eax ret _test2: movss 4(%esp), %xmm0 andps LCPI3_0, %xmm0 movss LCPI3_1, %xmm1 andps LCPI3_2, %xmm1 orps %xmm0, %xmm1 movd %xmm1, %eax ret bitconverts can happen due to various calling conventions that require fp values to passed in integer regs in some cases, e.g. when returning a complex. llvm-svn: 46414	2008-01-27 17:42:27 +00:00
Bill Wendling	0e2b8c2c45	The CorrelatedExpressions pass is now no more. llvm-svn: 46409	2008-01-27 06:13:32 +00:00
Chris Lattner	aa553aa0c1	Fold fptrunc(add (fpextend x), (fpextend y)) -> add(x,y), as GCC does. llvm-svn: 46406	2008-01-27 05:29:54 +00:00
Chris Lattner	e66aea6532	New test to verify that "merging 4 loads into a vec load" continues to work and continues to infer alignment info. llvm-svn: 46403	2008-01-26 20:06:45 +00:00
Chris Lattner	682346a7b0	Infer alignment of loads and increase their alignment when we can tell they are from the stack. This allows us to compile stack-align.ll to: _test: movsd LCPI1_0, %xmm0 movapd %xmm0, %xmm1 * andpd 4(%esp), %xmm1 andpd _G, %xmm0 addsd %xmm1, %xmm0 movl 20(%esp), %eax movsd %xmm0, (%eax) ret instead of: _test: movsd LCPI1_0, %xmm0 movsd 4(%esp), %xmm1 ** andpd %xmm0, %xmm1 andpd _G, %xmm0 addsd %xmm1, %xmm0 movl 20(%esp), %eax movsd %xmm0, (%eax) ret llvm-svn: 46401	2008-01-26 19:45:50 +00:00
Chris Lattner	f0c3240135	remove a useless xfailed test. llvm-svn: 46400	2008-01-26 19:35:46 +00:00
Duncan Sands	9fae964ef7	Invert this test, because it is wrong if we allow readonly functions to use byval parameters as local storage (how much do we want this?). llvm-svn: 46399	2008-01-26 12:33:01 +00:00
Bill Wendling	7b83688c73	If there's no instructions being emitted on X86 for a function, emit a nop. Emit the nop directly for PPC. llvm-svn: 46398	2008-01-26 09:03:52 +00:00
Bill Wendling	26fb9335f5	Need to convert to LLVM code and not C. llvm-svn: 46397	2008-01-26 06:56:08 +00:00
Bill Wendling	3e622b88b6	Rename the .c to .ll llvm-svn: 46396	2008-01-26 06:53:40 +00:00
Bill Wendling	7151e8d92c	Move testcase to the code gen directory. llvm-svn: 46395	2008-01-26 06:53:06 +00:00
Duncan Sands	792234c366	Create an explicit copy for byval parameters even when inlining a readonly function. llvm-svn: 46393	2008-01-26 06:41:49 +00:00
Bill Wendling	1e56a2ffb6	If we have a function like this: void bork() { int address = 0; address = 0; } It's compiled into LLVM code that looks like this: define void @bork() noreturn nounwind { entry: unreachable } This is bad on some platforms (like PPC) because it will generate the label for the function but no body. The label could end up being associated with some non-code related stuff, like a section. This places a "trap" instruction if the SimplifyCFG pass removed all code from the function leaving only one "unreachable" instruction. llvm-svn: 46387	2008-01-26 01:43:44 +00:00
Devang Patel	c40820e322	Add another testcase. llvm-svn: 46385	2008-01-26 01:21:48 +00:00
Chris Lattner	53a98f46fd	Fix some bugs in SimplifyNodeWithTwoResults where it would call deletenode to delete a node even if it was not dead in some cases. Instead, just add it to the worklist. Also, make sure to use the CombineTo methods, as it was doing things that were unsafe: the top level combine loop could touch dangling memory. This fixes CodeGen/Generic/2008-01-25-dag-combine-mul.ll llvm-svn: 46384	2008-01-26 01:09:19 +00:00
Evan Cheng	e62e9a8d96	New test case. llvm-svn: 46382	2008-01-26 00:35:43 +00:00
Chris Lattner	c2df169459	add a testcase for a bug Duncan pointed out. llvm-svn: 46372	2008-01-25 22:36:24 +00:00
Duncan Sands	ced29554f7	Test for PR1942. llvm-svn: 46357	2008-01-25 17:36:44 +00:00
Owen Anderson	a4ff15c69f	DeadStoreElimination can treat byval parameters as if there were alloca's for the purpose of removing end-of-function stores. llvm-svn: 46351	2008-01-25 10:10:33 +00:00
Chris Lattner	79076fdf2a	Add target-specific dag combines for FAND(x,0) and FOR(x,0). This allows us to compile: double test(double X) { return copysign(0.0, X); } into: _test: andpd LCPI1_0(%rip), %xmm0 ret instead of: _test: pxor %xmm1, %xmm1 andpd LCPI1_0(%rip), %xmm1 movapd %xmm0, %xmm2 andpd LCPI1_1(%rip), %xmm2 movapd %xmm1, %xmm0 orpd %xmm2, %xmm0 ret llvm-svn: 46344	2008-01-25 05:46:26 +00:00
Devang Patel	587591ba1b	New test. llvm-svn: 46333	2008-01-24 23:55:34 +00:00
Chris Lattner	cd5013eb2f	Teach basicaa that 'byval' arguments define a new memory location that can't be aliased to other known objects. This allows us to know that byval pointer args don't alias globals, etc. llvm-svn: 46315	2008-01-24 18:00:32 +00:00
Chris Lattner	16a8f126d3	Significantly simplify and improve handling of FP function results on x86-32. This case returns the value in ST(0) and then has to convert it to an SSE register. This causes significant codegen ugliness in some cases. For example in the trivial fp-stack-direct-ret.ll testcase we used to generate: _bar: subl $28, %esp call L_foo$stub fstpl 16(%esp) movsd 16(%esp), %xmm0 movsd %xmm0, 8(%esp) fldl 8(%esp) addl $28, %esp ret because we move the result of foo() into an XMM register, then have to move it back for the return of bar. Instead of hacking ever-more special cases into the call result lowering code we take a much simpler approach: on x86-32, fp return is modeled as always returning into an f80 register which is then truncated to f32 or f64 as needed. Similarly for a result, we model it as an extension to f80 + return. This exposes the truncate and extensions to the dag combiner, allowing target independent code to hack on them, eliminating them in this case. This gives us this code for the example above: _bar: subl $12, %esp call L_foo$stub addl $12, %esp ret The nasty aspect of this is that these conversions are not legal, but we want the second pass of dag combiner (post-legalize) to be able to hack on them. To handle this, we lie to legalize and say they are legal, then custom expand them on entry to the isel pass (PreprocessForFPConvert). This is gross, but less gross than the code it is replacing :) This also allows us to generate better code in several other cases. For example on fp-stack-ret-conv.ll, we now generate: _test: subl $12, %esp call L_foo$stub fstps 8(%esp) movl 16(%esp), %eax cvtss2sd 8(%esp), %xmm0 movsd %xmm0, (%eax) addl $12, %esp ret where before we produced (incidentally, the old bad code is identical to what gcc produces): _test: subl $12, %esp call L_foo$stub fstpl (%esp) cvtsd2ss (%esp), %xmm0 cvtss2sd %xmm0, %xmm0 movl 16(%esp), %eax movsd %xmm0, (%eax) addl $12, %esp ret Note that we generate slightly worse code on pr1505b.ll due to a scheduling deficiency that is unrelated to this patch. llvm-svn: 46307	2008-01-24 08:07:48 +00:00
Chris Lattner	214c11ee6f	take these with a pr # llvm-svn: 46303	2008-01-24 06:35:44 +00:00
Evan Cheng	91089e6d66	Let each target decide byval alignment. For X86, it's 4-byte unless the aggregare contains SSE vector(s). For x86-64, it's max of 8 or alignment of the type. llvm-svn: 46286	2008-01-23 23:17:41 +00:00
Evan Cheng	d436c2e724	SSE varargs arguments are passed in memory. llvm-svn: 46262	2008-01-22 23:26:53 +00:00
Chris Lattner	973072dc77	update this test to pass with duncan's change. llvm-svn: 46246	2008-01-22 05:31:58 +00:00
Nick Lewycky	78780f175b	Multiply can be evaluated in a different type, so long as the target type has a smaller bitwidth. llvm-svn: 46244	2008-01-22 05:08:48 +00:00
Devang Patel	6fae526290	New test. llvm-svn: 46220	2008-01-21 22:15:58 +00:00
Devang Patel	6d3139addd	New test. llvm-svn: 46209	2008-01-21 19:28:13 +00:00
Dale Johannesen	7807e86260	Implement flt_rounds for PowerPC. llvm-svn: 46174	2008-01-18 19:55:37 +00:00
Chris Lattner	49fd213770	remove extraneous &&'s from tests, as Scott is apparently not going to. llvm-svn: 46173	2008-01-18 19:53:43 +00:00
Dale Johannesen	b2d9e41233	Test is correct again for the moment. llvm-svn: 46172	2008-01-18 19:53:31 +00:00
Chris Lattner	febc7ea9bf	Fix a latent bug exposed by my truncstore patch. We compiled stfiwx-2.ll to: _test: fctiwz f0, f1 stfiwx f0, 0, r4 blr instead of: _test: fctiwz f0, f1 stfd f0, -8(r1) nop nop lwz r2, -4(r1) stb r2, 0(r4) blr The former is not correct (stores 4 bytes, not 1). llvm-svn: 46161	2008-01-18 16:54:56 +00:00
Scott Michel	506e61bad1	Forward progress: crtbegin.c now compiles successfully! Fixed CellSPU's A-form (local store) address mode, so that all globals, externals, constant pool and jump table symbols are now wrapped within a SPUISD::AFormAddr pseudo-instruction. This now identifies all local store memory addresses, although it requires a bit of legerdemain during instruction selection to properly select loads to and stores from local store, properly generating "LQA" instructions. Also added mul_ops.ll test harness for exercising integer multiplication. llvm-svn: 46142	2008-01-17 20:38:41 +00:00
Chris Lattner	41717f6989	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Chris Lattner	adb8aeaf6a	new testcase. llvm-svn: 46139	2008-01-17 19:47:23 +00:00
Evan Cheng	ca434b34ff	Test case for varargs parameter attribute issue I just fixed. llvm-svn: 46127	2008-01-17 07:26:31 +00:00
Chris Lattner	ee20bcd396	add testcase that has been sitting in my tree for awhile. llvm-svn: 46124	2008-01-17 06:54:09 +00:00
Evan Cheng	8633da0707	When a live virtual register is being clobbered by an implicit def, it is spilled and the spill is its kill. However, if the local allocator has determined the register has not been modified (possible when its value was reloaded), it would not issue a restore. In that case, mark the last use of the virtual register as kill. llvm-svn: 46111	2008-01-17 02:08:17 +00:00
Chris Lattner	c32be7e6b2	Fix arg promotion to propagate the correct attrs on the calls to promoted functions. This is important for varargs calls in particular. Thanks to duncan for providing a great testcase. llvm-svn: 46108	2008-01-17 01:17:03 +00:00
Evan Cheng	5be34d811c	Fixes a nasty dag combiner bug that causes a bunch of tests to fail at -O0. It's not safe to use the two value CombineTo variant to combine away a dead load. e.g. v1, chain2 = load chain1, loc v2, chain3 = load chain2, loc v3 = add v2, c Now we replace use of v1 with undef, use of chain2 with chain1. ReplaceAllUsesWith() will iterate through uses of the first load and update operands: v1, chain2 = load chain1, loc v2, chain3 = load chain1, loc v3 = add v2, c Now the second load is the same as the first load, SelectionDAG cse will ensure the use of second load is replaced with the first load. v1, chain2 = load chain1, loc v3 = add v1, c Then v1 is replaced with undef and bad things happen. llvm-svn: 46099	2008-01-16 23:11:54 +00:00
Duncan Sands	78e448d8b4	Trampoline support for x86-64. This looks like it should work, but I have no machine to test it on. Committed because it will at least cause no harm, and maybe someone can test it for me! llvm-svn: 46098	2008-01-16 22:55:25 +00:00
Chris Lattner	3c9f208ca8	add testcase for regression llvm-svn: 46073	2008-01-16 18:03:52 +00:00
Chris Lattner	109f0e56f5	make sure to use a cpu that has sse. llvm-svn: 46060	2008-01-16 06:32:02 +00:00
Chris Lattner	41e1fd13b2	My previous commit had an incomplete message, it should have been: make the 'fp return in ST(0)' optimization smart enough to look through token factor nodes. THis allows us to compile testcases like CodeGen/X86/fp-stack-retcopy.ll into: _carg: subl $12, %esp call L_foo$stub fstpl (%esp) fldl (%esp) addl $12, %esp ret instead of: _carg: subl $28, %esp call L_foo$stub fstpl 16(%esp) movsd 16(%esp), %xmm0 movsd %xmm0, 8(%esp) fldl 8(%esp) addl $28, %esp ret Still not optimal, but much better and this is a trivial patch. Fixing the rest requires invasive surgery that is is not llvm 2.2 material. llvm-svn: 46054	2008-01-16 05:56:59 +00:00
Devang Patel	a8d63b0b30	Do not strip llvm.used values. llvm-svn: 46045	2008-01-16 03:33:05 +00:00
Chris Lattner	f0dd2bb4e6	add a test to ensure that argpromote of one argument doesn't break the byval attr on some other argument. llvm-svn: 46025	2008-01-15 22:38:12 +00:00
Chris Lattner	afd4056065	verify x86 generates ud2 for llvm.trap llvm-svn: 46023	2008-01-15 22:22:02 +00:00
Chris Lattner	4d3944c554	new testcase for llvm.trap. llvm-svn: 46020	2008-01-15 22:17:26 +00:00
Duncan Sands	1095bdca79	Testcase for gimplify_expr crash caused by an unexpected placeholder_expr. llvm-svn: 46006	2008-01-15 19:55:41 +00:00
Duncan Sands	81e35b4d47	I noticed that the trampoline straightening transformation could drop attributes on varargs call arguments. Also, it could generate invalid IR if the transformed call already had the 'nest' attribute somewhere (this can never happen for code coming from llvm-gcc, but it's a theoretical possibility). Fix both problems. llvm-svn: 45973	2008-01-14 19:52:09 +00:00
Duncan Sands	0e785005ce	This test is now the same as byval-1.ll, so remove it. llvm-svn: 45960	2008-01-14 14:57:30 +00:00
Duncan Sands	c6e42dfda1	Test that byval cannot be used with pointers to types with no size. llvm-svn: 45959	2008-01-14 14:55:05 +00:00
Duncan Sands	5bd24c7449	We now allow byval on fairly general pointer types. llvm-svn: 45956	2008-01-14 06:53:45 +00:00
Chris Lattner	efb498eddb	Fix the miscompilation of MiBench/consumer-lame that was exposed by Evan's byval work. This miscompilation is due to the program indexing an array out of range and us doing a transformation that broke this. llvm-svn: 45949	2008-01-14 02:09:12 +00:00
Chris Lattner	d22a5f6314	Turn a memcpy from a double* into a load/store of double instead of a load/store of i64. The later prevents promotion/scalarrepl of the source and dest in many cases. This fixes the 300% performance regression of the byval stuff on stepanov_v1p2. llvm-svn: 45945	2008-01-14 00:28:35 +00:00
Chris Lattner	32eae5daa5	Fix PR1907, a nasty miscompilation because instcombine didn't realize that ne & sgt was a signed comparison (it was only looking at whether the left compare was signed). llvm-svn: 45937	2008-01-13 20:59:02 +00:00
Duncan Sands	1853262730	Check that nested functions don't get pointless static chains. llvm-svn: 45936	2008-01-13 18:44:12 +00:00
Duncan Sands	7414cc131b	When turning a call to a bitcast function into a direct call, if this becomes a varargs call then deal correctly with any parameter attributes on the newly vararg call arguments. llvm-svn: 45931	2008-01-13 08:02:44 +00:00
Chris Lattner	a20247cd7b	new testcase for rdar://5685492 llvm-svn: 45918	2008-01-13 00:19:57 +00:00
Chris Lattner	d90840eddc	we don't have to make an explicit copy of a byval argument when inlining a function if we know that the function does not write to any memory. This implements test/Transforms/Inline/byval2.ll llvm-svn: 45912	2008-01-12 18:54:29 +00:00
Duncan Sands	3b617cbe30	Be more liberal in what parameter attributes are allowed on the vararg arguments of a call. llvm-svn: 45909	2008-01-12 16:42:01 +00:00
Chris Lattner	c23b7b79fa	this actually does pass with 4.0 llvm-svn: 45899	2008-01-12 01:45:22 +00:00
Duncan Sands	6f49217a5e	When DAE drops the varargs part of a function, ensure any attributes on the vararg call arguments are also dropped. llvm-svn: 45892	2008-01-11 23:13:45 +00:00
Chris Lattner	3cc116abce	llvm-g++ 4.0 has completely different code for this warning, just xfail it. llvm-svn: 45890	2008-01-11 23:06:56 +00:00
Chris Lattner	e330f0fe21	new testcase llvm-svn: 45888	2008-01-11 23:04:14 +00:00
Duncan Sands	e55a5806ac	Do not allow attributes beyond a function's last parameter, even if it is a varargs function. Do allow attributes on the varargs part of a call, but not beyond the last argument. Only allow selected attributes to be on the varargs part of a call (currently only 'byval' is allowed). The reasoning here is that most attributes, eg inreg, simply make no sense here. llvm-svn: 45887	2008-01-11 22:36:48 +00:00
Chris Lattner	a1246ba8ed	Teach argpromote to ruthlessly hack small byval structs when it can get away with it, which exposes opportunities to eliminate the memory objects entirely. For example, we now compile byval.ll to: define internal void @f1(i32 %b.0, i64 %b.1) { entry: %tmp2 = add i32 %b.0, 1 ; <i32> [#uses=0] ret void } define i32 @main() nounwind { entry: call void @f1( i32 1, i64 2 ) ret i32 0 } This seems like it would trigger a lot for code that passes around small structs (e.g. SDOperand's or _Complex)... llvm-svn: 45886	2008-01-11 22:31:41 +00:00
Duncan Sands	41029353f9	Two occurrences on one line count as one... llvm-svn: 45885	2008-01-11 21:46:24 +00:00
Duncan Sands	d99973b822	If there are attributes on the varargs part of a call, don't discard them. llvm-svn: 45884	2008-01-11 21:23:39 +00:00
Scott Michel	5afa19350b	More CellSPU refinements: - struct_2.ll: Completely unaligned load/store testing - call_indirect.ll, struct_1.ll: Add test lines to exercise X-form [$reg($reg)] addressing At this point, loads and stores should be under control (he says in an optimistic tone of voice.) llvm-svn: 45882	2008-01-11 21:01:19 +00:00
Dale Johannesen	8ca78844b0	Disable for now. llvm-svn: 45881	2008-01-11 20:47:33 +00:00
Chris Lattner	bf51fecdc4	When inlining a functino with a byval argument, make an explicit copy of it in case the callee modifies the struct. llvm-svn: 45853	2008-01-11 06:09:30 +00:00
Scott Michel	1e9496e4d4	More CellSPU refinement and progress: - Cleaned up custom load/store logic, common code is now shared [see note below], cleaned up address modes - More test cases: various intrinsics, structure element access (load/store test), updated target data strings, indirect function calls. Note: This patch contains a refactoring of the LoadSDNode and StoreSDNode structures: they now share a common base class, LSBaseSDNode, that provides an interface to their common functionality. There is some hackery to access the proper operand depending on the derived class; otherwise, to do a proper job would require finding and rearranging the SDOperands sent to StoreSDNode's constructor. The current refactor errs on the side of being conservatively and backwardly compatible while providing functionality that reduces redundant code for targets where loads and stores are custom-lowered. llvm-svn: 45851	2008-01-11 02:53:15 +00:00
Duncan Sands	2c89976416	Output sinl for a long double FSIN node, not sin. Likewise fix up a bunch of other libcalls. While there I remove NEG_F32 and NEG_F64 since they are not used anywhere. This fixes 9 Ada ACATS failures. llvm-svn: 45833	2008-01-10 10:28:30 +00:00
Evan Cheng	0747381b13	Codegen improvement has reduced one spill. llvm-svn: 45814	2008-01-10 02:54:40 +00:00
Chris Lattner	cce1483bcf	new testcase for PR1845 llvm-svn: 45795	2008-01-10 00:30:38 +00:00
Evan Cheng	ba0214a6cb	Special copy SUnit's do not have SDNode's. llvm-svn: 45787	2008-01-09 23:01:55 +00:00
Evan Cheng	f91cfb435f	Fix sse2.psrl.w and sse2.psrl.q definitions. llvm-svn: 45772	2008-01-09 02:16:44 +00:00
Chris Lattner	c93ad7d569	Make load->store deletion a bit smarter. This allows us to compile this: void test(long long P) { P ^= 1; } into just: _test: movl 4(%esp), %eax xorl $1, (%eax) ret instead of code like this: _test: movl 4(%esp), %ecx xorl $1, (%ecx) movl 4(%ecx), %edx movl %edx, 4(%ecx) ret llvm-svn: 45762	2008-01-08 23:08:06 +00:00
Duncan Sands	b3b1ae18ab	Crashes llc when using Chris's new legalization logic. llvm-svn: 45758	2008-01-08 21:51:53 +00:00
Chris Lattner	67f581b344	Implement PR1795, an instcombine hack for forming GEPs with integer pointer arithmetic. llvm-svn: 45745	2008-01-08 07:23:51 +00:00
Chris Lattner	aeab9aefb3	remove darwin/i386 t-t llvm-svn: 45743	2008-01-08 06:52:51 +00:00
Chris Lattner	cafc567fb7	Finally implement correct ordered comparisons for PPC, even though the code generated is not wonderful. This turns a miscompilation into a code quality bug (noted in the ppc readme). This fixes PR642, which is over 2 years old (!). Nate, please review this. llvm-svn: 45742	2008-01-08 06:46:30 +00:00
Chris Lattner	7853f9d2df	Testcase for PR1721 llvm-svn: 45739	2008-01-08 05:16:29 +00:00
Nate Begeman	98dba4b0ce	Update test to catch recent x86 insert regression and improvements llvm-svn: 45705	2008-01-07 17:49:23 +00:00
Gordon Henriksen	edbfece273	Setting GlobalDirective in TargetAsmInfo by default rather than providing a misleading facility. It's used once in the MIPS backend and hardcoded as "\t.globl\t" everywhere else. llvm-svn: 45676	2008-01-07 02:31:11 +00:00
Gordon Henriksen	db4f51e1b9	With this patch, the LowerGC transformation becomes the ShadowStackCollector, which additionally has reduced overhead with no sacrifice in portability. Considering a function @fun with 8 loop-local roots, ShadowStackCollector introduces the following overhead (x86): ; shadowstack prologue movl L_llvm_gc_root_chain$non_lazy_ptr, %eax movl (%eax), %ecx movl $___gc_fun, 20(%esp) movl $0, 24(%esp) movl $0, 28(%esp) movl $0, 32(%esp) movl $0, 36(%esp) movl $0, 40(%esp) movl $0, 44(%esp) movl $0, 48(%esp) movl $0, 52(%esp) movl %ecx, 16(%esp) leal 16(%esp), %ecx movl %ecx, (%eax) ; shadowstack loop overhead (none) ; shadowstack epilogue movl 48(%esp), %edx movl %edx, (%ecx) ; shadowstack metadata .align 3 ___gc_fun: # __gc_fun .long 8 .space 4 In comparison to LowerGC: ; lowergc prologue movl L_llvm_gc_root_chain$non_lazy_ptr, %eax movl (%eax), %ecx movl %ecx, 48(%esp) movl $8, 52(%esp) movl $0, 60(%esp) movl $0, 56(%esp) movl $0, 68(%esp) movl $0, 64(%esp) movl $0, 76(%esp) movl $0, 72(%esp) movl $0, 84(%esp) movl $0, 80(%esp) movl $0, 92(%esp) movl $0, 88(%esp) movl $0, 100(%esp) movl $0, 96(%esp) movl $0, 108(%esp) movl $0, 104(%esp) movl $0, 116(%esp) movl $0, 112(%esp) ; lowergc loop overhead leal 44(%esp), %eax movl %eax, 56(%esp) leal 40(%esp), %eax movl %eax, 64(%esp) leal 36(%esp), %eax movl %eax, 72(%esp) leal 32(%esp), %eax movl %eax, 80(%esp) leal 28(%esp), %eax movl %eax, 88(%esp) leal 24(%esp), %eax movl %eax, 96(%esp) leal 20(%esp), %eax movl %eax, 104(%esp) leal 16(%esp), %eax movl %eax, 112(%esp) ; lowergc epilogue movl 48(%esp), %edx movl %edx, (%ecx) ; lowergc metadata (none) llvm-svn: 45670	2008-01-07 01:30:53 +00:00
Duncan Sands	fd975e4b3d	The transform that tries to turn calls to bitcast functions into direct calls bails out unless caller and callee have essentially equivalent parameter attributes. This is illogical - the callee's attributes should be of no relevance here. Rework the logic, which incidentally fixes a crash when removed arguments have attributes. llvm-svn: 45658	2008-01-06 18:27:01 +00:00
Duncan Sands	b8489f09a2	When transforming a call to a bitcast function into a direct call with cast parameters and cast return value (if any), instcombine was prepared to cast any non-void return value into any other, whether castable or not. Add a new predicate for testing whether casting is valid, and check it both for the return value and (as a cleanup) for the parameters. llvm-svn: 45657	2008-01-06 10:12:28 +00:00
Nick Lewycky	e05651c6d7	Accept both %y, %x and %x, %y as valid answers. llvm-svn: 45649	2008-01-06 03:12:44 +00:00
Chris Lattner	7e1c3aa702	remove a couple more unsafe xforms in the face of overflow. llvm-svn: 45613	2008-01-05 01:22:42 +00:00
Duncan Sands	8a7547d26f	Testcase with non-integer "bitfields" (in quotes, since they didn't actually need to be bitfields, though they are marked as such). llvm-svn: 45588	2008-01-04 17:31:56 +00:00
Duncan Sands	93c581f2a8	Testcase for PR1386. llvm-svn: 45583	2008-01-04 13:15:39 +00:00
Chris Lattner	d4c66656a1	Fix PR1896 llvm-svn: 45568	2008-01-04 05:04:53 +00:00
Duncan Sands	4f9b058e95	Test for handling of large bit offset from a variable field offset. llvm-svn: 45534	2008-01-03 11:53:52 +00:00
Chris Lattner	26b89fd30a	don't hoist FP additions into unconditional adds + selects. This could theoretically introduce a trap, but is also a performance issue. This speeds up ptrdist/ks by 8%. llvm-svn: 45533	2008-01-03 07:25:26 +00:00
Duncan Sands	cc3a49623f	An example for which the TYPE_SIZE was being set from the initial value, while the type fields were not (this is a qualified union type, so not all fields are always present). This resulted in the size of the corresponding LLVM type being larger than the gcc TYPE_SIZE. llvm-svn: 45522	2008-01-03 00:26:42 +00:00
Duncan Sands	fcb856ad1e	Test handling of records for which the fields are not ordered by offset. llvm-svn: 45520	2008-01-03 00:17:02 +00:00
Chris Lattner	7d567adef9	fix this to use a valid triple. llvm-svn: 45509	2008-01-02 22:21:45 +00:00
Chris Lattner	fbd8cc03c8	verify that aligned common support doesn't break. llvm-svn: 45495	2008-01-02 19:48:24 +00:00
Bill Wendling	e6898cbe7a	Update this testcase. The output needs to be disabled to pass. llvm-svn: 45478	2008-01-01 01:34:36 +00:00
Duncan Sands	8a4882564a	Fix PR1833 - eh.exception and eh.selector return two values, which means doing extra legalization work. It would be easier to get this kind of thing right if there was some documentation... llvm-svn: 45472	2007-12-31 18:35:50 +00:00
Gordon Henriksen	0f21fdb054	Another backwards compatibility fix. llvm-svn: 45463	2007-12-30 22:48:58 +00:00
Gordon Henriksen	81148ffd7e	Strengthening this test so it fails in release mode. llvm-svn: 45446	2007-12-30 05:45:49 +00:00
Gordon Henriksen	36da9ea958	Bindings for instruction calling conventions. llvm-svn: 45422	2007-12-29 20:45:00 +00:00
Chris Lattner	6aece93d5c	remove attributions from the rest of the llvm makefiles. llvm-svn: 45416	2007-12-29 20:11:13 +00:00
Chris Lattner	d55e743cfe	One readme entry is done, one is really easy (Evan, want to investigate eliminating the llvm.x86.sse2.loadl.pd intrinsic?), one shuffle optzn may be done (if shufps is better than pinsw, Evan, please review), and we already know about LICM of simple instructions. llvm-svn: 45407	2007-12-29 19:31:47 +00:00
Chris Lattner	ed55329cc9	upgrade this test llvm-svn: 45406	2007-12-29 19:24:06 +00:00
Chris Lattner	cd147e5596	Fold comparisons against a constant nan, and optimize ORD/UNORD comparisons with a constant. This allows us to compile isnan to: _foo: fcmpu cr7, f1, f1 mfcr r2 rlwinm r3, r2, 0, 31, 31 blr instead of: LCPI1_0: ; float .space 4 _foo: lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr7, f1, f0 mfcr r2 rlwinm r3, r2, 0, 31, 31 blr llvm-svn: 45405	2007-12-29 08:37:08 +00:00
Chris Lattner	b36a4a7a84	this xform is implemented. llvm-svn: 45404	2007-12-29 08:19:39 +00:00
Chris Lattner	f8e408b7b1	Codegen: as: _bar: pushl %esi subl $8, %esp movl 16(%esp), %esi call L_foo$stub fstps (%esi) addl $8, %esp popl %esi #FP_REG_KILL ret instead of: _bar: pushl %esi subl $8, %esp movl 16(%esp), %esi call L_foo$stub fstpl (%esi) cvtsd2ss (%esi), %xmm0 movss %xmm0, (%esi) addl $8, %esp popl %esi #FP_REG_KILL ret llvm-svn: 45401	2007-12-29 06:57:38 +00:00
Chris Lattner	e3515220d2	avoid going through a stack slot to convert from fpstack to xmm reg if we are just going to store it back anyway. This improves things like: double foo(); void bar(double P) { P = foo(); } llvm-svn: 45399	2007-12-29 06:41:28 +00:00
Chris Lattner	2369d2f4ab	dead calls to llvm.stacksave can be deleted, even though they have potential side-effects. llvm-svn: 45392	2007-12-29 00:59:12 +00:00
Chris Lattner	e26989231e	upgrade this test llvm-svn: 45391	2007-12-29 00:57:06 +00:00

... 3 4 5 6 7 ...

5033 Commits