llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-28 14:36:34 +00:00

Author	SHA1	Message	Date
Jakob Stoklund Olesen	c74b7b271e	Handle regmasks in MachineCSE. Don't attempt to extend physreg live ranges across calls. <rdar://problem/10942095> llvm-svn: 151610	2012-02-28 02:08:50 +00:00
Jakob Stoklund Olesen	c6377253f7	Enable ARM base pointer when calling functions with large arguments. When an outgoing call takes more than 2k of arguments on the stack, we don't allocate that call frame in the prolog, but adjust the stack pointer immediately before the call instead. This causes problems with the emergency spill slot because PEI can't track stack pointer adjustments on the second pass, and if the outgoing arguments are too big, SP can't be used to reach the emergency spill slot at all. Work around these problems by ensuring there is a base or frame pointer that can be used to access the emergency spill slot. <rdar://problem/10917166> llvm-svn: 151604	2012-02-28 01:15:01 +00:00
Preston Gurd	81931b8e3b	test commit. llvm-svn: 151588	2012-02-27 23:31:51 +00:00
Roman Divacky	588712f080	Test the section specification. llvm-svn: 151552	2012-02-27 20:42:19 +00:00
Roman Divacky	200acf8e6e	Reapply r151278 with fixes. MCize function entry label emission on PowerPC64 properly. llvm-svn: 151547	2012-02-27 20:20:47 +00:00
Hal Finkel	3aea686faa	Revert r151278, breaks static linking. Reverting this because it breaks static linking on ppc64. Specifically, it may be linkonce_odr functions that are the problem. With this patch, if you link statically, calls to some functions end up calling their descriptor addresses instead of calling to their entry points. This causes the execution to fail with SIGILL (b/c the descriptor address just has some pointers, not code). llvm-svn: 151433	2012-02-25 03:40:11 +00:00
NAKAMURA Takumi	17b6271b41	Target/X86: Fix assertion failures and warnings caused by r151382 _ftol2 lowering for i386-*-win32 targets. Patch by Joe Groff. [Joe Groff] Hi everyone. My previous patch applied as r151382 had a few problems: Clang raised a warning, and X86 LowerOperation would assert out for fptoui f64 to i32 because it improperly lowered to an illegal BUILD_PAIR. Here's a patch that addresses these issues. Let me know if any other changes are necessary. Thanks. llvm-svn: 151432	2012-02-25 03:37:25 +00:00
Akira Hatanaka	8fc9a35d3f	Add definitions of floating point multiply add/sub and negative multiply add/sub instructions. llvm-svn: 151415	2012-02-25 00:21:52 +00:00
Akira Hatanaka	3b3ee53886	Add an option to use a virtual register as the global base register instead of reserving a physical register ($gp or $28) for that purpose. This will completely eliminate loads that restore the value of $gp after every function call, if the register allocator assigns a callee-saved register, or eliminate unnecessary loads if it assigns a temporary register. example: .cpload $25 // set $gp. ... .cprestore 16 // store $gp to stack slot 16($sp). ... jalr $25 // function call. clobbers $gp. lw $gp, 16($sp) // not emitted if callee-saved reg is chosen. ... lw $2, 4($gp) ... jalr $25 // function call. lw $gp, 16($sp) // not emitted if $gp is not live after this instruction. ... llvm-svn: 151402	2012-02-24 22:34:47 +00:00
Michael J. Spencer	d2f0ce2674	Add WIN_FTOL_* psudo-instructions to model the unique calling convention used by the Win32 _ftol2 runtime function. Patch by Joe Groff! llvm-svn: 151382	2012-02-24 19:01:22 +00:00
Hal Finkel	784c4bf068	X11/X2 loads around indirect calls on ppc64 should not be deleted. llvm-svn: 151374	2012-02-24 17:54:01 +00:00
Hal Finkel	8c2c90c035	Don't crash when a glue node contains an internal CopyToReg This is necessary to support the existing ppc lowering code for indirect calls. Fixes PR12071. llvm-svn: 151373	2012-02-24 17:53:59 +00:00
Kristof Beyls	3f16b0ead0	test commit. removing unnecessary whitespace. llvm-svn: 151363	2012-02-24 13:52:45 +00:00
NAKAMURA Takumi	d8b4183963	test/CodeGen/X86/2012-02-23-mmx-inlineasm.ll: Fixup to add -march=x86. -mcpu does not choose arch automatically, on non-x86 hosts. llvm-svn: 151362	2012-02-24 13:29:50 +00:00
Pete Cooper	135769381b	Turn avx insert intrinsic calls into INSERT_SUBVECTOR DAG nodes and remove duplicate patterns for selecting the intrinsics llvm-svn: 151342	2012-02-24 03:51:49 +00:00
Jim Grosbach	4ff2fb2fbc	Thumb2 size reduction fix for tied operands of tMUL. The tied source operand of tMUL is the second source operand, not the first like every other two-address thumb instruction. Special case it in the size reduction pass to make sure we create the tMUL instruction properly. llvm-svn: 151315	2012-02-24 00:33:36 +00:00
Dan Gohman	8da4093a80	When emitting a cmp with 0 for a lowered select, mask out the high bits of the value carying the boolean condition, as their contents are undefined. This fixes rdar://10887484. llvm-svn: 151310	2012-02-24 00:09:36 +00:00
Bill Wendling	1a35321235	Allow an integer to be converted into an MMX type when it's used in an inline asm. <rdar://problem/10106006> llvm-svn: 151303	2012-02-23 23:25:25 +00:00
Roman Divacky	35c45da372	MCize function entry label emission on PowerPC64 properly. llvm-svn: 151278	2012-02-23 20:28:39 +00:00
Jakob Stoklund Olesen	3809cf9ffe	Make tests less sensitive to scheduling changes. llvm-svn: 151260	2012-02-23 17:19:34 +00:00
Anton Korobeynikov	fb863cd279	Fix to make sure that a comdat group gets generated correctly for a static member of instantiated C++ templates. Patch by Kristof Beyls! llvm-svn: 151250	2012-02-23 10:36:04 +00:00
Evan Cheng	9d9b58cc0d	Canonicalize (srl (bswap x), 16) to (rotr (bswap x), 16) if the high 16 bits of x are zero. This optimizes rev + lsr 16 to rev16. rdar://10750814 llvm-svn: 151230	2012-02-23 02:58:19 +00:00
Evan Cheng	d18a688213	Optimize a couple of common patterns involving conditional moves where the false value is zero. Instead of a cmov + op, issue an conditional op instead. e.g. cmp r9, r4 mov r4, #0 moveq r4, #1 orr lr, lr, r4 should be: cmp r9, r4 orreq lr, lr, #1 That is, optimize (or x, (cmov 0, y, cond)) to (or.cond x, y). Similarly extend this to xor as well as (and x, (cmov -1, y, cond)) => (and.cond x, y). It's possible to extend this to ADD and SUB but I don't think they are common. rdar://8659097 llvm-svn: 151224	2012-02-23 01:19:06 +00:00
Daniel Dunbar	cac06bf0c6	MC: Fix the MCNullStreamer which was broken in r147763. llvm-svn: 151213	2012-02-22 23:49:50 +00:00
Hal Finkel	cfc8c850f6	Allow the use of an alternate symbol for calculating a function's size. The standard function epilog includes a .size directive, but ppc64 uses an alternate local symbol to tag the actual start of each function. Until recently, binutils accepted the .size directive as: .size test1, .Ltmp0-test1 however, using this directive with recent binutils will result in the error: .size expression for XXX does not evaluate to a constant so we must use the label which actually tags the start of the function. llvm-svn: 151200	2012-02-22 21:11:47 +00:00
Michael J. Spencer	24f6d49962	Properly emit _fltused with FastISel. Refactor to share code with SDAG. Patch by Joe Groff! llvm-svn: 151183	2012-02-22 19:06:13 +00:00
Aaron Ballman	a76a5b7265	Adding support for Microsoft's thiscall calling convention. LLVM side of the patch. llvm-svn: 151123	2012-02-22 03:04:40 +00:00
Jakob Stoklund Olesen	4404c980b2	Remove a bad PowerPC test. This test case was way too strict, matching the entire assembly output. Every non-trivial change to the ppc backend or -O0 pipeline required the test to be updated. It should be replaced with a test of the specific vaarg feature. llvm-svn: 151105	2012-02-21 23:49:18 +00:00
Evan Cheng	9759637dc1	Proper support for a bastardized darwin-eabi hybird ABI. llvm-svn: 151083	2012-02-21 20:46:00 +00:00
NAKAMURA Takumi	0fac05f8e2	test/CodeGen/X86/2012-02-20-MachineCPBug.ll: Fix on generic(non-x86) hosts to add -mattr=+sse. llvm-svn: 151053	2012-02-21 11:56:42 +00:00
Evan Cheng	3bffc22fc2	Fix machine-cp by having it to check sub-register indicies. e.g. ecx = mov eax al = mov ch The second copy is not a nop because the sub-indices of ecx,ch is not the same of that of eax/al. Re-enabled machine-cp. PR11940 llvm-svn: 151002	2012-02-20 23:28:17 +00:00
Eric Christopher	c2e76f573d	Testcase for the previous commit. llvm-svn: 150852	2012-02-18 00:05:45 +00:00
David Chisnall	86b0f069d6	It turns out that putting an 8-byte symbol in a 4-byte section makes Solaris ld sulk. GNU ld is perfectly happy with it, which is worrying for a whole other set of reasons... Thanks to Anton, Duncan and Rafael for helping me track this down. Pointy hat to Rafael for introducing the bug in the first place. llvm-svn: 150811	2012-02-17 16:05:50 +00:00
Chad Rosier	7867a0bd92	[fast-isel] Add support for returning non-legal types with no sign- or zero- entend flag. llvm-svn: 150774	2012-02-17 01:21:28 +00:00
Bill Wendling	c137296347	Use –mcpu=generic, so that the test will not fail when run on an Intel Atom processor, due to the Atom scheduler producing an instruction sequence that is different from that which is expected. Patch by Michael Spencer! llvm-svn: 150736	2012-02-16 22:42:48 +00:00
Benjamin Kramer	814de25917	Disable machine copy propagation for now. It's known to be buggy (PR11940) and introduces subtle miscompiles in many places. llvm-svn: 150703	2012-02-16 17:29:50 +00:00
Eli Bendersky	4afdeeb682	Replace all instances of dg.exp file with lit.local.cfg, since all tests are run with LIT now and now Dejagnu. dg.exp is no longer needed. Patch reviewed by Daniel Dunbar. It will be followed by additional cleanup patches. llvm-svn: 150664	2012-02-16 06:28:33 +00:00
Bill Wendling	0ea63367ec	Add a test for generating Objective-C metadata from module flags. llvm-svn: 150635	2012-02-15 23:43:37 +00:00
Pete Cooper	21409dd760	Stop custom lowering forr x86 DEC64m from happening if the load in the lowered sequence has more than 1 user llvm-svn: 150537	2012-02-15 00:33:37 +00:00
Lang Hames	11ccc79191	Tighten physical register invariants: Allocatable physical registers can only be live in to a block if it is the function entry point or a landing pad. llvm-svn: 150494	2012-02-14 18:51:53 +00:00
Nadav Rotem	5da800572a	Fix PR12000. Some vector operations may use scalar operands with types that are greater than the vector element type. For example BUILD_VECTOR of type <1 x i1> with a constant i8 operand. This patch fixes the assertion. llvm-svn: 150477	2012-02-14 13:06:32 +00:00
Nadav Rotem	2141a8413e	Fix a bug in DAGCombine for the optimization of BUILD_VECTOR. We cant generate a shuffle node from two vectors of different types. llvm-svn: 150383	2012-02-13 12:42:26 +00:00
Craig Topper	1487726cdf	Revert accidental commit of a pruned testcase from r150360. llvm-svn: 150361	2012-02-13 04:33:33 +00:00
Craig Topper	250c8fb194	Update CanXFormVExtractWithShuffleIntoLoad to ensure bitcasts of loads only have one use. Matches DAGCombiner and prevents vector_shuffles from reaching isel. llvm-svn: 150360	2012-02-13 04:30:38 +00:00
Pete Cooper	b1229a8866	Fixed bug when custom lowering DEC64m on x86. If the DEC node had more than one user, it was doing this lowering but leaving the original DEC node around and so decrementing twice. Fixes PR11964. llvm-svn: 150356	2012-02-13 00:10:03 +00:00
Nadav Rotem	ea4aecb3e5	This patch addresses the problem of poor code generation for the zext v8i8 -> v8i32 on AVX machines. The codegen often scalarizes ANY_EXTEND nodes. The DAGCombiner has two optimizations that can mitigate the problem. First, if all of the operands of a BUILD_VECTOR node are extracted from an ZEXT/ANYEXT nodes, then it is possible to create a new simplified BUILD_VECTOR which uses UNDEFS/ZERO values to eliminate the scalar ZEXT/ANYEXT nodes. Second, another dag combine optimization lowers BUILD_VECTOR into a shuffle vector instruction. In the case of zext v8i8->v8i32 on AVX, a value in an XMM register is to be shuffled into a wide YMM register. This patch modifes the second optimization and allows the creation of shuffle vectors even when the newly generated vector and the original vector from which we extract the values are of different types. llvm-svn: 150340	2012-02-12 15:05:31 +00:00
Anton Korobeynikov	5996573d4b	Add support for implicit TLS model used with MS VC runtime. Patch by Kai Nacke! llvm-svn: 150307	2012-02-11 17:26:53 +00:00
Andrew Trick	c3cc8fa604	RegAlloc superpass: includes phi elimination, coalescing, and scheduling. Creates a configurable regalloc pipeline. Ensure specific llc options do what they say and nothing more: -reglloc=... has no effect other than selecting the allocator pass itself. This patch introduces a new umbrella flag, "-optimize-regalloc", to enable/disable the optimizing regalloc "superpass". This allows for example testing coalscing and scheduling under -O0 or vice-versa. When a CodeGen pass requires the MachineFunction to have a particular property, we need to explicitly define that property so it can be directly queried rather than naming a specific Pass. For example, to check for SSA, use MRI->isSSA, not addRequired<PHIElimination>. CodeGen transformation passes are never "required" as an analysis ProcessImplicitDefs does not require LiveVariables. We have a plan to massively simplify some of the early passes within the regalloc superpass. llvm-svn: 150226	2012-02-10 04:10:36 +00:00
NAKAMURA Takumi	81f7ad5b9b	test/CodeGen/X86/atom-lea-sp.ll: Add explicit -mtriple=i686-linux. llvm-svn: 150151	2012-02-09 05:12:58 +00:00
Evan Cheng	1be96ff50e	Commit Andy Zhang's test for the lea patch. llvm-svn: 150107	2012-02-08 22:33:17 +00:00

1 2 3 4 5 ...

5658 Commits