llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-06 03:08:43 +00:00

Author	SHA1	Message	Date
Chad Rosier	21cd759234	Add fast-isel support for returning i1, i8, and i16. llvm-svn: 143669	2011-11-04 00:50:21 +00:00
Dan Gohman	a5f382da8b	Reapply r143206, with fixes. Disallow physical register lifetimes across calls, and only check for nested dependences on the special call-sequence-resource register. llvm-svn: 143660	2011-11-03 21:49:52 +00:00
Pete Cooper	ad3d5b2eee	Reverted r143600 - selector reference change llvm-svn: 143646	2011-11-03 20:47:50 +00:00
Dan Bailey	986e6b02b8	fixed global array handling for ptx to use the correct bit widths llvm-svn: 143640	2011-11-03 19:24:46 +00:00
Craig Topper	124b2fd08c	Add new X86 AVX2 VBROADCAST instructions. llvm-svn: 143612	2011-11-03 07:35:53 +00:00
Chad Rosier	74c4e2c2d9	Add support for sign-extending non-legal types in SelectSIToFP(). llvm-svn: 143603	2011-11-03 02:04:59 +00:00
Pete Cooper	c8a657a2b2	Treat objc selector reference globals as invariant so that MachineLICM can hoist them out of loops. Fixes <rdar://problem/6027699> llvm-svn: 143600	2011-11-03 00:56:36 +00:00
Lang Hames	ceec8ec67e	Try to lower memset/memcpy/memmove to vector instructions on ARM where the alignment permits. llvm-svn: 143582	2011-11-02 22:52:45 +00:00
Nick Lewycky	691d7f80c2	Don't emit a directory entry for the value in DW_AT_comp_dir, that is always implied by directory index zero. llvm-svn: 143570	2011-11-02 20:55:33 +00:00
Chad Rosier	8a613c5ec5	Add support for comparing integer non-legal types. llvm-svn: 143559	2011-11-02 18:08:25 +00:00
Craig Topper	a2a55bd0b4	More AVX2 instructions and intrinsics. llvm-svn: 143536	2011-11-02 06:54:17 +00:00
Craig Topper	c5482eb697	Add a bunch more X86 AVX2 instructions and their corresponding intrinsics. llvm-svn: 143529	2011-11-02 04:42:13 +00:00
Eli Friedman	c60a0ad611	Teach the x86 backend a couple tricks for dealing with v16i8 sra by a constant splat value. Fixes PR11289. llvm-svn: 143498	2011-11-01 21:18:39 +00:00
Richard Osborne	5a9e575e81	Don't fold negative offsets into cp / dp accesses to avoid relocation errors. This can happen if the address + addend is less than the start of the cp / dp. llvm-svn: 143459	2011-11-01 11:31:53 +00:00
Richard Osborne	8175a9601d	Combine various XCore tests for floating point intrinsic support into a single test. llvm-svn: 143458	2011-11-01 10:51:48 +00:00
Richard Osborne	280d51dd14	Move various XCore tests to FileCheck llvm-svn: 143457	2011-11-01 10:41:28 +00:00
Craig Topper	361c873b52	Fix operand type for x86 pmadd_ub_sw intrinsic. llvm-svn: 143455	2011-11-01 07:25:22 +00:00
Craig Topper	dbf10927d7	Fix operand type for int_x86_ssse3_phadd_sw_128 intrinsic llvm-svn: 143336	2011-10-31 07:16:37 +00:00
Craig Topper	c0f93132bd	Test case for X86 FS/GS Base intrinsics llvm-svn: 143332	2011-10-31 02:15:47 +00:00
Craig Topper	6eaf58df7c	Begin adding AVX2 instructions. No selection support yet other than intrinsics. llvm-svn: 143331	2011-10-31 02:15:10 +00:00
Nick Lewycky	7308946be2	Switch new .file directive emission off by default, change llc's flag for it to -enable-dwarf-directory. llvm-svn: 143326	2011-10-31 01:06:02 +00:00
Benjamin Kramer	c0001c42c6	X86: Emit logical shift by constant splat of <16 x i8> as a <8 x i16> shift and zero out the bits where zeros should've been shifted in. llvm-svn: 143315	2011-10-30 17:31:21 +00:00
Craig Topper	e77289b243	Fix return type for X86 mpsadbw instrinsic. The instruction takes in a vector of 8-bit integers, but produces a vector of 16-bit integers. llvm-svn: 143313	2011-10-30 17:22:45 +00:00
Nadav Rotem	8282fc9e3b	Fix pr11266. On x86: (shl V, 1) -> add V,V Hardware support for vector-shift is sparse and in many cases we scalarize the result. Additionally, on sandybridge padd is faster than shl. llvm-svn: 143311	2011-10-30 13:24:22 +00:00
Nadav Rotem	68400d352b	Stabilize the test by specifying an exact cpu target llvm-svn: 143307	2011-10-30 08:07:50 +00:00
Nadav Rotem	6c79131e39	Add a new DAGCombine optimization for BUILD_VECTOR. If all of the inputs are zero/any_extended, create a new simple BV which can be further optimized by other BV optimizations. llvm-svn: 143297	2011-10-29 21:23:04 +00:00
Benjamin Kramer	24c4266ada	Force SSE for this test. llvm-svn: 143291	2011-10-29 19:43:44 +00:00
Dan Gohman	826cec9a4b	Revert r143206, as there are still some failing tests. llvm-svn: 143262	2011-10-29 00:41:52 +00:00
NAKAMURA Takumi	78a0f170d6	test/CodeGen/PowerPC/2008-10-17-AsmMatchingOperands.ll: [PR11218] Mark "REQUIRES: asserts" for now. llvm-svn: 143247	2011-10-28 23:11:03 +00:00
Dan Gohman	dedcc22bcd	Reapply r143177 and r143179 (reverting r143188), with scheduler fixes: Use a separate register, instead of SP, as the calling-convention resource, to avoid spurious conflicts with actual uses of SP. Also, fix unscheduling of calling sequences, which can be triggered by pseudo-two-address dependencies. llvm-svn: 143206	2011-10-28 17:55:38 +00:00
NAKAMURA Takumi	bcfac720a7	Dwarf: [PR11022] Fix emitting DW_AT_const_value(>i64), to be host-endian-neutral. Don't assume APInt::getRawData() would hold target-aware endianness nor host-compliant endianness. rawdata[0] holds most lower i64, even on big endian host. FIXME: Add a testcase for big endian target. FIXME: Ditto on CompileUnit::addConstantFPValue() ? llvm-svn: 143194	2011-10-28 14:12:22 +00:00
NAKAMURA Takumi	b5df9f3cc1	test/CodeGen/X86/2010-08-10-DbgConstant.ll: Add explicit -mtriple=i686-linux. It must be for elf! llvm-svn: 143189	2011-10-28 10:50:52 +00:00
Duncan Sands	a6507c4bcb	Speculatively disable Dan's commits 143177 and 143179 to see if it fixes the dragonegg self-host (it looks like gcc is miscompiled). Original commit messages: Eliminate LegalizeOps' LegalizedNodes map and have it just call RAUW on every node as it legalizes them. This makes it easier to use hasOneUse() heuristics, since unneeded nodes can be removed from the DAG earlier. Make LegalizeOps visit the DAG in an operands-last order. It previously used operands-first, because LegalizeTypes has to go operands-first, and LegalizeTypes used to be part of LegalizeOps, but they're now split. The operands-last order is more natural for several legalization tasks. For example, it allows lowering code for nodes with floating-point or vector constants to see those constants directly instead of seeing the lowered form (often constant-pool loads). This makes some things somewhat more complicated today, though it ought to allow things to be simpler in the future. It also fixes some bugs exposed by Legalizing using RAUW aggressively. Remove the part of LegalizeOps that attempted to patch up invalid chain operands on libcalls generated by LegalizeTypes, since it doesn't work with the new LegalizeOps traversal order. Instead, define what LegalizeTypes is doing to be correct, and transfer the responsibility of keeping calls from having overlapping calling sequences into the scheduler. Teach the scheduler to model callseq_begin/end pairs as having a physical register definition/use to prevent calls from having overlapping calling sequences. This is also somewhat complicated, though there are ways it might be simplified in the future. This addresses rdar://9816668, rdar://10043614, rdar://8434668, and others. Please direct high-level questions about this patch to management. Delete #if 0 code accidentally left in. llvm-svn: 143188	2011-10-28 09:55:57 +00:00
Nick Lewycky	5758d6af22	Always use the string pool, even when it makes the .o larger. This may help tools that read the debug info in the .o files by making the DIE sizes more consistent. llvm-svn: 143186	2011-10-28 05:29:47 +00:00
Dan Gohman	484df993bd	Eliminate LegalizeOps' LegalizedNodes map and have it just call RAUW on every node as it legalizes them. This makes it easier to use hasOneUse() heuristics, since unneeded nodes can be removed from the DAG earlier. Make LegalizeOps visit the DAG in an operands-last order. It previously used operands-first, because LegalizeTypes has to go operands-first, and LegalizeTypes used to be part of LegalizeOps, but they're now split. The operands-last order is more natural for several legalization tasks. For example, it allows lowering code for nodes with floating-point or vector constants to see those constants directly instead of seeing the lowered form (often constant-pool loads). This makes some things somewhat more complicated today, though it ought to allow things to be simpler in the future. It also fixes some bugs exposed by Legalizing using RAUW aggressively. Remove the part of LegalizeOps that attempted to patch up invalid chain operands on libcalls generated by LegalizeTypes, since it doesn't work with the new LegalizeOps traversal order. Instead, define what LegalizeTypes is doing to be correct, and transfer the responsibility of keeping calls from having overlapping calling sequences into the scheduler. Teach the scheduler to model callseq_begin/end pairs as having a physical register definition/use to prevent calls from having overlapping calling sequences. This is also somewhat complicated, though there are ways it might be simplified in the future. This addresses rdar://9816668, rdar://10043614, rdar://8434668, and others. Please direct high-level questions about this patch to management. llvm-svn: 143177	2011-10-28 01:29:32 +00:00
Dan Gohman	892b86e74c	Remove the Alpha backend. llvm-svn: 143164	2011-10-27 22:56:32 +00:00
Jakob Stoklund Olesen	de21509dcd	Also set addrmode6 alignment when align==size. Previously, we were only setting the alignment bits on over-aligned loads and stores. llvm-svn: 143160	2011-10-27 22:39:16 +00:00
Evan Cheng	75271d09f1	Avoid partial CPSR dependency from loop backedges. rdar://10357570 llvm-svn: 143145	2011-10-27 21:21:05 +00:00
Pete Cooper	cca60da8cd	Changed test to check for correct load size instead of shift as the shift might change if optimised llvm-svn: 143116	2011-10-27 18:15:58 +00:00
Nick Lewycky	651475977d	Teach our Dwarf emission to use the string pool. llvm-svn: 143097	2011-10-27 06:44:11 +00:00
Eli Friedman	76e3969f05	Don't crash on 128-bit sdiv by constant. Found by inspection. llvm-svn: 143095	2011-10-27 02:06:39 +00:00
Chad Rosier	e76ba1b654	A branch predicated on a constant can just FastEmit an unconditional branch. llvm-svn: 143086	2011-10-27 00:21:16 +00:00
Rafael Espindola	8c0e2c2fe7	Run test with -verify-machineinstrs. Patch by Sanjoy Das. llvm-svn: 143066	2011-10-26 21:20:26 +00:00
Rafael Espindola	1958dc7193	Fixes an issue reported by -verify-machineinstrs. Patch by Sanjoy Das. llvm-svn: 143064	2011-10-26 21:16:41 +00:00
Rafael Espindola	90896edc6c	This commit introduces two fake instructions MORESTACK_RET and MORESTACK_RET_RESTORE_R10; which are lowered to a RET and a RET followed by a MOV respectively. Having a fake instruction prevents the verifier from seeing a MachineBasicBlock end with a non-terminator (MOV). It also prevents the rather eccentric case of a MachineBasicBlock ending with RET but having successors nevertheless. Patch by Sanjoy Das. llvm-svn: 143062	2011-10-26 21:12:27 +00:00
Lang Hames	d87e366c7f	Make sure short memsets on ARM lower to stores, even when optimizing for size. llvm-svn: 143055	2011-10-26 20:56:52 +00:00
Jim Grosbach	5a61a956cb	Thumb2 remove redundant ".w" suffix from t2MVNCCi pattern. llvm-svn: 143034	2011-10-26 17:28:15 +00:00
James Molloy	9afc8b08f7	Revert r142530 at least temporarily while a discussion is had on llvm-commits regarding exactly how much optsize should optimize for size over performance. llvm-svn: 143023	2011-10-26 08:53:19 +00:00
Evan Cheng	941d5c148f	Revert part of r142530. The patch potentially hurts performance especially on Darwin platforms where -Os means optimize for size without hurting performance. llvm-svn: 143002	2011-10-26 01:17:44 +00:00
Dan Gohman	77125e4240	Remove the Blackfin backend. llvm-svn: 142880	2011-10-25 00:05:42 +00:00
Dan Gohman	b54d296fd4	Remove the SystemZ backend. llvm-svn: 142878	2011-10-24 23:48:32 +00:00
Eli Friedman	652497e03c	Don't crash on variable insertelement on ARM. PR10258. llvm-svn: 142871	2011-10-24 23:08:52 +00:00
Bill Wendling	e37d737f13	Check the visibility of the global variable before placing it into the stubs table. A hidden variable could potentially end up in both lists. <rdar://problem/10336715> llvm-svn: 142869	2011-10-24 23:05:43 +00:00
Dan Gohman	f742ffd7fa	Remove the explicit request for "Latency" scheduling from MSP430, as the Latency scheduler is going away. llvm-svn: 142811	2011-10-24 17:53:16 +00:00
Dan Gohman	6e1bd851dc	Change the default scheduler from Latency to ILP, since Latency is going away. llvm-svn: 142810	2011-10-24 17:45:02 +00:00
Chandler Carruth	68ba25c47d	Completely re-write the algorithm behind MachineBlockPlacement based on discussions with Andy. Fundamentally, the previous algorithm is both counter productive on several fronts and prioritizing things which aren't necessarily the most important: static branch prediction. The new algorithm uses the existing loop CFG structure information to walk through the CFG itself to layout blocks. It coalesces adjacent blocks within the loop where the CFG allows based on the most likely path taken. Finally, it topologically orders the block chains that have been formed. This allows it to choose a (mostly) topologically valid ordering which still priorizes fallthrough within the structural constraints. As a final twist in the algorithm, it does violate the CFG when it discovers a "hot" edge, that is an edge that is more than 4x hotter than the competing edges in the CFG. These are forcibly merged into a fallthrough chain. Future transformations that need te be added are rotation of loop exit conditions to be fallthrough, and better isolation of cold block chains. I'm also planning on adding statistics to model how well the algorithm does at laying out blocks based on the probabilities it receives. The old tests mostly still pass, and I have some new tests to add, but the nested loops are still behaving very strangely. This almost seems like working-as-intended as it rotated the exit branch to be fallthrough, but I'm not convinced this is actually the best layout. It is well supported by the probabilities for loops we currently get, but those are pretty broken for nested loops, so this may change later. llvm-svn: 142743	2011-10-23 09:18:45 +00:00
Nadav Rotem	7a79f94aad	Fix pr11193. SHL inserts zeros from the right, thus even when the original sign_extend_inreg value was of 1-bit, we need to sra. llvm-svn: 142724	2011-10-22 12:39:25 +00:00
Nadav Rotem	57f652cfe4	Fix pr11194. When promoting and splitting integers we need to use ZExtPromotedInteger and SExtPromotedInteger based on the operation we legalize. SetCC return type needs to be legalized via PromoteTargetBoolean. llvm-svn: 142660	2011-10-21 17:35:19 +00:00
Chandler Carruth	2f20f63a01	Don't hard code the desired alignment for loops -- it isn't 16-bytes on all x86 systems. Sorry for the breakage. llvm-svn: 142656	2011-10-21 16:41:39 +00:00
Nadav Rotem	52d820c0dd	1. Fix the widening of SETCC in WidenVecOp_SETCC. Use the correct return CC type. 2. Fix a typo in CONCAT_VECTORS which exposed the bug in #1. llvm-svn: 142648	2011-10-21 11:42:07 +00:00
Chandler Carruth	21c689d1ac	Add loop aligning to MachineBlockPlacement based on review discussion so it's a bit more plausible to use this instead of CodePlacementOpt. The code for this was shamelessly stolen from CodePlacementOpt, and then trimmed down a bit. There doesn't seem to be much utility in returning true/false from this pass as we may or may not have rewritten all of the blocks. Also, the statistic of counting how many loops were aligned doesn't seem terribly important so I removed it. If folks would like it to be included, I'm happy to add it back. This was probably the most egregious of the missing features, and now I'm going to start gathering some performance numbers and looking at specific loop structures that have different layout between the two. Test is updated to include both basic loop alignment and nested loop alignment. llvm-svn: 142645	2011-10-21 08:57:37 +00:00
Chandler Carruth	f352d2d7e3	Add a very basic test for MachineBlockPlacement. This is essentially the canonical example I used when developing it, and is one of the primary motivating real-world use cases for __builtin_expect (when burried under a macro). I'm working on more test cases here, but I'm trying to make sure both that the pass is doing the right thing with the test cases and that they aren't too brittle to changes elsewhere in the code generation pipeline. Feedback and/or suggestions on how to test this are very welcome. Especially feedback on whether testing the block comments is a good strategy; I couldn't find any good examples to steal from but all the other ideas I had were a lot uglier or more fragile. llvm-svn: 142644	2011-10-21 08:01:56 +00:00
Craig Topper	fd96157f13	Remove intrinsics for X86 BLSI, BLSMSK, and BLSR intrinsics and replace with custom isel lowering code. llvm-svn: 142642	2011-10-21 06:55:01 +00:00
Chad Rosier	38661ab3ce	Revert 142337. Thumb1 still doesn't support dynamic stack realignment. :( llvm-svn: 142557	2011-10-20 00:07:12 +00:00
Evan Cheng	057c12c2a0	Fix TLS lowering bug. The CopyFromReg must be glued to the TLSCALL. rdar://10291355 llvm-svn: 142550	2011-10-19 22:22:54 +00:00
Nadav Rotem	df65a641dd	Improve code generation for vselect on SSE2: When checking the availability of instructions using the TLI, a 'promoted' instruction IS available. It means that the value is bitcasted to another type for which there is an operation. The correct check for the availablity of an instruction is to check if it should be expanded. llvm-svn: 142542	2011-10-19 20:43:16 +00:00
James Molloy	73a2a8a45e	Use literal pool loads instead of MOVW/MOVT for materializing global addresses when optimizing for size. On spec/gcc, this caused a codesize improvement of ~1.9% for ARM mode and ~4.9% for Thumb(2) mode. This is codesize including literal pools. The pools themselves doubled in size for ARM mode and quintupled for Thumb mode, leaving suggestion that there is still perhaps redundancy in LLVM's use of constant pools that could be decreased by sharing entries. Fixes PR11087. llvm-svn: 142530	2011-10-19 14:11:07 +00:00
Nadav Rotem	05587f317b	Add support for the vector-widening of vselect and vector-setcc llvm-svn: 142488	2011-10-19 09:45:11 +00:00
Craig Topper	b1fa647871	Rename PEXTR to PEXT. Add intrinsics for BMI instructions. llvm-svn: 142480	2011-10-19 07:48:35 +00:00
Lang Hames	03f36ab3f6	Added testcase for <rdar://problem/10215997> llvm-svn: 142462	2011-10-18 23:50:52 +00:00
Nadav Rotem	f9d8f801d9	Add additional element-promotion tests. llvm-svn: 142442	2011-10-18 23:05:33 +00:00
Nadav Rotem	e435b9e2fd	Fix a bug in the legalization of vector anyext-load and trunc-store. Mem Index starts with zero. llvm-svn: 142434	2011-10-18 22:32:43 +00:00
Bob Wilson	f78f688c02	Fix incorrect check for sign-extended constant BUILD_VECTOR. <rdar://problem/10298332> llvm-svn: 142371	2011-10-18 17:34:51 +00:00
Bob Wilson	0273c767c8	Fix a DAG combiner assertion failure when constant folding BUILD_VECTORS. svn r139159 caused SelectionDAG::getConstant() to promote BUILD_VECTOR operands with illegal types, even before type legalization. For this testcase, that led to one BUILD_VECTOR with i16 operands and another with promoted i32 operands, which triggered the assertion. llvm-svn: 142370	2011-10-18 17:34:47 +00:00
Justin Holewinski	361b3c9ff2	PTX: Fix disabling of MAD instruction selection llvm-svn: 142352	2011-10-18 13:39:20 +00:00
Chad Rosier	eb469f466b	Add support for dynamic stack realignment when in thumb1 mode. rdar://10288916 llvm-svn: 142337	2011-10-18 05:28:00 +00:00
Nick Lewycky	c551c1c5f9	Add support for a new extension to the .file directive: .file filenumber "directory" "filename" This removes one join+split of the directory+filename in MC internals. Because bitcode files have independent fields for directory and filenames in debug info, this patch may change the .o files written by existing .bc files. llvm-svn: 142300	2011-10-17 23:05:28 +00:00
Nadav Rotem	b03ef2cd0b	Add CHECKs and document PR11158. llvm-svn: 142240	2011-10-17 20:23:23 +00:00
Nadav Rotem	2d3d696b51	stabalize tests by specifying the exact sse level llvm-svn: 142229	2011-10-17 19:45:38 +00:00
Hal Finkel	d65adcde2d	use FileCheck and not grep in new tests llvm-svn: 142189	2011-10-17 16:01:41 +00:00
Nadav Rotem	b2979db708	Clean the triple, add check lines. llvm-svn: 142183	2011-10-17 07:07:51 +00:00
Nadav Rotem	77004e845a	Previously v2i32 vectors were legalized to v4i32. Now, they are legalized to v2i64. These tests do not check MMX nor zmoving into them. llvm-svn: 142182	2011-10-17 06:59:01 +00:00
Hal Finkel	8be5b30fa8	Test case for CanLowerReturn fix (r141981) llvm-svn: 142172	2011-10-17 04:03:59 +00:00
Hal Finkel	b128cda81b	Add PPC 440 scheduler and some associated tests (new files) llvm-svn: 142171	2011-10-17 04:03:55 +00:00
Nadav Rotem	eeb400322e	Add tripple and stabalize a few more tests. llvm-svn: 142158	2011-10-16 21:20:54 +00:00
Nadav Rotem	d3f24f0fe5	Add triple to tests. llvm-svn: 142154	2011-10-16 20:53:20 +00:00
Nadav Rotem	60c9b81158	fix a typo in the test llvm-svn: 142153	2011-10-16 20:43:41 +00:00
Nadav Rotem	8ed6f090ef	Enable element promotion type legalization by deafault. Changed tests which assumed that vectors are legalized by widening them. llvm-svn: 142152	2011-10-16 20:31:33 +00:00
Nadav Rotem	22398874f8	Remove the the test which checks the saving of a vector of booleans into memory. The decision was to pack the bits. Currently no codegen supports this. Currently, all of the bits in the vector are saved into the same address in memory. llvm-svn: 142149	2011-10-16 19:06:06 +00:00
Nadav Rotem	a733f43137	Fix a bug in LowerV2I64Splat, which generated a BUILD_VECTOR for which there was no pattern. llvm-svn: 142130	2011-10-16 10:02:06 +00:00
Eli Friedman	c8d798d91e	Add missing correctness check to ARMTargetLowering::ReconstructShuffle. Fixes PR11129. llvm-svn: 142022	2011-10-14 23:58:49 +00:00
Jakob Stoklund Olesen	6f0c76740b	Update live-in lists when splitting critical edges. Fixes PR10814. Patch by Jan Sjödin! llvm-svn: 141960	2011-10-14 17:25:46 +00:00
Craig Topper	0a11eb1b21	Add X86 ANDN instruction. Including instruction selection. llvm-svn: 141947	2011-10-14 07:06:56 +00:00
Craig Topper	6b2120a8e1	Add X86 TZCNT instruction and patterns to select it. Also added core-avx2 processor which is gcc's name for Haswell. llvm-svn: 141939	2011-10-14 03:21:46 +00:00
Jakob Stoklund Olesen	98e48aef6f	Add value numbers when spilling dead defs. When spilling around an instruction with a dead def, remember to add a value number for the def. The missing value number wouldn't normally create problems since there would be an incoming live range as well. However, due to another bug we could spill a dead V_SET0 instruction which doesn't read any values. The missing value number caused an empty live range to be created which is dangerous since it doesn't interfere with anything. This fixes part of PR11125. llvm-svn: 141923	2011-10-14 00:34:31 +00:00
Benjamin Kramer	a705595096	Force CPU type on test so it doesn't accidentally emit movbe instead of bswap on Intel Atom CPUs. llvm-svn: 141863	2011-10-13 14:27:54 +00:00
Kalle Raiskila	15993a5d28	Mark 'branch indirect' instruction as an indirect branch. Not having it confused assembly printing of jumptables. llvm-svn: 141862	2011-10-13 11:40:03 +00:00
Bill Wendling	dcd9c25744	More closely follow libgcc, which has code after the `ret' instruction to release the stack segment and reset the stack pointer. Place the code in its own MBB to make the verifier happy. llvm-svn: 141859	2011-10-13 08:24:19 +00:00
Bill Wendling	2a571af745	Revert r141854 because it was causing failures: http://lab.llvm.org:8011/builders/llvm-x86_64-linux/builds/101 --- Reverse-merging r141854 into '.': U test/MC/Disassembler/X86/x86-32.txt U test/MC/Disassembler/X86/simple-tests.txt D test/CodeGen/X86/bmi.ll U lib/Target/X86/X86InstrInfo.td U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86.td U lib/Target/X86/X86Subtarget.h llvm-svn: 141857	2011-10-13 07:48:07 +00:00
Bill Wendling	a6bf25f30a	Should not add instructions to a BB after a return instruction. The machine instruction verifier doesn't like this, nor do I. llvm-svn: 141856	2011-10-13 07:42:32 +00:00
Craig Topper	eb29e18c9b	Add X86 TZCNT instruction and patterns to select it. Also added core-avx2 processor which is gcc's name for Haswell. llvm-svn: 141854	2011-10-13 07:09:14 +00:00
Jakob Stoklund Olesen	f38db50fed	Also inflate register classes around inline asm. Now that MI->getRegClassConstraint() can also handle inline assembly, don't bail when recomputing the register class of a virtual register used by inline asm. This fixes PR11078. llvm-svn: 141836	2011-10-12 23:37:40 +00:00
Bill Wendling	60bfb5d707	We need to verify that the machine instruction we're using as a replacement for our current machine instruction defines a register with the same register class as what's being replaced. This showed up in the SPEC 403.gcc benchmark, where it would ICE because a tail call was expecting one register class but was given another. (The machine instruction verifier catches this situation.) <rdar://problem/10270968> llvm-svn: 141830	2011-10-12 23:03:40 +00:00
Jakob Stoklund Olesen	2611cc2520	Fix -widen-vmovs liveness issues. When widening a copy, we are reading a larger register that may not be live. Use an <undef> flag to tell the register scavenger and machine code verifier that we know the value isn't defined. We now widen: %S6<def> = COPY %S4<kill>, %D3<imp-def> into: %D3<def> = VMOVD %D2<undef>, pred:14, pred:%noreg, %S4<imp-use,kill> This also keeps the <kill> flag on %S4 so we don't inadvertently kill a live value in %S5. Finally, ensure that ARMBaseInstrInfo::setExecutionDomain() preserves the <undef> flag when converting VMOVD to VORR. llvm-svn: 141746	2011-10-12 00:06:23 +00:00
Bob Wilson	20cc1d9f51	Make this test more specific. There are 3 stats that matched "machine-licm". llvm-svn: 141741	2011-10-11 23:34:31 +00:00
Eric Christopher	57c57a3260	Add a new wrapper node for a DILexicalBlock that encapsulates it and a file. Since it should only be used when necessary propagate it through the backend code generation and tweak testcases accordingly. This helps with code like in clang's test/CodeGen/debug-info-line.c where we have multiple #line directives within a single lexical block and want to generate only a single block that contains each file change. Part of rdar://10246360 llvm-svn: 141729	2011-10-11 22:59:11 +00:00
Bill Wendling	a1c238d8bf	Revert r141529. This is causing failures in the test-suite, like bigstack and ReedSolomon. Boo... llvm-svn: 141716	2011-10-11 21:40:47 +00:00
Devang Patel	3811b42be0	Add dominance check for the instruction being hoisted. For example, MachineLICM should not hoist a load that is not guaranteed to be executed. Radar 10254254. llvm-svn: 141689	2011-10-11 18:09:58 +00:00
Nadav Rotem	6ddbd1308b	Add support for legalization of vector SHL/SRA/SRL instructions llvm-svn: 141667	2011-10-11 14:36:35 +00:00
Craig Topper	f95d9bd513	Test case for X86 LZCNT instruction selection. llvm-svn: 141652	2011-10-11 06:47:01 +00:00
Bill Wendling	2606813218	Test simplification that Ana Pazos noticed. llvm-svn: 141644	2011-10-11 04:43:15 +00:00
NAKAMURA Takumi	00636555f2	test/CodeGen/X86/movbe.ll: Give explicit -mtriple=x86_64-linux, to unbreak win32 hosts. llvm-svn: 141640	2011-10-11 03:41:03 +00:00
Akira Hatanaka	2da85501f4	Test cases for 64-bit load and store instructions. llvm-svn: 141631	2011-10-11 01:52:31 +00:00
Bill Wendling	cb617ae8e0	Add testcase for PR11107. llvm-svn: 141607	2011-10-11 00:26:57 +00:00
Devang Patel	e6091c61c7	Revert r141569 and r141576. llvm-svn: 141594	2011-10-10 23:18:02 +00:00
Bill Wendling	7121342ad5	Reapply r141365 now that PR11107 is fixed. llvm-svn: 141591	2011-10-10 22:59:55 +00:00
Eli Friedman	7188ba35cb	Make sure the X86 backend doesn't explode on 128-bit shuffles in AVX mode. Fixes PR11102. llvm-svn: 141585	2011-10-10 22:28:47 +00:00
Devang Patel	d6f05a47c1	If loop header is also loop exiting block then it may not be safe to hoist instructions. llvm-svn: 141576	2011-10-10 20:32:03 +00:00
Nadav Rotem	38187aec17	Fix 10892 - When lowering SIGN_EXTEND_INREG do not lower v2i64 because the instruction set has no 64-bit SRA support. llvm-svn: 141570	2011-10-10 19:31:45 +00:00
Devang Patel	c149f390c6	Add dominance check for the instruction being hoisted. For example, MachineLICM should not hoist a load that is not guaranteed to be executed. Radar 10254254. llvm-svn: 141569	2011-10-10 19:09:20 +00:00
Benjamin Kramer	c498279bec	X86: Add patterns for the movbe instruction (mov + bswap, only available on atom) llvm-svn: 141563	2011-10-10 18:34:56 +00:00
Bill Wendling	7cba44defc	Revert r141365. It was causing MultiSource/Benchmarks/MiBench/consumer-lame to hang, and possibly SPEC/CINT2006/464_h264ref. llvm-svn: 141560	2011-10-10 18:27:30 +00:00
Bill Wendling	94258753c7	When getting the number of bits necessary for addressing mode ARMII::AddrModeT1_s, we need to take into account that if the frame register is ARM::SP, then the number of bits is 8. If it's not ARM::SP, then the number of bits is 5. llvm-svn: 141529	2011-10-10 07:24:23 +00:00
Jakob Stoklund Olesen	b49557d06d	Add TEST8ri_NOREX pseudo to constrain sub_8bit_hi copies. In 64-bit mode, sub_8bit_hi sub-registers can only be used by NOREX instructions. The COPY created from the EXTRACT_SUBREG DAG node cannot target all GR8 registers, only those in GR8_NOREX. TO enforce this, we ensure that all instructions using the EXTRACT_SUBREG are GR8_NOREX constrained. This fixes PR11088. llvm-svn: 141499	2011-10-08 18:28:28 +00:00
Jakob Stoklund Olesen	2817bbdeaf	Add missing test case for r141410. llvm-svn: 141498	2011-10-08 18:06:54 +00:00
Evan Cheng	99b25c827c	High bits of movmskp{s\|d} and pmovmskb are known zero. rdar://10247336 llvm-svn: 141371	2011-10-07 17:21:44 +00:00
Bob Wilson	b55a64ae72	Reenable tail calls for iOS 5.0 and later. llvm-svn: 141370	2011-10-07 17:17:49 +00:00
Bob Wilson	d8856e17fe	Reenable use of divmod compiler_rt functions for iOS 5.0 and later. llvm-svn: 141368	2011-10-07 16:59:21 +00:00
Anton Korobeynikov	0944a4c5cc	Peephole optimization for ABS on ARM. Patch by Ana Pazos! llvm-svn: 141365	2011-10-07 16:15:08 +00:00
Cameron Zwarich	cc5f846d58	Remove a check from ARM shifted operand isel helper methods, which were blocking merging an lsl #2 that has multiple uses on A9. This shift is free, so there is no problem merging it in multiple places. Other unprofitable shifts will not be merged. llvm-svn: 141247	2011-10-05 23:38:50 +00:00
Dan Gohman	779ae47721	Make this test less sensitive to codegen optimizations. llvm-svn: 141195	2011-10-05 18:13:08 +00:00
Akira Hatanaka	0a697c10ae	Move CHECK after entry label. llvm-svn: 141030	2011-10-03 21:24:30 +00:00
Akira Hatanaka	76c268271d	Add support for 64-bit logical NOR. llvm-svn: 141029	2011-10-03 21:23:18 +00:00
Akira Hatanaka	ae50204423	Add support for 64-bit count leading ones and zeros instructions. llvm-svn: 141028	2011-10-03 21:16:50 +00:00
Akira Hatanaka	6c6ff6fde7	Add support for 64-bit divide instructions. llvm-svn: 141024	2011-10-03 21:06:13 +00:00
Akira Hatanaka	e3c1d7eac0	Add support for 64-bit integer multiply instructions. llvm-svn: 141017	2011-10-03 20:01:11 +00:00
Bill Wendling	98db234c02	Filecheck-ize. llvm-svn: 140904	2011-09-30 23:40:29 +00:00
Bill Wendling	252b649025	Add new line at end of file. llvm-svn: 140903	2011-09-30 23:21:11 +00:00
Bill Wendling	fcf3096d9b	When inferring the pointer alignment, if the global doesn't have an initializer and the alignment is 0 (i.e., it's defined globally in one file and declared in another file) it could get an alignment which is larger than the ABI allows for that type, resulting in aligned moves being used for unaligned loads. For instance, in file A.c: struct S s; In file B.c: struct { // something long }; extern S s; void foo() { struct S p = s; // ... } this copy is a 'memcpy' which is turned into a series of 'movaps' instructions on X86. But this is wrong, because 'struct S' has alignment of 4, not 16. llvm-svn: 140902	2011-09-30 23:19:55 +00:00
Akira Hatanaka	5479850400	Remove unnecessary checking of register operands. llvm-svn: 140872	2011-09-30 19:18:24 +00:00
Akira Hatanaka	c9268767d6	Add definitions of Mips64 rotate instructions. llvm-svn: 140870	2011-09-30 18:51:46 +00:00
Jim Grosbach	44047da675	ARM Darwin default relocation model is PIC. This matches clang, so default options in llc and friends are now closer to clang's defaults. llvm-svn: 140863	2011-09-30 17:41:35 +00:00
Akira Hatanaka	ffa28d49f1	Check values of immediate operands. llvm-svn: 140860	2011-09-30 17:19:21 +00:00
Akira Hatanaka	dcd69c2e07	Mips64 shift instructions. llvm-svn: 140841	2011-09-30 03:18:46 +00:00
Akira Hatanaka	4bdb4b1958	Mips64 arithmetic and logical instructions with one source register and immediate. llvm-svn: 140839	2011-09-30 02:08:54 +00:00
Akira Hatanaka	50a4f6d570	Fill delay slot with useful instructions. Modified from Sparc's version of delay slot filler. Patch by Reed Kotler at Mips Technologies. llvm-svn: 140825	2011-09-29 23:52:13 +00:00
Akira Hatanaka	70be05d5d0	Mips64 arithmetic and logical instructions with two source registers. llvm-svn: 140806	2011-09-29 20:37:56 +00:00
Andrew Trick	e1d5ae73ac	LSR: rewrite inner loops only. Rewriting the entire loop nest now requires -enable-lsr-nested. See PR11035 for some performance data. A few unit tests specifically test nested LSR, and are now under a flag. llvm-svn: 140762	2011-09-29 01:33:38 +00:00
Andrew Trick	75c6a45d08	whitespace llvm-svn: 140761	2011-09-29 01:31:48 +00:00
Justin Holewinski	4966d44b44	PTX: Add new patterns for bitconvert and any_extend llvm-svn: 140753	2011-09-29 01:13:12 +00:00

1 2 3 4 5 ...

5299 Commits