llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-12-13 14:35:54 +00:00

Author	SHA1	Message	Date
Richard Osborne	70204c1c29	Add XCore intrinsics for getps, setps, setsr and clrsr. llvm-svn: 127678	2011-03-15 13:45:47 +00:00
Justin Holewinski	8948485aa7	PTX: Set PTX 2.0 as the minimum supported version - Remove PTX 1.4 code generation - Change type of intrinsics to .v4.i32 instead of .v4.i16 - Add and/or/xor integer instructions llvm-svn: 127677	2011-03-15 13:24:15 +00:00
Duncan Sands	e91289191a	Silence compiler warning about case values not being in the enumerated type MCFixupKind. This is the same technique that is used elsewhere in MC. llvm-svn: 127676	2011-03-15 08:54:51 +00:00
Duncan Sands	fc3e4d63e1	Avoid a compiler warning about reg possibly being used uninitialized when building with assertions disabled. llvm-svn: 127675	2011-03-15 08:41:24 +00:00
Cameron Zwarich	7947e73536	Do not add PHIs with no users when creating LCSSA form. Patch by Andrew Clinton. llvm-svn: 127674	2011-03-15 07:41:25 +00:00
Nick Lewycky	e30c07ab2b	Add C++ global operator {new,new[],delete,delete[]}(unsigned {int,long}) to the memory builtins as equivalent to malloc/free. This is different from any attribute we have. For example, you can delete the allocators when their result is unused, but you can't collapse two calls to the same function, even if no global/memory state has changed in between. The noalias return states that the result does not alias any other pointer, but instcombine optimizes malloc() as though the result is non-null for the purpose of eliminating unused pointers. llvm-svn: 127673	2011-03-15 07:31:32 +00:00
Evan Cheng	29faaebae9	Add a peephole optimization to optimize pairs of bitcasts. e.g. v2 = bitcast v1 ... v3 = bitcast v2 ... = v3 => v2 = bitcast v1 ... = v1 if v1 and v3 are of in the same register class. bitcast between i32 and fp (and others) are often not nops since they are in different register classes. These bitcast instructions are often left because they are in different basic blocks and cannot be eliminated by dag combine. rdar://9104514 llvm-svn: 127668	2011-03-15 05:13:13 +00:00
Eli Friedman	c0bfbd0610	PR9450: Make switch optimization in SimplifyCFG not dependent on the ordering of pointers in an std::map. llvm-svn: 127650	2011-03-15 02:23:35 +00:00
Evan Cheng	bac3e87eaa	sext(undef) = 0, because the top bits will all be the same. zext(undef) = 0, because the top bits will be zero. llvm-svn: 127649	2011-03-15 02:22:10 +00:00
Sean Callanan	a38db2eeda	Enabled disassembler support for AVX instructions in the instruction tables and fixed a few bugs that were causing decode conflicts. Rudimentary tests are coming up in the next patch. llvm-svn: 127646	2011-03-15 01:28:15 +00:00
Sean Callanan	5a51ccdc0f	X86 table-generator and disassembler support for the AVX instruction set. This code adds support for the VEX prefix and for the YMM registers accessible on AVX-enabled architectures. Instruction table support that enables AVX instructions for the disassembler is in an upcoming patch. llvm-svn: 127644	2011-03-15 01:23:15 +00:00
Andrew Trick	09d2dcd9ef	Remove getMinusSCEVForExitTest(). This function performed acrobatics to prove no-self-wrap, which we now have for free. llvm-svn: 127643	2011-03-15 01:16:14 +00:00
Johnny Chen	a86399b8e6	Fixed an ARM disassembler bug where it does not handle STRi12 correctly because an extra register operand was erroneously added. Remove an incorrect assert which triggers the bug. rdar://problem/9131529 llvm-svn: 127642	2011-03-15 01:13:17 +00:00
Bill Wendling	af19decfc9	There are some situations which can cause the URoR hack to infinitely recurse and then go kablooie. The problem was that it was tracking the PHI nodes anew each time into this function. But it didn't need to. And because the recursion didn't know that a PHINode was visited before, it would go ahead and call itself. There is a testcase, but unfortunately it's too big to add. This problem will go away with the EH rewrite. <rdar://problem/8856298> llvm-svn: 127640	2011-03-15 01:03:17 +00:00
Andrew Trick	5c8b815e5f	Propagate SCEV no-wrap flags whenever possible. This needs review. llvm-svn: 127638	2011-03-15 00:37:00 +00:00
Jim Grosbach	3de97c6e32	Clean up ARM tail calls a bit. They're pseudo-instructions for normal branches. Also more cleanly separate the ARM vs. Thumb functionality. Previously, the encoding would be incorrect for some Thumb instructions (the indirect calls). llvm-svn: 127637	2011-03-15 00:30:40 +00:00
Eric Christopher	7f724c8079	If we don't know how long a string is we can't fold an _chk version to the normal version. Fixes rdar://9123638 llvm-svn: 127636	2011-03-15 00:25:41 +00:00
Bill Wendling	da1364d669	Generate a VTBL instruction instead of a series of loads and stores when we can. As Nate pointed out, VTBL isn't super performant, but it has to be better than this: _shuf: @ BB#0: @ %entry push {r4, r7, lr} add r7, sp, #4 sub sp, #12 mov r4, sp bic r4, r4, #7 mov sp, r4 mov r2, sp vmov d16, r0, r1 orr r0, r2, #6 orr r3, r2, #7 vst1.8 {d16[0]}, [r3] vst1.8 {d16[5]}, [r0] subs r4, r7, #4 orr r0, r2, #5 vst1.8 {d16[4]}, [r0] orr r0, r2, #4 vst1.8 {d16[4]}, [r0] orr r0, r2, #3 vst1.8 {d16[0]}, [r0] orr r0, r2, #2 vst1.8 {d16[2]}, [r0] orr r0, r2, #1 vst1.8 {d16[1]}, [r0] vst1.8 {d16[3]}, [r2] vldr.64 d16, [sp] vmov r0, r1, d16 mov sp, r4 pop {r4, r7, pc} The "illegal" testcase in vext.ll is no longer illegal. <rdar://problem/9078775> llvm-svn: 127630	2011-03-14 23:02:38 +00:00
Jakob Stoklund Olesen	29a9539e7f	Place context in member variables instead of passing around pointers. Use the opportunity to get rid of the trailing underscore variable names. llvm-svn: 127618	2011-03-14 20:57:14 +00:00
Jakob Stoklund Olesen	da1afc2d80	Rename members to match LLVM naming conventions more closely. Remove the unused reserved_ bit vector, no functional change intended. This doesn't break 'svn blame', this file really is all my fault. llvm-svn: 127607	2011-03-14 19:56:43 +00:00
Jim Grosbach	6ee5aef028	Remove some dead patterns. llvm-svn: 127601	2011-03-14 18:34:35 +00:00
Evan Cheng	50f2d406ec	BIT_CONVERT has been renamed to BITCAST. llvm-svn: 127600	2011-03-14 18:19:52 +00:00
Evan Cheng	cb70b9e80b	Minor optimization. sign-ext/anyext of undef is still undef. llvm-svn: 127598	2011-03-14 18:15:55 +00:00
Evan Cheng	fbb846289a	Indentation. llvm-svn: 127595	2011-03-14 18:02:30 +00:00
Andrew Trick	da253e79f0	Negating a recurrence preserves no-self-wrap. llvm-svn: 127593	2011-03-14 17:38:54 +00:00
Andrew Trick	dab71254b6	HowFarToZero can compute a trip count as long as the recurrence has no-self-wrap. llvm-svn: 127591	2011-03-14 17:28:02 +00:00
Andrew Trick	5d45b563c5	Added SCEV::NoWrapFlags to manage unsigned, signed, and self wrap properties. Added the self-wrap flag for SCEV::AddRecExpr. A slew of temporary FIXMEs indicate the intention of the no-self-wrap flag without changing behavior in this revision. llvm-svn: 127590	2011-03-14 16:50:06 +00:00
Andrew Trick	e0442babf1	whitespace llvm-svn: 127589	2011-03-14 16:48:10 +00:00
Justin Holewinski	a2f7c8557c	PTX: Emit global arrays with proper sizes - Emit all arrays as type .b8 and proper sizes in bytes to conform to the output of nvcc llvm-svn: 127584	2011-03-14 15:40:11 +00:00
Justin Holewinski	995d10cfea	PTX: Add support for sqrt/sin/cos intrinsics llvm-svn: 127578	2011-03-14 14:09:33 +00:00
Che-Liang Chiou	6ff0aa8ab3	ptx: add set.p instruction and related changes to predicate execution llvm-svn: 127577	2011-03-14 11:26:01 +00:00
Jin-Gu Kang	9d52ff5473	This case is solved by Scalar Replacement of Aggregates (DT) and Early CSE pass so this patch reverts it to original source code. llvm-svn: 127574	2011-03-14 01:21:00 +00:00
Che-Liang Chiou	962612fc5c	ptx: add basic support of predicate execution llvm-svn: 127569	2011-03-13 17:26:00 +00:00
Jin-Gu Kang	5000ba8961	Add comment as following: load and store reference same memory location, the memory location is represented by getelementptr with two uses (load and store) and the getelementptr's base is alloca with single use. At this point, instructions from alloca to store can be removed. (this pattern is generated when bitfield is accessed.) For example, %u = alloca %struct.test, align 4 ; [#uses=1] %0 = getelementptr inbounds %struct.test* %u, i32 0, i32 0;[#uses=2] %1 = load i8* %0, align 4 ; [#uses=1] %2 = and i8 %1, -16 ; [#uses=1] %3 = or i8 %2, 5 ; [#uses=1] store i8 %3, i8* %0, align 4 llvm-svn: 127565	2011-03-13 14:05:51 +00:00
Jakob Stoklund Olesen	7d23be25ab	Now that we are deleting unused live intervals during allocation, pointers may be reused. Use the virtual register number as a cache tag instead. They are not reused. llvm-svn: 127561	2011-03-13 01:29:32 +00:00
Jakob Stoklund Olesen	2d87d5139b	Tell the register allocator about new unused virtual registers. This allows the allocator to free any resources used by the virtual register, including physical register assignments. llvm-svn: 127560	2011-03-13 01:23:11 +00:00
Oscar Fuentes	208de1fcc4	Build CompilerDriver library. llvm-svn: 127554	2011-03-12 22:01:42 +00:00
Benjamin Kramer	5986a24bae	Teach ComputeMaskedBits about sub nsw. llvm-svn: 127548	2011-03-12 17:18:11 +00:00
Duncan Sands	0514e10276	Speculatively revert commit 127478 (jsjodin) in an attempt to fix the llvm-gcc-i386-linux-selfhost and llvm-x86_64-linux-checks buildbots. The original log entry: Remove optimization emitting a reference insted of label difference, since it can create more relocations. Removed isBaseAddressKnownZero method, because it is no longer used. llvm-svn: 127540	2011-03-12 13:07:37 +00:00
Jin-Gu Kang	5e537a9449	This patch removes some of useless instructions generated by bitfield access. llvm-svn: 127539	2011-03-12 12:18:44 +00:00
Jakob Stoklund Olesen	6d02ddbbc3	Include snippets in the live stack interval. llvm-svn: 127530	2011-03-12 04:25:36 +00:00
Jakob Stoklund Olesen	1f9f236b8a	Spill multiple registers at once. Live range splitting can create a number of small live ranges containing only a single real use. Spill these small live ranges along with the large range they are connected to with copies. This enables memory operand folding and maximizes the spill to fill distance. Work in progress with known bugs. llvm-svn: 127529	2011-03-12 04:17:20 +00:00
Sean Callanan	4f6e58ff09	Fixed the comparison operator for the enhanced disassembler's disassembler map. llvm-svn: 127527	2011-03-12 03:27:54 +00:00
Jakob Stoklund Olesen	925b25d53d	That's it, I am declaring this a failure of the C++03 STL. There are too many compatibility problems with using mixed types in std::upper_bound, and I don't want to spend 110 lines of boilerplate setting up a call to a 10-line function. Binary search is not /that/ hard to implement correctly. I tried terminating the binary search with a linear search, but that actually made the algorithm slower against my expectation. Most live intervals have less than 4 segments. The early test against endIndex() does pay, and this version is 25% faster than plain std::upper_bound(). llvm-svn: 127522	2011-03-12 01:50:35 +00:00
Eric Christopher	80a45901e0	Sometimes isPredicable lies to us and tells us we don't need the operands. Go ahead and add them on when we might want to use them and let later passes remove them. Fixes rdar://9118569 llvm-svn: 127518	2011-03-12 01:09:29 +00:00
Jim Grosbach	f7531e7697	Add FIXME. llvm-svn: 127516	2011-03-12 00:51:00 +00:00
Jim Grosbach	555d910477	Pseudo-ize the ARM Darwin *r9 call instruction definitions. They're the same actual instruction as the non-Darwin defs, but have different call-clobber semantics and so need separate patterns. They don't need to duplicate the encoding information, however. llvm-svn: 127515	2011-03-12 00:45:26 +00:00
Jim Grosbach	923c731f15	Add a FIXME. llvm-svn: 127511	2011-03-11 23:25:21 +00:00
Jim Grosbach	daffeb06fb	Pseudo-ize the ARM 'B' instruction. llvm-svn: 127510	2011-03-11 23:24:15 +00:00
Jim Grosbach	2226dfbea2	Remove dead code. These ARM instruction definitions no longer exist. llvm-svn: 127509	2011-03-11 23:15:02 +00:00

1 2 3 4 5 ...

45892 Commits