llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-04-02 07:41:38 +00:00

Author	SHA1	Message	Date
Jakob Stoklund Olesen	11a08ab6be	Add a verification pass after ExpandISelPseudos. This pass often has weird CFG hacks and hand-written MI building code that can go wrong in many ways. llvm-svn: 162224	2012-08-20 20:52:08 +00:00
Jakob Stoklund Olesen	698d32d8c4	Add CFG checks to MachineVerifier. Verify that the predecessor and successor lists are consistent and free of duplicates. llvm-svn: 162223	2012-08-20 20:52:06 +00:00
Jakob Stoklund Olesen	4d875f1e57	Use a SmallPtrSet to dedup successors in EmitSjLjDispatchBlock. The test case ARM/2011-05-04-MultipleLandingPadSuccs.ll was creating duplicate successor list entries. llvm-svn: 162222	2012-08-20 20:52:03 +00:00
Sebastian Pop	6800067200	enable cross compilation with cmake This patch allows us to use cmake to specify a cross compiler: target different than host. In particular, it moves LLVM_DEFAULT_TARGET_TRIPLE and TARGET_TRIPLE variables from cmake/config-ix.cmake to the toplevel CMakeLists.txt to make them available at configure time. Here is the command line that I have used to test my patches to create a Hexagon cross compiler hosted on x86: $ cmake -G Ninja -D LLVM_TARGETS_TO_BUILD:STRING=Hexagon -D TARGET_TRIPLE:STRING=hexagon-unknown-linux-gnu -D LLVM_DEFAULT_TARGET_TRIPLE:STRING=hexagon-unknown-linux-gnu -D LLVM_TARGET_ARCH:STRING=hexagon-unknown-linux-gnu .. $ ninja check llvm-svn: 162219	2012-08-20 19:56:52 +00:00
Sebastian Pop	e6804b03a8	enable Hexagon target from cmake The patch adds a missing case for the Hexagon target in cmake/config-ix.cmake. llvm-svn: 162218	2012-08-20 19:56:49 +00:00
Sebastian Pop	2f1237d5f4	fix HexagonSubtarget parsing of -mv flag llvm-svn: 162217	2012-08-20 19:56:47 +00:00
Michael Liao	3d421a0c4d	fix a case where all operands of BUILD_VECTOR are undefined llvm-svn: 162214	2012-08-20 17:59:18 +00:00
Akira Hatanaka	b64681df6d	Fix coding style violations in 162135 and 162136. Patch by Petar Jovanovic. llvm-svn: 162213	2012-08-20 17:53:24 +00:00
Benjamin Kramer	3c9d9b798d	DataExtractor: Fix integer truncation issues in LEB128 extraction. llvm-svn: 162201	2012-08-20 10:52:11 +00:00
Stepan Dyatkovskiy	0a4850d1ef	Forget to add testcase for r162195. Sorry. llvm-svn: 162196	2012-08-20 08:03:18 +00:00
Stepan Dyatkovskiy	d39f5417bb	Fixed DAGCombiner bug (found and localized by James Malloy): The DAGCombiner tries to optimise a BUILD_VECTOR by checking if it consists purely of get_vector_elts from one or two source vectors. If so, it either makes a concat_vectors node or a shufflevector node. However, it doesn't check the element type width of the underlying vector, so if you have this sequence: Node0: v4i16 = ... Node1: i32 = extract_vector_elt Node0 Node2: i32 = extract_vector_elt Node0 Node3: v16i8 = BUILD_VECTOR Node1, Node2, ... It will attempt to: Node0: v4i16 = ... NewNode1: v16i8 = concat_vectors Node0, ... Where this is actually invalid because the element width is completely different. This causes an assertion failure on DAG legalization stage. Fix: If output item type of BUILD_VECTOR differs from input item type. Make concat_vectors based on input element type and then bitcast it to the output vector type. So the case described above will transformed to: Node0: v4i16 = ... NewNode1: v8i16 = concat_vectors Node0, ... NewNode2: v16i8 = bitcast NewNode1 llvm-svn: 162195	2012-08-20 07:57:06 +00:00
Craig Topper	77406bef3b	Remove FMA3 intrinsic instructions in favor of patterns. llvm-svn: 162194	2012-08-20 06:21:25 +00:00
Craig Topper	64c93f9d07	Use correct intrinsic for 256-bit VFMSUBADDPS. llvm-svn: 162193	2012-08-20 06:03:04 +00:00
Craig Topper	832951e7da	Remove trailing white space and tab characters. No functional change. llvm-svn: 162192	2012-08-19 23:37:46 +00:00
Nadav Rotem	589dc766e0	When unsafe math is used, we can use commutative FMAX and FMIN. In some cases this allows for better code generation. Added a new DAGCombine transformation to convert FMAX and FMIN to FMANC and FMINC, which are commutative. For example: movaps %xmm0, %xmm1 movsd LC(%rip), %xmm0 minsd %xmm1, %xmm0 becomes: minsd LC(%rip), %xmm0 llvm-svn: 162187	2012-08-19 13:06:16 +00:00
Benjamin Kramer	dca12ad159	Fabs folding is implemented. llvm-svn: 162186	2012-08-19 09:51:44 +00:00
Benjamin Kramer	54d9d1a993	InstCombine: Fix a crasher when encountering a function pointer. llvm-svn: 162180	2012-08-18 22:04:34 +00:00
Jakob Stoklund Olesen	abf0a9ec82	Remove the CAND/COR/CXOR custom ISD nodes and their select code. These nodes are no longer needed because the peephole pass can fold CMOV+AND into ANDCC etc. llvm-svn: 162179	2012-08-18 21:49:50 +00:00
Craig Topper	4362ba5082	Remove virtual from many methods. These methods replace methods in the base class, but the base class methods aren't virtual so it just increased call overhead. llvm-svn: 162178	2012-08-18 21:38:45 +00:00
Jakob Stoklund Olesen	e78d4a5b08	Also combine zext/sext into selects for ARM. This turns common i1 patterns into predicated instructions: (add (zext cc), x) -> (select cc (add x, 1), x) (add (sext cc), x) -> (select cc (add x, -1), x) For a function like: unsigned f(unsigned s, int x) { return s + (x>0); } We now produce: cmp r1, #0 it gt addgt.w r0, r0, #1 Instead of: movs r2, #0 cmp r1, #0 it gt movgt r2, #1 add r0, r2 llvm-svn: 162177	2012-08-18 21:25:22 +00:00
Jakob Stoklund Olesen	ece4a53017	Also pass logical ops to combineSelectAndUse. Add these transformations to the existing add/sub ones: (and (select cc, -1, c), x) -> (select cc, x, (and, x, c)) (or (select cc, 0, c), x) -> (select cc, x, (or, x, c)) (xor (select cc, 0, c), x) -> (select cc, x, (xor, x, c)) The selects can then be transformed to a single predicated instruction by peephole. This transformation will make it possible to eliminate the ISD::CAND, COR, and CXOR custom DAG nodes. llvm-svn: 162176	2012-08-18 21:25:16 +00:00
Benjamin Kramer	f79b68f912	Remove overly conservative hasOneUse check, this always expands into a single IR instruction. llvm-svn: 162175	2012-08-18 20:24:19 +00:00
Benjamin Kramer	1a05d12328	InstCombine: Add a couple of fabs identities for comparing with 0.0. llvm-svn: 162174	2012-08-18 20:06:47 +00:00
Benjamin Kramer	31074f526a	SimplifyLibcalls: Add fabs and trunc to the list of libcalls that are safe to shrink from double to float. llvm-svn: 162173	2012-08-18 19:27:32 +00:00
Nadav Rotem	d01a7b5942	Reapply r162160 with a fix: Optimize Arith->Trunc->SETCC sequence to allow better compare/branch code. llvm-svn: 162172	2012-08-18 17:53:03 +00:00
Anton Korobeynikov	c0e610e681	fp16-to-fp32 conversion instructions are available in Thumb mode as well. Make sure the generic pattern is used. llvm-svn: 162170	2012-08-18 13:08:43 +00:00
Craig Topper	e341db552a	Refactor code a bit to reduce number of calls in the final compiled code. No functional change intended. llvm-svn: 162166	2012-08-18 06:39:34 +00:00
Craig Topper	d35582ae96	Reorder initialization list to silence -Wreorder llvm-svn: 162165	2012-08-18 06:20:54 +00:00
Nadav Rotem	e9cdefa762	Revert r162160 because it made a few buildbots fail. llvm-svn: 162164	2012-08-18 05:02:36 +00:00
Nadav Rotem	76f1b84f58	The X86 backend has a number of optimizations for SETCC nodes which use arithmetic instructions. However, when small data types are used, a truncate node appears between the SETCC node and the arithmetic operation. This patch adds support for this pattern. Before: xorl %esi, %edi testb %dil, %dil setne %al ret After: xorb %dil, %sil setne %al ret rdar://12081007 llvm-svn: 162160	2012-08-18 02:43:28 +00:00
Eli Friedman	925738bb5c	Make atomic load and store of pointers work. Tighten verification of atomic operations so other unexpected operations don't slip through. Based on patch by Logan Chien. PR11786/PR13186. llvm-svn: 162146	2012-08-17 23:24:29 +00:00
Richard Smith	aa91b43d1c	Fix undefined behavior (binding a reference to a dereferenced null pointer) if SSAUpdater was created and destroyed without being initialized. llvm-svn: 162137	2012-08-17 21:42:44 +00:00
Akira Hatanaka	ab6dca06f4	Add MipsELFWriterInfo.{h,cpp}. llvm-svn: 162136	2012-08-17 21:38:47 +00:00
Akira Hatanaka	a50e7bd0a6	Correct MCJIT functionality for MIPS32 architecture. No new tests are added. All tests in ExecutionEngine/MCJIT that have been failing pass after this patch is applied (when "make check" is done on a mips board). Patch by Petar Jovanovic. llvm-svn: 162135	2012-08-17 21:28:04 +00:00
Bill Wendling	5ae9ccaa74	Implement stack protectors for structures with character arrays in them. <rdar://problem/10545247> llvm-svn: 162131	2012-08-17 20:59:56 +00:00
Jakob Stoklund Olesen	40eb30013e	Avoid folding ADD instructions with FI operands. PEI can't handle the pseudo-instructions. This can be removed when the pseudo-instructions are replaced by normal predicated instructions. Fixes PR13628. llvm-svn: 162130	2012-08-17 20:55:34 +00:00
Akira Hatanaka	4e1b032521	Add stub methods for mips assembly matcher. Patch by Vladimir Medic. llvm-svn: 162124	2012-08-17 20:16:42 +00:00
Benjamin Kramer	4e9e4d1818	MemoryBuiltins: Properly guard ObjectSizeOffsetVisitor against cycles in the IR. The previous fix only checked for simple cycles, use a set to catch longer cycles too. Drop the broken check from the ObjectSizeOffsetEvaluator. The BoundsChecking pass doesn't have to deal with invalid IR like InstCombine does. llvm-svn: 162120	2012-08-17 19:26:41 +00:00
Bill Wendling	0569e9a6f3	Change the `linker_private_weak_def_auto' linkage to` linkonce_odr_auto_hide' to make it more consistent with its intended semantics. The `linker_private_weak_def_auto' linkage type was meant to automatically hide globals which never had their addresses taken. It has nothing to do with the `linker_private' linkage type, which outputs the symbols with a `l' (ell) prefix among other things. The intended semantic is more like the `linkonce_odr' linkage type. Change the name of the linkage type to `linkonce_odr_auto_hide'. And therefore changing the semantics so that it produces the correct output for the linker. Note: The old linkage name `linker_private_weak_def_auto' will still parse but is not a synonym for `linkonce_odr_auto_hide'. This should be removed in 4.0. <rdar://problem/11754934> llvm-svn: 162114	2012-08-17 18:33:14 +00:00
Rafael Espindola	fccad41366	Assert that dominates is not given a multiple edge. Finding out if we have multiple edges between two blocks is linear. If the caller is iterating all edges leaving a BB that would be a square time algorithm. It is more efficient to have the callers handle that case. Currently the only callers are: * GVN: already avoids the multiple edge case. * Verifier: could only hit this assert when looking at an invalid invoke. Since it already rejects the invoke, just avoid computing the dominance for it. llvm-svn: 162113	2012-08-17 18:21:28 +00:00
Jakob Stoklund Olesen	36d81e300e	Add comment, clean up code. No functional change. llvm-svn: 162107	2012-08-17 16:59:09 +00:00
Benjamin Kramer	ba78a8432b	TargetLowering: Use the large shift amount during legalize types. The legalizer may call us with an overly large type. llvm-svn: 162101	2012-08-17 15:54:21 +00:00
Jakob Stoklund Olesen	476a5d42a7	Use standard pattern for iterate+erase. Increment the MBB iterator at the top of the loop to properly handle the current (and previous) instructions getting erased. This fixes PR13625. llvm-svn: 162099	2012-08-17 14:38:59 +00:00
Benjamin Kramer	d431f3a1f2	Guard MemoryBuiltins against self-looping GEPs, which can occur in unreachable code due to constant propagation. Fixes PR13621. llvm-svn: 162098	2012-08-17 14:16:37 +00:00
Benjamin Kramer	b42939c43b	Fix broken check lines. I really need to find a way to automate this, but I can't come up with a regex that has no false positives while handling tricky cases like custom check prefixes. llvm-svn: 162097	2012-08-17 12:28:26 +00:00
Tim Northover	1de091468c	Implement NEON domain switching for scalar <-> S-register vmovs on ARM llvm-svn: 162094	2012-08-17 11:32:52 +00:00
Jin-Gu Kang	7892202c05	Insertion of NoFolder functions to avoid ambiguous overload warnings or errors about whether to convert Idx to ArrayRef<Constant > or ArrayRef<Value > like ConstantFolder and TargetFolder. llvm-svn: 162090	2012-08-17 08:54:57 +00:00
Craig Topper	efc1bf9ee1	Use nested switch to select arguments to reduce calls to EmitPCMP. llvm-svn: 162089	2012-08-17 07:15:56 +00:00
Craig Topper	8fa010b216	Make ReplaceATOMIC_BINARY_64 a static function. Use a nested switch to reduce to only a single call to it thus allowing it to be inlined by the compiler. llvm-svn: 162088	2012-08-17 06:55:11 +00:00
Pranav Bhandarkar	5b5070348b	Test commit. include/llvm/IntrinsicsHexagon.td: Hexagon_Intrinsic is the base class for all Hexagon intrinsics and not altivec intrinsics. llvm-svn: 162087	2012-08-17 06:36:26 +00:00

... 3 4 5 6 7 ...

84439 Commits