llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-08 12:21:04 +00:00

Author	SHA1	Message	Date
Junmo Park	16a6e05403	Minor code cleanups. NFC. llvm-svn: 259033	2016-01-28 09:42:39 +00:00
Asaf Badouh	547a7d4edb	[X86][AVX512] small fix in ptestm intrinsics move ptestm{q\|d} intrinsics from patterns form (in td file) to the intrinsics table Differential Revision: http://reviews.llvm.org/D16633 llvm-svn: 259029	2016-01-28 08:33:22 +00:00
Matthias Braun	54b35cf6ca	ValueTracking: Use fixed array for assumption exclude set in Query. The Query structure is constructed often and is relevant for compiletime performance. We can replace the SmallPtrSet for assumption exclusions in this structure with a fixed size array because we know the maximum number of elements. This improves typical clang -O3 -emit-llvm compiletime by 1.2% in my measurements. Differential Revision: http://reviews.llvm.org/D16204 llvm-svn: 259025	2016-01-28 06:29:33 +00:00
Junmo Park	f494fe59f5	[DAGCombiner] Don't add volatile or indexed stores to ChainedStores Summary: findBetterNeighborChains does not handle volatile or indexed stores. However, it did not check when adding stores to ChainedStores. Reviewers: arsenm Differential Revision: http://reviews.llvm.org/D16463 llvm-svn: 259024	2016-01-28 06:23:33 +00:00
JF Bastien	e5f472b9cb	WebAssembly: fix build r259016 didn't also revert r258957 which broken the WebAssembly build. llvm-svn: 259020	2016-01-28 05:05:17 +00:00
Matthias Braun	75154d690e	SmallPtrSet: Make destructor available for inlining llvm-svn: 259019	2016-01-28 04:49:14 +00:00
Matthias Braun	0a16f48325	SmallPtrSet: Share some code between copy/move constructor/assignment operator llvm-svn: 259018	2016-01-28 04:49:11 +00:00
Matthias Braun	df154ae0af	SmallPtrSet: Remove trailing whitespace, fix indentation llvm-svn: 259017	2016-01-28 04:49:07 +00:00
NAKAMURA Takumi	a814b67e03	Revert r258951 (and r258950), "Refactor backend diagnostics for unsupported features" It broke layering violation in LLVMIR. clang r258950 "Add backend dignostic printer for unsupported features" llvm r258951 "Refactor backend diagnostics for unsupported features" llvm-svn: 259016	2016-01-28 04:41:32 +00:00
Dan Gohman	120265029b	[WebAssembly] Don't stackify a register def past a get_local use in the same tree. llvm-svn: 259013	2016-01-28 03:59:09 +00:00
Junmo Park	4f5a66835c	Minor code formatting cleanup. NFC. llvm-svn: 259010	2016-01-28 01:23:18 +00:00
Dan Gohman	2e8acd83fd	[WebAssembly] Enhanced register stackification This patch revamps the RegStackifier pass with a new tree traversal mechanism, enabling three major new features: - Stackification of values with multiple uses, using the result value of set_local - More aggressive stackification of instructions with side effects - Reordering operands in commutative instructions to enable more stackification. llvm-svn: 259009	2016-01-28 01:22:44 +00:00
George Burgess IV	68851f60b4	Minor style cleanup of CFLAA. NFC. llvm-svn: 259008	2016-01-28 00:54:01 +00:00
Sanjay Patel	9fa4a25ba4	less indenting; NFCI llvm-svn: 259002	2016-01-28 00:03:16 +00:00
Adam Nemet	b70814180d	[TTI] Add getPrefetchDistance from PPCLoopDataPrefetch, NFC This patch is part of the work to make PPCLoopDataPrefetch target-independent (http://thread.gmane.org/gmane.comp.compilers.llvm.devel/92758). As it was discussed in the above thread, getPrefetchDistance is currently using instruction count which may change in the future. llvm-svn: 258995	2016-01-27 22:21:25 +00:00
Evgeniy Stepanov	e376478941	Tweak unnamed label syntax in textual IR for easier matching in tests. Change the unnamed label comments like ; <label>:8 ; preds = %1 to ; <label>:8: ; preds = %1 This way lit tests can match [[LABEL]]: in both asserts and no-asserts builds. llvm-svn: 258993	2016-01-27 21:53:08 +00:00
Derek Schuff	8ba74b75fe	[WebAssembly] Implement byval arguments Summary: Just does the simple allocation of a stack object and passes a pointer to the callee. Differential Revision: http://reviews.llvm.org/D16610 llvm-svn: 258989	2016-01-27 21:17:39 +00:00
Tim Northover	873c40a8de	ARMv7k: base ABI decision on v7k Arch rather than watchos OS. Various bits we want to use the new ABI actually compile with "-arch armv7k -miphoneos-version-min=9.0". Not ideal, but also not ridiculous given how slices work. llvm-svn: 258975	2016-01-27 19:32:29 +00:00
Benjamin Kramer	7bb55f0a55	One more batch of self-containing headers. llvm-svn: 258974	2016-01-27 19:29:56 +00:00
Benjamin Kramer	430e3104e7	Don't put classes in headers into anonymous namespaces. You want ODR violations? That's how you get ODR violations. llvm-svn: 258973	2016-01-27 19:29:42 +00:00
Sanjay Patel	8c8f34a124	[SimplifyCFG] limit recursion depth when speculating instructions (PR26308) This is a fix for: https://llvm.org/bugs/show_bug.cgi?id=26308 With the switch to using the TTI cost model in: http://reviews.llvm.org/rL228826 ...it became possible to hit a zero-cost cycle of instructions (gep -> phi -> gep...), so we need a cap for the recursion in DominatesMergePoint(). A recursion depth parameter was already added for a different reason in: http://reviews.llvm.org/rL255660 ...so we can just set a limit for it. I pulled "10" out of the air and made it an independent parameter that we can play with. It might be higher than it needs to be given the currently low default value of PHINodeFoldingThreshold (2). That's the starting cost value that we enter the recursion with, and most instructions have cost set to TCC_Basic (1), so I don't think we're going to speculate more than 2 instructions with the current parameters. As noted in the review and the TODO comment, we can do better than just limiting recursion depth. Differential Revision: http://reviews.llvm.org/D16637 llvm-svn: 258971	2016-01-27 19:22:45 +00:00
John McCall	02db7c55a3	Add support for objc_unsafeClaimAutoreleasedReturnValue to the ObjC ARC Optimizer. The main implication of this is: 1. Ensuring that we treat it conservatively in terms of optimization. 2. We put the ASM marker on it so that the runtime can recognize objc_unsafeClaimAutoreleasedReturnValue from releaseRV. <rdar://problem/21567064> Patch by Michael Gottesman! llvm-svn: 258970	2016-01-27 19:05:08 +00:00
Benjamin Kramer	164a0ed653	Unbreak wasm build after r258951. llvm-svn: 258957	2016-01-27 18:03:40 +00:00
Benjamin Kramer	4b661f540a	Make more headers self-contained. A lot of this comes from the new complete type requirement of DenseMap. llvm-svn: 258956	2016-01-27 18:03:37 +00:00
Oliver Stannard	93adbfee25	Refactor backend diagnostics for unsupported features The BPF and WebAssembly backends had identical code for emitting errors for unsupported features, and AMDGPU had very similar code. This merges them all into one DiagnosticInfo subclass, that can be used by any backend. There should be minimal functional changes here, but some AMDGPU tests have been updated for the new format of errors (it used a slightly different format to BPF and WebAssembly). The AMDGPU error messages will now benefit from having precise source locations when debug info is available. The implementation of DiagnosticInfoUnsupported::print must be in lib/Codegen rather than in the existing file in lib/IR/ to avoid introducing a dependency from IR to CodeGen. Differential Revision: http://reviews.llvm.org/D16590 llvm-svn: 258951	2016-01-27 17:30:33 +00:00
Sanjoy Das	9e137d144d	[IndVars] Hoist DataLayout load out of loop; NFC llvm-svn: 258946	2016-01-27 17:05:09 +00:00
Sanjoy Das	2b5c58bf39	[IndVars] Use isSCEVable; NFC llvm-svn: 258945	2016-01-27 17:05:06 +00:00
Sanjoy Das	ef4f540f1a	[IndVars] Use range-for; NFC llvm-svn: 258944	2016-01-27 17:05:03 +00:00
Benjamin Kramer	c98a806c74	Move SafeStack to CodeGen. It depends on the target machinery, that's not available for instrumentation passes. llvm-svn: 258942	2016-01-27 16:53:42 +00:00
Benjamin Kramer	1d1115c0c4	Rename TargetSelectionDAGInfo into SelectionDAGTargetInfo and move it to CodeGen/ It's a SelectionDAG thing, not a Target thing. llvm-svn: 258939	2016-01-27 16:32:26 +00:00
Benjamin Kramer	88aefe078a	Move passes that live in lib/CodeGen out of Scalar.h llvm-svn: 258938	2016-01-27 16:05:42 +00:00
Benjamin Kramer	cc4037f846	Make some headers self-contained, remove unused includes that violate layering. llvm-svn: 258937	2016-01-27 16:05:37 +00:00
Tom Stellard	5b34ccce2a	AMDGPU/SI: Fix commuting of 32-bit VOPC instructions Summary: We didn't have entries in the commuting table for the 32-bit instructions. I don't think we hit this problem now, but we will once uniform branching is enabled. Tests will come in a later commit. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D16600 llvm-svn: 258936	2016-01-27 15:53:52 +00:00
Igor Laevsky	730e8cf8fe	[DebugInfo] Support zero-length CIE in the _eh_frame parser MCJIT emits zero-length CIE at the end of the _eh_frame section. This change ensures that parser inside DebugInfo will not crash and correctly record such cases. We are now recording DW_EH_PE_omit as a default value for FDE and LSDA encodings. Also Offset != EndAugmentationOffset assertion check will only happen if augmentation string had 'z' letter in it. Differential Revision: http://reviews.llvm.org/D16588 llvm-svn: 258931	2016-01-27 14:05:35 +00:00
Matthew Simpson	2374105880	Reapply commit r258404 with fix This patch is the second attempt to reapply commit r258404. There was bug in the initial patch and subsequent fix (mentioned below). The initial patch caused an assertion because we were computing smaller type sizes for instructions that cannot be demoted. The fix first determines the instructions that will be demoted, and then applies the smaller type size to only those instructions. This should fix PR26239 and PR26307. llvm-svn: 258929	2016-01-27 13:43:27 +00:00
Benjamin Kramer	f14c1e99a1	Revert "Allow X86::COND_NE_OR_P and X86::COND_NP_OR_E to be reversed." and "Add a missing test case for r258847." This reverts commit r258847, r258848. Causes miscompilations and backend errors. llvm-svn: 258927	2016-01-27 12:44:12 +00:00
Marek Olsak	0cb3416583	AMDGPU/SI: Stoney has only 16 LDS banks Summary: This is a candidate for stable, along with all patches that add the "stoney" processor. Reviewers: tstellarAMD Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D16485 llvm-svn: 258922	2016-01-27 11:19:45 +00:00
Benjamin Kramer	e3023baf50	Move MCTargetAsmParser.h to llvm/MC/MCParser where it belongs. llvm-svn: 258917	2016-01-27 10:01:28 +00:00
Igor Breger	4d028fc37a	AVX512: Fix vpmovzxbw predicate for AVX1/2 instructions. Differential Revision: http://reviews.llvm.org/D16595 llvm-svn: 258915	2016-01-27 08:57:46 +00:00
Igor Breger	04cebad2fa	AVX512: Add store mask patterns. Differential Revision: http://reviews.llvm.org/D16596 llvm-svn: 258914	2016-01-27 08:43:25 +00:00
Chen Li	0516a9ad17	[IndVarSimplify] Rewrite loop exit values with their initial values from loop preheader Summary: This is a revised version of D13974, and the following quoted summary are from D13974 "This patch adds support to check if a loop has loop invariant conditions which lead to loop exits. If so, we know that if the exit path is taken, it is at the first loop iteration. If there is an induction variable used in that exit path whose value has not been updated, it will keep its initial value passing from loop preheader. We can therefore rewrite the exit value with its initial value. This will help remove phis created by LCSSA and enable other optimizations like loop unswitch." D13974 was committed but failed one lnt test. The bug was that we only checked the condition from loop exit's incoming block was a loop invariant. But there could be another condition from loop header to that incoming block not being a loop invariant. This would produce miscompiled code. This patch fixes the issue by checking if the incoming block is loop header, and if not, don't perform the rewrite. The could be further improved by recursively checking all conditions leading to loop exit block, but I'd like to check in this simple version first and improve it with future patches. Reviewers: sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16570 llvm-svn: 258912	2016-01-27 07:40:41 +00:00
Haicheng Wu	acc848df4e	[SLPVectorizer] Swap the checking order of isCommutative and isConsecutiveAccess NFC llvm-svn: 258909	2016-01-27 04:59:05 +00:00
Matthias Braun	f834c97f30	SmallPtrSet: Inline the part of insert_imp in the small case Most of the time we only hit the small case, so it is beneficial to pull it out of the insert_imp() implementation. This improves compile time at least for non-LTO builds. Differential Revision: http://reviews.llvm.org/D16619 llvm-svn: 258908	2016-01-27 04:20:24 +00:00
David Majnemer	be8a721ed2	Revert "Revert "[SimplifyCFG] allow speculation of exactly one expensive instruction (PR24818)"" This reverts commit r258903 which reverted r255660. r258903 was an accidental commit and should not have been committed. llvm-svn: 258905	2016-01-27 02:59:41 +00:00
David Majnemer	4ee6f6446b	[SimplifyCFG] Don't mistake icmp of and for a tree of comparisons SimplifyCFG tries to turn complex branch conditions into a switch. Some of it's logic attempts to reason about bitwise arithmetic produced by InstCombine. InstCombine can turn things like (X == 2) \|\| (X == 3) into (X & 1) == 2 and so SimplifyCFG tries to detect when this occurs so that it can produce a switch instruction. However, the legality checking was not sufficient to determine whether or not this had occured. Correctly check this case by requiring that the right-hand side of the comparison be a power of two. This fixes PR26323. llvm-svn: 258904	2016-01-27 02:43:28 +00:00
David Majnemer	991bc63f1d	Revert "[SimplifyCFG] allow speculation of exactly one expensive instruction (PR24818)" This reverts commit r255660. llvm-svn: 258903	2016-01-27 02:43:22 +00:00
Matt Arsenault	06feef3a96	AMDGPU: Fix default device handling When no device name is specified, default to kaveri for HSA since SI is not supported and it woud fail. Default to "tahiti" instead of "SI" since these are effectively the same, and tahiti is an actual device. Move default device handling to the TargetMachine rather than the AMDGPUSubtarget. The module ISA version is computed from the device name provided with the target machine, so the attributes printed by the AsmPrinter were inconsistent with those computed in the subtarget. Also remove DevName field from subtarget since it's redundant with getCPU() in the superclass. llvm-svn: 258901	2016-01-27 02:17:49 +00:00
Reid Kleckner	b49dfe295d	[cmake] Remove /Og- flag which was working around PR24785 With r258897, MSVC 2013 now successfully compiles Function.cpp. llvm-svn: 258899	2016-01-27 01:52:46 +00:00
Reid Kleckner	822ed5c846	[llvm-tblgen] Avoid StringMatcher for GCC and MS builtin names This brings the compile time of Function.cpp from ~40s down to ~4s for me locally. It also shaves off about 400KB of object file size in a release+asserts build. I also realized that the AMDGPU backend does not have any GCC builtin names to match, so the extra lookup was a no-op. I removed it to silence a zero-length string table array warning. There should be no functional change here. This change really ends the story of PR11951. llvm-svn: 258897	2016-01-27 01:43:12 +00:00
Justin Lebar	cb0814e6c7	Disable all standard lib functions for NVVM. Summary: NVVM doesn't have a standard library, as currently implemented, so this just isn't going to work. I'd like to revisit this, since it's hiding opportunities for optimization, but correctness comes first. Thank you to hfinkel for pointing me in the right direction here. Reviewers: tra Subscribers: echristo, jhen, llvm-commits, hfinkel Differential Revision: http://reviews.llvm.org/D16604 llvm-svn: 258884	2016-01-26 23:51:06 +00:00

1 2 3 4 5 ...

86660 Commits