llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-14 01:20:40 +00:00

Author	SHA1	Message	Date
Lang Hames	65f929415a	[ORC] Replace lookupFlags in JITSymbolResolver with getResponsibilitySet. The new method name/behavior more closely models the way it was being used. It also fixes an assertion that can occur when using the new ORC Core APIs, where flags alone don't necessarily provide enough context to decide whether the caller is responsible for materializing a given symbol (which was always the reason this API existed). The default implementation of getResponsibilitySet uses lookupFlags to determine responsibility as before, so existing JITSymbolResolvers should continue to work. llvm-svn: 340874	2018-08-28 21:18:05 +00:00
Fedor Sergeev	3dcc0c90de	[NFC][PassTiming] factor out generic PassTimingInfo Moving PassTimingInfo from legacy pass manager code into a separate header. Making it suitable for both legacy and new pass manager. Adding a test on -time-passes main functionality. llvm-svn: 340872	2018-08-28 21:06:51 +00:00
Alina Sbirlea	da787801e8	[SimpleLoopUnswitch] Form dedicated exits after trivial unswitches. Summary: Form dedicated exits after trivial unswitches. Fixes PR38737, PR38283. Reviewers: chandlerc, fedor.sergeev Subscribers: sanjoy, jlebar, uabelho, llvm-commits Differential Revision: https://reviews.llvm.org/D51375 llvm-svn: 340871	2018-08-28 20:41:05 +00:00
Lang Hames	01906b44f8	[ORC] Add an addObjectFile method to LLJIT. The addObjectFile method adds the given object file to the JIT session, making its code available for execution. Support for the -extra-object flag is added to lli when operating in -jit-kind=orc-lazy mode to support testing of this feature. llvm-svn: 340870	2018-08-28 20:20:31 +00:00
Craig Topper	28d610220f	[X86] Add intrinsics for KADD instructions These are intrinsics for supporting kadd builtins in clang. These builtins are already in gcc to implement intrinsics from icc. Though they are missing from the Intel Intrinsics Guide. This instruction adds two mask registers together as if they were scalar rather than a vXi1. We might be able to get away with a bitcast to scalar and a normal add instruction, but that would require DAG combine smarts in the backend to recoqnize add+bitcast. For now I'd prefer to go with the easiest implementation so we can get these builtins in to clang with good codegen. Differential Revision: https://reviews.llvm.org/D51370 llvm-svn: 340869	2018-08-28 19:22:55 +00:00
Fangrui Song	3ecec21253	[AMDGPU] Fix -Wunused-variable when -DLLVM_ENABLE_ASSERTIONS=off llvm-svn: 340868	2018-08-28 19:19:03 +00:00
Matt Morehouse	0fb860039c	Revert "[libFuzzer] Port to Windows" This reverts commit r340860 due to failing tests. llvm-svn: 340867	2018-08-28 19:07:24 +00:00
Matt Arsenault	06ed91a16b	AMDGPU: Don't delete instructions if S_ENDPGM has implicit uses This can leave behind the uses with the defs removed. Since this should only really happen in tests, it's not worth the effort of trying to handle this. llvm-svn: 340866	2018-08-28 18:55:55 +00:00
Aditya Nandakumar	4e615a2119	[GISel]: Add missing opcodes for overflow intrinsics https://reviews.llvm.org/D51197 Currently, IRTranslator (and GISel) seems to be arbitrarily picking which overflow intrinsics get mapped into opcodes which either have a carry as an input or not. For intrinsics such as Intrinsic::uadd_with_overflow, translate it to an opcode (G_UADDO) which doesn't have any carry inputs (similar to LLVM IR). This patch adds 4 missing opcodes for completeness - G_UADDO, G_USUBO, G_SSUBE and G_SADDE. llvm-svn: 340865	2018-08-28 18:54:10 +00:00
Thomas Lively	64c3acad23	[WebAssembly][NFC] Document stackifier tablegen backend Summary: Add comments to help readers avoid having to read tablegen backends to understand the code. Also remove unecessary breaks from the output. Reviewers: dschuff, aheejin Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51371 llvm-svn: 340864	2018-08-28 18:49:47 +00:00
Matt Arsenault	921d151a2a	AMDGPU: Force shrinking of add/sub even if the carry is used The original motivating example uses a 64-bit add, so the carry is used. Insert a copy from VCC. This may allow shrinking of the used carry instruction. At worst, we are replacing a mov to materialize the constant with a copy of vcc. llvm-svn: 340862	2018-08-28 18:44:16 +00:00
Matt Morehouse	9172941eda	[libFuzzer] Port to Windows Summary: Port libFuzzer to windows-msvc. This patch allows libFuzzer targets to be built and run on Windows, using -fsanitize=fuzzer and/or fsanitize=fuzzer-no-link. It allows these forms of coverage instrumentation to work on Windows as well. It does not fix all issues, such as those with -fsanitize-coverage=stack-depth, which is not usable on Windows as of this patch. It also does not fix any libFuzzer integration tests. Nearly all of them fail to compile, fixing them will come in a later patch, so libFuzzer tests are disabled on Windows until them. Patch By: metzman Reviewers: morehouse, rnk Reviewed By: morehouse, rnk Subscribers: morehouse, kcc, eraman Differential Revision: https://reviews.llvm.org/D51022 llvm-svn: 340860	2018-08-28 18:34:32 +00:00
Matt Arsenault	0ecb1320d0	AMDGPU: Shrink insts to fold immediates This needs to be done in the SSA fold operands pass to be effective, so there is a bit of overlap with SIShrinkInstructions but I don't think this is practically avoidable. llvm-svn: 340859	2018-08-28 18:34:24 +00:00
Thomas Lively	e0508f0444	[WebAssembly] v128.not Implementation and tests. llvm-svn: 340857	2018-08-28 18:31:15 +00:00
Matt Arsenault	0250069099	AMDGPU: Move canShrink into TII llvm-svn: 340855	2018-08-28 18:22:34 +00:00
Nirav Dave	2914a691b3	[DAGCombine] Rework MERGE_VALUES to inline in single pass. NFCI. Avoid hyperlinear cost of inlining MERGE_VALUE node by constructing temporary vector and doing a single replacement. llvm-svn: 340853	2018-08-28 18:13:26 +00:00
Nirav Dave	741515d89c	[DAG] Avoid recomputing Divergence checks. NFCI. When making multiple updates to the same SDNode, recompute node divergence only once after all changes have been made. llvm-svn: 340852	2018-08-28 18:13:00 +00:00
Nirav Dave	52a17527a0	[DAG] Fix updateDivergence calculation Check correct SDNode when deciding if we should update the divergence property. llvm-svn: 340851	2018-08-28 18:12:35 +00:00
Matt Arsenault	f5f26c463d	AMDGPU: Remove nan tests in class if src is nnan llvm-svn: 340850	2018-08-28 18:10:02 +00:00
Heejin Ahn	ee50fd6214	[WebAssembly] Use getCalleeOpNo utility function (NFC) Reviewers: tlively Subscribers: dschuff, sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51366 llvm-svn: 340848	2018-08-28 17:49:39 +00:00
Craig Topper	8a4d47470b	[X86] Mark the FUCOMI instructions as requiring CMOV to be enabled. NFCI These instructions were added on the PentiumPro along with CMOV. This was already comprehended by the lowering process which should emit an alternate sequence using FCOM and FNSTW. This just makes it an explicit error if that doesn't work for some reason. llvm-svn: 340844	2018-08-28 17:17:13 +00:00
Brian Cain	e833d9d952	[debuginfo] generate debug info with asm+.file Summary: For assembly input files, generate debug info even when the .file directive is present, provided it does not include a file-number argument. Fixes PR38695. Reviewers: probinson, sidneym Subscribers: aprantl, hiraditya, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D51315 llvm-svn: 340839	2018-08-28 16:23:39 +00:00
David Bolvansky	64733b19f6	[Inliner] Attribute callsites with inline remarks Summary: Sometimes reading an output *.ll file it is not easy to understand why some callsites are not inlined. We can read output of inline remarks (option --pass-remarks-missed=inline) and try correlating its messages with the callsites. An easier way proposed by this patch is to add to every callsite processed by Inliner an attribute with the latest message that describes the cause of not inlining this callsite. The attribute is called //inline-remark//. By default this feature is off. It can be switched on by the option //-inline-remark-attribute//. For example in the provided test the result method //@test1// has two callsites //@bar// and inline remarks report different inlining missed reasons: remark: <unknown>:0:0: bar not inlined into test1 because too costly to inline (cost=-5, threshold=-6) remark: <unknown>:0:0: bar not inlined into test1 because it should never be inlined (cost=never): recursive It is not clear which remark correspond to which callsite. With the inline remark attribute enabled we get the reasons attached to their callsites: define void @test1() { call void @bar(i1 true) #0 call void @bar(i1 false) #2 ret void } attributes #0 = { "inline-remark"="(cost=-5, threshold=-6)" } .. attributes #2 = { "inline-remark"="(cost=never): recursive" } Patch by: yrouban (Yevgeny Rouban) Reviewers: xbolva00, tejohnson, apilipenko Reviewed By: xbolva00, tejohnson Subscribers: eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D50435 llvm-svn: 340834	2018-08-28 15:27:25 +00:00
Ryan Taylor	a30bc8842e	[AMDGPU] Add support for a16 modifiear for gfx9 Summary: Adding support for a16 for gfx9. A16 bit replaces r128 bit for gfx9. Change-Id: Ie8b881e4e6d2f023fb5e0150420893513e5f4841 Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D50575 llvm-svn: 340831	2018-08-28 15:07:30 +00:00
Mikael Holmen	4e23f235e0	[CloneFunction] Constant fold terminators before checking single predecessor Summary: This fixes PR31105. There is code trying to delete dead code that does so by e.g. checking if the single predecessor of a block is the block itself. That check fails on a block like this bb: br i1 undef, label %bb, label %bb since that has two (identical) predecessors. However, after the check for dead blocks there is a call to ConstantFoldTerminator on the basic block, and that call simplifies the block to bb: br label %bb Therefore we now do the call to ConstantFoldTerminator before the check if the block is dead, so it can realize that it really is. The original behavior lead to the block not being removed, but it was simplified as above, and then we did a call to Dest->replaceAllUsesWith(&*I); with old and new being equal, and an assertion triggered. Reviewers: chandlerc, fhahn Reviewed By: fhahn Subscribers: eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D51280 llvm-svn: 340820	2018-08-28 12:40:11 +00:00
Alexandros Lamprineas	249f0b96b4	[GVNHoist] Prune out useless CHI insertions Fix for the out-of-memory error when compiling SemaChecking.cpp with GVNHoist and ubsan enabled. I've used a cache for inserted CHIs to avoid excessive memory usage. Differential Revision: https://reviews.llvm.org/D50323 llvm-svn: 340818	2018-08-28 11:07:54 +00:00
Simon Pilgrim	1873dcec6b	[X86][SSE] Improve variable scalar shift of vXi8 vectors (PR34694) This patch creates the shift mask and actual shift using the vXi16 vector shift ops. Differential Revision: https://reviews.llvm.org/D51263 llvm-svn: 340813	2018-08-28 10:37:29 +00:00
Simon Pilgrim	e5a8149fc0	[X86][SSE] Avoid vector extraction/insertion for non-constant uniform shifts As discussed on D51263, we're better off using byte shifts to clear the upper bits on pre-SSE41 hardware. llvm-svn: 340810	2018-08-28 10:14:09 +00:00
Max Kazantsev	b4a8cdeb02	[NFC] A loop can never contain Ret instruction llvm-svn: 340808	2018-08-28 09:26:28 +00:00
David Chisnall	e2f48dce8f	Fix in getAllocationDataForFunction Summary: Correct to use set like behaviour of AllocType. Should check for subset, not precise value. Reviewers: theraven Reviewed By: theraven Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D50959 llvm-svn: 340807	2018-08-28 08:59:06 +00:00
Craig Topper	7da8e47622	[X86] Fix some comments to refer to KORTEST not KTEST. NFC KTEST is a different instruction. All of this code uses KORTEST. llvm-svn: 340799	2018-08-28 06:39:35 +00:00
Craig Topper	707737eef4	[DAGCombiner][AMDGPU][Mips] Fold bitcast with volatile loads if the resulting load is legal for the target. Summary: I'm not sure if this patch is correct or if it needs more qualifying somehow. Bitcast shouldn't change the size of the load so it should be ok? We already do something similar for stores. We'll change the type of a volatile store if the resulting store is Legal or Custom. I'm not sure we should be allowing Custom there... I was playing around with converting X86 atomic loads/stores(except seq_cst) into regular volatile loads and stores during lowering. This would allow some special RMW isel patterns in X86InstrCompiler.td to be removed. But there's some floating point patterns in there that didn't work because we don't fold (f64 (bitconvert (i64 volatile load))) or (f32 (bitconvert (i32 volatile load))). Reviewers: efriedma, atanasyan, arsenm Reviewed By: efriedma Subscribers: jvesely, arsenm, sdardis, kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, arichardson, jrtc27, atanasyan, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D50491 llvm-svn: 340797	2018-08-28 03:47:20 +00:00
Craig Topper	a72b09e6fc	[InstCombine] Extend (add (sext x), cst) --> (sext (add x, cst')) and (add (zext x), cst) --> (zext (add x, cst')) to work for vectors Differential Revision: https://reviews.llvm.org/D51236 llvm-svn: 340796	2018-08-28 02:02:29 +00:00
Kit Barton	5fe828ca5e	[PPC] Remove Darwin support from POWER backend. This patch issues an error message if Darwin ABI is attempted with the PPC backend. It also cleans up existing test cases, either converting the test to use an alternative triple or removing the test if the coverage is no longer needed. Updated Tests ------------- The majority of test cases were updated to use a different triple that does not include the Darwin ABI. Many tests were also updated to use FileCheck, in place of grep. Deleted Tests ------------- llvm/test/tools/dsymutil/PowerPC/sibling.test was originally added to test specific functionality of dsymutil using an object file created with an old version of llvm-gcc for a Powerbook G4. After a discussion with @JDevlieghere he suggested removing the test. llvm/test/CodeGen/PowerPC/combine_loads_from_build_pair.ll was converted from a PPC test to a SystemZ test, as the behavior is also reproducible there. All other tests that were deleted were specific to the darwin/ppc ABI and no longer necessary. Phabricator Review: https://reviews.llvm.org/D50988 llvm-svn: 340795	2018-08-28 01:18:29 +00:00
David Blaikie	cfd04c50e1	Revert "[CodeGenPrepare] Scan past debug intrinsics to find select candidates (NFC)" This causes crashes due to the interleaved dbg.value intrinsics being left at the end of basic blocks, causing the actual terminators (br, etc) to be not where they should be (not at the end of the block), leading to later crashes. Further discussion on the original commit thread. This reverts commit r340368. llvm-svn: 340794	2018-08-28 00:55:19 +00:00
George Burgess IV	af6dad54c0	[MemorySSA] Add NDEBUG checks to verifiers; NFC verify*() methods are intended to have no side-effects (unless we detect broken MSSA, in which case they assert()), and all of the other verify methods are wrapped by `#ifndef NDEBUG`. llvm-svn: 340793	2018-08-28 00:32:32 +00:00
Sanjay Patel	fcc0556b47	[InstCombine] fix formatting; NFC llvm-svn: 340790	2018-08-27 23:01:10 +00:00
Sanjay Patel	3fe1ea1aae	[InstCombine] allow shuffle+binop canonicalization with widening shuffles This lines up with the behavior of an existing transform where if both operands of the binop are shuffled, we allow moving the binop before the shuffle regardless of whether the shuffle changes the size of the vector. llvm-svn: 340787	2018-08-27 22:41:44 +00:00
Evandro Menezes	9e7c2d3864	[PATCH] [InstCombine] Fix issue in the simplification of pow() with nested exp{,2}() Fix the issue of duplicating the call to `exp{,2}()` when it's nested in `pow()`, as exposed by rL340462. Differential revision: https://reviews.llvm.org/D51194 llvm-svn: 340784	2018-08-27 22:11:15 +00:00
George Burgess IV	452ca2d6d2	s/std::set/DenseSet/; NFC We only use this set for `insert` and `count`, so a hashing container seems better here. llvm-svn: 340783	2018-08-27 22:10:59 +00:00
Brendon Cahoon	887911c8b0	[Pipeliner] Fix incorrect phi values in the epilog and kernel The code that generates the loop definition operand for phis in the epilog and kernel is incorrect in some cases. In the kernel, when a phi refers to another phi, the code that updates PhiOp2 needs to include the stage difference between the two phis. In the epilog, the check for using the loop definition instead of the phi definition uses the StageDiffAdj value (the difference between the phi stage and the loop definition stage), but the adjustment is not needed to determine if the current stage contains an iteration with the loop definition. Differential Revision: https://reviews.llvm.org/D51167 llvm-svn: 340782	2018-08-27 22:04:50 +00:00
Thomas Lively	672ab0c74a	[WebAssembly] TableGen backend for stackifying instructions Summary: The new stackification backend generates the giant switch statement used to translate instructions to their stackified forms. I did this because it was more interesting than adding all the different vector versions of the various SIMD instructions to the switch statment manually. Reviewers: aardappel, aheejin, dschuff Subscribers: mgorny, sbc100, jgravelle-google, sunfish, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D51318 llvm-svn: 340781	2018-08-27 22:02:09 +00:00
Roman Tereshin	91525f438c	Revert "[SCEV][NFC] Check NoWrap flags before lexicographical comparison of SCEVs" This reverts r319889. Unfortunately, wrapping flags are not a part of SCEV's identity (they do not participate in computing a hash value or in equality comparisons) and in fact they could be assigned after the fact w/o rebuilding a SCEV. Grep for const_cast's to see quite a few of examples, apparently all for AddRec's at the moment. So, if 2 expressions get built in 2 slightly different ways: one with flags set in the beginning, the other with the flags attached later on, we may end up with 2 expressions which are exactly the same but have their operands swapped in one of the commutative N-ary expressions, and at least one of them will have "sorted by complexity" invariant broken. 2 identical SCEV's won't compare equal by pointer comparison as they are supposed to. A real-world reproducer is added as a regression test: the issue described causes 2 identical SCEV expressions to have different order of operands and therefore compare not equal, which in its turn prevents LoadStoreVectorizer from vectorizing a pair of consecutive loads. On a larger example (the source of the test attached, which is a bugpoint) I have seen even weirder behavior: adding a constant to an existing SCEV changes the order of the existing terms, for instance, getAddExpr(1, ((A * B) + (C * D))) returns (1 + (C * D) + (A * B)). Differential Revision: https://reviews.llvm.org/D40645 llvm-svn: 340777	2018-08-27 21:41:37 +00:00
Matt Arsenault	3442f70fe6	DAG: Check transformed type for forming fminnum/fmaxnum from vselect Follow up to r340655 to fix vector types which are split. llvm-svn: 340766	2018-08-27 18:11:31 +00:00
Matt Arsenault	385d2085b8	MachineVerifier: Fix assert on implicit virtreg use If the liveness of a physical register was invalid, this was attempting to iterate the subregisters of all register uses of the instruction, which would assert when it encountered an implicit virtual register operand. llvm-svn: 340763	2018-08-27 17:40:09 +00:00
Sean Fertile	da4254727a	[PowerPC][MC] Support expressions in getMemRIX16Encoding. Loosens an assert in getMemRIX16Encoding that restricts DQ-form instructions to using an immediate, so that we can assemble instructions like lxv/stxv where the offset is an expression. Differential Revision: https://reviews.llvm.org/D51122 llvm-svn: 340761	2018-08-27 17:37:43 +00:00
Benjamin Kramer	bc59f97bc1	[NVPTX] Implement isLegalToVectorizeLoadChain This lets LSV nicely split up underaligned chains. Differential Revision: https://reviews.llvm.org/D51306 llvm-svn: 340760	2018-08-27 17:29:43 +00:00
Craig Topper	e74f5c63dc	[X86] When lowering v32i8 MULHS/MULHU, shuffle after the PACKUS rather than before. We're using a 256-bit PACKUS to do the truncation, but that instruction operates on 128-bit lanes. So previously we shuffled first to rearrange the lanes. But that requires 2 shuffles. Instead we can shuffle after the PACKUS using a single VPERMQ. This matches what our normal LowerTRUNCATE code does when it uses PACKUS. Differential Revision: https://reviews.llvm.org/D51284 llvm-svn: 340757	2018-08-27 17:20:41 +00:00
Craig Topper	6370edfa28	[X86] Add support for matching paddus patterns where one of the vectors is a constant. InstCombine mucks these up a bit. So we need to do some additional pattern matching to fix it. There are a still a few special cases not handled, but this covers the general case. Differential Revision: https://reviews.llvm.org/D50952 llvm-svn: 340756	2018-08-27 17:20:38 +00:00
Hans Wennborg	49f2cdbc52	Use a lambda for calls to ::open in RetryAfterSignal In Bionic, open can be overloaded for _FORTIFY_SOURCE support, causing compile errors of RetryAfterSignal due to overload resolution. Wrapping the call in a lambda avoids this. Based on a patch by Chih-Wei Huang <cwhuang@linux.org.tw>! llvm-svn: 340751	2018-08-27 15:55:39 +00:00

1 2 3 4 5 ...

116184 Commits