llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-29 22:30:33 +00:00

Author	SHA1	Message	Date
Hans Wennborg	41105ccc60	Declare __builtin_strlen in StringRef.h as constexpr Otherwise Visual Studio 2017 will complain about llvm::StringRef::strlen not being constexpr: StringRef.h(80): error C3615: constexpr function 'llvm::StringRef::strLen' cannot result in a constant expression StringRef.h(84): note: failure was caused by call of undefined function or one not declared 'constexpr'	2020-02-03 16:58:01 +01:00
Kazushi (Jam) Marukawa	3f0180f097	[VE] (fp)trunc+store & load+(fp)ext isel Summary: load+sext/zext/fpext and (fp)trunc+store isel legalization and tests Reviewers: arsenm, craig.topper, rengolin, k-ishizaka Reviewed By: arsenm Subscribers: merge_guards_bot, wdng, hiraditya, llvm-commits Tags: #ve, #llvm Differential Revision: https://reviews.llvm.org/D73774	2020-02-03 16:55:44 +01:00
Simon Pilgrim	cebe3b26a7	[X86] computeKnownBitsForTargetNode - add BEXTR support (PR39153) Add a KnownBits::extractBits helper	2020-02-03 15:43:59 +00:00
Hans Wennborg	238f3b0462	build_llvm_package.bat: Use a short form of the git revision	2020-02-03 16:40:10 +01:00
Craig Topper	cf7fa877a2	[X86] FUCOMI/FCOMI instructions should Def FPSW not FPCW. These instructions can set the exception in FPSW. But I don't think they can change FPCW. So this looks like a typo. Differential Revision: https://reviews.llvm.org/D73864	2020-02-03 07:39:00 -08:00
Sanjay Patel	cb8bd29a62	[InstCombine] regenerate complete test checks; NFC	2020-02-03 10:30:26 -05:00
Kazushi (Jam) Marukawa	b2d7ee731b	[VE] vaarg functions callers and callees Summary: Isel patterns and tests for vaarg functions as callers and callees. Reviewers: arsenm, rengolin, k-ishizaka Subscribers: merge_guards_bot, wdng, hiraditya, llvm-commits Tags: #ve, #llvm Differential Revision: https://reviews.llvm.org/D73710	2020-02-03 16:26:44 +01:00
Simon Pilgrim	8301fd0d00	[X86] Add some initial BEXTR combine tests	2020-02-03 15:16:40 +00:00
Simon Pilgrim	1f9f866ff9	[X86] Move BEXTR DemandedBits handling inside SimplifyDemandedBitsForTargetNode Some prep work for PR39153.	2020-02-03 15:16:40 +00:00
Matt Arsenault	80bf477ac5	AMDGPU: Fix extra type mangling on llvm.amdgcn.if.break These have to be the same mask type.	2020-02-03 07:02:05 -08:00
Johannes Doerfert	b1f217520e	Revert "[OpenMP][OMPIRBuilder] Add Directives (master and critical) to OMPBuilder." This reverts commit 1ca740387b9bbdc142ac81c8bdd6370a8813e328. The bots break [0], investigation is needed. [0] http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/22899	2020-02-03 08:59:14 -06:00
Fady Ghanim	0e8b45d86c	[OpenMP][OMPIRBuilder] Add Directives (master and critical) to OMPBuilder. Add support for Master and Critical directive in the OMPIRBuilder. Both make use of a new common interface for emitting inlined OMP regions called `emitInlinedRegion` which was added in this patch as well. Also this patch modifies clang to use the new directives when `-fopenmp-enable-irbuilder` commandline option is passed. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D72304	2020-02-03 08:44:23 -06:00
John Brawn	882073b1e0	[FPEnv][AArch64] Add lowering of f128 STRICT_FSETCC These get lowered to function calls, like the non-strict versions. Differential Revision: https://reviews.llvm.org/D73784	2020-02-03 14:39:16 +00:00
Krzysztof Parzyszek	fde0721a0a	[Hexagon] Rename FeatureHasPreV65 to FeaturePreV65	2020-02-03 08:20:59 -06:00
Sanjay Patel	244e402b5c	[InstCombine] reassociate splatted vector ops bo (splat X), (bo Y, OtherOp) --> bo (splat (bo X, Y)), OtherOp This patch depends on the splat analysis enhancement in D73549. See the test with comment: ; Negative test - mismatched splat elements ...as the motivation for that first patch. The motivating case for reassociating splatted ops is shown in PR42174: https://bugs.llvm.org/show_bug.cgi?id=42174 In that example, a slight change in order-of-associative math results in a big difference in IR and codegen. This patch gets all of the unnecessary shuffles out of the way, but doesn't address the potential scalarization (see D50992 or D73480 for that). Differential Revision: https://reviews.llvm.org/D73703	2020-02-03 09:08:36 -05:00
Matt Arsenault	7ff7b7c59d	AMDGPU/GlobalISel: Reduce indentation	2020-02-03 05:41:14 -08:00
Matt Arsenault	729362c237	AMDGPU/GlobalISel: Fix mem size in test This wasn't intended to tests an extload.	2020-02-03 05:41:14 -08:00
Simon Moll	99c2d7bdcd	[NFC][VE] format VEInstrInfo	2020-02-03 14:25:49 +01:00
Simon Moll	a84333d938	[NFC] unsigned->Register in storeRegTo/loadRegFromStack Summary: This patch makes progress on the 'unsigned -> Register' rewrite for `TargetInstrInfo::loadRegFromStack` and `TII::storeRegToStack`. Reviewers: arsenm, craig.topper, uweigand, jpienaar, atanasyan, venkatra, robertlytton, dylanmckay, t.p.northover, kparzysz, tstellar, k-ishizaka Reviewed By: arsenm Subscribers: wuzish, merge_guards_bot, jyknight, sdardis, nemanjai, jvesely, wdng, nhaehnle, hiraditya, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, jsji, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73870	2020-02-03 14:22:16 +01:00
Guillaume Chatelet	8cbfdb9b6f	[Alignment][NFC] Use Align for code creating MemOp Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73874	2020-02-03 14:10:30 +01:00
John Brawn	cdc62345ba	[FPEnv][ARM] Add lowering of STRICT_FSETCC and STRICT_FSETCCS These can be lowered to code sequences using CMPFP and CMPFPE which then get selected to VCMP and VCMPE. The implementation isn't fully correct, as the chain operand isn't handled correctly, but resolving that looks like it would involve changes around FPSCR-handling instructions and how the FPSCR is modelled. The fp-intrinsics test was already testing some of this but as the entire test was being XFAILed it wasn't noticed. Un-XFAIL the test and instead leave the cases where we aren't generating the right instruction sequences as FIXME. Differential Revision: https://reviews.llvm.org/D73194	2020-02-03 12:59:12 +00:00
James Henderson	46d04a31b6	[DebugInfo][test] Adjust line table unit length to account for contents Previously, if a debug line Prologue was created via createBasicPrologue, its TotalLength field did not account for any contents in the table itself. This change fixes this issue. Reviewed by: probinson Differential Revision: https://reviews.llvm.org/D73772	2020-02-03 12:16:36 +00:00
Simon Tatham	6709634733	[ARM,MVE] Fix vreinterpretq in big-endian mode. Summary: In big-endian MVE, the simple vector load/store instructions (i.e. both contiguous and non-widening) don't all store the bytes of a register to memory in the same order: it matters whether you did a VSTRB.8, VSTRH.16 or VSTRW.32. Put another way, the in-register formats of different vector types relate to each other in a different way from the in-memory formats. So, if you want to 'bitcast' or 'reinterpret' one vector type as another, you have to carefully specify which you mean: did you want to reinterpret the //register// format of one type as that of the other, or the //memory// format? The ACLE `vreinterpretq` intrinsics are specified to reinterpret the register format. But I had implemented them as LLVM IR bitcast, which is specified for all types as a reinterpretation of the memory format. So a `vreinterpretq` intrinsic, applied to values already in registers, would code-generate incorrectly if compiled big-endian: instead of emitting no code, it would emit a `vrev`. To fix this, I've introduced a new IR intrinsic to perform a register-format reinterpretation: `@llvm.arm.mve.vreinterpretq`. It's implemented by a trivial isel pattern that expects the input in an MQPR register, and just returns it unchanged. In the clang codegen, I only emit this new intrinsic where it's actually needed: I prefer a bitcast wherever it will have the right effect, because LLVM understands bitcasts better. So we still generate bitcasts in little-endian mode, and even in big-endian when you're casting between two vector types with the same lane size. For testing, I've moved all the codegen tests of vreinterpretq out into their own file, so that they can have a different set of RUN lines to check both big- and little-endian. Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D73786	2020-02-03 11:20:06 +00:00
Simon Tatham	e3f9be3c6f	[ARM,MVE] Add intrinsics for v[id]dupq and v[id]wdupq. Summary: These instructions generate a vector of consecutive elements starting from a given base value and incrementing by 1, 2, 4 or 8. The `wdup` versions also wrap the values back to zero when they reach a given limit value. The instruction updates the scalar base register so that another use of the same instruction will continue the sequence from where the previous one left off. At the IR level, I've represented these instructions as a family of target-specific intrinsics with two return values (the constructed vector and the updated base). The user-facing ACLE API provides a set of intrinsics that throw away the written-back base and another set that receive it as a pointer so they can update it, plus the usual predicated versions. Because the intrinsics return two values (as do the underlying instructions), the isel has to be done in C++. This is the first family of MVE intrinsics that use the `imm_1248` immediate type in the clang Tablegen framework, so naturally, I found I'd given it the wrong C integer type. Also added some tests of the check that the immediate has a legal value, because this is the first time those particular checks have been exercised. Finally, I also had to fix a bug in MveEmitter which failed an assertion when I nested two `seq` nodes (the inner one used to extract the two values from the pair returned by the IR intrinsic, and the outer one put on by the predication multiclass). Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D73357	2020-02-03 11:20:06 +00:00
Simon Tatham	c68d29ea5f	[ARM,MVE] Add intrinsics for vdupq. Summary: The unpredicated case of this is trivial: the clang codegen just makes a vector splat of the input, and LLVM isel is already prepared to handle that. For the predicated version, I've generated a `select` between the same vector splat and the `inactive` input parameter, and added new Tablegen isel rules to match that pattern into a predicated `MVE_VDUP` instruction. Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D73356	2020-02-03 11:20:06 +00:00
Simon Pilgrim	749efc6fd7	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warning. NFCI.	2020-02-03 11:09:08 +00:00
Raphael Isemann	c45af6b6ba	Reland "[llvm] Add a way to speed up the speed in which BumpPtrAllocator increases slab sizes"" Disable the red zone in the unit test allocator to fix the test errors in sanitizer builds. The red zone changed the amount of allocated bytes which made the test fail as it checked the number of allocated bytes of the allocator.	2020-02-03 12:06:15 +01:00
Clement Courbet	66d7e6fd7f	[llvm-exegesis] Restrict the range of allowable rounding countrols. Summary: It turns out that CUR_DIRECTION is just an internal placeholder, not an actual valid encoded value. Reviewers: gchatelet Subscribers: tschuett, mstojanovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73343	2020-02-03 11:53:27 +01:00
Dmitri Gribenko	dd086cbae5	Fixed a -Wunused-variable warning in no-assertion builds	2020-02-03 11:49:11 +01:00
Hans Wennborg	1b2ece857a	Make quick-append.test resilient to running in paths with '1.o' in the name	2020-02-03 11:35:24 +01:00
Raphael Isemann	6b6b415926	Revert "[llvm] Add a way to speed up the speed in which BumpPtrAllocator increases slab sizes" This reverts commit b848b510a8d52dbf50ee53a9a1ce844abb60d9bd as the unit tests fail on the sanitizer bots: /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/unittests/Support/AllocatorTest.cpp:145: Failure Expected: SlabSize Which is: 4096 To be equal to: Alloc.getTotalMemory() Which is: 4097	2020-02-03 11:10:50 +01:00
Guillaume Chatelet	3b360aa73f	Fix broken invariant Summary: A Copy with a source that is zeros is the same as a Set of zeros. This fixes the invariant that SrcAlign should always be non-null. Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73791	2020-02-03 11:01:05 +01:00
Clement Courbet	290c00761d	[llvm-exegesis] Add pfm counters for Zen2 (znver2). Summary: There are no counters for individual ports, but this is already enough to find a lot of issues in the current model (upcoming patch). Reviewers: dblaikie, gchatelet Subscribers: hiraditya, tschuett, RKSimon, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72032	2020-02-03 10:57:41 +01:00
Jay Foad	c7ca050389	[AMDGPU] Don't remove short branches over kills Summary: D68092 introduced a new SIRemoveShortExecBranches optimization pass and broke some graphics shaders. The problem is that it was removing branches over KILL pseudo instructions, and the fix is to explicitly check for that in mustRetainExeczBranch. Reviewers: critson, arsenm, nhaehnle, cdevadas, hakzsam Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73771	2020-02-03 09:26:52 +00:00
Sam Parker	363ccb5889	[JumpThreading] Half the duplicate threshold at Oz Duplicating instructions can lead to code size increases but using a threshold of 3 is good for reducing code size. Differential Revision: https://reviews.llvm.org/D72916	2020-02-03 08:40:20 +00:00
Raphael Isemann	3de927ff0d	[llvm] Add a way to speed up the speed in which BumpPtrAllocator increases slab sizes Summary: In D68549 we noticed that our BumpPtrAllocator we use for LLDB's ConstString implementation is growing its slabs at a rate that is too slow for our use case. It causes that we spend a lot of time calling `malloc` for all the tiny slabs that our ConstString BumpPtrAllocators create. We also can't just increase the slab size in the ConstString implementation (which is what D68549 originally did) as this really increased the amount of (mostly unused) allocated memory in any process using ConstString. This patch adds a template argument for the BumpPtrAllocatorImpl that allows specifying a faster rate at which the BumpPtrAllocator increases the slab size. This allows LLDB to specify a faster rate at which the slabs grow which should keep both memory consumption and time spent calling malloc low. Reviewers: george.karpenkov, chandlerc, NoQ Subscribers: NoQ, llvm-commits, llunak Tags: #llvm Differential Revision: https://reviews.llvm.org/D71654	2020-02-03 09:11:38 +01:00
Martin Storsjö	61497b26e1	[PM][CGSCC] Add parentheses to avoid a GCC warning. NFC. This avoids a warning about "suggest parentheses around && within \|\|".	2020-02-03 09:55:02 +02:00
Martin Storsjö	88959b7cec	[OpenMP] Fix GCC warnings. NFC. Remove an extra semicolon, and add llvm_unreachable to avoid warnings about control reaching the end of a non-void function.	2020-02-03 09:55:02 +02:00
Johannes Doerfert	c620ec9383	[Attributor][FIX] Try to resolve non-determinism problem for now There seems to be another instance of non-determinism which causes the number of iterations to be either 1 or 3 for one benchmark, depending on the system. This needs to be investigated and resolved. In the meantime we do not verify the number of iterations for this benchmark.	2020-02-03 00:47:29 -06:00
Johannes Doerfert	43fc0a102c	[Attributor] AANoRecurse check all call sites for `norecurse` If all call sites are in `norecurse` functions we can derive `norecurse` as the ReversePostOrderFunctionAttrsPass does. This should make ReversePostOrderFunctionAttrsLegacyPass obsolete once the Attributor is enabled. Reviewed By: uenoku Differential Revision: https://reviews.llvm.org/D72017	2020-02-02 23:57:17 -06:00
Johannes Doerfert	55895a0196	[Attributor] Propagate known information from `checkForAllCallSites` If we know that all call sites have been processed we can derive an early fixpoint. The use in this patch is likely not to trigger right now but a follow up patch will make use of it. Reviewed By: uenoku, baziotis Differential Revision: https://reviews.llvm.org/D72016	2020-02-02 23:57:17 -06:00
Igor Kudrin	b75688bde5	[DebugInfo] Remove an unused method DWARFUnit::getDWARF5HeaderSize(). NFC. The method was initially added for DWARFVerifier::verifyUnitHeader() but its results were never actually used. Differential Revision: https://reviews.llvm.org/D73773	2020-02-03 12:42:03 +07:00
Craig Topper	65de147729	[X86] Remove a couple unnecessary calls to ConvertCmpIfNecessary. We only need to call this on floating point comparisons. In this case these are known to be integer compares. One of them even has a SUB opcode instead of CMP.	2020-02-02 21:36:51 -08:00
Johannes Doerfert	f80ba61e01	[PM][CGSCC] Add a helper to update the call graph from SCC passes With this patch new trivial edges can be added to an SCC in a CGSCC pass via the updateCGAndAnalysisManagerForCGSCCPass method. It shares almost all the code with the existing updateCGAndAnalysisManagerForFunctionPass method but it implements the first step towards the TODOs. This was initially part of D70927. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D72025	2020-02-02 23:32:18 -06:00
Juneyoung Lee	92f700c60b	[llvm-extract] Add -keep-const-init commandline option Summary: This adds -keep-const-init option to llvm-extract which preserves initializers of used global constants. For example: ``` $ cat a.ll @g = constant i32 0 define i32 @f() { %v = load i32, i32* @g ret i32 %v } $ llvm-extract --func=f a.ll -S -o - @g = external constant i32 define i32 @f() { .. } $ llvm-extract --func=f a.ll -keep-const-init -S -o - @g = constant i32 0 define i32 @f() { .. } ``` This option is useful in checking whether a function that uses a constant global is optimized correctly. Reviewers: jsji, MaskRay, david2050 Reviewed By: MaskRay Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73833	2020-02-03 14:30:28 +09:00
LLVM GN Syncbot	5e45b1bf87	[gn build] Port c953409ff89	2020-02-03 05:24:55 +00:00
Johannes Doerfert	feb2aff206	[Inliner][NoAlias] Use call site attributes too If we had `noalias` on an argument the inliner created alias scope metadata already. However, the call site `noalias` annotation was not considered. Since the Attributor can derive such call site `noalias` annotation we should treat them the same as argument annotations. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D73528	2020-02-02 23:21:29 -06:00
Johannes Doerfert	f0f6b46554	[OpenMP][Part 1] Reusable OpenMP context/traits handling This is the first of multiple parts to make OpenMP context/trait handling reusable and generic. This patch was originally part of D71830 but with the unit tests it can be tested independently. This patch implements an almost complete handling of OpenMP contexts/traits such that we can reuse most of the logic in Flang through the OMPContext.{h,cpp} in llvm/Frontend/OpenMP. All but construct SIMD specifiers, e.g., inbranch, and the device ISA selector are define in llvm/lib/Frontend/OpenMP/OMPKinds.def. From these definitions we generate the enum classes TraitSet, TraitSelector, and TraitProperty as well as conversion and helper functions in llvm/lib/Frontend/OpenMP/OMPContext.{h,cpp}. The OpenMP context is now an explicit object (see `struct OMPContext`). This is in anticipation of construct traits that need to be tracked. The OpenMP context, as well as the VariantMatchInfo, are basically made up of a set of active or respectively required traits, e.g., 'host', and an ordered container of constructs which allows duplication. Matching and scoring is kept as generic as possible to allow easy extension in the future. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D71847	2020-02-02 23:18:25 -06:00
Shengchen Kan	1a8fc2cd80	[NFC] Fix helptext for opt/llc after https://reviews.llvm.org/D68411 Remove "cl::value_desc("jcc, fused, jmp, call, ret, indirect"),", which makes the option+it's cl::value_desc too long in all of help.	2020-02-03 12:31:42 +08:00
Saleem Abdulrasool	e525cc1017	build: attempt to repair the build The change in 9b84dabc5f36f7e027c62b5c00ea97e21cfcacdd uses a newer syntax which not all the builders support. Use an explicit `OR` over the languages instead to repair the builders.	2020-02-02 16:18:00 -08:00

1 2 3 4 5 ...

191185 Commits