llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2025-01-01 05:11:15 +00:00

Author	SHA1	Message	Date
Arthur O'Dwyer	344cef6695	[libc++] Granularize the <random> header. NFCI. Actually there's one functional change here, which is that users can no longer depend on <random> to include all of C++20 <concepts>. That inclusion is so new that we believe nobody should be depending on it yet, even in the presence of Hyrum's Law. We keep the includes of <vector>, <algorithm>, etc., so as not to break pre-C++20 Hyrum's Law users. Differential Revision: https://reviews.llvm.org/D114281	2021-11-22 13:24:27 -05:00
Dmitry Vyukov	5f18ae3988	Revert "tsan: new runtime (v3)" Summary: This reverts commit `1784fe0532`. Broke some bots: https://lab.llvm.org/buildbot#builders/57/builds/12365 http://green.lab.llvm.org/green/job/clang-stage1-RA/25658/ Reviewers: vitalybuka, melver Subscribers:	2021-11-22 19:08:48 +01:00
Peter Klausler	996ef895cd	[flang] Add -fno-automatic, refine IsSaved() This legacy option (available in other Fortran compilers with various spellings) implies the SAVE attribute for local variables on subprograms that are not explicitly RECURSIVE. The SAVE attribute essentially implies static rather than stack storage. This was the default setting in Fortran until surprisingly recently, so explicit SAVE statements & attributes could be and often were omitted from older codes. Note that initialized objects already have an implied SAVE attribute, and objects in COMMON effectively do too, as data overlays are extinct; and since objects that are expected to survive from one invocation of a procedure to the next in static storage should probably be explicit initialized in the first place, so the use cases for this option are somewhat rare, and all of them could be handled with explicit SAVE statements or attributes. This implicit SAVE attribute must not apply to automatic (in the Fortran sense) local objects, whose sizes cannot be known at compilation time. To get the semantics of IsSaved() right, the IsAutomatic() predicate was moved into Evaluate/tools.cpp to allow for dynamic linking of the compiler. The redundant predicate IsAutomatic() was noticed, removed, and its uses replaced. GNU Fortran's spelling of the option (-fno-automatic) was added to the clang-based driver and used for basic sanity testing. Differential Revision: https://reviews.llvm.org/D114209	2021-11-22 10:06:38 -08:00
Florian Mayer	ed8b5b37ab	[hwasan] fix arguments to symbolizer. new versions do not accept -inlining of -functions (tested with 11 and 13). Reviewed By: hctim Differential Revision: https://reviews.llvm.org/D114303	2021-11-22 09:59:06 -08:00
Lei Zhang	93284120f2	[mlir][vector] Fix TransferOpReduceRank for 0-D tensors We cannot unconditionally generate memref.load ops for such cases; need to check the source's type. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114376	2021-11-22 12:30:46 -05:00
Alex Zinenko	9c5982ef8e	[mlir] support recursive types in type conversion infra MLIR supports recursive types but they could not be handled by the conversion infrastructure directly as it would result in infinite recursion in `convertType` for elemental types. Support this case by keeping the "call stack" of nested type conversions in the TypeConverter class and by passing it as an optional argument to the individual conversion callback. The callback can then check if a specific type is present on the stack more than once to detect and handle the recursive case. This approach is preferred to the alternative approach of having a separate callback dedicated to handling only the recursive case as the latter was observed to introduce ~3% time overhead on a 50MB IR file even if it did not contain recursive types. This approach is also preferred to keeping a local stack in type converters that need to handle recursive types as that would compose poorly in case of out-of-tree or cross-project extensions. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D113579	2021-11-22 18:16:02 +01:00
Louis Dionne	774f7832fb	[runtimes] Do not force -stdlib=libc++ on Apple platforms It is pointless to specify -stdlib=libc++ on Apple platforms since that is the default anyway. Differential Revision: https://reviews.llvm.org/D114283	2021-11-22 12:05:09 -05:00
Peter Klausler	a62b60167d	[flang] Predefine unit 0 connected to stderr This is a near-universal language extension; external unit 0 is preconnected to the standard error output. Differential Revision: https://reviews.llvm.org/D114298	2021-11-22 09:02:39 -08:00
Louis Dionne	e7cee55c9d	[libc++] Remove uses of printf in some test support headers In the test suite, we generally don't use printf or other reporting utilities. It's not that it wouldn't be useful, it's just that some platforms don't support IO. Instead, we try to keep test cases small and self-contained so that we can reasonably easily reproduce failures locally and debug them. This patch removes printf in some of the last places in the test suite that used it. The only remaining places are in a deque test and in the filesystem tests. The filesystem tests are arguably fine to keep using IO, since we're testing <filesystem>. The deque test will be handled separately. Differential Revision: https://reviews.llvm.org/D114282	2021-11-22 12:01:18 -05:00
Nikolas Klauser	04a6dc06a0	[libc++] [NFC] Mark P0858R0 as implemented P0858R0 seems to be implemented. D93830 sets `__cpp_lib_string_view` and D80452 sets `__cpp_lib_array_constexpr`. Reviewed By: Quuxplusone, Mordante, ldionne, #libc Spies: libcxx-commits Differential Revision: https://reviews.llvm.org/D114344	2021-11-22 17:58:22 +01:00
Nico Weber	2fb3c05b34	[asm] Merge EmitMSInlineAsmStr() and EmitGCCInlineAsmStr() This basically reverts `1778831a3d`, which split them. Since they were split 9 years ago, EmitGCCInlineAsmStr() grew a bunch of features that usually weren't added to EmitMSInlineAsmStr(), and that was usually a mistake. D71677, D113932, D114167 are all examples of where things were backported to EmitMSInlineAsmStr(). The names were also not great. EmitMSInlineAsmStr() used to be called for `asm inteldialect`, which clang produces for Microsoft-style __asm { ... } blocks as well for GCC-style __asm__ / asm statements with -masm=intel. On the other hand, EmitGCCInlineAsmStr() used to be called for `asm`, whic clang produces for GCC-style __asm__ / asm statements with -masm=att (the default). It's also less code (23 insertions, 188 deletions). No behavior change. Differential Revision: https://reviews.llvm.org/D114330	2021-11-22 11:49:57 -05:00
Nico Weber	7c2d51474a	[asm] Allow labels as operands in intel asm syntax This makes a line in llvm/test/CodeGen/X86/asm-block-labels.ll pass with `asm inteldialect` too. I don't know if this is something one can hit in practice with inline asm. The test is from 2007 (`4646aa3e33`) but in 2009 blockaddr was introduced and e.g. `__asm__ __volatile__("brl %0" :: "X"(&&foo) : "memory");` compiles to call void asm sideeffect "brl $0", "X,..."(i8* blockaddress(@func, %1)) nowadays (thanks to jrtc27 for that example!). (`6c4d255bf3` switched clang to blockaddress on an opt-in basis, `e4801f7844` added docs for it, `31b132c0b7` added IR support.) I half-heartedly tried to build clang 2.8 locally, but it didn't just build. And 2.8 didn't have a prebuilt clang binary yet. The motivation is to make EmitGCCInlineAsmStr() and EmitMSInlineAsmStr() more alike, and maybe we should delete this code form EmitGCCInlineAsmStr() instead. But since it's just 3 lines and it's reachable from LLVM IR, let's do the safer thing for now. Differential Revision: https://reviews.llvm.org/D114329	2021-11-22 11:49:29 -05:00
Mark de Wever	cc9fdedba7	[libc++] Improve CMake include directory search. This patch has been tested in D70631, but it should be reviewed separately. Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D114248	2021-11-22 17:35:49 +01:00
Gabor Marton	ffc32efd1c	[Analyzer][Core] Simplify IntSym in SValBuilder Make the SimpleSValBuilder capable to simplify existing IntSym expressions based on a newly added constraint on the sub-expression. Differential Revision: https://reviews.llvm.org/D113754	2021-11-22 17:33:43 +01:00
Kazu Hirata	bf20a3b9b9	Use std::string::substr (NFC)	2021-11-22 08:21:09 -08:00
Kazu Hirata	59a26448a6	[Target] Use range-based for loops (NFC)	2021-11-22 08:21:07 -08:00
Alexey Bataev	02298c15d5	[SLP][NFC]Add a test that reveals the problem in the emission of vector int division with undefs.	2021-11-22 07:41:07 -08:00
Zarko Todorovski	dc9b5550b2	[NFC][llvm][Hexagon] Inclusive Terms remove uses of sanity in Hexagon taget Most changes are rewording comments but there are some assertions that I rephrased. Reviewed By: kparzysz Differential Revision: https://reviews.llvm.org/D114132	2021-11-22 10:08:01 -05:00
Hsiangkai Wang	137d3474ca	[RISCV] Reverse the order of loading/storing callee-saved registers. Currently, we restore the return address register as the last restoring instruction in the epilog. The next instruction is `ret` usually. It is a use of return address register. In some microarchitectures, there is load-to-use data hazard. To avoid the load-to-use data hazard, we could separate the load instruction from its use as far as possible. In this patch, we reverse the order of restoring callee-saved registers to increase the distance of `load ra` and `ret` in the epilog. Differential Revision: https://reviews.llvm.org/D113967	2021-11-22 23:02:11 +08:00
Dmitry Vyukov	1784fe0532	tsan: new runtime (v3) This change switches tsan to the new runtime which features: - 2x smaller shadow memory (2x of app memory) - faster fully vectorized race detection - small fixed-size vector clocks (512b) - fast vectorized vector clock operations - unlimited number of alive threads/goroutimes Depends on D112602. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D112603	2021-11-22 15:55:39 +01:00
Dmitry Vyukov	e69d50d9ff	tsan: disable instrumentation in runtime callbacks in tests All runtime callbacks must be non-instrumented with the new tsan runtime (it's now more picky with respect to recursion into runtime). Disable instrumentation in Darwin tests as we do in all other tests now. Differential Revision: https://reviews.llvm.org/D114348	2021-11-22 15:48:29 +01:00
Nikita Popov	62e9acad0a	Revert "[SCEV] Fix and validate ValueExprMap/ExprValueMap consistency" This reverts commit `d633db8f9d`. Causes bootstrap assertion failures: https://lab.llvm.org/buildbot/#/builders/168/builds/3459/steps/9/logs/stdio	2021-11-22 15:47:33 +01:00
Guillaume Chatelet	1b927b68b6	[libc] add memmove basic building blocks Differential Revision: https://reviews.llvm.org/D113321	2021-11-22 14:31:56 +00:00
Arjun P	0512bf3540	[MLIR] PresburgerSetTest: fix comment and add a test case	2021-11-22 20:00:56 +05:30
Nikita Popov	d633db8f9d	[SCEV] Fix and validate ValueExprMap/ExprValueMap consistency This adds validation for consistency of ValueExprMap and ExprValueMap, and fixes identified issues: * Addrec construction directly wrote to ValueExprMap in a few places, without updating ExprValueMap. Add a helper to ensures they stay consistent. The adjustment in forgetSymbolicName() explicitly drops the old value from the map, so that we don't rely on it being overwritten. * forgetMemoizedResultsImpl() was dropping the SCEV from ExprValueMap, but not dropping the corresponding entries from ValueExprMap. Differential Revision: https://reviews.llvm.org/D113349	2021-11-22 15:27:25 +01:00
Pavel Labath	7f09ab08de	[lldb] Fix [some] leaks in python bindings Using an lldb_private object in the bindings involves three steps - wrapping the object in it's lldb::SB variant - using swig to convert/wrap that to a PyObject - wrapping that in a lldb_private::python::PythonObject Our SBTypeToSWIGWrapper was only handling the middle part. This doesn't just result in increased boilerplate in the callers, but is also a functionality problem, as it's very hard to get the lifetime of of all of these objects right. Most of the callers are creating the SB object (step 1) on the stack, which means that we end up with dangling python objects after the function terminates. Most of the time this isn't a problem, because the python code does not need to persist the objects. However, there are legitimate cases where they can do it (and even if the use case is not completely legitimate, crashing is not the best response to that). For this reason, some of our code creates the SB object on the heap, but it has another problem -- it never gets cleaned up. This patch begins to add a new function (ToSWIGWrapper), which does all of the three steps, while properly taking care of ownership. In the first step, I have converted most of the leaky code (except for SBStructuredData, which needs a bit more work). Differential Revision: https://reviews.llvm.org/D114259	2021-11-22 15:14:52 +01:00
Pavel Labath	7c8ae65f2c	[lldb/test] Make it possible to run the mock gdb server on a single thread This is a preparatory commit to enable mocking of qemu startup. That will involve running the mock server in a separate process, so there's no need for multithreading. Initialization is moved from the start function into the constructor (which can then take an actual socket instead of a class), and the run method is made public. Depends on D114156. Differential Revision: https://reviews.llvm.org/D114157	2021-11-22 15:14:50 +01:00
Tobias Gysi	247a1a55eb	[mlir][linalg] Use getAsOpFoldResult in padding (NFC). After padding, we introduce a ExtractSliceOp to get the final unpadded result. This revision uses getAsOpFoldResult to compute the size of the unpadded result, which guarantees the result type has a partially static shape if some of the sizes of the unpadded result are statically known. At the moment, we rely on canonicalization to cleanup the types after padding. Depends On D114085 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114153	2021-11-22 13:15:19 +00:00
Tobias Gysi	32c43241e7	[mlir][linalg] Always generate an extract/insert slice pair when tiling output tensors. Adapt tiling to always generate an extract/insert slice pair for output tensors even if the tensor is not tiled. Having an explicit extract/insert slice pair simplifies followup transformations such as padding and bufferization. In particular, it makes read and written iteration argument slices explicit. Depends On D114067 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114085	2021-11-22 13:12:43 +00:00
Tres Popp	106f307499	Rename MlirExecutionEngine lookup to lookupPacked The purpose of the change is to make clear whether the user is retrieving the original function or the wrapper function, in line with the invoke commands. This new functionality is useful for users that already have defined their own packed interface, so they do not want the extra layer of indirection, or for users wanting to the look at the resulting primary function rather than the wrapper function. All locations, except the python bindings now have a `lookupPacked` method that matches the original `lookup` functionality. `lookup` still exists, but with new semantics. - `lookup` returns the function with a given name. If `bool f(int,int)` is compiled, `lookup` will return a reference to `bool(f)(int,int)`. - `lookupPacked` returns the packed wrapper of the function with the given name. If `bool f(int,int)` is compiled, `lookupPacked` will return `void(mlir_f)(void**)`. Differential Revision: https://reviews.llvm.org/D114352	2021-11-22 14:12:09 +01:00
Tobias Gysi	f7751a3a42	[mlir][linalg] Remove tile and fuse test pass (NFC). Remove the tile and fuse test pass that has been replaced by codegen strategy. Depends On D114067 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114068	2021-11-22 12:33:31 +00:00
Bradley Smith	955c72c35c	[AArch64][ARM] Add missing SVE/SVE2 features from Cortex-A710 Differential Revision: https://reviews.llvm.org/D114169	2021-11-22 12:30:06 +00:00
Simon Moll	56db1c072c	[DA][NFC] Update publication - add remarks Update the reference publication for the SyncDependenceAnalysis and Divergence Analysis. Fix phrasing, formatting. Add comments on reducible loop limitation. Reviewed By: sameerds Differential Revision: https://reviews.llvm.org/D114146	2021-11-22 12:58:19 +01:00
Roman Lebedev	704d92607d	[X86][TTI] Finish costmodel for AVX512BW's VPMOVM2[BW] / VPMOV[BW]2M instructions Apparently my methodology was suboptimal, and not only did miss all the +VL tuples, i also missed some plain tuples. I believe, this adds everything missing. Indeed, these manual costmodels are just not okay long-term. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D114334	2021-11-22 14:31:34 +03:00
Roman Lebedev	8d09dd61c3	[X86][TTI] Costmodel for AVX512DQ's VPMOVM2[DQ] / VPMOV[DQ]2M instructions Much like the VPMOVM2[BW] / VPMOV[BW]2M from AVX512BW, these either sign-extent the mask register into a vector, or pack the mask from vector register. Apparently, we didn't even have MCA tests for these, added in rG2f364f6f0d3a2420ca78cbd80abb186657180e05, so i'm just guessing that their perf characteristics are optimal. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D114314	2021-11-22 14:31:34 +03:00
Nicolas Vasilache	050cc1cd6e	[mlir] Add InitializeNativeTargetAsmParser to ExecutionEngine. This is required to allow python to work with lowerings that use inline_asm. Differential Revision: https://reviews.llvm.org/D114338	2021-11-22 11:28:14 +00:00
Tobias Gysi	e3d386ea27	[mlir][linalg] Add a tile and fuse on tensors pattern. Add a pattern to apply the new tile and fuse on tensors method. Integrate the pattern into the CodegenStrategy and use the CodegenStrategy to implement the tests. Depends On D114012 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114067	2021-11-22 11:13:21 +00:00
Diego Caballero	4348cd42c3	[LV] Drop integer poison-generating flags from instructions that need predication This patch fixes PR52111. The problem is that LV propagates poison-generating flags (`nuw`/`nsw`, `exact` and `inbounds`) in instructions that contribute to the address computation of widen loads/stores that are guarded by a condition. It may happen that when the code is vectorized and the control flow within the loop is linearized, these flags may lead to generating a poison value that is effectively used as the base address of the widen load/store. The fix drops all the integer poison-generating flags from instructions that contribute to the address computation of a widen load/store whose original instruction was in a basic block that needed predication and is not predicated after vectorization. Reviewed By: fhahn, spatel, nlopes Differential Revision: https://reviews.llvm.org/D111846	2021-11-22 10:57:29 +00:00
Nicolas Vasilache	789c88e80e	[mlir] Fix unintentional mutation by VectorType/RankedTensorType::Builder dropDim Differential Revision: https://reviews.llvm.org/D113933	2021-11-22 10:51:50 +00:00
Tobias Gysi	0ccc44cec0	[mlir][linalg] Fix tile and fuse for outermost reduction. Tile and fuse failed if the outermost tile loop is a reduction dimension. Add the necessary check to handle outermost reductions and introduce a test case to verify the change. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114012	2021-11-22 10:44:15 +00:00
Nicolas Vasilache	a9e236bed8	[mlir][Vector] Add a vblendps-based impl for transpose8x8 (both intrin and inline_asm) This revision follows up on the conversation titled: ```[llvm-dev] Understanding and controlling some of the AVX shuffle emission paths``` The revision adds a vblendps-based implementation for transpose8x8 and further distinguishes between and intrinsics and an inline_asm implementation. This results in roughly 20% fewer cycles as reported by llvm-mca: After this revision (intrinsic version, resolves to virtually identical assembly as per the llvm-dev discussion, no vblendps instruction is emitted): ``` Iterations: 100 Instructions: 5900 Total Cycles: 2415 Total uOps: 7300 Dispatch Width: 6 uOps Per Cycle: 3.02 IPC: 2.44 Block RThroughput: 24.0 Cycles with backend pressure increase [ 89.90% ] Throughput Bottlenecks: Resource Pressure [ 89.65% ] - SKXPort1 [ 0.04% ] - SKXPort2 [ 12.42% ] - SKXPort3 [ 12.42% ] - SKXPort5 [ 89.52% ] Data Dependencies: [ 37.06% ] - Register Dependencies [ 37.06% ] - Memory Dependencies [ 0.00% ] ``` After this revision (inline_asm version, vblendps instructions are indeed emitted): ``` Iterations: 100 Instructions: 6300 Total Cycles: 2015 Total uOps: 7700 Dispatch Width: 6 uOps Per Cycle: 3.82 IPC: 3.13 Block RThroughput: 20.0 Cycles with backend pressure increase [ 83.47% ] Throughput Bottlenecks: Resource Pressure [ 83.18% ] - SKXPort0 [ 14.49% ] - SKXPort1 [ 14.54% ] - SKXPort2 [ 19.70% ] - SKXPort3 [ 19.70% ] - SKXPort5 [ 83.03% ] - SKXPort6 [ 14.49% ] Data Dependencies: [ 39.75% ] - Register Dependencies [ 39.75% ] - Memory Dependencies [ 0.00% ] ``` An accessible copy of the conversation is available [here](https://gist.github.com/nicolasvasilache/68c7f34012584b0e00f335bcb374ede0). Reviewed By: ftynse, dcaballe Differential Revision: https://reviews.llvm.org/D114335	2021-11-22 10:32:34 +00:00
Sjoerd Meijer	4d21b64464	[BPI] Look-up tables for non-loop branches. NFC. This adds and uses look-up tables for non-loop branch probabilities, which have have probabilities directly encoded into the tables for the different condition codes. Compared to having this logic inlined in different functions, as it used to be the case, I think this is compacter and thus also easier to check/cross reference. This also adds a test for pointer heuristics that was missing. Differential Revision: https://reviews.llvm.org/D114009	2021-11-22 10:30:42 +00:00
Arjun P	d92aabc336	[MLIR][NFC] Simplex: remove repeated words in comment	2021-11-22 15:50:03 +05:30
Diego Caballero	a7027bb799	[LV] Pre-commit test for D111846 Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D112054	2021-11-22 10:13:56 +00:00
Guillaume Chatelet	2f1c037bbd	[libc] Remove unused variable	2021-11-22 10:12:46 +00:00
Manuel Klimek	84bf5e3286	Fix various problems found by fuzzing. 1. IndexTokenSource::getNextToken cannot return nullptr; some code was still written assuming it can; make getNextToken more resilient against incorrect input and fix its call-sites. 2. Change various asserts that can happen due to user provided input to conditionals in the code.	2021-11-22 11:08:38 +01:00
Salman Javed	a82942dd07	Add missing clang-tidy args in index.rst (NFC) The RST docs have gone out of sync with the command-line args that the clang-tidy program actually supports.	2021-11-22 22:50:05 +13:00
Kirill Bobyrev	b5f20372a8	[clangd] IncludeCleaner: Mark possible expr resolutions as used Fixes: https://github.com/clangd/clangd/issues/934 Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D114287	2021-11-22 10:44:24 +01:00
David Green	760d4d03d5	[AArch64] Sink splat shuffles to lane index intrinsics This teaches AArch64TargetLowering::shouldSinkOperands to sink splat shuffles to certain neon intrinsics, so that they can make use of the lane variants of the instructions that are available. Differential Revision: https://reviews.llvm.org/D112994	2021-11-22 08:11:35 +00:00
Salman Javed	83484f8472	Fix nits in clang-tidy's documentation (NFC) Add commas, articles, and conjunctions where missing.	2021-11-22 21:10:24 +13:00

1 2 3 4 5 ...

405372 Commits