llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-12-12 09:41:26 +00:00

Author	SHA1	Message	Date
Timm Bäder	b93d2d37e7	[clang][Interp] Handle SourceLocExprs Differential Revision: https://reviews.llvm.org/D155627	2023-09-06 14:45:49 +02:00
Matthias Springer	1e1a3112f1	[mlir][bufferization] Privatize buffers for parallel regions One-Shot Bufferize correctly handles RaW conflicts around repetitive regions (loops). Specical handling is needed for parallel regions. These are a special kind of repetitive regions that can have additional RaW conflicts that would not be present if the regions would be executed sequentially. Example: ``` %0 = bufferization.alloc_tensor() scf.forall ... { %1 = linalg.fill ins(...) outs(%0) ... scf.forall.in_parallel { tensor.parallel_insert_slice %1 into ... } } ``` A separate (private) buffer must be allocated for each iteration of the `scf.forall` loop. This change adds a new interface method to `BufferizableOpInterface` to detect parallel regions. By default, regions are assumed to be sequential. A buffer is privatized if an OpOperand bufferizes to a memory read inside a parallel region that is different from the parallel region where operand's value is defined. Differential Revision: https://reviews.llvm.org/D159286	2023-09-06 14:28:43 +02:00
Simon Pilgrim	e4d0e12099	[DAG] Fold (shl (sext (add_nsw x, c1)), c2) -> (add (shl (sext x), c2), c1 << c2) (REAPPLIED) Assuming the ADD is nsw then it may be sign-extended to merge with a SHL op in a similar fold to the existing (shl (add x, c1), c2) -> (add (shl x, c2), c1 << c2) fold. This is most useful for helping to expose address math for X86, but has also touched several aarch64 test cases as well. Alive2: https://alive2.llvm.org/ce/z/2UpSbJ Differential Revision: https://reviews.llvm.org/D159198	2023-09-06 13:19:42 +01:00
Abhina Sree	05eaff18c4	[test] Use host platform specific error message substitution in lit tests (#65328 ) On z/OS, the following error message is not matched correctly in lit tests. `EDC5129I No such file or directory.` This patch uses a lit config substitution to check for platform specific error messages.	2023-09-06 07:55:32 -04:00
Jay Foad	11171d81ae	[AMDGPU] Cope with SelectionDAG::UpdateNodeOperands returning a different SDNode (#65340 ) SITargetLowering::adjustWritemask calls SelectionDAG::UpdateNodeOperands to update an EXTRACT_SUBREG node in-place to refer to a new IMAGE_LOAD instruction, before we delete the old IMAGE_LOAD instruction. But in UpdateNodeOperands can do CSE on the fly and return a different EXTRACT_SUBREG node, so the original EXTRACT_SUBREG node would still exist and would refer to the old deleted IMAGE_LOAD instruction. This caused errors like: t31: v3i32,ch = <<Deleted Node!>> # D:1 This target-independent node should have been selected! UNREACHABLE executed at lib/CodeGen/SelectionDAG/InstrEmitter.cpp:1209! Fix it by detecting the CSE case and replacing all uses of the original EXTRACT_SUBREG node with the CSE'd one.	2023-09-06 12:51:44 +01:00
Viktoriia Bakalova	d71adebb9f	[include-cleaner] Map the 4-argument move overload to the algorithm header. Differential Revision: https://reviews.llvm.org/D159463	2023-09-06 11:38:56 +00:00
Matthias Springer	87568ff3ef	[mlir][SCF] convert-scf-to-cf: Lower scf.forall to scf.parallel (#65449 ) scf.forall ops without shared outputs (i.e., fully bufferized ops) are lowered to scf.parallel. scf.forall ops are typically lowered by an earlier pass depending on the execution target. E.g., there are optimized lowerings for GPU execution. This new lowering is for completeness (convert-scf-to-cf can now lower all SCF loop constructs) and provides a simple CPU lowering strategy for testing purposes. scf.parallel is currently lowered to scf.for, which executes sequentially. The scf.parallel lowering could be improved in the future to run on multiple threads.	2023-09-06 13:27:59 +02:00
Eymen Ünay	3e7cd5ea1e	[JITLink][AArch32] Fixes for initial AArch32 backend Fix masking error in Thumb_Jump24 Fix halfword comparisons in asserts Add Data_Pointer32 to getEdgeKindName Reviewed By: sgraenitz Differential Revision: https://reviews.llvm.org/D157540	2023-09-06 13:10:14 +02:00
Tom Eccles	43d729dda4	[flang][HLFIR] add more memory effects interfaces Anything that produces a hlfir.expr should have an allocation side effect so that it is not removed by CSE (which would result in two hlfir.destroy operations for the same expression). Similarly for hlfir.associate, which has hlfir.end_associate. Also adds read effects on arguments which are pointer-like or boxes. I see no regressions from this change when running llvm-testsuite with optimization enabled, or from SPEC2017 rate benchmarks. To test this, I have added MLIR's pass for testing side effect interfaces to fir-opt. Differential Revision: https://reviews.llvm.org/D158662	2023-09-06 10:29:57 +00:00
Phoebe Wang	c5fabaccef	[X86][Driver] Move mno-gather/mno-scatter from m_x86_Features_Group to m_Group. NFCI (#65457 ) m_x86_Features_Group always turn `mno-xxxx` into `-target-feature -xxxx`. In this case, we don't have `-gather/-scatter` but `+prefer-no-gather/scatter`. This patch solves unexpected warning when using `mno-gather/mno-scatter`: ``` '-gather' is not a recognized feature for this target (ignoring feature) '-scatter' is not a recognized feature for this target (ignoring feature) ```	2023-09-06 18:08:29 +08:00
Luke Lau	74f985b793	[RISCV] Remove -riscv-v-vector-bits-min in tests. NFC (#65404 ) V implies Zvl128b, but a lot of the fixed vector tests also redundantly specify -riscv-v-vector-bits-min=128. This patch removes them where there isn't another minimum vlen being tested for, and for cases where Zve* is being used Zvl128b was added to maintain the old test diff (and because an awkward vlen probably isn't interesting to test for). Other places where -risc-v-vector-bits-min were being used were replaced with Zvl.	2023-09-06 10:43:41 +01:00
Congcong Cai	b0831c3996	[clang-tidy][misc-include-cleaner]Avoid to insert same include header multiple times (#65431 ) `HeaderIncludes` won't update `ExistingIncludes` during inserting. We need to manage it in tidy check. Fixed: #65285	2023-09-06 17:37:14 +08:00
Dmitri Gribenko	97bf104d97	Revert "[DAG] Fold (shl (sext (add_nsw x, c1)), c2) -> (add (shl (sext x), c2), c1 << c2)" This reverts commit `b027ce0ab9`. This commit breaks Transforms/InferAddressSpaces/AMDGPU/flat_atomic.ll.	2023-09-06 11:28:55 +02:00
Fabio D'Urso	fdb29f7db5	[scudo] Rename AllocatorRingBuffer into scudo:ring_buffer To maintain the convention of Scudo names starting with "scudo:", which is used by some tooling to categorize memory usage. Reviewed By: Chia-hungDuan Differential Revision: https://reviews.llvm.org/D157102	2023-09-06 11:23:27 +02:00
Guillaume Chatelet	4acc3ffbb0	[reland][libc][NFC] split type_traits / utility in separate files (#65314 ) `type_traits` and `utility` refer to each other in their implementations. Also `type_traits` starts to become too big to be manageable. This PR splits each function into individual files. FTR this is [how libcxx handles large headers as well](https://github.com/llvm/llvm-project/tree/main/libcxx/include/__type_traits). The reland adds two missing functions : is_destructible_v and is_reference_v	2023-09-06 09:23:00 +00:00
Dmitri Gribenko	6cfb4119d3	[llvm][test] Write temporary files to %t instead of CWD	2023-09-06 11:20:19 +02:00
Simon Pilgrim	b027ce0ab9	[DAG] Fold (shl (sext (add_nsw x, c1)), c2) -> (add (shl (sext x), c2), c1 << c2) Assuming the ADD is nsw then it may be sign-extended to merge with a SHL op in a similar fold to the existing (shl (add x, c1), c2) -> (add (shl x, c2), c1 << c2) fold. This is most useful for helping to expose address math for X86, but has also touched several aarch64 test cases as well. Alive2: https://alive2.llvm.org/ce/z/2UpSbJ Differential Revision: https://reviews.llvm.org/D159198	2023-09-06 10:06:21 +01:00
Muhammad Omair Javaid	e82191469e	[LLDB] Skip TestBSDArchives.py on windows This fixes LLDB windows buildbot after updates to TestBSDArchives.py. https://lab.llvm.org/buildbot/#/builders/219/builds/5408 I have marked new failing test as an expected failure on Windows.	2023-09-06 14:03:21 +05:00
Kadir Cetinkaya	9a26d2c6d3	[clangd][unittests] Limit paralelism for clangd unittests We started seeing a lot of timeouts that align with the change in lit to execute gtests in shards. The logic there assumes tests are single-threaded, which is the case for most of the LLVM, hence they pick #shards ~ #cores (by slightly overshooting). There are enough unittests in clangd that rely on multi-threading, they can create arbitrarily many threads but we limit amount of meaningful work to ~4 thread per process. This change ensures that we're accounting for that paralelism when executing clangd tests and not overloading test executors. In theory the change overestimates the requirements, not all tests are multi-threaded, but it doesn't seem to be resulting in any regressions on my local runs. Fixes https://github.com/llvm/llvm-project/issues/64964. Fixes https://github.com/clangd/clangd/issues/1712.	2023-09-06 10:53:18 +02:00
Guillaume Chatelet	bcdbd0bd0b	Revert "[libc][NFC] split type_traits / utility in separate files (#65314 )" This reverts commit `b793c7e530`. It broke the libc-x86_64-debian-gcc-fullbuild-dbg build bot. https://lab.llvm.org/buildbot/#/builders/250/builds/9776	2023-09-06 08:47:45 +00:00
Kito Cheng	af9b25f9db	[RISCV] Optimize floating point scalar move and splat In D158086, we limit all floating point scalar move and splat can't fuse vsetvli with different SEW, and this patch try to relax the constraint as possible by introducing new SEW demand type: SEWGreaterThanOrEqualAndLessThan64, that allow SEW fused with larger SEW, but constraint it can't fused with SEW=64. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D158177	2023-09-06 16:39:30 +08:00
Guillaume Chatelet	b793c7e530	[libc][NFC] split type_traits / utility in separate files (#65314 ) `type_traits` and `utility` refer to each other in their implementations. Also `type_traits` starts to become too big to be manageable. This PR splits each function into individual files. FTR this is [how libcxx handles large headers as well](https://github.com/llvm/llvm-project/tree/main/libcxx/include/__type_traits).	2023-09-06 10:31:08 +02:00
Guillaume Chatelet	56426c6cdf	[libc] customizable namespace 1/4 (#65321 ) This implements the first step of https://discourse.llvm.org/t/rfc-customizable-namespace-to-allow-testing-the-libc-when-the-system-libc-is-also-llvms-libc/73079 Make required Bazel and CMake preparatory changes which define the var for the customizable namespace and use it (pass it as a compile option -DLIBC_NAMESPACE=<...>). Differential Revision: https://reviews.llvm.org/D159112	2023-09-06 10:27:56 +02:00
martinboehme	c0703eaec1	[clang][dataflow] Emit an error if source code is not compiled as C++. (#65301 ) The shape of certain elements of the AST can vary depending on the langugage. We currently only support C++.	2023-09-06 10:02:21 +02:00
martinboehme	be12f26dc0	[clang][dataflow][NFC] Remove stale comment. (#65322 )	2023-09-06 09:52:30 +02:00
David Spickett	6c3232b515	[lldb][Docs] Add simpler "automatic" cross-compile option to build docs (#65311 ) The main way I cross build lldb is to point CMake at an existing host build to get the native tablegen tools. This is what we had documented before. There is another option where you start from scratch and the host tools are built for you. This patch documents that and explains which one to choose. Added another arm64 example which uses this. So the frst one is the "automatic" build and the second is the traditional approach. For ease of copy paste and understanding, I've kept the full command in each section and noted the one difference between them. Along the way I updated some of the preamble to explain the two approaches and updated some language e.g. removing "just ...". Eveyone's "just" is different, doubly so when cross-compiling.	2023-09-06 08:48:19 +01:00
martinboehme	f470c361d9	[clang][dataflow] Eliminate uses of `RecordValue::getChild()`. (#65329 ) We want to work towards eliminating the `RecordStorageLocation` from `RecordValue`. These particular uses of `RecordValue::getChild()` can simply be replaced with `RecordStorageLocation::getChild()`.	2023-09-06 09:43:05 +02:00
Sergei Barannikov	a479be0f39	[MC] Change tryParseRegister to return ParseStatus (NFC) This finishes the work of replacing OperandMatchResultTy with ParseStatus, started in D154101. As a drive-by change, rename some RegNo variables to just Reg (a leftover from the days when RegNo had 'unsigned' type).	2023-09-06 10:28:12 +03:00
Ingo Müller	ca23c933bd	[mlir][python] Create all missing attribute builders. This patch adds attribute builders for all buildable attributes from the builtin dialect that did not previously have any. These builders can be used to construct attributes of a particular type identified by a string from a Python argument without knowing the details of how to pass that Python argument to the attribute constructor. This is used, for example, in the generated code of the Python bindings of ops. The list of "all" attributes was produced with: ( grep -h "ods_ir.AttrBuilder.get" $(find ../build/ -name "_ops_gen.py") \ \| cut -f2 -d"'" git grep -ho "^def [a-zA-Z0-9_]" -- include/mlir/IR/CommonAttrConstraints.td \ \| cut -f2 -d" " ) \| sort -u Then, I only retained those that had an occurence in `mlir/include/mlir/IR`. In particular, this drops many dialect-specific attributes; registering those builders is something that those dialects should do. Finally, I removed those attrbiutes that had a match in `mlir/python/mlir/ir.py` already and implemented the remaining ones. The only ones that still miss a builder now are the following: * Represent more than one possible attribute type: - `Any.Attr` (9x) - `IntNonNegative` - `IntPositive` - `IsNullAttr` - `ElementsAttr` I am not sure what "constant attributes" are: - `ConstBoolAttrFalse` - `ConstBoolAttrTrue` - `ConstUnitAttr` * `Location` not exposed by Python bindings: - `LocationArrayAttr` - `LocationAttr` * `get` function not implemented in Python bindings: - `StringElementsAttr` This patch also fixes a compilation problem with `I64SmallVectorArrayAttr`. Reviewed By: makslevental, rkayaith Differential Revision: https://reviews.llvm.org/D159403	2023-09-06 07:09:25 +00:00
jeanPerier	d26c78b2ad	[flang] handle indirect module variable use in internal procedure (#65324 ) When a module variable is referenced inside an internal procedure, but the use statement for the module is inside the host, semantics may not create any symbols with HostAssocDetails directly under the internal procedure scope. So pft::getScopeVariableList, that is called in the bridge when lowering the internal procedure scope, failed to instantiate the module variables. This lead to "symbol is not mapped to any IR value" compile time errors. This patch fixes the issue by adding the variables to the list of "captured" global variables from the host program, so that they are instantiated as part of the `internalProcedureBindings` in the bridge. The rational of doing it that way instead of changing `getScopeVariableList` is that `getScopeVariableList` would have to import all the module variables used inside the host since it cannot know which ones are referenced inside the internal procedure from the semantics::Scope information. The fix in this patch only instantiates the module variables from the host that are actually referenced inside the internal procedure.	2023-09-06 09:07:45 +02:00
Tobias Gysi	0c81e6dd91	[mlir][llvm] Add icmp folder (#65343 ) This revision adds a simple icmp folder that performs the following folds to the LLVM dialect icmp op: - cmpi(eq/ne, x, x) -> true/false - cmpi(eq/ne, alloca, null) -> false/true - cmpi(eq/ne, null, alloca) -> cmpi(eq/ne, alloca, null)	2023-09-06 08:58:39 +02:00
laichunfeng	71b5f57f0d	[RISCV] Adjust first sp size to use c.addi16sp. addi sp, sp, 512 may be used to recover the sp in the epilogue when stack size is larger than 2047(2^11 - 1), however, it can not be compressed using C extension, and addi sp, sp, 496 is able to be compressed, so try to use 496 as the ajust amount of the fisrt sp if function doesn't need extra instructions after adjust. Reviewed By: wangpc Differential Revision: https://reviews.llvm.org/D159431	2023-09-06 14:26:52 +08:00
Sergei Barannikov	9ddcacce08	[Sparc] Replace OperandMatchResultTy with ParseStatus (NFC) ParseStatus is slightly more convenient to use due to implicit conversion from bool, which allows to do something like: ``` return Error(L, "msg"); ``` when with MatchOperandResultTy it had to be: ``` Error(L, "msg"); return MatchOperand_ParseFail; ``` It also has more appropriate name since parse* methods are not only for parsing operands. Reviewed By: brad Differential Revision: https://reviews.llvm.org/D154321	2023-09-06 09:22:34 +03:00
Guray Ozen	1dc0071216	[MLIR] Guard Cuda 12.0+ newer driver APIs with CUDA_VERSION macro checks Fixes #64529 https://github.com/llvm/llvm-project/issues/64529 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D159440	2023-09-06 08:17:06 +02:00
Ting Wang	71be020dda	[SelectionDAG][PowerPC] Memset reuse vector element for tail store On PPC there are instructions to store element from vector(e.g. stxsdx/stxsiwx), and these instructions can be leveraged to avoid tail constant in memset and constant splat array initialization. This patch tries to explore these opportunities. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D138883	2023-09-06 01:52:38 -04:00
Marek Sedláček	ebf01690d9	Bug fix for multi-line labels in CFG dot graph After D154102 multi-line labels would get split incorrectly. When CFG is generated for a function with basic block name longer than 80 lines, then the header separator will be placed after the line break for the label name instead of after the whole label name. The fix is simple by just moving the insert of \| character before the line splitting happens. Differential Revision: https://reviews.llvm.org/D159207	2023-09-05 22:01:51 -07:00
Trung Nguyen	976dbae246	[libunwind] Haiku: Initial support Adds build support for Haiku. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D157866	2023-09-06 00:56:09 -04:00
Pravin Jagtap	b230472f22	[AMDGPU] Extend v2i16 & v2f16 support for llvm.amdgcn.update.dpp intr (#65318 ) Authored-by: Pravin Jagtap <Pravin.Jagtap@amd.com>	2023-09-06 10:20:34 +05:30
Craig Topper	2a7b8ab07c	[RISCV] Use add.uw for (or (and X, 0xFFFFFFFF), Y) if Y has zeroes in the lower 32 bits. (#65402 )	2023-09-05 21:05:53 -07:00
Michael Maitland	5b7982f2b2	[RISCV] Fix SiFive7 formula for Reductions and ordered Reductions (#65385 ) Reductions specified the correct formula in the comments but was implemented incorrectly. Ordered reductions formula was refined to be more accurate to actual hardware. A llvm-mca test case for reductions is added so that we have a better idea of how the model is performing.	2023-09-05 23:04:21 -04:00
Ed Maste	43d05f4cc9	[libc++] Switch FreeBSD CI job to Clang 16 (#65422 ) libc++ will drop support for Clang 15 before long.	2023-09-05 22:07:28 -04:00
Jie Fu	1438544e23	[DFSAN] Silence parameter 'va_labels' set but not used warning (NFC) /data/llvm-project/compiler-rt/lib/dfsan/dfsan_custom.cpp:2546:37: error: parameter 'va_labels' set but not used [-Werror,-Wunused-but-set-parameter] dfsan_label va_labels, dfsan_label ret_label, ^ 1 error generated.	2023-09-06 09:28:43 +08:00
Tomasz Kuchta	8dbcf8eba7	[DFSAN] Add support for sscanf. Reviewed By: browneee Differential Revision: https://reviews.llvm.org/D153775	2023-09-06 01:16:31 +00:00
Chris Bieneman	400d3261a0	[HLSL] Cleanup support for `this` as an l-value The goal of this change is to clean up some of the code surrounding HLSL using CXXThisExpr as a non-pointer l-value. This change cleans up a bunch of assumptions and inconsistencies around how the type of `this` is handled through the AST and code generation. This change is be mostly NFC for HLSL, and completely NFC for other language modes. This change introduces a new member to query for the this object's type and seeks to clarify the normal usages of the this type. With the introudction of HLSL to clang, CXXThisExpr may now be an l-value and behave like a reference type rather than C++'s normal method of it being an r-value of pointer type. With this change there are now three ways in which a caller might need to query the type of `this`: * The type of the `CXXThisExpr` * The type of the object `this` referrs to * The type of the implicit (or explicit) `this` argument This change codifies those three ways you may need to query respectively as: * CXXMethodDecl::getThisType() * CXXMethodDecl::getThisObjectType() * CXXMethodDecl::getThisArgType() This change then revisits all uses of `getThisType()`, and in cases where the only use was to resolve the pointee type, it replaces the call with `getThisObjectType()`. In other cases it evaluates whether the desired returned type is the type of the `this` expr, or the type of the `this` function argument. The `this` expr type is used for creating additional expr AST nodes and for member lookup, while the argument type is used mostly for code generation. Additionally some cases that used `getThisType` in simple queries could be substituted for `getThisObjectType`. Since `getThisType` is implemented in terms of `getThisObjectType` calling the later should be more efficient if the former isn't needed. Reviewed By: aaron.ballman, bogner Differential Revision: https://reviews.llvm.org/D159247	2023-09-05 19:38:50 -05:00
LLVM GN Syncbot	68e94f1f27	[gn build] Port `065dc485bd`	2023-09-06 00:01:57 +00:00
Nico Weber	d50b56d18c	[gn] port `fbdf684fae`	2023-09-05 20:01:17 -04:00
Greg Clayton	d4a141ef93	Switch over to using the LLVM archive parser for BSD archives. Our LLDB parser didn't correctly handle archives of all flavors on different systems, it currently only correctly handled BSD archives, normal and thin, on macOS, but I noticed that it was getting incorrect information when decoding a variety of archives on linux. There were subtle changes to how names were encoded that we didn't handle correctly and we also didn't set the result of GetObjectSize() correctly as there was some bad math. This didn't matter when exracting .o files from .a files for LLDB because the size was always way too big, but it was big enough to at least read enough bytes for each object within the archive. This patch does the following: - switch over to use LLVM's archive parser and avoids previous code duplication - remove values from ObjectContainerBSDArchive::Object that we don't use like: - uid - gid - mode - fix ths ObjectContainerBSDArchive::Object::file_size value to be correct - adds tests to test that we get the correct module specifications Differential Revision: https://reviews.llvm.org/D159408	2023-09-05 16:54:05 -07:00
Jonas Devlieghere	511b4dd928	[llvm] Add LLVM_CTOR_NODISCARD to WithColor Similarly to WithMarkup, add LLVM_CTOR_NODISCARD to the WithColor constructor.	2023-09-05 16:47:41 -07:00
Jonas Devlieghere	11ce3d99b7	[llvm] Add LLVM_CTOR_NODISCARD (#65418 ) [[nodiscard]] on constructors is a defect report against C++17. That means that it should be applied retroactively, though older compilers might not know about it and emit warnings. This adds a back-compatibility macro.	2023-09-05 16:47:04 -07:00
Jakub Mazurkiewicz	065dc485bd	[libc++][ranges] Implement P2443R1: `views::chunk_by` This patch implements https://www.open-std.org/jtc1/sc22/wg21/docs/papers/2021/p2443r1.html (`views::chunk_by`). Reviewed By: #libc, var-const Differential Revision: https://reviews.llvm.org/D144767	2023-09-05 16:19:49 -07:00

1 2 3 4 5 ...

473767 Commits