llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-12-13 02:21:45 +00:00

Author	SHA1	Message	Date
Yingwei Zheng	d10ee8942f	[InstCombine] Add `sext` version of pre-commit tests for PR65852. NFC.	2023-09-19 01:53:22 +08:00
Kazu Hirata	1645b5d321	[flang] Remove unused function IsListDirectedFieldComplete This patch fixes a warning: flang/runtime/edit-input.cpp:27:20: error: unused function 'IsListDirectedFieldComplete' [-Werror,-Wunused-function]	2023-09-18 10:45:25 -07:00
Abhinav271828	efca035c6d	[MLIR][Presburger] Template Matrix to allow MPInt and Fraction (#65272 ) The method implementations remain in the .cpp file; explicit instantiations have been added for these two types. makeMatrix has been duplicated to makeIntMatrix and makeFracMatrix.	2023-09-18 22:52:22 +05:30
Peter Klausler	e18fa6eea0	[flang] Fix compile-time infinite loop (#66246 ) When a COMMON block object has a derived type that is part of a set of mutually-dependent types with other members, the compiler loops. Fixes https://github.com/llvm/llvm-project/issues/65572.	2023-09-18 10:20:05 -07:00
Konstantina Mitropoulou	135e5216ba	[NewGVN] Set parent to the temporal instructions that are generated during phi-of-ops optimization (#66314 ) - Test for future commit in NewGVN - [NewGVN] Set parent to the temporal instructions that are generated during phi-of-ops optimization	2023-09-18 10:13:44 -07:00
Nicolas Vasilache	04ba475e85	[mlir][Vector] Add a rewrite pattern for better low-precision ext(bit… (#66648 ) …cast) expansion This revision adds a rewrite for sequences of vector `ext(bitcast)` to use a more efficient sequence of vector operations comprising `shuffle` and `bitwise` ops. Such patterns appear naturally when writing quantization / dequantization functionality with the vector dialect. The rewrite performs a simple enumeration of each of the bits in the result vector and determines its provenance in the source vector. The enumeration is used to generate the proper sequence of `shuffle`, `andi`, `ori` with shifts`. The rewrite currently only applies to 1-D non-scalable vectors and bails out if the final vector element type is not a multiple of 8. This is a failsafe heuristic determined empirically: if the resulting type is not an even number of bytes, further complexities arise that are not improved by this pattern: the heavy lifting still needs to be done by LLVM.	2023-09-18 19:02:46 +02:00
Yingwei Zheng	2a38d83918	[InstSimplify] Add pre-commits for PR#66606. NFC.	2023-09-19 01:00:34 +08:00
Slava Zakharin	47025af639	[flang][hlfir] Alias analysis for host associated accesses. (#65919 ) This patch adds `host_assoc` attribute for operations that implement FortranVariableInterface (e.g. `hlfir.declare`). The attribute is used by the alias analysis to make better conclusions about memory overlap. For example, a dummy argument of an inner subroutine and a host's variable used inside the inner subroutine cannot refer to the same object (if the dummy argument does not satisify exceptions in F2018 15.5.2.13). This closes a performance gap between HLFIR optimization pipeline and FIR ArrayValueCopy for Polyhedron/nf.	2023-09-18 09:59:06 -07:00
Peter Klausler	37ea42b22c	[flang][runtime] Enforce proper termination of list-directed input va… (#66244 ) …lues Emit an error at runtime when a list-directed input value is not followed by a value separator or end of record. Previously, the runtime I/O library was consuming as many input characters that were valid for the type of the value, and leaving any remaining characters for the next input edit, if any.	2023-09-18 09:57:28 -07:00
Adrian Prantl	15c8a76283	Skip tests with earlier versions of clang	2023-09-18 09:55:33 -07:00
Aart Bik	d34661309b	[mlir][sparse] fix merge conflict in sddmm test (#66670 ) This fixes a merge conflict in https://github.com/llvm/llvm-project/pull/66563	2023-09-18 09:51:21 -07:00
Walter Erquinigo	710276a250	[LLDB] Add a setting for printing ValueObject hex values without leading zeroes (#66548 ) As suggested by Greg in https://github.com/llvm/llvm-project/pull/66534, I'm adding a setting at the Target level that controls whether to show leading zeroes in hex ValueObject values. This has the benefit of reducing the amount of characters displayed in certain interfaces, like VSCode.	2023-09-18 12:48:16 -04:00
Jon Roelofs	83e6d2edfc	Revert "[ARM] Always lower direct calls as direct when the outliner is enabled (#66434 )" This reverts commit `003bcad9a8`. ARM folks say it regresses some of their benchmarks: https://github.com/llvm/llvm-project/pull/66434#issuecomment-1722424162	2023-09-18 09:45:46 -07:00
Zequan Wu	cb9f66de8b	[Profile] Pass InstrProfSymtab instead of IndexedInstrProfReader when creating BinaryCoverageReader Only InstrProfSymtab is needed to retrieve function names when debug info corrletaion is enabled.	2023-09-18 12:44:26 -04:00
Philip Reames	0722800289	[RISCV] Match constant indices of non-index type when forming strided ops (#65777 ) When checking to see if our index expressions can be converted into strided operations, we previously gave up if the index type wasn't an exact match for the intptrty for the address. Per gep semantics, this mismatch implies a sext or trunc cast to the respective index type. For constants, go ahead and evaluate that cast instead of giving up. Note that the motivation of this is mostly test cleanup. We canonicalize at IR such that the gep index will match the intptrty. This is mostly useful so that we can write both RV32 and RV64 tests from the same source. Its also helpful in preventing confusion - I've stumbled across this at least four times now and wasted time each one. Note: The test change for scatters unit stride cases contains a minor regression for rv32 and 64 bit indices. This is an artifact of order in which changes are landing. This will be addressed in a near future change for all configurations.	2023-09-18 09:41:34 -07:00
Peter Klausler	fa44ec72d6	[flang] Allow continuation of D lines (#66242 ) When the old extension of D debug lines in fixed form source is enabled, recognize continuation lines that begin with a D. (No test is added, since the community's driver doesn't support the GNU Fortran option -fd-lines-as-code.)	2023-09-18 09:37:01 -07:00
Peter Klausler	9bbec0ad42	[flang] Fix SIZEOF() expression rewriting (#66241 ) The rewriting of the extension intrinsic function SIZEOF was producing results that would reference symbols that were not available in the current scope, leading to crashes in lowering. The symbols could be function result variables, for SIZEOF(func()), or bare derived type component names, for SIZEOF(array(n)%component). Fixing this without regressing on a current test case involved careful threading of some state through the TypeAndShape characterization code and the shape/bounds analyzer, and some clean-up was done along the way.	2023-09-18 09:20:50 -07:00
Philip Reames	bb7b8726a4	[RISCV] Merge some test checks rvv/fixed-vectors-masked-gather.ll [nfc]	2023-09-18 09:20:12 -07:00
pawosm-arm	be16b03e20	[AArch64] Remove the Z#_HI register definitions (#66353 ) The Z#_HI register definitions were created during the very early SVE enablement work and before the SVE calling convention was locked in. As they look entirely unused, they need to go.	2023-09-18 17:18:28 +01:00
Craig Topper	8f04d81ede	[SelectionDAG][RISCV] Mask constants to narrow size in TargetLowering::expandUnalignedStore. If the SRL for Hi constant folds, but we don't remoe those bits from the Lo, we can end up with strange constant folding through DAGCombine later. I've only seen this with constants being lowered to constant pools during lowering on RISC-V.	2023-09-18 09:10:19 -07:00
Craig Topper	17a12a27ec	[RISCV] Add test case to show bad codegen for unaligned i64 store of a large constant. On the first split we create two i32 trunc stores and a srl to shift the high part down. The srl gets constant folded, but to produce a new i32 constant. But the truncstore for the low store still uses the original constant. This original constant then gets converted to a constant pool before we revisit the stores to further split them. The constant pool prevents further constant folding of the additional srls. After legalization is done, we run DAGCombiner and get some constant folding of srl via computeKnownBits which can peek through the constant pool load. This can create new constants that also need a constant pool.	2023-09-18 09:10:19 -07:00
Alexander Richardson	14882d6b74	[libc++][lit] Atomically update the persistent cache (#66538 ) When running multiple shards in parallel, one shard might write to the cache while another one is reading this cache. Instead of updating the file in place, write to a temporary file and swap the cache file using os.replace(). This is an atomic operation and means shards will either see the old state or the new one.	2023-09-18 12:08:24 -04:00
Aart Bik	6a45339bac	[mlir][sparse] refine sparse fusion with empty tensors materialization (#66563 ) This is a minor step towards deprecating bufferization.alloc_tensor(). It replaces the examples with tensor.empty() and adjusts the underlying rewriting logic to prepare for this upcoming change.	2023-09-18 09:01:11 -07:00
Craig Topper	f71a9e8bb7	[SelectionDAG][RISCV][PowerPC][X86] Use TargetConstant for immediates for ISD::PREFETCH. (#66601 ) The intrinsic uses ImmArg so TargetConstant would be consistent with how other intrinsics are handled. This hides the constants from type legalization so we can remove the promotion support. isel patterns are updated accordingly.	2023-09-18 08:58:50 -07:00
Peter Klausler	39f1860dcc	[flang] Fold NORM2() (#66240 ) Fold references to the (relatively new) intrinsic function NORM2 at compilation time when the argument(s) are all constants. (Getting this done right involved some changes to the API of the accumulator function objects used by the DoReduction<> template, which rippled through some other reduction function folding code.)	2023-09-18 08:58:19 -07:00
Lukas Sommer	93e0658a83	[mlir][llvm] Use zeroinitializer for TargetExtType (#66510 ) Use the recently introduced llvm.mlir.zero operation for values with LLVM target extension type. Replaces the previous workaround that uses a single zero-valued integer attribute constant operation. Signed-off-by: Lukas Sommer <lukas.sommer@codeplay.com>	2023-09-18 17:49:36 +02:00
Peter Klausler	79c430787f	[flang][runtime] INQUIRE(UNIT=-666, EXIST=x) should be .FALSE. (#66239 ) The runtime implementation for INQUIRE(EXIST=x) is returning .TRUE. for all non-existent unit, which is incorrect for valid unit numbers.	2023-09-18 08:38:55 -07:00
Philip Reames	13a74d6cc8	[RISCV] Fix crash when legalizing mgather/scatter on rv32 This is a fix for a subset of legalization problems around 64 bit indices on rv32 targets. For RV32+V, we were using the wrong mask type for the manual truncation lowering for fixed length vectors. Instead, just use the generic TRUNCATE node, and let it be lowered as needed. Note that legalization is still broken for rv32+zve32. That appears to be a different issue.	2023-09-18 08:36:23 -07:00
Nikita Popov	38c59b9f53	Revert "Reapply [Verifier] Sanity check alloca size against DILocalVariable fragment size" This reverts commit `47324cfd7d`. This exposed incorrect debuginfo in rustc. Revert the verification until this has been fixed.	2023-09-18 17:24:53 +02:00
Peter Klausler	f025e41174	[flang] Accept pointer-valued function results as ASSOCIATED() arguments (#66238 ) The POINTER= and TARGET= arguments to the intrinsic function ASSOCIATED() can be the results of references to functions that return object pointers or procedure pointers. NULL() was working well but not program-defined pointer-valued functions. Correct the validation of ASSOCIATED() and extend the infrastructure used to detect and characterize procedures and pointers.	2023-09-18 08:22:18 -07:00
Luke Lau	5aa8e43ccd	[VP] Add missing functional_intrinsic properties and add static_assert. NFC (#66199 ) Some VP intrinsic definitions were missing the VP_PROPERTY_FUNCTIONAL_INTRINSIC property. This patch fills them in, and adds a static_assert that all VP intrinsics have an equivalent opcode or intrinsic defined so we don't forget them in future. Some VP intrinsics don't have an equivalent, namely merge and strided load/store. For those, a new property was added to mark that they don't have a non-VP equivalent. This adds a helper method to get the ID of the functionally equivalent intrinsic, similar to the existing getFunctionalOpcodeForVP and getConstrainedIntrinsicIDForVP method.	2023-09-18 16:18:36 +01:00
Joseph Huber	c354ee8d18	[libc][GPU] Fix dependencies for externally installed stub files (#66653 ) Summary: The GPU build has a lot of magic around how we package the output. Generally, the GPU needs to exist as a secondary fatbinary image for offloading languages. This is because offloading languages pretend like offloading to an accelerator is a single file. This then needs to be put into a single file to make it mesh with the existing build infrastructure. To work with this, the `libc` makes an installed version of the library that simply embeds the GPU code into an empty stub file. This wasn't being updated correctly, which lead to the installed `libc` static library not being updated correctly when the underlying file was changed. The previous behaviour only updated when the entrypoint itself was modified, but not any of its headers. By adding a dependcy on the actual object file we should now capture the regular CMake semantics.	2023-09-18 10:15:02 -05:00
Kiran Chandramohan	e2733a6767	[Flang][OpenMP] Add trivial conversion pattern for omp.ordered_region (#66085 ) Fixes #65570	2023-09-18 16:05:17 +01:00
zhijian	c24a422aa3	[libc++][CI][AIX] modify the equivalence classes of regex_match for locale "cs_CZ.ISO8859-2" Reviewers: David Tenty, Mark de Wever Differential Revision: https://reviews.llvm.org/D126407	2023-09-18 11:03:06 -04:00
Martin Erhart	6bf043e743	[mlir][bufferization] Remove allow-return-allocs and create-deallocs pass options, remove bufferization.escape attribute (#66619 ) This commit removes the deallocation capabilities of one-shot-bufferization. One-shot-bufferization should never deallocate any memrefs as this should be entirely handled by the ownership-based-buffer-deallocation pass going forward. This means the `allow-return-allocs` pass option will default to true now, `create-deallocs` defaults to false and they, as well as the escape attribute indicating whether a memref escapes the current region, will be removed. A new `allow-return-allocs-from-loops` option is added as a temporary workaround for some bufferization limitations.	2023-09-18 16:44:48 +02:00
Simon Pilgrim	b2ffc867ad	[DAG] getNode() - begin generalizing the (zext (trunc (assertzext x))) -> (assertzext x) fold. We'll need to generalize this fold to check for any zero upperbits to address some of the D155472 regressions, but this exposes a number of issues. For now, just use the general MaskedValueIsZero test instead of the assertzext.	2023-09-18 15:32:31 +01:00
Amirreza Ashouri	aa8601dc6d	[libc++] [string_view] Remove operators made redundant by C++20 (#66206 ) Thanks to Giuseppe D'Angelo for pointing this out on the cpplang Slack! The example implementation in https://eel.is/c++draft/string.view.comparison#example-1 was necessary when it was written, in C++17, but in C++20 we don't need that complexity anymore, because of the reversed candidates that are synthesized by the compiler.	2023-09-18 10:30:44 -04:00
Matthias Springer	a2bb365733	[mlir] Fix Bazel build	2023-09-18 16:29:21 +02:00
Yingwei Zheng	dc118147f2	[InstCombine] Add pre-commit tests for PR65073. NFC.	2023-09-18 22:27:05 +08:00
Corentin Jabot	3ce8eda592	[Github] Add a new line before the line separator to avoid paragraphs being treated as titles	2023-09-18 16:17:38 +02:00
Nikita Popov	c7aacbb5b6	[ArgPromotion] Update allocsize indices after promotion Promotion can add/remove arguments. We need to update the indices in the allocsize attribute accordingly. Fixes https://github.com/llvm/llvm-project/issues/66103.	2023-09-18 16:15:16 +02:00
Jay Foad	d8d0588f66	[TwoAddressInstruction] Update LiveIntervals after INSERT_SUBREG with undef read (#66211 ) Update LiveIntervals after rewriting: %reg = INSERT_SUBREG undef %reg, %subreg, subidx to: undef %reg:subidx = COPY %subreg D113044 implemented this for the non-undef case.	2023-09-18 14:51:58 +01:00
Jie Fu	dd6dde1166	[mlir][Vector] Fix -Wunused-function in VectorEmulateNarrowType.cpp (NFC) /data/llvm-project/mlir/lib/Dialect/Vector/Transforms/VectorEmulateNarrowType.cpp:229:21: error: unused function 'operator<<' [-Werror,-Wunused-function] static raw_ostream &operator<<(raw_ostream &os, ^ 1 error generated.	2023-09-18 21:47:33 +08:00
Louis Dionne	d4d8f214a3	[libc++] Simplify how we select modules flavors in the test suite (#66385 ) This gets rid of the separate parameter enable_modules_lsv in favor of adding a named option to the enable_modules parameter. The patch also removes the getModuleFlag helper, which was just a really complicated way of hardcoding "none".	2023-09-18 09:37:18 -04:00
frgossen	06f9ffa050	Fix unused variable (#66644 )	2023-09-18 09:35:20 -04:00
Matthias Springer	64839fbd45	[mlir][bufferization] Empty tensor elimination for materialize_in_destination (#65468 ) This revision adds support for empty tensor elimination to "bufferization.materialize_in_destination" by implementing the `SubsetInsertionOpInterface`. Furthermore, the One-Shot Bufferize conflict detection is improved for "bufferization.materialize_in_destination".	2023-09-18 15:34:28 +02:00
Yingwei Zheng	be2723da5c	[InstSimplify] Fold icmp of `X and/or C1` and `X and/or C2` into constant (#65905 ) This patch simplifies the pattern `icmp X and/or C1, X and/or C2` when one constant mask is the subset of the other. If `C1 & C2 == C1`, `A = X and/or C1`, `B = X and/or C2`, we can do the following folds: `icmp ule A, B -> true` `icmp ugt A, B -> false` We can apply similar folds for signed predicates when `C1` and `C2` are the same sign: `icmp sle A, B -> true` `icmp sgt A, B -> false` Alive2: https://alive2.llvm.org/ce/z/Q4ekP5 Fixes #65833.	2023-09-18 21:32:48 +08:00
Sergio Afonso	fb4bdf361f	[Flang][OpenMP] Run Flang-specific OpenMP MLIR passes in bbc This patch moves the group of OpenMP MLIR passes using after lowering of Fortran to MLIR into a pipeline to be shared by `flang-new` and `bbc`. Currently, the `bbc` tool does not produce the expected FIR for offloading- enabled OpenMP codes due to not running these passes. Unit tests exercising these passes are updated to check `bbc` output as well.	2023-09-18 14:10:04 +01:00
Nicolas Vasilache	bf7c490ab7	[mlir][Vector] Add a rewrite pattern for better low-precision bitcast… (#66387 ) …(trunci) expansion This revision adds a rewrite for sequences of vector `bitcast(trunci)` to use a more efficient sequence of vector operations comprising `shuffle` and `bitwise` ops. Such patterns appear naturally when writing quantization / dequantization functionality with the vector dialect. The rewrite performs a simple enumeration of each of the bits in the result vector and determines its provenance in the pre-trunci vector. The enumeration is used to generate the proper sequence of `shuffle`, `andi`, `ori` followed by an optional final `trunci`/`extui`. The rewrite currently only applies to 1-D non-scalable vectors and bails out if the final vector element type is not a multiple of 8. This is a failsafe heuristic determined empirically: if the resulting type is not an even number of bytes, further complexities arise that are not improved by this pattern: the heavy lifting still needs to be done by LLVM.	2023-09-18 15:08:18 +02:00
Joseph Huber	b8f64431ea	[libc] Add GPU config file using the new format (#66635 ) Summary: This patch copies a config file for the GPU similar to the baremetal/embedded implementation. This will configure the implementations of functions like `sprintf` and `snprintf` to be compiled into more simple versions that can be run on the GPU. These functions cannot be enabled yet as Vararg support hasn't landed, but it will be used then.	2023-09-18 08:06:59 -05:00

... 2 3 4 5 6 ...

475206 Commits