llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-11-29 16:41:27 +00:00

Author	SHA1	Message	Date
Marius Brehler	849f8183fb	[mlir] Fix ConstantOp verifier This restricts the attributes to integers for constants of type IndexType. So far an attribute like StringAttr as in %c1 = constant "" : index is valid. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98216	2021-03-12 08:49:25 +01:00
Sergei Grechanik	fd2b08969b	[mlir][Vector] Lowering of transfer_read/write to vector.load/store This patch introduces progressive lowering patterns for rewriting vector.transfer_read/write to vector.load/store and vector.broadcast in certain supported cases. Reviewed By: dcaballe, nicolasvasilache Differential Revision: https://reviews.llvm.org/D97822	2021-03-11 18:17:51 -08:00
Sergei Grechanik	46ef6ffdaf	[NFC] Test commit. Add empty lines.	2021-03-11 17:31:20 -08:00
Mehdi Amini	e1364f1068	Replace use of OperationState with builder::create in GPU Kernel Outlining (NFC) OperationState is a low level API that is rarely indicated, the builder API convenient wrapper is preferred when possible.	2021-03-12 00:14:02 +00:00
Diego Caballero	0fd0fb5329	Reland: [mlir][Affine][Vector] Add initial support for 'iter_args' to Affine vectorizer. This patch adds support for vectorizing loops with 'iter_args' when those loops are not a vector dimension. This allows vectorizing outer loops with an inner 'iter_args' loop (e.g., reductions). Vectorizing scenarios where 'iter_args' loops are vector dimensions would require more work (e.g., analysis, generating horizontal reduction, etc.) not included in this patch. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97892	2021-03-12 01:08:28 +02:00
Diego Caballero	96891f0418	Reland: [mlir][Vector][Affine] Improve affine vectorizer algorithm This patch replaces the root-terminal vectorization approach implemented in the Affine vectorizer with a topological order approach that vectorizes all the operations within the target loop nest. These are the most important changes introduced by the new algorithm: * Removed tracking of root and terminal ops. Existing vectorization functionality is preserved and extended so that loop nests without root-terminal chains can be vectorized. * Vectorizing a loop nest now only requires a single topological traversal. * A new vector loop nest is incrementally built along the vectorization process. The original scalar loop is kept intact. No cloning guard is needed to recover the scalar loop if vectorization fails. This approach also simplifies the challenging task of replacing a loop operation amid the vectorization process without invalidating the analysis information that depends on the original loop. * Vectorization of specific operations has been implemented as independent, preparing them to be moved to a potential vectorization interface. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97442	2021-03-12 00:19:50 +02:00
River Riddle	31bb8efd69	[mlir][StorageUniquer] Properly call the destructor on non-trivially destructible storage instances This allows for storage instances to store data that isn't uniqued in the context, or contain otherwise non-trivial logic, in the rare situations that they occur. Storage instances with trivial destructors will still have their destructor skipped. A consequence of this is that the storage instance definition must be visible from the place that registers the type. Differential Revision: https://reviews.llvm.org/D98311	2021-03-11 11:35:32 -08:00
Diego Caballero	ed193bce9d	[mlir][Vector][Affine] Fix heap-use-after-free in vectorizer This patch fixes a heap-use-after-free introduced by the recent changes in the vectorizer: https://reviews.llvm.org/rG95db7b4aeaad590f37720898e339a6d54313422f The problem is due to the way candidate loops are visited. All candidate loops are pattern-matched beforehand using the 'NestedMatch' utility. These matches may intersect with each other so it may happen that we try to vectorize a loop that was previously vectorized. The new vectorization algorithm replaces the original loops that are vectorized with new loops and, therefore, any reference to the original loops in the pre-computed matches becomes invalid. This patch fixes the problem by classifying the candidate matches into buckets before vectorization. Each bucket contains all the matches that intersect. The vectorizer uses these buckets to make sure that we only vectorize one match from each bucket, at most. Differential Revision: https://reviews.llvm.org/D98382	2021-03-11 20:44:07 +02:00
Nikita Popov	f3f0c6cd47	[mlir] Remove uses of type-less CreateLoad() APIs (NFC) For the use in LLVMOps.td I used the getPointerElementType() escape hatch, as it's not obvious to me how the load type should be properly obtained here.	2021-03-11 18:39:20 +01:00
Alex Zinenko	27104390e8	[mlir] fix cmake build	2021-03-11 18:22:00 +01:00
Alex Zinenko	3ba14fa0ce	[mlir] Introduce data layout modeling subsystem Data layout information allows to answer questions about the size and alignment properties of a type. It enables, among others, the generation of various linear memory addressing schemes for containers of abstract types and deeper reasoning about vectors. This introduces the subsystem for modeling data layouts in MLIR. The data layout subsystem is designed to scale to MLIR's open type and operation system. At the top level, it consists of attribute interfaces that can be implemented by concrete data layout specifications; type interfaces that should be implemented by types subject to data layout; operation interfaces that must be implemented by operations that can serve as data layout scopes (e.g., modules); and dialect interfaces for data layout properties unrelated to specific types. Built-in types are handled specially to decrease the overall query cost. A concrete default implementation of these interfaces is provided in the new Target dialect. Defaults for built-in types that match the current behavior are also provided. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97067	2021-03-11 16:54:47 +01:00
Arpith C. Jacob	b4a516cc43	[mlir] Add LLVM loop codegen options to control software pipelining Support specifying the II and disabling pipelining. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98420	2021-03-11 16:46:44 +01:00
Tres Popp	25a20b8aa6	[mlir] Correct verifyCompatibleShapes verifyCompatibleShapes is not transitive. Create an n-ary version and update SameOperandShapes and SameOperandAndResultShapes traits to use it. Differential Revision: https://reviews.llvm.org/D98331	2021-03-11 13:04:10 +01:00
Julian Gross	2aef202981	[mlir] Fix invalid hoisting of dependent allocs in buffer hoisting pass. Buffer hoisting moves allocs upwards although it has dependency within its nested region. This patch fixes this issue. https://bugs.llvm.org/show_bug.cgi?id=49142 Differential Revision: https://reviews.llvm.org/D98248	2021-03-11 11:46:16 +01:00
Christian Sigg	bafe418d12	[mlir] Change test-gpu-to-cubin to derive from SerializeToBlobPass Clean-up after D98279, remove one call to createConvertGPUKernelToBlobPass(). Depends On D98203 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98360	2021-03-11 10:42:20 +01:00
Frederik Gossen	b975e3b5aa	[MLIR] Add canoncalization for `shape.is_broadcastable` Canonicalize `is_broadcastable` to constant true if fewer than 2 unique shape operands. Eliminate redundant operands, otherwise. Differential Revision: https://reviews.llvm.org/D98361	2021-03-11 10:10:34 +01:00
Christian Sigg	2224221fb3	[mlir] Add NVVM to CUBIN conversion to mlir-opt If MLIR_CUDA_RUNNER_ENABLED, register a 'gpu-to-cubin' conversion pass to mlir-opt. The next step is to switch CUDA integration tests from mlir-cuda-runner to mlir-opt + mlir-cpu-runner and remove mlir-cuda-runner. Depends On D98279 Reviewed By: herhut, rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D98203	2021-03-11 10:07:11 +01:00
Matthias Springer	c40e0d7609	[mlir][AVX512] Implement sparse vector dot product integration test. This test operates on two hardware-vector-sized vectors and utilizes vp2intersect and mask.compress. PHAB_REVIEW=D98099	2021-03-11 13:00:17 +09:00
River Riddle	4e02eb8014	[mlir] Optimize the implementation of RegionDCE The current implementation has some inefficiencies that become noticeable when running on large modules. This revision optimizes the code, and updates some out-dated idioms with newer utilities. The main components of this optimization include: * Add an overload of Block::eraseArguments that allows for O(N) erasure of disjoint arguments. * Don't process entry block arguments given that we don't erase them at this point. * Don't track individual operation results, given that we don't erase them. We can just track the parent operation. Differential Revision: https://reviews.llvm.org/D98309	2021-03-10 16:39:50 -08:00
Emilio Cota	c0891706bc	[mlir] Add polynomial approximation for math::Log2 ``` name old cpu/op new cpu/op delta BM_mlir_Log2_f32/10 134ns ±15% 45ns ± 4% -66.39% (p=0.000 n=20+17) BM_mlir_Log2_f32/100 1.03µs ±16% 0.12µs ±10% -88.78% (p=0.000 n=20+18) BM_mlir_Log2_f32/1k 10.3µs ±16% 0.7µs ± 5% -93.24% (p=0.000 n=20+17) BM_mlir_Log2_f32/10k 104µs ±15% 7µs ±14% -93.25% (p=0.000 n=20+20) BM_eigen_s_Log2_f32/10 95.3ns ±17% 90.9ns ± 6% ~ (p=0.228 n=20+18) BM_eigen_s_Log2_f32/100 907ns ± 3% 911ns ± 6% ~ (p=0.539 n=16+20) BM_eigen_s_Log2_f32/1k 9.88µs ± 4% 9.85µs ± 3% ~ (p=0.790 n=16+17) BM_eigen_s_Log2_f32/10k 105µs ±10% 110µs ±16% ~ (p=0.459 n=16+20) BM_eigen_v_Log2_f32/10 32.5ns ±31% 33.9ns ±14% +4.31% (p=0.028 n=17+20) BM_eigen_v_Log2_f32/100 176ns ± 8% 180ns ± 7% +2.19% (p=0.045 n=16+17) BM_eigen_v_Log2_f32/1k 1.44µs ± 4% 1.50µs ± 9% +3.91% (p=0.001 n=16+17) BM_eigen_v_Log2_f32/10k 14.5µs ±10% 15.0µs ± 8% +3.92% (p=0.002 n=16+19) ``` Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D98282	2021-03-10 14:49:22 -08:00
Christian Sigg	6a291ed0f0	[mlir] Remove unnecessary copying of pass options I missed a comment in D98279 that you don't need to copy pass options. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D98366	2021-03-10 21:55:28 +01:00
Weiwei Li	619c1505f9	[mlir][spirv] Define spv.Image Operation co-authered-by: Alan Liu <alanliu.yf@gmail.com> Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98270	2021-03-10 15:48:04 -05:00
Alex Zinenko	79da91c59a	Revert "[mlir][Vector][Affine] Improve affine vectorizer algorithm" This reverts commit `95db7b4aea`. This breaks vectorize_2d.mlir and vectorize_3d.mlir test under ASAN (use after free).	2021-03-10 20:25:49 +01:00
Alex Zinenko	ed715536f1	Revert "[mlir][Affine][Vector] Add initial support for 'iter_args' to Affine vectorizer." This reverts commit `77a9d1549f`. Parent commit is broken.	2021-03-10 20:25:32 +01:00
Diego Caballero	77a9d1549f	[mlir][Affine][Vector] Add initial support for 'iter_args' to Affine vectorizer. This patch adds support for vectorizing loops with 'iter_args' when those loops are not a vector dimension. This allows vectorizing outer loops with an inner 'iter_args' loop (e.g., reductions). Vectorizing scenarios where 'iter_args' loops are vector dimensions would require more work (e.g., analysis, generating horizontal reduction, etc.) not included in this patch. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97892	2021-03-10 20:40:21 +02:00
Diego Caballero	95db7b4aea	[mlir][Vector][Affine] Improve affine vectorizer algorithm This patch replaces the root-terminal vectorization approach implemented in the Affine vectorizer with a topological order approach that vectorizes all the operations within the target loop nest. These are the most important changes introduced by the new algorithm: * Removed tracking of root and terminal ops. Existing vectorization functionality is preserved and extended so that loop nests without root-terminal chains can be vectorized. * Vectorizing a loop nest now only requires a single topological traversal. * A new vector loop nest is incrementally built along the vectorization process. The original scalar loop is kept intact. No cloning guard is needed to recover the scalar loop if vectorization fails. This approach also simplifies the challenging task of replacing a loop operation amid the vectorization process without invalidating the analysis information that depends on the original loop. * Vectorization of specific operations has been implemented as independent, preparing them to be moved to a potential vectorization interface. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97442	2021-03-10 20:29:58 +02:00
Vladislav Vinogradov	b599f464d4	[mlir][CMAKE] Fix build with BUILD_SHARED_LIBS=ON Link `MLIRStandardToLLVM` to `MLIRAVX512Transforms`, since the latter uses `LLVMTypeConverter` defined in the first one. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D98336	2021-03-10 14:52:36 +01:00
Alex Zinenko	e02dd790b1	[mlir] fix typo in OpDefinitions.md	2021-03-10 14:44:08 +01:00
Alex Zinenko	78f3fb4f46	[mlir] Update comments in ArmNeon dialect. NFC These were not updated when squashing LLVMArmNeon and ArmNeon dialects.	2021-03-10 13:35:57 +01:00
Alex Zinenko	a776942ba1	[mlir] squash LLVM_AVX512 dialect into AVX512 The dialect separation was introduced to demarkate ops operating in different type systems. This is no longer the case after the LLVM dialect has migrated to using built-in vector types, so the original reason for separation is no longer valid. Squash the two dialects into one. The code size decrease isn't quite large: the ops originally in LLVM_AVX512 are preserved because they match LLVM IR intrinsics specialized for vector element bitwidth. However, it is still conceptually beneficial to have only one dialect. I originally considered to use Tablegen multiclasses to define both the type-polymorphic op and its two intrinsic-related instantiations, but decided against it given both the complexity of the required Tablegen input and its dissimilarity with the rest of ODS-defined ops, both potentially resulting in very poor maintainability. Depends On D98327 Reviewed By: nicolasvasilache, springerm Differential Revision: https://reviews.llvm.org/D98328	2021-03-10 13:07:26 +01:00
Alex Zinenko	0af53de369	[mlir] simplify type constraints in AVX512 dialect VectorOfLengthAndType accepts a cartesian product of given lengths and types rather than types produced by co-indexed values in the corresponding lists. Update the definitions accordingly. The type validity is already enforced by op traits. Reviewed By: nicolasvasilache, springerm Differential Revision: https://reviews.llvm.org/D98327	2021-03-10 13:07:25 +01:00
Inho Seo	2ce4caf414	Moved getStaticLoopRanges and getStaticShape methods to LinalgInterfaces.td to add static shape verification It is to use the methods in LinalgInterfaces.cpp for additional static shape verification to match the shaped operands and loop on linalgOps. If I used the existing methods, I would face circular dependency linking issue. Now we can use them as methods of LinalgOp. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D98163	2021-03-10 04:06:22 -08:00
Christian Sigg	4d295cf5b5	[mlir] Add base class for GpuKernelToBlobPass Instead of configuring kernel-to-cubin/rocdl lowering through callbacks, introduce a base class that target-specific passes can derive from. Put the base class in GPU/Transforms, according to the discussion in D98203. The mlir-cuda-runner will go away shortly, and the mlir-rocdl-runner as well at some point. I therefore kept the existing code path working and will remove it in a separate step. Depends On D98168 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98279	2021-03-10 12:14:43 +01:00
Vladislav Vinogradov	f3bf5c053b	[mlir] Model MemRef memory space as Attribute Based on the following discussion: https://llvm.discourse.group/t/rfc-memref-memory-shape-as-attribute/2229 The goal of the change is to make memory space property to have more expressive representation, rather then "magic" integer values. It will allow to have more clean ASM form: ``` gpu.func @test(%arg0: memref<100xf32, "workgroup">) // instead of gpu.func @test(%arg0: memref<100xf32, 3>) ``` Explanation for `Attribute` choice instead of plain `string`: * `Attribute` classes allow to use more type safe API based on RTTI. * `Attribute` classes provides faster comparison operator based on pointer comparison in contrast to generic string comparison. * `Attribute` allows to store more complex things, like structs or dictionaries. It will allows to have more complex memory space hierarchy. This commit preserve old integer-based API and implements it on top of the new one. Depends on D97476 Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D96145	2021-03-10 12:57:27 +03:00
Hanhan Wang	d5d4fb635e	[mlir][linalg] Add support for using scalar attributes in TC ops. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97876	2021-03-10 01:51:12 -08:00
Mehdi Amini	75f3f77805	Fix MLIR test post `890afad954`	2021-03-09 23:30:51 +00:00
Mehdi Amini	890afad954	Fix Flang build after MLIR API changes around `generatedTypeParser`	2021-03-09 23:19:30 +00:00
River Riddle	a776ecb6c2	[mlir][IR] Add an Operation::eraseOperands that supports batch erasure This method allows for removing multiple disjoint operands at once, reducing the need to erase operands individually (which results in shifting the operand list). Differential Revision: https://reviews.llvm.org/D98290	2021-03-09 15:07:53 -08:00
River Riddle	4a7aed4ee7	[mlir][IR] Add a new SymbolUserMap class This class provides efficient implementations of symbol queries related to uses, such as collecting the users of a symbol, replacing all uses, etc. This provides similar benefits to use related queries, as SymbolTableCollection did for lookup queries. Differential Revision: https://reviews.llvm.org/D98071	2021-03-09 15:07:52 -08:00
Mehdi Amini	cd9a69289c	Fix LLVM Dialect LoopOptionsAttr round-tripping: the keywords were missing in the output This indicated some missing test coverage, which are now added to the roundtrip test.	2021-03-09 22:00:22 +00:00
Mehdi Amini	fe81e8f3b5	Add default LoopOptionsAttrBuilder constructor and method to check if empty() (NFC) Also move setters out-of-line to make sure the templated helper is actually instantiated.	2021-03-09 21:12:15 +00:00
Christian Sigg	840ff84d33	[mlir] Default for gpu-binary-annotation option. Provide default for gpuBinaryAnnotation so that we don't need to specify it in tests. The annotation likely only needs to be target specific if we want to lower to e.g. both CUDA and ROCDL. Reviewed By: herhut, bondhugula Differential Revision: https://reviews.llvm.org/D98168	2021-03-09 21:01:50 +01:00
Mehdi Amini	79f736c150	Switch generatedTypeParser/generatedAttributeParser to return an OptionalParseResult This allows the caller to distinguish between a parse error or an unmatched keyword. It fixes the redundant error that was emitted by the caller when the generated parser would fail. Differential Revision: https://reviews.llvm.org/D98162	2021-03-09 19:43:45 +00:00
Mehdi Amini	8205c1a90a	Rework LLVM Dialect LoopOptions attribute Instead of storing an array of LoopOpt attributes, which were just wrapping std::pair<enum, int> anyway, we can have an attribute storing a sorted ArrayRef<std::pair<enum, int>> as a single unit. This improves here the textual format and the general API. Note that we're limiting the options to fit into an int64_t by design, but this isn't a new constraint. Building the LoopOptions attribute is likely worth a specific builder for efficient reason, that'll be the subject of a future patch. Differential Revision: https://reviews.llvm.org/D98105	2021-03-09 19:43:45 +00:00
Lei Zhang	50000abe3c	[mlir] Use affine.apply when distributing to processors This makes it easy to compose the distribution computation with other affine computations. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D98171	2021-03-09 08:37:20 -05:00
Alex Zinenko	8184247f0b	[mlir] move LLVM target import header and tests Move Target/LLVMIR.h to target/LLVMIR/Import.h to better reflect the purpose of this file. Also move all LLVM IR target tests under the LLVMIR directory. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98178	2021-03-09 09:22:14 +01:00
Alex Zinenko	90fec5ed65	[mlir] make MLIRPresburger depend on MLIRIR The analysis library uses Location, which is defined in the MLIRIR library.	2021-03-09 09:19:53 +01:00
Vladislav Vinogradov	2241b3986c	[mlir][CMAKE] Fix cross-compilation build Use `MLIR_LINALG_ODS_GEN` and `MLIR_LINALG_ODS_YAML_GEN` variables instead of `MLIR_LINALG_ODS_GEN_EXE` and `MLIR_LINALG_ODS_YAML_GEN_EXE`. The former are defined in PARENT SCOPE only, so the `if` condition is never evaluates to `TRUE`. The logic should be the following (taken from tblgen part): 1. `TOOL_NAME` - CACHE variable (default equal to target name). User can override it to actual executable path. 2. `TOOL_NAME_EXE` - internal variable, initialized to `${TOOL_NAME}` first. In case of cross-compilation (`LLVM_USE_HOST_TOOLS == TRUE`) if user didn't set own path to native executable via `TOOL_NAME` variable, CMake will create separate targets to build native tool and will override `TOOL_NAME_EXE` to the executable produced by this target. 3. `TOOL_NAME_TARGET` - internal variable, which points to tool target name. If the native tool is built as described above, it will point to the target correspondant to that native tool. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98025	2021-03-09 10:51:56 +03:00
Tobias Gysi	c1a4cd551f	[mlir][linalg] refactor the result handling during vectorization. Return the vectorization results using a vector passed by reference instead of returning them embedded in a structure. Differential Revision: https://reviews.llvm.org/D98182	2021-03-09 07:11:57 +00:00
Stella Laurenzo	e31c77b182	[mlir][python] Reorganize MLIR python into namespace packages. * Only leaf packages are non-namespace packages. This allows most of the top levels to be split into different directories or deployment packages. In the previous state, the presence of __init__.py files at each level meant that the entire tree could only ever exist in one physical directory on the path. * This changes the API usage slightly: `import mlir` will no longer do a deep import of `mlir.ir`, etc. This may necessitate some client code changes. * Dialect gen code was restructured so that the user is responsible for providing the `my_dialect.py` file, which then must import its peer `_my_dialect_ops_gen`. This gives complete control of the dialect namespace to the user instead of to tablegen code, allowing further dialect-specific python APIs. * Correspondingly, the previous extension modules `_my_dialect.py` are now `_my_dialect_ops_ext.py`. * Now that the `linalg` namespace is open, moved the `linalg_opdsl` tool into it. * This may require some corresponding downstream adjustments to npcomp, circt, et al: * Probably some shallow imports need to be converted to deep imports (i.e. not `import mlir` brings in the world). * Each tablegen generated dialect now needs an explicit `foo.py` which does a `from ._foo_ops_gen import `. This is similar to the way that generated code operates in the C++ world. If providing dialect op extensions, those need to be moved from `_foo.py` -> `_foo_ops_ext.py`. Differential Revision: https://reviews.llvm.org/D98096	2021-03-08 23:01:34 -08:00
Mehdi Amini	038f2a337d	Move LLVM::FMFAttr definition to TableGen (NFC) This is using the new Attribute storage generation support in TableGen to define the LLVM FastMathFlags. Differential Revision: https://reviews.llvm.org/D98007	2021-03-09 05:29:54 +00:00
River Riddle	0d01dfbc37	[mlir][IR][NFC] Move the remaining builtin types to ODS This will allow for removing the duplicated type documentation from LangRef and instead link to the builtin dialect documentation. Differential Revision: https://reviews.llvm.org/D98093	2021-03-08 14:32:40 -08:00
River Riddle	a4bb667d83	[mlir][IR][NFC] Define the Location classes in ODS instead of C++ This also removes the need for LocationDetail.h. Differential Revision: https://reviews.llvm.org/D98092	2021-03-08 14:32:40 -08:00
Rob Suderman	cb3542e1ca	[MLIR][TOSA] Added lowerings for Reduce operations to Linalg Lowerings for min, max, prod, and sum reduction operations on int and float values. This includes reduction tests for both cases. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D97893	2021-03-08 10:57:19 -08:00
Christian Sigg	7cdcb4a3b9	[mlir] NFC: Add #endif comment.	2021-03-08 19:25:24 +01:00
Benjamin Kramer	42c195f0ec	[mlir][Shape] Allow shape.split_at to return extent tensors and lower it to std.subtensor split_at can return an error if the split index is out of bounds. If the user knows that the index can never be out of bounds it's safe to use extent tensors. This has a straight-forward lowering to std.subtensor. Differential Revision: https://reviews.llvm.org/D98177	2021-03-08 16:48:05 +01:00
Frederik Gossen	3b9667a84c	Clarify documentation for `Elementwise`, `Scalarizable`, `Vectorizable`, and `Tensorizable` traits. Differential Revision: https://reviews.llvm.org/D97841	2021-03-08 10:35:22 +01:00
Mehdi Amini	e94e55712c	Forward the `LLVM_ENABLE_LIBCXX` CMake parameter to the mlir standalone test This allows to build and test MLIR with `-DLLVM_ENABLE_LIBCXX=ON`.	2021-03-08 05:07:26 +00:00
KareemErgawy-TomTom	3fb384d50e	[MLIR][SPIRV] Rename `spv.selection` to `spv.mlir.selection`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. For ops that don't have a SPIR-V spec counterpart, we use spv.mlir.snake_case. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98014	2021-03-06 16:05:31 +01:00
Lei Zhang	bb6f5c8314	[mlir][spirv] Convert tensor.extract for very small tensors Normally tensors will be stored in buffers before converting to SPIR-V, given that is how a large amount of data is sent to the GPU. However, SPIR-V supports converting from tensors directly too. This is for the cases where the tensor just contains a small amount of elements and it makes sense to directly inline them as a small data array in the shader. To handle this, internally the conversion might create new local variables. SPIR-V consumers in GPU drivers may or may not optimize that away. So this has implications over register pressure. Therefore, a threshold is used to control when the patterns should kick in. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D98052	2021-03-06 08:03:36 -05:00
Mehdi Amini	f8fe6d9f3f	Use gen-dialect-doc instead of gen-op-doc for the Builtin dialect This is fixing the missing title and menu entry on the MLIR website.	2021-03-06 05:32:46 +00:00
Matthias Springer	acce0ea70c	[mlir][AVX512] Add mask.compress to AVX512 dialect. Adds mask.compress to the AVX512 dialect and defines a lowering to the LLVM dialect. Differential Revision: https://reviews.llvm.org/D97611	2021-03-06 10:02:48 +09:00
Mehdi Amini	a7cac0d9a5	Fix Dialect doc generation to special case for the Builtin dialect empty name This should fix the issue with an empty entry for the builtin dialect on the website. Differential Revision: https://reviews.llvm.org/D98074	2021-03-05 23:47:50 +00:00
Alex Zinenko	6410ee0d09	[mlir] Squash LLVM_ArmNeon dialect into ArmNeon The two dialects are largely redundant. The former was introduced as a mirror of the latter operating on LLVM dialect types. This is no longer necessary since the LLVM dialect operates on built-in types. Combine the two dialects. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D98060	2021-03-05 23:33:32 +01:00
Aart Bik	e5c8fc776f	[mlir][vector] canonicalize unmasked gather/scatter/compress/expand directly into l/s With the new vector.load/store operations, there is no need to go through unmasked transfer operations (which will canonicalized to l/s anyway). Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D98056	2021-03-05 14:23:50 -08:00
Diego Caballero	2de6dbda66	[mlir] Add 'Skip' result to Operation visitor This patch is a follow-up on D97217. It adds a new 'Skip' result to the Operation visitor so that a callback can stop the ongoing visit of an operation/block/region and continue visiting the next one without fully interrupting the walk. Skipping is needed to be able to erase an operation/block in pre-order and do not continue visiting the internals of that operation/block. Related to the skipping mechanism, the patch also introduces the following changes: * Added new TestIRVisitors pass with basic testing for the IR visitors. * Fixed missing early increment ranges in visitor implementation. * Updated documentation of walk methods to include erasure information and walk order information. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97820	2021-03-06 00:02:20 +02:00
Diego Caballero	71a86245ca	[mlir] Extend Operation visitor with pre-order traversal This patch extends the Region, Block and Operation visitors to also support pre-order walks. We introduce a new template argument that dictates the walk order (only pre-order and post-order are supported for now). The default order for Regions, Blocks and Operations is post-order. Mixed orders (e.g., Region/Block pre-order + Operation post-order) could easily be implemented, as shown in NumberOfExecutions.cpp. Reviewed By: rriddle, frgossen, bondhugula Differential Revision: https://reviews.llvm.org/D97217	2021-03-06 00:02:20 +02:00
Diego Caballero	b635492c3f	[mlir][Affine][NFC] Return BlockArgument in AffineForOp::getInductionVar This avoids unnecessary casts when a BlockArgument is required. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D97879	2021-03-06 00:02:19 +02:00
KareemErgawy-TomTom	d48ceb45e3	[MLIR][SPIRV] Rename `spv.undef` to `spv.Undef`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. For ops that don't have a SPIR-V spec counterpart, we use spv.mlir.snake_case. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D98016	2021-03-05 15:49:44 -05:00
River Riddle	f175ba4a54	[mlir][AsmPrinter] Don't use string comparison when filtering list attributes In .mlir modules with larges amounts of attributes, e.g. a function with a larger number of argument attributes, the string comparison filtering greatly affects compile time. This revision switches to using a SmallDenseSet in these situations, resulting in over a 10x speed up in some situations. Differential Revision: https://reviews.llvm.org/D97980	2021-03-05 12:47:05 -08:00
KareemErgawy-TomTom	29812a6195	[MLIR][SPIRV] Rename `spv.loop` to `spv.mlir.loop`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97918	2021-03-05 15:44:30 -05:00
Stella Laurenzo	0b5f1b859f	[mlir][linalg] Add linalg_opdsl tool first draft. * Mostly imported from experimental repo as-is with cosmetic changes. * Temporarily left out emission code (for building ops at runtime) to keep review size down. * Documentation and lit tests added fresh. * Sample op library that represents current Linalg named ops included. Differential Revision: https://reviews.llvm.org/D97995	2021-03-05 11:45:09 -08:00
Stella Laurenzo	a9ccdfbc7d	NFC: Glob all python sources in the MLIR Python bindings. * Also switches to use symlinks vs copy as that enables edit-and-continue python development. * Broken out of https://reviews.llvm.org/D97995 per request from reviewer. Differential Revision: https://reviews.llvm.org/D98005	2021-03-05 10:21:02 -08:00
Aart Bik	adc35b689f	[mlir][sparse] mask reduction update Reduction updates should be masked, just like the load and stores. Note that alternatively, we could use the fact that masked values are zero of += updates and mask invariants to get this working but that would not work for *= updates. Masking the update itself is cleanest. This change also replaces the constant mask with a broadcast of "true" since this constant folds much better for various folding patterns. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D98000	2021-03-05 08:56:10 -08:00
Nicolas Vasilache	c86d3c1a38	[mlir][Linalg] Fix order of dimensions in hoistPaddingOnTensors.	2021-03-05 15:11:35 +00:00
Christian Sigg	5fedf30748	[mlir] Make cuInit() call thread-safe. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D98024	2021-03-05 16:06:15 +01:00
Nicolas Vasilache	35908406dc	[mlir][scf] Canonicalize scf.for last tensor iteration result. Canonicalize the iter_args of an scf::ForOp that involve a tensor_load and for which only the last loop iteration is actually visible outside of the loop. The canonicalization looks for a pattern such as: ``` %t0 = ... : tensor_type %0 = scf.for ... iter_args(%bb0 : %t0) -> (tensor_type) { ... // %m is either tensor_to_memref(%bb00) or defined above the loop %m... : memref_type ... // uses of %m with potential inplace updates %new_tensor = tensor_load %m : memref_type ... scf.yield %new_tensor : tensor_type } ``` `%bb0` may have either 0 or 1 use. If it has 1 use it must be exactly a `%m = tensor_to_memref %bb0` op that feeds into the yielded `tensor_load` op. If no aliasing write of `%new_tensor` occurs between tensor_load and yield then the value %0 visible outside of the loop is the last `tensor_load` produced in the loop. For now, we approximate the absence of aliasing by only supporting the case when the tensor_load is the operation immediately preceding the yield. The canonicalization rewrites the pattern as: ``` // %m is either a tensor_to_memref or defined above %m... : memref_type scf.for ... { // no iter_args ... // uses of %m with potential inplace updates } %0 = tensor_load %m : memref_type ``` Differential revision: https://reviews.llvm.org/D97953	2021-03-05 09:42:19 +00:00
KareemErgawy-TomTom	c74eb466d2	[MLIR][SPIRV] Rename `spv.globalVariable` to `spv.GlobalVariable`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97919	2021-03-04 16:24:59 -05:00
KareemErgawy-TomTom	5abdca47b3	[MLIR][SPIRV] Rename `spv.constant` to `spv.Constant`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from `spv.camelCase` to `spv.CamelCase` everywhere. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97917	2021-03-04 16:15:56 -05:00
KareemErgawy-TomTom	4d90e460bc	[MLIR][SPIRV] Rename `spv.spcConstant...` to `spv.SpcConstant...`. To unify the naming scheme across all ops in the SPIR-V dialect, we are moving from spv.camelCase to spv.CamelCase everywhere. Differential Revision: https://reviews.llvm.org/D97920	2021-03-04 16:07:41 -05:00
River Riddle	2f37cdd569	[mlir][IR][NFC] Move a majority of the builtin attributes to ODS Now that attributes can be generated using ODS, we can move the builtin attributes as well. This revision removes a majority of the builtin attributes with a few left for followup revisions. The attributes moved to ODS in this revision are: AffineMapAttr, ArrayAttr, DictionaryAttr, IntegerSetAttr, StringAttr, SymbolRefAttr, TypeAttr, and UnitAttr. Differential Revision: https://reviews.llvm.org/D97591	2021-03-04 13:04:06 -08:00
River Riddle	1447ec5182	[mlir][AttrDefGen] Add support for specifying the value type of an attribute The value type of the attribute can be specified by either overriding the typeBuilder field on the AttrDef, or by providing a parameter of type `AttributeSelfTypeParameter`. This removes the need to define custom storage class constructors for attributes that have a value type other than NoneType. Differential Revision: https://reviews.llvm.org/D97590	2021-03-04 13:04:05 -08:00
River Riddle	6bc767cd07	[mlir] Add a DialectAsmParser::getChecked method This function simplifies calling the getChecked methods on Attributes and Types from within the parser, and removes any need to use `getEncodedSourceLocation` for these methods (by using an SMLoc instead). This is much more efficient than using an mlir::Location, as the encoding process to produce an mlir::Location is inefficient and undesirable for parsing (locations used during parsing should not persist afterwards unless otherwise necessary). Differential Revision: https://reviews.llvm.org/D97900	2021-03-04 11:53:24 -08:00
Ahmed Taei	da1e37a8b0	Fold full-size subview of static shapes. Differential Revision: https://reviews.llvm.org/D97429	2021-03-04 09:52:06 -08:00
Nicolas Vasilache	f21d78633a	[mlir] Tighten the rules around folding TensorLoadOp `tensor_load(tensor_to_memref(x)) -> x` is an incorrect folding because it ignores potential aliasing. This revision approximates no-aliasing by restricting the folding to occur only when tensor_to_memref is immediately preceded by tensor_load in the same block. This is a conservative step back towards correctness until better alias analysis becomes available. Context: https://llvm.discourse.group/t/properly-using-bufferization-related-passes/2913/6 Differential Revision: https://reviews.llvm.org/D97957	2021-03-04 17:48:09 +00:00
Arpith C. Jacob	4e393350c5	[mlir] Add an AccessGroup attribute to load/store LLVM dialect ops and generate the access_group LLVM metadata. This also includes LLVM dialect ops created from intrinsics. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D97944	2021-03-04 18:17:23 +01:00
Hanhan Wang	b47c6c686c	[mlir][linalg] Add suffix "Op" to pooling TC ops. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D97946	2021-03-04 07:08:30 -08:00
Nicolas Vasilache	f3cc854364	[mlir][Vector] Add folding of vector transfers from/into tensor producing ops. Add a folder to rewrite a sequence such as: ``` %t1 = ... %v = vector.transfer_read %t0[%c0...], {masked = [false...]} : tensor<static_sizesxf32>, vector<static_sizesxf32> %t2 = vector.transfer_write %v, %t1[%c0...] {masked = [false...]} : vector<static_sizesxf32>, tensor<static_sizesxf32> ``` into: ``` %t0 ``` The producer of t1 may or may not be DCE'd depending on whether it is a block argument or has side effects. Differential revision: https://reviews.llvm.org/D97934	2021-03-04 14:17:42 +00:00
Nicolas Vasilache	a756f12b4d	[mlir][Linalg] Add folding of linalg.copy that are in fact identities. Differential Revision: https://reviews.llvm.org/D97939	2021-03-04 13:37:26 +00:00
Nicolas Vasilache	4f4f3f1e59	[mlir] NFC - Add runner util functions to only print MemRef metadata. These are useful to debug execution, without having to print the whole content of a memref.	2021-03-04 12:35:45 +00:00
Nicolas Vasilache	05882157db	[mlir][Linalg] NFC - Add isOutputTensor to LinalgInterfaces.td	2021-03-04 12:33:21 +00:00
Christian Sigg	f69d5a7fc7	[mlir] Initialize CUDA context lazily. So we can remove the ignore-warning pragma again. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D97864	2021-03-04 13:07:56 +01:00
Alex Zinenko	32c49c7d73	[mlir] ODS: change OpBuilderDAG to OpBuilder We no longer have the non-DAG version. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97856	2021-03-04 10:55:02 +01:00
Alex Zinenko	19db802e7b	[mlir] make implementations of translation to LLVM IR interfaces private There is no need for the interface implementations to be exposed, opaque registration functions are sufficient for all users, similarly to passes. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97852	2021-03-04 09:16:32 +01:00
Arpith C. Jacob	4a2930f495	[mlir] Add loop codegen options to some LLVM dialect ops. Add a Loop Option attribute and generate llvm metadata attached to branch instructions to control code generation. Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D96820	2021-03-04 09:01:57 +01:00
Aart Bik	553cb6d473	[mlir][sparse] fix bug in reduction chain Found with exhaustive testing, it is possible that a while loop appears in between chainable for loops. As long as we don't scalarize reductions in while loops, this means we need to terminate the chain at the while. This also refactors the reduction code into more readable helper methods. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97886	2021-03-03 17:38:22 -08:00
River Riddle	83ef862fad	[mlir] Add support for generating Attribute classes for ODS The support for attributes closely maps that of Types (basically 1-1) given that Attributes are defined in exactly the same way as Types. All of the current ODS TypeDef classes get an Attr equivalent. The generation of the attribute classes themselves share the same generator as types. Differential Revision: https://reviews.llvm.org/D97589	2021-03-03 16:41:49 -08:00
River Riddle	e07c968a6d	[mlir][pdl][NFC] Rename InputOp to OperandOp This better matches the actual IR concept that is being modeled, and is consistent with how the rest of PDL is structured. Differential Revision: https://reviews.llvm.org/D95718	2021-03-03 15:48:00 -08:00
River Riddle	55f878bad9	[mlir][pdl] Add a new !pdl.range<> type This type represents a range of positional values. It will be used in followup revisions to add support for variadic constructs to PDL, such as operand and result ranges. Differential Revision: https://reviews.llvm.org/D95717	2021-03-03 15:48:00 -08:00
River Riddle	3dfa86149e	[mlir][IR] Refactor the internal implementation of Value The current implementation of Value involves a pointer int pair with several different kinds of owners, i.e. BlockArgumentImpl, Operation , TrailingOpResult. This design arose from the desire to save memory overhead for operations that have a very small number of results (generally 0-2). There are, unfortunately, many problematic aspects of the current implementation that make Values difficult to work with or just inefficient. Operation result types are stored as a separate array on the Operation. This is very inefficient for many reasons: we use TupleType for multiple results, which can lead to huge amounts of memory usage if multi-result operations change types frequently(they do). It also means that simple methods like Value::getType/Value::setType now require complex logic to get to the desired type. Value only has one pointer bit free, severely limiting the ability to use it in things like PointerUnion/PointerIntPair. Given that we store the kind of a Value along with the "owner" pointer, we only leave one bit free for users of Value. This creates situations where we end up nesting PointerUnions to be able to use Value in one. As noted above, most of the methods in Value need to branch on at least 3 different cases which is both inefficient, possibly error prone, and verbose. The current storage of results also creates problems for utilities like ValueRange/TypeRange, which want to efficiently store base pointers to ranges (of which Operation isn't really useful as one). This revision greatly simplifies the implementation of Value by the introduction of a new ValueImpl class. This class contains all of the state shared between all of the various derived value classes; i.e. the use list, the type, and the kind. This shared implementation class provides several large benefits: * Most of the methods on value are now branchless, and often one-liners. * The "kind" of the value is now stored in ValueImpl instead of Value This frees up all of Value's pointer bits, allowing for users to take full advantage of PointerUnion/PointerIntPair/etc. It also allows for storing more operation results as "inline", 6 now instead of 2, freeing up 1 word per new inline result. * Operation result types are now stored in the result, instead of a side array This drops the size of zero-result operations by 1 word. It also removes the memory crushing use of TupleType for operations results (which could lead up to hundreds of megabytes of "dead" TupleTypes in the context). This also allowed restructured ValueRange, making it simpler and one word smaller. This revision does come with two conceptual downsides: * Operation::getResultTypes no longer returns an ArrayRef<Type> This conceptually makes some usages slower, as the iterator increment is slightly more complex. * OpResult::getOwner is slightly more expensive, as it now requires a little bit of arithmetic From profiling, neither of the conceptual downsides have resulted in any perceivable hit to performance. Given the advantages of the new design, most compiles are slightly faster. Differential Revision: https://reviews.llvm.org/D97804	2021-03-03 14:33:37 -08:00
David Blaikie	4fda0dc14b	Fix use of deprecated API	2021-03-03 14:07:28 -08:00
MaheshRavishankar	c118fdcd59	[mlir] Remove incorrect folding for SubTensorInsertOp The SubTensorInsertOp has a requirement that dest type and result type match. Just folding the tensor.cast operation violates this and creates verification errors during canonicalization. Also fix other canonicalization methods that werent inserting casts properly. Differential Revision: https://reviews.llvm.org/D97800	2021-03-03 13:58:05 -08:00
Hanhan Wang	83c56aa4ee	[mlir][linalg] Add depthwise_conv_2d_input_nhwc_filter_hwcf to Linalg TC ops. Different from the definition in Tensorflow and TOSA, the output is [N,H,W,C,M]. This can make transforms easier in LinAlg because the indexing maps are plain. E.g., to determine if the fill op has dependency between the depthwise conv op, the current pipeline only recognizes the dep if they are all projected affine map. Reviewed By: asaadaldien Differential Revision: https://reviews.llvm.org/D97798	2021-03-03 11:47:02 -08:00
Mehdi Amini	13cb431719	Add basic JIT Python Bindings This offers the ability to create a JIT and invoke a function by passing ctypes pointers to the argument and the result. Differential Revision: https://reviews.llvm.org/D97523	2021-03-03 18:19:40 +00:00
Mehdi Amini	86c8a7857d	Add C bindings for mlir::ExecutionEngine This adds minimalistic bindings for the execution engine, allowing to invoke the JIT from the C API. This is still quite early and experimental and shouldn't be considered stable in any way. Differential Revision: https://reviews.llvm.org/D96651	2021-03-03 18:19:40 +00:00
Hanhan Wang	497b7b8c00	[mlir][linalg] Delete unused vars if there are shaped-only operands. Reviewed By: stella.stamenova Differential Revision: https://reviews.llvm.org/D97851	2021-03-03 09:36:08 -08:00
MaheshRavishankar	5d7e0a23c6	[mlir] Add LinalgInterface method to clone with a given BlockAndValueMapping. Since Linalg operations have regions by default which are not isolated from above, add an another method to the interface that will take a BlockAndValueMapping to remap the values within the region as well. Differential Revision: https://reviews.llvm.org/D97709	2021-03-03 09:25:20 -08:00
Benjamin Kramer	73cb58dc48	[mlir][Shape] Lower cstr_eq to shape_eq + assert Differential Revision: https://reviews.llvm.org/D97860	2021-03-03 17:22:28 +01:00
Benjamin Kramer	24acadef8a	[mlir][Shape] Make shape_eq nary This gets rid of a dubious shape_eq %a, %a fold, that folds shape_eq even if %a is not an Attribute. Differential Revision: https://reviews.llvm.org/D97728	2021-03-03 16:26:40 +01:00
Benjamin Kramer	c714b441ef	[mlir][Shape] Make cstr_eq more like cstr_broadcastable This includes allowing extents and not just shapes. Differential Revision: https://reviews.llvm.org/D97716	2021-03-03 16:20:05 +01:00
Vladislav Vinogradov	5d613e42d3	[mlir][ODS] Use StringLiteral instead of StringRef when applicable Use `StringLiteral` for function return type if it is known to return constant string literals only. This will make it visible to API users, that such values can be safely stored, since they refers to constant data, which will never be deallocated. `StringRef` is general is not safe to store for a long term, since it might refer to temporal data allocated in heap. Add `inline` and `constexpr` methods support to `OpMethod`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97390	2021-03-03 16:15:12 +03:00
Vladislav Vinogradov	e3b350ce83	[mlir][NFC] Use `BaseMemRefType::isValidElementType` in Parser Instead of hard-coded check. Reviewed By: ftynse, rriddle Differential Revision: https://reviews.llvm.org/D97612	2021-03-03 16:00:54 +03:00
Christian Sigg	b6ac26fce5	[mlir] Silence -Wglobal-constructors error in CudaRuntimeWrapper.cpp Until I have a better solution with dynamic initialization, to get the nvidia build bot green again.	2021-03-03 13:48:03 +01:00
Christian Sigg	9d7be77bf9	[mlir] Move cuda tests Move test inputs to test/Integration directory. Move runtime wrappers to ExecutionEngine. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97463	2021-03-03 13:16:51 +01:00
Aart Bik	5b333d3449	[mlir][sparse] do not ignore ordering for "dense" tensor linked with sparse type Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97795	2021-03-02 15:21:51 -08:00
Hanhan Wang	c0f8115c73	[mlir][linalg] Only generate one var for an attrUse. Some variables are unused after D97383 landed. We should generate one symbol for one attrUse. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D97794	2021-03-02 12:48:20 -08:00
Frederik Gossen	bcc9b371e4	Split `ElementwiseMappable` trait into four more precise traits. Some elementwise operations are not scalarizable, vectorizable, or tensorizable. Split `ElementwiseMappable` trait into the following, more precise traits. - `Elementwise` - `Scalarizable` - `Vectorizable` - `Tensorizable` This allows for reuse of `Elementwise` in dialects like HLO. Differential Revision: https://reviews.llvm.org/D97674	2021-03-02 15:31:19 +01:00
Christian Sigg	caa5144d56	[mlir] Use mlir::OpState::operator->() to get to Operation::getAttrs(). This is a preparation step to remove getAttrs() from OpState.	2021-03-02 13:29:27 +01:00
KareemErgawy-TomTom	3b021fbdc0	[MLIR][LinAlg] Detensorize interal function control flow. This patch continues detensorizing implementation by detensoring internal control flow in functions. In order to detensorize functions, all the non-entry block's arguments are detensored and branches between such blocks are properly updated to reflect the detensored types as well. Function entry block (signature) is left intact. This continues work towards handling github/google/iree#1159. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D97148	2021-03-02 11:46:20 +01:00
Vladislav Vinogradov	37eca08e5b	[mlir][NFC] Rename `MemRefType::getMemorySpace` to `getMemorySpaceAsInt` Just a pure method renaming. It is a preparation step for replacing "memory space as raw integer" with more generic "memory space as attribute", which will be done in separate commit. The `MemRefType::getMemorySpace` method will return `Attribute` and become the main API, while `getMemorySpaceAsInt` will be declared as deprecated and will be replaced in all in-tree dialects (also in separate commits). Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D97476	2021-03-02 11:08:54 +03:00
Stella Laurenzo	6d2fd3d9cd	[mlir][linalg] Replace monomorphic contration ops with polymorphic variants. * Moves `batch_matmul`, `matmul`, `matvec`, `vectmat`, `dot` to the new mechanism. * This is not just an NFC change, in addition to using a new code generation mechanism, it also activates symbolic casting, allowing mixed precision operands and results. * These definitions were generated from DSL by the tool: https://github.com/stellaraccident/mlir-linalgpy/blob/main/mlir_linalg/oplib/core.py (will be upstreamed in a subsequent set of changes). Reviewed By: nicolasvasilache, ThomasRaoux Differential Revision: https://reviews.llvm.org/D97719	2021-03-01 21:19:53 -08:00
Stella Laurenzo	d36a15de1f	[mlir][linalg] Memoize indexing map generation. Differential Revision: https://reviews.llvm.org/D97602	2021-03-01 21:15:40 -08:00
MaheshRavishankar	a9e68db973	[mlir] Add canonicaliations for subtensor_insert operation. Add canonicalizers to subtensor_insert operations need canonicalizers that propagate the constant arguments within offsets, sizes and strides. Also add pattern to propogate tensor_cast operations. Differential Revision: https://reviews.llvm.org/D97704	2021-03-01 14:59:18 -08:00
Stella Stamenova	801067f4c0	[mlir][lldb] Fix several gcc warnings in mlir and lldb These warnings are raised when compiling with gcc due to either having too few or too many commas, or in the case of lldb, the possibility of a nullptr. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97586	2021-03-01 13:48:22 -08:00
Rob Suderman	087bc20fe4	[MLIR][TOSA] Lower tosa.transpose to linalg.generic Lowers the transpose operation to a generic linalg op when permutations is a constant value. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D97508	2021-03-01 11:09:49 -08:00
Ahmed Taei	ea7f211b2e	[mlir] Add polynomial approximation for math::ExpOp Similar to fast_exp in https://github.com/boulos/syrah Differential Revision: https://reviews.llvm.org/D97599	2021-03-01 11:02:07 -08:00
Jacques Pienaar	87e05eb03b	Revert "Remove use of tuple for multiresult type storage" This reverts commit `08f0764ff5`.	2021-03-01 10:39:41 -08:00
Jacques Pienaar	08f0764ff5	Remove use of tuple for multiresult type storage Move the results in line with the op instead. This results in each operation having its own types recorded vs single tuple type, but comes at benefit that every mutation doesn't incurs uniquing. Ran into cases where updating result type of operation led to very large memory usage. Differential Revision: https://reviews.llvm.org/D97652	2021-03-01 09:30:24 -08:00
Jacques Pienaar	2f0b4db5ea	[mlir] Add convenience grouping for tensor type inference For ops that produces tensor types and implement the shaped type component interface, the type inference interface can be used. Create a grouping of these together to make it easier to specify (it cannot be added into a list of traits, but must rather be appended/concated to one as it isn't a trait but a list of traits). Differential Revision: https://reviews.llvm.org/D97636	2021-03-01 05:21:08 -08:00
Aart Bik	6afaea6682	[mlir][sparse] fixed inaccury in maintaining universal index The universal index was maintained if dense indices were still in place, and lattice points followed. However, it should only be kept if any of those following lattice points actually consumes the universal index. This change also fixes an inaccuracy with a missing broadcast around vector invariant. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97594	2021-02-27 17:32:57 -08:00
Stella Laurenzo	769cd20bdb	NFC: Adds labels to a linalg test and extend a missing case.	2021-02-27 16:00:18 -08:00
Stella Laurenzo	2ceedc3a20	[mlir][linalg] Add symbolic type conversion to linalg named ops. This enables this kind of construct in the DSL to generate a named op that is polymorphic over numeric type variables `T` and `U`, generating the correct arithmetic casts at construction time: ``` @tc_def_op def polymorphic_matmul(A=TensorDef(T1, S.M, S.K), B=TensorDef(T2, S.K, S.N), C=TensorDef(U, S.M, S.N, output=True)): implements(ContractionOpInterface) C[D.m, D.n] += cast(U, A[D.m, D.k]) * cast(U, B[D.k, D.n]) ``` Presently, this only supports type variables that are bound to the element type of one of the arguments, although a further extension that allows binding a type variable to an attribute would allow some more expressiveness and may be useful for some formulations. This is left to a future patch. In addition, this patch does not yet materialize the verifier support which ensures that types are bound correctly (for such simple examples, failing to do so will yield IR that fails verification, it just won't yet fail with a precise error). Note that the full grid of extensions/truncation/int<->float conversions are supported, but many of them are lossy and higher level code needs to be mindful of numerics (it is not the job of this level). As-is, this should be sufficient for most integer matmul scenarios we work with in typical quantization schemes. Differential Revision: https://reviews.llvm.org/D97603	2021-02-27 15:52:35 -08:00
Stella Laurenzo	5867c18e2c	[mlir][linalg] Generate additional interfaces for named ops. * Adds ContractionOpInterface to polymorphic_matmul. Differential Revision: https://reviews.llvm.org/D97601	2021-02-27 15:43:41 -08:00
Mehdi Amini	014575932f	Fix Block::eraseArguments: keep track the first removed element while removing Not only this is likely more efficient than BitVector::find_first(), but also if the BitVector is empty find_first() returns -1, which llvm::drop_begin isn't robust against.	2021-02-27 19:18:09 +00:00
Mehdi Amini	7b06786de2	Fix Block::eraseArguments to properly update the cached positions This is fixing correctness and ASAN failure post-ee90bb3486948.	2021-02-27 19:04:12 +00:00
Mehdi Amini	a6d76cf610	Change the error message when `-split-input-file` is used with mlir-opt to make it recognizable by IDEs By adding the line number of the split point immediately after the file name (separated by `:`) this is recognized by various tool as a proper location. Ideally we would want to point to the line of the error, but that would require some very invasive changes I suspect. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D93363	2021-02-27 18:32:34 +00:00
Jacques Pienaar	dd2f50a4d0	[mlir] Improve test coverage for print-op-graph	2021-02-27 10:18:38 -08:00
Mehdi Amini	ee90bb3486	Store (cache) the Argument number (index in the argument list) inside the BlockArgumentImpl This avoids linear search in BlockArgument::getArgNumber(). Differential Revision: https://reviews.llvm.org/D97596	2021-02-27 17:21:08 +00:00
River Riddle	e6260ad043	[mlir] Simplify various pieces of code now that Identifier has access to the Context/Dialect This also exposed a bug in Dialect loading where it was not correctly identifying identifiers that had the dialect namespace as a prefix. Differential Revision: https://reviews.llvm.org/D97431	2021-02-26 18:00:05 -08:00
Rob Suderman	16abacaea9	[MLIR][TOSA] Resubmit Tosa to Standard/SCF Lowerings (const, if, while)" Includes a lowering for tosa.const, tosa.if, and tosa.while to Standard/SCF dialects. TosaToStandard is used for constant lowerings and TosaToSCF handles the if/while ops. Resubmission of https://reviews.llvm.org/D97518 with ASAN fixes. Differential Revision: https://reviews.llvm.org/D97529	2021-02-26 17:44:12 -08:00
Jacques Pienaar	91ab48ea6b	[mlir] Add regions to OpAdaptor Allows querying regions too via OpAdaptor's generated. This does not yet move region verification to adaptor nor require regions for ops where needed. Differential Revision: https://reviews.llvm.org/D97519	2021-02-26 16:25:00 -08:00
Rob Suderman	f685c9ac86	[MLIR][TOSA] Lower tosa.identity and tosa.identitiyn to linalg Both identity ops can be loweried by replacing their results with their inputs. We keep this as a linalg lowering as other backends may choose to create copies. Differential Revision: https://reviews.llvm.org/D97517	2021-02-26 15:45:07 -08:00
Aart Bik	df5ccf5a94	[mlir][vector] add higher dimensional support to gather/scatter Similar to mask-load/store and compress/expand, the gather and scatter operation now allow for higher dimension uses. Note that to support the mixed-type index, the new syntax is: vector.gather %base [%i,%j] [%kvector] .... The first client of this generalization is the sparse compiler, which needs to define scatter and gathers on dense operands of higher dimensions too. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97422	2021-02-26 14:20:19 -08:00
Rob Suderman	caccddc52a	[MLIR][TOSA] Lower tosa.reshape to linalg.reshape Lowering from the tosa.reshape op to linalg.reshape. For same-rank or non-collapsed/expanded cases two linalg.reshapes are inserted. Differential Revision: https://reviews.llvm.org/D97439	2021-02-26 12:57:57 -08:00
Geoffrey Martin-Noble	21bb63893e	[MLIR][linalg] Make integer matmul ops cast before multiplying Right now they multiply before casting which means they would frequently overflow. There are various reasonable ways to do this, but until we have robust op description infra, this is a simple and safe default. More careful treatments are likely to be hardware specific, as well (e.g. using an i8*i8->i16 mul instruction). Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D97505	2021-02-26 08:36:31 -08:00
Hanhan Wang	bfd3771c9e	[mlir][linalg] Add pooling ops to Linalg TC ops. - Add EDSC builders for std_cmpf_ogt and std_cmpf_olt. - Add pooling_nhwc_min/max/sum ops Depends On D97384 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97385	2021-02-26 07:18:03 -08:00
Hanhan Wang	855a119604	[mlir][linalg] Allow TC ops taking an unused shaped operand. If one operand is not used in the formula, it will be considered a shaped operand. And the result of indexing map of the operand will be the first reduction dims. Depends On D97383 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97384	2021-02-26 06:45:56 -08:00
Benjamin Kramer	4941fef9c4	[mlir] Silence some deprecation warnings after `dffc487b07`	2021-02-26 15:15:56 +01:00
Marius Brehler	56774bdda5	[mlir] Replace deprecated 'getAttrs' 'getAttrs' has been explicitly marked deprecated. This patch refactors to use Operation::getAttrs(). Reviewed By: csigg Differential Revision: https://reviews.llvm.org/D97546	2021-02-26 14:52:40 +01:00
Christian Sigg	dffc487b07	[mlir] Mark OpState::removeAttr() deprecated. Fix call sites. The method will be removed 2 weeks later. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D97530	2021-02-26 12:04:41 +01:00

1 2 3 4 5 ...

7097 Commits