llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-11-29 16:41:27 +00:00

Author	SHA1	Message	Date
Geoffrey Martin-Noble	db011775e4	Reland "[MLIR] Make structured op tests permutation invariant" Relands with fix swapping DEPENDS for LINK_LIBS. This reverts commit `cd8cc00b9e`. Differential Revision: https://reviews.llvm.org/D97011	2021-02-18 18:09:49 -08:00
Jing Pu	d690cbf821	Add DivOp to the Shape dialect Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D96907	2021-02-18 16:58:47 -08:00
Mehdi Amini	cd8cc00b9e	Revert "[MLIR] Make structured op tests permutation invariant" This reverts commit `b9ff67099a`. The build is broken with -DBUILD_SHARED_LIBS=ON	2021-02-19 00:16:45 +00:00
Geoffrey Martin-Noble	b9ff67099a	[MLIR] Make structured op tests permutation invariant Extracts the relevant dimensions from the map under test to build up the maps to test against in a permutation-invariant way. This also includes a fix to the indexing maps used by isColumnMajorMatmul. The maps as currently written do not describe a column-major matmul. The linalg named op column_major_matmul has the correct maps (and notably fails the current test). If `C = matmul(A, B)` we want an operation that given A in column major format and B in column major format produces C in column major format. Given that for a matrix, faux column major is just transpose. `column_major_matmul(transpose(A), transpose(B)) = transpose(C)`. If `A` is `NxK` and `B` is `KxM`, then `C` is `NxM`, so `transpose(A)` is `KxN`, `transpose(B)` is `MxK` and `transpose(C)` is `MxN`, not `NxM` as these maps currently have. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96984	2021-02-18 14:36:07 -08:00
Nicolas Vasilache	b006902b2d	[mlir] Fold trivial subtensor / subtensor_insert ops. Static subtensor / subtensor_insert of the same size as the source / destination tensor and root @[0..0] with strides [1..1] are folded away. Differential revision: https://reviews.llvm.org/D96991	2021-02-18 21:34:55 +00:00
Nicolas Vasilache	8e01e2ec0f	[mlir][Vector] Fold tensor_cast + vector.transfer_read Differential Revision: https://reviews.llvm.org/D96988	2021-02-18 20:47:16 +00:00
Andrew Pritchard	08c681f645	Perform memory accesses in the same addrspace as the corresponding memref. It's not necessarily the case on all architectures that all memory is addressable in addrspace 0, so casting the pointer to addrspace 0 is liable to cause problems. Reviewed By: aartbik, ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D96380	2021-02-18 12:36:16 -08:00
natashaknk	25b4a6a7f0	[MLIR][TOSA] Add lowering from TOSA to Linalg for math-based and elementwise ops This patch adds lowering to Linalg for the following TOSA ops: negate, rsqrt, mul, select, clamp and reluN and includes support for signless integer and floating point types Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D96924	2021-02-18 12:10:10 -08:00
Alexander Belyaev	624fccba87	[mlir] Add `linalg.tiled_loop` op. `subtensor_insert` was used instead of `linalg.subtensor_yield` to make this PR smaller. Verification will be added in a follow-up PR. Differential Revision: https://reviews.llvm.org/D96943	2021-02-18 13:23:00 +01:00
Alexander Belyaev	a89035d750	Revert "[MLIR] Create memref dialect and move several dialect-specific ops from std." This commit introduced a cyclic dependency: Memref dialect depends on Standard because it used ConstantIndexOp. Std depends on the MemRef dialect in its EDSC/Intrinsics.h Working on a fix. This reverts commit `8aa6c3765b`.	2021-02-18 12:49:52 +01:00
Julian Gross	8aa6c3765b	[MLIR] Create memref dialect and move several dialect-specific ops from std. Create the memref dialect and move several dialect-specific ops without dependencies to other ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp DeallocOp -> MemRef_DeallocOp MemRefCastOp -> MemRef_CastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp TransposeOp -> MemRef_TransposeOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D96425	2021-02-18 11:29:39 +01:00
Alex Zinenko	12875ed976	[mlir] generate enum translation functions with unused attribute The functions translating enums to LLVM IR are generated in a single file included in many places, not all of which use all translations. Generate functions with "unused" attribute to silence compiler warnings. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96880	2021-02-18 09:44:40 +01:00
Aart Bik	ff6c84b803	[mlir][sparse] generalize sparse storage format to many more types Rationale: Narrower types for overhead storage yield a smaller memory footprint for sparse tensors and thus needs to be supported. Also, more value types need to be supported to deal with all kinds of kernels. Since the "one-size-fits-all" sparse storage scheme implementation is used instead of actual codegen, the library needs to be able to support all combinations of desired types. With some crafty templating and overloading, the actual code for this is kept reasonably sized though. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D96819	2021-02-17 18:20:23 -08:00
Alex Zinenko	4a3473ff3b	[mlir] silence unused-function warnings in table-generated code Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96695	2021-02-17 19:28:31 +01:00
Eugene Zhulenev	519f5917b4	[mlir] Add fma operation to std dialect Will remove `vector.fma` operation in the followup CLs. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96801	2021-02-17 10:06:01 -08:00
Hanhan Wang	c80484e16e	[mlir][StandardToSPIRV] Add support for lowering trunci to SPIR-V to i1 types. Add a pattern to converting some value to a boolean. spirv.S/UConvert does not work on i1 types. Thus, the pattern is lowered to cmpi + select. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D96851	2021-02-17 07:23:41 -08:00
Weiwei Li	7742620620	[mlir][spirv] Add spv.GLSL.FrexpStruct co-authored-by: Alan Liu <alanliu.yf@gmail.com> Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D96527	2021-02-17 09:02:03 -05:00
Benjamin Kramer	63a35f35ec	[mlir][Shape] Generalize cstr_broadcastable folding for n-ary broadcasts This is still fairly tricky code, but I tried to untangle it a bit. Differential Revision: https://reviews.llvm.org/D96800	2021-02-17 11:44:52 +01:00
Stella Laurenzo	4c3f1be84f	[mlir][python] Add python binding for AffineMapAttribute. Differential Revision: https://reviews.llvm.org/D96815	2021-02-16 15:43:30 -08:00
MaheshRavishankar	81264dfbe8	[mlir][Linalg] Add utility method to reshape ops to express output shape in terms of input shape. Resolving the dim of outputs of a tensor_reshape op in terms of its input shape allows the op to be eliminated when its used only in its dims. The init_tensor -> tensor_reshape canonicalization can be simplified to use the dims of the output of the tensor_reshape which gets canonicalized away later making the tensor_reshape dead. Differential Revision: https://reviews.llvm.org/D96635	2021-02-16 13:42:08 -08:00
Adam Straw	99c0458f2f	separate AffineMapAccessInterface from AffineRead/WriteOpInterface Separating the AffineMapAccessInterface from AffineRead/WriteOp interface so that dialects which extend Affine capabilities (e.g. PlaidML PXA = parallel extensions for Affine) can utilize relevant passes (e.g. MemRef normalization). Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D96284	2021-02-16 13:05:27 -08:00
Thomas Raoux	adfd3c7083	[mlir] Fix memref_cast + subview folder when reducing rank When the destination of the subview has a lower rank than its source we need to fix the result type of the new subview op. Differential Revision: https://reviews.llvm.org/D96804	2021-02-16 12:00:59 -08:00
Alex Zinenko	ce8f10d6cb	[mlir] Simplify ModuleTranslation for LLVM IR A series of preceding patches changed the mechanism for translating MLIR to LLVM IR to use dialect interface with delayed registration. It is no longer necessary for specific dialects to derive from ModuleTranslation. Remove all virtual methods from ModuleTranslation and factor out the entry point to be a free function. Also perform some cleanups in ModuleTranslation internals. Depends On D96774 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96775	2021-02-16 18:42:52 +01:00
Alex Zinenko	2ab57c503e	[mlir] tighten LLVM dialect verifiers to generate valid LLVM IR Verification of the LLVM IR produced when translating various MLIR dialects was only active when calling the translation programmatically. This has led to several cases of invalid LLVM IR being generated that could not be caught with textual mlir-translate tests. Add verifiers for these cases and fix the tests in preparation for enforcing the validation of LLVM IR. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96774	2021-02-16 18:18:21 +01:00
Alex Zinenko	9cd47a26d5	[mlir] add verifiers for NVVM and ROCDL kernel attributes Make sure they can only be attached to LLVM functions as a result of converting GPU functions to the LLVM Dialect.	2021-02-16 18:06:54 +01:00
Thomas Raoux	397336dcab	[mlir][vector] Add missing support for contract of integer lowering. Some of the lowering of vector.contract didn't support integer case. Since reduction of integer cannot accumulate we always break up the reduction op, it should be merged by a separate canonicalization if possible. Differential Revision: https://reviews.llvm.org/D96461	2021-02-16 07:13:30 -08:00
Thomas Raoux	807e5467f3	[mlir] Add canonicalization for tensor_cast + tensor_to_memref This helps bufferization passes by removing tensor_cast operations. Differential Revision: https://reviews.llvm.org/D96745	2021-02-16 07:11:09 -08:00
Lei Zhang	cb1a42359b	[mlir][vector] Move splitting transfer ops into a separate entry point These patterns unrolls transfer read/write ops if the vector consumers/ producers are extract/insert slices op. Transfer ops can map to hardware load/store functionalities, where the vector size matters for bandwidth considerations. So these patterns should be collected separately, instead of being generic canonicalization patterns. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96782	2021-02-16 10:04:34 -05:00
Lei Zhang	d8c7f442ea	[mlir][vector] Add support for unrolling vector.fma Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96781	2021-02-16 09:56:25 -05:00
Tres Popp	787d771dce	[mlir] Don't return nullptrs from scf::IfOp::getSuccessorRegions Previously this might happen if there was no elseRegion and the method was asked for all successor regions. Differential Revision: https://reviews.llvm.org/D96764	2021-02-16 12:06:30 +01:00
Nicolas Vasilache	21debeae78	[mlir][Linalg] Generalize vector::transfer hoisting on tensors. This revision adds support for hoisting "subtensor + vector.transfer_read" / "subtensor_insert + vector.transfer_write pairs" across scf.for. The unit of hoisting becomes a HoistableRead / HoistableWrite struct which contains a pair of "vector.transfer_read + optional subtensor" / "vector.transfer_write + optional subtensor_insert". scf::ForOp canonicalization patterns are applied greedily on the successful application of the transformation to cleanup the IR more eagerly and potentially expose more transformation opportunities. Differential revision: https://reviews.llvm.org/D96731	2021-02-16 09:45:14 +00:00
Adrian Kuegel	07cc77187a	Lower math.expm1 to intrinsics in the GPUToNVVM and GPUToROCDL conversions. This adds the lowering for expm1 for GPU backends. Differential Revision: https://reviews.llvm.org/D96756	2021-02-16 10:23:42 +01:00
Nicolas Vasilache	d01ea0edaa	[mlir] Drop reliance of SliceAnalysis on specific ops. SliceAnalysis originally was developed in the context of affine.for within mlfunc. It predates the notion of region. This revision updates it to not hardcode specific ops like scf::ForOp. When rooted at an op, the behavior of the slice computation changes as it recurses into the regions of the op. This does not support gathering all values transitively depending on a loop induction variable anymore. Additional variants rooted at a Value are added to also support the existing behavior. Differential revision: https://reviews.llvm.org/D96702	2021-02-16 06:34:32 +00:00
Nicolas Vasilache	02d053ed2d	[mlir][Vector] Add a canonicalization pattern for vector.contract + add Differential Revision: https://reviews.llvm.org/D96701	2021-02-15 21:22:36 +00:00
Jacques Pienaar	381a65fa06	[mlir] Add clone method to ShapedType Allow clients to create a new ShapedType of the same "container" type but with different element or shape. First use case is when refining shape during shape inference without needing to consider which ShapedType is being refined. Differential Revision: https://reviews.llvm.org/D96682	2021-02-15 11:04:16 -08:00
Tres Popp	3842d4b679	Make shape.is_broadcastable/shape.cstr_broadcastable nary This corresponds with the previous work to make shape.broadcast nary. Additionally, simplify the ConvertShapeConstraints pass. It now doesn't lower an implicit shape.is_broadcastable. This is still the same in combination with shape-to-standard when the 2 passes are used in either order. Differential Revision: https://reviews.llvm.org/D96401	2021-02-15 16:05:32 +01:00
Alex Zinenko	176379e0c8	[mlir] Use the interface-based translation for LLVM "intrinsic" dialects Port the translation of five dialects that define LLVM IR intrinsics (LLVMAVX512, LLVMArmNeon, LLVMArmSVE, NVVM, ROCDL) to the new dialect interface-based mechanism. This allows us to remove individual translations that were created for each of these dialects and just use one common MLIR-to-LLVM-IR translation that potentially supports all dialects instead, based on what is registered and including any combination of translatable dialects. This removal was one of the main goals of the refactoring. To support the addition of GPU-related metadata, the translation interface is extended with the `amendOperation` function that allows the interface implementation to post-process any translated operation with dialect attributes from the dialect for which the interface is implemented regardless of the operation's dialect. This is currently applied to "kernel" functions, but can be used to construct other metadata in dialect-specific ways without necessarily affecting operations. Depends On D96591, D96504 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96592	2021-02-15 14:43:07 +01:00
Tres Popp	89d900b2a1	[mlir] Add error message on shape.broadcast verification failure	2021-02-15 10:58:53 +01:00
Alex Zinenko	34ea608a47	[mlir] Support repeated delayed registration of dialect interfaces Dialects themselves do not support repeated addition of interfaces with the same TypeID. However, in case of delayed registration, the registry may contain such an interface, or have the same interface registered several times due to, e.g., dependencies. Make sure we delayed registration does not attempt to add an interface with the same TypeID more than once. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96606	2021-02-15 10:46:26 +01:00
Tobias Gysi	99f3510b41	Reland "[mlir] add support for verification in integration tests" The patch extends the runner utils by verification methods that compare two memrefs. The methods compare the content of the two memrefs and print success if the data is identical up to a small numerical error. The methods are meant to simplify the development of integration tests that compare the results against a reference implementation (cf. the updates to the linalg matmul integration tests). Originally landed in `5fa893c` (https://reviews.llvm.org/D96326) and reverted in `dd719fd` due to a Windows build failure. Changes: - Remove the max function that requires the "algorithm" header on Windows - Eliminate the truncation warning in the float specialization of verifyElem by using a float constant Reviewed By: Kayjukh Differential Revision: https://reviews.llvm.org/D96593	2021-02-14 20:30:05 +01:00
Nicolas Vasilache	428bc6feed	[mlir][Linalg] Fix constant detection in linalg.pad_tensor vectorization.	2021-02-14 15:53:39 +00:00
Praveen Narayanan	a65fb1916c	Add a "kind" attribute to ContractionOp and OuterProductOp. Currently, vector.contract joins the intermediate result and the accumulator argument (of ranks K) using summation. We desire more joining operations --- such as max --- to help vector.contract express reductions. This change extends Vector_ContractionOp to take an optional attribute (called "kind", of enum type CombiningKind) specifying the joining operation to be add/mul/min/max for int/fp , and and/or/xor for int only. By default this attribute has value "add". To implement this we also need to extend vector.outerproduct, since vector.contract gets transformed to vector.outerproduct (and that to vector.fma). The extension for vector.outerproduct is also an optional kind attribute that uses the same enum type and possible values. The default is "add". In case of max/min we transform vector.outerproduct to a combination of compare and select. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D93280	2021-02-12 20:23:59 +00:00
Mehdi Amini	aa4e466caa	[mlir][Linalg] Improve region support in Linalg ops This revision takes advantage of the newly extended `ref` directive in assembly format to allow better region handling for LinalgOps. Specifically, FillOp and CopyOp now build their regions explicitly which allows retiring older behavior that relied on specific op knowledge in both lowering to loops and vectorization. This reverts commit `3f22547fd1` and reland `973e133b76` with a workaround for a gcc bug that does not accept lambda default parameters: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=59949 Differential Revision: https://reviews.llvm.org/D96598	2021-02-12 19:11:24 +00:00
Diego Caballero	656674a7c4	[mlir][Vector] Align gather/scatter/expand/compress API Align the vector gather/scatter/expand/compress API with the vector load/store/maskedload/maskedstore API. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D96396	2021-02-12 20:48:38 +02:00
Diego Caballero	ee66e43a96	[mlir][Vector] Introduce 'vector.load' and 'vector.store' ops This patch adds the 'vector.load' and 'vector.store' ops to the Vector dialect [1]. These operations model contiguous vector loads and stores from/to memory. Their semantics are similar to the 'affine.vector_load' and 'affine.vector_store' counterparts but without the affine constraints. The most relevant feature is that these new vector operations may perform a vector load/store on memrefs with a non-vector element type, unlike 'std.load' and 'std.store' ops. This opens the representation to model more generic vector load/store scenarios: unaligned vector loads/stores, perform scalar and vector memory access on the same memref, decouple memory allocation constraints from memory accesses, etc [1]. These operations will also facilitate the progressive lowering of both Affine vector loads/stores and Vector transfer reads/writes for those that read/write contiguous slices from/to memory. In particular, this patch adds the 'vector.load' and 'vector.store' ops to the Vector dialect, implements their lowering to the LLVM dialect, and changes the lowering of 'affine.vector_load' and 'affine.vector_store' ops to the new vector ops. The lowering of Vector transfer reads/writes will be implemented in the future, probably as an independent pass. The API of 'vector.maskedload' and 'vector.maskedstore' has also been changed slightly to align it with the transfer read/write ops and the vector new ops. This will improve reusability among all these operations. For example, the lowering of 'vector.load', 'vector.store', 'vector.maskedload' and 'vector.maskedstore' to the LLVM dialect is implemented with a single template conversion pattern. [1] https://llvm.discourse.group/t/memref-type-and-data-layout/ Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96185	2021-02-12 20:48:37 +02:00
Mehdi Amini	3f22547fd1	Revert "[mlir][Linalg] Improve region support in Linalg ops." This reverts commit `973e133b76`. It triggers an issue in gcc5 that require investigation, the build is broken with: /tmp/ccdpj3B9.s: Assembler messages: /tmp/ccdpj3B9.s:5821: Error: symbol `_ZNSt17_Function_handlerIFvjjEUljjE2_E9_M_invokeERKSt9_Any_dataOjS6_' is already defined /tmp/ccdpj3B9.s:5860: Error: symbol `_ZNSt14_Function_base13_Base_managerIUljjE2_E10_M_managerERSt9_Any_dataRKS3_St18_Manager_operation' is already defined	2021-02-12 18:15:51 +00:00
Alex Zinenko	66900b3eae	[mlir] Use dialect interfaces to translate OpenMP dialect to LLVM IR Migrate the translation of the OpenMP dialect operations to LLVM IR to the new dialect-based mechanism. Depends On D96503 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96504	2021-02-12 18:37:47 +01:00
Alex Zinenko	b77bac0572	[mlir] Introduce dialect interfaces for translation to LLVM IR The existing approach to translation to the LLVM IR relies on a single translation supporting the base LLVM dialect, extensible through inheritance to support intrinsic-based dialects also derived from LLVM IR such as NVVM and AVX512. This approach does not scale well as it requires additional translations to be created for each new intrinsic-based dialect and does not allow them to mix in the same module, contrary to the rest of the MLIR infrastructure. Furthermore, OpenMP translation ingrained itself into the main translation mechanism. Start refactoring the translation to LLVM IR to operate using dialect interfaces. Each dialect that contains ops translatable to LLVM IR can implement the interface for translating them, and the top-level translation driver can operate on interfaces without knowing about specific dialects. Furthermore, the delayed dialect registration mechanism allows one to avoid a dependency on LLVM IR in the dialect that is translated to it by implementing the translation as a separate library and only registering it at the client level. This change introduces the new mechanism and factors out the translation of the "main" LLVM dialect. The remaining dialects will follow suit. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96503	2021-02-12 17:49:44 +01:00
Nicolas Vasilache	f3fb2dd147	[mlir][Linalg] NFC - Add an OpFoldResult-based builder for InitTensorOp	2021-02-12 16:03:51 +00:00
Nicolas Vasilache	973e133b76	[mlir][Linalg] Improve region support in Linalg ops. This revision takes advantage of the newly extended `ref` directive in assembly format to allow better region handling for LinalgOps. Specifically, FillOp and CopyOp now build their regions explicitly which allows retiring older behavior that relied on specific op knowledge in both lowering to loops and vectorization. Differential Revision: https://reviews.llvm.org/D96598	2021-02-12 14:51:03 +00:00

1 2 3 4 5 ...

4955 Commits