llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2025-02-11 12:16:07 +00:00

Author	SHA1	Message	Date
gysit	d629645fcd	[mlir][OpDSL] Add support for adding canonicalization patterns. Extend OpDSL with a `defines` method that can set the `hasCanonicalizer` flag for an OpDSL operation. If the flag is set via `defines(Canonicalizer)` the operation needs to implement the `getCanonicalizationPatterns` method. The revision specifies the flag for linalg.fill_tensor and adds an empty `FillTensorOp::getCanonicalizationPatterns` implementation. This revision is a preparation step to replace linalg.fill by its OpDSL counterpart linalg.fill_tensor. The two are only functionally equivalent if both specify the same canonicalization patterns. The revision is thus a prerequisite for the linalg.fill replacement. Depends On D120725 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D120726	2022-03-08 15:56:59 +00:00
gysit	f4ae02afe7	[mlir][linalg] Add a FillOpInterface. Add a FillOpInterface similar to the contraction and convolution op interfaces. The FillOpInterface is a preparation step to replace linalg.fill by its OpDSL version linalg.fill_tensor. The interface implements the `value()`, `output()`, and `result()` methods that by default are not available on linalg.fill_tensor. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D120725	2022-03-08 15:48:02 +00:00
Javier Setoain	f2b89c7ae0	[mlir][Vector] Use create_mask in transfer mask materializations Currently, the transfer mask is materialized by generating the vector comparison: [offset + 0, .., offset + length - 1] < [dim, .., dim] A better alternative is to materialize the transfer mask by using the operation: `vector.create_mask (dim - offset)`, which will generate simpler code and compose better with scalable vectors. Differential Revision: https://reviews.llvm.org/D120487	2022-03-08 09:02:50 +00:00
Bixia Zheng	5b87e0521d	[mlir][sparse][taco] Split the evaluate method into compile and compute. This is to align with the PyTACO API better. Modify an existing unit test to test the new routines. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D121083	2022-03-07 16:58:41 -08:00
Hanhan Wang	1538bd518c	[mlir][Vector] Add patterns to reorder elementwise ops and broadcast/transpose ops. In quantized comutation, there are casting ops around computation ops. Reorder the ops to make reduce-to-contract actually work. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D120760	2022-03-07 12:52:12 -08:00
River Riddle	5a7b919409	[mlir][NFC] Rename StandardToLLVM to FuncToLLVM The current StandardToLLVM conversion patterns only really handle the Func dialect. The pass itself adds patterns for Arithmetic/CFToLLVM, but those should be/will be split out in a followup. This commit focuses solely on being an NFC rename. Aside from the directory change, the pattern and pass creation API have been renamed: * populateStdToLLVMFuncOpConversionPattern -> populateFuncToLLVMFuncOpConversionPattern * populateStdToLLVMConversionPatterns -> populateFuncToLLVMConversionPatterns * createLowerToLLVMPass -> createConvertFuncToLLVMPass Differential Revision: https://reviews.llvm.org/D120778	2022-03-07 11:25:23 -08:00
Bixia Zheng	4b7745c176	[mlir][sparse][taco] Add more unit tests. These unit tests resides in an internal repository. Porting the tests to the public repository. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D121021	2022-03-07 10:10:01 -08:00
Diego Caballero	917d95fc8a	[mlir][Vector] Improve default lowering of vector transpose operations The default lowering of vector transpose operations generates a large sequence of scalar extract/insert operations, one pair for each scalar element in the input tensor. In other words, the vector transpose is scalarized. However, there are transpose patterns where one or more adjacent high-order dimensions are not transposed (for example, in the transpose pattern [1, 0, 2, 3], dimensions 2 and 3 are not transposed). This patch improves the lowering of those cases by not scalarizing them and extracting/ inserting a full n-D vector, where 'n' is the number of adjacent high-order dimensions not being transposed. By doing so, we prevent the scalarization of the code and generate a more performant vector version. Paradoxically, this patch shouldn't improve the performance of transpose operations if we are using LLVM. The LLVM pipeline is able to optimize away some of the extract/insert operations and the SLP vectorizer is converting the scalar operations back to its vector form. However, scalarizing a vector version of the code in MLIR and relying on the SLP vectorizer to reconstruct the vector code again is highly undesirable for several reasons. Reviewed By: nicolasvasilache, ThomasRaoux Differential Revision: https://reviews.llvm.org/D120601	2022-03-07 17:56:02 +00:00
Benjamin Kramer	03ed395149	[mlir] Add missing override keyword. NFC.	2022-03-07 17:58:32 +01:00
Sergei Grechanik	27df7158fe	[mlir] Fix dumping invalid ops This patch fixes the crash when printing some ops (like affine.for and scf.for) when they are dumped in invalid state, e.g. during pattern application. Now the AsmState constructor verifies the operation first and switches to generic operation printing when the verification fails. Also operations are now printed in generic form when emitting diagnostics and the severity level is Error. Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D117834	2022-03-07 08:32:31 -08:00
Uday Bondhugula	9b740c035c	Update normalizeAffineFor to canonicalize maps/operands before using them Update normalizeAffineFor utility to canonicalize maps and operands before using them. Differential Revision: https://reviews.llvm.org/D121086	2022-03-07 18:49:50 +05:30
Mehdi Amini	e1f389a89f	Apply clang-tidy fixes for readability-simplify-boolean-expr to MLIR (NFC)	2022-03-07 10:41:45 +00:00
William S. Moses	87ec6f41bb	[OpenMPIRBuilder] Allocate temporary at the correct block in a nested parallel The OpenMPIRBuilder has a bug. Specifically, suppose you have two nested openmp parallel regions (writing with MLIR for ease) ``` omp.parallel { %a = ... omp.parallel { use(%a) } } ``` As OpenMP only permits pointer-like inputs, the builder will wrap all of the inputs into a stack allocation, and then pass this allocation to the inner parallel. For example, we would want to get something like the following: ``` omp.parallel { %a = ... %tmp = alloc store %tmp[] = %a kmpc_fork(outlined, %tmp) } ``` However, in practice, this is not what currently occurs in the context of nested parallel regions. Specifically to the OpenMPIRBuilder, the entirety of the function (at the LLVM level) is currently inlined with blocks marking the corresponding start and end of each region. ``` entry: ... parallel1: %a = ... ... parallel2: use(%a) ... endparallel2: ... endparallel1: ... ``` When the allocation is inserted, it presently inserted into the parent of the entire function (e.g. entry) rather than the parent allocation scope to the function being outlined. If we were outlining parallel2, the corresponding alloca location would be parallel1. This causes a variety of bugs, including https://github.com/llvm/llvm-project/issues/54165 as one example. This PR allows the stack allocation to be created at the correct allocation block, and thus remedies such issues. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D121061	2022-03-06 18:34:25 -05:00
River Riddle	9c9a431735	[mlir][Pass] Add support for an InterfacePass and pass filtering based on OperationName This commit adds a new hook Pass `bool canScheduleOn(RegisteredOperationName)` that indicates if the given pass can be scheduled on operations of the given type. This makes it easier to define constraints on generic passes without a) adding conditional checks to the beginning of the `runOnOperation`, or b) defining a new pass type that forwards from `runOnOperation` (after checking the invariants) to a new hook. This new hook is used to implement an `InterfacePass` pass class, that represents a generic pass that runs on operations of the given interface type. The PassManager will also verify that passes added to a pass manager can actually be scheduled on that pass manager, meaning that we will properly error when an Interface is scheduled on an operation that doesn't actually implement that interface. Differential Revision: https://reviews.llvm.org/D120791	2022-03-04 15:14:04 -08:00
Mogball	e7c7b16a84	[mlir] Region/BranchOpInterface: Allow implicit type conversions along control-flow edges RegionBranchOpInterface and BranchOpInterface are allowed to make implicit type conversions along control-flow edges. In effect, this adds an interface method, `areTypesCompatible`, to both interfaces, which should return whether the types of corresponding successor operands and block arguments are compatible. Users of the interfaces, here on forth, must be aware that types may mismatch, although current users (in MLIR core), are not affected by this change. By default, type equality is used. `async.execute` already has unequal types along control-flow edges (`!async.value<f32>` vs. `f32`), but it opted out of calling `RegionBranchOpInterface::verifyTypes` in its verifier. That method has now been removed and `RegionBranchOpInterface` will verify types along control edges by default in its verifier. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D120790	2022-03-04 20:33:14 +00:00
William S. Moses	62f84c73d2	[MLIR][SCF] Allow combining subsequent if statements that yield & negated condition This patch extends the existing if combining canonicalization to also handle the case where a value returned by the first if is used within the body of the second if. This patch also extends if combining to support if's whose conditions are logical negations of each other. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120924	2022-03-04 12:07:47 -05:00
William S. Moses	1d1791572c	[MLIR][MemRef] Ensure alloca_scope is inlined with no allocating ops Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120841	2022-03-04 11:58:59 -05:00
William S. Moses	4a94a33ca6	[MLIR][LLVM] Fold extractvalue to ignore insertvalue at distinct index We can simplify an extractvalue of an insertvalue to extract out of the base of the insertvalue, if the insert and extract are at distinct and non-prefix'd indices Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120915	2022-03-04 11:03:34 -05:00
Uday Bondhugula	5a99b776eb	[MLIR] Extend isLoopMemoryParallel to account for locally allocated memrefs Extend isLoopMemoryParallel check to include locally allocated memrefs. This strengthens and also speeds up the dependence check used by the utility by excluding locally allocated memrefs where appropriate. Additional memref dialect ops can be supported exhaustively via proper interfaces. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D120617	2022-03-04 09:16:28 +05:30
River Riddle	81f2f4dfb2	[PDLL] Add support for tablegen includes and importing ODS information This commit adds support for processing tablegen include files, and importing various information from ODS. This includes operations, attribute+type constraints, attribute/operation/type interfaces, etc. This will allow for much more robust tooling, and also allows for referencing ODS constructs directly within PDLL (imported interfaces can be used as constraints, operation result names can be used for member access, etc). Differential Revision: https://reviews.llvm.org/D119900	2022-03-03 16:14:03 -08:00
Lei Zhang	7d249dfd7d	[mlir][linalg] NFC: minor cleanups after moving pad to tensor dialect Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D120627	2022-03-03 09:44:54 -05:00
Matthias Springer	16cbe883b5	[mlir][linalg][bufferize] Migrate --linalg-bufferize to BufferizableOpInterface-based bufferization This commit deletes the old dialect conversion-based bufferization patterns, which are now obsolete. Differential Revision: https://reviews.llvm.org/D120883	2022-03-03 20:12:37 +09:00
Andrzej Warzynski	1b2e35e4d4	Revert "[mlir] Add extensible dialects" This reverts commit dbe9f0914fcfd8444fd9656821af0f1a34a27e7a. The flang-x86_64-windows buildbot has been failing since this has been merged: * https://lab.llvm.org/buildbot/#/builders/172/builds/9124 Similar failure was reported by the pre-commit CI.	2022-03-03 10:30:50 +00:00
Peixin-Qiao	a5605c9a15	[MLIR] Fix afterIP for dynamic worksharing-loop after collaping loops The loopInfos gets invalidated after collapsing nested loops. Use the saved afterIP since the returned afterIP by applyDynamicWorkshareLoop may be not valid. Reviewed By: shraiysh Differential Revision: https://reviews.llvm.org/D120294	2022-03-03 15:22:20 +08:00
River Riddle	3ba66435d9	[mlir][SPIRV] Split up StandardToSPIRV now that the Standard dialect is gone StandardToSPIRV currently contains an assortment of patterns converting from different dialects to SPIRV. This commit splits up StandardToSPIRV into separate conversions for each of the dialects involved (some of which already exist). Differential Revision: https://reviews.llvm.org/D120767	2022-03-02 13:14:36 -08:00
Mathieu Fehr	dbe9f0914f	[mlir] Add extensible dialects Add support for extensible dialects, which are dialects that can be extended at runtime with new operations and types. These operations and types cannot at the moment implement traits or interfaces. Differential Revision: https://reviews.llvm.org/D104554	2022-03-02 12:42:59 -08:00
natashaknk	8d7a833eed	[tosa][mlir] Add support for dynamic width/height for Conv2D inputs in tosa-to-linalg Infers output shape for dynamic width/height inputs. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D119977	2022-03-02 12:16:35 -08:00
William S. Moses	758ddba381	[MLIR] Use Datalayout defaults when importing LLVM LLVM defines several default datalayouts for integer and floating point types that are not being considered when importing into MLIR. This patch remedies this. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120832	2022-03-02 13:00:53 -05:00
William S. Moses	bf6477ebeb	[MLIR][OpenMP] Place alloca scope within wsloop in scf.parallel to omp lowering https://reviews.llvm.org/D120423 replaced the use of stacksave/restore with memref.alloca_scope, but kept the save/restore at the same location. This PR places the allocation scope within the wsloop, thus keeping the same allocation scope as the original scf.parallel (e.g. no longer over stack allocating). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120772	2022-03-02 12:46:58 -05:00
William S. Moses	2af81c6978	[MLIR][Arith] Canonicalize cmpi of extui/extsi Canonicalize cmpi(eq, ext a, ext b) and cmpi(ne, ext a, ext b) Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120620	2022-03-02 12:30:03 -05:00
William S. Moses	db31da279f	[MLIR][Arith] Add constant folder for left shift Add constant folder for left shift Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120661	2022-03-02 12:00:23 -05:00
Alex Zinenko	eb27da7dec	[mlir] Ignore index data layout in translation to LLVM It can be present, but is irrelevant for the translation.	2022-03-02 16:56:21 +01:00
Alex Zinenko	ea998709eb	[mlir] Add partial support for translating data layout Add support for translating data layout specifications for integer and float types between MLIR and LLVM IR. This is a first step towards removing the string-based LLVM dialect data layout attribute on modules. The latter is still available and will remain so until the first-class MLIR modeling can fully replace it. Depends On D120739 Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D120740	2022-03-02 14:56:50 +01:00
Alex Zinenko	f64170aa1d	[mlir] Data layout for integer and float types Add support for integer and float types into the data layout subsystem with default logic similar to LLVM IR. Given the flexibility of the sybsystem, the logic can be easily overwritten by operations if necessary. This provides the connection necessary, e.g., for the GPU target where alignment requirements for integers and floats differ from those provided by default (although still compatible with the LLVM IR model). Previously, it was impossible to use non-default alignment requirements for integer and float types, which could lead to incorrect address and size calculations when targeting GPUs. Depends On D120737 Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D120739	2022-03-02 14:56:49 +01:00
Alex Zinenko	554839ecdf	[mlir] support f80 and f128 in parseOptionalType This seems to have been forgotten when the types were introduced. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D120737	2022-03-02 14:56:47 +01:00
Shraiysh Vaishay	d2f0fe23d2	[mlir][OpenMP] Added assemblyFormat for atomic and critical operations This patch adds assemblyFormat for `omp.critical.declare`, `omp.atomic.read`, `omp.atomic.write`, `omp.atomic.update` and `omp.atomic.capture`. Also removing those clauses from `parseClauses` that aren't needed anymore, thanks to the new assemblyFormats. Reviewed By: NimishMishra, rriddle Differential Revision: https://reviews.llvm.org/D120248	2022-03-02 11:22:09 +05:30
Aart Bik	d8b229a1d5	[mlir][sparse][pytaco] added test with various sparse annotations This also found a bug in the new toMLIR code (missing permutation) Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D120783	2022-03-01 16:36:19 -08:00
Bixia Zheng	c25f3dfff3	[mlir][sparse][taco] Support tensor dimension storage ordering and more general sparsity values. Previously, we can't properly handle input tensors with a dimension ordering that is different from the natural ordering or with a mixed of compressed and dense dimensions. This change fixes the problems by passing the dimension ordering and sparsity values to the runtime routine. Modify an existing test to test the situation. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D120777	2022-03-01 15:36:38 -08:00
River Riddle	2f5715dc78	[mlir][NFC] Rename the old Standard dialect test directory to Func The remanants of Standard was renamed to Func, but the test directory remained named as Standard. In adidition to fixing the name, this commit also moves the tests for operations not in the Func dialect to the proper parent dialect test directory.	2022-03-01 13:48:34 -08:00
River Riddle	1f971e23f0	[mlir] Trim a huge number of unnecessary dependencies on the Func dialect The Func has a large number of legacy dependencies carried over from the old Standard dialect, which was pervasive and contained a large number of varied operations. With the split of the standard dialect and its demise, a lot of lingering dead dependencies have survived to the Func dialect. This commit removes a large majority of then, greatly reducing the dependence surface area of the Func dialect.	2022-03-01 12:10:04 -08:00
River Riddle	23aa5a7446	[mlir] Rename the Standard dialect to the Func dialect The last remaining operations in the standard dialect all revolve around FuncOp/function related constructs. This patch simply handles the initial renaming (which by itself is already huge), but there are a large number of cleanups unlocked/necessary afterwards: * Removing a bunch of unnecessary dependencies on Func * Cleaning up the From/ToStandard conversion passes * Preparing for the move of FuncOp to the Func dialect See the discussion at https://discourse.llvm.org/t/standard-dialect-the-final-chapter/6061 Differential Revision: https://reviews.llvm.org/D120624	2022-03-01 12:10:04 -08:00
Bixia Zheng	20eaa88fff	[mlir][sparse] Extend convertToMLIRSparseTensor to support permutation and more general sparsity values. Previously, convertToMLIRSparseTensor assumes identity storage ordering and all compressed dimensions. This change extends the function with two parameters for users to specify the storage ordering and the sparsity of each dimension. Modify PyTACO to reflect this change. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D120643	2022-03-01 10:51:39 -08:00
William S. Moses	78fb4f9d5d	[SCF][MemRef] Enable SCF.Parallel Lowering to use Scope Op As discussed in https://reviews.llvm.org/D119743 scf.parallel would continuously stack allocate since the alloca op was placd in the wsloop rather than the omp.parallel. This PR is the second stage of the fix for that problem. Specifically, we now introduce an alloca scope around the inlined body of the scf.parallel and enable a canonicalization to hoist the allocations to the surrounding allocation scope (e.g. omp.parallel). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120423	2022-03-01 13:25:09 -05:00
Lei Zhang	c809c9bd3b	[mlir][spirv] Convert gpu.barrier to spv.ControlBarrier Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D120722	2022-03-01 12:04:00 -05:00
Alex Zinenko	5c73db24df	[mlir] disallow side-effecting ops in llvm.mlir.global The llvm.mlir.global operation accepts a region as initializer. This region corresponds to an LLVM IR constant expression and therefore should not accept operations with side effects. Add a corresponding verifier. Reviewed By: wsmoses, bondhugula Differential Revision: https://reviews.llvm.org/D120632	2022-03-01 14:16:09 +01:00
gysit	e9085d0d25	[mlir][OpDSL] Rename function to make signedness explicit (NFC). The revision renames the following OpDSL functions: ``` TypeFn.cast -> TypeFn.cast_signed BinaryFn.min -> BinaryFn.min_signed BinaryFn.max -> BinaryFn.max_signed ``` The corresponding enum values on the C++ side are renamed accordingly: ``` #linalg.type_fn<cast> -> #linalg.type_fn<cast_signed> #linalg.binary_fn<min> -> #linalg.binary_fn<min_signed> #linalg.binary_fn<max> -> #linalg.binary_fn<max_signed> ``` Depends On D120110 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D120562	2022-03-01 08:15:53 +00:00
gysit	24357fec8d	[mlir][OpDSL] Add arithmetic function attributes. The revision extends OpDSL with unary and binary function attributes. A function attribute, makes the operations used in the body of a structured operation configurable. For example, a pooling operation may take an aggregation function attribute that specifies if the op shall implement a min or a max pooling. The goal of this revision is to define less and more flexible operations. We may thus for example define an element wise op: ``` linalg.elem(lhs, rhs, outs=[out], op=BinaryFn.mul) ``` If the op argument is not set the default operation is used. Depends On D120109 Reviewed By: nicolasvasilache, aartbik Differential Revision: https://reviews.llvm.org/D120110	2022-03-01 07:45:47 +00:00
Michael Kruse	a66f7769a3	[OpenMPIRBuilder] Implement static-chunked workshare-loop schedules. Add applyStaticChunkedWorkshareLoop method implementing static schedule when chunk-size is specified. Unlike a static schedule without chunk-size (where chunk-size is chosen by the runtime such that each thread receives one chunk), we need two nested loops: one for looping over the iterations of a chunk, and a second for looping over all chunks assigned to the threads. This patch includes the following related changes: * Adapt applyWorkshareLoop to triage between the schedule types, now possible since all schedules have been implemented. The default schedule is assumed to be non-chunked static, as without OpenMPIRBuilder. * Remove the chunk parameter from applyStaticWorkshareLoop, it is ignored by the runtime. Change the value for the value passed to the init function to 0, as without OpenMPIRBuilder. * Refactor CanonicalLoopInfo::setTripCount and CanonicalLoopInfo::mapIndVar as used by both, applyStaticWorkshareLoop and applyStaticChunkedWorkshareLoop. * Enable Clang to use the OpenMPIRBuilder in the presence of the schedule clause. Differential Revision: https://reviews.llvm.org/D114413	2022-02-28 18:18:33 -06:00
Okwan Kwon	4c901bf447	[mlir] Match Arithmetic::ConstantOp and Tensor::ExtractSliceOp. Add a pattern matcher for ExtractSliceOp when its source is a constant. The matching heuristics can be governed by the control function since generating a new constant is not always beneficial. Differential Revision: https://reviews.llvm.org/D119605	2022-02-28 23:09:03 +00:00
Lei Zhang	96bc2233c4	[mlir][linalg] Enhance FoldInsertPadIntoFill to support op chain If we have a chain of `tensor.insert_slice` ops inserting some `tensor.pad` op into a `linalg.fill` and ranges do not overlap, we can also elide the `tensor.pad` later. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D120446	2022-02-28 16:51:17 -05:00

1 2 3 4 5 ...

5560 Commits