llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-11-26 23:21:11 +00:00

History

Guray Ozen 12c241b365 [MLIR][NVVM] Explicit Data Type for Output in `wgmma.mma_async` (#78713 ) The current implementation of `nvvm.wgmma.mma_async` Op deduces the data type of the output matrix from the data type of struct member, which can be non-intuitive, especially in cases where types like `2xf16` are packed into `i32`. This PR addresses this issue by improving the Op to include an explicit data type for the output matrix. The modified Op now includes an explicit data type for Matrix-D (<f16>), and looks as follows: ``` %result = llvm.mlir.undef : !llvm.struct<(struct<(i32, i32, ... nvvm.wgmma.mma_async %descA, %descB, %result, #nvvm.shape<m = 64, n = 32, k = 16>, D [<f16>, #nvvm.wgmma_scale_out<zero>], A [<f16>, #nvvm.wgmma_scale_in<neg>, <col>], B [<f16>, #nvvm.wgmma_scale_in<neg>, <col>] ```		2024-01-22 08:37:20 +01:00
..
dialects	[MLIR][NVVM] Explicit Data Type for Output in `wgmma.mma_async` (#78713 )	2024-01-22 08:37:20 +01:00
integration	[mlir][python] meta region_op (#75673 )	2023-12-21 11:20:29 -06:00
ir	[mlir][arith] Add overflow flags support to arith ops (#78376 )	2024-01-17 06:12:23 +03:00
lib	[mlir][python] enable registering dialects with the default `Context` (#72488 )	2023-11-27 19:26:05 -06:00
CMakeLists.txt
develoment_files.py	[NFC][Py Reformat] Reformat python files in mlir subdir	2023-05-26 08:05:40 +02:00
execution_engine.py	[mlir][Python] Fix conversion of non-zero offset memrefs to np.arrays	2023-09-05 08:02:59 +00:00
lit.local.cfg	[NFC][Py Reformat] Reformat python files in mlir subdir	2023-05-26 08:05:40 +02:00
pass_manager.py	[mlir][python][nfc] Test `-print-ir-after-all` (#75742 )	2023-12-17 20:24:47 +01:00
python_test_ops.td	[mlir][python] fix python_test dialect and I32/I64ElementsBuilder (#70871 )	2023-10-31 19:55:42 -05:00