llvm-capstone/mlir/test/python
Guray Ozen 12c241b365
[MLIR][NVVM] Explicit Data Type for Output in wgmma.mma_async (#78713)
The current implementation of `nvvm.wgmma.mma_async` Op deduces the data
type of the output matrix from the data type of struct member, which can be
non-intuitive, especially in cases where types like `2xf16` are packed
into `i32`.

This PR addresses this issue by improving the Op to include an explicit
data type for the output matrix.

The modified Op now includes an explicit data type for Matrix-D (<f16>),
and looks as follows:

```
%result = llvm.mlir.undef : !llvm.struct<(struct<(i32, i32, ...
nvvm.wgmma.mma_async
    %descA, %descB, %result,
    #nvvm.shape<m = 64, n = 32, k = 16>,
    D [<f16>, #nvvm.wgmma_scale_out<zero>],
    A [<f16>, #nvvm.wgmma_scale_in<neg>, <col>],
    B [<f16>, #nvvm.wgmma_scale_in<neg>, <col>]
```
2024-01-22 08:37:20 +01:00
..
dialects [MLIR][NVVM] Explicit Data Type for Output in wgmma.mma_async (#78713) 2024-01-22 08:37:20 +01:00
integration [mlir][python] meta region_op (#75673) 2023-12-21 11:20:29 -06:00
ir [mlir][arith] Add overflow flags support to arith ops (#78376) 2024-01-17 06:12:23 +03:00
lib [mlir][python] enable registering dialects with the default Context (#72488) 2023-11-27 19:26:05 -06:00
CMakeLists.txt
develoment_files.py [NFC][Py Reformat] Reformat python files in mlir subdir 2023-05-26 08:05:40 +02:00
execution_engine.py [mlir][Python] Fix conversion of non-zero offset memrefs to np.arrays 2023-09-05 08:02:59 +00:00
lit.local.cfg [NFC][Py Reformat] Reformat python files in mlir subdir 2023-05-26 08:05:40 +02:00
pass_manager.py [mlir][python][nfc] Test -print-ir-after-all (#75742) 2023-12-17 20:24:47 +01:00
python_test_ops.td [mlir][python] fix python_test dialect and I32/I64ElementsBuilder (#70871) 2023-10-31 19:55:42 -05:00