llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2025-04-05 06:42:38 +00:00

Author	SHA1	Message	Date
Matteo Franciolini	0e0b6070fd	Implements MLIR Bytecode versioning capability A dialect can opt-in to handle versioning through the `BytecodeDialectInterface`. Few hooks are exposed to the dialect to allow managing a version encoded into the bytecode file. The version is loaded lazily and allows to retrieve the version information while parsing the input IR, and gives an opportunity to each dialect for which a version is present to perform IR upgrades post-parsing through the `upgradeFromVersion` method. Custom Attribute and Type encodings can also be upgraded according to the dialect version using readAttribute and readType methods. There is no restriction on what kind of information a dialect is allowed to encode to model its versioning. Currently, versioning is supported only for bytecode formats. Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D143647	2023-03-10 23:28:56 +01:00
Rahul Kayaith	55cf53fd0f	[mlir][Parser] Make parse{Attribute,Type} null-terminate input `parseAttribute` and `parseType` require null-terminated strings as input, but this isn't great considering the argument type is `StringRef`. This changes them to copy to a null-terminated buffer by default, with a `isKnownNullTerminated` flag added to disable the copying. closes #58964 Reviewed By: rriddle, kuhar, lattner Differential Revision: https://reviews.llvm.org/D145182	2023-03-03 17:03:27 -05:00
Rahul Kayaith	f5f8a46bb0	[mlir][AsmParser] Improve parse{Attribute,Type} error handling Currently these functions report errors directly to stderr, this updates them to use diagnostics instead. This also makes partially-consumed strings an error if the `numRead` parameter isn't provided (the docstrings already claimed this happened, but it didn't.) While here I also tried to reduce the number of overloads by switching to using default parameters. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D144804	2023-03-01 17:14:59 -05:00
Kazu Hirata	55e2cd1609	Use llvm::count{lr}_{zero,one} (NFC)	2023-01-28 12:41:20 -08:00
Kazu Hirata	83d56fb17a	Drop the ZeroBehavior parameter from countLeadingZeros and the like (NFC) This patch drops the ZeroBehavior parameter from bit counting functions like countLeadingZeros. ZeroBehavior specifies the behavior when the input to count{Leading,Trailing}Zeros is zero and when the input to count{Leading,Trailing}Ones is all ones. ZeroBehavior was first introduced on May 24, 2013 in commit eb91eac9fb866ab1243366d2e238b9961895612d. While that patch did not state the intention, I would guess ZeroBehavior was for performance reasons. The x86 machines around that time required a conditional branch to implement countLeadingZero<uint32_t> that returns the 32 on zero: test edi, edi je .LBB0_2 bsr eax, edi xor eax, 31 .LBB1_2: mov eax, 32 That is, we can remove the conditional branch if we don't care about the behavior on zero. IIUC, Intel's Haswell architecture, launched on June 4, 2013, introduced several bit manipulation instructions, including lzcnt and tzcnt, which eliminated the need for the conditional branch. I think it's time to retire ZeroBehavior as its utility is very limited. If you care about compilation speed, you should build LLVM with an appropriate -march= to take advantage of lzcnt and tzcnt. Even if not, modern host compilers should be able to optimize away quite a few conditional branches because the input is often known to be nonzero from dominating conditional branches. Differential Revision: https://reviews.llvm.org/D141798	2023-01-18 19:58:44 -08:00
Kazu Hirata	0a81ace004	[mlir] Use std::optional instead of llvm::Optional (NFC) This patch replaces (llvm::\|)Optional< with std::optional<. I'll post a separate patch to remove #include "llvm/ADT/Optional.h". This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-14 01:25:58 -08:00
serge-sans-paille	984b800a03	Move from llvm::makeArrayRef to ArrayRef deduction guides - last part This is a follow-up to https://reviews.llvm.org/D140896, split into several parts as it touches a lot of files. Differential Revision: https://reviews.llvm.org/D141298	2023-01-10 11:47:43 +01:00
River Riddle	18546ff8dd	[mlir:Bytecode] Add shared_ptr<SourceMgr> overloads to allow safe mmap of data The bytecode reader currently has no mechanism that allows for directly referencing data from the input buffer safely. This commit adds shared_ptr<SourceMgr> overloads that provide an explicit and safe way of extending the lifetime of the input. The usage of these new overloads is adopted in all of our tooling, and is implicitly used in the filename only parser methods. Differential Revision: https://reviews.llvm.org/D139366	2022-12-11 22:45:34 -08:00
Kazu Hirata	4f81805a3f	[mlir] Use std::optional instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 17:11:23 -08:00
Kazu Hirata	b2379415b9	[Reader] Use std::optional in BytecodeReader.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 10:31:26 -08:00
Kazu Hirata	192d9dd731	[mlir] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 19:58:32 -08:00
bzcheeseman	ae461d8b99	[MLIR][Bytecode] Ensure `dataIt` is aligned coming out of `EncodingReader::alignTo`. This addresses the TODO in the code previously and checks that the address of `dataIt` is properly aligned to the requested alignment. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D137855	2022-11-13 09:27:23 -08:00
River Riddle	54cdc03dfa	[mlir:Parser] Always splice parsed operations to the end of the parsed block The current splicing behavior dates back to when all blocks had terminators, so we would "helpfully" splice before the terminator. This doesn't make sense anymore, and leads to somewhat unexpected results when parsing multiple pieces of IR into the same block. Differential Revision: https://reviews.llvm.org/D135096	2022-10-05 13:11:38 -07:00
River Riddle	1ae60e044e	[mlir] Making verification after parsing optional This is very useful when you want to parse IR even if its invalid (e.g. bytecode). It's also useful if you don't want to pay the cost of verification in certain situations. Differential Revision: https://reviews.llvm.org/D134847	2022-09-28 20:38:12 -07:00
River Riddle	5fb1bbe6d4	[mlir] Add bytecode encodings for the builtin ElementsAttr attributes This adds bytecode support for DenseArrayAttr, DenseIntOrFpElementsAttr, DenseStringElementsAttr, and SparseElementsAttr. Differential Revision: https://reviews.llvm.org/D133744	2022-09-13 11:39:20 -07:00
River Riddle	6ab2bcffe4	[mlir:Bytecode] Add support for encoding resources Resources are encoded in two separate sections similarly to attributes/types, one for the actual data and one for the data offsets. Unlike other sections, the resource sections are optional given that in many cases they won't be present. For testing, bytecode serialization is added for DenseResourceElementsAttr. Differential Revision: https://reviews.llvm.org/D132729	2022-09-13 11:39:19 -07:00
Mehdi Amini	35cf7e8b2f	Apply clang-tidy fixes for modernize-use-emplace in BytecodeReader.cpp (NFC)	2022-08-30 00:10:45 +00:00
River Riddle	2f90764ce8	[mlir:Bytecode] Add encoding support for a majority of the builtin attributes This adds support for the non-location, non-elements, non-affine builtin attributes. Differential Revision: https://reviews.llvm.org/D132539	2022-08-26 13:31:05 -07:00
River Riddle	02c2ecb9c6	[mlir:Bytecode] Add initial support for dialect defined attribute/type encodings Dialects can opt-in to providing custom encodings by implementing the `BytecodeDialectInterface`. This interface provides hooks, namely `readAttribute`/`readType` and `writeAttribute`/`writeType`, that will be used by the bytecode reader and writer. These hooks are provided a reader and writer implementation that can be used to encode various constructs in the underlying bytecode format. A unique feature of this interface is that dialects may choose to only encode a subset of their attributes and types in a custom bytecode format, which can simplify adding new or experimental components that aren't fully baked. Differential Revision: https://reviews.llvm.org/D132498	2022-08-23 16:56:04 -07:00
River Riddle	b3449392f5	[mlir:Bytecode][NFC] Cleanup Attribute/Type reading This moves some parsing functionality from BytecodeReader to AttrTypeReader, and removes some duplication between the attribute/type code paths. Differential Revision: https://reviews.llvm.org/D132497	2022-08-23 16:56:03 -07:00
River Riddle	83dc999948	[mlir:Bytecode][NFC] Refactor string section writing and reading This extracts the string section writer and reader into dedicated classes, which better separates the logic and will also simplify future patches that want to interact with the string section. Differential Revision: https://reviews.llvm.org/D132496	2022-08-23 16:56:03 -07:00
River Riddle	96fd3f2d5b	[mlir:Bytecode] Fix asan failure We were accessing the region state after it got popped from the stack.	2022-08-22 10:02:42 -07:00
Goran Flegar	59548fe873	[mlir] Fix compile errors with bytecode support	2022-08-22 17:59:51 +02:00
River Riddle	93cf0e8a28	[mlir] Fix bots after bytecode support was added in D131747 * Fix ambiguous Twine constructor call * Ensure shift is 64-bit (for MSVC) * Disable bytecode tests on s390x (we don't support big endian right now)	2022-08-22 01:31:39 -07:00
River Riddle	f3acb54c1b	[mlir] Add initial support for a binary serialization format This commit adds a new bytecode serialization format for MLIR. The actual serialization of MLIR to binary is relatively straightforward, given the very very general structure of MLIR. The underlying basis for this format is a variable-length encoding for integers, which gets heavily used for nearly all aspects of the encoding (given that most of the encoding is just indexing into lists). The format currently does not provide support for custom attribute/type serialization, and thus always uses an assembly format fallback. It also doesn't provide support for resources. These will be added in followups, the intention for this patch is to provide something that supports the basic cases, and can be built on top of. https://discourse.llvm.org/t/rfc-a-binary-serialization-format-for-mlir/63518 Differential Revision: https://reviews.llvm.org/D131747	2022-08-22 00:36:26 -07:00

25 Commits