archived-SPIRV-Tools

mirror of https://github.com/RPCSX/SPIRV-Tools.git synced 2026-01-31 01:05:19 +01:00

Author	SHA1	Message	Date
Steven Perron	ca004da9f9	Add knowledge of cooperative matrices (#5720 ) * Add knowledge of cooperative matrices Some optimizations are not aware of cooperative matrices, and either do nothing or assert. This commits fixes that up. * Add int tests, and a handle a couple more cases. * Add float tests, and a handle a couple more cases. * Add NV coop matrix as well.	2024-06-26 08:00:29 -04:00
Steven Perron	581279dedd	[OPT] Zero-extend unsigned 16-bit integers when bitcasting (#5714 ) The folding rule `BitCastScalarOrVector` was incorrectly handling bitcasting to unsigned integers smaller than 32-bits. It was simply copying the entire 32-bit word containing the integer. This conflicts with the requirement in section 2.2.1 of the SPIR-V spec which states that unsigned numeric types with a bit width less than 32-bits must have the high-order bits set to 0. This change include a refactor of the bit extension code to be able to test it better, and to use it in multiple files. Fixes https://github.com/microsoft/DirectXShaderCompiler/issues/6319.	2024-06-19 19:17:05 +02:00
Steven Perron	148c97f687	Avoid use of type manager in extact->construct folding (#5684 ) * Avoid use of type manager in extact->construct folding When dealing with structs the type manager merge two different structs into a single entry if they have all of the same decorations and element types. This is because they hash to the same value in the hash table. This can cause problems if you need to get the id of a type from the type manager because you could get either one. In this case, it returns the wrong one. The fix avoids using the type manager in one place. I have not looked closely at other places the type manager is used to make sure it is used safely everywhere. Fixes #5624 * Remove use of TypeManager::GetId This removes a use of TypeManager::GetId by keeping the id around. This avoid a potential problem if the type manager gets confused. These types of bugs are hard to generate test cases for, so I do not have a test. However, existing tests make sure that do not regress.	2024-05-31 14:13:20 +02:00
Steven Perron	336b5710a5	Do not fold mul and adds to generate fmas (#5682 ) This removes the folding rules added in #4783 and #4808. They lead to poor code generation on Adreno devices when 16-bit floating point values were used. Since this change is transformation is suppose to be neutral, there is no general reason to continue doing it. I have talked to the owners of SwiftShader, and they do not mind if the transform is removed. They were the ones the requested the change in the first place. Fixes #5658	2024-05-22 13:01:26 -04:00
Steven Perron	9e7a1f2ddd	Fix array size calculation (#5463 ) The function that get the number of elements in a composite variable returns an incorrect values for the arrays. This is fixed, so that it returns the correct number of elements for arrays where the number of elements is represented as a 32-bit integer and is known at compile time. Fixes #4953	2023-11-02 13:29:57 -04:00
Steve Urquhart	44c9da6fee	Remove const zero image operands (#5232 )	2023-05-24 10:30:10 -04:00
Spencer Fricke	fa69b09cff	spirv-opt: Remove unused includes and code (#5177 )	2023-03-28 12:40:30 -04:00
Laura Hermanns	bd83b772c3	Fix operand index out of bounds when folding OpCompositeExtract. (#5107 ) GetExtractOperandsForElementOfCompositeConstruct() states "Returns the empty vector if \|result_index\| is out-of-bounds", but violates that contract for non-vector result types.	2023-03-03 15:52:49 +00:00
Nathan Gauër	1a7f71afb4	clean: constexpr-ify and unify anon namespace use (#4991 ) Constexpr guaranteed no runtime init in addition to const semantics. Moving all opt/ to constexpr. Moving all compile-unit statics to anonymous namespaces to uniformize the method used (anonymous namespace vs static has the same behavior here AFAIK). Signed-off-by: Nathan Gauër <brioche@google.com>	2022-11-17 19:02:50 +01:00
alan-baker	d35a78db57	Switch SPIRV-Tools to use spirv.hpp11 internally (#4981 ) Fixes #4960 * Switches to using enum classes with an underlying type to avoid undefined behaviour	2022-11-04 17:27:10 -04:00
gmitrano-unity	1cecf91701	Support Narrow Types in BitCast Folding Rule (#4941 ) * Support Narrow Types in BitCast Folding Rule This change adds support for narrow types in the BitCastScalarOrVector folding rule. According to Section 2.2.1 of the SPIR-V spec, types that are narrower than 32 bits are automatically either sign extended, or zero extended depending on the type. With that guaranteed, we should be able to use the first 32-bit word of any narrow type for the folding logic without performing any special conversions. In order to reduce code duplication, this change moves the GetU32BitValue and GetU64BitValue functions from IntConstant to ScalarConstant. Without this move, we would have needed an identical version of GetU32BitValue on FloatConstant. * Add Tests for 16-bit BitCast Folding This change adds several new test cases to the IntegerInstructionFoldingTest which trigger the 16-bit BitCast logic. The logic for half types was also added to the integer case since we can't easily validate half float types in C++ code. It's easier to validate them as unsigned integers instead. Pllus this also allows us to verify the SPIR-V constant sign extension logic too. * Add 8-Bit Folding Test Cases This change adds a couple more test cases to the integer instruction folding test suite in order to ensure that the BitCast logic also works correctly with the Int8 shader capability.	2022-10-06 10:35:18 -04:00
Steven Perron	0a43a84e02	Fix shuffle feeding shuffle with undef literal (#4883 ) When folding a vector shuffle with an undef literal, it is possible that the literal is adjusted so that it will then be interpreted as an index into the input operands. This is fixed by special casing that case, and not adjusting those operands. Fixes #4859	2022-08-10 09:04:35 -04:00
Steven Perron	4f321f862a	Avoid undefined divide-by-0 (#4821 ) The Reciprocal function expects a divide-by-0 to return nan,and then Reciprocal will return 0. Since the divide-by-0 is actually undefined, we will identify this case early, and return 0. No new tests are needed because we already tests folding divide-by-0. Fixes #4715	2022-06-21 11:37:34 -04:00
Nicolas Capens	130a05d2e3	Fold multiply and subtraction into FMA with negation (#4808 ) This change adds a folding rule which transforms x * y - a and a - x * y into FMA(x, y, -a) and FMA(-x, y, a), respectively. While the SPIR-V instruction count remains the same, target instruction sets typically feature FMA instruction variants that can negate an operand. Also this transformation may unlock further optimizations which eliminate the negation. (Google bug: b/226145988)	2022-05-31 12:03:56 -04:00
Steven Perron	088cb1a5c8	Add more folding for composite instructions (#4802 ) * Add move folding for composite instructions Fold chains of insert into construct If a chain of OpCompositeInsert instruction write to every element of a composite object, then we can replace it with an OpCompositeConstruct. Fold a construct fed by extracts to a single extract We already fold an OpCompositeConstruct when it is simlpy reconstructing an object that was decomposed by a series of OpCompositeExtract instructions. However, we do not do that if that object is an element of a larger object. I have updated the rule, so that if the original object is a an element of a larger object, then the OpCompositeConstruct is replaced with a single OpCompositeExtract from the larger object. Fixes #4371.	2022-05-26 10:29:02 -04:00
Steven Perron	1295dca8e2	Reapply "Add folding rule to generate Fma instructions (#4783 )" (#4789 ) This reverts commit `671f6e633f`. PR #4783 was reverted because it caused OpenCL CTS failures for clvk. The was in clspv, which was not adding the no contract decoration when it was required. This has been fixed in https://github.com/google/clspv/pull/845. We can now reapply #4783.	2022-05-03 10:20:23 -04:00
Daniele Vettorel	671f6e633f	Revert "Add folding rule to generate Fma instructions (#4783 )" (#4785 ) This reverts commit `2b2b0282af`.	2022-04-20 10:55:20 -04:00
Steven Perron	2b2b0282af	Add folding rule to generate Fma instructions (#4783 ) Adding Fma instruction can speed up the code. This was requested by swiftshader, so they do not have to do this analysis themselves. It can also help reduce the code size, and the work the ICD compilers have to do.	2022-04-19 11:25:07 -04:00
Steven Perron	48a36c72e4	Better handling of 0xFFFFFFFF when folding vector shuffle (#4743 ) When folding a vector shuffle feeding a vector shuffle, we do not propagate an 0xFFFFFFFF, which has a special meaning, correctly. We adjust the value making it lose it meaning as an undefined value. Fixes #4581	2022-03-07 19:35:57 +00:00
Steven Perron	3291b6951e	Do not fold snegate feeding sdiv. (#4600 ) When the variable value is INT_MIN, we cannot fold the negate into the divide, so we have to turn off that folding rule. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/4487.	2021-10-28 10:02:57 -04:00
Steven Perron	59f51bb4f8	Fix extract with out-of-bounds index (#4529 ) * Fix extract with out-of-bounds index When folding a OpCompositeExtract that is fed by an OpCompositeConstruct, we handle and out of bounds index, but only in the case where the result of the OpCompostiteConstruct is a struct. This change refactors that folding rule and then improves it to handle an out-of-bounds access when the result of the OpCompositeConstruct is a vector.	2021-09-20 13:02:47 -04:00
Alastair Donaldson	36ff135341	spirv-opt: Avoid integer overflow during constant folding (#4511 ) In SPIR-V, integers use 2s complement representation, so that signed integer overflow and underflow is well defined. However, the constant folder was causing overflow / underflow at the C++ level. This change avoids such overflows by performing constant folding for IAdd, ISub and IMul in the context of unsigned values, which works because signedness is irrelevant according to the SPIR-V semantics for these instructions. Fixes #4510.	2021-09-14 21:09:05 +00:00
Nicolas Capens	869a550d26	Don't fold unsigned divides of an constant and a negation (#4457 ) Negating an unsigned constant results in its two's complement which is still interpreted as unsigned. For example -2u becomes 4294967294u. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/4456	2021-08-16 09:56:05 -04:00
Ryan Harrison	8b3dc6bbed	Check that valid bitcasted constant was returned (#4311 ) This call returns nullptr to indicate errors. Fixes https://crbug.com/1213365	2021-06-01 09:09:31 -04:00
Jaebaek Seo	07ec4f83c5	Support folding OpBitcast with numeric constants (#4247 ) Add constant folding rule for OpBitcast with numeric scalar or vector constants.	2021-04-27 14:24:46 -04:00
Corentin Wallez	4a59fd4763	Fix -Wextra-semi-stmt -Wsuggest-destructor-override -Wdeprecated-copy-dtor (#4164 ) * Fix -Wextra-semi-stmt * Fix -Wsuggest-destructor-override * Fix -Wdeprecated-copy-dtor	2021-03-09 13:16:43 +00:00
Vasyl Teliman	948577c5df	Fix the bug (#3680 )	2020-08-13 09:09:57 -04:00
Steven Perron	00ca4e5bdf	Don't crash when folding construct of empty struct (#3092 ) * Don't crash when folding construct of empty struct An OpCompositeConstruct of an empty struct will be folded to a constant under normal circumstances. However, if the id limit has been reached and the constant cannot be generated, then other folding rules will be tried. These rules do not handle the case of an empty struct. We add allow it to be handled. Fixes http://crbug/1030194 * Changes based on the review.	2019-12-10 14:58:30 -05:00
Steven Perron	3ed4586044	Folding: perform add and sub on mismatched integer types (#3084 ) Fixes #3040	2019-12-02 17:51:20 -05:00
Ehsan	12e54dae16	Update Offset to ConstOffset bitmask if operand is constant. (#3024 ) Update Offset to ConstOffset bitmask if operand is constant. Fixes #3005	2019-11-11 22:35:14 -05:00
greg-lunarg	5ea7099374	Add two new simplifications. (#2984 ) Implements the following simplifications: (a - b) + b => a (a * b) + (a * c) => a * (b + c) Also adds logic to simplification to handle rules that create new operations that might need simplification, such as the second rule above. Only perform the second simplification if the multiplies have the add as their only use. Otherwise this is a deoptimization of size and performance.	2019-10-28 08:19:38 -07:00
Jakub Kuderski	e3da3143b2	Disallow use of OpCompositeExtract/OpCompositeInsert with no indices (#2980 )	2019-10-17 13:53:34 -04:00
Steven Perron	55ea57a785	Handle extract with no indexes (#2910 ) * Handle extract with no indexes It is possible that OpCompositeExtract instructions will not have any indexes. This is not handled well by scalar replacement and instruction folding. Fixes https://crbug.com/1006435 * Fix typo.	2019-09-24 16:19:31 -04:00
Steven Perron	15fc19d091	Refactor instruction folders (#2815 ) * Refactor instruction folders We want to refactor the instruction folder to allow different sets of rules to be added to the instruction folder. We might want different sets of rules in different circumstances. We also need a way to add rules for extended instructions. Changes are made to the FoldingRules class and ConstFoldingRules class to enable that. We added tests to check that we can fold extended instructions using the new framework. At the same time, I noticed that there were two tests that did not tests what they were suppose to. They could not be easily salvaged. #2813 was opened to track adding the new tests.	2019-08-26 18:54:11 -04:00
Steven Perron	5186ffedb3	Remove duplicates from list of interface IDs in OpEntryPoint instruction (#2449 ) * Remove duplicates from list of interface IDs in OpEntryPoint instruction Fixes #2002.	2019-03-13 15:46:31 -04:00
Steven Perron	dc9d155d62	Fix folding of volatile store. (#2048 ) When looking for the Volatile mask on a store, the instruction folder accesses an out-of-bounds element. We fix that up. Fixes crbug.com/903530.	2018-11-14 13:52:18 -05:00
dan sinclair	eda2cfbe12	Cleanup includes. (#1795 ) This Cl cleans up the include paths to be relative to the top level directory. Various include-what-you-use fixes have been added.	2018-08-03 15:06:09 -04:00
Alan Baker	755e5c9420	Transform to combine consecutive access chains * Combines OpAccessChain, OpInBoundsAccessChain, OpPtrAccessChain and OpInBoundsPtrAccessChain * New folding rule to fold add with 0 for integers * Converts to a bitcast if the result type does not match the operand type V	2018-07-31 13:42:47 -04:00
Alan Baker	b49f76fd62	Handle undef literal value in vector shuffle Fixes #1731 * Updated folding rules related to vector shuffle to account for the undef literal value: * FoldVectorShuffleFeedingShuffle * FoldVectorShuffleFeedingExtract * FoldVectorShuffleWithConstants * These rules would commit memory violations due to treating the undef literal value as an accessible composite component	2018-07-20 11:32:43 -04:00
Steven Perron	95b4d47e34	Fix infinite loop while folding OpVectorShuffle (#1722 ) When folding an OpVectorShuffle where the first operand is defined by an OpVectorShuffle, is unused, and is equal to the second, we end up with an infinite loop. This is because we think we change the instruction, but it does not actually change. So we keep trying to folding the same instruction. This commit fixes up that specific issue. When the operand is unused, we replace it with Null.	2018-07-13 12:43:00 -04:00
Steven Perron	63c1d8fb15	Fix size error when folding vector shuffle. (#1721 ) When folding a vector shuffle that feeds another vector shuffle causes the size of the first operand to change, when other indices have to be adjusted reletive to the new size.	2018-07-13 11:20:02 -04:00
dan sinclair	c7da51a085	Cleanup extraneous namespace qualifies in source/opt. (#1716 ) This CL follows up on the opt namespacing CLs by removing the unnecessary opt:: and opt::analysis:: namespace prefixes.	2018-07-12 15:14:43 -04:00
dan sinclair	4cc6cd184a	Pass the IRContext into the folding rules. (#1709 ) This CL updates the folding rules to receive the IRContext as a paramter instead of retrieving off of the Instruction. Issue #1703	2018-07-12 09:12:23 -04:00
Steven Perron	e63551deac	Add folding rule to merge a vector shuffle feeding another one.	2018-07-11 14:44:46 -04:00
dan sinclair	e6b953361d	Move the ir namespace to opt. (#1680 ) This CL moves the files in opt/ to consistenly be under the opt:: namespace. This frees up the ir:: namespace so it can be used to make a shared ir represenation.	2018-07-09 11:32:29 -04:00
dan sinclair	3dad1cda11	Change libspirv to spvtools namespace (#1678 ) This CL changes all of the libspirv namespace code to spvtools to match the rest of the code base.	2018-07-07 09:38:00 -04:00
dan sinclair	76e0bde196	Move utils/ to spvtools::utils Currently the utils/ folder uses both spvutils:: and spvtools::utils. This CL changes the namespace to consistenly be spvtools::utils to match the rest of the codebase.	2018-07-06 16:47:46 -04:00
Steven Perron	a45d4cac61	Move folding routines into a class The folding routines are currently global functions. They also rely on data in an std::map that holds the folding rules for each opcode. This causes that map to not have a clear owner, and therefore never gets deleted. There has been a request to delete this map. To implement this, we will create a InstructionFolder class that owns the maps. The IRContext will own the InstructionFolder instance. Then the global functions will become public memeber functions of the InstructionFolder. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1659.	2018-07-05 17:52:43 -04:00
Steven Perron	2eb9bfb5b6	Remove stores of undef. When storing an undef, any value is valid, including the one already in that memory location. So we can avoid the store.	2018-06-29 09:49:19 -04:00
Steven Perron	fe2fbee294	Delete the insert-extract-elim pass. Replaces anything that creates an insert-extract-elim pass and create a simplifiation pass instead. Then delete the implementation of the pass. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1570.	2018-06-01 10:13:39 -04:00

1 2

69 Commits