FEX-Emu/FEX - FEX - Gitea: Git with a cup of tea

mirror of https://github.com/FEX-Emu/FEX.git synced 2025-02-24 00:32:22 +00:00

Author	SHA1	Message	Date
Ryan Houdek	ee6debe8fd	FEXCore: Adds DividePow2 helper	2023-10-04 20:56:29 -07:00
Mai	3ba1c7912c	Merge pull request #3178 from Sonicadvance1/fix_avx_alias_precolour Minor AVX optimizations	2023-10-04 21:31:20 -04:00
Ryan Houdek	a408afaeb0	InstCountCI: Update for optimized AVX	2023-10-04 10:05:09 -07:00
Ryan Houdek	fba7c4bedc	IR/RA: Fixes register aliasing and pre-colouring for AVX This is the cause of a bunch of redundant moves that shows up in InstCountCI. Fixing this aliasing and pre-colouring issue causes a ton of 256-bit operations to become optimal.	2023-10-04 10:04:06 -07:00
Ryan Houdek	c52753e9c8	OpcodeDispatcher: Minor optimization in vzeroall Using the cached zero value is less efficient than loading it in to the register for all these cases. Lets us use rename hardware more efficiently and removes a dependency chain on a single register. Original: ``` movi v2.2d, #0x0 mov z16.d, p7/m, z2.d <... 16 more times> mov z31.d, p7/m, z2.d ``` Result: ``` movi v16.2d, #0x0 <... 16 more times> movi v31.2d, #0x0 ```	2023-10-04 10:01:13 -07:00
Ryan Houdek	e39634d314	Arm64: Fixes assert in VSQSHL/VSQSHR with SVE When Dst != Vector then we need to pass Dst in to both Zd and Zdn. Would have worked fine in a release build but assert build managed to capture it.	2023-10-04 09:59:59 -07:00
Ryan Houdek	507cf82dad	Merge pull request #3176 from neobrain/fix_thunks_unused_artifacts Thunks: Only build guest target for libfex_thunk_test if FEXLinuxTests are enabled	2023-10-04 07:07:18 -07:00
Ryan Houdek	48fa4f1121	Merge pull request #3156 from neobrain/feature_thunk_data_layout_analysis Thunks: Analyze data layout to detect platform differences	2023-10-04 07:06:49 -07:00
Tony Wasserka	e06d609bf0	Thunks: Drop unused STRUCT_VERIFIER define from CMake	2023-10-03 11:43:29 +02:00
Tony Wasserka	0a09e04e33	Thunks: Only build guest target for libfex_thunk_test if FEXLinuxTests are enabled	2023-10-03 11:43:27 +02:00
Ryan Houdek	a1a709f948	Merge pull request #3170 from Sonicadvance1/vixl_sim_instcountci InstCountCI: Enable running on x86 hosts	2023-10-02 16:38:25 -07:00
Ryan Houdek	5925eef213	Github/InstCountCI: Enables x86 runner To ensure we don't break this path for developers.	2023-10-02 16:26:14 -07:00
Ryan Houdek	df369bd6a0	InstCountCI: Enable running on x86 hosts This is a quality of life improvement for people that want to tinker with the InstCountCI but they may not necessarily have an Arm64 device available immediately for poking. As long as the vixl disassembler is enabled then the InstCountCI tests can run and get bit-accurate encodings just like on an Arm64 device. This also ensures that behaviour is consistent with or without the vixl simulator enabled which is very important when running on x86 hosts.	2023-10-02 16:26:14 -07:00
Ryan Houdek	978489fce1	InstCountCI: Explicitly disable SVE256 for one test group These instructions are specifically testing the SVE128 implementations, don't want SVE256 mucking up the instructions.	2023-10-02 16:26:14 -07:00
Ryan Houdek	d5a4d9b17f	InstCountCI: Adds option to disable cssc for tests One x87 instruction was using CSSC abs	2023-10-02 16:26:14 -07:00
Ryan Houdek	9933ef07ea	Tools: Enable indirect vixl runtime calls if simulator is used So tests can still run.	2023-10-02 16:26:14 -07:00
Ryan Houdek	6964e65660	HostFeatures: Hardcode icache and dcache line size on x86 64-byte is effectively part of x86's ABI anyway. No need to query it for our uses.	2023-10-02 16:26:14 -07:00
Ryan Houdek	11db8e7506	FEXCore: Wire up the new option to disable vixl indirect runtimes Also so it compiles without the vixl simulator enabled.	2023-10-02 16:26:12 -07:00
Ryan Houdek	b6b5e93dbb	Config: Adds an option to disable vixl sim indirect runtime calls	2023-10-02 16:23:11 -07:00
Ryan Houdek	935b3a313a	Merge pull request #3171 from Sonicadvance1/merge_dispatcher FEXCore: Merge Arm64Dispatcher in to Dispatcher	2023-10-02 16:22:36 -07:00
Tony Wasserka	fe681ab335	unittests/ThunkLibs: Specify clang resource directory when compiling test code	2023-10-02 22:18:23 +02:00
Tony Wasserka	2d9e816ff5	unittests/ThunkLibs: Add various tests for structs repacking and for void parameters	2023-10-02 22:18:23 +02:00
Tony Wasserka	b04b0549a9	unittests/ThunkLibs: Add data layout tests	2023-10-02 22:18:22 +02:00
Tony Wasserka	2b472cb962	Thunks/gen: Enforce type compatibility for function parameters	2023-10-02 22:18:22 +02:00
Tony Wasserka	7f931b5623	Thunks/gen: Add detection logic for data layout differences This runs the data layout analysis pass added in the previous change twice: Once for the host architecture and once for the guest architecture. This allows the new DataLayoutCompareAction to query architecture differences for each type, which can then be used to instruct code generation accordingly. Currently, type compatibility is classified into 3 categories: * Fully compatible (same size/alignment for the type itself and any members) * Repackable (incompatibility can be resolved with emission of automatable repacking code, e.g. when struct members are located at differing offsets due to padding bytes) * Incompatible	2023-10-02 22:18:22 +02:00
Tony Wasserka	070fa9f924	Thunks/gen: Add data layout analysis This adds a ComputeDataLayout function that maps a set of clang::Types to an internal representation of their data layout (size, member list, ...).	2023-10-02 22:18:22 +02:00
Tony Wasserka	371bf50c76	Thunks/gen: Track data types passed across architecture boundaries The set of these types is tracked in AnalysisAction, to which extensive verification logic is added to detect potential incompatibilities and to enforce use of annotatations where needed.	2023-10-02 22:18:22 +02:00
Tony Wasserka	d65d29903b	Thunks/gen: Rename EmitOutput to OnAnalysisComplete	2023-10-02 22:03:10 +02:00
Tony Wasserka	7791e0090d	Thunks: Disable 32-bit host thunks These are not supported yet.	2023-10-02 22:03:10 +02:00
Alyssa Rosenzweig	02da6d6ce7	Merge pull request #3174 from Sonicadvance1/remove_steam_appconfig AppConfig: Removes Steam config	2023-10-01 18:48:30 -04:00
Ryan Houdek	a478cbb694	AppConfig: Removes Steam config This was only required on x86 devices trying to escape the emulation. Since x86 is now remove, this is entirely unnecessary. When Steam launches applications with `/bin/sh`, this will remain under the emulation and not escape these days.	2023-10-01 08:46:53 -07:00
Ryan Houdek	3a25dd6d2b	Merge pull request #3173 from CallumDev/x87f64-fabs X87F64: Implement FABS with vector instruction	2023-10-01 01:54:11 -07:00
CallumDev	9c25db83d9	JIT: VectorOps remove extraneous element size logs	2023-10-01 15:03:21 +10:30
CallumDev	7346476546	Update InstCountCI	2023-10-01 14:41:13 +10:30
CallumDev	c42b581378	X87F64: Implement FABS with vector instruction	2023-10-01 14:39:55 +10:30
Ryan Houdek	ccfd770d9d	Merge pull request #3172 from CallumDev/x87f64-opts X87F64: Use Bfe for rounding mode, FCHS use float instruction	2023-09-30 18:41:29 -07:00
CallumDev	d4a623a3fb	InstCountCI Update	2023-10-01 11:22:18 +10:30
CallumDev	c09c25005e	X87F64: Use Bfe for rounding mode, FCHS use float instruction	2023-10-01 11:11:33 +10:30
Ryan Houdek	90570fd5f4	FEXCore: Merge Arm64Dispatcher in to Dispatcher With the removal of the x86 JIT, there is no need to have these be independent classes. Merges the Arm64Dispatcher in to the base Dispatcher class. No functional change, just moving code.	2023-09-30 09:31:55 -07:00
Mai	ab4642af38	Merge pull request #3167 from Sonicadvance1/gatherqdps unittests/ASM: Implements tests for vpgatherqd/vgatherqps	2023-09-29 12:16:43 -04:00
Mai	d94e5ce7f4	Merge pull request #3168 from Sonicadvance1/gatherqqpd unittests/ASM: Implements tests for vpgatherqq/vgatherqpd	2023-09-29 12:16:12 -04:00
Mai	dad7086fd0	Merge pull request #3166 from Sonicadvance1/gatherdqpd unittests/ASM: Implements tests for vpgatherdq/vgatherpq	2023-09-29 12:15:39 -04:00
Ryan Houdek	a21def7d74	unittests/ASM: Implements tests for vpgatherqq/vgatherqpd Similar to previous tests, vpgatherqq and vgatherqpd are equivalent instructions. So the tests are the same with the mnemonic changed. This adds tests for an additional two sets of instructions. Getting us full coverage of all eight instructions if we include the tests from PR #3167 and #3166 Tests the same things as described in #3165 In addition, since these tests use 64-bit indices for address calculation, we can easily generate and indice vector that tests overflow. So every test at every displacement ALSO gains an additional overflow test to ensure correct behaviour around pointer overflow calculation.	2023-09-29 08:04:47 -07:00
Ryan Houdek	0d8d5444a4	unittests/ASM: Implements tests for vpgatherqd/vgatherqps Similar to previous tests, vgatherqd and vgatherqps are equivalent instructions. So the tests are the same with the mnemonic changed. This adds tests for an additional two sets of instructions, Getting us up to six total over the eight if we include the tests from #3166. Tests the same things as described in #3165 In addition, since these tests use 64-bit indices for address calculation, we can easily generate and indice vector that tests overflow. So every test at every displacement ALSO gains and additional overflow test to ensure correct behaviour around pointer overflow calculation.	2023-09-29 07:20:07 -07:00
Ryan Houdek	eedfad5036	unittests/ASM: Implements tests for vpgatherdq/vgatherpq Just like the previous tests, vpgatherdq and vgatherpq are equivalent instructions. So the tests are the same except for the instruction mnemonic again. This adds unittests for two more of the eight gather instructions. Getting us up to testing four in total. Specifically this adds tests for 32-bit indices while loading 64-bit element instructions. Same thing as PR #3165 for what it tests versus doesn't.	2023-09-28 22:49:03 -07:00
Ryan Houdek	85da0f0640	Merge pull request #3165 from Sonicadvance1/gatherddps unittests/ASM: Implements tests for vpgatherdd/vgatherps	2023-09-28 22:44:38 -07:00
Ryan Houdek	9a01b440e3	unittests/ASM: Implements tests for vpgatherdd/vgatherps vpgatherdd and vgatherps are effectively the same instructions, so the tests are the same except for the instruction mnemonic. This adds unit tests for two of the eight gather instructions. Specifically this adds tests for the 32-bit indices loading 32-bit elements instructions. What it tests: - Tests all displacement scales - Tests multiple mask arrangements - Ensures the mask register is zero'd after the instruction What it doesn't test: - Doesn't test address size calculation overflow - Only would happen on 32-bit with 32-bit indices, or /really/ high base addresses - The instruction should behave as a mask to the address size - Effectively behaves like `(uint64_t)(base + index << ilog2(scale))` - Better idea is to just not expose AVX to 32-bit applications - Doesn't test VSIB immediate displacement - This just ends up being base_addr + imm so it isn't too interesting - We can add more tests in the future if we think we messed that up - Doesn't test partial fault behaviour - Because that's a nightmare. Specifically keeps each instruction test small and isolated so if a single register fails it is very easily to nail down which operation did it. I know some of our ASM tests do a chunk of work and spit out a result at the end which can be difficult to debug in some cases. Didn't want to do that which is why the tests are spread out across 16 files for these single class of instructions.	2023-09-28 19:58:34 -07:00
Ryan Houdek	228ee7fa47	TestHarnessRunner: Support AVX2 flag detection	2023-09-28 19:58:34 -07:00
Ryan Houdek	98789a8039	FEXCore: Implement support for AVX2 feature detection	2023-09-28 19:57:08 -07:00
Ryan Houdek	14398742c3	Merge pull request #3164 from neobrain/fix_thunks_asan Thunks: Fix AddressSanitizer build	2023-09-28 12:05:55 -07:00

1 2 3 4 5 ...

7973 Commits