Ryan Houdek
4afc7adb05
unittests: Fixes vpblend unittest
...
This typo was causing undefined data to be used in the unittest, showed
up in debug builds.
2024-08-10 07:43:25 -07:00
Mai
4882f10536
Merge pull request #3888 from Sonicadvance1/avx128_optimize_blends
...
AVX128: Optimize blends
2024-08-07 17:08:19 -04:00
Ryan Houdek
230bde6aef
InstcountCI: Adds vpermq coverage
2024-08-06 09:08:30 -07:00
Ryan Houdek
c24d7aacba
unittests/ASM: Adds vpermq test that covers all immediate encodings
...
To ensure we cover all tests when optimizing.
2024-08-06 09:08:30 -07:00
Billy Laws
af7462ee6a
unittests: Add test using the address-override flag with segment addressing
2024-07-31 20:04:30 +01:00
Paulo Matos
5bbbced1bd
Fix nasm warning in Rounding.asm
2024-07-31 16:23:39 +02:00
Paulo Matos
3332220d06
instcountci: Intersperse flag retrieval and FSW insertion
2024-07-31 12:05:17 +02:00
Paulo Matos
aee8c9def2
instcountci: Reuse Top in ReconstructFSW_Helper
2024-07-31 11:57:19 +02:00
Paulo Matos
9a8e7eaace
instcountci: Add instcountci for fnstsw from fast path
2024-07-31 09:59:32 +02:00
Paulo Matos
4227e012aa
Add instcountci for fnstsw from fast path
2024-07-31 09:59:28 +02:00
Ryan Houdek
d24d0a95a0
Merge pull request #3894 from pmatos/RoundingModeTests
...
ASM Tests: X87 Rounding modes
2024-07-29 23:26:52 -07:00
Ryan Houdek
c2092bfed0
Merge pull request #3893 from pmatos/FNINITFix
...
Fix call to FNINITF64 and refactor
2024-07-26 13:25:49 -07:00
Paulo Matos
9af7ee6bd2
instcountci: Fix call to FNINITF64 and refactor
2024-07-26 14:56:10 +02:00
Paulo Matos
b1ec50c7c2
Test running scripts tell ctest of skipped tests
...
CMake sets 125 as the skipped test exit code that the scripts use.
2024-07-26 14:04:54 +02:00
Paulo Matos
ceac38a6ac
ASM Tests: X87 Rounding modes
2024-07-26 10:07:58 +02:00
Ryan Houdek
4634688aca
InstcountCI: Update for AVX128 blends
2024-07-23 19:24:19 -07:00
Ryan Houdek
dd3e3ed189
unittests/ASM: Implements a vpblendw test
...
Runs through all immediate encodings for vpblendw and crcs the results
to ensure correct behaviour. This was just a concern because of the typo
in documentation. But it is also good to have.
2024-07-23 19:24:19 -07:00
Ryan Houdek
8ebf049fb9
InstcountCI: Update for Scalar FMA with AFP
2024-07-22 12:58:20 -07:00
Paulo Matos
39bc2a82c1
instcountci: X87 Pass and refactoring
2024-07-22 08:50:01 +02:00
Paulo Matos
774325dcf2
Tests: X87 Refactoring and Pass
2024-07-22 08:44:45 +02:00
Paulo Matos
a1378f94ce
X87 Code Refactoring and Optimization Pass
2024-07-22 08:44:45 +02:00
Alyssa Rosenzweig
592d6cc43f
InstCountCI: Update
...
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
2024-07-21 15:50:10 -04:00
Ryan Houdek
f8c6baae97
Merge pull request #3883 from Sonicadvance1/implement_daz
...
Arm64: Implements support for DAZ using AFP.FIZ
2024-07-21 10:03:34 -07:00
Ryan Houdek
56df57e980
InstcountCI: Update
2024-07-20 17:26:27 -07:00
Ryan Houdek
4fffe68f81
InstcountCI: Update
2024-07-20 15:57:01 -07:00
Ryan Houdek
ae9312bdab
unittests: Implements a DAZ test
...
Specifically does a vector add with and without DAZ enabled and ensures
the value is different when the source values contain a denormal.
2024-07-20 15:34:54 -07:00
Ryan Houdek
b78da2e5ad
Arm64: Implements support for DAZ using AFP.FIZ
...
When AFP is supported then we can actually support DAZ. This might also
fix the audio corruption in Animal Well but I can't test it until Steam
is running on Oryon. Requires a bit of plumbing for MXCSR which we were
hacking around before but now we actually want to store the value.
Fixes #3856
2024-07-20 15:34:54 -07:00
Alyssa Rosenzweig
e4b7a65a49
Merge pull request #3880 from pmatos/InstCountMemcpy
...
Add x87 memcpy instcountci tests
2024-07-19 08:53:23 -04:00
Paulo Matos
c77a707dbe
Add x87 memcpy instcountci tests
2024-07-19 09:09:34 +02:00
Ryan Houdek
0b829d2c46
unittests: Adds a test for full pshufd imm coverage
2024-07-18 04:13:03 -07:00
Ryan Houdek
bddb533fa0
InstcountCI: Add some more of the cases
2024-07-18 04:13:03 -07:00
Ryan Houdek
c7254e31ed
InstcountCI: Update for VPERM/VPSHUFD improvements
2024-07-18 04:10:58 -07:00
Ryan Houdek
da51169ba9
Merge pull request #3875 from alyssarosenzweig/ir/gethostflag
...
IR: garbage collect premature F80Cmp optimizations
2024-07-17 03:05:48 -07:00
Alyssa Rosenzweig
7546160811
InstCountCI: Update
...
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
2024-07-16 14:53:58 -04:00
Alyssa Rosenzweig
19e58cac62
InstCountCI: Update
...
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
2024-07-16 13:54:28 -04:00
Paulo Matos
9017cd14c8
Remove unused function MmapOverride
2024-07-16 11:16:07 +02:00
Paulo Matos
ed87c01470
Simplify Disabled_Tests and remove pr57275 from failures
...
Disabled_Tests was mostly a copy of Known_Failures. Leave only the race
condition on SIGPROF test (mcount_pic.c).
Also remove pr57275.c from known failures. It passes now that we support
AVX.
Also if a test is disabled, just skip it
2024-07-13 07:08:02 +02:00
Ryan Houdek
d79b7fcc49
Merge pull request #3808 from alyssarosenzweig/rclse/3
...
Try to delete RCLSE again
2024-07-12 20:38:06 -07:00
Ryan Houdek
3d90d1ab4f
InstcountCI: Update for vmovq fix
2024-07-11 18:34:06 -07:00
Ryan Houdek
3c7318d7c8
AVX128: Fixes vmovq loading too much data
...
This was doing a 128-bit load from memory and then a 64-bit zero extend
which looked like a spurious move but it was trying to match the
behaviour of vmovq where it needed the zero extend.
Also adds a unit test to ensure that we aren't loading too much data by
loading right up against a page boundary.
Fixes #3787
2024-07-11 18:34:05 -07:00
Mai
e25918d846
Merge pull request #3858 from Sonicadvance1/implement_nt_load
...
Implement support for SSE4.1/AVX NT loads
2024-07-11 14:22:41 -04:00
Alyssa Rosenzweig
d78b0ea435
InstCountCI: Update
...
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
2024-07-11 13:21:14 -04:00
Ryan Houdek
3ff1ff8f74
InstcountCI: Update for svebitperm
2024-07-11 01:46:35 -07:00
Ryan Houdek
31f82c1d96
InstcountCI: Update for SVE NT load support
2024-07-10 23:07:58 -07:00
Alyssa Rosenzweig
5e0952159d
unittests: add test for a MMX register cache bug
...
this failed on an earlier version of the register cache.
Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
2024-07-10 11:34:24 -04:00
Ryan Houdek
72d6c8ebd6
Merge pull request #3820 from alyssarosenzweig/ir/drop-deferred
...
Drop deferred flag infrastructure
2024-07-09 17:06:25 -07:00
Alyssa Rosenzweig
3ef9ea94e5
Merge pull request #3848 from pmatos/FTSTX87Tests
...
Tests for X87 FTST
2024-07-09 09:10:29 -04:00
Paulo Matos
381ce23fd7
Tests for X87 FTST
2024-07-09 13:36:16 +02:00
Mai
af6a0be832
Merge pull request #3842 from Sonicadvance1/fix_f64_to_i32
...
VCVT{T,}PD2DQ fixes and optimization
2024-07-09 03:49:31 -04:00
Ryan Houdek
287fe5beac
InstcountCI: Update
2024-07-09 00:38:48 -07:00