mirror of
https://github.com/FEX-Emu/FEX.git
synced 2025-01-10 15:50:18 +00:00
40e073c8b2
Causes the dispatcher to drop from 4476 bytes down to 3900 for SVE-256bit supporting targets. This is done by significantly reducing SVE loadstore ops. Going from 8 instructions per 4 registers, down to 2 instructions. This is done by switching from 1 register loadstore instructions up to 4 register loadstore instructions. Which should significantly improve performance on future SVE platforms. Filling and Spilling to the context is still using the old code path because SVE doesn't offer non-interleaving loadstores. Spilling and filling on the stack is fine because we don't need to match context state.