FEX-Emu/FEX - FEX - Gitea: Git with a cup of tea

mirror of https://github.com/FEX-Emu/FEX.git synced 2025-02-07 15:09:05 +00:00

Author	SHA1	Message	Date
Paulo Matos	ca31a0404c	ConstProp should generate 32bit constants when required	2024-01-29 17:15:47 +00:00
Mai	750b0b70bc	Merge pull request #3356 from Sonicadvance1/modify_code_lock Jitarm64: Implements spin-loop futex for JIT blocks	2024-01-23 13:46:59 -05:00
Ryan Houdek	56d8080ec9	Merge pull request #3345 from Sonicadvance1/fix_syscall_registers OpcodeDispatcher: Fixes syscall rcx/r11 generation	2024-01-22 15:21:13 -08:00
Ryan Houdek	c0be974272	Merge pull request #3368 from bylaws/preprcr FEXCore: Fix RCL/RCR shift wraparound behaviour	2024-01-21 13:44:49 -08:00
Billy Laws	e323938173	FEXCore: Fix RCL/RCR shift wraparound behaviour This ends up being cleaner to handle outside of CalculateFlags_ShiftVariable as constant masking is only needed for RCL/RCR.	2024-01-21 18:15:50 +00:00
Billy Laws	407e26bfee	FEXCore: Use TMP1-4 for values that need preserving across spills The ARM64EC SRA layout will use x0-3 for x86_64 registers, as such any arguments passed to C ABI functions need to proxy their arguments through the temporaries and move as appropriate.	2024-01-21 16:21:13 +00:00
Ryan Houdek	a6c57f71e9	SpinWaitLock: Fixes potential extra wait that would occur on contended lock We had a chance of doing an additional bogus wfe if the expected value was hit in one iteration of a loop. Not the biggest problem on current hardware where WFE only ever sleeps for 1-4 system cycles, but on future hardware where WFE might actually sleep for longer then this could have been an issue.	2024-01-17 10:41:16 -08:00
Ryan Houdek	2af7e997f4	Spinlocks: Fix assembly Need to have a source be +r so it doesn't get overwritten.	2024-01-17 10:19:38 -08:00
Ryan Houdek	ab6c00bbcf	FEXCore/Utils: Rename FutexSpinWait to SpinWaitLock	2024-01-17 10:19:38 -08:00
Ryan Houdek	e18453cb57	Jitarm64: Implements spin-loop futex for JIT blocks This will ensure that multiple concurrent SIGBUS handlers in the same code block doesn't modify the same code.	2024-01-17 10:19:38 -08:00
Ryan Houdek	39f49782da	Arm64: Move ParanoidTSO checks up out of the non-paranoid code bath	2024-01-17 10:19:38 -08:00
Ryan Houdek	2c5dd20f3c	FutexSpinWait: Implement spin-loop Unique mutex.	2024-01-17 10:19:38 -08:00
Ryan Houdek	136fa78825	FEXCore: Implements an efficient spin-loop API This will only be used internally inside of FEXCore for efficient shared codecach backpatch spin-loops.	2024-01-17 10:19:38 -08:00
Ryan Houdek	f956f008ea	Merge pull request #3372 from alyssarosenzweig/opt/cmpxchg-review Optimize GPR cmpxchg	2024-01-15 05:11:12 -08:00
Ryan Houdek	1f7a619c79	OpcodeDispatcher: Fixes syscall rcx/r11 generation Noticed this while writing #3342. Fixes #3343 The syscall instruction is defined in the documentation that it will set RCX to the next instruction's RIP and R11 to be RFLAGS. We entirely skipped this which I noticed while writing unit tests. Adds unittests to test both 32-bit and 64-bit behaviour because our helper shares code with both. I don't know if anything actually relied on this behaviour but we should definitely support it.	2024-01-12 19:14:30 -08:00
Alyssa Rosenzweig	58127bd0e8	OpcodeDispatcher: optimize trivial cmpxchgs Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2024-01-12 12:23:34 -04:00
Alyssa Rosenzweig	e8945dfb6d	OpcodeDispatcher: optimize gpr cmpxchg NZCV stuff. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2024-01-12 12:03:28 -04:00
Ryan Houdek	8c3163096b	Merge pull request #3363 from Sonicadvance1/fix_label_allocations ArmEmitter: Support single use forward labels	2024-01-12 00:26:31 -08:00
Ryan Houdek	615cfe0246	Merge pull request #3361 from Sonicadvance1/decompose_std_function FEXCore: Decompose some std::function usage to regular pointers	2024-01-10 16:55:29 -08:00
Ryan Houdek	3d5f876585	Fixes some new glibc allocations that cropped up I guess this was handled by brk things before.	2024-01-09 13:55:04 -08:00
Ryan Houdek	37102400b5	Arm64: Switches uses of forward label over to SingleUse if possible Primary goal for this is to ensure that the delinker doesn't need to allocate any memory. This delinker can end up getting hit heavily with JIT code so we don't want it to be allocating memory.	2024-01-08 22:18:20 -08:00
Ryan Houdek	c01e6283ae	CodeEmitter: Support a single use forward label Currently all uses of the forward label calls in to jemalloc to allocate memory. This allows a forward label that doesn't require any memory allocation, which is the common case in FEX.	2024-01-08 22:18:20 -08:00
Ryan Houdek	248dc97993	FEXCore: Decompose some std::function usage to regular pointers The delinker step of the JIT was using std::function with capture lambdas that required memory allocation when unnecessary. Because the compiler can't see through our std::function usage it could never decompose these by itself. By passing the Thread's frame and record to the function as arguments then we can have the signature be a raw function pointer. This fixes an area of concern from: https://github.com/FEX-Emu/FEX/blob/main/docs/ProgrammingConcerns.md#stdfunction-and-lambdas	2024-01-06 19:39:54 -08:00
Ryan Houdek	d488592eda	Merge pull request #3339 from Sonicadvance1/pass_thread_unaligned_fault_handler FEXCore: Pass thread object to HandleUnalignedAccess	2024-01-04 18:20:37 -08:00
Ryan Houdek	743df8dfae	Merge pull request #3327 from Sonicadvance1/remove_syscall_indirection Arm64: Removes a vtable indirection in syscalls	2024-01-04 18:19:40 -08:00
Ryan Houdek	4b3792196f	Merge pull request #3303 from Sonicadvance1/initial_runtime_longmode_switch OpcodeDispatcher: Initial support for runtime long-mode switch	2024-01-04 18:17:54 -08:00
Ryan Houdek	db7d7a6bd7	Merge pull request #3349 from Sonicadvance1/revert_frontend_ownership Revert "FEXLoader: Moves thread management to the frontend"	2024-01-03 14:25:04 -08:00
Alyssa Rosenzweig	04a88ed3ab	Merge pull request #3353 from Sonicadvance1/public_interface_cleaning FEXCore interface cleaning	2024-01-03 15:14:54 -04:00
Alyssa Rosenzweig	9da08b40bd	Merge pull request #3344 from Sonicadvance1/xbyak_upstream Externals: Update xbyak to v7.02 and switch away from fork	2024-01-03 15:13:58 -04:00
Alyssa Rosenzweig	5467c3e478	Merge pull request #3357 from Sonicadvance1/remove_non_sra FEXCore: Removes SRA option, it's now permanently enabled	2024-01-03 15:10:04 -04:00
wannacu	4e7bab849c	JIT: Fixes broken register in VTBX1 If the Dst register is allocated as VectorIndices or VectorTable, using Dst as an operand to perform the tbx operation will result in an error. For example: %131(FPR0) i128 = LoadNamedVectorIndexedConstant u8:Tmp:RegisterSize, #0x6, #0xaa0 %132(FPR0) i128 = VTBX1 u8:Tmp:RegisterSize, %129(FPRFixed6) i32v4, %126(FPRFixed10) i16v8, %131(FPR0) i128 Since the tbx instruction's destination register is also the original operand, this is consistent with the semantics of VTBX1. Therefore, directly using VectorSrcDst as the destination operand for the tbx instruction is safe.	2023-12-29 16:18:40 +08:00
Ryan Houdek	d098545c20	FEXCore: Removes SRA option, it's now permanently enabled	2023-12-28 18:28:02 -08:00
Ryan Houdek	5358af7794	Revert "FEXLoader: Moves thread management to the frontend" This reverts commit 58f26939548cba8b26ec4771aeed6b54e3700046.	2023-12-27 04:33:50 -08:00
Ryan Houdek	25bcddf3a5	FEXCore: Removes context wide and map lookup While locking a shared_lock and doing an empty table lookup is fairly fast, just remove them from the hot path entirely if no custom IR handlers are installed. This is only used for our IRLoader, which is losing its importance significantly and should probably be removed anyway.	2023-12-26 11:11:44 -08:00
Ryan Houdek	f785b38e4d	Merge pull request #3352 from Sonicadvance1/remove_irloader Removes IRLoader, unittests, and public interface	2023-12-26 11:08:26 -08:00
Ryan Houdek	b115c144fb	FEXCore: Removes NetStream from public API Only used by GDBServer. NFC.	2023-12-25 07:07:17 -08:00
Ryan Houdek	d8f20751fe	FEXCore: Moves IREmitter from the public API to backend No functional change	2023-12-25 07:00:29 -08:00
Ryan Houdek	1977747fc2	Removes IRLoader, unittests, and public interface This unit test hasn't really served any purpose for a while now and mostly just causes pain when reworking things in the IR. Just remove the IRLoader, its unit tests, the github action steps and the public FEXCore interface to it. Since it isn't used by anything other than Thunks. Also moves some IR definitions from the public API to the backend.	2023-12-25 07:00:29 -08:00
Ryan Houdek	257016bf12	FEXCore: Moves BucketList out of public API NFC	2023-12-25 06:58:22 -08:00
Ryan Houdek	69d65fba4a	FEXCore: Removes unused SyscallVisitor This was expected to be part of the syscall optimizations we did but ended up getting manifested in a different way. Remove it.	2023-12-25 06:42:11 -08:00
Ryan Houdek	bce694ebb5	FEXCore: Moves BitUtils to FHU No functional change	2023-12-25 06:38:51 -08:00
Ryan Houdek	5d37d5db1a	FEXCore: Optimize HostFeatures and CPUID feature calculation Need #3348 merged first. As I was casually thinking, this code made me realize that it was quite branch heavy and could likely be optimized to logic. The previous code generated some fairly nasty branch heavy code. This can be optimized to be branchless and take roughly five instructions per flag. Using a bitfield for each feature would turn each calculation in to 3-4 instructions but that seems overkill. Very minor thing.	2023-12-25 04:58:15 -08:00
Ryan Houdek	4d109c9ce0	Config: Fixes parsing strenum inside of json files This wasn't wired up before.	2023-12-23 22:32:59 -08:00
Ryan Houdek	db9b326534	FEXCore: Support disabling CPUID features based on config Need to be able to disable sha by config.	2023-12-23 22:32:29 -08:00
Ryan Houdek	1c34b25538	FEX: Removes legacy kernel 32-bit allocator We only used this so that our Xavier CI system which were running old kernels could run unit tests. We have now removed the Xaviers from CI and this is no longer necessary. Stop pretending that we support kernels older than 5.0 and allowing this fallback. The 32-bit allocator is still used for the MAP_32BIT mmap flag, so the load bearing code can't be fully removed. Just remove the config and the frontend things using it.	2023-12-21 06:21:01 -08:00
Ryan Houdek	38ad3f0e05	FEXCore: Pass thread object to HandleUnalignedAccess Currently no functional change but public API breaks should come early. The thread state object will be used for looking up thread specific codebuffers in the future when we support MDWE with code mirrors.	2023-12-21 01:55:25 -08:00
Ryan Houdek	266f7feecb	Arm64: Removes a vtable indirection in syscalls We can safely call virtual functions through the JIT with a little bit of work. FEX's JIT has quite a few steps before it gets to a syscall handler. Before this commit: JIT->static HandleSyscall->SyscallHandler::HandleSyscall->SyscallHandler After this commit: JIT->SyscallHandler::HandleSyscall->SyscallHandler A bit hard to notice this when this interface can spin at 67-million calls per second though.	2023-12-21 01:55:02 -08:00
Ryan Houdek	f9902142f7	Utils: Add ability to get VTable entries to PMF helper This will be useful to remove an indirection.	2023-12-21 01:55:02 -08:00
Ryan Houdek	9e5d7aa5fe	OpcodeDispatcher: Initial support for runtime long-mode switch This has the Frontend and OpcodeDispatcher select their operating mode depending on the incoming code segment long-mode flag. Adds some asserts since currently it is unexpected if the configuration changes at runtime. This is fairly straightforward for an initial setup but isn't fully fleshed out. Right now FEX's x86 tables aren't setup in a way to support choosing a different instruction decoding depending on runtime operating mode change, so that would break in interesting ways. Primarily this just gets FEX setup to start piping the operating mode through from the frontend to the backend. This is a long term task, so it is going to take a long time to iron out all the issues.	2023-12-21 01:54:19 -08:00
Ryan Houdek	8648fb1485	FEXCore: Accurately store segment descriptors Previously we were only storing the 32-bit base address which isn't actually how segment descriptors work. In reality segment descriptors are 64-bit descriptors that are laid out in a particular layout depending on the 4-bit type value. In reality we only care about code and data segment layouts since the rest are bonkers. Describe these descriptors correctly and setup a default code descriptor for the operating mode that FEX is starting in.	2023-12-21 01:54:18 -08:00

1 2 3 4 5 ...

809 Commits