Unknown W. Brackets
27550a2336
Update some flags in GLES_GPU.
...
It's useful to know if we need to have a correct PC value or downcount.
2014-03-29 14:10:35 -07:00
Unknown W. Brackets
18f6407705
Clamp morph colors in non-vertexjit.
...
Not sure if any platforms use this...
2014-03-29 01:53:28 -07:00
Unknown W. Brackets
69c2500dc6
Make sure vai->flags are set while hashing.
...
Since we decode the verts in this case we have a fresh flag. Might
be #5718 ?
2014-03-25 08:19:38 -07:00
Unknown W. Brackets
61f5d3d360
Initial stab at tracking vertex alpha.
...
Not sure what efficient method to use on x86...
2014-03-24 01:26:55 -07:00
Unknown W. Brackets
996fa39684
Reduce some unnecessary includes in Core/.
2014-03-15 10:41:07 -07:00
Henrik Rydgård
79549e4acc
PosS16Through: Treat Z as unsigned.
2014-02-13 11:59:47 +01:00
Unknown W. Brackets
ad8488ecc4
Reset reporting on a new game.
...
This way "once" and "spam" no longer persist between runs. Makes it
easier to test a bunch of games in a row without restarting in between.
2014-02-09 17:39:16 -08:00
Unknown W. Brackets
4f585fd7a6
Fix a few minor typos.
2014-01-11 22:59:52 -08:00
Henrik Rydgard
e3e9408b87
Resurrect the vtx decoder jit option for debugging
2013-11-30 16:11:54 +01:00
Henrik Rydgard
eea8413a82
Make the texcoord speedhack a bit less aggressive at eliminating uvscaleoffset uniform updates.
...
Should fix #4688 and the new problem in #2919 while keeping Frontier Gate Boost working.
2013-11-30 15:54:20 +01:00
Henrik Rydgard
030e6460cc
ARM: NEON-optimize software skinning
2013-11-24 18:03:42 +01:00
Henrik Rydgard
acb3994172
Split VertexDecoder into files for ARM and x86, got too large.
2013-11-24 15:58:15 +01:00
Henrik Rydgard
dfea160491
ARM: Use PLD (cache preload) in vertex decoder loop.
2013-11-24 15:08:47 +01:00
Henrik Rydgard
f650b23c90
ARM: Add NEON widening and narrowing moves, and float/int convert.
...
Experiment a little in the vertex decoder.
2013-11-24 13:30:28 +01:00
Henrik Rydgard
e3d471f590
Fix issues with texturing from render targets with prescaled UV (texcoord speed hack)
2013-11-19 23:38:36 +01:00
Henrik Rydgård
f1ed1a9a17
Flush the lit pool before flushing the icache.
2013-11-15 11:14:24 +01:00
Henrik Rydgård
bd23f91e0c
Flush the lit pool in the vtx dec
2013-11-15 10:34:27 +01:00
Henrik Rydgard
adde016338
vtxdec sw skin arm: Simplify code, try a slightly different approach.
2013-11-14 23:07:23 +01:00
Henrik Rydgard
d6d54c8bb4
Fix a couple of bugs in vtxdec
2013-11-14 15:06:51 +01:00
Henrik Rydgard
871d8b28ab
Assorted vtxdec x86 optimizations
2013-11-14 11:44:13 +01:00
Sacha
eaa52c4a61
VertexDecoder: Aligned code seems to help Symbian (armv6) performance. It is also used for CPU JIT so there's likely a reason.
2013-11-14 15:30:07 +10:00
Henrik Rydgard
ff1d53c9c7
vtxdec x86: Don't load bones unless software skinning is enabled
2013-11-14 00:56:24 +01:00
Henrik Rydgard
a9b365aa71
Fix for possible misalignment crash on x86
2013-11-14 00:37:04 +01:00
Henrik Rydgård
9bbdd1907d
Kind-of optimized ARM software skinning (non-NEON)
2013-11-13 18:11:35 +01:00
Henrik Rydgård
821a2f10f8
Delete obsolete code
2013-11-13 18:10:58 +01:00
Henrik Rydgard
6221dbaf5d
Optimize software skinning for x86.
...
Can't seem to get a win on Windows vs hardware skinning though, even though
draw calls drop by 2/3rd...
2013-11-13 18:10:58 +01:00
Henrik Rydgard
9333d3ea76
Vtx dec jit: Combine the scale and offset registers to save 1 more xmm register.
2013-11-13 18:10:58 +01:00
Henrik Rydgard
4f78eda23b
Save a couple of registers in the x86 vertex decoder jit by SIMD-ing prescale UV
2013-11-13 18:10:57 +01:00
Henrik Rydgard
7e67476b00
Simple unoptimized software skinning.
...
Does not take advantage of the possible reduction in state changes yet.
2013-11-13 18:10:57 +01:00
Henrik Rydgård
ab3fe9ba86
Extract the software transform code into its own file.
2013-11-13 14:56:34 +01:00
Henrik Rydgard
63334698e1
Add temporary setting to disable the vertex decoder jit while we debug it
2013-11-09 18:16:26 +01:00
Henrik Rydgard
0e542b6ecc
vtxdec: small omission
2013-11-08 20:38:49 +01:00
Henrik Rydgard
22d6b36005
Vertexdecoder "double": fix for x86 and very minor optimization for arm.
2013-11-08 20:03:28 +01:00
Henrik Rydgard
6b45c321b6
vtxdecjit: turn off excessive logging
2013-11-08 18:49:17 +01:00
Henrik Rydgard
381b6d0f05
VertexDecoder JIT: Add the last missing ones except morph, I think.
2013-11-08 12:43:46 +01:00
Henrik Rydgard
c4e3dd14fd
Add commented-out code to save XMM4/XMM5.
...
According to all calling convention manuals I can find, we don't really
need to preserve them. If they become problematic as mentioned, we can
activate this.
2013-11-07 09:54:58 +01:00
Henrik Rydgard
b203da05e9
Prescale UV in vtx-dec-jit: Fix bugs, add ARM support
2013-11-07 01:24:53 +01:00
Henrik Rydgård
367bcf6d4f
Prescale in the vertex dec jit. Needs debugging.
2013-11-07 01:24:53 +01:00
Unknown W. Brackets
34398b7d0c
Avoid literal loads in the arm vertexjit.
2013-11-06 08:45:00 -08:00
Henrik Rydgård
51995a3d43
Vtx dec: After generating ARM, remember to flush the icache.
...
Will hopefully fix the random crashes in #4461 .
2013-11-06 16:14:40 +01:00
Henrik Rydgård
e3f6f25390
Buildfix for non-Windows non-ARM
2013-11-06 13:54:26 +01:00
Henrik Rydgård
ea9da85bdb
Missed one possible unaligned access
2013-11-06 13:14:49 +01:00
Henrik Rydgård
b3fdfc01c8
ARM vtx dec: Avoid all unaligned accesses entirely.
...
Seeing so much contradictory information on the support and performance
of these.
2013-11-06 12:17:41 +01:00
Henrik Rydgård
1e158fa652
ARM vtx dec: Preserving our FP scratch register appears to improve
...
stability.
Also added some logging.
2013-11-06 11:47:26 +01:00
Henrik Rydgård
b19d41f9a8
Now that LDRH works, use it where appropriate
2013-11-06 10:51:21 +01:00
Henrik Rydgard
0eb3d79de9
x86 VertexDecoder jit: Fix typo in 16-bit weight decoder. May fix crashes.
2013-11-05 21:06:43 +01:00
Henrik Rydgård
7bf8a4dc5e
We need to use a signed VCVT to float in PosS*Through
2013-11-05 12:17:18 +01:00
Unknown W. Brackets
c7edf73cdb
Small optimizations to the vertexjit.
2013-11-05 00:32:08 -08:00
Unknown W. Brackets
f6662054bd
Fix arm emitter bug in LDRH and friends.
2013-11-05 00:32:08 -08:00
Henrik Rydgård
5886ccffdc
Merge pull request #4441 from unknownbrackets/vertex-decoder-jit
...
Force 5 byte jumps to avoid jump target issues (vertex jit, x86)
2013-11-04 01:25:40 -08:00