Commit Graph

12292 Commits

Author SHA1 Message Date
Unknown W. Brackets
a8a299c2e3 Fix ToRGB/ToRGBA possible accuracy loss.
It was always like this, but not used as much before.  Shifts are fast and
it eneds to sum anyway, there should not be any benefit to multiplying as
floats, and it will probably lose accuracy.
2014-03-18 22:56:27 -07:00
Henrik Rydgård
44e9484942 Merge pull request #5677 from unknownbrackets/perf
Minor perf improvements to software transform
2014-03-18 09:21:41 +01:00
Unknown W. Brackets
678237aa6c Improve SSE usage in software transform.
It's actually already pretty decent (unlike the softgpu), but there were a
few places it could use a bit of help.  Speeds up things with hardware
transform off, or areas that need to use software transform.
2014-03-17 23:05:48 -07:00
Unknown W. Brackets
416df17088 Inline From/ToRGB(A) to avoid losing SSE.
Otherwise it has to store it, which I'd like to avoid.
2014-03-17 23:03:04 -07:00
Unknown W. Brackets
1ce6bf399a Buildfix for 32-bit x86, arg. 2014-03-17 21:52:45 -07:00
Unknown W. Brackets
833c93bd98 Dumb mistake, forgot the divide.
Probably caused the blending issues.
2014-03-17 12:53:49 -07:00
Henrik Rydgård
800e2160ea Merge pull request #5675 from unknownbrackets/softgpu
Just add a packed version of Vec3f
2014-03-17 15:05:21 +01:00
Unknown W. Brackets
6630e45eff Just add a packed version of Vec3f.
This way we can have it aligned to memory where needed.  I think it'd be
better to avoid this if possible so that we can actually vectorize
spline/etc. code.

Fixes #5673.
2014-03-17 06:59:40 -07:00
Henrik Rydgård
8cd8521294 Merge pull request #5671 from unknownbrackets/perf
Optimize some 4444/8888 color conversions
2014-03-17 10:32:18 +01:00
Unknown W. Brackets
38d0bac1df Optimize some 4444/8888 color conversions.
Small performance boost in softgpu.
2014-03-17 01:21:52 -07:00
Henrik Rydgård
4bea9035ee Merge pull request #5670 from unknownbrackets/softgpu
Use SSE in the softgpu to improve perf
2014-03-17 09:13:55 +01:00
Unknown W. Brackets
6de2129f98 softgpu: Don't re-pack 8888 colors.
It's like a bad joke, but MSVC was not optimizing this out.
2014-03-16 23:03:07 -07:00
Unknown W. Brackets
10456a09ac Oops, forgot to multiply in float ToRGBA().
Not actually used...
2014-03-16 21:12:23 -07:00
Unknown W. Brackets
627027307c softgpu: Use SSE in ToRGB()/FromRGB() etc. 2014-03-16 19:21:35 -07:00
Unknown W. Brackets
07ca96e226 softgpu: Use SSE in alpha blending. 2014-03-16 18:57:11 -07:00
Unknown W. Brackets
601ff10f1e softgpu: Use SSE in tex modulation.
Could do others, this seems the most common.  Gives a few more percent.
2014-03-16 18:28:06 -07:00
Unknown W. Brackets
47728528d7 softgpu: Use SSE in Vec?::Length().
Minor perf boost but if I do everything in Vec things get slower.
2014-03-16 17:56:34 -07:00
Unknown W. Brackets
6ef0aa123f softgpu: Use SSE for the secondary color.
It's easy to speed up this code since it's so hot.
2014-03-16 16:21:12 -07:00
Unknown W. Brackets
7f3e158a0f softgpu: Get all tex samples at the same time.
Kills a bunch of overhead, improving speed more.
2014-03-16 15:51:47 -07:00
Unknown W. Brackets
d9e29a2edf softgpu: Optimize alpha blending handling.
This alone makes it a good bit faster.
2014-03-16 15:22:31 -07:00
Unknown W. Brackets
f21649e563 softgpu: Minor simplification for alpha blend. 2014-03-16 15:09:42 -07:00
Unknown W. Brackets
1ab7325d4a softgpu: Use a full Vec4 for the prim color.
Simpler, and slightly faster.
2014-03-16 15:04:41 -07:00
Unknown W. Brackets
c3530a6674 softgpu: Don't multithread small triangles.
It ends up being slower with all the overhead, of course.
2014-03-16 14:49:49 -07:00
Unknown W. Brackets
b33d0c4046 softgpu: Use SSE for texture sampling. 2014-03-16 14:33:42 -07:00
Unknown W. Brackets
b357b00ace softgpu: Use SSE for through texture coords. 2014-03-16 14:30:20 -07:00
Unknown W. Brackets
dd140b73bb softgpu: Use SSE for gouraud shading. 2014-03-16 14:29:22 -07:00
Unknown W. Brackets
743854afc8 Fix off-by-one on fast matrix loads.
May matter mostly if there's a stall right at the end of the matrix.
2014-03-15 15:23:55 -07:00
Henrik Rydgård
78ce9b3f3c Spline patches: Ignore too-small patch_div_s/t. May help #5663 2014-03-15 21:29:48 +01:00
Unknown W. Brackets
737c96eced More cstring includes, grr. 2014-03-15 12:03:33 -07:00
Unknown W. Brackets
06b9e731e8 Linux buildfix.
Arg, stupid memcmp/sprintf...
2014-03-15 11:56:52 -07:00
Henrik Rydgård
fe7a4b654f Merge pull request #5661 from unknownbrackets/warnings
Reduce a bunch of includes, fix a couple warnings
2014-03-15 19:53:49 +01:00
Unknown W. Brackets
abfa203520 Cut down on a few more includes in Core/HLE/. 2014-03-15 11:49:35 -07:00
Unknown W. Brackets
a843cbd580 Shrink the very common sceKernelThread.h include. 2014-03-15 11:44:02 -07:00
Unknown W. Brackets
d65af7353b Avoid some ChunkFile includes. 2014-03-15 11:32:57 -07:00
Unknown W. Brackets
05ab192c9c Reduce includes in Core/HLE/.
Especially templates.
2014-03-15 11:22:19 -07:00
Unknown W. Brackets
8d29b583e8 Reduce a few Core/Dialog/ includes. 2014-03-15 10:52:13 -07:00
Unknown W. Brackets
c9be2f606c Fix a type conversion warning. 2014-03-15 10:41:13 -07:00
Unknown W. Brackets
996fa39684 Reduce some unnecessary includes in Core/. 2014-03-15 10:41:07 -07:00
Unknown W. Brackets
4674ff50cc Ingore literal-suffix on Android, the sdk has it.
Just makes it hard to find real warnings.
2014-03-15 10:37:13 -07:00
Unknown W. Brackets
e465562f98 Merge pull request #5659 from thedax/androidBuildFix
Build-fix for Android.
2014-03-15 09:37:34 -07:00
The Dax
bca5b38c90 Build-fix for Android. 2014-03-15 11:32:55 -04:00
Henrik Rydgård
d791093362 Merge pull request #5658 from thedax/unixBuildFix
Fix Unix-like builds (e.g. Linux, Android, etc.).
2014-03-15 15:24:24 +01:00
The Dax
285d192e83 Further fixes. 2014-03-15 10:17:03 -04:00
The Dax
21ce99cabd Fix Unix-like builds. 2014-03-15 10:02:47 -04:00
Henrik Rydgård
923accf6ff Merge pull request #5657 from kaienfr/mp3_support
Mp3 audio support
2014-03-15 14:54:52 +01:00
kaienfr
1a90e0c7ba Update VS project files 2014-03-15 14:44:47 +01:00
kaienfr
b2b6b09072 Mp3 Audio Support.
Now, we can play mp3 audio format. :)
2014-03-15 14:40:31 +01:00
Henrik Rydgard
5705496233 Reduce the delay in __KernelVolatileTryMemLock. However, it's still too large. 2014-03-15 10:51:16 +01:00
Henrik Rydgard
b4d99b1981 Revert "Avoid caching when HW T&L with morph enabled."
This reverts commit 557eae7ca9.
2014-03-15 10:46:04 +01:00
Henrik Rydgard
c80510fb3b MemMap should not be included in MIPS.h. 2014-03-15 10:45:39 +01:00