Unknown W. Brackets
|
df1a91ee25
|
samplerjit: Correct nearest negative texture clamp.
Was not clamping to zero when negative.
|
2022-02-20 10:25:00 -08:00 |
|
Unknown W. Brackets
|
a88c9a0680
|
softgpu: Remove incorrect offsetting for X/Y.
|
2022-02-20 09:13:20 -08:00 |
|
Unknown W. Brackets
|
ad18833a4f
|
samplerjit: Fix non-SSE4 bugs in jit.
|
2022-02-15 20:13:38 -08:00 |
|
Unknown W. Brackets
|
99d7703d33
|
samplerjit: Precalculate DXT1/3/5 offsets.
This improves WALL-E by 8% overall.
|
2022-02-05 13:04:17 -08:00 |
|
Unknown W. Brackets
|
c91b51c8e1
|
samplerjit: Reduce DXT5 decode code size a bit.
|
2022-02-03 20:42:34 -08:00 |
|
Unknown W. Brackets
|
c2dd59084d
|
samplerjit: Optimize DXT calc using BMI2.
|
2022-02-01 00:18:56 -08:00 |
|
Unknown W. Brackets
|
3e4afe2a0c
|
samplerjit: Avoid RCX gymanstics with BMI2.
|
2022-01-31 22:33:09 -08:00 |
|
Unknown W. Brackets
|
4cadcea6da
|
samplerjit: Decode colors with BMI2.
This only happens with nearest, though, so very small benefit.
|
2022-01-31 22:05:34 -08:00 |
|
Unknown W. Brackets
|
1b2cf52bfe
|
samplerjit: Fix non-shared CLUT on Linux.
Oops, good that CI will catch this now - I've broken this more than once.
|
2022-01-29 22:20:46 -08:00 |
|
Unknown W. Brackets
|
26a8d498d7
|
samplerjit: Correct level lookup in nearest.
|
2022-01-29 20:29:43 -08:00 |
|
Unknown W. Brackets
|
3387ab1711
|
samplerjit: Fix reg corruption in DXT funcs.
We'd cache something in a reg, but it'd no longer be there.
|
2022-01-29 20:29:08 -08:00 |
|
Unknown W. Brackets
|
5976cad797
|
samplerjit: Reduce register waste.
A few registers were allocated longer than needed, which made requiring
stack more likely.
|
2022-01-29 09:47:06 -08:00 |
|
Unknown W. Brackets
|
eb70a90347
|
samplerjit: Avoid frac uv transfer to gen regs.
It should just stay in vec, this is more convenient anyway.
|
2022-01-28 23:50:54 -08:00 |
|
Unknown W. Brackets
|
99d6d569f0
|
samplerjit: Reduce transfers in nearest texel calc.
This benefits a few games, mostly where there's lots of UI or similar.
|
2022-01-24 21:28:04 -08:00 |
|
Unknown W. Brackets
|
c1e657ed47
|
samplerjit: Better vectorize UV linear calc.
Gives about 1-2% when mips are used.
|
2022-01-24 20:42:07 -08:00 |
|
Unknown W. Brackets
|
733046962f
|
samplerjit: Reuse XMM reg for sizes.
Gives just under 1% overall improvement in games using mips.
|
2022-01-24 19:01:23 -08:00 |
|
Unknown W. Brackets
|
d8c5c35b1a
|
samplerjit: Optimize texenv blending a bit.
This reduces to a single multiply, which is much faster.
|
2022-01-23 11:43:34 -08:00 |
|
Unknown W. Brackets
|
4262e657b4
|
samplerjit: Oops, forgot about 64 unpack.
Just a minor codegen tweak. Always forget there are more of these than
pack instructions.
|
2022-01-22 10:49:36 -08:00 |
|
Unknown W. Brackets
|
0425b8d630
|
samplerjit: Fix Linux stack corruption.
Oops, nearest was not using the red zone correctly.
|
2022-01-22 10:47:32 -08:00 |
|
Unknown W. Brackets
|
212e730e98
|
samplerjit: Fix some Linux register issues.
|
2022-01-22 00:14:15 -08:00 |
|
Unknown W. Brackets
|
6ec819878a
|
samplerjit: Reduce prolog/epilog spill.
Track reg usage so we only push/pop what we need.
|
2022-01-19 00:03:59 -08:00 |
|
Unknown W. Brackets
|
357e2e9d68
|
softjit: Simplify constant writes.
|
2022-01-19 00:03:59 -08:00 |
|
Unknown W. Brackets
|
c2985bca31
|
softjit: Centralize some common funcs from sampler.
No need to duplicate this code.
|
2022-01-19 00:03:59 -08:00 |
|
Unknown W. Brackets
|
0ba2d05da5
|
samplerjit: Simplify AVX shift-copies.
These have been the most common and the fallback is safe. Let's just add
a helper.
|
2022-01-17 15:15:36 -08:00 |
|
Unknown W. Brackets
|
d6fa301ab1
|
softgpu: Track CLUTs as states for binning.
This way we can have multiple CLUTs in process at once, which helps.
|
2022-01-16 08:14:09 -08:00 |
|