FFmpeg

mirror of https://github.com/xenia-project/FFmpeg.git synced 2024-11-24 03:59:43 +00:00

Author	SHA1	Message	Date
Christophe Gisquet	4cb6964244	dcadec: simplify decoding of VQ high frequencies The vector dequantization has a test in a loop preventing effective SIMD implementation. By moving it out of the loop, this loop can be DSPized. Therefore, modify the current DSP implementation. In particular, the DSP implementation no longer has to handle null loop sizes. The decode_hf implementations have following timings: For x86 Arrandale: C SSE SSE2 SSE4 win32: 260 162 119 104 win64: 242 N/A 89 72 The arm NEON optimizations follow in a later patch as external asm. The now unused check for the y modifier in arm inline asm is removed from configure.	2014-02-28 13:03:22 +01:00
Janne Grunau	5c1c6e8226	dca: include dcadsp.h in {arm,x86}/dca.h for checkheaders	2014-02-08 13:38:36 +01:00
Christophe Gisquet	2bd44cb705	dcadsp: add int8x8_fmul_int32 to dsp context It is currently declared as a macro who is set to inlinable functions, among which a Neon and a default C implementations. Add a DSP parameter to each inline function, unused except by the default C implementation which calls a function from the DSP context. On an Arrandale CPU, gain for an inlined SSE2 function vs. a call: - Win32: 29 to 26 cycles - Win64: 25 to 23 cycles Signed-off-by: Janne Grunau <janne-libav@jannau.net>	2014-02-07 22:51:59 +01:00
Christophe Gisquet	b6293e2798	fmtconvert: Explicitly use int32_t instead of int Signed-off-by: Martin Storsjö <martin@martin.st>	2013-07-17 11:02:47 +03:00
Diego Biurrun	76b19a3984	Fix a number of incorrect intmath.h #includes.	2013-02-26 00:51:34 +01:00
Mans Rullgard	a7831d509f	arm: use HAVE*_INLINE/EXTERNAL macros for conditional compilation These macros reflect the actual capabilities required here. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-12-07 16:54:03 +00:00
Mans Rullgard	f7de52354f	ARM: dca: disable optimised decode_blockcodes() for old gcc Old gcc versions have trouble compiling this function, and no simple, targeted test is possible. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-12-15 01:02:58 +00:00
Mans Rullgard	00a856e3f9	dca: ARMv6 optimised decode_blockcode() This is a hand-tuned version of the code with impossible parts of the FASTDIV function ommitted. 2-5% faster overall on Cortex-A8. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-11-25 13:19:53 +00:00
Mans Rullgard	6308729e68	ARM: check for inline asm 'y' operand modifier support The inline asm added in `bf5d46d` uses the 'y' modifier which is only supported from gcc 4.5. This check allows building with older compilers. Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-10-03 08:56:24 +01:00
Mans Rullgard	bf5d46d8e6	dca: NEON optimised high freq VQ decoding Signed-off-by: Mans Rullgard <mans@mansr.com>	2011-09-30 19:01:23 +01:00

10 Commits