FFmpeg/libavcodec/x86
Ronald S. Bultje ae11291865 Unroll loop in h264_idct_add16intra_sse2(). Basically identical to r25171, this
inlines scan8[] and removes loop setup. 15% faster, 0.4% overall.

See "[PATCH] unroll loop in h264_idct_add8_sse2()" thread on ML.

Originally committed as revision 25172 to svn://svn.ffmpeg.org/ffmpeg/trunk
2010-09-24 14:07:23 +00:00
..
cavsdsp_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
dct32_sse.c Move SSE optimized 32-point DCT to its own file. Should fix breakage with YASM 2010-07-06 17:48:23 +00:00
deinterlace.asm Convert deinterlacing MMX code to YASM 2010-07-31 14:50:51 +00:00
dnxhd_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
dsputil_mmx_avg_template.c Add bitexact versions of put_no_rnd_pixels8 _x2 and _y2 for vp3/theora 2010-06-04 04:46:26 +00:00
dsputil_mmx_qns_template.c
dsputil_mmx_rnd_template.c Replace several #ifdef PIC with the more obvious and correct 2009-09-30 09:49:12 +00:00
dsputil_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
dsputil_mmx.h Port latest x264 deblock asm (before they moved to using NV12 as internal 2010-09-03 16:52:46 +00:00
dsputil_yasm.asm Update x264asm header files to latest versions. 2010-06-23 19:20:46 +00:00
dsputilenc_mmx.c x86: disable SSE functions using stack when stack is not aligned 2010-09-21 17:57:21 +00:00
dsputilenc_yasm.asm Don't access upper 32 bits of a 32-bit int on 64-bit systems. 2010-09-17 12:24:22 +00:00
fdct_mmx.c Replace remaining uses of ATTR_ALIGNED with DECLARE_ALIGNED 2010-03-18 15:00:17 +00:00
fft_3dn2.c imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits". 2010-08-23 15:51:09 +00:00
fft_3dn.c
fft_mmx.asm cosmetics in imdct_sse 2010-08-28 21:03:13 +00:00
fft_sse.c imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits". 2010-08-23 15:51:09 +00:00
fft.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
fft.h SSE optimized 32-point DCT 2010-07-06 16:58:54 +00:00
h264_chromamc.asm Move H264 chroma MC from inline asm to yasm. This fixes VP3/5/6 and VC-1 2010-08-30 16:31:04 +00:00
h264_deblock.asm Port latest x264 deblock asm (before they moved to using NV12 as internal 2010-09-03 16:52:46 +00:00
h264_i386.h Remove explicit filename from Doxygen @file commands. 2010-04-20 14:45:34 +00:00
h264_idct.asm Unroll loop in h264_idct_add16intra_sse2(). Basically identical to r25171, this 2010-09-24 14:07:23 +00:00
h264_intrapred_init.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
h264_intrapred.asm Fix h264/vp8 intra pred on Athlon XP 2010-07-01 10:29:47 +00:00
h264_qpel_mmx.c Split h264dsp_mmx.c (which was #included in dsputil_mmx.c) in h264_qpel_mmx.c, 2010-09-01 20:48:59 +00:00
h264_weight.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
h264dsp_mmx.c x86: disable SSE functions using stack when stack is not aligned 2010-09-21 17:57:21 +00:00
idct_mmx_xvid.c Add some missing #includes 2010-03-06 22:36:36 +00:00
idct_mmx.c Fix compilation in x86_64. I broke it with r24580. 2010-07-29 22:45:21 +00:00
idct_sse2_xvid.c Put ff_ prefix on non-static {put_signed,put,add}_pixels_clamped_mmx() 2010-08-30 16:22:27 +00:00
idct_xvid.h Remove explicit filename from Doxygen @file commands. 2010-04-20 14:45:34 +00:00
lpc_mmx.c Reduce number of ASM constraints for ff_lpc_compute_autocorr_sse2 since it 2010-01-02 17:48:08 +00:00
Makefile Move sse16_sse2() from inline asm to yasm. It is one of the functions causing 2010-09-17 01:44:17 +00:00
mathops.h Adding missing () to mathops.h. 2010-05-11 00:22:50 +00:00
mlpdsp.c Use fewer macros in x86-optimized mlpdsp. 2009-06-03 23:48:28 +00:00
motion_est_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
mpegaudiodec_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
mpegvideo_mmx_template.c Remove DECLARE_ALIGNED_{8,16} macros 2010-03-06 14:24:59 +00:00
mpegvideo_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
simple_idct_mmx.c Put ff_ prefix on non-static {put_signed,put,add}_pixels_clamped_mmx() 2010-08-30 16:22:27 +00:00
snowdsp_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
vc1dsp_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
vc1dsp_yasm.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
vp3dsp.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
vp8dsp-init.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
vp8dsp.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
vp56_arith.h VP5/6/8: ~7% faster arithmetic decoding 2010-08-12 01:11:32 +00:00
vp56dsp_init.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
vp56dsp.asm Fix typos when converting inline asm to yasm, fixes MMX-only fate-ea-vp61. 2010-08-26 14:33:39 +00:00
x86inc.asm sync yasm macros from x264 2010-07-21 22:45:16 +00:00
x86util.asm Move hadamard_diff{,16}_{mmx,mmx2,sse2,ssse3}() from inline asm to yasm, 2010-09-17 01:56:06 +00:00