FFmpeg/libavcodec/x86
Ronald S. Bultje 1d16a1cf99 Rename h264_idct_sse2.asm to h264_idct.asm; move inline IDCT asm from
h264dsp_mmx.c to h264_idct.asm (as yasm code). Because the loops are now
coded in asm instead of C, this is (depending on the function) up to 50%
faster for cases where gcc didn't do a great job at looping.

Since h264_idct_add8() is now faster than the manual loop setup in h264.c,
in-asm idct calling can now be enabled for chroma as well (see r16207). For
MMX, this is 5% faster. For SSE2 (which isn't done for chroma if h264.c does
the looping), this makes it up to 50% faster. Speed gain overall is ~0.5-1.0%.

Originally committed as revision 25119 to svn://svn.ffmpeg.org/ffmpeg/trunk
2010-09-14 13:36:26 +00:00
..
cavsdsp_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
dct32_sse.c Move SSE optimized 32-point DCT to its own file. Should fix breakage with YASM 2010-07-06 17:48:23 +00:00
deinterlace.asm Convert deinterlacing MMX code to YASM 2010-07-31 14:50:51 +00:00
dnxhd_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
dsputil_mmx_avg_template.c Add bitexact versions of put_no_rnd_pixels8 _x2 and _y2 for vp3/theora 2010-06-04 04:46:26 +00:00
dsputil_mmx_qns_template.c
dsputil_mmx_rnd_template.c Replace several #ifdef PIC with the more obvious and correct 2009-09-30 09:49:12 +00:00
dsputil_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
dsputil_mmx.h Port latest x264 deblock asm (before they moved to using NV12 as internal 2010-09-03 16:52:46 +00:00
dsputil_yasm.asm Update x264asm header files to latest versions. 2010-06-23 19:20:46 +00:00
dsputilenc_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
fdct_mmx.c Replace remaining uses of ATTR_ALIGNED with DECLARE_ALIGNED 2010-03-18 15:00:17 +00:00
fft_3dn2.c imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits". 2010-08-23 15:51:09 +00:00
fft_3dn.c
fft_mmx.asm cosmetics in imdct_sse 2010-08-28 21:03:13 +00:00
fft_sse.c imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits". 2010-08-23 15:51:09 +00:00
fft.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
fft.h SSE optimized 32-point DCT 2010-07-06 16:58:54 +00:00
h264_chromamc.asm Move H264 chroma MC from inline asm to yasm. This fixes VP3/5/6 and VC-1 2010-08-30 16:31:04 +00:00
h264_deblock.asm Port latest x264 deblock asm (before they moved to using NV12 as internal 2010-09-03 16:52:46 +00:00
h264_i386.h Remove explicit filename from Doxygen @file commands. 2010-04-20 14:45:34 +00:00
h264_idct.asm Rename h264_idct_sse2.asm to h264_idct.asm; move inline IDCT asm from 2010-09-14 13:36:26 +00:00
h264_intrapred_init.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
h264_intrapred.asm Fix h264/vp8 intra pred on Athlon XP 2010-07-01 10:29:47 +00:00
h264_qpel_mmx.c Split h264dsp_mmx.c (which was #included in dsputil_mmx.c) in h264_qpel_mmx.c, 2010-09-01 20:48:59 +00:00
h264_weight.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
h264dsp_mmx.c Rename h264_idct_sse2.asm to h264_idct.asm; move inline IDCT asm from 2010-09-14 13:36:26 +00:00
idct_mmx_xvid.c Add some missing #includes 2010-03-06 22:36:36 +00:00
idct_mmx.c Fix compilation in x86_64. I broke it with r24580. 2010-07-29 22:45:21 +00:00
idct_sse2_xvid.c Put ff_ prefix on non-static {put_signed,put,add}_pixels_clamped_mmx() 2010-08-30 16:22:27 +00:00
idct_xvid.h Remove explicit filename from Doxygen @file commands. 2010-04-20 14:45:34 +00:00
lpc_mmx.c Reduce number of ASM constraints for ff_lpc_compute_autocorr_sse2 since it 2010-01-02 17:48:08 +00:00
Makefile Rename h264_idct_sse2.asm to h264_idct.asm; move inline IDCT asm from 2010-09-14 13:36:26 +00:00
mathops.h Adding missing () to mathops.h. 2010-05-11 00:22:50 +00:00
mlpdsp.c Use fewer macros in x86-optimized mlpdsp. 2009-06-03 23:48:28 +00:00
motion_est_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
mpegaudiodec_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
mpegvideo_mmx_template.c Remove DECLARE_ALIGNED_{8,16} macros 2010-03-06 14:24:59 +00:00
mpegvideo_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
simple_idct_mmx.c Put ff_ prefix on non-static {put_signed,put,add}_pixels_clamped_mmx() 2010-08-30 16:22:27 +00:00
snowdsp_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
vc1dsp_mmx.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
vc1dsp_yasm.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
vp3dsp.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
vp8dsp-init.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
vp8dsp.asm Use "d" suffix for general-purpose registers used with movd. 2010-09-05 10:10:16 +00:00
vp56_arith.h VP5/6/8: ~7% faster arithmetic decoding 2010-08-12 01:11:32 +00:00
vp56dsp_init.c Move mm_support() from libavcodec to libavutil, make it a public 2010-09-08 15:07:14 +00:00
vp56dsp.asm Fix typos when converting inline asm to yasm, fixes MMX-only fate-ea-vp61. 2010-08-26 14:33:39 +00:00
x86inc.asm sync yasm macros from x264 2010-07-21 22:45:16 +00:00
x86util.asm MMX/SSE VC1 loop filter 2010-07-11 22:53:01 +00:00