- 13 8月, 2011 1 次提交
-
-
由 Ronald S. Bultje 提交于
This allows using it in libswscale/ also.
-
- 30 7月, 2011 1 次提交
-
-
由 Jason Garrett-Glaser 提交于
-
- 14 6月, 2011 3 次提交
-
-
由 Jason Garrett-Glaser 提交于
Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
-
由 Jason Garrett-Glaser 提交于
Needs some ARM/PPC asm modifications.
-
由 Jason Garrett-Glaser 提交于
Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
-
- 01 6月, 2011 1 次提交
-
-
由 Daniel Kang 提交于
Signed-off-by: NRonald S. Bultje <rbultje@google.com>
-
- 18 5月, 2011 1 次提交
-
-
由 Daniel Kang 提交于
Arguments for variable size instructions are added to many macros, along with other various changes. The x86util.asm code was ported from x264. Signed-off-by: NDiego Biurrun <diego@biurrun.de>
-
- 15 5月, 2011 1 次提交
-
-
由 Diego Biurrun 提交于
-
- 19 3月, 2011 1 次提交
-
-
由 Mans Rullgard 提交于
Signed-off-by: NMans Rullgard <mans@mansr.com>
-
- 15 1月, 2011 1 次提交
-
-
由 Jason Garrett-Glaser 提交于
About 2.5x the speed. NOTE: the way that the asm code handles large qmuls is a bit suboptimal. If x264-style dequant was used (separate shift and qmul values), it might be possible to get some extra speed. Originally committed as revision 26336 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 26 9月, 2010 1 次提交
-
-
由 Reimar Döffinger 提交于
Originally committed as revision 25206 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 24 9月, 2010 2 次提交
-
-
由 Ronald S. Bultje 提交于
inlines scan8[] and removes loop setup. 15% faster, 0.4% overall. See "[PATCH] unroll loop in h264_idct_add8_sse2()" thread on ML. Originally committed as revision 25172 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
由 Ronald S. Bultje 提交于
code directly also and remove loop setup. 20% faster in function, 0.8% overall. See "[PATCH] unroll loop in h264_idct_add8_sse2()" thread on ML. Originally committed as revision 25171 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 14 9月, 2010 1 次提交
-
-
由 Ronald S. Bultje 提交于
h264dsp_mmx.c to h264_idct.asm (as yasm code). Because the loops are now coded in asm instead of C, this is (depending on the function) up to 50% faster for cases where gcc didn't do a great job at looping. Since h264_idct_add8() is now faster than the manual loop setup in h264.c, in-asm idct calling can now be enabled for chroma as well (see r16207). For MMX, this is 5% faster. For SSE2 (which isn't done for chroma if h264.c does the looping), this makes it up to 50% faster. Speed gain overall is ~0.5-1.0%. Originally committed as revision 25119 to svn://svn.ffmpeg.org/ffmpeg/trunk
-