提交 · 3a39195b1db5620901a049cd690752b1935f5e0f · 小白菜888 / Ffmpeg

13 8月, 2011 1 次提交
- R
  Move x86inc.asm to libavutil/. · 3a39195b
  由 Ronald S. Bultje 提交于 7月 23, 2011
```
This allows using it in libswscale/ also.
```
  3a39195b
30 7月, 2011 1 次提交
- J
  
  H.264: tweak some other x86 asm for Atom · a3bf7b86
  由 Jason Garrett-Glaser 提交于 7月 27, 2011
  
  a3bf7b86
14 6月, 2011 3 次提交
- J
  4:4:4 H.264 decoding support · c90b9442
  由 Jason Garrett-Glaser 提交于 6月 03, 2011
```
Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
```
  c90b9442
- J
  Roll back 4:4:4 H.264 for now · 504811ba
  由 Jason Garrett-Glaser 提交于 6月 13, 2011
```
Needs some ARM/PPC asm modifications.
```
  504811ba
- J
  4:4:4 H.264 decoding support · c9c49387
  由 Jason Garrett-Glaser 提交于 6月 03, 2011
```
Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
```
  c9c49387
01 6月, 2011 1 次提交
- D
  Update 8-bit H.264 IDCT function names to reflect bit-depth. · 348493db
  由 Daniel Kang 提交于 5月 24, 2011
```
Signed-off-by: NRonald S. Bultje <rbultje@google.com>
```
  348493db
18 5月, 2011 1 次提交

Modify x86util.asm to ease transitioning to 10-bit H.264 assembly. · d0005d34

由 Daniel Kang 提交于 5月 16, 2011

Arguments for variable size instructions are added to many macros, along
with other various changes. The x86util.asm code was ported from x264.
Signed-off-by: NDiego Biurrun <diego@biurrun.de>

d0005d34

15 5月, 2011 1 次提交
- D
  
  Fix FSF address copy paste error in some license headers. · 888fa31e
  由 Diego Biurrun 提交于 5月 14, 2011
  
  888fa31e
19 3月, 2011 1 次提交
- M
  Replace FFmpeg with Libav in licence headers · 2912e87a
  由 Mans Rullgard 提交于 3月 18, 2011
```
Signed-off-by: NMans Rullgard <mans@mansr.com>
```
  2912e87a
15 1月, 2011 1 次提交

H.264: split luma dc idct out and implement MMX/SSE2 versions · 19fb234e

由 Jason Garrett-Glaser 提交于 1月 14, 2011

About 2.5x the speed.

NOTE: the way that the asm code handles large qmuls is a bit suboptimal.
If x264-style dequant was used (separate shift and qmul values), it might
be possible to get some extra speed.

Originally committed as revision 26336 to svn://svn.ffmpeg.org/ffmpeg/trunk

19fb234e

26 9月, 2010 1 次提交
- R
  Add d suffix to movd target register to make it work with nasm. · 02b424d9
  由 Reimar Döffinger 提交于 9月 26, 2010
```
Originally committed as revision 25206 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  02b424d9
24 9月, 2010 2 次提交

Unroll loop in h264_idct_add16intra_sse2(). Basically identical to r25171, this · ae112918

由 Ronald S. Bultje 提交于 9月 24, 2010

inlines scan8[] and removes loop setup. 15% faster, 0.4% overall.

See "[PATCH] unroll loop in h264_idct_add8_sse2()" thread on ML.

Originally committed as revision 25172 to svn://svn.ffmpeg.org/ffmpeg/trunk

ae112918

Unroll loop in h264_idct_add8_sse2(). This means we can inline scan8[] in the · 4bca6774

由 Ronald S. Bultje 提交于 9月 24, 2010

code directly also and remove loop setup. 20% faster in function, 0.8% overall.

See "[PATCH] unroll loop in h264_idct_add8_sse2()" thread on ML.

Originally committed as revision 25171 to svn://svn.ffmpeg.org/ffmpeg/trunk

4bca6774

14 9月, 2010 1 次提交

Rename h264_idct_sse2.asm to h264_idct.asm; move inline IDCT asm from · 1d16a1cf

由 Ronald S. Bultje 提交于 9月 14, 2010

h264dsp_mmx.c to h264_idct.asm (as yasm code). Because the loops are now
coded in asm instead of C, this is (depending on the function) up to 50%
faster for cases where gcc didn't do a great job at looping.

Since h264_idct_add8() is now faster than the manual loop setup in h264.c,
in-asm idct calling can now be enabled for chroma as well (see r16207). For
MMX, this is 5% faster. For SSE2 (which isn't done for chroma if h264.c does
the looping), this makes it up to 50% faster. Speed gain overall is ~0.5-1.0%.

Originally committed as revision 25119 to svn://svn.ffmpeg.org/ffmpeg/trunk

1d16a1cf