[vorbis-dev] vorbis SSE diff
Felix von Leitner
felix-vorbis at fefe.de
Sat Apr 19 18:25:59 PDT 2003
I just uploaded a diff I made for libvorbis to
http://www.fefe.de/diffs/libvorbis-1.0-simd.diff.bz2
It adds a 3dnow! decoding speedup (about 10%), SSE decoding speedup
(about 20%) and SSE encoding speedup (about 25%).
The code selection happens purely at compile time, no CPU flag detection
or so. I figure someone else will have to do that, I'm short on time.
To activate 3dnow!, add -Dsimd_3dn to CFLAGS. To activate SSE, compile
with gcc 3.2.2 (others have had alignment problems), compile with
optimizer enabled, and add something like -march=pentium3 -msse to
CFLAGS.
I probably haven't even remotely touched all the parts that would be
beneficial to speed up, but I don't even know how vorbis works, I'm just
some comp.arch guy. ;) However, I use vorbis to encode the sound track
to recorded TV episodes and encoding time for a 45 min episode is now
down to 5 minutes on my Athlon XP 2000+. Vorbis already was
impressively fast; now it's on par with lame using --vbr-new.
Hope this helps,
Felix
PS: I'm not on the list, please Cc: me and send questions directly to
me.
--- >8 ----
List archives: http://www.xiph.org/archives/
Ogg project homepage: http://www.xiph.org/ogg/
To unsubscribe from this list, send a message to 'vorbis-dev-request at xiph.org'
containing only the word 'unsubscribe' in the body. No subject is needed.
Unsubscribe messages sent to the list will be ignored/filtered.
More information about the Vorbis-dev
mailing list