[theora-dev] SSE2 assembly support
Timothy B. Terriberry
tterribe at email.unc.edu
Thu Feb 11 01:10:46 PST 2010
Kay Tiong Khoo wrote:
> The profile concurs with Timothy's assessment. The optimized MMX functions account for ~30% of the samples, so the room for improvement by conversion to SSE2 is limited. I will try some opportunistic optimizations before starting on the conversion work.
Make sure you are working from the current 1.2 development branch:
On x86-64, this should be using SSE2 already for SATD (your profile
shows the MMXEXT versions). It still uses MMXEXT SATD on x86-32 because
the SSE2 versions profiled as slower on an actual 32-bit processor
(where each instruction often requires multiple clocks).
More information about the theora-dev