[flac-dev] [PATCH] new SSE code to calculate autocorrelation
lvqcl
lvqcl.mail at gmail.com
Sat Aug 2 02:11:05 PDT 2014
This patch accelerates FLAC__lpc_compute_autocorrelation_intrin_sse_lag_NN
routines for AMD and newer Intel CPUs. But it's slower on older Intel CPUs.
('Newer Intel CPUs' means Core i aka Nehalem and newer)
According to tests at HA: <http://www.hydrogenaud.io/forums/index.php?s=&showtopic=101082&view=findpost&p=870753>
CPU flac -5 flac -8
Athlon XP +5 % +2.4 %
Athlon 64 X2 +9 % +4 %
Core i +7 % +1 % ... +2.7 %
Core 2 ? -3.5 %
According to Steam HW survey <http://store.steampowered.com/hwsurvey/>
69% of Steam users have SSE4.2 which means that the new code is faster for them.
There are also AMD users that don't have SSE4.2, so I think that about 75%
of Steam users will benefit from this patch.
(If somebody knows a better source of stat data please tell.)
-------------- next part --------------
A non-text attachment was scrubbed...
Name: autocorr_sse.zip
Type: application/zip
Size: 1121 bytes
Desc: not available
Url : http://lists.xiph.org/pipermail/flac-dev/attachments/20140802/b0cc1b2d/attachment.zip
More information about the flac-dev
mailing list