[flac-dev] About SSE intrinsincs in decoder

Martijn van Beurden mvanb1 at gmail.com
Tue Jul 5 08:26:33 UTC 2022


Op di 5 jul. 2022 om 09:41 schreef olivier tristan <o.tristan at uvi.net>:

> You do not talk about the SSE 4.1 version in your bench.
>
> Have you tried this use case ?
>

I compared 4 compiles: one without any changes (so with all variants of the
lpc functions, including the SSE4.1 ones) and three with variants of plain
C code. As both CPUs that were tested had SSE4.1 capability, these
functions were compared with. So yes, current GCC outperforms those SSE4.1
intrinsics functions on 16-bit inputs and comes close on 24-bit inputs.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xiph.org/pipermail/flac-dev/attachments/20220705/30bea852/attachment.htm>


More information about the flac-dev mailing list