[opus] [PATCH] Refactor silk_LPC_analysis_filter() & Optimize celt_fir_permit_overflow() for ARM NEON
Linfeng Zhang
linfengz at google.com
Wed Mar 1 19:30:34 UTC 2017
Hi Timothy,
Do you think it would be possible to improve the API of xcorr_kernel() so
> that calling it in a loop is more efficient?
>
If it could be inlined, it will be more efficient. Besides memory bouncing,
frequent function call is expensive.
The other advantage to wiring up xcorr_kernel() is that it applies in more
> places than your intrinsics-only celt_fir() implementation.
>
I agree.
One solution is to put the outer for(N) loop inside xcorr_kernel() to let
it return N results instead of 4 (similar to the celt_fir() NEON intrinsics
did). This will make it efficient plus universal.
Thanks,
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xiph.org/pipermail/opus/attachments/20170301/7bb0970c/attachment.html>
More information about the opus
mailing list