[opus] 2 patches related to silk_biquad_alt() optimization
linfengz at google.com
Tue Apr 25 00:03:17 UTC 2017
Tested on my chromebook, when stride (channel) == 1, the optimization has
no gain compared with C function.
When stride (channel) == 2, the optimization is 1.2%-1.8% faster (1.6% at
Complexity 8) compared with C function.
Please let me know and I can remove the optimization of stride 1 case.
If it's allowed to skip the split of A_Q28 and replace by 32-bit
multiplication (result is 64-bit), probably it could be faster on NEON.
This may change the encoder results because of different order of adding,
shifting and rounding.
On Wed, Apr 19, 2017 at 10:23 PM, Jean-Marc Valin <jmvalin at jmvalin.ca>
> Hi Linfeng,
> Thanks for the patches. I'll have a look and get back to you. What kind
> of speedup are you getting for these functions? On what command line?
> On 19/04/17 12:29 PM, Linfeng Zhang wrote:
> > Hi,
> > Attached are 2 patches related to silk_biquad_alt() optimization. Please
> > review.
> > Thanks,
> > Linfeng Zhang
> > _______________________________________________
> > opus mailing list
> > opus at xiph.org
> > http://lists.xiph.org/mailman/listinfo/opus
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the opus