[opus] [RFC PATCH v1 2/2] armv7(float): Optimize encode usecase using NE10 library
Viswanath Puttagunta
viswanath.puttagunta at linaro.org
Fri Jan 30 15:41:22 PST 2015
On 30 January 2015 at 02:38, Timothy B. Terriberry <tterribe at xiph.org> wrote:
>
> Viswanath Puttagunta wrote:
>>
>> Is the peak stack usage a complete blocker in current form?
>
>
> Since this only affects people who enable NE10, I don't think this is a blocker.
I discussed this and comment about _t with Phil at NE10_ARM. He committed
to two changes:
1. Change API to address usage of _t from NE10 api for following types:
ne10_int32_t ------> will become ----> ne10_int32
ne10_fft_cpx_float32_t ------> will become ----> ne10_fft_cpx_float32
ne10_fft_state_float32_t ------> will become ---> ne10_fft_state_float32
ne10_fft_cfg_float32_t -----> will become ----> ne10_fft_cfg_float32
2. Provide scaling as argument to ne10_fft_c2c_1d_float32_neon() and
remove need for one buffer on stack (ALLOC(tempin, st->nfft,
ne10_fft_cpx_float32_t);). We will still need cfg->buffer. So, peak
stack usage goes up by ~3.8K instead of ~7.5K.
Hope to get the RFCv2 by end of next week.
More information about the opus
mailing list