[opus] [RFC PATCH v1 0/5] aarch64: celt_pitch_xcorr: Fixed point series

Viswanath Puttagunta viswanath.puttagunta at linaro.org
Tue Mar 31 20:35:32 PDT 2015


Hi Timothy,

FYI, I just submitted pull request for my Ne10 patch that enables
builds for Aarch64 at
https://github.com/projectNe10/Ne10/pull/108

Phil at ARM said he will do more testing on it and will merge it soon.

As I mentioned in previous email, in mean time, for convinience, I
provided pre-build NE10 library binaries at
http://people.linaro.org/~viswanath.puttagunta/opus/NE10_root/

Regards,
Vish


On 31 March 2015 at 17:57, Viswanath Puttagunta
<viswanath.puttagunta at linaro.org> wrote:
> Hi Timothy,
>
> As I mentioned earlier [1], I now fixed compile issues
> with fixed point and resubmitting the patch.
>
> I also have new patch that does intrinsics optimizations
> for celt_pitch_xcorr targetting aarch64.
>
> You can find my latest work-in-progress branch at [2]
>
> For reference, you can use the Ne10 pre-built libraries
> at [3]
>
> Note that I am working with Phil at ARM to get my patch at [4]
> upstreamed to Ne10.
>
> [1]: http://lists.xiph.org/pipermail/opus/2015-March/002941.html
> [2]: https://git.linaro.org/people/viswanath.puttagunta/opus.git
>      Branch: rfcv1_final_xcorr_fixed_armv8
> [3]: http://people.linaro.org/~viswanath.puttagunta/opus/NE10_root/
> [4]: git://git.linaro.org/people/viswanath.puttagunta/Ne10.git
>      Branch: rfcv1_rc1_armv8
>
> Jonathan Lennox (1):
>   Intrinsics/RTCD related fixes. Mostly x86
>
> Viswanath Puttagunta (4):
>   armv7(float): Optimize encode usecase using NE10 library
>   armv7(float): Optimize decode usecase using NE10 library
>   aarch64: Enable intrinsics for aarch64
>   aarch64: celt_pitch_xcorr: Fixed point intrinsics
>
>  Makefile.am                              |  72 ++++--
>  celt/arm/arm_celt_map.c                  |  71 +++++-
>  celt/arm/armcpu.c                        |   6 +-
>  celt/arm/celt_ne10_fft.c                 | 148 +++++++++++
>  celt/arm/celt_ne10_mdct.c                | 263 ++++++++++++++++++++
>  celt/arm/celt_neon_intr.c                | 275 +++++++++++++++++++++
>  celt/arm/fft_arm.h                       |  74 ++++++
>  celt/arm/mdct_arm.h                      |  60 +++++
>  celt/arm/pitch_arm.h                     |  14 +-
>  celt/bands.c                             |   6 +-
>  celt/celt.c                              |  16 +-
>  celt/celt.h                              |  12 +-
>  celt/celt_decoder.c                      |  24 +-
>  celt/celt_encoder.c                      |  20 +-
>  celt/celt_lpc.h                          |   2 +-
>  celt/cpu_support.h                       |  15 +-
>  celt/dump_modes/Makefile                 |  23 +-
>  celt/dump_modes/dump_modes.c             |  21 ++
>  celt/dump_modes/dump_modes_arch.h        |  41 ++++
>  celt/dump_modes/dump_modes_arm_ne10.c    | 125 ++++++++++
>  celt/kiss_fft.c                          |  31 ++-
>  celt/kiss_fft.h                          |  69 +++++-
>  celt/mdct.c                              |  20 +-
>  celt/mdct.h                              |  61 ++++-
>  celt/mips/celt_mipsr1.h                  |   2 +-
>  celt/modes.c                             |   8 +-
>  celt/pitch.c                             |   4 +-
>  celt/pitch.h                             |  22 +-
>  celt/static_modes_float.h                |  25 ++
>  celt/static_modes_float_arm_ne10.h       | 404 +++++++++++++++++++++++++++++++
>  celt/tests/test_unit_dft.c               |  56 +++--
>  celt/tests/test_unit_mathops.c           |  22 +-
>  celt/tests/test_unit_mdct.c              |  88 ++++---
>  celt/tests/test_unit_rotation.c          |  22 +-
>  celt/x86/celt_lpc_sse.c                  |   4 +
>  celt/x86/celt_lpc_sse.h                  |  12 +-
>  celt/x86/pitch_sse.c                     | 334 ++++++++++---------------
>  celt/x86/pitch_sse.h                     | 256 ++++++++------------
>  celt/x86/pitch_sse2.c                    |  95 ++++++++
>  celt/x86/pitch_sse4_1.c                  | 195 +++++++++++++++
>  celt/x86/x86_celt_map.c                  |  76 +++++-
>  celt/x86/x86cpu.c                        |  47 +++-
>  celt/x86/x86cpu.h                        |  26 +-
>  celt_headers.mk                          |   3 +
>  celt_sources.mk                          |   9 +-
>  configure.ac                             | 391 +++++++++++++++++++++---------
>  m4/opus-intrinsics.m4                    |  29 +++
>  silk/x86/SigProc_FIX_sse.h               |  17 ++
>  silk/x86/main_sse.h                      |  48 ++++
>  silk/x86/x86_silk_map.c                  |  25 +-
>  src/analysis.c                           |   8 +-
>  src/analysis.h                           |   2 +-
>  src/opus_encoder.c                       |   2 +-
>  src/opus_multistream_encoder.c           |   9 +-
>  win32/VS2010/celt.vcxproj                |  17 +-
>  win32/VS2010/celt.vcxproj.filters        |  27 +++
>  win32/VS2010/silk_common.vcxproj         |  17 +-
>  win32/VS2010/silk_common.vcxproj.filters |  23 +-
>  win32/VS2010/silk_fixed.vcxproj          |  13 +-
>  win32/VS2010/silk_fixed.vcxproj.filters  |  17 +-
>  win32/config.h                           |  25 +-
>  61 files changed, 3150 insertions(+), 699 deletions(-)
>  create mode 100644 celt/arm/celt_ne10_fft.c
>  create mode 100644 celt/arm/celt_ne10_mdct.c
>  create mode 100644 celt/arm/fft_arm.h
>  create mode 100644 celt/arm/mdct_arm.h
>  create mode 100644 celt/dump_modes/dump_modes_arch.h
>  create mode 100644 celt/dump_modes/dump_modes_arm_ne10.c
>  create mode 100644 celt/static_modes_float_arm_ne10.h
>  create mode 100644 celt/x86/pitch_sse2.c
>  create mode 100644 celt/x86/pitch_sse4_1.c
>  create mode 100644 m4/opus-intrinsics.m4
>
> --
> 1.9.1
>


More information about the opus mailing list