[opus] Ambisonics with Head Locked Stereo to Opus Channel Mapping Family 2 for WebVR Chrome App and YouTube

Martin Backhaus contact at martinbackhaus.com
Fri Aug 7 02:10:03 UTC 2020


Hello,

I am trying to encode an Opus file with Ambisonics including Head-Locked (non-diegetic) Stereo sound for a Virtual Reality 360° video.

YouTube describes the spatial audio requirements here:
https://support.google.com/youtube/answer/6395969

It's the last list item 5.

> 5. Supported First Order Ambisonics (FOA) with Head-Locked Stereo format:
> W, Y, Z, X, L, R as a 6-channel audio track in your uploaded file, sample rate: 48 kHz
> PCM encoded audio in a MOV container:
> Sample rate: 48 kHz
> OPUS encoded audio in an MP4 container:
> Min. bitrate 768 kbps
> Channel mapping family: 2

I am trying to encode 6 Channels as Opus to an .opus and .webm file.

Until now, YouTube and Chrome Browser cannot playback the results correctly. VLC is not playing anything.

Using this MOV container and PCM works.

I am using the following ffmpeg command:

ffmpeg -i ambix_6ch_withHLS.wav -c:a libopus -b:a 768k -mapping_family 2 -y ambix.opus

The opusenc.exe binary does not mention a command line argument for channel mapping.

In this mailing list archive, Emily Bowman mentioned 19 December 2019, that Ambisonics is an API-only feature. Is this still correct?


Here's the specification:
https://tools.ietf.org/html/rfc8486

Chapter 3.1 and 3.3 are important, also concerning Head-locked stereo.

Here's the output of opusinfo (converted from WAV using ffmpeg):

Processing file "ambix_6ch.opus"...

New logical stream (#1, serial: 20487e76): type opus
Encoded with Lavf58.45.100
User comments section follows...
        encoder=Lavc58.91.100 libopus
        encoded_by=Adobe Premiere Pro 2020.0 (Windo
        time_reference=11163152
        date=2020-08-07
Opus stream 1:
        Pre-skip: 312
        Playback gain: 0 dB
        Channels: 6
        Original sample rate: 48000 Hz
        Streams: 5, Coupled: 1
        Channel Mapping Family: 2 Map: [2, 3, 4, 5, 0, 1]
        Packet duration:   20.0ms (max),   20.0ms (avg),   20.0ms (min)
        Page duration:    700.0ms (max),  609.2ms (avg),  100.0ms (min)
        Total data length: 8535952 bytes (overhead: 0.466%)
        Playback length: 1m:25.885s
        Average bitrate: 795.1 kbit/s, w/o overhead: 791.4 kbit/s
Logical stream 1 ended


So the goal is to have this 4-channel Ambisonics track ("Ambix" ACN SN3D format, first order) with 2-channel "head locked" Stereo (total of 6 channels) encoded to opus, (and muxed to mp4), so YouTube understands it and - more important - a WebVR Chrome App.

Any encoding guide, further Opus + Ambisonics documentation would help, especially as YouTube mentions it as a supported format. Am I missing something?

Thanks and best regards,

Martin

-- 
Martin Backhaus
Mail: contact at martinbackhaus.com


More information about the opus mailing list