[opus] Submitting a patch that exposes VAD voiced/unvoiced signal type
jmvalin at mozilla.com
Wed Jun 7 06:46:52 UTC 2017
There's two main issues with a patch like the one you're proposing.
First, the data is only valid when SILK is being used and is essentially
undefined in CELT mode. The second issue is that by exposing internals,
it makes it impossible to improve these algorithms since it would break
API compatibility. I'm not fundamentally against trying to expose some
information, but there would have to be a way to address those two issues.
On a slightly different topic, have you looked at the VAD probability
that's computed in analysis.c (along with the speech/music probability)?
> I'm reaching out because we'd like to contribute back to the project
> a patch that exposes the signal type of the audio packet when
> encoding the PCM audio to OPUS. We've found the Opus VAD algorithm to
> be exceptional in this regard and have written a library that
> leverages this information for audio end-pointing. Attached is the
> patch. Please let us know if you'd be willing to accept it, or if
> you'd prefer we fork libopus or recommend some other option.
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 819 bytes
Desc: OpenPGP digital signature
More information about the opus