[opus] Opus for ASR

Jean-Marc Valin jmvalin at jmvalin.ca
Fri Sep 14 14:13:14 PDT 2012

Hi Milan,

On 12-09-14 04:09 PM, Young, Milan wrote:
> A big part of optimizing for ASR will be an infrastructure that reports
> feedback on candidate improvements and facilitates regression testing. 
> To that end, Nuance is willing to publish a service which allows
> developers to upload codec binaries to our computational grid and report
> back a score. 

Did you have any thoughts yet how you were going to give access to that?
I assume Nuance doesn't want to run binaries from random people on the
Internet :-)

> If such a service is of interest to you, please let me
> know of any design constraints you have in mind.

Well, we're definitely interested in adding that to our regression suite.

> In particular, I’d
> like to know preferences in accuracy vs. latency in the service.  For
> those of you familiar with speech recognition, you will be aware that
> testing involves tens and hundreds of thousands of utterances, hence my
> concern.

I suspect we'll want the "quick" test for automated regression testing
and a longer test for any experiments specifically designed to optimize
ASR accuracy. What kind of times are we talking about here (just the
order of magnitude would help)?



More information about the opus mailing list