[opus] Opus for ASR

Jean-Marc Valin jmvalin at jmvalin.ca
Fri Sep 14 14:13:14 PDT 2012


Hi Milan,

On 12-09-14 04:09 PM, Young, Milan wrote:
> A big part of optimizing for ASR will be an infrastructure that reports
> feedback on candidate improvements and facilitates regression testing. 
> To that end, Nuance is willing to publish a service which allows
> developers to upload codec binaries to our computational grid and report
> back a score. 

Did you have any thoughts yet how you were going to give access to that?
I assume Nuance doesn't want to run binaries from random people on the
Internet :-)

> If such a service is of interest to you, please let me
> know of any design constraints you have in mind.

Well, we're definitely interested in adding that to our regression suite.

> In particular, I’d
> like to know preferences in accuracy vs. latency in the service.  For
> those of you familiar with speech recognition, you will be aware that
> testing involves tens and hundreds of thousands of utterances, hence my
> concern.

I suspect we'll want the "quick" test for automated regression testing
and a longer test for any experiments specifically designed to optimize
ASR accuracy. What kind of times are we talking about here (just the
order of magnitude would help)?

Cheers,

	Jean-Marc


More information about the opus mailing list