[opus] Opus for ASR
Jean-Marc Valin
jmvalin at jmvalin.ca
Fri Sep 14 14:13:14 PDT 2012
Hi Milan,
On 12-09-14 04:09 PM, Young, Milan wrote:
> A big part of optimizing for ASR will be an infrastructure that reports
> feedback on candidate improvements and facilitates regression testing.
> To that end, Nuance is willing to publish a service which allows
> developers to upload codec binaries to our computational grid and report
> back a score.
Did you have any thoughts yet how you were going to give access to that?
I assume Nuance doesn't want to run binaries from random people on the
Internet :-)
> If such a service is of interest to you, please let me
> know of any design constraints you have in mind.
Well, we're definitely interested in adding that to our regression suite.
> In particular, I’d
> like to know preferences in accuracy vs. latency in the service. For
> those of you familiar with speech recognition, you will be aware that
> testing involves tens and hundreds of thousands of utterances, hence my
> concern.
I suspect we'll want the "quick" test for automated regression testing
and a longer test for any experiments specifically designed to optimize
ASR accuracy. What kind of times are we talking about here (just the
order of magnitude would help)?
Cheers,
Jean-Marc
More information about the opus
mailing list