[Speex-dev] voice activity detection
Joachim Boomberschloss
boomberschloss at yahoo.com
Wed Jan 19 13:03:48 PST 2005
Thanks Tom and Steve. Your replies make me quite
happy. It is strange that this information is not
available anywhere other than this list!
Joe ;)
--- Tom Grandgent <tgrand at canvaslink.com> wrote:
> Yes, you can use it independently by using
> speex_preprocess().
> This function can do noise removal, AGC, and VAD.
> I've been
> using it to do all three and it usually works very
> well. There
> can be a train going by outside, producing lots of
> sound that
> makes it through the noise filter, and yet the VAD
> knows it's
> not speech.
>
> However, sometimes the VAD seems to get into a bad
> state and
> will either always indicate speech (even when the
> input is
> very nearly silent) or rarely indicate it such that
> the person
> has to shout to be heard. When this happens, it
> never recovers,
> and I have to destroy and recreate the state to fix
> it. I don't
> know what could be wrong... it probably works fine
> for everyone
> else...
>
> Also, be aware that there are probabilities that can
> be tweaked
> to alter the sensitivity of the VAD, as described in
> this post
> to the list awhile ago:
>
> At 06:37 AM 5/18/2004, Steve Kann wrote:
> >I don't seem to get that; I suppose you've already
> changed the thresholds
> >to make things more sensitive though:
> >
> >This is what I have now in iaxclient; the first
> number is the "initial"
> >probability to go from not speech -> speech. The
> second is to go from
> >speech -> not speech. [the difference implements
> hysteresis].
> >
> >/* if (st->speech_prob> .35 ||
> (st->last_speech < 20 &&
> >st->speech_prob>.1)) */
> > if (st->speech_prob> .30 || (st->last_speech
> < 20 &&
> > st->speech_prob>.07))
>
> Might be helpful for you.
>
> Tom
>
> Joachim Boomberschloss <boomberschloss at yahoo.com>
> wrote:
> >
> > Hello,
> >
> > I am wondering whether Speex's voice activity
> > detection mechanism can be used independently of
> > encoding and decoding speech, and how reliable it
> is
> > in general. I need this kind of functionality for
> a
> > project I am working on, and I was wondering if
> Speex
> > provides it in a ready-to-use fashion, or if it
> > requires modification of the source code. If so, I
> > would be glad to consider doing this work for the
> > benefit of others with some guidance (depending on
> > complexity and usefulness).
> >
> > Thanks,
> >
> > Joe.
>
>
__________________________________
Do you Yahoo!?
All your favorites on one personal page Try My Yahoo!
http://my.yahoo.com
More information about the Speex-dev
mailing list