[Speex-dev] Recommended GUI for Speex
me at sbooth.org
me at sbooth.org
Wed Jan 18 11:10:51 PST 2006
Conrad,
Thanks for the information- it was a big help. I am fairly happy with GUI
I ended up with- I posted a screenshot at http://sbooth.org/SpeexGUI.png
My approach is slightly different; instead of a drop-down list I just
threw everything out there. I try and maintain consistency by auto
enabling/selection of other options as appropriate. When the user clicks
the "Enable VBR" checkbox for example the VAD checkbox is automatically
selected if it isn't already; the ABR checkbox is only enabled if VBR is
selected, etc.
Again, thanks.
Stephen
> On Tue, Jan 17, 2006 at 02:19:17PM -0500, me at sbooth.org wrote:
>> Hi all,
>>
>> I have recently added Speex support to my app (http://sbooth.org/Max/)
>> and
>> am in the process of creating a GUI for the user to control the codec
>> parametere. I am new to Speex and as such I am not really sure which
>> parameters are more important than others, and deserve prominent
>> placement, etc. Is there a recommended GUI for people to follow or use
>> as
>> an example?
>
> Hi Stephen,
>
> I designed the encoding GUI for Sweep a few years ago, with a lot of
> help from Jean-Marc about the various options:
>
> http://www.metadecks.org/software/sweep/images/screenshots/tour/speex.png
>
> (that screenshot is from sweep 0.9.0 which is being released today ;-)
>
> The first thing is to choose the mode (narrowband, wideband or
> ultra-wideband). These modes are designed for 8kHz, 16kHz and 32kHz
> sampling rates respectively.
>
> Next is a matter of choosing the encoding features. Not all combinations
> are meaningful ("meaningful" meaning that, for example, VAD or DTX are
> ignored if VBR is not specified); hence in order to make a GUI that only
> provides correct options, it's better to combine the feature flags. In
> Sweep this was done with a drop-down list.
>
> For these feature flags:
>
> /* Feature flags */
> #define FEAT_VBR 1
> #define FEAT_VAD 2
> #define FEAT_DTX 4
>
> These are the meaningful combinations presented in the list:
>
> choices[] = {
> { 0, N_("Constant bitrate (CBR) with no features"),
> NULL
> },
> { FEAT_VAD, N_("CBR with Voice Activity Detection (VAD)"),
> N_("VAD generates low bitrate comfort noise to replace non-speech")
> },
> { FEAT_VAD | FEAT_DTX,
> N_("CBR with VAD and Discontinuous Transmission (DTX)"),
> N_("DTX marks extended pauses with a minimum bitrate signal")
> },
> { FEAT_VBR | FEAT_VAD,
> N_("Variable bitrate (VBR) with VAD"),
> N_("VBR allows the bitrate to adapt to the complexity of the speech; "
> "this selection uses VBR without DTX, which may improve performance
> "
> "compared to full VBR in the presence of background noise.")
> },
> { FEAT_VBR | FEAT_VAD | FEAT_DTX,
> N_("Variable bitrate (VBR) with all features"),
> N_("VBR allows the bitrate to adapt to the complexity of the speech, "
> "and handles pauses using VAD and DTX")
> }
>
> Encoding complexity and quality are slightly different. The quality
> relates
> to how good the actual audio is, complexity relates to how much compute
> power to put into working it out; or something like that. In any case,
> apparently complexity is less important than all of these (we hid it in
> the "extras" tab along with the number of frames per packet).
>
> hope that helps :-) though I haven't even described any of the newer
> options in speex 1.1 ...
>
> cheers,
>
> Conrad.
>
More information about the Speex-dev
mailing list