[xiph-cvs] cvs commit: speex/doc manual.lyx

Jean-Marc Valin jm at xiph.org
Tue Dec 31 10:41:52 PST 2002



jm          02/12/31 13:41:50

  Modified:    doc      manual.lyx
  Log:
  ...

Revision  Changes    Path
1.40      +47 -7     speex/doc/manual.lyx

Index: manual.lyx
===================================================================
RCS file: /usr/local/cvsroot/speex/doc/manual.lyx,v
retrieving revision 1.39
retrieving revision 1.40
diff -u -r1.39 -r1.40
--- manual.lyx	31 Dec 2002 06:07:32 -0000	1.39
+++ manual.lyx	31 Dec 2002 18:41:46 -0000	1.40
@@ -3203,6 +3203,16 @@
  
 \layout Subsection*
 
+Quality
+\layout Subsection*
+
+Complexity (variable)
+\layout Standard
+
+With Speex, it is possible to vary the complexity allowed for the encoder.
+ 
+\layout Subsection*
+
 Variable Bit-Rate (VBR)
 \layout Standard
 
@@ -3229,23 +3239,53 @@
 \layout Subsection*
 
 Average Bit-Rate (ABR)
-\layout Standard
-
 \layout Subsection*
 
 Voice Activity Detection (VAD)
+\layout Standard
+
+When enabled, voice activity detection detects whether the audio being encoded
+ is speech or silence/background noise.
+ VAD is always implicitly activated when encoding in VBR, so the option
+ is only useful in non-VBR operation.
+ In this case, Speex detects non-speech periods and encode them with just
+ enough bits to reproduce the background noise.
+ This is calles 
+\begin_inset Quotes eld
+\end_inset 
+
+comfort noise generation
+\begin_inset Quotes erd
+\end_inset 
+
+ (CNG).
 \layout Subsection*
 
 Discontinuous Transmission (DTX)
-\layout Subsection*
+\layout Standard
 
-Perceptual enhancement
+Discontinuous transmission is an addition to VAD operation, that allows
+ to stop transmitting completely when the background noise is stationnary.
+ In file-based operation, since we cannot just stop writing to the file,
+ only 5 bits are used for such frames (corresponding to 250 bps).
 \layout Subsection*
 
-Complexity (variable)
-\layout Subsection*
+Perceptual enhancement
+\layout Standard
 
-Quality
+Perceptual enhancement is a part of the decoder which, when turned on, tries
+ to reduce (the perception of) the noise produced by the coding/decoding
+ process.
+ In most cases, perceptual enhancement make the sound further from the original
+ 
+\emph on 
+objectively
+\emph default 
+ (if you use SNR), but in the end it still 
+\emph on 
+sounds
+\emph default 
+ better (subjective improvement).
 \layout Section
 
 Command-line encoder/decoder

<p><p>--- >8 ----
List archives:  http://www.xiph.org/archives/
Ogg project homepage: http://www.xiph.org/ogg/
To unsubscribe from this list, send a message to 'cvs-request at xiph.org'
containing only the word 'unsubscribe' in the body.  No subject is needed.
Unsubscribe messages sent to the list will be ignored/filtered.



More information about the commits mailing list