[Vorbis] Hz vs bitrate?

Sat Jan 21 17:24:03 PST 2006

James wrote:

> the Vorbis FAQ says:
> "mid to high quality (8kHz-48.0kHz, 16+ bit, polyphonic) audio and music at 
> fixed and variable bitrates from 16 to 128 kbps/channel."

hmmm, I actually think those numbers need to be revised.

> What is the difference between Hz and bitrate?

hmmm.  This is going to take a bit of explaining.  I'm not an expert on the 
technicalities, but I understand it enough to try and describe it.

Hz is probably not a good term to use, but it's become a standard so we 
will.  The true count is samples per second.

To represent a sound, you need to take a snapshot of audio, called a 
sample.  Actually, you need two snapshots or samples before you can do 
anything.  Since sound is a series of vibrations, you need to be able to 
capture both the on and off nature of a given vibration before you can 
reproduce it.

This means that the higher the sound you want to record digitally, the more 
samples you need to take.  8000 samples per second will be able to 
reproduce sounds of up to 4000 vibrations per second, the kind of signal 
you get over a telephone line.  If you want to represent the theoretical 
limit of human hearing, 20000 vibrations per second, you need at least 
40000 samples per second.  This is where the CD standard of 44100 samples 
per second (or Hz) comes from.  Note that I think it's more complicated 
than this, hence the extra 4100 (I'm sure someone else on the list will 
explain this).

It's a bit like frame rate with video.  If you record something at 15 fps, 
movement will look jerkier than movement recorded at 30fps, etc.  Except in 
the case of audio, the lower the rate, the more muffled it will sound.

Bit rate is simply a measure of data, or at least data per second.  With 
uncompressed audio, there is a direct relationship between the sample rate 
and the amount of data it takes to represent that sample rate.  A 44.1kHz 
16-bit stereo signal takes 1411.2 kbps, or approximately 10.4 megs per 
minute to record.  A 44.1kHz 16-bit mono file would take half of this, as 
would a 44.1kHz 8-bit stereo file or a 22.05kHz 16-bit stereo file.  I 
won't get into a discussion of bits per sample, just to say that samples of 
lower bit depth are noisier than samples of higher bit depth.

Now, formats like Ogg Vorbis and MP3 compress audio by making calculated 
guesses about the sounds humans aren't likely to hear.  As part of this 
process, such formats allow us to make some of the decisions by deciding 
how much to throw away, or to put it more simply, how much data to use to 
represent the original sound.  So, using our 44.1kHz stereo sample, We can 
choose to use as little as 48kbps or as much as approx 500kbps to store 
this sound.  At 500kbps, more of the original sound should be present than 
at 48kbps.

This is also why we can go lower with lower sampling rates.  A 44.1kHz mono 
sound or a 22.05kHz stereo sound is less complex than a 44.1kHz stereo 
sound, and is therefore easier to store.

Also note that a file's bit rate is simply an indicater of data size.  You 
can, for example, have a 64kbps 44.1kHz stereo sound, or a 64kbps 22.05kHz 
stereo sound, or a 64kbps 44.1kHz mono sound.  All will take up the same 
amount of disk space.  But the first file will be using less data to 
represent more complex sound than the other two.  So at lower rates, one 
needs to decide whether it's better to have higher complexity sound 
represented more poorly, or lower complexity sound represented more 
accurately.  And of course, when you get lower, you also have to decide 
whether it's better to have say 11.025kHz stereo sound, or 22.05kHz mono 
sound, at a given rate.

ONe more comment about bit rates and sound quality.  Bit rates are, as I've 
said above, merely a measure of quantity of data.  This means that it's 
only at all valid to compare bit rates of a given codec to itself. 
Encoding at 128kbps may be overkill with one codec, and insufficient with 
another.  Especially with MP3, even the sound quality that you'll get at a 
given bit rate will vary from encoder to encoder.  So ultimately, unless 
you need to use a specific data size (e.g. for streaming), the best measure 
of how good something sounds is to listen to it and see.

> Doesn't MP3 support higher bitrates?

Yes, and so does Ogg Vorbis.  Quality 10 in the Xiph encoder aims at approx 
500kbps for a 44.1kHz stereo sound.  I've only ever seen MP3 go to 320kbps, 
but there may be encoders which go higher.  But it's all a matter of how 
much data you want to represent a sound - you could write an encoder which 
uses 4 times the uncompressed bit rate for storage, but there wouldn't be 
much point.  It's only an indicator of how much data is being used to store 
the sound.

Geoff.