[vorbis] Re: UTF8, vorbiscomment, oggenc, and 'vcedit.c'

Peter Harris peter.harris at hummingbird.com
Mon Jan 14 09:33:57 PST 2002



> > We can detect UNICODE via the magic word 'FEFF' (looks like FFFE in
> > little-endian files; looks like EFBBBF in UTF-8) and use that. In the
> 
> Aack!  That's a Windowsism; requiring it in anything but Windows will
> annoy a *lot* of people, myself included.

Actually, it's a UNICODEism. It's optional, but it's in the spec. 

http://www.unicode.org/unicode/faq/utf_bom.html#22
http://www.unicode.org/charts/PDF/UFE70.pdf

> How about this: read it in the current locale.  If a program (ie a
> script) wants to call it with UTF-8 data, it can change the locale to
> UTF-8 for that call.  That way, it works like every other modern locale-
> sensitive package, and isn't an exception.

Sounds like a good default for files without a BOM (Byte Order Mark).

Peter Harris

<p>--- >8 ----
List archives:  http://www.xiph.org/archives/
Ogg project homepage: http://www.xiph.org/ogg/
To unsubscribe from this list, send a message to 'vorbis-request at xiph.org'
containing only the word 'unsubscribe' in the body.  No subject is needed.
Unsubscribe messages sent to the list will be ignored/filtered.



More information about the Vorbis mailing list