[vorbis] Re: UTF8, vorbiscomment, oggenc, and 'vcedit.c'

Glenn Maynard g_ogg at zewt.org
Sat Jan 12 19:17:29 PST 2002



On Sat, Jan 12, 2002 at 12:37:24AM -0500, Glenn Maynard wrote:
> On Fri, Jan 11, 2002 at 06:58:28PM -0500, Peter Harris wrote:
> > > > umlaut-u in wchar_t *argv[] appears as superscript-n when wprintf()ed
> > >
> > > That seems to mean they're doing some weird conversion that we can't
> > > really fix.  It might be worth adding a @file option (read arguments),
> > > which is always in UTF-8, so that scripts that want to interface with
> > > this always have at least one reliable way of getting these characters
> > > through.  (Or perhaps in the locale/codepage encoding, I'm not sure.)
> > 
> > We can detect UNICODE via the magic word 'FEFF' (looks like FFFE in
> > little-endian files; looks like EFBBBF in UTF-8) and use that. In the
> 
> Aack!  That's a Windowsism; requiring it in anything but Windows will
> annoy a *lot* of people, myself included.
> 
> How about this: read it in the current locale.  If a program (ie a
> script) wants to call it with UTF-8 data, it can change the locale to
> UTF-8 for that call.  That way, it works like every other modern locale-
> sensitive package, and isn't an exception.
> 
> For Windows, you could get away with using FEFF to change to Unicode,
> but then you have fairly different behavior between Windows and Unix.
> That might be inevitable, though.

Actually, taking a look at vcomment, the @file notation isn't needed;
that's what -c is for.  This applies to that option, however.


-- 
Glenn Maynard

--- >8 ----
List archives:  http://www.xiph.org/archives/
Ogg project homepage: http://www.xiph.org/ogg/
To unsubscribe from this list, send a message to 'vorbis-request at xiph.org'
containing only the word 'unsubscribe' in the body.  No subject is needed.
Unsubscribe messages sent to the list will be ignored/filtered.




More information about the Vorbis mailing list