[Icecast-dev] Broken UTF-8 in yp.xml?

Assen Totin assen.totin at gmail.com
Sun Jan 2 09:22:49 PST 2011


Hi all,

I apologize if this subject has been discussed before.

I recently wrote a small add-on for XBMC Media Centre (www.xbmc.org)
which allows the users to listen to Icecast radio stations. My primary
source of information is the yp.xml file.

While the yp.xml file is UTF-8 there seem to be some entries with
broken UTF-8 (which most often looks like multiple-encoded UTF-8, but
with one "wrong" byte which prevents reversing the string back to
regular UTF-8).

As an example, look for the topmost entry in ypxml which has
85.239.108.31 in its listen_url - that should be a radio station Welle
Türingens Rock with the letter "ü" being broken.

To keep the posting short, I have written a short page with
information what I was able to debug so far:
http://bilbo.online.bg/~assen/icecast-addon/unicode.htm

I'll be grateful if anyone could shed some light about such entries in
yp.xml and ideas how to handle them.

Thank you in advance for your attention,

Assen Totin


More information about the Icecast-dev mailing list