<div class="gmail_quote"><div>Hi, Monty thanks for the reply - at least there weren&#39;t any huge surprises.&nbsp; As for your last statement on memory concerns, it is a big deal for our implementation.&nbsp; The two areas I need to watch out for seem to be related to the large codebooks &amp; large comment headers.&nbsp; I know the ogg spec says that you&#39;re not supposed to embed images, but the first ogg tagger I downloaded gave that option so it&#39;s bound to happen.&nbsp; It seems that the decoder allocates enough pages to gather the full comment header and then puts that into the comment structure in _vorbis_unpack_comment() right?&nbsp; If so then this is an unbounded memory access issue and I need to avoid this scenario.&nbsp; I know that some work has been done recently on-list to handle large comment headers but if you have any additional ideas on how to deal with this without unbounded memory usage that would be great.<br>

<br>Since I don&#39;t need to worry about comments at all I tried to create a dummy page that read everything from the comments into a single static ogg page/reference but that ended up failing badly and I didn&#39;t like the idea of fighting against the spec so much so I dropped that attempt for the moment.&nbsp; If you have any bright ideas I&#39;d love to hear them.<br>

<br>As for the large codebooks, my only idea there was to look at the libVorbis name at the beginning of the ogg file.&nbsp; If the date is beyond a certain point where the codebooks would be guaranteed to be smaller then I would decode otherwise I&#39;d dealloc and bail immediately.&nbsp; Any idea if that&#39;s a workable solution?&nbsp; I&#39;m concerned that some encoder implementations might not have well formed names in them.&nbsp; The version I am doing most of my testing on is &quot;Xiph.Org libVorbis I 20070622&quot; and the name for the beta4 ogg file is &quot;Xiphophorus libVorbis I 20010225&quot; - all I&#39;d do is look at the value of the numerical string and make a decision from there.&nbsp; This would avoid all of the extra pages required to read the large codebooks at init time, but if the check fails because someone&#39;s encoder doesn&#39;t populate this field properly then I&#39;m screwed.&nbsp; Is the name placed at that point under any sort of control or do different implementations put all sorts of odd things in there?<br>

<br>Ethan<br>&nbsp;</div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><div class="Ih2E3d">&gt; I&#39;d really like to avoid needing to allocate all this extra memory for a<br>


&gt; very old encoder, but if there are a lot of ogg files out there then we&#39;re<br>

&gt; going to at least need to come up with some elegant solution. &nbsp;When is the<br>

&gt; earliest the decoder realizes that the input file is from beta 4? &nbsp;Is it<br>

&gt; when it captures the vendor name in _vorbis_unpack_comment()?<br>

<br>

</div>well, *I* have a bunch, but I don&#39;t count :-) &nbsp;And yes, this was back<br>

in 2001ish.<br>

<br>

The decoder doesn&#39;t really know/care about specific releases.<br>

However, if memory usage is the concern, you can watch for large<br>

tesselated codebooks. Codebook maptype 0 == tesselation.<br>

<br>

Monty<br>

</blockquote></div><br>