[foms] What data is needed for adaptive stream switching?
chris at pearce.org.nz
Mon Nov 22 16:35:45 PST 2010
On 23/11/2010 12:26 p.m., Benjamin M. Schwartz wrote:
> AFAICT, WebM streams may be "variable frame rate" (VFR), i.e. frames may have arbitrary
> duration and be intermittent.
This is correct.
> Suppose your decode pipeline looks like:
> network source -> demux -> queue1 (encoded frames) -> decode -> queue2
> (decoded frames) -> painter
> I favor a solution that either (1) exposes the temporal range currently in
> each queue,
To reduce memory footprint we only keep about decoded 10 video frames in
advance of the playback position (but about 2s of audio). We don't demux
the entirety of our downloaded data, also to save memory.
> or (2) exposes the browser's estimate of the time remaining
> before each queue underruns (possibly +inf). Option 1 is just like your
> design (using @buffered), but extended to distinguish network and CPU limits.
There are two separate problems here, detecting slow network, and
detecting slow CPU. My @buffered proposal can only detect slow network.
So you want us to expose an attribute which simply reports whether the
we expect to drop frames in future?
I think predicting frame dropping due to slow CPU in a timely manner is
hard. Given that we only keep 10 decoded frames queued, even if we post
an event notification when things go bad, you wouldn't have enough time
to setup a new connection to switch to a lower quality stream before you
start to drop frames anyway.
>> Or we could also expose an attribute which simply reports whether the
>> browser is dropping frames, and so you can use this to determine whether
>> you need to switch down to a lower quality stream in order to get smooth
> Please don't make dropped frames the primary metric. A correctly
> functioning player should adapt so that frames are never dropped.
So you mean guess in advance what resolution and bitrate the user's
hardware will be able to handle playing? How do you tell the bitrate of
a VBR video in advance? How do you factor in the affect of variable
>> Given the painting architecture in Firefox, it's very inconvenient to
>> separately count the number of frames which were not painted due to the
>> video being hidden/off screen (this was suggested by Andy Berkheimer).
>> If you wanted to detect this, you could detect when the "painted frames"
>> counter completely stops incrementing. Even when we're CPU starved, we
>> should be able to paint the occasional frame...
> My impression is that in FF4 and other upcoming browsers, sites doing
> basic video playback use almost no CPU for painting, thanks to GPU
> acceleration. As such, if the system is CPU-starved it will be due to
People do crazy things with web browsers these days. Decoding is not
only heavy user of the CPU!
> Also, switching to a stream with lower bitrate or resolution
> may not affect the CPU load from painting.
Sometimes we have to fallback to scaling in the CPU, for example if the
user's graphics card is blacklisted due to buggy drivers. :(
On a related note, a lower quality stream will be quicker to decode as
well, so frames will be less likely to be late. This is particularly
important on mobile platforms, or on underpowered netbooks.
> Of course,, you cannot safely hard-drop frames at the decode stage. The
> system must decode every frame, then skip ahead when a keyframe arrives.*
If Firefox thinks it's going to audio underrun, we skip the video decode
up to the next key frame, and resume decoding from there. How else can
you save time during the decode? We drop frames which are presented late
> Due to variance in decode times, it's likely that at the edge of
> CPU-starvation you will see some frames miss their display deadline even
> though the stream can be decoded in real-time on average. If you can't
> afford len(queue2)>1, then you can substitute deadline-miss statistics to
> indicate when the system is at the edge of CPU starvation, without hard
> frame-dropping. (You can even avoid frame-dropping altogether by
> displaying those frames even if they show up a few ms late, or by setting
> their deadline a few ms ahead of when they are actually needed.)
What would the deadline-miss statistics look like?
We drop frames for two reasons:
1. It looks like we're going to audio underrun, so we need to shed
work to keep the audio playing smoothly.
2. The frame arrived late for painting (possibly due to slow decode,
or delays in the pipeline).
In case 1 what would you report for the deadline-miss?
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the foms