Stateless refer to the HW Interface. The stateless HW can accept decoding jobs i...

zerocrates · on Nov 17, 2020

Oh so, you just provide whatever reference frames, if any, are needed, and it's just on you to make sure you've decoded what's necessary first? The difference here basically being that the hardware will not do the "bookkeeping"?

stormer2000 · on Nov 17, 2020

Correct.

zerocrates · on Nov 17, 2020

Thanks for explaining.

vlovich123 · on Nov 17, 2020

Are there performance implications of needing to upload the entire state needed for a single frame? Or do none of these encoders have such caching anyway & thus it's just pushing the complex pieces of resource management out to user space where it belongs better?

megous · on Nov 17, 2020

I don't think anything is uploaded anywhere, you just need more RAM to keep frames around as long as they are necessary. The decoder operates on data in system's RAM.

vlovich123 · on Nov 17, 2020

Is that generally true to be faster rather than having dedicated RAM alongside the ASIC? Or are the unit economics not worth it and generally unified memory systems is the current dominating design?

londons_explore · on Nov 17, 2020

Considering most pixels in the reference frames will be read, on average, less than once per generated frame, it makes no sense to have dedicated RAM.

magicalhippo · on Nov 17, 2020

But each reference frame is on average used for many generated frames, no? I mean that's kinda the point of them, isn't it?

megous · on Nov 17, 2020

Input is way smaller than output, so memory performance considerations there probably don't even register in the larger scale of things. (having to write the decompressed frame to RAM and read it again to scan it out to display)

Say 4-8kiB per frame on input leads to 4MiB frame on output.

magicalhippo · on Nov 18, 2020

I admit I'm not familiar with h246, I thought the motion vectors and such was applied on the decompressed reference image. At least that's how we implemented the psedudo-MPEG1 encoder/decoder in class.

Not having to decompress the reference frame for every decoded frame seems like a win.