A few thoughts that aren't related to each other. 1. This is a brilliant hack. K...

leeoniya · on Sept 20, 2022

also thought it was odd that AVIF was not compared - it would show a major quality and size improvement over WebP.

goombacloud · on Sept 20, 2022

The comparison doesn't make much sense because for fair comparisons you have to measure decompressor size plus encoded image size. The decompressor here is super huge because it includes the whole AI model. Also, everyone needs to have the exact same copy of the model in the decompressor for it to work reliably.

wongarsu · on Sept 20, 2022

Only if decompressor and image are transmitted over the same channel at the same time, and you only have a small number of images. When compressing images for the web I don't care if a webp decompressor is smaller than a jpg or png decompressor, because the recipient already has all of those.

Of course stable diffusion's 4GB is much more extreme than Brotli's 120kb dictionary size, and would bloat a Browser's install size substantially. But for someone like Instagram or a Camera maker it could still make sense. Or imagine phones having the dictionary shipped in the OS to save just a couple kB on bad data connections.

operator-name · on Sept 20, 2022

Even if dictionaries were shipped, the biggest difficulty would be performance and resources. Most of these models require beefy compute and a large amount of VRAM that isn't likely to ever exist on end devices.

Unless that can be resolved it just doesn't make sense to use it as a (de)compressor.