Algorithm based on LLMs doubles lossless data compression rates

NoSpotOfGround@lemmy.world · 19 hours ago

Algorithm based on LLMs doubles lossless data compression rates

barsoap@lemm.ee · 17 hours ago

I would guess it’s pretty hard to reconstruct data with a different LLM

I think the idea is to have compressor and decompressor use the exact same neural network. Looks like arithmetic coding with a learned function.

But yes model size is probably going to be an issue.

Harlehatschi@lemmy.ml · 15 hours ago

Ye but that would limit the use cases to very few. Most of the time you compress data to either transfer it to a different system or to store it for some time, in both cases you wouldn’t want to be limited to the exact same LLM. Which leaves us with almost no use case.

I mean… cool research… kinda… but pretty useless.