Meta's latest legal wheeze is to insist that pirating books is fair use, actually. And it might be working.

artifex@piefed.social · 5 hours ago

Meta's latest legal wheeze is to insist that pirating books is fair use, actually. And it might be working.

OfCourseNot@fedia.io · 3 hours ago

There’s an argument to be made that it is, in fact, not ‘reading’. The training of the model could be considered a lossy compression of the data. And streaming movies in a lossy compression format is not fair use, is it?

ryathal@sh.itjust.works · 1 hour ago

The model doesn’t stream out anyone’s content though. The article mentions that the plaintiffs have provided no examples of a prompt that creates anything substantial.

Streaming a lossy compression would generally be infringement, but there is definitely a point where it becomes not infringement if it’s lossy enough.

What a model generally stores, is factual information that isn’t copyright in the first place. It’s storing word counts, sentence lengths, sentiment analysis, and so on.

Fatal@piefed.social · 2 hours ago

It’s not the storage of the information that matters as much as the presentation. Google’s search index stores a huge amount of copyrighted material, even losslessly. But they only present small snippets at a time which is not considered copyright infringement. The question really is whether or not the information being presented by the models is in a format which is considered copyright infringement. So far, courts have not found that they are.