For the purposes of this question, lets assume all future computers are gonna become locked down and you’d need corporate approval to run things… so with such a hypothetical dark future in mind: How to hoard as much as info as possible?
For the purposes of this question, lets assume all future computers are gonna become locked down and you’d need corporate approval to run things… so with such a hypothetical dark future in mind: How to hoard as much as info as possible?
I think that, while yes, LLMs are an option for data storage, I don’t think that they’re worth the effort. Sure, they might have a very wide breadth of information that would be hard to gather manually, but how can you be sure that the information you’re getting is a good replica of the source, or that the source that it was trained on was good in the first place? A piece of information could come from either 4chan or Wikipedia, and unless you had the sources yourself to confirm (in which case, why use the LLM as all), you’d have no way of telling which it came from.
Aside from that, just getting the information out of it would be a challenge, at least for the hardware of today and the near future. Running a model large enough to have a useful amount of world knowledge requires a some pretty substantial hardware if you want any amount of speed that would be useful, and with rising hardware costs, that might not be possible for most people even years from now.
So sure, maybe as an afterthought if you happen to have some extra space on your drives and oodles of spare RAM, but I doubt that it’d be worth thinking that much about.