We will use Grok 3.5 (maybe we should call it 4), which has advanced reasoning, to rewrite the entire corpus of human knowledge, adding missing information and deleting errors.
Then retrain on that.
Far too much garbage in any foundation model trained on uncorrected data.
I wonder how many papers he’s read since ChatGPT released about how bad it is to train AI on AI output.
Spoiler: He’s gonna fix the “missing” information with MISinformation.
She sounds Hot
She’s unfortunately can’t see you because of financial difficulties. You gotta give her money like I do. One day, I will see her in person.
Delusional and grasping for attention.
“and then on retrain on that”
Thats called model collapse.
So just making shit up.
So they’re just going to fill it with Hitler’s world view, got it.
Typical and expected.
I mean, this is the same guy who said we’d be living on Mars in 2025.
Humm…this doesn’t sound great
Lol turns out elon has no fucking idea about how llms work
It’s pretty obvious where the white genocide “bug” came from.
“We’ll fix the knowledge base by adding missing information and deleting errors - which only an AI trained on the fixed knowledge base could do.”
“Deleting Errors” should sound alarm bells in your head.
And the adding missing information doesn’t. Isn’t that just saying we are going to make shit up.
Huh. I’m not sure if he’s understood the alignment problem quite right.
The thing that annoys me most is that there have been studies done on LLMs where, when trained on subsets of output, it produces increasingly noisier output.
Sources (unordered):
- What is model collapse?
- AI models collapse when trained on recursively generated data
- Large Language Models Suffer From Their Own Output: An Analysis of the Self-Consuming Training Loop
- Collapse of Self-trained Language Models
Whatever nonsense Muskrat is spewing, it is factually incorrect. He won’t be able to successfully retrain any model on generated content. At least, not an LLM if he wants a successful product. If anything, he will be producing a model that is heavily trained on censored datasets.
It’s not so simple, there are papers on zero data ‘self play’ or other schemes for using other LLM’s output.
Distillation is probably the only one you’d want for a pretrain, specifically.
deleted by creator
What the fuck? This is so unhinged. Genuine question, is he actually this dumb or he’s just saying complete bullshit to boost stock prices?
my guess is yes.
Yes! We should all wholeheartedly support this GREAT INNOVATION! There is NOTHING THAT COULD GO WRONG, so this will be an excellent step to PERMANENTLY PERFECT this WONDERFUL AI.
He knows more … about knowledge… than… anyone alive now