TikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAI

Luu Tuyen@lemmy.world · 3 months ago

TikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAI

Breve@pawb.social · 3 months ago

They’re too late, there’s going to be way too much AI generated garbage in their data and so many social media platforms like Reddit and Twitter have already taken measures to curb scrapers.

Drunemeton@lemmy.world · 3 months ago

I think that’s the “25-times faster” bit. They seem to be in a hurry to collect as much human-generated data as possible.

GHiLA@sh.itjust.works · 3 months ago

How does it know what is and isn’t?

Uh oh.

Drunemeton@lemmy.world · 3 months ago

Yeah…

Hey! Perhaps they’ll use A.I. to weed out the A.I. generated bits.

JackbyDev@programming.dev · 2 months ago

I mean, if I could theoretically take a snapshot of the entire Internet I’d rather do it now than later because there’s just gonna be more AI later.

chickenf622@sh.itjust.works · 3 months ago

Like those platforms aren’t already full of AI garbage as well. Training new models will require a cut-off date before the genie was let out of the bottle.

TikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAI

TikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAI

TikTok’s parent launched a web scraper that's gobbling up the world’s online data 25-times faster than OpenAI