The first one was because it was data which wasn’t public that was scrapped. The second was because the data contained personal information and I’m guessing also because it wasn’t public.
As long as you don’t need a user account to access the data and it doesn’t contain personal information, it is fair game.
Any regular person can scrape and use public data for AI use, it’s not illegal for companies or individuals and it shouldn’t be.
Except companies have sued people for web scraping.
https://www.informationweek.com/it-leadership/linkedin-sues-after-scraping-of-user-data
Even web scraping for research purposes, not for competitive purposes.
https://www.engadget.com/2016-05-17-publicly-released-okcupid-profiles-taken-down-dmca-claim.html
The first one was because it was data which wasn’t public that was scrapped. The second was because the data contained personal information and I’m guessing also because it wasn’t public.
As long as you don’t need a user account to access the data and it doesn’t contain personal information, it is fair game.
It will be the way we have all come to hate AI. Patriot act 2.0
proceeds to download the entirety of one piece to train an AI