• grysbok@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    2
    ·
    14 hours ago

    Tbh, I’d be less testy about bots scraping my sites for AI input IF they respected my robots.txt file and didn’t slam the server. They’re just rude and I don’t like it. Sometimes they’re so rude it’s effectively a DOS attack.

    Tbh, my sites exist to get information out there and I don’t care if someone mirrors my sites, as long as the information is still accurate.

    • MudMan@fedia.io
      link
      fedilink
      arrow-up
      1
      ·
      8 hours ago

      I mean, that’s great and you’re well within your rights, but that’s not what people generally say when they express outrage about AI scraping. People straight up call it theft very often and seem to consider using online content for training is the equivalent of copying or distributing it.

      Which stands out to me because that was not what happened when the EU decided that Google News was effectively piracy after a whole bunch of news outlets complained. The consensus there seemed to be that it was a bummer to lose the service despite all the scraping.

      • grysbok@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        9 minutes ago

        Oh yeah, I get that there’s more than 2 reasons to be upset about AI scraping. I work in the academic library world and the vibe here is

        1. bots are rude
        2. AI is not a reliable source of facts

        We work with facts and information, and I have no expectation that my collection of facts is something to defend against replication.

        On the other hand, I’d be pissed AF if someone stole my research paper on 1800s family drama and reprinted it without attribution, or AI-hallucinated new pseudo-facts that were not in the source materials.

        Edit: my situation isn’t that of others and I totally get why artists and authors would be upset about AI bots stealing their work.