• 0 Posts
  • 129 Comments
Joined 2 years ago
cake
Cake day: August 9th, 2023

help-circle


  • This gives legal backing to any lawsuits against ai companies.

    Currently everything on the Internet is assumed to be free. Robots.txt is just a suggestion and not legally enforceable. I assume RSL is supposed to communicate terms of use explicitly, like a EULA.

    It’s like open source licenses on github. Sure you can access the source, but here are the rules you have to follow. Yes, a lot of companies still ignore it, notably GNU licensed software has been abused by the likes of Apple.










  • The answer lies within the article

    Publishers legally control content that AI companies desperately want, but AI companies don’t always want to negotiate a license. The first-sale doctrine offered a workaround: Once you buy a physical book, you can do what you want with that copy—including destroy it. That meant buying physical books offered a legal workaround.

    And yet buying things is expensive, even if it is legal. So like many AI companies before it, Anthropic initially chose the quick and easy path. In the quest for high-quality training data, the court filing states, Anthropic first chose to amass digitized versions of pirated books to avoid what CEO Dario Amodei called “legal/practice/business slog”—the complex licensing negotiations with publishers. But by 2024, Anthropic had become “not so gung ho about” using pirated ebooks “for legal reasons” and needed a safer source.