Home GADGETS Meta staff torrented nearly 82TB of pirated books for AI training —...

Meta staff torrented nearly 82TB of pirated books for AI training — court records reveal copyright violations

Meta staff torrented nearly 82TB of pirated books for AI training — court records reveal copyright violations

Facebook parent-company Meta is currently fighting a class action lawsuit alleging copyright infringement and unfair competition, among others, with regards to how it trained LLaMA. According to an X (formerly Twitter) post by vx-underground, court records reveal that the social media company used pirated torrents to download 81.7TB of data from shadow libraries including Anna’s Archive, Z-Library, and LibGen. It then used this information to train its AI models.

The evidence, in the form of written communication, shows the researchers’ concerns about Meta’s use of pirated materials. One senior AI researcher said way back in October 2022, “I don’t think we should use pirated material. I really need to draw a line here.” While another one said, “Using pirated material should be beyond our ethical threshold,” then they added, “SciHub, ResearchGate, LibGen are basically like PirateBay or something like that, they are distributing content that is protected by copyright and they’re infringing it.”

Did Meta pirate eBooks?

(Image credit: Future)

Source link