Meta Accused of Using 81.7TB of Pirated Books to Train AI Models
Meta is facing allegations of downloading over 81.7 terabytes of pirated books to train its artificial intelligence models, according to newly unsealed court documents. These revelations have emerged in a copyright infringement lawsuit filed by authors, including Sarah Silverman, Richard Kadrey, and Christopher Golden, who claim that Meta utilized their works without permission to develop its AI technologies.
The unsealed emails suggest that Meta's executives, including CEO Mark Zuckerberg, were aware of and approved the use of data from Library Genesis (LibGen), a well-known repository of pirated books, for training their AI models. Internal communications indicate that Meta employees discussed strategies to obscure the origins of the data, such as removing explicit copyright markings and altering metadata, to mitigate legal risks.
Meta has defended its actions by asserting that training AI models on publicly available datasets constitutes "fair use" under copyright law. The company has filed motions to dismiss the lawsuit, arguing that their use of the data is transformative and does not infringe upon the authors' rights.
This case is part of a broader wave of legal challenges against tech companies accused of using copyrighted materials without authorization to train AI systems. The outcomes of these lawsuits could have significant implications for the development of AI technologies and the protection of intellectual property rights in the digital age.
RECOMMENDED NEWS
Google is releasing previously Pixel-exclusive AI tools to all Google Photos users
Soon, all users of the photo viewing and editing app Google Photos may use a number of AI tools to ...
WhatsApp for iOS adds support for Passkeys
WhatsApp has announced that it is rolling out Passkeys support on iOS. The Meta owned messaging com...
Apple patches 2 zero-day vulnerabilities that were used to attack Intel-based Macs
Apple has released a critical update for macOS to patch a couple of zero-day vulnerabilities. The s...
Reddit's New Paywall: Some Subreddits to Require Subscription
Reddit is set to introduce a paywall feature for certain subreddits by the end of 2025, as confirme...
Vivaldi integrates Proton VPN for enhanced browsing privacy
Vivaldi Technologies has announced a partnership with Proton AG to integrate Proton VPN directly in...
Netflix introduces dialogue-only subtitles for a simplified viewing experience
Netflix has announced a quality-of-life update that could enhance the experience for users. Normall...
Comments on "Meta Accused of Using 81.7TB of Pirated Books to Train AI Models" :