Home / Business and Economy / AI's 'Theft': Publishers Fight Tech Giants
AI's 'Theft': Publishers Fight Tech Giants
8 Apr
Summary
- Publishers are suing Google over AI training data use.
- AI companies argue training on data is fair use.
- The lawsuit impacts the economic model of the creative industry.

Major publishing houses are challenging generative AI companies over the use of copyrighted content for model training. Hachette, a prominent publisher, has joined a lawsuit against Google, alleging that the tech giant used its books and other copyrighted materials without permission to train its AI model, Gemini.
Google contends that training large language models (LLMs) on vast datasets is a transformative process, analyzing language patterns rather than reproducing original works, and thus falls under fair use. Publishers, however, view this as a form of theft, arguing that their authors' intellectual property is being monetized by AI companies.
This conflict has significant implications for the creative economy, potentially disrupting the established model where creators are compensated for their work. David Shelley, CEO of Hachette UK, emphasizes the need to protect authors' rights and the future of creative industries, warning that ignoring copyright could devastate the sector.
Legal actions are proliferating across the U.S. and Europe, with other authors and organizations also suing AI firms. A recent case saw authors win a $1.5 billion settlement against Anthropic, not for copyright breach but due to the high volume of pirated books used in training data. Shelley likens the AI companies' approach to piracy, asserting that if LLMs can reproduce content based on ingested works, it infringes on the original creators' intellectual property.
The financial stakes are immense, with the generative AI market valued at over $100 billion. Shelley advocates for creators to be remunerated or have the right to refuse the use of their work in AI training. He stresses that the current legal framework of copyright, while needing evolution, is crucial for ensuring creators can earn a living.
Hachette is also exploring AI's operational uses, such as metadata entry and customer service, but firmly draws a line against AI-generated content creation, believing it undermines the role of human authors. Initiatives like the "Human Authored" certification are emerging, highlighting the growing value placed on human creativity in an AI-dominated landscape.
The fight extends beyond business preservation; it's about safeguarding societal access to new ideas and stories. Shelley calls for collective action, including lobbying governments and engaging with the press, to update legal precedents and uphold copyright, which he describes as essential for a flourishing society.
Despite the challenges, Hachette continues to innovate, focusing on customer needs across digital and analog formats. Shelley's approach balances a 200-year legacy with a forward-looking strategy, aiming to ensure creators can benefit from their ideas while adapting to a changing technological world.