Encyclopedia Britannica and Merriam-Webster sue OpenAI, claiming ChatGPT cannibalises nearly 100,000 of their articles
Back to Home
legal

Encyclopedia Britannica and Merriam-Webster sue OpenAI, claiming ChatGPT cannibalises nearly 100,000 of their articles

March 16, 202621 views2 min read

Encyclopedia Britannica and Merriam-Webster have sued OpenAI for allegedly using their copyrighted content to train ChatGPT without permission. The lawsuit follows a similar case against Perplexity and marks a significant legal battle over AI data usage.

In a significant legal escalation, Encyclopedia Britannica and Merriam-Webster have filed a copyright and trademark lawsuit against OpenAI, accusing the AI company of unlawfully using their copyrighted content to train ChatGPT. The lawsuit, filed in New York on March 13, 2026, alleges that OpenAI incorporated nearly 100,000 articles from these two reference giants into its AI training datasets without permission. The suit claims that ChatGPT not only reproduces this content verbatim in its responses but also undermines the value of the publishers' intellectual property.

Previous Legal Precedent

This move follows a similar lawsuit filed by the same publishers against Perplexity in late 2025, which also accused the AI search engine of using copyrighted material without consent. The timing of the lawsuits underscores growing concerns among traditional publishers about how AI companies are leveraging their content for training without explicit authorization. The legal actions mark a pivotal moment in the ongoing debate over AI data usage and intellectual property rights.

Implications for the AI Industry

The lawsuit could have far-reaching consequences for the AI industry, particularly for companies that rely on large-scale data scraping for model training. If successful, the case could force AI developers to seek explicit permissions from content creators or face significant legal repercussions. Moreover, it highlights the tension between the rapid advancement of AI technology and the need to protect creators’ rights. Legal experts suggest that this case may set a precedent for how copyrighted materials are treated in the training of large language models.

As the AI landscape continues to evolve, the outcome of this lawsuit could reshape how publishers and AI companies navigate the complex terrain of data rights and content usage. With more publishers likely to follow suit, the legal challenges may intensify, forcing the industry to redefine its approach to content licensing and AI development.

Source: TNW Neural

Related Articles