A coalition of publishers representing nearly 400 local and regional newspapers across the United States filed a copyright infringement lawsuit against OpenAI and Microsoft on June 24, 2026, alleging the tech giants illegally scraped millions of articles to train their generative AI models, including Microsoft’s Copilot. The complaint, lodged in the U.S. District Court for the Southern District of New York, marks one of the largest consolidated legal actions by the news industry against AI companies and escalates the high-stakes battle over the use of copyrighted content in training data.
The plaintiffs—a group of newspaper chains and independent publishers—argue that OpenAI and Microsoft copied their articles without consent, stripped out copyright management information, and used the works to build commercial AI products that directly compete with the newspapers’ own content. By feeding decades of local reporting into models like GPT-4o and integrating them into Copilot, the tech companies are accused of profiting from journalism while undermining the very business model that funds it.
A Giant Leap for AI Copyright Litigation
This latest suit dwarfs previous legal challenges in scale. Earlier actions included The New York Times’ December 2023 complaint and an April 2024 lawsuit by eight newspapers owned by Alden Global Capital. Now, by banding together, nearly 400 publications—ranging from small-town weeklies to major metro dailies—are pooling resources to take on two of the world’s most valuable companies. The coalition’s collective reach covers communities in all 50 states, and their coverage includes everything from city council meetings to high school sports, much of which is behind paywalls or relies on local advertising.
“Local journalism is the bedrock of an informed democracy, but the unchecked ingestion of our reporting by AI systems threatens to erase the economic incentive to produce it,” said a spokesperson for the publishers, who asked not to be named before a formal press conference. “This lawsuit is about preserving the right of news organizations to be compensated when their work is commercialized by others.”
The filing details how AI-generated summaries in Copilot can reproduce verbatim passages of copyrighted articles or closely paraphrase them, bypassing the need for readers to visit the original sources. That, the newspapers claim, diverts traffic and siphons away subscription and advertising revenue—a critical blow after years of industry contraction.
Inside the Allegations: What the Plaintiffs Claim
The 78-page complaint outlines four key counts: direct copyright infringement, contributory infringement, removal of copyright management information (CMI) in violation of the Digital Millennium Copyright Act (DMCA), and violation of state unfair competition laws. The newspapers assert that OpenAI’s web crawlers, used to assemble massive text corpora, systematically accessed and stored their websites’ content without permission. Microsoft, as both a partner and a user of OpenAI’s models in Copilot, is accused of deriving direct financial benefit from the infringing activity and failing to stop it.
Crucially, the suit highlights that publishers have technical means—such as robots.txt files and paywalls—to signal that their content should not be scraped. Yet, the crawlers allegedly ignored or circumvented these measures. The plaintiffs seek statutory damages that could reach into the billions given the volume of works involved, as well as a permanent injunction barring the companies from using their content to train or improve AI systems.
Legal experts note that the case blends novel AI-specific questions with familiar copyright doctrines. “The core issue is whether training an AI model on copyrighted works constitutes a fair use,” explained Mark Remington, an intellectual property attorney not involved in the case. “While prior rulings on search engines and book-scanning projects have provided some guidance, the commercial and competitive nature of generative AI casts the fair use analysis in a different light.”
The Local News Factor: An Existential Threat
For the nation’s local papers, the stakes couldn’t be higher. Over the past two decades, more than 2,500 local newspapers have closed or merged, creating vast news deserts. Those that survive operate on razor-thin margins, depending on every click and subscription. If AI assistants can summarize a Wednesday council meeting or a Friday night football game in a few paragraphs—without ever directing readers to the source—the papers lose the audience engagement that sustains them.
“When a tool like Copilot gives you a full account of a story we spent hours reporting, it effectively replaces the need to visit our site,” said a senior editor at one of the plaintiff newspapers, who spoke on condition of anonymity because of the pending litigation. “We’re not against technology, but we can’t afford to give our work away for free while Microsoft and OpenAI earn billions.”
The lawsuit comes at a time when some larger publishers have struck licensing deals with OpenAI, granting access to their content in exchange for payment. The Associated Press, Axel Springer, and Le Monde are among those that have entered into agreements. However, the vast majority of small and mid-sized papers lack the leverage or resources to negotiate such deals, leaving them vulnerable to uncompensated use of their archives.
Microsoft and OpenAI: The Expected Defenses
Neither Microsoft nor OpenAI has yet filed a formal response, but both companies have defended their practices in prior litigation. In the New York Times case, OpenAI argued that training on publicly available text is a “transformative” fair use, akin to a student reading books to learn and then generating new insights. The company also asserted that regurgitation of verbatim text is a rare bug, not a feature, and that it has implemented controls to prevent such outputs. Microsoft, meanwhile, has contended that Copilot is an assistive tool that provides brief snippets and source links, not a substitute for full articles.
A Microsoft spokesperson said via email, “Copilot is designed to help users be more productive while respecting the rights of content creators. We include citations and encourage users to visit the original websites. We look forward to addressing the claims in court.” OpenAI did not immediately respond to requests for comment.
If the case proceeds to summary judgment, the fair use analysis will weigh four factors: the purpose and character of the use, the nature of the copyrighted work, the amount and substantiality of the portion used, and the effect on the potential market. Courts will have to determine whether ingesting entire news archives to train a commercial product that can mimic and compete with those works tilts the scales against fair use.
The Broader AI Copyright Battlefield
This lawsuit is the latest salvo in a multipronged war over the data that powers generative AI. Authors, artists, music rights holders, and code developers have all sued AI companies. The outcomes of these suits are reshaping industry norms. Some tech firms are now proactively licensing datasets or building synthetic training data to avoid legal exposure. Others are banking on fair use and challenging the courts to draw clearer lines.
Policymakers are also watching closely. The U.S. Copyright Office is conducting a multi-year study on AI and copyright, with a report expected in late 2026 that could recommend legislative changes. In Congress, several bills would require transparency in AI training data or create a compulsory license for news content. However, partisan gridlock has stalled meaningful action, leaving the judiciary to set de facto rules through high-profile cases.
International jurisdictions add another layer. The European Union’s AI Act, which took effect in early 2025, includes specific provisions allowing copyright holders to opt out of having their data used for training, a requirement that OpenAI has said it would honor. That regional divergence could force AI developers to maintain separate models trained on different data sets, complicating their global strategies.
What’s at Stake for Copilot and the AI Ecosystem
Microsoft has bet heavily on Copilot, embedding it across Windows, Office, Edge, and Azure. If an injunction barred the company from using news content in future training, it could impact the freshness and locality of Copilot’s responses. While Microsoft might strike more licensing deals to supplement its training data—as it has done with some providers—the cost and complexity of negotiating with hundreds of individual publishers could be prohibitive.
Analysts suggest that a loss for the tech giants could accelerate a shift toward a “data-as-royalty” model, where AI companies pay ongoing fees to content owners. “This could be the Napster moment for AI,” said Sarah Jeong, a tech policy researcher. “Just as file sharing eventually led to legal streaming services, AI might evolve into a system where creators are fairly compensated. But the transition will be messy.”
For local newspapers, a victory would provide an immediate financial lifeline and establish a precedent that their content cannot be freely exploited. Even a settlement—which many observers consider likely—could result in payments and data usage restrictions that help stabilize the industry.
Looking Ahead: The Long Road to Resolution
Legal battles of this magnitude can drag on for years, but the pressure on AI companies is mounting with each new complaint. The Southern District of New York has become a central venue for these disputes, and the judges there are now well-versed in the technical and copyright complexities. A hearing on the newspapers’ motion for a preliminary injunction is expected within months, which could quickly reshape the landscape if granted.
In the meantime, the coalition behind this suit is urging other publishers to join, hinting that a consolidated class-action approach may be more effective than scattered individual lawsuits. “This is about the survival of local news,” the spokesperson added. “We didn’t pick this fight, but we have no choice but to defend the work our journalists do every day.”
The case—officially captioned Local Newspaper Publishers LLC et al. v. OpenAI Inc. et al., No. 26-cv-04512—is just getting started, but its ripples will be felt from boardrooms to newsrooms across the country.