Judge forces OpenAI to produce 20 million chat logs in copyright lawsuit — CEO Sam Altman's "fair use" defense relies on this

Sam Altman, chief executive officer of OpenAI Inc., during the Federal Reserve Integrated Review of the Capital Framework for Large Banks Conference in Washington, DC, US, on Tuesday, July 22, 2025.
(Image credit: Getty Images | Bloomberg)
Disclaimer

Enjoy our content? Make sure to set Windows Central as a preferred source in Google Search, and find out why you should so that you can stay up-to-date on the latest news, reviews, features, and more.

Over the past few years, OpenAI has faced multiple copyright infringement lawsuits, with news outlets and publishers claiming ChatGPT used their content without authorization or compensation.

However, OpenAI has sought to dismiss the lawsuit by arguing that scraping online content to train AI models qualifies as fair use. Until recently, OpenAI has had the last laugh in court. In 2024, a New York federal judge recently dismissed a copyright infringement lawsuit filed against OpenAI by news outlets, including Raw Story and AlterNet.

The judge indicated that the plaintiffs didn't establish where the AI firm sourced its content, and that it potentially then used the scraped data to train ChatGPT without compensation.

OpenAI was recently compelled to produce 20 million chat logs from ChatGPT users in its ongoing court battle with the New York Times, in a case before U.S. Magistrate Judge Ona Wang in Manhattan (via Reuters).

Let us be clear about what is really at stake here. The alleged injury for which Plaintiffs truly seek redress is not the exclusion of CMI but the use of Plaintiffs' articles to develop ChatGPT without compensation. Whether there is another statute or legal theory that does elevate this type of harm remains to be seen. But that question is not before the Court today.

New York Federal Judge, Colleen McMahon

While OpenAI previously argued that this was a huge ask that would "disregard long-standing privacy protections," the judge indicated that the logs were important to establish the outlets' claims. Wang further indicated that producing the logs wouldn't violate ChatGPT users' privacy. "There are multiple layers of protection in this case precisely because of the highly sensitive and private nature of much of the discovery," the judge added.

OpenAI has already filed an appeal to wiggle its way out of Wang's order to produce 20 million chat logs to the presiding judge, U.S. District Judge Sidney Stein.

In the past, OpenAI CEO Sam Altman argued that copyright law doesn't categorically prohibit the use of copyrighted content for training AI models. Interestingly, the executive admitted that developing ChatGPT-like tools without copyrighted content is virtually impossible.

MediaNews Group executive editor Frank Pine indicated that OpenAI's leadership was "hallucinating when they thought they could get away with withholding evidence about how their business model relies on stealing from hardworking journalists."

It'll be interesting to see how the court case pans out, especially following recent reports suggesting that top AI labs might be unable to advance AI models due to a lack of high-quality training content amid plans to inject ads into ChatGPT.


Click to follow Windows Central on Google News

Follow Windows Central on Google News to keep our latest news, insights, and features at the top of your feeds!


Kevin Okemwa
Contributor

Kevin Okemwa is a seasoned tech journalist based in Nairobi, Kenya with lots of experience covering the latest trends and developments in the industry at Windows Central. With a passion for innovation and a keen eye for detail, he has written for leading publications such as OnMSFT, MakeUseOf, and Windows Report, providing insightful analysis and breaking news on everything revolving around the Microsoft ecosystem. While AFK and not busy following the ever-emerging trends in tech, you can find him exploring the world or listening to music.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.