OpenAI Accused of Deleting ChatGPT Training Data in Copyright Case
OpenAI allegedly deleted ChatGPT training data amid copyright infringement claims by publishers, raising concerns about evidence retention.
OpenAI is facing accusations of deleting crucial training data for its ChatGPT model amid copyright infringement lawsuits from The New York Times and the Daily News. The alleged accidental deletion raises concerns about evidence retention in legal cases involving AI. The publishers claim ChatGPT was trained using their copyrighted content. OpenAI had granted them access to virtual machines to search for this content within the training data. However, according to a letter filed with the U.S. District Court for the Southern District of New York, OpenAI engineers erased the publishers' search data from one of the virtual machines. TechCrunch's Kyle Wiggers reported: Earlier this fall, OpenAI agreed to provide two virtual machines...But on November 14, OpenAI engineers erased all the publishers’ search data...which was filed in the U.S. District Court...late Wednesday. Although OpenAI claims the deletion was accidental and that the data was recovered, the recovered format is allegedl…