With large language models needing quality data, some publishers are offering theirs at a price while others are blocking access
OpenAI, the developer of ChatGPT, knows that high-quality data matters in the artificial intelligence business – and news publishers have vast amounts of it.
“It would be impossible to train today’s leading AI models without using copyrighted materials,” the company said this year in a submission to the UK’s House of Lords, adding that limiting its options to books and drawings in the public domain would create underwhelming products.
More Stories
Fight to stop Tesla project in South Australia to continue after council approval
Google and Home Depot drop Pride Toronto sponsorship amid Trump’s DEI war
Former bosses at video games firm Ubisoft on trial in France accused of sexual harassment