With large language models needing quality data, some publishers are offering theirs at a price while others are blocking access
OpenAI, the developer of ChatGPT, knows that high-quality data matters in the artificial intelligence business – and news publishers have vast amounts of it.
“It would be impossible to train today’s leading AI models without using copyrighted materials,” the company said this year in a submission to the UK’s House of Lords, adding that limiting its options to books and drawings in the public domain would create underwhelming products.
More Stories
TikTok breached EU advertising transparency laws, commission says
Dutch climate campaigners vow to take Shell to court again
Top winemaker ‘may have to leave its Spanish vineyards due to climate crisis’