With large language models needing quality data, some publishers are offering theirs at a price while others are blocking access
OpenAI, the developer of ChatGPT, knows that high-quality data matters in the artificial intelligence business – and news publishers have vast amounts of it.
“It would be impossible to train today’s leading AI models without using copyrighted materials,” the company said this year in a submission to the UK’s House of Lords, adding that limiting its options to books and drawings in the public domain would create underwhelming products.
More Stories
Chinese fishing fleets using North Korean forced labour in potential breach of sanctions, report claims
Qantas posts $1.39bn profit as holidaymakers flock to Jetstar
‘They’ve lost my trust’: consumers shun companies as bosses kowtow to Trump