With large language models needing quality data, some publishers are offering theirs at a price while others are blocking access
OpenAI, the developer of ChatGPT, knows that high-quality data matters in the artificial intelligence business – and news publishers have vast amounts of it.
“It would be impossible to train today’s leading AI models without using copyrighted materials,” the company said this year in a submission to the UK’s House of Lords, adding that limiting its options to books and drawings in the public domain would create underwhelming products.
More Stories
Former bosses at video games firm Ubisoft on trial in France accused of sexual harassment
High-rise, high expectations: is Casablanca’s finance hub a model for African development?
Millions of Australian workers to get an above-inflation pay rise as minimum wage lifts by 3.5%