With large language models needing quality data, some publishers are offering theirs at a price while others are blocking access
OpenAI, the developer of ChatGPT, knows that high-quality data matters in the artificial intelligence business – and news publishers have vast amounts of it.
“It would be impossible to train today’s leading AI models without using copyrighted materials,” the company said this year in a submission to the UK’s House of Lords, adding that limiting its options to books and drawings in the public domain would create underwhelming products.
More Stories
Australia has been hesitant – but could robots soon be delivering your pizza?
Dutch climate campaigners vow to take Shell to court again
Ben & Jerry’s co-founder arrested for Gaza protest at US Senate hearing