However, the practice of using content for AI training has faced resistance, with The New York Times, CNN, Reuters, and Vox Media blocking OpenAI's GPT crawler from accessing their data. The New York Times even sued OpenAI and Microsoft Corp. for allegedly using its copyrighted material illegally. Reddit Inc. and several authors have also taken action against companies using their content for AI training, indicating that training LLMs could become increasingly costly.
Key takeaways:
- OpenAI is reportedly offering as little as $1 million to news publishing firms to use their content to train its large language models.
- The company is currently negotiating with around one dozen media companies.
- Apple Inc. is also in the race to develop generative AI and has inked deals worth in the region of $50 million.
- Several media companies and authors have started to block or sue companies like OpenAI and Microsoft Corp. for using their copyrighted material to train AI models.