Epoch AI acknowledges the mistake in communication and commits to improving transparency in future collaborations. The article also raises questions about the integrity of the agreement between Epoch AI and OpenAI, suggesting that a written agreement would be more reassuring. There is skepticism about OpenAI's use of the data, with concerns that it could indirectly aid in capabilities advancement. The discussion emphasizes the importance of maintaining private and uncontaminated test sets to ensure accurate progress measurement and avoid conflicts of interest.
Key takeaways:
- FrontierMath was funded by OpenAI, but this information was not transparently communicated to contributors and contractors until after the o3 announcement.
- OpenAI has access to a large fraction of the FrontierMath dataset, though there is a holdout set for independent verification of model capabilities.
- There is a verbal agreement that OpenAI will not use the dataset for training, but concerns remain about the potential use of the data for capabilities advancement.
- Epoch AI acknowledges the lack of transparency as a mistake and commits to improving transparency in future collaborations.