1
Feature Story
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory
Nov 14, 2024 · dwarkeshpatel.com
Gwern's insights challenge the conventional belief that algorithms are more important than compute in AI development. He argues that the increasing capabilities of AI models like GPT-3 are a result of scaling laws and the application of more compute power. He also criticizes the research literature for not accurately representing the origins of ideas, suggesting that trial and error, serendipity, and compute power play a much larger role in discovery than is often acknowledged.
Key takeaways
- Gwern is a pseudonymous researcher and writer who was one of the first people to predict the scaling of large language models (LLMs).
- He believes that all intelligence is a search over Turing machines, and that the success of scaling points to this theory.
- He argues that the importance of algorithms has been overestimated, and that compute and data, trial and error, and serendipity play enormous roles in AI development.
- He also discusses the potential future of AI, including the possibility of AI firms and the evolution of AI intelligence.