Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory
Nov 14, 2024 - dwarkeshpatel.com
The article features an interview with pseudonymous researcher and writer, Gwern, who is known for his polymathic thinking and was one of the first to predict the scaling of Language Learning Models (LLMs). The conversation covers a range of topics, including the benefits of anonymity, the evolution of artificial intelligence (AI), and the role of compute power in AI development. Gwern shares his belief that intelligence is essentially a search over Turing machines and that the increasing power of AI is due to the application of more compute to more data and parameters. He also discusses the importance of trial and error, serendipity, and large-scale compute in the discovery of new algorithms.
Gwern's insights challenge the conventional belief that algorithms are more important than compute in AI development. He argues that the increasing capabilities of AI models like GPT-3 are a result of scaling laws and the application of more compute power. He also criticizes the research literature for not accurately representing the origins of ideas, suggesting that trial and error, serendipity, and compute power play a much larger role in discovery than is often acknowledged.
Key takeaways:
Gwern is a pseudonymous researcher and writer who was one of the first people to predict the scaling of large language models (LLMs).
He believes that all intelligence is a search over Turing machines, and that the success of scaling points to this theory.
He argues that the importance of algorithms has been overestimated, and that compute and data, trial and error, and serendipity play enormous roles in AI development.
He also discusses the potential future of AI, including the possibility of AI firms and the evolution of AI intelligence.