Sign up to save tools and stay up to date with the latest in AI
bg
bg
1

Explaining DeepSeek: The Chinese model's efficiency is scaring markets

Jan 27, 2025 - businessinsider.com
China's DeepSeek model is challenging US AI firms by offering a cost-effective and efficient alternative to models like OpenAI's. DeepSeek's model is 20-40 times cheaper to run and uses modest hardware, raising questions about US investments in AI infrastructure. The DeepSeek-V3 model, comparable to OpenAI's ChatGPT, was trained on a cluster of 2,048 Nvidia H800 GPUs, demonstrating innovation under constraints. Despite its smaller size of 671 billion parameters compared to ChatGPT-4's 1.76 trillion, DeepSeek-V3 achieves impressive benchmarks due to its "mixture of experts" architecture. This efficient model is virtually being given away for free, posing a competitive threat to US companies.

The DeepSeek-R1 model, a reasoning model, further intensifies the competition by using advanced techniques like generating its own training data. Despite the shock from DeepSeek's advancements, investments in AI infrastructure continue, with projects like Stargate in Texas. Bernstein analysts caution against overreacting, noting that DeepSeek's $5 million cost figure excludes prior research expenses. The competition for AI model supremacy remains fierce, but demand for computing power is expected to rise, driven by Jevon's paradox, where increased efficiency leads to higher demand.

Key takeaways:

  • China's DeepSeek model is significantly cheaper and more efficient than US AI models, raising questions about US investments in AI infrastructure.
  • DeepSeek-V3 uses a "mixture of experts" architecture, making it smaller and easier to run while maintaining impressive performance.
  • DeepSeek-R1, a reasoning model, competes with OpenAI's latest models and uses innovative techniques like generating its own training data.
  • Despite DeepSeek's achievements, large-scale investments in data centers continue, and demand for computing power is expected to rise due to Jevon's paradox.
View Full Article

Comments (0)

Be the first to comment!