Sign up to save tools and stay up to date with the latest in AI
bg
bg
1

Kneron advances edge AI with neural processing unit and Edge GPT server updates

Jun 05, 2024 - venturebeat.com
Silicon vendor Kneron has unveiled its next generation of silicon and server technology at the Computex conference in Taiwan. The company, which was founded in 2015 and counts Qualcomm and Sequoia Capital among its investors, is expanding its AI server portfolio with the KNEO 330 Edge GPT server that enables offline inference capabilities. The server integrates multiple KL830 edge AI chips and allows for affordable on-premises GPT deployments for enterprises. Kneron's technology is part of a growing trend of vendors looking to use technology other than a GPU to improve the power and efficiency of AI workloads.

Kneron's new hardware can also support RAG retrieval-augmented generation workflows and the company now has multiple capabilities for training and fine-tuning models that run on its hardware. The company's CEO, Albert Liu, highlighted that one of the key differentiators for Kneron's technology is its low power consumption. The new KL830 has a peak power consumption of only 2 watts, allowing Kneron's chips to be integrated into various devices, including PCs, without the need for additional cooling solutions.

Key takeaways:

  • Kneron, a silicon vendor, is rolling out its next generation KL830 neural processing unit (NPU) and providing a glimpse into the future KL 1140 which is set to debut in 2025.
  • The company is also expanding its AI server portfolio with the KNEO 330 Edge GPT server that enables offline inference capabilities.
  • Kneron's technology aims to improve power and efficiency of AI workloads, offering an alternative to GPU technology.
  • The company's new KL830 chip has a peak power consumption of only 2 watts, allowing it to be integrated into various devices without the need for additional cooling solutions.
View Full Article

Comments (0)

Be the first to comment!