Sign up to save tools and stay up to date with the latest in AI
bg
bg
1

Leveling up Workers AI: General Availability and more new capabilities

Apr 02, 2024 - blog.cloudflare.com
Cloudflare has announced the general availability of its Workers AI inference platform, offering improved performance, reliability, and lower costs on popular models. The platform, which has been in open beta for several months, now includes a new dashboard and AI playground, and supports fine-tuned inference with Bring Your Own (BYO) LoRAs. The company also announced the expansion of its partnership with Hugging Face, the addition of Python support in Workers, and updates to its AI Gateway and Vectorize products.

The company plans to deploy GPUs to its data centers worldwide by the end of 2024, making it the most widely distributed cloud-AI inference platform. It also plans to launch its next generation of compute servers with GPUs in Q2 2024. The AI Gateway now supports Anthropic, Azure, AWS Bedrock, Google Vertex, and Perplexity, and will add persistent logs, custom metadata, and secrets management in Q2 2024. Vectorize, which allows developers to persist embeddings and query for the closest match, is set for general availability in June 2024.

Key takeaways:

  • Cloudflare's Workers AI inference platform is now Generally Available, with improved performance, reliability, and lower costs on popular models.
  • Cloudflare continues its partnership with Hugging Face, adding 4 more models to their platform and making it easier to run these models on Workers AI.
  • Cloudflare now supports Python in Workers, allowing developers to write Cloudflare Workers in the second most popular programming language in the world.
  • Cloudflare's AI Gateway now supports more providers including Anthropic, Google Vertex, and Perplexity, and plans to add persistent logs, custom metadata, and secrets management in Q2 of 2024.
View Full Article

Comments (0)

Be the first to comment!