Sign up to save tools and stay up to date with the latest in AI
bg
bg
1

OpenAI launches DALL-E 3 API, new text-to-speech models | TechCrunch

Nov 06, 2023 - news.bensbites.co
OpenAI has introduced a range of new APIs, including DALL-E 3, a text-to-image model, and a text-to-speech API with six preset voices and two AI model variants. The DALL-E 3 API offers various format and quality options, starting at $0.04 per image, while the text-to-speech API is priced at $0.015 per 1,000 characters. Both APIs feature built-in moderation to prevent misuse.

In addition, OpenAI has released the next version of its open-source automatic speech recognition model, Whisper large-v3. This updated model reportedly offers enhanced performance across multiple languages. These developments were announced during OpenAI's inaugural developer day.

Key takeaways:

  • OpenAI has launched a new API for its text-to-image model, DALL-E 3, which includes built-in moderation to prevent misuse.
  • The DALL-E 3 API offers various format and quality options, with prices starting at $0.04 per image generated.
  • OpenAI is also offering a text-to-speech API with six preset voices and two generative AI model variants, with pricing starting at $0.015 per input 1,000 characters.
  • OpenAI has launched the next version of its open source automatic speech recognition model, Whisper large-v3, which reportedly has improved performance across languages.
View Full Article

Comments (0)

Be the first to comment!