In addition, OpenAI has released the next version of its open-source automatic speech recognition model, Whisper large-v3. This updated model reportedly offers enhanced performance across multiple languages. These developments were announced during OpenAI's inaugural developer day.
Key takeaways:
- OpenAI has launched a new API for its text-to-image model, DALL-E 3, which includes built-in moderation to prevent misuse.
- The DALL-E 3 API offers various format and quality options, with prices starting at $0.04 per image generated.
- OpenAI is also offering a text-to-speech API with six preset voices and two generative AI model variants, with pricing starting at $0.015 per input 1,000 characters.
- OpenAI has launched the next version of its open source automatic speech recognition model, Whisper large-v3, which reportedly has improved performance across languages.