Gemini 1.5 Pro can be used for a variety of tasks, such as analyzing code libraries, reasoning across lengthy documents, and holding long conversations with a chatbot. It is also multilingual and multimodal, meaning it can understand images, videos, and audio streams in addition to text. Early users of the model, including United Wholesale Mortgage, TBS, and Replit, are using it for tasks such as mortgage underwriting, automating metadata tagging on media archives, and generating, explaining, and transforming code. Google is also working to optimize the model's latency and plans to integrate it into other parts of its corporate product ecosystem.
Key takeaways:
- Gemini 1.5 Pro, Google's most advanced generative AI model, is now available in public preview on Vertex AI, Google's enterprise-focused AI development platform.
- The model's key feature is its ability to process between 128,000 tokens to up to 1 million tokens, which is significantly higher than other models like Anthropic’s Claude 3 and OpenAI’s GPT-4 Turbo.
- Gemini 1.5 Pro is multilingual and multimodal, meaning it can understand images, videos, and audio streams in addition to text.
- Early users of Gemini 1.5 Pro are leveraging the large context window for tasks like mortgage underwriting, automating metadata tagging on media archives, and generating, explaining and transforming code.