Deep Learning

Our most affordable AI model yet

Today, we're introducing the Gemini 3.1 Flash-Lite, our fastest and most economical Gemini 3 series model. Designed for high-volume developer workloads at scale, the Flash-Lite 3.1 delivers high quality for its price and model class.

Starting today, Flash-Lite 3.1 is being released in developer preview with the Gemini API in Google AI Studio and for businesses with Vertex AI.

Cost efficiency without compromise

Priced at just $0.25/1M input tokens and $1.50/1M output tokens, Flash-Lite 3.1 delivers improved performance at a fraction of the cost of larger models. It outperforms Flash 2.5 with a token 2.5X faster Time to First Answer and a 45% increase in output speed, according to the Performance Analysis benchmark while maintaining the same or better quality. This low latency is necessary for high-frequency performance, making it an ideal model for developers to create responsive, real-time experiences.

Source link

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button