Home ยป Google Unveils Gemini 1.5 Flash-8B, the Ultra-Compact Budget-Friendly Model, at 50% Discount from the Standard Flash Model.

Google Unveils Gemini 1.5 Flash-8B, the Ultra-Compact Budget-Friendly Model, at 50% Discount from the Standard Flash Model.

Google has rolled out the Gemini 1.5 Flash-8B, the smallest model in the Gemini Flash lineup, for free trial. This model has been downsized to 8 million parameters, which may slightly reduce its intelligence compared to the standard Gemini 1.5 Flash. However, it offers a 50% lower price, faster response times, and a doubled rate limit (up to 4,000 requests per minute from the original 2,000).

The price for the Gemini 1.5 Flash-8B is considered the most affordable among all Gemini models:

– $0.0375 per 1 million input tokens for prompts smaller than 128K (previously $0.075)
– $0.15 per 1 million output tokens (previously $0.30)
– $0.01 per 1 million tokens on cached prompts

Source: Google for Developers

TLDR: Google introduces the Gemini 1.5 Flash-8B model, offering a smaller size, lower price, faster response times, and increased rate limits compared to its predecessors.

More Reading

Post navigation

Leave a Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

LiteRT: Google’s Transformation of TensorFlow Lite Allows Versatile Model Execution Across Multiple Platforms

Pixel 8 Pro Leaked on Official Google Website, Exposing Its Camera Specs

Google Rewards Researchers with $10 Million in Bug Bounty Program Last Year.