Home ยป Google Unveils Gemini 1.5 Flash-8B, the Ultra-Compact Budget-Friendly Model, at 50% Discount from the Standard Flash Model.

Google Unveils Gemini 1.5 Flash-8B, the Ultra-Compact Budget-Friendly Model, at 50% Discount from the Standard Flash Model.

Google has rolled out the Gemini 1.5 Flash-8B, the smallest model in the Gemini Flash lineup, for free trial. This model has been downsized to 8 million parameters, which may slightly reduce its intelligence compared to the standard Gemini 1.5 Flash. However, it offers a 50% lower price, faster response times, and a doubled rate limit (up to 4,000 requests per minute from the original 2,000).

The price for the Gemini 1.5 Flash-8B is considered the most affordable among all Gemini models:

– $0.0375 per 1 million input tokens for prompts smaller than 128K (previously $0.075)
– $0.15 per 1 million output tokens (previously $0.30)
– $0.01 per 1 million tokens on cached prompts

Source: Google for Developers

TLDR: Google introduces the Gemini 1.5 Flash-8B model, offering a smaller size, lower price, faster response times, and increased rate limits compared to its predecessors.

More Reading

Post navigation

Leave a Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Enhancing User Security: Google Introduces Passkey Recommendations to Elevate Login Experience

Transforming CUDA Code to Run on AMD GPU with SCALE Compiler Project

French Regulator Fines Google 9.8 Billion Baht for Training AI with News Content without Prior Notification