Home ยป Google Unveils Gemini 1.5 Flash-8B, the Ultra-Compact Budget-Friendly Model, at 50% Discount from the Standard Flash Model.

Google Unveils Gemini 1.5 Flash-8B, the Ultra-Compact Budget-Friendly Model, at 50% Discount from the Standard Flash Model.

Google has rolled out the Gemini 1.5 Flash-8B, the smallest model in the Gemini Flash lineup, for free trial. This model has been downsized to 8 million parameters, which may slightly reduce its intelligence compared to the standard Gemini 1.5 Flash. However, it offers a 50% lower price, faster response times, and a doubled rate limit (up to 4,000 requests per minute from the original 2,000).

The price for the Gemini 1.5 Flash-8B is considered the most affordable among all Gemini models:

– $0.0375 per 1 million input tokens for prompts smaller than 128K (previously $0.075)
– $0.15 per 1 million output tokens (previously $0.30)
– $0.01 per 1 million tokens on cached prompts

Source: Google for Developers

TLDR: Google introduces the Gemini 1.5 Flash-8B model, offering a smaller size, lower price, faster response times, and increased rate limits compared to its predecessors.

More Reading

Post navigation

Leave a Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

DeepSeek Integration Testing Enhances Search System in Chinese Weixin or WeChat by Tencent.

Court Orders Google to Calculate Full Costs if Compliance with Epic Games’ Request Regarding Play Store Data Access is Necessary

Revolutionary AI Unveils Exquisite Stable Audio Model with Unmatched Velocity