Home ยป Google Unveils Gemini 1.5 Flash-8B, the Ultra-Compact Budget-Friendly Model, at 50% Discount from the Standard Flash Model.

Google Unveils Gemini 1.5 Flash-8B, the Ultra-Compact Budget-Friendly Model, at 50% Discount from the Standard Flash Model.

Google has rolled out the Gemini 1.5 Flash-8B, the smallest model in the Gemini Flash lineup, for free trial. This model has been downsized to 8 million parameters, which may slightly reduce its intelligence compared to the standard Gemini 1.5 Flash. However, it offers a 50% lower price, faster response times, and a doubled rate limit (up to 4,000 requests per minute from the original 2,000).

The price for the Gemini 1.5 Flash-8B is considered the most affordable among all Gemini models:

– $0.0375 per 1 million input tokens for prompts smaller than 128K (previously $0.075)
– $0.15 per 1 million output tokens (previously $0.30)
– $0.01 per 1 million tokens on cached prompts

Source: Google for Developers

TLDR: Google introduces the Gemini 1.5 Flash-8B model, offering a smaller size, lower price, faster response times, and increased rate limits compared to its predecessors.

More Reading

Post navigation

Leave a Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Challenging Google: Donald Trump Opposes Google’s Separation, Asserts Chinese Concerns and Hardships for United States

Google invests $350 million in Flipkart, the e-commerce behemoth in India, with Walmart holding shares.

Google’s Information Hub in Finland to Transmit Heat as Heater, Encompassing 80% of the City.