Google has rolled out the Gemini 1.5 Flash-8B, the smallest model in the Gemini Flash lineup, for free trial. This model has been downsized to 8 million parameters, which may slightly reduce its intelligence compared to the standard Gemini 1.5 Flash. However, it offers a 50% lower price, faster response times, and a doubled rate limit (up to 4,000 requests per minute from the original 2,000).
The price for the Gemini 1.5 Flash-8B is considered the most affordable among all Gemini models:
– $0.0375 per 1 million input tokens for prompts smaller than 128K (previously $0.075)
– $0.15 per 1 million output tokens (previously $0.30)
– $0.01 per 1 million tokens on cached prompts
Source: Google for Developers
TLDR: Google introduces the Gemini 1.5 Flash-8B model, offering a smaller size, lower price, faster response times, and increased rate limits compared to its predecessors.
Leave a Comment