GEMINI LABJP
MODEL — Gemini 3.5 Flash is generally available, beating 3.1 Pro on nearly all benchmarks while running fasterAPI — The Interactions API reaches GA as the primary way to work with Gemini models and agentsAGENTS — Managed Agents enter public preview, running autonomous agents in Google-hosted isolated Linux sandboxesCOST — Project Spend Caps let you set a monthly dollar limit on Gemini API usage per projectSHEETS — Gemini in Sheets diagnoses and fixes formula errors in one click by analyzing surrounding dataSTUDIO — Google AI Studio gets a developer-first refresh with an expanded gallery of starter appsMODEL — Gemini 3.5 Flash is generally available, beating 3.1 Pro on nearly all benchmarks while running fasterAPI — The Interactions API reaches GA as the primary way to work with Gemini models and agentsAGENTS — Managed Agents enter public preview, running autonomous agents in Google-hosted isolated Linux sandboxesCOST — Project Spend Caps let you set a monthly dollar limit on Gemini API usage per projectSHEETS — Gemini in Sheets diagnoses and fixes formula errors in one click by analyzing surrounding dataSTUDIO — Google AI Studio gets a developer-first refresh with an expanded gallery of starter apps
TAG

Model Routing

2 articles
Back to all tags
Related:
Gemini 3.5 Flash2Cost Optimization1Retry Design1Indie Development1model selection1cost optimization1automation1indie developer1Gemini API1
Gemini API/2026-06-26Advanced

Is Gemini 3.5 Flash Actually Cheaper? Measuring Retry Amplification to Find the Flash vs Pro Break-Even

Now that 3.5 Flash is generally available, it is tempting to route everything to it. But once you measure effective cost per success instead of per-call price, the decision changes. Here is a small harness to measure retry amplification and find the break-even.

Gemini Dev/2026-06-20Intermediate

Routing Gemini by Pipeline Stage: Draft on Flash, Finish on the Top Tier

A record of reworking which Gemini model handles which stage of an automation pipeline, prompted by the general availability of Gemini 3.5 Flash and the rollout of 3.1 Flash-Lite. Includes a small router that splits work into draft, classify, and finalize stages, how the cost picture changes, and the guardrails I settled on.