◉GEMINI LAB JP

●NANOLITE — Nano Banana 2 Lite is here: Google's fastest and most cost-efficient Gemini Image model, made for running lightweight image generation cheaply●OMNIFLASH — Gemini Omni Flash is in public preview, a natively multimodal model that lets enterprises and developers build custom, dynamic video workflows●AGENTS — Managed Agents expand with background: true for async server-side runs and polling, remote MCP server integration, and refreshing credentials across interactions●MEMORY — The Memory Bank IngestEvents API is generally available, decoupling event ingestion from memory generation so you can stream content continuously●THROUGHPUT — Provisioned Throughput now lets you submit up to seven pending orders for the same model and region●DEPRECATE — Image generation models shut down on August 17, and the Grok 4.1 family on the Gemini Enterprise Agent Platform on August 20●NANOLITE — Nano Banana 2 Lite is here: Google's fastest and most cost-efficient Gemini Image model, made for running lightweight image generation cheaply●OMNIFLASH — Gemini Omni Flash is in public preview, a natively multimodal model that lets enterprises and developers build custom, dynamic video workflows●AGENTS — Managed Agents expand with background: true for async server-side runs and polling, remote MCP server integration, and refreshing credentials across interactions●MEMORY — The Memory Bank IngestEvents API is generally available, decoupling event ingestion from memory generation so you can stream content continuously●THROUGHPUT — Provisioned Throughput now lets you submit up to seven pending orders for the same model and region●DEPRECATE — Image generation models shut down on August 17, and the Grok 4.1 family on the Gemini Enterprise Agent Platform on August 20

TAG

Image Recognition

3 articles

← Back to all tags

Related:

multimodal² Gemini API² Gemini¹ Multimodal¹ Indie Development¹ App Store¹ Review¹ Gemini Vision¹ art¹ indie developer¹ document analysis¹ PDF processing¹

⬡ Gemini Advanced/2026-06-04Intermediate

Pre-Screening Wallpaper App Submissions with Gemini Vision: A Two-Week Field Memo

Before submitting a new batch of wallpapers, I spent two weeks running Gemini's image understanding as a first-pass filter for store review risk. What it caught, what it missed, and where a human still has to decide.

⬡ Gemini Advanced/2026-05-13Intermediate

What Happens When You Show Your Own Artwork to Gemini Vision — An Honest Review from a Maker and a Developer

I fed my own art images into Gemini Vision to test what it reads and what it misses. An honest, indie-developer look at where it's genuinely useful for running a wallpaper app, and where it still falls short.

◈ Gemini API/2026-03-28Advanced

Building a Multimodal Document Analysis System with Gemini API — Processing Images, PDFs, and Videos in a Unified Architecture

Learn how to build a multimodal document analysis system using Gemini API. This guide covers file upload, structured data extraction, and batch processing pipelines for images, PDFs, and videos.