Using Gemini API with Spring Boot: A Java Developer's Guide to Building AI Chat Applications

Why Java Developers Should Use the Gemini API

When it comes to building AI-powered applications, Python tends to dominate the conversation. But the reality is that a huge portion of enterprise backends — banking systems, healthcare platforms, logistics services — are built with Java and Spring Boot. If you can integrate the Gemini API directly into your existing Java stack, you get powerful AI capabilities without switching languages or fragmenting your infrastructure.

We'll build a simple REST endpoint with Spring Boot that takes a message and returns a Gemini AI response. From that foundation grow customer support bots, internal document Q&A systems, code review assistants, and a good deal more.

If you're brand new to the Gemini API, start with the Gemini API Quickstart guide to get your API key set up first.

Prerequisites

Before diving in, make sure you have the following ready:

Java 17 or later (LTS recommended)
Maven 3.8+ or Gradle 8+
A Google AI Studio account with an API key
Spring Boot 3.2+ (the version used in this guide)

Getting Your Gemini API Key

Head to Google AI Studio and click "Get API Key" to generate a new key. You'll add it to your application.properties file shortly.

Setting Up the Spring Boot Project

Configuring pom.xml

Generate a project using Spring Initializr and add the following dependencies. We'll use WebClient from Spring WebFlux to communicate with the Gemini API.

<!-- pom.xml (excerpt) -->
<dependencies>
  <!-- Spring Web for REST API exposure -->
  <dependency>
    <groupId>org.springframework.boot</groupId>
    <artifactId>spring-boot-starter-web</artifactId>
  </dependency>
 
  <!-- WebFlux for WebClient (non-blocking HTTP) -->
  <dependency>
    <groupId>org.springframework.boot</groupId>
    <artifactId>spring-boot-starter-webflux</artifactId>
  </dependency>
 
  <!-- JSON processing -->
  <dependency>
    <groupId>com.fasterxml.jackson.core</groupId>
    <artifactId>jackson-databind</artifactId>
  </dependency>
</dependencies>

Configuring application.properties

Store your API key in the configuration file rather than hardcoding it.

# src/main/resources/application.properties
 
# Gemini API settings
gemini.api.key=YOUR_GEMINI_API_KEY
gemini.api.url=https://generativelanguage.googleapis.com/v1beta
gemini.model=gemini-2.5-flash
 
# Application settings
spring.application.name=gemini-chat-api
server.port=8080

Important: Replace YOUR_GEMINI_API_KEY with your actual key. In production, use environment variables or a secrets manager — never commit API keys to version control.

Implementing the Gemini API Client

Configuration Class (GeminiConfig.java)

package com.example.geminichat.config;
 
import org.springframework.beans.factory.annotation.Value;
import org.springframework.context.annotation.Bean;
import org.springframework.context.annotation.Configuration;
import org.springframework.web.reactive.function.client.WebClient;
 
@Configuration
public class GeminiConfig {
 
    @Value("${gemini.api.url}")
    private String apiUrl;
 
    // Register WebClient as a Spring Bean
    @Bean
    public WebClient geminiWebClient() {
        return WebClient.builder()
                .baseUrl(apiUrl)
                .defaultHeader("Content-Type", "application/json")
                .build();
    }
}

Data Models

Define Java Records that match the Gemini API's JSON request/response format.

package com.example.geminichat.model;
 
import java.util.List;
 
// Request body from the client
public record ChatRequest(String message) {}
 
// Request body sent to Gemini API
public record GeminiRequest(List<Content> contents) {
    public record Content(List<Part> parts) {}
    public record Part(String text) {}
}
 
// Gemini API response (simplified)
public record GeminiResponse(List<Candidate> candidates) {
    public record Candidate(Content content) {}
    public record Content(List<Part> parts) {}
    public record Part(String text) {}
}
 
// Response returned to the client
public record ChatResponse(String reply, boolean success) {}

Service Layer (GeminiService.java)

Encapsulate the Gemini API communication logic in a dedicated service class.

package com.example.geminichat.service;
 
import com.example.geminichat.model.ChatResponse;
import com.example.geminichat.model.GeminiRequest;
import com.example.geminichat.model.GeminiResponse;
import org.springframework.beans.factory.annotation.Value;
import org.springframework.stereotype.Service;
import org.springframework.web.reactive.function.client.WebClient;
 
import java.util.List;
 
@Service
public class GeminiService {
 
    private final WebClient webClient;
 
    @Value("${gemini.api.key}")
    private String apiKey;
 
    @Value("${gemini.model}")
    private String model;
 
    public GeminiService(WebClient geminiWebClient) {
        this.webClient = geminiWebClient;
    }
 
    public ChatResponse chat(String userMessage) {
        // Build the request body
        var request = new GeminiRequest(
            List.of(new GeminiRequest.Content(
                List.of(new GeminiRequest.Part(userMessage))
            ))
        );
 
        try {
            // POST to Gemini API
            GeminiResponse response = webClient.post()
                    .uri("/models/{model}:generateContent?key={key}", model, apiKey)
                    .bodyValue(request)
                    .retrieve()
                    .bodyToMono(GeminiResponse.class)
                    .block(); // Synchronous for simplicity — go reactive in production
 
            // Extract text from response
            if (response != null && !response.candidates().isEmpty()) {
                String text = response.candidates().get(0)
                        .content().parts().get(0).text();
                return new ChatResponse(text, true);
            }
            return new ChatResponse("No response received from Gemini", false);
 
        } catch (Exception e) {
            return new ChatResponse("Error: " + e.getMessage(), false);
        }
    }
}

REST Controller

package com.example.geminichat.controller;
 
import com.example.geminichat.model.ChatRequest;
import com.example.geminichat.model.ChatResponse;
import com.example.geminichat.service.GeminiService;
import org.springframework.http.ResponseEntity;
import org.springframework.web.bind.annotation.*;
 
@RestController
@RequestMapping("/api/chat")
public class ChatController {
 
    private final GeminiService geminiService;
 
    public ChatController(GeminiService geminiService) {
        this.geminiService = geminiService;
    }
 
    // POST /api/chat
    @PostMapping
    public ResponseEntity<ChatResponse> chat(@RequestBody ChatRequest request) {
        if (request.message() == null || request.message().isBlank()) {
            return ResponseEntity.badRequest()
                    .body(new ChatResponse("Message cannot be empty", false));
        }
 
        ChatResponse response = geminiService.chat(request.message());
        return ResponseEntity.ok(response);
    }
 
    // Health check
    @GetMapping("/health")
    public ResponseEntity<String> health() {
        return ResponseEntity.ok("Gemini Chat API is running!");
    }
}

Testing the API

Start the application and test it with curl.

# Start the application
./mvnw spring-boot:run
 
# Send a chat request
curl -X POST http://localhost:8080/api/chat \
  -H "Content-Type: application/json" \
  -d '{"message": "What are the top 3 features of Java?"}'

Expected response:

{
  "reply": "Here are the top 3 features of Java:\n1. **Platform Independence**: Java's 'Write Once, Run Anywhere' philosophy means your compiled bytecode runs on any platform with a JVM.\n2. **Object-Oriented Design**: Classes, inheritance, and encapsulation make it well-suited for building maintainable large-scale applications.\n3. **Rich Ecosystem**: Frameworks like Spring Boot, Hibernate, and Maven give you a mature, battle-tested toolchain.",
  "success": true
}

For handling errors gracefully in production, check out the Gemini API Error Handling Complete Guide.

Going Further: Production-Ready Enhancements

Add System Instructions

You can steer the AI's behavior by adding a systemInstruction field to your request body — useful for constraining the assistant to a specific domain or tone.

// Extend GeminiRequest to include systemInstruction
public record GeminiRequest(
    List<Content> contents,
    SystemInstruction systemInstruction
) {
    public record SystemInstruction(List<Part> parts) {}
    // ... (same structure as above)
}
 
// Usage in service
var systemInstruction = new GeminiRequest.SystemInstruction(
    List.of(new GeminiRequest.Part(
        "You are a Java expert. Answer questions concisely and accurately."
    ))
);

To master system instruction design for production applications, the Gemini 2.5 Pro System Instructions Mastery Guide covers advanced patterns in depth.

Token and Cost Control

Add generationConfig to your request to limit output tokens and manage costs. See the Gemini API Cost Optimization Guide for a full breakdown of cost-saving strategies.

Switching to Reactive

The block() call in the service makes this synchronous, which is fine for low-traffic scenarios. For high-throughput production services, return Mono<ChatResponse> from your service and Mono<ResponseEntity<ChatResponse>> from your controller. Spring MVC fully supports reactive return types starting from Spring Boot 3.x.

Looking back

You now have a working Gemini AI chat REST API on Spring Boot, without a single line of Python. What carried the weight:

Use spring-boot-starter-webflux's WebClient for clean, non-blocking HTTP to the Gemini API
Externalize API keys and model names via application.properties for easy swapping
Java Records make for concise, immutable data models aligned with the API's JSON structure
Keeping Gemini logic in a dedicated service class makes future upgrades straightforward

From here, you can extend this into multi-turn conversations, integrate RAG (retrieval-augmented generation) for document Q&A, or connect Function Calling to external databases. The Spring Boot + Gemini combination is a practical choice for enterprise teams who want AI without leaving the Java ecosystem.