Cloud DevOps IA / IA Générative Intelligence Artificielle

⚗️ A local Agent with ADK and Docker Model Runner

24 mars 2026 Jean-Philippe Baconnais

A French version is available on the Jean-Phi’s website.

⛅ Using AI in the Cloud

I’m the first to admit it: I use AI models provided in the Cloud. As a user, it suits me perfectly. The models are powerful and give me good answers.

But is the scope of these models really adapted to my needs? They do the job, that’s a fact, but aren’t they a bit too general-purpose for my specific use cases?

Running models locally is not new. It was actually one of the very first requirements that emerged: trying to deploy a model on a local machine.

The apparition of Small Language Models (SLM) has offered a concrete answer to this problem, making it possible to adapt models to specific use cases and simplify their deployment.

There are now various alternatives for deploying these models. I have first heard about Ollama, as it was, if I’m right, one of the first tools to simplify this process.

If you’re interested in Small Language Models, I highly recommend Philippe Charrière’s articles available on his blog.

I took some time to make my own experience, and that’s what I’m sharing with you in this article.

🧩 My Context

For this experiment, I wanted to use my usual agent framework for its simplicity: Google’s Agent Development Kit (ADK): https://google.github.io/adk-docs/.

A Java agent can be quickly created with ADK as follows:

			
public static BaseAgent initAgent() {
  return LlmAgent.builder().name("trip-planner-agent")
  .description("A simple trip helper with ADK")
  .model(System.getProperty("gemini.model"))
  .instruction("""
  You are A simple Trip Planner Agent. Your goal is to give me information about one destination.
  """).build();
}

		

🐳 Docker Model Runner

I had already tested Ollama to run models on my machine. But for this experiment, I wanted to try Docker’s solution: Docker Model Runner (DMR). This project provides new Docker commands to pull, run, and list models available on your machine.

			
docker model pull
docker model list
docker model run

A quick configuration is required in the Docker Desktop settings to enable this service:

Settings page displaying options for AI features, including Docker Model Runner configurations such as enabling TCP support and specifying a port.

After that, DMR allows you to easily fetch a model and run it on your machine. For example, to use the “gemma3” model, simply run the following command. The model will be downloaded if you don’t already have it:

docker model run gemma3

Once started, the agent can respond to you as shown in this example:

Terminal screenshot displaying a discussion on the differences between Quarkus and Spring Boot, highlighting key features of Quarkus such as cloud-native focus, native image compilation, low memory footprint, improved security, light weight, and smaller dependencies

🧩 The ADK / DMR Connection

The langchain4j library will link the model specified in ADK and the model provided by DMR. This dependency must to be added:

			
<dependency>
  <groupId>dev.langchain4j</groupId>
  <artifactId>langchain4j-open-ai</artifactId>
  <version>1.1.0</version>
</dependency>

		

The code will be a little bit modified. Instead of calling the model with a string, we use an instance of the LangChain4j class, defined with an OpenAiChatModel instance.

			
OpenAiChatModel chatModel = OpenAiChatModel.builder()
  .baseUrl(modelUrl)
  .apiKey("not-needed-for-local")
  .modelName(modelName)
  .maxRetries(1)
  .timeout(Duration.ofMinutes(2))
  .build();
var adkModel = new LangChain4j(chatModel);
return LlmAgent.builder()
  .name("Travel Agent")
  .description("Travel expert using a local model")
  .model(adkModel)
  .build();

		

🙌 A Locally Available Agent

The agent is now plugged into a local model. The application can be started with the Maven command `mvn compile exec:java -Dexec.mainClass=adk.agent.TravelAgent` after starting your model with Docker Model Runner. For development, this is perfect.

Once the application was functional, I decided to set up a docker-compose.yml file to start the model and the application with a single command:

docker compose up -d --build && docker attach adk-app

💡 Pro tip: To avoid repackaging the application every time, a cache is implemented in the Dockerfile.

TADA! 🥳

🤔 A « Local » Future?

This project was just an experimentation, but it gives some other questions. My primary use of AI is for development. Wouldn’t it be better to dedicate a locally deployed agent to this activity?

When I create agents planned to be deployed on the Cloud, perhaps a local model would be more than sufficient during the development phase? These questions have been answered by others who have tested this, but can it be implemented (and quickly) in a corporate context?

We can, of course, go even further by building our own custom model for our specific context, but that’s a topic for another day! 😁

Auteur/Autrice

Jean-Philippe Baconnais

Voir toutes les publications

Blog Zenika

⚗️ A local Agent with ADK and Docker Model Runner

⛅ Using AI in the Cloud

🧩 My Context

🐳 Docker Model Runner

🧩 The ADK / DMR Connection

🙌 A Locally Available Agent

🤔 A « Local » Future?

Auteur/Autrice

Laisser un commentaireAnnuler la réponse.

⛅ Using AI in the Cloud

🧩 My Context

🐳 Docker Model Runner

🧩 The ADK / DMR Connection

🙌 A Locally Available Agent

🤔 A « Local » Future?

Auteur/Autrice

Partager :

Vous pourrez aussi aimer

Docker 1.6 et son écosystème

Google Kubernetes Engine, CircleCI and Traefik for a full-fledged GitOps platform in the cloud – Part 1

Il n'y a pas de magie dans spring

Laisser un commentaireAnnuler la réponse.

En savoir plus sur Blog Zenika