Inceptron compiler, now open for early access. Auto-compile models for maximum efficiency. Join early access →

Inceptron compiler, now open for early access. Auto-compile models for maximum efficiency. Join early access →

Inceptron compiler, now open for early access. Auto-compile models for maximum efficiency. Join early access →

Back to changelog

Jun 22, 2026

Inceptron Partners with Kilo to Bring EU-Hosted Inference to Engineering Teams

Inceptron Partners with Kilo to Bring EU-Hosted Inference to Engineering Teams

Green Fern

Inceptron Partners with Kilo to Bring EU-Hosted Inference to Engineering Teams

We’re excited to share that Inceptron is now partnered with Kilo to bring high-performance, EU-hosted inference to teams building with agentic engineering.

Kilo users can now access Inceptron-hosted open-weight models directly through Kilo. This gives teams a way to use strong models for coding and agent workflows while keeping inference workloads on European infrastructure.

For many companies, AI adoption is no longer blocked by model quality. It is blocked by where data goes, who processes it, and whether the setup can pass internal security review.

That is the problem this partnership is built around.

EU-hosted models inside Kilo

Through the partnership, Kilo users can access Inceptron-hosted models such as:

  • Kimi K2.7 from MoonshotAI

  • GLM 5.2 from Z.ai

  • MiniMax M2.5 from MiniMax

These models are used for coding, agent workflows, and production inference where teams need performance, cost control, and data residency.

Inceptron hosts the models on infrastructure built for AI workloads in the EU. Kilo makes them available where developers already work.

Built for teams that need control

A lot of engineering teams want to move faster with AI, but cannot send source code, prompts, or customer data through infrastructure they cannot govern.

That matters most for companies with EU data residency requirements, GDPR obligations, or strict internal security processes.

With Inceptron and Kilo, teams can route inference through European infrastructure and use open-weight models without adding unnecessary data exposure.

Two ways to use Inceptron in Kilo

Teams can use Inceptron through the Kilo Gateway or through BYOK.

With Kilo Gateway, teams can access Inceptron-hosted models directly from Kilo. The gateway handles routing and makes it easier to switch between models.

With BYOK, teams that already work with Inceptron can add their Inceptron API key in Kilo and route requests through their existing setup.

Both options are designed to make model access simple without changing how engineers work.

Available across the Kilo workflow

Once enabled in Kilo, Inceptron-hosted models can be used across the Kilo ecosystem, including:

  • Kilo CLI

  • Cloud agents

  • VS Code and JetBrains extensions

That means teams can use EU-hosted inference from the terminal, inside their IDE, or as part of automated agent workflows.

Why we partnered with Kilo

We use Kilo ourselves for agentic engineering.

That matters to us. We do not want to offer infrastructure for tools we would not use in our own workflows.

Kilo gives developers a strong interface for agentic engineering. Inceptron gives teams the infrastructure layer for fast, compliant inference in Europe.

Together, we can support teams that want to ship with AI while keeping control of their data and deployment setup.

If your team wants to run open-weight models through EU-hosted infrastructure, you can now access Inceptron directly through Kilo.