Back to BlogModel Launch · January 30, 2026
Kimi K2.5 Launch

Kimi K2.5 is live on LeemerChat

Ireland's first AI platform to introduce Kimi K2.5 — the world's most powerful duo open-source 1T-parameter model, now available free on LeemerChat.

Built on Kimi K2 with continued pretraining over ~15T mixed visual and text tokens, K2.5 is a native multimodal model that delivers state-of-the-art coding and vision capability plus a self-directed agent swarm paradigm.

Context

262K tokens

Pretraining

~15T multimodal tokens

Agent Swarm

Up to 100 sub-agents

Tool Calls

Up to 1,500

Frontier status

Why Kimi K2.5 is a frontier model

Frontier models are defined by their ability to solve real-world tasks end-to-end: multimodal understanding, tool orchestration, and long-horizon reasoning. K2.5 meets that bar with top-tier benchmarks across agents, coding, and vision—while maintaining massive context and autonomous orchestration.

Native multimodal reasoning

Kimi K2.5 fuses vision and language natively, enabling visual coding, UI parsing, and multi-modal reasoning without separate adapters.

Long-horizon agentic depth

262K context plus self-directed agent swarms let it sustain complex plans and multi-step workflows across massive tool chains.

Frontier-grade coding

Top-tier performance on SWE-bench and multilingual coding benchmarks makes K2.5 a serious frontier contender for real software work.

Agent swarm

Self-directed agent swarms at scale

For complex tasks, Kimi K2.5 can self-direct an agent swarm with up to 100 sub-agents, executing parallel workflows across up to 1,500 tool calls. The swarm is automatically created and orchestrated by K2.5 without any predefined subagents or workflow.

Self-directed agent swarm

Kimi K2.5 can automatically spawn and orchestrate up to 100 sub-agents for complex tasks. No predefined subagents. No manual workflow design.

1,500 tool calls in parallel

The swarm can execute parallel workflows across up to 1,500 tool calls, compressing research, coding, and data synthesis into a single run.

Up to 4.5x faster execution

Compared with a single-agent setup, K2.5 reduces execution time by up to 4.5x by coordinating parallel sub-tasks.

Benchmark snapshot

Recreated benchmark highlights

The chart below recreates the published benchmark snapshot comparing Kimi K2.5 with GPT-5.2 (xhigh), Claude Opus 4.5, and Gemini 3 Pro.

Agents

Humanity's Last Exam (Full)

percentile (%)
Kimi K2.550.2
GPT-5.2 (xhigh)45.5
Claude Opus 4.543.2
Gemini 3 Pro45.8

Agents

BrowseComp

percentile (%)
Kimi K2.574.9
GPT-5.2 (xhigh)65.8
Claude Opus 4.557.8
Gemini 3 Pro59.2

Agents

DeepSearchQA

percentile (%)
Kimi K2.577.1
GPT-5.2 (xhigh)71.3
Claude Opus 4.576.1
Gemini 3 Pro63.2

Coding

SWE-bench Verified

percentile (%)
Kimi K2.576.8
GPT-5.2 (xhigh)80
Claude Opus 4.580.9
Gemini 3 Pro76.2

Coding

SWE-bench Multilingual

percentile (%)
Kimi K2.573
GPT-5.2 (xhigh)72
Claude Opus 4.577.5
Gemini 3 Pro65

Image

MMMU Pro

percentile (%)
Kimi K2.578.5
GPT-5.2 (xhigh)79.5
Claude Opus 4.574
Gemini 3 Pro81

Image

MathVision

percentile (%)
Kimi K2.584.2
GPT-5.2 (xhigh)83
Claude Opus 4.577.1
Gemini 3 Pro86.1

Image

OmniDocBench 1.5*

percentile (%)
Kimi K2.588.8
GPT-5.2 (xhigh)85.7
Claude Opus 4.587.7
Gemini 3 Pro88.5

Video

VideoMMMU

percentile (%)
Kimi K2.586.6
GPT-5.2 (xhigh)85.9
Claude Opus 4.584.4
Gemini 3 Pro87.6

Video

LongVideoBench

percentile (%)
Kimi K2.579.8
GPT-5.2 (xhigh)76.5
Claude Opus 4.567.2
Gemini 3 Pro77.7

* OmniDocBench score is computed as (1 − normalized Levenshtein distance) × 100, where a higher score denotes superior accuracy.

Ready to build with Kimi K2.5?

The most powerful multimodal Kimi model is now live — free to try on LeemerChat for every user.

Related Posts

February 12, 2026

MiniMax M2.5 Is Live: SOTA Productivity Model for Real-World Office Work

MiniMax M2.5 launches on LeemerChat with breakthrough performance in Word, Excel, and PowerPoint generation. Scoring 80.2% on SWE-Bench Verified and 76.3% on BrowseComp, M2.5 extends M2.1's coding expertise into general office productivity.

Read more
February 11, 2026

GLM-5 Is Live: Frontier Open-Source Scale for Complex Engineering and Agentic Work

GLM-5 launches on LeemerChat with major upgrades in scale, training data, and RL infrastructure. Built for long-horizon agentic systems, coding reliability, and complex reasoning under production constraints.

Read more
December 19, 2025

Gemini 3 Flash Explained: Google's Fastest Frontier-Grade AI for Real-World Scale

Google's Gemini 3 Flash represents a clear shift in how frontier-level AI is delivered in production. Near-Pro-level reasoning and multimodal understanding while remaining fast, responsive, and economical enough for large-scale deployment.

Read more
December 7, 2025

Meet LeemerGLM: our Gemma 3-powered multimodal expert

A behind-the-scenes look at how we built LeemerGLM on top of Gemma 3 4B, why we paired it with a multimodal specialist, and how it slots into our expert panel.

Read more

Try These Features

Explore more:All PostsReleasesModelsBenchmarksEngineeringInsightsAll FeaturesAbout UsTermsPrivacy