Frontier Model Refresh

We just moved a chunk of the frontier into the free tier.

This launch is about more than adding model IDs. We re-cut the lineup around what users actually do: long-context coding, screenshot-heavy debugging, multimodal planning, and agent loops that keep running after the first response.

Free frontier
3

MiMo V2 Pro, GLM-5V Turbo, and Gemma 4 31B IT now ship as free partner models.

Premium refresh
3

GPT-5.4, GLM-5, and MiniMax M2.7 now anchor the paid frontier tier.

Biggest jump
1M

Both GPT-5.4 and MiMo V2 Pro push the active catalog into million-token territory.

Free partner models

Xiaomi MiMo-V2-Pro

xiaomi/mimo-v2-pro
1M context

Xiaomi positions MiMo-V2-Pro as a global top-tier agent model. In practice, it gives free users a genuinely large-context partner option for long-horizon coding, planning, and agent loops.

Source

Z.AI GLM-5V-Turbo

z-ai/glm-5v-turbo
202K context

GLM-5V-Turbo is the free multimodal agent pick. It is built for image, video, and text inputs, which matters when workflows start from screenshots, UI bugs, diagrams, and documents instead of plain prompts.

Source

Google Gemma 4 31B IT

google/gemma-4-31b-it
262K context

Gemma 4 31B IT brings dense multimodal reasoning, native function calling, and strong document understanding into the free tier. It is the pragmatic open-weight choice for structured work.

Source
Premium frontier models

OpenAI GPT-5.4

openai/gpt-5.4
1M context

GPT-5.4 is now the flagship premium OpenAI slot in LeemerChat. OpenAI explicitly recommends it for complex reasoning and coding, and the 1M-token window changes what counts as a single-session problem.

Source

Z.AI GLM-5

z-ai/glm-5
80K context

GLM-5 replaces the older GLM-4.7 line across our active configs. The point is not just a newer name: it is the stronger frontier alternative for coding, agent planning, and synthesis-heavy work.

Source

MiniMax M2.7

minimax/minimax-m2.7
200K context

MiniMax positions M2.7 as its next-generation productivity model. For us, it becomes the single MiniMax recommendation for coding, productivity, and tool-oriented execution instead of a split M2.1 versus M2.5 story.

Source
Why These Count As Frontier

Frontier is now about operating range, not just benchmark bragging.

Frontier no longer means premium-only

The biggest change is economic, not cosmetic. Free users now get serious frontier-grade options instead of fallback models that only make sense for lightweight chat.

The lineup is simpler

We removed overlapping GPT-5 chat variants, retired the GLM-4.7 split, and collapsed MiniMax into one active premium recommendation. Fewer tiers, less selector noise, clearer picks.

Agentic work is the organizing principle

These additions are here because they are good at execution loops: repo reading, long planning arcs, multimodal debugging, and document-grounded reasoning.

Launch Offer

Claim your first month for $1

We start from a base price of $1 and localize the display with a live USD conversion. If you are in Ireland, you will see euro. If you are in India, you will see rupees.

Localized for your regionBase USD fallback
Start with $1

Exchange rate data via ExchangeRate-API. Offer display adapts at runtime based on detected location.

Source notes

This post is based on provider docs, model cards, and public model pages. Where a provider’s public page makes a high-level positioning claim rather than exposing a full benchmark sheet, we phrase that cautiously in the copy above.

Related Posts

February 11, 2026

GLM-5 Is Live: Frontier Open-Source Scale for Complex Engineering and Agentic Work

GLM-5 launches on LeemerChat with major upgrades in scale, training data, and RL infrastructure. Built for long-horizon agentic systems, coding reliability, and complex reasoning under production constraints.

Read more
February 12, 2026

MiniMax M2.5 Is Live: SOTA Productivity Model for Real-World Office Work

MiniMax M2.5 launches on LeemerChat with breakthrough performance in Word, Excel, and PowerPoint generation. Scoring 80.2% on SWE-Bench Verified and 76.3% on BrowseComp, M2.5 extends M2.1's coding expertise into general office productivity.

Read more
January 30, 2026

Kimi K2.5: Moonshot AI’s Frontier Multimodal Model, Now Live on LeemerChat

Kimi K2.5 brings state-of-the-art visual coding, 262K context, and self-directed agent swarms. We’re Ireland’s first AI platform to launch it — and it’s live free on LeemerChat.

Read more
December 15, 2025

RIN: Sharp. Fast. Precise. Our Free Unlimited Reasoning Model

Meet RIN (凛) — a 26B-A3B MoE model running at 450 tokens/second, completely free and unlimited. The precision instrument for builders who value speed over hand-holding. Semi-successor to LeemerGLM.

Read more
Explore more:All PostsReleasesModelsBenchmarksEngineeringInsightsAll FeaturesAbout UsTermsPrivacy