Release Snapshot

DeepSeek V3.1 Terminus launched on September 22, 2025 as a targeted refinement of the August 21 DeepSeek V3.1 release. DeepSeek has already upgraded its app, web, and API experiences to Terminus, so existing agents inherit the improvements without additional migration work.

Multilingual Enhancements

Terminus focuses on stronger multilingual alignment, delivering more consistent responses when prompts switch languages mid-session. The model keeps the 128K token context window and introduces decoding tweaks that reduce hallucinations in cross-language question answering. For teams shipping global products, these changes cut the time spent rewriting prompts for each locale.

Agent Performance

Benchmark gains validate the release: Terminus records 57.8 on SWE-bench Multilingual (up from 54.5) and 62.9 on MixInstruct 2/8-shot benchmarks (up from 59.2). The model also posts 68.4 on SWE Verified and 91.2 on HumanEval, showing broader reasoning improvements that support longer agent chains.

Feature Stack

The core architecture remains a 685B-parameter Mixture-of-Experts design with roughly 37B active parameters per token. Builders still get dual Swift (fast) and Think (deliberative) inference profiles, plus integrated dataset and vector management so retrieval and fine-tuning share the same control plane. You can adopt Terminus without refactoring existing pipelines.

Deployment and Access

DeepSeek publishes Terminus checkpoints in BF16, FP8 (E4M3), and FP32 under the MIT license on Hugging Face, with ModelScope mirrors for mainland China workloads. That flexibility makes it easier to target different accelerator stacks while meeting precision and cost constraints.

Next Steps

Revisit usage budgets ahead of the Terminus, Swift, and Think pricing that took effect on September 5, 2025.
Re-run multilingual QA and instruction-following tests to confirm prompts behave as expected under the new decoding defaults.
Download the latest checkpoints to stage fine-tuning or evaluation pipelines before large-scale rollout.

Guides

2026-07-19

How to Use Kimi K3 for Free (Honest Options in July 2026)

Want Kimi K3 without a big bill? What’s free today—signup rewards, free tier limits, what still costs money, and what “open weights July 27” actually changes.

10 min min read

Analysis

2026-07-18

Kimi K3 vs Claude & GPT: Cost and When to Switch

Kimi K3 vs Claude and GPT isn’t “delete your API keys.” Here’s the practical switch rule—list price, coding signals, and when to keep Sonnet or Sol.

12 min min read

Analysis

2026-07-17

Kimi K3 Open Weights July 27: What You Can Use Today

Feeds say “open 3T-class model,” but Hugging Face is still empty. Here’s what Moonshot actually promised for July 27—and what you should run this week instead of waiting on a local GPU fantasy.

10 min min read

DeepSeek V3.1 Terminus: Multilingual Agents Ready for Production

Release Snapshot

Multilingual Enhancements

Agent Performance

Feature Stack

Deployment and Access

Next Steps

Popular Kimi K2 paths

Kimi K3

Kimi K2.7 Code

Kimi Code

Kimi K3 Status

Related Articles