Model Update
6 minutes min read
DeepSeek Insights Team

DeepSeek V3.1 Terminus: Multilingual Agents Ready for Production

DeepSeek V3.1 Terminus: What's New for Builders

Release Snapshot

DeepSeek V3.1 Terminus launched on September 22, 2025 as a targeted refinement of the August 21 DeepSeek V3.1 release. DeepSeek has already upgraded its app, web, and API experiences to Terminus, so existing agents inherit the improvements without additional migration work.

Multilingual Enhancements

Terminus focuses on stronger multilingual alignment, delivering more consistent responses when prompts switch languages mid-session. The model keeps the 128K token context window and introduces decoding tweaks that reduce hallucinations in cross-language question answering. For teams shipping global products, these changes cut the time spent rewriting prompts for each locale.

Agent Performance

Benchmark gains validate the release: Terminus records 57.8 on SWE-bench Multilingual (up from 54.5) and 62.9 on MixInstruct 2/8-shot benchmarks (up from 59.2). The model also posts 68.4 on SWE Verified and 91.2 on HumanEval, showing broader reasoning improvements that support longer agent chains.

Feature Stack

The core architecture remains a 685B-parameter Mixture-of-Experts design with roughly 37B active parameters per token. Builders still get dual Swift (fast) and Think (deliberative) inference profiles, plus integrated dataset and vector management so retrieval and fine-tuning share the same control plane. You can adopt Terminus without refactoring existing pipelines.

Deployment and Access

DeepSeek publishes Terminus checkpoints in BF16, FP8 (E4M3), and FP32 under the MIT license on Hugging Face, with ModelScope mirrors for mainland China workloads. That flexibility makes it easier to target different accelerator stacks while meeting precision and cost constraints.

Next Steps

  • Revisit usage budgets ahead of the Terminus, Swift, and Think pricing that took effect on September 5, 2025.
  • Re-run multilingual QA and instruction-following tests to confirm prompts behave as expected under the new decoding defaults.
  • Download the latest checkpoints to stage fine-tuning or evaluation pipelines before large-scale rollout.

Related Articles

Moonshot AI has officially shipped Kimi K2.6, graduating the Code Preview branch into a general-availability model built for 12-hour autonomous coding sessions, 300-agent swarms, and full-stack generation. Here is what changed, what it means, and how to put it to work.
The interesting question about Kimi K2.6 is not what it does — it is what kind of model it is clearly being built to host. Treat the 12-hour runs, 300-agent swarms, and context compressor as load-bearing infrastructure, and the shape of K3 becomes visible.
On April 13, 2026, Moonshot AI officially confirmed that Kimi K2.6 Code Preview has entered beta testing. Built on a trillion-parameter MoE architecture, this next-generation model delivers significant improvements in code generation and agent capabilities.