moonshotai/Kimi-K2-Instruct
by moonshot
Pricing
Kimi-K2-Instruct by Moonshot AI
Overview
Kimi-K2-Instruct is a 1T-parameter-scale Mixture-of-Experts (MoE) instruction-tuned language model developed by Moonshot AI. Released in July 2025, it is optimized for multi-turn chat, code generation, reasoning, and agentic tool use. Despite its trillion-parameter size, only 32B parameters are active per forward pass, making it highly efficient.
Kimi-K2 ranks among the top open-source models, rivaling proprietary systems like GPT-4 and Claude Opus in many key benchmarks โ while remaining fully accessible to the community.
Key Features
- ๐ง 1T parameter MoE model with 32B active per token
- ๐งช State-of-the-art benchmark scores in coding, math, and agentic tasks
- ๐ง Instruction-tuned for multi-turn conversations and complex reasoning
- ๐ Tool use support with function-calling and plugin frameworks
- ๐งฐ Optimized for long context and API integration
- ๐ค Open weights via Hugging Face with flexible licensing
Use Cases
- ๐ฌ Chatbots and AI agents
- ๐ ๏ธ Developer copilots and code assistants
- ๐งฎ Mathematical reasoning and symbolic tasks
- ๐ Academic research and fine-tuning experiments
- ๐ข Enterprise agent frameworks with API deployment
Limitations
- โ๏ธ Requires high-end GPU or distributed setup for local use
- ๐ No official fine-tuned version for RLHF yet (though 3rd-party versions exist)
- ๐ Not yet optimized for low-resource (edge) deployment
Performance Benchmarks
Benchmark | Kimi-K2-Instruct | GPT-4.1 | Claude 3 Opus | Gemini 1.5 Pro |
---|---|---|---|---|
MMLU | 89.5% | 86.4% | 88.7% | 84.5% |
HumanEval (Code) | 82.4% | 83.1% | 79.5% | 75.3% |
LiveCodeBench v6 | 53.7% (pass@1) | 44.7% | 42.2% | 38.0% |
SWE-bench Verified | 65.8% | 63.7% | 58.1% | 55.0% |
AIME 2024 | 69.6% | 68.5% | 64.9% | 60.2% |
Model Comparison
Model Name | Architecture | Parameters | MoE Routing | Audio/Multimodal | Open Source | API Access |
---|---|---|---|---|---|---|
Kimi-K2-Instruct | MoE | 1T (32B active) | Top-2 of 64 experts | No (text only) | โ Yes | Hugging Face / OpenRouter |
GPT-4.1 (OpenAI) | Dense | ~1.8T (est.) | โ | Tool use, no native audio | โ No | OpenAI only |
Claude 3 Opus | Dense | ~1.2T (est.) | โ | Tool use, no audio | โ No | Anthropic API |
Gemini 1.5 Pro | Mixture | ~1.5T+ | Yes | Vision + Text | โ No | Google Gemini |
LLaMA 3 70B | Dense | 70B | โ | Text only | โ Yes | Ollama / HF / Replicate |
DeepSeek-V3 | MoE | 236B (16B active) | Top-2 | Text only | โ Yes | HF / OpenRouter |
Kimi-K2-Instruct redefines the boundary of open LLMs, bringing trillion-scale reasoning to the global community. Whether youโre building tools, agents, or assistants โ itโs built for you.