moonshotai/Kimi-K2-Instruct

by moonshot

Pricing

Input $0.24 / 1M tokens
Output $1.00 / 1M tokens

Kimi-K2-Instruct by Moonshot AI


Overview

Kimi-K2-Instruct is a 1T-parameter-scale Mixture-of-Experts (MoE) instruction-tuned language model developed by Moonshot AI. Released in July 2025, it is optimized for multi-turn chat, code generation, reasoning, and agentic tool use. Despite its trillion-parameter size, only 32B parameters are active per forward pass, making it highly efficient.

Kimi-K2 ranks among the top open-source models, rivaling proprietary systems like GPT-4 and Claude Opus in many key benchmarks โ€” while remaining fully accessible to the community.


Key Features

  • ๐Ÿง  1T parameter MoE model with 32B active per token
  • ๐Ÿงช State-of-the-art benchmark scores in coding, math, and agentic tasks
  • ๐Ÿ”ง Instruction-tuned for multi-turn conversations and complex reasoning
  • ๐Ÿ”— Tool use support with function-calling and plugin frameworks
  • ๐Ÿงฐ Optimized for long context and API integration
  • ๐Ÿ“ค Open weights via Hugging Face with flexible licensing

Use Cases

  • ๐Ÿ’ฌ Chatbots and AI agents
  • ๐Ÿ› ๏ธ Developer copilots and code assistants
  • ๐Ÿงฎ Mathematical reasoning and symbolic tasks
  • ๐Ÿ“š Academic research and fine-tuning experiments
  • ๐Ÿข Enterprise agent frameworks with API deployment

Limitations

  • โš™๏ธ Requires high-end GPU or distributed setup for local use
  • ๐Ÿ“‰ No official fine-tuned version for RLHF yet (though 3rd-party versions exist)
  • ๐Ÿ“„ Not yet optimized for low-resource (edge) deployment

Performance Benchmarks

BenchmarkKimi-K2-InstructGPT-4.1Claude 3 OpusGemini 1.5 Pro
MMLU89.5%86.4%88.7%84.5%
HumanEval (Code)82.4%83.1%79.5%75.3%
LiveCodeBench v653.7% (pass@1)44.7%42.2%38.0%
SWE-bench Verified65.8%63.7%58.1%55.0%
AIME 202469.6%68.5%64.9%60.2%

Model Comparison

Model NameArchitectureParametersMoE RoutingAudio/MultimodalOpen SourceAPI Access
Kimi-K2-InstructMoE1T (32B active)Top-2 of 64 expertsNo (text only)โœ… YesHugging Face / OpenRouter
GPT-4.1 (OpenAI)Dense~1.8T (est.)โœ–Tool use, no native audioโŒ NoOpenAI only
Claude 3 OpusDense~1.2T (est.)โœ–Tool use, no audioโŒ NoAnthropic API
Gemini 1.5 ProMixture~1.5T+YesVision + TextโŒ NoGoogle Gemini
LLaMA 3 70BDense70Bโœ–Text onlyโœ… YesOllama / HF / Replicate
DeepSeek-V3MoE236B (16B active)Top-2Text onlyโœ… YesHF / OpenRouter

Kimi-K2-Instruct redefines the boundary of open LLMs, bringing trillion-scale reasoning to the global community. Whether youโ€™re building tools, agents, or assistants โ€” itโ€™s built for you.