deepseek-reasoner
by deepseek
Pricing
Input
$0.04 / 1M tokens
Output
$0.20 / 1M tokens
DeepSeek-Reasoner Release by DeepSeek AI
Overview
DeepSeek-Reasoner is a reasoning-first language model released by DeepSeek AI in mid-2024. It explicitly separates chain-of-thought reasoning from final answers, enabling interpretable and auditable outputs.
- Release Date: May 28, 2024
- Chain-of-Thought: Auto-generated and accessible before final output
- Context Length: 64,000 tokens (CoT reasoning doesn’t count toward this)
- Final Output: Max 64,000 tokens including reasoning
Features
- Explicit reasoning pathway (CoT output)
- High coding, math, and logic strength
- Part of the DeepSeek-R1 family with advanced RL tuning
- OpenAI-compatible API
- Transparent, auditable output generation
Benchmarks
Task | DeepSeek-Reasoner | Claude 3.5 Haiku | GPT-4o Mini |
---|---|---|---|
MMLU (reasoning) | 84.5% | 82.5% | 82.0% |
GSM8K (math) | 92.4% | 89.0% | 87.0% |
HumanEval (coding) | 89.1% | 87.5% | 87.2% |
MMMU (multimodal) | N/A | 59.2% | 59.4% |
Pricing
Type | Price per 1M tokens |
---|---|
Input | $0.10 |
Output (includes CoT) | $0.50 |
Reasoning tokens (CoT) are priced equally with final answers.
Use Cases
- Transparent academic tutors with visible reasoning
- Compliance-focused AI tools with auditable decisions
- Step-by-step logic generation for math/code solvers
- Research assistants and scientific automation
- AI systems requiring explainability and trust
Safety and Stability
- Follows strict instruction-tuning methods
- Internal reasoning can be reviewed or filtered
- CoT generation improves reliability of answers
- No browsing; offline inference only
Limitations
- Slower response due to dual-stage output
- No multimodal (vision/audio) support
- Relatively new; limited third-party benchmarks
License
DeepSeek-Reasoner is commercially usable via API or local deployment under DeepSeek’s open license. Available via API or HuggingFace.