deepseek-reasoner - Simpleterms

DeepSeek-Reasoner Release by DeepSeek AI

Overview

DeepSeek-Reasoner is a reasoning-first language model released by DeepSeek AI in mid-2024. It explicitly separates chain-of-thought reasoning from final answers, enabling interpretable and auditable outputs.

Release Date: May 28, 2024
Chain-of-Thought: Auto-generated and accessible before final output
Context Length: 64,000 tokens (CoT reasoning doesn’t count toward this)
Final Output: Max 64,000 tokens including reasoning

Features

Explicit reasoning pathway (CoT output)
High coding, math, and logic strength
Part of the DeepSeek-R1 family with advanced RL tuning
OpenAI-compatible API
Transparent, auditable output generation

Benchmarks

Task	DeepSeek-Reasoner	Claude 3.5 Haiku	GPT-4o Mini
MMLU (reasoning)	84.5%	82.5%	82.0%
GSM8K (math)	92.4%	89.0%	87.0%
HumanEval (coding)	89.1%	87.5%	87.2%
MMMU (multimodal)	N/A	59.2%	59.4%

Pricing

Type	Price per 1M tokens
Input	$0.10
Output (includes CoT)	$0.50

Reasoning tokens (CoT) are priced equally with final answers.

Use Cases

Transparent academic tutors with visible reasoning
Compliance-focused AI tools with auditable decisions
Step-by-step logic generation for math/code solvers
Research assistants and scientific automation
AI systems requiring explainability and trust

Safety and Stability

Follows strict instruction-tuning methods
Internal reasoning can be reviewed or filtered
CoT generation improves reliability of answers
No browsing; offline inference only

Limitations

Slower response due to dual-stage output
No multimodal (vision/audio) support
Relatively new; limited third-party benchmarks

License

DeepSeek-Reasoner is commercially usable via API or local deployment under DeepSeek’s open license. Available via API or HuggingFace.