DeepSeek-R1
by Community
First-generation reasoning models from DeepSeek.
OSS
DeepSeek-R1
Added 1 June 2026
Overview
DeepSeek-R1 is an open-source reasoning model designed to perform multi-step logical inference and problem-solving tasks. It uses chain-of-thought reasoning to work through complex problems step-by-step before generating answers. Available as a framework for local deployment and integration.
Best for
Best for
Developers building open-source applications needing interpretable multi-step reasoning without vendor lock-in
Use cases
- Mathematical problem solving and verification
- Code generation with reasoning about correctness
- Logical inference and constraint satisfaction tasks
Notes
DeepSeek-R1 is an open-source reasoning model designed to perform multi-step logical inference and problem-solving tasks. It uses chain-of-thought reasoning to work through complex problems step-by-step before generating answers. Available as a framework for local deployment and integration.
92,010 stars on GitHub. Last updated 2025-06-27. Licensed MIT.
Use cases
- Mathematical problem solving and verification
- Code generation with reasoning about correctness
- Logical inference and constraint satisfaction tasks
Pros
- Open-source and freely available for local deployment
- Transparent reasoning process shows intermediate steps
- Strong community adoption with 92k GitHub stars
Cons
- First-generation model with unproven performance against commercial reasoning systems
- Requires significant compute resources for inference
- Limited documentation on specific reasoning capabilities and failure modes
Indexed from awesome-llm and enriched against its public facts.
Pros
- Open-source and freely available for local deployment
- Transparent reasoning process shows intermediate steps
- Strong community adoption with 92k GitHub stars
Cons
- First-generation model with unproven performance against commercial reasoning systems
- Requires significant compute resources for inference
- Limited documentation on specific reasoning capabilities and failure modes
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
ollama
Community
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
llama.cpp
Community
LLM inference in C/C++
vLLM
Community
A high-throughput and memory-efficient inference and serving engine for LLMs
arikusi/deepseek-mcp-server
Various
MCP Server for DeepSeek API - enables MCP clients to use DeepSeek Chat and Reasoner models
OthmaneBlial/term_mcp_deepseek
Various
A MCP‑like server using the DeepSeek API for Terminal
PsChina/deepseek-as-subagent
Various
Run DeepSeek as a real sub-agent inside Claude Code / Codex CLI — DeepSeek gets its own 7-tool agent loop in a sandboxed workspace, not just a single LLM call.
spranab/brainstorm-mcp
Various
MCP server for multi-round AI brainstorming debates between multiple models (GPT, DeepSeek, Groq, Ollama, etc.)
Chain-of-Thoughts Papers
Community
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
DeepSeek-v2-236B-MoE
Community
We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of whic
DeepSeek-V3 Technical Report
Community
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-eff
Reasoning using Language Models
Community
From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓
unslothai
Community
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
Vercel AI SDK
Vercel
The de facto TypeScript SDK for AI apps. Streaming, tools, multi-model, and now an agent loop.
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Community
General reasoning represents a long-standing and formidable challenge in artificial intelligence. Recent breakthroughs, exemplified by large language models (LLMs) and chain-of-t
Baichuan-7|13B
Community
AGI Large Language Models
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Community
BigScience
CodeQwen1.5-7B
Community
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction The advent of advanced programming tools, which harnesses the power of large language models (LLMs), has significantly en
DeepSeek-V2.5
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
GLM-130B: An Open Bilingual Pre-trained Model
Community
GLM-130B
GLM-2|6|10|13|70B
Community
Org profile for THUDM on Hugging Face, the AI community building the future.
Grok-1-314B-MoE
Community
Grok-1-314B-MoE — indexed from awesome-llm
InternLM2-1.8|7|20B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Kimi-K2
Community
Kimi K2 is the large language model series developed by Moonshot AI team
Llama 1-7|13|33|65B
Community
[OPT-1.3 6.7 13 30 66B](https://arxiv.org/abs/2205.01068)
Llama 3.2-1|3|11|90B
Community
[Llama 3.1-8 70 405B](https://llama.meta.com/)
LLaMA: Open and Efficient Foundation Language Models
Community
2023-02
Mixtral-8x7B
Community
The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
MPT-7B
Community
Introducing MPT-7B, the first entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available fo
Nemotron-4-340B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
open-r1
Community
Fully open reproduction of DeepSeek-R1
OpenAI o3-mini
Community
Pushing the frontier of cost-effective reasoning.
OpenELM-1.1|3B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Qwen-1.8B|7B|14B|72B
Community
Qwen - a Qwen Collection
Qwen2-0.5B|1.5B|7B|57B-A14B-MoE|72B
Community
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction After months of efforts, we are pleased to announce the evolution from Qwen1.5 to Qwen2. This time, we bring to you: Pret
Qwen2.5 Technical Report
Community
In this report, we introduce Qwen2.5, a comprehensive series of large language models (LLMs) designed to meet diverse needs. Compared to previous iterations, Qwen 2.5 has been si
Qwen2.5-Max
Community
QWEN CHAT API DEMO DISCORD It is widely recognized that continuously scaling both data size and model size can lead to significant improvements in model intelligence. However, th
Qwen2-Math-1.5B|7B|72B
Community
GITHUB HUGGING FACE MODELSCOPE DISCORD 🚨 This model mainly supports English. We will release bilingual (English and Chinese) math models soon. Introduction Over the past year, w
RecurrentGemma-2B
Community
Open weights language model from Google DeepMind, based on Griffin.
Solving Quantitative Reasoning Problems with Language Models
Community
Language models have achieved remarkable performance on a wide range of tasks that require natural language understanding. Nevertheless, state-of-the-art models have generally st
The Llama 3 Herd of Models
Community
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models