Open Source Alternatives
Open source alternatives to DeepSeek-R1
Open source alternatives to DeepSeek-R1, ranked by GitHub stars and freshness.
26 open-source alternatives in the index, ranked by GitHub stars and freshness.
open-r1
Community
Fully open reproduction of DeepSeek-R1
Best for: Researchers and builders needing transparent, locally-controlled reasoning models
Kimi-K2
Community
Kimi K2 is the large language model series developed by Moonshot AI team
Best for: Developers exploring open-source large language models for custom applications
RecurrentGemma-2B
Community
Open weights language model from Google DeepMind, based on Griffin.
Best for: Researchers and developers exploring efficient open-weight language models.
Baichuan-7|13B
Community
AGI Large Language Models
Best for: Developers and researchers needing an open-source, Chinese-capable large language model for fine-tuning and deployment
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Community
BigScience
Best for: Researchers and developers needing a large, open multilingual model for diverse language and code tasks
CodeQwen1.5-7B
Community
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction The advent of advanced programming tools, which harnesses the power of large language models (LLMs), has significantly en
Best for: Developers and teams seeking a cost-effective, privacy-respecting open-source alternative to proprietary coding assistants
DeepSeek-V2.5
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Best for: Developers and researchers who need a capable, open-source language model for customization and self-hosting.
GLM-130B: An Open Bilingual Pre-trained Model
Community
GLM-130B
Best for: Researchers and developers needing an open, large-scale bilingual model for experimentation and benchmarking
GLM-2|6|10|13|70B
Community
Org profile for THUDM on Hugging Face, the AI community building the future.
Best for: Developers and researchers who need open-source, customizable Chinese-English LLMs for fine-tuning or deployment.
Grok-1-314B-MoE
Community
Grok-1-314B-MoE — indexed from awesome-llm
Best for: Researchers and teams with high-end hardware who need an extremely large open-source language model
InternLM2-1.8|7|20B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Best for: Developers and researchers who need a range of open language model sizes for testing, deployment, or experimentation
Llama 1-7|13|33|65B
Community
[OPT-1.3 6.7 13 30 66B](https://arxiv.org/abs/2205.01068)
Best for: Researchers and developers needing a performant, open-source language model for fine-tuning and self-hosted deployment.
Llama 3.2-1|3|11|90B
Community
[Llama 3.1-8 70 405B](https://llama.meta.com/)
Best for: Developers who need a versatile, open-source LLM for deployment across devices
LLaMA: Open and Efficient Foundation Language Models
Community
2023-02
Best for: Researchers and developers needing an open, efficient base model for fine-tuning and experimentation
Mixtral-8x7B
Community
The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
Best for: Developers and teams who need a high-quality open-source model for fine-tuning and self-hosted deployment
MPT-7B
Community
Introducing MPT-7B, the first entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available fo
Best for: Developers and organizations needing a high-quality open-source LLM with commercial rights for text and code tasks.
Nemotron-4-340B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Best for: Developers and researchers who need a powerful, open foundation model for instruction following and reasoning
OpenAI o3-mini
Community
Pushing the frontier of cost-effective reasoning.
Best for: Developers who need a budget-friendly reasoning model for production use.
OpenELM-1.1|3B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Best for: Developers needing efficient, open-source language models for research or small-scale applications
Qwen-1.8B|7B|14B|72B
Community
Qwen - a Qwen Collection
Best for: Developers seeking scalable open-source LLMs for diverse deployment environments
Qwen2-0.5B|1.5B|7B|57B-A14B-MoE|72B
Community
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction After months of efforts, we are pleased to announce the evolution from Qwen1.5 to Qwen2. This time, we bring to you: Pret
Best for: Developers needing a flexible, open-source LLM family with strong multilingual and coding capabilities
Qwen2.5 Technical Report
Community
In this report, we introduce Qwen2.5, a comprehensive series of large language models (LLMs) designed to meet diverse needs. Compared to previous iterations, Qwen 2.5 has been si
Best for: Researchers and developers evaluating large language model capabilities and training strategies
Qwen2.5-Max
Community
QWEN CHAT API DEMO DISCORD It is widely recognized that continuously scaling both data size and model size can lead to significant improvements in model intelligence. However, th
Best for: Developers exploring open-source MoE large language models for integration
Qwen2-Math-1.5B|7B|72B
Community
GITHUB HUGGING FACE MODELSCOPE DISCORD 🚨 This model mainly supports English. We will release bilingual (English and Chinese) math models soon. Introduction Over the past year, w
Best for: Developers who need a math-focused reasoning model within a resource-constrained or open-source pipeline
Solving Quantitative Reasoning Problems with Language Models
Community
Language models have achieved remarkable performance on a wide range of tasks that require natural language understanding. Nevertheless, state-of-the-art models have generally st
Best for: Researchers and developers exploring quantitative reasoning in language models
The Llama 3 Herd of Models
Community
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models
Best for: Developers and researchers seeking a capable, open foundation model for multilingual, coding, and reasoning tasks.