Qwen-1.8B|7B|14B|72B
by Community
Qwen - a Qwen Collection
OSS
Qwen-1.8B|7B|14B|72B
Added 1 June 2026
Overview
A collection of open-source large language models ranging from 1.8B to 72B parameters, hosted on Hugging Face. Users can download and run inference locally or via API, selecting model size based on compute constraints and performance needs.
Best for
Best for
Developers seeking scalable open-source LLMs for diverse deployment environments
Use cases
- Deploying a lightweight 1.8B model for real-time chat on edge devices
- Running the 72B model for complex reasoning and code generation tasks
- Fine-tuning one of the model sizes on domain-specific data for custom applications
Notes
A collection of open-source large language models ranging from 1.8B to 72B parameters, hosted on Hugging Face. Users can download and run inference locally or via API, selecting model size based on compute constraints and performance needs.
Use cases
- Deploying a lightweight 1.8B model for real-time chat on edge devices
- Running the 72B model for complex reasoning and code generation tasks
- Fine-tuning one of the model sizes on domain-specific data for custom applications
Pros
- Wide range of sizes allows matching model capacity to resource limits
- Open-source and freely available for self-hosting or modification
- Good performance for cost compared to many proprietary alternatives
Cons
- Community-maintained project may lack official support or documentation
- Smaller models (1.8B, 7B) have limited capability for nuanced tasks
- Requires significant GPU memory for the 72B variant
Indexed from awesome-llm and enriched against its public facts.
Pros
- Wide range of sizes allows matching model capacity to resource limits
- Open-source and freely available for self-hosting or modification
- Good performance for cost compared to many proprietary alternatives
Cons
- Community-maintained project may lack official support or documentation
- Smaller models (1.8B, 7B) have limited capability for nuanced tasks
- Requires significant GPU memory for the 72B variant
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
vLLM
Community
A high-throughput and memory-efficient inference and serving engine for LLMs
llama.cpp
Community
LLM inference in C/C++
ollama
Community
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
Litgpt
Community
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.