Qwen2-0.5B|1.5B|7B|57B-A14B-MoE|72B
by Community
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction After months of efforts, we are pleased to announce the evolution from Qwen1.5 to Qwen2. This time, we bring to you: Pret
OSS
Qwen2-0.5B|1.5B|7B|57B-A14B-MoE|72B
Added 1 June 2026
Overview
Qwen2 is a family of open-source large language models released by the Qwen team, available in five sizes from 0.5B to 72B parameters. It includes both pretrained and instruction-tuned variants, supports up to 128K token context windows, and covers 29 languages. The models show strong benchmark performance, especially in coding and mathematics.
Best for
Best for
Developers needing a flexible, open-source LLM family with strong multilingual and coding capabilities
Use cases
- Deploying a small, efficient model for on-device or edge inference
- Building multilingual chatbots or assistants with extended context handling
- Fine-tuning for specialized coding or math reasoning tasks
Notes
Qwen2 is a family of open-source large language models released by the Qwen team, available in five sizes from 0.5B to 72B parameters. It includes both pretrained and instruction-tuned variants, supports up to 128K token context windows, and covers 29 languages. The models show strong benchmark performance, especially in coding and mathematics.
Use cases
- Deploying a small, efficient model for on-device or edge inference
- Building multilingual chatbots or assistants with extended context handling
- Fine-tuning for specialized coding or math reasoning tasks
Pros
- Multiple size options allow matching model to compute budget
- Strong coding and math performance relative to parameter count
- Long 128K context window supported on instruction-tuned variants
Cons
- Community-driven release may have less formal support than vendor-backed models
- Larger models (72B) require significant hardware for inference
- Benchmark gains may not translate equally to all downstream tasks
Indexed from awesome-llm and enriched against its public facts.
Pros
- Multiple size options allow matching model to compute budget
- Strong coding and math performance relative to parameter count
- Long 128K context window supported on instruction-tuned variants
Cons
- Community-driven release may have less formal support than vendor-backed models
- Larger models (72B) require significant hardware for inference
- Benchmark gains may not translate equally to all downstream tasks
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
ollama
Community
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
vLLM
Community
A high-throughput and memory-efficient inference and serving engine for LLMs
llama.cpp
Community
LLM inference in C/C++