The Llama 3 Herd of Models
by Community
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models
OSS
The Llama 3 Herd of Models
Added 1 June 2026
Overview
The Llama 3 Herd of Models is a set of foundation language models developed by a community effort. The largest model is a dense Transformer with 405B parameters and a 128K token context window. It supports multilinguality, coding, reasoning, and tool usage, and its quality is comparable to GPT-4 on many tasks.
Best for
Best for
Developers and researchers seeking a capable, open foundation model for multilingual, coding, and reasoning tasks.
Use cases
- Multilingual natural language processing and generation
- Code generation, completion, and software development assistance
- Building AI agents that reason and use external tools
Notes
The Llama 3 Herd of Models is a set of foundation language models developed by a community effort. The largest model is a dense Transformer with 405B parameters and a 128K token context window. It supports multilinguality, coding, reasoning, and tool usage, and its quality is comparable to GPT-4 on many tasks.
Use cases
- Multilingual natural language processing and generation
- Code generation, completion, and software development assistance
- Building AI agents that reason and use external tools
Pros
- Publicly released with pre-trained and post-trained weights available
- Performance comparable to GPT-4 across a wide range of benchmarks
- Long 128K token context window for extended inputs
Cons
- Very large 405B parameter model demands substantial compute resources
- Community release may have less formal support and documentation than proprietary alternatives
- Large model size limits deployment to high-end hardware
Indexed from awesome-llm and enriched against its public facts.
Pros
- Publicly released with pre-trained and post-trained weights available
- Performance comparable to GPT-4 across a wide range of benchmarks
- Long 128K token context window for extended inputs
Cons
- Very large 405B parameter model demands substantial compute resources
- Community release may have less formal support and documentation than proprietary alternatives
- Large model size limits deployment to high-end hardware
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
llama.cpp
Community
LLM inference in C/C++
vLLM
Community
A high-throughput and memory-efficient inference and serving engine for LLMs
ollama
Community
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
LangChain
Community
The agent engineering platform.