Pythia-1|1.4|2.8|6.9|12B
by Community
The hub for EleutherAI's work on interpretability and learning dynamics
OSS
Pythia-1|1.4|2.8|6.9|12B
Added 1 June 2026
Overview
Pythia is a suite of models and tools from EleutherAI for studying interpretability and learning dynamics. It provides multiple model sizes (from 1.4B to 12B parameters) to enable comparative analysis of how language models evolve during training. The project is open source and hosted on GitHub.
Best for
Best for
Researchers studying interpretability and learning dynamics of large language models
Use cases
- Analyzing learning dynamics across different model scales
- Studying internal representations and interpretability of language models
- Comparing training trajectories of models from the same architecture
Notes
Pythia is a suite of models and tools from EleutherAI for studying interpretability and learning dynamics. It provides multiple model sizes (from 1.4B to 12B parameters) to enable comparative analysis of how language models evolve during training. The project is open source and hosted on GitHub.
2,812 stars on GitHub. Last updated 2025-11-15. Licensed Apache-2.0.
Use cases
- Analyzing learning dynamics across different model scales
- Studying internal representations and interpretability of language models
- Comparing training trajectories of models from the same architecture
Pros
- Open source with community contributions
- Multiple model sizes enable scale-dependent analysis
- Focused on interpretability research, not just model release
Cons
- Primarily a research tool, not optimized for production deployment
- Documentation may be sparse for non-researchers
- Requires significant computational resources to run large models
Indexed from awesome-llm and enriched against its public facts.
Pros
- Open source with community contributions
- Multiple model sizes enable scale-dependent analysis
- Focused on interpretability research, not just model release
Cons
- Primarily a research tool, not optimized for production deployment
- Documentation may be sparse for non-researchers
- Requires significant computational resources to run large models
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.