Litgpt
by Community
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
OSS
Litgpt
Added 1 June 2026
Overview
LitGPT is a Python framework providing 20+ pre-built high-performance LLM implementations with recipes for pretraining, finetuning, and deployment at scale. It abstracts away infrastructure complexity to let builders work directly with model training and inference pipelines.
Best for
Best for
Teams building or customizing LLMs at scale with access to compute resources
Use cases
- Finetuning open-source LLMs on custom datasets
- Pretraining models from scratch with distributed compute
- Deploying trained models to production environments
Notes
LitGPT is a Python framework providing 20+ pre-built high-performance LLM implementations with recipes for pretraining, finetuning, and deployment at scale. It abstracts away infrastructure complexity to let builders work directly with model training and inference pipelines.
13,395 stars on GitHub. Last updated 2026-06-01. Licensed Apache-2.0.
Use cases
- Finetuning open-source LLMs on custom datasets
- Pretraining models from scratch with distributed compute
- Deploying trained models to production environments
Pros
- Covers full model lifecycle from pretraining through deployment
- 20+ vetted model implementations reduce setup time
- Built on Lightning for distributed training out of the box
Cons
- Requires Python and familiarity with training workflows
- Community-maintained, not backed by commercial support
- Learning curve steeper than inference-only tools
Indexed from awesome-llm and enriched against its public facts.
Pros
- Covers full model lifecycle from pretraining through deployment
- 20+ vetted model implementations reduce setup time
- Built on Lightning for distributed training out of the box
Cons
- Requires Python and familiarity with training workflows
- Community-maintained, not backed by commercial support
- Learning curve steeper than inference-only tools
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
femtoGPT
Community
Pure Rust implementation of a minimal Generative Pretrained Transformer
Finetuned Language Models are Zero-Shot Learners
Community
This paper explores a simple method for improving the zero-shot learning abilities of language models. We show that instruction tuning—finetuning language models on a collection
MPT-7B
Community
Introducing MPT-7B, the first entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available fo
Qwen-1.8B|7B|14B|72B
Community
Qwen - a Qwen Collection
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Community
The performance of a large language model (LLM) depends heavily on the quality and size of its pretraining dataset. However, the pretraining datasets for state-of-the-art open LL
femtoGPT
Community
Pure Rust implementation of a minimal Generative Pretrained Transformer
Llama 3-8|70B
Community
[Llama 2-7 13 70B](https://llama.meta.com/llama2/)
Meta Lingua
Community
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
torchtune
Community
PyTorch native post-training library