O Open Source Frameworks medium

nanotron

by Community

Minimalistic large language model 3D-parallelism training

Visit Community View repo Submit your build →

OSS

nanotron

Added 1 June 2026

Overview

Nanotron is a minimalistic framework for training large language models using 3D parallelism. It implements data, tensor, and pipeline parallelism in Python to distribute training across multiple GPUs.

Best for

Best for
Researchers and engineers who need a simple, hackable framework for distributed LLM training experiments.

Use cases

Training large language models from scratch with distributed parallelism
Experimenting with 3D parallelism strategies for model scaling
Reproducing research results in distributed LLM training

Notes

2,705 stars on GitHub. Last updated 2026-05-26. Licensed Apache-2.0.

Use cases

Training large language models from scratch with distributed parallelism
Experimenting with 3D parallelism strategies for model scaling
Reproducing research results in distributed LLM training

Pros

Lightweight and focused on core parallelism techniques
Active community with 2705 GitHub stars
Integrates well with the Hugging Face ecosystem

Cons

Limited to training, no inference or deployment features
Minimal documentation beyond code comments
Requires deep understanding of distributed training concepts

Indexed from awesome-llm and enriched against its public facts.

Pros

Lightweight and focused on core parallelism techniques
Active community with 2705 GitHub stars
Integrates well with the Hugging Face ecosystem

Cons

Limited to training, no inference or deployment features
Minimal documentation beyond code comments
Requires deep understanding of distributed training concepts

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Built with1entry

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 1mo ago

Pairs with2entries

O OSS Framework medium

DeepSpeed

Community

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

★ 42,436 updated 1mo ago

O OSS Framework medium

Megatron-LM

Community

Ongoing research training transformer models at scale

★ 16,545 updated 1mo ago

Alternative to3entries

O OSS Framework medium

DeepSpeed

Community

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

★ 42,436 updated 1mo ago

O OSS Framework medium

Colossal-AI

Community

Making large AI models cheaper, faster and more accessible

★ 41,382 updated 1mo ago

O OSS Framework medium

Megatron-LM

Community

Ongoing research training transformer models at scale

★ 16,545 updated 1mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →