O Open Source Frameworks medium

Megatron-LM

by Community

Ongoing research training transformer models at scale

Visit Community View repo Submit your build →

OSS

Megatron-LM

Added 1 June 2026

#large-language-models #model-para #transformers

Overview

Megatron-LM is a Python framework for training large transformer models at scale, developed and maintained by NVIDIA. It provides distributed training optimizations and memory-efficient techniques to handle models that exceed single-GPU capacity.

Best for

Best for
ML engineers training large transformer models who need production-grade distributed training infrastructure

Use cases

Training billion-parameter language models across multiple GPUs
Reducing memory footprint and training time for large transformers
Implementing pipeline parallelism and tensor parallelism strategies

Notes

16,545 stars on GitHub. Last updated 2026-06-01.

Use cases

Training billion-parameter language models across multiple GPUs
Reducing memory footprint and training time for large transformers
Implementing pipeline parallelism and tensor parallelism strategies

Pros

Production-grade distributed training infrastructure from NVIDIA
Significant memory and compute optimizations for large models
Active research codebase with ongoing improvements

Cons

Steep learning curve for distributed training concepts
Requires multi-GPU or multi-node setup to be practical
Community-driven with less formal support than commercial alternatives

Indexed from awesome-llm and enriched against its public facts.

Pros

Production-grade distributed training infrastructure from NVIDIA
Significant memory and compute optimizations for large models
Active research codebase with ongoing improvements

Cons

Steep learning curve for distributed training concepts
Requires multi-GPU or multi-node setup to be practical
Community-driven with less formal support than commercial alternatives

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Uses1entry

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 1mo ago

Built with1entry

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 1mo ago

Pairs with1entry

O OSS Framework medium

NeMo Framework

Community

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech

★ 17,285 updated 1mo ago

Alternative to2entries

O OSS Framework medium

DeepSpeed

Community

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

★ 42,436 updated 1mo ago

O OSS Framework medium

Colossal-AI

Community

Making large AI models cheaper, faster and more accessible

★ 41,382 updated 1mo ago

Powers2entries

O OSS Framework medium

BLOOMZ&mT0

Community

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

O OSS Framework medium

GPT-NeoX

Community

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

★ 7,432 updated 1mo ago

Pairs with3entries

O OSS Framework medium

Large Language Model Training in 2023

Community

Learn about large language model training with insights on large language model examples, model architecture, and model training guide.

O OSS Framework medium

Liger-Kernel

Community

Efficient Triton Kernels for LLM Training

★ 6,400 updated 1mo ago

O OSS Framework medium

nanotron

Community

Minimalistic large language model 3D-parallelism training

★ 2,705 updated 1mo ago

Alternatives9entries

O OSS Framework medium

Axolotl

Community

Go ahead and axolotl questions

★ 11,997 updated 1mo ago

O OSS Framework medium

BMTrain

Community

Efficient Training (including pre-training and fine-tuning) for Big Models

★ 624 updated 2mo ago

O OSS Framework medium

Colossal-AI

Community

Making large AI models cheaper, faster and more accessible

★ 41,382 updated 1mo ago

O OSS Framework medium

DeepSpeed

Community

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

★ 42,436 updated 1mo ago

O OSS Framework medium

maxtext

Community

A simple, performant and scalable Jax LLM!

★ 2,303 updated 1mo ago

O OSS Framework medium

Mesh Tensorflow

Community

Mesh TensorFlow: Model Parallelism Made Easier

★ 1,625 updated 2y ago

O OSS Framework medium

nanotron

Community

Minimalistic large language model 3D-parallelism training

★ 2,705 updated 1mo ago

O OSS Framework medium

torchtitan

Community

A PyTorch native platform for training generative AI models

★ 5,394 updated 1mo ago

O OSS Framework medium

Transformer Engine

Community

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide b

★ 3,374 updated 1mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →