O Open Source Frameworks medium

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

by Community

Megatron-LM

Visit Community View repo Submit your build →

OSS

Added 2 June 2026

Overview

Megatron-LM is a framework for training multi-billion parameter language models using model parallelism. It partitions model layers across multiple GPUs to overcome memory limits and enable efficient distributed training of large transformer models.

Best for

Best for
Researchers and engineers training very large transformer-based language models.

Use cases

Training large language models with billions of parameters
Scaling transformer models across multiple GPUs
Implementing model parallelism for deep learning research

Notes

Use cases

Training large language models with billions of parameters
Scaling transformer models across multiple GPUs
Implementing model parallelism for deep learning research

Pros

Enables training of models that exceed single GPU memory
Efficient model parallelism reduces communication overhead
Proven for state-of-the-art language models like GPT-3 sizes

Cons

Requires careful tuning of tensor and pipeline parallelism
Primarily designed for NVIDIA GPUs and CUDA
Steep learning curve for customizing parallelism strategies

Indexed from awesome-llm and enriched against its public facts.

Pros

Enables training of models that exceed single GPU memory
Efficient model parallelism reduces communication overhead
Proven for state-of-the-art language models like GPT-3 sizes

Cons

Requires careful tuning of tensor and pipeline parallelism
Primarily designed for NVIDIA GPUs and CUDA
Steep learning curve for customizing parallelism strategies

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Built with1entry

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 23d ago

Pairs with2entries

O OSS Framework medium

vLLM

Community

A high-throughput and memory-efficient inference and serving engine for LLMs

★ 81,619 updated 23d ago

O OSS Framework medium

TensorRT-LLM

Community

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NV

★ 13,781 updated 23d ago

Alternative to3entries

O OSS Framework medium

DeepSpeed

Community

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

★ 42,436 updated 23d ago

O OSS Framework medium

Colossal-AI

Community

Making large AI models cheaper, faster and more accessible

★ 41,382 updated 30d ago

O OSS Framework medium

NeMo Framework

Community

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech

★ 17,285 updated 23d ago

← Back to Open Source Submit your own entry →