Enterprise DNA
Directories / Alternatives / Megatron-LM

Open Source Alternatives

Open source alternatives to Megatron-LM

Open source alternatives to Megatron-LM, ranked by GitHub stars and freshness.

7 open-source alternatives in the index, ranked by GitHub stars and freshness.

O OSS Framework medium

DeepSpeed

Community

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

★ 42,436 updated 2d ago
open-source

Best for: Teams training large models who need to maximize GPU efficiency and scale across multiple devices.

O OSS Framework medium

Colossal-AI

Community

Making large AI models cheaper, faster and more accessible

★ 41,382 updated 9d ago
open-source

Best for: Teams training large models who have access to multiple GPUs and need to optimize resource efficiency

O OSS Framework medium

torchtitan

Community

A PyTorch native platform for training generative AI models

★ 5,394 updated 2d ago
open-source

Best for: Teams using PyTorch to train custom generative AI models at scale

O OSS Framework medium

nanotron

Community

Minimalistic large language model 3D-parallelism training

★ 2,705 updated 8d ago
open-source

Best for: Researchers and engineers who need a simple, hackable framework for distributed LLM training experiments.

O OSS Framework medium

maxtext

Community

A simple, performant and scalable Jax LLM!

★ 2,303 updated 2d ago
open-source

Best for: Developers already using Jax who need a streamlined, scalable LLM training framework

O OSS Framework medium

BMTrain

Community

Efficient Training (including pre-training and fine-tuning) for Big Models

★ 624 updated 1mo ago
open-source

Best for: Developers and researchers training or fine-tuning large models who need a specialized, efficiency-focused Python framework

O OSS Framework medium

OLMo: Accelerating the Science of Language Models

Community

Language models (LMs) have become ubiquitous in both NLP research and in commercial product offerings. As their commercial importance has surged, the most powerful models have be

open-source

Best for: Researchers and developers needing transparent, open language models