NeMo Framework
by Community
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech
OSS
NeMo Framework
Added 1 June 2026
Overview
NeMo is a Python framework for building and training large language models, multimodal systems, and speech AI applications. It provides modular components for ASR, TTS, and LLM development with built-in support for distributed training and inference optimization.
Best for
Best for
Researchers and ML engineers building custom LLMs, speech systems, or multimodal models who need low-level control and scalability.
Use cases
- Training custom LLMs from scratch or fine-tuning existing models
- Building speech recognition and text-to-speech systems
- Developing multimodal AI applications combining text and audio
Notes
NeMo is a Python framework for building and training large language models, multimodal systems, and speech AI applications. It provides modular components for ASR, TTS, and LLM development with built-in support for distributed training and inference optimization.
17,285 stars on GitHub. Last updated 2026-06-01. Licensed Apache-2.0.
Use cases
- Training custom LLMs from scratch or fine-tuning existing models
- Building speech recognition and text-to-speech systems
- Developing multimodal AI applications combining text and audio
Pros
- Modular architecture lets you mix and match components for different AI tasks
- Optimized for distributed training and inference at scale
- Strong community adoption with 17k+ GitHub stars
Cons
- Steeper learning curve than higher-level APIs, requires Python expertise
- Primarily designed for research and experimentation rather than production deployment
- Documentation and examples focus on NVIDIA hardware ecosystems
Indexed from awesome-llm and enriched against its public facts.
Pros
- Modular architecture lets you mix and match components for different AI tasks
- Optimized for distributed training and inference at scale
- Strong community adoption with 17k+ GitHub stars
Cons
- Steeper learning curve than higher-level APIs, requires Python expertise
- Primarily designed for research and experimentation rather than production deployment
- Documentation and examples focus on NVIDIA hardware ecosystems
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
Large Language Model Training in 2023
Community
Learn about large language model training with insights on large language model examples, model architecture, and model training guide.
Megatron-LM
Community
Ongoing research training transformer models at scale
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models
Community
Microsoft
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Community
The performance of a large language model (LLM) depends heavily on the quality and size of its pretraining dataset. However, the pretraining datasets for state-of-the-art open LL
LaMDA: Language Models for Dialog Applications
Community
2022-01
Megatron-DeepSpeed
Community
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Community
Megatron-LM