Mesh Tensorflow
by Community
Mesh TensorFlow: Model Parallelism Made Easier
OSS
Mesh Tensorflow
Added 1 June 2026
Overview
Mesh TensorFlow is a framework for model parallelism in TensorFlow. It allows developers to split large neural network models across multiple devices by defining how tensors are partitioned. It provides a domain-specific language for describing distributed layouts.
Best for
Best for
Developers training large transformer models on TPU or GPU clusters
Use cases
- Training models that exceed single-device memory
- Distributing transformer layers across multiple GPUs
- Implementing sharded computation for large-scale neural networks
Notes
Mesh TensorFlow is a framework for model parallelism in TensorFlow. It allows developers to split large neural network models across multiple devices by defining how tensors are partitioned. It provides a domain-specific language for describing distributed layouts.
1,625 stars on GitHub. Last updated 2023-11-17. Licensed Apache-2.0.
Use cases
- Training models that exceed single-device memory
- Distributing transformer layers across multiple GPUs
- Implementing sharded computation for large-scale neural networks
Pros
- Enables training of models too large for one device
- Integrates directly with TensorFlow ecosystem
- Provides explicit control over tensor partitioning
Cons
- Requires manual specification of mesh layouts
- Added complexity compared to data parallelism
- Limited adoption outside Google’s TPU environments
Indexed from awesome-llm and enriched against its public facts.
Pros
- Enables training of models too large for one device
- Integrates directly with TensorFlow ecosystem
- Provides explicit control over tensor partitioning
Cons
- Requires manual specification of mesh layouts
- Added complexity compared to data parallelism
- Limited adoption outside Google's TPU environments
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.