TensorFlow Model Optimization
by Community
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
OSS
TensorFlow Model Optimization
Added 1 June 2026
Overview
A toolkit for optimizing machine learning models built with Keras and TensorFlow for deployment. It provides techniques such as quantization and pruning to reduce model size and improve inference speed.
Best for
Best for
Developers deploying TensorFlow models to mobile, embedded, or edge devices
Use cases
- Reducing model size for mobile or edge deployment
- Speeding up inference on resource-constrained devices
- Applying post-training quantization to TensorFlow models
Notes
A toolkit for optimizing machine learning models built with Keras and TensorFlow for deployment. It provides techniques such as quantization and pruning to reduce model size and improve inference speed.
1,572 stars on GitHub. Last updated 2026-06-01. Licensed Apache-2.0.
Use cases
- Reducing model size for mobile or edge deployment
- Speeding up inference on resource-constrained devices
- Applying post-training quantization to TensorFlow models
Pros
- Open source with community support
- Integrates directly with TensorFlow and Keras workflows
- Offers both quantization and pruning techniques
Cons
- Limited to TensorFlow and Keras models only
- May require careful tuning to avoid accuracy loss
- Documentation can be sparse for advanced use cases
Indexed from awesome-llmops and enriched against its public facts.
Pros
- Open source with community support
- Integrates directly with TensorFlow and Keras workflows
- Offers both quantization and pruning techniques
Cons
- Limited to TensorFlow and Keras models only
- May require careful tuning to avoid accuracy loss
- Documentation can be sparse for advanced use cases
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.