PocketFlow
by Community
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
OSS
PocketFlow
Added 1 June 2026
Overview
PocketFlow is an automatic model compression framework written in Python. It helps developers reduce model size and inference latency for AI applications. The tool uses automated techniques to prune, quantize, and distill models.
Best for
Best for
Developers deploying efficiently compressed models to resource‑constrained or latency‑sensitive environments
Use cases
- Compressing deep learning models for mobile or edge deployment
- Reducing inference time in production pipelines
- Shrinking model storage footprint for cloud or embedded systems
Notes
PocketFlow is an automatic model compression framework written in Python. It helps developers reduce model size and inference latency for AI applications. The tool uses automated techniques to prune, quantize, and distill models.
2,912 stars on GitHub. Last updated 2023-03-31.
Use cases
- Compressing deep learning models for mobile or edge deployment
- Reducing inference time in production pipelines
- Shrinking model storage footprint for cloud or embedded systems
Pros
- Automates complex compression workflows
- Open source with a large community of over 2,900 stars
- Python‑based and integrates with major deep learning frameworks
Cons
- Limited to the Python ecosystem
- Compression results can vary by model architecture and task
- Community‑supported with no official vendor backing
Indexed from awesome-llmops and enriched against its public facts.
Pros
- Automates complex compression workflows
- Open source with a large community of over 2,900 stars
- Python‑based and integrates with major deep learning frameworks
Cons
- Limited to the Python ecosystem
- Compression results can vary by model architecture and task
- Community‑supported with no official vendor backing
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
PyTorch
Community
Tensors and Dynamic neural networks in Python with strong GPU acceleration
TensorFlow
Community
An Open Source Machine Learning Framework for Everyone
Keras
Community
Deep Learning for humans