Enterprise DNA
O Open Source Observability medium

Kubeflow

by Community

Machine Learning Toolkit for Kubernetes

K

OSS

Kubeflow

Added 1 June 2026

#google-kubernetes-engine #jupyter #kubeflow #kubernetes #machine-learning #minikube #ml #notebook

Overview

Kubeflow is an open-source ML toolkit that runs on Kubernetes, providing components for building and deploying machine learning workflows. It abstracts Kubernetes complexity to let teams define, train, and serve models as containerized pipelines without managing infrastructure directly.

Best for

Best for
Teams with Kubernetes infrastructure who need to standardize ML workflows across on-prem or multi-cloud environments

Use cases

  • Orchestrating multi-step training pipelines across distributed clusters
  • Managing model serving and inference at scale on Kubernetes
  • Automating hyperparameter tuning and experiment tracking workflows

Notes

Kubeflow is an open-source ML toolkit that runs on Kubernetes, providing components for building and deploying machine learning workflows. It abstracts Kubernetes complexity to let teams define, train, and serve models as containerized pipelines without managing infrastructure directly.

15,700 stars on GitHub. Last updated 2026-05-24. Licensed Apache-2.0.

Use cases

  • Orchestrating multi-step training pipelines across distributed clusters
  • Managing model serving and inference at scale on Kubernetes
  • Automating hyperparameter tuning and experiment tracking workflows

Pros

  • Runs on any Kubernetes cluster, avoiding vendor lock-in
  • Handles distributed training and serving natively
  • Active community with broad ecosystem integration

Cons

  • Requires existing Kubernetes expertise to operate effectively
  • Steep learning curve for teams new to container orchestration
  • Observability tooling is basic compared to managed ML platforms

Indexed from awesome-llmops and enriched against its public facts.

Pros

  • Runs on any Kubernetes cluster, avoiding vendor lock-in
  • Handles distributed training and serving natively
  • Active community with broad ecosystem integration

Cons

  • Requires existing Kubernetes expertise to operate effectively
  • Steep learning curve for teams new to container orchestration
  • Observability tooling is basic compared to managed ML platforms

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Pairs with23entries
O OSS Obs medium

Argo Workflows

Community

Workflow Engine for Kubernetes

★ 16,728 updated 2d ago
O OSS Obs medium

Awesome Argo

Community

A curated list of awesome projects and resources related to Argo (a CNCF graduated project)

★ 2,441 updated 3mo ago
O OSS Obs medium

Awesome Federated Learning Systems

Community

Federated Learning Systems Paper List

★ 75 updated 2y ago
O OSS Obs medium

Awesome Production Machine Learning

Community

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

★ 20,585 updated 2d ago
O OSS Obs medium

Harmonia

Community

Federated Learning Made Easy

★ 17 updated 5y ago
O OSS Obs medium

JuiceFS

Community

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

★ 13,645 updated 2d ago
O OSS Obs medium

Kaito

Community

Kubernetes AI Toolchain Operator

★ 950 updated 6d ago
O OSS Obs medium

Katib

Community

Automated Machine Learning on Kubernetes

★ 1,685 updated 5d ago
O OSS Obs medium

Kedro-Viz

Community

Visualise your Kedro data and machine-learning pipelines and track your experiments.

★ 749 updated 5d ago
O OSS Obs medium

Kserve

Community

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

★ 5,534 updated 2d ago
O OSS Obs medium

KubeAI

Community

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

★ 1,201 updated 2d ago
O OSS Obs medium

Kueue

Community

Kubernetes-native Job Queueing

★ 2,536 updated 2d ago
O OSS Obs medium

Maxim AI

Community

At Maxim AI, we are building the production infrastructure for AI. Maxim’s stack comprising gateway and governance, observability, and evals empowers AI teams to ship agents with

O OSS Obs medium

NNI

Community

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

★ 14,352 updated 1y ago
O OSS Obs medium

Primehub

Community

open-source MLOps platform

★ 410 updated 4mo ago
O OSS Obs medium

Puzzlet AI

Community

Redirecting...

O OSS Obs medium

Seldon-core

Community

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

★ 4,752 updated 2mo ago
O OSS Obs medium

TFServing

Community

A flexible, high-performance serving system for machine learning models

★ 6,353 updated 6d ago
O OSS Obs medium

visenger/awesome-mlops

Community

A curated list of references for MLOps

★ 13,923 updated 1y ago
O OSS Obs medium

Volcano

Community

A Cloud Native Batch System (Project under CNCF)

★ 5,621 updated 2d ago
O OSS Obs medium

Weco Observe

Community

Build and Optimize your machine learning pipeline with the Weco Platform - based on AIDE ML, the LLM-powered code optimization Agent for Machine Learning Engineering.

O OSS Obs medium

Yunikorn

Community

Apache YuniKorn Core

★ 1,014 updated 5d ago
O OSS Obs medium

ZenML

Community

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

★ 5,429 updated 2d ago
Alternatives14entries
O OSS Obs medium

Airflow

Community

Platform created by the community to programmatically author, schedule and monitor workflows.

O OSS Obs medium

ClearML

Community

ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

★ 6,715 updated 3d ago
O OSS Obs medium

Determined

Community

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch

★ 3,225 updated 1y ago
O OSS Obs medium

dstack

Community

Open framework for confidential AI

★ 496 updated 2d ago
O OSS Obs medium

Flyte

Community

Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.

★ 7,056 updated 2d ago
O OSS Obs medium

KubeAI

Community

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

★ 1,201 updated 2d ago
O OSS Obs medium

Metaflow

Community

Build, Manage and Deploy AI/ML Systems

★ 10,111 updated 2d ago
O OSS Obs medium

MLRun

Community

MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environ

★ 1,670 updated 2d ago
O OSS Obs medium

PAI

Community

Resource scheduling and cluster management for AI

★ 2,687 updated 1y ago
O OSS Obs medium

Polyaxon

Community

Open Source AI Infra & Engineering Control Plane

★ 3,706 updated 5d ago
O OSS Obs medium

Prefect

Community

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

★ 22,518 updated 2d ago
O OSS Obs medium

Starwhale

Community

an MLOps/LLMOps platform

★ 238 updated 1y ago
O OSS Obs medium

VDP

Community

🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications

★ 2,313 updated 2d ago
O OSS Obs medium

ZenML

Community

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

★ 5,429 updated 2d ago