O Open Source Observability medium

auto-sklearn

by Community

Automated Machine Learning with scikit-learn

Visit Community View repo Submit your build →

OSS

auto-sklearn

Added 1 June 2026

#automated-machine-learning #automl #bayesian-optimization #hyperparameter-optimization #hyperparameter-search #hyperparameter-tuning #meta-learning #metalearning

Overview

auto-sklearn is an automated machine learning toolkit that extends scikit-learn with automatic model selection and hyperparameter optimization. It uses Bayesian optimization and meta-learning to efficiently search the space of scikit-learn classifiers and regressors, then builds an ensemble of the best performing models.

Best for

Best for
Data scientists and ML engineers who need an automated baseline for tabular classification and regression tasks using scikit-learn

Use cases

Quickly finding a strong baseline classifier or regressor on a new tabular dataset
Automating hyperparameter tuning for scikit-learn pipelines in research or production
Comparing AutoML performance against manually tuned scikit-learn models

Notes

8,102 stars on GitHub. Last updated 2026-04-21. Licensed BSD-3-Clause.

Use cases

Quickly finding a strong baseline classifier or regressor on a new tabular dataset
Automating hyperparameter tuning for scikit-learn pipelines in research or production
Comparing AutoML performance against manually tuned scikit-learn models

Pros

Proven AutoML algorithm with strong benchmark results on tabular data
Seamless integration with the scikit-learn ecosystem and its API
Meta-learning warm-starts the search using prior dataset performance

Cons

Can be computationally expensive and slow for large datasets without sufficient resources
Limited to scikit-learn estimators; no support for deep learning or gradient boosting outside XGBoost/LightGBM wrappers
Ensembling step increases model size and prediction latency

Indexed from awesome-llmops and enriched against its public facts.

Pros

Proven AutoML algorithm with strong benchmark results on tabular data
Seamless integration with the scikit-learn ecosystem and its API
Meta-learning warm-starts the search using prior dataset performance

Cons

Can be computationally expensive and slow for large datasets without sufficient resources
Limited to scikit-learn estimators; no support for deep learning or gradient boosting outside XGBoost/LightGBM wrappers
Ensembling step increases model size and prediction latency

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Uses1entry

O OSS Obs medium

scikit-learn

Community

scikit-learn: machine learning in Python

★ 66,218 updated 1mo ago

Built with1entry

O OSS Obs medium

scikit-learn

Community

scikit-learn: machine learning in Python

★ 66,218 updated 1mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →