O Open Source Observability medium

LightGBM

by Community

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other

Visit Community View repo Submit your build →

OSS

LightGBM

Added 1 June 2026

#data-mining #decision-trees #distributed #gbdt #gbm #gbrt #gradient-boosting #kaggle

Overview

LightGBM is a gradient boosting framework written in C++ that trains decision tree ensembles for classification, regression, and ranking tasks. It uses leaf-wise tree growth and histogram-based learning to achieve fast training on large datasets with lower memory overhead than traditional gradient boosting.

Best for

Best for
Data scientists building production ML systems on large tabular datasets where training speed and memory efficiency matter.

Use cases

Training classification models on tabular data at scale
Building ranking systems for search and recommendation
Rapid prototyping of gradient boosting pipelines

Notes

18,416 stars on GitHub. Last updated 2026-06-01. Licensed MIT.

Use cases

Training classification models on tabular data at scale
Building ranking systems for search and recommendation
Rapid prototyping of gradient boosting pipelines

Pros

Significantly faster training speed than XGBoost on large datasets
Lower memory consumption through histogram-based learning
Supports distributed training across multiple machines

Cons

Leaf-wise growth can overfit on small datasets without careful tuning
Steeper learning curve for hyperparameter optimization compared to simpler models
Less mature ecosystem and fewer pre-built integrations than XGBoost

Indexed from awesome-llmops and enriched against its public facts.

Pros

Significantly faster training speed than XGBoost on large datasets
Lower memory consumption through histogram-based learning
Supports distributed training across multiple machines

Cons

Leaf-wise growth can overfit on small datasets without careful tuning
Steeper learning curve for hyperparameter optimization compared to simpler models
Less mature ecosystem and fewer pre-built integrations than XGBoost

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Alternative to1entry

O OSS Obs medium

XGBoost

Community

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and D

★ 28,431 updated 1mo ago

Used by2entries

O OSS Obs medium

Deepchecks

Community

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test

★ 4,017 updated 6mo ago

O OSS Obs medium

FLAML

Community

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

★ 4,360 updated 1mo ago

Pairs with7entries

O OSS Obs medium

FeatureTools

Community

An open source python library for automated feature engineering

★ 7,655 updated 5mo ago

O OSS Obs medium

Hyperopt

Community

Distributed Asynchronous Hyperparameter Optimization in Python

★ 7,576 updated 1mo ago

O OSS Obs medium

MOE

Community

A global, black box optimization engine for real world metric optimization.

★ 1,320 updated 3y ago

O OSS Obs medium

NNI

Community

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

★ 14,352 updated 2y ago

O OSS Obs medium

scikit-optimize(skopt)

Community

Sequential model-based optimization with a scipy.optimize interface

★ 2,826 updated 2y ago

O OSS Obs medium

Spearmint

Community

Spearmint Bayesian optimization codebase

★ 1,568 updated 6y ago

O OSS Obs medium

XGBoost

Community

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and D

★ 28,431 updated 1mo ago

Alternatives1entry

O OSS Obs medium

XGBoost

Community

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and D

★ 28,431 updated 1mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →