License
BSD-3-Clause
25 entries under this license.
scikit-learn
Community
scikit-learn: machine learning in Python
Best for: Python developers building traditional machine learning pipelines and prototyping models quickly.
Weaviate
Community
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance an
Best for: Teams building production search systems who need open-source control and can manage infrastructure.
Jupyter Notebooks
Community
Jupyter Interactive Notebook
Best for: Data scientists and researchers who need interactive exploration with reproducible documentation
Triton Server (TRTIS)
Community
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Best for: Teams deploying large-scale inference services that need high throughput and multi-framework support.
auto-sklearn
Community
Automated Machine Learning with scikit-learn
Best for: Data scientists and ML engineers who need an automated baseline for tabular classification and regression tasks using scikit-learn
FeatureTools
Community
An open source python library for automated feature engineering
Best for: Data scientists and ML engineers working with structured tabular data
torchtune
Community
PyTorch native post-training library
Best for: Developers already working in PyTorch who need a lightweight, modular library for fine-tuning and adapting large models.
torchtitan
Community
A PyTorch native platform for training generative AI models
Best for: Teams using PyTorch to train custom generative AI models at scale
Meta Lingua
Community
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Best for: Researchers and engineers who need a minimal, modifiable LLM training codebase for experimentation.
Jupyter AI
Various
An open source extension that connects AI agents to computational notebooks in JupyterLab.
Best for: Data scientists and researchers who want to augment Jupyter notebooks with AI assistance
TurboPilot
Various
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU
Best for: Developers who want private, CPU-based code completion with open source models
CodeT5
Community
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
Best for: Developers and researchers needing an open-source model for code comprehension and generation tasks
scikit-optimize(skopt)
Community
Sequential model-based optimization with a scipy.optimize interface
Best for: Python developers who need a simple, scipy-compatible optimizer for moderate-scale hyperparameter or parameter tuning
SymbolicAI
Various
A neurosymbolic perspective on LLMs
Best for: Developers building applications that need structured, logical reasoning from LLMs
isaacphi/mcp-language-server
Various
mcp-language-server gives MCP enabled clients access semantic tools like get definition, references, rename, and diagnostics.
Best for: Developers building MCP-based IDEs or automating code analysis via AI agents
datalayer/jupyter-mcp-server
Various
🪐 🔧 Model Context Protocol (MCP) Server for Jupyter.
Best for: Developers building AI agents that need to interact with Jupyter notebooks
EvalML
Community
EvalML is an AutoML library written in python.
Best for: Data scientists who want to rapidly prototype and compare models without writing extensive code.
FEDOT
Community
Automated modeling and machine learning framework FEDOT
Best for: Data scientists and researchers who need automated model composition and structure optimization.
HpBandSter
Community
a distributed Hyperband implementation on Steroids
Best for: Researchers and engineers running distributed hyperparameter optimization with limited compute budgets
RoBO
Community
RoBO: a Robust Bayesian Optimization framework
Best for: Researchers and developers experimenting with Bayesian optimization techniques
Upgini
Community
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, in
Best for: Data scientists and ML engineers who need to quickly augment datasets with external features to improve model performance
intruder-io/intruder-mcp
Various
An MCP server to let AI agents control Intruder
Best for: Developers and security teams using Intruder who want to automate scanning workflows through AI agents
stadiamaps/stadiamaps-mcp-server-ts
Various
A TypeScript MCP server for interacting with the Stadia Maps APIs
Best for: Developers building location-aware AI assistants or agents using the Model Context Protocol
hmk/box-mcp-server
Various
A Box model context protocol server to search, read and access files
Best for: Developers who want to connect their Box storage to MCP-compatible AI agents
rust-works/omni-dev
Various
AI-powered git commit rewriter, PR generator, and MCP server for Jira, Confluence, and Datadog. Single Rust binary.
Best for: Developers who want a lightweight, all-in-one CLI for commit rewriting and PR generation with Atlassian and Datadog integration