Enterprise DNA
Directories / Self-hosted

Directories by Enterprise DNA

Self-hosted AI tools

Run it on your own infra. No seat pricing, no data leaving your network, no vendor that can sunset you. Every tool here is open-source or explicitly self-hostable.

3185 self-hostable entries in the index.

O OSS Obs medium

TensorFlow

Community

An Open Source Machine Learning Framework for Everyone

★ 195,356 updated 2d ago
open-source

Best for: Teams building production ML systems that need cross-platform deployment and performance optimization

O OSS Framework medium

AutoGPT

Community

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

★ 184,701 updated 2d ago
open-source

Best for: Developers exploring autonomous agent architectures and prototyping experimental workflows

O OSS Framework medium

ollama

Community

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

★ 172,846 updated 2d ago
open-source

Best for: Developers building local-first applications or prototyping with open-source LLMs without cloud costs

O OSS Framework medium

Awesome ChatGPT Prompts

Community

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

★ 163,161 updated 2d ago
open-source

Best for: Teams wanting a searchable, shareable prompt reference library they can self-host and customize

O OSS Orchestration medium

Langflow

Community

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

★ 149,019 updated 2d ago
open-source

Best for: Developers building experimental or production LLM workflows who want visual design with code flexibility

O OSS Framework medium

Dify

Community

Production-ready platform for agentic workflow development.

★ 143,435 updated 2d ago
open-source

Best for: Teams building production LLM agents who want open-source control and visual workflow design.

O OSS Framework medium

LangChain

Community

The agent engineering platform.

★ 138,234 updated 2d ago
open-source

Best for: Python developers building prototype or production LLM applications that need to orchestrate multiple tools and data sources

M MCP Dev low

microsoft/markitdown

Various

Python tool for converting files and office documents to Markdown.

★ 138,078 updated 8d ago
open-source

Best for: Developers building document pipeline tools or migrating content to Markdown-based systems

O OSS Framework medium

llama.cpp

Community

LLM inference in C/C++

★ 114,160 updated 2d ago
open-source

Best for: Developers building privacy-first or offline-capable applications with constrained hardware

O OSS Obs medium

whisper

Community

Robust Speech Recognition via Large-Scale Weak Supervision

★ 101,156 updated 1mo ago
open-source

Best for: Developers building privacy-first or offline-capable voice features with multilingual requirements

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 2d ago
open-source

Best for: ML researchers and engineers building custom neural network architectures with GPU training needs

O OSS Framework medium

DeepSeek-R1

Community

First-generation reasoning models from DeepSeek.

★ 92,010 updated 11mo ago
open-source

Best for: Developers building open-source applications needing interpretable multi-step reasoning without vendor lock-in

O OSS Framework medium

vLLM

Community

A high-throughput and memory-efficient inference and serving engine for LLMs

★ 81,619 updated 2d ago
open-source

Best for: Teams building production LLM APIs and services that need to maximize throughput and minimize latency under concurrent load.

O OSS Framework medium

llm-course

Community

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

★ 79,792 updated 3mo ago
open-source

Best for: Developers new to LLMs seeking structured, hands-on learning without infrastructure setup

M MCP Dev low

netdata/netdata#Netdata

Various

The fastest path to AI-powered full stack observability, even for lean teams.

★ 79,017 updated 2d ago
open-source

Best for: DevOps teams and developers needing lightweight, real-time infrastructure monitoring without heavy agent overhead

O OSS Orchestration medium

Lobe Chat

Community

🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire AI team.

★ 78,069 updated 2d ago
open-source

Best for: Teams building multi-agent systems who need open-source orchestration and want to avoid vendor lock-in

O OSS Obs medium

code server

Community

VS Code in the browser

★ 77,785 updated 5d ago
open-source

Best for: Teams needing consistent remote development environments or developers working across multiple machines

O OSS Obs medium

stable-diffusion

Community

A latent text-to-image diffusion model

★ 73,065 updated 1y ago
open-source

Best for: Developers building image generation features who need local control and can manage infrastructure requirements

O OSS Obs medium

Docker

Community

The Moby Project - a collaborative project for the container ecosystem to assemble container-based systems

★ 71,617 updated 5d ago
open-source

Best for: Teams building microservices or needing reproducible, portable application deployment

O OSS Orchestration medium

MetaGPT

Community

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

★ 68,466 updated 4mo ago
open-source

Best for: Developers building proof-of-concepts or prototypes who want AI agents to handle multiple development stages in parallel.

O OSS Obs medium

scikit-learn

Community

scikit-learn: machine learning in Python

★ 66,218 updated 2d ago
open-source

Best for: Python developers building traditional machine learning pipelines and prototyping models quickly.

O OSS Framework medium

unslothai

Community

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

★ 65,515 updated 2d ago
open-source

Best for: Developers who want to fine-tune or experiment with open models locally without cloud costs.

O OSS Obs medium

Keras

Community

Deep Learning for humans

★ 64,079 updated 2d ago
open-source

Best for: Python developers building standard deep learning models who prioritize development speed over maximum performance optimization

O OSS Orchestration medium

Anything LLM

Community

The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.

★ 60,905 updated 2d ago
open-source

Best for: Developers building privacy-sensitive LLM applications who can run compute locally and want to avoid vendor lock-in.

O OSS Obs medium

LLMApp

Community

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, re

★ 59,487 updated 4mo ago
open-source

Best for: Teams building enterprise search or RAG systems that need live data synchronization without custom connector development.

O OSS Framework medium

Embedchain

Community

Universal memory layer for AI Agents

★ 57,321 updated 2d ago
open-source

Best for: Python developers building prototype or early-stage agents that need document retrieval without managing vector infrastructure directly

O OSS Orchestration medium

Private GPT

Community

Interact with your documents using the power of GPT, 100% privately, no data leaks

★ 57,218 updated 3mo ago
open-source

Best for: Teams handling confidential documents who need privacy guarantees over speed

M MCP Dev low

upstash/context7

Various

Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors

★ 56,548 updated 2d ago
open-source

Best for: Teams using AI code editors who need their LLMs to reference accurate, current codebase documentation without hallucination.

O OSS Obs medium

segment-anything (SAM)

Community

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to

★ 54,274 updated 1y ago
open-source

Best for: Developers building image annotation tools, content moderation systems, or computer vision applications needing zero-shot segmentation.

O OSS Orchestration medium

Flowise

Community

Build AI Agents, Visually

★ 53,254 updated 4d ago
open-source

Best for: Teams building AI agents who want visual composition without writing orchestration code

O OSS Obs medium

LiteLLM 🚅

Community

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, Vertex

★ 48,950 updated 2d ago
open-source

Best for: Teams managing multiple LLM providers or needing cost visibility across model calls

O OSS Obs medium

Milvus

Community

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

★ 44,579 updated 2d ago
open-source

Best for: Teams building search or recommendation features who need to manage vector data at scale and prefer open-source control over managed services.

O OSS Framework medium

DeepSpeed

Community

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

★ 42,436 updated 2d ago
open-source

Best for: Teams training large models who need to maximize GPU efficiency and scale across multiple devices.

O OSS Framework medium

Colossal-AI

Community

Making large AI models cheaper, faster and more accessible

★ 41,382 updated 9d ago
open-source

Best for: Teams training large models who have access to multiple GPUs and need to optimize resource efficiency

O OSS Obs medium

GLM-6B (ChatGLM)

Community

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

★ 41,068 updated 1y ago
open-source

Best for: Developers building Chinese-English chatbots who need local deployment and cost control over quality.

O OSS Orchestration medium

Phidata

Community

Build, run, and manage agent platforms.

★ 40,451 updated 2d ago
open-source

Best for: Python developers building multi-agent systems who want structured orchestration without building from scratch

O OSS Framework medium

FastChat

Community

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

★ 39,479 updated 1mo ago
open-source

Best for: Researchers and ML engineers building custom LLM applications who need training, serving, and evaluation in one framework.

M MCP Dev low

mindsdb/mindsdb

Various

Platform dedicated to building an open foundation for applied Artificial Intelligence, designed for people seeking production-ready AI systems they can truly control, extend and de

★ 39,231 updated 6d ago
open-source

Best for: Data engineers and analysts who want to build ML pipelines without leaving SQL or their existing databases

O OSS Orchestration medium

Quiver

Community

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama.

★ 39,173 updated 10mo ago
open-source

Best for: Python developers building RAG features into existing applications who want to avoid vendor lock-in and infrastructure boilerplate.

O OSS Obs medium

bark

Community

🔊 Text-Prompted Generative Audio Model

★ 39,142 updated 1y ago
open-source

Best for: Developers building offline audio generation features or prototyping multilingual voice applications

O OSS Framework medium

Langchain-Chatchat

Community

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like

★ 38,121 updated 6mo ago
open-source

Best for: Teams building private knowledge systems with local LLMs who prioritize data sovereignty over ease of deployment

O OSS Orchestration medium

AgentGPT

Community

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

★ 36,162 updated 1y ago
open-source

Best for: Developers prototyping autonomous agent workflows and learning agent orchestration patterns

M MCP Dev low

agent-infra/mcp-server-browser

Various

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

★ 35,863 updated 16d ago
open-source

Best for: Developers building AI agents that need reliable, protocol-standardized web automation capabilities

O OSS Obs medium

Jax

Community

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

★ 35,725 updated 2d ago
open-source

Best for: Researchers and engineers building custom numerical algorithms that need automatic differentiation and hardware acceleration.

O OSS Obs medium

Caffe

Community

Caffe: a fast open framework for deep learning.

★ 34,585 updated 1y ago
open-source

Best for: Teams building production computer vision systems who prioritize inference speed and have existing Caffe expertise

O OSS Orchestration medium

GPT Pilot

Community

The first real AI developer

★ 33,752 updated 1mo ago
open-source

Best for: Developers building prototypes or automating routine code generation tasks who can validate and refine AI output

O OSS Obs medium

tabby

Community

Self-hosted AI coding assistant

★ 33,554 updated 3mo ago
open-source

Best for: Teams prioritizing data privacy and control over ease of deployment

O OSS Obs medium

Continue

Community

⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI

★ 33,482 updated 2d ago
open-source

Best for: Teams prioritizing code privacy and wanting AI assistance integrated into existing IDE workflows and CI systems

O OSS Obs medium

netron

Community

Visualizer for neural network, deep learning and machine learning models

★ 33,013 updated 2d ago
open-source

Best for: ML engineers and researchers who need to quickly inspect and understand model architectures across different frameworks

O OSS Orchestration medium

CopilotKit

Community

The Frontend Stack for Agents & Generative UI. React + Angular. Makers of the AG-UI Protocol

★ 31,886 updated 2d ago
open-source

Best for: Frontend developers building React or Angular apps that need embedded AI agents and dynamic UI generation

O OSS Obs medium

Qdrant

Community

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

★ 31,735 updated 2d ago
open-source

Best for: Builders needing fast, scalable vector search for embeddings in production AI systems

O OSS Obs medium

PyTorch Lightning

Community

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

★ 31,168 updated 2d ago
open-source

Best for: Teams training models at scale who want to avoid rewriting training code for different hardware configurations

O OSS Framework medium

SGLang

Community

SGLang is a high-performance serving framework for large language models and multimodal models.

★ 28,885 updated 2d ago
open-source

Best for: Teams building production LLM services who need performance-optimized serving infrastructure

O OSS Obs medium

XGBoost

Community

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and D

★ 28,431 updated 6d ago
open-source

Best for: Data scientists and ML engineers building production models on structured datasets.

O OSS Obs medium

Chroma

Community

Search infrastructure for AI

★ 28,173 updated 2d ago
open-source

Best for: Developers building RAG systems and semantic search features who want a straightforward, open-source vector store

O OSS Orchestration medium

GPT Researcher

Community

An autonomous agent that conducts deep research on any data using any LLM providers

★ 27,439 updated 6d ago
open-source

Best for: Developers building research automation tools who need flexible LLM provider switching and don't mind managing Python infrastructure.

M MCP Dev low

eyaltoledano/claude-task-master

Various

An AI-powered task-management system you can drop into Cursor, Lovable, Windsurf, Roo, and others.

★ 27,304 updated 1mo ago
open-source

Best for: Developers using Claude-based code editors who need persistent task tracking across sessions

O OSS Framework medium

open-r1

Community

Fully open reproduction of DeepSeek-R1

★ 26,029 updated 2mo ago
open-source

Best for: Researchers and builders needing transparent, locally-controlled reasoning models

O OSS Orchestration medium

AgentScope

Community

Build and run agents you can see, understand and trust.

★ 25,983 updated 2d ago
open-source

Best for: Teams building multi-agent systems who prioritize understanding and debugging agent interactions over rapid deployment.

M MCP Dev low

FastMCP

Various

🚀 The fast, Pythonic way to build MCP servers and clients.

★ 25,425 updated 2d ago
open-source

Best for: Python developers building LLM integrations who want to expose tools and data sources via the Model Context Protocol without writing boilerplate.

M MCP Dev low

oraios/serena

Various

A powerful MCP toolkit for coding, providing semantic retrieval and editing capabilities - the IDE for your agent

★ 24,823 updated 2d ago
open-source

Best for: Developers building AI agents that need to understand and modify code programmatically

O OSS Orchestration medium

AI

Community

The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents

★ 24,590 updated 2d ago
open-source

Best for: TypeScript developers building AI features in Next.js applications who want lightweight, unopinionated orchestration

O OSS Obs medium

LeRobot

Community

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

★ 24,565 updated 2d ago
open-source

Best for: Researchers and engineers building robot learning systems who want accessible tooling and pre-trained baselines.

O OSS Obs medium

PaddlePaddle

Community

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

★ 23,930 updated 2d ago
open-source

Best for: Teams building large-scale production ML systems who need distributed training and cross-platform deployment out of the box

O OSS Obs medium

NCNN

Community

ncnn is a high-performance neural network inference framework optimized for the mobile platform

★ 23,318 updated 4d ago
open-source

Best for: Mobile and embedded developers building latency-critical inference applications on constrained hardware

O OSS Obs medium

Faster Whisper

Community

Faster Whisper transcription with CTranslate2

★ 23,312 updated 6mo ago
open-source

Best for: Developers building production speech-to-text systems where inference speed and resource efficiency matter more than simplicity.

O OSS Orchestration medium

MemGPT

Community

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

★ 23,081 updated 20d ago
open-source

Best for: Developers building conversational or autonomous agents that need to learn and maintain state across extended interactions.

O OSS Obs medium

Dolt

Community

Dolt – Git for Data

★ 22,967 updated 2d ago
open-source

Best for: Teams needing audit trails and collaborative workflows on structured data

O OSS Obs medium

Prefect

Community

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

★ 22,518 updated 2d ago
open-source

Best for: Python teams building production data pipelines who need observability and fault tolerance without heavyweight infrastructure

M MCP Dev low

ahujasid/blender-mcp

Various

🐍 - MCP server for working with Blender

★ 22,223 updated 4mo ago
open-source

Best for: Developers building AI agents that need to generate or modify 3D content in Blender programmatically

O OSS Orchestration medium

Local GPT

Community

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

★ 22,208 updated 2mo ago
open-source

Best for: Teams handling confidential documents who need offline document chat without external dependencies

O OSS Framework medium

promptfoo

Community

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative config

★ 21,784 updated 2d ago
open-source

Best for: Teams building LLM applications who need systematic prompt validation and security testing before deployment

O OSS Framework medium

veRL

Community

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

★ 21,691 updated 2d ago
open-source

Best for: ML engineers building custom RL post-training pipelines for LLMs at scale

O OSS Obs medium

pgvector

Community

Open-source vector similarity search for Postgres

★ 21,551 updated 4d ago
open-source

Best for: Teams already using Postgres who want vector search without adding infrastructure

O OSS Framework medium

Guidance

Community

A guidance language for controlling large language models.

★ 21,486 updated 13d ago
open-source

Best for: Developers building production systems that need deterministic, schema-compliant LLM outputs

O OSS Obs medium

peft

Community

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

★ 21,218 updated 2d ago
open-source

Best for: Developers adapting large language models on resource-constrained hardware or managing multiple task-specific variants efficiently.

O OSS Obs medium

Apache MXNet

Community

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

★ 20,809 updated 2y ago
open-source

Best for: Teams building distributed training pipelines or mobile ML applications who need multi-language flexibility

O OSS Obs medium

Awesome Production Machine Learning

Community

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

★ 20,585 updated 2d ago
open-source

Best for: Teams building ML infrastructure who need a starting point for evaluating open source ops tools

O OSS Orchestration medium

AI Chatbot

Community

A full-featured, hackable Next.js AI chatbot built by Vercel

★ 20,420 updated 16d ago
open-source

Best for: Developers building AI chatbots in Next.js who want a working reference implementation to fork and customize

O OSS Obs medium

Candle

Community

Minimalist ML framework for Rust

★ 20,387 updated 2d ago
open-source

Best for: Rust developers building production ML systems where safety and performance matter more than rapid prototyping

O OSS Framework medium

Opik

Community

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

★ 19,417 updated 2d ago
open-source

Best for: Python developers building production LLM systems who need observability and systematic evaluation.

O OSS Orchestration medium

SWE Agent

Community

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [Ne

★ 19,387 updated 3d ago
open-source

Best for: Teams wanting to automate routine bug fixes and explore LM-driven code generation without vendor lock-in

M MCP Dev low

mediar-ai/screenpipe

Various

YC (S26) | AI that knows what you've seen, said, or heard. Records everything you do, say, hear 24/7, local, private, secure

★ 19,049 updated 2d ago
open-source

Best for: Developers building local-first AI agents that need rich context from user activity without cloud transmission.

O OSS Orchestration medium

DB GPT

Community

open-source agentic AI data assistant for the next generation of AI + Data products.

★ 18,886 updated 7d ago
open-source

Best for: Teams building internal AI data assistants who can maintain Python codebases and want to avoid vendor lock-in

O OSS Framework medium

OpenAI Evals

Community

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

★ 18,584 updated 1mo ago
open-source

Best for: Teams building LLM applications who need systematic, reproducible evaluation workflows

O OSS Obs medium

LightGBM

Community

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other

★ 18,416 updated 2d ago
open-source

Best for: Data scientists building production ML systems on large tabular datasets where training speed and memory efficiency matter.

O OSS Orchestration medium

DocsGPT

Community

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

★ 17,914 updated 2d ago
open-source

Best for: Teams building internal document search and analysis tools who want open-source control and can manage their own infrastructure.

M MCP Dev low

topoteretes/cognee

Various

Memory platform for AI Agents in 6 lines of code

★ 17,624 updated 2d ago
open-source

Best for: Python developers building multi-turn agents that need persistent, queryable memory without database infrastructure overhead

O OSS Orchestration medium

SuperAGI

Community

SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

★ 17,554 updated 1y ago
open-source

Best for: Teams building production autonomous agents who want open source control and Python-first development

M MCP Dev low

pydantic/pydantic-ai/mcp-run-python

Various

AI Agent Framework, the Pydantic way

★ 17,445 updated 2d ago
open-source

Best for: Python developers building type-safe AI agents with strict validation requirements

O OSS Framework medium

NeMo Framework

Community

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech

★ 17,285 updated 2d ago
open-source

Best for: Researchers and ML engineers building custom LLMs, speech systems, or multimodal models who need low-level control and scalability.

O OSS Obs medium

Argo Workflows

Community

Workflow Engine for Kubernetes

★ 16,728 updated 2d ago
open-source

Best for: Teams running workloads on Kubernetes who need declarative, auditable job orchestration without external services

O OSS Framework medium

Megatron-LM

Community

Ongoing research training transformer models at scale

★ 16,545 updated 2d ago
open-source

Best for: ML engineers training large transformer models who need production-grade distributed training infrastructure

O OSS Obs medium

Weaviate

Community

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance an

★ 16,258 updated 2d ago
open-source

Best for: Teams building production search systems who need open-source control and can manage infrastructure.

O OSS Obs medium

Kubeflow

Community

Machine Learning Toolkit for Kubernetes

★ 15,700 updated 10d ago
open-source

Best for: Teams with Kubernetes infrastructure who need to standardize ML workflows across on-prem or multi-cloud environments

O OSS Obs medium

DVC

Community

🦉 Data Versioning and ML Experiments

★ 15,643 updated 2d ago
open-source

Best for: ML teams building reproducible pipelines who need Git-like versioning for data and models

O OSS Obs medium

ChatGLM2-6B

Community

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

★ 15,576 updated 1y ago
open-source

Best for: Developers building Chinese-English applications who need local control and want to avoid cloud API dependencies.

O OSS Orchestration medium

Plandex

Community

Open source AI coding agent. Designed for large projects and real world tasks.

★ 15,434 updated 8mo ago
open-source

Best for: Teams building large features or refactoring substantial codebases who want transparent, plan-driven AI assistance

M MCP Dev low

googleapis/genai-toolbox

Various

MCP Toolbox for Databases is an open source MCP server for databases.

★ 15,425 updated 2d ago
open-source

Best for: Developers building AI agents that need direct database query capabilities via MCP

M MCP Dev low

vectorize-io/hindsight

Various

Hindsight: Agent Memory That Learns

★ 15,407 updated 2d ago
open-source

Best for: Developers building stateful agents that need to learn and adapt from experience over multiple interactions

O OSS Framework medium

MNN-LLM

Community

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

★ 15,353 updated 2d ago
open-source

Best for: Developers building production on-device LLM and edge AI applications where latency and resource efficiency are critical.

M MCP Dev low

GLips/Figma-Context-MCP

Various

MCP server to provide Figma layout information to AI coding agents like Cursor

★ 14,946 updated 7d ago
open-source

Best for: Teams using Figma and AI coding assistants who want to automate component generation from design files.

O OSS Orchestration medium

Llmware

Community

Unified framework for building enterprise RAG pipelines with small, specialized models

★ 14,848 updated 17d ago
open-source

Best for: Teams building enterprise document search and QA systems who want to optimize costs by using specialized models instead of large LLMs.

O OSS Obs medium

fauxpilot

Community

FauxPilot - an open-source alternative to GitHub Copilot server

★ 14,733 updated 2y ago
open-source

Best for: Developers who need Copilot-like completion on private code or want to avoid cloud-based code submission

O OSS Orchestration medium

Botpress

Community

The open-source hub to build & deploy GPT/LLM Agents ⚡️

★ 14,719 updated 2d ago
open-source

Best for: Teams building production LLM agents who want open-source control and TypeScript integration

O OSS Obs medium

Horovod

Community

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

★ 14,696 updated 6mo ago
open-source

Best for: ML engineers training large models who need to scale across multiple GPUs or nodes without rewriting training logic

O OSS Obs medium

NNI

Community

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

★ 14,352 updated 1y ago
open-source

Best for: ML engineers and researchers who need flexible, self-hosted AutoML for hyperparameter tuning and neural architecture search

M MCP Dev low

mickaelkerjean/filestash

Various

:filefolder: File Management Platform / Universal Data Access Layer (without FUSE)

★ 14,271 updated 2d ago
open-source

Best for: Developers building multi-cloud file management systems or needing unified access to heterogeneous storage without FUSE overhead

O OSS Framework medium

Ragas

Community

Supercharge Your LLM Application Evaluations 🚀

★ 14,186 updated 3mo ago
open-source

Best for: Teams building RAG systems who need continuous evaluation without manual labeling

O OSS Obs medium

visenger/awesome-mlops

Community

A curated list of references for MLOps

★ 13,923 updated 1y ago
open-source

Best for: Teams building or evaluating MLOps infrastructure who need a structured starting point for tool discovery

O OSS Framework medium

Outlines

Community

Structured Outputs

★ 13,914 updated 16d ago
open-source

Best for: Developers building applications that need reliable structured data extraction from LLMs without validation failures

M MCP Dev low

skill-seekers/Skill_Seekers

Various

Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection

★ 13,877 updated 3d ago
open-source

Best for: Developers building Claude agents who need to rapidly convert existing documentation into reusable skills

O OSS Orchestration medium

AI Scientist

Community

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

★ 13,864 updated 5mo ago
open-source

Best for: Researchers and ML engineers exploring automated workflows for hypothesis-driven scientific discovery

O OSS Framework medium

TensorRT-LLM

Community

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NV

★ 13,781 updated 2d ago
open-source

Best for: Teams deploying LLMs at scale on NVIDIA infrastructure who need maximum inference performance.

O OSS Obs medium

JuiceFS

Community

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

★ 13,645 updated 2d ago
open-source

Best for: Teams running distributed workloads on Kubernetes who need shared, cloud-backed storage without rewriting applications

O OSS Obs medium

scalene

Community

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

★ 13,436 updated 3d ago
open-source

Best for: Python developers optimizing computationally intensive or memory-heavy applications who need precise per-line performance visibility.

O OSS Obs medium

TVM

Community

Open Machine Learning Compiler Framework

★ 13,405 updated 2d ago
open-source

Best for: ML engineers deploying models to resource-constrained or heterogeneous hardware environments

O OSS Framework medium

Litgpt

Community

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

★ 13,395 updated 2d ago
open-source

Best for: Teams building or customizing LLMs at scale with access to compute resources

O OSS Framework medium

wechat-chatgpt

Community

Use ChatGPT On Wechat via wechaty

★ 13,240 updated 2y ago
open-source

Best for: Developers building WeChat bots or integrations who want to add conversational AI without building the WeChat connection layer from scratch.

O OSS Obs medium

Jupyter Notebooks

Community

Jupyter Interactive Notebook

★ 13,173 updated 5d ago
open-source

Best for: Data scientists and researchers who need interactive exploration with reproducible documentation

Showing the top 120 by GitHub stars. 3065 more self-hostable entries live across the directories.