Directories by Enterprise DNA
Self-hosted AI tools
Run it on your own infra. No seat pricing, no data leaving your network, no vendor that can sunset you. Every tool here is open-source or explicitly self-hostable.
3185 self-hostable entries in the index.
TensorFlow
Community
An Open Source Machine Learning Framework for Everyone
Best for: Teams building production ML systems that need cross-platform deployment and performance optimization
AutoGPT
Community
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Best for: Developers exploring autonomous agent architectures and prototyping experimental workflows
ollama
Community
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
Best for: Developers building local-first applications or prototyping with open-source LLMs without cloud costs
Awesome ChatGPT Prompts
Community
f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
Best for: Teams wanting a searchable, shareable prompt reference library they can self-host and customize
Langflow
Community
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Best for: Developers building experimental or production LLM workflows who want visual design with code flexibility
Dify
Community
Production-ready platform for agentic workflow development.
Best for: Teams building production LLM agents who want open-source control and visual workflow design.
LangChain
Community
The agent engineering platform.
Best for: Python developers building prototype or production LLM applications that need to orchestrate multiple tools and data sources
microsoft/markitdown
Various
Python tool for converting files and office documents to Markdown.
Best for: Developers building document pipeline tools or migrating content to Markdown-based systems
llama.cpp
Community
LLM inference in C/C++
Best for: Developers building privacy-first or offline-capable applications with constrained hardware
whisper
Community
Robust Speech Recognition via Large-Scale Weak Supervision
Best for: Developers building privacy-first or offline-capable voice features with multilingual requirements
PyTorch
Community
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Best for: ML researchers and engineers building custom neural network architectures with GPU training needs
DeepSeek-R1
Community
First-generation reasoning models from DeepSeek.
Best for: Developers building open-source applications needing interpretable multi-step reasoning without vendor lock-in
vLLM
Community
A high-throughput and memory-efficient inference and serving engine for LLMs
Best for: Teams building production LLM APIs and services that need to maximize throughput and minimize latency under concurrent load.
llm-course
Community
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Best for: Developers new to LLMs seeking structured, hands-on learning without infrastructure setup
netdata/netdata#Netdata
Various
The fastest path to AI-powered full stack observability, even for lean teams.
Best for: DevOps teams and developers needing lightweight, real-time infrastructure monitoring without heavy agent overhead
Lobe Chat
Community
🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire AI team.
Best for: Teams building multi-agent systems who need open-source orchestration and want to avoid vendor lock-in
code server
Community
VS Code in the browser
Best for: Teams needing consistent remote development environments or developers working across multiple machines
stable-diffusion
Community
A latent text-to-image diffusion model
Best for: Developers building image generation features who need local control and can manage infrastructure requirements
Docker
Community
The Moby Project - a collaborative project for the container ecosystem to assemble container-based systems
Best for: Teams building microservices or needing reproducible, portable application deployment
MetaGPT
Community
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Best for: Developers building proof-of-concepts or prototypes who want AI agents to handle multiple development stages in parallel.
scikit-learn
Community
scikit-learn: machine learning in Python
Best for: Python developers building traditional machine learning pipelines and prototyping models quickly.
unslothai
Community
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
Best for: Developers who want to fine-tune or experiment with open models locally without cloud costs.
Keras
Community
Deep Learning for humans
Best for: Python developers building standard deep learning models who prioritize development speed over maximum performance optimization
Anything LLM
Community
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
Best for: Developers building privacy-sensitive LLM applications who can run compute locally and want to avoid vendor lock-in.
LLMApp
Community
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, re
Best for: Teams building enterprise search or RAG systems that need live data synchronization without custom connector development.
Embedchain
Community
Universal memory layer for AI Agents
Best for: Python developers building prototype or early-stage agents that need document retrieval without managing vector infrastructure directly
Private GPT
Community
Interact with your documents using the power of GPT, 100% privately, no data leaks
Best for: Teams handling confidential documents who need privacy guarantees over speed
upstash/context7
Various
Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors
Best for: Teams using AI code editors who need their LLMs to reference accurate, current codebase documentation without hallucination.
segment-anything (SAM)
Community
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to
Best for: Developers building image annotation tools, content moderation systems, or computer vision applications needing zero-shot segmentation.
Flowise
Community
Build AI Agents, Visually
Best for: Teams building AI agents who want visual composition without writing orchestration code
LiteLLM 🚅
Community
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, Vertex
Best for: Teams managing multiple LLM providers or needing cost visibility across model calls
Milvus
Community
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Best for: Teams building search or recommendation features who need to manage vector data at scale and prefer open-source control over managed services.
DeepSpeed
Community
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Best for: Teams training large models who need to maximize GPU efficiency and scale across multiple devices.
Colossal-AI
Community
Making large AI models cheaper, faster and more accessible
Best for: Teams training large models who have access to multiple GPUs and need to optimize resource efficiency
GLM-6B (ChatGLM)
Community
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Best for: Developers building Chinese-English chatbots who need local deployment and cost control over quality.
Phidata
Community
Build, run, and manage agent platforms.
Best for: Python developers building multi-agent systems who want structured orchestration without building from scratch
FastChat
Community
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Best for: Researchers and ML engineers building custom LLM applications who need training, serving, and evaluation in one framework.
mindsdb/mindsdb
Various
Platform dedicated to building an open foundation for applied Artificial Intelligence, designed for people seeking production-ready AI systems they can truly control, extend and de
Best for: Data engineers and analysts who want to build ML pipelines without leaving SQL or their existing databases
Quiver
Community
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama.
Best for: Python developers building RAG features into existing applications who want to avoid vendor lock-in and infrastructure boilerplate.
bark
Community
🔊 Text-Prompted Generative Audio Model
Best for: Developers building offline audio generation features or prototyping multilingual voice applications
Langchain-Chatchat
Community
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like
Best for: Teams building private knowledge systems with local LLMs who prioritize data sovereignty over ease of deployment
AgentGPT
Community
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Best for: Developers prototyping autonomous agent workflows and learning agent orchestration patterns
agent-infra/mcp-server-browser
Various
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
Best for: Developers building AI agents that need reliable, protocol-standardized web automation capabilities
Jax
Community
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Best for: Researchers and engineers building custom numerical algorithms that need automatic differentiation and hardware acceleration.
Caffe
Community
Caffe: a fast open framework for deep learning.
Best for: Teams building production computer vision systems who prioritize inference speed and have existing Caffe expertise
GPT Pilot
Community
The first real AI developer
Best for: Developers building prototypes or automating routine code generation tasks who can validate and refine AI output
tabby
Community
Self-hosted AI coding assistant
Best for: Teams prioritizing data privacy and control over ease of deployment
Continue
Community
⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI
Best for: Teams prioritizing code privacy and wanting AI assistance integrated into existing IDE workflows and CI systems
netron
Community
Visualizer for neural network, deep learning and machine learning models
Best for: ML engineers and researchers who need to quickly inspect and understand model architectures across different frameworks
CopilotKit
Community
The Frontend Stack for Agents & Generative UI. React + Angular. Makers of the AG-UI Protocol
Best for: Frontend developers building React or Angular apps that need embedded AI agents and dynamic UI generation
Qdrant
Community
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Best for: Builders needing fast, scalable vector search for embeddings in production AI systems
PyTorch Lightning
Community
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Best for: Teams training models at scale who want to avoid rewriting training code for different hardware configurations
SGLang
Community
SGLang is a high-performance serving framework for large language models and multimodal models.
Best for: Teams building production LLM services who need performance-optimized serving infrastructure
XGBoost
Community
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and D
Best for: Data scientists and ML engineers building production models on structured datasets.
Chroma
Community
Search infrastructure for AI
Best for: Developers building RAG systems and semantic search features who want a straightforward, open-source vector store
GPT Researcher
Community
An autonomous agent that conducts deep research on any data using any LLM providers
Best for: Developers building research automation tools who need flexible LLM provider switching and don't mind managing Python infrastructure.
eyaltoledano/claude-task-master
Various
An AI-powered task-management system you can drop into Cursor, Lovable, Windsurf, Roo, and others.
Best for: Developers using Claude-based code editors who need persistent task tracking across sessions
open-r1
Community
Fully open reproduction of DeepSeek-R1
Best for: Researchers and builders needing transparent, locally-controlled reasoning models
AgentScope
Community
Build and run agents you can see, understand and trust.
Best for: Teams building multi-agent systems who prioritize understanding and debugging agent interactions over rapid deployment.
FastMCP
Various
🚀 The fast, Pythonic way to build MCP servers and clients.
Best for: Python developers building LLM integrations who want to expose tools and data sources via the Model Context Protocol without writing boilerplate.
oraios/serena
Various
A powerful MCP toolkit for coding, providing semantic retrieval and editing capabilities - the IDE for your agent
Best for: Developers building AI agents that need to understand and modify code programmatically
AI
Community
The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents
Best for: TypeScript developers building AI features in Next.js applications who want lightweight, unopinionated orchestration
LeRobot
Community
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Best for: Researchers and engineers building robot learning systems who want accessible tooling and pre-trained baselines.
PaddlePaddle
Community
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Best for: Teams building large-scale production ML systems who need distributed training and cross-platform deployment out of the box
NCNN
Community
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Best for: Mobile and embedded developers building latency-critical inference applications on constrained hardware
Faster Whisper
Community
Faster Whisper transcription with CTranslate2
Best for: Developers building production speech-to-text systems where inference speed and resource efficiency matter more than simplicity.
MemGPT
Community
Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.
Best for: Developers building conversational or autonomous agents that need to learn and maintain state across extended interactions.
Dolt
Community
Dolt – Git for Data
Best for: Teams needing audit trails and collaborative workflows on structured data
Prefect
Community
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Best for: Python teams building production data pipelines who need observability and fault tolerance without heavyweight infrastructure
ahujasid/blender-mcp
Various
🐍 - MCP server for working with Blender
Best for: Developers building AI agents that need to generate or modify 3D content in Blender programmatically
Local GPT
Community
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
Best for: Teams handling confidential documents who need offline document chat without external dependencies
promptfoo
Community
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative config
Best for: Teams building LLM applications who need systematic prompt validation and security testing before deployment
veRL
Community
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
Best for: ML engineers building custom RL post-training pipelines for LLMs at scale
pgvector
Community
Open-source vector similarity search for Postgres
Best for: Teams already using Postgres who want vector search without adding infrastructure
Guidance
Community
A guidance language for controlling large language models.
Best for: Developers building production systems that need deterministic, schema-compliant LLM outputs
peft
Community
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Best for: Developers adapting large language models on resource-constrained hardware or managing multiple task-specific variants efficiently.
Apache MXNet
Community
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Best for: Teams building distributed training pipelines or mobile ML applications who need multi-language flexibility
Awesome Production Machine Learning
Community
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Best for: Teams building ML infrastructure who need a starting point for evaluating open source ops tools
AI Chatbot
Community
A full-featured, hackable Next.js AI chatbot built by Vercel
Best for: Developers building AI chatbots in Next.js who want a working reference implementation to fork and customize
Candle
Community
Minimalist ML framework for Rust
Best for: Rust developers building production ML systems where safety and performance matter more than rapid prototyping
Opik
Community
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Best for: Python developers building production LLM systems who need observability and systematic evaluation.
SWE Agent
Community
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [Ne
Best for: Teams wanting to automate routine bug fixes and explore LM-driven code generation without vendor lock-in
mediar-ai/screenpipe
Various
YC (S26) | AI that knows what you've seen, said, or heard. Records everything you do, say, hear 24/7, local, private, secure
Best for: Developers building local-first AI agents that need rich context from user activity without cloud transmission.
DB GPT
Community
open-source agentic AI data assistant for the next generation of AI + Data products.
Best for: Teams building internal AI data assistants who can maintain Python codebases and want to avoid vendor lock-in
OpenAI Evals
Community
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Best for: Teams building LLM applications who need systematic, reproducible evaluation workflows
LightGBM
Community
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other
Best for: Data scientists building production ML systems on large tabular datasets where training speed and memory efficiency matter.
DocsGPT
Community
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Best for: Teams building internal document search and analysis tools who want open-source control and can manage their own infrastructure.
topoteretes/cognee
Various
Memory platform for AI Agents in 6 lines of code
Best for: Python developers building multi-turn agents that need persistent, queryable memory without database infrastructure overhead
SuperAGI
Community
SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
Best for: Teams building production autonomous agents who want open source control and Python-first development
pydantic/pydantic-ai/mcp-run-python
Various
AI Agent Framework, the Pydantic way
Best for: Python developers building type-safe AI agents with strict validation requirements
NeMo Framework
Community
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech
Best for: Researchers and ML engineers building custom LLMs, speech systems, or multimodal models who need low-level control and scalability.
Argo Workflows
Community
Workflow Engine for Kubernetes
Best for: Teams running workloads on Kubernetes who need declarative, auditable job orchestration without external services
Megatron-LM
Community
Ongoing research training transformer models at scale
Best for: ML engineers training large transformer models who need production-grade distributed training infrastructure
Weaviate
Community
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance an
Best for: Teams building production search systems who need open-source control and can manage infrastructure.
Kubeflow
Community
Machine Learning Toolkit for Kubernetes
Best for: Teams with Kubernetes infrastructure who need to standardize ML workflows across on-prem or multi-cloud environments
DVC
Community
🦉 Data Versioning and ML Experiments
Best for: ML teams building reproducible pipelines who need Git-like versioning for data and models
ChatGLM2-6B
Community
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Best for: Developers building Chinese-English applications who need local control and want to avoid cloud API dependencies.
Plandex
Community
Open source AI coding agent. Designed for large projects and real world tasks.
Best for: Teams building large features or refactoring substantial codebases who want transparent, plan-driven AI assistance
googleapis/genai-toolbox
Various
MCP Toolbox for Databases is an open source MCP server for databases.
Best for: Developers building AI agents that need direct database query capabilities via MCP
vectorize-io/hindsight
Various
Hindsight: Agent Memory That Learns
Best for: Developers building stateful agents that need to learn and adapt from experience over multiple interactions
MNN-LLM
Community
MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.
Best for: Developers building production on-device LLM and edge AI applications where latency and resource efficiency are critical.
GLips/Figma-Context-MCP
Various
MCP server to provide Figma layout information to AI coding agents like Cursor
Best for: Teams using Figma and AI coding assistants who want to automate component generation from design files.
Llmware
Community
Unified framework for building enterprise RAG pipelines with small, specialized models
Best for: Teams building enterprise document search and QA systems who want to optimize costs by using specialized models instead of large LLMs.
fauxpilot
Community
FauxPilot - an open-source alternative to GitHub Copilot server
Best for: Developers who need Copilot-like completion on private code or want to avoid cloud-based code submission
Botpress
Community
The open-source hub to build & deploy GPT/LLM Agents ⚡️
Best for: Teams building production LLM agents who want open-source control and TypeScript integration
Horovod
Community
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Best for: ML engineers training large models who need to scale across multiple GPUs or nodes without rewriting training logic
NNI
Community
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Best for: ML engineers and researchers who need flexible, self-hosted AutoML for hyperparameter tuning and neural architecture search
mickaelkerjean/filestash
Various
:filefolder: File Management Platform / Universal Data Access Layer (without FUSE)
Best for: Developers building multi-cloud file management systems or needing unified access to heterogeneous storage without FUSE overhead
Ragas
Community
Supercharge Your LLM Application Evaluations 🚀
Best for: Teams building RAG systems who need continuous evaluation without manual labeling
visenger/awesome-mlops
Community
A curated list of references for MLOps
Best for: Teams building or evaluating MLOps infrastructure who need a structured starting point for tool discovery
Outlines
Community
Structured Outputs
Best for: Developers building applications that need reliable structured data extraction from LLMs without validation failures
skill-seekers/Skill_Seekers
Various
Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection
Best for: Developers building Claude agents who need to rapidly convert existing documentation into reusable skills
AI Scientist
Community
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Best for: Researchers and ML engineers exploring automated workflows for hypothesis-driven scientific discovery
TensorRT-LLM
Community
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NV
Best for: Teams deploying LLMs at scale on NVIDIA infrastructure who need maximum inference performance.
JuiceFS
Community
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
Best for: Teams running distributed workloads on Kubernetes who need shared, cloud-backed storage without rewriting applications
scalene
Community
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Best for: Python developers optimizing computationally intensive or memory-heavy applications who need precise per-line performance visibility.
TVM
Community
Open Machine Learning Compiler Framework
Best for: ML engineers deploying models to resource-constrained or heterogeneous hardware environments
Litgpt
Community
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Best for: Teams building or customizing LLMs at scale with access to compute resources
wechat-chatgpt
Community
Use ChatGPT On Wechat via wechaty
Best for: Developers building WeChat bots or integrations who want to add conversational AI without building the WeChat connection layer from scratch.
Jupyter Notebooks
Community
Jupyter Interactive Notebook
Best for: Data scientists and researchers who need interactive exploration with reproducible documentation
Showing the top 120 by GitHub stars. 3065 more self-hostable entries live across the directories.