O Open Source Orchestration medium

Llmware

by Community

Unified framework for building enterprise RAG pipelines with small, specialized models

Visit Community View repo Submit your build →

OSS

Llmware

Added 1 June 2026

#agents #generative-ai-tools #llamacpp #llm #onnx #openvino #parsing #retrieval-augmented-generation

Overview

Llmware is a Python framework for building enterprise RAG (Retrieval-Augmented Generation) pipelines using small, specialized models instead of large general-purpose ones. It provides orchestration tools to connect retrieval, parsing, and inference components into production workflows. The framework emphasizes cost efficiency and control by enabling deployment of focused models optimized for specific tasks.

Best for

Best for
Teams building enterprise document search and QA systems who want to optimize costs by using specialized models instead of large LLMs.

Use cases

Building document retrieval and question-answering systems with custom model selection
Orchestrating multi-step RAG pipelines with document parsing and embedding steps
Deploying enterprise search applications with fine-tuned or specialized models

Notes

14,848 stars on GitHub. Last updated 2026-05-17. Licensed Apache-2.0.

Use cases

Building document retrieval and question-answering systems with custom model selection
Orchestrating multi-step RAG pipelines with document parsing and embedding steps
Deploying enterprise search applications with fine-tuned or specialized models

Pros

Designed specifically for enterprise RAG workflows with orchestration built in
Supports small and specialized models, reducing inference costs and latency
Active open-source project with substantial community adoption (14k+ stars)

Cons

Python-only, limiting integration into non-Python backend systems
Requires manual model selection and configuration, adding complexity for teams unfamiliar with model specialization
Community-maintained project without commercial support guarantees

Indexed from awesome-langchain and enriched against its public facts.

Pros

Designed specifically for enterprise RAG workflows with orchestration built in
Supports small and specialized models, reducing inference costs and latency
Active open-source project with substantial community adoption (14k+ stars)

Cons

Python-only, limiting integration into non-Python backend systems
Requires manual model selection and configuration, adding complexity for teams unfamiliar with model specialization
Community-maintained project without commercial support guarantees

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Uses4entries

O OSS Framework medium

LangChain

Community

The agent engineering platform.

★ 138,234 updated 1mo ago

O OSS Obs medium

Chroma

Community

Search infrastructure for AI

★ 28,173 updated 1mo ago

O OSS Obs medium

Qdrant

Community

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

★ 31,735 updated 1mo ago

O OSS Obs medium

Milvus

Community

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

★ 44,579 updated 1mo ago

Built with1entry

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 1mo ago

Pairs with3entries

O OSS Framework medium

ollama

Community

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

★ 172,846 updated 1mo ago

O OSS Orchestration medium

Flowise

Community

Build AI Agents, Visually

★ 53,254 updated 1mo ago

O OSS Orchestration medium

Anything LLM

Community

The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.

★ 60,905 updated 1mo ago

Alternative to2entries

O OSS Orchestration medium

Langflow

Community

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

★ 149,019 updated 1mo ago

O OSS Framework medium

LangChain

Community

The agent engineering platform.

★ 138,234 updated 1mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →