O Open Source Orchestration medium

Llama2 Embedding Server

by Community

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

Visit Community View repo Submit your build →

OSS

Added 1 June 2026

#embedding-similarity #embedding-vectors #embeddings #llama2 #llamacpp #semantic-search

Overview

Llama2 Embedding Server is a FastAPI service for semantic text search. It uses precomputed embeddings and advanced similarity measures to find similar texts. It supports multiple file types through textract for extraction.

Best for

Best for
Developers needing a lightweight semantic search server for static text collections.

Use cases

Build a semantic search API over a document corpus
Perform similarity searches on precomputed text embeddings
Integrate file extraction and embedding into a single service

Notes

1,053 stars on GitHub. Last updated 2025-02-27.

Use cases

Build a semantic search API over a document corpus
Perform similarity searches on precomputed text embeddings
Integrate file extraction and embedding into a single service

Pros

Provides a ready-to-deploy FastAPI server for embeddings
Supports multiple file formats via textract
Uses advanced similarity measures beyond cosine

Cons

Only supports precomputed embeddings, not real-time generation
Community project may have limited support or updates
Requires manual embedding computation upfront

Indexed from awesome-langchain and enriched against its public facts.

Pros

Provides a ready-to-deploy FastAPI server for embeddings
Supports multiple file formats via textract
Uses advanced similarity measures beyond cosine

Cons

Only supports precomputed embeddings, not real-time generation
Community project may have limited support or updates
Requires manual embedding computation upfront

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Pairs with5entries

O OSS Orchestration medium

Anything LLM

Community

The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.

★ 60,905 updated 1mo ago

O OSS Orchestration medium

Private GPT

Community

Interact with your documents using the power of GPT, 100% privately, no data leaks

★ 57,218 updated 4mo ago

P Apps Productivity low

quivr

Various

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama.

★ 39,173 updated 1y ago

O OSS Orchestration medium

Langflow

Community

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

★ 149,019 updated 1mo ago

O OSS Orchestration medium

Flowise

Community

Build AI Agents, Visually

★ 53,254 updated 1mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →