Llama2 Embedding Server
by Community
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
OSS
Llama2 Embedding Server
Added 1 June 2026
Overview
Llama2 Embedding Server is a FastAPI service for semantic text search. It uses precomputed embeddings and advanced similarity measures to find similar texts. It supports multiple file types through textract for extraction.
Best for
Best for
Developers needing a lightweight semantic search server for static text collections.
Use cases
- Build a semantic search API over a document corpus
- Perform similarity searches on precomputed text embeddings
- Integrate file extraction and embedding into a single service
Notes
Llama2 Embedding Server is a FastAPI service for semantic text search. It uses precomputed embeddings and advanced similarity measures to find similar texts. It supports multiple file types through textract for extraction.
1,053 stars on GitHub. Last updated 2025-02-27.
Use cases
- Build a semantic search API over a document corpus
- Perform similarity searches on precomputed text embeddings
- Integrate file extraction and embedding into a single service
Pros
- Provides a ready-to-deploy FastAPI server for embeddings
- Supports multiple file formats via textract
- Uses advanced similarity measures beyond cosine
Cons
- Only supports precomputed embeddings, not real-time generation
- Community project may have limited support or updates
- Requires manual embedding computation upfront
Indexed from awesome-langchain and enriched against its public facts.
Pros
- Provides a ready-to-deploy FastAPI server for embeddings
- Supports multiple file formats via textract
- Uses advanced similarity measures beyond cosine
Cons
- Only supports precomputed embeddings, not real-time generation
- Community project may have limited support or updates
- Requires manual embedding computation upfront
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
Embedchain
Community
Universal memory layer for AI Agents
Chroma
Community
Search infrastructure for AI
Milvus
Community
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Private GPT
Community
Interact with your documents using the power of GPT, 100% privately, no data leaks
Anything LLM
Community
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.