LLama Cpp Agent
by Community
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls a
OSS
LLama Cpp Agent
Added 1 June 2026
Overview
A Python framework for interacting with LLMs via llama.cpp. It enables chat, structured function calls, and structured output, even with models not fine-tuned for JSON or function calling.
Best for
Best for
Developers who need local, structured LLM interactions without cloud dependencies
Use cases
- Building local chatbots with structured responses
- Executing function calls from LLMs without fine-tuning
- Extracting structured data from model outputs
Notes
A Python framework for interacting with LLMs via llama.cpp. It enables chat, structured function calls, and structured output, even with models not fine-tuned for JSON or function calling.
639 stars on GitHub. Last updated 2026-03-09.
Use cases
- Building local chatbots with structured responses
- Executing function calls from LLMs without fine-tuning
- Extracting structured data from model outputs
Pros
- Works with models not fine-tuned for JSON or function calls
- Lightweight and focused on local execution
- Active community with 639 GitHub stars
Cons
- Limited to llama.cpp backend, not cloud models
- May require manual setup for complex workflows
- Smaller ecosystem compared to broader orchestration tools
Indexed from awesome-langchain and enriched against its public facts.
Pros
- Works with models not fine-tuned for JSON or function calls
- Lightweight and focused on local execution
- Active community with 639 GitHub stars
Cons
- Limited to llama.cpp backend, not cloud models
- May require manual setup for complex workflows
- Smaller ecosystem compared to broader orchestration tools
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.