Open Source Alternatives
Open source alternatives to Promptfoo
Open source alternatives to Promptfoo, ranked by GitHub stars and freshness.
9 open-source alternatives in the index, ranked by GitHub stars and freshness.
Opik
Community
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Best for: Python developers building production LLM systems who need observability and systematic evaluation.
OpenAI Evals
Community
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Best for: Teams building LLM applications who need systematic, reproducible evaluation workflows
Giskard
Community
🐢 Open-Source Evaluation & Testing library for LLM Agents
Best for: Python developers building LLM agents who need automated safety and quality testing.
Promptify
Community
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
Best for: Python developers seeking a straightforward way to produce structured outputs from LLM prompts while managing prompt versions.
Agenta
Community
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
Best for: Teams building LLM applications who want an integrated open-source toolchain
LangWatch
Community
The platform for LLM evaluations and AI agent testing
Best for: Developers building and testing LLM-based agents in TypeScript who need a lightweight evaluation framework
FELM
Community
FELM: Benchmarking Factuality Evaluation of Large Language Models
Best for: Researchers and developers needing a standardized way to measure LLM factuality
LangSmith
Community
Complete AI agent and LLM observability platform with tracing and real-time monitoring. Debug agents, find failures fast, and track costs and latency.
Best for: Teams building complex multi-step agents or LLM pipelines that need production observability
PromptPerfect
Community
PromptPerfect - AI Prompt Generator and Optimizer
Best for: Developers and power users who frequently interact with LLMs and want to improve prompt reliability