Enterprise DNA
Directories / Alternatives / Promptfoo

Open Source Alternatives

Open source alternatives to Promptfoo

Open source alternatives to Promptfoo, ranked by GitHub stars and freshness.

9 open-source alternatives in the index, ranked by GitHub stars and freshness.

O OSS Framework medium

Opik

Community

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

★ 19,417 updated 2d ago
open-source

Best for: Python developers building production LLM systems who need observability and systematic evaluation.

O OSS Framework medium

OpenAI Evals

Community

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

★ 18,584 updated 1mo ago
open-source

Best for: Teams building LLM applications who need systematic, reproducible evaluation workflows

O OSS Framework medium

Giskard

Community

🐢 Open-Source Evaluation & Testing library for LLM Agents

★ 5,414 updated 5d ago
open-source

Best for: Python developers building LLM agents who need automated safety and quality testing.

O OSS Framework medium

Promptify

Community

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

★ 4,612 updated 2mo ago
open-source

Best for: Python developers seeking a straightforward way to produce structured outputs from LLM prompts while managing prompt versions.

O OSS Framework medium

Agenta

Community

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

★ 4,171 updated 2d ago
open-source

Best for: Teams building LLM applications who want an integrated open-source toolchain

O OSS Framework medium

LangWatch

Community

The platform for LLM evaluations and AI agent testing

★ 3,275 updated 2d ago
open-source

Best for: Developers building and testing LLM-based agents in TypeScript who need a lightweight evaluation framework

O OSS Framework medium

FELM

Community

FELM: Benchmarking Factuality Evaluation of Large Language Models

open-source

Best for: Researchers and developers needing a standardized way to measure LLM factuality

O OSS Framework medium

LangSmith

Community

Complete AI agent and LLM observability platform with tracing and real-time monitoring. Debug agents, find failures fast, and track costs and latency.

open-source

Best for: Teams building complex multi-step agents or LLM pipelines that need production observability

O OSS Framework medium

PromptPerfect

Community

PromptPerfect - AI Prompt Generator and Optimizer

open-source

Best for: Developers and power users who frequently interact with LLMs and want to improve prompt reliability