O Open Source Observability medium

ai-evaluation

by Community

Evaluation Framework for all your AI related Workflows

Visit Community View repo Submit your build →

OSS

ai-evaluation

Added 1 June 2026

#agentic-ai #ai #ai-agents #cicd #evaluation #ml

Overview

A community-built evaluation framework for AI workflows, written in Python. It provides tools to assess and validate outputs from AI models and pipelines.

Best for

Best for
Developers seeking a simple, open-source evaluation framework for AI workflows

Use cases

Testing and scoring LLM responses against expected criteria
Monitoring performance of AI systems in production
Validating outputs from custom AI workflows

Notes

A community-built evaluation framework for AI workflows, written in Python. It provides tools to assess and validate outputs from AI models and pipelines.

105 stars on GitHub. Last updated 2026-05-29. Licensed Apache-2.0.

Use cases

Testing and scoring LLM responses against expected criteria
Monitoring performance of AI systems in production
Validating outputs from custom AI workflows

Pros

Open source and free to use
Lightweight Python implementation
Focused specifically on AI evaluation

Cons

Small community with only 105 stars
Limited documentation and examples
May lack advanced features found in larger frameworks

Indexed from awesome-llmops and enriched against its public facts.

Pros

Open source and free to use
Lightweight Python implementation
Focused specifically on AI evaluation

Cons

Small community with only 105 stars
Limited documentation and examples
May lack advanced features found in larger frameworks

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Pairs with3entries

O OSS Framework medium

LangChain

Community

The agent engineering platform.

★ 138,234 updated 1mo ago

O OSS Obs medium

LiteLLM 🚅

Community

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, Vertex

★ 48,950 updated 1mo ago

O OSS Obs medium

Chroma

Community

Search infrastructure for AI

★ 28,173 updated 1mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →