Enterprise DNA
M MCP Servers Developer low

hidai25/eval-view

by Various

Regression testing for AI agents. Snapshot behavior,diff tool calls,catch regressions in CI. Works with LangGraph, CrewAI, OpenAI, Anthropic.

H

MCP

hidai25/eval-view

Added 1 June 2026

#agent-benchmark #agent-evaluation #agentic-ai #ai-agents #anthropic #autogen #cli #crewai

Overview

A regression testing tool for AI agents. It snapshots agent behavior and diffs tool calls to catch regressions in CI pipelines. Supports LangGraph, CrewAI, OpenAI, and Anthropic frameworks.

Best for

Best for
Developers building AI agents who need regression testing in CI

Use cases

  • Catching regressions in agent tool calls during CI
  • Comparing behavior snapshots across agent versions
  • Validating agent outputs after code changes

Notes

A regression testing tool for AI agents. It snapshots agent behavior and diffs tool calls to catch regressions in CI pipelines. Supports LangGraph, CrewAI, OpenAI, and Anthropic frameworks.

112 stars on GitHub. Last updated 2026-05-27. Licensed Apache-2.0.

Use cases

  • Catching regressions in agent tool calls during CI
  • Comparing behavior snapshots across agent versions
  • Validating agent outputs after code changes

Pros

  • Integrates with popular agent frameworks
  • Provides diff-based regression detection
  • Lightweight Python library

Cons

  • Limited to Python ecosystem
  • Requires manual snapshot management
  • Only supports specific agent frameworks

Indexed from awesome-mcp-servers-punkpeye and enriched against its public facts.

Pros

  • Integrates with popular agent frameworks
  • Provides diff-based regression detection
  • Lightweight Python library

Cons

  • Limited to Python ecosystem
  • Requires manual snapshot management
  • Only supports specific agent frameworks