hidai25/eval-view
by Various
Regression testing for AI agents. Snapshot behavior,diff tool calls,catch regressions in CI. Works with LangGraph, CrewAI, OpenAI, Anthropic.
MCP
hidai25/eval-view
Added 1 June 2026
Overview
A regression testing tool for AI agents. It snapshots agent behavior and diffs tool calls to catch regressions in CI pipelines. Supports LangGraph, CrewAI, OpenAI, and Anthropic frameworks.
Best for
Best for
Developers building AI agents who need regression testing in CI
Use cases
- Catching regressions in agent tool calls during CI
- Comparing behavior snapshots across agent versions
- Validating agent outputs after code changes
Notes
A regression testing tool for AI agents. It snapshots agent behavior and diffs tool calls to catch regressions in CI pipelines. Supports LangGraph, CrewAI, OpenAI, and Anthropic frameworks.
112 stars on GitHub. Last updated 2026-05-27. Licensed Apache-2.0.
Use cases
- Catching regressions in agent tool calls during CI
- Comparing behavior snapshots across agent versions
- Validating agent outputs after code changes
Pros
- Integrates with popular agent frameworks
- Provides diff-based regression detection
- Lightweight Python library
Cons
- Limited to Python ecosystem
- Requires manual snapshot management
- Only supports specific agent frameworks
Indexed from awesome-mcp-servers-punkpeye and enriched against its public facts.
Pros
- Integrates with popular agent frameworks
- Provides diff-based regression detection
- Lightweight Python library
Cons
- Limited to Python ecosystem
- Requires manual snapshot management
- Only supports specific agent frameworks
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.