M MCP Servers Developer low

iris-eval/mcp-server

Name: iris-eval/mcp-server
Availability: InStock
Author: Various

by Various

The agent eval standard for MCP — score output quality, catch safety failures, enforce cost budgets

Visit Various Submit your build →

MCP

iris-eval/mcp-server

Added 1 June 2026

#agent-evaluation #ai-agent #claude #eval #evaluation #llm #mcp #mcp-server

Overview

iris-eval/mcp-server provides a standardized evaluation framework for agents using the Model Context Protocol (MCP). It scores output quality, detects safety failures, and enforces cost budgets to help developers assess and control agent behavior.

Best for

Best for
Developers building and evaluating agents that use the Model Context Protocol

Use cases

Benchmark agent output quality against defined criteria
Automatically catch safety violations during agent execution
Enforce per-call or cumulative cost limits to prevent budget overruns

How to use

Install

npx @iris-eval/mcp-server --dashboard

Tools exposed

IRIS_TRANSPORT
IRIS_PORT
IRIS_HOST
IRIS_DB_PATH
IRIS_LOG_LEVEL
IRIS_DASHBOARD
IRIS_DASHBOARD_PORT
IRIS_API_KEY
IRIS_ALLOWED_ORIGINS

Tested with

Claude Desktop, Claude Code, Cursor, Windsurf, Cline, Continue, VS Code, ChatGPT

Example client config

{\n  "mcpServers": {\n    "iris-eval": {\n      "command": "npx",\n      "args": ["@iris-eval/mcp-server"]\n    }\n  }\n}

Notes

6 stars on GitHub. Last updated 2026-05-25. Licensed MIT.

Use cases

Benchmark agent output quality against defined criteria
Automatically catch safety violations during agent execution
Enforce per-call or cumulative cost limits to prevent budget overruns

Pros

Offers a formal evaluation standard for MCP-based agents
Combines quality, safety, and cost checks in one tool
Written in TypeScript for type-safe integration

Cons

Very low GitHub star count (6) suggests limited community adoption
Tightly coupled to the MCP ecosystem, not useful outside it
Requires agent infrastructure already built on MCP

Indexed from awesome-mcp-servers-punkpeye and enriched against its public facts.

Pros

Offers a formal evaluation standard for MCP-based agents
Combines quality, safety, and cost checks in one tool
Written in TypeScript for type-safe integration

Cons

Very low GitHub star count (6) suggests limited community adoption
Tightly coupled to the MCP ecosystem, not useful outside it
Requires agent infrastructure already built on MCP

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Works in3entries

A Agents Coding one click

Cline

Open-source autonomous coding agent that lives inside VS Code. BYO model key, watch it work.

A Agents Coding one click

Claude Code

Anthropic

Anthropic's terminal-native coding agent. Reads your repo, edits files, runs tests, ships PRs.

A Agents Coding low

Continue

Continue.dev

Open-source AI code assistant for VS Code and JetBrains. Customisable, BYO model, built for enterprise.

Built with1entry

M MCP Dev low

FastMCP

Various

🚀 The fast, Pythonic way to build MCP servers and clients.

★ 25,425 updated 1mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to MCP Servers Submit your own entry →