O Open Source Observability medium

Rhesis

by Community

The testing platform for AI teams. Bring engineers, PMs, and domain experts together to generate tests, simulate (adversarial) conversations, and trace every failure to its root ca

Visit Community View repo Submit your build →

OSS

Rhesis

Added 1 June 2026

#generative-ai #llm-evaluation #llm-evaluation-framework #llmops #open-source #quality-assessment #responsible-ai #test-execution

Overview

Rhesis is an open-source testing platform for AI teams. It allows engineers, product managers, and domain experts to collaboratively generate tests, simulate adversarial conversations, and trace failures to their root cause.

Best for

Best for
AI teams that want a collaborative, open-source testing and debugging platform.

Use cases

Collaboratively create test cases for AI models across roles
Simulate adversarial conversations to probe model robustness
Trace model failures back to specific inputs or system components

Notes

357 stars on GitHub. Last updated 2026-06-01.

Use cases

Collaboratively create test cases for AI models across roles
Simulate adversarial conversations to probe model robustness
Trace model failures back to specific inputs or system components

Pros

Open source with Python codebase, easy to inspect and customize
Designed for cross-functional team collaboration on testing
Provides root-cause tracing for failures, aiding debugging

Cons

Relatively small community (357 stars) may mean limited support or integrations
Python-only implementation may not fit non-Python stacks
Newer tool, still evolving features and reliability

Indexed from awesome-llmops and enriched against its public facts.

Pros

Open source with Python codebase, easy to inspect and customize
Designed for cross-functional team collaboration on testing
Provides root-cause tracing for failures, aiding debugging

Cons

Relatively small community (357 stars) may mean limited support or integrations
Python-only implementation may not fit non-Python stacks
Newer tool, still evolving features and reliability

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Built with2entries

O OSS Framework medium

LangChain

Community

The agent engineering platform.

★ 138,234 updated 1mo ago

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 1mo ago

Pairs with3entries

O OSS Obs medium

LiteLLM 🚅

Community

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, Vertex

★ 48,950 updated 1mo ago

O OSS Orchestration medium

Langflow

Community

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

★ 149,019 updated 1mo ago

O OSS Framework medium

Dify

Community

Production-ready platform for agentic workflow development.

★ 143,435 updated 1mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →