O Open Source Frameworks medium

awesome-hallucination-detection

by Community

List of papers on hallucination detection in LLMs.

Visit Community View repo Submit your build →

OSS

Added 1 June 2026

#hallucinations #llms #nlp

Overview

A community-curated list of papers and resources focused on hallucination detection in large language models. It organizes research by categories such as detection methods, benchmarks, and surveys, providing a structured reference for builders and researchers.

Best for

Best for
Researchers and developers who need a curated bibliography on hallucination detection

Use cases

Exploring state-of-the-art hallucination detection techniques
Finding benchmark datasets for evaluating model hallucinations
Surveying recent academic literature on LLM reliability

Notes

1,096 stars on GitHub. Last updated 2026-05-25. Licensed Apache-2.0.

Use cases

Exploring state-of-the-art hallucination detection techniques
Finding benchmark datasets for evaluating model hallucinations
Surveying recent academic literature on LLM reliability

Pros

Comprehensive coverage of research papers across detection approaches
Regularly updated with contributions from the community
Well-organized into categories for quick reference

Cons

Not a runnable tool or library; no code implementations included
Requires manual paper reading and evaluation
May lack practical guidance for real-world deployment

Indexed from awesome-llm and enriched against its public facts.

Pros

Comprehensive coverage of research papers across detection approaches
Regularly updated with contributions from the community
Well-organized into categories for quick reference

Cons

Not a runnable tool or library; no code implementations included
Requires manual paper reading and evaluation
May lack practical guidance for real-world deployment

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Pairs with4entries

O OSS Framework medium

promptfoo

Community

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative config

★ 21,784 updated 1mo ago

O OSS Framework medium

Ragas

Community

Supercharge Your LLM Application Evaluations 🚀

★ 14,186 updated 4mo ago

O OSS Framework medium

lm-evaluation-harness

Community

A framework for few-shot evaluation of language models.

★ 12,772 updated 2mo ago

O OSS Framework medium

OpenAI Evals

Community

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

★ 18,584 updated 3mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →