Enterprise DNA
P Apps and SaaS Productivity low

Have I Been Trained?

by Various

Data Governance for Generative AI

HI

Apps

Have I Been Trained?

Added 1 June 2026

Overview

Have I Been Trained? lets creators search for their images or text in major AI training datasets. It checks against a publicly accessible index of known datasets used to train generative models. Users upload or link content to see if it appears in those collections.

Best for

Best for
Artists, photographers, writers, and other content creators who want to know if their work is used to train AI models.

Use cases

  • Check if personal artwork or writings are included in common AI training datasets
  • Monitor unauthorized use of creative work in generative model training
  • Assess exposure of proprietary data in public training corpora

Notes

Have I Been Trained? lets creators search for their images or text in major AI training datasets. It checks against a publicly accessible index of known datasets used to train generative models. Users upload or link content to see if it appears in those collections.

Use cases

  • Check if personal artwork or writings are included in common AI training datasets
  • Monitor unauthorized use of creative work in generative model training
  • Assess exposure of proprietary data in public training corpora

Pros

  • Provides transparency into opaque training data practices
  • Simple interface requiring only a URL or file upload to start a search
  • Covers multiple well-known datasets in a single query

Cons

  • Only checks datasets that have been indexed, not all training data in use
  • Results may be incomplete if datasets are updated or not publicly documented
  • Relies on creators proactively searching rather than offering continuous monitoring

Indexed from awesome-generative-ai and enriched against its public facts.

Pros

  • Provides transparency into opaque training data practices
  • Simple interface requiring only a URL or file upload to start a search
  • Covers multiple well-known datasets in a single query

Cons

  • Only checks datasets that have been indexed, not all training data in use
  • Results may be incomplete if datasets are updated or not publicly documented
  • Relies on creators proactively searching rather than offering continuous monitoring

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.