Have I Been Trained?
by Various
Data Governance for Generative AI
Apps
Have I Been Trained?
Added 1 June 2026
Overview
Have I Been Trained? lets creators search for their images or text in major AI training datasets. It checks against a publicly accessible index of known datasets used to train generative models. Users upload or link content to see if it appears in those collections.
Best for
Best for
Artists, photographers, writers, and other content creators who want to know if their work is used to train AI models.
Use cases
- Check if personal artwork or writings are included in common AI training datasets
- Monitor unauthorized use of creative work in generative model training
- Assess exposure of proprietary data in public training corpora
Notes
Have I Been Trained? lets creators search for their images or text in major AI training datasets. It checks against a publicly accessible index of known datasets used to train generative models. Users upload or link content to see if it appears in those collections.
Use cases
- Check if personal artwork or writings are included in common AI training datasets
- Monitor unauthorized use of creative work in generative model training
- Assess exposure of proprietary data in public training corpora
Pros
- Provides transparency into opaque training data practices
- Simple interface requiring only a URL or file upload to start a search
- Covers multiple well-known datasets in a single query
Cons
- Only checks datasets that have been indexed, not all training data in use
- Results may be incomplete if datasets are updated or not publicly documented
- Relies on creators proactively searching rather than offering continuous monitoring
Indexed from awesome-generative-ai and enriched against its public facts.
Pros
- Provides transparency into opaque training data practices
- Simple interface requiring only a URL or file upload to start a search
- Covers multiple well-known datasets in a single query
Cons
- Only checks datasets that have been indexed, not all training data in use
- Results may be incomplete if datasets are updated or not publicly documented
- Relies on creators proactively searching rather than offering continuous monitoring
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.