Extract Website Content → Check AI Training Use → Generate Report
Audit websites and AI platforms to identify if your copyrighted content is being used in AI training datasets without permission.
Workflow Steps
Screaming Frog
Crawl suspicious websites
Use Screaming Frog to crawl websites suspected of hosting pirated content or AI training datasets. Focus on sites like academic repositories, file-sharing platforms, and AI company documentation that might reference your content.
ChatGPT
Analyze content for matches
Upload excerpts of your copyrighted content to ChatGPT and ask it to identify if this content appears in its training data. Use prompts like 'Does this text appear in your training dataset?' and 'Can you complete this passage?' with unique sentences from your work.
Google Sheets
Document findings
Create a spreadsheet to track your audit results. Include columns for content source, suspected AI model, match confidence level, and evidence type. This creates a systematic record for potential legal action.
Notion
Generate compliance report
Compile your findings into a comprehensive report in Notion, including evidence screenshots, AI model responses, and recommended actions. Structure it with sections for executive summary, detailed findings, and next steps for legal review.
Workflow Flow
Step 1
Screaming Frog
Crawl suspicious websites
Step 2
ChatGPT
Analyze content for matches
Step 3
Google Sheets
Document findings
Step 4
Notion
Generate compliance report
Why This Works
This systematic approach provides concrete evidence of unauthorized AI training use, combining automated discovery with AI-powered analysis to build a strong case for copyright protection.
Best For
Publishers and authors who want to audit whether their content is being used in AI training without authorization
Explore More Recipes by Tool
Comments
No comments yet. Be the first to share your thoughts!