GPT Cleanup

Free AI Text Detector

Scan any text for hidden AI fingerprints — zero-width characters, ChatGPT watermarks, and invisible markers. Proof, not probability.

100% client-side · No text is ever sent to a server · Works with ChatGPT, Claude, Gemini, Grok & more

Text to Analyze
Detection Result

Scan results will appear here...

How this works: This tool scans for technical fingerprints — hidden Unicode characters, HTML attributes, and formatting markers that AI chat interfaces inject into copied text. It does not analyze writing style or use machine learning. A low score means no technical markers were found, but the text could still be AI-generated and manually cleaned. A high score means definitive technical evidence of AI tool origin.

How Our AI Detector Works

Most AI detectors rely on machine learning models that analyze writing patterns like perplexity, burstiness, and sentence structure. The fundamental problem with this approach is false positives: human-written text gets flagged as AI-generated, and lightly edited AI text passes undetected. Statistical models guess — they do not prove.

GPT Cleanup's AI Detector takes a completely different approach. Instead of analyzing how text reads, we scan for what text contains — hidden Unicode characters, HTML data attributes, and invisible formatting markers that AI chat interfaces inject into copied text. These are technical fingerprints: deterministic evidence that software, not a human keyboard, produced the content.

When we detect a ChatGPT data-start attribute or zero-width spaces scattered through a paragraph, that's forensic evidence — not a probability score. Our results are binary and provable: either the marker exists in your text, or it doesn't. This makes our detector immune to the false-positive problem that plagues NLP-based tools.

The tool processes both the raw HTML and plain text versions of your input simultaneously. This dual-layer approach is critical because some markers (like ChatGPT's proprietary data-start, data-end, and data-message-* attributes) exist only in the HTML clipboard layer that rides along when you copy from ChatGPT's web interface. Plain-text-only analysis would miss these entirely.

How to Use the AI Detector in 3 Steps

1

Paste Your Text

Copy text directly from ChatGPT, Claude, Gemini, Grok, or any AI tool and paste it into the analysis panel. For best results, copy from the AI chat interface rather than an intermediate editor — this preserves the hidden HTML layer that contains the strongest markers.

2

Click "Scan"

One click runs our detection engine across 10 marker categories in both HTML and plain text layers. The analysis happens in milliseconds, entirely in your browser. No text is ever uploaded to a server.

3

Review Your Results

Get a 0-100 confidence score with a clear verdict (Human / Mixed / AI) plus a detailed breakdown of every finding organized by severity — from critical evidence like ChatGPT data attributes to weaker signals like smart quotes.

Complete List of AI Markers We Detect

Our scanner checks for 10 categories of technical AI markers, each assigned a severity weight based on how reliably it indicates AI origin. Critical markers are near-definitive proof — they virtually never appear in human-typed text. Weak markers are common in formatted text and only become meaningful when combined with stronger signals. The weighted scoring system produces a 0–100 confidence score.

MarkerSeverityReferenceWhy It Matters
ChatGPT HTML AttributesCriticaldata-start, data-endProprietary HTML markup injected by ChatGPT's web interface during copy — the single strongest signal of AI origin, virtually impossible to find in human-authored text
Zero-Width CharactersCriticalU+200B, U+200C, U+200DInvisible Unicode characters that AI chat UIs embed for cursor tracking and rich-text management — splits keywords, breaks JSON, and flags AI detection tools
Byte Order Mark (BOM)StrongU+FEFFLegacy encoding marker that should only appear at the start of a file — when found mid-text, it's a strong indicator of AI text processing pipeline artifacts
Word JoinerStrongU+2060Invisible word-binding character that prevents line breaks — rare in human text, commonly injected by AI chat interfaces during text rendering
Direction MarkersStrongU+200E, U+200FLeft-to-right and right-to-left marks used for bidirectional text control — suspicious when found in monolingual English text from AI tools
Non-Breaking SpacesModerateU+00A0Looks identical to a normal space but prevents line wrapping — AI tools frequently substitute regular spaces with NBSP, a detectable pattern
Soft HyphensModerateU+00ADInvisible hyphenation hints that only render at line breaks — AI text rendering engines insert these for layout optimization, humans almost never type them
Smart QuotesWeakU+201C-201FCurly quotation marks that AI tools auto-convert from straight quotes — common in formatted text but suspicious in quantity alongside other markers
Em DashesWeakU+2013, U+2014Long dashes that AI models heavily favor over hyphens — a stylistic signature that becomes relevant when combined with technical markers
Unicode EllipsisWeakU+2026Single-character ellipsis that AI tools auto-convert from three dots — a minor signal that adds weight when found with stronger evidence

Who Uses AI Text Detection

Educators & Academics

Verify student submissions with concrete technical evidence. Our detector complements NLP-based tools like Turnitin by finding proof rather than probability — zero-width characters don't lie, even when writing style appears human.

Content Teams & Editors

Check freelancer deliverables and contributed content for unedited AI output. Detect when text has been pasted directly from ChatGPT without any manual editing by scanning for the HTML fingerprints that copy operations preserve.

SEO Professionals

Find hidden characters that silently damage search performance. Zero-width spaces in title tags split keywords, break schema markup, and cause indexing anomalies that are invisible to the human eye but visible to search engine crawlers.

Developers & QA Teams

Detect invisible Unicode contamination in code snippets, API responses, JSON payloads, and configuration files sourced from AI tools. A single zero-width character can cause parsing failures that are nearly impossible to debug.

Technical Detection vs. NLP-Based AI Detectors

Traditional AI detectors like GPTZero, Originality.ai, and Copyleaks use natural language processing to analyze statistical patterns in writing — measuring perplexity (how predictable each word is), burstiness (variation in sentence complexity), and stylistic markers. While powerful, this approach has a fundamental limitation: it produces probability scores, not proof. A well-written human essay can score as "likely AI," and a lightly paraphrased AI text can score as "likely human."

Turnitin's AI detection module combines NLP analysis with their massive academic text database, giving it strong performance in educational contexts. However, it still relies on probabilistic scoring and has documented cases of false positives that have caused real harm to students.

GPT Cleanup's approach is orthogonal — we don't replace these tools, we complement them. Our detector finds concrete, verifiable evidence: a ChatGPT data-start attribute either exists in the text or it doesn't. A zero-width space is either present or absent. There is no gray area, no probability score, no chance of a false positive on technical markers.

The ideal workflow combines both approaches: use GPT Cleanup's AI Detector first for definitive technical evidence, then cross-reference with an NLP-based tool for stylistic analysis. If both methods agree, you have strong confidence. If they disagree, the technical evidence tells you what actually happened at the copy-paste level.

Understanding Your Detection Score

The AI Detector produces a weighted confidence score from 0 to 100, calculated by summing severity-weighted points for each detected marker category. The score maps to three verdict bands:

0 – 19Human

No significant technical markers found. The text shows no evidence of AI tool origin at the clipboard level. Note: this doesn't guarantee the text wasn't AI-generated — it may have been cleaned or manually retyped.

20 – 49Mixed

Some technical markers detected, but not enough for a definitive conclusion. This often means the text was partially edited after copying from an AI tool, or contains formatting artifacts that could have other sources.

50 – 100AI

Strong technical evidence of AI tool origin. Multiple high-severity markers detected — ChatGPT data attributes, zero-width characters, or a combination of strong signals. This is near-certain proof of unedited copy-paste from an AI interface.

Frequently Asked Questions

Everything you need to know about detecting AI-generated text

How does GPT Cleanup's AI Detector work?
Unlike AI detectors that use machine learning to guess writing style, our tool scans for technical fingerprints — hidden Unicode characters, HTML data attributes, and invisible markers that AI chat interfaces inject into copied text. These markers are deterministic evidence, not probabilistic guesses.
Is this AI detector free?
Yes, completely free with no usage limits, no sign-up, and no premium tier. All processing happens in your browser — no server costs means no reason to charge.
What's the difference between this and GPTZero or Turnitin?
GPTZero and Turnitin analyze writing patterns (perplexity, burstiness) using NLP models, which can produce false positives. Our detector scans for concrete technical evidence — hidden characters and HTML attributes that AI tools inject. We find proof, not probability. The two approaches are complementary.
Can AI text pass this detector?
If someone manually cleans the text (using our Text Cleaner, for example), the technical fingerprints will be removed and this detector won't flag it. That's by design — we detect copy-paste artifacts, not writing style.
Is my text private?
100% private. All scanning happens client-side in your browser. No text is uploaded, stored, or sent to any server. You can verify this by disconnecting from the internet and using the tool offline.
What hidden characters does this detect?
We scan for zero-width spaces (U+200B), zero-width joiners (U+200D), byte order marks (U+FEFF), word joiners (U+2060), direction markers (U+200E/F), non-breaking spaces (U+00A0), soft hyphens (U+00AD), ChatGPT data-start/data-end HTML attributes, smart quotes, em dashes, and Unicode ellipsis characters.
What does each severity level mean?
Critical: Nearly impossible to find in human-typed text (e.g., ChatGPT data attributes, zero-width characters). Strong: Very rare in human text (BOM mid-text, direction markers). Moderate: Occasionally appears in human text but suspicious in quantity (non-breaking spaces, soft hyphens). Weak: Common in formatted text, only relevant when combined with stronger signals (smart quotes, em dashes).

Detects Fingerprints from Every AI Tool

Our detection engine identifies technical markers from every major AI writing platform. Each tool's chat interface injects its own combination of hidden Unicode characters and HTML attributes during copy operations. We scan for all known patterns across every platform.

ChatGPTGPT-4oClaudeGeminiGrokDeepSeekPerplexityJasperLLaMACopilotBing ChatNotion AIGrammarly AI

Clean AI Text with GPT Cleanup

Found AI fingerprints? Remove all hidden characters, zero-width spaces, and ChatGPT watermarks in one click. Free, instant, and 100% private — no data leaves your browser.