After running the Ruthless Auditor API on 150+ domains last month, I discovered a problem that neither "Adjective Creep" nor "Summary Integrity" training can fix. Citation Contamination.
Most harm to your Entity Boundary isn't caused by your own content. It's caused by your citation network.
The Poisoned Trust Graph
Your entity profile isn't built in isolation. LLMs aggregate signals across:
- Your primary content (Tier 1: Noun Precision)
- Corroborating sources (Tier 2: Entity Consensus)
- Your outbound citations (the validation layer)
When you cite a source claiming "AI tools reduce content costs by 70%" — but that source later gets flagged as generating synthetic citations — your entity inherits a portion of that contamination.
This is Triple Extraction Failure:
- The cited source fails Source Verification (can't be cross-referenced)
- Your entity inherits a Validation Gap (the claim can't be verified)
- The entire citation chain gets marked with Validation Friction > threshold
The Compute Cost of Trust for citing you just spiked from O(1) to O(n²).
Real audit snapshot
I tested two SaaS sites in the same niche:
Site A: Cited "research from McKinsey" (real, verifiable PDF with DOI). Added 3 authoritative sources for key stats.
Site B: Cited "industry research" (blog post citing 5 other blog posts, 2 with expired domains, 1 redirecting to spam).
Entity Consensus score after 90 days:
- Site A: 0.73 (above Transaction Readiness threshold)
- Site B: 0.31 (below Entity Boundary collapse threshold)
Site B's Validation Gap wasn't from their own content quality. It was from who they linked to.
The Hash Verification cascade
LLMs don't just verify your claims. They verify the entire citation graph depth-first.
When your citation points to:
- Stale domains → Hash mismatch → Contamination flag
- Redirect chains → Trust decay at each hop
- Generic landing pages → No entity boundary found → Validation failure
Each failure reduces your Entity Coherence Score. Three failures in a single paragraph? Compute Cost of Trust exceeds the model's inference budget. Your entire section gets skipped.
The audit you actually need
Forget keyword density. Run this instead:
Step 1: Extract every outbound link from your top 10 pages.
Step 2: Batch-verify each with a HEAD request + content hash.
Step 3: Measure Validation Latency (how long before 200 OK vs. redirect vs. 4xx).
Step 4: Calculate your Citation Health Score.
Rule: If >30% of your citations have >2 redirects or >500ms verification time, you have a Citation Contamination problem that no Schema markup can fix.
The Trench Question
Most SEO audits count backlinks. But when was the last time you measured your citations' citations?
If your entity boundary is only as strong as your weakest trust signal — and 40% of your citation sources fail basic Hash Verification — what's the actual Transaction Readiness cost of that decay?
Is there a Citation Contamination audit workflow you're using, or are you still measuring visibility metrics that ignore the validation layer entirely?