---
name: seo-audit-schema
description: Machine-verified JSON-LD schema extraction from every page. Identifies schema types, counts, and pages without structured data.
disable-model-invocation: true
allowed-tools: Bash
argument-hint: "[domain] [sitemap-url] [output-dir]"
---

# SEO Audit: Schema/JSON-LD Extraction

Run the Python schema extractor on every page in the sitemap.

```bash
python3 ${CLAUDE_SKILL_DIR}/scripts/schema.py $ARGUMENTS
```

This produces:
- `{domain}_schema.csv` — Every JSON-LD block on every page with type, name, and content preview
- `{domain}_schema_pages.csv` — Per-page schema count and types
- `{domain}_schema_summary.json` — Type counts and statistics

After completion, analyze:
1. Which pages have schema and which don't
2. Schema types present (Organization, Product, BreadcrumbList, BlogPosting, FAQPage, etc.)
3. Pages that should have schema but don't (About pages, product pages)
4. Any JSON-LD parse errors
