Parser build queue
26 targets across 4 waves, drawn from the PRD's 249-site SaaS research. Waves run by descending aggregate coverage — pick the top unbuilt target in the earliest wave.
0 of 26 parsers shipped · 0 of 26 signatures verified. See coverage map →
Docs platforms, high-leverage
5 targets · 121 sites covered
- 55 sites observed
nextjsdocs__NEXT_DATA__ script payload; MDX-driven routes
Sig: plannedParser: planned - 36 sites observed
mintlifydocsx-mintlify-client-version response header; /llms.txt present
Sig: plannedParser: planned - 17 sites observed
docusaurusdocs<meta name="generator"> (verify strippable signal)
Sig: plannedParser: planned - 7 sites observed
readmedocsfooter attribution plus files.readme.io CDN host
Sig: plannedParser: planned - 6 sites observed
gitbookdocsx-gitbook-route-site response header
Sig: plannedParser: planned
Docs platforms, long tail
9 targets · 33 sites covered
- 7 sites observed
zendesk-guidedocsvia: zorg response header for help portals
Sig: plannedParser: planned - 5 sites observed
hugodocsgenerator meta tag (verify strippable signal)
Sig: plannedParser: planned - 5 sites observed
sphinxdocsPython ecosystem theme markers; Sphinx or Read the Docs generator meta
Sig: plannedParser: planned - 5 sites observed
mkdocsdocsPython ecosystem theme markers; MkDocs generator meta
Sig: plannedParser: planned - 3 sites observed
ferndocsfooter or asset attribution
Sig: plannedParser: planned - 3 sites observed
antoradocsAsciidoc-based multi-version docs markup
Sig: plannedParser: planned - 3 sites observed
astrodocs_astro/ asset paths or generator meta
Sig: plannedParser: planned - 1 sites observed
vitepressdocs<meta name="generator" content="VitePress">
Sig: plannedParser: planned - 1 sites observed
nextradocsNext.js-based MDX docs framework marker
Sig: plannedParser: planned
CMS-backed Resources and Blog, high-leverage
5 targets · 85 sites covered
- 38 sites observed
wordpressresources/wp-json/wp/v2/posts REST API plus /feed/
Sig: plannedParser: planned - 21 sites observed
sanityresourcescdn.sanity.io asset host
Sig: plannedParser: planned - 16 sites observed
contentfulresourcesimages.ctfassets.net asset host
Sig: plannedParser: planned - 6 sites observed
hubspot-cmsresourceshs-scripts and hs-forms markers
Sig: plannedParser: planned - 4 sites observed
webflow-cmsresourcescdn.prod.website-files.com asset paths
Sig: plannedParser: planned
CMS-backed Resources and Blog, long tail
7 targets · 15 sites covered
- 5 sites observed
notion-cmsresourcesembedded Notion content blocks inside Webflow or Next.js fronts
Sig: plannedParser: planned - 3 sites observed
payloadresourcesbackend CMS for Next.js fronts
Sig: plannedParser: planned - 2 sites observed
contentstackresourcesimages.contentstack.io asset host
Sig: plannedParser: planned - 2 sites observed
prismicresourcesasset-host or API signature
Sig: plannedParser: planned - 1 sites observed
strapiresourcesbackend CMS for Next.js fronts
Sig: plannedParser: planned - 1 sites observed
datocmsresourcesasset-host signature
Sig: plannedParser: planned - 1 sites observed
storyblokresourcesasset-host signature
Sig: plannedParser: planned