Scrape

Scrape reference

Fetch, extract, and crawl with optional JS rendering and geo routing.

Endpoints

MethodPathScopeDescription
POST/v1/scrape/fetchscrape:fetchFetch a single URL.
POST/v1/scrape/extractscrape:extractFetch + LLM-extract a structured object.
POST/v1/scrape/crawlscrape:crawlStart an async crawl from a seed URL.

Quickstart

curl -X POST https://api.valienz.io/v1/scrape/fetch \
  -H "Authorization: Bearer $VALIENZ_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://example.com",
    "format": "html",
    "rendered": false
  }'

Notes

  • JS rendering

    Set "rendered": true for SPA-heavy pages. Adds a few seconds of latency and costs more credits.

  • Sandboxed previews

    The dashboard's HTML preview iframe uses sandbox="" so scraped pages cannot execute scripts in your dashboard origin.

Errors

CodeHTTPDescriptionResolution
E_TARGET_BLOCKED403Target host on the platform deny-list.Contact support if you need an allowance for a specific host.