Scrape reference
Fetch, extract, and crawl with optional JS rendering and geo routing.
Endpoints
| Method | Path | Scope | Description |
|---|---|---|---|
| POST | /v1/scrape/fetch | scrape:fetch | Fetch a single URL. |
| POST | /v1/scrape/extract | scrape:extract | Fetch + LLM-extract a structured object. |
| POST | /v1/scrape/crawl | scrape:crawl | Start an async crawl from a seed URL. |
Quickstart
curl -X POST https://api.valienz.io/v1/scrape/fetch \
-H "Authorization: Bearer $VALIENZ_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"url": "https://example.com",
"format": "html",
"rendered": false
}'Notes
JS rendering
Set "rendered": true for SPA-heavy pages. Adds a few seconds of latency and costs more credits.
Sandboxed previews
The dashboard's HTML preview iframe uses sandbox="" so scraped pages cannot execute scripts in your dashboard origin.
Errors
| Code | HTTP | Description | Resolution |
|---|---|---|---|
| E_TARGET_BLOCKED | 403 | Target host on the platform deny-list. | Contact support if you need an allowance for a specific host. |