Web Scraper (Static HTML)

Fetch static HTML and extract fields with real CSS selectors (Cheerio). Does not execute JavaScript.

Type: ACTION · Handler: BUILT_IN · Category: Network

info

For SPAs and client-rendered pages, use Web Scraper (Browser) when available (Playwright container — deferred).


Selectors config

selectors is a JSON object. Each key becomes an output field in results.

Simple (text from first match):

{
  "title": "h1",
  "price": ".product-price"
}

Advanced rule:

{
  "link": {
    "selector": "a.product-link",
    "extract": "attr",
    "attr": "href",
    "multiple": true
  }
}
extractResult
textNormalized text (default)
htmlInner HTML
attrAttribute (attr field, default href)

Set "multiple": true to return an array of all matches.


Configuration

FieldDescription
urlPage URL
selectorsJSON map (required)
methodGET or POST
userAgentCustom User-Agent
timeoutMs (default 30000)
followRedirectsDefault true
includeHtmlInclude full page HTML in output

Outputs

FieldDescription
resultsMap of keys → string or string[]
statusCodeHTTP status
finalUrlURL after redirects
htmlFull HTML (if includeHtml)

Example chain

timer_trigger → web_scraper → if_condition → discord_send_message

Pair with HTML to Markdown or AI Summarizer for article text.

Was this helpful?
historyLast updated: May 10, 2026