web-scrape-links
ActiveTool of The Stall
Extracts all hyperlinks from any public webpage. Returns href URLs normalized to absolute URLs with visible link text. Filters out javascript:, mailto:, data: schemes. Optionally restrict to same-domain links, deduplicate, or include #anchor links. Useful for crawlers, sitemap builders, link graph analysis, and content audits.
Parameters schema
{
"type": "object",
"$schema": "http://json-schema.org/draft-07/schema#",
"properties": {
"url": {
"type": "string",
"description": "URL of the webpage to extract links from (http or https)."
},
"limit": {
"type": "integer",
"description": "Maximum links to return (default 200, max 500)."
},
"deduplicate": {
"type": "string",
"description": "Pass \"false\" to allow duplicate hrefs. Default: true (each unique URL returned once)."
},
"include_anchors": {
"type": "string",
"description": "Pass \"true\" to include anchor-only links (#section). Default: false."
},
"same_domain_only": {
"type": "string",
"description": "Pass \"true\" to return only links pointing to the same domain as the input URL. Default: false."
}
},
"additionalProperties": false
}No endpoints wrapped at confidence ≥ 0.70.
Parent server
The Stall
1/7 registries