You're viewing a demo portfolio

Join the waitlist
PRSM

web-scrape-links

Active

Tool of The Stall

declared in 4.82.0

Extracts all hyperlinks from any public webpage. Returns href URLs normalized to absolute URLs with visible link text. Filters out javascript:, mailto:, data: schemes. Optionally restrict to same-domain links, deduplicate, or include #anchor links. Useful for crawlers, sitemap builders, link graph analysis, and content audits.

Parameters schema

{
  "type": "object",
  "$schema": "http://json-schema.org/draft-07/schema#",
  "properties": {
    "url": {
      "type": "string",
      "description": "URL of the webpage to extract links from (http or https)."
    },
    "limit": {
      "type": "integer",
      "description": "Maximum links to return (default 200, max 500)."
    },
    "deduplicate": {
      "type": "string",
      "description": "Pass \"false\" to allow duplicate hrefs. Default: true (each unique URL returned once)."
    },
    "include_anchors": {
      "type": "string",
      "description": "Pass \"true\" to include anchor-only links (#section). Default: false."
    },
    "same_domain_only": {
      "type": "string",
      "description": "Pass \"true\" to return only links pointing to the same domain as the input URL. Default: false."
    }
  },
  "additionalProperties": false
}

What this tool wraps· 0 endpoints

min confidence0.700.50

No endpoints wrapped at confidence ≥ 0.70.

Parent server

The Stall

1/7 registries
View full server →