crawlingu
Crawlingu is a hypothetical domain‑specific language for scripting web crawling tasks. It is designed to express, in a readable and platform‑neutral form, which pages to visit, how to fetch them, how to extract data, and how to route results to storage or downstream processors. As a DSL, Crawlingu combines declarative rules with lightweight procedural actions and typically supports selectors modeled after CSS or XPath to identify data within HTML documents.
Its core concepts include seeds (starting URLs), scope and politeness settings (allowed domains, crawl depth, request
Syntax in Crawlingu is described as statements grouped into sections such as seeds, crawl, parse, and store.
Status: Crawlingu remains a conceptual construct used in discussions of DSLs for web crawling. There is no
See also: web scraping, domain‑specific language, data extraction, CSS selectors, XPath, crawling politeness.