Scrapy
Scrapy is an open-source, collaborative framework for extracting data from websites. Written in Python, it provides tools to define how to crawl sites, parse pages, and export structured data. It is widely used for web scraping, data mining, and automated content harvesting.
The framework is built around the Scrapy Engine and components such as Spiders, Selectors, Item Pipelines, and
Scrapy offers built-in features such as robust CSS/XPath extraction, structured data export (JSON, CSV, XML), respect
Typical use involves writing a Spider class, running it with the Scrapy command-line tool, and configuring pipelines
Scrapy is maintained as an open-source project with BSD license. It is developed and supported by Zyte