datapars
DataPars is a data processing and parsing framework designed for extracting structured information from unstructured or semi‑structured sources such as logs, web pages, and scientific datasets. The tool is written in Python and provides a modular architecture that allows developers to compose reusable parsers using declarative grammar specifications or programmatic extraction rules. DataPars supports common formats including CSV, JSON, XML, HTML, and custom binary protocols, and can be extended with plug‑in modules to handle proprietary data streams.
The project was initially released in 2015 by a group of open‑source contributors who identified a need
Typical use cases for DataPars involve extracting key metrics from server logs, cleaning questionnaire responses, converting
DataPars is maintained under an MIT license, with source code hosted on a public Git hosting service.