datafilterx
Datafilterx is an open-source software library designed to filter, transform, and route structured data within data processing pipelines. It provides a declarative approach to expressing filtering logic and aims to be fast, extensible, and easy to integrate with existing workflows.
The library offers core features such as a declarative filter language, support for streaming and batch processing,
The architecture centers on a lightweight core, an expression compiler, and a collection of pluggable adapters
Datafilterx can be used as a standalone library or embedded in ETL tools and data platforms. It
Common use cases include data cleansing and filtering, content-based routing of records, deduplication, and transformation of
The project is maintained by an international open-source community with ongoing releases, documentation, and user-contributed examples.