DetX
DetX is a modular software framework designed to support automated data extraction, transformation, and validation in information systems. It emphasizes determinism and reproducibility, offering a domain-specific language and a plugin architecture to build end-to-end data pipelines that can be versioned and audited.
The core comprises four layers: connectors, processors, orchestrator, and sink adapters. Connectors ingest from databases, files,
Key features include schema inference and evolution, data lineage and provenance, reproducible runs via configuration hash/versioning,
Applications include data integration and quality assurance, compliance auditing, and supply chain transparency, where reproducibility and
Limitations include a learning curve for the DSL and potential overhead for very large pipelines, along with
See also ETL, data provenance, data lineage, reproducible research.