Distx
Distx is a distributed data processing framework designed to simplify the construction of scalable, data-intensive applications. It provides a unified API for both batch and streaming workloads, with built-in support for task orchestration, failure recovery, and data movement across a cluster. Distx emphasizes portability across cloud, on-premises, and hybrid environments and aims to interoperate with common data formats and messaging systems.
Its architecture typically comprises a control plane, a pool of worker processes, and a metadata store. A
Key features include a DAG-based execution model, pluggable backends for storage and compute, streaming and batch
Typical use cases include Extract-Transform-Load pipelines, feature-engineering workflows for machine learning, real-time analytics, and event-driven processing
Distx is an open-source project maintained by a community of contributors and organizations. Development follows public