rdoWork
rdoWork is a software framework to automate and orchestrate distributed data workflows. It targets data engineering teams building pipelines, ETL processes, and remote data operations (RDO), providing a lightweight runtime, a task orchestration layer, and pluggable connectors for cloud and on‑prem environments.
Core features include DAG-based scheduling, dependency resolution, retry and timeout policies, parallel execution, and resource-aware scheduling
Its architecture centers on a small core engine that coordinates workers and a plugin system for executors.
Tasks run in isolated environments with support for idempotent execution, comprehensive logging, and audit trails to
The project emphasizes extensibility through connectors to data stores and services such as S3, GCS, Azure
rdoWork is a community-driven open-source project released under a permissive license, with ongoing development and governance
In practice, it is used for data pipelines, migrations, and remote processing tasks. Reviewers note ease of
Within the landscape of workflow orchestration tools, rdoWork is often compared with Apache Airflow, Prefect, and