Oozie
Apache Oozie is an open-source workflow scheduler system designed to manage Hadoop job orchestration. It runs as a Java web application with a REST API and a web UI, and it coordinates the execution of various Hadoop jobs across a cluster. Oozie stores workflow definitions and job state in a relational database and uses the cluster's resource manager to launch individual tasks.
Core concepts include Workflow Jobs, Coordinators, and Bundles. A Workflow Job defines a directed acyclic graph
Coordinator Jobs enable time- and data-driven scheduling. They use datasets to describe input data availability or
Oozie is designed to work with the Hadoop ecosystem, submitting actions to the ResourceManager and using Hadoop-compatible
First released as an Apache project in the late 2000s, Oozie remains a core tool for workflow