JupiterG
JupiterG is an open-source distributed computing framework designed to simplify deployment and scaling of data processing and machine learning workloads across heterogeneous clusters. It provides a unified workflow engine, a resource manager, and a data layer with data locality optimizations.
Development and history: Initiated in 2022 by the JupiterG Consortium, a collaboration of academia and industry.
Architecture: JupiterG is built around a central scheduler, multiple worker nodes or pods, a distributed in-memory
Features: The framework provides dynamic resource management, automatic fault recovery, task checkpointing, and streaming data support.
Use and reception: JupiterG is used in academic research and enterprise environments for large-scale data pipelines