HTCondor

HTCondor is an open-source workload management system designed to manage large-scale compute tasks across clusters and distributed resources. It is used for high-throughput computing, where many jobs with varying runtimes are submitted and executed opportunistically on idle resources. HTCondor supports heterogeneous environments, checkpointing, and fault tolerance, making it suitable for scientific computing, data analysis, and academic research workflows.

Core components include the central manager, which hosts the collector and negotiator, and coordinates resource advertisements

HTCondor provides user-facing tools such as condor_q, condor_status, condor_rm, and condor_submit for job management, monitoring, and

a

a

a

Wisconsin–Madison,

a

high-throughput