Batch Systems Comparison

Functionality Torque/Maui SLURM HTCondor USGE/SoGE LSFSorted ascending
Number of sites HELP 92 14 17 14 6
Scalability no Debated. Probably depending on the hardware type (HPC vs HTC) as much as the configuration of the DB and plug-ins. yes yes 6500 nodes
Getting arguments from CE ARC-CE/Cream if site writes blah script ARC-CE ARC-CE ARC-CE/Cream if site writes blah script ARC-CE/Cream if site writes blah script
Info System support HELP Cream Cream no Cream Cream
Command line tools HELP custom commands custom commands, pbs like wrappers custom commands custom commands custom commands
Cream integration partial partial partial full full
ARC-CE integration full full full full full
Partitioning reservations yes, can overlap fully configurable fully configurable fully configurable
Limits configurable standard + custom fully configurable fully configurable fully configurable
Dcumentation Good Good Good less than satisfying/? Good
High Availability no head node failover central manager & job queue failover head node failover head node failover
Stability low high high high high
Licenses/Costs free free free Univa/free IBM
Community Support yes for torque yes through slurm-dev list yes through the HTCondor-users list no/? no
Requires DB no yes for advanced features no optional no
Requires shared FS no yes according to the manual no yes according to the manual but files can also be distributed across the cluster no
Distribution format source, rpm source source, rpm, tarball tarball tarball
Priorities configurable based on fairshare job, user & group priorities yes user, group, queue priorities
Comunity puppet module yes no yes no wrote our own
Developer Support maui not supported yes yes yes yes
APEL support yes yes yes yes yes
IPv6 support no no yes, with limitations no yes
Installation easiness yes without DB yes yes yes yes
Configuration easiness   yes with puppet yes yes yes
Efficient Backfilling tunable tunable not out-of-the-box, but similar behaviour can probably be configured yes yes
Queues yes no can use accounting groups to produce similar functionailty yes yes
Fairshares yes yes yes yes yes
Hierarchical Fairshares   yes yes yes yes
Wholenode/multicore yes yes yes yes yes
Multicluster no yes yes (flocking, job router) yes yes
Cgroup support yes (>6.0.0) full support cpu & memory yes (>8.2.0) yes (>9.1.1)
Min Num daemons 2 Master / 1 WNs 1 Master / 1 WNs (without DB for fairshares) 4 Master / 2 WNs 1...3 Master / 1 WNs 10 master / 5 WNs
Min Num config files 3 on head, 1 on WN 1 shared (+DB for fairshares) 2 default can be split in /etc/condor/config.d DB or a bunch of filesmanaged with GE tools 10 on master, 1 on WN

HELP Number of sites: is an approximate number extracted from BDII summing T1 and T2 no distinction between varieties (SGE*, PBS*). Last updated on the 27/01/2016.
HELP Info system support: IS responsibility Matrix
HELP Command line tools: different batch systems commands comparison (HTcondor missing).

Other comparison documents

CESGA Torque/MAUI,SGE,SLURM comparison document

-- AlessandraForti - 13 Mar 2014

This topic: LCG > WebHome > WLCGGDBDocs > BatchSystemComparison
Topic revision: r35 - 2016-05-27 - AlessandraForti
This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback