YAIM cluster configuration: phase 1
This wiki page describes the steps needed to test the new yaim cluster module that contains the configuration of the Glue cluster and Glue subcluster entities.
The relevant yaim modules that are needed to test the new cluster configuration are:
- yaim lcg ce: this module has been modified to include new variables and to remove the code that configures the Glue cluster and Glue subcluster entities. A branch has been created in CVS
glite-yaim-lcg-ce_branch_cluster_configuration
to include these changes.
- yaim cluster: this is a new module that contains the configuration of the Glue cluster and Glue subcluster entities.
- yaim torque server: it hasn't been changed. This implies that some new and old variables will have to coexist for some time.
- yaim torque utils: it hasn't been changed but it's not affected by the new cluster configuration.
Installation instructions
In order to test the new cluster configuration, you can install the following metapackages:
lcg-CE
glite-TORQUE_server
glite-TORQUE_utils
Optionally, you can also install a
glite-BDII
, if you want to run a site
BDII.
Then you should upgrade the glite-yaim-lcg-ce rpm by running:
rpm -U /afs/cern.ch/project/gd/www/yaim/testing/glite-yaim-lcg-ce-4.0.5-1.noarch.rpm
And install the new cluster configuration yaim module by running:
rpm -i /afs/cern.ch/project/gd/www/yaim/testing/glite-yaim-cluster-1.0.0-1.noarch.rpm
Configuration instructions
Since there's a set of new variables, you would need to change your usual site-info.def:
lcg CE
Mandatory variables for the lcg CE: You'll find them under
/opt/glite/yaim/examples/services/lcg-ce
:
The new variable names follow this syntax:
- In general, variables based on hostnames, queues or VOViews containing '.' and '_' # should be transformed into '-'
- <host-name>: identifier that corresponds to the CE hostname in lower case. Example: ctb-generic-1.cern.ch -> ctb_generic_1_cern_ch
- <queue-name>: identifier that corresponds to the queue in upper case. Example: dteam -> DTEAM
- <voview-name>: identifier that corresponds to the VOView id in upper case. '/' and '=' should also be transformed into '_'. Example: /dteam/Role=admin -> DTEAM_ROLE_ADMIN
Variable Name |
Description |
Value type |
Version |
CE_HOST_<host-name>_CLUSTER_UniqueID |
UniqueID of the cluster the CE belongs to |
string |
glite-yaim-lcg-ce 4.0.5-1 |
CE_InfoApplicationDir |
Prefix of the experiment software directory in a site. This variable has been renamed in the new infosys configuration. The old variable name was: VO_SW_DIR . This parameter can be defined per CE, queue, site or voview. See /opt/glite/yaim/examples/services/lcg-ce for examples. |
string |
glite-yaim-lcg-ce 4.0.5-1 |
The following variables will be distributed in the future in site-info.def since they affect other yaim modules. At this moment we are in a transition face to migrate to the new variable names.
Variable Name |
Description |
Value type |
Version |
CE_HOST_<host-name>_CE_TYPE |
CE type: 'jobmanager' for lcg CE and 'cream' for cream CE |
string |
glite-yaim-lcg-ce 4.0.5-1 |
CE_HOST_<host-name>_QUEUES |
Space separated list of the queue names configured in the CE. This variable has been renamed in the new infosys configuration. The old variable name was: QUEUES |
string |
glite-yaim-lcg-ce 4.0.5-1 |
CE_HOST_<host-name>_QUEUE_<queue-name>_CE_AccessControlBaseRule |
Space separated list of FQANS and/or VO names which are allowed to access the queues configured in the CE. This variable has been renamed in the new infosys configuration. The old variable name was: _GROUP_ENABLE |
string |
glite-yaim-lcg-ce 4.0.5-1 |
CE_HOST_<host-name>_CE_InfoJobManager |
The name of the job manager used by the gatekeeper. This variable has been renamed in the new infosys configuration. The old variable name was: JOB_MANAGER . Please, define: lcgpbs, lcglfs, lcgsge or lcgcondor |
string |
glite-yaim-lcg-ce 4.0.5-1 |
Default variables for the lcg CE: You'll find them under
/opt/glite/yaim/defaults/lcg-ce.pre
:
It contains a list of
CE_*
variables with some default values. This variables can also be specified per CE-queue-voview or CE-queue, depending if we want that a certain VOVIEW has a specific value for a certain variable, or if we want that all the VOVIEWS of a queue share a specific value. For example, if I define in site-info.def:
# In the CE vtb-generic-17.cern.ch, in the queue dteam, in the VOVIEW dteam, I want that the default value StateWaitingJobs is 666666
CE_HOST_vtb_generic_17_cern_ch_QUEUE_DTEAM_VOVIEW_DTEAM_CE_StateWaitingJobs=666666
Or I can also define:
# In the CE vtb-generic-17.cern.ch, in the queue dteam, in all the supported VOVIEWS, I want that the default value StateWaitingJobs is 666666
CE_HOST_vtb_generic_17_cern_ch_QUEUE_DTEAM_CE_StateWaitingJobs=666666
If I don't define anything, the default value for the whole CE, defined in
/opt/glite/yaim/defaults/lcg-ce.pre
, is taken.
The variables that can be redefined per CE-queue are:
CE_VAR="
ImplementationName
ImplementationVersion
InfoGatekeeperPort
InfoLRMSType
InfoLRMSVersion
InfoJobManager
InfoApplicationDir
InfoDataDir
InfoDefaultSE
InfoTotalCPUs
StateEstimatedResponseTime
StateRunningJobs
StateStatus
StateTotalJobs
StateWaitingJobs
StateWorstResponseTime
StateFreeJobSlots
StateFreeCPUs
PolicyMaxCPUTime
PolicyMaxObtainableCPUTime
PolicyMaxRunningJobs
PolicyMaxWaitingJobs
PolicyMaxTotalJobs
PolicyMaxWallClockTime
PolicyMaxObtainableWallClockTime
PolicyPriority
PolicyAssignedJobSlots
PolicyMaxSlotsPerJob
PolicyPreemption"
The variables that moreover can also be redefined per CE-queue-VOVIEW are:
VOVIEW_VAR="
StateRunningJobs
StateWaitingJobs
StateTotalJobs
StateFreeJobSlots
StateEstimatedResponseTime
StateWorstResponseTime
InfoDefaultSE
InfoApplicationDir
InfoDataDir
"
If the Glue schema supports other variables than the ones defined here, you can just add new ones, by redefining
CE_VAR
and/or
VOVIEW_VAR
in site-info.def. It's the list of variables contained in
CE_VAR
and
VOVIEW_VAR
what
YAIM uses to create the ldif file.
Cluster
Mandatory variables for the cluster: You'll find them under
/opt/glite/yaim/examples/services/glite-cluster
:
The new variable names follow this syntax:
- In general, variables based on hostnames, queues or VOViews containing '.' and '_' # should be transformed into '-'
- <host-name>: identifier that corresponds to the CE hostname in lower case. Example: ctb-generic-1.cern.ch -> ctb_generic_1_cern_ch
- <cluster-name>: identifier that corresponds to the cluster name in upper case. Example: my_cluster -> MY_CLUSTER
- <subcluster-name>: identifier that corresponds to the subcluster name in upper case. Example: my_subcluster -> MY_SUBCLUSTER
Variable Name |
Description |
Value type |
Version |
CLUSTERS |
Space separated list of your cluster names, Ex. "cluster1 [cluster2 [...]]" |
string list |
glite-yaim-cluster 1.0.0-1 |
CLUSTER_<cluster-name>_CLUSTER_UniqueID |
Cluster UniqueID |
string |
glite-yaim-cluster 1.0.0-1 |
CLUSTER_<cluster-name>_CLUSTER_Name |
Cluster human readable name |
string |
glite-yaim-cluster 1.0.0-1 |
CLUSTER_<cluster-name>_SITE_UniqueID |
Site name where the cluster belongs to. It should be consistent with your variable SITE_NAME. NOTE: This may be changed to SITE_UniqueID when the GlueSite is configured with the new infosys variables |
string |
glite-yaim-cluster 1.0.0-1 |
CLUSTER_<cluster-name>_CE_HOSTS |
Space separated list of CE hostnames configured in the cluster |
hostname list |
glite-yaim-cluster 1.0.0-1 |
CLUSTER_<cluster-name>_SUBCLUSTERS |
Space separated list of your subcluster names, Ex="subcluster1 [subcluster2 [...]]"= |
string list |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_SUBCLUSTER_UniqueID |
Subcluster UniqueID |
string |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_ApplicationSoftwareRunTimeEnvironment |
"sw1 [| sw2 [| ...]" old CE_RUNTIMEENV |
string list |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_ArchitectureSMPSize |
old CE_SMPSIZE |
number |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_ArchitecturePlatformType |
old CE_OS_ARCH |
string |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_BenchmarkSF00 |
old CE_SF00 |
number |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_BenchmarkSI00 |
old CE_SI00 |
number |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_MainMemoryRAMSize |
old CE_MINPHYSMEM |
number |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_MainMemoryVirtualSize |
old CE_MINVIRTMEM |
number |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_NetworkAdapterInboundIP |
old CE_INBOUNDIP |
boolean |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_NetworkAdapterOutboundIP |
old CE_OUTBOUNDIP |
boolean |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_OperatingSystemName |
old CE_OS |
OS name |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_OperatingSystemRelease |
old CE_OS_RELEASE |
OS release |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_OperatingSystemVersion |
old CE_OS_VERSION |
OS version |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_ProcessorClockSpeed |
old CE_CPU_SPEED |
number |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_ProcessorModel |
old CE_CPU_MODEL |
string |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_HOST_ProcessorVendor |
old CE_CPU_VENDOR |
string |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_SUBCLUSTER_Name |
subcluster human readable name |
string |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_SUBCLUSTER_PhysicalCPUs |
old CE_PHYSCPU |
number |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_SUBCLUSTER_LogicalCPUs |
old CE_LOGCPU |
number |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_SUBCLUSTER_TmpDir |
tmp directory |
path |
glite-yaim-cluster 1.0.0-1 |
SUBCLUSTER_<subcluster-name>_SUBCLUSTER_WNTmpDir |
WN tmp directory |
path |
glite-yaim-cluster 1.0.0-1 |
--
MariaALANDESPRADILLO - 01 Sep 2008