Worker Node Configuration Database
Message Format
The message format complies to the
GridMonitoringProbeSpecification. Additional the key-value pair
testName
has always to be present:
testName:<name of the test>
Besides there are a couple of common key-value pairs. Most of them will created automatically:
key |
auto |
description |
format |
example |
testId |
x |
Unique id for each test |
32 hex |
7dd4984b03115b34f7993439055f622f |
testVersion |
- |
Version of the test |
Major.Minor |
1.0 |
hostName |
x |
Host name of the WN |
URI |
lxbra2108.cern.ch |
hostId |
x |
Unique id for each host |
32 hex |
d965877f592d3fdf85d43a28e9dc8380 |
pyamqId |
x |
A unique id for each execution of the client |
32 hex |
30ff8d0689a5b23b0024bcf9746ceeb7 |
pyamqEncrypt |
- |
States whether a test was/will be encrypted |
0 or 1 |
1 |
executionFile |
- |
Path to the file which was/will be executed |
file path |
/path/to/test/system_os |
executionArguments |
- |
Arguments which will be passed to the execution file |
arg1 arg2 ... |
arg1 |
executionCommand |
x |
The command which was executed to get the test results |
command |
/path/to/test/system_os arg1 |
executionStarttime |
x |
Start time of a test |
Timestamp |
1234972037.6445301 |
executionEndtime |
x |
End time of a test |
Timestamp |
1234972037.650563 |
executionTime |
x |
Difference between start and end time |
Float |
0.0060329437255859375 |
summary |
- |
A short summary of the test used by the servers WebAdmin |
Short String |
ok |
The key
detailsdata
is a special key. It must, if it exists, be the final line in the message. It usually contains log messages of the test, e.g.:
detailsdata:java version could not be gathered
java not found, try JAVA_HOME: /usr/java/jdk1.5.0_16
/usr/java/jdk1.5.0_16/bin/java: No such file or directory
exitcode:127
Tests
Scheduling
- Interval Scheduling
- Run tests after a certain timeout is exeeded. (e.g 1 day)
- Probability Scheduling
- Run tests randomly with a certain probability. (e.g 0.2 means 20%)
Test Descriptions
This is a list of all tests, their type and a short description what they are doing. (besides internal tests)
nane |
type |
description |
wn_grid_afs |
Python |
Checks whether /afs/cern.ch/ is mounted. |
wn_grid_dteamdir |
Python |
Checks availability of the DTEAM Software Directory |
wn_grid_gfal |
Bash |
GFAL version |
wn_grid_glite |
Bash |
gLite version |
wn_grid_infra |
Python |
GlueSiteUniqueId , GlueCEUniqueId , JobManagerType , BatchQueue |
wn_grid_job |
Python |
JobId and JobContact |
wn_grid_vo |
Bash |
VoName , VomsCapabilities , VomsGroup , VomsRole of the user |
wn_grid_wn_info |
Bash |
GlueCEUniqueClusterId (dedicated to glite-wn-info ) |
wn_software_java |
Bash |
Java version |
wn_software_perl |
Bash |
Perl version |
wn_software_python |
Python |
Python version |
wn_software_python_openssl |
Python |
Checks whether the python module OpenSSL is installed |
wn_system_cpu |
Bash |
ProcessesTotal , ProcessesRunning , ProccesorCount , ProcessorCores , ProcessorLoad1 , ProcessorLoad5 , ProcessorLoad15 , ProcessorClockSpeed , ProcessorVendor |
wn_system_env |
Python |
Retrieves the system environment by calling setenv |
wn_system_memory |
Bash |
HostMainMemoryRAMFree , GlueHostMainMemoryRAMSize , HostMainMemoryVirtualFree , GlueHostMainMemoryVirtualSize in MB |
wn_system_network |
Bash |
Ping to cern.ch in milliseconds |
wn_system_os |
Bash |
GlueHostOperatingSystemName , GlueHostOperatingSystemRelease , GlueHostOperatingSystemVersion , GlueHostArchitecturePlatformType |
wn_system_timeoffset |
Bash |
Time offset using ntpdate in seconds |
Deployment Status
These are the test which are currently deployed in PPS:
name |
old alias |
probability |
interval |
wn_grid_afs |
grid_sfs |
1 |
- |
wn_grid_dteamdir |
grid_dteamdir |
1 |
- |
wn_grid_glite |
grid_glite |
1 |
- |
wn_grid_infra |
grid_infra |
1 |
- |
wn_grid_job |
grid_job |
1 |
- |
wn_grid_vo |
grid_vo |
1 |
- |
wn_software_java |
version_java |
0.2 |
- |
wn_software_perl |
version_perl |
0.2 |
- |
wn_software_python |
version_python |
1 |
- |
wn_software_python_openssl |
python_openssl |
1 |
- |
wn_system_cpu |
system_cpu |
1 |
- |
wn_system_env |
system_env |
1 |
- |
wn_system_memory |
system_mem |
1 |
- |
wn_system_network |
system_net |
1 |
- |
wn_system_os |
system_os |
1 |
- |
Desired Status
This table shows whether a test should be deployed and how it should be scheduled.
Please edit
name |
status |
probability |
interval |
note |
wn_grid_afs |
- |
- |
- |
necessary? |
wn_grid_gfal |
Prod |
0.1 |
1 day |
|
wn_grid_glite |
Prod |
0.1 |
1 day |
|
wn_grid_infra |
Prod |
1 |
- |
|
wn_grid_job |
Prod |
1 |
- |
|
wn_grid_vo |
Prod |
1 |
- |
|
wn_grid_wn_info |
Prod |
0.1 |
1 day |
|
wn_software_java |
Prod |
0.1 |
1 day |
|
wn_software_perl |
Prod |
0.1 |
1 day |
|
wn_software_python |
Prod |
0.1 |
1 day |
|
wn_software_python_openssl |
- |
- |
- |
unnecessary |
wn_system_cpu |
Prod |
0.1 |
1 day |
|
wn_system_env |
- |
- |
- |
too sensible information |
wn_system_memory |
Prod |
0.1 |
1 day |
|
wn_system_network |
Prod |
0.1 |
1 day |
|
wn_system_os |
Prod |
0.1 |
1 day |
|
wn_system_timeoffset |
Prod |
0.1 |
1 day |
|
Documentation
Slides:
Developer Guides:
Comments