Worker Node Configuration Database

Message Format

The message format complies to the GridMonitoringProbeSpecification. Additional the key-value pair testName has always to be present:

testName:<name of the test>

Besides there are a couple of common key-value pairs. Most of them will be created automatically:

key auto description format example
testId x Unique id for each test 32 hex 7dd4984b03115b34f7993439055f622f
testVersion - Version of the test Major.Minor 1.0
hostname x Host name of the WN URI lxbra2108.cern.ch
hostid x Unique id for each host 32 hex d965877f592d3fdf85d43a28e9dc8380
pyamqId x A unique id for each execution of the client 32 hex 30ff8d0689a5b23b0024bcf9746ceeb7
pyamqEncrypt - States whether a test was/will be encrypted 0 or 1 1
pyamqSubtopic - Adds a subtopic to the default destination \w+(\.\w+)* site.MySite
executionFile - File name which was/will be executed file name system_os
executionArguments - Arguments which will be passed to the execution file arg1 arg2 ... arg1
executionCommand x The command which was executed to get the test results command /path/to/test/system_os arg1
executionStarttime x Start time of a test Timestamp 1234972037.6445301
executionEndtime x End time of a test Timestamp 1234972037.650563
executionTime x Difference between start and end time Float 0.0060329437255859375
summary - A short summary of the test used by the servers WebAdmin Short String ok

The key detailsdata is a special key. It must, if it exists, be the final line in the message. It usually contains log messages of the test, e.g.:

detailsdata:java version could not be gathered
java not found, try JAVA_HOME: /usr/java/jdk1.5.0_16
/usr/java/jdk1.5.0_16/bin/java: No such file or directory
exitcode:127

Tests

Scheduling

  • Interval Scheduling
    • Run tests after a certain timeout is exeeded. (e.g 1 day)
  • Probability Scheduling
    • Run tests randomly with a certain probability. (e.g 0.2 means 20%)

Test Descriptions

This is a list of all tests, their type and a short description what they are doing. (besides internal tests)

name type description
wn_grid_afs Python Checks whether /afs/cern.ch/ is mounted.
wn_grid_dteamdir Python Checks availability of the DTEAM Software Directory
wn_grid_gfal Bash GFAL version
wn_grid_glite Bash gLite version
wn_grid_infra Python GlueSiteUniqueId, GlueCEUniqueId, JobManagerType, BatchQueue
wn_grid_job Python JobId and JobContact
wn_grid_vo Bash VoName, VomsCapabilities, VomsGroup, VomsRole of the user
wn_grid_wn_info Bash GlueCEUniqueClusterId (dedicated to glite-wn-info)
wn_software_java Bash Java version
wn_software_perl Bash Perl version
wn_software_python Python Python version
wn_software_python_openssl Python Checks whether the python module OpenSSL is installed
wn_system_cpu Bash ProcessesTotal, ProcessesRunning, ProccesorCount, ProcessorCores, ProcessorLoad1, ProcessorLoad5, ProcessorLoad15, ProcessorClockSpeed, ProcessorVendor
wn_system_env Python Retrieves the system environment by calling setenv
wn_system_memory Bash HostMainMemoryRAMFree, GlueHostMainMemoryRAMSize, HostMainMemoryVirtualFree, GlueHostMainMemoryVirtualSize in MB
wn_system_network Bash Ping to cern.ch in milliseconds
wn_system_os Bash GlueHostOperatingSystemName, GlueHostOperatingSystemRelease, GlueHostOperatingSystemVersion, GlueHostArchitecturePlatformType
wn_system_timeoffset Bash Time offset using ntpdate in seconds

Deployment Status

These are the test which are currently deployed in PPS:

name old alias probability interval encrypted
wn_grid_afs grid_sfs 1 - -
wn_grid_dteamdir grid_dteamdir 1 - -
wn_grid_glite grid_glite 1 - x
wn_grid_infra grid_infra 1 - x
wn_grid_job grid_job 1 - x
wn_grid_vo grid_vo 1 - x
wn_software_java version_java 0.2 - x
wn_software_perl version_perl 0.2 - x
wn_software_python version_python 1 - x
wn_software_python_openssl python_openssl 1 - -
wn_system_cpu system_cpu 1 - x
wn_system_env system_env 1 - x
wn_system_memory system_mem 1 - x
wn_system_network system_net 1 - x
wn_system_os system_os 1 - x

Desired Status

This table shows whether a test should be deployed and how it should be scheduled.

Please edit

name status probability interval encrypted note
wn_grid_afs Prod 0.1 n.a. -  
wn_grid_gfal Prod 0.1 1 day x  
wn_grid_glite Prod 0.1 1 day x  
wn_grid_infra Prod 1 - x  
wn_grid_job Prod 1 - x  
wn_grid_vo Prod 1 - x  
wn_grid_wn_info Prod 0.1 1 day x  
wn_software_java Prod 0.1 1 day x  
wn_software_perl Prod 0.1 1 day x  
wn_software_python Prod 0.1 n.a. x  
wn_software_python_openssl - - - - unnecessary
wn_system_cpu Prod 0.1 1 day x  
wn_system_env - - - - too sensible information
wn_system_memory Prod 0.1 1 day x  
wn_system_network Prod 0.1 1 day x  
wn_system_os Prod 0.1 1 day x  
wn_system_timeoffset Prod 0.1 1 day -  

Documentation

Links

Comments

Topic attachments
I Attachment History Action Size Date Who Comment
PDFpdf grid-cm-client-wn.pdf r1 manage 274.9 K 2009-03-10 - 16:37 ThomasLow GCM Client Worker Node Documentation
PDFpdf grid-cm-server.pdf r1 manage 168.3 K 2009-03-10 - 16:37 ThomasLow GCM Server and Database Documentation
PDFpdf grid-cm-user-guide.pdf r1 manage 131.8 K 2009-03-10 - 16:38 ThomasLow GCM User Guide
PDFpdf grid-cm-web.pdf r1 manage 139.6 K 2009-03-10 - 16:38 ThomasLow GCM Web Portal Documentation
Edit | Attach | Watch | Print version | History: r10 < r9 < r8 < r7 < r6 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r10 - 2009-04-01 - SteveTraylen
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    EGEE All webs login

This site is powered by the TWiki collaboration platform Powered by Perl This site is powered by the TWiki collaboration platformCopyright & by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Ask a support question or Send feedback