SAM to Nagios migration of sensors/tests - CMS
The page concerns migration of CMS sensors and tests that are making use of SAM submission/execution framework.
Goals
- migrate all tests in SAM CMS CE and SRMv2 sensors to Nagios based monitoring framework
- make tests Nagios compliant
- using special wrappers from org.sam framework
- rewriting tests from scratch
- integrate migrated tests into the Nagios based monitoring framework
- Result:
- all CMS SAM tests using SAM submission framework migrated
- RPM with the tests released and put into egee-SA1 repository
- migrated tests integrated into new Nagios based monitoring framework
Planning
Two sensors to be migrated:
- CE. Two jobs are submitted with different proxies (Role=lcgadmin, Role=production)
-
CE-sft-job
- Role=lcgadmin; 6 tests on WN; no dependency between tests on WN
-
CE-cms-prod
- Role=production; 1 test on WN
- SRM (SRMv2). Seven tests run with different proxies (Role=lcgadmin, Role=production)
- six tests - Role=production; five tests depend on result of
SRMv2-get-pfn-from-tfc
-
SRMv2-user
- Role=lcgadmin
During integration account for test->metric name changes. Eg.:
SAM test |
Nagios check |
CE |
CE-sft-job |
org.sam.CE-JobState-cms-lcgadmin (UI/Nagios) |
CE-cms-analysis |
org.cms.WN-analysis (UI/Nagios/WN) |
CE-cms-basic |
org.cms.WN-basic (UI/Nagios/WN) |
CE-cms-frontier |
org.cms.WN-frontier (UI/Nagios/WN) |
CE-cms-squid |
org.cms.WN-squid (UI/Nagios/WN) |
CE-cms-swinst |
org.cms.WN-swinst (UI/Nagios/WN) |
CE-sft-wn |
org.cms.WN-wn (UI/Nagios/WN) |
|
CE-cms-prod |
org.sam.CE-JobState-cms-production (UI/Nagios) |
CE-cms-mc |
org.sam.WN-mc-cms (UI/Nagios/WN) |
SRM |
SRMv2-get-pfn-from-tfc |
org.cms.SRMv2-GetPfnFromTfc-cms |
SRMv2-lcg-cp |
org.cms.SRMv2-Cp-cms |
SRMv2-lcg-gt |
org.cms.SRMv2-Gt-cms |
SRMv2-lcg-gt-rm-gt |
org.cms.SRMv2-GtRmGt-cms |
SRMv2-lcg-ls |
org.cms.SRMv2-Ls-cms |
SRMv2-lcg-ls-dir |
org.cms.SRMv2-LsDir-cms |
SRMv2-user |
org.cms.SRMv2-User-cms |
Packaging
RPM root
/usr/libexec/grid-monitoring/probes/org.cms
~ tree /usr/libexec/grid-monitoring/probes/org.cms/wnjob
/usr/libexec/grid-monitoring/probes/org.cms/wnjob
|-- etc
| `-- wn.d
| `-- org.cms
| |-- commands.cfg
| `-- services.cfg
`-- probes
`-- org.cms
|-- check_A
|-- check_B
`-- checks_lib.sh
Plan
P.ID |
Name |
Notes |
Result |
1 |
migration of CE tests for Role=lcgadmin |
try using org.sam/samtest-run wrapper |
tests submitted with org.sam/CE-probe and produce Nagios compliant output; results come from MB; part of RPM in egee-SA1 repo |
2 |
migration of CE tests for Role=production |
try using org.sam/samtest-run wrapper |
test submitted with org.sam/CE-probe; result comes from MB; part of RPM in egee-SA1 repo |
3 |
integration of PI1 with Nagios |
management of proxy with Role=lcgadmin on Nagios box |
tests run with Role=lcgadmin under Nagios |
4 |
integration of PI2 with Nagios |
management of proxy with Role=production on Nagios box |
test runs with Role=production under Nagios |
5 |
migration of SRM tests for Role=production |
try using org.sam/samtest-run wrapper; NB! there is a dependency between the tests |
tests are submitted from command line against SRMs and produce Nagios compliant output; dependency between tests is taken into account for future integration of the test into Nagios; part of RPM in egee-SA1 repo |
6 |
migrate SRM test for Role=lcgadmin |
try using org.sam/samtest-run wrapper |
test is submitted from command line against SRMs and produces Nagios compliant output; part of RPM in egee-SA1 repo |
7 |
integration of PI5 with Nagios |
management of proxy with Role=production on Nagios box |
tests run with Role=production under Nagios |
8 |
integration of PI6 with Nagios |
management of proxy with Role=lcgadmin on Nagios box |
test runs with Role=lcgadmin under Nagios |
Milestones
Milestone |
Date |
Result |
M1 |
middle of Oct'09 |
all tests migrated and the first release of RPM is made |
M2 |
TBD |
migrated tests integrated into new Nagios based monitoring framework |
Progress
Planned |
Ongoing |
Done |
PI1 |
CE-cms-analysis |
- |
- |
|
CE-cms-basic |
CE-cms-dummy |
CE-cms-frontier |
CE-cms-squid |
CE-cms-swinst |
CE-sft-wn |
PI2 |
CE-cms-mc |
PI5 |
PI6 |
PI3 |
PI4 |
PI7 |
PI8 |
--
KonstantinSkaburskas - 2009-09-11
Topic revision: r4 - 2010-03-02
- unknown