SAM to Nagios migration of sensors/tests - CMS

The page concerns migration of CMS sensors and tests that are making use of SAM submission/execution framework.

Goals

  • migrate all tests in SAM CMS CE and SRMv2 sensors to Nagios based monitoring framework
    • make tests Nagios compliant
      • using special wrappers from org.sam framework
      • rewriting tests from scratch
  • integrate migrated tests into the Nagios based monitoring framework

  • Result:
    • all CMS SAM tests using SAM submission framework migrated
    • RPM with the tests released and put into egee-SA1 repository
    • migrated tests integrated into new Nagios based monitoring framework

Planning

Two sensors to be migrated:

  • CE. Two jobs are submitted with different proxies (Role=lcgadmin, Role=production)
    • CE-sft-job - Role=lcgadmin; 6 tests on WN; no dependency between tests on WN
    • CE-cms-prod - Role=production; 1 test on WN
  • SRM (SRMv2). Seven tests run with different proxies (Role=lcgadmin, Role=production)
    • six tests - Role=production; five tests depend on result of SRMv2-get-pfn-from-tfc
    • SRMv2-user - Role=lcgadmin

During integration account for test->metric name changes. Eg.:

SAM test Nagios check
CE
CE-sft-job org.sam.CE-JobState-cms-lcgadmin (UI/Nagios)
CE-cms-analysis org.cms.WN-analysis (UI/Nagios/WN)
CE-cms-basic org.cms.WN-basic (UI/Nagios/WN)
CE-cms-frontier org.cms.WN-frontier (UI/Nagios/WN)
CE-cms-squid org.cms.WN-squid (UI/Nagios/WN)
CE-cms-swinst org.cms.WN-swinst (UI/Nagios/WN)
CE-sft-wn org.cms.WN-wn (UI/Nagios/WN)
CE-cms-prod org.sam.CE-JobState-cms-production (UI/Nagios)
CE-cms-mc org.sam.WN-mc-cms (UI/Nagios/WN)
SRM
SRMv2-get-pfn-from-tfc org.cms.SRMv2-GetPfnFromTfc-cms
SRMv2-lcg-cp org.cms.SRMv2-Cp-cms
SRMv2-lcg-gt org.cms.SRMv2-Gt-cms
SRMv2-lcg-gt-rm-gt org.cms.SRMv2-GtRmGt-cms
SRMv2-lcg-ls org.cms.SRMv2-Ls-cms
SRMv2-lcg-ls-dir org.cms.SRMv2-LsDir-cms
SRMv2-user org.cms.SRMv2-User-cms

Packaging

RPM root /usr/libexec/grid-monitoring/probes/org.cms

   ~ tree /usr/libexec/grid-monitoring/probes/org.cms/wnjob
   /usr/libexec/grid-monitoring/probes/org.cms/wnjob
   |-- etc
   |   `-- wn.d
   |       `-- org.cms
   |           |-- commands.cfg
   |           `-- services.cfg
   `-- probes
       `-- org.cms
           |-- check_A
           |-- check_B
           `-- checks_lib.sh

Plan

P.ID Name Notes Result
1 migration of CE tests for Role=lcgadmin try using org.sam/samtest-run wrapper tests submitted with org.sam/CE-probe and produce Nagios compliant output; results come from MB; part of RPM in egee-SA1 repo
2 migration of CE tests for Role=production try using org.sam/samtest-run wrapper test submitted with org.sam/CE-probe; result comes from MB; part of RPM in egee-SA1 repo
3 integration of PI1 with Nagios management of proxy with Role=lcgadmin on Nagios box tests run with Role=lcgadmin under Nagios
4 integration of PI2 with Nagios management of proxy with Role=production on Nagios box test runs with Role=production under Nagios
5 migration of SRM tests for Role=production try using org.sam/samtest-run wrapper; NB! there is a dependency between the tests tests are submitted from command line against SRMs and produce Nagios compliant output; dependency between tests is taken into account for future integration of the test into Nagios; part of RPM in egee-SA1 repo
6 migrate SRM test for Role=lcgadmin try using org.sam/samtest-run wrapper test is submitted from command line against SRMs and produces Nagios compliant output; part of RPM in egee-SA1 repo
7 integration of PI5 with Nagios management of proxy with Role=production on Nagios box tests run with Role=production under Nagios
8 integration of PI6 with Nagios management of proxy with Role=lcgadmin on Nagios box test runs with Role=lcgadmin under Nagios

Milestones

Milestone Date Result
M1 middle of Oct'09 all tests migrated and the first release of RPM is made
M2 TBD migrated tests integrated into new Nagios based monitoring framework

Progress

Planned Ongoing Done
PI1
CE-cms-analysis - -
CE-cms-basic
CE-cms-dummy
CE-cms-frontier
CE-cms-squid
CE-cms-swinst
CE-sft-wn
PI2
CE-cms-mc
PI5
PI6
PI3
PI4
PI7
PI8

-- KonstantinSkaburskas - 2009-09-11

Edit | Attach | Watch | Print version | History: r4 < r3 < r2 < r1 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r4 - 2010-03-02 - unknown
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    LCG All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright &© 2008-2023 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback