There are six active instances of
SSB:
- CMS (dashb-ssb)
- ATLAS (dashb-atlas-ssb)
- ALICE (dashb-alice-ssb)
- LHCb (dashb-lhcb-ssb)
- SAM3 (wlcg-mon)
- SAM3 preproduction (wlcg-mon-dev)
Four of them (CMS, ATLAS, and SAM3) are configured to send alarms if the metrics are too old. The messages look like:
X SSB metrics for X are not being updated !
The messages might arrive just due to a small delay in consuming the messages.
If you get these messages:
- For SAM3 preproduction, ignore them
- For any of the other:
- In case you get the alarm for more than 10 metrics more than 5 times in a row, send an sms to Pablo and Marian
--
PabloSaiz - 2014-12-10