LCG Grid Deployment -
CERN ROC -
CERN ROC Procedures
CERN ROC template for e-mailing a site to notify them of their monitoring suspension
Dear Site Administrator(s),
we are contacting you as we are the responsible ROC for the site
SITE_NAME site with special reference to SAM History of your CE
CE_NAME
https://lcg-sam.cern.ch:8443/sam/sam.py?funct=ShowHistory&sensors=CE&vo=ops&nodename=CE_NAME
Your site has been kept into a long downtime during several months (the
last one finished on DATE), and during that period several
GGUS tickets have been addressed to you by this ROC and the monitoring
structure of the Grid Operators on Duty.
From the operations point of view we estimate that, until your site is not
again fully up and running, it is not necessary to keep the production
monitoring active on the site SITE_NAME.
Therefore we have decided to suspend, until further notice, the monitoring
of your site done by the Grid Operators on Duty.
This has been done by setting the site status in the GOC db to
"Suspended".
This also means that the information published in the site BDII is not
going to be visible in the top level BDIIs until the site goes through a
new certification.
As soon as we get notified that the problems at your sites have been
fixed, we would start the certification procedure for the site (one week
of observation through the SAM tests) and then the regular
production monitoring procedures will be re-activated.
If needed, our experts will be available to help you, at the best of
their knowledge, with the issues you might encounter during the
configuration of the site.
Please let us know should this solution be not satisfactory for you.
Thanks.
Best regards,
the Cern ROC team
-- Main.diana - 02 May 2006
Topic revision: r5 - 2007-11-01
- unknown