Procedure in case of problems with the DPM

Important :

Remember to call the operators (75011) before rebooting any machine, so that they (temporarily) ignore the alarms (and don't start a parallel intervention.)

Or send a mail to (remove the NONSPAM !).

Lemon monitoring

Note that the DPM daemons are monitored by Lemon : if any of the DPM daemons is down for any reason, it will be restarted automatically. After 3 unsuccessfull restarts, the operators will call the support phones.

You can check the list of alarms in: LemonAlarmList

Go through the DPMSmokeTest.

This includes the description of some kind of problem with the relative solution. It is not meant to be exhaustive and is under constant update. If you can solve the problem then:

  1. Log your action in the intervention log
  2. Email the Second Level Support mailing list :

General questions to ask yourself

  1. Try to think at general problems like network connections, firewalls, connection with database etc ... and look at the appropriate log files (both of the current service machine and the backup) for hints (/var/log/messages and /var/log/lfc/log are just an example)
  2. If you still are clueless then

-- SophieLemaitre - 21 Oct 2005

Edit | Attach | Watch | Print version | History: r2 < r1 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r2 - 2006-02-07 - PeterJones
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    LCG All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2023 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback