LCG Management Board

Date/Time

Tuesday  2 March 2010 16:00-17:00 – Phone Meeting   

Agenda

http://indico.cern.ch/conferenceDisplay.py?confId=81979

Members

http://cern.ch/LCG/mb.htm

 

(Version 1 – 04.03.2010)

Participants

J. Bakken, I.Bird (chair),  D. Bonacorsi, K.Bos, D.Britton,  T. Cass, M. Cattaneo, L. Dell’Agnello, D. Duellmann, M. Ernst, I.Fisk, Qin Gang, J. Gordon, D. Heagerty (notes), F.Hernandez, H. Meinhard, G.Merino, A.Pace, R. Pordes, H. Renshall, M.Schulz, Y. Schutz, R. Tafirout, J. Templon

Invited

David Collados

Action List

https://twiki.cern.ch/twiki/bin/view/LCG/MbActionList

Mailing List Archive

https://groups.cern.ch/group/worldwide-lcg-management-board/default.aspx

Next Meeting

Tuesday 23 March 2010 16:00-18:00 – F2F Meeting (NIKHEF, Amsterdam)

1           Minutes and Matters arising (Minutes)

 

1.1         Minutes from the last meeting

The minutes of the last meeting were approved.

1.2         Statement on support for Castor and DPM

D. Duellmann and M. Schulz clarified that support will continue for CASTOR and DPM.

 

2          Action List Review (List of actions)

 

·         OPN Mandate: Experiments and some Sites should provide names for working with the OPN on the needs and actions needed on Tier-1 to Tier-2 links. Names are requested prior to the OPN meeting on 8 March (http://indico.cern.ch/conferenceDisplay.py?confId=80755).

The action remains open.

 

3          GDB Summary
 

 

J. Gordon summarized the GDB meeting which took place at CERN on 10 February 2010. His slides are linked from the agenda page. He requested feedback on topics for the next GDB. Attendees travelling to the next GDB meeting at NIKHEF, Amsterdam are asked to register on the Indico meeting page (http://indico.cern.ch/conferenceDisplay.py?confId=84636).

 

J. Gordon and I. Bird will decide whether a review of the suspension of the security policy related to glexec will take place at the next MB or GDB.

 

 

4          LCG Operations Weekly Report

 

Summary of status and progress of the LCG Operations. This report covers the WLCG activities since the last MB meeting. All daily meeting summaries are available here: https://twiki.cern.ch/twiki/bin/view/LCG/WLCGOperationsMeetings. Some comments to the slides linked from this meeting’s agenda are noted below.

 

  • Slide 15: the problems with xrootd are still under investigation.
  • Slide 8: The Chimera upgrade at TRIUMF took less time than expected but a bug prevented this being reported correctly.

 

 

5          Proposal/timelines for migrating the WLCG grid monitoring infrastructure from SAM to Nagios

 

The status and plans for migrating the Grid monitoring infrastructure from SAM to Nagios are described in the slides linked from the agenda page. EGEE operations are now using Nagios to follow up alarms. Official availability reports for March will be provided using SAM probe results. The March availability reports will also be generated based on Nagios results. A report will need to be provided in April to the MB analyzing the March results as a basis for deciding the sign-off on Nagios reports.

 

  • It was clarified that although sites can run their own Nagios tests, the ROCs decide which results are used in the WLCG availability reports.
  • It is planned that Nagios VO availability tests will be run from CERN (managed by the VO representative).

 

After the meeting, D. Collados sent the following information and links:

 

The Nagios infrastructure provides equivalent functional tests to the ones executed by SAM. We also provide visualization of each test executed through the Nagios and MyEGEE interfaces, where you can check at any time the status of your services and the full logging output of each execution. You can see the list of ROC Nagioses deployed in this page:

https://twiki.cern.ch/twiki/bin/view/EGEE/NagiosROCURL

which currently tests all the EGEE infrastructure. For each Nagios instance there is an equivalent MyEGEE portal. For instance, for the Central Europe Nagios:

https://sam-ce-roc.cern.ch/nagios/

you have the MyEGEE portal:

https://sam-ce-roc.cern.ch/myegee/

 

Each EGEE site can currently check the status of their services through those interfaces or can setup their site Nagios instance to only visualize the result of the services monitored in their site. So this is in fact something that sites should start doing now.

 

 

6          Requirements for 2010 & 2011

 

 

I. Bird presented a slide, attached to the agenda, on requirements for 2010, 2011 and 2012. The numbers correspond to those agreed at the LHCC Meeting (17-18 Feb) with an alignment to the RRB years. The breakdown on the slide was provisionally agreed.

 

 

7          AOB

 

 

7.1         OB Meeting agenda - 19th March

I. Bird requested comments to be sent to him on the draft agenda for the Overview Board meeting on 19 March 2010. The draft is attached to this meeting’s agenda page.

7.2         Next meeting: 23 March 2010, NIKHEF, Amsterdam 

The next MB meeting will be held at NIKHEF in Amsterdam. Travel information has been linked from the meeting agenda page. Those who will attend locally in Amsterdam are requested to register on the meeting agenda page to help the local organizers.

 

 

8          Summary of New Actions

 

There were no new actions.