WLCG-OSG-EGEE Ops' Minutes Mon 06 Apr 2009

Summary

The WLCG OPS meeting will not have a WLCG section anymore. ATLAS requested for the RSS feed to be updated if a downtime is modified, and the modification itself logged. gLite releases are on schedule. There are now reference cards for gLite services.

Attendance

EGEE

  • Asia Pacific ROC: Absent
  • Central Europe ROC: Lukasz Flis
  • OCC / CERN ROC: John Shade, Antonio Retico, Nick Thackray, Diana Bosio
  • French ROC: Pierre Girard, Rolf Rumler, Helene Cordier
  • German/Swiss ROC: Wen Mei
  • Italian ROC: Absent
  • Northern Europe ROC: Ron Trompert
  • Russian ROC: Lev Shamardin
  • South East Europe ROC: Kostas Koumantaros
  • South West Europe ROC: Kai Neuffer
  • UK/Ireland ROC: Jeremy Coles, Kashif Mohammad
  • GGUS: Torsten Antoni
  • GOCDB: Absent
  • OSG: ROb Quick
  • SA3: Lorenzo Sbolgi

WLCG

  • WLCG Service Coordination: Harry Renshall

WLCG Tier 1 Sites

  • ASGC: Absent
  • BNL: Absent
  • CERN site: Jan Iven
  • FNAL: Catalin Dumitrescu
  • FZK: Angela Poschlad
  • IN2P3: Pierre Girard
  • INFN: Absent
  • NDGF: Jens Larsson
  • PIC: Absent
  • RAL: Gareth Smith
  • SARA/NIKHEF: Ron Trompert
  • TRIUMF: Absent

LHC Experiments

  • ATLAS: Alessandro di Girolamo
  • LHCb: absent
  • CMS: absent
  • ALICE: absent

Feedback on Last Week's Minutes

None was given.

EGEE Items

Grid Operator Hand Over on Duty

  Primary Team Secondary Team
From ROC UKI ROC Russia
To ROC ??? ROC ???

  • There was an issue with the CIC portal, so the handover was given verbally by Helene and ROC UKI.

  • Report from UK/I: A quiet week.

Sites Considered For Suspension

* Report from Helène on the ROC Russia handover log: a very old ticket, concerning a site under ROC UKI: GGUS 46692. There had been no activity on the ticket since 26/2. As the site administrators answered on 2/4, the ticket was de-escalated.

PPS Reports and Issues

Highlights:

  • Recent check point of the pilot service on CREAM. Second phase is closed. A third phase is now open, consisting of two branches, one focused on user and testing of ICE (i.e controlled roll out of ICE in production), and the second is testing with SA3 to try and test as many possible criteria for the transition from LCG CE to CREAM. Detailed plan will be made available as soon as possible.

gLite Release News

  • Today UPDATE 43 is being realeased to production as scheduled. It includes a new version of YAIM clients to enabled service discovery, the removal of an obsolete dependency. A new info plugin. Roll out of patch 2652 on VOMS, already released 2 weeks ago.

  • UPDATE 44 is scheduled for April 13.

EGEE Items From ROC Reports

  • The CIC portal page is not accessible. So oral reports from the different ROCs.
  • SWE: request by LIP, they ask why there are such a low number of jobs for CMS, also ATLAS seems to send only the SAM tests.

ANSWER: Probably the experiment activities are low. Maybe the LIP administrators can join the WLCG daily meeting at 15:00 and ask the experiments directly.

  • ROC DECH: SCAI site: problem with some big jobs submitted by the user crashed the WN. Maybe it was a big job submitted by ATLAS. The WN needed to be reinstalled.

ANSWER: There are ATLAS jobs with memory requirements, more details on the problem are needed. Please ask the site to open a GGUS ticket.

Grid Service Interventions

ALL TIMES IN UTC+2

  • SARA: OUTAGE: From 08:00 9 April to 18:00 9 April. Services: vobox-alice.grid.sara.nl , ce.gina.sara.nl , creamce.gina.sara.nl
  • RAL: OUTAGE: From 12:00 8 April to 18:00 16 April. Services: lcgwms02.gridpp.rl.ac.uk
  • RAL: At Risk: From 11:00 7 April to 12:00 7 April. Services: lcgfts.gridpp.rl.ac.uk
  • FNAL: OUTAGE: From 12:00 7 April to 02:00 8 April. Services: Intermittent outages on all services. See http://www.uscms.org/SoftwareComputing/UserComputing/Downtimes/UAF_downtimes.html
  • PIC: OUTAGE: From 15:00 6 April to 12:00 8 April. Services: ENTIRE SITE.
  • INFN: Should be coming out of long scheduled downtime at 17:00 today.

  • on the agenda page one can consult links to CIC Portal (broadcasts/news), scheduled downtimes (GOCDB) and CERN IT Status Board

  • ATLAS: if a downtime is extended, but the associated RSS feed is not updated. This is causing a problem.

Nick Thackray: there is some work in implementing downtime extension as a new downtime.

ATLAS: maybe it would also be useful to keep the info that the dowtime has been modified.

The issue will be followed in an action.

REMINDER: Retirement of gLite 3.0

  • As previously announced, it is planned that all remaining gLite 3.0 services will be retired by the end of April. At this point, all support for these services will cease. All sites should ensure that they are running up-to-date versions of their services. If any site sees a need to keep a gLite 3.0 service in the middleware stack, please submit a GGUS ticket as soon as possible.

If there are issues, let the us know.

Pierre: a site is using gLite 3.0, the only concern is time. The site will update in May.

Nick: It is only the support that is dropped, there will not be any bug fixes or answer to any ticket will be "please update your site".

gLite service Reference Cards

  • A twiki has been created
https://twiki.cern.ch/twiki/bin/view/EGEE/ServiceReferenceCards

These collect useful information, such as daemons running, configuration information. The twiki documentation is complete. There is also a new section on OSCT security on how to ban a user or how to configure a firewall.

This is not a complete documentation for each service, but only a quick reference card. Links to more in depth documentation are provided in the reference cards themselves.

Questions should be directed to the SA3 mailing list project-eu-egee-sa3@cernNOSPAMPLEASE.ch

OSG Items

  • Reminder to report on progress on GOCdb-to-OIM for OSG sites in savannah support 107531
  • Discussion of open tickets for OSG The following tickets re-appear in the escalation report so the same comments are still valid.
    • Exactly this was discussed for the last 3 weeks and Rob had an action to check. GGUS #46647: The ticket is now assigned to Rob. The action required is in the 2009-03-24 by MariaDZ.
    • Comment on 2009-03-30 in stalled urgent ATLAS ticket since 2009-03-09 GGUS #46988:
            Tim and other OSG colleagues,
            my understanding from https://savannah.cern.ch/support/index.php?107511#comment3 is that
            had you chosen status 'customer' in OIM, 
            the ggus ticket would have gone to status 'waiting for reply' and the submitter would have 
            been prompted to react. Please do so now.
            yours
            maria
Comments #9 and #3 in theGGUS savannah item 107511 explain how to handle the OIM-GGUS field mapping for the above and other tickets.

Newly Created Action Items

Assigned to Due date Description State Closed Notify  
Main.OCC 2009-04-27 Check with the GOCDB if the RSS feed is updated when the downtime is modified (extended or shortened)

Update 20/4/09: Nick to check with Gilles (but he thinks that the answer is no).

Update 27/4/09: An update has been given to Nick by Gilles, this will be added here.

18/05/09: An RSS notification is sent by the Operations Portal whenever there is a change to a down-time (see minutes of today's meeting for more details).

2009-05-22 edit

Review of Open Action Items

Open Action Items

IdSubmitterDescriptionCreationDueAssigned To 

Actions Closed in Last 20 Days

IdSubmitterDescriptionCreationDueAssigned ToClosed 

AOB

Next Meeting

The next meeting will be Monday, 20 APR 2009 14:00 UTC (16:00 Swiss local time).

  • Attendees can join from 14:45 UTC (15:45 Swiss local time) onwards.
  • The meeting will start promptly at 15:00 UTC (16:00 Swiss local time).
  • The WLCG section will start at the fixed time of 15:30 UTC (16:30 Swiss local time).
  • To dial in to the conference:
    • Dial +41227676000
    • Enter access code 0148141


These minutes can only be changed by members of:

Edit | Attach | Watch | Print version | History: r5 < r4 < r3 < r2 < r1 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r5 - 2009-05-22 - NickThackray
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    EGEE All webs login

This site is powered by the TWiki collaboration platform Powered by Perl This site is powered by the TWiki collaboration platformCopyright &© by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Ask a support question or Send feedback