The WLCG OPS meeting will not have a WLCG section anymore. ATLAS requested for the RSS feed to be updated if a downtime is modified, and the modification itself logged. gLite releases are on schedule. There are now reference cards
for gLite services.
Attendance
EGEE
Asia Pacific ROC: Absent
Central Europe ROC: Lukasz Flis
OCC / CERN ROC: John Shade, Antonio Retico, Nick Thackray, Diana Bosio
French ROC: Pierre Girard, Rolf Rumler, Helene Cordier
There was an issue with the CIC portal, so the handover was given verbally by Helene and ROC UKI.
Report from UK/I: A quiet week.
Sites Considered For Suspension
* Report from Helène on the ROC Russia handover log: a very old ticket, concerning a site under ROC UKI: GGUS 46692. There had been no activity on the ticket since 26/2. As the site administrators answered on 2/4, the ticket was de-escalated.
Recent check point of the pilot service on CREAM. Second phase is closed. A third phase is now open, consisting of two branches, one focused on user and testing of ICE (i.e controlled roll out of ICE in production), and the second is testing with SA3 to try and test as many possible criteria for the transition from LCG CE to CREAM. Detailed plan will be made available as soon as possible.
Today UPDATE 43 is being realeased to production as scheduled. It includes a new version of YAIM clients to enabled service discovery, the removal of an obsolete dependency. A new info plugin. Roll out of patch 2652 on VOMS, already released 2 weeks ago.
UPDATE 44 is scheduled for April 13.
EGEE Items From ROC Reports
The CIC portal page is not accessible. So oral reports from the different ROCs.
SWE: request by LIP, they ask why there are such a low number of jobs for CMS, also ATLAS seems to send only the SAM tests.
ANSWER: Probably the experiment activities are low. Maybe the LIP administrators can join the WLCG daily meeting at 15:00 and ask the experiments directly.
ROC DECH: SCAI site: problem with some big jobs submitted by the user crashed the WN. Maybe it was a big job submitted by ATLAS. The WN needed to be reinstalled.
ANSWER: There are ATLAS jobs with memory requirements, more details on the problem are needed.
Please ask the site to open a GGUS ticket.
Grid Service Interventions
ALL TIMES IN UTC+2
SARA: OUTAGE: From 08:00 9 April to 18:00 9 April. Services: vobox-alice.grid.sara.nl , ce.gina.sara.nl , creamce.gina.sara.nl
RAL: OUTAGE: From 12:00 8 April to 18:00 16 April. Services: lcgwms02.gridpp.rl.ac.uk
RAL: At Risk: From 11:00 7 April to 12:00 7 April. Services: lcgfts.gridpp.rl.ac.uk
PIC: OUTAGE: From 15:00 6 April to 12:00 8 April. Services: ENTIRE SITE.
INFN: Should be coming out of long scheduled downtime at 17:00 today.
on the agenda page one can consult links to CIC Portal (broadcasts/news), scheduled downtimes (GOCDB) and CERN IT Status Board
ATLAS: if a downtime is extended, but the associated RSS feed is not updated. This is causing a problem.
Nick Thackray: there is some work in implementing downtime extension as a new downtime.
ATLAS: maybe it would also be useful to keep the info that the dowtime has been modified.
The issue will be followed in an action.
REMINDER: Retirement of gLite 3.0
As previously announced, it is planned that all remaining gLite 3.0 services will be retired by the end of April. At this point, all support for these services will cease. All sites should ensure that they are running up-to-date versions of their services. If any site sees a need to keep a gLite 3.0 service in the middleware stack, please submit a GGUS ticket as soon as possible.
If there are issues, let the us know.
Pierre: a site is using gLite 3.0, the only concern is time. The site will update in May.
Nick: It is only the support that is dropped, there will not be any bug fixes or answer to any ticket will be "please update your site".
gLite service Reference Cards
A twiki has been created
https://twiki.cern.ch/twiki/bin/view/EGEE/ServiceReferenceCards
These collect useful information, such as daemons running, configuration information. The twiki documentation is complete. There is also a new section on OSCT security on how to ban a user or how to configure a firewall.
This is not a complete documentation for each service, but only a quick reference card. Links to more in depth documentation are provided in the reference cards themselves.
Questions should be directed to the SA3 mailing list project-eu-egee-sa3@cernNOSPAMPLEASE.ch
OSG Items
Reminder to report on progress on GOCdb-to-OIM for OSG sites in savannah support 107531
Discussion of open tickets for OSG The following tickets re-appear in the escalation report so the same comments are still valid.
Exactly this was discussed for the last 3 weeks and Rob had an action to check. GGUS #46647: The ticket is now assigned to Rob. The action required is in the 2009-03-24 by MariaDZ.
Comment on 2009-03-30 in stalled urgent ATLAS ticket since 2009-03-09 GGUS #46988:
Tim and other OSG colleagues,
my understanding from https://savannah.cern.ch/support/index.php?107511#comment3 is that
had you chosen status 'customer' in OIM,
the ggus ticket would have gone to status 'waiting for reply' and the submitter would have
been prompted to react. Please do so now.
yours
maria
Comments #9 and #3 in theGGUS savannah item 107511 explain how to handle the OIM-GGUS field mapping for the above and other tickets.
Newly Created Action Items
Assigned to
Due date
Description
State
Closed
Notify
Main.OCC
2009-04-27
Check with the GOCDB if the RSS feed is updated when the downtime is modified (extended or shortened) Update 20/4/09: Nick to check with Gilles (but he thinks that the answer is no). Update 27/4/09: An update has been given to Nick by Gilles, this will be added here. 18/05/09: An RSS notification is sent by the Operations Portal whenever there is a change to a down-time (see minutes of today's meeting for more details).