Apologies:
ROC Italy: we are unavailable today for the Operations Meeting, due to a telephone/switchboard failure.- Alessandro
[Could those dialing in with anonymous phone numbers please also connect with a web-browser and list the names of those in the room? - Ed.]
Reduced attendance:
Due to the EGI Workshop being held today at CERN, there was a lower than usual participation in today's Ops meeting.
Summary
Pre production testing of the CREAM CE is now well under way.
Requests were made for comments on the proxy renew deamon crashing with the recently released VOMS servers.
IN2P3 are investigating 30,000 atlas jobs hitting their site.
A report was given on base storage services needed at sites. GSSDCCRCBaseVersions.
ATLAS requested gstat should flag up discrepancies between LFCs in GOCDB but not in BDII.
Feedback on Last Week's Minutes
None was given.
EGEE Items
Grid Operator Hand Over on Duty
Primary Team
Secondary Team
From
ROC CE
ROC UK/I
To
ROC SWE
ROC AP
Nit when collecting weekly reports: CIC portal uses the security certificate of a different site. Cyril will follow-up.
CIC portal uses the security certificate of a different site. Cyril will follow-up. John will submit a GGUS ticket. Update: GGUS:38050 Problem is that Firefox doesn't recognize the French Certificate Authority. Solution is simply to define an exception in Firefox.
Release of gLite3.1 PPS Update31 to PPS in preparation (contains glexec installation/configuration + rpath stripping)
gLite3.0 Update50 in advanced phase of pre-deployment testing, contains a fix to glite-FTA
EGEE Items From ROC Reports
In response to a question from CE ROC: LCG_GFAL_INFOSYS - is not part of official release. SAM does not intend to use this feature. The question about alarms during SE scheduled downtimes is an issue with the presentation layer - in this case, the COD dashboard.
DESY wanted to know the procedure to follow when users use site resources in a denial-of-service manner. ROCs should follow this up with their respective sites.
NE ROC were inquiring about GGUS:37334. Steve will follow-up.
Steve to look at GGUS:37334 and escalate to someone. 2nd July - This may be resolved by a fix to BUG:37008, waiting for clarification. 8th July - Mentioned in the EMT yesterday, the fix is in an upcoming patch and also a bug will be submitted to link to it. Add bug before next week and close. 14th July - Bug now submitted BUG:38820 . As I understand it this already fixed in an upcoming release. Close the action here after today's meeting since the BUG is now present.
SE ROC (Antun) seeking to validate running accounting for multiple sites on a single monbox. Clemens said that Italian ROCs do it (but they use DGAS). No one was able to help, but Steve reckoned that, technically, it should be OK. Kostas will keep us posted.
WLCG Items
WLCG issues coming from ROC reports
Alessandro volunteered to investigate the 30'000 Atlas jobs at IN2P3 problem, but Cyril said that it was probably an internal problem with IN2P3's batch system.
Upcoming WLCG Service Interventions
Steve reported that the VOMS service "went on a migratiion & software upgrade today". The old VOMS service will be decommissioned on Wednesday.
ATLAS Service (Alessandro)
Nothing special to report. Question for CNAF: Do they have any news? Power-cut, floor collapsing? According to Roberto, CNAF is not yet back. Steve stated that it was for the COD to decide how they should submit tickets to Italy.
ALICE Service
CMS Service
LHCb Service
gsidcap file access issue at IN2P3 has been resolved. The s/w fix (1.8.0-15p8 out next week) will need to be rolled out ASAP.
No SRMv1 pools configured at SARA, which is a problem for LHCb. Ron said that he'd attend to the problem immediately after the meeting.
WLCG Service Coordination
Flavia gave the storage service status weekly report. dCache patch 8 is recommended (7 shouldn't be installed). For Castor, the latest patch is 2.1.7-10 which is about to be released. Tier-1s are recommended to upgrade around the 15th of July.
The base versions of services and client software is maintained GSSDCCRCBaseVersions. The link is now included as an integral part of the agenda pages for these meetings.
Suggestion to use EVO for these meetings was squashed. However, if the CERN callback problems persist, the decision could be reversed.
Kostas complained that there had been no progress with GGUS:37890. UK/I ROC should have a look.
Assigned to
Due date
Description
State
Closed
Notify
Main.UKRoc
2008-07-10
UK/I ROC to look at GGUS:37890. *14th July 2008* Jeremy will take a look. *21st July 2008* BUG:38320 which was the related item is now fixed and closed. This item is, consequently, also closed.
ATLAS mentioned that the LFC from CNAF has disappeared from the infosystem since February. It is hard for experiments to keep track of these things. Steve explained that GSTAT normally checks for discrepancies between GOCDB & BDII. He will submit a ticket to get them to include LFCs in their tests.
Steve should submit a GGUS requesting that gstat monitors for LFCs not publishing as compared to GOCDB. *2nd July* GGUS:38053 now submitted, leave action item until a response is given.