WLCG-OSG-EGEE Ops' Minutes Mon 11 May 2009
Summary
Attendance
EGEE
- Central Europe ROC: Malgorzata Krakowian
- OCC / CERN ROC: John Shade, Diana Bosio, Nick Thackray, Steve Traylen
- French ROC: Pierre Girard
- German/Swiss ROC: Angela Poschlad, Wen Mei
- Northern Europe ROC: Gert Svensson
- South East Europe ROC: Marios Chatziangelou
- South West Europe ROC: Christian Neisser, Oscar Oliver
- UK/Ireland ROC: Derek Ross
- GGUS: Torsten Antoni
- c-COD: Vera Hasper
WLCG
- WLCG Service Coordination: Harry Renshall
WLCG Tier 1 Sites
- CERN site: Sophie Lemaitre
- FNAL: Catalin Dumitriescu
- FZK: Angela Poschlad
- IN2P3: Pierre Girard
- NDGF: Roger Oscarsson
- PIC: Christian Neisser
- RAL: Derek Ross, Gareth Smith
- SARA/NIKHEF: Ron Trompert
LHC Experiments
- ATLAS: absent
- LHCb: absent
- CMS: absent
- ALICE: absent
Feedback on Last Week's Minutes
None was given.
EGEE Items
Grid Operator Hand Over on Duty
|
"Old style" COD Team |
From |
Germany/Switzerland (DECH) |
To |
Russia |
- Report from "old style" COD:
No unresponsive sites. Nothing to raise.
|
c-COD Team |
From |
North Europe (NE) |
To |
Asia Pacific (AP) |
Vera: There are a number of ROC tickets that are well overdue. Also, please switch off alarms that are in OK state.
Sites Considered For Suspension
None.
PPS Reports and Issues
- UPDATE 46 will be released soon to production.
- Replies from GRIF and ASGC were received concerning the SCAS installation. We remind that Tier1s and bigger Tier2s are encouraged by the WLCG management board to deploy SCAS/gLExec for testing
gLite Release News
- 2009-05-04: gLite 3.1 Update 45 was released to production. The update affects the client nodes (UI WN and VOBOXes) and it will contain:
- New GFAL (1.11.4) and lcg_util 1.7.2. (PATCH:2785;PATCH:2783
)
- Addition of glite-wn-info to return information about subcluster from a WN (PATCH:2757;PATCH:2758
)
- New yaim core and yaim clients
- version 4.0.6 with many bug fixes.
- version 4.0.7 fixing some issues needed by GFAL and lcg_util.
- The two consecutive versions of YAIM are released at the same time in order to have a complete list of fixed issues, but sites will need to install only the latest one.
EGEE Items From ROC Reports
- IT-ROC: Most of the errors (lcg-cr test for CE) at Italian sites (last night until early this morning), were due to our top-bdii egee-bdii.cnaf.infn.it: one of the dns configured on it was unreachable, so the bdii has been emptied.
- SEE ROC:Some middleware components do not like 3 years period logs (which it is a requirement) due to system limits, please see the corresponding ticket at https://gus.fzk.de/ws/ticket_info.php?ticket=48291
.
- The requirement is that the logs are kept, but they can be archived, they do not have to remain on the server.
- Answer from Maria: the section on audit requirements of the JSPG document says that logs have to be kept
- SWE ROC: 32bit binaries overwrite 64bit binaries for lcg-utils (installation of WNs).
- Answer from Oliver Keeble: the solution will be to split the libraries out into separate rpms (standard practise). We're waiting for this from the DM team. In the meantime the workaround is to reinstall the 64bit binaries only (Andreas should be able to give you a link to where this is documented).
- SWE ROC: last update to lcg-utils broke the SAM tests.
- Answer from John Shade: It is a bug in lcg-utils, but as a work-around we will remove the time-out in SAM, this should remove the segmentation fault experienced by the tests.
WLCG Items
WLCG issues coming from ROC reports
Upcoming WLCG Service Interventions
- Consult links on the agenda page.
WLCG Service Coordination
Nothing to report.
ATLAS Service
ALICE Service
CMS Service
LHCb Service
- Maria asked for a status of the OIM decision on adding the e-mail addresses to OIM.
- Discussion of open tickets for OSG
- ggus #44104. This ticket is waiting on the OSG GOC to roll out changes to their production BDII that will publish entries by their OSG resource group, not the OSG resource name. This will remove this issue before it gets to the BDII. Next action deadline in OIM is in Feb 2010. Should we close as unsolved to free the escalation reports?
- Rob will check as it might be fixed sooner.
- ggus #37059. Urgent ticket re-opened. Please have a look.
- ggus #47786. Site concerned is Nebraska. Urgent. Submitted 2009-04-08! Some OSG reminders remain unanswered by the site (?) The submitter arbitrarily decided no LHCb jobs should be submitted at the Nebraska site but this is not the opinion of the VO management. A generic queue to be used when resources are spare would be appreciated.
- Rob will check but the most likely solution is "Nebraska does not support LHCb".
Newly Created Action Items
Review of Open Action Items
Open Action Items
Id | Submitter | Description | Creation | Due | Assigned To | |
---|
Actions Closed in Last 20 Days
Id | Submitter | Description | Creation | Due | Assigned To | Closed | |
---|
AOB
- To the French people that are connecting using the anonymopus syp appearing as 0033...: it would be highly appreciated if you could join the web conference by clicking on the audioconf link (choosing 'web conference only') and write your names and roles explicitely. This will make life easier for the minute taker.
- Q: Ron Trompert for SARA: Estimate for a UI on SL5?
- A: there is no estimate so far. It will be reported at the next GDB, on May 13th.
Next Meeting
The next meeting will be Monday, 18 May 2009 14:00 UTC (16:00 Swiss local time).
- Attendees can join from 13:45 UTC (15:45 Swiss local time) onwards.
- The meeting will start promptly at 14:00 UTC (16:00 Swiss local time).
- To dial in to the conference:
- Dial +41227676000
- Enter access code 0148141
These minutes can only be changed by members of: