TWiki
>
EGEE Web
>
SA1
>
WlcgOsgEgeeOpsMeetingMinutes
>
WlcgOsgEgeeOpsMinutes2009x05x11
(2009-05-13,
DianaBosio
)
(raw view)
E
dit
A
ttach
P
DF
<!-- Page created from template WlcgOsgEgeeOpsMinutesTemplate Having created a new minutes page correct the following information. The index page will be updated automatically. Change the indico id and who ever was the chair. --> ---+!! WLCG-OSG-EGEE Ops' Minutes Mon 11 May 2009 <!-- This next block is the meta data for the meeting. Leave it in this simple format of Keyword: Value It is this that the table on the list of all minutes is constructed from. --> * Date: Mon 11 May 2009 * Agenda: [[http://indico.cern.ch/conferenceDisplay.py?confId=58811][58811]] * Chair: Main.JohnShade * Minutes: Main.DianaBosio * Minutes from previous meetings: WlcgOsgEgeeOpsMeetingMinutes * Distribution: grid-operations-meeting@cern.ch %TOC% <!-- Write a short summary of the meeting in between the following summary section tags for Alberto's WLCG news letter. Note the summary is included within the index page with <pre></pre> tags which is unfortunate. Consequently try to keep you lines to a sensible width to fit on the page. Also wikiwords will NOT be expanded to a hyperlink. --> ---++ Summary %STARTSECTION{name="summary" type="section"}% * We remind that Tier1s and bigger Tier2s are encouraged by the WLCG management board to deploy SCAS/gLExec for testing in a production environment, to be done through the pilot installation, contact egee-pps-pilot-scas@cernSPAMNOT.ch * There is an interesting post on the gridPP blog http://www.gridpp.rl.ac.uk/blog/2009/05/11/new-house-for-lfc-and-fts-backends/ concerning High Availability LFC. %ENDSECTION{name="summary" type="section"}% ---++ Attendance ---++++ EGEE * Central Europe ROC: Malgorzata Krakowian * OCC / CERN ROC: John Shade, Diana Bosio, Nick Thackray, Steve Traylen * French ROC: Pierre Girard * German/Swiss ROC: Angela Poschlad, Wen Mei * Northern Europe ROC: Gert Svensson * South East Europe ROC: Marios Chatziangelou * South West Europe ROC: Christian Neisser, Oscar Oliver * UK/Ireland ROC: Derek Ross * GGUS: Torsten Antoni * c-COD: Vera Hasper ---++++OSG * Rob Quick <!-- * OSCT: project-egee-osct@cern.ch --> ---++++ WLCG * WLCG Service Coordination: Harry Renshall ---++++ WLCG Tier 1 Sites * CERN site: Sophie Lemaitre * FNAL: Catalin Dumitriescu * FZK: Angela Poschlad * IN2P3: Pierre Girard * NDGF: Roger Oscarsson * PIC: Christian Neisser * RAL: Derek Ross, Gareth Smith * SARA/NIKHEF: Ron Trompert ---++++ LHC Experiments * ATLAS: absent * LHCb: absent * CMS: absent * ALICE: absent ---++ Feedback on Last Week's Minutes None was given. ---++ EGEE Items ---++++ Grid Operator Hand Over on Duty | | * "Old style" COD Team* | | *From* | Germany/Switzerland (DECH) | | *To* | Russia | * Report from "old style" COD: No unresponsive sites. Nothing to raise. | | *c-COD Team* | | *From* | North Europe (NE) | | *To* | Asia Pacific (AP) | * Report from cCOD: Vera: There are a number of ROC tickets that are well overdue. Also, please switch off alarms that are in OK state. ---+++++ Sites Considered For Suspension None. ---++++ PPS Reports and Issues * UPDATE 46 will be released soon to production. * Replies from GRIF and ASGC were received concerning the SCAS installation. We remind that Tier1s and bigger Tier2s are encouraged by the WLCG management board to deploy SCAS/gLExec for testing ---++++ gLite Release News * 2009-05-04: gLite 3.1 Update 45 was released to production. The update affects the client nodes (UI WN and VOBOXes) and it will contain: * New GFAL (1.11.4) and lcg_util 1.7.2. (PATCH:2785;PATCH:2783) * Addition of glite-wn-info to return information about subcluster from a WN (PATCH:2757;PATCH:2758) * New yaim core and yaim clients * version 4.0.6 with many bug fixes. * version 4.0.7 fixing some issues needed by GFAL and lcg_util. * The two consecutive versions of YAIM are released at the same time in order to have a complete list of fixed issues, but sites will need to install only the latest one. ---++++ EGEE Items From ROC Reports * IT-ROC: Most of the errors (lcg-cr test for CE) at Italian sites (last night until early this morning), were due to our top-bdii egee-bdii.cnaf.infn.it: one of the dns configured on it was unreachable, so the bdii has been emptied. * SEE ROC:Some middleware components do not like 3 years period logs (which it is a requirement) due to system limits, please see the corresponding ticket at https://gus.fzk.de/ws/ticket_info.php?ticket=48291. * The requirement is that the logs are kept, but they can be archived, they do not have to remain on the server. * Answer from Maria: the section on audit requirements of the JSPG document says that logs have to be kept * for 2 years for Vo membership (section 4.7) https://edms.cern.ch/file/428034/3/VOMembershipManagement-v3.4.pdf * for 90 days for operational (high-frequency) system transaction https://edms.cern.ch/file/428037/3/Traceability-Logging-v2.0.pdf (section 4) * SWE ROC: 32bit binaries overwrite 64bit binaries for lcg-utils (installation of WNs). * Answer from Oliver Keeble: the solution will be to split the libraries out into separate rpms (standard practise). We're waiting for this from the DM team. In the meantime the workaround is to reinstall the 64bit binaries only (Andreas should be able to give you a link to where this is documented). * SWE ROC: last update to lcg-utils broke the SAM tests. * Answer from John Shade: It is a bug in lcg-utils, but as a work-around we will remove the time-out in SAM, this should remove the segmentation fault experienced by the tests. ---++ WLCG Items ---++++ WLCG issues coming from ROC reports ---++++ Upcoming WLCG Service Interventions * Consult links on the agenda page. ---++++ WLCG Service Coordination Nothing to report. ---++++ ATLAS Service ---++++ ALICE Service ---++++ CMS Service ---++++ LHCb Service ---++ OSG Items * Maria asked for a status of the OIM decision on adding the e-mail addresses to OIM. * Discussion of open tickets for OSG * ggus #44104. This ticket is waiting on the OSG GOC to roll out changes to their production BDII that will publish entries by their OSG resource group, not the OSG resource name. This will remove this issue before it gets to the BDII. Next action deadline in OIM is in Feb 2010. Should we close as unsolved to free the escalation reports? * Rob will check as it might be fixed sooner. * ggus #37059. Urgent ticket re-opened. Please have a look. * ggus #47786. Site concerned is Nebraska. Urgent. Submitted 2009-04-08! Some OSG reminders remain unanswered by the site (?) The submitter arbitrarily decided no LHCb jobs should be submitted at the Nebraska site but this is not the opinion of the VO management. A generic queue to be used when resources are spare would be appreciated. * Rob will check but the most likely solution is "Nebraska does not support LHCb". ---+++ Newly Created Action Items <!-- This is an example action item, just add new action items here. Please delete the example one. Note the example gets expanded in the template, Please when you duplicate delete the uid="xxxx" , closed="DD-MMM-YYYY", and closer="Main.SteveTraylen" they will be added automatically with an increment. A valid action item should have a "created", "creator", "due", "state" and "who". Obviously the state should be "open" not "closed". --> ---+++ Review of Open Action Items <!-- Leave these next two so as to display open and recently closed action items.. --> ---+++ Open Action Items %ACTIONSEARCH{topic="(OpsActionItemsDB|.*WlcgOsgEgeeOpsMinutes2.*)" sort="$uid" state="open" format="|$uid|$creator|$text|$created|$due|$who|$edit|" header="|Id|Submitter|Description|Creation|Due|Assigned To||" }% ---+++ Actions Closed in Last 20 Days %ACTIONSEARCH{topic="(OpsActionItemsDB|.*WlcgOsgEgeeOpsMinutes2.*)" sort="$uid" state="closed" closed="> 20 days ago" format="|$uid|$creator|$text|$created|$due|$who|$closed|$edit|" header="|Id|Submitter|Description|Creation|Due|Assigned To|Closed||" }% ---++ AOB * To the French people that are connecting using the anonymopus syp appearing as 0033...: it would be highly appreciated if you could join the web conference by clicking on the audioconf link (choosing 'web conference only') and write your names and roles explicitely. This will make life easier for the minute taker. * Q: Ron Trompert for SARA: Estimate for a UI on SL5? * A: there is no estimate so far. It will be reported at the next GDB, on May 13th. * In reply to the high availability LFC question from the SEE ROC, please consult the gridPP blog http://www.gridpp.rl.ac.uk/blog/2009/05/11/new-house-for-lfc-and-fts-backends/ ---++ Next Meeting The next meeting will be Monday, 18 May 2009 14:00 UTC (16:00 Swiss local time). * Attendees can join from 13:45 UTC (15:45 Swiss local time) onwards. * The meeting will start promptly at 14:00 UTC (16:00 Swiss local time). * To dial in to the conference: * Dial +41227676000 * Enter access code 0148141 --- These minutes can only be changed by members of: * Set ALLOWTOPICCHANGE = Main.WLCGOpsMeetGroup
E
dit
|
A
ttach
|
Watch
|
P
rint version
|
H
istory
: r2
<
r1
|
B
acklinks
|
V
iew topic
|
WYSIWYG
|
M
ore topic actions
Topic revision: r2 - 2009-05-13
-
DianaBosio
Log In
EGEE
EGEE Web
EGEE Web Home
gLite
ProductTeams
SA3
JRA1
TMB
EMT
SA1
SA2
NA2
NA4
EGEE-UIG
List of
registered projects
List of EGEE-RP
interactions
Changes
Index
Search
Main.WebList
Welcome Guest
Login
or
Register
Cern Search
TWiki Search
Google Search
EGEE
All webs
Copyright &© by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Ask a support question
or
Send feedback