TWiki
>
EGEE Web
>
SA1
>
WlcgOsgEgeeOpsMeetingMinutes
>
WlcgOsgEgeeOpsMinutes2010x01x25
(2010-02-08,
MaiteBarroso
)
(raw view)
E
dit
A
ttach
P
DF
<!-- Page created from template WlcgOsgEgeeOpsMinutesTemplate Having created a new minutes page correct the following information. The index page will be updated automatically. Change the indico id and who ever was the chair. --> ---+!! WLCG-OSG-EGEE Ops' Minutes Mon 25 Jan 2010 <!-- This next block is the meta data for the meeting. Leave it in this simple format of Keyword: Value It is this that the table on the list of all minutes is constructed from. --> * Date: Mon 25 Jan 2010 * Agenda: [[http://indico.cern.ch/conferenceDisplay.py?confId=82497][82497]] * Chair: Main.MaiteBarroso * Minutes: Main.NicholasThackray * Minutes from previous meetings: WlcgOsgEgeeOpsMeetingMinutes * Distribution: grid-operations-meeting@cern.ch %TOC% <!-- Write a short summary of the meeting in between the following summary section tags for Alberto's WLCG news letter. Note the summary is included within the index page with <pre></pre> tags which is unfortunate. Consequently try to keep you lines to a sensible width to fit on the page. Also wikiwords will NOT be expanded to a hyperlink. --> ---++ Summary %STARTSECTION{name="summary" type="section"}% No summary yet. %ENDSECTION{name="summary" type="section"}% ---++ Attendance ---++++ EGEE * Asia Pacific ROC: * Canadian ROC: * Central Europe ROC: * OCC / CERN ROC: Maite Barroso, Antonio Retico, Nick Thackray * French ROC: Helene Cordier * German/Swiss ROC: Angela Poschlad * Italian ROC: * Latin American ROC: * Northern Europe ROC: Ron Trompert, Gert Svensson * Russian ROC: Lev Shamardin * South East Europe ROC: Ioannis Liabotis * South West Europe ROC: Christian Neissner * UK/Ireland ROC: Jeremy Coles * GGUS: * GOCDB: <!-- * OSCT: project-egee-osct@cern.ch --> ---++++ WLCG Tier 1 Sites * ASGC: * BNL: * CERN site: * FNAL: * FZK: Angela Poschlad * !IN2P3: * INFN: * NDGF: * PIC: * RAL: * SARA/NIKHEF: Ron Trompert * TRIUMF: ---++ Feedback on Last Week's Minutes None was given. ---++ EGEE Items ---++++ Grid Operator Hand Over on Duty | | *c-COD Team* | | *From* | ROC Italy | | *To* | ROC France | * Report from cCOD: <i>Handover Log: * some expired tkts (ROD_CANADA, ROD_ICALG) that were solved * other with expiration date extended due to downtimes (ROD_NE) * other expired tkt from which there is no answer: ROC_CANADA - #54707 (APEL), * an expired tkt for SAMPA (ROC_LA) - that seems had lost the connection with the alarm, no action has been taken by ROD_LA * 2 tkt for CERN_PROD - both seem because of middleware probl: #53931 (CREAM-CE) has a "suggested fix" - but was not applied; #54424 - APEL problem, expired</i> *Maite:* What is the procedure when a ticket expires?<BR> *Helene:* It goes to the OCC and is discussed at this meeting.<BR> The two CERN tickets will be checked off-line. ---+++++ Sites Considered For Suspension None. ---++++ Pilot Services Reports and Issues * News about active pilots can be found at LCG.OpsMeetingPilots * Last week there was a checkpoint meeting for ARGUS. Now all sites are up and running and a glexec-ARGUS chain is available. ---++++ gLite Release News * Please find gLite release news in: https://twiki.cern.ch/twiki/bin/view/LCG/OpsMeetingGliteReleases * gLite 3.2 staged rollout in progress. See link for details. Tentative date for release is 27 Jan.<BR> *Need some Staged Rollout sites for MPI - volunteers please!* ---++++ EGEE Items From ROC Reports * *From ROC DECH:* <BR> <u>LCG2-FZK Service Incident</u>: Planned downtime affecting ATLAS: OUTAGE 2010-02-01 8:00 to 2010-02-05 15:00 (UTC) The dCache instance for atlas (atlassrm-fzk.gridka.de) will be migrated to Chimera. * *From ROC Russia:* <BR> Wrong version detection command for the LB service. https://savannah.cern.ch/bugs/?61586 . This bug duplicate https://savannah.cern.ch/bugs/?55482 from 2009-09-09 09:59. So it is not corrected during 3(!) months.<BR><BR> This will be fixed in gLite 3.2 but not in gLite 3.1. Nick will follow up on this %BLUE% *[ACTION]* %ENDCOLOR%. * *From Russia ROC:* <BR> Certificate issue from Belgrade certificate. Causes problems with dCache. Savannah BUG:61819. ---++++ Fixing MPI sites (from the MPI WG) <i>Dear Maite (CC. Steven) It seems that already today some sites are starting to fix their MPI problems :) We also got a few reactions wondering about this sudden urge to fix MPI site problems now. It would certainly help if the the ROCs receive an explanatory e-mail about the MPI Task Force mission, containing also the link with to the official documentation to MPI Support in EGEE, meaning this one: https://twiki.cern.ch/twiki/bin/view/EGEE/MpiTools that each ROC should distribute it to their sites. Many people is concerned because they have followed some documentation which is also online coming from SEE Grid, and particular to certain cluster in Budapest. There are reasons today for being optimistic, because people are fixing the issues, and mpi-start continues to work without any problem in the CREAM CE (see http://indico.ifca.es/indico/getFile.py/access?contribId=10&sessionId=1& amp;resId=1&materialId=slides&confId=249 ) However, in the timelife of EGEE we can probably only fix the current sites, and arrange properly the documentation. Any other thing like new features of the middleware will have to waitt for future developments. See here for status of mpi-start: http://indico.ifca.es/indico/getFile.py/access?contribId=2&sessionId=0&a mp;resId=0&materialId=slides&confId=249 cheers, Isabel More information about the MPI knowledge DB: http://wiki.ifca.es/e-ciencia/index.php/MPI_Errors </i> No questions were raised. ---++++ Instances of out of date services in the grid <i>Attached </i>[TO THE AGENDA]<i> you can find a list of instances of services that are out-of-date according to the list of supported service versions wiki page, here: https://twiki.cern.ch/twiki/bin/view/EGEE/SupportedServiceVersions </i> * *Nick:* Please can each ROC go through the list and pick out any of their sites. Then they should contact their sites and ask them to update the services that are out-of-date. <BR> *Angela:* What about if a site needs to keep an old version of a service for a VO they support? Do we then force sites to update or take their services off-line? <BR> *Nick:* No. If a site gives a good reason for needing to keep an unsupported version of a service, they can do this. _However_, they must understand that they will not get any support for this service and that if a security issue is found with the service, they may then be forced to either upgrade or take off-line that service. ---+++ Newly Created Action Items <!-- This is an example action item, just add new action items here. Please delete the example one. Note the example gets expanded in the template, Please when you duplicate delete the uid="xxxx" , closed="DD-MMM-YYYY", and closer="Main.SteveTraylen" they will be added automatically with an increment. A valid action item should have a "created", "creator", "due", "state" and "who". Obviously the state should be "open" not "closed". --> %ACTION{ closed="2010-02-08" closer="" created="2010-01-25" creator="Main.NickThackray" due="2010-02-08" notify="" state="closed" uid="000081" who="Main.OCC" }% Wrong version detection command for the LB service. BUG:61586 . This bug duplicates BUG:55482 from 2009-09-09 09:59. So it is not corrected during 3(!) months. <p /><b>UPDATE AT THE MEETING:</b> This will be fixed in gLite 3.2 but not in gLite 3.1. OCC will follow up. <br /><b>08/02/2010</b>There is now a fix for gLite 3.1, the bug is set to "Fix Certified", I think this action can be closed. %ENDACTION% ---+++ Review of Open Action Items <!-- Leave these next two so as to display open and recently closed action items.. --> ---+++ Open Action Items %ACTIONSEARCH{topic="(OpsActionItemsDB|.*WlcgOsgEgeeOpsMinutes2.*)" sort="$uid" state="open" format="|$uid|$creator|$text|$created|$due|$who|$edit|" header="|Id|Submitter|Description|Creation|Due|Assigned To||" }% ---+++ Actions Closed in Last 20 Days %ACTIONSEARCH{topic="(OpsActionItemsDB|.*WlcgOsgEgeeOpsMinutes2.*)" sort="$uid" state="closed" closed="> 20 days ago" format="|$uid|$creator|$text|$created|$due|$who|$closed|$edit|" header="|Id|Submitter|Description|Creation|Due|Assigned To|Closed||" }% ---++ AOB None. ---++ Next Meeting The next meeting will be Monday, 8th February 2010 16:00 UTC+1 (Swiss local time). * Attendees can join from 15:45 UTC+1 onwards. * The meeting will start promptly at 16:00 UTC+1. * To dial in to the conference: * Dial +41227676000 * Enter access code 0148141 --- These minutes can only be changed by members of: * Set ALLOWTOPICCHANGE = Main.WLCGOpsMeetGroup
E
dit
|
A
ttach
|
Watch
|
P
rint version
|
H
istory
: r3
<
r2
<
r1
|
B
acklinks
|
V
iew topic
|
WYSIWYG
|
M
ore topic actions
Topic revision: r3 - 2010-02-08
-
MaiteBarroso
Log In
EGEE
EGEE Web
EGEE Web Home
gLite
ProductTeams
SA3
JRA1
TMB
EMT
SA1
SA2
NA2
NA4
EGEE-UIG
List of
registered projects
List of EGEE-RP
interactions
Changes
Index
Search
Main.WebList
Welcome Guest
Login
or
Register
Cern Search
TWiki Search
Google Search
EGEE
All webs
Copyright &© by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Ask a support question
or
Send feedback