WLCG-OSG-EGEE Ops' Minutes Mon 27 Oct 2008

Summary

Collecting details of what the cream-CE must be capable of before the lcg-CE can be really dropped, e.g CondorG<->CreamCE support.

Attendance

EGEE

  • Asia Pacific ROC: Jason
  • Central Europe ROC: Malgorzata
  • OCC / CERN ROC: John Shade, Antonio Retico, Nick Thackray, Steve Traylen, Maria,
  • French ROC: Osman
  • German/Swiss ROC: Angela
  • Italian ROC:
  • Northern Europe ROC: Ron Trompert
  • Russian ROC: Lev Shamardin,
  • South East Europe ROC:
  • South West Europe ROC: Kai
  • UK/Ireland ROC: Jeremy Coles
  • GGUS: Helmet

WLCG

  • WLCG Service Cordination: Harry Renshall, Jamie Shiers

WLCG Tier 1 Sites

  • ASGC: Jason
  • BNL: Absent
  • CERN site: Harry
  • FNAL: Catalin Dumitrescu
  • FZK: Angela
  • IN2P3:
  • INFN:
  • NDGF:
  • PIC: Kai
  • RAL: Gareth Smith
  • SARA/NIKHEF: Absent
  • TRIUMF: Absent
  • OSG: Rob Quick

LHC Experiments

  • ATLAS: Alessandro di Girolamo
  • LHCb: absent.
  • CMS: absent
  • ALICE: Patricia Mendez

Feedback on Last Week's Minutes

None was given.

EGEE Items

Grid Operator Hand Over on Duty

  Primary Team Secondary Team
From SE DECH
To NE CERN

Report from SouthEast Europe
* During the week the GGUS:42172 was transfered to political instances. The last SAM are ok now, but it is still opened. We believe the problem is fixed. * The GGUS:40707 (Political) is duplicated GGUS:42008. The second one needs more time to solve the problem and the first one was extended.
Report from DECH
  • Two GGUS:42199 and GGUS:42015 unanswered for one site, ITPA-LCG2, both transfered to the next step, political instance.

Assigned to Due date Description State Closed Notify  
RocNorth 2009-11-10 Check IPTA-LCG2 for progress on GGUS:42015. This now led to suspension action on 2008-11-10. Closing this one. 2008-11-12 edit
* The situation with the GGUS:40521 is not clear, the case is at the last step of escalation for long time - corresponding ROC should review the situation on operational meeting.

Assigned to Due date Description State Closed Notify  
RocRussia 2009-11-10 Check RU-Phys-SPbSU for progress on GGUS:40521. RocRussia did suspend. 2008-11-12 edit
  • CERN and NE will have to contact last weeks ROC offline and should also look at the above tickets for their region.

PPS Reports

EGEE Items From ROC Reports

$ ROC NE: Vera Hansper has submitted GGUS:42341 on October 15th. This ticket has been assigned the the GridView support unit. No action has been undertaken yet to solve this ticket. Create an action with John to check there is some progress.

Assigned to Due date Description State Closed Notify  
JohnShade 2009-11-10 Check there is progress on GGUS:42341

Update: Current monitoring architecture does not allow one service to be on multiple nodes/sites. A new database schema is being worked on (based on service end-points), but the restriction of binding a service to a specific site will probbaly remain.

2008-11-12 edit
$ ROC UK/I: Noticed some problems with the formatting of this ROC report. Multiple unlabelled lines for the downtime history against many sites and only some of these line up. Please submit a GGUS ticket.

gLite Release News

  • OpsMeetingGliteReleases
  • Calm week , release in preparation for PPS gLite 3.1 update 38.
    • Bug fixes for the CREAM clients. PATCH:2002.
    • Major release of VOMS. See PATCH: In particular the FQAN ordering.
    • New job manager for Sun Grid Engine
    • New GFAL.

WLCG Items

Cream CE submission.

  • List of criteria needed for CREAM to replace the lcg-CE, e.g GRAM (WS and preWS) and CondorG.
  • Isn't ICE needed? Yes it is needed.
  • A fuller list will be produced next week by Nick following feedback.

Pool Accounts and VO Boxes.

  • Not particularly an ALICE problem,
  • Sites generally deploy anyway without pool accounts.
    • Need to review why pool accounts cannot be used.
    • Open a BUG for why pool accounts can not be used.

WLCG issues coming from ROC reports

* UK/I Noticed few sites are passing LHCb SAM tests. e.g problems registered files with book keeping service. LHCb to give an update next week. Create action item on LHCb.

Assigned to Due date Description State Closed Notify  
Main.LHCb 2009-11-10 From UK/I but general, why are so many sites failing LHCb SAM tests. Please can LHCb give a summary. Roberto Santin. will check again on 11.11.2008. 2008-11-19 edit
* CERN ROC, new Role=pilot deployed on CERN-PROD.

Upcoming WLCG Service Interventions

* See the lists: CERN-PROD, 10 of the CEs will be down tomorrow each for about 30 minutes.

ATLAS Service

* ATLAS have stopped cosmic data runs => data flow dropped accordingly. * Submitted ALARM to CNAF and has not been treated accordingly. GGUS:42620. * Please GGUS submit a round a tests if possible. * (Since the meeting this has been partially understood and discussed.)

ALICE Service

  • How many VO boxes are used?
    • For Tier1s then two.
    • Second VO box needed to submit to CreamCE if it is there basically.

CMS Service

absent

LHCb Service

absent

WLCG Service Coordination

OSG Items

  • Rob quick stated that the new GGUS<->OSG interface is now in place. Looks okay from their end asked if okay from GGUS side. No one present to comment.
    • We used to have 3 or 4 open tickets and now more like 7. Will contact Guenter directly at GGUS.

  • Sensitivity of ticket escalation has been increased so now more tickets appear as in need of investigation. One considered urgent concerning. GGUS:41058. Rob will be checked.

AOB

Action Items

Newly Created Action Items

Review of Open Action Items

Open Action Items

IdSubmitterDescriptionCreationDueAssigned To 

Actions Closed in Last 20 Days

IdSubmitterDescriptionCreationDueAssigned ToClosed 

Next Meeting

The next meeting will be Monday, dd mmm 2007 15:00 UTC (16:00 Swiss local time).

  • Attendees can join from 14:45 UTC (15:45 Swiss local time) onwards.
  • The meeting will start promptly at 15:00 UTC (16:00 Swiss local time).
  • The WLCG section will start at the fixed time of 15:30 UTC (16:30 Swiss local time).
  • To dial in to the conference:
    • Dial +41227676000
    • Enter access code 0157610


These minutes can only be changed by members of:

Edit | Attach | Watch | Print version | History: r6 < r5 < r4 < r3 < r2 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r6 - 2008-11-19 - SteveTraylen
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    EGEE All webs login

This site is powered by the TWiki collaboration platform Powered by Perl This site is powered by the TWiki collaboration platformCopyright & by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Ask a support question or Send feedback