WLCG-OSG-EGEE Ops' Minutes Mon 29 Jun 2009

Summary

  • A patch is available to workaround the problem of mod_ssl with CA update 1.30, which involves padding the list with bogus CAs.
  • gLite 3.1 Update 48 was released to production, and will be followed within two weeks by Update 49 which will contain a bundle of new features, including the first release of Site Central Authorization Service (SCAS).

Attendance

EGEE

  • Asia Pacific ROC: ShuTing Liao
  • Central Europe ROC: Malgorzata Krakowian
  • OCC / CERN ROC: Diana Bosio, John Shade, Antonio Retico, Maria Dimou, Maarten Litmath
  • French ROC: Rolf Rumler, Pierre Girard
  • German/Swiss ROC: Sven Hermann
  • Italian ROC: Absent
  • Northern Europe ROC: Zeeshan Ali Shah, Ron Trompert
  • Russian ROC: Lev Shamardin
  • South East Europe ROC: Marios Chatziangelou
  • South West Europe ROC: Christian Neissner
  • UK/Ireland ROC: Jeremy Coles
  • GGUS: Helmut Dres

WLCG Tier 1 Sites

  • ASGC: Jason Shih
  • BNL: Absent
  • CERN site: Absent
  • FNAL: Absent
  • FZK: Sven Hermann
  • IN2P3: Pierre Girard, Rolf Rumler
  • INFN: Absent
  • NDGF: Thomas Bellman
  • PIC: Absent
  • RAL: Gareth Smith
  • SARA/NIKHEF: Ron Trompert
  • TRIUMF: Absent

OSG GOC

  • Rob Quick
  • Kyle ?

There was a terrible echo, undimished by Diana's repeated requests for people to mute their phones and CERN re-joining the conference. As a result, the meeting got underway 10 minutes late.

Feedback on Last Week's Minutes

None was asked for, and none was given.

EGEE Items

Grid Operator Hand Over on Duty

From ROC CE
To ROC NE

  • Malgorzata (CE) complained that there were several ALARMS for CERN ROC older than 130h, not handled by CERN ROD. The problem was that alarms were in OK status but not closed (ROD has 72 hours to handle the tickets). Diana was surprised, as she'd closed all the relevant tickets on Thursday. The conclusion was that it was probably a two-day glitch in the operations dashboard. Diana pointed out that 90% of tickets for ROC CERN are not worth a ticket - but that's another issue.

PPS Reports and Issues

  • A new version of lcas (glite-security-lcas-1.3.11-2) was released to the pilot and it is now being installed by the three participant sites.

gLite Release News

  • gLite 3.1 Update 48 was released to production. It contains:
    • Improvements to fetch-crl script
    • WN: grid-cm-* packages provide worker node configuration monitoring published on the Active MQ messaging system
    • The new lcg-info-dynamic-software package GIP handles values of software tags for SubClusters
    • dCache 1.9.1-7 Sever and 1.9.0-9 Client release i386
    • Installation of CREAM and CEMon client on VOBOX

  • gLite 3.1 Update 49 (a biggy!) should be available within two weeks. See above link for details but, in a nutshell:
    • First release of Site Central Authorization Service (SCAS) + glexec + LCAS/LCMAPS
    • LFC version 1.7.2-3
    • YAIM for lcg-CE for the gatekeeper to publish information about installed capacity

EGEE Items From ROC Reports

  • DECH: The announcement about the FZK 2nd July 2 hour outage didn’t make it from GOCDB to CIC portal. Sven has opened GGUS:49747 about the problem. [GOCDB has been switching back & forth from its backup instance during the RAL reorganization, so the notification may have got lost in the process - JRS]

Grid Service Interventions

  • RAL should be back fully operational next Monday (they’re moving). See agenda link for additional outages.

1.30 CA update

  • Special guest Maarten Litmath explained in detail the problems with the latest CA release and the suggested workaround. Savannah ticket 48458 has been updated with an RPM for services that might break. It is difficult to predict who will be hit, but CERN and GRIF CERN & GRIF have been bitten so far. IGTF don’t want to provide an official workaround, but an RPM which includes bogus CAs in addition to the real list will be made available. For the moment, a manual installation is necessary on services that need it (eg. WMS). Maarten pointed out that everyone will eventually hit the bug as new CAs are added at every release. [SAM grace period has been extended by one week to allow the situation to decant - JRS]

OSG Items

  • There were three items from Maria. Rob is following up, although there were some address problems. Rob requested more information about the complaint concerning GGUS:49599, since it was resolved quickly. Rob and Maria to follow-up off-line.

Newly Created Action Items

Assigned to Due date Description State Closed Notify  
Main.OCC 2007-03-05 Example Action Item 2007-03-06 SteveTraylen   edit

Review of Open Action Items

There were no action items to review.

Open Action Items

IdSubmitterDescriptionCreationDueAssigned To 

Actions Closed in Last 20 Days

IdSubmitterDescriptionCreationDueAssigned ToClosed 

AOB

  • Just a quick reminder from John for ROCs to send explanations for sites that failed to meet the standard 70/75 availability/reliability criteria. A suggestion was made that it would be nice if the CIC portal could provide a means for sites to enter explanations themselves. Rolf kindly agreed to check the feasibility with the developers.

Next Meeting

The next meeting will be Monday, 06-JUL-2009 14:00 UTC (16:00 Geneva time).

  • To dial in to the conference:
    • Dial +41227676000
    • Enter access code 0148141


These minutes can only be changed by members of:

Edit | Attach | Watch | Print version | History: r3 < r2 < r1 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r3 - 2009-07-01 - JohnShade
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    EGEE All webs login

This site is powered by the TWiki collaboration platform Powered by Perl This site is powered by the TWiki collaboration platformCopyright &© by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Ask a support question or Send feedback