Week of 130610

Daily WLCG Operations Call details

To join the call, at 15.00 CE(S)T Monday to Friday inclusive (in CERN 513 R-068) do one of the following:

  1. Dial +41227676000 (Main) and enter access code 0119168, or
  2. To have the system call you, click here

The scod rota for the next few weeks is at ScodRota

WLCG Availability, Service Incidents, Broadcasts, Operations Web

VO Summaries of Site Usability SIRs Broadcasts Operations Web
ALICE ATLAS CMS LHCb WLCG Service Incident Reports Broadcast archive Operations Web

General Information

General Information GGUS Information LHC Machine Information
CERN IT status board WLCG Baseline Versions WLCG Blogs GgusInformation Sharepoint site - LHC Page 1



  • local: Stefan, Maarten, Xavi, Ivan, Manuel
  • remote: Xavier, Maria, David, Lisa, Vladimir, Gareth, Wei-Jen, Paolo, Rob, Rolf,

Experiments round table:

    • T0/Central services
      • Jobs at OPENSTACK_CLOUD at CERN are failing because of bad credentials GGUS:94716 & INC:12844. Ongoing.
    • T1
      • NDGF-T1: GGUS:94670 Transfers from NDGF-T1 are failing with "An end of file occurred". Pool servers reset. Fixed.

  • CMS
    • Production at moderate levels with upgrade MC production
    • GGUS:94505, GGUS:94615 File read issues at RAL due to high load there.
    • GGUS:94741, GGUS:94748, GGUS:94750 File read issues at IN2P3, possible need for file replication. -- GGUS tickets appear to all stem from the same Savannah ticket
    • GGUS:94595 Test alarm ticket -- been quietly watching this -- anything we need to do here?
      • Maarten: no further action needed from your side

  • ALICE -
    • Russian network: on June 4 the GEANT link to Moscow was cut to 100 Mbit/s (GGUS:94540) and since that time there have been many job failures at most of the Russian sites due to timeouts. To alleviate the congestion and avoid job loss, the most affected ALICE sites have been closed for job processing Sun evening:
      • IHEP
      • JINR
      • MEPHI
      • PNPI

  • LHCb
    • Incremental stripping campaign in progress and MC productions ongoing
    • T0:
    • T1:
      • GRIDKA: Problem with staging during weekend; Solved

Sites / Services round table:

  • KIT: Problems with CMS squids, overloaded, used former ATLAS squids to mitigate the problem.
  • RAL: NTR
  • OSG: NTR
  • IN2P3: Currently in downtime for MSS, due to robotics maintenance. Dcache will be upgraded to 2.2.10. Long DT tomorrow for several services, except FTS and operations portal. Adding CREAM-CE under SL6. Batch back Tue evening. Robotics will be back Wed morning.
  • Storage: Hotfix next Mo/Tue for VO experiments for Castor. 5 mins transparent change.
  • Dashboard: NTR
  • Data processing: NTR




  • local:
  • remote:

Experiments round table:

  • ALICE -

Sites / Services round table:


This topic: LCG > WebHome > WLCGCommonComputingReadinessChallenges > WLCGOperationsMeetings > WLCGDailyMeetingsWeek130610
Topic revision: r3 - 2013-06-10 - StefanRoiser
This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback