Difference: ProductionOperationsWLCGJan10Reports (2 vs. 3)

Revision 32010-02-08 - unknown

Line: 1 to 1
 

January 2010 Reports

To the main
Changed:
<
<

29th January 2009 (Friday)

>
>

29th January 2010 (Friday)

  Experiment activities:
Line: 41 to 41
 
Changed:
<
<

28th January 2009 (Thursday)

>
>

28th January 2010 (Thursday)

  Experiment activities:
Line: 62 to 62
 
    • shared area problem at AUVERGRID, Milano and Pisa
Changed:
<
<

27th January 2009 (Wednesday)

>
>

27th January 2010 (Wednesday)

  Experiment activities:
Line: 82 to 82
 
  • T2 sites issues:
    • none
Changed:
<
<

26th January 2009 (Tuesday)

>
>

26th January 2010 (Tuesday)

  Experiment activities:
Line: 105 to 105
 
  • T2 sites issues:
    • Shared area issues both at UKI-SOUTHGRID-RALPP and AUVERGRID
Changed:
<
<

25th January 2009 (Monday)

>
>

25th January 2010 (Monday)

  Experiment activities:
Line: 124 to 124
 
    • PIC: an issue with one file for some user analysis jobs Under investigation by our local contact person.
    • CNAF:migration to TSM is now over.
Changed:
<
<

22nd January 2009 (Friday)

>
>

22nd January 2010 (Friday)

  Experiment activities:
Line: 144 to 144
 
    • Shared area and SQLite issues
Changed:
<
<

21th January 2009 (Thursday)

>
>

21th January 2010 (Thursday)

  Experiment activities:
Line: 161 to 161
 
    • CNAF: registering data of the new T1Dx endpoint in LFC.
    • PIC: The low efficiency jobs observed yesterday (as suspected) were user jobs: these were about jobs whose output sandbox upload to CASTOR RAL was hanging. DIRAC has in place any possible timeout. Jobs stack is not longer available because finally killed by the LRMS no further investigation are possible.
    • RAL: similarly another user job seems to have consumed 49 seconds over a wall clock time of 55 hours,.
Changed:
<
<

20th January 2009 (Wednesday)

>
>

20th January 2010 (Wednesday)

  Experiment activities:
Line: 181 to 181
 
  • T2 sites issues:
Changed:
<
<

19th January 2009 (Tuesday)

>
>

19th January 2010 (Tuesday)

  Experiment activities:
Line: 203 to 203
 
  • T2 sites issues:
Changed:
<
<

18th January 2009 (Monday)

>
>

18th January 2010 (Monday)

  Experiment activities:
Line: 225 to 225
 
  • Others:
    • GGUS portal problem submitting TEAM ticket. Open a normal GGUS ticket against GGUS support.
Changed:
<
<

15th January 2009 (Friday)

>
>

15th January 2010 (Friday)

  Experiment activities:
Line: 247 to 247
 
    • CNAF: confirmed that the problem yesterday was due to GPFS not available. Problem fixed..
  • T2 sites issues:
    • SAM failing at INFN-NAPOLI-CMS and jobs aborting at UKI-SOUTHGRID-BHAM-HEP
Changed:
<
<

14th January 2009 (Thursday)

>
>

14th January 2010 (Thursday)

  Experiment activities:
Line: 270 to 271
 
    • CNAF: we had all Stripping jobs failing there. It looks a load problem with Storm: apparently the jobs managed to retrieve the tURL of the file (GPFS file://) but then the job got stuck accessing data and not updates. Looking for more information then GGUS will be submitted.
  • T2 sites issues:
    • none
Changed:
<
<

13th January 2009 (Wednesday)

>
>

13th January 2010 (Wednesday)

  Experiment activities:
Line: 290 to 291
 
  • T2 sites issues:
    • none
Changed:
<
<

12th January 2009 (Tuesday)

>
>

12th January 2010 (Tuesday)

  Experiment activities:
Line: 309 to 310
 
    • none
  • T2 sites issues:
    • none
Changed:
<
<

11th January 2009 (Monday)

>
>

11th January 2010 (Monday)

  Experiment activities:
Line: 330 to 331
 
    • IN2p3: the problem reported before Xmas due to a third party library used with gsidcap has been fixed by developers but needs some more testing. In the mean time Lyon moved to the dcap protocol and the SE can be unbanned in the LHCb production mask.
  • T2 sites issues:
    • Shared area issue
Changed:
<
<

8th January 2009 (Friday)

>
>

8th January 2010 (Friday)

  Experiment activities:
Line: 350 to 351
 
    • SARA: WMS is not longer submitting CondorG jobs because of too many piled jobs in the ICE queue (1500) (see PIC)
  • T2 sites issues: none reported
Changed:
<
<

7th January 2009 (Thursday)

>
>

7th January 2010 (Thursday)

  Experiment activities:
  • Because of a wrong format of the dates (problem discovered by Pablo the 31st of December) the lhcb SSB was not publishing fresh results for none of its view until yesterday. The problem has been sorted out.
Line: 368 to 369
 
    • CNAF: discussions/evaluation about migrating CASTOR to TSM. It is about 6TB of data (1 night for copying everything).
  • T2 sites issues:
Changed:
<
<

6th January 2009 (Wednesday)

>
>

6th January 2010 (Wednesday)

  Experiment activities:
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback