2009 Q1
Job Statistics
- Summary:
- Over 1000 K jobs run
- About 25% failed
- Routinely running over 10 K jobs per day
- Daily peak of over 80 K jobs
- 440 K Production jobs run to end
- 180 K User jobs run to the end
- 135 K Production Jobs Failed
- 114 K User Jobs Failed
- Total number of Jobs by Final Major Status
- Daily number of Jobs by Final Mayor Status
- Done|Completed Jobs by User Group
- Done|Completed Production Jobs by Job Type
- Failed Jobs by User Group
- Failed Production Jobs by Minor Status
- Failed User Jobs by Minor Status
Running at Tier1's
- Summary:
- 4 K Production Jobs at Tier1s
- Monte Carlo Production was done outside Tier1's
- 130 K User Jobs at Tier1s
- 50 % CERN Share
- very low contributions from CNAF, NIHKEF, IN2P3
- Done|Completed Production Jobs by Site
- Done|Completed User Jobs by Site
Job Failure Analysis
- Summary:
- Production Jobs Failed mostly due to:
- Application Error everywhere (86 K)
- Input Sandbox Download mostly at Shefield, RAL-HEP, IN2P3-T2 (45 K)
- User Jobs Failed mosty due to:
- Input Sandbox Download mostly at CERN, Durham, Dortmund, RAL (75 K)
- Application Error mostly at IN2P3-T2, CERN (20 K)
- Input Data Resolution, notice large fraction of errors at NIHKEF with respect to successful jobs (11 K)
- Comment about any site is appropriated
- Large number of problems with Input Data Resolution outside Tier1's.
- Failed Production Jobs (Application Finished With Error) by Site
- Failed Production Jobs (Input Sandbox Download) by Site
- Failed User Jobs (Input Sandbox Download) by Site
- Failed User Jobs (Application Finished With Error) by Site
- Failed User Jobs (Input Data Resolution) by Site
Job Failure at Tier1 Analysis
- Summary:
- Most activity at Tier1 related to user Jobs.
- Production Jobs Failed mostly due to:
- Application Error everywhere (86 K)
- Input Sandbox Download mostly at Shefield, RAL-HEP, IN2P3-T2 (45 K)
- User Jobs Failed mosty due to:
- Input Sandbox Download mostly at CERN, Durham, Dortmund, RAL (75 K)
- Application Error mostly at IN2P3-T2, CERN (20 K)
- Input Data Resolution, notice large fraction of errors at NIHKEF with respect to successful jobs (11 K)
- Comment about any site is appropriated
- Large number of problems with Input Data Resolution outside Tier1's.
- Failed Jobs at CERN by Minor Status
- Failed Jobs at CNAF by Minor Status
- Failed Jobs at GRIDKA by Minor Status
- Failed Jobs at IN2P3 by Minor Status
- Failed Jobs at NIKHEF by Minor Status
- Failed Jobs at PIC by Minor Status
- Failed Jobs at RAL by Minor Status
- Failed Jobs at GRIDKA by Minor Status
--
RicardoGraciani - 10 Jun 2009
This topic: LHCb
> WebHome >
LHCbComputing >
DIRACWeeklyReport > DIRACQuarterlyReport2009Q1
Topic revision: r2 - 2009-06-10 - RicardoGraciani