GD Group Report for C5-04-Apr-2008 ================================== LCG deployment: --------------- - Total number of Sites (*): 245 - Software -> Num. Sites (*): gLite-3_1_0 -> 166 gLite-3_0_2 -> 67 gLite-3_0_0 -> 2 LCG-2_7_0 -> 2 unknown -> 8 - Status -> Num. Sites (*): ok -> 180 degraded -> 16 down -> 49 - Average of concurrently running jobs during this week (+): ~22k (*) Sites that are Certified _and_ Production _and_ Monitored by SAM: https://lcg-sam.cern.ch:8443/sam/sam.py To see this page one needs a grid certificate loaded in the browser. The calculation of the Site availability (Status) is described at: http://goc.grid.sinica.edu.tw/gocwiki/SAM_Metrics_calculation Software version is coming from the 'CE-sft-softver' CE test. Sites not supporting SAM 'CE' service, or not having sent results for this particular test during the last week, are counted as 'unknown'. (+) Job statistics taken from GStat: http://goc.grid.sinica.edu.tw/gstat/ http://goc.grid.sinica.edu.tw/gstat/total/GIISQuery_Usage_cpu_.html For the time being we do not report CPU numbers: 1. Not all the reported CPUs are actually available for grid jobs. 2. Sites with multiple CEs may have their CPUs double-counted. 3. GStat includes sites that are not considered by the SFTs. EGEE Pre-Production Service Coordination: ----------------------------------------- * Release of gLite 3.1 Update18 to production in preparation. The update, to be released soon, will contain: - NEW: glite-MON for SL4 - DPM 1.6.7-4 . fix for bug #33769: incorrect pool free space after dpm-drain . improved ACL management for srmMkdir command - UI/WN/VOBOX . lcg-tags non longer produces Globus warnings suppressed . voms-admin client 2.0.6-1 providing ACL support on command line - vdt_globus_essentials (affecting several services and notably the CE) . bug fix to prevent globus-job-manager processes to pile-up on a CE (bug observed at CERN after SAM WMS?RB tests were enabled) - voms-admin server (VOMS) . Refactored voms-admin-ping script . ACL management web service (compatible with client >= 2.0.6-1) . Registration web service. . Many bug fixes EGEE CERN Regional Operations Centre (ROC): ------------------------------------------- * COD week for Cern ROC * GGUS ticket processing: The bug in GGUS blocking sometimes updates to/from PRMS/Remedy was fixed. Just after the fix the backlog of missing updates was re-processed by GGUS. This caused a "storm" of non-synchronised updates which in some cases re-opened tickets already closed. In some other cases (more annoying for remedy supporters) the updates created new tickets in remedy. We cleaned-up the situation at our best but spurious entries in open remedy tickets could still be present. CERN GRID Pre-Production Site (CERN_PPS): ----------------------------------------- * A discussion was started with LHCb to set-up at CERN a pre-production instance of an AMGA server for the VO based on the freshly released gLite distribution ETICS: - A new version of the ETICS system (2.0.3-1) has been deployed today in production. This is a maintenance release fixing bugs found since the previous release in November 2007 - The following new external components have been added to the repository in response to user requests: o gSOAP 2.7.10 o Condor 7.1.0 - Work is progressing on the new major release, which focuses on performance improvements - Builds/tests submitted this week: 1324 gLite 3.x Build & Integration: - Certification repository gLite 3.0 --------------------------------------- .. Presently .. 0 in preparation .. 0 in configuration .. 0 in certification gLite 3.1 --------------------------------------- .. Presently .. 8 in preparation .. 0 in configuration .. 16 in certification - PPS repository gLite 3.0 --------------------------------------- .. No new release (latest release 3.0.2 PPS Update 46) None .. Next set of patches scheduled for release to PPS : None gLite 3.1 --------------------------------------- .. No new release (latest release 3.1 PPS Update 21) #1629 VOMS-Admin server 2.0.13-1 & VOMS-Admin client 2.0.6-1 #1676 new vdt_globus_essentials to fix Globus bug 5771 #1704 New version of lcg-tags #1706 R3.1/slc4/i386: DPM 1.6.7-4 update #1707 R3.1/slc4/x86_64: DPM 1.6.7-4 update #1708 R3.1/SLC4/i386: glite-AMGA_oracle metapackage .. Next set of patches scheduled for release to PPS: #1219 fix for DENY tags to lcg-info-dynamic-scheduler #1645 R3.1/SLC4/x86_64: GFAL/lcg_util update #1680 R3.1/SLC4/x86_64: GFAL 1.10.8 #1709 [ YAIM ] yaim core and yaim lcg-ce 4.0.4 series (Job Priorities implementation) #1728 [ YAIM ] glite-yaim-clients 4.0.3 series #1730 new lcg-ManageVOTAg version solving bug 34245 #1738 R3.1/SLC4/i386: GFAL & lcg-util update #1712 R-GMA fix for forwards compatibility - Production repository gLite 3.0 --------------------------------------- .. No new release (latest release 3.0.2 Update 41) #1671 R3.0/SLC3/i386: FTS transfer-url-copy update for space tokens .. Next set of patches scheduled for release to production: None gLite 3.1 --------------------------------------- .. No new release (latest release 3.1 Update 17) #1571 glite-LSF_utils .. Next set of patches scheduled for release to production: #1707 R3.1/slc4/x86_64: DPM 1.6.7-4 update #1706 R3.1/slc4/i386: DPM 1.6.7-4 update #1704 New version of lcg-tags gLite 3.1 #1676 new vdt_globus_essentials to fix Globus bug 5771 #1629 VOMS-Admin server 2.0.13-1 & VOMS-Admin client 2.0.6-1 #1537 glite-MON for gLite 3.1 / SL4 gLite 3.x Testing & Certification: ---------------------------------- * Certification .. WMS & LB patches fully updated and back in certification .. DPM/LFC 1.6.10 and FTA updates in certification Patches certified; #1712 R-GMA fix for forwards compatibility #1730 new lcg-ManageVOTAg version solving bug 34245 #1738 R3.1/SLC4/i386: GFAL & lcg-util update * Other work .. Installation of Nagios on the ctb. .. Overhaul of the virtualisation infrastructure portal; vNode 2 in beta. Grid User Support: Current user support priorities are listed at the site-experiment session of the CCRC'08 F2F and the GDB Meetings on 1st and 2nd April 2008. To improve communication between Grid sites a Survey is taking place now: https://gus.fzk.de/pages/questionnaire_rocs-sites.php Results, as they come in, are listed here: https://gus.fzk.de/pages/metrics/result-rocs-sites-survey.php To evolve GGUS in a useful way for the VOs the User Support Advisory Group (USAG) is the forum for requirements' discussion. Next meeting is on 10/4 at 11am. Draft agenda: http://indico.cern.ch/conferenceDisplay.py?confId=30349 SAM --- Production Service: * quattor profiles for UI upgrade tested (upgrading production UI Tuesday 8th April) Client/sensors development: * working on bugs: #32575, #34978, #34730 Previous (full) reports can be consulted at: https://twiki.cern.ch/twiki/bin/view/LCG/GDC5Reports ---Zdenek