GD Group Report for C5-07-Mar-2008 ================================== LCG deployment: --------------- - Total number of Sites (*): 249 - Software -> Num. Sites (*): gLite-3_1_0 -> 145 gLite-3_0_2 -> 84 gLite-3_0_0 -> 2 LCG-2_7_0 -> 5 unknown -> 13 - Status -> Num. Sites (*): ok -> 174 degraded -> 20 down -> 55 - Average of concurrently running jobs during this week (+): ~20k (*) Sites that are Certified _and_ Production _and_ Monitored by SAM: https://lcg-sam.cern.ch:8443/sam/sam.py To see this page one needs a grid certificate loaded in the browser. The calculation of the Site availability (Status) is described at: http://goc.grid.sinica.edu.tw/gocwiki/SAM_Metrics_calculation Software version is coming from the 'CE-sft-softver' CE test. Sites not supporting SAM 'CE' service, or not having sent results for this particular test during the last week, are counted as 'unknown'. (+) Job statistics taken from GStat: http://goc.grid.sinica.edu.tw/gstat/ http://goc.grid.sinica.edu.tw/gstat/total/GIISQuery_Usage_cpu_.html For the time being we do not report CPU numbers: 1. Not all the reported CPUs are actually available for grid jobs. 2. Sites with multiple CEs may have their CPUs double-counted. 3. GStat includes sites that are not considered by the SFTs. WLCG Transfer Service: ---------------------- * Transfer ranging from 150 to 1700 MB/s, averaging around 950 MB/s per day. * Involving all major T1 sites. * Mostly traffic from CMS and less from Alice * 1 open ticket in total * Throughput plots: http://gridview.cern.ch/GRIDVIEW/ EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite3.1.0 PPS Update20 was released to PPS. and after the pre-deployment test is now being installed by PPS sites. This Update introduces the MONBOX on the 3.1 baseline (for SLC4) * gLite 3.0 Update40 was released to production. The update contains: - host certificate of voms server used by egeode and biomed VOs (for SL3) - Fix of missing dependency on lcg-schema for glite-WMS metapackage * Release of gLite 3.1 Update16 to production in preparation. The update will contain: - A new index to speed the BDII up - UI: Bug fixes to JDl API (bulk submission) and gfal cliens - dcache SE: Glue 1.3 clean ups - DPM SE: version 1.6.7 (32-bit and 64-bit) fixing various configuration bugs; introducing new front-ends for Xroot and HTTP/HTTPS; upgrading the version of gSOAP from 2.6.2 -> 2.7.6b - lcgCE: bug fixes * gLite 3.0.2 PPS Update 46 was released to PPS and after the pre-deployment test is now being installed by PPS sites. The update contains: - update to FTS transfer-url-copy for space tokens - Fix of missing dependency on lcg-schema for glite-WMS metapackage CERN GRID Pre-Production Site (CERN_PPS): ----------------------------------------- * Upgrade of the site to glite 3.1 PPS-update19 * Upgraded to gLite 3.0 PPS Update 46 - WMS, FTS, MON box SAM: * Production Services - Unavailabilities: .. From 04-03-2008 (Tue) 07:45h to 04-03-2008 (Tue) 12:34h Scheduled Intervention in LCGR database (Deploy Oracle Critical Patch and creation of new TESTDATA partitions) .. March 15:00 -- updated (SLC4, gLite3.1) SAM UI back to production * Development - new pre-release version of SAM DB component lcg-sam-server-db-1.1.2 has been installed on our SAM Validation instance. It contains only a minor update to OSG service discovery script - 8 bug fixes in total for submission framework (lcg-sam-client-1.2.1-1) and sensors (lcg-sam-client-sensors-1.4.2-1) were put on validation as well. gLite 3.x Build & Integration: ------------------------------ - Certification repository gLite 3.0 --------------------------------------- .. Presently .. 0 in preparation .. 0 in configuration .. 0 in certification gLite 3.1 --------------------------------------- .. Presently .. 3 in preparation .. 0 in configuration .. 25 in certification - PPS repository gLite 3.0 --------------------------------------- .. No new release (latest release 3.0.2 PPS Update 46): None .. Next set of patches scheduled for release to PPS : #1694 R3.0 lcg-vomscerts-4.8.0 adds next cert for biomed + egeode [gLite 3.0] #1705 R3.0 lcg-vomscerts-4.8.0 adds next cert for biomed + egeode [gLite 3.0 WMS] gLite 3.1 --------------------------------------- .. 3.1 PPS Update 20 #1537 glite-MON for gLite 3.1 / SL4 .. Next set of patches scheduled for release to PPS: #1629 VOMS-Admin server 2.0.13-1 & VOMS-Admin client 2.0.6-1 #1676 new vdt_globus_essentials to fix Globus bug 5771 #1704 New version of lcg-tags - Production repository gLite 3.0 --------------------------------------- .. 3.0.2 Update 40 #1673 Fix of missing dependecy on lcg-schema for glite-WMS metapackage #1612 YAIM module for 3.0 WMS to fix the bug of limit on uid for gridftp server #1694 R3.0 lcg-vomscerts-4.8.0 adds next cert for biomed + egeode #1705 R3.0 lcg-vomscerts-4.8.0 adds next cert for biomed + egeode .. Next set of patches scheduled for release to production: None gLite 3.1 --------------------------------------- .. No new release (latest release 3.1 Update 15) .. Next set of patches scheduled for release to production: #1515 patch for bugs 28483 and 30143 (slc4) #1543 Dcache 1.7 upgrade and YAIM module update request for Glue 1.3 #1600 glite-SE_dpm_mysql SLC4/x86_64 metapackage #1601 glite-SE_dpm_disk metapackage for SLC4/x86_64 #1661 [ YAIM ] glite-yaim-lcg-ce 4.0.3 #1670 R3.1/SLC4/x86_64: DPM 1.6.7 update #1679 R3.1/SLC4/i386: GFAL 1.10.8 #1681 Updated BDII package #1669 R3.1/SLC4/i386: DPM 1.6.7 update gLite 3.x Testing & Certification: ---------------------------------- * Certification - patches certified: #1629: VOMS-Admin server 2.0.13-1 & VOMS-Admin client 2.0.6-1 #1676: new vdt_globus_essentials to fix Globus bug 5771 #1704: New version of lcg-tags * gLite 3.1 / SL4 32bit glite-WMS and glite-LB have passed acceptance tests and are being prepared for release to PPS. * Configuration - JPWG compliant yaim now in certification. - Implementation of consistent exit codes in yaim - Testing of YAIM configurator (CIC portal) Grid User Support: ------------------ A number of discussions are being held with LHC Experiment VOs (initiated by Atlas), participants of the Grid Operations' and the ROC managers' meetings to plan selectively direct routes from VO experts to Grid Sites. These on-going discussions were recorded in: http://goc.grid.sinica.edu.tw/gocwiki/Week__2008/02/20_-_2008/03/03 and were presented to CCRC'08 F2F: http://indico.cern.ch/getFile.py/access?contribId=8&resId=3&materialId=slides&confId=29170 Grid Authentication & Authorization Services: --------------------------------------------- A suggestion for planning a simplified voms service set-up at CERN was presented to the March GDB. Slides in http://indico.cern.ch/materialDisplay.py?sessionId=9&materialId=0&confId=20227 More discussions will follow involving the VOM(R)S WG and the LCG-OSG management. Grid Operational Security: -------------------------- Nothing to report this week. ETICS: ------ Nothing reported this week. The full GD report can be consulted on: --------------------------------------- https://twiki.cern.ch/twiki/bin/view/LCG/GDC5Reports ---Zdenek