GD Group Report for C5-25-Apr-2008 ================================== LCG deployment: --------------- - Total number of Sites (1): 283 - Status -> Num. Sites (1): ok -> 170 degraded -> 22 down -> 90 na -> 1 - Software -> Num. Sites (2): gLite-3_1_0 -> 188 gLite-3_0_2 -> 53 gLite-3_0_0 -> 1 LCG-2_7_0 -> 2 - Average of concurrently running jobs during this week (3): ~25k (1) Sites that are Certified, in Production and that have been monitored by SAM during the last week under OPS credentials. SAM is available at: https://lcg-sam.cern.ch:8443/sam/sam.py To see this page one needs a grid certificate loaded in the browser. The calculation of the Site availability (Status) is described at: https://cern.ch/twiki/pub/LCG/GridView/Gridview_Service_Availability_Computation.pdf (2) Software version is coming from the 'CE-sft-softver' CE test. Sites not supporting SAM 'CE' service, or not having sent results for this particular test during the last week, are not counted. (3) Job statistics taken from GStat: http://goc.grid.sinica.edu.tw/gstat/ http://goc.grid.sinica.edu.tw/gstat/total/GIISQuery_Usage_job_.html SAM Production Services: ------------------------ * SAM UI upgrade to (SLC4, glite 3.1) * Unavailabilities: ----------------- - From: 22-04-2008 (Tue) 07:45 UTC To: 23-04-2008 (Wed) 13:30 UTC Severity: Minor Affected services: all Symptoms: problems/fixes propagated to SAM possibly 1 hour later than normal (tests only in every odd hour) Reason: upgrade of SAM UI (SLC4, gLite 3.1) Solution: sorting out problems arised during the installation + testing - From: 23-04-2008 (Wed) 16:15 UTC To: 23-04-2008 (Wed) 21:15 UTC Symptom: presence of OSG sites alternating Reason: misconfiguration of the top-BDII config generator Solution: configuration fixed CERN PPS site: -------------- * Nothing to report EGEE CERN Regional Operations Centre (ROC): ------------------------------------------- * Nothing to report EGEE Pre-Production Service Coordination: ----------------------------------------- * After pre-deployment testing PPS sites are now upgrading to ** gLite3.1.0 PPS Update24 ** * gLite 3.1.0 Update20 was released to production with HIGH priority. The update contains urgent patches for CCRC08: Two issues were found in the release: - broken submission gL3.0CE --> gL3.1WN (problem at dgass cache) - configuration issue in YAIM None of these two issues could have been detected by PPS in the current configuration gLite 3.x Integration & Build: - Certification repository gLite 3.0 --------------------------------------- .. Presently .. 0 in preparation .. 0 in configuration .. 0 in certification gLite 3.1 --------------------------------------- .. Presently .. 3 in preparation .. 0 in configuration .. 9 in certification - PPS repository gLite 3.0 --------------------------------------- .. No new release (latest release 3.0.2 PPS Update 48) .. Next set of patches scheduled for release to PPS : None gLite 3.1 --------------------------------------- .. 3.1 PPS Update 24 #1758 R3.1/i386/SLC4: GFAL & lcg_util update #1719 R3.1/SLC4/x86_64: DPM/LFC v1.6.10 .. 3.1 PPS Update 25 #1278 Service Information Provider #1683 Dcache 1.8.0.12.p6 (First dcache 1.8 release) #1713 VOMS Core + logging Fix v2 #1723 Rebuild MPI_utils mpich RPM with Fortran wrappers #1729 APEL working with external log4j and BC #1759 R3.1/x86_64/SLC4: GFAL & lcg_util update #1788 Trustmanager fix for install script .. Next set of patches scheduled for release to PPS : #1800 New vdt_globus_jobmanager_common to fix globus-cass-cache.. - Production repository gLite 3.0 --------------------------------------- .. 3.0.2 Update 42 #1740 R3.0/SLC3/i386: FTA update #1769 R3.0 lcg-vomscerts-4.9.0 adds next cert for lcg-voms #1770 R3.0 WMS lcg-vomscerts-4.9.0 adds next cert for lcg-voms .. Next set of patches scheduled for release to Production : None gLite 3.1 --------------------------------------- .. 3.1 Update 20 #1680 R3.1/SLC4/x86_64: GFAL 1.10.8 gLite 3.1 #1645 R3.1/SLC4/x86_64: GFAL/lcg_util update #1738 R3.1/SLC4/i386: GFAL & lcg-util update #1605 R3.1/SLC4/i386: DPM/LFC v1.6.10 #1752 Patch to improve the performance of lcg CE .. Next set of patches scheduled for release to production: #1800 New vdt_globus_jobmanager_common to fix globus-cass-cache.. gLite 3.x Testing & Certification: ---------------------------------- * Certification- patches certified: #1278: Service Information Provider #1723: Rebuild MPI_utils mpich RPM with Fortran wrappers #1759: R3.1/x86_64/SLC4: GFAL & lcg_util update #1788: Trustmanager fix for install script - 64bit WN tarball provided to PPS - CREAM certification underway * Configuration Preliminary investigation of moving the VOMS server to yaim configuration. * Testing - Some new tests written to verify MPI support Grid Operational Security: -------------------------- Nothing to report this week. ETICS: ------ Nothing reported. The full GD report can be consulted on: --------------------------------------- https://twiki.cern.ch/twiki/bin/view/LCG/GDC5Reports ---Zdenek