LCG deployment ============== - Total number of Sites (1): 261 - Status -> Num. Sites (1): ok -> 207 degraded -> 13 down -> 41 - Software -> Num. Sites (2): gLite-3_1_0 -> 224 gLite-3_0_2 -> 22 gLite-3_0_0 -> 1 - Average of concurrently running jobs during this week (3): 32.6k EGEE CERN Regional Operations Centre (ROC): ================================== * Business as usual with tickets EGEE Pre-Production Service Coordination: ============================== 2008-09-05: News about deployment of the CREAM CE (PATCH:1755). The deployment was delayed because there are instances of the WMS of unsupported version in production that can still accidentally match the Cream CE and cause submission failure. The number of WMS potentially affected by this issue has been estimated to be order of 4/5 by an analysis of the information published. The list of sites that are publishing old version of WMS running on SL3 has been forwarded to the ROCs for verification and corrections. The extent of the issue seems to be anyway very limited and appropriate counter-measures are being taken. In addition to that, the proxy renewal mechanism is not working properly because it required opening of ports on the WNs. Until these issues are fixed by an incoming patch, Cream will be released with a GlueCEServiceState different from 'production' and changed again later on. The release with the workaround that is currently in certification can be expected in about 2 weeks CERN GRID Pre-Production Site (CERN_PPS): ================================= nothing to report Testing Report =========== * Certification Patches certified; * gLite 3.1 / SL4 patch #1672: Service release version information provider for glite-PX patch #2187: removal of obsoleted DM packages : glite_WN_ia32 patch #2188: removal of obsoleted DM packages : glite_WN_x86_64 patch #2189: removal of obsoleted DM packages : glite-UI patch #2190: removal of obsoleted DM packages : glite-VOBOX Patches rejected: patch #2079: condor update for glite-CONDOR_utils * Other work Fixes to gLite rpm lists after problems reported by FIO Operational Security ============== The security incident affecting HEP sites (see report from 07 Aug) is still being investigated by the EGEE OSCT and the relevant academic CSIRTs. SAM. ===== Production Releases: On 2008-09-04, lcg-sam-client-1.2.4-1 and lcg-sam-client-sensors-1.4.5-3 were released to Production. Apart from bug fixes and improvements to Data Management tests, the release introduced the SRMv2 sensor. The whole set of SRMv2 tests was made critical on SAM Production, but alarms for CODs are currently suppressed in order to give site managers a chance to correct any minor configuration issues. Details: https://twiki.cern.ch/twiki/bin/view/LCG/SamReleaseActivity Unavailabilities: From: 09-09-2008 (Tue) 02:25 To: 09-09-2008 (Tue) 03:25 and From: 05-09-2008 (Fri) 10:30 To: 05-09-2008 (Fri) 15:00 Problem: sam-bdii unavailable Impact: Important Details: automatic restarts of sam-bdii daemon were not functioning correctly. Tests affected: CE, SE, SRM Solution: Manual restart of the service. ETICS ===== Nothing to report this week. Statistics: Projects registered: 31 (-) Packages available in the repository: 65338 (+1%) Build/test reports total: 2964 (+8.5%) Build/tests executed this week: 625