LCG deployment ------------------- - Total number of Sites (1): 261 - Status -> Num. Sites (1): ok -> 201 degraded -> 15 down -> 44 - Software -> Num. Sites (2): gLite-3_1_0 -> 229 gLite-3_0_2 -> 16 gLite-3_0_0 -> 1 - Average of concurrently running jobs during this week (3): 33.6k EGEE CERN Regional Operations Centre (ROC): ------------------------------------------- * Business as usual with tickets EGEE Pre-Production Service Coordination: ----------------------------------------- 2008-10-13: The issues with the BDII observed after gLite 3.1 Update 33 were analysed by Laurence Field. * The issue reported by CERN ROC, tracked with BUG:42799, was narrowed down to a race condition which could be solved by doubling the value of the configuration variable "GIP_TIMEOUT" in YAIM. * Another issue reported in BUG:42799 is addressed by PATCH:2519 which will be released with gLite3.1 Update34, scheduled for Thursday 15th 2008-10-10: gLite 3.1 Update 33 was released to production CERN GRID Pre-Production Site (CERN_PPS): ----------------------------------------- * The BDII run on behalf of the Cern ROC was updated to glite 3.1 Update33 - An issue was found reported with BUG:42799 (see PPS report) * The FTS/FTA is set-up and it is now fully functional at CERN_PPS * The PPS SAM server was upgraded to the last release * A collaboration with the Nagios development team was started. The CERN_PPS site is now monitored by a test instance of the new Nagios for grid sites. Analysis of the alarms in progress * Some unused services were decommissioned: - local LFC: lxb2038 - RB: lxb2085 SAM ------ New releases of submission framework, sensors and database scripts to Production. Details available at https://twiki.cern.ch/twiki/bin/view/LCG/SamReleaseActivity for those interested. Other than that, SAM had nothing to report until this morning when tests started to fail because of BDII problems. This is the third time we’ve suffered from BDII problems, the cause of which remains a mystery. FIO and the BDII developers are continuing their investigations. Integration, Test & Release Report ---------------------------------------------------------- * Patches Certified patch #1579: R3.1/SLC4/noarch: Hydra service patch #2017: R3.1/SLC4/i386: Hydra client patch #2519: Updated BDI The SCAS service has entered certification * Configuration A yaim module for SCAS has been created. * Releases Patches scheduled for release to PPS : #1579 R3.1/SLC4/noarch: Hydra service #2002 First update of CREAM Client UI for slc4/i386 platform #2017 R3.1/SLC4/i386: Hydra client #2253 New JobManager, Information Dynamic plugin and yaim utils versions for SGE Patches scheduled for release to Production: #2047 R3.1/SLC4/i386: FTA SL4 bug fixes for gridftp transfers - 3.1.0 #2048 R3.1/SLC4/x86_64: FTA SL4 bug fixes for gridftp transfers #2115 R3.1/SLC4/i386: FTS SL4 bug fixes #2116 R3.1/SLC4/x86_64: FTS SL4 bug fixes #1803 First cummulative update of LB 3.1 #2092 add glite brokerinfo to glite-UI x86_ia32 #2198 Removal of info-plugin-fcr from glite-BDII #2377 Removal of obsoleted lcg-info packages glite-VOMS_oracle #2378 Removal of obsoleted lcg-info packages glite-PX #2379 Removal of obsoleted lcg-info packages glite-VOMS_mysql #2380 Patch for bug #39928, LB patch #1803 related #2519 Updated BDII Operational Security ---------------------------------- Nothing to report this week. IT-GD-OPS-US ----------------- This month's GGUS Release will be made available Thursday 23 October. Complete release notes on: https://gus.fzk.de/pages/owl.php A new revolutionary GGUS development was recently decided at the User Support Advisory Group meeting. Normal GGUS tickets will be allowed to reach the Site's helpdesk address when a GOCDB-GGUS interface, based on Web Services, will be ready. A lot of technical (interface with GOCdb, interface with local ticketing systems) and procedural issues (what if the sites are not in GOCdb or have multiple email contact addresses per service) have to be clarified in this process.