egee-pps-pilot-scas@cern.ch
for instructions in joining the current test effort. I will ask for feedback on progress at the May GDB.pps-support@cern.ch
.
Specifically ROCs and sites who recently raised the need for a speed-up in the delivery of the new BDII are definitely welcome to join.
2009-03-12: Release of gLite 3.2 Update 01 to production in preparation pps-support@cern.ch
pps-support@cern.ch
2009-01-09: Pilot service of SLC5 WN at CERN: in progressgd-release-team
used instad of Felix's and Andreas' personal e-mail. Status: Done . Antonio
PPS Coordination (C5 Report on 22-May-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite3.1.0 PPS Update28 was released to PPS and is currently in phase of pre-deployment testing. The update contains: - WMS: fix for minor configuration bug * Release of gLite3.0.2 Update43 to production in preparation The update, to be released today, contains * lcg-vomscerts-5.0.0 with new host certificate for the VOMS server vo.racf.bnl.gov Affected metapackages o lcg-RB o glite-SE_classic o glite-VOBOX o glite-WMS o glite-LB o glite-WMSLB The following metapackages, now supported with gLite version 3.1, are affected as well if still deployed at some sites in version 3.0: o lcg-CE o lcg-CE_torque o glite-LFC_mysql o glite-LFC_oracle o glite-SE_dpm_disk o glite-SE_dpm_mysql o glite-SE_dpm_oracle * gLite3.0.2 PPS Update49 was released to PPS. The update contains - lcg-vomscerts-5.0.0 with new host certificate for the VOMS server vo.racf.bnl.gov Normal Affected metapackages * lcg-RB * glite-SE_classic * glite-VOBOX * glite-WMS * glite-LB * glite-WMSLB The following metapackages, now supported with gLite version 3.1, are affected as well if still deployed at some sites in version 3.0 * lcg-CE * lcg-CE_torque * glite-LFC_mysql * glite-LFC_oracle * glite-SE_dpm_disk * glite-SE_dpm_mysql * glite-SE_dpm_oracle * Release of gLite3.1.0 Update24 to production in preparation The update, to be released today, contains * lcg-vomscerts-5.0.0 with new host certificate for the VOMS server vo.racf.bnl.gov Affected metapackages o lcg-CE o lcg-CE_torque o glite-LFC_mysql o glite-LFC_oracle o glite-SE_dpm_disk o glite-SE_dpm_mysql o glite-SE_dpm_oracle * Yaim core and yaim lcg-ce 4.0.4 series - Job Priorities implementation * gLite3.1 Update23 released to Production on 16-May The update,affecting the lcg-CE contains a new marshal package to fix a security issue found on the CE. It is mandatory for sites to upgrade to this version if the improvement packages have been installed on their lcg CE. Those packages were introduced with the gLite 3.1 Update 20. All details of the update can be found in: http://glite.web.cern.ch/glite/packages/R3.1/updates.asp * Pilot of WMS3.1 at CNAF and CERN-PROD in progress wrap-up meeting with sites and experiments scheduled for today * Pilot of AMGA at CERN_PPS in progress with LHCb * ARDA requested a new pilot activity concerning the future version of the gLite LB and its interaction with the experiments' dashboard. Technical requirements are under analysis * Service desctription and implementation plan for pre-production in EGEEIII in progress. Draft available on the PPS website https://twiki.cern.ch/twiki/bin/view/LCG/PreProductionServiceDescription
--- "gLite Release" section for WLCG/EGEE OPS and SCM meetings (19-May-08)
Release News:
Now in production
gLite3.1 Update23 released to Production on 16-May
The update,affecting the lcg-CE contains a new marshal package to fix a security issue found on the CE.
It is mandatory for sites to upgrade to this version if the improvement packages have been installed on their lcg CE. Those packages were introduced with the gLite 3.1 Update 20.
All details of the update can be found in:
http://glite.web.cern.ch/glite/packages/R3.1/updates.asp
Now in pre-production
gLite3.0.2 PPS Update28 was released to PPS and is currently under pre-deployment testing
The update contains
PPS Coordination (C5 Report on 15-May-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite3.1.0 PPS Update27 was released to PPS and is currently in phase of pre-deployment testing. The update contains: - Torque (server, client, MPI_utils) . Many enhancements and bug fixes - Maui (unchanged with new versioning schema) - VOMS Admin (affecting VOMS, UI, VOBOX) . Updated voms-admin interface documentation. . Deprecated old ACL interface methods. . Added VOMS-Admin User's guide . Improved voms-admin client online documentation . bug fixes - bug fixes on VOMS server . Enabled log rotation on VOMS/VOMS-admin log files (bug 20607) . Enabled setting of proxy timeout via configuration (bug 17247) . Enabled usage of voms server hostname (--uri parameter) via configuration - Client tools: . New version of lcg-info to support multiple BDII endpoints in LCG_GFAL_INFOSYS - yaim core (technically affecting all services) . check of unix permission of directory cointaining YAIM configuration files now removed - New host certificate for VOMS server vo.racf.bnl.gov; affecting: . lcg-CE . lcg-CE_torque . glite-LFC_mysql . glite-LFC_oracle . glite-SE_dpm_disk . glite-SE_dpm_mysql . glite-SE_dpm_oracle * gLite3.0.2 PPS Update49 was released to PPS and is currently in phase of pre-deployment testing. The update contains - lcg-vomscerts-5.0.0 with new host certificate for the VOMS server vo.racf.bnl.gov Normal Affected metapackages * lcg-RB * glite-SE_classic * glite-VOBOX * glite-WMS * glite-LB * glite-WMSLB The following metapackages, now supported with gLite version 3.1, are affected as well if still deployed at some sites in version 3.0 * lcg-CE * lcg-CE_torque * glite-LFC_mysql * glite-LFC_oracle * glite-SE_dpm_disk * glite-SE_dpm_mysql * glite-SE_dpm_oracle * Release of gLite 3.1 Update22 went to production on Tuesday. The update contains: - lcg-CE . SGE Engine enabled on lcg-CE . fix for DENY tags to lcg-info-dynamic-scheduler - dcache . Dcache 1.8.0.12.p6 (First dcache 1.8 release) - MPI_utils . Rebuild MPI_utils mpich RPM with Fortran wrappers - gLite-PX . first version of the dynamic service publisher, replacing the previous static configuration - 64 bit WNs + recet updates to GFAL (already deployed for 32 bit) - VOMS core (affecting clients) . new VOMS core 1.8.3-4 (affecting VOMS servers and clients on UI WN VOBOX CE SE_dpm LFC WMS LB . Many bug fixes. Fully backward compatible. . fix to trustmanager install script - client tools . lcg-infosites: new option to query for the wms and the lb associated to a certain VO. The -f option to filter based on the site name is also available. . bug fixes for edg-gridftp-client * Pilot of WMS3.1 at CNAF and CERN-PROD in progress with Atlas and CMS * Pilot of AMGA at CERN_PPS in progress with LHCb * Use cases for pre-production in EGEEIII communicated to the EGEE ROCS https://twiki.cern.ch/twiki/bin/view/LCG/PreProductionUseCases Implementation plan with effort evaluation currently in progress
PPS Coordination (C5 Report on 8-May-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * Release of gLite 3.1 Update22 in preparation. The update, to be released today, will contain: - lcg-CE . SGE Engine enabled on lcg-CE . fix for DENY tags to lcg-info-dynamic-scheduler - dcache . Dcache 1.8.0.12.p6 (First dcache 1.8 release) - MPI_utils . Rebuild MPI_utils mpich RPM with Fortran wrappers - gLite-PX . first version of the dynamic service publisher, replacing the previous static configuration - 64 bit WNs + recet updates to GFAL (already deployed for 32 bit) - VOMS core (affecting clients) . new VOMS core 1.8.3-4 (affecting VOMS servers and clients on UI WN VOBOX CE SE_dpm LFC WMS LB . Many bug fixes. Fully backward compatible. . fix to trustmanager install script - client tools . lcg-infosites: new option to query for the wms and the lb associated to a certain VO. The -f option to filter based on the site name is also available. . bug fixes for edg-gridftp-client * Pilot of WMS3.1 at CNAF and CERN-PROD in progress with Atlas and CMS * Pilot of AMGA at CERN_PPS in progress with LHCb
"gLite Release" section for WLCG/EGEE OPS and SCM meetings (5-May-08)
Release News:
Now in production
No releases to production last week.
Last one: gLite3.1 Update21
Details in http://glite.web.cern.ch/glite/packages/R3.1/updates.asp
Now in pre-production
No releases to production last week.
Last one: gLite3.1.0 PPS Update26
Details in
https://twiki.cern.ch/twiki/bin/view/EGEE/PPSReleaseNotes_310_PPS_Update26
Soon in production
Release of gLite 3.1 Update22 in preparation.
The update, to be released next Wednesday, will contain:
- lcg-CE
- SGE Engine enabled on lcg-CE
- fix for DENY tags to lcg-info-dynamic-scheduler
- dcache
- Dcache 1.8.0.12.p6 (First dcache 1.8 release)
- MPI_utils
- Rebuild MPI_utils mpich RPM with Fortran wrappers
- gLite-PX
- first version of the dynamic service publisher, replacing the previous static configuration
- VOMS core (affecting clients)
- new VOMS core 1.8.3-4 (affecting VOMS servers and clients on UI WN VOBOX CE SE_dpm LFC WMS LB
- Many bug fixes. Fully backward compatible.
- fix to trustmanager install script
- client tools
- lcg-infosites: new option to query for the wms and the lb associated to a certain VO. The -f option to filter based on the site name is also available.
- bug fixes for edg-gridftp-client
---
"gLite Release" section for WLCG/EGEE OPS and SCM meetings (28-Apr-08)
Patches released to production with Update21:
* PATCH:1800: New vdt_globus_jobmanager_common to fix globus-cass-cache problem on WN
Update20 was accelerated due to requirements coming from CCRC08. The installation issue with the CE was due to a mistake in the release preparation. The issue was fixed by Update21.
The dependency of the installation function from the new version of yaim-core version was not correctly set. Of course, as the correct version of yaim-core was already deployed in pre-production (but not in production) this issue was not visible for the pre-deployment testers in PPS. This particular issue could only have been trapped by a deployment test in production (currently not foreseen by the release procedure). When a release to production is prepared, in fact the patches are selected from preproduction and bundled together in a production update. Generally this is a subset of the patches installed in pre-production: possible dependencies among patches at this stage are recognised only by the documentation. Errors in the documentation cannot be trapped. In order to strengthen the process a further checkpoint in the release procedure should be inserted, which has always be rated too expensive in terms of elapsed time.
BTW: yaim-core was being held back in PPS because it forced a change in the permissions schema for the site-info.def and containing directory to be implemented at all sites, which was not rated acceptable for the operations.
The issue found later on in production affecting the submission from CE3.0 to WN3.1 has another explanation. CE at version 3.1 has been in production for more than two months, which means that regression tests are not being done in certification. Pre-production runs, by mandate, the top version of the services
Release News:
Now in production
gLite 3.1.0 Update20 and 21were released to production with HIGH priority.
Update 21 was an urgent fix for a compatibility issue affecting lcg-CEs still running at version 3.0 introduced by Update 20
The main changes introduced by Update20 (relevant for CCRC08) are:
- UI/WN/VOBOX
- new feature: glite-data-gfal version (1.10.11-1)
provides new functions gfal_abortrequest and gfal_abortfilesseveral,
- new feature: glite-data-dm-util (lcg_util) version (1.6.11-1) now
prints the SE type (SRMv1, SRMv2, Classic SE) in verbose mode (when relevant)
- bug fix: lcg-ls does not work for the classic SE
- bug fix: lcg-cr glibc memory corruption
- bug fix: gfal_stat seg. fault with dummy LFN
- bug fix: lcg-sd doesn't doesn't work with SRMv2 request token
- bug fix: lcg-gt segmentation fault
- fix globus-cass-cache problem on WN
- DPM/LFC v1.6.10
- fix problem of replication of a zero-length file improve logging of updatefilestatus method
- DICOM back-end service for DPM
- producing re-buildable source RPMs
- group writable directories when SRM started with umask 0
- DPM-DSI: DPM's gridftp does not allow for ':' in SURL (GGUS ticket #32335)
- support for CKSM (md5 only yet)
- lcg-CE
- Changes in Globus jobmanager and GASS cache. These modifications
improve the performance of the lcg-CE by a factor of two to three
Details in http://glite.web.cern.ch/glite/packages/R3.1/updates.asp
Now in pre-production
PPS site are now upgrading to gLite 3.1.0 PPS Updates 25 and 26:
- gLite-PX
- dynamic service publisher, replacing the previous static configuration
- dcache 1.8
- Major dcache version change, adds support for SRM 2.2.
- VOMS
- new VOMS core 1.8.3-4 (affecting VOMS servers and clients on UI WN VOBOX CE SE_dpm LFC WMS LB
- Many bug fixes. Fully backward compatible.
- fix to trustmanager install script
- MPI_Utils
- wrapper scripts to compile Fortran MPI programs.
- APEL (CE and MON and BATCH_utils)
- APEL working with external log4j and BC
- GFAL APEL (CE and MON and BATCH_utils)
- APEL working with external log4j and BC
- UI/WN/VOBOX
- dcache client 1.8
- glite-data-gfal v1.10.11-1 (bug fixes)
- glite-data-dm-util v1.6.11-1 (bug fixes)
- fix globus-cass-cache problem on WN
Details in
- https://twiki.cern.ch/twiki/bin/view/EGEE/PPSReleaseNotes_310_PPS_Update25
- https://twiki.cern.ch/twiki/bin/view/EGEE/PPSReleaseNotes_310_PPS_Update26
Soon in production
No release to production scheduled for this week
PPS Coordination (C5 Report on 24-Apr-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * After pre-deployment testing PPS sites are now upgrading to ** gLite3.1.0 PPS Update24 ** * gLite 3.1.0 Update20 was released to production with HIGH priority. The update contains urgent patches for CCRC08: Two issues were found in the release: - broken submission gL3.0CE --> gL3.1WN (problem at dgass cache) - configuration issue in YAIM None of these two issues could have been detected by PPS in the current configuration
PPS Coordination (C5 Report on 17-Apr-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * After pre-deployment testing PPS sites are now upgrading to ** gLite3.1.0 PPS Update23 ** The update contains - WMS LB (SL4): first release to PPS - UI/WN/VOBOX . edg-gridftp-client-1.2.8 fixes bugs 33205, 27274 . DPM/LFC v1.6.10 - DPM/LFC . DICOM back-end service for DPM . re-buildable source RPMs . support for MacOSX . group writable directories when SRM started with umask 0 . bug fixes - CE . patch to Globus job manager to improve performances Details in: https://twiki.cern.ch/twiki/bin/view/EGEE/PPSReleaseNotes_310_PPS_Update23 * gLite 3.1.0 Update19 was released to production with HIGH priority. The update contains: - UI/WN/VOBOX . many bug fixes, including the on epreventing to use aliases for WMS . new lcg-ManageVOTAg version - MON . R-GMA fix for forwards compatibility - 3.1.0 PPS Update 22 - Many services . lcg-vomscerts-4.9.0 adds next cert for lcg-voms * gLite 3.0.2 PPS Updates 47 and 48 were released to PPS, went through the pre-deployment and were released to the sites The updates contain: - FTS . new version of FTA changing the gridFTP session handling (CCRC) - Many services . lcg-vomscerts-4.9.0 adds next cert for lcg-voms * A meeting was held with Atlas and CMS to kick-off the WMS 3.1 service at CERN-PROD Details and timelines available at https://twiki.cern.ch/twiki/bin/view/LCG/PPIslandKickOff2008x04x15 * Release of gLite3.1 Update 20 to production in preparation The update will contain:: - UI/WN/VOBOX . gfal 1.10.8 with many bug fixes . DPM/LFC clients compatible with v1.6.10 - DPM/LFC v1.6.10 . DICOM back-end service for DPM . re-buildable source RPMs . support for MacOSX . group writable directories when SRM started with umask 0 . bug fixes * Release of gLite3.0 Update 42 to production in preparation The update will contain:: - FTS . new version of FTA changing the gridFTP session handling (CCRC) - Many services . lcg-vomscerts-4.9.0 adds next cert for lcg-voms
PPS Coordination (C5 Report on 10-Apr-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * A meeting with LHCb was held to kick-off the deployment of a preproduction AMGA service for the VO. The pilot will be run by CERN_PPS and used by LHCb as a development instance for the BookKeping service. * gLite3.1.0 PPS Update23 was released to PPS and it is currently in phase of pre-deployment testing. The release is currently frozen waiting for one urgent update from the certification team. It contains - WMS LB (SL4): first release to PPS - UI/WN/VOBOX . edg-gridftp-client-1.2.8 fixes bugs 33205, 27274 . DPM/LFC v1.6.10 - DPM/LFC . DICOM back-end service for DPM . re-buildable source RPMs . support for MacOSX . group writable directories when SRM started with umask 0 . bug fixes * gLite3.0.2 PPS Update47 was released to PPS and it is currently in phase of pre-deployment testing. - FTS: . FTA Update: change the gridFTP session handling * gLite3.1.0 PPS Update22 passed the pre-deployment tests and it is now being installed by the PPS sites The release contains, among others, an update of yaim-core, so, technically, all services are concerned. The full list of patch deployed is: - glite-AMGA_oracle (initial release) - UI/WN/VOBOX . GFAL/lcg_util: many bug fixes . new lcg-ManageVOTAg version (solving bug 34245) . lcg-infosites: new option to query the wms and lb associated to a VO. -f option to filter based on the site name . [ YAIM ] glite-yaim-clients: bug fixes + conifugurable list of WMS and LB - R-GMA . Switch back to using MEMORY instead of DATABASE producer - YAIM (affecting all nodes) . new yaim-core with a consistent list of changes and bug fixes - CE . change to lcg-info-dynamic-scheduler to support DENY tags * Release of gLite 3.1 Update18 sent to production on Monday. The update, to be released soon, will contain: - NEW: glite-MON for SL4 - DPM 1.6.7-4 . fix for bug #33769: incorrect pool free space after dpm-drain . improved ACL management for srmMkdir command - UI/WN/VOBOX . lcg-tags non longer produces Globus warnings suppressed . voms-admin client 2.0.6-1 providing ACL support on command line - vdt_globus_essentials (affecting several services and notably the CE) . bug fix to prevent globus-job-manager processes to pile-up on a CE (big observed at CERN after SAM WMS?RB tests were enabled ) - voms-admin server (VOMS) . Refactored voms-admin-ping script . ACL management web service (compatible with client >= 2.0.6-1) . Registration web service. . many bug fixes
PPS Coordination (C5 Report on 20-Mar-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * Release of gLite 3.1 Update18 to production in preparation. The update, to be released soon, will contain: - NEW: glite-MON for SL4 - DPM 1.6.7-4 . fix for bug #33769: incorrect pool free space after dpm-drain . improved ACL management for srmMkdir command - UI/WN/VOBOX . lcg-tags non longer produces Globus warnings suppressed . voms-admin client 2.0.6-1 providing ACL support on command line - vdt_globus_essentials (affecting several services and notably the CE) . bug fix to prevent globus-job-manager processes to pile-up on a CE (big observed at CERN after SAM WMS?RB tests were enabled ) - voms-admin server (VOMS) . Refactored voms-admin-ping script . ACL management web service (compatible with client >= 2.0.6-1) . Registration web service. . many bug fixes
PPS Coordination (C5 Report on 20-Mar-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite 3.1 Update17 was released to production on Monday. The update contains the new service glite-LSF_utils * gLite3.1.0 PPS Update21 was released to PPS. and after the pre-deployment test is now being installed by PPS sites. This Update conatins; - VOMS-Admin server 2.0.13-1 & VOMS-Admin client 2.0.6-1 with a number of bug fixes - vdt_globus_essentials to fix Globus bug 5771 - New version of lcg-tags - DPM 1.6.7-4 - 1708 new glite-AMGA_oracle * Release of gLite 3.0 Update41 to production in preparation. The update, to be released today, will contain: - FTS: transfer-url-copy update for space tokens
PPS Coordination (C5 Report on 13-Mar-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite 3.1 Update16 was released to production on Monday. The update contains: - A new index on the attribute GlueServiceEndpoint, used by lcg-utils - UI: Bug fixes to JDl API (bulk submission) and gfal cliens - dcache SE: Glue 1.3 clean ups - DPM SE: version 1.6.7 (32-bit and 64-bit) fixing various configuration bugs; introducing new front-ends for Xroot and HTTP/HTTPS; upgrading the version of gSOAP from 2.6.2 -> 2.7.6b - GFAL version 1.10.8-1: creation of subdirectories with lcg-utils - lcgCE: bug fixes
PPS Coordination (C5 Report on 06-Mar-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite3.1.0 PPS Update20 was released to PPS. and after the pre-deployment test is now being installed by PPS sites. This Update introduces the MONBOX on the 3.1 baseline (for SLC4) * gLite 3.0 Update40 was released to production. The update contains: - host certificate of voms server used by egeode and biomed VOs (for SL3) - Fix of missing dependency on lcg-schema for glite-WMS metapackage * Release of gLite 3.1 Update16 to production in preparation. The update will contain: - A new index to speed the BDII up - UI: Bug fixes to JDl API (bulk submission) and gfal cliens - dcache SE: Glue 1.3 clean ups - DPM SE: version 1.6.7 (32-bit and 64-bit) fixing various configuration bugs; introducing new front-ends for Xroot and HTTP/HTTPS; upgrading the version of gSOAP from 2.6.2 -> 2.7.6b - lcgCE: bug fixes * gLite 3.0.2 PPS Update 46 was released to PPS and after the pre-deployment test is now being installed by PPS sites. The update contains: - update to FTS transfer-url-copy for space tokens - Fix of missing dependency on lcg-schema for glite-WMS metapackage
PPS Coordination (C5 Report on 28-Feb-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * An inventory of PPS services and activities was started. A spreadsheet was sent to the sites for feedback and comments (www.cern.ch/pps/index.php?dir=./site/) Ir will be used as a starting point for the re-distribution of tasks in PPS * A discussion was started with the experiment support to agree on a profitable deployment model of the 64-bit WNs in pre-production * gLite 3.1 (SL4) A new update, gLite3.1.0 PPS Update20 is in preparation. This Update will introduce the MONBOX on the 3.1 baseline (for SLC4) * gLlite 3.1.0 PPS Update 19 was released to PPS and it is now being moved forward to PPS sites after pre-deployment testing: The update contains: - WN 3.1 for sl4 64bits - glite-LSF_utils - lcg-vomscerts-4.8.0 adds next cert for biomed + egeode - new version of lcg-ManageVOTag fixing bug #31848 The 64-bit WN distribution *has not been tested* so far because no PPS sites had resources available for the installation. * gLite 3.1 Update15 was released to production on Wednesday. The update contains: - host certificate of voms server used by egeode and biomed VOs * gLite 3.1 Update14 was released to production on Friday. The update contains: - YAIM module to configure LCG CE and gLite WN for MPI support according to the guidelines from the EGEE TCG working group on MPI - Additional MAUI package (better support for the split of CE from Torque server) - Improved globus-gridftp startup script - lcg_util v1.6.8 - Improvements to glite-info-provider-ldap - glite-yaim-core 4.0.3-13 for gLite 3.1 As there is an update of YAIM core *all* metapackages are reported as affected by this update. Actually the yaim-core was changed sue to an incompatibility found in PPS with the released version of the glite-info-provider-ldap, so the list of impacted services can be restricted to those concerned by the new version of glite-info-provider
PPS Coordination (C5 Report on 21-Feb-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * Glite 3.1 (SL4) After the pre-deployment test, the PPS sites are upgrading to * gLite3.1.0-PPS-UPDATE18 * The update contains: - 64bit versions of SE_dpm_mysql/_disk - dcache now installs with yum install (not groupinstall) - bdii v. 3.9.1-5 - yaim-core update * Release of gLite 3.1 Update14 to production in preparation. The update, to be released very soon, will contain: - YAIM module to configure LCG CE and gLite WN for MPI support according to the guidelines from the EGEE TCG working group on MPI - Additional MAUI package (better support for the split of CE from Torque server) - Improved globus-gridftp startup script - lcg_util v1.6.8 - Improvements to glite-info-provider-ldap - glite-yaim-core 4.0.3-13 for gLite 3.1 As there is an update of YAIM core *all* metapackages are reported as affected by this update. Actually the yaim-core was changed sue to an incompatibility found in PPS with the released version of the glite-info-provider-ldap, so the list of impacted services can be restricted to those concerned by the new version of glite-info-provider * Release of gLite 3.0 Update40 to production in preparation. The update, to be released very soon, will contain: - YAIM module for 3.0 WMS to fix the bug of limit on uid for gridftp server
PPS Coordination (C5 Report on 14-Feb-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * Glite 3.0 (SL3) After the pre-deployment test, the PPS sites are upgrading to * 3.0.2 PPS Update45 * - YAIM module for 3.0 WMS to fix the bug of limit on uid for gridftp server * Glite 3.1 (SL4) After the pre-deployment test, the PPS sites are upgrading to * gLite3.1.0-PPS-UPDATE17 * The releases contain - glite-MPI_utils metapackage for gLite 3.1 - Improved globus-gridftp startup script - various improvements for glite-info-provider-ldap - lcg_util v1.6.8 (SLC4) * gLite 3.1.0 PPS Update18 was released to pre-production yesterday (Wednesday): It is currently in phase of pre-deployment testing: The update contains: - 64bit versions of SE_dpm_mysql/_disk - dcache now installs with yum install (not groupinstall) - bdii v. 3.9.1-5 - yaim-core update As there is an update of YAIM core *all* metapackages are reported as affected by this update * release of gLite 3.1 Update13 went to production on Monday. The update, to be released very soon, will contain: - A Major upgrade to dcache (patch#1395) - An updtae from VDT to fix a gridftp issue - voms-admin client for UI and VOBOX - dcacheVoms2Gplasma required for proxies created with grid-proxy-init
PPS Coordination (C5 Report on 07-Feb-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite 3.0.2 PPS Update45 was released to pre-production on Tuesday: It is currently in phase of pre-deployment testing: The update contains: - YAIM module for 3.0 WMS to fix the bug of limit on uid for gridftp server * gLite 3.1.0 PPS Update17 was released to pre-production last Thursday: It is currently in phase of pre-deployment testing: The update contains: - glite-MPI_utils metapackage for gLite 3.1 - Improved globus-gridftp startup script - various improvements for glite-info-provider-ldap - lcg_util v1.6.8 (SLC4) * release of gLite 3.1 Update13 to production in preparation. The update, to be released very soon, will contain: - A Major upgrade to dcache (patch#1395) - An updtae from VDT to fix a gridftp issue - voms-admin client for UI and VOBOX - dcacheVoms2Gplasma required for proxies created with grid-proxy-init * A master thesis on "Financial Derivatives Market for Grid Computing" Authors: Aubert, David Solli, Arnstein Seljeflot Lindset, Snorre Huuse, Henning was completed running the simulation jobs on the EGEE pre-production grid http://cdsweb.cern.ch/record/1080367 The VO created ad hoc for that will be now decommissioned to provide a new operational use case
PPS Coordination (C5 Report on 31-Jan-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * After the pre-deployment test, the PPS sites are upgrading to * gLite3.1.0-PPS-UPDATE15 and gLite3.1.0-PPS-UPDATE16 * The releases contain - fix in GFAL/lcg_util for lcg-cp failing on classic SEs - dcacheVoms2Gplasma required to authenticate grid-proxy-init * gLite 3.1 Update 12 was released in production on Monday: The update contains: o new version of the lcg_util package (1.6.7-1), which fixes a bug in the lcg-cp command (-n option) o fix for site-BDII not publishing its own information * After the pre-deployment test gLite3.0.2-PPS-UPDATE44 was released the 24th to PPS sites and is currently in phase of pre-deployment testing The update contains - fix for FTS (patch for volatile files) - patch to APEL publisher to fix typo in configuration template
PPS Coordination (C5 Report on 24-Jan-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * After the pre-deployment test, the PPS sites are upgrading to * gLite3.1.0-PPS-UPDATE14 * This release cointain a new version of the edg-mkgridmap compatible with OpenSSL 0.9.7 It affects VOBOX, LFC, DPM, CE, VOMS * gLite3.1.0-PPS-UPDATE15 was released yesterday to PPS and is currently in the phase of pre-deployment testing The update contains - fix in GFAL/lcg_util for lcg-cp failing on classic SEs - dcacheVoms2Gplasma required to authenticate grid-proxy-init * gLite3.0.2-PPS-UPDATE44 was released on Tuesday 22nd to PPS and is currently in phase of pre-deployment testing The update contains - fix for FTS (patch for volatile files) - patch to APEL publisher to fix typo in configuration template * release of gLite 3.1 Update11 to production in preparation. The update, to be released today, contains, among other: * glite-LFC 1.6.8-1 for SL4 (oracle and Mysql) * fix in GFAL/lcg_util for lcg-cp failing on classic SEs * fix in yaim-bdii to correctly conifgure FCR in top-level BDIIs * release of gLite3.0 Update39 to production in preparation. The update will contain - fix for FTS (patch for volatile files) - patch to APEL publisher to fix typo in configuration template * gLite 3.1 Update 10 was released in production on Monday: The update contains: o glite-PX for glite 3.1 o gLite-AMGA_postgres for gLite 3.1 o VOBOX o edg-mkgridmap-3.0.0 compatible with OpenSSL 0.9.7 o ~ 20 patches with bug fixes This release unfortunately introduced a bug in production affecting the lcg-cp functionality if used against classic SEs The bug was broadcast and a patch will be made available today with Update11 * gLite3.1.0-PPS-UPDATE13, was released and deployed in PPS on Friday 18th The upgrade includes, among others: - Major upgrade to dcache - VDT/globus update to fix a gridftp issue - DPM 1.6.7-2 - lcg-utils 1.6.5 * A communication flaw in the release process was identified and fixed. Due to this flaw bugs opened in PPS during the preparation of the release to production could have gone unnoticed. A case similar has happened in with gLite3.1 Update 10
PPS Coordination (C5 Report on 17-Jan-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * due to a BDII failure, SAM tests for PPS were not available during the whole day of Monday * gLite3.1.0-PPS-UPDATE13, currently in phase of pre-deployment test has been enriched with new DPM/GFAL patches recently certified. It now contains, among others: - Major upgrade to dcache - VDT/globus update to fix a gridftp issue - DPM 1.6.7-2 - lcg-utils 1.6.5 * gLite3.0 Update38 was released to production The update concerns: - WMS: several bug fixes. - YAIM: - new YAIM configuration modules for 3.0 covering many services - enhanced command-line options - man yaim now available - an impressive list of new features and bug fixes - MON box - VOBOX - New tool added to renew proxies with VOMS extension via the myproxy server. - FTS / FTA / FTM: - new utility to maintain your /opt/glite/etc/services.xml file. - VOMS API/client: Update to voms 1.7.24 and gSOAP 2.7. - DPM - New DPM Python interface - Upgrade from gSOAP 2.6.2 to 2.7.6b * Release of gLite 3.1 Update 10 in production in preparation: The update (to be released today) will contain: o glite-PX for glite 3.1 o gLite-AMGA_postgres for gLite 3.1 o VOBOX o edg-mkgridmap-3.0.0 compatible with OpenSSL 0.9.7 o ~ 20 patches with bug fixes
PPS Coordination (C5 Report on 10-Jan-08) EGEE Pre-Production Service Coordination: ----------------------------------------- * After the pre-deployment test, the PPS sites are upgrading to * gLite3.1.0-PPS-UPDATE12 * This release cointain a new version of the edg-mkgridmap compatible with OpenSSL 0.9.7 It affects VOBOX, LFC, DPM, CE, VOMS * gLite3.0 Update38 to production in preparation:
#EuQr07 Quarterly Report 07 PPS Quarterly Report for EU (Oct - Dec 2007) During Q7 the Pre-Production Service continued the operations in the traditional way, assuring a regular flow of updates to be smoothly released from certification to production. Specifically 9 new releases were handled, moving to production nearly 90 SW updates including patches and new functionalities. More than 30 software problem reports, mostly dealing with configuration issues, were opened during the deployment activities. Further details are available at: http://www.cern.ch/pps/index.php?dir=./release Out of a review of the number of CPUs published in PPS, it turned out that most of the current computing power in PPS (>2000 CPU) is currently provided by CERN-PPS(~1500), PIC-PPS (~300) and PPS-SiGNET (~200). Smaller sites in PPS, at the same time, are actively contributing to the release operations, significantly reducing the load on the central coordination. Further details are available at: - http://lxb2003.cern.ch/gm/gridmap.html?topo=pps&layout=tc&vo=OPS&serv=Site&period=latest - http://www.cern.ch/pps/index.php?dir=./panel/SVC/ Part of the resources in the PPS grid were dedicated to provide service for the EGEE/OSG permanent interoperability testing platform. This activity involves currently 2 sites in PPS and the number is meant to grow in Q8. In the context of interoperations, a permanent communication channel from EGEE to OSG was created to make sure that key notifications about gLite releases with a potential impact on OSG are correctly forwarded. Further details are available at: https://www.cern.ch/grid-interop/egee-osg/ Behind the scenes, initiatives were taken in the aim of re-structuring the service. In particular, within the pre-production of the AMGA service, recently integrated in the gLite stack, LHCb and biomed VOs were contacted with a proposal to collaborate in the definition of a new service model, better focused on the needs of the VO. The same kind of discussion was initiated with Diligent in order to prototype the shift towards the new pre-production model of the whole workflow management service. PPS - Plan for Q8 The activity during Q8 will presumably be heavily focused on the re-structuring of the service, in progress since Q6, aimed to better tailor the service to the actual needs of the HEP users. More sites and services in PPS will be involved in the EGEE/OSG permanent interoperability testing platform, to cover service needs as identified by the platform coordinator. The pre-deployment activity will also be further developed in order to cover more and more deployment scenarios.
PPS Coordination (C5 Report on 13-Dec-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * After the pre-deployment test, the PPS sites are upgrading to * gLite3.0.2-PPS-UPDATE43 * This new version of the middleware contains several bug fixes for - WMS and LB - VOBOX - MONBOX - FTS and FTA * gLite3.1 Update09 released to production: The release contains: - glite-LFC_mysql metapackage for SLC4 - glite-SE_dpm_disk metapackage for SLC4 - glite-SE_dpm_mysql metapackage for SLC4 - glite-LFC_oracle metapackage for SLC4 * gLite3.1.0-PPS-UPDATE12 was released to PPS and is currently in the phase of pre-deployment testing The update contains a new version 3.0.0 of edg-mkgridmap compatible with OpenSSL 0.9.7 * Procedures for releases in production and PPS were modified: - updates to the production and pre-production services are now notified to OSG as well, in order to facilitate the interoperability of the two grids - a new state "pps-deployment test" is introduced in the patch life-cycle in Savannah
PPS Coordination (C5 Report on 13-Dec-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * After the pre-deployment test, the PPS sites are upgrading to * gLite3.1.0-PPS-UPDATE11 * The VOBOX service for SL4(32 bit)was introduced by this update. * Release of gLite3.1 Update09 to production in preparation: The release will contain: - glite-LFC_mysql metapackage for SLC4 - glite-SE_dpm_disk metapackage for SLC4 - glite-SE_dpm_mysql metapackage for SLC4 - glite-LFC_oracle metapackage for SLC4 We are currently preparing the release documentation, if it will be ready by a convenient time the production repository could be prepared tonight or tomorrow morning, otherwise the update will likely be postponed to the new year. * gLite3.0.2-PPS-UPDATE43 was released to PPS and is currently in the phase of pre-deployment testing This new version of the middleware contains several bug fixes for - WMS and LB - VOBOX - MONBOX - FTS and FTA
PPS Coordination (C5 Report on 06-Dec-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * After the pre-deployment test, the PPS sites are upgrading to * gLite3.1.0-PPS-UPDATE10 * A number of new services for SL4(32 bit) were introduced by this update. - glite-AMGA_postgres - glite-LFC_mysql - glite-LFC_oracle - glite-PX - glite-SE_dpm_disk - glite-SE_dpm_mysql - glite-VOMS_mysql - glite-VOMS_oracle * release of gLite3.1 Update07 to production in preparation: (To be announced today) The release contains: - glite-VOMS_mysql metapackage for gLite 3.1 and SL(C)4 - glite-VOMS_oracle metapackage for gLite 3.1 and SL(C)4 - Bug fixes for UI and WN * gLite3.1.0-PPS-UPDATE11 was released to PPS and is currently in the phase of pre-deployment testing This new version of the middleware contains several minor patches, plus a new version of GFAL/lcg_util versions and the glite-VOBOX service tfor SL4 (32 bit). Also a new version of glite-yaim-core has been released, which requests all meta-packages to be updated * Within the study for the re-organisation of PPS, a discussion was initiated with Diligent about the possibility to migrate the subset of PPS sites/services currently supporting the Diligent production into a "pilot" service in the production grid.
PPS Coordination (C5 Report on 29-Nov-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite3.1.0-PPS-UPDATE10 was released to PPS and is currently in phase of pre-deployment testing This update represents the introduction of a number of new services to gLite 3.1 for SL4 (32 bit). - glite-AMGA_postgres - glite-LFC_mysql - glite-LFC_oracle - glite-PX - glite-SE_dpm_disk - glite-SE_dpm_mysql - glite-VOMS_mysql - glite-VOMS_oracle The pre-deployment testing was finished for all services with these exception - LFC-Oracle (not tested) - dpm_oracle (not tested) - VOMS-oracle (not tested) - dpm_mysql (still in test) We are studying a way to move forward to PPS services already tested * release of gLite3.0 Update37 to production in preparation: (To be announced today) The release contains: - MySQL server/client update - Updated Torque and Maui - FTS Cancellation
PPS Coordination (C5 Report on 08-Nov-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite3.1.0-PPS-UPDATE09 was released to PPS and is currently in phase of pre-deployment testing This new version of the middleware contains the following fixes - Fix for missing python libraries after restart of the UI (#1257) - Fixed dependancies of edg-mkgridmap (#1403) - Adjusted dependancies of R-GMA on WNs (#1423) - new certificate for voms.cern.ch for 3.1 release (#1452) - Updated lcg-info-provider-software (#1470) - Updated glite-yaim-bdii to publish site entry (#1471) - R3.1 updated a1_grid_env.sh script (#1500) Patches #1470, #1471, #1452 were moved directly to production with gL3.1 Update06 * After the pre-deployment test, the PPS is now upgraded to * gLite3.0.2-PPS-UPDATE42 * this upgrade affects all services using a Mysql server and contains: - new voms certificate for the WMS repository - upgraded Mysql server * gLite3.1 Update06 was released to production: The release will contain mainly: - lcg-CE for SLC4 - BDII for SLC4 - fixes for issue in publishing site name entry found in PPS Several issues were found in PPS and reported to production as known issues. A relevant one is - Gstat error report: https://gus.fzk.de/ws/ticket_info.php?ticket=28922 * gLite3.0 Update36 was released to production: The release contains: - new host certificates of voms.cern.ch server
PPS Coordination (C5 Report on 08-Nov-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite3.0 Update35 was released to production, Main enhancements and bug fixes: - APEL Update to address corruption of accounting data from sites publishing multiple ClusterIDs (critical bug #25430). - GIN has been updated not to call GIP directly, but to use site ldap servers. - new version of glite-info-generic - fix to avoid older LB clients to crash if a newer LB server returns events not understandable events. In particular, this may occur when gLite 3.0 client is used to query gLite 3.1 job collections. * gLite3.1 Update06 in preparation: The release will contain mainly: - lcg-CE for SLC4 - BDII for SLC4 - fixes for issue in publishing site name entry found in PPS * After the pre-deployment test, the PPS is now upgraded to * gLite3.0.2-PPS-UPDATE41 * This update contains: - R3.0/SLC3: FTS cancellation - Updated Torque (2.1.9-4) and Maui (3.2.6p19-4)l - lcg-vomscerts-4.7.0 adds next cert for voms.cern.ch * gLite3.1.0 PPS-Update09 was released to PPS, currently in phase of pre-deployment test the release contains the following fixes - Fix for missing python libraries after restart of the UI (#1257) - Fixed dependancies of edg-mkgridmap (#1403) - Adjusted dependancies of R-GMA on WNs (#1423) - new certificate for voms.cern.ch for 3.1 release (#1452) - Updated lcg-info-provider-software (#1470) - Updated glite-yaim-bdii to publish site entry (#1471) - R3.1 updated a1_grid_env.sh script (#1500) * gLite3.0.2 PPS-Update42 in preparation: the release will affect the WMSLB and contain: - new voms certificate for the WMS repository - upgraded Mysql server on LB
PPS Coordination (C5 Report on 01-Nov-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite3.1.0-PPS-UPDATE41 was released to PPS. This update contains: - R3.0/SLC3: FTS cancellation - Updated Torque (2.1.9-4) and Maui (3.2.6p19-4)l - lcg-vomscerts-4.7.0 adds next cert for voms.cern.ch The release is currently under pre-deployment test * gLite3.1.0-PPS-UPDATE08 has passed the pre-deployment tests ( reports available in www.cern.ch/pps/index.php?dir=./release/testreports/gLite3.1.0/gLite3.1.0-PPS-UPDATE08/ ) and it is now being deployed at the PPS sites. The upgrade of the service is in progress This update contains: - R3.1 FTS update (glite-data_R_3_1_35_1) - JobWrapper tests - new version with no R-GMA dependencies - New version of lcg-tags with better error reporting - New version of lcg-info with support for VOViews, sites and services - lcg-CE for glite 3.1 - Updated Torque (2.1.9-4) and Maui (3.2.6p19-4) - gLite 3.1 TORQUE_utils (slc4/ia32) - gLite 3.1 TORQUE_server (slc4/ia32) - glite-yaim-core 4.0.1 for the 3.1 repository - glite-yaim-clients 4.0.1 for the 3.1 repository
PPS Coordination (C5 Report on 25-Oct-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * Minutes of the PPS all-sites meeting available. http://indico.cern.ch/sessionDisplay.py?sessionId=33&confId=18714 * gLite3.1.0-PPS-UPDATE08 released to PPS . This update contains: - R3.1 FTS update (glite-data_R_3_1_35_1) - JobWrapper tests - new version with no R-GMA dependencies - New version of lcg-tags with better error reporting - New version of lcg-info with support for VOViews, sites and services - lcg-CE for glite 3.1 - Updated Torque (2.1.9-4) and Maui (3.2.6p19-4) - gLite 3.1 TORQUE_utils (slc4/ia32) - gLite 3.1 TORQUE_server (slc4/ia32) - glite-yaim-core 4.0.1 for the 3.1 repository - glite-yaim-clients 4.0.1 for the 3.1 repository The release is now under an intense pre-deployment testing activity and it has not been yet deployed to all pps sites. * BDII for version 3.1 from PPS Update07 was not relaesed in production with Update04 as originally planned. Waiting for a bug in YAIM to be fixed and release documentation improved
PPS Quarterly Report for EU (Jul- Sep 2007) Review of PPS in QR06 The Pre-Production Service ran through Q6 taking advantage of the consolidation work performed on processes and procedures during Q5, with a new improved release process: http://www.cern.ch/pps/index.php?dir=./release/process. There were 12 new releases of middleware and these included updates and new services of the gLite middleware and these were promptly deployed across the 30 sites composing the PPS grid and so new services were made accessible early to Grid users: http://www.cern.ch/pps/index.php?dir=./release/bulletin More than 80 software problem reports, mostly dealing with configuration issues, were opened during the deployment activities. This was done without any appreciable service interruptions, in order to avoid disturbing the Data Challenge for Diligent, a 4-month continuous activity successfully completed on the 11th of October: https://twiki.cern.ch/twiki/bin/view/DILIGENT/DiligentFlickrDC A number of operation-related activities were migrated from CERN to other PPS sites. Namely the responsibility for pre-deployment testing, Service Availability Monitoring and technical coordination of the releases is now shared with sites external to CERN. An all-site meeting was held, during EGEE07, where an important agreement was reached with the EGEE Grid Operators (COD) about the modality to follow-up service tickets raised against PPS sites. Interoperability between EGEE/PPS sites and OSG/ITB sites (OSG Integration TestBed) was tested and a proof-of-concept was set-up to test a new method to distribute middleware clients through interoperable grids. PPS - Plan for QR07 Work for Q7 has already started with the main goal of re-structuring the service. Re-structuring is deemed necessary in order to increase the usage rate of PPS by the HEP VOs. Currently, in fact, all the HEP VOs report an increasing difficulty, mainly due to lack of manpower, to interact with a PPS grid which is completely parallel and distinct by the production one. A plan is being drafted for technical changes to be applied in the services during Q7. The guidelines for the service are: - rationalization and optimization of the resources; - focus on users and more involvement of the users in the service acceptance protocol; - integration and formalisation within the PPS framework of non-standard testing activity currently performed by the VOs on the production platform. PPS will take active part, during Q7, to the SA1 EGEE/OSG interoperation program aimed to assure a stable interoperability of the OSG and EGEE middleware stacks throughout releases as well as to develop and maintain suitable interoperation procedures.
PPS Coordination (C5 Report on 18-Oct-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite3.0.2-PPS-UPDATE07 released to PPS sites. Upgrade in progress This update contains: - glite-FTM Normal - gLite 3.1 BDII (slc4/ia32) Normal * Documentation of site-BDII delivered with Update07 incomplete critical bug http://savannah.cern.ch/bugs/?30524 opened, to be solved before going into production
PPS Coordination (C5 Report on 18-Oct-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite3.0.2-PPS-UPDATE07 released to PPS sites. Upgrade in progress This update contains: - glite-FTM Normal - gLite 3.1 BDII (slc4/ia32) Normal * Documentation of site-BDII delivered with Update07 incomplete critical bug http://savannah.cern.ch/bugs/?30524 opened, to be solved before going into production
PPS Coordination (C5 Report on 11-Oct-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * PPS service upgraded to gLite3.0.2-PPS-UPDATE40 This release contains: - R-GMA fixes (Bug #17323) - APEL Update (glite-apel_R_2_0_17) - YAIM 4.0.0 for the 3.0 repository - lcg-vomscerts-4.6.0 adds cert for US-ATLAS server (Synch to production) - Addition of lcg-version to WN and UI - Fix to avoid LB client crash when unknown events are returned by server - Re-branded GIP that includes improved LDIF parsing * pre-deployment tests of gLite 3.1.0 PPS Update07 (FTM, BDII3.1) still in progress at pps-CNAF and CERN_PPS
PPS Coordination (C5 Report on 4-Oct-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite 3.0.2 PPS Update 40 passed the pre-deployment tests in PPS waiting to be deployed at the remaining sites This release contains: - R-GMA fixes (Bug #17323) - APEL Update (glite-apel_R_2_0_17) - YAIM 4.0.0 for the 3.0 repository - lcg-vomscerts-4.6.0 adds cert for US-ATLAS server (Synch to production) - Addition of lcg-version to WN and UI - Fix to avoid LB client crash when unknown events are returned by server - Re-branded GIP that includes improved LDIF parsing * gLite 3.1.0 PPS Update07 was delivered to PPS, currently undergoing pre-deployment tests This update contains: - glite-FTM Normal - gLite 3.1 BDII (slc4/ia32) Normal Documentation issues were found in the BDII release Fixing issues before deploying to PPS sites * PPS all-sites meeting (3rd Oct, Budapest c/o EGEE 07) http://indico.cern.ch/sessionDisplay.py?sessionId=33&slotId=1&confId=18714#2007-10-03 Minutes will be circulated next weeks.
PPS Coordination (C5 Report on 27-Sep-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * gLite 3.0.2 PPS Update 40 currently in pre-deployment in PPS This release contains: - R-GMA fixes (Bug #17323) - APEL Update (glite-apel_R_2_0_17) - YAIM 4.0.0 for the 3.0 repository - lcg-vomscerts-4.6.0 adds cert for US-ATLAS server (Synch to production) - Addition of lcg-version to WN and UI - Fix to avoid LB client crash when unknown events are returned by server - Re-branded GIP that includes improved LDIF parsing * Agenda for PPS meeting (3rd Oct, Budapest c/o EGEE 07) on-line at http://indico.cern.ch/sessionDisplay.py?sessionId=33&slotId=1&confId=18714#2007-10-03 * No release to production this week because no patches were waiting in PPS
PPS Coordination (C5 Report on 13-Sep-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * Release of gLite 3.1.0 Update06 to PPS done This release contains: - new voms certificate for US-ATLAS server - uberftp for the glite-UI node - lcg-tags added to glite 3.1 UI, WN and VOBOX - lcg-infosites added to the glite 3.1 WN and VOBOX * Javier Lopez and Esteban Freire from CESGA PPS joined the PPS Coordination team. Their main activilty will be to follow up the roll-out of the gLite middleware updates from Certification to PPS * A review of the number of available CPUs published in the information system by PPS sutes was started * Sequence diagram describing the release process Certification --> PPS --> Production now available on the PPS website http://www.cern.ch/pps/index.php?dir=./release/process * Configurations problem with one of the SAM clients (PPS-RAL) caused failure of tests at all sites at the beginnign of this week. Submission tests temporarily suspended. Now the service is fixed and restored.
PPS Coordination (C5 Report on 13-Sep-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * Release of gLite 3.0 Update 34 to production in progress This release contains: - fix in YAIM for the configuraiton of proxy renewal - VOMS certificate for the new server used by US-ATLAS - patches from PPS Update39 * Release of PPS 3.1.0 PPS Update06 started This release contains: - new voms certificate for US-ATLAS server - uberftp for the glite-UI node - lcg-tags added to glite 3.1 UI, WN and VOBOX - lcg-infosites added to the glite 3.1 WN and VOBOX * A training activity was started in order to let people from PPS-CESGA take over the coordination of the releases to PPS.
PPS Coordination (C5 Report on 22-Aug-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * Release of PPS 3.0.2 PPS Update 38 finished - This release contains the SLC3 build of WMS 3.1 * Release of PPS 3.1.0 PPS Update 05 finished - This release was to synchronise PPS with production * FCR: Central location for FCR ldif file corrected. Documentation chaged accordingly http://www.cern.ch/pps/index.php?dir=./bdii/& . * A new document, reporting about known issues in the release potentially affecting the users, requested by the experiments, is currently being studied * Intense negotiation in course with CODs upon proposed changes in the best practices currently in use to report failures affecting the PPS grid. The debate is between options: A) COD opening and following-up tickets to PPS sites (+) No exceptions in the ops procedures to be followed for PPS sites (+) ROCs automatically in the loop (+) Full test of the support path for new services as part of the Pre-production phase (-) The physiologic instability of PPS causes either the sites to receive a lot of "false positives" or the COD to spend extra care before opening a ticket (-) COD Team has the frustrating perception that PPS sites and ROCs do not always take the ticket in the due consideration, whereas, from the COD's point of view, the effort spent to handle the ticket is the same B) COD not opening tickets and PPS sites registering to automatic alarm notifications (+) No tickets to be handled by CODs, TPMs, ROCs. Notifications sent immediately to site admins who can fix the problem while it's still "hot" (+) Step in the direction of automation (as in the guidelines for EGEE3) (+) Support line in PPS not strictly dependent upon the cooperation of ROC CODs and TPMs (-) ROCs, who finally hold the responsibility for PPS sites, are completely cut off from the support. (-) Previous experience with CODs not submitting tickets. The result was a general service degradation The decision has not been made yet: We are aiming to a compromise solution bases on option "B" + a weekly status report to be sent by the CODs to ROCs and PPS Support.
PPS Coordination (C5 Report on 16-Aug-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * Release schedule changed, now releases of gLite3.1 and 3.0 will be done in alternating weeks. * Release process improved in order to handle better: - releases fast-tracked to production - releases of updates affecting different baselines * Release of PPS 3.1 PPS Update 05 in progress - mirroring of the repository requested at CNAF
PPS Coordination (C5 Report on 19-Jul-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * Information about top-level BDII config file now publicly available in http://www.cern.ch/pps/index.php?dir=./bdii/& * Meeting with SA3 and EIS to define a strategy to deploy gLite3.1 Ui in production: Some changes in the tarball distribution were put in place yesterday to address the points raised by Diana (UI service manager at CERN_PPS) Middle term strategy: - YAIM will provide a series of measures to assure the backward compatibility with previous versions of the UI over a period of 3 months minimum - SA1 will update the documentation and will start a campaign to coach the users to switch to the new configuration method.
PPS Coordination (C5 Report on 12-Jul-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * glite3.0.2 PPS U35 released to PPS - it contains the new service-oriented YAIM * Inter-operations PPS <-> OSG testing in progress - OSG ITR sites integrated in PPS information system * Details of DILIGENT data challenge, to be done in PPS, available in https://twiki.cern.ch/twiki/bin/view/DILIGENT/DiligentInfrastructurePps#Data_Challenges
PPS Coordination (C5 Report on 5-Jul-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * glite3.0.2 PPS U34 released to PPS - it contains an urgent security patche for the DPM * PPS-RAL started submitting SAM tests. Together with PPS-CYFRONET they have now taken over CERN_PPS
PPS Coordination (C5 Report on 28-Jun-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * glite3.0.2 PPS U33 released to PPS - it contains urgent fixes for the DPM - pre-deployment shown some issues in the release, currently being fixed by the integration team - the corrected patch should be delivered in production with the next release (today) * glite3.1.0 PPS U02 released to PPS - it contains the feature to remove the VoViews FQAN tag requested by Atlas - the configuration was also successfully tested at T2 site (prague_cesnet_lcg2 - should go in production with the next release (today)
PPS Coordination (C5 Report on 14-Jun-07) EGEE Pre-Production Service Coordination: ----------------------------------------- * The SL4 UI was released to the PPS this week. Significant issues have already been found regarding this new service (see comments under the section "CERN GRID Pre-Production Site") * Setting up of the grid interoperability testbed (between the EGEE grid and OSG grid) is almost complete. Verification testing to start early next week. * Testing of FTS 2.0 and SRM 2.2 continues, with the full collaboration of the experiments. * Procedure now in place to give rapid feedback to development and testing teams from the testing of new middleware releases in the PPS. This should lead to faster bug fixing and clearer list of "known issues".
PPS Coordination (C5 Report on 14-Jun-07) EGEE PRE-PRODUCTION SERVICE COORDINATION ======================================== * PPS Update 32 released on Wednesday. Upgrade of the service in progress (completed at 7 sites) o The update contains: - lsf fixes - improved BDII with indexes (performance improvement of a factor of 3 observed in earlier installation in CERN-PROD) o The pre-deployment tests showed no major issues in installation and configuration * FTS v2 testing in progress. A report of the testing will be issued soon * Long (two-days) un-availability of SAM tests due to CNAF downtime
PPS Coordination (C5 Report on 7-Jun-07) EGEE PRE-PRODUCTION SERVICE COORDINATION ======================================== * PPS Update 31 released - The new pre-deployment test done by selected sites was successfully applied: A rich report on installation and configuration issues was made available 26h after the software was released from certification and avoided a lot of work duplication * Atlas is still intensively testing the fixes for the job priority mechanism at IFIC and PIC * Some PPS sites, after the publication of the activity panels in the web site ( http://egee-pre-production-service.web.cern.ch/egee-pre-production-service/index.php?dir=./panel/& ) envisaged the need for an all-hand meeting of PPS site administrators. The proposal is to book a section in EGEE07 * First successful tranfers for ATLAS using SRMv2/FTS2.0
PPS Coordination (C5 Report on 24-May-07) EGEE PRE-PRODUCTION SERVICE COORDINATION ======================================== * No further releases done after PPS Update 29 (8 May) - This was decided upon request from the sites to have time to recover from several issue caused by the previous update * Integration of SRM2.2 test SEs into the PPS progressing: - A major configuration problem in dCache was found while testing the channel CERN->BNL - Set-up of channels at CNAF in progress * Release process: Revision of timelines for release to PPS and production We made the proposal to move to a 2-weekly cycle, with 3.5 weeks of pre-production for regular patches No objections were moved, so we are going to apply it starting from next week * An activity board resuming all activities on-going within the PPS, both service-related and VO-related, was prepared. It is currenlty being reviewed by the sites and will be announced to the VOs today. http://egee-pre-production-service.web.cern.ch/egee-pre-production-service/index.php?dir=./panel/&
PPS Coordination (C5 Report on 10-May-07) EGEE PRE-PRODUCTION SERVICE COORDINATION ======================================== * gLite 3.0 Update 29 released to production on Tuesday. This contains, among others: - #898 high priority LCG-CE modifications for DGAS support - #1144 high priority R-GMA Server fix for bugs #21558, #20090 and #23052 - # a new version of the gLite 3.1 Worker Node (glite-WN-3.1.0-3) for SL4/i386 which addresses all known issues. * Upgrade of PPS to UPDATE 29 is progressing well: more than 50% of sites upgraded * Integration of SRM2.2 test SEs into the PPS progressing: - CERN_PPS is for the time being publishing end-points in US in the information system - SAM tests are being summitted to all published SRMs. A good par of them is passing the tests, no incompatibilities were found in this regression tests - Atlas transmitted some requirements on FTS channels for preliminary tests. * Release process improved: Starting from next week a pool of 6 sites will perform an early installation test of the whole release before its deployment in PPS. This will allow to us to trap evident issues and avoid their fast-spreading in PPS. The aim is to make the overall service more stable for the users. Mario David, at LIP is coordinating this activity. * A plenary meeting with the PPS sites was held yesterday, where some of the most urgent issues were addressed. The topics covered included (agenda available at http://indico.cern.ch/conferenceDisplay.py?confId=15191): - SRM v2 testing - Migration of the SAM PPS service to external sites - Re-introduction of pre-deployment testing Attendance by the sites was high and all the topics were well received. Unfortunately we experineced major communication issues using VRVS
PPS Coordination (C5 Report on 3-May-07) EGEE PRE-PRODUCTION SERVICE COORDINATION ======================================== * Nothing to report
PPS Coordination (C5 Report on 26-Apr-07) EGEE PRE-PRODUCTION SERVICE COORDINATION ======================================== * Starting to move the SRM2.2 test SEs into the PPS for more thorough testing by the VOs (particularly the WLCG experiments). * No update released to PPS this week. * gLite 3.0 Update 22 released to production. This contains: - #1101 GFAL 1.9.0-2/lcg_utils 1.5.1-1 - #1088 Removal of incorrect myproxy deps in glite-SE* metapackages - #1066 yaim 3.0.1-10 (Major Update) - #1052 fix reading LB super-users file - #915 Voms update to 1.7 branch (Major Update)
PPS Coordination (C5 Report on 19-Apr-07) EGEE PRE-PRODUCTION SERVICE COORDINATION ======================================== * Significant bug found in SL4 native WN (gridFTP segmentation fault) * Upgrade path from "interim" WN (SL3 WN made compatible for SL4) to native SL4 WN found to be difficult. Operations managers are deciding whether to release the interim WN to production or wait for the native SL4 version. * PPS-Update 27 released to the PPS. This contains: - patch #1118 lcg-vomscerts-4.4.1 has correct cert for biomed/egeode - patch #1115 New version of lcg-info with support for VOViews, sites and services - patch #1110 Dcache 1.7.0-34 upgrade with GridFTP bug fixes - patch #1108 glite-yaim 3.0.1-12 5 > This version of YAIM enables DGAS logging on the LCG CEs. * gLite 3.0 Update 21 released to production. This contains: - #1085 Missing package python-fpconst for SL3 installation - #1084 Missing dependency on lcg-expiregridmapdir for glite-WMS - #1074 Missing dependency on glite-yaim in metapackages - #1038 lcg-info-dynamic-scheduler peformance improvement for bug#23636 - #1077 "glite-yaim-3.0.1-9 update > This is a major new version of yaim: glite-yaim-3.0.1. This version contains the following changes: o DNS-like VO names o YAIM's hierarchical configuration storage o Changes in the groups.conf file o Special accounts management o Queue management