gLite Certification - LCG Grid Deployment

gLite Certification Work Log

  • 2006-01-09(1): Task - Testbed Administration. Details: kernel upgrade done on lxb2064. Status: Done. Antonio

  • 2005-12-15(1): Task VOMS installation Details: Mail sent to Oracle support to get a relevant number of user accounts (60) for stress testing Status: Done . Antonio

  • 2005-12-14(1): Task - Manual Test. Details: mysql VOMS testing (biggish number of VOs) Status: IN PROGRESS . Alessandro

  • 2005-12-2(4): Task - MEETING Details: PPS meeting. Alessandro, Antonio, Huimin Lin

  • 2005-12-2(3): Task - Manual Test. Details: BUGS follow up Status: IN PROGRESS . Alessandro

  • 2005-12-2(2): Task - Manual Test. Details: WMS troubleshooting Status: IN PROGRESS . Alessandro

  • 2005-12-2(1): Task - Manual Test. Details: New VOMS server (lxb1732) and update of the CERT testbed to use it Status: DONE . Alessandro

  • 2005-12-2(1): Task - Testbed Administration. Details: Requested kernel upgrade done on several machines. Status: Done. Antonio

  • 2005-12-1(4): Task - Manual Test. Details: All FTSs in CERTIFICATION updated with the quick fixes Status: DONE . Alessandro

  • 2005-12-1(3): Task - Manual Test. Details: All CEs and WNs in CERTIFICATION updated with the quick fixes Status: DONE . Alessandro

  • 2005-12-1(2): Task - Manual Test. Details: Import of the latest glite 1.4 quick fixes into the CERT apt repository (lxb2040) Status: DONE . Alessandro

  • 2005-12-1(1): Task - Manual Test. Details: KERNEL upgrade of the CERT testbed machines Status: DONE . Alessandro

  • 2005-11-24(5): Task - Manual Test. Details: WMS statistics: no major problems in the last two days; continuing the monitoring Status: IN PROGRESS . Alessandro

  • 2005-11-24(4): Task - Manual Test. Details: 1.4 WMS in push mode debugging : the problem is now understood apparently Status: IN PROGRESS . Alessandro, Massimo Sgaravatto

  • 2005-11-24(3): Task - Manual Test. Details: VOMS server debugging: SERIOUS PROBLEM in that lcg-voms.cern.ch and voms.cern.ch are not "clean" Status: IN PROGRESS . Alessandro, Mario (JRA1)

  • 2005-11-24(1): Task - Manual Test. Details: VOMS server debugging: SERIOUS PROBLEM in that lcg-voms.cern.ch and voms.cern.ch are not "clean" Status: IN PROGRESS . Alessandro

  • 2005-11-23(2): Task - Manual Test. Details: Bulk submission tests:3/4 of the jobs failed for various reasons (lots of them in waiting). The wms seems ok though Status: IN PROGRESS . Alessandro

  • 2005-11-23(1): Task - Manual Test. Details: Running Bugs follow ups Status: IN PROGRESS . Alessandro

  • 2005-11-22(5): Task - Manual Test. Details: Running of the certification testsuite to reproduce the memory leak against lxb1913 Status: IN PROGRESS . Alessandro, Di

  • 2005-11-22(3-4): Task - Manual Test. Details: Bulk submission tests Status: IN PROGRESS . Alessandro

  • 2005-11-22(2): Task - Manual Test. Details: Reconfiguration of a WMS and a CE Status: IN PROGRESS . Alessandro

  • 2005-11-22(1): Task - Manual Test. Details: Troubleshooting of lxb1913: memory leak problem Status: IN PROGRESS . Alessandro

  • 2005-11-21(3): Task - Manual Test. Details: Running of the certification testsuite against lxb1913 Status: IN PROGRESS . Alessandro, Di

  • 2005-11-21(1-2): Task - Manual Test. Details: Gangmatching tests: success (listmatch, submission and retrival of the output sandbox) Status: DONE . Alessandro

  • 2005-11-18(4): Task - Manual Test. Details: Critical bug against the WMS in push mode: the acl information is badly created in the ISM Status: IN PROGRESS . Alessandro, Di

  • 2005-11-18(1-2-3): Task - Manual Test. Details: Gang job test: partially successful (list match works): it required the reconfiguration of a WMS (to work in push mode pointing to a suitable bdii), DPM and two CEs Status: IN PROGRESS . Alessandro

  • 2005-11-17(6): Task - Manual Test. Details: Reinstallation and configuration of lxb1913 (glite 1.4.2 WMS in pull mode) Status: Done . Alessandro

  • 2005-11-17(5): Task - Manual Test. Details: Preproduction troubleshooting Status: Done . Alessandro, Di

  • 2005-11-17(4): Task - Manual Test. Details: Reconfiguration of all the CE(s) in certification Status: Done . Alessandro

  • 2005-11-17(3): Task - Manual Test. Details: Update of all the WMS(s) in certification to 1.4.2 Status: Done . Alessandro

  • 2005-11-17(2): Task - Manual Test. Details: Troubleshooting of Elena's test suite problem Status: Done . Alessandro

  • 2005-11-17(1): Task - Manual Test. Details: Update of the apt repository to 1.4.2 Status: Done . Alessandro

  • 2005-11-17(1): Task - Testbed Admin. Details: Kernel upgrade to 2.4.21-37.EL.cern to all machines + reboot Status: Done . Antonio

  • 2005-11-16(4): Task - Manual Test. Details: Installation of a torque server and CE on separate machines: there is a problem with the ACL not being set in the out.ldif file of the CE; as a result the macth macking fails! (major bug submitted) Status: IN PROGRESS . Alessandro, Di

  • 2005-11-16(3): Task - Manual Test. Details: Troubleshooting in the PPS Status: DONE . Alessandro

  • 2005-11-16(2): Task - Manual Test. Details: Troubleshooting for Elena's tests. Status: DONE . Alessandro

  • 2005-11-16(1): Task - Meeting Details: Certification testsuite usage. Alessandro, Di, Daniele (CNAF), Matteo (CNAF), Federico (Torino)

  • 2005-11-15(3): Task - Meeting Details: PPS and certification meetings. Alessandro, Nick, Di, Antonio, Daniele (CNAF), Matteo (CNAF), Federico (Torino), Graeme (Glasgow)

  • 2005-11-15(2): Task - Manual Test. Details: SC troubleshooting. Status: IN PROGRESS . Alessandro

  • 2005-11-15(1): Task - Manual Test. Details: WMS troubleshooting. Status: IN PROGRESS . Alessandro

  • 2005-11-14(7): Task - Manual Test. Details: Installation and configuration of the 1.4.1 oracle SC in the certification testbed Status: DONE . Alessandro

  • 2005-11-14(6): Task - Manual Test. Details: Upgrade of the mysql SC to 1.4.1 in the certification testbed Status: DONE . Alessandro

  • 2005-11-14(5): Task - Manual Test. Details: Upgrade of all the CEs to 1.4.1 in the certification testbed Status: DONE . Alessandro

  • 2005-11-14(4): Task - Manual Test. Details: Upgrade of all the WMS to 1.4.1 in the certification testbed Status: DONE . Alessandro

  • 2005-11-14(3): Task - Manual Test. Details: Upgrade of the RGMA server to 1.4.1 in the certification testbed Status: DONE . Alessandro

  • 2005-11-14(2): Task - Manual Test. Details: Upgrade of all the WNs to 1.4.1 in the certification testbed Status: DONE . Alessandro

  • 2005-11-14(1): Task - Manual Test. Details: Remirror of the apt rep: it works now with the CE (something changed in JRA1's rep?) Status: DONE . Alessandro

  • 2005-11-11(5): Task - Manual Test. Details: 1.4.1 CE does not install (unmet dependency with lcg-info-dynamic-software) Status: IN PROGRESS . Alessandro

  • 2005-11-11(4): Task - Manual Test. Details: 1.4.1 WMS upgraded (lxb1926) Status: DONE . Alessandro

  • 2005-11-11(3): Task - Manual Test. Details: 1.4.1 release: mirror of the apt repository (quick fixes under updates) Status: DONE . Alessandro

  • 2005-11-11(2): Task - Manual Test. Details: WMS testing Status: IN PROGRESS . Alessandro

  • 2005-11-11(1): Task - Meeting Details: group meeting Status: IN PROGRESS . Alessandro

  • 2005-11-10(4): Task - Manual Test. Details: Troubleshooting of the russian certification authority problem on the VOMS server(s) Status: DONE . Alessandro

  • 2005-11-10(3): Task - Manual Test. Details: Problems with the gangmatching submission: critical bug against the WMS for a dead lock (both in 1.4 and 1.4.1) Status: IN PROGRESS . Alessandro

  • 2005-11-10(2): Task - Manual Test. Details: 1.4 WMS installation and configuration as a backup service (lxb1926) Status: IN PROGRESS . Alessandro

  • 2005-11-10(1): Task - Manual Test. Details: Problems with the WMS being unstable; jobs do not complete! Condor is in a bad state and no clear way to drain the queue is available (after submitting 7000 bulk jobs): it seems that the jobs remained so much on the WMS that the proxy expired. The WMS seems to be continuing trying to renew the proxy indefenetly (as seen from the Schedd log on the CE): the bug is still in enhancement but might be raised to major! Status: IN PROGRESS . Alessandro, Di

  • 2005-11-9(3): Task - Manual Test. Details: New mysql 1.4 FTS installed and configured (lxb1787). Tested the rgma service discovery mechanism and it does NOT work (a bug submitted): it seems it works in 1.4.1 though! Status: IN PROGRESS . Alessandro

  • 2005-11-9(2): Task - Manual Test. Details: Proxy renewal daemon on the WMS takes up a lot of CPU: debugging section with Mario on monday (bug submitted by JRA1): it may be a scalability problem! Status: IN PROGRESS . Alessandro

  • 2005-11-9(1): Task - Manual Test. Details: Problems with the WMS being unstable; jobs do not complete! Condor is in a bad state and no clear way to drain the queue is available as the condor commands do not respond (restarting the service does not solve the problem): still not a clear answer on this Status: IN PROGRESS . Alessandro

  • 2005-11-8(1-2): Task - Manual Test. Details: Further tests with the bulk submission: 7000 jobs submitted. Problems with the WMS being unstable; jobs do not complete! Condor is in a bad state and no clear way to drain the queue is available (bug submitted) as the condor commands do not respond (restarting the service does not solve the problem) Status: IN PROGRESS . Alessandro

  • 2005-11-7(3): Task - Manual Test. Details: Further tests with the bulk submission: 7000 jobs submitted. Status: IN PROGRESS . Alessandro

  • 2005-11-7(2): Task - Meeting Details: PPS meeting with CNAF: agreed that CNAF will take over the mixed testing of GLITE with LCG; further developements in the next coming days Status: IN PROGRESS . Alessandro, Daniele Cesini (CNAF)

  • 2005-11-7(1): Task - Manual Test. Details: WMS performance test: still a memory leak problem. The swap area is exausted very quickly, but jobs get executed successfully. Further tests with the bulk submission went ok. Status: IN PROGRESS . Alessandro

  • 2005-11-4(2): Task - Manual Test. Details: WMS performance test: still a memory leak problem. The swap area is exausted very quickly, but jobs get executed successfully. Status: IN PROGRESS . Alessandro

  • 2005-11-4(1): Task - Meeting PPS meeting. Alessandro, Nick, Antonio, Di

  • 2005-11-3(1-2): Task - Manual Test. Details: WMS performance test: still a memory leak problem. The swap area is exausted very quickly, but jobs get executed successfully. Status: IN PROGRESS . Alessandro, Di

  • 2005-11-2(3): Task - Meeting Details: CMS meeting . Alessandro, Antonio, Di

  • 2005-11-2(2): Task - Manual Test. Details: Bulk submission test (parametric jobs): Failed as the UI does not know the bulk submission jdl attributes: a bug has been submitted! Status: IN PROGRESS . Alessandro

  • 2005-11-2(1): Task - Manual Test. Details: RGMA server upgraded to 1.4 in the certification testbed Status: DONE . Alessandro

  • 2005-11-1(2): Task - Manual Test. Details: 1.4 FTS trouble shooting in PPS (upgraded from 1.3) Status: IN PROGRESS . Alessandro, Di

  • 2005-11-1(1): Task - Manual Test. Details: 1.4 WMS test in push mode with LCG production bdii and new configuration matrix (lxb1940): no visible difference! Status: IN PROGRESS . Alessandro

  • 2005-10-31(3): Task - Manual Test. Details: DLI regression test: the 1.3 quick fix is NOT in 1.4 (waiting for 1.4.1) Status: IN PROGRESS . Alessandro

  • 2005-10-31(2): Task - Manual Test. Details: Problems with the WMS and glite-job-listmatch in the certification area (MIX testbed) Status: IN PROGRESS . Alessandro

  • 2005-10-31(1): Task - Manual Test. Details: Problems with the WMS in the certification area: critical bug submitted Status: IN PROGRESS . Alessandro, Di

  • 2005-10-28(1): Task - Manual Test. Details: Problems with the WMS in the certification area Status: IN PROGRESS . Alessandro, Di

  • 2005-10-27(1): Task - Manual Test. Details: Problems with the LB in the preproduction WMS at cern (critical bug by Di): currently retesting the WMS in certification to reproduce the preoblem! Status: IN PROGRESS . Alessandro, Di

  • 2005-10-26(2): Task - Manual Test. Details: Problems with the LB in the preproduction WMS at cern (critical bug by Di): currently retesting the WMS in certification to reproduce the preoblem! Status: IN PROGRESS . Alessandro, Di

  • 2005-10-26(1): Task - Manual Test. Details: 1.4 ORACLE FTS: installed and configured. glite-transfer-submit-placement does not work! (problems with creating a gsiftp entry using the IO server!!!): a bug submitted Status: IN PROGRESS . Alessandro, Di

  • 2005-10-25(1-2): Task - Manual Test. Details: 1.4 ORACLE FTS: installed and configured. glite-transfer-submit-placement does not work! (problems with creating a gsiftp entry using the IO server!!!) Status: IN PROGRESS . Alessandro, Di

  • 2005-10-24(3): Task - Manual Test. Details: 1.4 UI reconfigured Status: IN PROGRESS . Alessandro

  • 2005-10-24(1-2): Task - Manual Test. Details: 1.4 ORACLE FTS: installed and configured. It works ok with urlcopy (simple test with no catalog interaction again) Status: IN PROGRESS . Alessandro, Di

  • 2005-10-21: Task - Manual Test. Details: 1.4 mysql FTA installation and configuration: added CHANNEL_TEST and tested the transfer from one of the CASTOR machine (lxb1909) to another CASTOR machine (lxb1909):the test WAS successful. We reconfigured the service to use urlcopy (again very poor documentation: bugs to come) and succeeded in transfering one file! (very time consuming!!!)! The srmcopy mode has still problems (possibly a bug) Status: IN PROGRESS . Alessandro, Di

  • 2005-10-20: Task - Manual Test. Details: 1.4 mysql FTA installation and configuration: added CHANNEL_TEST and tested the transfer from one of the DPM to another DPM:the test was NOT successful. We sorted the service discovery problems (interaction with Diana and Gavin) as well as the srm compatibility problem with DPM (it just does not work in that case). Also the fts mode differs from the fps one in the DN which is used to contact the SRM endpoint (fps:the fts dn is used; fts: the user dn is used instead). Testing with castor: still problems with srmcopy Status: IN PROGRESS . Alessandro, Di

  • 2005-10-19(3): Task - Manual Test. Details: 1.4 mysql FTA installation and configuration: added CHANNEL_TEST and tested the transfer from one DPM to another:the test was NOT successful. The FTS works with the myproxy delegation: the myproxy server service disovery seems not to be working, i.e. services.xml is not read, MYPROXY_SERVER env variable is not read either Status: IN PROGRESS . Alessandro

  • 2005-10-19(2): Task - Bugs Submitted. Details: #12517--> IO Server release notes(documentation); 12522--> DGAS Client (impossible to exclude) Status: Done . Antonio

  • 2005-10-19(1): Task - Manual Test. Details: 1.4 mysql FTA installation and configuration: added CHANNEL_TEST and tested the transfer from one DPM to another:the test was NOT successful though Status: IN PROGRESS . Alessandro

  • 2005-10-18(2): Task - Meeting Details: Glite extended integration meeting. Alessandro

  • 2005-10-18(1): Task - Manual Test. Details: 1.4 FTA installation and configuration: added CHANNEL_TEST and tested the transfer from one DPM to another:the test was NOT successful (it took a lot of time:the documentation is wanting;submitted few bugs). Status: IN PROGRESS . Alessandro

  • 2005-10-17(4): Task - Manual Test. Details: 1.4 FTA installation and configuration: I dropped the schema with the 1.3 oracle script; I then had to stop the 1.3 services as the maximum number of connection to the database had been reached! In the end the agents started up successfully. Status: DONE . Alessandro

  • 2005-10-17(3): Task - Manual Test. Details: 1.4 FTS installed (OK). FTA installation and configuration failed because of a schema incompatibility. Status: IN PROGRESS . Alessandro

  • 2005-10-17(2): Task - Manual Test. Details: 1.4 DGAS SERVER installed and configured (lxb1763.cern.ch). Status: DONE. Alessandro

  • 2005-10-17(1): Task - Meeting Details: Monitoring meeting: discussion of glite critical/major bugs . Alessandro, Nick

  • 2005-10-14(3): Task - Manual Test. Details: DGAS SERVER Won't install: there is a conflict with perl-DBD-MySQL in the glite REP and the CERN one (older version), which is NOT installed in our linux inst. Status: IN PROGRESS . Alessandro

  • 2005-10-14(2): Task - Manual Test. Details: FTS configuration script fails (bug submitted) Status: IN PROGRESS . Alessandro

  • 2005-10-14(1): Task - Manual Test. Details: DGAS SERVER Won't install (bug submitted) Status: IN PROGRESS . Alessandro

  • 2005-10-14(1): Task - Testbed Administration. Details: UI lxb1937 inserted in the backup system. Now a daily backup of the home directories is done. Status: Done. Antonio.

  • 2005-10-13(5): Task - Manual Test. Details: Troubleshooting of the lcg-voms.cern.ch voms server. Status: SORTED . Alessandro, Maria

  • 2005-10-13(4): Task - Manual Test. Details: DGAS server won't install! Status: IN PROGRESS . Alessandro

  • 2005-10-13(3): Task - Manual Test. Details: Upgrade of the SC to 1.4 (lxb1934). Status: Done. Alessandro.

  • 2005-10-13(2): Task - Manual Test. Details: Retesting of the WMS in push mode with a production bdii (change in the configuration setting DisablePurchasingFromGris = false;): no difference!. Status: Done. Alessandro.

  • 2005-10-13(1): Task - Manual Test. Details: Troubleshooting of the lcg-voms.cern.ch voms server. Status: Progress . Alessandro, Maria

  • 2005-10-12(4): Task: Support to production. Notes: Intallation of a "production" WMS in version 1.4 done with Yvan on lxb1175. Status: Progress. Antonio

  • 2005-10-12(3): Task - Manual Test. Details: Retesting of the WMS in push mode with a production bdii. Status: Done.Alessandro.

  • 2005-10-12(2): Task - Manual Test. Details: Reconfiguration of the LCG RB. Reconfiguration of the glite and LCG CE (for the LCG only reinstallation of the log daemon and restart of it) in the MIX testbed. Job submission test successful. Status: Done. Alessandro.

  • 2005-10-12(1): Task - Manual Test. Details: IO SERVERS in the certification area upgraded to 1.4. Status: Done.Alessandro.

  • 2005-10-11(3): Task - Manual Test. Details: JRA1 integration meeting: PPS status, certification, 1.4.1 release, pyXML rpm conflict with LCG issue, name convention change (postponed). Alessandro.

  • 2005-10-11(2): Task - Manual Test. Details: UPGRADE of the MIX testbed to 1.4: the certification testsuite is to be run against the glite and lcg RBs. Status: Done. Alessandro.

  • 2005-10-11(1): Task - Manual Test. Details: cemon.wms.host.subject is ignored. Check redone: the bug is invalid! Status: Done.Alessandro.

  • 2005-10-10(4): Task - Manual Test. Details: cemon.wms.host.subject is ignored. This is a security issue which affects the 1.4 WMS working in push mode (pure glite): as a consequence in push mode any WMS can retrieve any CE's information and therefore submit jobs to the CEs (a major bug has been submitted) Status: In Progress. Alessandro, Antonio.

  • 2005-10-10(2): Task - Manual Test. Details: The CEs in the certification testbed have been reconfigured because of a misconfiguration (lxb1923 had the wrong WMS's DN configured when working in push mode). Status: Done. Alessandro, Antonio.

  • 2005-10-10(1-2): Task - Manual Test. Details: problem with 1.4 WMS in push mode (lxb1940)( Status: Solved. Alessandro.

  • 2005-10-07(5): Task - Manual Test. Details: RGMA purchaser and BDII cannot be used together by the WMS (bug submitted): the rgma purchaser mechanism cannot be deployed as a consequence Status: Progress. Alessandro.

  • 2005-10-07(4): Task - Manual Test. Details: RGMA gin on the CEs does not publish correctly the whole of the ACLs information. As a result the RGMA purchaser mechanism cannot work (major bug submitted) Status: Progress. Alessandro.

  • 2005-10-07(3): Task - Manual Test. Details: The 1.4 WMS working in push mode with a production bdii (UPDATE after monitoring for few hours): the ISM is empty almost all the time (critical bug has been submitted) . Status: Done.Alessandro.

  • 2005-10-07(2): Recurrent Task: 004 - Briefing. Details: Introduction to Gergely Status: Done. Antonio.

  • 2005-10-07(1): Task - Manual Test. Details: The 1.4 WMS working in push mode with a production bdii has problems: the ISM is empty almost all the time (a bug has been submitted) Status IN PROGRESS. Alessandro.

  • 2005-10-06(3): Task - Manual Test. Details: The 1.4 WMS working in push mode with a production bdii has problems: the ISM is empty almost all the time (a bug has been submitted) Status IN PROGRESS. Alessandro.

  • 2005-10-06(2): Task - Manual Test. Details: The RGMA service discovery mechanism (using RGMA to populate the information supermarket) in the 1.4 WMS WORKS OK! (1.4 WMS is lxb1914.cern.ch)! (tested with a 1.3 RGMA server lxb1944.cern.ch with two CEs in it) Status: Done. Alessandro.

  • 2005-10-05(5): Long-term Task: 001 - Define Certification Process and Procedures Notes: "Stupid Tests" section added to the wiki pages Status: Progress. Antonio

  • 2005-10-05(4): Recurrent Task: 001 - Testbed Installation and configuration. Details: Installation and configuration of 2 CEs 1.4 in pull mode (lxb1935,lxb1923). Status: Done. Antonio

  • 2005-10-05(3): Recurrent Task: 002 - Manual Test. Details: Testing of the WMS with a bdii: it works. The rgma and bdii mechanisms are incompatible (you cannot have both). Status: Done. Alessandro.

  • 2005-10-05(2): Recurrent Task: 003 - MIX. Details: Upgrade of the push mode WMS in the MIXED testbed (lxb1933) to R1.4. Testing of the WMS with a bdii (problems) Status. : Progress. Alessandro

  • 2005-10-05(1): Recurrent Task: 001 - Testbed Installation and configuration. Details: Upgrade of the UI (lxb1937) to R1.4. Status: Done. Alessandro.

  • 2005-10-04(5): Recurrent Task: 002 - Manual Test. Details: Simple job submission (hello world job) with a CE(1.3), WN(1.3), WMS (1.4) and UI (1.3) in PULL mode works. Status: Done. Alessandro

  • 2005-10-04(4): Task: 002 - SC-ORA installation on CERT. Notes: The problem deals with CERN character set. Causes have been found by Krisztof and request for support sent to Oracle support at CERN Status. : Progress. Antonio

  • 2005-10-04(3): Meeting: gLite XIT meeting. - Bugs against the bulk submission will be solved with gLite 1.4.1. Antonio,Alessandro

  • 2005-10-04(2): Recurrent Task: 001 - Testbed Installation and configuration. Details: Installation and configuration of WMS 1.4 in push mode (lxb1940). Status: Done. Alessandro

  • 2005-10-04(1): Recurrent Task: 001 - Testbed Installation and configuration. Details: Installation and configuration of WMS 1.4 in pull mode (lxb1913). Status: Done. Alessandro

  • 2005-10-03(3): Recurrent Task: 001 - Testbed Installation and configuration. Details: Reinstalling of the certification testbed nodes and editing of the XML global file. Status: Done. Alessandro, Antonio and Di

  • 2005-10-03(2): Meeting: - It has been decided that the Certification be done per component (the 1.4 WMS is to be released asap to the preproduction). Antonio, Marcus, Alessandro and Di

  • 2005-10-03(1): Recurrent Task: 001 - Testbed Installation and configuration. Details: Mirror of the JRA1 1.4 apt repository to lxb2040. Status: Done. Alessandro.

  • 2005-10-03(1): Meeting: - Certification meeting (discussion on the modification of the certification process). Alessandro, Antonio and Di

  • 2005-09-30(2): Task: - FTS success (there were problems with dropping the 1.2 DB schema). Alessandro

  • 2005-09-30(2): Meeting: - VOs handling in glite 1.4. Alessandro, Antonio and Robert (JRA1)

  • 2005-09-30(1): Task: - FTS test. Alessandro

  • 2005-09-29(2): Task: - Further testing. bug #10607 seems to be still outstanding despite the release of the quick fix QF1.3.0_21_2005. Alessandro, Di

  • 2005-09-29(1): Task: - Testing of the quick fixes (bug #10063 is solved by quick fix gLite QF1.3.0_21_2005 but bug #10607 seems to be still outstanding). Status: Done.Alessandro, Di

  • 2005-09-28(2): Task: - Testing of the quick fixes and APT repository update (bug #9684 and bug #10816 are solved by the new rpm glite-ce-config-2.0.3-1.noarch.rpm). Status: Done. Alessandro

  • 2005-09-22(2): Task: - FPS debugging. Alessandro, Di

  • 2005-09-22(1): Meeting: - Definition of the roadmap to follow for the Preproduction (upgrade to 1.3, diligent VO to be inserted, ACLs) and Certification (data management) in the few weeks to come. Alessandro, Antonio, Di

  • 2005-09-21(1): Task: 016 - Tutoring Notes: Brief Igor on his activity in the next days Status: Done. Antonio

  • 2005-09-20(1): Task: 018 - Reconfiguration of the IO servers in the Certification (and indirectly preproduction) and their testing. Alessandro

  • 2005-09-19(1): Task: 017 - Certification WNs reconfiguration to support the RGMA service discovery mechanism. Alessandro

  • 2005-09-16(5): Long-term Task: 001 - Define Certification Process and Procedures Notes: Procedure to configure SFT in pre-production Status: Progress. Antonio

  • 2005-09-16(4): Task: 016 - Tutoring Notes: Brief Valentin on YAIM documentation modules: LCG activity Status: Done. Antonio

  • 2005-09-16(3): Task: 015 - Critical bug report Notes:to JRA1 for configuration of custom env variables in the WN. bug #10843 Status: Done. Di

  • 2005-09-16(2): Meeting: JRA1 and CERTIFICATION - Understanding and possible solutions for the running of CMS (env variable on WN), tag variable in the glue schema and possibility to change out.ldif on the fly. Alessandro, Antonio, Di, Alberto (JRA1)

  • 2005-09-16(1): Task: 015 Working on the cms requirements and the CESE binding, WN env variable problem. Alessandro

  • 2005-09-15(2): Meeting: JRA1 and CERTIFICATION - VO specific information to be placed in a centralized location (major progress on the refactoring of the configuration files to come) Alessandro, Antonio, Robert Harakaly (JRA1)

  • 2005-09-15(1): Task: 014 - CESE binding test in progress. Alessandro

  • 2005-09-14(4): Long-term Task: 001 - Define Certification Process and Procedures Notes: New section added to wiki . procedure to handle SFT started Status: Progress. Antonio

  • 2005-09-14(3): Meeting: Certification - ACL problem on the preproduction CEs and impossibility to currently have this information propagated to the WMS discussed (bug #10798 submitted). Alessandro Antonio Di Nick

  • 2005-09-14(2): Meeting: Certification - Analysis of the cms use case and possible solution found (multiple CESE bindings). Alessandro Antonio Di Nick

  • 2005-09-14(1): Task: 012 - RGMA service discovery mechanism issue cleared Notes Alessandro

  • 2005-09-14(1): Long-term Task: 001 - Define Certification Process and Procedures Notes: Procedure to handle APT added Status: Progress. Alessandro

  • 2005-09-02(2): Task: 010 - WMS match making with a LFN works ok with LCG but still fails in glite. A bug has been submitted. Alessandro

  • 2005-09-02(1): Task: 009 - I mirrored the jra1 apt rep in CERT. More info can be found on the node status page for lxb2040. Alessandro

  • 2005-09-02(1): Meeting: Certification team. Decisions - Major Action Points:
    • Meeting with Maarten to identify the possible deployment problems of the glite WMS within the LCG production grid.

  • 2005-09-01(2): Meeting: Certification team. Decisions - Major Action Points:
    • to test FTS and FPS in CERT -> Di
    • to try and run the Certification Test Suite and (likely) understand why it fails -> Alessandro, Di
    • to mirror the apt rep in CERT -> Alessandro
    • to write a list of tests to be done on WMS -> ALL+ Maarten
    • to organize scalability tes of the WMS -> Nick
    • to set-up SFT in PPS -> Antonio
    • (PPS) to ask Javier to upgrade R-GMA server -> Nick
    • (PPS) ask some sites to start the upgrade of the CEs -> Nick. Antonio

  • 2005-09-01(1): Task: 008 - Analysis of a workaround to solve the SFT matchmaking problem. Notes: Three possible solutions proposed:
    1. to retrieve with an RGMA query the list of WMS to forward the tests -> not feasible: the schema does not support and major changes to SFT woudl be needed
    2. to configure a dedicated WMS in PUSH mode with all the CEs -> not feasible: not scalable and bug #10063.
    3. Register the CERN WMS in all the PPS CEs: -> feasible: only available solution in short term. Status: Done. e-mail sent out to PPS site admin to the anticipate the needed configuration change. Antonio

  • 2005-08-31(2): Task: 007 - Test on the rgma service discovery mechanism Notes: The certification testbed UI has been reconfigured to rely on the rgma service discovery mechanism. However there are problems in that the rgma query fails to retrieve any entry points from the rgma server. Also the mysql Single Catalog has wrong default parameters.: Suspended. Alessandro

  • 2005-08-31(1): Task: 006 - glite WMS interoperability with LCG LFC on the MIX CERT testbed (lxb1933). Notes: It seems it is not working on the glite WMS as well as on the LCG RB (followed Sophie and Min recipe). The used LFC was lfc-dteam-test.cern.ch which does not have a gris.However its information is present in the production bdii lcg-bdii.cern.ch (probably inserted there by hand). I created a dir on the LFC and added a file to the "grid" (LFC catalog and castor) which was then put as a requirement in a jdl: Suspended. Alessandro

  • 2005-08-30(1): Task: 002 - SC-ORA installation on CERT. Notes: Installation on Oracle 10g tried unsuccessfully. The error message is slightly different. Follow-up posted to the bug. Status. : Suspended. Moving to more urgent matters. Waiting for release 1.4. Antonio

  • 2005-08-29(3): Task: 001 - SFT run on CERT. Notes: The CE list is get but the job list match fails because the WMSs are hardcoded in the CEs and so from it is impossible to see a given CE unless you are using a particular WMS. Service discovery is missing. Status: Suspended. This could be a showstopper, maybe Piotr can give a solution but it would be better if service discovery would be in place. Antonio

  • 2005-08-29(2): Task: 005 - Configure SFT to get the list of CEs from the GOC DB. Notes: The query has been configured in /opt/lcg/sft/defaults.glite on lxb1937. Status: Done. Antonio

  • 2005-08-29(1): Task: 002 - SC-ORA installation on CERT. Notes: The fix proposed to the critical bug does not work. SC developers (christophe) propose to switch to Oracle 10g (version officially supported). An new Oracle account on "devdb10" has been requested. Status: Suspended. Waiting for the account to be created. Antonio

  • 2005-08-26(2): Bug #10501 submitted into Savannah. Antonio

  • 2005-08-26(1): Task: 004 - Migrate PPS site data from the GOC db into the RGMA-server in CERT. Notes: Min's scripts Used. re-configuration of the fexible-archiver on the RGMA server was needed. Procedure written. Status: done. About 4 hours. Antonio

  • 2005-08-25(3): Task: 003 - Contact Javier Sanchez. Notes: Javier is available to run the test as soon as we give him a working configuration for the SC Status: done. Antonio

  • 2005-08-25(2): Task: 002 - SC-ORA installation on CERT. Notes: the 'ls' fails. Cross-check with JRA1 configuration done unsuccessfully. BUGS #10491 (critical) #10492 #10493 opened. Status: Suspended. Waiting for the critical bug to be fixed. Antonio.

  • 2005-08-25(1): Meeting: Certification team. Decisions Try and interface sft in PPS with the GOC db. Explore the solution of an Oracle BE at CERN for the PPS SC. Major Action Points: - Antonio: to get the SC-ORA working in CERT; to contact Javier Sanchez for an external test of the SC; Query to GOC for the SFT - Di: to get FTS and FPS working in CERT - Vladimir: to re-style the internal process documentation (certification wiki). - Nick: to read the test documentation from JRA1. Proposals: Vladimir's grup in Bulgaria could take part in the development of the certification test suite.Antonio

  • 2005-08-24(2): Task: 001 - SFT run on CERT. Notes: the script fails to publish results for the basic job submission tests using th eSFT server installed by Piotr in pre-production Status: Suspended. Waiting for Piotr. Antonio

  • 2005-08-24(1): Task: 001 - SFT run on CERT. Notes: the flag to switch the suite from LCG to gLite sometimes does not work, so 'lcg' tests are sent to gLite resources. Only the basic job submission tests can be run Status: Suspended. Waiting for Piotr. Antonio

  • 2005-08-23(1): Task: 001 - SFT server installed on top of a gLite RGMA server. Notes: the installation procedure refers to lcg-archiver and should be modified for the flexible archiver. Status: Suspended. Waiting for Piotr. Antonio

  • 2005-08-19(2): Certification wiki page with work log created. Antonio

  • 2005-08-19(1): 4 Bugs submitted into Savannah. Antonio

-- Main.aretico - 19 Aug 2005

Edit | Attach | Watch | Print version | History: r93 < r92 < r91 < r90 < r89 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r93 - 2006-11-28 - LaurenceField
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    LCG All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback