Certification Report for Patch 3022 (LB 2.0, SL5, 64-bit)
Environment
* Clean SL5 installation, then:
yum -y install createrepo
cat << EOF > /etc/yum.repos.d/glite-local.repo
[main]
[glite-local]
name=Local glite RPMS repository
baseurl=file:///var/cache/glite-local
enabled=1
EOF
mkdir /var/cache/glite-local
cd /var/cache/glite-local
wget http://etics-repository.cern.ch/repository/download/registered/externals/c-ares/1.3.0/sl5_x86_64_gcc412/c-ares-1.3.0-4.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/externals/classads/1.0/sl5_x86_64_gcc412/classads-1.0-2.sl5.x86_64.rpm
wget http://linuxsoft.cern.ch/EGEE/gLite/R3.2/glite-GENERIC/sl5/x86_64/RPMS.externals/fetch-crl-2.7.0-2.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/glite-info-generic/2.0.2/noarch/glite-info-generic-2.0.2-5.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/glite-info-provider-release/1.0.0/noarch/glite-info-provider-release-1.0.0-8.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/glite-info-templates/1.0.0/noarch/glite-info-templates-1.0.0-11.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.jobid.api-c/1.0.0/sl5_x86_64_gcc412/glite-jobid-api-c-1.0.0-4.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.jobid.api-cpp/1.0.0/sl5_x86_64_gcc412/glite-jobid-api-cpp-1.0.0-4.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lb.client/4.0.3/sl5_x86_64_gcc412/glite-lb-client-4.0.3-1.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lb.common/7.0.3/sl5_x86_64_gcc412/glite-lb-common-7.0.3-1.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lb.doc/1.0.3/sl5_x86_64_gcc412/glite-lb-doc-1.0.3-1.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lbjp-common.db/1.0.1/sl5_x86_64_gcc412/glite-lbjp-common-db-1.0.1-3.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lbjp-common.jp-interface/1.0.0/sl5_x86_64_gcc412/glite-lbjp-common-jp-interface-1.0.0-6.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lbjp-common.maildir/1.0.0/sl5_x86_64_gcc412/glite-lbjp-common-maildir-1.0.0-3.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lbjp-common.server-bones/1.0.0/sl5_x86_64_gcc412/glite-lbjp-common-server-bones-1.0.0-4.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lbjp-common.trio/1.0.0/sl5_x86_64_gcc412/glite-lbjp-common-trio-1.0.0-5.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lb.logger/2.0.4/sl5_x86_64_gcc412/glite-lb-logger-2.0.4-1.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lb.server/2.0.3/sl5_x86_64_gcc412/glite-lb-server-2.0.3-2.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lb.state-machine/1.0.2/sl5_x86_64_gcc412/glite-lb-state-machine-1.0.2-1.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lb.utils/2.0.3/sl5_x86_64_gcc412/glite-lb-utils-2.0.3-1.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lb.ws-interface/3.0.1/sl5_x86_64_gcc412/glite-lb-ws-interface-3.0.1-1.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lb.ws-test/1.0.1/sl5_x86_64_gcc412/glite-lb-ws-test-1.0.1-3.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.lb.glite-LB/3.2.0/sl5_x86_64_gcc412/glite-LB-3.2.0-4.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.security.gsoap-plugin/2.0.1/sl5_x86_64_gcc412/glite-security-gsoap-plugin-2.0.1-2.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.security.gss/2.0.0/sl5_x86_64_gcc412/glite-security-gss-2.0.0-5.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.security.voms-api/1.8.12/sl5_x86_64_gcc412/glite-security-voms-api-1.8.12-1.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.security.voms-api-c/1.8.12/sl5_x86_64_gcc412/glite-security-voms-api-c-1.8.12-2.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.yaim.core/4.0.10/noarch/glite-yaim-core-4.0.10-2.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.yaim.lb/4.2.1/noarch/glite-yaim-lb-4.2.1-1.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/vdt/gpt/3.2-VDT-1.10.1/sl5_x86_64_gcc412/gpt-3.2autotools2004_NMI_9.0_x86_64_rhap_5-1.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.gridsite.core/1.5.10/sl5_x86_64_gcc412/gridsite-shared-1.5.10-1.sl5.x86_64.rpm
wget http://grid-deployment.web.cern.ch/grid-deployment/glite/cert/3.2/patches/3244/sl5/x86_64/RPMS.externals/vdt_globus_essentials-VDT1.10.1x86_64_rhap_5-4.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.yaim.core/4.0.11/noarch/glite-yaim-core-4.0.11-2.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/glite-info-provider-service/1.2.2/noarch/glite-info-provider-service-1.2.2-0.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/glue-schema/2.0.3/noarch/glue-schema-2.0.3-1.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/bdii/5.0.6/noarch/bdii-5.0.6-1.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/glite-version/3.2.1/noarch/glite-version-3.2.1-1.noarch.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.security.voms-api-cpp/1.8.12/sl5_x86_64_gcc412/glite-security-voms-api-cpp-1.8.12-2.sl5.x86_64.rpm
wget http://etics-repository.cern.ch/repository/download/registered/org.glite/org.glite.security.lcas/1.3.7/sl5_x86_64_gcc412/glite-security-lcas-1.3.7-2.sl5.x86_64.rpm
createrepo .
cd ~/
yum -y install glite-LB
mkdir -m 700 yaim
cd yaim
cat << EOF > site-info.def
WN_LIST=${INSTALL_ROOT}/glite/yaim/examples/wn-list.conf
USERS_CONF=${INSTALL_ROOT}/glite/yaim/examples/users.conf
GROUPS_CONF=${INSTALL_ROOT}/glite/yaim/examples/groups.conf
MYSQL_PASSWORD=[Edited]
SITE_NAME=delwin
SITE_EMAIL="[Edited]"
SITE_LAT=0.0
SITE_LONG=0.0
CE_HOST=my-ce.$MY_DOMAIN
CE_CPU_MODEL=cpu_model
CE_CPU_VENDOR=vendor
CE_CPU_SPEED=speed
CE_OS=os_name
CE_OS_RELEASE=x.y.z
CE_OS_VERSION="version"
CE_OS_ARCH=i686
CE_MINPHYSMEM=number
CE_MINVIRTMEM=number
CE_PHYSCPU=number
CE_LOGCPU=number
CE_SMPSIZE=number
CE_SI00=number
CE_SF00=number
CE_OUTBOUNDIP=boolean
CE_INBOUNDIP=boolean
CE_RUNTIMEENV="tag1 [tag2 [...]]"
CE_CAPABILITY="CPUScalingFactorSI00=value [Share=vo-name1:value [Share=vo-name2:value [...]]]"
CE_OTHERDESCR="Cores=value[,Benchmark=value-HEP-SPEC06]"
BATCH_SERVER=$CE_HOST
JOB_MANAGER=my_job_manager
CE_BATCH_SYS=my_batch_system
BATCH_LOG_DIR=my_batch_system_log_directory
BATCH_VERSION=my_batch_system_version
APEL_DB_PASSWORD="APELDB_PWD"
RB_HOST=my-rb.$MY_DOMAIN
WMS_HOST=my-wms.$MY_DOMAIN
PX_HOST=my-px.$MY_DOMAIN
MON_HOST=my-mon.$MY_DOMAIN
FTS_SERVER_URL="https://fts.${MY_DOMAIN}:8443/path/glite-data-transfer-fts"
DPM_HOST="my-dpm.$MY_DOMAIN"
SE_LIST="SE1 SE2 SE3"
SE_MOUNT_INFO_LIST="[SE1:export_dir1,mount_dir1 [SE2:export_dir2,mount_dir2 [...]]|none]"
SE_GRIDFTP_LOGFILE=path_to_gridftp_logfile.log
VOS="vo1 [vo2 [...]]"
VO_SW_DIR=exp_soft_dir
QUEUES="q1 [q2 [...]]"
EOF
/opt/glite/yaim/bin/yaim -c -s ./site-info.def -n glite-LB
Tests
User Space TestPlan Tests
Jan 22 11:33:44 harad.ics.muni.cz lb-test-event-delivery.sh:
start
Testing if all binaries are available
done
Testing credentials
done
Registering testing job
done
Registered job:
https://delwin.fi.muni.cz:9000/2mpN6dhtHYVY-wWtX9JMJA
Logging events resulting in READY state
Sleeping for 10 seconds (waiting for events to deliver)...
Is the testing job (
https://delwin.fi.muni.cz:9000/2mpN6dhtHYVY-wWtX9JMJA
) in a correct state? Ready
done
Logging events resulting in RUNNING state
Logging events resulting in DONE state
Sleeping for 10 seconds (waiting for events to deliver)...
Testing job (
https://delwin.fi.muni.cz:9000/2mpN6dhtHYVY-wWtX9JMJA
) is in state: Done
done
Purging test job (Trying the best, result will not be tested)
Error running the edg_wll_Purge().
Operation not permitted (only superusers may purge)
The jobs were not dumped.
Jan 22 11:34:08 harad.ics.muni.cz lb-test-event-delivery.sh:
end
Jan 22 11:34:08 harad.ics.muni.cz lb-test-il-recovery.sh:
start
Testing if all binaries are available
done
Testing credentials
done
Registering testing job (
https://delwin.fi.muni.cz:9000/MWHdUh4ZKD9KWNg9846bxQ)  
;
done
Info: No command to stop was given
Generating events resulting in CLEARED state
./lb-test-il-recovery.sh: line 161: /tmp/logd_sustr.MWHdUh4ZKD9KWNg9846bxQ: No such file or directory
chmod: cannot access `/tmp/logd_sustr.MWHdUh4ZKD9KWNg9846bxQ': No such file or directory
Info: No command to start was given
Sleeping for 70 seconds (waiting for interlogger to notice and deliver events)...
Testing job (
https://delwin.fi.muni.cz:9000/MWHdUh4ZKD9KWNg9846bxQ
) is in state: Cleared
done
Purging test job (Trying the best, result will not be tested)
Error running the edg_wll_Purge().
Operation not permitted (only superusers may purge)
The jobs were not dumped.
Jan 22 11:35:18 harad.ics.muni.cz lb-test-il-recovery.sh:
end
Jan 22 11:35:18 harad.ics.muni.cz lb-test-job-registration.sh:
start
Testing if all binaries are available
done
Testing credentials
done
Registering testing job
done
Is the testing job (
https://delwin.fi.muni.cz:9000/oRqoWNFTkSogf4JuQ122Ng
) in a correct state? Submitted
done
Purging test job (Trying the best, result will not be tested)
Error running the edg_wll_Purge().
Operation not permitted (only superusers may purge)
The jobs were not dumped.
Jan 22 11:35:18 harad.ics.muni.cz lb-test-job-registration.sh:
end
Jan 22 11:35:18 harad.ics.muni.cz lb-test-https.sh:
start
Testing if all binaries are available
done
Testing credentials
done
Getting proxy cert path... /tmp/x509up_u62210
done
Registering testing job
done
Evaluating job list...
done
Looking up the test job...
done
Evaluating job status listing...
done
Checking for jobid (verifying content)...
done
Purging test job (Trying the best, result will not be tested)
Error running the edg_wll_Purge().
Operation not permitted (only superusers may purge)
The jobs were not dumped.
Registering notification glite-lb-notify new -j
https://delwin.fi.muni.cz:9000/ledtJUwkhfoDlS1-Ju_4bw
(
https://delwin.fi.muni.cz:9000/NOTIF:7jLNDE2PgQpmQ9wl2xVKDg
)
done
Evaluating notification status listing...
done
Checking for jobid (verifying content)...
done
Dropping the test notification (
https://delwin.fi.muni.cz:9000/NOTIF:7jLNDE2PgQpmQ9wl2xVKDg)  
;
done
Purging test job (Trying the best, result will not be tested)
Error running the edg_wll_Purge().
Operation not permitted (only superusers may purge)
The jobs were not dumped.
Jan 22 11:35:19 harad.ics.muni.cz lb-test-https.sh:
end
Jan 22 11:35:19 harad.ics.muni.cz lb-test-job-states.sh:
start
Testing if all binaries are available
done
Testing credentials
done
Registering testing job (
https://delwin.fi.muni.cz:9000/Iki84AQPI7pfgeUVqp6zVw)  
;
done
Testing job is in state: Submitted (should be Submitted)
done
logging Accepted
done
Testing job is in state: Waiting (should be Waiting)
done
logging
EnQueued done
Testing job is in state: Waiting (should be Waiting)
done
logging
DeQueued done
Testing job is in state: Waiting (should be Waiting)
done
logging
HelperCall done
Testing job is in state: Waiting (should be Waiting)
done
logging Match
done
Testing job is in state: Waiting (should be Waiting)
done
logging
HelperReturn done
Testing job is in state: Waiting (should be Waiting)
done
logging
EnQueued done
Testing job is in state: Ready (should be Ready)
done
logging
DeQueued done
Testing job is in state: Ready (should be Ready)
done
logging Transfer
done
Testing job is in state: Ready (should be Ready)
done
logging Accepted
done
Testing job is in state: Ready (should be Ready)
done
logging Transfer
done
Testing job is in state: Scheduled (should be Scheduled)
done
logging Running
done
Testing job is in state: Running (should be Running)
done
logging Done
done
Testing job is in state: Done (should be Done)
done
logging Clear
done
Testing job is in state: Cleared (should be Cleared)
done
Purging test job (Trying the best, result will not be tested)
Error running the edg_wll_Purge().
Operation not permitted (only superusers may purge)
The jobs were not dumped.
Jan 22 11:37:42 harad.ics.muni.cz lb-test-job-states.sh:
end
Jan 22 11:37:42 harad.ics.muni.cz lb-test-notif-recovery.sh:
start
Testing if all binaries are available
done
Testing credentials
done
Registering testing job (
https://delwin.fi.muni.cz:9000/VFUYg1PPVo2-qr0tZYukfA
)
done
Registering notification (
https://delwin.fi.muni.cz:9000/NOTIF:15fi0t3qxMV7zDHw_AeXrg
)
done
Logging events resulting in DONE state
notification is valid until: '2010-01-22 11:37:55 UTC' (1264160275)
./lb-test-notif-recovery.sh: line 141: kill: (21446) - No such process
Notifications were delivered
done
Dropping the test notification (
https://delwin.fi.muni.cz:9000/NOTIF:15fi0t3qxMV7zDHw_AeXrg)  
;
done
Purging test job (Trying the best, result will not be tested)
Error running the edg_wll_Purge().
Operation not permitted (only superusers may purge)
The jobs were not dumped.
Jan 22 11:38:05 harad.ics.muni.cz lb-test-notif-recovery.sh:
end
Jan 22 11:38:05 harad.ics.muni.cz lb-test-notif.sh:
start
Testing if all binaries are available
done
Testing credentials
done
Registering testing job (
https://delwin.fi.muni.cz:9000/7A2gVPMu3iWGloISWx0ZCQ
)
done
Registering notification (
https://delwin.fi.muni.cz:9000/NOTIF:eIaUGyIOfydgnG6rfr5Fkw
)
done
Logging events resulting in DONE state
notification is valid until: '2010-01-22 11:38:05 UTC' (1264160285)
./lb-test-notif.sh: line 140: kill: (21507) - No such process
Notifications were delivered
done
Dropping the test notification (
https://delwin.fi.muni.cz:9000/NOTIF:eIaUGyIOfydgnG6rfr5Fkw)  
;
done
Purging test job (Trying the best, result will not be tested)
Error running the edg_wll_Purge().
Operation not permitted (only superusers may purge)
The jobs were not dumped.
Jan 22 11:38:17 harad.ics.muni.cz lb-test-notif.sh:
end
Jan 22 11:38:20 harad.ics.muni.cz lb-test-notif-switch.sh:
start
Testing if all binaries are available
done
Testing credentials
done
Registering testing jobs
old:
https://delwin.fi.muni.cz:9000/eTJrQ5rv0hMWOFNkoHCWuQ
new:https://delwin.fi.muni.cz:9000/opew_vMIi_I7vTKsjWfgcA
done
Registering notification (
https://delwin.fi.muni.cz:9000/NOTIF:HhTzC7X9Wq-eTR_cS9RtOg
)
done
Logging events resulting in RUNNING state
notification is valid until: '2010-01-22 11:38:20 UTC' (1264160300)
./lb-test-notif-switch.sh: line 141: kill: (21599) - No such process
Notifications were delivered
done
Changing notification ... glite-lb-notify change returned OK
done
Logging events resulting in DONE state for both jobs
notification is valid until: '2010-01-22 11:38:32 UTC' (1264160312)
./lb-test-notif-switch.sh: line 176: kill: (21656) - No such process
Notifications for the old job were not delivered
done
Notifications for the new job were delivered
done
Dropping the test notification (
https://delwin.fi.muni.cz:9000/NOTIF:HhTzC7X9Wq-eTR_cS9RtOg)  
;
done
Purging test job (Trying the best, result will not be tested)
Error running the edg_wll_Purge().
Operation not permitted (only superusers may purge)
The jobs were not dumped.
Jan 22 11:38:59 harad.ics.muni.cz lb-test-notif-switch.sh:
end
Jan 22 11:38:59 harad.ics.muni.cz lb-test-ws.sh:
start
Testing if all binaries are available
done
Testing credentials
done
Registering testing job
done
WS interface to query: delwin.fi.muni.cz:9003
Has the job (
https://delwin.fi.muni.cz:9000/DOezOMvk-xw55fjGyuXtAQ
) been submitted?
done
Is it possible to retrieve events?
done
Purging test job (Trying the best, result will not be tested)
Error running the edg_wll_Purge().
Operation not permitted (only superusers may purge)
The jobs were not dumped.
Jan 22 11:39:00 harad.ics.muni.cz lb-test-ws.sh:
end
In the Wild—Real-World Test
* tests separated to smaller pieces and using not only local resources (overloaded our CE)
1) simple job, done
ui1.egee.cesnet.cz$ ./lb-test-wild.sh -n 1 -f html --test done delwin.fi.muni.cz:9000
Jan 28 13:59:41 ui1.egee.cesnet.cz lb-test-wild.sh:
start
[wild] submit (done test):
https://delwin.fi.muni.cz:9000/CqU7Hez1sa8lflWmIVSUiA
[wild] sleep before cancel...
[wild] submitted
done
[wild] ================================
[wild] 2010-01-28 13:59:57
https://delwin.fi.muni.cz:9000/CqU7Hez1sa8lflWmIVSUiA
Ready (done test)
[wild] 2010-01-28 14:00:17
https://delwin.fi.muni.cz:9000/CqU7Hez1sa8lflWmIVSUiA
Scheduled (done test)
[wild] 2010-01-28 14:00:37
https://delwin.fi.muni.cz:9000/CqU7Hez1sa8lflWmIVSUiA
Running (done test)
[wild] 2010-01-28 14:03:39
https://delwin.fi.muni.cz:9000/CqU7Hez1sa8lflWmIVSUiA
Done (done test)
[wild] all jobs finished
done
[wild] ================================
[wild]
https://delwin.fi.muni.cz:9000/CqU7Hez1sa8lflWmIVSUiA
: 'Done' OK (done)
done
[wild] components:
JobController LogMonitor LRMS
NetworkServer WorkloadManager OK
done
[wild] ================================
[wild] job output test
[wild] fetching output from
https://delwin.fi.muni.cz:9000/CqU7Hez1sa8lflWmIVSUiA
[wild] output of 'https://delwin.fi.muni.cz:9000/CqU7Hez1sa8lflWmIVSUiA' fetched
done
[wild] 2010-01-28 14:04:01
https://delwin.fi.muni.cz:9000/CqU7Hez1sa8lflWmIVSUiA
Cleared (done test)
[wild] all jobs in done cleared
done
Jan 28 14:04:11 ui1.egee.cesnet.cz lb-test-wild.sh:
end
2) simple job, cancel
ui1.egee.cesnet.cz$ ./lb-test-wild.sh -n 1 -f html --test cancel delwin.fi.muni.cz:9000
Jan 28 14:07:26 ui1.egee.cesnet.cz lb-test-wild.sh:
start
[wild] submit (done test):
https://delwin.fi.muni.cz:9000/mJ5ckPYNWPHv4FLxtfiKKQ
[wild] sleep before cancel...
[wild] cancel
https://delwin.fi.muni.cz:9000/mJ5ckPYNWPHv4FLxtfiKKQ
[wild] submitted
done
[wild] ================================
[wild] 2010-01-28 14:07:40
https://delwin.fi.muni.cz:9000/mJ5ckPYNWPHv4FLxtfiKKQ
Ready (cancel test)
[wild] 2010-01-28 14:08:00
https://delwin.fi.muni.cz:9000/mJ5ckPYNWPHv4FLxtfiKKQ
Cancelled (cancel test)
[wild] all jobs finished
done
[wild] ================================
[wild]
https://delwin.fi.muni.cz:9000/mJ5ckPYNWPHv4FLxtfiKKQ
: 'Cancelled' OK (cancel)
done
[wild] components:
JobController LogMonitor NetworkServer WorkloadManager OK
done
[wild] ================================
[wild] job output test
[wild] all jobs in done cleared
done
Jan 28 14:08:30 ui1.egee.cesnet.cz lb-test-wild.sh:
end
3) simple job, fail
ui1.egee.cesnet.cz$ ./lb-test-wild.sh -n 1 -f html --test fail -w delwin.fi.muni.cz:9000
Jan 28 16:14:59 ui1.egee.cesnet.cz lb-test-wild.sh:
start
[wild] submit (fail test):
https://delwin.fi.muni.cz:9000/mIfX-9qNlgmel_dzOydXOA
[wild] sleep before cancel...
[wild] submitted
done
[wild] ================================
[wild] 2010-01-28 16:15:10
https://delwin.fi.muni.cz:9000/mIfX-9qNlgmel_dzOydXOA
Ready (fail test)
[wild] 2010-01-28 16:15:31
https://delwin.fi.muni.cz:9000/mIfX-9qNlgmel_dzOydXOA
Scheduled (fail test)
[wild] 2010-01-28 16:18:32
https://delwin.fi.muni.cz:9000/mIfX-9qNlgmel_dzOydXOA
Running (fail test)
[wild] 2010-01-28 16:43:27
https://delwin.fi.muni.cz:9000/mIfX-9qNlgmel_dzOydXOA
Aborted (fail test)
[wild] all jobs finished
done
[wild] ================================
[wild]
https://delwin.fi.muni.cz:9000/mIfX-9qNlgmel_dzOydXOA
: 'Aborted' OK (fail)
done
[wild] components:
JobController LogMonitor LRMS
NetworkServer WorkloadManager OK
done
[wild] ================================
[wild] job output test
[wild] all jobs in done cleared
done
Jan 28 16:43:57 ui1.egee.cesnet.cz lb-test-wild.sh:
end
4) collection, done
ui1.egee.cesnet.cz$ ./lb-test-wild.sh -n 1 -f html --test done_coll -w delwin.fi.muni.cz:9000
Jan 28 14:09:53 ui1.egee.cesnet.cz lb-test-wild.sh:
start
[wild] submit (collection to done test):
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
[wild] sleep before cancel...
[wild] submitted
done
[wild] ================================
[wild] 2010-01-28 14:10:09
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
Waiting (done_coll test)
[wild] 2010-01-28 14:10:29
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
Running (done_coll test)
[wild] 2010-01-28 14:39:49
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
Done (done_coll test)
[wild] all jobs finished
done
[wild] ================================
[wild]
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
: 'Done' OK (done_coll)
done
[wild] components: LBServer
NetworkServer WorkloadManager OK
done
[wild] ================================
[wild] job output test
[wild] fetching output from
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
[wild] output of 1. offspring of
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
fetched
done
[wild] output of 2. offspring of
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
fetched
done
[wild] output of 3. offspring of
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
fetched
done
[wild] output of 4. offspring of
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
fetched
done
[wild] output of 5. offspring of
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
fetched
done
[wild] 2010-01-28 14:40:31
https://delwin.fi.muni.cz:9000/I-wxltHnXaYD_FnpOUKGjg
Cleared (done_coll test)
[wild] all jobs in done cleared
done
Jan 28 14:40:41 ui1.egee.cesnet.cz lb-test-wild.sh:
end
5) collection, cancelled by user
ui1.egee.cesnet.cz$ ./lb-test-wild.sh -n 1 -f html --test cancel_coll delwin.fi.muni.cz:9000
Jan 28 16:35:36 ui1.egee.cesnet.cz lb-test-wild.sh:
start
[wild] submit (collection to done test):
https://delwin.fi.muni.cz:9000/Lz84sTpWDpHXmXgHtOmbvQ
[wild] sleep before cancel...
[wild] cancel
https://delwin.fi.muni.cz:9000/Lz84sTpWDpHXmXgHtOmbvQ
[wild] submitted
done
[wild] ================================
[wild] 2010-01-28 16:35:49
https://delwin.fi.muni.cz:9000/Lz84sTpWDpHXmXgHtOmbvQ
Waiting (cancel_coll test)
[wild] 2010-01-28 16:36:09
https://delwin.fi.muni.cz:9000/Lz84sTpWDpHXmXgHtOmbvQ
Cancelled (cancel_coll test)
[wild] all jobs finished
done
[wild] ================================
[wild]
https://delwin.fi.muni.cz:9000/Lz84sTpWDpHXmXgHtOmbvQ
: 'Cancelled' OK (cancel_coll)
done
[wild] components:
NetworkServer WorkloadManager OK
done
[wild] ================================
[wild] job output test
[wild] all jobs in done cleared
done
Jan 28 16:36:40 ui1.egee.cesnet.cz lb-test-wild.sh:
end
6) collection, fail
ui1.egee.cesnet.cz$ ./lb-test-wild.sh -n 1 -f html --test fail_coll -w delwin.fi.muni.cz:9000
Jan 28 17:25:31 ui1.egee.cesnet.cz lb-test-wild.sh:
start
[wild] submit (collection to fail test):
https://delwin.fi.muni.cz:9000/UeEmZP3ckt8Hph5MShwGEw
[wild] sleep before cancel...
[wild] submitted
done
[wild] ================================
[wild] 2010-01-28 17:25:48
https://delwin.fi.muni.cz:9000/UeEmZP3ckt8Hph5MShwGEw
Waiting (fail_coll test)
[wild] 2010-01-28 17:26:09
https://delwin.fi.muni.cz:9000/UeEmZP3ckt8Hph5MShwGEw
Running (fail_coll test)
[wild] 2010-01-28 18:00:38
https://delwin.fi.muni.cz:9000/UeEmZP3ckt8Hph5MShwGEw
Aborted (fail_coll test)
[wild] all jobs finished
done
[wild] ================================
[wild]
https://delwin.fi.muni.cz:9000/UeEmZP3ckt8Hph5MShwGEw
: 'Aborted' OK (fail_coll)
done
[wild] components: LBServer
NetworkServer WorkloadManager OK
done
[wild] ================================
[wild] job output test
[wild] all jobs in done cleared
done
Jan 28 18:01:09 ui1.egee.cesnet.cz lb-test-wild.sh:
end
Server Purge Test (Requiring LB Server Admin Privileges)
** Hey, purging the whole database...The jobs were not dumped.
The following jobs were purged:
https://delwin.fi.muni.cz:9000/l6mYzKtrmaxgaIxz1q_ZUA
[1;32m[ done ]
[0m** Logging test jobs
** So far so good [1;32m[ done ]
[0m** sleeping 60 seconds...
** OK, another set of jobs[1;32m[ done ]
[0m** draining other 6 seconds ...
** test jobs:
aborted:
https://delwin.fi.muni.cz:9000/JpF_enDi3B5PfqUg_rDp3w
https://delwin.fi.muni.cz:9000/0cC4qky_HvHHElf9vx9ydw
cleared:
https://delwin.fi.muni.cz:9000/26pU9hfs9eXKLm-1n92O1A
https://delwin.fi.muni.cz:9000/0Jfp36-dWRJyn_vzc9pv6Q
cancelled:
https://delwin.fi.muni.cz:9000/ggTWzuKWAobvzQz_QKJgrA
https://delwin.fi.muni.cz:9000/UpvrMQe7SnB5fleXGpsgsQ
other:
https://delwin.fi.muni.cz:9000/TjgIYkc0Nv5gwTRu9vUhHQ
https://delwin.fi.muni.cz:9000/FD_rU-66bmq3QCkZb-UacA
** Dry run
30s aborted https://delwin.fi.muni.cz:9000/JpF_enDi3B5PfqUg_rDp3w [1;32m[ done ]
[0m0s aborted https://delwin.fi.muni.cz:9000/0cC4qky_HvHHElf9vx9ydw [1;32m[ done ]
[0m0s aborted https://delwin.fi.muni.cz:9000/JpF_enDi3B5PfqUg_rDp3w [1;32m[ done ]
[0m30s cleared https://delwin.fi.muni.cz:9000/26pU9hfs9eXKLm-1n92O1A [1;32m[ done ]
[0m0s cleared https://delwin.fi.muni.cz:9000/0Jfp36-dWRJyn_vzc9pv6Q [1;32m[ done ]
[0m0s cleared https://delwin.fi.muni.cz:9000/26pU9hfs9eXKLm-1n92O1A [1;32m[ done ]
[0m30s cancelled https://delwin.fi.muni.cz:9000/ggTWzuKWAobvzQz_QKJgrA [1;32m[ done ]
[0m0s cancelled https://delwin.fi.muni.cz:9000/UpvrMQe7SnB5fleXGpsgsQ [1;32m[ done ]
[0m0s cancelled https://delwin.fi.muni.cz:9000/ggTWzuKWAobvzQz_QKJgrA [1;32m[ done ]
[0m30s other https://delwin.fi.muni.cz:9000/TjgIYkc0Nv5gwTRu9vUhHQ [1;32m[ done ]
[0m0s other https://delwin.fi.muni.cz:9000/FD_rU-66bmq3QCkZb-UacA [1;32m[ done ]
[0m0s other https://delwin.fi.muni.cz:9000/TjgIYkc0Nv5gwTRu9vUhHQ [1;32m[ done ]
[0m** Server defaults
Nothing purged as expected [1;32m[ done ]
[0m** Purge the first set of jobs
diff OK [1;32m[ done ]
[0m** Purge the rest
diff OK [1;32m[ done ]
[0m** Anything left?
No, OK [1;32m[ done ]
[0m** Check zombies
https://delwin.fi.muni.cz:9000/TjgIYkc0Nv5gwTRu9vUhHQ Purged [1;32m[ done ]
[0mhttps://delwin.fi.muni.cz:9000/JpF_enDi3B5PfqUg_rDp3w Purged [1;32m[ done ]
[0mhttps://delwin.fi.muni.cz:9000/ggTWzuKWAobvzQz_QKJgrA Purged [1;32m[ done ]
[0mhttps://delwin.fi.muni.cz:9000/26pU9hfs9eXKLm-1n92O1A Purged [1;32m[ done ]
[0mhttps://delwin.fi.muni.cz:9000/FD_rU-66bmq3QCkZb-UacA Purged [1;32m[ done ]
[0mhttps://delwin.fi.muni.cz:9000/0cC4qky_HvHHElf9vx9ydw Purged [1;32m[ done ]
[0mhttps://delwin.fi.muni.cz:9000/UpvrMQe7SnB5fleXGpsgsQ Purged [1;32m[ done ]
[0mhttps://delwin.fi.muni.cz:9000/0Jfp36-dWRJyn_vzc9pv6Q Purged [1;32m[ done ]
[0m
** All tests passed **[1;32m[ done ]
[0m
Review of Linked Bugs (manual regression tests)
The sandbox_retrieved is correctly displayed by glite-lb-job_status (checked glite-lb-client-4.0.3-1.sl5.x86_64).
The output of glite-wms-job-status should be checked by the UI developers/testers.
It is hard to reproduce the situation when the "server gone/lost" error occurs during a transaction. So instead of a regression test I checked the code and it seems to be correct.
A simple regression test of LB server from glite-lb-server-2.0.3-2.sl5.x86_64 RPM showed the following:
The output from LB HTML looks correct, however it is still not a w3c valid HTML. The initial problem of displaying full information in job status (default or unset values are now in grey) has been solved. The HTML validity has been added to the feature request list.
The glite-info-service-lbserver.conf.template from glite-info-provider-service-1.2.2-0.noarch RPM is correct and the version of the lb-server is now published correctly.
--
ZdenekSustr - 22-Jan-2010