* Make script to runcrabtask regardless TW, Crabserver (To be able test schedd)

* Make script for WmAgent check (Long going)

* Test WMCore match lfn fix (Ongoing need to be tested also with WmAgent)

* Test ASO, TW slc6

* Have new TW machine for dev (Marco agreed)

* Create patch for java file:// missing protocol

* Create CrabServer installation twiki

* Test Crab3 T2_US_Vanderbilt (Couldn't load 'lfs'. DSI activation failed.) an all T1 match_lfn

* Create LifeCycle for sitedb (started on lxplus)


* Discussions about LifeCycle for other services

* New TW for dev (slc6)

* Test all sites (need to check them)


* Schedd test of sleep succeeded (submittion. not running)

* Crab3 dev meeting

* Some fixes to TW

* Quick fix for sitedb


* Changed CAF to point to tesbed crabcache (Failure, reported!)

* Started to work vocms95 (hostcert problem) still testing vocms96

* Updated TW for prod (retry to True)

* Reached again limit of crabcache (600mb), but it was pointing to testbed ?! (Fixed by Marco)

* Deployed ASO and TW on slc6 (will have a new machine for tests)

* New Schedd works (second try after James fix)


* New TW deployment

* New schedd test (Still failure)

* Update documentation of TW (Added Banned site list)

* Update documentation of ASO (Added fix to prod list)

2014-04-14 -> 2014-04-16

* New TW

* Test the new schedd

* WMAgent end validation

* Meeting with HammerCloud people

* Ran opportunistic tests

* Updated rest configuration (banned out destination all T1 and added submit5 schedd to prod instance)

* CRAB Dev meeting


* Running simple jobs on crab3

* Simple crab test

* Tested new schedd (still have problems)

* Waiting for reply from Alison (about Oppurtunistic tests)

* Got new from gen input dataset, and start to test

* Testing Jose skimming dataset


* Found the bug with RFC proxy (all other test is for this about SSL Handshake)

* Investigated and found solution

* Ran WMAgent tests

* Removed all jobs from condor (too much failure)


* CRAB Dev meeting

* Testing Hammercloud

* Need to discuss about exceeding of vocms95

* Ran WmAgent tests

* Wrote mail to group about ssl handshake error (something bad is going with HM, how it creates a delegation)

* MonteCarloFromGEN json fix

* Wrote Seangchan about error on MonteCarloFromGEN


* Testing Hammercloud

* Got CRAB Server task to develop (discussed with Marco how to do it)

* Reported ticket to one site (have 4 more, which I need more information before answering to ticket) (Need to test valid site, and failure, compare) Took most of the time :/

* Talked with Maric about ASO deployment


* Testing Hammercloud (T1, T2) Found bug,reported, fixed by Daniele

* Patched WMAgent

* Found bug in crab get report (missing arg)

* Found bug in ASO DBSPublisher, reported to Hassen (restarted DBSPublisher)

* Wrote Seangchan about MontecarloFromGEN

* Still testing issue about SSLHandshake


* Updated ASO documentation (deployment of OpsProxy)

* Testing Hammercloud

* Testing new schedd (Failures to authentificate) Found bug reported to James

* Updated twiki of known condor commands

* New WMAgent problem with schedd

* CRAB3 documention update (condor commands, ASO, TW)


* ASO deployment

* Ran WMAgent test (2 json files still need to update)

* Talked to Ivan to change the list of used machines by me

* Talked with Andres and tested Hammercloud(Need to write letter)

* Reconfigured eos cleanup script

* Check keep load test

* Updated ASO documentation


* For tomorrow

--- Create twiki with phedex, dbs, couch calls, start the script creation (4)

* CRAB 3 Dev meeting

* Install new ASO on vocms31 (Need new host cert! waiting for Ivan) (Sertificate got, need to start installation)

* Talked with Alan about phedex, dbs, script and json templates

* Reviewed site allow to write, asked in the crabdev group

* Talked with Andres and Marco about hist task difference on monitoring

* Checked ASO replication status (Created again new replication)

* Created request to remove datasets from phedex

* Stopped EOS script, refactored it to start remove then reaching 800gb used quota


* Ran script to keep load on cmsweb-testbed

* Tried to test new Schedd, but without success, it is missing in poll (Trying to get new pool)

* Started to test each CMSSW with simple task witk 100 jobs

* Showed Andres the tests which doing with each site

* Checked each sites (T1 and T2) crab3, and checked hack of write allowance

* Updated JSON files in private repo for WmAgent tests

* Tested RPM creation, works as expected

* committed sitedb changes (Remove CE, fix remove button, print only ascii to logs)

* checked logs on T2_US_Florida and T1_US_FNAL (Andres letter title: jobs not resubmitted because excessive memory use )


* Fixed monitoring for ASO from cmsweb (Also left bug in ASO manage script)

* Tested WMCore fix for lfn2pfn (Working as expected, will be tested with Marco tomorrow for other sites)

* Talked with Diego about RPM creation, need to create rpm for sitedb (ONGOING) (FAILURE)

* ASO fixing for cmsweb (wrong configuration file)

* TaskWorker redeployment (vocms245 and vocms36 and vocms244) 3.3.5.rc2

* In the meeting announced about new Schedd (vocms96), talked with Marco to leave vocms20 only with testing machines, all other only with taskworkers vocms244, vocms245, vocms36

-- JustasBalcas - 31 Mar 2014

Edit | Attach | Watch | Print version | History: r35 < r34 < r33 < r32 < r31 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r35 - 2014-04-24 - JustasBalcas
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    Main All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback