WMAgent end to end Validation Tests for HG1403 cmsweb upgrade

Upgrade schedule

  • 24 Feb: release candidate RPMs due for pre-prod deployment * deadline for requests *
  • 25 Feb: cmsweb-testbed pre-prod release candidate deployment
  • 16 Mar: validation results due * deadline for validation *
  • 18 Mar (Tuesday): production deployment

Validation results trac ticket
Release changes trac ticket
HG1403_milestone

Versions tested

HG1403a

ReqMgr version 0.9.94
Global WQ version 0.9.94
WMStats version 0.9.94
WMAgent version used for the testing: v0.9.91

Release Notes

RequestManager
Global_WorkQueue
WMStats
WMAgent

Observed changes from previous versions

Tests

Test Tester Completed Status Comments
Bug fixes in WMAgent
Bug fixes / New features in WMStats
Show drain status of an Agent in WMStats 4552   WMA >= 0.9.79
remove stalled warning if there is job running 4750  
New features in WMAgent
Configure T1 disk sites overrride 4739   WMA >= 0.9.79
Specify required OS in Condor JDL 4698   WMA >= 0.9.79
Actively manage site lists in pending jobs 4603   WMA >= 0.9.79
LheInputFiles feature added to TaskChain requests 4871   WMA >= 0.9.9X
EventsPerLumi capability added to TaskChain requests 4872   WMA >= 0.9.9X
Bug fixes in ReqMgr
New features in ReqMgr
Priority: It has become a required parameter, it can only take values up to 1 million Alan 2013-09-10 FAIL The correct parameter is "RequestPriority" and it is not mandatory
Argument validation is stricter, in general the idea is that a parameter is either with a valid value or not present, dummy values will most likely fail validation Alan 2013-09-11 OK It does not accept ProcessingString and AcquisitionEra with empty strings/dicts
JobSplitting can now be specified at request creation. Use "SplittingAlgo" and other parameters for that Alan 2013-09-11 OK  
LheInputFiles feature added to TaskChain requests 4871   WMA >= 0.9.9X
EventsPerLumi capability added to TaskChain requests 4872   WMA >= 0.9.9X
StoreResult processing ???   I have no idea how to test it
Do not allow rejection of requests in "assigned" state 4976    
lexicon refactoring 4941  
Bug fixes in WorkQueue
lexicon refactoring 4941  
Standard workflows
Old request moved from completed to closed-out and announced      
Old request moved from completed to rejected      
Old request moved from assignment-approved to rejected      
MC workflow      
ReDigi workflow      
ACDC Resubmission (using acdcserver)      
ReReco+skim workflow      
LHE Step 0 workflow      
Abort acquired workflow      
Abort running workflow      
MC from GEN Workflow      
High Scale Test      
TaskChain: MC recycling Alan    
TaskChain: MC from scratch Alan    
TaskChain: FastSim workflow + event splitting Alan    
TaskChain: Data workflow Alan    
TaskChain: Pileup workflow by recycling Alan    
TaskChain: Pileup workflow from scratch Alan    
TaskChain: Pileup Pyquen workflow (PrimaryDataset override) Alan    
TaskChain: automatic harvesting Alan    
TaskChain: different ProcessingString per task Alan    
TaskChain: KeepOutput = False feature (single and cascade) Alan    
TaskChain: 'TransientOutputModules': ['RAWoutput'] and TransientOutputModules = ['RECOSIMoutput'] Alan    
TaskChain: ACDC via WMStats Alan    

Optional things to test

Test Tester Completed Status Comments
TaskChain: cascade "closed-out" and "announced" changes via script Alan    
Propagate Memory (RequestMemory in MB), Disk (RequestDisk in KB) and Job length (MaxWallTimeMins in minutes) estimates to Condor through the JDL #4472  
Apply smart error handling for jobs that failed due to high memory usage or excessive run time #4473      
ACDC creation in WMStats      
WMAgent
Robust merge jobs - add missing merge files to ACDC, proceed with existing files #4476      
!ReqMgr
WorkQueue
Fixed an issue that prevented the location of pileup of datasets to be updated after a workflow had been acquired #4629      
Fixed timeouts when connecting to the ReqMgr which prevented workflows from being acquired #4660      
Track pileup location and NOT fail out requests #3733 and #4507      

-- AlanMalta - 28 Feb 2014

Edit | Attach | Watch | Print version | History: r4 < r3 < r2 < r1 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r1 - 2014-02-28 - AlanMalta
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    Sandbox All webs login

  • Edit
  • Attach
This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback