cmssrv219

Agent tweaks

UPDATE wmbs_location SET state=(SELECT id from wmbs_location_state where name='Normal') WHERE state!=(SELECT id from wmbs_location_state where name='Normal');
UPDATE wmbs_location SET running_slots=2000, pending_slots=1000;
UPDATE rc_threshold SET max_slots=2000, pending_slots=1000;
  • Set maxRetries to 0 ==> OK
  • Run PhEDExFix ==> NOPE

Jobs in Condor

[cmsdataops@cmssrv219 current]$ condorq
[cmsdataops@cmssrv219 current]$ 

Jobs ordered by status

SQL> select wmbs_job_state.name, count(*)
from wmbs_job
join wmbs_job_state on (wmbs_job.state = wmbs_job_state.id)
group by wmbs_job.state, wmbs_job_state.name;
+----------+----------+
| name     | count(*) |
+----------+----------+
| cleanout |        2 |
+----------+----------+
1 row in set (0.00 sec)

Workflows in the System

MariaDB [wmagent]> SELECT DISTINCT name from wmbs_workflow;
+-------------------------------------------------------------------------+
| name                                                                    |
+-------------------------------------------------------------------------+
| jen_a_ACDC_HIG-RunIISpring16DR80-01778_00601_v0__160818_110237_9792     |  aborted
| prozober_ACDC_task_HIG-RunIISpring16DR80-01626__v1_T_160808_121300_6940 |  announced
+-------------------------------------------------------------------------+
2 rows in set (0.00 sec)

Workflows not fully injected

MariaDB [wmagent]> select distinct name from wmbs_workflow where injected = 0;     
Empty set (0.00 sec)

Subscriptions not finished

MariaDB [wmagent]> select distinct wmbs_workflow.name AS wfName
   FROM wmbs_subscription
   INNER JOIN wmbs_fileset ON wmbs_subscription.fileset = wmbs_fileset.id
   INNER JOIN wmbs_workflow ON wmbs_workflow.id = wmbs_subscription.workflow
   where wmbs_subscription.finished = 0 ORDER BY wmbs_workflow.name;
Empty set (0.00 sec)

Files available in WMBS (waiting for job creation)

SQL> SELECT wmbs_workflow.name, wmbs_sub_files_available.subscription, count(wmbs_sub_files_available.fileid)
  FROM wmbs_sub_files_available
  INNER JOIN wmbs_subscription ON wmbs_sub_files_available.subscription = wmbs_subscription.id
  INNER JOIN wmbs_workflow ON wmbs_subscription.workflow = wmbs_workflow.id
  GROUP BY wmbs_sub_files_available.subscription;
Empty set (0.00 sec)

Getting distinct workflow names with files available:

SQL> SELECT DISTINCT wmbs_workflow.name
  FROM wmbs_sub_files_available
  INNER JOIN wmbs_subscription ON wmbs_sub_files_available.subscription = wmbs_subscription.id
  INNER JOIN wmbs_workflow ON wmbs_subscription.workflow = wmbs_workflow.id
  GROUP BY wmbs_sub_files_available.subscription;
Empty set (0.00 sec)

Files acquired or acquired in WMBS (waiting for job to finish)

SQL> SELECT wmbs_workflow.name, wmbs_sub_files_acquired.subscription, count(wmbs_sub_files_acquired.fileid)
  FROM wmbs_sub_files_acquired
  INNER JOIN wmbs_subscription ON wmbs_sub_files_acquired.subscription = wmbs_subscription.id
  INNER JOIN wmbs_workflow ON wmbs_subscription.workflow = wmbs_workflow.id
  GROUP BY wmbs_sub_files_acquired.subscription;
Empty set (0.00 sec)

Getting distinct workflow names with files acquired:

SQL> SELECT DISTINCT wmbs_workflow.name
  FROM wmbs_sub_files_acquired
  INNER JOIN wmbs_subscription ON wmbs_sub_files_acquired.subscription = wmbs_subscription.id
  INNER JOIN wmbs_workflow ON wmbs_subscription.workflow = wmbs_workflow.id
  GROUP BY wmbs_sub_files_acquired.subscription;
Empty set (0.00 sec)

Files and Blocks in Phedex and DBS

Blocks open in DBS

SQL> SELECT * FROM dbsbuffer_block WHERE status!='Closed';
Empty set (0.02 sec)

Files not updated DBS

SQL> SELECT * from dbsbuffer_file where status = 'NOTUPLOADED';
Empty set (0.02 sec)

Files not injected in Phedex, with parent block id (can be recovered)

SQL> SELECT * FROM dbsbuffer_file
WHERE in_phedex=0
AND block_id IS NOT NULL
AND lfn NOT LIKE '%unmerged%'
AND lfn NOT LIKE 'MCFakeFile%'
AND lfn NOT LIKE '%BACKFILL%'
AND lfn NOT LIKE '/store/user%';
Empty set (0.24 sec)

Files not in phedex without parent block id (cannot be recovered) Possible input files.

SQL> SELECT count(*) FROM dbsbuffer_file
WHERE in_phedex=0
AND block_id IS NULL
AND lfn NOT LIKE '%unmerged%' 
AND lfn NOT LIKE 'MCFakeFile%'
AND lfn NOT LIKE '%BACKFILL%'
AND lfn NOT LIKE '/store/backfill%'
AND lfn NOT LIKE '/store/user%';
+----------+
| count(*) |
+----------+
|    66414 |
+----------+
1 row in set (0.11 sec)

So we run fix Phedex to update the files not in phedex

[cmsdataops@cmssrv219 current]$ curl https://raw.githubusercontent.com/amaltaro/ProductionTools/master/newFixPhEDEx.py > newFixPhedex.py
[cmsdataops@cmssrv219 current]$ source /data/srv/wmagent/current/apps/wmagent/etc/profile.d/init.sh
[cmsdataops@cmssrv219 current]$ python newFixPhedex.py 
Shutting down PhEDExInjector...
Checking 445 dataset in both PhEDEx and DBS ...
...

And we check afterwards

SQL> SELECT lfn FROM dbsbuffer_file
WHERE in_phedex=0
AND block_id IS NULL
AND lfn NOT LIKE '%unmerged%' 
AND lfn NOT LIKE 'MCFakeFile%'
AND lfn NOT LIKE '%BACKFILL%'
AND lfn NOT LIKE '/store/user%';
+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| lfn                                                                                                                                                                                                    |
+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| /store/mc/RunIISpring16DR80/WprimeToWZToWlepZhad_narrow_M-4000_13TeV-madgraph/RAWAODSIM/PUSpring16RAWAODSIM_80X_mcRun2_asymptotic_2016_v3-v2/00000/7EABE310-3222-E611-9638-02163E00AD57.root           |
| /store/mc/RunIISpring16DR80/WprimeToWZToWlepZhad_narrow_M-4000_13TeV-madgraph/RAWAODSIM/PUSpring16RAWAODSIM_80X_mcRun2_asymptotic_2016_v3-v2/00000/705AF243-CB23-E611-A41E-02163E011441.root           |
| /store/mc/RunIISpring16DR80/WprimeToWZToWlepZhad_narrow_M-4000_13TeV-madgraph/RAWAODSIM/PUSpring16RAWAODSIM_80X_mcRun2_asymptotic_2016_v3-v2/00000/A6C635C2-D023-E611-A348-02163E013E30.root           |
| /store/mc/RunIISpring16DR80/Graviton2PBToZZTo2L2Nu_width0_M-1200_13TeV-JHUgenV6-pythia8/RAWAODSIM/PUSpring16RAWAODSIM_80X_mcRun2_asymptotic_2016_v3-v1/60000/A079EA53-6622-E611-AF8D-2C600C6E9883.root |
| /store/mc/RunIISpring16DR80/Graviton2PBToZZTo2L2Nu_width0_M-1200_13TeV-JHUgenV6-pythia8/RAWAODSIM/PUSpring16RAWAODSIM_80X_mcRun2_asymptotic_2016_v3-v1/60000/A8D7238F-6322-E611-84F4-FA163EE5E8A4.root |
| /store/mc/RunIISpring16DR80/GluGluH2JetsToTauTau_M125_13TeV_CPmixing_maxmix_JHU/RAWAODSIM/PUSpring16RAWAODSIM_80X_mcRun2_asymptotic_2016_v3-v1/70000/0CD676D7-6F18-E611-B8F0-0025905A6126.root         |
+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
6 rows in set (0.15 sec)

all input dataset.

Agent is READY to go.

Edit | Attach | Watch | Print version | History: r32 < r31 < r30 < r29 < r28 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r31 - 2016-09-16 - AlanMalta
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    CMSPublic All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback