Procedure to investigate production jobs

Input and Output LFNs of job(s)

To get information about jobs Input and Output LFNs:

dirac-lhcb-production-job-lfn jobid [jobid2 ...]

for example:

dirac-lhcb-production-job-lfn 22160

2008-09-05 18:11:49 UTC dirac-lhcb-production-job-lfn/DiracAPI INFO: Files retrieved 
and extracted in /afs/cern.ch/user/r/roma/Dirac/InputSandbox22160
OutputLFN: ['/lhcb/data/DST/00003028/0000/00003028_00000073_1.rdst']
InputLFN: ['/lhcb/data/CCRC08/RAW/LHCb/CCRC/28933/028933_0000064834.raw']

The result of this command is the list of output and input LFNs from job(s). LFNs from lists can be investigated with dms commands (dirac-dms-lfn-logging-info, dirac-dms-lfn-metadata, dirac-dms-lfn-replicas)

Checking Input and Output LFNs of job(s)

To check jobs Input and Output LFNs:

dirac-lhcb-production-job-lfn-check jobid [jobid2 ...]

for example:

dirac-lhcb-production-job-lfn-check 22160

LFC:
Successful:
/lhcb/data/DST/00003028/0000/00003028_00000073_1.rdst:
{'CNAF_M-DST':'srm://storm-fe-lhcb.cr.cnaf.infn.it/t1d1/lhcb/data/
DST/00003028/0000/00003028_00000073_1.rdst'}
Bookkeping:
/lhcb/data/DST/00003028/0000/00003028_00000073_1.rdst : True
ProductionDB for production 3028
{'TargetSE': 'Unknown', 'TransformationStatus': 'Stopped', 'JobStatus': 'Done',
 'JobID': '73', 'UsedSE': 'CNAF-RAW', 'FileStatus': 'Processed'}

This command gets the lists of output and input LFNs from the job. Then it checks each output LFN for presence in Local File Catalog and in Bookkeeping. Each input LFN is tested for correct information in ProductionDB.

Checking Input and Output LFNs of selected jobs

To check the input and output LFNs from the selected jobs:

dirac-lhcb-production-job-select-check options

Possible options can be listed with help of --help option. Command options are the same as dirac-wms-select-jobs, except for two additional options (sea later).

For example:

dirac-lhcb-production-job-select-check --JobGroup=00003029 --Date=2008-04-30

23781, 23782, 23783, 23784, 23785, 23786, 23787, 23788,
 23789, 23790, 23791, 23792, 23793, 23794
OK: 0 job(s)
[]
Problem with LFC: 14 job(s)
[23781, 23782, 23783, 23784, 23785, 23786, 23787, 23788, 
23789, 23790, 23791, 23792, 23793, 23794]
Problem with Bookkeeping: 14 job(s)
[23781, 23782, 23783, 23784, 23785, 23786, 23787, 23788, 
23789, 23790, 23791, 23792, 23793, 23794]
Problem with ProductionDB: 8 job(s)
[23783, 23784, 23785, 23786, 23787, 23788, 23793, 23794]

This command selects jobs with selection criteria defined by options. Then it does the same as the previous command and counts the number of jobs with full information in all DB and jobs which have a problem in the LFC, Bookkeeping and ProductionDB.

For test one job, options --JobID=jobid can be used. In this case other selection options are not used. Option --Verbose adds print for each job and this command becomes very close to previous command.

-- VladimirRomanovskiy - 05 Sep 2008

Edit | Attach | Watch | Print version | History: r4 < r3 < r2 < r1 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r4 - 2008-09-09 - VladimirRomanovskiy
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    LHCb All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback