---+!! <nop>Task/Job monitoring *Description of Dashboard API* %TOC% %STARTINCLUDE% ---++ List of requirements ---++ Development server Task monitoring dashboard development server launched on http://pcadc01.cern.ch/ ---++ Front page: List of users List of users, who run jobs during last month. URL :http://pcadc01.cern.ch/client/index.html *Action name:* gangataskmonitoring *URL request:* http://pcadc01.cern.ch/dashboard/request.py/gangataskmonitoring *Output JSON object format:* {"basicData": [[{"GridName": "UserName1"},... {"GridName": "UserNameN"}]]} ---++ List of tasks List of tasks for defined user during given time period (from-to) OR timerange *Action name:* gangataskstable *Parameters:* * * usergridname, * timerange OR from , to time period in format * typeofrequest=A (currently mandatory) Example: * gangataskstable?usergridname=%22KonstantinosKousouris%22&from=2010-04-02%2018:20&to=2010-04-07%2012:30&typeofrequest=A *JSON output example:* <pre>{"user_taskstable": [{"Executable": "cmsRun", "UNKNOWN": 239, "SubmissionType": "direct", "Application": "CMSSW", "NUMOFJOBS": 264, "TargetCE": "15_Selected_SE", "SubmissionTool": "crab", "SubmissionUI": "T1_US_FNAL", "PENDING": 0, "TASKMONID": "kkousour_crab_0_100411_030027_1mv54a", "TaskType": "analysis", "ApplicationVersion": "CMSSW_3_5_6", "TaskId": 3338662, "SUCCESS": 0, "TaskCreatedTimeStamp": "2010-04-11 10:08", "SchedulerName": "LOCALFNAL", "TaskMonitorId": "kkousour_crab_0_100411_030027_1mv54a", "FAILED": 25, "RUNNING": 0, "NEventsPerJob": 35761, "InputCollection": "/MinimumBias/Commissioning10-PromptReco-v8/RECO", "SubToolVersion": "2.7.1"}]}</pre> ---++ Jobs of chosen tasks *Action name:* gangataskjobs *URL request:* *Parameters:* <blockquote> * taskmonitorid; * what. Could be *:* * all<strong> -</strong> Displays all job states<strong>;</strong> * P - pending; * R - running; * U - unknown; * S - successful; * F - failed; </blockquote> Example: gangataskjobs?taskmonid=kkousour_crab_0_100411_030027_1mv54a&what=all *Output JSON object format example for _what = all_ parameter :* <pre>{"taskjobs": [[{"STATUS": "U", "resubmissions": 1, "EventRange": "1", "started": "2010-04-11 08:03", "GridEndId": "U", "AppGenericStatusReasonValue": "Error return without specification", "finished": "2010-04-12 08:03", "submitted": "2010-04-11 08:03", "Site": "T3_US_FNALLPC", "TaskJobId": 153301646, "JobExecExitCode": null, "SchedulerJobId": "https://cmslpc16.fnal.gov/be0adc69f8cfa9f7a184ad7ce27dd2b2c81c68fa/1", "GridEndReason": "unknown"}], , {"username": "\"KonstantinosKousouris\"", "what": "ALL", "taskmonid": "kkousour_crab_0_100411_030027_1mv54a"}]} </pre> ---+++ Page details in case of parameter _what=all_ ---++++ Plots * Terminated Jobs by Site ( grouping by "Site" and "STATUS" values); * Graphical Overvew (used status values(# running, #pending, etc ) from Page2 for appropriate task; * Successful Jobs Distributed by Site ( grouping by sites with :success" status); * Processed Events Cumulative Plot. * *Action name:* proceventscumulativeAlt * *URL request:* * *Parameters:* * taskmonid * JSON output: {"totaljobs": [[{"TOTAL": #jobs}], {"taskmonid": "taskmonid"}], "procevents": [[{"NEventsPerJob": #events}], {"taskmonid": "taskmonid"}], "succjobs": [[{"TOTAL": #jobs, "TOTALEVENTS": #total events}], {"taskmonid": "andersj_pip_e_1_10_5o7w2r"}], "meta": { ... }, "allfinished": [[ {"finished": Timestamp, "Events": #events}, ...}], "lastfinished": [[{"finished": Timestamp}], {"taskmonid": taskmonid}], "firststarted": [[{"started": Timestamp}], {"taskmonid": taskmonid}]} ---++++ Jobs' table | *Column name* | *Key* | | SchedulerJobId | SchedulerJobId | | Id in Task | EventRange | | Status | STATUS;<br /> * P - pending;<br /> * R - running;<br /> * U - unknown;<br /> * S - successful;<br /> * F - failed; | | Appl Exit Code | <p>Check STATUS, if "P" or "R" - display "Not yet"</p> <p>JobExecExitCode >-1?JobExecExitCode : Unknown</p> <p>toolTipText:</p> <!-- @page { size: 8.27in 11.69in; margin: 0.79in } P { margin-bottom: 0.08in } --> <pre>AppGenericStatusReasonValue</pre> | | Grid End Status | <p>GridEndId</p> <p>toolTipText: GridEndReason</p> | | Retries | resubmissions | | Site | Site | | Submitted | submitted | | Started | started | | Finished | finished | *JSON output example for _what = F_ parameter :* <pre>{"taskjobs": [[{"JobExitReason": " Output file(s) not found", "resubmissions": 1, "EventRange": "23", "started": "2010-04-11 08:03", "GridEndId": "U", "finished": "2010-04-11 08:26", "submitted": "2010-04-11 08:03", "Site": "T3_US_FNALLPC", "TaskJobId": 153301734, "AppStatusReason": "unknown", "JobExitCode": 60302, "SchedulerJobId": "https://cmslpc16.fnal.gov/be0adc69f8cfa9f7a184ad7ce27dd2b2c81c68fa/23", "GridEndReason": "unknown"}], {"username": "\"KonstantinosKousouris\"", "what": "F", "taskmonid": "kkousour_crab_0_100411_030027_1mv54a"}]}</pre> ---++ Resubmitted jobs *Action name:* resubmittedjobsAtl *URL request:* *Parameters:* | *Colomn name* | *Key* | | JobExitReason | JobExitReason | | Id in Task | EventRange | | Appl Exit Code | JobExitCode | | Appl Exit Reason | AppStatusReason | | Grid End Status | GridEndId | | Site | Site | | Submitted | submitted | | Started | started | | Finished | finished | *Example*: Request : resubmittedjobsAtl?what=ALL&taskjobid=152060487&taskmonid=aproskur_crab_0_100406_145411_tl7n64 *JSON output example*: <pre>{"rsJobs": [{ "JobExitReason": "CMS exception (CMSSW)", "EventRange": "1", "started": "2010-04-06 13:00:05", "GridEndId": "D", "Site": "T2_UK_London_Brunel", "submitted": "2010-04-06 12:55:10", "finished": "2010-04-06 13:10:06", "AppStatusReason": "unknown", "JobExitCode": 8001, "SchedulerJobId":"https://wms218.cern.ch:9000/p4J3bxUEvPlLICjIXb1olg", "GridEndReason": "unknown"}, ...}]}</pre> -- Main.LauraSargsyan - 27-Apr-2010
This topic: ArdaGrid
>
WebHome
>
Dashboard
>
TaskJobMonitoring
Topic revision: r12 - 2011-11-09 - LauraSargsyan
Copyright &© 2008-2022 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use
Discourse
or
Send feedback