Current status for Alice metrics

Currently the following metrics are provided:

  • For the job_processing activity: parallel_jobs, completed_jobs, successfully_completed_jobs, CPU_time, wall_time, CPU_time_KSI2K, wall_time_KSI2K ok

  • The pledged values are given only for the wall_time, not for the other metrics. Isn't it available? Hasn't Alice defined some pledged values for other metrics like the number of running jobs?
    ANSWER (Costin on Nov 9th): There are no other pledged resources defined apart from the wall time expressed in KSI2k units. We don't have a target on parallel running jobs on each site, transfer rates or anything else similar.
ok

  • The status of the individual activity is not provided. ok, this is understood Costin already explained that the make an evaluation of the site on the basis of some specific tests. Then, they have a dump of this results, for each test, in this URL. In each line:
    VOBox,hostname,ip,alive,ALIEN_CE,ALIEN_SE,ALIEN_PackMan,ALIEN_Monitor,ALIEN_FTD,SAM_DPD,SAM_PM,SAM_PR,SAM_PSR,SAM_RBS,SAM_SA,SAM_UPR,SAM_WMS,SE_add,SE_ls,SE_get,SE_whereis,SE_rm
    Prague,goliasx31.farm.particle.cz,147.231.25.31,0,0,0,0,0,-1,-1,-1,-1,-1,-1,-1,-1,-1,2,2,2,2,2
    RAL,lcgvo0597.gridpp.rl.ac.uk,130.246.183.199,0,0,0,0,0,1,-1,-1,-1,-1,-1,-1,-1,-1,2,0,0,0,0
    RRC-KI,house.grid.kiae.ru,144.206.66.3,0,0,0,0,0,-1,-1,-1,-1,-1,-1,-1,-1,-1,0,0,0,0,0
    
where the number mean:
0=ok
1=err
2=warn
-1=n/a
So, in principle, we could compute the status of some activities on the basis of these numbers.
More information about these tests (Costin, Nov 9th): The tests in the other url are grouped in 3 categories:
- ALIEN_* are tests executed locally on the VoBox for each AliEn service that is supposed to run there
- SAM_* are SAM tests, taken from the XML dump provided by SAM; so for these one could also use the original SAM URL
- SE_* are remote SE availability tests (if there are several SEs per site thenall are aggregated under a single value)
ok, from Alice side it's clear the meaning of the tests Now it's up to us to decide if we want to compute a status

  • For the general activity (that is the overall site status) the status is provided ok

  • For the data transfer activity, no metrics yet. Will they be available in the future?
    ANSWER (Costin Nov 9th) Data transfers will be available in the future, when we will resume transferring data to T1s and we have some data I will update the export page to also include this data. ok. just wait

Some remarks:

I was wondering if it's normal to have so many sites idle, and also big sites with a very little activity about job processing. ANSWER: (Nov 9th) For now we don't run many jobs, actually the production is stopped and what we see are users who run some jobs from time to time. So it's ok to have most of the sites idle now.

-- ElisaLanciotti - 07 Nov 2008

Edit | Attach | Watch | Print version | History: r3 < r2 < r1 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r3 - 2020-08-30 - TWikiAdminUser
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    Sandbox/SandboxArchive All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2023 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback