SAM/GridView visualisation proposal

Use Cases

Service failure analysis

  • Purpose: check the overall status of all grid services to manually correlate failures.
  • Possible views: (current status only)
    • high-level overview:
      • scope:
        • all sites
        • all services of a site
      • status: represented by coloured dots/boxes (green - all fine, orange - something wrong, red - completely unavailable)
    • one selected service type (current SAM portal):
      • scope: all sites
      • status:
        • simple - one box per service
        • detailed - one box per test (flexible test selection: critical or custom)
    • only failures: show all failing services in * scope: all sites * more detailed view with failing tests
    • differential view: the difference in service statuses between two points in time (for example: now and 1 hour ago) * show only those services that changed
  • Required navigation:
    • define a VO perspective
    • fast switching between all services and one selected service type, fast switching the service type
    • fast switching between highlevel view of sites (one dot/box per site) and services view (one dot/box per service grouped by site)
    • possibility to narrow down to given EGEE ROC(s) or LCG Tier (Tier-2 cloud?)
    • flexible and fast "show only failures" controller: selecting the threshold of severity level
    • links to the history view of given service or site (for site: availability history)
    • selecting point in time for differential view on availability graphs
  • Comments:
    • all views shown in one selected VO perspective
    • multi-VO overall view of sites may be considered but difficult to design
    • take the current SAM Portal and SEE-Grid SAM interface as example

Grid Operator (COD, ROC, PPS)

  • COD:
    • purpose:
      • get, understand and act on alarms
      • recognize correlations between failing services
    • views:
      • COD dashboard with embedded history bars from SAM/GridView
      • detailed history of single test execution (digging into test details)
      • overview of services status of a single site, or central services

  • ROC (in addition to COD):
    • purpose:
      • easy & quick overview of the status of sites in the ROC
      • digging into problems down until the test level
    • views:
      • high-level view (sites in Region)
        • all sites
        • only failing sites
      • services status per site
        • all services
        • only failing services
      • test results for a particular service on a site
      • services status for all sites in the ROC (see SAM Portal latest view for a Region)
      • test results for one type of service in all sites in the ROC (see SAM Portal latest view for a Region)

  • all site in a reginon (PPS):
    • purpose:
      • PPS status overvirew
      • CODs monitoring PPS sites
      • PPS SAM client (+ infrastructure) management
    • views:
      • high-level view (PPS sites)
      • services status per site
      • test results for a particual service on a site
      • services status for all PPS sites (see SAM Portal latest view for a Region)
      • test results for one service for all sites PPS (see SAM Portal latest view for a Region)

Site Operator

  • site view:
    • status of services
      • multiple VOs!
    • test results per service

Virtual Organisation and test developer

Similar to the use case Service failure analysis.

Interface Design Proposal

TODO: put the results of Judit/Phool/... discussions here

Migration Plan

Phase 1
Keep all current tools (SAM Portal, GridView) and replace SAM History View by GridView History Bars, make necessary modifications in all the other tools that link to SAM Portal (COD Dashboard, GStat, ...)
Phase 2
Replace current SAM Portal completely by GridView/(something else)

-- PiotrNyczyk - 17 Jul 2007

Edit | Attach | Watch | Print version | History: r3 < r2 < r1 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r3 - 2007-07-20 - PiotrNyczyk
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    LCG All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback