Transfer Service Operations Procedures

[ in progress - last update: AlexanderUzhinskiy on 2007-05-21 - 09:38

Number of topics: 1

This page describes the procedure to check the status of the transfer service. The scope is the CERN FTS managed transfers: the tier-0 export and the CAF FTS service.

Both the FTS infrastructure at CERN and the actual state of the transfers should be checked.

The goal is to make sure that any issues affecting the service (whether they are FTS or SRM related) are reported to the correct site as soon as possible. The issues should also be tracked here to make sure that they are followed up.

The daily logs should be recorded in TransferOperationsDailyLog.

FTS infrastructure tests

The purpose of this is to check that the basic FTS service infrastructure at CERN is operating correctly. There are a number of alarms and status checks that are made. If any of these tests show problems, report to the FTS operations staff here, or using fts-support@cern.ch.

External tests

  • The main external FTS probe is made by SAM. This checks that the service is correctly registered in the information system and that it is responding to user requests. ( SAM FTS test - check that the CERN-PROD entries are both OK).

Internal tests

  • These are fabric-level tests that make sure that the daemons are running correctly and responding. The main ones are reported at the IT daily morning meeting (09.00) for the gridfts cluster - all of the alarms have operator procedures which should fix the problem. However, if a problem is still open, it should be investigated. You can review the current tickets at Logger (search domain: FIO, cluster: gridfts).

FTS: overall service monitoring

The

  • Check regularly the Gridview overview: GRIDVIEW. This will indicate which channels are (successfully) transferring data, though it will not show failed transfers.

  • Check the FTS daily report for failing sites: FTS report. This report is currently only generated once per day for the previous 24 hours transfers.

  • Run the log-parsing utility to check for the top reasons for a site being down.

How to log a problem

Report internal FTS problems to the CERN people here or via fts-support@cern.ch.

Report site storage problems using the GGUS submission portal.

Track the ticket number and status in the daily report.

A summary of outstanding issues should be reported in the weekly report for the Joint Operations Meeting.

....


Last edit: AlexanderUzhinskiy on 2007-05-21 - 09:38

Number of topics: 1

Maintainer: GavinMcCance

Edit | Attach | Watch | Print version | History: r6 < r5 < r4 < r3 < r2 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r3 - 2007-02-22 - GavinMcCance
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    LCG All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2020 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback