Andrew, Luis, John, edgar

Issues last week

  • Bristol: some links problem today --> set to drain
  • T2_BE_IIHE: no xrootd fallback config --> can not be used for present HIN WF's --> set to drain and asked admin to implement
  • T2_TW_Taiwan: xrootd fallback config was missing but was quickly implemented by admin --> work again
  • RAL: testing CVMFS black node auto discovery script to take offlines bad nodes, no more problem seen but let see ...
  • Several sites with CVMFS problem earlier this week (KIT, ...) but none since second half of the week


  • Coming off Shift- Xavier
  • Coming on Shift - Sunil
  • Thanks to Dorain and John for covering while Jen was on vacation!
  • Jen will be training Luis in Operations

Site Issues

Sites for Production

Site in MC Slots Status Notes Issues
T2_RU_PNPI 176 skip to be commissioned
T2_RU_SINP ? drain to be commissioned

List of sites that have never been commissioned (and which just need to be put out of drain)? (ask Edgar)


IEEE Paper

Draft Outline #1

  • Introduction (Why we need to run so much simulations, why we need to do a rereconstruction of the data) (Edgar/Jen)
  • a brief discussion of what the different types of workflows are, and how they are processed differently (Diego/Jen/Edgar)
  • monitoring for T1 & T2 sites(Diego/Jen/Edgar)
  • How we ran prior to 2011
    • ProdAgent vs WMAgent ( Diego/Alan) (Focus on differences and improvements)
    • Reprocessing and Production (Jen/Xavier) (How this was handled with ProdAgent and why the need to move to another framework
    • How we ran with WMAgent (after 2011)
  • WMAgent /ReqMgr/Workqueue (Diego/Edgar/Alan) General comment on how it works * PREP/ReqmG Interaction (Vincenzo?) * Organization of the workflow team and operations around it (Edgar)
  • Achievements
  • Events reconstructed (L3s)
  • Usage of the grid (Edgar/Jen/L3s)
  • Conclusions / Outlook (Edgar/Jen)

Action Items

  • Write twiki disk/tape separation T1_IT_CNAF. Edgar
  • Recovery workflows - Jen - suspend
    • first 2 workflows are completely through and now we are waiting for people to really look and make sure that there are no show stoppers before we do the other 50.
    • Guillemo is bothering JeanRoc about if people have actually looked at the data
  • we need to add a daily report on Workflow stats - needs work on debugging
  • A new state for completed and already dealt with ACDC.
  • How many workflows running, pending, waiting, stuck
    • Is it documented yet?
  • solve the problem of how to use a non-production scram architecture (waiting for Alan to come back)
  • Luis now has accounts and can log in, time to make him useful (Jen)


Edit | Attach | Watch | Print version | History: r5 < r4 < r3 < r2 < r1 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r5 - 2013-08-28 - EdgarFajardo
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    CMSPublic All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback