Indico: https://indico.cern.ch/conferenceDisplay.py?confId=254668

News

  • Diego (FNAL) is leaving us, we thank him for all his work.

Attending

Dorian, Sunil, Andrew, Edgar (from home), Luis (FNAL), Xavier, John, Diego (FNAL)

Issues last week

  • Black node at Wisconsin
  • vocms237 JobCreator, Edgar patched it.

Personel

  • Coming off Shift- Sunil
  • Coming on Shift - Xavier

Site Issues

Sites for Production

Site in MC Slots Status Notes Issues
T2_GR_Ioannina 94 skip stopped commissioning 100% failure rate - Aug13: no response from site admins https://savannah.cern.ch/support/?138614
T1_RU_JINR 4800 skip waiting for link commissioning - different procedure for T1
T2_PK_NCP 318 not in list commission if out of the waiting room for at least 2 weeks
T2_RU_PNPI 176 skip to be commissioned

Agents

  • LHEStepZero type will disappear, will be distinguished by the flag. Information has to pass to the agent. Diego will create a github issue for it.

Workflows

IEEE Paper

Draft Outline #1

  • Introduction (Why we need to run so much simulations, why we need to do a rereconstruction of the data) (Edgar/Jen)
  • a brief discussion of what the different types of workflows are, and how they are processed differently (Diego/Jen/Edgar)
  • monitoring for T1 & T2 sites(Diego/Jen/Edgar)
  • How we ran prior to 2011
    • ProdAgent vs WMAgent ( Diego/Alan) (Focus on differences and improvements)
    • Reprocessing and Production (Jen/Xavier) (How this was handled with ProdAgent and why the need to move to another framework
    • How we ran with WMAgent (after 2011)
  • WMAgent /ReqMgr/Workqueue (Diego/Edgar/Alan) General comment on how it works * PREP/ReqmG Interaction (Vincenzo?) * Organization of the workflow team and operations around it (Edgar)
  • Achievements
  • Events reconstructed (L3s)
  • Usage of the grid (Edgar/Jen/L3s)
  • Conclusions / Outlook (Edgar/Jen)

Action Items

  • Write twiki disk/tape separation T1_IT_CNAF. Edgar
  • Recovery workflows - Jen - suspend
    • first 2 workflows are completely through and now we are waiting for people to really look and make sure that there are no show stoppers before we do the other 50.
    • Guillemo is bothering JeanRoc about if people have actually looked at the data
  • we need to add a daily report on Workflow stats - needs work on debugging
  • A new state for completed and already dealt with ACDC.
  • How many workflows running, pending, waiting, stuck
    • Is it documented yet?
  • solve the problem of how to use a non-production scram architecture (waiting for Alan to come back)

AOB

  • Diego will continue to work in the paper
  • Andrew had a question regarding if two runs could be merged in more than one file. Normally yes. For it to not needs an agent hack.
  • FNAL-Luis will take 3 more days to get CERN Account.
Edit | Attach | Watch | Print version | History: r6 < r5 < r4 < r3 < r2 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r6 - 2013-08-21 - AndrewLevin
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    CMSPublic All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2020 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback