Welcome Julian! Please introduce yourself to the group!


Issues last week

  • issues with too many jobs hitting the system at once again
    • I believe Luis is supposed to be looking into this. What can we all do to help speed this process along?
  • The switch to EOS at FNAL is over, Test workflows went through with only minor issues that have been fixed so we are back in business.
  • Still having issues with couch Databases.
  • Issues with couch and Late Binding are causing workflows to take a very long time to go through and making debugging difficult.
  • git -
    • Is git installed on the production machines? Seangchan, Luis and I couldn't find it
    • Who needs to install this centrally? FNAL and CERN machines
    • what is the command we need to replace:
      • svn co svn+ssh:// ~/WmAgentScripts
    • Seangchan was wondering why we are all keeping our "own version" of the WmAgentScripts in our own home areas instead of all working out of the cmst1 directory. Can we revisit this topic?


  • Sep 10 --> Sep 17 Xavier
  • Sep 17 --> Sep 24 Sara

Site Issues

Sites for Production


  • still having issues with couch and stability of agents



IEEE Paper

Draft Outline #1

  • Introduction (Why we need to run so much simulations, why we need to do a rereconstruction of the data) (Edgar/Jen)
  • a brief discussion of what the different types of workflows are, and how they are processed differently (Diego/Jen/Edgar)
  • monitoring for T1 & T2 sites(Diego/Jen/Edgar)
  • How we ran prior to 2011
    • ProdAgent vs WMAgent ( Diego/Alan) (Focus on differences and improvements)
    • Reprocessing and Production (Jen/Xavier) (How this was handled with ProdAgent and why the need to move to another framework
    • How we ran with WMAgent (after 2011)
  • WMAgent /ReqMgr/Workqueue (Diego/Edgar/Alan) General comment on how it works * PREP/ReqmG Interaction (Vincenzo?) * Organization of the workflow team and operations around it (Edgar)
  • Achievements
  • Events reconstructed (L3s)
  • Usage of the grid (Edgar/Jen/L3s)
  • Conclusions / Outlook (Edgar/Jen)

Action Items

  • Write twiki disk/tape separation T1_IT_CNAF. Edgar
  • Recovery workflows - Jen - suspend
    • first 2 workflows are completely through and now we are waiting for people to really look and make sure that there are no show stoppers before we do the other 50.
    • Guillemo is bothering JeanRoc about if people have actually looked at the data
  • A new state for completed and already dealt with ACDC.
  • How many workflows running, pending, waiting, stuck
    • Is it documented yet? yes
    • Luis is working on a script to pull these numbers automatically. - script done but we are still tweeking it
  • solve the problem of how to use a non-production scram architecture (waiting for Alan to come back)
  • Updating documentation on scripts with github now that we aren't using svn anymore
    • docuentation needs to be updated and everyone needs to start ramping up on github


  • Diego will continue to work in the paper
  • problem with the creation of WF's if you change the number of files per job
  • CouchDB will be rotated on Wed so we will be running without being able to watch WMStats on Wed/Thurs should be back Friday
Edit | Attach | Watch | Print version | History: r3 < r2 < r1 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r1 - 2013-09-17 - JenniferAdelmanMcCarthy
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    CMSPublic All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback