Workflow Team Meeting - Jan 7 4PM CERN, 9 FNAL time

Vidyo Link

Attending

  • FNAL: Jen, SeangChan
  • US: Ajit, Stephan
  • CERN : Dima, JR
  • Colombia:

Personnel

  • Gaston out Dec 22-Jan 8 - not have connection may read e-mail
  • Jorge out Dec 21-Jan 6
  • Eliana out Dec 28th to Jan 14th.
  • Matteo - Will be around, will be in Italy from Dec 21-Jan 7 but will be working remotely, will have good internet, availablity won't change just be on EU time
  • Alan - Going to Brazil Dec 21-Jan 21 will be working from Brazil Jan 14-20 - SeangChan has Alan's grandma's number
  • do we know when Julian's Replacement is starting yet? - Jen will email Christoph

News - Dima

  • Nothing Much to Say - We need to close out the ReReco 2015C_25ns, and 2015D are the highest priority
  • JR will loook into adding a priority column on the assistance page

3 top issues effecting production

  • redirector issues: recoverable with multiple acdc's but annoying
    • Jen will tell Jorge when he gets in that he needs to look at it.
  • Creation failure - too many events per lumi trying to up number of events per lumi - see how it is working in morning
  • bad fwjr xml's see fabozzi_Run2015D-MET-16Dec2015_763_151218_000541_8216 as example not recovering
    • Jen will talk to SeangChan about looking into this
  • stuck WF's -219
    • 219 has disappeared we need to get it back
    • weird error at Estonia no solution yet. Stephan will look - we need to poke site support on this.
  • still going through backlog of work from over holidays
  • what is going on with injection of new jobs, pending is empty. Dima and JR are looking, there are problems with transfers pending, too much data transferring, the big datasets are staging and then we will be going again.

Site support - Gaston

Transfers - Jorge

Workflows

ReDigi

  • Still need to go through

TaskChains

  • Still need to go through

StepChain

Rereco

Store Results

  • check documentation

MonteCarlo

  • still need to go through

Agent Issues

  • 219 - database issues/stuck workflows issues ongoing, JR and SeangChan

Agent redeployment

  • Next production stable release aimed at Feb/2016
  • Ready to be redeployed: submit2, vocms0311
  • cmssrv218 and 219 are in drain (Workqueues overloaded).
production SL6
FNAL CERN
cmsgwms-submit1 (up) vocms0308 (up)
cmsgwms-submit2 (ready to redeploy) vocms0309 (up)
cmssrv217 (up) vocms0310 (up)
cmssrv218 (drain - overloaded) vocms0311 (ready to redeploy)
cmssrv219 (drain - overloaded) vocms0304 (on HLT tests)
  vocms0303 (up / highprio)

RelVal Andrew

L3 discussion - Ajit, Jean-Roch, Matteo

Opportunistic Resources

Automatic Assignment And Unified Software

  • We need documenation!!!!!!! Matteo is working on it, will continue to look at it.

AOB

-- JenniferAdelmanMcCarthy - 2016-01-07

Edit | Attach | Watch | Print version | History: r3 < r2 < r1 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r3 - 2016-01-20 - JenniferAdelmanMcCarthy
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    CMSPublic All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2020 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback