Workflow Team Meeting - Feb 25 4PM CERN, 9 FNAL time

Vidyo Link


  • FNAL: Jen, Jorge, Eliana, Gaston
  • US:Matteo
  • CERN : Paola, Alan, Dima


  • Jen to CERN Feb 29-March 4 - tickets are booked!
  • Jorge to Columbia April 15-May 2, Talk on April 27

News - Dima

  • PPD is now wanted to start earlier, nominal date April 1 but could start as much as much as 2 wks earlier

3 top issues affecting production

  • emergency downtime for FNAL due to security issues
    • machines are not coming up as nicely as they usually do, those agents with lots of jobs running still did not want to come up nicely
    • Need to find out when CERN is doing it's security patch fixes and reboots
  • Schedd on submit1 was not reporting for a day but it's back up
  • issues with cmsweb on Tues made getting through the list of workflows with issues impossible

Site support - Gaston

  • Into the waiting room: T2_IN_TIFR, T2_PL_Swierk - HIP and SINP are still in
  • Brunel also in and out in 16 out 20
  • Estonia - glsf failures at Estonia, and missing files
  • Beiihe, eerj is in downtime, t2_es_ifca wonky, PK_NCP in drain still, they are due to go into the waiting room tomorrow

Transfers - Jorge

  • No big news
  • we had some files that were in PhEDEx, and not in DBS, if there is none in the acq era of either the workflow or the parent they should be invalidated.
    • Jorge will point Alan to the ggus ticket and we will get this fixed,








Store Results

  • NA


Agent Issues

  • rebooted all FNAL everything yesterday

Agent redeployment

  • no news
  • we have to redeploy 217 , 310, 308
    • Alan and Paola are working on the draining process of the 310 and 308 agents. There are still pending 2 and 13 workflows, respectively.
  • after these are starting a new version of the agents in March, should - mid March, plan is to have everything redeployed by the beginning of the April Campaign
  • who is going to write up the redeployment schedule? Alan & Paola
  • cmsweb update - most of the crab experts won't be available next week due to Barcelona workshop the deployment will be March 8
    • the next one would be ~First Tues of the month in general

ReqestMgr2 Migration

  • we are not dropping request manager1 anytime soon
  • Jen and Matteo have started testing
  • Jen was able to get a workflow assigned via web pages, but we were unable to mess with splitting - ggus ticket filed
  • Jen, Matteo and SeangChan will meet at 10 on Friday and go through scripts
  • the WmAgentScripts - are not working well with requestmanager 2, going through them
  • There are a number of scripts that Unified copied over from WmAgentScripts, we do not want to maintain 2 copies of the same script. We need to decide what one is valid and just use that one. How do we want to do this? I vote for Unified scripts moving into the same level as the WmAgent scripts so we truely have a unified group of scripts that we are all using! JR : there isn't much copied from previous scripts, core calls to reqmgr are already centralized to one interface shared by ops and unified

RelVal Andrew

  • he is going to make some requests into testbed to see if we can get ReqMgr 2 goin

L3 discussion - Ajit, Jean-Roch, Matteo

Opportunistic Resources

Automatic Assignment And Unified Software

  • Alli will be in charge of documentation


-- JenniferAdelmanMcCarthy - 2016-02-24

Edit | Attach | Watch | Print version | History: r7 < r6 < r5 < r4 < r3 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r7 - 2016-03-07 - JeanrochVlimant
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    CMSPublic All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback