Team Meeting June 4, 2013

Indico Link to meeting

Attending:

John, Jen, Edgar, Dorian, Xavier, Seangchan

Sara - is at another meeting

Andrew - in upgrade meeting

Alan - preparing testbed deployment and is unable to come

Personel for this week:

  • Jacob on Vacation until June 16
  • Andrew will be on vacation June 15-22
  • Jen taking Wednesday off, will be working 1/2 days from home Thurs/Friday of this week
    • With the end of the school year Jen will be working longer day hours, and fewer evening/weekend hours over the summer.
    • John, Dorian will you be able to keep an eye on things this week?
      • Dorian can be online and will keep an eye on things
  • Diego is back in Colombia and is taking the week off, will be reading e-mail on and off in the evening, will be back at work next week.
  • Coming off shift: Sara
  • Coming on shift: Xavier
  • once people know their summer vacation plans let's get them on the schedule!
  • Stenick off next week
  • Xavier will update the EU shift past the end of June

Site support info

Sites IN This week IN # weeks IN This week OUT T2_CH_CERN X T2_CH_CERN_AI X T2_CH_CERN_HLT X T2_GR_Ioannina X T2_RU_ITEP X T2_UK_SGrid_Bristol X T2_EE_Estonia 2 T2_IN_TIFR 2 T2_MY_UPM_BIRUNI 2 T2_PK_NCP 2 T2_PL_Cracow 2 T2_RU_RRC_KI 2 T2_TH_CUNSTDA 2 T2_TR_METU 2 T2_PL_Warsaw X 1 T2_BR_UERJ X 1

Issues with Monitoring using WMStats

  • problems with couch are causing ongoing issues with monitoring with WMStats

Agent issues

  • vocms216 is being upgraded - all new jobs will be submitted and shouldn't have the isses of 201 & 235 because the problems we were having were due to large number of lumis and that has been addressed
  • vocms85 not running many jobs, but seems OK
  • vocms237 Not creating new jobs
    • not creating cleanup jobs - Edgar has asked Seangchan to look
  • vocms201 & 235
    • couch has problem with documents, some of documents are bigger than it can handle unless we upgrade couch
    • Diego suggested disabling monitoring so jobs can move on,
    • no easy solution for right now, Seangchan has been without computer for 2 days so he will start to look at it again today
    • let's put both agents in drain, finish what we have and clean up when they are done
    • we will not be able to monitor workflows on these 2 machines for the time being in WMStats
      • the lower level detail page information is still there there is just no information on the front page.

WMStats

  • we can not get information from dashboard to WMStats
  • we would like to retire old GlobalMonitor
    • reg expression search
    • access to local queue so we can determine why workflows are not closing out
    • Seangchan is trying to put these two features in WMStats hopefully we can retire old Global Monitor after the July release
  • Creation of ACDC from WMStats no longer works and that needs to get fixed again

Krista has requested help testing WMAgent on cmssrv101 with the new condor/glideinwms rpm install

  • If Seanchan and I are not able to do this today, Edgar would you be able to do so late your day/early ours later this week?
    • Jen, Seangchan and Krista are going to try to meet this afternoon at 2, If we don't get things done I will let you know.

Andrew issues:

  • workflows stuck in assigned on 113
    • workflows are not in the global queue Seangchan can you look at it today, if it's not fixed will reply to the e-log with what he tried so Edgar knows where to start when he comes back online
    • everything looks correctly assigned but they have been sitting in assigned for 4 hours we need to keep an eye on it today
  • Dataset naming - JeanRoch is complaining about dataset naming, there are some cases where we want to write to an existing dataset ie ACDC or extend statistics for MC GENSIM
  • we need to put a warning to an existing dataset Edgar will help write the script that will give a warning if we are writing to an existing dataset
Edit | Attach | Watch | Print version | History: r7 < r6 < r5 < r4 < r3 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r7 - 2013-06-04 - JenniferAdelmanMcCarthy
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    CMSPublic All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2020 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback