Luis is going to Colombia for a seminar April 21-25 then on vacation April 28-May 2
Dave & Oli at SLAC April 6-10
Dave at CERN April 21-29
Julian will be on vacation from 16-May to 18-May
CERN shutdown for Easter Holidays April 17-21
News
Issues with WF's stuck in running open - Fixed
was a problem with the dbs2 shutdown. there were a handful of WF's that needed to be fixed by hand.
Need to come up with requirments for request manager 2.
things that we currently do with scripts need to move to request manager
Sara's Notes
Lots of agent issues had to restart TaskArchiver crashing a lot.
Known issue in meantime just keep restarting and e-logging
Central couch compacts at night CERN time, view creation is slow/stuck and until compaction finishes all agents will will report AnalyticsData Collector is "down" in fact it is running it just hasn't reported in 20 min. If you see this you need to just wait. If you see TaskArchiver/JobUpdater down, those are actually down.
couch maxing out. Depends on how many documents we have, it seems like it is happening more often now. It has been increased to 100 and we are still maxing out. We need to restart couch. Developers are testing Big Couch. If this works problem solved, but as of now it doesn't appear that this is going to work. For now just restart couch
change in documentation needs to be made. We only need to shut down the agent and restart couch then restart the agent if it is replication down otherwise just restarting couch is OK and then JobSubmitter doesn't have to rebuild things.
there are 3 WF's that have failures at all sites with SCRAM_ARCH issues.