SL6 stress testing, backfill - more on this further down the agenda
Site support
we began AAA testing with SL6 but need to stop doing so...
Sara's notes
Agent Issues
Agents have been well behaved... that's what happens when you aren't running much
Redeployment plan
cmssrv112 - in drain for redeployment as disk filled
Workflows
Processing string - Where do we stand with this. We had 2 tasks assigned last week where do we stand?
We need to talk to MCM about how they want the policy set: Dave will reply to the e-mail that they are postponing it and asking for clarification as to how we will know that we are using it.
Julian will make a list of WF's that have processing string in the schema that are already in the system.
We have begun ramping up the load testing on the 3 SL6 agents, so far no major crashes!
had issues with couch replication
Fri-Weekend - submitted Backfill ran at nice steady state, no crashes but thresholds were improperly set
Mon-Tues realized that the thresholds were set incorrectly when we couldn't get the 2nd Backfill to go, once the thresholds were reset things ramped up nicely and held steady state
We are running T1's at threshold using these 3 agents
Wed - Started another redigi backfill, with higher priority than the others, it is currently taking over slots as we want it to!
Wed- Julian started MC backfill, and ramping up more jobs
cmssrv217.fnal.gov cmssrv217. 11618 27143 0
cmssrv218.fnal.gov cmssrv218. 13986 20560 1
cmssrv219.fnal.gov cmssrv219. 10986 23072 0
Julian was going to work on changes to the resubmit script to help us create backfill. How is it going?
We think we are ready to start running low priority real work on these machines, Anybody want to throw some work at us?