Week of 051121
Open Actions from last week:
- Lyon wants to switch to SRM Copy (fts-support)
DONE
- New LFC sensor to detect current thread usage, and external service availability via CLI tools (James)
Chair: James
On Call: Maarten + Antonio
Monday:
Log: VAR FULL lxb0729, LCG_GRIDFTP_MON_WRONG on oplapro[61,73]
New Actions:
- Work out how to do log expiry with log4j (Gavin)
- New version of LCG_MON_GRIDFTP (Maarten)
Discussion:
- Intervention on Pilot LFC was carried out smoothly and successfully on friday afternoon.
- FTS Team promise a gLite QF tomorrow.
Tuesday:
Log: LFC_DB_ERROR on lfc006, resource usage problems on lfc-atlas, reports of problems to BNL
New Actions:
Discussion:
- restarted lfcdaemon on lfc006
- added extra server for lfc-atlas (lfc004) - now on lfc00[4,8]
- problems to BNL was due to srmcopy code in FTS not handling all errors well. BNL + IN2P3 now reverted back to srm 3-rd party copy (gavin)
- FTS QF is now available - will be tested today, and deployed tomorrow
- new version of LFC which should fix some of ATLAS's problems will be tested today as well.
Wednesday
Log: Support team a reboot of all LFC Servers upon more LFC_DB_ERRORS
Actions:
- Patricia has a qeury on which FTS version for UI/VOBOX - Gavin to respond
DONE
- Check with ZS what is needed and why for gridview (James)
Discussion:
- Gridview are asking for more resources/memory. Harry says it's dependant on the mid-range servers going into production
- FTS QF and new LFC not tested yet
Thursday
Log: LCG_GRIDFTP_WRONG on many oplapro nodes - given to sysadmin by mistake
Actions:
Discussion:
- New rpms for lcg-gridftp-mon should fix problems with LCG_GRIDFTP_WRONG
Friday
Log: Nothing to report
Actions:
- PS DB team will do a reboot of the DB for all LFC+FTS on Monday 9.30 AM
Discussion:
Topic revision: r6 - 2005-11-25
- unknown