Week of 051003

Open Actions from last week:
  • check possiblility of using LCG Quattor WG componets for LFC/DPM/... (Jan/Vlado/Sophie) TO TEST
  • FTS server not visible from the VO box -> deploy another one (Simone + Gavin). WAITINGT
  • Check problem with grid-mapfile for unosat on LFC boxes (Sophie /Patricia) FIXED
  • Quick Fixes to solve the FTS NDGF (Gavin) DONE
  • change FTS configuration to use the new myproxy server (Gavin) DONE
  • Monday : intervention myproxy FTS (Paolo) DONE
  • Olof : suspecting that Fermilab is still using castorgrid instead of castorgridsc -> to check (Olof). DONE

On Call: Jean-Philippe + Sophie

Monday:

Log: Nothing

New Actions:

  • FTS myproxy intervention. Already announced (Paolo) DONE

Discussion:

  • Quick Fixes for NDGF being tested.
  • Fermilab was indeed using castorgrid, now a lot of transfers from them. ia32 nodes : no data -> suspect firewall. to check (Olof). All CMS transfers failed. Rest of the transfers are fine.
  • LHCb starts today. They should be ready.

Tuesday:

Log: Nothing

New Actions:

  • check last ia32 node (Olof) DONE
  • check list sent by Roberto. check CNAF pb. (Jean-Philippe/Sophie)
  • FTS firewall issue (Gavin/Maarten) PENDING

Discussion:

  • Paolo : still firewall issues with the FTS web service that should be visible from outside.
  • Olof : ia32 nodes problem understood : strange netmasks. Fixed except on one node : to be rebooted today
  • Eric : database monitoring for growing processes in place on the CASTOR production and test setup. The test setup is being stressed, in order to see if the problem can be reproduced.

Wednesday

Log: No calls. SHIVA had a problem.

Actions:

  • Fix the problem in the stager to remove the disk-only files (Olof)
  • cleanup and re-lay files again.
  • lxshare218d is very popular with CMS
  • SARA have network problems after reconnect of 10G link. DONE
  • publish procedure for FTS to publish in BDII (James/Gavin)
  • Danielle and Olof to do a debug session at 11. ON HOLD

Discussion:

  • security scan exposed some non-essential ports open
  • some of 1000 diskonly copies have been GC'ed

Thursday

Log: Problem reported by Fermi with non-fqdn hostnames. Problem on IA32 nodes is still there with transfers on loopback interface time out. Will escalate to linux.support

Actions:

  • Problem with names found by CASTOR2 team and will be patched today (Olof)
  • Remove IA32 nodes from wan cluster until problem solved (Olof)
  • Escalate problem to linux.support (Olof)
  • do schema update for FTS production (Olof)
  • problem with castor NS node - broken disk - should be hot-swappable and will be fixed later today

Discussion:

  • shiva now cleaned of duplicates. James will go through old bugs and close them.
  • SARA reverted to Geant
  • debug session with daniella did not happen due to interventions
  • New oracle security fixes coming in 2 weeks (week of 17th or 24th Oct) Will need to be applied then
  • DPM/CASTOR rfio incompatibility has been escalated by CMS.

Friday

Log:

Actions:

Discussion:

-- SophieLemaitre - 04 Oct 2005

Edit | Attach | Watch | Print version | History: r7 < r6 < r5 < r4 < r3 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r5 - 2005-10-06 - JamieShiers
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    LCG All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback