PPS Pilot Follow-up Meeting Minutes Wed 26 Feb 2009

  • Date: Wed 26 Feb 2009
  • Agenda: 52981
  • Description: Pilot of glexec/SCAS: check-point
  • Chair: Antonio Retico, Gianni Pucciani


  • PPS: Antonio Retico
  • CERN: Gianni Pucciani (SA3)
  • CMS: -
  • Atlas: Jose Caballero
  • Alice: Patricia Mendez Lorenzo
  • LHCb: Roberto Santinelli, Stuart Paterson, Andrei Tsaregorodtsev
  • Nikhef: Oscar Koeroo

Review of action items (tasks)

SA1/SA3 tasks

Status of the subtasks of TASK:8986 (see them in the PPS tracker ) .

other tasks


Status and results of the pilot service (by VOs and sites)

Angela needs to install the new glexec rpms.

LHCb still needs to do some development to work with the new glexec rpms. Andrei: We have found that on our side we need few bits to tune in particular the identity we saw that different identities are mapped to different unix groups we want to be sure that this is not happening. We may need to restructure the code. We need to install our software and make sure that once you switch the identity the software stay visible. It has been our pain for the past. Even if we are just using it as it is. Nikhef will be used by LHCb to test the integration (Dirac-glexec). Andrei and Stuart: we will try out straight away next week with Nikhef. We don't expect surprises.

Patricia stated that Alice is also interested in doing some developments to use glexec and that early tests should start in June, maybe before but a real date is not yet clear. FZK is very convenient following the same requirements of LHCb. Angela: which role do you expect to use? Patricia: pilot role created few days ago.

Status and results of the development (by developers)

Oscar explained how Glexec and the SCAS client have been improved to mitigate the errors due to the SCAS internal refresh of the serving process, through retrial mechanisms. In one day a new glexec patch, including all these improvements. The SCAS server load balancing system is also a work in progress which has already being tested at Nikhef. All this developments should give an important contribution to the fault tolerance in the glexec-SCAS interactions.

Open Issues (by VOs, sites, deployment teams)

FZK raised the issue of publicizing the CE that has been prepared in the production BDII, saying that this could cause ATLAS production jobs to end up on that CE even if the CE is published with glueCEStateStatus="PilotSCAS". Having the CE in the production BDII is strongly requested by LHB and Alice for this pilot activity. ATLAS, not present at the meeting, will be asked whether this could be a real issue for them.

Next events

Antonio: I have to prepare a presentation for the GDB of March the 11th, everybody is welcome to send me their results. The next Pilot will be done on March the 19th at 16:00.

Topic revision: r5 - 2009-03-05
