WN Working Group.
The worker node working group is
TCG sponsored activity that aims to address the matching and utilization of
of worker node resources within the EGEE grid.
The mandate, list of members and mailing list is available here.
Mandate
Comments
A number of comments have already been addressed to the group that might be considered for inclusion within discussions
and outputs.
- CPU Numbers - The working group will most likely touch on describing heterogeneous batch farms with multiple GlueSubClusters. Consequently publishing reliable numbers numbers for CPUs in GlueSubCluster for use by e.g gstat
becomes a sensible objective. Long lcg-rollout thread
.
- Passing wallclocktime for jobs. It is very likly that the group will consider the passing to the LRMS values for memory and or disk requirements. Particularly for sites supporting MPI jobs it is vital that jobs are all also submitted with a wall clock time to allow for backfill. While not an objective of the group since different WNs do not generally support different wall clocktimes it is related to argument passing and so can be considered.
Strategy
- VOs to produce a list of constraints related to WN capacity they wish to describe their jobs by, e.g Memory, Diskspace, anything else?
- Produce an outline of what can be achieved today. By today we are talking about Glue 1.3 Schema, WMS 3.1 and the LCG CE.
- We can consider from this if a short term solution is worth implementing given the anticipated constraints of the lcg-CE. Any such solution would likely result in recommendations to the YAIM team for such a deployment.
- Some sites notably RAL already run with a configuration such that matching different worker node resources within the same site is possible but far from optimal.
- Run within the PPS a CREAM CE which is expected to at the very earliest available as pre-pre-release at the end of October 2007.
- This will be configured as a CE, torque batch system and two batch workers with different hardware configurations.
- Information publishing of this CREAM CE can be tweaked by hand to establish the publishing of this heterogeneous cluster.
Software Tags
The software tags need to addressed with respect to running multiple
SubClusters on the same physical host.
Test Rig
A test installation is being set up within the PPS.
See:
WNWorkingGroupInstallLog
Presentations
Relevant Documents
Meetings.
--
SteveTraylen - 25 Sep 2007