TWiki> EGEE Web>TCGHome>WNWorkingGroup (revision 10)EditAttachPDF

WN Working Group.

The worker node working group is TCG sponsored activity that aims to address the matching and utilization of of worker node resources within the EGEE grid.

The mandate, list of members and mailing list is available here. Mandate

Comments

A number of comments have already been addressed to the group that might be considered for inclusion within discussions and outputs.
  • CPU Numbers - The working group will most likely touch on describing heterogeneous batch farms with multiple GlueSubClusters. Consequently publishing reliable numbers numbers for CPUs in GlueSubCluster for use by e.g gstat becomes a sensible objective. Long lcg-rollout thread.
  • Passing wallclocktime for jobs. It is very likly that the group will consider the passing to the LRMS values for memory and or disk requirements. Particularly for sites supporting MPI jobs it is vital that jobs are all also submitted with a wall clock time to allow for backfill. While not an objective of the group since different WNs do not generally support different wall clocktimes it is related to argument passing and so can be considered.

Strategy

  • VOs to produce a list of constraints related to WN capacity they wish to describe their jobs by, e.g Memory, Diskspace, anything else?
  • Produce an outline of what can be achieved today. By today we are talking about Glue 1.3 Schema, WMS 3.1 and the LCG CE.
    • We can consider from this if a short term solution is worth implementing given the anticipated constraints of the lcg-CE. Any such solution would likely result in recommendations to the YAIM team for such a deployment.
    • Some sites notably RAL already run with a configuration such that matching different worker node resources within the same site is possible but far from optimal.
  • Run within the PPS a CREAM CE which is expected to at the very earliest available as pre-pre-release at the end of October 2007.
    • This will be configured as a CE, torque batch system and two batch workers with different hardware configurations.
    • Information publishing of this CREAM CE can be tweaked by hand to establish the publishing of this heterogeneous cluster.

Test Rig

A test installation is being set up within the PPS.

Node Name OS Purpose Notes
lxb1914 slc4 lcg-gatekeeper , pbs_server, pbs_mom and GlueCE Info Providers. Installing Now

See: WNWorkingGroupInstallLog

Relevant Documents

Meetings.

-- SteveTraylen - 25 Sep 2007

Edit | Attach | Watch | Print version | History: r18 | r12 < r11 < r10 < r9 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r10 - 2007-10-18 - SteveTraylen
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    EGEE All webs login

This site is powered by the TWiki collaboration platform Powered by Perl This site is powered by the TWiki collaboration platformCopyright &© by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Ask a support question or Send feedback