WLCG MW Readiness WG 18th meeting Minutes - July 6th 2016

WG twiki

Agenda

Summary

  • Old and inactive tickets in jira will be closed by the MW Officer after last verification with Product owners and/or Volunteer site managers. See here which ones.
  • We need to know about upcoming MW products' releases, we also have a permanent poll open for new Volunteer sites especially for MW Readiness verification on CentOS7. See here what we know so far for the near future.
  • The agenda topic on WG Mandate review and meeting frequency discussion was postponed due to lack of participants in this meeting. For more people joining this effort, we hope to get some publicity at the WLCG Workshop in October.
  • Due to summer holidays, WLCG Workshop & CHEP preparations and aftermath, the proposed date for the next meeting is Wed Nov. 2nd @ 4pm CET. Please email the e-group of the WG for comments.

Attendance

  • local: Maria Dimou (chair & notes), Maarten Litmaath (ARGUS report), Andrea Manzi (MW Officer), Vincent Brillault (WLCG Security), David Cameron (ATLAS), Andrea Sciabà (CMS).
  • remote: Matt Doidge (Lancaster), Di Qing (Triumf).

Minutes of previous meeting

The minutes of the last (17th) meeting HERE are corrected in the Summary and Action List to remove the additional action around the pakiti API documentation given that the pakiti client won't be used on production systems.

Verification status report

The MWREADY JIRA dashboard shows the latest status info of open tickets. Summary of progress since our last meeting is in the tables below. The following idle tickets on the dashboard are waiting for WG members' comments:

ATLAS workflow Readiness Verification Status:

MW Product version Volunteer Site(s) Comments Verification status
dCache 2.16.x NDGF JIRA:MWR-131 ongoing
dCache 2.13.30 Triumf JIRA:MWR-130 Installed already in prod for Triumf
UI bundle centos7-ui-0.1 CERN JIRA:MWR-128 verification on CentOS also for CMS. Version number is just a place-holder by Matt Doidge make available the bundle in cvfms end of May. Cristina Aiftimiei built the rpms for the emi repo Please see dedicated discussion slot in this meeting
FTS 3.4.7 CERN JIRA:MWR-133 also for CMS ongoing
StoRM 1.11.11 CNAF JIRA:MWR-127 Completed in prod
DPM 1.8.11 Edinburgh=UK-SCOTGRID-ECDF JIRA:MWR-125 Completed
DPM (srm-less) 1.8.11 LAPP Annecy JIRA:MWR-104 , last update in the ticket reports the new DPM 1.8.11 now installed on LAPP testbed on-going

CMS workflow Readiness Verification Status

MW Product version Volunteer Site(s) Comments Verification status
DPM 1.8.11 GRIF_LLR JIRA:MWR-124 Completed
dCache 2.16.4 PIC JIRA:MWR-134 ongoing

Discussions around CentOS 7 UI and WN bundles

  • UI bundle on CentOS7 available in CVMFS and now via RPM ( to be tested and pushed to UMD preview repo)
    • Missing dep ( cream-cli, wms-cli)
  • WN bundle to be prepared, Matt stated that could work on it..then RPM will follow ( to contact Cristina)
  • UI bundle can be tested at CERN but for the WN one we would need some volunteer sites
    • ATLAS already have some testing queue, Edinburgh was interested in this
  • it would be great to have both WN and UI ready and released after the summer in UMD

Matt confirmed that Edinburgh is interested in this indeed, because they are moving to large CentOS7 installations on site. Andrea M. will open a jira ticket to monitor progress.

Discussing with the experiments about their workflows, ATLAS has a dedicated twiki linked from their workflow document. CMS and ALICE are in the position described in the Action List below, namely not yet. LHCb hasn't answered but they did ask for clients on CentOS7 in the grid application area of CVMFS, in afs. Indeed, David Smith is working on this port.

Sites, in need to move to CentOS7, are interested to get the MW verified.

WLCG MW Readiness Software Status

  • NTR

Sites' feedback

  • No report.

Special topic

  • Major releases coming out this year ( that we are aware of)
    • new Cream-CE release ( both SL6 and CentOS7) scheduled by the end of the year
    • dCache new golden release 2.16 ( already out) ( SL6 and CentOS7)
    • DPM 1.9.0 ( SL6 and CentOS7) scheduled for November
    • ??
  • Looking for CentOS7 volunteers for :
    • dCache
    • DPM
    • ARC-CE
  • MW readiness mandate review and products review? Postponed to the next meeting due to lack of participants today.

Report from recent ARGUS meetings

  • main items for MW Readiness:
    • new Argus 1.7 beta rpms were created that fix:
      • the mapping bug that affected simple CMS proxies
      • a few long-standing minor issues e.g. with the startup scripts
    • the new rpms have been tested on one CentOS7 host in the Argus cluster at CERN
      • the host was repeatedly included in the cluster during a few days
      • its logs were checked for unexpected failures
      • its effects on the shared gridmapdir were checked as well
      • all looked OK!
    • to facilitate the upgrade of the complete cluster, we have asked EGI to copy the new rpms into the UMD Preview repository
      • pending the release notes
    • after a short Staged Rollout phase the release should become official
      • to be continued...

Maarten foresees a smooth operation in the future as 10% of the ARGUS cluster nodes will point to the UMD review repository, so the new versions will be automatically tried and roll into production.

Actions

Action items Done from past meetings can be found HERE.

  • 20160518-02: Expansion of the CentOS7 experiments' intentions to: Pending
    • ALICE: Maarten to check and bring experiment intentions at the next meeting. So far, ALICE runs on SL6 with binaries build on SL5 and it works but in the future this might not be the case.
    • LHCb: Joel/Stefan to give us experiment intentions
  • 20160127-02: David C. and Andrea S. to obtain their experiments' plans concerning EL7 and/or CentOS7. On-going
    • ATLAS: Information is collected in this ATLAS twiki. See in particular the statement on ATLAS migration
    • CMS: The CMS software built on SLC6 is known to be not binary compatible with an OS other than SLC6. CMS is evaluating a container based approach to allow running SLC6 (or other) binaries on WNs with CC7 or other OS versions. In addition, CMSSW is routinely built on the CC7 architecture as a possible future production architecture. Formal physics validation of CMSSW on CentOS7 hasn't started yet, but CMS is definitely doing more than just building on it.
  • 20160127-01: Andrea M., Andrea S., David C., Paul M. see how the nightly data scratch can be handled so that the Prometheus dCache tests can start JIRA:MWREADY:36. The last update of this ticket dates since June 2015. If there is no interest currently, we should probably close the ticket and this action. Decided to close the action and verify via the jira ticket whether Prometheus contains CentOS7 nodes. Prometheus's bio here as promised at the meeting. Close

Next meeting

  • Proposed date is Wed Nov. 2nd @ 4pm CET. Objections to the e-group a.s.a.p. please!

AOB

-- MariaDimou - 2016-06-06

Edit | Attach | Watch | Print version | History: r110 < r109 < r108 < r107 < r106 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r110 - 2018-02-28 - MaartenLitmaath
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    LCG All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright &© 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback