TWiki
>
CMSPublic Web
>
CompOps
>
CompOpsWorkflowTeam
>
WorkflowTeamMeeting
>
WorkflowTeamMeeting20150702
(2015-07-02,
JulianBadillo
)
(raw view)
E
dit
A
ttach
P
DF
---+!! Workflow Team Meeting - July 2nd 4PM CERN, 9 FNAL time %TOC{depth="3" title="Contents:"}% ---++ Vidyo Link * https://indico.cern.ch/event/402959/ ---++ Attending * US: Ajit, Matteo * EU: Dima, Alan, Julian, Andrew ---++ Personel * Jen off June 26-July5 - will have e-mail access but painfully slow internet * Jen off Aug 10-26 (tenitive) * SeangChan July 27-31 * Julian Sept 14-30 * Alan Jul 11 - 20 he guesses ---++ News * No news ---++ 3 top issues affecting production * GlideIn - Collector problems after an update, things look better now. * Some stuff delayed in acquired. * RunIISpring15DR74: (Exit Code: 8003) = Step3 miniaod problem, across sites, reported to PPD HN [[https://hypernews.cern.ch/HyperNews/CMS/get/dataopsrequests/7457.html][HN link]] * ACDC's for some of them don't do any good. * ACDC's stuck in acquired in Global Queue - SeangChan worked on them - any updates? * Global Queue got wrong from PhEDEx * https://cms-logbook.cern.ch/elog/Workflow+processing/20907 * Not progress about this. ---++ Site support - John * some test wfs on TH_CUNSTDA, RU_IHEP, * maybe we need to kill them to upgrade agent. ---++ Workflows ---+++ ReDigi * Batch of ~140 RunIISpring15DR74 got in ---+++ TaskChains * GEN-SIM sample partially lost due to site storage issues, after announced. [[https://cms-logbook.cern.ch/elog/Workflow+processing/20935][see elog]] * reported here [[https://hypernews.cern.ch/HyperNews/CMS/get/dataopsrequests/7295/1.html]] * batch of ~130 RunIIWinter15wmLHE got in ---+++ MonteCarlo * EXO-RunIIWinter15GS 's with high failure rates: * maxRSS exceeded. * All wfs aborted. * Discussion still going on [[https://hypernews.cern.ch/HyperNews/CMS/get/comp-ops/2384/1/4/3/1/1/1/1.html][HN link]] ---+++ ReReco's * Tide-up going on. * https://cms-logbook.cern.ch/elog/Workflow+processing/20961 * General killing/cloning/closing, any feedback? ---+++ Store Results * NTR ---++ Agent Issues ---+++ Redeployment Plan * Only two agents remaining. Everything else was upgraded. * we got rid of the wfs waiting for same dataset. ---++ RelVal Andrew * Seangchang is not here * Andrew wants to add an exit code for not retrying jobs killed by timeout. * This delays the completion of a wf * Andrew asked also about the archiving delay. ---++ L3 discussion - Ajit, Jean-Roch, Matteo ---+++ Opportunistic Resources - Stefan * Olivier found some issues that are preventing jobs to run properly on HLT. * Still some details need to be tuned up. * Meanwhile we keep pending jobs on HLT. * NERSC was able to handle some wfs * Matteo is dealing on SDSC * Julian is going to re-assing something there, but last days were full of pending jobs but very few running. ---+++ Automatic Assignment And Unified Software * JR implemented the deletion of invalidated data. ---+++ AOB Main.JulianBadillo - 2015-07-02
E
dit
|
A
ttach
|
Watch
|
P
rint version
|
H
istory
: r4
<
r3
<
r2
<
r1
|
B
acklinks
|
V
iew topic
|
WYSIWYG
|
M
ore topic actions
Topic revision: r4 - 2015-07-02
-
JulianBadillo
Log In
CMSPublic
CMSPublic Web
CMSPrivate Web
Create New Topic
Index
Search
Changes
Notifications
Statistics
Preferences
Create
a LeftBar
Public webs
Public webs
ABATBEA
ACPP
ADCgroup
AEGIS
AfricaMap
AgileInfrastructure
ALICE
AliceEbyE
AliceSPD
AliceSSD
AliceTOF
AliFemto
ALPHA
ArdaGrid
ASACUSA
AthenaFCalTBAna
Atlas
AtlasLBNL
AXIALPET
CAE
CALICE
CDS
CENF
CERNSearch
CLIC
Cloud
CloudServices
CMS
Controls
CTA
CvmFS
DB
DefaultWeb
DESgroup
DPHEP
DM-LHC
DSSGroup
EGEE
EgeePtf
ELFms
EMI
ETICS
FIOgroup
FlukaTeam
Frontier
Gaudi
GeneratorServices
GuidesInfo
HardwareLabs
HCC
HEPIX
ILCBDSColl
ILCTPC
IMWG
Inspire
IPv6
IT
ItCommTeam
ITCoord
ITdeptTechForum
ITDRP
ITGT
ITSDC
LAr
LCG
LCGAAWorkbook
Leade
LHCAccess
LHCAtHome
LHCb
LHCgas
LHCONE
LHCOPN
LinuxSupport
Main
Medipix
Messaging
MPGD
NA49
NA61
NA62
NTOF
Openlab
PDBService
Persistency
PESgroup
Plugins
PSAccess
PSBUpgrade
R2Eproject
RCTF
RD42
RFCond12
RFLowLevel
ROXIE
Sandbox
SocialActivities
SPI
SRMDev
SSM
Student
SuperComputing
Support
SwfCatalogue
TMVA
TOTEM
TWiki
UNOSAT
Virtualization
VOBox
WITCH
XTCA
Cern Search
TWiki Search
Google Search
CMSPublic
All webs
Copyright &© 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use
Discourse
or
Send feedback