Difference: AtlasProduction14010002Status (1 vs. 16)

Revision 162008-06-14 - unknown

Line: 1 to 1
 
META TOPICPARENT name="AndresPacheco"
Changed:
<
<
<!-- /ActionTrackerPlugin -->
>
>

E ditW YSIWYGA ttachPDFP rintable

<!-- /patternToolBar-->

<!-- /patternTop-->

r15 - 21 May 2008 - 17:17:09 - AndresPacheco You are here: TWiki >   Main Web >  TWikiUsers > AndresPacheco  > AtlasProduction14010002Status

<!-- /patternHomePath-->

<!-- /ActionTrackerPlugin -->
 

21 May 2008, Revision 9

Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

Andreu Pacheco-Pages / IFAE-CERN











Changed:
<
<

Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

>
>

Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

 



Changed:
<
<

Current situation: (21 May 2008 17h12)

>
>

Current situation: (21 May 2008 17h12)

 


Cache is public since 21 May 2008.

New geometry ATLAS-CSC-05-01-00 in FCT since 16 May 2008

Changed:
<
<

Proposed strategy: (16 May 2008 9h56)

>
>

Proposed strategy: (16 May 2008 9h56)

 

Follow and create bugs in savannah for RTT, FCT and grid validation jobs.

Changed:
<
<

RTT Tests (rel_1, 20 May 2008 11:03)

  • EvgenJobTransforms errors:

    • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

      • Atlfast.5870.ttH_poslepnu_jj_bb

  • RecJobTransforms errors:

    • BUG#36625: CPU time limit exceeded, and core dumped.

      • FDR1toESDandAOD

      • RecoTransf_130030

      • RecoTransf_HighStat

    • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

      • recoFastCaloSim_NoTrig

      • recoFastCaloSim

    • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

      • RecoTagTransf_130003

  • SimuJobTransforms: All OK.

FCT Tests AtlasProduction

  • Pileup pcache (checked 20 May, 11:06) All tests OK.
  • Basic-pcache (cheked 20 May 2008, 11h06)

    • Tests with ErrorCode=0:
      • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
      • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
      • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      •  Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
      • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      •  Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
      • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
      • BStoESD-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_BSrecoESD
      • RDOtoBS-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_RDOtoBS
      • Atlfast-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlfast
      • Gen-CSC.006384.PythiaH120gamgam-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_evgen
      • Gen-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_evgen
      • Gen-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_evgen
      • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlasG4
      • SimulReco-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_simul_reco
      • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlasG4
      • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_digi
      • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlasG4
      • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_digi_trf
      • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD

    • BUG#36284: csc_digi TRF_INFILE_TOOFEW
      • csc_digi TRF_INFILE_TOOFEW, who=JobTransform.csc_digi, message=Inputfile/afs/cern.ch/atlas/offline/external/FullChainTest/pcache_14.1.0.Y /last_good_root/ShowerParam-CSC.005200.T1_McAtNlo_Jimmy.HITS.pool.root : too few events (49 < 50) in input file
      • NOTE: This is the effect of an masked error in csc_atlasG4 transform making csc_digi failing.
      • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_digi
        
        
>
>

RTT Tests (rel_1, 20 May 2008 11:03)

  • EvgenJobTransforms errors:

    • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

      • Atlfast.5870.ttH_poslepnu_jj_bb

  • RecJobTransforms errors:

    • BUG#36625: CPU time limit exceeded, and core dumped.

      • FDR1toESDandAOD

      • RecoTransf_130030

      • RecoTransf_HighStat

    • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink?, cannot set index

      • recoFastCaloSim_NoTrig

      • recoFastCaloSim

    • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection? to key: /TRIGGER/HLT/Menu

      • RecoTagTransf_130003

  • SimuJobTransforms: All OK.

FCT Tests AtlasProduction?

  • Pileup pcache (checked 20 May, 11:06) All tests OK.
  • Basic-pcache (cheked 20 May 2008, 11h06)

    • Tests with ErrorCode?=0:
      • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
      • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
      • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      •  Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
      • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      •  Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
      • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
      • BStoESD-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_BSrecoESD
      • RDOtoBS-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_RDOtoBS
      • Atlfast-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlfast
      • Gen-CSC.006384.PythiaH120gamgam-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_evgen
      • Gen-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_evgen
      • Gen-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_evgen
      • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlasG4
      • SimulReco-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_simul_reco
      • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlasG4
      • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_digi
      • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlasG4
      • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_digi_trf
      • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
      • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD

    • BUG#36284: csc_digi TRF_INFILE_TOOFEW
      • csc_digi TRF_INFILE_TOOFEW, who=JobTransform.csc_digi, message=Inputfile/afs/cern.ch/atlas/offline/external/FullChainTest/pcache_14.1.0.Y /last_good_root/ShowerParam-CSC.005200.T1_McAtNlo_Jimmy.HITS.pool.root : too few events (49 < 50) in input file
      • NOTE: This is the effect of an masked error in csc_atlasG4 transform making csc_digi failing.
      • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_digi
        
        
 
  • Changed:
    <
    <
    BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu
    >
    >
    BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection? to key: /TRIGGER/HLT/Menu
     
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_buildTAG
    • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_buildTAG
      

  • Long pcache (19 May, 17:18)

    • OK with Error Code =0
      • Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoAOD
    •  *BUG#36726*: TRF_SEGFAULT, message=FATAL 2008-May-19 06:55:13 [static void 
      ers::ErrorHandler::SignalHandler::action(...) 
      at ers/src/ErrorHandler.cxx:88] 
      Got signal 11 Segmentation fault (invalid memory reference)
      • VMem = 2434.637 MB, RSS  = 1915.984 MB, malloc =  694.045 MB
      • Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoESD
    Changed:
    <
    <

    Open Issues (18 May 2008, 13h11)

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests is low at the level of 57% (rel_0).


    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.

    ISSUE 080519-2051: Is Tag Building working without trigger?A task has been submitted to verify if the problem with Tag building is related to trigger or not.
    >
    >

    Open Issues (18 May 2008, 13h11)

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1?. The rate of successful RTT tests is low at the level of 57% (rel_0).


    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.

    ISSUE 080519-2051: Is Tag Building working without trigger?A task has been submitted to verify if the problem with Tag building is related to trigger or not.
     
    • Task valid1.005200.T1_McAtNlo_Jimmy.merge.TAG.e322_s412_r413_t40_tid022339 submitted 19 May 2008.
    Changed:
    <
    <

    Validation bugs 14.1.0.1 (16b May 2008)

    >
    >

    Validation bugs 14.1.0.1 (16b May 2008)

     

    36726 Atlas Validation - csc_recoESD TRF_SEGFAULT Crash in Full Chain Test job in Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1

    Changed:
    <
    <
    >
    >
     
    • With same tags suceeded in rel_0 but failed in rel_1
    Changed:
    <
    <
    >
    >
     
    • Bug opened by Andreu Pacheco on 19 May 2008.

    36666 Atlas Validation - 14.1.0.1 valid1 reco task 22282 failures: TRF_UNKNOWN | Failed to convert object to persistent type: St9bad_alloc

    Line: 54 to 65
     

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.
    • Iacopo reports that people is investigating on 19 May 2008.

    36648 (36633) - Atlas Trigger (Atlas Validation) - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!

    Changed:
    <
    <
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    >
    >
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

     
    • Bug submitted 14 May 2008
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.
    • Claimed to be a duplicate of 36608
    Changed:
    <
    <

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.
    • Zachary reports maybe duplicate bug with #35909 but keeps this bug open on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) GONE Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      • Assigned to Michael Duehrssen on 19 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      • Claire reported that one test will be removed and number of events reduced on 16 May 2008.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

      • Sebastian Binet made two tags to solve it (PerfMonComps-00-14-04, PyUtils-00-03-11), they were submitted to 14.1.0.10 on 16 May 2008. Not in Atlas Production.

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue. To ignore.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      • Brigitte epp copied all input files to lxplus on 19 May 2008.

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      • David Quarrie requested confirmation that bug was fixed on 16 May 2008.

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    TAGS PENDING IN ATLASPOINT1 (checked 18 May 2008 14h54)

    >
    >

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.
    • Zachary reports maybe duplicate bug with #35909 but keeps this bug open on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) GONE Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms? recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink?, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder?) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate?, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject? (clid/key):1334834594 TrackParticleCandidate?, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD? reported an ERROR, but returned a StatusCode? "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      • Assigned to Michael Duehrssen on 19 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform? failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      • Claire reported that one test will be removed and number of events reduced on 16 May 2008.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

      • Sebastian Binet made two tags to solve it (PerfMonComps-00-14-04, PyUtils?-00-03-11), they were submitted to 14.1.0.10 on 16 May 2008. Not in Atlas Production.

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore? service not found! | ERROR Unable to initialize Service: GeoModelSvc? | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError?: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue. To ignore.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder? reported an ERROR, but returned a StatusCode? "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter?-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit?-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject?

      TRF_UNKNOWN | Could not create Rep for DataObject? (clid/key):210948284 LumiBlocks? | Could not create Rep for DataObject? (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit?-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      • Brigitte epp copied all input files to lxplus on 19 May 2008.

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer? cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      • David Quarrie requested confirmation that bug was fixed on 16 May 2008.

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder? errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    TAGS PENDING IN ATLASPOINT1 (checked 18 May 2008 14h54)

     

    Line: 78 to 89
      InDetAlignmentMonitoring-00-02-07 (update, Tobis Golling, in Atlas Production)

    OutputStreamAthenaPool-00-01-29 (in Atlasproduction)

    Changed:
    <
    <

    PENDING TAGS ATLASPRODUCTION 14.1.0.2: (16 May 2008, 11:09)

    >
    >

    PENDING TAGS ATLASPRODUCTION 14.1.0.2: (16 May 2008, 11:09)

     

    None

    Changed:
    <
    <

    ATLASPRODUCTION 14.1.0.2 versus ATLASPOINT1 - 14.1.0.1 (19 May 2008 08:10)


    CaloMonitoring-00-00-94 | <none> /Calorimeter/CaloMonitoring
    OutputStreamAthenaPool-00-01-29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool
    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions
    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms
    InDetTrigRecExample-00-06-90-05 | InDetTrigRecExample-00-06-90-04 /InnerDetector/InDetExample/InDetTrigRecExample
    InDetAlignmentMonitoring-00-02-07 | InDetAlignmentMonitoring-00-02-04 /InnerDetector/InDetMonitoring/InDetAlignmentMonitoring
    InDetPerformanceMonitoring-00-00-08 | InDetPerformanceMonitoring-00-00-04 /InnerDetector/InDetMonitoring/InDetPerformanceMonitoring
    PixelMonitoring-00-03-27 | <none> /InnerDetector/InDetMonitoring/PixelMonitoring
    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils/TRT_TR_Process
    LArG4FastSimulation-00-02-30 | <none> /LArCalorimeter/LArG4/LArG4FastSimulation
    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4ShowerLibSvc
    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Validation
    RecExCommission-00-02-59 | RecExCommission-00-02-60 /Reconstruction/RecExample/RecExCommission
    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample/RecExRecoTest
    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample/RecExTrigTest
    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApps
    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel
    TrigT1Monitoring-00-00-12 | <none> /Trigger/TrigT1/TrigT1Monitoring

    PROCEDURES TO BE FOLLOWED (14 May 2008, 14:51)

    >
    >

    ATLASPRODUCTION 14.1.0.2 versus ATLASPOINT1 - 14.1.0.1 (19 May 2008 08:10)


    CaloMonitoring?-00-00-94 | <none> /Calorimeter/CaloMonitoring
    OutputStreamAthenaPool?-00-01-29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool
    EvgenJobOptions?-00-00-46 | <none> /Generators/EvgenJobOptions
    EvgenJobTransforms?-00-06-04 | <none> /Generators/EvgenJobTransforms
    InDetTrigRecExample?-00-06-90-05 | InDetTrigRecExample?-00-06-90-04 /InnerDetector/InDetExample/InDetTrigRecExample
    InDetAlignmentMonitoring?-00-02-07 | InDetAlignmentMonitoring?-00-02-04 /InnerDetector/InDetMonitoring/InDetAlignmentMonitoring
    InDetPerformanceMonitoring?-00-00-08 | InDetPerformanceMonitoring?-00-00-04 /InnerDetector/InDetMonitoring/InDetPerformanceMonitoring
    PixelMonitoring?-00-03-27 | <none> /InnerDetector/InDetMonitoring/PixelMonitoring
    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils/TRT_TR_Process
    LArG4FastSimulation?-00-02-30 | <none> /LArCalorimeter/LArG4/LArG4FastSimulation
    LArG4ShowerLibSvc?-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4ShowerLibSvc
    LArG4Validation?-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Validation
    RecExCommission?-00-02-59 | RecExCommission?-00-02-60 /Reconstruction/RecExample/RecExCommission
    <none> | RecExRecoTest?-00-00-59 /Reconstruction/RecExample/RecExRecoTest
    <none> | RecExTrigTest?-00-00-30 /Reconstruction/RecExample/RecExTrigTest
    G4AtlasApps?-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApps
    TileGeoModel?-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel
    TrigT1Monitoring?-00-00-12 | <none> /Trigger/TrigT1/TrigT1Monitoring

    PROCEDURES TO BE FOLLOWED (14 May 2008, 14:51)

     
    Changed:
    <
    <
    Look at RTT tests called Event,Simu or Reco JobTransforms,open and follow up tickets.
    >
    >
    Look at RTT tests called Event,Simu or Reco JobTransforms?,open and follow up tickets.
     
  • Look at FCT tests daily

  • Changed:
    <
    <
    Recommended that tags go into AtlasPoint1 first and then in AtlasProduction with the exception of Simulation tags.
    >
    >
    Recommended that tags go into AtlasPoint1? first and then in AtlasProduction? with the exception of Simulation tags.
     
  • Follow up bugs and check daily their status.

  • Line: 112 to 123
     Expert in Atlfast II - Michael Duehrsenn







    Changed:
    <
    <
    -- AndresPacheco - 15 May 2008
    >
    >
    -- AndresPacheco - 15 May 2008

    <!-- /patternTopic-->

    <!-- /patternContent-->

    E dit | W YSIWYG | A ttach | P rintable | C lone | R aw View | Backlinks: We b, A l l Webs | H istory: r15  <  r14  <  r13  <  r12  <  r11 | M ore topic actions

    <!--/patternTopicActions-->

    <!-- /patternMoved-->
    <!-- /patternMainContents-->
    <!-- /patternMain-->
    <!-- /patternLeftBar-->
    <!-- /patternFloatWrap-->

     

    <!-- /patternOuter-->
    <!-- /patternWrapper-->
    | CERN |

    |
    <!-- /patternTopBar-->
    This site is powered by the TWiki collaboration platformCopyright &© by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
    Ideas, requests, problems regarding TWiki? Ask a support question or Send feedback
    <!-- /patternBottomBarContents-->
    <!-- /patternBottomBar-->
    <!-- /patternPage-->
    <!-- /patternPageShadow-->
    <!-- /patternScreen-->
     \ No newline at end of file

    Revision 152008-05-21 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Changed:
    <
    <

    20 May 2008, Revision 9

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











    >
    >

    21 May 2008, Revision 9

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











     

    Line: 10 to 10
     



    Changed:
    <
    <

    Current situation: (19 May 2008 17h17)


    Next cache 14.1.0.2 expected Wed 21 May 2008

    New geometry ATLAS-CSC-05-01-00 in FCT since 16 May 2008

    >
    >

    Current situation: (21 May 2008 17h12)


    Cache is public since 21 May 2008.

    New geometry ATLAS-CSC-05-01-00 in FCT since 16 May 2008

     

    Proposed strategy: (16 May 2008 9h56)

    Follow and create bugs in savannah for RTT, FCT and grid validation jobs.

    RTT Tests (rel_1, 20 May 2008 11:03)

    Revision 142008-05-20 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Changed:
    <
    <

    18 May 2008, Revision 8

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











    >
    >

    20 May 2008, Revision 9

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











     

    Line: 14 to 14
     


    Next cache 14.1.0.2 expected Wed 21 May 2008

    New geometry ATLAS-CSC-05-01-00 in FCT since 16 May 2008

    Proposed strategy: (16 May 2008 9h56)

    Follow and create bugs in savannah for RTT, FCT and grid validation jobs.

    Changed:
    <
    <

    RTT Tests (rel_0, 19 May 2008 12:54)

    • EvgenJobTransforms errors:

      • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors:

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • RecoTagTransf_130003

    • SimuJobTransforms errors

      • Error with error code 0 ???
        • T1_McAtNlo_Jimmy
        • HeavyIons_Simulation-Digits
    >
    >

    RTT Tests (rel_1, 20 May 2008 11:03)

    • EvgenJobTransforms errors:

      • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors:

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • RecoTagTransf_130003

    • SimuJobTransforms: All OK.

     

    FCT Tests AtlasProduction

    Changed:
    <
    <
    • Pileup pcache (checked 19 May, 12:59) All tests OK.
    • Basic-pcache (cheked 18 May 2008, 13h22)

      • Tests with ErrorCode=0:
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • BStoESD-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_BSrecoESD
        • RDOtoBS-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_RDOtoBS
        • Atlfast-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlfast
        • Gen-CSC.006384.PythiaH120gamgam-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • Gen-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • Gen-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimulReco-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_simul_reco
        • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_digi
        • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_digi
        • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD

      • BUG#36284: csc_digi TRF_INFILE_TOOFEW
        • csc_digi TRF_INFILE_TOOFEW, who=JobTransform.csc_digi, message=Inputfile/afs/cern.ch/atlas/offline/external/FullChainTest/pcache_14.1.0.Y /last_good_root/ShowerParam-CSC.005200.T1_McAtNlo_Jimmy.HITS.pool.root : too few events (49 < 50) in input file
        • NOTE: This is the effect of an masked error in csc_atlasG4 transform making csc_digi failing.
        • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_digi
          
      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoESD
    >
    >
    • Pileup pcache (checked 20 May, 11:06) All tests OK.
    • Basic-pcache (cheked 20 May 2008, 11h06)

      • Tests with ErrorCode=0:
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
        • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
        • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
        •  Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
        •  Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD
        • BStoESD-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_BSrecoESD
        • RDOtoBS-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_RDOtoBS
        • Atlfast-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlfast
        • Gen-CSC.006384.PythiaH120gamgam-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_evgen
        • Gen-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_evgen
        • Gen-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_evgen
        • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlasG4
        • SimulReco-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_simul_reco
        • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlasG4
        • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_digi
        • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_atlasG4
        • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_digi_trf
        • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoESD
        • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_recoAOD

      • BUG#36284: csc_digi TRF_INFILE_TOOFEW
        • csc_digi TRF_INFILE_TOOFEW, who=JobTransform.csc_digi, message=Inputfile/afs/cern.ch/atlas/offline/external/FullChainTest/pcache_14.1.0.Y /last_good_root/ShowerParam-CSC.005200.T1_McAtNlo_Jimmy.HITS.pool.root : too few events (49 < 50) in input file
        • NOTE: This is the effect of an masked error in csc_atlasG4 transform making csc_digi failing.
        • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_digi
          
      BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu
    Changed:
    <
    <
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_buildTAG
      
    • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_buildTAG
  • Long pcache (19 May, 17:18)

    • OK with Error Code =0
      • Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoAOD
    •  *BUG#36726*: TRF_SEGFAULT, message=FATAL 2008-May-19 06:55:13 [static void 
      
      
    >
    >
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_buildTAG
    • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_2-19May08.23h15-csc_buildTAG
      

  • Long pcache (19 May, 17:18)

    • OK with Error Code =0
      • Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoAOD
    •  *BUG#36726*: TRF_SEGFAULT, message=FATAL 2008-May-19 06:55:13 [static void 
      
      
     ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
    • VMem = 2434.637 MB, RSS  = 1915.984 MB, malloc =  694.045 MB
    • Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoESD
    Line: 59 to 59
     
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.
    • Claimed to be a duplicate of 36608
    Changed:
    <
    <

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.
    • Zachary reports maybe duplicate bug with #35909 but keeps this bug open on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      • Assigned to Michael Duehrssen on 19 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      • Claire reported that one test will be removed and number of events reduced on 16 May 2008.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        
        
    >
    >

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.
    • Zachary reports maybe duplicate bug with #35909 but keeps this bug open on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) GONE Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      • Assigned to Michael Duehrssen on 19 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      • Claire reported that one test will be removed and number of events reduced on 16 May 2008.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        
        
     producer=csc_recoESD
  • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

  • Sebastian Binet made two tags to solve it (PerfMonComps-00-14-04, PyUtils-00-03-11), they were submitted to 14.1.0.10 on 16 May 2008. Not in Atlas Production.

  • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
    • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

    • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

    • Claimed to be fixed by 36418 on 16 May 2008.

    36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

    • Last updated May 15th. Andreu submitted identical task.

    • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

    • Clased bug as new task has a new bug report.

    36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

    • Updated May 15th. Assigned to Andrea di Simone.

    • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

    TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Assigned to Simon George. Last update May 14th.

    • Probable high memory consumption related issue. To ignore.

    • DQ recomments to ignore because duplicate of 35289 (?)

    36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

    36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

    Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

    • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

    36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

    Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

    • Last update May 13th. Assigned to Sofia Valldecorsa.

    • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

    36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

    TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

    • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

    • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

    36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

    • Last updated May 15th. Assigned to John Apostolakis

    • Brigitte epp copied all input files to lxplus on 19 May 2008.

    34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

    • Last update May 14th. Assigned to Edward Moyse

    • David Quarrie requested confirmation that bug was fixed on 16 May 2008.

    35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

    • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

  • TAGS PENDING IN ATLASPOINT1 (checked 18 May 2008 14h54)

    Revision 132008-05-19 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Line: 22 to 22
      BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_buildTAG
      
      
    Changed:
    <
    <
  • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_buildTAG
  • Long pcache (19 May, 17:18)

    • OK with Error Code =0
      • Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoAOD
    • TRF_SEGFAULT, message=FATAL 2008-May-19 06:55:13 [static void 
      
      
    >
    >
  • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_buildTAG
  • Long pcache (19 May, 17:18)

    • OK with Error Code =0
      • Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoAOD
    •  *BUG#36726*: TRF_SEGFAULT, message=FATAL 2008-May-19 06:55:13 [static void 
      
      
     ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
    • VMem = 2434.637 MB, RSS  = 1915.984 MB, malloc =  694.045 MB
    • Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoESD

    Open Issues (18 May 2008, 13h11)

    Changed:
    <
    <

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests is low at the level of 57% (rel_0).


    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.
    >
    >

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests is low at the level of 57% (rel_0).


    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.

    ISSUE 080519-2051: Is Tag Building working without trigger?A task has been submitted to verify if the problem with Tag building is related to trigger or not.
    • Task valid1.005200.T1_McAtNlo_Jimmy.merge.TAG.e322_s412_r413_t40_tid022339 submitted 19 May 2008.
     

    Validation bugs 14.1.0.1 (16b May 2008)

    Added:
    >
    >
    36726 Atlas Validation - csc_recoESD TRF_SEGFAULT Crash in Full Chain Test job in Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1
     36666 Atlas Validation - 14.1.0.1 valid1 reco task 22282 failures: TRF_UNKNOWN | Failed to convert object to persistent type: St9bad_alloc
    • INFO Total Virtual memory = 3100.93 Real 2498.63
    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: St9bad_alloc,producer=csc_recoESD
    Line: 47 to 54
     

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.
    • Iacopo reports that people is investigating on 19 May 2008.

    36648 (36633) - Atlas Trigger (Atlas Validation) - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!

    Changed:
    <
    <
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    >
    >
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

     
    • Bug submitted 14 May 2008
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.
    • Claimed to be a duplicate of 36608
    Changed:
    <
    <

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.
    • Zachary reports maybe duplicate bug with #35909 but keeps this bug open on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      • Assigned to Michael Duehrssen on 19 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      • Claire reported that one test will be removed and number of events reduced on 16 May 2008.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

      • Sebastian Binet made two tags to solve it (PerfMonComps-00-14-04, PyUtils-00-03-11), they were submitted to 14.1.0.10 on 16 May 2008. Not in Atlas Production.

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue. To ignore.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      • Brigitte epp copied all input files to lxplus on 19 May 2008.

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      • David Quarrie requested confirmation that bug was fixed on 16 May 2008.

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    >
    >

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.
    • Zachary reports maybe duplicate bug with #35909 but keeps this bug open on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      • Assigned to Michael Duehrssen on 19 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      • Claire reported that one test will be removed and number of events reduced on 16 May 2008.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

      • Sebastian Binet made two tags to solve it (PerfMonComps-00-14-04, PyUtils-00-03-11), they were submitted to 14.1.0.10 on 16 May 2008. Not in Atlas Production.

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue. To ignore.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      • Brigitte epp copied all input files to lxplus on 19 May 2008.

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      • David Quarrie requested confirmation that bug was fixed on 16 May 2008.

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

     

    TAGS PENDING IN ATLASPOINT1 (checked 18 May 2008 14h54)

    Revision 122008-05-19 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Line: 10 to 10
     



    Changed:
    <
    <

    Current situation: (16 May 2008 12h20)


    Next cache 14.1.0.2 expected Tue 20 May 2008

    New geometry ATLAS-CSC-05-01-00 in FCT since 16 May 2008

    >
    >

    Current situation: (19 May 2008 17h17)


    Next cache 14.1.0.2 expected Wed 21 May 2008

    New geometry ATLAS-CSC-05-01-00 in FCT since 16 May 2008

     

    Proposed strategy: (16 May 2008 9h56)

    Follow and create bugs in savannah for RTT, FCT and grid validation jobs.

    RTT Tests (rel_0, 19 May 2008 12:54)

    Line: 22 to 22
      BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_buildTAG
      
      
    Changed:
    <
    <
  • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_buildTAG
  • Long pcache (18 May, 17:41) All tests OK

  • >
    >
  • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_buildTAG
  • Long pcache (19 May, 17:18)

    • OK with Error Code =0
      • Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoAOD
    • TRF_SEGFAULT, message=FATAL 2008-May-19 06:55:13 [static void 
      ers::ErrorHandler::SignalHandler::action(...) 
      at ers/src/ErrorHandler.cxx:88] 
      Got signal 11 Segmentation fault (invalid memory reference)
      • VMem = 2434.637 MB, RSS  = 1915.984 MB, malloc =  694.045 MB
      • Rec_on_12.0.6.5-BasicPlus1KGridRDOEvts-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoESD
     

    Open Issues (18 May 2008, 13h11)

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests is low at the level of 57% (rel_0).


    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.

    Validation bugs 14.1.0.1 (16b May 2008)

    Revision 112008-05-19 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Line: 11 to 11
     



    Current situation: (16 May 2008 12h20)

    Changed:
    <
    <

    The Atlasproduction 14.1.0.2 cache contains all tags pending in AtlasPoint1 14.1.0.1.

    Next cache 14.1.0.2 expected Tue 20 May 2008

    New geometry ATLAS-CSC-05-01-00 in FCT since 16 May 2008

    >
    >


    Next cache 14.1.0.2 expected Tue 20 May 2008

    New geometry ATLAS-CSC-05-01-00 in FCT since 16 May 2008

     

    Proposed strategy: (16 May 2008 9h56)

    Follow and create bugs in savannah for RTT, FCT and grid validation jobs.

    Changed:
    <
    <

    RTT Tests (rel_5, 17 May 2008 12:37)

    • EvgenJobTransforms errors:

      • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors:

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • RecoTagTransf_130003

    • No SimuJobTransforms errors

    >
    >

    RTT Tests (rel_0, 19 May 2008 12:54)

    • EvgenJobTransforms errors:

      • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors:

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • RecoTagTransf_130003

    • SimuJobTransforms errors

      • Error with error code 0 ???
        • T1_McAtNlo_Jimmy
        • HeavyIons_Simulation-Digits
     

    FCT Tests AtlasProduction

    Changed:
    <
    <
    • Pileup pcache (checked 18 May, 09:11) All tests OK.
    • Basic-pcache (cheked 18 May 2008, 13h22)

      • Tests with ErrorCode=0:
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • BStoESD-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_BSrecoESD
        • RDOtoBS-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_RDOtoBS
        • Atlfast-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlfast
        • Gen-CSC.006384.PythiaH120gamgam-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • Gen-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • Gen-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimulReco-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_simul_reco
        • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_digi
        • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_digi
        • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD

      • BUG#36284: csc_digi TRF_INFILE_TOOFEW
        • csc_digi TRF_INFILE_TOOFEW, who=JobTransform.csc_digi, message=Inputfile/afs/cern.ch/atlas/offline/external/FullChainTest/pcache_14.1.0.Y /last_good_root/ShowerParam-CSC.005200.T1_McAtNlo_Jimmy.HITS.pool.root : too few events (49 < 50) in input file
        • NOTE: This is the effect of an masked error in csc_atlasG4 transform making csc_digi failing.
        • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_digi
          
      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
    >
    >
    • Pileup pcache (checked 19 May, 12:59) All tests OK.
    • Basic-pcache (cheked 18 May 2008, 13h22)

      • Tests with ErrorCode=0:
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • BStoESD-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_BSrecoESD
        • RDOtoBS-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_RDOtoBS
        • Atlfast-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlfast
        • Gen-CSC.006384.PythiaH120gamgam-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • Gen-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • Gen-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimulReco-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_simul_reco
        • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_digi
        • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_digi
        • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD

      • BUG#36284: csc_digi TRF_INFILE_TOOFEW
        • csc_digi TRF_INFILE_TOOFEW, who=JobTransform.csc_digi, message=Inputfile/afs/cern.ch/atlas/offline/external/FullChainTest/pcache_14.1.0.Y /last_good_root/ShowerParam-CSC.005200.T1_McAtNlo_Jimmy.HITS.pool.root : too few events (49 < 50) in input file
        • NOTE: This is the effect of an masked error in csc_atlasG4 transform making csc_digi failing.
        • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_digi
          
      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_recoESD
      BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu
    Changed:
    <
    <
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_buildTAG
    • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_buildTAG
  • Long pcache (17 May, 13:11) All tests OK

  • >
    >
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_buildTAG
      
    • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_1-19May08.01h15-csc_buildTAG
  • Long pcache (18 May, 17:41) All tests OK

  •  

    Open Issues (18 May 2008, 13h11)

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests is low at the level of 57% (rel_0).


    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.

    Validation bugs 14.1.0.1 (16b May 2008)

    Line: 35 to 36
     
    • 100% failures Task 22282 valid1.005640.CharybdisJimmy.recon.e322_s429_r426
    • Bug opened by Yuri on 16 May 2008.
    • Andreu opinion is a memory allocation problem due to a high memory usage raised on 17 May 2008.
    Added:
    >
    >
    • David Quarrie recommends veto to reconstruction of exotics must be done with trigger on on 19 May 2008.
      36663 Atlas Reconstruction - csc_BSreco_trf issue - 14.1.0.1 reco from 13.0.40.5 BS - Wrong reconstruction for staco muons
    • Bug Opened by Iacopo on 16 May 2008. No person assigned.
    • 100% impact on Task 22305 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r423
    Changed:
    <
    <

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.
    >
    >

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.
    • Iacopo reports that people is investigating on 19 May 2008.
      36648 (36633) - Atlas Trigger (Atlas Validation) - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    Line: 47 to 49
     
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.
    • Claimed to be a duplicate of 36608
    Changed:
    <
    <

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    >
    >

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.
    • Zachary reports maybe duplicate bug with #35909 but keeps this bug open on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      • Assigned to Michael Duehrssen on 19 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      • Claire reported that one test will be removed and number of events reduced on 16 May 2008.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

      • Sebastian Binet made two tags to solve it (PerfMonComps-00-14-04, PyUtils-00-03-11), they were submitted to 14.1.0.10 on 16 May 2008. Not in Atlas Production.

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue. To ignore.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      • Brigitte epp copied all input files to lxplus on 19 May 2008.

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      • David Quarrie requested confirmation that bug was fixed on 16 May 2008.

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

     

    TAGS PENDING IN ATLASPOINT1 (checked 18 May 2008 14h54)

    Changed:
    <
    <
    TriggerTest-00-01-90 (David Strom)
    >
    >

    InDetPerformanceMonitoring-00-00-09 ( Tobias Golling , needed for FDR2 )
     
    Changed:
    <
    <
    CaloMonitoring-00-00-94 ( Francesco Spano)
    >
    >
    TrigEFMissingET-00-02-27 (Ignacio Aracena, several bug fixes which need to go in for calibration studies)
     
    Changed:
    <
    <
    InDetTrigRecExample-00-06-90-05(Jiri Masik, in Atlas Production)
    >
    >
    MissingETMonitoring-00-00-01 (Michele Consonni, New monitoring histograms )
     
    Changed:
    <
    <
    TrigT1Monitoring-00-00-12 (add, Damien Prieur)
    >
    >
    TriggerTest-00-01-90 (David Strom,affects monitoring only )
     
    Changed:
    <
    <
    InDetPerformanceMonitoring-00-00-08(update,Tobias Golling, in Atlas Production)
    >
    >
    InDetTrigRecExample-00-06-90-05(Jiri Masik, in Atlas Production)
      InDetAlignmentMonitoring-00-02-07 (update, Tobis Golling, in Atlas Production)
    Changed:
    <
    <
    PixelMonitoring -00-03-27 (add, Triplett Nathan, in Atlasproduction)

    TrkExRungeKuttaPropagator-01-01-33 (Emil Obreshkov, in AtlasProduction)

    OutputStreamAthenaPool-00-01-29 (in Atlasproduction)

    >
    >

    OutputStreamAthenaPool-00-01-29 (in Atlasproduction)

     

    PENDING TAGS ATLASPRODUCTION 14.1.0.2: (16 May 2008, 11:09)

    None

    Changed:
    <
    <

    ATLASPRODUCTION 14.1.0.2 versus ATLASPOINT1 - 14.1.0.1 (18 May 2008 13:10)

    >
    >

    ATLASPRODUCTION 14.1.0.2 versus ATLASPOINT1 - 14.1.0.1 (19 May 2008 08:10)

     
    CaloMonitoring-00-00-94 | <none> /Calorimeter/CaloMonitoring
    OutputStreamAthenaPool-00-01-29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool
    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions
    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms
    InDetTrigRecExample-00-06-90-05 | InDetTrigRecExample-00-06-90-04 /InnerDetector/InDetExample/InDetTrigRecExample
    InDetAlignmentMonitoring-00-02-07 | InDetAlignmentMonitoring-00-02-04 /InnerDetector/InDetMonitoring/InDetAlignmentMonitoring
    InDetPerformanceMonitoring-00-00-08 | InDetPerformanceMonitoring-00-00-04 /InnerDetector/InDetMonitoring/InDetPerformanceMonitoring
    PixelMonitoring-00-03-27 | <none> /InnerDetector/InDetMonitoring/PixelMonitoring
    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils/TRT_TR_Process
    LArG4FastSimulation-00-02-30 | <none> /LArCalorimeter/LArG4/LArG4FastSimulation
    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4ShowerLibSvc
    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Validation
    RecExCommission-00-02-59 | RecExCommission-00-02-60 /Reconstruction/RecExample/RecExCommission
    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample/RecExRecoTest
    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample/RecExTrigTest
    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApps
    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel
    TrigT1Monitoring-00-00-12 | <none> /Trigger/TrigT1/TrigT1Monitoring

    PROCEDURES TO BE FOLLOWED (14 May 2008, 14:51)

    Revision 102008-05-19 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Line: 42 to 42
     

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.

    36648 (36633) - Atlas Trigger (Atlas Validation) - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!

    Changed:
    <
    <
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    >
    >
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

     
    • Bug submitted 14 May 2008
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.
    • Claimed to be a duplicate of 36608
    Changed:
    <
    <

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    >
    >

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

     

    TAGS PENDING IN ATLASPOINT1 (checked 18 May 2008 14h54)

    Revision 92008-05-19 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Line: 42 to 42
     

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.

    36648 (36633) - Atlas Trigger (Atlas Validation) - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!

    Changed:
    <
    <
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    >
    >
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

     
    • Bug submitted 14 May 2008
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.
    • Claimed to be a duplicate of 36608
    Changed:
    <
    <

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    >
    >

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

     

    TAGS PENDING IN ATLASPOINT1 (checked 18 May 2008 14h54)

    Line: 71 to 71
     

    PENDING TAGS ATLASPRODUCTION 14.1.0.2: (16 May 2008, 11:09)

    None

    ATLASPRODUCTION 14.1.0.2 versus ATLASPOINT1 - 14.1.0.1 (18 May 2008 13:10)

    Changed:
    <
    <
    CaloMonitoring-00-00-94 | <none> /Calorimeter/CaloMonitoring
    OutputStreamAthenaPool-00-01-29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool
    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions
    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms
    InDetTrigRecExample-00-06-90-05 | InDetTrigRecExample-00-06-90-04 /InnerDetector/InDetExample/InDetTrigRecExample
    InDetAlignmentMonitoring-00-02-07 | InDetAlignmentMonitoring-00-02-04 /InnerDetector/InDetMonitoring/InDetAlignmentMonitoring
    InDetPerformanceMonitoring-00-00-08 | InDetPerformanceMonitoring-00-00-04 /InnerDetector/InDetMonitoring/InDetPerformanceMonitoring
    PixelMonitoring-00-03-27 | <none> /InnerDetector/InDetMonitoring/PixelMonitoring
    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils/TRT_TR_Process
    LArG4FastSimulation-00-02-30 | <none> /LArCalorimeter/LArG4/LArG4FastSimulation
    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4ShowerLibSvc
    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Validation
    RecExCommission-00-02-59 | RecExCommission-00-02-60 /Reconstruction/RecExample/RecExCommission
    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample/RecExRecoTest
    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample/RecExTrigTest
    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApps
    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel
    TrkExRungeKuttaPropagator-01-01-33 | <none> /Tracking/TrkExtrapolation/TrkExRungeKuttaPropagator
    TrigT1Monitoring-00-00-12 | <none> /Trigger/TrigT1/TrigT1Monitoring

    >
    >

    CaloMonitoring-00-00-94 | <none> /Calorimeter/CaloMonitoring
    OutputStreamAthenaPool-00-01-29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool
    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions
    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms
    InDetTrigRecExample-00-06-90-05 | InDetTrigRecExample-00-06-90-04 /InnerDetector/InDetExample/InDetTrigRecExample
    InDetAlignmentMonitoring-00-02-07 | InDetAlignmentMonitoring-00-02-04 /InnerDetector/InDetMonitoring/InDetAlignmentMonitoring
    InDetPerformanceMonitoring-00-00-08 | InDetPerformanceMonitoring-00-00-04 /InnerDetector/InDetMonitoring/InDetPerformanceMonitoring
    PixelMonitoring-00-03-27 | <none> /InnerDetector/InDetMonitoring/PixelMonitoring
    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils/TRT_TR_Process
    LArG4FastSimulation-00-02-30 | <none> /LArCalorimeter/LArG4/LArG4FastSimulation
    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4ShowerLibSvc
    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Validation
    RecExCommission-00-02-59 | RecExCommission-00-02-60 /Reconstruction/RecExample/RecExCommission
    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample/RecExRecoTest
    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample/RecExTrigTest
    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApps
    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel
    TrigT1Monitoring-00-00-12 | <none> /Trigger/TrigT1/TrigT1Monitoring

     

    PROCEDURES TO BE FOLLOWED (14 May 2008, 14:51)

    Revision 82008-05-18 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Changed:
    <
    <

    May 16th 2008, Revision 7

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











    >
    >

    18 May 2008, Revision 8

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











     

    Line: 17 to 17
     

    RTT Tests (rel_5, 17 May 2008 12:37)

    • EvgenJobTransforms errors:

      • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors:

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • RecoTagTransf_130003

    • No SimuJobTransforms errors

    FCT Tests AtlasProduction

    Changed:
    <
    <
    • Pileup pcache (checked 17 May, 12:42) All tests OK.
    • Basic-pcache (cheked 17 May 2008, 12h46)

      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_6-17May08.01h15
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_6-17May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_6-17May08.01h15-csc_recoESD
    >
    >
    • Pileup pcache (checked 18 May, 09:11) All tests OK.
    • Basic-pcache (cheked 18 May 2008, 13h22)

      • Tests with ErrorCode=0:
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec_on_13.0.30.2-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD
        • BStoESD-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_BSrecoESD
        • RDOtoBS-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_RDOtoBS
        • Atlfast-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlfast
        • Gen-CSC.006384.PythiaH120gamgam-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • Gen-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • Gen-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_evgen
        • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimulReco-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_simul_reco
        • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimDig-CSC.005300.PythiaH130zz4l-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_digi
        • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_atlasG4
        • SimDig-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_digi
        • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Rec-AllBasicSamples-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoAOD

      • BUG#36284: csc_digi TRF_INFILE_TOOFEW
        • csc_digi TRF_INFILE_TOOFEW, who=JobTransform.csc_digi, message=Inputfile/afs/cern.ch/atlas/offline/external/FullChainTest/pcache_14.1.0.Y /last_good_root/ShowerParam-CSC.005200.T1_McAtNlo_Jimmy.HITS.pool.root : too few events (49 < 50) in input file
        • NOTE: This is the effect of an masked error in csc_atlasG4 transform making csc_digi failing.
        • SimDig-ShowerParam-CSC.005200.T1_McAtNlo_Jimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_digi
          
      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_recoESD
      BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu
    Changed:
    <
    <
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_6
    • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_6-17May08.01h15-csc_buildTAG_trf
  • Long pcache (17 May, 22:25) All tests OK

  • Open Issues (17 May 2008, 12h48)

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests is low at the level of 57% (rel_5).


    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.
    >
    >
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_buildTAG
    • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_0-18May08.01h15-csc_buildTAG
  • Long pcache (17 May, 13:11) All tests OK

  • Open Issues (18 May 2008, 13h11)

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests is low at the level of 57% (rel_0).


    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.
     

    Validation bugs 14.1.0.1 (16b May 2008)

    Line: 36 to 37
     
    • Andreu opinion is a memory allocation problem due to a high memory usage raised on 17 May 2008.

    36663 Atlas Reconstruction - csc_BSreco_trf issue - 14.1.0.1 reco from 13.0.40.5 BS - Wrong reconstruction for staco muons

    Changed:
    <
    <
    • Bug Opened by Iacopo on 16 May 2008. No person assigned.
    >
    >
    • Bug Opened by Iacopo on 16 May 2008. No person assigned.
    • 100% impact on Task 22305 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r423
     

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.

    36648 (36633) - Atlas Trigger (Atlas Validation) - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!

    Changed:
    <
    <
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    >
    >
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

     
    • Bug submitted 14 May 2008
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.
    • Claimed to be a duplicate of 36608
    Changed:
    <
    <

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    TAGS PENDING IN ATLASPOINT1 (checked 17 May 2008 13h08)

    >
    >

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    TAGS PENDING IN ATLASPOINT1 (checked 18 May 2008 14h54)

     

    Changed:
    <
    <
    InDetTrigRecExample-00-06-90-05(Jiri Masik, in Atlas Production)
    >
    >
    TriggerTest-00-01-90 (David Strom)

    CaloMonitoring-00-00-94 ( Francesco Spano)

     
    Changed:
    <
    <
    LArG4FastSimulation-00-02-30 (Zachary Marshall, in Atlas Production)
    >
    >
    InDetTrigRecExample-00-06-90-05(Jiri Masik, in Atlas Production)
      TrigT1Monitoring-00-00-12 (add, Damien Prieur)
    Line: 66 to 70
     

    TrkExRungeKuttaPropagator-01-01-33 (Emil Obreshkov, in AtlasProduction)

    OutputStreamAthenaPool-00-01-29 (in Atlasproduction)

    PENDING TAGS ATLASPRODUCTION 14.1.0.2: (16 May 2008, 11:09)

    None

    Changed:
    <
    <

    ATLASPRODUCTION 14.1.0.2 versus ATLASPOINT1 - 14.1.0.1 (15 May 2008 9:30)

    OutputStreamAthenaPool-00-01-29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool
    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions
    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms
    PixelMonitoring-00-03-27 | <none> /InnerDetector/InDetMonitoring/PixelMonitoring
    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils/TRT_TR_Process
    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4ShowerLibSvc
    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Validation
    RecExCommission-00-02-59 | RecExCommission-00-02-60 /Reconstruction/RecExample/RecExCommission
    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample/RecExRecoTest
    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample/RecExTrigTest
    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApps
    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel
    TrkExRungeKuttaPropagator-01-01-33 | <none> /Tracking/TrkExtrapolation/TrkExRungeKuttaPropagator

    >
    >

    ATLASPRODUCTION 14.1.0.2 versus ATLASPOINT1 - 14.1.0.1 (18 May 2008 13:10)

    CaloMonitoring-00-00-94 | <none> /Calorimeter/CaloMonitoring
    OutputStreamAthenaPool-00-01-29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool
    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions
    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms
    InDetTrigRecExample-00-06-90-05 | InDetTrigRecExample-00-06-90-04 /InnerDetector/InDetExample/InDetTrigRecExample
    InDetAlignmentMonitoring-00-02-07 | InDetAlignmentMonitoring-00-02-04 /InnerDetector/InDetMonitoring/InDetAlignmentMonitoring
    InDetPerformanceMonitoring-00-00-08 | InDetPerformanceMonitoring-00-00-04 /InnerDetector/InDetMonitoring/InDetPerformanceMonitoring
    PixelMonitoring-00-03-27 | <none> /InnerDetector/InDetMonitoring/PixelMonitoring
    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils/TRT_TR_Process
    LArG4FastSimulation-00-02-30 | <none> /LArCalorimeter/LArG4/LArG4FastSimulation
    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4ShowerLibSvc
    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Validation
    RecExCommission-00-02-59 | RecExCommission-00-02-60 /Reconstruction/RecExample/RecExCommission
    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample/RecExRecoTest
    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample/RecExTrigTest
    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApps
    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel
    TrkExRungeKuttaPropagator-01-01-33 | <none> /Tracking/TrkExtrapolation/TrkExRungeKuttaPropagator
    TrigT1Monitoring-00-00-12 | <none> /Trigger/TrigT1/TrigT1Monitoring

     

    PROCEDURES TO BE FOLLOWED (14 May 2008, 14:51)

    Revision 72008-05-17 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Line: 20 to 20
     
    • Pileup pcache (checked 17 May, 12:42) All tests OK.
    • Basic-pcache (cheked 17 May 2008, 12h46)

      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_6-17May08.01h15
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_6-17May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_6-17May08.01h15-csc_recoESD
      • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

    Changed:
    <
    <
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_6
    • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_6-17May08.01h15-csc_buildTAG_trf
  • Long pcache (15 May, 15:48) All tests OK

  • >
    >
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_6
    • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_6-17May08.01h15-csc_buildTAG_trf
  • Long pcache (17 May, 22:25) All tests OK

  •  

    Open Issues (17 May 2008, 12h48)

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests is low at the level of 57% (rel_5).


    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.

    Validation bugs 14.1.0.1 (16b May 2008)

    Line: 40 to 40
     

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.

    36648 (36633) - Atlas Trigger (Atlas Validation) - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!

    Changed:
    <
    <
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    >
    >
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

     
    • Bug submitted 14 May 2008
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.
    • Claimed to be a duplicate of 36608
    Changed:
    <
    <

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    >
    >

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

     

    TAGS PENDING IN ATLASPOINT1 (checked 17 May 2008 13h08)

    Revision 62008-05-17 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Line: 11 to 11
     



    Current situation: (16 May 2008 12h20)

    Changed:
    <
    <

    The 14.1.0.Y successful ATN tests did go up from 59% (rel_3) to 55% (rel_5).

    The Atlasproduction 14.1.0.2 cache contains all tags pending in AtlasPoint1 14.1.0.1.

    Next cache 14.1.0.2 expected Mon/Tue 19/20 May 2008

    >
    >

    The Atlasproduction 14.1.0.2 cache contains all tags pending in AtlasPoint1 14.1.0.1.

    Next cache 14.1.0.2 expected Tue 20 May 2008

    New geometry ATLAS-CSC-05-01-00 in FCT since 16 May 2008

     

    Proposed strategy: (16 May 2008 9h56)

    Follow and create bugs in savannah for RTT, FCT and grid validation jobs.

    Changed:
    <
    <

    RTT Tests (rel_5, 16 May 2008 08:37)

    >
    >

    RTT Tests (rel_5, 17 May 2008 12:37)

     
    • EvgenJobTransforms errors:

      • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors:

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • RecoTagTransf_130003

    • No SimuJobTransforms errors

    FCT Tests AtlasProduction

    Changed:
    <
    <
    • Pileup pcache (16 May, 8:46) All tests OK.
    • Basic-pcache (16 May 2008, 11h47)

      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Failed Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_5 on 16 May 2008

    >
    >
    • Pileup pcache (checked 17 May, 12:42) All tests OK.
    • Basic-pcache (cheked 17 May 2008, 12h46)

      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_6-17May08.01h15
        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_6-17May08.01h15-csc_recoESD
        • Trig_full_no_Bphysics_Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_6-17May08.01h15-csc_recoESD
      BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu
    Changed:
    <
    <
    • Failed TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_5 on 16 May 2008
    • Failed Tag-AllBasicSamples-pcache_14.1.0.Y.rel_5 on 16 May 2008
  • Long pcache (15 May, 15:48) All tests OK

  • Open Issues (15 May 2008, 20h26)

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests did go down from 59% (rel_4) to 55% (rel_5).

    ISSUE 080513-1236: New geometry ATLAS-CSC-05-01-00 in FCT.

    • Requested to Seth to use the geometry in FCT on 15 May 2008

    • Seth changed the geometry for AtlasProduction 14.1.0.Y to ATLAS-CSC-05-01-00 on 16 May 2008
    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.
    >
    >
    • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_6
    • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_6-17May08.01h15-csc_buildTAG_trf
  • Long pcache (15 May, 15:48) All tests OK

  • Open Issues (17 May 2008, 12h48)

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests is low at the level of 57% (rel_5).


    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.
     

    Validation bugs 14.1.0.1 (16b May 2008)

    Changed:
    <
    <

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.
    >
    >

    36666 Atlas Validation - 14.1.0.1 valid1 reco task 22282 failures: TRF_UNKNOWN | Failed to convert object to persistent type: St9bad_alloc

    • INFO Total Virtual memory = 3100.93 Real 2498.63
    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: St9bad_alloc,producer=csc_recoESD
    • TRF_UNKNOWN,who=AthenaPoolConverter,message=CreateRep failed, key = HLTAutoKey_L2DsPhiPiFex_1299073032_135,producer=csc_recoESD
    • TRF_UNKNOWN,who=ToolSvc.StreamESDTool,message=Could not create Rep for DataObject (clid/key):1299073032 HLTAutoKey_L2DsPhiPiFex_1299073032_135,producer=csc_recoESD
    • 100% failures Task 22282 valid1.005640.CharybdisJimmy.recon.e322_s429_r426
    • Bug opened by Yuri on 16 May 2008.
    • Andreu opinion is a memory allocation problem due to a high memory usage raised on 17 May 2008.

    36663 Atlas Reconstruction - csc_BSreco_trf issue - 14.1.0.1 reco from 13.0.40.5 BS - Wrong reconstruction for staco muons

    • Bug Opened by Iacopo on 16 May 2008. No person assigned.

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.
      36648 (36633) - Atlas Trigger (Atlas Validation) - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    Line: 37 to 47
     
    • Claimed to be a duplicate of 36608

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    Changed:
    <
    <

    TAGS PENDING IN ATLASPOINT1 (16 May 2008 15h21)

    >
    >

    TAGS PENDING IN ATLASPOINT1 (checked 17 May 2008 13h08)

     

    Changed:
    <
    <
    PixelMonitoring -00-03-27 (add, Triplett Nathan, in Atlasproduction)
    >
    >

    InDetTrigRecExample-00-06-90-05(Jiri Masik, in Atlas Production)

     
    Changed:
    <
    <
    TrigT1CaloCalibTools -00-00-34 (update,Prieur Damien, in AtlasProduction)
    >
    >
    LArG4FastSimulation-00-02-30 (Zachary Marshall, in Atlas Production)
     
    Changed:
    <
    <
    MDTcabling-00-02-46 (add,alessandro.cerri@cern.ch, in AtlasProduction)
    >
    >
    TrigT1Monitoring-00-00-12 (add, Damien Prieur)
     
    Changed:
    <
    <
    TriggerMenuPython-00-01-19-02(update, Takanori Kono, in AtlasProduction)
    >
    >
    InDetPerformanceMonitoring-00-00-08(update,Tobias Golling, in Atlas Production)
     
    Changed:
    <
    <
    RecJobTransforms-00-06-22(update, Stathes Paganis, in AtlasProduction)
    >
    >
    InDetAlignmentMonitoring-00-02-07 (update, Tobis Golling, in Atlas Production)
     
    Changed:
    <
    <
    TrigMoore-00-01-54(add, Gabriella Catald, in AtlasProduction)

    EvgenJobOptions-00-00-46(add, osamu jinnouchi, in AtlasProduction)

    CaloTPCnv-00-00-30(add, Guillaume Unal, in AtlasProduction)

    MuGirlGlobalFit-00-00-06 (add, David Quarrie, Sofia, in AtlasProduction)

    PyJobTransformsCore-00-06-80 (update,Manuel Gallas, in AtlasProduction)

    TriggerTest-00-01-89 (update, David Strom, in AtlasProduction)

    TrigT1CaloSim-00-00-15 (add,Ignacio Aracena, in AtlasProduction)

    InDetTrigRecExample-00-06-90-04 (update, Jiri Masik, in AtlasProduction)

    TauDPDMaker-00-02-15 (add, David Cote, in AtlasProduction)

    TrkExRungeKuttaPropagator-01-01-33 (Emil Obreshkov, in AtlasProduction)

    iPatTrackFitter-01-02-09 (in AtlasProduction)

    RecExTrigTest-00-00-30 (removed from AtlasProduction)

    RecExRecoTest-00-00-59 (removed from AtlasProduction)

    MuidTrackBuilder-01-02-04 (in AtlasProduction)

    MuidExample-00-00-88 (in AtlasProduction)

    TrkiPatFitter-01-07-03 (in AtlasProduction)

    OutputStreamAthenaPool-00-01-29 (in Atlasproduction)

    >
    >
    PixelMonitoring -00-03-27 (add, Triplett Nathan, in Atlasproduction)

    TrkExRungeKuttaPropagator-01-01-33 (Emil Obreshkov, in AtlasProduction)

    OutputStreamAthenaPool-00-01-29 (in Atlasproduction)

     

    PENDING TAGS ATLASPRODUCTION 14.1.0.2: (16 May 2008, 11:09)

    None

    ATLASPRODUCTION 14.1.0.2 versus ATLASPOINT1 - 14.1.0.1 (15 May 2008 9:30)

    Changed:
    <
    <
    CaloTPCnv-00-00-30 | <none> /Calorimeter/CaloCnv/CaloTPCnv


    OutputStreamAthenaPool-00-01
    -29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool


    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions


    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms


    InDetTrigRecExample-00-06-90
    | InDetTrigRecExample -00-06-90 * -03 /InnerDetector/InDetExample/InDetTrigRecExample


    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils
    /TRT_TR_Process


    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4Sho
    werLibSvc


    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Val
    idation


    TauDPDMaker-00-02-15 | <none> /PhysicsAnalysis/TauID/TauDPDM
    aker


    MuGirlGlobalFit-00-00-06 | <none> /Reconstruction/MuonIdentifica
    tion/MuGirlGlobalFit


    MuidExample-00-00-88 | <none> /Reconstruction/MuonIdentifica
    tion/MuidExample


    MuidTrackBuilder-01-02-04 | <none> /Reconstruction/MuonIdentifica
    tion/MuidTrackBuilder


    RecExCommission-00-02-54 | RecExCommission
    -00-02-59 /Reconstruction/RecExample/RecExCommission


    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample
    /RecExRecoTest


    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample
    /RecExTrigTest


    RecJobTransforms-00-06-22 | RecJobTransforms-00-06-16 /Reconstruction/RecJobTransfor
    ms


    iPatTrackFitter-01-02-09 | <none> /Reconstruction/iPat/iPatTrack
    Fitter


    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApp
    s


    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel


    PyJobTransformsCore-00-06-80 | PyJobTransformsCore-00-06-79 /Tools/PyJobTransformsCore


    TrkExRungeKuttaPropagator-01
    -01-33 | <none> /Tracking/TrkExtrapolation/TrkExRungeKuttaPropagator


    TrkiPatFitter-01-07-03 | <none> /Tracking/TrkFitter/TrkiPatFit
    ter


    <none> | TrigHLTMonitoring
    *-00-00-09
    /Trigger/TrigMonitoring/TrigHLTMonitoring


    TrigT1CaloSim-00-00-15 | <none> /Trigger/TrigT1/TrigT1CaloSim


    TriggerTest-00-01-89 | TriggerTest-00-01-87 /Trigger/TrigValidation
    /TriggerTest


    TriggerMenuPython-00-01-19-02 | TriggerMenuPython-00-01-19-01 /Trigger/TriggerCommon/TriggerMenuPython

    >
    >
    OutputStreamAthenaPool-00-01-29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool
    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions
    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms
    PixelMonitoring-00-03-27 | <none> /InnerDetector/InDetMonitoring/PixelMonitoring
    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils/TRT_TR_Process
    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4ShowerLibSvc
    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Validation
    RecExCommission-00-02-59 | RecExCommission-00-02-60 /Reconstruction/RecExample/RecExCommission
    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample/RecExRecoTest
    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample/RecExTrigTest
    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApps
    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel
    TrkExRungeKuttaPropagator-01-01-33 | <none> /Tracking/TrkExtrapolation/TrkExRungeKuttaPropagator

     

    PROCEDURES TO BE FOLLOWED (14 May 2008, 14:51)

    Revision 52008-05-16 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Changed:
    <
    <

    May 16th 2008, Revision 5

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











    >
    >

    May 16th 2008, Revision 7

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











     

    Line: 22 to 22
     BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu
    • Failed TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_5 on 16 May 2008
    • Failed Tag-AllBasicSamples-pcache_14.1.0.Y.rel_5 on 16 May 2008
  • Long pcache (15 May, 15:48) All tests OK

  • Open Issues (15 May 2008, 20h26)

    Changed:
    <
    <

    ISSUE 080513-1021: Low level of RTT test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests did go down from 94% (rel_6) to 59% (rel_4).

    ISSUE 080513-1236: New geometry ATLAS-CSC-05-01-00 in FCT.

    >
    >

    ISSUE 080513-1021: Low level of ATN test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests did go down from 59% (rel_4) to 55% (rel_5).

    ISSUE 080513-1236: New geometry ATLAS-CSC-05-01-00 in FCT.

     
    • Requested to Seth to use the geometry in FCT on 15 May 2008

    Changed:
    <
    <
    • ACTION SETH 080515-1958: USE GEOMETRY ATLAS-CSC-05-01-00 in FCT 14.1.0.Y
    >
    >
    • Seth changed the geometry for AtlasProduction 14.1.0.Y to ATLAS-CSC-05-01-00 on 16 May 2008
     ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.

    Validation bugs 14.1.0.1 (16b May 2008)

    Changed:
    <
    <

    36636 (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    >
    >

    36649 (36615) – Atlas Inner Detector (Atlas Validation) - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    • Claimed to be un duplicate of 36649 fixed.
    • Moved to Atlas Inner Detector and assigned to Vadim by David Quarrie on 16 May 2008.

    36648 (36633) - Atlas Trigger (Atlas Validation) - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!

    • Submitted by Andreu Pacheco on 16 May 2008.
    • 13% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008
    • Moved to Atlas Trigger by David Quarrie and assigned to Diego Casadei on 16 May 2008.

    36636 FIXED (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

     
    • Bug submitted 14 May 2008
    Changed:
    <
    <
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36633 - Atlas Validation - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!

    • Submitted by Andreu Pacheco on 16 May 2008.
    • 100% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation

    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008. No person assigned.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008. No person assigned.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

      36615 – Atlas Validation - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Updated May 15th.

      • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

      36608 (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      36575 (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    TAGS PENDING IN ATLASPOINT1 (16 May 2008 11h08)

    >
    >
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.
    • Claimed to be a duplicate of 36608

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation
    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    • Assigned to Zachary Marshall by David Quarrie on 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      • Requested update by David Quarrie on 16 May 2008. No person assigned.

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      • David Quarrie contacted David Rousseau on 16 May 2008.

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    • 36608 FIXED (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).
      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      • Claimed to be fixed by 36418 on 16 May 2008.

      36575 CLOSED (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      • Clased bug as new task has a new bug report.

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    TAGS PENDING IN ATLASPOINT1 (16 May 2008 15h21)

     

    Changed:
    <
    <
    PixelMonitoring -00-03-27 (add, Triplett Nathan)
    >
    >
    PixelMonitoring -00-03-27 (add, Triplett Nathan, in Atlasproduction)
     
    Changed:
    <
    <
    TrigT1CaloCalibTools -00-00-34 (update,Prieur Damien)
    >
    >
    TrigT1CaloCalibTools -00-00-34 (update,Prieur Damien, in AtlasProduction)
     
    Changed:
    <
    <
    MDTcabling-00-02-46 (add,alessandro.cerri@cern.ch)
    >
    >
    MDTcabling-00-02-46 (add,alessandro.cerri@cern.ch, in AtlasProduction)
      TriggerMenuPython-00-01-19-02(update, Takanori Kono, in AtlasProduction)

    Revision 42008-05-16 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Changed:
    <
    <

    May 16th 2008, Revision 4

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











    >
    >

    May 16th 2008, Revision 5

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











     

    Line: 10 to 10
     



    Changed:
    <
    <

    Current situation: (16 May 2008 9h56)

    The 14.1.0.Y successful ATN tests did go up from 23% (rel_3) to 59% (rel_4).

    The Atlasproduction 14.1.0.2 cache contains all tags pending in AtlasPoint1 14.1.0.1.

    Next cache 14.1.0.2 expected Mon/Tue 19/20 May 2008

    >
    >

    Current situation: (16 May 2008 12h20)

    The 14.1.0.Y successful ATN tests did go up from 59% (rel_3) to 55% (rel_5).

    The Atlasproduction 14.1.0.2 cache contains all tags pending in AtlasPoint1 14.1.0.1.

    Next cache 14.1.0.2 expected Mon/Tue 19/20 May 2008

     

    Proposed strategy: (16 May 2008 9h56)

    Follow and create bugs in savannah for RTT, FCT and grid validation jobs.

    RTT Tests (rel_5, 16 May 2008 08:37)

    Changed:
    <
    <
    • EvgenJobTransforms errors:

      • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors:

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#36571: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • RecoTagTransf_130003

    • No SimuJobTransforms errors

    >
    >
    • EvgenJobTransforms errors:

      • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors:

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • RecoTagTransf_130003

    • No SimuJobTransforms errors

     

    FCT Tests AtlasProduction

    Changed:
    <
    <
    • Pileup pcache (16 May, 8:46) All tests OK.
    • Basic-pcache (15 May 2008, 11h36)

      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_4

      • BUG#36522: csc_buildTAG_trf TRF_UNKNOWN, 69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD"

        • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_4-

        • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_4

    • Long pcache (15 May, 15:48) All tests OK

    >
    >
    • Pileup pcache (16 May, 8:46) All tests OK.
    • Basic-pcache (16 May 2008, 11h47)

      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Failed Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_5 on 16 May 2008

      • BUG#35289: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • Failed TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_5 on 16 May 2008
        • Failed Tag-AllBasicSamples-pcache_14.1.0.Y.rel_5 on 16 May 2008
    • Long pcache (15 May, 15:48) All tests OK

     

    Open Issues (15 May 2008, 20h26)

    ISSUE 080513-1021: Low level of RTT test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests did go down from 94% (rel_6) to 59% (rel_4).

    ISSUE 080513-1236: New geometry ATLAS-CSC-05-01-00 in FCT.

    • Requested to Seth to use the geometry in FCT on 15 May 2008

    Line: 30 to 33
     
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    Changed:
    <
    <
    36633 - Atlas Validation - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 100% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation

    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008. No person assigned.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008. No person assigned.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

      36615 – Atlas Validation - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Updated May 15th.

      • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

      36608 (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      36575 (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    >
    >
    36633 - Atlas Validation - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!
    • Submitted by Andreu Pacheco on 16 May 2008.
    • 100% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation

    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008. No person assigned.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008. No person assigned.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

      36615 – Atlas Validation - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Updated May 15th.

      • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

      36608 (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      36575 (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

     

    TAGS PENDING IN ATLASPOINT1 (16 May 2008 11h08)

    Changed:
    <
    <

    >
    >

    PixelMonitoring -00-03-27 (add, Triplett Nathan)

    TrigT1CaloCalibTools -00-00-34 (update,Prieur Damien)

     
    Changed:
    <
    <
    MDTcabling-00-02-46 (add,alessandro.cerri@cern.ch)
    >
    >
    MDTcabling-00-02-46 (add,alessandro.cerri@cern.ch)
      TriggerMenuPython-00-01-19-02(update, Takanori Kono, in AtlasProduction)
    Line: 48 to 54
     

    PENDING TAGS ATLASPRODUCTION 14.1.0.2: (16 May 2008, 11:09)

    None

    ATLASPRODUCTION 14.1.0.2 versus ATLASPOINT1 - 14.1.0.1 (15 May 2008 9:30)

    Changed:
    <
    <
    CaloTPCnv-00-00-30 | <none> /Calorimeter/CaloCnv/CaloTPCnv


    OutputStreamAthenaPool-00-01
    -29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool


    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions


    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms


    InDetTrigRecExample-00-06-90
    | InDetTrigRecExample -00-06-90 * -03 /InnerDetector/InDetExample/InDetTrigRecExample


    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils
    /TRT_TR_Process


    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4Sho
    werLibSvc


    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Val
    idation


    TauDPDMaker-00-02-15 | <none> /PhysicsAnalysis/TauID/TauDPDM
    aker


    MuGirlGlobalFit-00-00-06 | <none> /Reconstruction/MuonIdentifica
    tion/MuGirlGlobalFit


    MuidExample-00-00-88 | <none> /Reconstruction/MuonIdentifica
    tion/MuidExample


    MuidTrackBuilder-01-02-04 | <none> /Reconstruction/MuonIdentifica
    tion/MuidTrackBuilder


    RecExCommission-00-02-54 | RecExCommission
    *-00-02-59 /Reconstruction/RecExample/RecExCommission


    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample
    /RecExRecoTest


    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample
    /RecExTrigTest


    RecJobTransforms-00-06-22 | RecJobTransforms-00-06-16 /Reconstruction/RecJobTransfor
    ms


    iPatTrackFitter-01-02-09 | <none> /Reconstruction/iPat/iPatTrack
    Fitter


    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApp
    s


    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel


    PyJobTransformsCore-00-06-80 | PyJobTransformsCore-00-06-79 /Tools/PyJobTransformsCore


    TrkExRungeKuttaPropagator-01
    -01-33 | <none> /Tracking/TrkExtrapolation/TrkExRungeKuttaPropagator


    TrkiPatFitter-01-07-03 | <none> /Tracking/TrkFitter/TrkiPatFit
    ter


    <none> | TrigHLTMonitoring -00-00-09 /Trigger/TrigMonitoring
    /TrigHLTMonitoring


    TrigT1CaloSim-00-00-15 | <none> /Trigger/TrigT1/TrigT1CaloSim


    TriggerTest-00-01-89 | TriggerTest-00-01-87 /Trigger/TrigValidation
    /TriggerTest


    >
    >
    CaloTPCnv-00-00-30 | <none> /Calorimeter/CaloCnv/CaloTPCnv


    OutputStreamAthenaPool-00-01
    -29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool


    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions


    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms


    InDetTrigRecExample-00-06-90
    | InDetTrigRecExample -00-06-90 * -03 /InnerDetector/InDetExample/InDetTrigRecExample


    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils
    /TRT_TR_Process


    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4Sho
    werLibSvc


    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Val
    idation


    TauDPDMaker-00-02-15 | <none> /PhysicsAnalysis/TauID/TauDPDM
    aker


    MuGirlGlobalFit-00-00-06 | <none> /Reconstruction/MuonIdentifica
    tion/MuGirlGlobalFit


    MuidExample-00-00-88 | <none> /Reconstruction/MuonIdentifica
    tion/MuidExample


    MuidTrackBuilder-01-02-04 | <none> /Reconstruction/MuonIdentifica
    tion/MuidTrackBuilder


    RecExCommission-00-02-54 | RecExCommission
    -00-02-59 /Reconstruction/RecExample/RecExCommission


    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample
    /RecExRecoTest


    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample
    /RecExTrigTest


    RecJobTransforms-00-06-22 | RecJobTransforms-00-06-16 /Reconstruction/RecJobTransfor
    ms


    iPatTrackFitter-01-02-09 | <none> /Reconstruction/iPat/iPatTrack
    Fitter


    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApp
    s


    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel


    PyJobTransformsCore-00-06-80 | PyJobTransformsCore-00-06-79 /Tools/PyJobTransformsCore


    TrkExRungeKuttaPropagator-01
    -01-33 | <none> /Tracking/TrkExtrapolation/TrkExRungeKuttaPropagator


    TrkiPatFitter-01-07-03 | <none> /Tracking/TrkFitter/TrkiPatFit
    ter


    <none> | TrigHLTMonitoring
    *-00-00-09
    /Trigger/TrigMonitoring/TrigHLTMonitoring


    TrigT1CaloSim-00-00-15 | <none> /Trigger/TrigT1/TrigT1CaloSim


    TriggerTest-00-01-89 | TriggerTest-00-01-87 /Trigger/TrigValidation
    /TriggerTest


      TriggerMenuPython-00-01-19-02 | TriggerMenuPython-00-01-19-01 /Trigger/TriggerCommon/TriggerMenuPython

    Revision 32008-05-16 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    <!-- /ActionTrackerPlugin -->
    Changed:
    <
    <

    May 16th 2008, Revision 2

    Brief Status of Atlas Production Cache 14.1.0.2

    Andreu Pacheco-Pages / IFAE-CERN











    >
    >

    May 16th 2008, Revision 4

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

    Andreu Pacheco-Pages / IFAE-CERN











     

    Changed:
    <
    <

    Brief Status of Atlas Production Cache 14.1.0.2

    >
    >

    Brief Status of Atlas Production Cache 14.1.0.Y (14.1.0.2 candidate)

     



    Changed:
    <
    <

    Current situation: (Wed May 14th, 2008 17h50)

    >
    >

    Current situation: (16 May 2008 9h56)

     

    The 14.1.0.Y successful ATN tests did go up from 23% (rel_3) to 59% (rel_4).

    The Atlasproduction 14.1.0.2 cache contains all tags pending in AtlasPoint1 14.1.0.1.

    Next cache 14.1.0.2 expected Mon/Tue 19/20 May 2008

    Changed:
    <
    <

    Proposed strategy: (Sun 14 May 2008 19h02)

    >
    >

    Proposed strategy: (16 May 2008 9h56)

     

    Follow and create bugs in savannah for RTT, FCT and grid validation jobs.

    RTT Tests (rel_5, 16 May 2008 08:37)

    Changed:
    <
    <
    • EvgenJobTransforms errors (1/5):

      • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors (16/17):

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#36571: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • RecoTagTransf_130003

    • No SimuJobTransforms errors

    >
    >
    • EvgenJobTransforms errors:

      • BUG#36569:TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors:

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: TRF_UNKNOWN, producer=csc_recoFastCaloSim,who=T_AthenaPoolCustomCnv,message=Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#36571: TRF_SVRINIT,producer=csc_buildTAG,who=ServiceManager, message=ERROR Unable to initialize service "DSConfigSvc" and TRF_UNKNOWN,producer=csc_buildTAG,who=DetectorStore, who=DetectorStore, message=could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu

        • RecoTagTransf_130003

    • No SimuJobTransforms errors

     

    FCT Tests AtlasProduction

    • Pileup pcache (16 May, 8:46) All tests OK.
    • Basic-pcache (15 May 2008, 11h36)

      • BUG#36629: csc_recoESD_trf TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

        • Rec_on_12.0.6.3-DC3.006640.CharybdisJimmy-pcache_14.1.0.Y.rel_4

      • BUG#36522: csc_buildTAG_trf TRF_UNKNOWN, 69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD"

        • TagMerge-AllRecReleases-pcache_14.1.0.Y.rel_4-

        • Tag-AllBasicSamples-pcache_14.1.0.Y.rel_4

    • Long pcache (15 May, 15:48) All tests OK

    Open Issues (15 May 2008, 20h26)

    Changed:
    <
    <

    ISSUE 080613-1021: Low level of RTT test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests did go down from 94% (rel_6) to 59% (rel_4).

    ISSUE 080613-1236: New geometry ATLAS-CSC-05-01-00 in FCT. Postponed until new tag and the approval from David Quarrie. Needs new tag for cable mapping. OK for David.
    ACTION SETH 080515-1958: USE GEOMETRY ATLAS-CSC-05-01-00 in FCT 14.1.0.Y

    Validation bugs 14.1.0.1 (Friday May 16th)

    >
    >

    ISSUE 080513-1021: Low level of RTT test success % after passing all tags from AtlasPoint1. The rate of successful RTT tests did go down from 94% (rel_6) to 59% (rel_4).

    ISSUE 080513-1236: New geometry ATLAS-CSC-05-01-00 in FCT.

    • Requested to Seth to use the geometry in FCT on 15 May 2008

    • ACTION SETH 080515-1958: USE GEOMETRY ATLAS-CSC-05-01-00 in FCT 14.1.0.Y
    ISSUE 080516-1025: Many bugs related with memory allocation problems. There is a high incidence of bugs opened due to crashes after failing to allocate memory. Batch systems usually limit the virtual memory of jobs exceeding 2.2-2.4 GB. This causes malloc() to fail when asking for more memory and then Athena crashes trying to use the returned pointer which is of course invalid. This will always happen, so it would be worth to handle the malloc failures in a way that the error can be easily identified to improve bug reporting.

    Validation bugs 14.1.0.1 (16b May 2008)

     

    36636 (36523,36411) – Atlas Trigger (Atlas Muon Spectrometer,Atlas Validation) TRF_SEGFAULT, producer=csc_recoESD, message=FATAL 2008-May-08 19:18:56 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    • Bug submitted 14 May 2008
    Changed:
    <
    <
    • Bug moved to Atlas Trigger 16th May. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN (69999). POOL commit failed 0x9a769b0. commitOutput FAILED to commit OutputStream. commitOutput failed.

    • Bog opened 14 May 2008 by Andreu Pacheco
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36633 - Atlas Validation - csc_recoESD failure TRF_UNKNOWN,69999, Unknown Transform error, who=EFMissingET_Fex, message=Failed to attach feature!

    • Created May 16th.
    • 100% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

    36631 (36618) – Atlas Simulation (Atlas Validation) - csc_atlasG4 failure. TRF_SEGVIO | * Break * segmentation violation.

    • Updated May 15th. Moved to Atlas Simulation.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task - TRF_SEGFAULT, 60010, segmentation fault. FATAL 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)

    • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

    36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim. TRF_UNKNOWN (69999), severity=FATAL who=T_AthenaPoolCustomCnv

    message=Failed to convert object to persistent type: ElementLink, cannot set


    message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored


    message=Could not create Rep for DataObject (clid/key):1334834594


    message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE"

    • 2 RTT job failures. Moved to Atlas Reconstruction May 15th.

    36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

    • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

    36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

    • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

    36615 – Atlas Validation - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Updated May 15th.

    • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

    36608 (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

    • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

    36575 (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

    • Last updated May 15th. Andreu submitted identical task.

    • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

    36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

    • Updated May 15th. Assigned to Andrea di Simone.

    • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

    TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

    • Assigned to Simon George. Last update May 14th.

    • Probable high memory consumption related issue.

    • DQ recomments to ignore because duplicate of 35289 (?)

    36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

    36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

    Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

    • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

    36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

    Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

    • Last update May 13th. Assigned to Sofia Valldecorsa.

    • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

    36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

    TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

    • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

    • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

    36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

    • Last updated May 15th. Assigned to John Apostolakis

    34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

    • Last update May 14th. Assigned to Edward Moyse

    35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

    • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    TAGS PENDING IN ATLASPOINT1 (15 May 2008 19h52)

    TriggerMenuPython-00-01-19-02(update, Takanori Kono, in AtlasProduction)

    RecJobTransforms-00-06-22(update, Stathes Paganis, in AtlasProduction)

    TrigMoore-00-01-54(add, Gabriella Catald, in AtlasProduction)

    EvgenJobOptions-00-00-46(add, osamu jinnouchi, in AtlasProduction)

    CaloTPCnv-00-00-30(add, Guillaume Unal, in AtlasProduction)

    MuGirlGlobalFit-00-00-06 (add, David Quarrie, Sofia, in AtlasProduction)

    PyJobTransformsCore-00-06-80 (update,Manuel Gallas, in AtlasProduction)

    TriggerTest-00-01-89 (update, David Strom, in AtlasProduction)

    TrigT1CaloSim-00-00-15 (add,Ignacio Aracena, in AtlasProduction)

    InDetTrigRecExample-00-06-90-04 (update, Jiri Masik, in AtlasProduction)

    TauDPDMaker-00-02-15 (add, David Cote, in AtlasProduction)

    TrkExRungeKuttaPropagator-01-01-33 (Emil Obreshkov, in AtlasProduction)

    iPatTrackFitter-01-02-09 (in AtlasProduction)

    RecExTrigTest-00-00-30 (removed from AtlasProduction)

    RecExRecoTest-00-00-59 (removed from AtlasProduction)

    MuidTrackBuilder-01-02-04 (in AtlasProduction)

    MuidExample-00-00-88 (in AtlasProduction)

    TrkiPatFitter-01-07-03 (in AtlasProduction)

    OutputStreamAthenaPool-00-01-29 (in Atlasproduction)

    PENDING TAGS ATLASPRODUCTION 14.1.0.2: (15 May 2008, 19h36)

    >
    >
    • Bug moved to Atlas Trigger in 16 May 2008. No person assigned.
    • Bug impact: 3% failures Task 22129 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r421. Updated 16 May 2008.

    36635 (36568,36567) – Atlas Simulation (Atlas Generators,Atlas Validation) – genAtlfast task – RTT Atlfast.5870.ttH_poslepnu_jj_bb failure. TRF_UNKNOWN, producer=csc_genAtlfast, message=POOL commit failed 0x9a769b0.

    • Bug opened by Andreu Pacheco on 14 May 2008.
    • Moved to Atlas Simulation and assigned to Simon Dean by DQ on May 16th, 2008

    36633 - Atlas Validation - csc_recoESD failure TRF_UNKNOWN,who=EFMissingET_Fex, message=Failed to attach feature!

    • Submitted by Andreu Pacheco on 16 May 2008.
    • 100% failures Task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421. Updated 16 May 2008

    36631 (36618) – Atlas Simulation (Atlas Validation) -TRF_SEGVIO, producer=csc_atlasG4, message=*** Break * segmentation violation

    • Bug submitted by Alessandra Doria on 15 May 2008.
    • Bug moved to Atlas Simulation on 15 May 2008. No person assigned.

    • 100% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429. Updated 16 May 2008.

    36629 (36628) Atlas Reconstruction (Atlas Validation) – FCT csc_recoESD task -

    • TRF_SEGFAULT, 2008-May-14 23:49:14 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference)
      • Bug opened and moved to Reconstruction by Andreu Pacheco 15 May 2008. No person assigned.
      • 1 FCT failure. Moved to Atlas Reconstruction May 15th. Maybe a duplicate of 36608

      36627 (36626) Atlas Reconstruction (Atlas Validation) csc_recoFastCaloSim RTT Transform failures due to RecJobTransforms recoFastCaloSim_NoTrig and recoFastCaloSim.

    • TRF_UNKNOWN,who=T_AthenaPoolCustomCnv, message=Failed to convert object to persistent type: ElementLink, cannot set index, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=Py:Configurable, message=attempt to add a duplicate (AtlfastAodBuilder) ... dupe ignored, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ForwardIndexingPolicy, message=reverseLookup: element not found, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaPoolConverter, message=CreateRep failed, key = TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=ToolSvc.StreamAODTool, message=Could not create Rep for DataObject (clid/key):1334834594 TrackParticleCandidate, producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=StreamAOD, message=streamObjects failed., producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AlgErrorAuditor, message=Illegal Return Code: Algorithm StreamAOD reported an ERROR, but returned a StatusCode "FAILURE", producer=csc_recoFastCaloSim
    • TRF_UNKNOWN, who=AthenaEventLoopMgr, message=Terminating event processing loop due to errors, producer=csc_recoFastCaloSim
      • 2 RTT job failures: RecJobTransforms and recoFastCaloSim_NoTrig

      • Moved to Atlas Reconstruction May 15 May 2008. No person assigned

      36625 (36624) Atlas Reconstruction (Atlas Validation) RTT RecJobTransform failures due to cpu limit exceeded.

      • 3 RTT job failures. Moved to Atlas Reconstruction May 15th. Manuel thinks that the solution is to reduce the number of events.

      36620 - Athena - csc_reco_trf.py failure due to Perfmon hitting memory ceiling at finalize. A number of production job with 14.1.0.1 are failing at finalize, when Perfmon is trying to call lshosts ! Updated May 15th. Assigned to Sebastien Binet <binet>.

      • TRF_SEGFAULT, message=FATAL 2008-May-14 14:12:06 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference),
        producer=csc_recoESD
      • 5% failure. Task 22259 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s429_r426

      36615 – Atlas Validation - csc_BSreco_trf.py failure with TRF_SEGFAULT. TRF_SEGFAULT | FATAL 2008-May-15 10:21:05 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Updated May 15th.

      • 100% failures Task 22307 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s413_b30_r428

      36608 (36576,36430) – Atlas Trigger (Atlas Reconstruction,Atlas Validation) – csc_recoESD task - TRF_SEGFAULT (60010) segmentation fault. [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Moved to Atlas Trigger May 15th. Assigned to Jiri Masik. Tag to submit.

      • 5% failures Task 21993 valid1.005200.T1_McAtNlo_Jimmy.recon.e322_s412_r413

      36575 (36462) – Atlas Reconstruction (Atlas Validation) – csc_recoESD task - "TRF_SVRINIT | ERROR Unable to initialize service "DetectorStore" | DetectorStore service not found! | ERROR Unable to initialize Service: GeoModelSvc | finalize: Invalid state "Configured".

      • Last updated May 15th. Andreu submitted identical task.

      • Resubmitted task 22311 valid2.018101.PythiaB_Bd_Jpsie3e3K0s.recon.e315_s412_r421

      36574 (36480) – Atlas Simulation (Atlas Validation) – Simulation task - TRF_SEGVIO | * Break * segmentation violation. IOVSvc. WARNING setRange(CLID,key,range) for unregistered proxies is deprecated - you need to specify a store! This will be an ERROR soon! SystemError: problem in C++; program state has been reset

      • Updated May 15th. Assigned to Andrea di Simone.

      • 6% failures Task 22204 valid1.018101.PythiaB_Bd_Jpsie3e3K0s.digit.e337_s429

      TO IGNORE - 36524 (36460) – Atlas Trigger (Atlas Validation) – csc_recoESD task - 75% failure rate - TRF_SEGFAULT (60010) segmentation fault. tatic void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference).

      • Assigned to Simon George. Last update May 14th.

      • Probable high memory consumption related issue.

      • DQ recomments to ignore because duplicate of 35289 (?)

      36522 FIXED (36520) –Atlas Physics Analysis (Atlas Validation)– csc_buildTAG task - TRF_UNKNOWN,69999, Unknown Transform error (2 times) unable to locate configurable for type "StreamAOD".

      36388 FIXED (36387) – Atlas Inner Detector (Validation) – csc_recoESD task- csc_recoESD

      Error details: TRF_UNKNOWN | Track parameters are not charged tracks ... fit aborted | Illegal Return Code: Algorithm InDetV0Finder reported an ERROR, but returned a StatusCode "SUCCESS".To be followed with Zui.

      • Updated 15 May. Assigned to Eva Bouhova. Fixed with tag TrkV0Fitter-00-03-10

      36384 FIXED (36367) – Atlas Reconstruction (Validation) – Reco failure in MuGirl

      Error details: TRF_SEGFAULT | FATAL 2008-May-07 09:43:21 [static void ers::ErrorHandler::SignalHandler::action(...) at ers/src/ErrorHandler.cxx:88] Got signal 11 Segmentation fault (invalid memory reference). To be followed with Marge Shapiro.

      • Last update May 13th. Assigned to Sofia Valldecorsa.

      • May 15th: Fixed with tag MuGirlGlobalFit-00-00-06

      36577 FIXED (36381) – Atlas Reconstruction (Atlas Validation) csc_recoESD task. Reco failure Could not create Rep for DataObject

      TRF_UNKNOWN | Could not create Rep for DataObject (clid/key):210948284 LumiBlocks | Could not create Rep for DataObject (clid/key):1316383046 /GLOBAL/DETSTATUS/LBSUMM | streamObjects failed.

      • Moved to Atlas Reconstruction on May 14th. Ian asked Marge to look at it on 8 May but no update was seen.

      • May 15th. Claire reports that may be fixed by MuGirlGlobalFit-00-00-06

      36284 – Atlas Simulation – Bug opened by Manuel Gallas based on FCT failures - Geant4 got stuck in event in the 14.1.0.1 cache. Now filtered. Error details: ATH_G4_STUCK (15010) Geant4 got stuck in event. This error disappears if I change radius of fiber from 0.5557 to 0.556mm.

      • Last updated May 15th. Assigned to John Apostolakis

      34830 (36204) – Atlas Muon Spectrometer (Atlas Reconstruction) – Bug opened by D.Rousseau- if running ID+Muon MuidExtrCombinedMuonContainer cannot be written. To be followed with Stephane Willocq.

      • Last update May 14th. Assigned to Edward Moyse

      35289 (36143) – Atlas Trigger TAPM (Atlas Physics Analysis) - 14.1.0 AOD->TAG trigger config is failing. GlobalTriggerTagBuilder errors using csc_buildTAG_trf.py in R14.0.0. , trigger config fail, thus trigger info is not filled in tag. To be followed with Ignacio Aracena.

      • Assigned to Joerg Stelzer. Last update 13 May. Priority was given to another bug #35211.

    TAGS PENDING IN ATLASPOINT1 (16 May 2008 11h08)

    MDTcabling-00-02-46 (add,alessandro.cerri@cern.ch)

    TriggerMenuPython-00-01-19-02(update, Takanori Kono, in AtlasProduction)

    RecJobTransforms-00-06-22(update, Stathes Paganis, in AtlasProduction)

    TrigMoore-00-01-54(add, Gabriella Catald, in AtlasProduction)

    EvgenJobOptions-00-00-46(add, osamu jinnouchi, in AtlasProduction)

    CaloTPCnv-00-00-30(add, Guillaume Unal, in AtlasProduction)

    MuGirlGlobalFit-00-00-06 (add, David Quarrie, Sofia, in AtlasProduction)

    PyJobTransformsCore-00-06-80 (update,Manuel Gallas, in AtlasProduction)

    TriggerTest-00-01-89 (update, David Strom, in AtlasProduction)

    TrigT1CaloSim-00-00-15 (add,Ignacio Aracena, in AtlasProduction)

    InDetTrigRecExample-00-06-90-04 (update, Jiri Masik, in AtlasProduction)

    TauDPDMaker-00-02-15 (add, David Cote, in AtlasProduction)

    TrkExRungeKuttaPropagator-01-01-33 (Emil Obreshkov, in AtlasProduction)

    iPatTrackFitter-01-02-09 (in AtlasProduction)

    RecExTrigTest-00-00-30 (removed from AtlasProduction)

    RecExRecoTest-00-00-59 (removed from AtlasProduction)

    MuidTrackBuilder-01-02-04 (in AtlasProduction)

    MuidExample-00-00-88 (in AtlasProduction)

    TrkiPatFitter-01-07-03 (in AtlasProduction)

    OutputStreamAthenaPool-00-01-29 (in Atlasproduction)

    PENDING TAGS ATLASPRODUCTION 14.1.0.2: (16 May 2008, 11:09)

     

    None

    ATLASPRODUCTION 14.1.0.2 versus ATLASPOINT1 - 14.1.0.1 (15 May 2008 9:30)

    Changed:
    <
    <
    CaloTPCnv-00-00-30 | <none> /Calorimeter/CaloCnv/CaloTPCnv


    OutputStreamAthenaPool-00-01
    -29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool


    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions


    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms


    InDetTrigRecExample-00-06-90
    -04 | InDetTrigRecExample-00-06-90-03 /InnerDetector/InDetExample/InDetTrigRecExample


    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils
    /TRT_TR_Process


    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4Sho
    werLibSvc


    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Val
    idation


    TauDPDMaker-00-02-15 | <none> /PhysicsAnalysis/TauID/TauDPDM
    aker


    MuGirlGlobalFit-00-00-06 | <none> /Reconstruction/MuonIdentifica
    tion/MuGirlGlobalFit


    MuidExample-00-00-88 | <none> /Reconstruction/MuonIdentifica
    tion/MuidExample


    MuidTrackBuilder-01-02-04 | <none> /Reconstruction/MuonIdentifica
    tion/MuidTrackBuilder


    RecExCommission-00-02-54 | RecExCommission-00-02-59 /Reconstruction/RecExample
    /RecExCommission


    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample
    /RecExRecoTest


    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample
    /RecExTrigTest


    RecJobTransforms-00-06-22 | RecJobTransforms-00-06-16 /Reconstruction/RecJobTransfor
    ms


    iPatTrackFitter-01-02-09 | <none> /Reconstruction/iPat/iPatTrack
    Fitter


    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApp
    s


    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel


    PyJobTransformsCore-00-06-80 | PyJobTransformsCore-00-06-79 /Tools/PyJobTransformsCore


    TrkExRungeKuttaPropagator-01
    -01-33 | <none> /Tracking/TrkExtrapolation/TrkExRungeKuttaPropagator


    TrkiPatFitter-01-07-03 | <none> /Tracking/TrkFitter/TrkiPatFit
    ter


    <none> | TrigHLTMonitoring-00-00-09 /Trigger/TrigMonitoring
    /TrigHLTMonitoring


    TrigT1CaloSim-00-00-15 | <none> /Trigger/TrigT1/TrigT1CaloSim


    TriggerTest-00-01-89 | TriggerTest-00-01-87 /Trigger/TrigValidation
    /TriggerTest


    >
    >
    CaloTPCnv-00-00-30 | <none> /Calorimeter/CaloCnv/CaloTPCnv


    OutputStreamAthenaPool-00-01
    -29 | <none> /Database/AthenaPOOL/OutputStreamAthenaPool


    EvgenJobOptions-00-00-46 | <none> /Generators/EvgenJobOptions


    EvgenJobTransforms-00-06-04 | <none> /Generators/EvgenJobTransforms


    InDetTrigRecExample-00-06-90
    | InDetTrigRecExample -00-06-90 * -03 /InnerDetector/InDetExample/InDetTrigRecExample


    TRT_TR_Process-00-00-27 | <none> /InnerDetector/InDetSimUtils
    /TRT_TR_Process


    LArG4ShowerLibSvc-00-02-03 | <none> /LArCalorimeter/LArG4/LArG4Sho
    werLibSvc


    LArG4Validation-00-00-50 | <none> /LArCalorimeter/LArG4/LArG4Val
    idation


    TauDPDMaker-00-02-15 | <none> /PhysicsAnalysis/TauID/TauDPDM
    aker


    MuGirlGlobalFit-00-00-06 | <none> /Reconstruction/MuonIdentifica
    tion/MuGirlGlobalFit


    MuidExample-00-00-88 | <none> /Reconstruction/MuonIdentifica
    tion/MuidExample


    MuidTrackBuilder-01-02-04 | <none> /Reconstruction/MuonIdentifica
    tion/MuidTrackBuilder


    RecExCommission-00-02-54 | RecExCommission
    *-00-02-59 /Reconstruction/RecExample/RecExCommission


    <none> | RecExRecoTest-00-00-59 /Reconstruction/RecExample
    /RecExRecoTest


    <none> | RecExTrigTest-00-00-30 /Reconstruction/RecExample
    /RecExTrigTest


    RecJobTransforms-00-06-22 | RecJobTransforms-00-06-16 /Reconstruction/RecJobTransfor
    ms


    iPatTrackFitter-01-02-09 | <none> /Reconstruction/iPat/iPatTrack
    Fitter


    G4AtlasApps-00-02-62-02 | <none> /Simulation/G4Atlas/G4AtlasApp
    s


    TileGeoModel-00-01-28-01 | <none> /TileCalorimeter/TileGeoModel


    PyJobTransformsCore-00-06-80 | PyJobTransformsCore-00-06-79 /Tools/PyJobTransformsCore


    TrkExRungeKuttaPropagator-01
    -01-33 | <none> /Tracking/TrkExtrapolation/TrkExRungeKuttaPropagator


    TrkiPatFitter-01-07-03 | <none> /Tracking/TrkFitter/TrkiPatFit
    ter


    <none> | TrigHLTMonitoring -00-00-09 /Trigger/TrigMonitoring
    /TrigHLTMonitoring


    TrigT1CaloSim-00-00-15 | <none> /Trigger/TrigT1/TrigT1CaloSim


    TriggerTest-00-01-89 | TriggerTest-00-01-87 /Trigger/TrigValidation
    /TriggerTest


      TriggerMenuPython-00-01-19-02 | TriggerMenuPython-00-01-19-01 /Trigger/TriggerCommon/TriggerMenuPython

    Revision 22008-05-16 - unknown

    Line: 1 to 1
     
    META TOPICPARENT name="AndresPacheco"
    Changed:
    <
    <

    May 16th 2008, Revision 1



    Brief Status of Atlas Production Cache 14.1.0.2



    Andreu Pacheco-Pages / IFAE-CERN







    >
    >
    <!-- /ActionTrackerPlugin -->

    May 16th 2008, Revision 2

    Brief Status of Atlas Production Cache 14.1.0.2

    Andreu Pacheco-Pages / IFAE-CERN











     

    Brief Status of Atlas Production Cache 14.1.0.2

    Changed:
    <
    <






    Current situation: (Wed May 14th, 2008 17h50)

    The 14.1.0.2 of successful ATN tests did go up from 23% (rel_3) to 59% (rel_4).

    The Atlasproduction 14.1.0.2 cache contains all tags pending in AtlasPoint1 14.1.0.1.

    Proposed strategy: (Sun 14 May 2008 19h02)

    Follow and create bugs in savannah for RTT, FCT and grid validation jobs.

    RTT Tests (rel_4, 15 May 2008 10:21)

    • EvgenJobTransforms errors (1/5):

      • BUG#36569: csc_genAtlfast : TRF_UNKNOWN (69999). POOL commit failed 0x9a769b0. commitOutput FAILED to commit OutputStream. commitOutput failed.

        • Atlfast.5870.ttH_poslepnu_jj_bb

    • RecJobTransforms errors (16/17):

      • BUG#36625: CPU time limit exceeded, and core dumped.

        • FDR1toESDandAOD

        • RecoTransf_130030

        • RecoTransf_HighStat

      • BUG#36627: csc_recoFastCaloSim: TRF_UNKNOWN,69999, Unknown Transform error. Failed to convert object to persistent type: ElementLink, cannot set index

        • recoFastCaloSim_NoTrig

        • recoFastCaloSim

      • BUG#36571:csc_buildTAG : TRF_SVRINIT (61200) ServiceManager Unable to initialize Service. ERROR Unable to initialize service "DSConfigSvc". could not bind handle to CondAttrListCollection to key: /TRIGGER/HLT/Menu.

        • RecoTagTransf_130003

          <