Igprof detained results
RECO
this is the results (in igprof ticks, so real time and therefore affected by clock variations) for the very same data sample and code above.
Note that VirtualJetProducer is dominated by cod from
fastjet which has not been recompiled.
total refers just to the "producers"
producer |
SB |
HW |
|
sse |
sse |
avx |
avx2 |
avx2 prefetch |
total |
275918 |
214294 |
220113 |
218826 |
218369 |
cms::CkfTrackCandidateMakerBase |
106244 |
76419 |
80907 |
78750 |
77708 |
VirtualJetProducer (fastjet: sse) |
23799 |
21075 |
20673 |
20704 |
20673 |
SeedGeneratorFromRegionHitsEDProducer |
20619 |
15348 |
16138 |
16749 |
16761 |
TrackProducer |
19607 |
14511 |
14891 |
14091 |
14373 |
ConversionTrackCandidateProducer |
15816 |
12346 |
13198 |
13082 |
12827 |
MuonIdProducer |
14797 |
11848 |
12879 |
12929 |
12892 |
GsfTrackProducer |
8876 |
7294 |
5396 |
5308 |
5325 |
PrimaryVertexProducer |
4157 |
3267 |
3233 |
2922 |
2940 |
ConversionProducer |
4200 |
3396 |
2984 |
3990 |
3897 |
PFECALSuperClusterProducer |
3291 |
2730 |
2732 |
2706 |
2732 |
RecoTauProducer |
3072 |
2417 |
2703 |
2702 |
2715 |
EcalUncalibRecHitProducer |
2690 |
2845 |
2634 |
2908 |
2881 |
PFClusterProducer |
2722 |
2656 |
2579 |
2387 |
2401 |
PFBlockProducer |
3098 |
2514 |
2561 |
2540 |
2543 |
GoodSeedProducer |
2657 |
2162 |
2175 |
2142 |
2161 |
reco::modules::MultiTrackSelector |
2058 |
1868 |
1909 |
1866 |
1897 |
TauDiscriminationProducerBase<reco::PFTau, … |
1865 |
1577 |
1617 |
1588 |
1597 |
PFDisplacedVertexCandidateProducer |
1792 |
1532 |
1546 |
1726 |
1731 |
TrackExtrapolator |
1708 |
1451 |
1432 |
1601 |
1599 |
PFDisplacedVertexProducer |
1759 |
1301 |
1422 |
1291 |
1300 |
ElectronSeedProducer |
1712 |
1253 |
1394 |
1347 |
1542 |
PFElecTkProducer |
1452 |
1294 |
1371 |
1354 |
1341 |
cms::TrackListMerger |
1351 |
1244 |
1259 |
1249 |
1251 |
CosmicMuonProducer |
1499 |
1156 |
1210 |
1198 |
1198 |
ClusterSummaryProducer |
1445 |
1130 |
1146 |
1149 |
1424 |
PhotonConversionTrajectorySeedProducerFromSingleLeg |
1265 |
1088 |
1121 |
1131 |
1151 |
reco::modules::DuplicateTrackMerger |
1231 |
972 |
1041 |
1009 |
1028 |
PixelTrackProducer |
1121 |
925 |
963 |
1058 |
1057 |
PFRecHitProducer |
967 |
842 |
867 |
904 |
883 |
SiStripRecHitConverter |
896 |
805 |
826 |
896 |
876 |
SiStripClusterizer |
941 |
656 |
657 |
731 |
720 |
CaloTowersCreator |
786 |
660 |
647 |
721 |
709 |
TopProjector<reco::PFCandidate,… |
770 |
651 |
591 |
638 |
633 |
SETMuonSeedProducer |
627 |
506 |
532 |
554 |
605 |
JetTracksAssociatorAtVertex |
534 |
540 |
529 |
500 |
513 |
PFProducer |
575 |
450 |
479 |
479 |
474 |
PFPileUp |
550 |
402 |
461 |
407 |
420 |
SecondaryVertexProducer |
444 |
338 |
399 |
364 |
358 |
JetPlusTrackProducer |
425 |
362 |
396 |
395 |
388 |
cms::SiPixelClusterProducer |
474 |
399 |
393 |
427 |
410 |
HLT
Here is results for the same code for high pileup events in HLT.
total refers just to the "producers"
We did not run
perf as the job is dominated by I/O and root machinery. Events are read in before hot code starts, so no I/O is involved in the results shown below
producer |
SB |
HW |
|
sse |
sse |
avx |
avx2 |
avx2 prefetch |
total |
28507 |
22628 |
23361 |
23798 |
23840 |
cms::CkfTrackCandidateMakerBase |
8484 |
6052 |
6403 |
6496 |
6397 |
VirtualJetProducer |
2143 |
1786 |
1826 |
1793 |
1796 |
PixelTrackProducer |
1717 |
1373 |
1433 |
1585 |
1598 |
CaloTowersCreator |
1354 |
1131 |
1126 |
1228 |
1260 |
PFClusterProducer |
1139 |
1067 |
1087 |
1005 |
1005 |
SeedGeneratorFromRegionHitsEDProducer |
1018 |
803 |
854 |
899 |
892 |
PFBlockProducer |
1044 |
841 |
851 |
838 |
841 |
EcalRawToRecHitProducer |
880 |
795 |
832 |
843 |
820 |
TriggerSummaryProducerAOD |
921 |
703 |
754 |
714 |
749 |
cms::SiPixelClusterProducer |
788 |
685 |
684 |
769 |
766 |
ElectronSeedProducer |
949 |
673 |
682 |
695 |
726 |
TrackProducer |
868 |
589 |
588 |
604 |
613 |
cms::SiPixelRecHitConverter |
600 |
516 |
519 |
605 |
598 |
SiPixelRawToDigi |
644 |
563 |
514 |
521 |
502 |
TriggerSummaryProducerRAW |
567 |
497 |
483 |
488 |
492 |
HcalHitReconstructor |
446 |
388 |
398 |
402 |
391 |
PrimaryVertexProducer |
442 |
396 |
394 |
384 |
361 |
CSCRecHitDProducer |
418 |
344 |
363 |
409 |
413 |
PFRecHitProducer |
311 |
262 |
267 |
278 |
276 |
CSCDCCUnpacker |
258 |
221 |
215 |
231 |
220 |
L2MuonProducer |
251 |
197 |
206 |
171 |
180 |
L1GlobalTrigger |
225 |
206 |
176 |
200 |
217 |
HcalRawToDigi |
165 |
165 |
168 |
158 |
172 |
cms::METProducer |
161 |
170 |
156 |
153 |
154 |
TSGFromL2Muon |
153 |
115 |
137 |
119 |
131 |
L3MuonProducer |
158 |
128 |
132 |
128 |
118 |
PreshowerClusterProducer |
167 |
123 |
121 |
128 |
134 |
cms::SimpleTrackListMerger |
118 |
107 |
109 |
111 |
123 |
MuonIdProducer |
120 |
102 |
105 |
111 |
110 |
cms::JetCorrectionProducer<reco::CaloJet> |
103 |
65 |
95 |
92 |
96 |
MeasurementTrackerSiStripRefGetterProducer |
81 |
79 |
94 |
60 |
92 |
HLTCaloJetIDProducer |
124 |
87 |
93 |
113 |
111 |
DTRecSegment4DProducer |
108 |
81 |
86 |
93 |
95 |
PFProducer |
89 |
75 |
82 |
75 |
74 |
CSCSegmentProducer |
74 |
67 |
80 |
70 |
59 |
EgammaHLTEcalRecIsolationProducer |
48 |
41 |
69 |
52 |
51 |
CaloTowerCreatorForTauHLT |
69 |
62 |
69 |
65 |
71 |
Multi5x5ClusterProducer |
68 |
69 |
64 |
77 |
80 |
DTUnpackingModule |
62 |
64 |
58 |
53 |
47 |
DTRecHitProducer |
47 |
46 |
53 |
45 |
45 |