DeepCore: Convolutional Neural Network for high pT jet tracking

Introduction

High pT jet problem

The tracking inside jet core becomes inefficient in high transverse momentum jets, because the collimated environment produces merged cluster from different tracks on the pixel detector. The splitting of the merged cluster is inefficient and it degrades the quality of seeding of the CMS tracking algorithm, based on a combinatorial Kalman Filter (cKF).

CNN High pT jet tracking: DeepCore
  • The presented algorithm is a Convolutional Neural Network which produces the track-seeds directly from the raw pixel information of the four layers in the jet core region. In this way, it is possible to improve the tracking performance by skipping the pixel clustering.
  • DeepCore has been trained on 22 millions of inputs from 2 millions of Monte Carlo QCD events (1.8 TeV <p̂T< 2.4 TeV, pTjet> 1 TeV, |ηjet|<1.4)
Integration and validation
  • DeepCore replaces the seeding step of the cKF of CMS reconstruction in the iteration dedicate to the jet core region only.
  • The performance of the algorithm integrated in the CMS reconstruction has been validated on a Monte Carlo sample of 2 x 104 QCD events (1.8 TeV < p̂T < 2.4 TeV, pTjet> 1 TeV, |ηjet|<1.4)
  • The typical selection is applied on simulated tracks used for validation: |ηjet|<2.5, rprod<3 cm, |zprod|<30 cm, pT> 0.9 GeV.

DeepCore Network

Input

A single input of the network is composed by:

  • four 30x30 pixel maps (one per layer), aligned with the directions of a merged cluster present on layer one (or layer 2 if layer 1 is missing)
  • jet η and jet pT
Note: for a single jet are produced multiple inputs for the Neural Network
Target

For each pixel in 30x30 window of layer 2:

  • 1 if a track cross that pixel, 0 otherwise (TCP map)
  • the 5 track parameters in local parametrization (with respect to the centre of the pixel) for the TCP and in a circle of radius 2 pixel (NCP)
  • 3 repetitions of this target are given to take into account of overlap
Prediction
  • Same structure of the target
  • The seeds are selected as the track parameters of the most probable (TCP>0.85, 0.75, 0.65 in the 3 repetitions, or 0.50,0.40,0.30 if layer 2 is missing) pixels.
Architecture
  • 9 2D Convolutional layers branched at layer 5 in two, to predict the TCP Map and the Track Parameters Map
  • Activation functions: ReLU for all the layers, except Sigmoid for the last TCP layer
  • Loss functions: 2 custom functions for the two branches
  • TCP: Binary Cross Entropy weighted on TCP/NCP/other pixels fractions
  • Track Parameters: Mean Square Error, clipped between [-5,5], averaged on TCP and NCP only

Definitions

Tracking Efficiency

Number of simulated tracks associated to a reconstructed one divided by the number of simulated tracks. A simulated track is flagged as “associated” if the χ2 between its parameters and the reconstructed is lower than 25.

Fake Rate

Number of reconstructed tracks not associated to a simulated one divided by the number of reconstructed tracks.

Duplicate rate

Number of tracks duplicate divided by the number of reconstructed tracks. A reconstructed track is flagged as “duplicate” if at least another reconstructed track is associated to the same simulated track of the first one.

Current status

  • DeepCore has been fully trained and integrated in CMSSW in the barrel region only ( |ηjet|<1.4)
  • DeepCore-endcap is currently under development (optimization of the training)
  • DeepCore can be enabled, in cmsDriver commands, with the option ----procModifiers seedingDeepCore. In the endcap the standard jetCore is still used.
Workflow
Last pull requests:
Code location:

References

Detector Performance Summary: [ https://cds.cern.ch/record/2669826/files/DP2019_007.pdf ]

Connecting The Dots 2019 Proceeding: [ https://arxiv.org/abs/1910.08058 ]

Connecting The Dots 2019 Talk: [ https://indico.cern.ch/event/742793/contributions/3274301/attachments/1822584/2981871/bertacchi_deepcore_ConnectingTheDots.pdf ]

CMS track reconstruction reference: [ arXiv:1405.6569v2 ]

Performance Plots

Training

plots Description

pdf png
Residual between the seed η parameter predicted by DeepCore and the target (simulated) track η parameter.

pdf png
Correlation between prediction of DeepCore and target parameters shown with seed η parameter predicted against the simulated track η parameter.

CMS reconstruction performance

plots plots Description

pdf png

pdf png
Tracking efficiency (left figure) and fake rate (right figure) in the jet core region (ΔR<0.1, between the reconstructed jet axis and the simulated track direction). The contribution of the different iterations of the cKF are shown as stacked histograms. The DeepCore algorithm is used in the iteration dedicated to the cores of the jets [jetCore (purple)]. In the efficiency the shared reconstructed tracks (duplicated) between various iterations are not removed.

CMS Compared reconstruction performance

plots plots Description

pdf png

pdf png
Tracking efficiency (left figure) and fake rate (right figure) in the jet core region (ΔR<0.1, between the reconstructed jet axis and the simulated track direction). The light blue filled histogram is obtained with the standard CMS tracking algorithm. The dark blu histogram is obtained removing the cKF iteration dedicated to the jet cores. The red histogram is obtained using the DeepCore in the seeding for the interation dedicated to the jet cores. The green histrogram is obtained producing the seed for the jet core iteration using the simulated track information (i.e. equivalent to 100% of seeding efficiency). In the lower pads are shown the differences between various tracking efficiencies (fake rates) and the MC truth seeding one, divided by the MC truch seeding efficiency (fake rate).

Qualitative Plots

plots plots plots plots plots Description

pdf png

pdf png

pdf png

pdf png

pdf png
Example of of the pixel maps used as input for the DeepCore neural network (four left plots): the maps shows a windows on the four pixel detecor layer of CMS, aligned to the jet direction. The most right figure is the map of the predicted crossing point on the window of layer 2, expressed as probability.

pdf png

pdf png

pdf png

pdf png

pdf png
Example of of the pixel maps used as input for the DeepCore neural network (four left plots): the maps shows a windows on the four pixel detecor layer of CMS, aligned to the jet direction. On the top are also shown the crosses of the crossing point of the target (simulated) tracks and the correspondent predicion of DeepCore for the most probable hits. The prediction is produced on layer 2 and propagated linearly on the other layer. The most right figure is the map of the predicted crossing point on the window of layer 2, expressed as probability, with the crosses of the predictions and the targets.
-- ValerioBertacchi - 2019-04-09
Topic attachments
I Attachment History Action Size Date Who Comment
PDFpdf Probabiltiy_crosses_event30.pdf r1 manage 21.5 K 2020-11-05 - 20:16 ValerioBertacchi Probability with and without prediction and target
PNGpng Probabiltiy_crosses_event30.png r1 manage 19.6 K 2020-11-05 - 20:16 ValerioBertacchi Probability with and without prediction and target
PDFpdf Probabiltiy_only_event30.pdf r1 manage 18.3 K 2020-11-05 - 20:16 ValerioBertacchi Probability with and without prediction and target
PNGpng Probabiltiy_only_event30.png r1 manage 15.2 K 2020-11-05 - 20:16 ValerioBertacchi Probability with and without prediction and target
PDFpdf RGB_PixelMap_crosses_layer0_event30.pdf r1 manage 17.0 K 2020-11-05 - 20:16 ValerioBertacchi RGB pixel maps
PNGpng RGB_PixelMap_crosses_layer0_event30.png r1 manage 16.2 K 2020-11-05 - 20:17 ValerioBertacchi RGB pixel maps
PDFpdf RGB_PixelMap_crosses_layer1_event30.pdf r1 manage 17.0 K 2020-11-05 - 20:16 ValerioBertacchi RGB pixel maps
PNGpng RGB_PixelMap_crosses_layer1_event30.png r1 manage 16.6 K 2020-11-05 - 20:17 ValerioBertacchi RGB pixel maps
PDFpdf RGB_PixelMap_crosses_layer2_event30.pdf r1 manage 16.0 K 2020-11-05 - 20:16 ValerioBertacchi RGB pixel maps
PNGpng RGB_PixelMap_crosses_layer2_event30.png r1 manage 15.6 K 2020-11-05 - 20:17 ValerioBertacchi RGB pixel maps
PDFpdf RGB_PixelMap_crosses_layer3_event30.pdf r1 manage 15.3 K 2020-11-05 - 20:16 ValerioBertacchi RGB pixel maps
PNGpng RGB_PixelMap_crosses_layer3_event30.png r1 manage 14.9 K 2020-11-05 - 20:17 ValerioBertacchi RGB pixel maps
PDFpdf RGB_PixelMap_input_layer0_event30.pdf r1 manage 13.8 K 2020-11-05 - 20:16 ValerioBertacchi RGB pixel maps
PNGpng RGB_PixelMap_input_layer0_event30.png r1 manage 12.0 K 2020-11-05 - 20:17 ValerioBertacchi RGB pixel maps
PDFpdf RGB_PixelMap_input_layer1_event30.pdf r1 manage 13.8 K 2020-11-05 - 20:16 ValerioBertacchi RGB pixel maps
PNGpng RGB_PixelMap_input_layer1_event30.png r1 manage 12.1 K 2020-11-05 - 20:17 ValerioBertacchi RGB pixel maps
PDFpdf RGB_PixelMap_input_layer2_event30.pdf r1 manage 13.7 K 2020-11-05 - 20:16 ValerioBertacchi RGB pixel maps
PNGpng RGB_PixelMap_input_layer2_event30.png r1 manage 12.0 K 2020-11-05 - 20:17 ValerioBertacchi RGB pixel maps
PDFpdf RGB_PixelMap_input_layer3_event30.pdf r1 manage 13.6 K 2020-11-05 - 20:16 ValerioBertacchi RGB pixel maps
PNGpng RGB_PixelMap_input_layer3_event30.png r1 manage 11.9 K 2020-11-05 - 20:17 ValerioBertacchi RGB pixel maps
PDFpdf eff_chi2_comparison.pdf r1 manage 15.4 K 2020-11-05 - 20:12 ValerioBertacchi Efficiency and fakerate comparison
PNGpng eff_chi2_comparison.png r1 manage 32.8 K 2020-11-05 - 20:12 ValerioBertacchi Efficiency and fakerate comparison
PDFpdf fake_chi2_comparison.pdf r1 manage 15.5 K 2020-11-05 - 20:12 ValerioBertacchi Efficiency and fakerate comparison
PNGpng fake_chi2_comparison.png r1 manage 31.3 K 2020-11-05 - 20:12 ValerioBertacchi Efficiency and fakerate comparison
PDFpdf predVStarget_520.0_2.pdf r1 manage 29.1 K 2020-11-05 - 20:13 ValerioBertacchi correlation prediction-target
PNGpng predVStarget_520.0_2.png r1 manage 43.7 K 2020-11-05 - 20:13 ValerioBertacchi correlation prediction-target
PDFpdf residual_520.0_2.pdf r1 manage 28.4 K 2020-11-05 - 20:13 ValerioBertacchi residuals
PNGpng residual_520.0_2.png r1 manage 46.0 K 2020-11-05 - 20:13 ValerioBertacchi residuals
PDFpdf stacked_efficiency_dr.pdf r1 manage 16.5 K 2020-11-05 - 20:14 ValerioBertacchi stacked efficiency and fakerate
PNGpng stacked_efficiency_dr.png r1 manage 29.2 K 2020-11-05 - 20:14 ValerioBertacchi stacked efficiency and fakerate
PDFpdf stacked_fakerate_dr.pdf r1 manage 16.1 K 2020-11-05 - 20:14 ValerioBertacchi stacked efficiency and fakerate
PNGpng stacked_fakerate_dr.png r1 manage 29.9 K 2020-11-05 - 20:14 ValerioBertacchi stacked efficiency and fakerate
Edit | Attach | Watch | Print version | History: r6 < r5 < r4 < r3 < r2 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r6 - 2021-08-02 - ValerioBertacchi
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    CMSPublic All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright &© 2008-2023 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback