-- CristinaAnaMantillaSuarez - 2015-02-23

Projects



Summer Student Project: Measurement of mt using leptonic observables

We are interested on how is the top quark mass mt related to the kinematics of the leptons in the final state. The idea is that is this analysis the measurement won't be affected by the jet energy scale JES uncertainty.

This is the theory article from which the idea is based on: http://inspirehep.net/record/1305642

Slides: https://indico.cern.ch/event/301787/session/3/contribution/17/material/slides/0.pdf

  • It presents a procedure for determination of mt based on leptonic observables in dilepton tt events.
    • Construct the distributions from leptons only and require b-jets [anti-kT, R=0.5] within the detector (i.e. integrate over) - Minimal sensitivity to the modeling of both perturbative and non-perturbative QCD effects.
    • We shall be working with the top quark pole mass (defined as the real part of the pole in the top-quark propagator)
  • Employs kinematic distributions of leptons - we are interested in their shapes.
Label Observable
1 $p_T (l^{+})$
2 $p_T (l^{+}l^{-})$
3 $M_T (l^{+}l^{-})$
4 $E(l^{+})+E(l^{-})$
5 $p_T(l^{+})+p_T(l^{-})$
    • Working with distributions is cumbersome, but the information on the top mass that such shapes encode can be effectively provided by Mellin moments of the corresponding distributions.

ABOUT UNFOLDING

Unfolding can be defined as correcting data for detector effects. Due to the finite resolution of real world particle detectors, any measurement conducted in experimental high energy physics is contaminated by stochastic smearing. The observations recorded with any real world particle detector are always subject to undesired experimental effects, such as limited detector resolution and detection inefficiencies. The observation of such distorted collision events instead of the desired true events is called smearing or folding of the data and often results in broadening of the physical spectra measured by the LHC experiments.

Unfolding then refers to using the smeared observations to infer the true physical distribution of the events. It refers to the problem of estimating the particle level distribution of some physical quantity of interest on the basis of observations smeared by an imperfect measurement device.

There are three main reasons when unfolding is desirable:

  • To publish an estimate of the true physical distribution of events.
  • To compare measurements of 2 different experiments with different experimental resolutions
  • To compare unfolded histograms with theoretical predictions

In our case, we apply an unfolding technique because we want to compare our results with the theoretical predictions from our theory paper of reference. The measured distributions are distorted from the true underlying distributions by the limited acceptance of our detector and by bin-to-bin smearing due to a finite resolution of the variables. We perform our unfolding using TUnfold package, which consists on a matrix inversion based on a least square fit with Tikhonov regularisation. Similar (but not identical) to the SVD method. Some documentation can be found here:

http://www.desy.de/~sschmitt/tunfoldv16docu.html http://www.desy.de/~sschmitt/TUnfold/tunfold_manual_v17.3.pdf

Also we will try to follow TOP group general recomendations related to unfolding for top signatures.

https://twiki.cern.ch/twiki/bin/viewauth/CMS/TopUnfolding

Actually, the unfolding code we use is based in the code spinnet in this page:

https://twiki.cern.ch/twiki/bin/view/CMS/TopUnfoldingExampleCodes

For more information about TUnfold algorithm, have a look at:

The class TUnfold can be used to unfold measured data spectra and obtain the underlying "true" distribution. The unfolding method is summarized here: *The measured spectrum $\vec{y}$ can be expressed by the true spectrum $\vec{x}$ multiplied by a smearing matrix S, that accounts for migration of an event from one bin into another bin due to resolution effects as well as for different acceptances for the different bins:

$\vec{y}$ = S$\cdot$$\vec{x}$

By performing a regularized inversion of the matrix S, TUnfold gives an estimate for the true spectrum $\vec{x}$ and accounts for the above mentioned effects. In addition TUnfold can also take care of the proper subtraction of background contributions with a proper handling of the uncertainties on the background estimation. We use TUnfoldSys which provides methods to do systematic error propagation and to do unfolding with background subtraction.

TUnfoldSys uses a regularization parameter $\tau$, giving the strength of regularization. Will be roughly on the order of 1e-4. We use this value suggested but in principle you can determine this value by performing unfolding with many different values, maybe between 1e-3 and 1e-7 or such, and choosing the value of tau that minimizes tunfold.GetRhoAvg().

UPDATES:

  • NEW WORKING DIRECTORY: /afs/cern.ch/work/c/cmantill/public/CMSSW_5_3_22/src/UserCode/TopMassSecVtx/

7 Sep, 2015: Remarks

Some remarks and conclusions of this project at the end of this summer:

  • We fully implemented the analysis proposed by theorists.

  • We have shown that is feasible to measure m$_t$ using lepton distributions only.

  • The kinematic variables have lower sensitivity to m$_t$ than the quoted in [1].

  • We found larger uncertainties than expected. This might be because we are affected by statistical uncertainty of the samples.

  • Studied how to improve systematics using a shape analysis. The expected syst. uncertainty for the shape analysis is: \textcolor{blue}{$+2.1/ - 1.6$} [GeV].

  • Changing prescription for systematic uncertainties as proposed by A Mitov is expected to yield a more competitive measurement.

I gave some short talks about this project. The slides you can find in the next links:

7 Aug, 2015: Systematics and optimizing unfolding procedure

$p_{T}(l^{+})$
Systematic Uncertainty $\delta \mu^{(1)} _{p_{T}(l^{+})}$ [GeV] $\delta m_t$ [GeV]
Scale Up -0.0888 -1.7438
Scale Down 0.0936 1.8366
Matching Up 0.0554 1.0872
Matching Down 0.0811 1.5917
Pileup Up -0.0369 0.4029
Pileup Down -0.0156 -0.3082
Lepton Selection Up -0.0022 -0.0431
Lepton Selection Down 0.002 0.04375
Top Pt -0.2002 -3.9458
Lepton Energy Scale Up 0.0035 0.0687
Lepton Energy Scale Down -0.0143 -0.2824
p11 0.0286 0.5637
p11nocr -0.0347 -0.6829
p11mpihi -0.0413 -0.8135

30 July, 2015: Systematics and Correction to Moments Plots

I find a bug in the code that was plotting the Mellin moments vs the top mass, I was working with the unfolded distribution divided by the bin width but the distribution at generated level was not.

Unfolded vs Top Mass
$\mu^{(1)}_{p_T (l^{+})}$ $\mu^{(1)}_{p_T (l^{+}l^{-})}$ $\mu^{(1)}_{M (l^{+}l^{-})}$

$\mu^{(1)}_{E (l^{+})+E(l^{-})}$ $\mu^{(1)}_{Pt (l^{+})+Pt(l^{-})}$

I also calculated most of the relevant systematics for this analysis and made some ratio plots for each pair of systematics.


Positive lepton Pt Systematics -

Systematic Uncertainty $\delta \mu^{(1)} _{p_{T}(l^{+})}$ [GeV] $\delta m_t$ [GeV]
Scale Up -0.0675 -1.330
Scale Down 0.1149 2.2641
Matching Up 0.0767 1.5118
Matching Down 0.1024 2.0183
Pileup Up 0.0204 0.4029
Pileup Down -0.0156 -0.3082
Lepton Selection Up -0.0022 -0.0431
Lepton Selection Down 0.002 0.04375
Top Pt -0.2002 -3.9458
Lepton Energy Scale Up 0.0035 0.0687
Lepton Energy Scale Down -0.0143 -0.2824
p11 0.0286 0.5637
p11nocr -0.0347 -0.6829
p11mpihi -0.0413 -0.8135

Scales Matching Scale P11
Pile Up Lepton Energy Scale Lepton Selection Efficiency
Top P$_{T}$

Charged lepton pair Pt Systematics -

Systematic Uncertainty $\delta \mu^{(1)} _{p_{T}(l^{+})}$ [GeV] $\delta m_t$ [GeV]
Scale Up -0.1297 -1.3436
Scale Down 0.3447 3.5707
Matching Up -0.0715 -0.7404
Matching Down 0.2668 2.7638
Pileup Up 0.0166 0.1715
Pileup Down -0.0159 -0.1642
Lepton Selection Up -0.0027 -0.0281
Lepton Selection Down 0.0028 0.0287
Top Pt -0.2496 -0.2586
Lepton Energy Scale Up 0.0499 0.5172
Lepton Energy Scale Down -0.0493 -0.5111
p11 -0.0196 -0.2032
p11nocr -0.0162 -0.1680
p11mpihi -0.020408 -0.2114

Scales Matching Scale P11
Pile Up Lepton Energy Scale Lepton Selection Efficiency
Top P$_{T}$

23 July, 2015: Binning and Unfolding

After correcting the binning, we found that the distributions still contained a high number of events the first and last bin. We corrected this by not counting the overflow and underflow bin in the migration matrix, i.e. when generating plots with runPlotter.py we add the option:

python scripts/runPlotter.py -j test/topss2014/mass_scan_samples.json -o quantiles/mc/ptpos/166/plots quantiles/mc/ptpos/166 --cutUnderOverFlow

Besides, we found a bug while calculating Mellin moments from the unfolded distribution. We were working with the unfolded distribution divided by the bin width, and this was causing a displacement on the mean value from 52 to 44 approx. This mean value we use to calculate the first and second moments.

After correcting this two bugs, we obtained the final plots. However we still observe a difference between the moments calculated from the unfolded and generated level kinematic distributions.

The plots for each of the kinematic variables are here:

For kinematic distribution plots at reconstructed and generated level, and also for the unfolded distribution, I attach here only the results for 171 mass sample. The other mass samples distributions look quite similar (look at the comparison plot between rec and gen).


Positive lepton Pt

  • $p_T (l^{+})$

Distribution comparison at reconstructed level Distribution comparison at generated level

  • Mellin moments: $\mu^{(1)}_{p_T (l^{+})}$ and $\mu^{(2)}_{p_T (l^{+})} $

$\mu^{(1)}$ Unfolded vs Generated $\mu^{(2)}$ Unfolded vs Generated

$\mu^{(1)}$ Unfolded vs Top Mass $\mu^{(2)}$ Unfolded vs Top Mass

  • e.g. 171 mass sample

Generated level Reconstructed level Reconstructed level Bin-corrected

Unfolded Efficiency, Purity and Stability

Charged lepton pair Pt

  • $p_T (l^{+}l^{-})$

Distribution comparison at reconstructed level Distribution comparison at generated level

  • Mellin moments: $\mu^{(1)}_{p_T (l^{+})}$ and $\mu^{(2)}_{p_T (l^{+})} $

$\mu^{(1)}$ Unfolded vs Generated $\mu^{(2)}$ Unfolded vs Generated

$\mu^{(1)}$ Unfolded vs Top Mass $\mu^{(2)}$ Unfolded vs Top Mass

  • e.g. 171 mass sample

Generated level Reconstructed level Reconstructed level Bin-corrected

Unfolded Efficiency, Purity and Stability

Invariant mass of the charged lepton pair

  • $M (l^{+}l^{-})$

Distribution comparison at reconstructed level Distribution comparison at generated level

  • Mellin moments: $\mu^{(1)}_{p_T (l^{+})}$ and $\mu^{(2)}_{p_T (l^{+})} $

$\mu^{(1)}$ Unfolded vs Generated $\mu^{(2)}$ Unfolded vs Generated

$\mu^{(1)}$ Unfolded vs Top Mass $\mu^{(2)}$ Unfolded vs Top Mass

  • e.g. 171 mass sample

Generated level Reconstructed level Reconstructed level Bin-corrected

Unfolded Efficiency, Purity and Stability
/>

Energy sum of the 2 leptons

  • $E (l^{+})+E(l^{-})$

Distribution comparison at reconstructed level Distribution comparison at generated level

  • Mellin moments: $\mu^{(1)}_{p_T (l^{+})}$ and $\mu^{(2)}_{p_T (l^{+})} $

$\mu^{(1)}$ Unfolded vs Generated $\mu^{(2)}$ Unfolded vs Generated

$\mu^{(1)}$ Unfolded vs Top Mass $\mu^{(2)}$ Unfolded vs Top Mass

  • e.g. 171 mass sample

Generated level Reconstructed level Reconstructed level Bin-corrected

Unfolded Efficiency, Purity and Stability

Pt sum of the 2 leptons

  • $Pt (l^{+})+Pt(l^{-})$

Distribution comparison at reconstructed level Distribution comparison at generated level

  • Mellin moments: $\mu^{(1)}_{p_T (l^{+})}$ and $\mu^{(2)}_{p_T (l^{+})} $

$\mu^{(1)}$ Unfolded vs Generated $\mu^{(2)}$ Unfolded vs Generated

$\mu^{(1)}$ Unfolded vs Top Mass $\mu^{(2)}$ Unfolded vs Top Mass

  • e.g. 173 mass sample

Generated level Reconstructed level Reconstructed level Bin-corrected

Unfolded Efficiency, Purity and Stability

22 July, 2015: Stability, Purity and Efficiency

I obtained the stability, purity and efficiency of the binning as a closure test.

*Purity :

$p=\frac{N_{rec gen}}{N_{rec}}$

The purity p denotes the number of events that are generated and correctly reconstructed in a given bin i relative to the number of events that are reconstructed in bin i but generated anywhere.

*Stability

$s=\frac{N_{rec gen}}{N_{gen}}$

(notice this is basically the diagonal of the matrix after normalizing it) The stability s denotes the number of events that are generated and correctly reconstructed in a given bin i relative to the number of events that are generated in bin i but reconstructed anywhere

  • Efficiency :
$eff=N_{rec gen}/N_{gen total}$

That is, the number of events reconstructed in a given bin which are matched at generator level divided by the total number of events which have been matched at generator level

Efficiency, Purity and Stability


Also, I found a bug in my unfolding procedure, where I was subtracting background although I was not taking it into account. I corrected this and summarized my results in this slides:


Now I am having problems to define a correct binning to unfold, since the last bin and the first bin contain also the overflow and underflow bins respectively. So I am doing several tests with the binning, and using just one of the variables to test: $p_T (l^{+})$

At the end I compare the unfolded distributions obtained for each of the cases. And I compare the purity, stability and efficiency and also the

Here I describe how do I choose the bins:

The purpose of using quantiles is to choose these bins in such a way that all bins in the histograms contain the same numbers of events. This will increase the stability of the method. Depending on the case, flattening the truth spectrum after selection might be better than before selection. You have to use a finer binning in the measured variable as compared to the truth variable. As a rule of thumb, we should use at least twice the number of bins for the measured variable.

  • After several tests, I obtained the quantiles for generated and reconstructed distributions (19 and 38 respectively) and I defined the binning with those, making sure that the reconstructed binning has one more bin at the start and the end of the array, i.e.:

bins_gen = [22.72,25.45,28.18,30.92,33.67,36.42,39.17,42.01,44.89,47.77,50.94,55.06,59.19,63.63,68.15,74.76,83.59,95,115.65]

bins_rec = [20,21.36,22.73,24.09,25.45,26.82,28.18,29.55,30.92,32.29,33.67,35.04,36.41,37.79,39.17,40.57,42.01,43.45,44.89,46.33,47.77,49.21,50.93,53,55.06,57.12,59.18,61.37,63.63,65.89,68.15,70.73,74.76,78.74,83.59,88.75,95,102.85,115.65,138.46,250]

The bold bins are the extra bins in the reconstructed level.

Generated level Reconstructed level Reconstructed level Bin-corrected

Unfolded Efficiency, Purity and Stability

For all the mass samples:

Distribution comparison First Mellin Moment Unfolded vs Generated Second Mellin Moment Unfolded vs Generated

u1 Unfolded vs Top Mass u2 Unfolded vs Top Mass

The new version of the binning flattens out the distribution, so it is okay. And also now the bins at reconstructed level are the double than at generated level, so the unfolding procedure is working.

Still there are some things to work in. When we compare the unfolded and the generated plots versus top mass they don't give exactly the same. They are both linear but the scale is somehow different.

I have put the plots for the other distributions here:

http://cmsdoc.cern.ch/~cmantill/top/

17 July, 2015: Comparing shape of the distributions, at reconstructed and generated level, for each of the mass samples

  • $p_T (l^{+})$ - ptpos - Pt of the positive lepton

Distribution comparison at reconstructed level Distribution comparison at generated level

  • $p_T (l^{+}l^{-})$ - ptll - Pt of the charged lepton pair

Distribution comparison at reconstructed level Distribution comparison at generated level

  • $M (l^{+}l^{-})$ - mll - Invariant mass of the charged lepton pair

Distribution comparison at reconstructed level Distribution comparison at generated level

  • $E (l^{+})+E(l^{-})$-EposEm- Energy sum of the 2 leptons

Distribution comparison at reconstructed level Distribution comparison at generated level

  • $Pt (l^{+})+Pt(l^{-})$-ptposptm- Pt sum of the 2 leptons

Distribution comparison at reconstructed level Distribution comparison at generated level

All of the variables show a similar (but fairly weak) dependence on the top mass

16 July, 2015: Calculating Mellin moments for each of the mass samples

Given an observable O (e.g. one of the listed in the table), the Mellin moments are given by:

$\mu^{(1)}_O = <O>$ ( Mean )

$\mu^{(2)}_O = <O^{2}> = \sigma^{2} +  <O>^{2}$

I obtained the first plots, as a test, using just information from reconstructed and generated level. The unfolding procedure is not working, we think it is due to a binning problem.

First Mellin Moment Reconstructed vs Generated Second Mellin Moment Reconstructed vs Generated

u1 Generated level vs Top Mass u1 Reconstructed level vs Top Mass

u2 Generated level vs Top Mass u2 Reconstructed level vs Top Mass

13 July, 2015: Working with mass samples

We will start to work with mass samples from here, corresponding to mt = [166.5,169.5,171.5,173.5,175.5,178.5] GeV.

The mass samples are located in

 
/store/cmst3/group/top/summer2015/treedir_bbbcb36/ttbar/mass_scan/

for ttbar they are named as MC8TeV_TTJets_MSDecays_*.root where * is the mass. In addition, given tW/tbarW is the main background I will also use:

 
/store/cmst3/group/top/summer2015/bbbcb36/mass_scan/MC8TeV_SingleTbar_tW_*.root 
/store/cmst3/group/top/summer2015/bbbcb36/mass_scan/MC8TeV_SingleT_tW_*.root 

I have to reproduce the distribution plots for all the samples and for each of the kinematic variables. I have to unfold this distributions using MC only, and then calculate the Mellin moments such as at the end I will obtain a calibration curve of the moments values vs the top mass.

The migration matrix we use to unfold the distributions of each of the mass samples, is the one from the nominal sample MC8TeV_TTJets_MSDecays_172v5.root

09 July, 2015: Rebinning distributions according to quantiles

To check the binning array I was using, I got the quantiles separately and then round those numbers a bit so I have the same bin size for a long range and then a bit bigger bins towards the end.Then I just hardcode them by adding that array into my code. At the end I got this:

Generated level Reconstructed level Reconstructed level Bin-corrected

In some cases there are events in the 0 pt bin. Could be that there are some events with only one lepton.

I checked the event selection, and there is indeed a cut on abs(EvCat) to be either 11*11, 11*13, or 13*13. But there are some events in the MC samples which have events in the 0 pt bin, we impose the condition:

 
 if not isData: 
           if tree.GenLpPt == 0 or tree.GenLmPt == 0: continue

Now the plots look like this:

Generated level Reconstructed level Reconstructed level Bin-corrected

08 July, 2015: Rebinning distributions according to quantiles

I found a bug in the way that I was calculating the quantiles, so now the distributions look like this for ptpos:

Generated level Reconstructed level Reconstructed level Bin-corrected

It looks like all the bins except the first and last are ok (they have more or less the same number of entries), but those two are wrong. I added one more quantil, by adding the 1.0 into my array

Generated level Reconstructed level Reconstructed level Bin-correctedSorted ascending

07 July, 2015: Rebinning distributions according to quantiles

We have started to study the unfolding procedure.

We are going to define the binning scheme for our histograms, such that we get flat distributions to unfold.

This is the idea: GetQuantiles ROOT Function gets you the values of x which divide the distribution in the quantiles you define.

You can use it to re-define the binning of the distribution, such that now you now that the statistics will be distributed according to your pre-defined quantiles.

i.e.

  • Run once with the histograms defined with a big range and equal binning and get the quantiles for |0.1, 0.2, 0.3, ...,0.9,1 ]
  • Run again with the histograms (and error matrix) defined according to the quantiles found i.e. [0, x_{0.1}, x_{0.2},...,x_{0.9},x_{1}, max]
  • Unfold using the new binning definition

  • At first I made a code which got the quantiles from each of the MC and data samples and rebinned the corresponding histograms according to each of the obtained quantiles.
  • But what they wanted me to do was to get the quantiles from one of the DataMUEG files and fix the binning for all for the plots using them. I did this but then I got too many events in the last bin.

Generated level Reconstructed level Reconstructed level Bin-corrected

I extended the bin range, but I am just increasing the range to 200 without changing the binning, so I just make one big bin from 100 to 200, which will contain all the events there.

Generated level Reconstructed level Reconstructed level Bin-corrected

Note: Github useful tips

 wget -q -O - --no-check-certificate https://raw.github.com/stiegerb/TopMassSecVtx/master/TAGS.txt | sh
 git clone git@github.com:stiegerb/TopMassSecVtx.git UserCode/TopMassSecVtx
 git status
 cp ~/.gitconfig ~/.gitconfig.orig
 cp /afs/cern.ch/user/s/stiegerb/public/forCarlotta/.gitconfig ~/
 git df
 git br
 git remote -v
 git remote add cmantill git@github.com:cmantill/TopMassSecVtx.git
 git remote -v
 git checkout -b mtdilepton
 git add scripts/runDileptonUnfolding.py
 git add scripts/utils.py
 git commit -m'Cristinas first commit'
 git l
 git push cmantill mtdilepton

history | grep git > githistory

Note:

Getting CERN Kerberos ticket in my laptop I connected my computer with lxplus using OpenAFS. In order to access your CERN AFS account you'll need to obtain an AFS token from the CERN server. I had already installed Kerberos packages but I followed instructions from this sites:

http://linux.web.cern.ch/linux/docs/kerberos-access.shtml

https://gist.github.com/KFubuki/10728230 - Also here I found some CERN very useful hacks

https://wiki.chipp.ch/twiki/bin/view/CmsTier3/HowToWorkInCmsEnv

After installing and setting it up, you should create a ticket and log on:

kinit username@CERN.CH
aklog
  

You can also test your access with:

klist
ls /afs/cern.ch/
ls /afs/cern.ch/user/c/cmantill/
  

Now you can work directly from your computer.

Note:

Request Workspace at Lxplus Locally at CERN, the personal working area on CERN's LXPLUS cluster isn't big enough to handle the output files, you'll need to write them to a larger-capacity area.

To ask for "AFS workspace" (up to 100 GB, backed up), login to the Cern account web page and go to "List Services", take the "AFS Workspaces" and then "Settings".

There you can ask for "workspace" in AFS, as well as extend the quota for your backed-up home (up to 10 GB). Please note the different AFS path to your workspace: /afs/cern.ch/work/initial/username where initial is the first letter of your username, i.e. the workspace is not hanging from your home.

Workspace path: /afs/cern.ch/work/c/cmantill

30 June, 2015: Generating first plots

Turns out I was submitting 0 jobs before because the input directory should be this one:

 
input directory with the files: /store/cmst3/group/top/summer2015/treedir_bbbcb36/ttbar
 

I got the plots for the other four distributions. I modified scripts/runDileptonUnfolding.py and got a new version. Here I attach some plots:

The notation I use in my code is the following:

  • $p_T (l^{+})$ - ptpos - Pt of the positive lepton

  • $p_T (l^{+}l^{-})$ - ptll - Pt of the charged lepton pair

Generated level Reconstructed level Reconstructed level Bin-corrected

Observed normalization PDF

  • $M (l^{+}l^{-})$ - mll - Invariant mass of the charged lepton pair

Generated level Reconstructed level Reconstructed level Bin-corrected

Observed normalization PDF

  • $E (l^{+})+E(l^{-})$-EposEm- Energy sum of the 2 leptons

Generated level Reconstructed level Reconstructed level Bin-corrected

Observed normalization PDF

  • $Pt (l^{+})+Pt(l^{-})$-ptposptm- Pt sum of the 2 leptons

Generated level Reconstructed level Reconstructed level Bin-corrected

Observed normalization PDF

29 June, 2015: Reading the twiki page and starting to work

The twiki page: https://twiki.cern.ch/twiki/bin/viewauth/CMS/CMGTopStudents2015

Working directory: /afs/cern.ch/user/c/cmantill/private/top/CMSSW_5_3_22/src/UserCode/TopMassSecVtx

  • Storing locally the normalization to be applied to the MC.
     
       ./scripts/runPlotter.py --rereadXsecWeights /store/cmst3/group/top/summer2015/bbbcb36/ -j test/topss2014/samples.json
       

  ...Processing all MC8TeV and Data8TeV
   ...
   >>> Produced xsec weights and wrote to cache (.xsecweights.pck)
   

The normalization is computed as a weight $\sigma/N_{\rm orig~events}$ where $\sigma$ is the theoretical cross section of a process and $N_{\rm orig~events}$ is the number of generated events. The luminosity $\mathcal{L}$ is the ratio of the number of events detected $\mathcal{N}$ in a certain time interval to the interaction cross-section $\sigma$. With this definition, the number of expected events after acquiring a given integrated luminosity $\mathcal{L}$, is given by $N=N_{\rm sel}\cdot\mathcal{L}\cdot\sigma/N_{\rm orig~events}$ where $N_{\rm sel}$ is the number of selected events in the analysis.

  • $p_T (l^{+})$ distributions
    • Creating the reconstructed and generator level distributions

 
python scripts/runDileptonUnfolding.py -i /store/cmst3/group/top/summer2015/treedir_bbbcb36/ -o unfoldResults/ --jobs 8

input directory with the files: /store/cmst3/group/top/summer2015/treedir_bbbcb36/

--------------------------------------------------------------------------------
Creating ROOT file with migration matrices, data and background distributions from /store/cmst3/group/top/summer2015/treedir_bbbcb36/singlet/
Discarded 0 files duplicated in cmsLs output
 Submitting jobs in 8 threads
Histograms saved in unfoldResults/Data8TeV_SingleElectron2012C.root
...
 --------------------------------------------------------------------------------

I obtained the distributions at generated and reconstructed level for %$p_T (l^{+})$ (ptpos).

Simple counting Bin-corrected

I can subtract the background from the data, and unfold the result by running

python scripts/runDileptonUnfolding.py -r unfoldResults/plots/plotter.root -v ptpos -o unfoldResults

Observed normalization PDF

Note: Eos useful commads:

eoscms ls -l /eos/cms/store/cmst3/group/top/summer2015

TFile::Open("root://eoscms.cern.ch//eos/cms/store/cmst3/group/top/summer2015/ttbar/treedir_bbbcb36/ttbar/MC8TeV_ZZ.root")

Topic attachments
I Attachment History Action Size Date Who Comment
PNGpng EposEm171_gen.png r1 manage 20.9 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distribbutions
PNGpng EposEm171_pur_stab_eff.png r1 manage 12.4 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distributions
PNGpng EposEm171_rec.png r1 manage 18.6 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distributions
PNGpng EposEm171_rec_wgt.png r1 manage 20.1 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distributions
PNGpng EposEm171_unfolded.png r1 manage 24.0 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distributions
PNGpng EposEm_gen_comparisonNorm_2.png r1 manage 24.3 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distributions
PNGpng EposEm_rec_comparisonNorm_2.png r1 manage 28.0 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distributions
PNGpng mll171_gen.png r1 manage 21.0 K 2015-07-24 - 12:06 CristinaAnaMantillaSuarez Mll distributions
PNGpng mll171_pur_stab_eff.png r1 manage 11.9 K 2015-07-24 - 12:05 CristinaAnaMantillaSuarez Mll distributions
PNGpng mll171_rec.png r1 manage 17.8 K 2015-07-24 - 12:05 CristinaAnaMantillaSuarez Mll distributions
PNGpng mll171_rec_wgt.png r1 manage 18.1 K 2015-07-24 - 12:05 CristinaAnaMantillaSuarez Mll distributions
PNGpng mll171_unfolded.png r1 manage 24.1 K 2015-07-24 - 12:05 CristinaAnaMantillaSuarez Mll distributions
PNGpng mll_gen_comparisonNorm_2.png r1 manage 24.9 K 2015-07-24 - 12:05 CristinaAnaMantillaSuarez Mll distributions
PNGpng mll_rec_comparisonNorm_2.png r1 manage 27.6 K 2015-07-24 - 12:05 CristinaAnaMantillaSuarez Mll distributions
PNGpng ptll171_gen.png r1 manage 20.1 K 2015-07-24 - 12:03 CristinaAnaMantillaSuarez Ptll distributions
PNGpng ptll171_pur_stab_eff.png r1 manage 12.2 K 2015-07-24 - 12:02 CristinaAnaMantillaSuarez Ptll distributions
PNGpng ptll171_rec.png r1 manage 17.5 K 2015-07-24 - 12:02 CristinaAnaMantillaSuarez Ptll distributions
PNGpng ptll171_rec_wgt.png r1 manage 19.0 K 2015-07-24 - 12:02 CristinaAnaMantillaSuarez Ptll distributions
PNGpng ptll171_unfolded.png r1 manage 23.7 K 2015-07-24 - 12:02 CristinaAnaMantillaSuarez Ptll distributions
PNGpng ptll_gen_comparisonNorm_2.png r1 manage 24.6 K 2015-07-24 - 12:02 CristinaAnaMantillaSuarez Ptll distributions
PNGpng ptll_rec_comparisonNorm_2.png r1 manage 28.1 K 2015-07-24 - 12:02 CristinaAnaMantillaSuarez Ptll distributions
PNGpng ptpos171_gen.png r1 manage 20.0 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng ptpos171_pur_stab_eff.png r1 manage 12.0 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng ptpos171_rec.png r1 manage 17.1 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng ptpos171_rec_wgt.png r1 manage 17.7 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng ptpos171_unfolded.png r1 manage 24.5 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng ptpos_gen_comparisonNorm_2.png r1 manage 22.8 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng ptpos_rec_comparisonNorm_2.png r1 manage 25.8 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng ptposptm173_gen.png r1 manage 21.5 K 2015-07-24 - 12:34 CristinaAnaMantillaSuarez Ptposptm distributions
PNGpng ptposptm173_pur_stab_eff.png r1 manage 12.4 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Ptposptm distributions
PNGpng ptposptm173_rec.png r1 manage 18.1 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Ptposptm distributions
PDFpdf ptposptm173_rec_wgt.pdf r1 manage 14.7 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Ptposptm distributions
PNGpng ptposptm173_unfolded.png r1 manage 25.0 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Ptposptm distributions
PNGpng ptposptm_gen_comparisonNorm_2.png r1 manage 23.7 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Ptposptm distributions
PNGpng ptposptm_rec_comparisonNorm_2.png r1 manage 26.3 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Ptposptm distributions
PNGpng u1_EposEm_2.png r1 manage 17.5 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distributions
PNGpng u1_EposEm_unf_2.png r1 manage 12.8 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distributions
PNGpng u1_mll_2.png r1 manage 16.3 K 2015-07-24 - 12:05 CristinaAnaMantillaSuarez Mll distributions
PNGpng u1_mll_unf_2.png r1 manage 13.6 K 2015-07-24 - 12:05 CristinaAnaMantillaSuarez Mll distributions
PNGpng u1_ptll_2.png r1 manage 15.9 K 2015-07-24 - 12:02 CristinaAnaMantillaSuarez Ptll distributions
PNGpng u1_ptll_unf_2.png r1 manage 13.6 K 2015-07-24 - 12:02 CristinaAnaMantillaSuarez Ptll distributions
PNGpng u1_ptpos_2.png r1 manage 17.3 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng u1_ptpos_unf_2.png r1 manage 13.8 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng u1_ptposptm_2.png r1 manage 17.0 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Ptposptm distributions
PNGpng u1_ptposptm_unf_2.png r1 manage 14.3 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Ptposptm distributions
PNGpng u2_EposEm_2.png r1 manage 19.4 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distributions
PNGpng u2_EposEm_unf_2.png r1 manage 17.0 K 2015-07-24 - 12:23 CristinaAnaMantillaSuarez EposEm distributions
PNGpng u2_mll_2.png r1 manage 17.0 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Mll distributions
PNGpng u2_mll_unf_2.png r1 manage 15.1 K 2015-07-24 - 12:26 CristinaAnaMantillaSuarez Mll distributions
PNGpng u2_ptll_2.png r1 manage 17.9 K 2015-07-24 - 12:02 CristinaAnaMantillaSuarez Ptll distributions
PNGpng u2_ptll_unf_2.png r1 manage 15.4 K 2015-07-24 - 12:02 CristinaAnaMantillaSuarez Ptll distributions
PNGpng u2_ptpos_2.png r1 manage 19.5 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng u2_ptpos_unf_2.png r1 manage 15.8 K 2015-07-24 - 11:58 CristinaAnaMantillaSuarez Ptpos distributions
PNGpng u2_ptposptm_2.png r1 manage 16.8 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Ptposptm distributions
PNGpng u2_ptposptm_unf_2.png r1 manage 14.9 K 2015-07-24 - 12:33 CristinaAnaMantillaSuarez Ptposptm distributions

This topic: Sandbox > TWikiUsers > CristinaAnaMantillaSuarez > CristinaAnaMantillaSuarezSandbox
Topic revision: r25 - 2015-09-11 - CristinaAnaMantillaSuarez
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback