(Received 2015 October 7 )
From the Meeting / Gunther
Comments
ll.63-69 improve description
done
l.71 S10 scheme etc - to explain or to use standard
CMS sentence
done
l.78 to explain parameters of the jet veto, put in the AN information about performance of the central-jet veto
done
l.87 what are the uncertainties of the cross sections used
done
l.97 ID should be removed and explained in words
done
l.102 add information about amount of the QCD background
done
l.106-116 explain better why regions are used in this way, more clear to explain the veto , multijets etc . Add few lines on correlation between regions.
done
l.123 to be more precise - acceptance - > efficiency need to explain 50% uncertainty on QCD
done
"plus shape" - need to be described in the text, the same is for scale ...
done
put all uncertainties in the table, describe better in the text
done
l.129 +- 100% explain that why
done
l.133 how the shapes vary - how we fit should be better described
done
table 1 should include all uncertainties used for the final fit
done
figures - clarify the bands uncertainty signal strength - to which cross section?
done
l.138 little acceptance issue
done
Table 3 - need to describe how combination is done
done
l.157 hadronization 0.81 explain why it is small, refer to similar number from Zb
done
l.158-159 uncertainty on hadronization corrections - describe
done
l.165 DPS 100% uncertainty - describe why
done
l.167 explain better
done
add conclusions
done
Anne-Marie
Comments
l13-15 add also ref to
CMS 8
TeV Zbb PAS SMP-14-010 ?
done
l19 the former one [8] used only muons.
done
l22 is electron trigger not up to eta=2.4 ??
the trigger is up to 2.5
l26+3 add ref for electron paper (JINST 10 (2015) P06005,
http://arxiv.org/abs/1502.02701
)
done
l39-40 cut in two sentences: ...[17]. Those which...
done
l48 give b-tag efficiency and mistag for c and light rates for this
working point (as given in btag paper ref). Ref[19] should be to
published paper.
done
l51 ref[20] is not used anywhere here, we don't have MG4 samples
anywhere I think... remove.
done
l54-55 refs 24 and 25 are for Z+b ... relevant ????
removed
l56 ref 28 seems quite old, 2002 ! Use refs [14-16] (just 16 or all I
don't know...) from SMP-14-010 instead ??
updated POWHEG references
l58 Add a paragraph introducing the main background processes to your
signal...
done
l59 ref [30] is a duplicate of [21]
done
l63 line too long.
done
l65-66 is not logical: for LO you give PDF version and for NLO Pythia
tune :/ Should give both for both...
done
l71-72 remove sentence "The S10 scheme..." too detailed.
done
l73 re-weighted to the pileup distribution for data -> reweighted to
match the pileup distribution of the data.
done
l75-77 rephrase along the lines of: "The dominant background arise from
the ttbar process. Data and MC are hence first compared in two
ttbar-dominated control regions. MC scale factors are extracted and
applied to the signal region. Finally a likelihood fit (++add details or
ref to method used?++) to the MT distribution is performed in order to
extract the W+bb cross section."
done
l79-80 replace "falling within the pseudorapidity range" by "and"
done
l80 loosely isolated: give isolation value. Jets: give precision on what
pT,eta are considered for jets...
done
l82 requiring a third jet: it is not clear whether it is then exactly 3
jets or at least 3 jets.... => requiring at least three jets.
done
l87 correspondent -> corresponding
done
l96-97 pdgId=5 and 4 refer to
quarks not hadrons... and is MC
jargon... also "the presence...in one of the jets" is highly unclear:
parton jet ? hadron jet ? reco jet ?? Is there a dR matching involved ?
I think the following would be more accurate (but correct me if I am wrong):
"If an event contains a b quark, from matrix element or parton shower,
it falls into the W+bbbar category. If an event contains no b quarks but
an even, non-zero, number of c quarks, again from matrix element or
parton shower, it falls into the W+cc category. The remainder falls into
the W+udscg category."
done
l101 obtained shapes -> shapes obtained ?
since there is a ? I will explain my meaning: I am using 'shapes' as the object of the sentence with 'obtained' as the adjective referring to the procedure described in the preceding sentences. Other options might be 'shapes thus obtained' or 'shapes obtained as such'
l102 give the percentage of such backgrounds for the inverted isolation:
it is important to know whether it's small enough or could be source of
uncertainties... Same l105 for the non-inverted isolation, give
percentage of background, i.e.. if I understand correctly you have 4
numbers in the mT<20
GeV region: isolation data-noniso = QCD-noniso +
BKG-noniso and data-iso = QCD-iso + bkg-iso, and you want to set
QCD-noniso = QCD-iso .... I'd like to have in the text the fractions
BKG-noniso/data-noniso and bkg-iso/data-iso with their stat uncertainties...
done
l106-107 needs rephrasing, it sounds weird to have to adjust
uncertainties .... What about explaining why b-tagging efficiency and
jet energy scale would not be properly corrected for already, or say
explicitly that at the same time as extracting the final cross section
the fit is sensitive to b-tagging efficiency and jet energy scale. It
would be worth mentioning in the text here the answer you made to the
stat comm question: these two components are difficult to separate in
the signal region but contribute differently to the 2 control regions,
hence the strategy you are adopting to help finding stable fit results....
done
l111 move "obtained" at end of line
sentence deleted
Section 5 : needs to be expanded !! Add one paragraph per uncertainty
source, with exactly what is done and how the variation is calculated.
Add a line in the table with all the ones mentioned lines 120-122...
"scale" in table can be very confusing, what scale ?? Replace with "QCD
scales" ??
done
l127 add uncertainties (stat+syst) to these factors. The final ± 0.15 is
also not explained, or is that somehow related to the 14.1% of table 1 ???
done
l130 Add uncertainty on 1.57.
done
l136 Make an explicit note here that if previous procedure lead to
different scale factors for b-tag and JES than should be in signal
region, it would be visible here as they are left floating again ?? But
say that it's not the case ??
done
fig 2 caption -> spell out JES
done
equation bottom page 5: remove bit between 3rd and 4th "=", I think
everyone should be capable of passing Ngen from denominator to numerator
done
139 typo in acceptance
done
l149+1, l151 cross section*s*
done
l153 add Pythia 8 ref (see SMP-14-010)
done
l155 NNLO23_.... that's too jargon, is this an NNPDF pdfset ?? add ref.,
add ref also for
CUETP8M1 tune...
l158-159 "a four-flavor" "a five-flavor" replace "a" by "the" to refer
to the previously defined samples ??? Or are these new ones Also, it
doesn't seem very appropriate to have a variation of matrix element (4
vs 5 flavor) to get an uncertainty on something which concerns rather
the parton shower... was it not also pythia 6 vs pythia 8 by any chance ??
done
l168 add ref for LHAPDF.
done
l169 add table with theoretical predictions and uncertainties
(separating stat, PDF, scales) and refer to it.
Figure 4 contains this information, and it is written in the last paragraphs of Section 6.
l173 Add conclusion section !!!
done
References:
1-9+19+43-44
CMS Collaboration Collaboration
done
43: missing journal,doi,arxiv.
All: check to give only first page (example, ref 1 gives 1-29 -> replace
by just 1, ref 3 gives only 126 -> ok...)
done
7 published, add journal
done
15 Physics Letters -> Phys.Lett. in others....
done
37 twice arxiv
done
Teresa
Comments
I agree with most of the comments sent by Anne Marie, and i do not repeat them here.
line 50-57: on top of AM comments, make clear you refer here to signal process only.
done
line 88-93: i do not understand why this part is not in the MC Samples section 3.
moved
line 112-113: ... and averaged before later use. The average b-tagging scale factor is used to reweigh the MC, from which the jet energy scale is obtained from a fit to the ttbar-multilepton region.
done
... i do not if this wording is better, but from the existing one is difficult to understand the two step procedure followed -
done
Table 1: as AM says or use explicitly "Re./Frag. scales"
done
line 131: what exactly means "adjusted by 1.57 standard deviations"? scale up? down???
done
pag 5, last para w/o line number. It needs reformulation, better wording ..
done
Table 2: separate data from the rest. it would be good to include a column with the % of variation.
table has been largely expanded
line 137: Nrec is the number of "expected" reconstructed events?? or just the number of reconstructed events??
reconstructed, this has been made more clear in the text
Figure 4: is it understood why the theory is systematically lower? Maybe worth commenting in the conclusion section (missing now)
conclusion section added
Tristan
* 1.Introduction
*
In the first paragraph, you mention b quarks, b hadrons, and b jets, but you don't define them. On l.44, you mention "b quark jet identification". Also the hyphenation is not always consistent. This is maybe not crucial for a PAS, but anyway some concrete suggestions to homogenize this:
- Define in the first paragraph "b jets" = "jets originating from the hadronization of b quarks" (i.e. generator level).
- Later, use "b-tagged jets" = "jets identified as such" (i.e. reco level).
I call this the Alexander Schmidt convention, which I think is the most consistent one and leads to the least confusion. Maybe also have a look at the Z+b-jets paper (
http://arxiv.org/abs/1402.1521
) for inspiration. This might seem a bit detailed grammar/spelling comment, but will also help the understandability of the text later on, when these details can be important.
good convention, done
* 2.Selection & reconstruction
*
l.21 "with loosely" -> "with a loosely"
done
l.32: pT is here defined as a vector, which is a bit unusual, and not consistent with previous appearances of this variable
done
* 3.MC Samples
*
General comment: You write consistently "five- and four-flavour samples". This is not wrong, but looks strange. I would turn it around. In l.152 it is actually the other way around.
done
- l.36+59: "kT" variable not consistent
done
- l.59: Period at the end of sentence missing
done
- l.60: "V" is not defined. You can probably do that already in the first sentence of the Introduction.
done
- l.70-73: The whole paragraph is an unusual description of pileup. I would have a look at other
CMS publications, or simply check the
CMS guidelines.
done
* 4.Strategy
*
- l.79: I think you mean "b-jets" -> "b-tagged jets"
done
- l.96+97: as mentioned by Guenther, drop "with pdgId=x"
done
One more suggestion: to be extra clear, I think you can specify that W+bb is defined by having a W and at least one b hadron without any gen-level cuts on eta/pT. (At least, that is how I understand the text.) This might seem too detailed, but sometimes people define V+b as having a b in the jet in the acceptance... so it is better to make it extra clear if there are any criteria on the generator b when defining your signal.
in fact, we do cut around signal acceptance. PAS has been updated to reflect this
- l.101: You write "The obtained shapes are corrected for the presence of all other backgrounds"... I don't understand what this involves. Does this mean you subtract MC backgrounds from data? Or something else? Please clarify in the text.
done
- l.106-116: Guenther asked some questions about the fit here, some clarification about the motivation and correlation would indeed be useful.
done
* 5.Systematic
*
Table 1: the table is very minimalistic. I am especially a bit puzzled by the 14.1% of the b-tagging. Some questions:
we have a new, more inclusive table now
1) Where does this number come from? Only b-tagging, or also other contributions?
14.1 actually should have been even higher, 14.8 which is the 100% uncertainty on rescale factor. The uncertainty on the original b Tagging scale factor varies by sample, for example 6-7% for Wbb, T, TT 13-15% for Wcc 40% for W+light see Table 8 in AN
2) Since this is the largest contribution... isn't this actually a bit big? Given that the b-tag uncertainties are usually 3.5%, I would expect an uncertainty of 2*3.5%=7%. Or is it because you are using the Tight WP? (I think you already explained me this some time ago...)
I think the confusion is between the b-tag SF with uncertainties as perscribed by the BTV and the b-tag rescale factor that we fit in Step 1, ttbar multijet region.
We find that the b-tag SF should be rescaled by 14.8% and put a 100%
uncertainty on this re-scale factor to cover the imprecision of this
method - all of the change in normalization from the ttbar-multijet
fit is absorbed into this re-scale factor which is a rough estimation.
3) Why do you actually fit the b-tag SFs yourself, instead of taking from
BTV? Would that make the uncertainties possibly smaller? (I have the vague impression you also have explained me this already before, sorry if I forgot.)
We apply the SFs from the BTV and then see that they do not accurately
describe our data in many related ttbar phase spaces. This was the
conclusion of the presentation made to the BTV, that further scaling
was appropriate.
4) Is there any correlation with the ttbar normalization? In other words: are you possibly double counting the uncertainties, thereby overestimating your uncertainty on the cross section? That would be a pity.
The difference between the b tag SF and ttbar normalization comes in
how they are correlated between samples. The b tag SF is correlated
across all MC samples while the ttbar xc uncertainty (this is what I
assume you mean by “ttbar normalization”) is only tied to the ttbar
sample. With the shape of the b tag SF shifted templates being so similar
to the shape of the unshifted template, this is hard to disentangle
in fitting a region dominated by ttbar and one of the reasons for
assigning such a large uncertainty to the rescaling factor.
==> I vaguely remember having asked many of these questions before (sorry for my poor memory)... but maybe some clarification in the text will help. And I just wish the impact on your measurement from uncertainties from b-tagging could be smaller somehow...
I think the confusion is between the b-tag SF and b-tag reSF
- l.123: "affects acceptance" -> "affects the acceptance"
done
- Table 1: It was not clear to me what "+ shape" means.
done
- Table 1 caption: "in fitting" -> "in the fitting"
done
* 6.Results
*
l.127: You write "b-tagging efficiency scale factors", and I know what you mean.... but the phrasing is confusing, because normally this concerns the scale factor of a single b-tagged jet (whereas in this case it concerns the scale factor of a sample with two b-tagged jets per event). So I would phrase it a bit less ambiguous.
done
l.128: I unfortunately still don't understand where the uncertainty in "1.15±0.15" comes from.
done
Caption Figs1+2+3: "muon on the left and electron on the right" -> "the muon sample on the left and the electron sample on the right"
done
l.133: "normalization" -> "normalizations"
done
l.139 "accpetance"
done
l.140: "channel, " -> "channel, and "
done
l.142: "exactly b-jets" -> "exactly two b-tagged jets" ?
done
l.145: "a B-hadrons"
done
l.149+151: "section are" -> "sections are" (twice)
done
l.153: comma usage not consistent with previous occurrences
done
l.157: For the hadronization factor you could maybe refer the reader to the first
CMS Z+b-jets measurement (see my e-mail). Although this only concerns a single b in the final state, so not sure it helps... (Anne-Marie will have better suggestions on this topic)
added
l.163: "of W produced" -> "of a W produced"
done
l.167: "Uncertainty" -> "The uncertainty"? "section" -> "sections"?
done
l.168: For reference to be used for the PDF uncertainties, see my forwarded email from the Higgs mailing list.
thanks
l.168: As discussed during the meeting, I was told that LO ME should be used with LO PDFs. (But I don't know if this was the opinion of the theorists I was talking to, or a fact.) Anyway, I think it is sufficient to use systematics for the corresponding PDF set alone, without additionally invoking higher/lower-order PDFs.
done
l.171: space after Fig
done
l.172: "of the order of" -> "of approximately"?
done
General physics comment: the predictions are almost all the same, and the measurement does not distinguish between them.... This is a pity, but I would try to make some philosophical comments here, about the consequences of your measurement, in order to give more weight to your study. As Theresa suggested, maybe comment on the comparison with Atlas, or with the 7TeV measurement... but at least try to make the reader aware of the relevance of the measurement, and what we should learn from this. This is certainly worth some effort when you go for a paper.
Currently, it looks like "we are 1 sigma away from the predictions, and that's what it is", but I would really try to squeeze out more on the physics interpretation.
(DPS was a hot topic in this final state, given the observation in W+1b. But I am not sure that quantitatively commenting on DPS is possible, given the still limited sensitivity in this final state.)
And as said before, it would also be nice to have some distributions. E.g.
DeltaR (b,b). You already said you would doing it for the paper, just making sure
(And it would be nice to have an interpretation of H±->W±A. But that's more something for Maria's
HigExo group.)
* References
*
1+2+3+4+5+6+7+8+9+19+43+44: "Collaboration Collaboration"
done