| Measurement of Single Top Production Cross Section in MET plus Jets Sample with the Full CDF Run II Data Set. |
Giorgio Bellettini, Daniela Bortoletto, Matteo Cremonesi, Qiuguang Liu, Fabrizio Margaroli, Karolos Potamianos, Marco Trovato [Contact]
Top quarks are produced mostly in pairs at the Tevatron through the strong force. However, the electroweak force also allows the production of a single top quark with a cross section that is roughly half the top-antitop one. In addition to the lower rate, the less distintinctive signature makes this process much harder to be separated from background. In the past, Tevatron experiments have always been looking for single top production in events where one high energy electron or muon is identified, as expected in the top leptonic decay channel t->Wb->l&nu b, in order to improve the signal signature over background. We present here a measurement of single top production cross section selecting events consistent with W+jets topology but where no electron or muon has been identifieded, and where the tau lepton in the t->Wb->&tau&nu b channel is reconstructed as a jet in the calorimeters. Multivariate analysis techniques are necessary to suppress the large background and to discriminate the single top signal. We use the likelihood profile to estimate an expected production cross section of σs+texp = 3.20 +1.39 -1.43 pb. In the CDF full data set we measure σs+tobs = 3.04 +1.46 -1.39 pb.
We have analyzed electroweak single top production using the full CDF Run II data set collected up to the end of the Tevatron run in September 2011, corresponding to a total integrated luminosity of 9.1 fb -1. The cross section being so small and the processes that mimic the signal so large, CDF and D0 have not been able to measure the cross section of this interesting process with full-prove significance. In order to increase the scanty statistics, we look here at events rejected by previous analyses, i.e. events where there are no identified leptons, or where &tau s are reconstructed as a jet. We thus rely solely on the signature of high PT jets and large missing transverse energy. Being statistically independent of the lepton+jets sample, this sample provides, albeit with low precision, an independent measurement of the single top production cross section. Eventually, the result of this measurement can be combined with the lepton+jets one and help reaching a solid single top cross section measurement at the Tevatron.
Many Standard Model processes can produce a final state with large missing tranverse energy and jets, such as single top (our signal), top pair production, W/Z + jets, diboson production. In addition, QCD multijet production can mimic this signature due to severely mismeasured jets which appear to have large MET. Since the QCD heavy flavour production cross-section is orders of magnitude higher than that of the signal, it constitutes the biggest background in this search. Additionally light flavour jets can be falsely identified as b-jets (commonly referred to as "mistags").
Having no identified leptons in the final state, backgrounds are many orders of magnitude larger than the signal even after requiring large missing transverse energy and b-tagged jets in the final state. It is thus necessary to develop an event selection which reduces backgrounds to a more manageable size before trying to build a discriminant to measure the single top cross section. The QCD multijet production is the only background where the MET is mainly instrumental. In a first step we study the kinematics of these events and implement a multivariate technique to cut them out as much as possible.
We develop in a second step a discriminant to act against tt background. Top pair production gives a large contribution to total background, especially when both the leading jets in the final states are required to be b-tagged. In addition, the produc- tion of real top quarks makes this background more signal like if we take into account variables connected to the top mass, that are instead particularly useful in separating single top from all other non-top backgrounds.
In the final step, we use again a machine learning technique to discriminate the signal from the surviving backgrounds, and scan its output distribution to measure the single top production cross section in the MET+jets final state.
lepton veto = use loose identification cuts to reject events with isolated leptons
MET > 50 GeV
Number of jets = 2 or 3 and one of the leading jets (j1 or j2) central (|&eta| < 0.9).
Events with a larger number of jets are rejected
&Delta R(j1 , j2 ) > 1
ET (j1 ) > 35 GeV, ET (j2 ) > 25 GeV
Events are b-tagged either 1S, SS, or SJ
To model the single top dynamics and predict both shape and rate, we use a sample generated with Powheg. The official sample is actually composed of three subsamples, one s-channel sample, a LO and a NLO t-channel sample. This process is characterized by an accurately predicted cross section, used to derive its normalization.
We use the LO Pythia cross-sections scaled by a k-factor corresponding to the ratio between the NLO and LO cross-section prediction in MCFM3 to predict both shape and rate of diboson production. The LO MCFM predictions are also consistent with those from Pythia and are reported to be in good agreement with the data. To estimate the contribution of mistagged light flavor diboson events in the tag sample we apply the corresponding mistag matrix to diboson light flavor MC samples, vetoing events with a real b- or c-quark from HEPG. This process is characterized by an accurately predicted cross section, used to derive its normalization.
Top pair-production yields a significant contribution to the background in the pre- selection region, especially in the double-tag sample. Semi-leptonic top decays are energetic, bear large MET and high jet multiplicity. We use Monte Carlo samples generated for the Top Group to model the shape of this background. The top-antitop events were generated with Pythia with Mtop = 172.5 GeV and normalized with the cross section measured by CDF: 7.71+-0.51 pb.
V+jets background was generated with Alpgen+Pythia. As for diboson production, the contribution of mistagged light flavor V+jets events was determined by applying the mistag rate matrix to W/Z+lf samples vetoing events with a real b- or c-quark from HEPG. Since the imprecise theoretical prediction for the cross section of this process, we derive its normalization from data.
Due to the largecross-sections, it is practically impossible to generate enough statistics to simulate all QCD processes. To deal with this problem a method for estimating QCD background from data was developed. This technique allows us to estimate not only heavy flavor QCD production, but also processes with a light flavor jet falsely tagged as a b-quark. Additionaly it allows us to model Single SecVTX tagged data sample, which adds additional sensitivity to the analysis. We derive QCD multijet normalization from data.
We fitted QCDNN distribution in order to estimate the normalization of both QCD and V+jets backgrounds. We choose this variable to perform the fit because the QCD background is well separated from the other electroweak processes and its normalization can be derived with a good precision. The fit is performed by using a binned likelihood fit (TFractionFitter function in ROOT).
We train an artificial neural network (NN), a multilayer perceptron (MLP) fed with 13 kinematic variables, to separate the signal from the main background: QCD multijet production. We cut on this output at QCDNN > 0.35, to form the Pre-selection 2 region.
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
We train an artificial neural network (NN), a multilayer perceptron (MLP) fed with 12 kinematic variables, to separate the signal from top pair production background. We cut on this output at TTNN > 0.3, to form the Signal region.
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
11 variables are fed to a MLP trained with events with at least one tight tag (SecVTX)to obtain the final NN discriminant. We then use a likelihood profile of this discriminant to measure the production cross section of single top events.
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| 1S | SJ | SS | |
| t-channel | 170.0 ± 8.9 | 8.1 ± 0.7 | 8.8 ± 0.7 |
| s-channel | 94.1 ± 4.9 | 31.4 ± 2.6 | 38.4 ± 2.8 |
| QCD multijet | 2642.1 ± 191.0 | 284.0 ± 21.7 | 85.9 ± 8.1 |
| W+jets | 2425.1 ± 212.3 | 77.0 ± 8.5 | 55.7 ± 5.8 |
| Z+jets | 1009.8 ± 88.1 | 37.8 ± 4.1 | 32.6 ± 3.4 |
| Diboson | 228.5 ± 27.4 | 23.6 ± 3.2 | 23.4 ± 3.0 |
| \ttbar | 453.3 ± 40.3 | 94.5 ± 10.3 | 108.2 ± 11.1 |
| Expected | 7023.0 ± 360.0 | 556.4 ± 27.6 | 353.1 ± 17.5 |
| Data | 7186 | 569 | 351 |
Systematic uncertainties are split in normalization uncertainty and shape uncertainty. The normalization uncertainty relfects changes to the event yield due to the systematic effect while the shape uncertainty reflect changes to the template histograms. Both of these effects can be included, depending on the source the systematic uncertainty.
The table below summarizes the systematic uncertainties and their effects.
| Systematic | Region | Signal | diboson | ttbar | V+jets | QCD |
| Luminosity | 1S+SJ+SS | ±6%6% | ±6%6% | no | no | no |
| 1S+SJ+SS | ±2%2% | ±2%2% | no | no | no | |
| Lepton veto | 1S+SJ+SS | ±2%2% | ±2%2% | no | no | no |
| B-tagging SecVTX | 1S | ±5.2%5.2% | ±5.2%5.2% | ±5.2%5.2% | no | no |
| SJ | ±3%3% | ±3%3% | ±3%3% | no | no | |
| SS | ±10.4%10.4% | ±10.4%10.4% | ±10.4%10.4% | no | no | |
| B-tagging JetProb | 1S | ±3%3% | ±3%3% | ±3%3% | no | no |
| SJ | ±3.3%3.3% | ±3.3%3.3% | ±3.3%3.3% | no | no | |
| SS | ±0%0% | ±0%0% | ±0%0% | no | no | |
| Cross Section | 1S+SJ+SS | no | ±6%6% | ±6.6%6.6% | no | ±20%20% |
| JES shape/rate | 1S | yes/±4%4% | yes/±6%6% | yes/±2%2% | yes/no | no/no |
| SJ | yes/±4%4% | yes/±5%5% | yes/±1%1% | yes/no | no/no | |
| SS | yes/±3%3% | yes/±6%6% | yes/±1%1% | yes/no | no/no | |
| Q2 scale | 1S+SJ+SS | no | no | no | yes (only W+jets) | no |
| TRF | 1S+SJ+SS | no | no | no | no | yes |
| ISR/FSR | 1S+SJ+SS | ±2%2% | no | no | no | no |
| Top mass dependence | 1S+SJ+SS | yes | no | yes | no | no |
| Trigger efficiency | 1S+SJ+SS | ±2%2% | ±2%2% | ±2%2% | no | no |
We apply our analysis to 9.1 fb -1 of CDF Run II data and measure the single top production cross-section.
The result of the binned maximum likelihood fit is shown below. All sources of systematic uncertainties (normalization and shape) are included.
![]() |