Search | arXiv e-print repository

Digital Twin Generators for Disease Modeling

Authors: Nameyeh Alam, Jake Basilico, Daniele Bertolini, Satish Casie Chetty, Heather D'Angelo, Ryan Douglas, Charles K. Fisher, Franklin Fuller, Melissa Gomes, Rishabh Gupta, Alex Lang, Anton Loukianov, Rachel Mak-McCully, Cary Murray, Hanalei Pham, Susanna Qiao, Elena Ryapolova-Webb, Aaron Smith, Dimitri Theoharatos, Anil Tolwani, Eric W. Tramel, Anna Vidovszky, Judy Viduya, Jonathan R. Walsh

Abstract: A patient's digital twin is a computational model that describes the evolution of their health over time. Digital twins have the potential to revolutionize medicine by enabling individual-level computer simulations of human health, which can be used to conduct more efficient clinical trials or to recommend personalized treatment options. Due to the overwhelming complexity of human biology, machine… ▽ More A patient's digital twin is a computational model that describes the evolution of their health over time. Digital twins have the potential to revolutionize medicine by enabling individual-level computer simulations of human health, which can be used to conduct more efficient clinical trials or to recommend personalized treatment options. Due to the overwhelming complexity of human biology, machine learning approaches that leverage large datasets of historical patients' longitudinal health records to generate patients' digital twins are more tractable than potential mechanistic models. In this manuscript, we describe a neural network architecture that can learn conditional generative models of clinical trajectories, which we call Digital Twin Generators (DTGs), that can create digital twins of individual patients. We show that the same neural network architecture can be trained to generate accurate digital twins for patients across 13 different indications simply by changing the training set and tuning hyperparameters. By introducing a general purpose architecture, we aim to unlock the ability to scale machine learning approaches to larger datasets and across more indications so that a digital twin could be created for any patient in the world. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2404.17576 [pdf, ps, other]

Enhancing Longitudinal Clinical Trial Efficiency with Digital Twins and Prognostic Covariate-Adjusted Mixed Models for Repeated Measures (PROCOVA-MMRM)

Authors: Jessica L. Ross, Arman Sabbaghi, Run Zhuang, Daniele Bertolini, the Alzheimer's Disease Cooperative Study, Alzheimer's Disease Neuroimaging Initiative, the Critical Path for Alzheimer's Disease Database, the European Prevention of Alzheimer's Disease, Consortium, the Pooled Resource Open-Access ALS Clinical Trials Consortium

Abstract: Clinical trials are critical in advancing medical treatments but often suffer from immense time and financial burden. Advances in statistical methodologies and artificial intelligence (AI) present opportunities to address these inefficiencies. Here we introduce Prognostic Covariate-Adjusted Mixed Models for Repeated Measures (PROCOVA-MMRM) as an advantageous combination of prognostic covariate adj… ▽ More Clinical trials are critical in advancing medical treatments but often suffer from immense time and financial burden. Advances in statistical methodologies and artificial intelligence (AI) present opportunities to address these inefficiencies. Here we introduce Prognostic Covariate-Adjusted Mixed Models for Repeated Measures (PROCOVA-MMRM) as an advantageous combination of prognostic covariate adjustment (PROCOVA) and Mixed Models for Repeated Measures (MMRM). PROCOVA-MMRM utilizes time-matched prognostic scores generated from AI models to enhance the precision of treatment effect estimators for longitudinal continuous outcomes, enabling reductions in sample size and enrollment times. We first provide a description of the background and implementation of PROCOVA-MMRM, followed by two case study reanalyses where we compare the performance of PROCOVA-MMRM versus the unadjusted MMRM. These reanalyses demonstrate significant improvements in statistical power and precision in clinical indications with unmet medical need, specifically Alzheimer's Disease (AD) and Amyotrophic Lateral Sclerosis (ALS). We also explore the potential for sample size reduction with the prospective implementation of PROCOVA-MMRM, finding that the same or better results could have been achieved with fewer participants in these historical trials if the enhanced precision provided by PROCOVA-MMRM had been prospectively leveraged. We also confirm the robustness of the statistical properties of PROCOVA-MMRM in a variety of realistic simulation scenarios. Altogether, PROCOVA-MMRM represents a rigorous method of incorporating advances in the prediction of time-matched prognostic scores generated by AI into longitudinal analysis, potentially reducing both the cost and time required to bring new treatments to patients while adhering to regulatory standards. △ Less

Submitted 26 April, 2024; originally announced April 2024.

Comments: 29 pages, 9 tables

MSC Class: 62J05

arXiv:2012.13455 [pdf, other]

Modeling Disease Progression in Mild Cognitive Impairment and Alzheimer's Disease with Digital Twins

Authors: Daniele Bertolini, Anton D. Loukianov, Aaron M. Smith, David Li-Bland, Yannick Pouliot, Jonathan R. Walsh, Charles K. Fisher

Abstract: Alzheimer's Disease (AD) is a neurodegenerative disease that affects subjects in a broad range of severity and is assessed in clinical trials with multiple cognitive and functional instruments. As clinical trials in AD increasingly focus on earlier stages of the disease, especially Mild Cognitive Impairment (MCI), the ability to model subject outcomes across the disease spectrum is extremely impor… ▽ More Alzheimer's Disease (AD) is a neurodegenerative disease that affects subjects in a broad range of severity and is assessed in clinical trials with multiple cognitive and functional instruments. As clinical trials in AD increasingly focus on earlier stages of the disease, especially Mild Cognitive Impairment (MCI), the ability to model subject outcomes across the disease spectrum is extremely important. We use unsupervised machine learning models called Conditional Restricted Boltzmann Machines (CRBMs) to create Digital Twins of AD subjects. Digital Twins are simulated clinical records that share baseline data with actual subjects and comprehensively model their outcomes under standard-of-care. The CRBMs are trained on a large set of records from subjects in observational studies and the placebo arms of clinical trials across the AD spectrum. These data exhibit a challenging, but common, patchwork of measured and missing observations across subjects in the dataset, and we present a novel model architecture designed to learn effectively from it. We evaluate performance against a held-out test dataset and show how Digital Twins simultaneously capture the progression of a number of key endpoints in clinical trials across a broad spectrum of disease severity, including MCI and mild-to-moderate AD. △ Less

Submitted 24 December, 2020; originally announced December 2020.

arXiv:2009.09780 [pdf, other]

Impact of lung segmentation on the diagnosis and explanation of COVID-19 in chest X-ray images

Authors: Lucas O. Teixeira, Rodolfo M. Pereira, Diego Bertolini, Luiz S. Oliveira, Loris Nanni, George D. C. Cavalcanti, Yandre M. G. Costa

Abstract: COVID-19 frequently provokes pneumonia, which can be diagnosed using imaging exams. Chest X-ray (CXR) is often useful because it is cheap, fast, widespread, and uses less radiation. Here, we demonstrate the impact of lung segmentation in COVID-19 identification using CXR images and evaluate which contents of the image influenced the most. Semantic segmentation was performed using a U-Net CNN archi… ▽ More COVID-19 frequently provokes pneumonia, which can be diagnosed using imaging exams. Chest X-ray (CXR) is often useful because it is cheap, fast, widespread, and uses less radiation. Here, we demonstrate the impact of lung segmentation in COVID-19 identification using CXR images and evaluate which contents of the image influenced the most. Semantic segmentation was performed using a U-Net CNN architecture, and the classification using three CNN architectures (VGG, ResNet, and Inception). Explainable Artificial Intelligence techniques were employed to estimate the impact of segmentation. A three-classes database was composed: lung opacity (pneumonia), COVID-19, and normal. We assessed the impact of creating a CXR image database from different sources, and the COVID-19 generalization from one source to another. The segmentation achieved a Jaccard distance of 0.034 and a Dice coefficient of 0.982. The classification using segmented images achieved an F1-Score of 0.88 for the multi-class setup, and 0.83 for COVID-19 identification. In the cross-dataset scenario, we obtained an F1-Score of 0.74 and an area under the ROC curve of 0.9 for COVID-19 identification using segmented images. Experiments support the conclusion that even after segmentation, there is a strong bias introduced by underlying factors from different sources. △ Less

Submitted 13 September, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

Comments: Submitted to Sensors

arXiv:2006.00654 [pdf, other]

A multimodal approach for multi-label movie genre classification

Authors: Rafael B. Mangolin, Rodolfo M. Pereira, Alceu S. Britto Jr., Carlos N. Silla Jr., Valéria D. Feltrim, Diego Bertolini, Yandre M. G. Costa

Abstract: Movie genre classification is a challenging task that has increasingly attracted the attention of researchers. In this paper, we addressed the multi-label classification of the movie genres in a multimodal way. For this purpose, we created a dataset composed of trailer video clips, subtitles, synopses, and movie posters taken from 152,622 movie titles from The Movie Database. The dataset was caref… ▽ More Movie genre classification is a challenging task that has increasingly attracted the attention of researchers. In this paper, we addressed the multi-label classification of the movie genres in a multimodal way. For this purpose, we created a dataset composed of trailer video clips, subtitles, synopses, and movie posters taken from 152,622 movie titles from The Movie Database. The dataset was carefully curated and organized, and it was also made available as a contribution of this work. Each movie of the dataset was labeled according to a set of eighteen genre labels. We extracted features from these data using different kinds of descriptors, namely Mel Frequency Cepstral Coefficients, Statistical Spectrum Descriptor , Local Binary Pattern with spectrograms, Long-Short Term Memory, and Convolutional Neural Networks. The descriptors were evaluated using different classifiers, such as BinaryRelevance and ML-kNN. We have also investigated the performance of the combination of different classifiers/features using a late fusion strategy, which obtained encouraging results. Based on the F-Score metric, our best result, 0.628, was obtained by the fusion of a classifier created using LSTM on the synopses, and a classifier created using CNN on movie trailer frames. When considering the AUC-PR metric, the best result, 0.673, was also achieved by combining those representations, but in addition, a classifier based on LSTM created from the subtitles was used. These results corroborate the existence of complementarity among classifiers based on different sources of information in this field of application. As far as we know, this is the most comprehensive study developed in terms of the diversity of multimedia sources of information to perform movie genre classification. △ Less

Submitted 31 May, 2020; originally announced June 2020.

Comments: 21 pages and 4 figures

arXiv:2005.08424 [pdf]

Single-sample writers -- "Document Filter" and their impacts on writer identification

Authors: Fabio Pinhelli, Alceu S. Britto Jr, Luiz S. Oliveira, Yandre M. G. Costa, Diego Bertolini

Abstract: The writing can be used as an important biometric modality which allows to unequivocally identify an individual. It happens because the writing of two different persons present differences that can be explored both in terms of graphometric properties or even by addressing the manuscript as a digital image, taking into account the use of image processing techniques that can properly capture differe… ▽ More The writing can be used as an important biometric modality which allows to unequivocally identify an individual. It happens because the writing of two different persons present differences that can be explored both in terms of graphometric properties or even by addressing the manuscript as a digital image, taking into account the use of image processing techniques that can properly capture different visual attributes of the image (e.g. texture). In this work, perform a detailed study in which we dissect whether or not the use of a database with only a single sample taken from some writers may skew the results obtained in the experimental protocol. In this sense, we propose here what we call "document filter". The "document filter" protocol is supposed to be used as a preprocessing technique, such a way that all the data taken from fragments of the same document must be placed either into the training or into the test set. The rationale behind it, is that the classifier must capture the features from the writer itself, and not features regarding other particularities which could affect the writing in a specific document (i.e. emotional state of the writer, pen used, paper type, and etc.). By analyzing the literature, one can find several works dealing the writer identification problem. However, the performance of the writer identification systems must be evaluated also taking into account the occurrence of writer volunteers who contributed with a single sample during the creation of the manuscript databases. To address the open issue investigated here, a comprehensive set of experiments was performed on the IAM, BFL and CVL databases. They have shown that, in the most extreme case, the recognition rate obtained using the "document filter" protocol drops from 81.80% to 50.37%. △ Less

Submitted 17 May, 2020; originally announced May 2020.

arXiv:2004.05835 [pdf, other]

doi 10.1016/j.cmpb.2020.105532

COVID-19 identification in chest X-ray images on flat and hierarchical classification scenarios

Authors: Rodolfo M. Pereira, Diego Bertolini, Lucas O. Teixeira, Carlos N. Silla Jr., Yandre M. G. Costa

Abstract: The COVID-19 can cause severe pneumonia and is estimated to have a high impact on the healthcare system. The standard image diagnosis tests for pneumonia are chest X-ray (CXR) and computed tomography (CT) scan. CXR are useful in because it is cheaper, faster and more widespread than CT. This study aims to identify pneumonia caused by COVID-19 from other types and also healthy lungs using only CXR… ▽ More The COVID-19 can cause severe pneumonia and is estimated to have a high impact on the healthcare system. The standard image diagnosis tests for pneumonia are chest X-ray (CXR) and computed tomography (CT) scan. CXR are useful in because it is cheaper, faster and more widespread than CT. This study aims to identify pneumonia caused by COVID-19 from other types and also healthy lungs using only CXR images. In order to achieve the objectives, we have proposed a classification schema considering the multi-class and hierarchical perspectives, since pneumonia can be structured as a hierarchy. Given the natural data imbalance in this domain, we also proposed the use of resampling algorithms in order to re-balance the classes distribution. Our classification schema extract features using some well-known texture descriptors and also using a pre-trained CNN model. We also explored early and late fusion techniques in order to leverage the strength of multiple texture descriptors and base classifiers at once. To evaluate the approach, we composed a database, named RYDLS-20, containing CXR images of pneumonia caused by different pathogens as well as CXR images of healthy lungs. The classes distribution follows a real-world scenario in which some pathogens are more common than others. The proposed approach achieved a macro-avg F1-Score of 0.65 using a multi-class approach and a F1-Score of 0.89 for the COVID-19 identification in the hierarchical classification scenario. As far as we know, we achieved the best nominal rate obtained for COVID-19 identification in an unbalanced environment with more than three classes. We must also highlight the novel proposed hierarchical classification approach for this task, which considers the types of pneumonia caused by the different pathogens and lead us to the best COVID-19 recognition rate obtained here. △ Less

Submitted 6 May, 2020; v1 submitted 13 April, 2020; originally announced April 2020.

Comments: Accepted for publication in the Computer Methods and Programs in Biomedicine Journal

arXiv:1905.11460 [pdf, other]

Incidence Networks for Geometric Deep Learning

Authors: Marjan Albooyeh, Daniele Bertolini, Siamak Ravanbakhsh

Abstract: Sparse incidence tensors can represent a variety of structured data. For example, we may represent attributed graphs using their node-node, node-edge, or edge-edge incidence matrices. In higher dimensions, incidence tensors can represent simplicial complexes and polytopes. In this paper, we formalize incidence tensors, analyze their structure, and present the family of equivariant networks that op… ▽ More Sparse incidence tensors can represent a variety of structured data. For example, we may represent attributed graphs using their node-node, node-edge, or edge-edge incidence matrices. In higher dimensions, incidence tensors can represent simplicial complexes and polytopes. In this paper, we formalize incidence tensors, analyze their structure, and present the family of equivariant networks that operate on them. We show that any incidence tensor decomposes into invariant subsets. This decomposition, in turn, leads to a decomposition of the corresponding equivariant linear maps, for which we prove an efficient pooling-and-broadcasting implementation. △ Less

Submitted 11 August, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

Comments: Last revised August 10, 2020

arXiv:1704.08262 [pdf, other]

doi 10.1007/JHEP07(2017)099

Soft Functions for Generic Jet Algorithms and Observables at Hadron Colliders

Authors: Daniele Bertolini, Daniel Kolodrubetz, Duff Neill, Piotr Pietrulewicz, Iain W. Stewart, Frank J. Tackmann, Wouter J. Waalewijn

Abstract: We introduce a method to compute one-loop soft functions for exclusive $N$-jet processes at hadron colliders, allowing for different definitions of the algorithm that determines the jet regions and of the measurements in those regions. In particular, we generalize the $N$-jettiness hemisphere decomposition of [Jouttenus 2011] in a manner that separates the dependence on the jet boundary from the o… ▽ More We introduce a method to compute one-loop soft functions for exclusive $N$-jet processes at hadron colliders, allowing for different definitions of the algorithm that determines the jet regions and of the measurements in those regions. In particular, we generalize the $N$-jettiness hemisphere decomposition of [Jouttenus 2011] in a manner that separates the dependence on the jet boundary from the observables measured inside the jet and beam regions. Results are given for several factorizable jet definitions, including anti-$k_T$, XCone, and other geometric partitionings. We calculate explicitly the soft functions for angularity measurements, including jet mass and jet broadening, in $pp \to L + 1$ jet and explore the differences for various jet vetoes and algorithms. This includes a consistent treatment of rapidity divergences when applicable. We also compute analytic results for these soft functions in an expansion for a small jet radius $R$. We find that the small-$R$ results, including corrections up to $\mathcal{O}(R^2)$, accurately capture the full behavior over a large range of $R$. △ Less

Submitted 7 October, 2017; v1 submitted 26 April, 2017; originally announced April 2017.

Comments: 33 pages + appendices, 17 figures, v2: journal version, v3: fixed typo in eq.(4.37)

arXiv:1701.07919 [pdf, other]

doi 10.1103/PhysRevD.95.054024

Integrated and Differential Accuracy in Resummed Cross Sections

Authors: Daniele Bertolini, Mikhail P. Solon, Jonathan R. Walsh

Abstract: Standard QCD resummation techniques provide precise predictions for the spectrum and the cumulant of a given observable. The integrated spectrum and the cumulant differ by higher-order terms which, however, can be numerically significant. In this paper we propose a method, which we call the $σ\text{-improved}$ scheme, to resolve this issue. It consists of two steps: (i) include higher-order terms… ▽ More Standard QCD resummation techniques provide precise predictions for the spectrum and the cumulant of a given observable. The integrated spectrum and the cumulant differ by higher-order terms which, however, can be numerically significant. In this paper we propose a method, which we call the $σ\text{-improved}$ scheme, to resolve this issue. It consists of two steps: (i) include higher-order terms in the spectrum to improve the agreement with the cumulant central value, and (ii) employ profile scales that encode correlations between different points to give robust uncertainty estimates for the integrated spectrum. We provide a generic algorithm for determining such profile scales, and show the application to the thrust distribution in $e^+e^-$ collisions at NLL$'$+NLO and NNLL$'$+NNLO. △ Less

Submitted 26 January, 2017; originally announced January 2017.

Comments: 8 pages, 8 figures

Journal ref: Phys. Rev. D 95, 054024 (2017)

arXiv:1608.01310 [pdf, other]

doi 10.1088/1475-7516/2016/11/030

Principal Shapes and Squeezed Limits in the Effective Field Theory of Large Scale Structure

Authors: Daniele Bertolini, Mikhail P. Solon

Abstract: We apply an orthogonalization procedure on the effective field theory of large scale structure (EFT of LSS) shapes, relevant for the angle-averaged bispectrum and non-Gaussian covariance of the matter power spectrum at one loop. Assuming natural-sized EFT parameters, this identifies a linear combination of EFT shapes - referred to as the principal shape - that gives the dominant contribution for t… ▽ More We apply an orthogonalization procedure on the effective field theory of large scale structure (EFT of LSS) shapes, relevant for the angle-averaged bispectrum and non-Gaussian covariance of the matter power spectrum at one loop. Assuming natural-sized EFT parameters, this identifies a linear combination of EFT shapes - referred to as the principal shape - that gives the dominant contribution for the whole kinematic plane, with subdominant combinations suppressed by a few orders of magnitude. For the covariance, our orthogonal transformation is in excellent agreement with a principal component analysis applied to available data. Additionally we find that, for both observables, the coefficients of the principal shapes are well approximated by the EFT coefficients appearing in the squeezed limit, and are thus measurable from power spectrum response functions. Employing data from N-body simulations for the growth-only response, we measure the single EFT coefficient describing the angle-averaged bispectrum with $\mathcal{O}(10\%)$ precision. These methods of shape orthogonalization and measurement of coefficients from response functions are valuable tools for develo** the EFT of LSS framework, and can be applied to more general observables. △ Less

Submitted 3 August, 2016; originally announced August 2016.

Comments: 18+10 pages, 5 figures

arXiv:1604.01770 [pdf, other]

doi 10.1088/1475-7516/2016/06/052

The Trispectrum in the Effective Field Theory of Large Scale Structure

Authors: Daniele Bertolini, Katelin Schutz, Mikhail P. Solon, Kathryn M. Zurek

Abstract: We compute the connected four point correlation function (the trispectrum in Fourier space) of cosmological density perturbations at one-loop order in Standard Perturbation Theory (SPT) and the Effective Field Theory of Large Scale Structure (EFT of LSS). This paper is a companion to our earlier work on the non-Gaussian covariance of the matter power spectrum, which corresponds to a particular wav… ▽ More We compute the connected four point correlation function (the trispectrum in Fourier space) of cosmological density perturbations at one-loop order in Standard Perturbation Theory (SPT) and the Effective Field Theory of Large Scale Structure (EFT of LSS). This paper is a companion to our earlier work on the non-Gaussian covariance of the matter power spectrum, which corresponds to a particular wavenumber configuration of the trispectrum. In the present calculation, we highlight and clarify some of the subtle aspects of the EFT framework that arise at third order in perturbation theory for general wavenumber configurations of the trispectrum. We consistently incorporate vorticity and non-locality in time into the EFT counterterms and lay out a complete basis of building blocks for the stress tensor. We show predictions for the one-loop SPT trispectrum and the EFT contributions, focusing on configurations which have particular relevance for using LSS to constrain primordial non-Gaussianity. △ Less

Submitted 6 April, 2016; originally announced April 2016.

Comments: 25+3 pages, 7 figures

arXiv:1512.07630 [pdf, other]

doi 10.1103/PhysRevD.93.123505

Non-Gaussian Covariance of the Matter Power Spectrum in the Effective Field Theory of Large Scale Structure

Authors: Daniele Bertolini, Katelin Schutz, Mikhail P. Solon, Jonathan R. Walsh, Kathryn M. Zurek

Abstract: We compute the non-Gaussian contribution to the covariance of the matter power spectrum at one-loop order in Standard Perturbation Theory (SPT), and using the framework of the effective field theory (EFT) of large scale structure (LSS). The complete one-loop contributions are evaluated for the first time, including the leading EFT corrections that involve seven independent operators, of which four… ▽ More We compute the non-Gaussian contribution to the covariance of the matter power spectrum at one-loop order in Standard Perturbation Theory (SPT), and using the framework of the effective field theory (EFT) of large scale structure (LSS). The complete one-loop contributions are evaluated for the first time, including the leading EFT corrections that involve seven independent operators, of which four appear in the power spectrum and bispectrum. We compare the non-Gaussian part of the one-loop covariance computed with both SPT and EFT of LSS to two separate simulations. In one simulation, we find that the one-loop prediction from SPT reproduces the simulation well to $k_i + k_j \sim$ 0.25 h/Mpc, while in the other simulation we find a substantial improvement of EFT of LSS (with one free parameter) over SPT, more than doubling the range of $k$ where the theory accurately reproduces the simulation. The disagreement between these two simulations points to unaccounted for systematics, highlighting the need for improved numerical and analytic understanding of the covariance. △ Less

Submitted 23 May, 2016; v1 submitted 23 December, 2015; originally announced December 2015.

Comments: v2 - 10+9 pages, 6 figures; minor changes + data analysis and conclusions updated. Version accepted for publication in PRD

Journal ref: Phys. Rev. D 93, 123505 (2016)

arXiv:1504.00679 [pdf, other]

Towards an Understanding of the Correlations in Jet Substructure

Authors: D. Adams, A. Arce, L. Asquith, M. Backovic, T. Barillari, P. Berta, D. Bertolini, A. Buckley, J. Butterworth, R. C. Camacho Toro, J. Caudron, Y. -T. Chien, J. Cogan, B. Cooper, D. Curtin, C. Debenedetti, J. Dolen, M. Eklund, S. El Hedri, S. D. Ellis, T. Embry, D. Ferencek, J. Ferrando, S. Fleischmann, M. Freytsis , et al. (61 additional authors not shown)

Abstract: Over the past decade, a large number of jet substructure observables have been proposed in the literature, and explored at the LHC experiments. Such observables attempt to utilize the internal structure of jets in order to distinguish those initiated by quarks, gluons, or by boosted heavy objects, such as top quarks and W bosons. This report, originating from and motivated by the BOOST2013 worksho… ▽ More Over the past decade, a large number of jet substructure observables have been proposed in the literature, and explored at the LHC experiments. Such observables attempt to utilize the internal structure of jets in order to distinguish those initiated by quarks, gluons, or by boosted heavy objects, such as top quarks and W bosons. This report, originating from and motivated by the BOOST2013 workshop, presents original particle-level studies that aim to improve our understanding of the relationships between jet substructure observables, their complementarity, and their dependence on the underlying jet properties, particularly the jet radius and jet transverse momentum. This is explored in the context of quark/gluon discrimination, boosted W boson tagging and boosted top quark tagging. △ Less

Submitted 18 August, 2015; v1 submitted 2 April, 2015; originally announced April 2015.

Comments: Report prepared by the participants of the BOOST 2013 workshop, hosted by the University of Arizona at Flagstaff, AZ, 12-16 August 2013. 54 pages, 51 figures. Version to be published in EPJC

arXiv:1501.01965 [pdf, other]

doi 10.1007/JHEP05(2015)008

The First Calculation of Fractional Jets

Authors: Daniele Bertolini, Jesse Thaler, Jonathan R. Walsh

Abstract: In collider physics, jet algorithms are a ubiquitous tool for clustering particles into discrete jet objects. Event shapes offer an alternative way to characterize jets, and one can define a jet multiplicity event shape, which can take on fractional values, using the framework of "jets without jets". In this paper, we perform the first analytic studies of fractional jet multiplicity… ▽ More In collider physics, jet algorithms are a ubiquitous tool for clustering particles into discrete jet objects. Event shapes offer an alternative way to characterize jets, and one can define a jet multiplicity event shape, which can take on fractional values, using the framework of "jets without jets". In this paper, we perform the first analytic studies of fractional jet multiplicity $\tilde{N}_{\rm jet}$ in the context of $e^+e^-$ collisions. We use fixed-order QCD to understand the $\tilde{N}_{\rm jet}$ cross section at order $α_s^2$, and we introduce a candidate factorization theorem to capture certain higher-order effects. The resulting distributions have a hybrid jet algorithm/event shape behavior which agrees with parton shower Monte Carlo generators. The $\tilde{N}_{\rm jet}$ observable does not satisfy ordinary soft-collinear factorization, and the $\tilde{N}_{\rm jet}$ cross section exhibits a number of unique features, including the absence of collinear logarithms and the presence of soft logarithms that are purely non-global. Additionally, we find novel divergences connected to the energy sharing between emissions, which are reminiscent of rapidity divergences encountered in other applications. Given these interesting properties of fractional jet multiplicity, we advocate for future measurements and calculations of $\tilde{N}_{\rm jet}$ at hadron colliders like the LHC. △ Less

Submitted 14 May, 2015; v1 submitted 8 January, 2015; originally announced January 2015.

Comments: 45 pages, 11 figures, 2 tables; v2: expanded discussion of non-additivity, v3: journal version

Report number: MIT-CTP 4632

Journal ref: JHEP05(2015)008

arXiv:1407.6013 [pdf, other]

doi 10.1007/JHEP10(2014)059

Pileup Per Particle Identification

Authors: Daniele Bertolini, Philip Harris, Matthew Low, Nhan Tran

Abstract: We propose a new method for pileup mitigation by implementing "pileup per particle identification" (PUPPI). For each particle we first define a local shape $α$ which probes the collinear versus soft diffuse structure in the neighborhood of the particle. The former is indicative of particles originating from the hard scatter and the latter of particles originating from pileup interactions. The dist… ▽ More We propose a new method for pileup mitigation by implementing "pileup per particle identification" (PUPPI). For each particle we first define a local shape $α$ which probes the collinear versus soft diffuse structure in the neighborhood of the particle. The former is indicative of particles originating from the hard scatter and the latter of particles originating from pileup interactions. The distribution of $α$ for charged pileup, assumed as a proxy for all pileup, is used on an event-by-event basis to calculate a weight for each particle. The weights describe the degree to which particles are pileup-like and are used to rescale their four-momenta, superseding the need for jet-based corrections. Furthermore, the algorithm flexibly allows combination with other, possibly experimental, probabilistic information associated with particles such as vertexing and timing performance. We demonstrate the algorithm improves over existing methods by looking at jet $p_T$ and jet mass. We also find an improvement on non-jet quantities like missing transverse energy. △ Less

Submitted 29 September, 2014; v1 submitted 22 July, 2014; originally announced July 2014.

Comments: v2 - 23 pages, 10 figures; update to JHEP version, minor revisions throughout, results unchanged

Journal ref: JHEP 1410 (2014) 59

arXiv:1310.7584 [pdf, other]

doi 10.1007/JHEP04(2014)013

Jet Observables Without Jet Algorithms

Authors: Daniele Bertolini, Tucker Chan, Jesse Thaler

Abstract: We introduce a new class of event shapes to characterize the jet-like structure of an event. Like traditional event shapes, our observables are infrared/collinear safe and involve a sum over all hadrons in an event, but like a jet clustering algorithm, they incorporate a jet radius parameter and a transverse momentum cut. Three of the ubiquitous jet-based observables---jet multiplicity, summed sca… ▽ More We introduce a new class of event shapes to characterize the jet-like structure of an event. Like traditional event shapes, our observables are infrared/collinear safe and involve a sum over all hadrons in an event, but like a jet clustering algorithm, they incorporate a jet radius parameter and a transverse momentum cut. Three of the ubiquitous jet-based observables---jet multiplicity, summed scalar transverse momentum, and missing transverse momentum---have event shape counterparts that are closely correlated with their jet-based cousins. Due to their "local" computational structure, these jet-like event shapes could potentially be used for trigger-level event selection at the LHC. Intriguingly, the jet multiplicity event shape typically takes on non-integer values, highlighting the inherent ambiguity in defining jets. By inverting jet multiplicity, we show how to characterize the transverse momentum of the n-th hardest jet without actually finding the constituents of that jet. Since many physics applications do require knowledge about the jet constituents, we also build a hybrid event shape that incorporates (local) jet clustering information. As a straightforward application of our general technique, we derive an event-shape version of jet trimming, allowing event-wide jet grooming without explicit jet identification. Finally, we briefly mention possible applications of our method for jet substructure studies. △ Less

Submitted 9 March, 2014; v1 submitted 28 October, 2013; originally announced October 2013.

Comments: v2 - 31 pages, 18 figures; update to JHEP version, section 3.2 expanded, reference to FastJet contrib updated, results unchanged

Report number: MIT-CTP 4502

arXiv:1302.6229 [pdf, other]

doi 10.1142/9789814525220_0009

TASI 2012: Super-Tricks for Superspace

Authors: Daniele Bertolini, Jesse Thaler, Zoe Thomas

Abstract: These lectures from the TASI 2012 summer school outline the basics of supersymmetry (SUSY) in 3+1 dimensions. Starting from a ground-up development of superspace, we develop all of the tools necessary to construct SUSY lagrangians. While aimed at an introductory level, these lectures incorporate a number of "super-tricks" for SUSY aficionados, including SUSY-covariant derivatives, equations of m… ▽ More These lectures from the TASI 2012 summer school outline the basics of supersymmetry (SUSY) in 3+1 dimensions. Starting from a ground-up development of superspace, we develop all of the tools necessary to construct SUSY lagrangians. While aimed at an introductory level, these lectures incorporate a number of "super-tricks" for SUSY aficionados, including SUSY-covariant derivatives, equations of motion in superspace, background field methods, and non-linear realizations of goldstinos. △ Less

Submitted 16 May, 2013; v1 submitted 25 February, 2013; originally announced February 2013.

Comments: 75 pages, 4 figures, 1 table. v2: formatting improved, hyperlinks added, references updated

Report number: MIT-CTP 4444

arXiv:1207.4209 [pdf, other]

doi 10.1007/JHEP12(2012)118

The Social Higgs

Authors: Daniele Bertolini, Matthew McCullough

Abstract: Using published Higgs search data we investigate whether any evidence supports the possibility that the Higgs may be mixed with other neutral scalars. We combine the positive evidence for the Higgs at 125.5 GeV with search constraints at other masses to explore the viability of two simple models. The first Higgs 'friend' model is simply a neutral scalar mixed with the Higgs. In the second Higgs 'a… ▽ More Using published Higgs search data we investigate whether any evidence supports the possibility that the Higgs may be mixed with other neutral scalars. We combine the positive evidence for the Higgs at 125.5 GeV with search constraints at other masses to explore the viability of two simple models. The first Higgs 'friend' model is simply a neutral scalar mixed with the Higgs. In the second Higgs 'accomplice' model the new scalar has an enhanced coupling to photons due to couplings to additional charged fields. We find that the latter scenario allows improvement in fitting the data by accommodating enhanced diphoton rates and suppression in other channels for a Higgs mass of 125.5 GeV. Small excesses at other masses allow the additional scalar to further improve the fit to the data, particularly if it has mass in the vicinity of 210 GeV. Due to observed event rates at 125.5 GeV and strong limits in high mass Higgs searches, mixing angles greater than pi/4 are typically disfavored at the 95% confidence level, depending on the mass of the scalar. △ Less

Submitted 22 October, 2012; v1 submitted 17 July, 2012; originally announced July 2012.

Comments: 11 pages, 4 figures. v2 references added, Higgs data updated

Report number: MIT-CTP 4383

arXiv:1111.0628 [pdf, ps, other]

doi 10.1007/JHEP04(2012)130

Visible Supersymmetry Breaking and an Invisible Higgs

Authors: Daniele Bertolini, Keith Rehermann, Jesse Thaler

Abstract: If there are multiple hidden sectors which independently break supersymmetry, then the spectrum will contain multiple goldstini. In this paper, we explore the possibility that the visible sector might also break supersymmetry, giving rise to an additional pseudo-goldstino. By the standard lore, visible sector supersymmetry breaking is phenomenologically excluded by the supertrace sum rule, but thi… ▽ More If there are multiple hidden sectors which independently break supersymmetry, then the spectrum will contain multiple goldstini. In this paper, we explore the possibility that the visible sector might also break supersymmetry, giving rise to an additional pseudo-goldstino. By the standard lore, visible sector supersymmetry breaking is phenomenologically excluded by the supertrace sum rule, but this sum rule is relaxed with multiple supersymmetry breaking. However, we find that visible sector supersymmetry breaking is still phenomenologically disfavored, not because of a sum rule, but because the visible sector pseudo-goldstino is generically overproduced in the early universe. A way to avoid this cosmological bound is to ensure that an R symmetry is preserved in the visible sector up to supergravity effects. A key expectation of this R-symmetric case is that the Higgs boson will dominantly decay invisibly at the LHC. △ Less

Submitted 4 May, 2012; v1 submitted 2 November, 2011; originally announced November 2011.

Comments: v1 - 27 pages, 13 figures, 1 table; v2 - references added; v3 - expanded discussion of higgs sector, JHEP version

Report number: MIT-CTP 4320

Showing 1–20 of 20 results for author: Bertolini, D