Search | arXiv e-print repository

Improving Hyperparameter Optimization with Checkpointed Model Weights

Authors: Nikhil Mehta, Jonathan Lorraine, Steve Masson, Ramanathan Arunachalam, Zaid Pervaiz Bhat, James Lucas, Arun George Zachariah

Abstract: When training deep learning models, the performance depends largely on the selected hyperparameters. However, hyperparameter optimization (HPO) is often one of the most expensive parts of model design. Classical HPO methods treat this as a black-box optimization problem. However, gray-box HPO methods, which incorporate more information about the setup, have emerged as a promising direction for mor… ▽ More When training deep learning models, the performance depends largely on the selected hyperparameters. However, hyperparameter optimization (HPO) is often one of the most expensive parts of model design. Classical HPO methods treat this as a black-box optimization problem. However, gray-box HPO methods, which incorporate more information about the setup, have emerged as a promising direction for more efficient optimization. For example, using intermediate loss evaluations to terminate bad selections. In this work, we propose an HPO method for neural networks using logged checkpoints of the trained weights to guide future hyperparameter selections. Our method, Forecasting Model Search (FMS), embeds weights into a Gaussian process deep kernel surrogate model, using a permutation-invariant graph metanetwork to be data-efficient with the logged network weights. To facilitate reproducibility and further research, we open-source our code at https://github.com/NVlabs/forecasting-model-search. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: See the project website at https://research.nvidia.com/labs/toronto-ai/FMS/

MSC Class: 68T05 ACM Class: I.2.6; G.1.6; D.2.8

arXiv:2405.13978 [pdf, other]

Mitigating Interference in the Knowledge Continuum through Attention-Guided Incremental Learning

Authors: Prashant Bhat, Bharath Renjith, Elahe Arani, Bahram Zonooz

Abstract: Continual learning (CL) remains a significant challenge for deep neural networks, as it is prone to forgetting previously acquired knowledge. Several approaches have been proposed in the literature, such as experience rehearsal, regularization, and parameter isolation, to address this problem. Although almost zero forgetting can be achieved in task-incremental learning, class-incremental learning… ▽ More Continual learning (CL) remains a significant challenge for deep neural networks, as it is prone to forgetting previously acquired knowledge. Several approaches have been proposed in the literature, such as experience rehearsal, regularization, and parameter isolation, to address this problem. Although almost zero forgetting can be achieved in task-incremental learning, class-incremental learning remains highly challenging due to the problem of inter-task class separation. Limited access to previous task data makes it difficult to discriminate between classes of current and previous tasks. To address this issue, we propose `Attention-Guided Incremental Learning' (AGILE), a novel rehearsal-based CL approach that incorporates compact task attention to effectively reduce interference between tasks. AGILE utilizes lightweight, learnable task projection vectors to transform the latent representations of a shared task attention module toward task distribution. Through extensive empirical evaluation, we show that AGILE significantly improves generalization performance by mitigating task interference and outperforming rehearsal-based approaches in several CL scenarios. Furthermore, AGILE can scale well to a large number of tasks with minimal overhead while remaining well-calibrated with reduced task-recency bias. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Published at 3rd Conference on Lifelong Learning Agents (CoLLAs 2024)

arXiv:2405.10752 [pdf, other]

Constraining possible $γ$-ray burst emission from GW230529 using Swift-BAT and Fermi-GBM

Authors: Samuele Ronchini, Suman Bala, Joshua Wood, James Delaunay, Simone Dichiara, Jamie A. Kennea, Tyler Parsotan, Gayathri Raman, Aaron Tohuvavohu, Naresh Adhikari, Narayana P. Bhat, Sylvia Biscoveanu, Elisabetta Bissaldi, Eric Burns, Sergio Campana, Koustav Chandra, William H. Cleveland, Sarah Dalessi, Massimiliano De Pasquale, Juan García-Bellido, Claudio Gasbarra, Misty M. Giles, Ish Gupta, Dieter Hartmann, Boyan A. Hristov , et al. (13 additional authors not shown)

Abstract: GW230529 is the first compact binary coalescence detected by the LIGO-Virgo-KAGRA collaboration with at least one component mass confidently in the lower mass-gap, corresponding to the range 3-5$M_{\odot}$. If interpreted as a neutron star-black hole merger, this event has the most symmetric mass ratio detected so far and therefore has a relatively high probability of producing electromagnetic (EM… ▽ More GW230529 is the first compact binary coalescence detected by the LIGO-Virgo-KAGRA collaboration with at least one component mass confidently in the lower mass-gap, corresponding to the range 3-5$M_{\odot}$. If interpreted as a neutron star-black hole merger, this event has the most symmetric mass ratio detected so far and therefore has a relatively high probability of producing electromagnetic (EM) emission. However, no EM counterpart has been reported. At the merger time $t_0$, Swift-BAT and Fermi-GBM together covered 100$\%$ of the sky. Performing a targeted search in a time window $[t_0-20 \text{s},t_0+20 \text{s}]$, we report no detection by the Swift-BAT and the Fermi-GBM instruments. Combining the position-dependent $γ-$ray flux upper limits and the gravitational-wave posterior distribution of luminosity distance, sky localization and inclination angle of the binary, we derive constraints on the characteristic luminosity and structure of the jet possibly launched during the merger. Assuming a top-hat jet structure, we exclude at 90$\%$ credibility the presence of a jet which has at the same time an on-axis isotropic luminosity $\gtrsim 10^{48}$ erg s$^{-1}$, in the bolometric band 1 keV-10 MeV, and a jet opening angle $\gtrsim 15$ deg. Similar constraints are derived testing other assumptions about the jet structure profile. Excluding GRB 170817A, the luminosity upper limits derived here are below the luminosity of any GRB observed so far. △ Less

Submitted 17 May, 2024; originally announced May 2024.

Comments: 18 pages, 1 table, 11 figures

arXiv:2404.18161 [pdf, other]

IMEX-Reg: Implicit-Explicit Regularization in the Function Space for Continual Learning

Authors: Prashant Bhat, Bharath Renjith, Elahe Arani, Bahram Zonooz

Abstract: Continual learning (CL) remains one of the long-standing challenges for deep neural networks due to catastrophic forgetting of previously acquired knowledge. Although rehearsal-based approaches have been fairly successful in mitigating catastrophic forgetting, they suffer from overfitting on buffered samples and prior information loss, hindering generalization under low-buffer regimes. Inspired by… ▽ More Continual learning (CL) remains one of the long-standing challenges for deep neural networks due to catastrophic forgetting of previously acquired knowledge. Although rehearsal-based approaches have been fairly successful in mitigating catastrophic forgetting, they suffer from overfitting on buffered samples and prior information loss, hindering generalization under low-buffer regimes. Inspired by how humans learn using strong inductive biases, we propose IMEX-Reg to improve the generalization performance of experience rehearsal in CL under low buffer regimes. Specifically, we employ a two-pronged implicit-explicit regularization approach using contrastive representation learning (CRL) and consistency regularization. To further leverage the global relationship between representations learned using CRL, we propose a regularization strategy to guide the classifier toward the activation correlations in the unit hypersphere of the CRL. Our results show that IMEX-Reg significantly improves generalization performance and outperforms rehearsal-based approaches in several CL scenarios. It is also robust to natural and adversarial corruptions with less task-recency bias. Additionally, we provide theoretical insights to support our design decisions further. △ Less

Submitted 28 April, 2024; originally announced April 2024.

Comments: Published in Transactions on Machine Learning Research

arXiv:2404.02402 [pdf, other]

Token Trails: Navigating Contextual Depths in Conversational AI with ChatLLM

Authors: Md. Kowsher, Ritesh Panditi, Nusrat Jahan Prottasha, Prakash Bhat, Anupam Kumar Bairagi, Mohammad Shamsul Arefin

Abstract: Conversational modeling using Large Language Models (LLMs) requires a nuanced understanding of context to generate coherent and contextually relevant responses. In this paper, we present Token Trails, a novel approach that leverages token-type embeddings to navigate the intricate contextual nuances within conversations. Our framework utilizes token-type embeddings to distinguish between user utter… ▽ More Conversational modeling using Large Language Models (LLMs) requires a nuanced understanding of context to generate coherent and contextually relevant responses. In this paper, we present Token Trails, a novel approach that leverages token-type embeddings to navigate the intricate contextual nuances within conversations. Our framework utilizes token-type embeddings to distinguish between user utterances and bot responses, facilitating the generation of context-aware replies. Through comprehensive experimentation and evaluation, we demonstrate the effectiveness of Token Trails in improving conversational understanding and response generation, achieving state-of-the-art performance. Our results highlight the significance of contextual modeling in conversational AI and underscore the promising potential of Token Trails to advance the field, paving the way for more sophisticated and contextually aware chatbot interactions. △ Less

Submitted 2 April, 2024; originally announced April 2024.

arXiv:2402.05428 [pdf, other]

Mixture Density Networks for Classification with an Application to Product Bundling

Authors: Narendhar Gugulothu, Sanjay P. Bhat, Tejas Bodas

Abstract: While mixture density networks (MDNs) have been extensively used for regression tasks, they have not been used much for classification tasks. One reason for this is that the usability of MDNs for classification is not clear and straightforward. In this paper, we propose two MDN-based models for classification tasks. Both models fit mixtures of Gaussians to the the data and use the fitted distribut… ▽ More While mixture density networks (MDNs) have been extensively used for regression tasks, they have not been used much for classification tasks. One reason for this is that the usability of MDNs for classification is not clear and straightforward. In this paper, we propose two MDN-based models for classification tasks. Both models fit mixtures of Gaussians to the the data and use the fitted distributions to classify a given sample by evaluating the learnt cumulative distribution function for the given input features. While the proposed MDN-based models perform slightly better than, or on par with, five baseline classification models on three publicly available datasets, the real utility of our models comes out through a real-world product bundling application. Specifically, we use our MDN-based models to learn the willingness-to-pay (WTP) distributions for two products from synthetic sales data of the individual products. The Gaussian mixture representation of the learnt WTP distributions is then exploited to obtain the WTP distribution of the bundle consisting of both the products. The proposed MDN-based models are able to approximate the true WTP distributions of both products and the bundle well. △ Less

Submitted 8 February, 2024; originally announced February 2024.

arXiv:2402.01643 [pdf, other]

L-TUNING: Synchronized Label Tuning for Prompt and Prefix in LLMs

Authors: Md. Kowsher, Md. Shohanur Islam Sobuj, Asif Mahmud, Nusrat Jahan Prottasha, Prakash Bhat

Abstract: Efficiently fine-tuning Large Language Models (LLMs) for specific tasks presents a considerable challenge in natural language processing. Traditional methods, like prompt or prefix tuning, typically rely on arbitrary tokens for training, leading to prolonged training times and generalized token use across various class labels. To address these issues, this paper introduces L-Tuning, an efficient f… ▽ More Efficiently fine-tuning Large Language Models (LLMs) for specific tasks presents a considerable challenge in natural language processing. Traditional methods, like prompt or prefix tuning, typically rely on arbitrary tokens for training, leading to prolonged training times and generalized token use across various class labels. To address these issues, this paper introduces L-Tuning, an efficient fine-tuning approach designed for classification tasks within the Natural Language Inference (NLI) framework. Diverging from conventional methods, L-Tuning focuses on the fine-tuning of label tokens processed through a pre-trained LLM, thereby harnessing its pre-existing semantic knowledge. This technique not only improves the fine-tuning accuracy and efficiency but also facilitates the generation of distinct label embeddings for each class, enhancing the model's training nuance. Our experimental results indicate a significant improvement in training efficiency and classification accuracy with L-Tuning compared to traditional approaches, marking a promising advancement in fine-tuning LLMs for complex language tasks. △ Less

Submitted 12 April, 2024; v1 submitted 20 December, 2023; originally announced February 2024.

Comments: Published in the ICLR TinyPaper track

arXiv:2401.01965 [pdf, other]

Quasi-two-dimensionality of three-dimensional, magnetically dominated, decaying turbulence

Authors: Shreya Dwivedi, Chandranathan Anandavijayan, Pallavi Bhat

Abstract: Decaying magnetohydrodynamic (MHD) turbulence is important in various astrophysical contexts, including early universe magnetic fields, star formation, turbulence in galaxy clusters, magnetospheres and solar corona. Previously known in the nonhelical case of magnetically dominated decaying turbulence, we show that magnetic reconnection is important also in the fully helical case and is likely the… ▽ More Decaying magnetohydrodynamic (MHD) turbulence is important in various astrophysical contexts, including early universe magnetic fields, star formation, turbulence in galaxy clusters, magnetospheres and solar corona. Previously known in the nonhelical case of magnetically dominated decaying turbulence, we show that magnetic reconnection is important also in the fully helical case and is likely the agent responsible for the inverse transfer of energy. Again, in the fully helical case, we find that there is a similarity in power law decay exponents in both 2.5D and 3D simulations. To understand this intriguing similarity, we investigate the possible quasi-two-dimensionalization of the 3D system. We perform Minkowski functional analysis and find that the characteristic length scales of a typical magnetic structure in the system are widely different, suggesting the existence of local anisotropies. Finally, we provide a quasi-two-dimensional hierarchical merger model which recovers the relevant power law scalings. In the nonhelical case, we show that a helicity-based invariant cannot constrain the system, and the best candidate is still anastrophy or vector potential squared, which is consistent with the quasi-two-dimensionalization of the system. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: 20 pages, 23 figures

arXiv:2310.18743 [pdf, other]

Optimization of utility-based shortfall risk: A non-asymptotic viewpoint

Authors: Sumedh Gupte, Prashanth L. A., Sanjay P. Bhat

Abstract: We consider the problems of estimation and optimization of utility-based shortfall risk (UBSR), which is a popular risk measure in finance. In the context of UBSR estimation, we derive a non-asymptotic bound on the mean-squared error of the classical sample average approximation (SAA) of UBSR. Next, in the context of UBSR optimization, we derive an expression for the UBSR gradient under a smooth p… ▽ More We consider the problems of estimation and optimization of utility-based shortfall risk (UBSR), which is a popular risk measure in finance. In the context of UBSR estimation, we derive a non-asymptotic bound on the mean-squared error of the classical sample average approximation (SAA) of UBSR. Next, in the context of UBSR optimization, we derive an expression for the UBSR gradient under a smooth parameterization. This expression is a ratio of expectations, both of which involve the UBSR. We use SAA for the numerator as well as denominator in the UBSR gradient expression to arrive at a biased gradient estimator. We derive non-asymptotic bounds on the estimation error, which show that our gradient estimator is asymptotically unbiased. We incorporate the aforementioned gradient estimator into a stochastic gradient (SG) algorithm for UBSR optimization. Finally, we derive non-asymptotic bounds that quantify the rate of convergence of our SG algorithm for UBSR optimization. △ Less

Submitted 30 March, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

arXiv:2310.08217 [pdf, other]

TriRE: A Multi-Mechanism Learning Paradigm for Continual Knowledge Retention and Promotion

Authors: Preetha Vijayan, Prashant Bhat, Elahe Arani, Bahram Zonooz

Abstract: Continual learning (CL) has remained a persistent challenge for deep neural networks due to catastrophic forgetting (CF) of previously learned tasks. Several techniques such as weight regularization, experience rehearsal, and parameter isolation have been proposed to alleviate CF. Despite their relative success, these research directions have predominantly remained orthogonal and suffer from sever… ▽ More Continual learning (CL) has remained a persistent challenge for deep neural networks due to catastrophic forgetting (CF) of previously learned tasks. Several techniques such as weight regularization, experience rehearsal, and parameter isolation have been proposed to alleviate CF. Despite their relative success, these research directions have predominantly remained orthogonal and suffer from several shortcomings, while missing out on the advantages of competing strategies. On the contrary, the brain continually learns, accommodates, and transfers knowledge across tasks by simultaneously leveraging several neurophysiological processes, including neurogenesis, active forgetting, neuromodulation, metaplasticity, experience rehearsal, and context-dependent gating, rarely resulting in CF. Inspired by how the brain exploits multiple mechanisms concurrently, we propose TriRE, a novel CL paradigm that encompasses retaining the most prominent neurons for each task, revising and solidifying the extracted knowledge of current and past tasks, and actively promoting less active neurons for subsequent tasks through rewinding and relearning. Across CL settings, TriRE significantly reduces task interference and surpasses different CL approaches considered in isolation. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: Accepted at 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

arXiv:2307.06248 [pdf]

HELEN: Traveling Wave SRF Linear Collider Higgs Factory

Authors: S. Belomestnykh, P. C. Bhat, A. Grassellino, S. Kazakov, H. Padamsee, S. Posen, A. Romanenko, V. Shiltsev, A. Valishev, V. Yakovlev

Abstract: Traveling wave SRF accelerating structures offer several advantages over the traditional standing wave structures: substantially lower $H_pk/E_acc$ and lower $E_pk/E_acc$, ratios of peak magnetic field and peak electric field to the accelerating gradient, respectively, together with substantially higher $R/Q$. In this paper we discuss how a linear collider Higgs Factory HELEN can be built using TW… ▽ More Traveling wave SRF accelerating structures offer several advantages over the traditional standing wave structures: substantially lower $H_pk/E_acc$ and lower $E_pk/E_acc$, ratios of peak magnetic field and peak electric field to the accelerating gradient, respectively, together with substantially higher $R/Q$. In this paper we discuss how a linear collider Higgs Factory HELEN can be built using TW-based SRF linacs. We cover a plan to address technological challenges and describe ways to upgrade the collider luminosity and energy. △ Less

Submitted 12 July, 2023; originally announced July 2023.

Comments: 14th International Particle Accelerator Conference IPAC'23. arXiv admin note: text overlap with arXiv:2209.01074

Report number: FERMILAB-CONF-23-183-AD-PPD-SQMS-TD

arXiv:2307.01281 [pdf, other]

Unified treatment of mean-field dynamo and angular-momentum transport in magnetorotational instability-driven turbulence

Authors: Tushar Mondal, Pallavi Bhat

Abstract: Magnetorotational instability (MRI)-driven turbulence and dynamo phenomena are analyzed using direct statistical simulations. Our approach begins by develo** a unified mean-field model that combines the traditionally decoupled problems of the large-scale dynamo and angular-momentum transport in accretion disks. The model consists of a hierarchical set of equations, capturing up to the second-ord… ▽ More Magnetorotational instability (MRI)-driven turbulence and dynamo phenomena are analyzed using direct statistical simulations. Our approach begins by develo** a unified mean-field model that combines the traditionally decoupled problems of the large-scale dynamo and angular-momentum transport in accretion disks. The model consists of a hierarchical set of equations, capturing up to the second-order cumulants, while a statistical closure approximation is employed to model the three-point correlators. We highlight the web of interactions that connect different components of stress tensors -- Maxwell, Reynolds, and Faraday -- through shear, rotation, correlators associated with mean fields, and nonlinear terms. We determine the dominant interactions crucial for the development and sustenance of MRI turbulence. Our general mean field model for the MRI-driven system allows for a self-consistent construction of the electromotive force, inclusive of inhomogeneities and anisotropies. Within the realm of large-scale magnetic field dynamo, we identify two key mechanisms -- the rotation-shear-current effect and the rotation-shear-vorticity effect -- that are responsible for generating the radial and vertical magnetic fields, respectively. We provide the explicit (nonperturbative) form of the transport coefficients associated with each of these dynamo effects. Notably, both of these mechanisms rely on the intrinsic presence of large-scale vorticity dynamo within MRI turbulence. △ Less

Submitted 14 November, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

Comments: 32 pages including 25 figures; Version accepted for publication in Phys. Rev. E

arXiv:2305.12262 [pdf, other]

Extreme Variability in a Long Duration Gamma-ray Burst Associated with a Kilonova

Authors: P. Veres, P. N. Bhat, E. Burns, R. Hamburg, N. Fraija, D. Kocevski, R. Preece, S. Poolakkil, N. Christensen, M. A. Bizouard, T. Dal Canton, S. Bala, E. Bissaldi, M. S. Briggs, W. Cleveland, A. Goldstein, B. A. Hristov, C. M. Hui, S. Lesage, B. Mailyan, O. J. Roberts, C. A. Wilson-Hodge

Abstract: The recent discovery of a kilonova from the long duration gamma-ray burst, GRB 211211A, challenges classification schemes based on temporal information alone. Gamma-ray properties of GRB 211211A reveal an extreme event, which stands out among both short and long GRBs. We find very short variations (few ms) in the lightcurve of GRB 211211A and estimate ~1000 for the Lorentz factor of the outflow. W… ▽ More The recent discovery of a kilonova from the long duration gamma-ray burst, GRB 211211A, challenges classification schemes based on temporal information alone. Gamma-ray properties of GRB 211211A reveal an extreme event, which stands out among both short and long GRBs. We find very short variations (few ms) in the lightcurve of GRB 211211A and estimate ~1000 for the Lorentz factor of the outflow. We discuss the relevance of the short variations in identifying similar long GRBs resulting from compact mergers. Our findings indicate that in future gravitational wave follow-up campaigns, some long duration GRBs should be treated as possible strong gravitational wave counterparts. △ Less

Submitted 20 May, 2023; originally announced May 2023.

Comments: 10 pages 5 figures, submitted to AAS journals

arXiv:2305.04769 [pdf, other]

BiRT: Bio-inspired Replay in Vision Transformers for Continual Learning

Authors: Kishaan Jeeveswaran, Prashant Bhat, Bahram Zonooz, Elahe Arani

Abstract: The ability of deep neural networks to continually learn and adapt to a sequence of tasks has remained challenging due to catastrophic forgetting of previously learned tasks. Humans, on the other hand, have a remarkable ability to acquire, assimilate, and transfer knowledge across tasks throughout their lifetime without catastrophic forgetting. The versatility of the brain can be attributed to the… ▽ More The ability of deep neural networks to continually learn and adapt to a sequence of tasks has remained challenging due to catastrophic forgetting of previously learned tasks. Humans, on the other hand, have a remarkable ability to acquire, assimilate, and transfer knowledge across tasks throughout their lifetime without catastrophic forgetting. The versatility of the brain can be attributed to the rehearsal of abstract experiences through a complementary learning system. However, representation rehearsal in vision transformers lacks diversity, resulting in overfitting and consequently, performance drops significantly compared to raw image rehearsal. Therefore, we propose BiRT, a novel representation rehearsal-based continual learning approach using vision transformers. Specifically, we introduce constructive noises at various stages of the vision transformer and enforce consistency in predictions with respect to an exponential moving average of the working model. Our method provides consistent performance gain over raw image and vanilla representation rehearsal on several challenging CL benchmarks, while being memory efficient and robust to natural and adversarial corruptions. △ Less

Submitted 8 May, 2023; originally announced May 2023.

Comments: Accepted at 40th International Conference on Machine Learning (ICML 2023)

arXiv:2304.10299 [pdf, ps, other]

Statement from the American Linear Collider Committee to the P5 subpanel

Authors: J. A. Bagger, S. Belomestnykh, P. C. Bhat, J. E. Brau, M. Demarteau, D. Denisov, S. Gori, P. D. Grannis, T. Junginger, A. J. Lankford, M. Liepe, T. W. Markiewicz, H. E. Montgomery, M. Perelstein, M. E. Peskin, J. Strube, A. P. White, G. W. Wilson

Abstract: This statement from the American Linear Collider Committee to the P5 subpanel has three purposes. It presents a brief summary of the case for an $e^+e^-$ Higgs factory that has emerged from Snowmass 2021. It highlights the special virtues of the ILC that are shared with other linear colliders but not with circular colliders. Finally, it calls attention to the resources available in the ILC White P… ▽ More This statement from the American Linear Collider Committee to the P5 subpanel has three purposes. It presents a brief summary of the case for an $e^+e^-$ Higgs factory that has emerged from Snowmass 2021. It highlights the special virtues of the ILC that are shared with other linear colliders but not with circular colliders. Finally, it calls attention to the resources available in the ILC White Paper for Snowmass (arXiv:2203.07622). The ALCC urges P5 to move the Higgs factory forward as a global project by assigning the idea of an $e^+e^-$ Higgs factory high priority, initiating a global discussion of the technology choice and cost sharing, and offering the option of siting the Higgs factory in the U.S. △ Less

Submitted 17 May, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

Comments: 6 pages

arXiv:2303.14172 [pdf, other]

doi 10.3847/2041-8213/ace5b4

Fermi-GBM Discovery of GRB 221009A: An Extraordinarily Bright GRB from Onset to Afterglow

Authors: S. Lesage, P. Veres, M. S. Briggs, A. Goldstein, D. Kocevski, E. Burns, C. A. Wilson-Hodge, P. N. Bhat, D. Huppenkothen, C. L. Fryer, R. Hamburg, J. Racusin, E. Bissaldi, W. H. Cleveland, S. Dalessi, C. Fletcher, M. M. Giles, B. A. Hristov, C. M. Hui, B. Mailyan, C. Malacaria, S. Poolakkil, O. J. Roberts, A. von Kienlin, J. Wood , et al. (115 additional authors not shown)

Abstract: We report the discovery of GRB 221009A, the highest flux gamma-ray burst ever observed by the Fermi Gamma-ray Burst Monitor (GBM). This GRB has continuous prompt emission lasting more than 600 seconds which smoothly transitions to afterglow visible in the GBM energy range (8 keV--40 MeV), and total energetics higher than any other burst in the GBM sample. By using a variety of new and existing ana… ▽ More We report the discovery of GRB 221009A, the highest flux gamma-ray burst ever observed by the Fermi Gamma-ray Burst Monitor (GBM). This GRB has continuous prompt emission lasting more than 600 seconds which smoothly transitions to afterglow visible in the GBM energy range (8 keV--40 MeV), and total energetics higher than any other burst in the GBM sample. By using a variety of new and existing analysis techniques we probe the spectral and temporal evolution of GRB 221009A. We find no emission prior to the GBM trigger time (t0; 2022 October 9 at 13:16:59.99 UTC), indicating that this is the time of prompt emission onset. The triggering pulse exhibits distinct spectral and temporal properties suggestive of the thermal, photospheric emission of shock-breakout, with significant emission up to $\sim$15 MeV. We characterize the onset of external shock at t0+600 s and find evidence of a plateau region in the early-afterglow phase which transitions to a slope consistent with Swift-XRT afterglow measurements. We place the total energetics of GRB 221009A in context with the rest of the GBM sample and find that this GRB has the highest total isotropic-equivalent energy ($\textrm{E}_{γ,\textrm{iso}}=1.0\times10^{55}$ erg) and second highest isotropic-equivalent luminosity ($\textrm{L}_{γ,\textrm{iso}}=9.9\times10^{53}$ erg/s) based on redshift of z = 0.151. These extreme energetics are what allowed us to observe the continuously emitting central engine of GBM from the beginning of the prompt emission phase through the onset of early afterglow. △ Less

Submitted 12 July, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

Comments: 26 pages 7 figures - accepted for publication in ApJL

arXiv:2303.10158 [pdf, other]

Data-centric Artificial Intelligence: A Survey

Authors: Daochen Zha, Zaid Pervaiz Bhat, Kwei-Herng Lai, Fan Yang, Zhimeng Jiang, Shaochen Zhong, Xia Hu

Abstract: Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler of its great success is the availability of abundant and high-quality data for building machine learning models. Recently, the role of data in AI has been significantly magnified, giving rise to the emerging concept of data-centric AI. The attention of researchers and practitioners has gradually shifte… ▽ More Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler of its great success is the availability of abundant and high-quality data for building machine learning models. Recently, the role of data in AI has been significantly magnified, giving rise to the emerging concept of data-centric AI. The attention of researchers and practitioners has gradually shifted from advancing model design to enhancing the quality and quantity of the data. In this survey, we discuss the necessity of data-centric AI, followed by a holistic view of three general data-centric goals (training data development, inference data development, and data maintenance) and the representative methods. We also organize the existing literature from automation and collaboration perspectives, discuss the challenges, and tabulate the benchmarks for various tasks. We believe this is the first comprehensive survey that provides a global view of a spectrum of tasks across various stages of the data lifecycle. We hope it can help the readers efficiently grasp a broad picture of this field, and equip them with the techniques and further research ideas to systematically engineer data for building AI systems. A companion list of data-centric AI resources will be regularly updated on https://github.com/daochenzha/data-centric-AI △ Less

Submitted 11 June, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

Comments: 38 pages, 6 figues, 5 tables. A companion list of data-centric AI resources is available at https://github.com/daochenzha/data-centric-AI

arXiv:2303.07834 [pdf, other]

Finite-Horizon Constrained MDPs With Both Additive And Multiplicative Utilities

Authors: Uday Kumar M, Sanjay P Bhat, Veeraruna Kavitha, Nandyala Hemachandra

Abstract: This paper considers the problem of finding a solution to the finite horizon constrained Markov decision processes (CMDP) where the objective as well as constraints are sum of additive and multiplicative utilities. Towards solving this, we construct another CMDP, with only additive utilities under a restricted set of policies, whose optimal value is equal to that of the original CMDP. Furthermore,… ▽ More This paper considers the problem of finding a solution to the finite horizon constrained Markov decision processes (CMDP) where the objective as well as constraints are sum of additive and multiplicative utilities. Towards solving this, we construct another CMDP, with only additive utilities under a restricted set of policies, whose optimal value is equal to that of the original CMDP. Furthermore, we provide a finite dimensional bilinear program (BLP) whose value equals the CMDP value and whose solution provides the optimal policy. We also suggest an algorithm to solve this BLP. △ Less

Submitted 15 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

arXiv:2302.11346 [pdf, other]

Task-Aware Information Routing from Common Representation Space in Lifelong Learning

Authors: Prashant Bhat, Bahram Zonooz, Elahe Arani

Abstract: Intelligent systems deployed in the real world suffer from catastrophic forgetting when exposed to a sequence of tasks. Humans, on the other hand, acquire, consolidate, and transfer knowledge between tasks that rarely interfere with the consolidated knowledge. Accompanied by self-regulated neurogenesis, continual learning in the brain is governed by a rich set of neurophysiological processes that… ▽ More Intelligent systems deployed in the real world suffer from catastrophic forgetting when exposed to a sequence of tasks. Humans, on the other hand, acquire, consolidate, and transfer knowledge between tasks that rarely interfere with the consolidated knowledge. Accompanied by self-regulated neurogenesis, continual learning in the brain is governed by a rich set of neurophysiological processes that harbor different types of knowledge, which are then integrated by conscious processing. Thus, inspired by the Global Workspace Theory of conscious information access in the brain, we propose TAMiL, a continual learning method that entails task-attention modules to capture task-specific information from the common representation space. We employ simple, undercomplete autoencoders to create a communication bottleneck between the common representation space and the global workspace, allowing only the task-relevant information to the global workspace, thus greatly reducing task interference. Experimental results show that our method outperforms state-of-the-art rehearsal-based and dynamic sparse approaches and bridges the gap between fixed capacity and parameter isolation approaches while being scalable. We also show that our method effectively mitigates catastrophic forgetting while being well-calibrated with reduced task-recency bias. △ Less

Submitted 14 February, 2023; originally announced February 2023.

Comments: Accepted as a conference paper at ICLR 2023

arXiv:2301.04819 [pdf, other]

Data-centric AI: Perspectives and Challenges

Authors: Daochen Zha, Zaid Pervaiz Bhat, Kwei-Herng Lai, Fan Yang, Xia Hu

Abstract: The role of data in building AI systems has recently been significantly magnified by the emerging concept of data-centric AI (DCAI), which advocates a fundamental shift from model advancements to ensuring data quality and reliability. Although our community has continuously invested efforts into enhancing data in different aspects, they are often isolated initiatives on specific tasks. To facilita… ▽ More The role of data in building AI systems has recently been significantly magnified by the emerging concept of data-centric AI (DCAI), which advocates a fundamental shift from model advancements to ensuring data quality and reliability. Although our community has continuously invested efforts into enhancing data in different aspects, they are often isolated initiatives on specific tasks. To facilitate the collective initiative in our community and push forward DCAI, we draw a big picture and bring together three general missions: training data development, inference data development, and data maintenance. We provide a top-level discussion on representative DCAI tasks and share perspectives. Finally, we list open challenges. More resources are summarized at https://github.com/daochenzha/data-centric-AI △ Less

Submitted 2 April, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

Comments: Accepted by SDM 2023 Blue Sky Track. More resources are summarized at https://github.com/daochenzha/data-centric-AI

arXiv:2212.02141 [pdf, other]

doi 10.3847/2041-8213/acb99d

A Cosmological Fireball with Sixteen-Percent Gamma-Ray Radiative Efficiency

Authors: Liang Li, Yu Wang, Felix Ryde, Asaf Pe'er, Bing Zhang, Sylvain Guiriec, Alberto J. Castro-Tirado, D. Alexander Kann, Magnus Axelsson, Kim Page, Peter Veres, P. N. Bhat

Abstract: Gamma-ray bursts (GRBs) are the most powerful explosions in the universe. How efficiently the jet converts its energy to radiation is a long-standing problem and it is poorly constrained. The standard model invokes a relativistic fireball with a bright photosphere emission component. A definitive diagnosis of GRB radiation components and measurement of GRB radiative efficiency require prompt emiss… ▽ More Gamma-ray bursts (GRBs) are the most powerful explosions in the universe. How efficiently the jet converts its energy to radiation is a long-standing problem and it is poorly constrained. The standard model invokes a relativistic fireball with a bright photosphere emission component. A definitive diagnosis of GRB radiation components and measurement of GRB radiative efficiency require prompt emission and afterglow data with high-resolution and wide-band coverage in time and energy. Here we report a comprehensive temporal and spectral analysis of the TeV-emitting bright GRB 190114C. Its fluence is one of the highest of all GRBs detected so far, which allows us to perform a high-resolution study of the prompt emission spectral properties and their temporal evolution down to a timescale of about 0.1 s. We observe that each of the initial pulses has a thermal component contributing $\sim20\%$ of the total energy, the corresponding temperature and the inferred Lorentz factor of the photosphere evolve following broken power-law shapes. From the observation of the non-thermal spectra and the light curve, the onset of afterglow corresponding to the deceleration of the fireball is considered at $\sim 6$~s. By incorporating the thermal and the non-thermal observations, as well as the photosphere and the synchrotron radiative mechanisms, we can directly derive the fireball energy budget with little dependence on hypothetical parameters and to measure a $\sim 16\%$ radiative efficiency for this GRB. With the fireball energy budget derived, the afterglow microphysics parameters can also be constrained directly from the data. △ Less

Submitted 12 February, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

Comments: 27 pages, 8 figures (including 14 panels), 6 tables, accepted for publication in The Astrophysical Journal Letters

arXiv:2209.14963 [pdf, ps, other]

Approximate Solutions To Constrained Risk-Sensitive Markov Decision Processes

Authors: Uday Kumar M, Sanjay P Bhat, Veeraruna Kavitha, Nandyala Hemachandra

Abstract: This paper considers the problem of finding near-optimal Markovian randomized (MR) policies for finite-state-action, infinite-horizon, constrained risk-sensitive Markov decision processes (CRSMDPs). Constraints are in the form of standard expected discounted cost functions as well as expected risk-sensitive discounted cost functions over finite and infinite horizons. The main contribution is to sh… ▽ More This paper considers the problem of finding near-optimal Markovian randomized (MR) policies for finite-state-action, infinite-horizon, constrained risk-sensitive Markov decision processes (CRSMDPs). Constraints are in the form of standard expected discounted cost functions as well as expected risk-sensitive discounted cost functions over finite and infinite horizons. The main contribution is to show that the problem possesses a solution if it is feasible, and to provide two methods for finding an approximate solution in the form of an ultimately stationary (US) MR policy. The latter is achieved through two approximating finite-horizon CRSMDPs which are constructed from the original CRSMDP by time-truncating the original objective and constraint cost functions, and suitably perturbing the constraint upper bounds. The first approximation gives a US policy which is $ε$-optimal and feasible for the original problem, while the second approximation gives a near-optimal US policy whose violation of the original constraints is bounded above by a specified $ε$. A key step in the proofs is an appropriate choice of a metric that makes the set of infinite-horizon MR policies and the feasible regions of the three CRSMDPs compact, and the objective and constraint functions continuous. A linear-programming-based formulation for solving the approximating finite-horizon CRSMDPs is also given. △ Less

Submitted 29 September, 2022; originally announced September 2022.

Comments: 38 pages

arXiv:2209.14136 [pdf, other]

Snowmass'21 Accelerator Frontier Report

Authors: S. Gourlay, T. Raubenheimer, V. Shiltsev, G. Arduini, R. Assmann, C. Barbier, M. Bai, S. Belomestnykh, S. Bermudez, P. Bhat, A. Faus-Golfe, J. Galambos, C. Geddes, G. Hoffstaetter, M. Hogan, Z. Huang, M. Lamont, D. Li, S. Lund, R. Milner, P. Musumeci, E. Nanni, M. Palmer, N. Pastrone, F. Pellemoine , et al. (13 additional authors not shown)

Abstract: In 2020-2022, extensive discussions and deliberations have taken place in corresponding topical working groups of the Snowmass Accelerator Frontier (AF) and in numerous joint meetings with other Frontiers, Snowmass-wide meetings, a series of Colloquium-style Agoras, cross-Frontier Forums on muon and electron-positron colliders and the collider Implementation Task Force (ITF). The outcomes of these… ▽ More In 2020-2022, extensive discussions and deliberations have taken place in corresponding topical working groups of the Snowmass Accelerator Frontier (AF) and in numerous joint meetings with other Frontiers, Snowmass-wide meetings, a series of Colloquium-style Agoras, cross-Frontier Forums on muon and electron-positron colliders and the collider Implementation Task Force (ITF). The outcomes of these activities are summarized in this Accelerator Frontier report. △ Less

Submitted 17 November, 2022; v1 submitted 28 September, 2022; originally announced September 2022.

Comments: contribution to Snowmass'21, v.2 (final)

arXiv:2209.01318 [pdf, other]

Muon Collider Forum Report

Authors: K. M. Black, S. **dariani, D. Li, F. Maltoni, P. Meade, D. Stratakis, D. Acosta, R. Agarwal, K. Agashe, C. Aime, D. Ally, A. Apresyan, A. Apyan, P. Asadi, D. Athanasakos, Y. Bao, E. Barzi, N. Bartosik, L. A. T. Bauerdick, J. Beacham, S. Belomestnykh, J. S. Berg, J. Berryhill, A. Bertolin, P. C. Bhat , et al. (160 additional authors not shown)

Abstract: A multi-TeV muon collider offers a spectacular opportunity in the direct exploration of the energy frontier. Offering a combination of unprecedented energy collisions in a comparatively clean leptonic environment, a high energy muon collider has the unique potential to provide both precision measurements and the highest energy reach in one machine that cannot be paralleled by any currently availab… ▽ More A multi-TeV muon collider offers a spectacular opportunity in the direct exploration of the energy frontier. Offering a combination of unprecedented energy collisions in a comparatively clean leptonic environment, a high energy muon collider has the unique potential to provide both precision measurements and the highest energy reach in one machine that cannot be paralleled by any currently available technology. The topic generated a lot of excitement in Snowmass meetings and continues to attract a large number of supporters, including many from the early career community. In light of this very strong interest within the US particle physics community, Snowmass Energy, Theory and Accelerator Frontiers created a cross-frontier Muon Collider Forum in November of 2020. The Forum has been meeting on a monthly basis and organized several topical workshops dedicated to physics, accelerator technology, and detector R&D. Findings of the Forum are summarized in this report. △ Less

Submitted 8 August, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

arXiv:2209.01074 [pdf]

HELEN: A Linear Collider Based On Advanced SRF Technology

Authors: S. Belomestnykh, P. C. Bhat, M. Checchin, A. Grassellino, M. Martinello, S. Nagaitsev, H. Padamsee, S. Posen, A. Romanenko, V. Shiltsev, A. Valishev, V. Yakovlev

Abstract: This paper discusses recently proposed Higgs Energy LEptoN (HELEN) $e+e-$ linear collider based on advances in superconducting radio frequency technology. The collider offers cost and AC power savings, smaller footprint (relative to the ILC), and could be built at Fermilab with an interaction region within the site boundaries. After the initial physics run at 250 GeV, the collider could be upgrade… ▽ More This paper discusses recently proposed Higgs Energy LEptoN (HELEN) $e+e-$ linear collider based on advances in superconducting radio frequency technology. The collider offers cost and AC power savings, smaller footprint (relative to the ILC), and could be built at Fermilab with an interaction region within the site boundaries. After the initial physics run at 250 GeV, the collider could be upgraded either to higher luminosity or to higher (up to 500 GeV) energies. △ Less

Submitted 2 September, 2022; originally announced September 2022.

Report number: FERMILAB-CONF-22-643-AD-PPD-SQMS-TD

arXiv:2208.13894 [pdf, other]

"I was Confused by It; It was Confused by Me:" Exploring the Experiences of People with Visual Impairments around Mobile Service Robots

Authors: Prajna M. Bhat, Yuhang Zhao

Abstract: Mobile service robots have become increasingly ubiquitous. However, these robots can pose potential accessibility issues and safety concerns to people with visual impairments (PVI). We sought to explore the challenges faced by PVI around mainstream mobile service robots and identify their needs. Seventeen PVI were interviewed about their experiences with three emerging robots: vacuum robots, deliv… ▽ More Mobile service robots have become increasingly ubiquitous. However, these robots can pose potential accessibility issues and safety concerns to people with visual impairments (PVI). We sought to explore the challenges faced by PVI around mainstream mobile service robots and identify their needs. Seventeen PVI were interviewed about their experiences with three emerging robots: vacuum robots, delivery robots, and drones. We comprehensively investigated PVI's robot experiences by considering their different roles around robots -- direct users and bystanders. Our study highlighted participants' challenges and concerns about the accessibility, safety, and privacy issues around mobile service robots. We found that the lack of accessible feedback made it difficult for PVI to precisely control, locate, and track the status of the robots. Moreover, encountering mobile robots as bystanders confused and even scared the participants, presenting safety and privacy barriers. We further distilled design considerations for more accessible and safe robots for PVI. △ Less

Submitted 29 August, 2022; originally announced August 2022.

Comments: Proc. ACM Hum.-Comput. Interact. 6, CSCW2, Article 481 (November 2022), 26 pages, 1 figure

arXiv:2207.06267 [pdf, other]

Task Agnostic Representation Consolidation: a Self-supervised based Continual Learning Approach

Authors: Prashant Bhat, Bahram Zonooz, Elahe Arani

Abstract: Continual learning (CL) over non-stationary data streams remains one of the long-standing challenges in deep neural networks (DNNs) as they are prone to catastrophic forgetting. CL models can benefit from self-supervised pre-training as it enables learning more generalizable task-agnostic features. However, the effect of self-supervised pre-training diminishes as the length of task sequences incre… ▽ More Continual learning (CL) over non-stationary data streams remains one of the long-standing challenges in deep neural networks (DNNs) as they are prone to catastrophic forgetting. CL models can benefit from self-supervised pre-training as it enables learning more generalizable task-agnostic features. However, the effect of self-supervised pre-training diminishes as the length of task sequences increases. Furthermore, the domain shift between pre-training data distribution and the task distribution reduces the generalizability of the learned representations. To address these limitations, we propose Task Agnostic Representation Consolidation (TARC), a two-stage training paradigm for CL that intertwines task-agnostic and task-specific learning whereby self-supervised training is followed by supervised learning for each task. To further restrict the deviation from the learned representations in the self-supervised stage, we employ a task-agnostic auxiliary loss during the supervised stage. We show that our training paradigm can be easily added to memory- or regularization-based approaches and provides consistent performance gain across more challenging CL settings. We further show that it leads to more robust and well-calibrated models. △ Less

Submitted 13 July, 2022; originally announced July 2022.

Comments: Accepted at Conference on Lifelong Learning Agents (CoLLAs 2022)

arXiv:2207.06213 [pdf, ps, other]

U.S. National Accelerator R\&D Program on Future Colliders

Authors: P. C. Bhat, S. Belomestnykh, A. Bross, S. Dasu, D. Denisov, S. Gourlay, S. **dariani, A. J. Lankford, S. Nagaitsev, E. A. Nanni, M. A. Palmer, T. Raubenheimer, V. Shiltsev, A. Valishev, C. Vernieri, F. Zimmermann

Abstract: Future colliders are an essential component of a strategic vision for particle physics. Conceptual studies and technical developments for several exciting future collider options are underway internationally. In order to realize a future collider, a concerted accelerator R\&D program is required. The U.S. HEP accelerator R\&D program currently has no direct effort in collider-specific R\&D area. T… ▽ More Future colliders are an essential component of a strategic vision for particle physics. Conceptual studies and technical developments for several exciting future collider options are underway internationally. In order to realize a future collider, a concerted accelerator R\&D program is required. The U.S. HEP accelerator R\&D program currently has no direct effort in collider-specific R\&D area. This shortcoming greatly compromises the U.S. leadership role in accelerator and particle physics. In this white paper, we propose a new national accelerator R\&D program on future colliders and outline the important characteristics of such a program. △ Less

Submitted 13 July, 2022; originally announced July 2022.

Comments: 8 pages; Submitted to the Proceedings of the US Community Study on the Future of Particle Physics (Snowmass 2021)

arXiv:2207.04998 [pdf, other]

Consistency is the key to further mitigating catastrophic forgetting in continual learning

Authors: Prashant Bhat, Bahram Zonooz, Elahe Arani

Abstract: Deep neural networks struggle to continually learn multiple sequential tasks due to catastrophic forgetting of previously learned tasks. Rehearsal-based methods which explicitly store previous task samples in the buffer and interleave them with the current task samples have proven to be the most effective in mitigating forgetting. However, Experience Replay (ER) does not perform well under low-buf… ▽ More Deep neural networks struggle to continually learn multiple sequential tasks due to catastrophic forgetting of previously learned tasks. Rehearsal-based methods which explicitly store previous task samples in the buffer and interleave them with the current task samples have proven to be the most effective in mitigating forgetting. However, Experience Replay (ER) does not perform well under low-buffer regimes and longer task sequences as its performance is commensurate with the buffer size. Consistency in predictions of soft-targets can assist ER in preserving information pertaining to previous tasks better as soft-targets capture the rich similarity structure of the data. Therefore, we examine the role of consistency regularization in ER framework under various continual learning scenarios. We also propose to cast consistency regularization as a self-supervised pretext task thereby enabling the use of a wide variety of self-supervised learning methods as regularizers. While simultaneously enhancing model calibration and robustness to natural corruptions, regularizing consistency in predictions results in lesser forgetting across all continual learning scenarios. Among the different families of regularizers, we find that stricter consistency constraints preserve previous task information in ER better. △ Less

Submitted 11 July, 2022; originally announced July 2022.

Comments: Accepted at Conference on Lifelong Learning Agents (CoLLAs 2022)

arXiv:2203.09076 [pdf, other]

C$^3$ Demonstration Research and Development Plan

Authors: Emilio A. Nanni, Martin Breidenbach, Caterina Vernieri, Sergey Belomestnykh, Pushpalatha Bhat, Sergei Nagaitsev, Mei Bai, William Berg, Tim Barklow, John Byrd, Ankur Dhar, Ram C. Dhuley, Chris Doss, Joseph Duris, Auralee Edelen, Claudio Emma, Josef Frisch, Annika Gabriel, Spencer Gessner, Carsten Hast, Chunguang **g, Arkadiy Klebaner, Anatoly K. Krasnykh, John Lewellen, Matthias Liepe , et al. (25 additional authors not shown)

Abstract: C$^3$ is an opportunity to realize an e$^+$e$^-$ collider for the study of the Higgs boson at $\sqrt{s} = 250$ GeV, with a well defined upgrade path to 550 GeV while staying on the same short facility footprint. C$^3$ is based on a fundamentally new approach to normal conducting linear accelerators that achieves both high gradient and high efficiency at relatively low cost. Given the advanced stat… ▽ More C$^3$ is an opportunity to realize an e$^+$e$^-$ collider for the study of the Higgs boson at $\sqrt{s} = 250$ GeV, with a well defined upgrade path to 550 GeV while staying on the same short facility footprint. C$^3$ is based on a fundamentally new approach to normal conducting linear accelerators that achieves both high gradient and high efficiency at relatively low cost. Given the advanced state of linear collider designs, the key system that requires technical maturation for C$^3$ is the main linac. This white paper presents the staged approach towards a facility to demonstrate C$^3$ technology with both Direct (source and main linac) and Parallel (beam delivery, dam** ring, ancillary component) R&D. The white paper also includes discussion on the approach for technology industrialization, related HEP R&D activities that are enabled by C$^3$ R&D, infrastructure requirements and siting options. △ Less

Submitted 6 July, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

Comments: contribution to Snowmass 2021

Report number: SLAC-PUB-17660

arXiv:2203.08211 [pdf, other]

Higgs-Energy LEptoN (HELEN) Collider based on advanced superconducting radio frequency technology

Authors: S. Belomestnykh, P. C. Bhat, A. Grassellino, M. Checchin, D. Denisov, R. L. Geng, S. **dariani, M. Liepe, M. Martinello, P. Merkel, S. Nagaitsev, H. Padamsee, S. Posen, R. A. Rimmer, A. Romanenko, V. Shiltsev, A. Valishev, V. Yakovlev

Abstract: This Snowmass 2021 contributed paper discusses a Higgs-Energy LEptoN (HELEN) $e^+e^-$ linear collider based on advances superconducting radio frequency technology. The proposed collider offers cost and AC power savings, smaller footprint (relative to the ILC), and could be built at Fermilab with an Interaction Region within the site boundaries. After the initial physics run at 250 GeV, the collide… ▽ More This Snowmass 2021 contributed paper discusses a Higgs-Energy LEptoN (HELEN) $e^+e^-$ linear collider based on advances superconducting radio frequency technology. The proposed collider offers cost and AC power savings, smaller footprint (relative to the ILC), and could be built at Fermilab with an Interaction Region within the site boundaries. After the initial physics run at 250 GeV, the collider could be upgraded either to higher luminosity or to higher (up to 500 GeV) energies. If the ILC could not be realized in Japan in a timely fashion, the HELEN collider would be a viable option to build a Higgs factory in the U.S. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: contribution to Snowmass 2021

Report number: FERMILAB-FN-1155-AD-PPD-SQMS-TD

arXiv:2203.08088 [pdf, other]

Future Collider Options for the US

Authors: P. C. Bhat, S. **dariani, G. Ambrosio, G. Apollinari, S. Belomestnykh, A. Bross, J. Butler, A. Canepa, D. Elvira, P. Fox, Z. Gecse, E. Gianfelice-Wendt, P. Merkel, S. Nagaitsev, D. Neuffer, H. Piekarz, S. Posen, T. Sen, V. Shiltsev, N. Solyak, D. Stratakis, M. Syphers, G. Velev, V. Yakovlev, K. Yonehara , et al. (1 additional authors not shown)

Abstract: The United States has a rich history in high energy particle accelerators and colliders -- both lepton and hadron machines, which have enabled several major discoveries in elementary particle physics. To ensure continued progress in the field, U.S. leadership as a key partner in building next generation collider facilities abroad is essential; also critically important is the exploring of options… ▽ More The United States has a rich history in high energy particle accelerators and colliders -- both lepton and hadron machines, which have enabled several major discoveries in elementary particle physics. To ensure continued progress in the field, U.S. leadership as a key partner in building next generation collider facilities abroad is essential; also critically important is the exploring of options to host a future collider in the U.S. The "Snowmass" study and the subsequent Particle Physics Project Prioritization Panel (P5) process provide the timely opportunity to develop strategies for both. What we do now will shape the future of our field and whether the U.S. will remain a world leader in these areas. In this white paper, we briefly discuss the US engagement in proposed collider projects abroad and describe future collider options for the U.S. We also call for initiating an integrated R\&D program for future colliders. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: Contribution to Snowmass 2021

Report number: FERMILAB-CONF-22-144-PPD

arXiv:2203.08042 [pdf, other]

Higgs Self Couplings Measurements at Future proton-proton Colliders: a Snowmass White Paper

Authors: Angela Taliercio, Paola Mastrapasqua, Claudio Caputo, Pietro Vischia, Nicola De Filippis, Pushpalatha Bhat

Abstract: The Higgs boson trilinear and quartic self-couplings are directly related to the shape of the Higgs potential; measuring them with precision is extremely important, as they provide invaluable information on the electroweak symmetry breaking and the electroweak phase transition. In this paper, we perform a detailed analysis of double Higgs boson production, through the gluon gluon fusion process, i… ▽ More The Higgs boson trilinear and quartic self-couplings are directly related to the shape of the Higgs potential; measuring them with precision is extremely important, as they provide invaluable information on the electroweak symmetry breaking and the electroweak phase transition. In this paper, we perform a detailed analysis of double Higgs boson production, through the gluon gluon fusion process, in the most promising decay channels di-bottom-quark di-photons, di-bottom-quark di-tau, and four-bottom-quark for several future colliders: the HL-LHC at 14 TeV and the FCC-hh at 100 TeV, assuming respectively 3 inverse ab and 30 inverse ab of integrated luminosity. In the HL LHC scenario, we expect an upper limit on the di Higgs cross section production of 0.76 at 95% confidence level, corresponding to a significance of 2.8 sigma. In the FCC-hh scenario, depending on the assumed detector performance and systematic uncertainties, we expect that the Higgs self-coupling will be measured with a precision in the range 4.8-8.5% at 95% confidence level. △ Less

Submitted 23 March, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

Comments: contribution to Snowmass 2021

arXiv:2203.07646 [pdf, other]

Strategy for Understanding the Higgs Physics: The Cool Copper Collider

Authors: Sridhara Dasu, Emilio A. Nanni, Michael E. Peskin, Caterina Vernieri, Tim Barklow, Rainer Bartoldus, Pushpalatha C. Bhat, Kevin Black, Jim Brau, Martin Breidenbach, Nathaniel Craig, Dmitri Denisov, Lindsey Gray, Philip C. Harris, Michael Kagan, Zhen Liu, Patrick Meade, Nathan Majernik, Sergei Nagaitsev, Isobel Ojalvo, Christoph Paus, Carl Schroeder, Ariel G. Schwartzman, Jan Strube, Su Dong , et al. (4 additional authors not shown)

Abstract: A program to build a lepton-collider Higgs factory, to precisely measure the couplings of the Higgs boson to other particles, followed by a higher energy run to establish the Higgs self-coupling and expand the new physics reach, is widely recognized as a primary focus of modern particle physics. We propose a strategy that focuses on a new technology and preliminary estimates suggest that can lead… ▽ More A program to build a lepton-collider Higgs factory, to precisely measure the couplings of the Higgs boson to other particles, followed by a higher energy run to establish the Higgs self-coupling and expand the new physics reach, is widely recognized as a primary focus of modern particle physics. We propose a strategy that focuses on a new technology and preliminary estimates suggest that can lead to a compact, affordable machine. New technology investigations will provide much needed enthusiasm for our field, resulting in trained workforce. This cost-effective, compact design, with technologies useful for a broad range of other accelerator applications, could be realized as a project in the US. Its technology innovations, both in the accelerator and the detector, will offer unique and exciting opportunities to young scientists. Moreover, cost effective compact designs, broadly applicable to other fields of research, are more likely to obtain financial support from our funding agencies. △ Less

Submitted 7 June, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

Comments: 11 pages, 2 figures, contribution to Snowmass 2021

Report number: SLAC-PUB-17661

arXiv:2203.07622 [pdf, other]

The International Linear Collider: Report to Snowmass 2021

Authors: Alexander Aryshev, Ties Behnke, Mikael Berggren, James Brau, Nathaniel Craig, Ayres Freitas, Frank Gaede, Spencer Gessner, Stefania Gori, Christophe Grojean, Sven Heinemeyer, Daniel Jeans, Katja Kruger, Benno List, Jenny List, Zhen Liu, Shinichiro Michizono, David W. Miller, Ian Moult, Hitoshi Murayama, Tatsuya Nakada, Emilio Nanni, Mihoko Nojiri, Hasan Padamsee, Maxim Perelstein , et al. (487 additional authors not shown)

Abstract: The International Linear Collider (ILC) is on the table now as a new global energy-frontier accelerator laboratory taking data in the 2030s. The ILC addresses key questions for our current understanding of particle physics. It is based on a proven accelerator technology. Its experiments will challenge the Standard Model of particle physics and will provide a new window to look beyond it. This docu… ▽ More The International Linear Collider (ILC) is on the table now as a new global energy-frontier accelerator laboratory taking data in the 2030s. The ILC addresses key questions for our current understanding of particle physics. It is based on a proven accelerator technology. Its experiments will challenge the Standard Model of particle physics and will provide a new window to look beyond it. This document brings the story of the ILC up to date, emphasizing its strong physics motivation, its readiness for construction, and the opportunity it presents to the US and the global particle physics community. △ Less

Submitted 16 January, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

Comments: 356 pages, Large pdf file (40 MB) submitted to Snowmass 2021; v2 references to Snowmass contributions added, additional authors; v3 references added, some updates, additional authors

Report number: DESY-22-045, IFT--UAM/CSIC--22-028, KEK Preprint 2021-61, PNNL-SA-160884, SLAC-PUB-17662

arXiv:2203.06164 [pdf, ps, other]

Higgs Factory Considerations

Authors: J. A. Bagger, B. C. Barish, S. Belomestnykh, P. C. Bhat, J. E. Brau, M. Demarteau, D. Denisov, S. C. Eno, C. G. R. Geddes, P. D. Grannis, A. Hutton, A. J. Lankford, M. U. Liepe, D. B. MacFarlane, T. Markiewicz, H. E. Montgomery, J. R. Patterson, M. Perelstein, M. E. Peskin, M. C. Ross, J. Strube, A. P. White, G. W. Wilson

Abstract: We discuss considerations that can be used to formulate recommendations for initiating a lepton collider project that would provide precision studies of the Higgs boson and related electroweak phenomena. We discuss considerations that can be used to formulate recommendations for initiating a lepton collider project that would provide precision studies of the Higgs boson and related electroweak phenomena. △ Less

Submitted 17 March, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

Comments: contribution to Snowmass 2021

arXiv:2202.07503 [pdf, other]

BED: A Real-Time Object Detection System for Edge Devices

Authors: Guanchu Wang, Zaid Pervaiz Bhat, Zhimeng Jiang, Yi-Wei Chen, Daochen Zha, Alfredo Costilla Reyes, Afshin Niktash, Gorkem Ulkar, Erman Okman, Xuanting Cai, Xia Hu

Abstract: Deploying deep neural networks~(DNNs) on edge devices provides efficient and effective solutions for the real-world tasks. Edge devices have been used for collecting a large volume of data efficiently in different domains. DNNs have been an effective tool for data processing and analysis. However, designing DNNs on edge devices is challenging due to the limited computational resources and memory.… ▽ More Deploying deep neural networks~(DNNs) on edge devices provides efficient and effective solutions for the real-world tasks. Edge devices have been used for collecting a large volume of data efficiently in different domains. DNNs have been an effective tool for data processing and analysis. However, designing DNNs on edge devices is challenging due to the limited computational resources and memory. To tackle this challenge, we demonstrate Object Detection System for Edge Devices~(BED) on the MAX78000 DNN accelerator. It integrates on-device DNN inference with a camera and an LCD display for image acquisition and detection exhibition, respectively. BED is a concise, effective and detailed solution, including model training, quantization, synthesis and deployment. The entire repository is open-sourced on Github, including a Graphical User Interface~(GUI) for on-chip debugging. Experiment results indicate that BED can produce accurate detection with a 300-KB tiny DNN model, which takes only 91.9 ms of inference time and 1.845 mJ of energy. The real-time detection is available at YouTube. △ Less

Submitted 25 September, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

arXiv:2108.08740 [pdf, other]

doi 10.1093/mnras/stab3138

Saturation of large-scale dynamo in anisotropically forced turbulence

Authors: Pallavi Bhat

Abstract: Turbulent dynamo theories have faced difficulties in obtaining evolution of large-scale magnetic fields on short dynamical time-scales due to the constraint imposed by magnetic helicity balance. This has critical implications for understanding the large-scale magnetic field evolution in astrophysical systems like the Sun, stars and galaxies. Direct numerical simulations (DNS) in the past with isot… ▽ More Turbulent dynamo theories have faced difficulties in obtaining evolution of large-scale magnetic fields on short dynamical time-scales due to the constraint imposed by magnetic helicity balance. This has critical implications for understanding the large-scale magnetic field evolution in astrophysical systems like the Sun, stars and galaxies. Direct numerical simulations (DNS) in the past with isotropically forced helical turbulence have shown that large-scale dynamo saturation time-scales are dependent on the magnetic Reynolds number (Rm). In this work, we have carried out periodic box DNS of helically forced turbulence leading to a large-scale dynamo with two kinds of forcing function, an isotropic one based on that used in PENCIL-CODE and an anisotropic one based on Galloway-Proctor flows. We show that when the turbulence is forced anisotropically, the nonlinear (saturation) behaviour of the large-scale dynamo is only weakly dependent on Rm. In fact the magnetic helicity evolution on small and large scales in the anisotropic case is distinctly different from that in the isotropic case. This result possibly holds promise for the alleviation of important issues like catastrophic quenching. △ Less

Submitted 2 December, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

Comments: 9 pages, 10 figures, comments welcome. Published in MNRAS

arXiv:2108.04212 [pdf, other]

AutoVideo: An Automated Video Action Recognition System

Authors: Daochen Zha, Zaid Pervaiz Bhat, Yi-Wei Chen, Yicheng Wang, Sirui Ding, Jiaben Chen, Kwei-Herng Lai, Mohammad Qazim Bhat, Anmoll Kumar Jain, Alfredo Costilla Reyes, Na Zou, Xia Hu

Abstract: Action recognition is an important task for video understanding with broad applications. However, develo** an effective action recognition solution often requires extensive engineering efforts in building and testing different combinations of the modules and their hyperparameters. In this demo, we present AutoVideo, a Python system for automated video action recognition. AutoVideo is featured fo… ▽ More Action recognition is an important task for video understanding with broad applications. However, develo** an effective action recognition solution often requires extensive engineering efforts in building and testing different combinations of the modules and their hyperparameters. In this demo, we present AutoVideo, a Python system for automated video action recognition. AutoVideo is featured for 1) highly modular and extendable infrastructure following the standard pipeline language, 2) an exhaustive list of primitives for pipeline construction, 3) data-driven tuners to save the efforts of pipeline tuning, and 4) easy-to-use Graphical User Interface (GUI). AutoVideo is released under MIT license at https://github.com/datamllab/autovideo △ Less

Submitted 16 July, 2022; v1 submitted 9 August, 2021; originally announced August 2021.

Comments: Accepted by IJCAI https://github.com/datamllab/autovideo

arXiv:2104.09866 [pdf, other]

Distill on the Go: Online knowledge distillation in self-supervised learning

Authors: Prashant Bhat, Elahe Arani, Bahram Zonooz

Abstract: Self-supervised learning solves pretext prediction tasks that do not require annotations to learn feature representations. For vision tasks, pretext tasks such as predicting rotation, solving jigsaw are solely created from the input data. Yet, predicting this known information helps in learning representations useful for downstream tasks. However, recent works have shown that wider and deeper mode… ▽ More Self-supervised learning solves pretext prediction tasks that do not require annotations to learn feature representations. For vision tasks, pretext tasks such as predicting rotation, solving jigsaw are solely created from the input data. Yet, predicting this known information helps in learning representations useful for downstream tasks. However, recent works have shown that wider and deeper models benefit more from self-supervised learning than smaller models. To address the issue of self-supervised pre-training of smaller models, we propose Distill-on-the-Go (DoGo), a self-supervised learning paradigm using single-stage online knowledge distillation to improve the representation quality of the smaller models. We employ deep mutual learning strategy in which two models collaboratively learn from each other to improve one another. Specifically, each model is trained using self-supervised learning along with distillation that aligns each model's softmax probabilities of similarity scores with that of the peer model. We conduct extensive experiments on multiple benchmark datasets, learning objectives, and architectures to demonstrate the potential of our proposed method. Our results show significant performance gain in the presence of noisy and limited labels and generalization to out-of-distribution data. △ Less

Submitted 30 June, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

Comments: Spotlight @ Learning from Limited or Imperfect Data (L2ID) Workshop - CVPR 2021

arXiv:2104.09362 [pdf, ps, other]

doi 10.3847/1538-4357/abf938

A new identity card for the bulge globular cluster NGC 6440 from resolved star counts

Authors: Cristina Pallanca, Barbara Lanzoni, Francesco R. Ferraro, Luca Casagrande, Sara Saracino, Bhavana Purohith Bhaskar Bhat, Silvia Leanza, Emanuele Dalessandro, Enrico Vesperini, --

Abstract: We present a new identity card for the cluster NGC 6440 in the Galactic Bulge. We have used a combination of high-resolution Hubble Space Telescope images, wide-field ground-based observations performed with the ESO-FORS2, and the public survey catalog Pan-STARRS, to determine the gravitational center, projected density profile and structural parameters of this globular from resolved star counts.… ▽ More We present a new identity card for the cluster NGC 6440 in the Galactic Bulge. We have used a combination of high-resolution Hubble Space Telescope images, wide-field ground-based observations performed with the ESO-FORS2, and the public survey catalog Pan-STARRS, to determine the gravitational center, projected density profile and structural parameters of this globular from resolved star counts. The new determination of the cluster center differs by ~ 2" (corresponding to 0.08 pc) from the previous estimate, which was based on the surface brightness peak. The star density profile, extending out to 700" from the center and suitably decontaminated from the Galactic field contribution, is best-fitted by a King model with significantly larger concentration ($c=1.86\pm0.06$) and smaller core radius ($r_c=6.4"\pm0.3"$) with respect to the literature values. By taking advantage of high-quality optical and near-infrared color-magnitude diagrams, we also estimated the cluster age, distance and reddening. The luminosity of the RGB-bump was also determined. This study indicates that the extinction coefficient in the bulge, in the direction of the cluster has a value ($R_V=2.7$) that is significantly smaller than that traditionally used for the Galaxy ($R_V=3.1$). The corresponding best-fit values of the age, distance and color excess of NGC 6440 are 13 Gyr, 8.3 kpc and $E(B-V)\sim 1.27$, respectively. These new determinations also allowed us to update the values of the central ($t_{rc}=2.5\ 10^7$ yr) and half-mass ($t_{rh}=10^9$ yr) relaxation times, suggesting that NGC 6440 is in a dynamically evolved stage. △ Less

Submitted 19 April, 2021; originally announced April 2021.

Comments: Accepted for publication in The Astrophysical Journal; 19 pages, 9 figures and 2 tables

arXiv:2104.03142 [pdf, other]

A matrix math facility for Power ISA(TM) processors

Authors: José E. Moreira, Kit Barton, Steven Battle, Peter Bergner, Ramon Bertran, Puneeth Bhat, Pedro Caldeira, David Edelsohn, Gordon Fossum, Brad Frey, Nemanja Ivanovic, Chip Kerchner, Vincent Lim, Shakti Kapoor, Tulio Machado Filho, Silvia Melitta Mueller, Brett Olsson, Satish Sadasivam, Baptiste Saleil, Bill Schmidt, Rajalakshmi Srinivasaraghavan, Shricharan Srivatsan, Brian Thompto, Andreas Wagner, Nelson Wu

Abstract: Power ISA(TM) Version 3.1 has introduced a new family of matrix math instructions, collectively known as the Matrix-Multiply Assist (MMA) facility. The instructions in this facility implement numerical linear algebra operations on small matrices and are meant to accelerate computation-intensive kernels, such as matrix multiplication, convolution and discrete Fourier transform. These instructions h… ▽ More Power ISA(TM) Version 3.1 has introduced a new family of matrix math instructions, collectively known as the Matrix-Multiply Assist (MMA) facility. The instructions in this facility implement numerical linear algebra operations on small matrices and are meant to accelerate computation-intensive kernels, such as matrix multiplication, convolution and discrete Fourier transform. These instructions have led to a power- and area-efficient implementation of a high throughput math engine in the future POWER10 processor. Performance per core is 4 times better, at constant frequency, than the previous generation POWER9 processor. We also advocate the use of compiler built-ins as the preferred way of leveraging these instructions, which we illustrate through case studies covering matrix multiplication and convolution. △ Less

Submitted 7 April, 2021; originally announced April 2021.

arXiv:2103.13528 [pdf, other]

doi 10.3847/1538-4357/abf24d

The Fermi GBM Gamma-Ray Burst Spectral Catalog: 10 Years of Data

Authors: S. Poolakkil, R. Preece, C. Fletcher, A. Goldstein, P. N. Bhat, E. Bissaldi, M. S. Briggs, E. Burns, W. H. Cleveland, M. M. Giles, C. M. Hui, D. Kocevski, S. Lesage, B. Mailyan, C. Malacaria, W. S. Paciesas, O. J. Roberts, P. Veres, A. von Kienlin, C. A. Wilson-Hodge

Abstract: We present the systematic spectral analyses of gamma-ray bursts (GRBs) detected by the Fermi Gamma-Ray Burst Monitor (GBM) during its first ten years of operation. This catalog contains two types of spectra; time-integrated spectral fits and spectral fits at the brightest time bin, from 2297 GRBs, resulting in a compendium of over 18000 spectra. The four different spectral models used for fitting… ▽ More We present the systematic spectral analyses of gamma-ray bursts (GRBs) detected by the Fermi Gamma-Ray Burst Monitor (GBM) during its first ten years of operation. This catalog contains two types of spectra; time-integrated spectral fits and spectral fits at the brightest time bin, from 2297 GRBs, resulting in a compendium of over 18000 spectra. The four different spectral models used for fitting the spectra were selected based on their empirical importance to the shape of many GRBs. We describe in detail our procedure and criteria for the analyses, and present the bulk results in the form of parameter distributions both in the observer frame and in the GRB rest frame. 941 GRBs from the first four years have been re-fitted using the same methodology as that of the 1356 GRBs in years five through ten. The data files containing the complete results are available from the High-Energy Astrophysics Science Archive Research Center (HEASARC). △ Less

Submitted 24 March, 2021; originally announced March 2021.

arXiv:2101.05146 [pdf, other]

doi 10.1038/s41586-020-03077-8

Rapid Spectral Variability of a Giant Flare from a Magnetar in NGC 253

Authors: O. J. Roberts, P. Veres, M. G. Baring, M. S. Briggs, C. Kouveliotou, E. Bissaldi, G. Younes, S. I. Chastain, J. J. DeLaunay, D. Huppenkothen, A. Tohuvavohu, P. N. Bhat, E. Gogus, A. J. van der Horst, J. A. Kennea, D. Kocevski, J. D. Linford, S. Guiriec, R. Hamburg, C. A. Wilson-Hodge, E. Burns

Abstract: Magnetars are slowly-rotating neutron stars with extremely strong magnetic fields ($10^{13-15}$ G), episodically emitting $\sim100$ ms long X-ray bursts with energies of $\sim10^{40-41}$ erg. Rarely, they produce extremely bright, energetic giant flares that begin with a short ($\sim0.2$ s), intense flash, followed by fainter, longer lasting emission modulated by the magnetar spin period (typicall… ▽ More Magnetars are slowly-rotating neutron stars with extremely strong magnetic fields ($10^{13-15}$ G), episodically emitting $\sim100$ ms long X-ray bursts with energies of $\sim10^{40-41}$ erg. Rarely, they produce extremely bright, energetic giant flares that begin with a short ($\sim0.2$ s), intense flash, followed by fainter, longer lasting emission modulated by the magnetar spin period (typically 2-12 s), thus confirming their origin. Over the last 40 years, only three such flares have been observed in our local group; they all suffered from instrumental saturation due to their extreme intensity. It has been proposed that extra-galactic giant flares likely constitute a subset of short gamma-ray bursts, noting that the sensitivity of current instrumentation prevents us from detecting the pulsating tail, while the initial bright flash is readily observable out to distances $\sim 10-20$ Mpc. Here, we report X- and gamma-ray observations of GRB 200415A, which exhibits a rapid onset, very fast time variability, flat spectra and significant sub-millisecond spectral evolution. These attributes match well with those expected for a giant flare from an extra-galactic magnetar, noting that GRB 200415A is directionally associated with the galaxy NGC 253 ($\sim$3.5 Mpc away). The detection of $\sim3$ MeV photons provides definitive evidence for relativistic motion of the emitting plasma. The observed rapid spectral evolution can naturally be generated by radiation emanating from such rapidly-moving gas in a rotating magnetar. △ Less

Submitted 13 January, 2021; originally announced January 2021.

arXiv:2012.03981 [pdf, other]

doi 10.1103/PhysRevLett.127.062003

Comparison of $pp$ and $p \bar{p}$ differential elastic cross sections and observation of the exchange of a colorless $C$-odd gluonic compound

Authors: V. M. Abazov, B. Abbott, B. S. Acharya, M. Adams, T. Adams, J. P. Agnew, G. D. Alexeev, G. Alkhazov, A. Alton, G. A. Alves, G. Antchev, A. Askew, P. Aspell, A. C. S. Assis Jesus, I. Atanassov, S. Atkins, K. Augsten, V. Aushev, Y. Aushev, V. Avati, C. Avila, F. Badaud, J. Baechler, L. Bagby, C. Baldenegro Barrera , et al. (451 additional authors not shown)

Abstract: We describe an analysis comparing the $p\bar{p}$ elastic cross section as measured by the D0 Collaboration at a center-of-mass energy of 1.96 TeV to that in $pp$ collisions as measured by the TOTEM Collaboration at 2.76, 7, 8, and 13 TeV using a model-independent approach. The TOTEM cross sections extrapolated to a center-of-mass energy of $\sqrt{s} =$ 1.96 TeV are compared with the D0 measurement… ▽ More We describe an analysis comparing the $p\bar{p}$ elastic cross section as measured by the D0 Collaboration at a center-of-mass energy of 1.96 TeV to that in $pp$ collisions as measured by the TOTEM Collaboration at 2.76, 7, 8, and 13 TeV using a model-independent approach. The TOTEM cross sections extrapolated to a center-of-mass energy of $\sqrt{s} =$ 1.96 TeV are compared with the D0 measurement in the region of the diffractive minimum and the second maximum of the $pp$ cross section. The two data sets disagree at the 3.4$σ$ level and thus provide evidence for the $t$-channel exchange of a colorless, $C$-odd gluonic compound, also known as the odderon. We combine these results with a TOTEM analysis of the same $C$-odd exchange based on the total cross section and the ratio of the real to imaginary parts of the forward elastic scattering amplitude in $pp$ scattering. The combined significance of these results is larger than 5$σ$ and is interpreted as the first observation of the exchange of a colorless, $C$-odd gluonic compound. △ Less

Submitted 25 June, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

Comments: D0 and TOTEM Collaborations

Journal ref: Phys. Rev. Lett. 127, 062003 (2021)

arXiv:2007.07325 [pdf, other]

doi 10.1093/mnras/staa3849

Inverse energy transfer in decaying, three dimensional, nonhelical magnetic turbulence due to magnetic reconnection

Authors: Pallavi Bhat, Muni Zhou, Nuno F. Loureiro

Abstract: It has been recently shown numerically that there exists an inverse transfer of magnetic energy in decaying, nonhelical, magnetically dominated, magnetohydrodynamic turbulence in 3-dimensions (3D). We suggest that magnetic reconnection is the underlying physical mechanism responsible for this inverse transfer. In the two-dimensional (2D) case, the inverse transfer is easily inferred to be due to s… ▽ More It has been recently shown numerically that there exists an inverse transfer of magnetic energy in decaying, nonhelical, magnetically dominated, magnetohydrodynamic turbulence in 3-dimensions (3D). We suggest that magnetic reconnection is the underlying physical mechanism responsible for this inverse transfer. In the two-dimensional (2D) case, the inverse transfer is easily inferred to be due to smaller magnetic islands merging to form larger ones via reconnection. We find that the scaling behaviour is similar between the 2D and the 3D cases, i.e., the magnetic energy evolves as $t^{-1}$, and the magnetic power spectrum follows a slope of $k^{-2}$. We show that on normalizing time by the magnetic reconnection timescale, the evolution curves of the magnetic field in systems with different Lundquist numbers collapse onto one another. Furthermore, transfer function plots show signatures of magnetic reconnection driving the inverse transfer. We also discuss the conserved quantities in the system and show that the behaviour of these quantities is similar between the 2D and 3D simulations, thus making the case that the dynamics in 3D could be approximately explained by what we understand in 2D. Lastly, we also conduct simulations where the magnetic field is subdominant to the flow. Here, too, we find an inverse transfer of magnetic energy in 3D. In these simulations, the magnetic energy evolves as $ t^{-1.4}$ and, interestingly, a dynamo effect is observed. △ Less

Submitted 14 July, 2020; originally announced July 2020.

Comments: 13 pages, 18 figures

arXiv:2006.10583 [pdf]

doi 10.1038/s41567-020-0863-3

Particle Physics at Accelerators in the United States and Asia

Authors: Pushpalatha C. Bhat, Geoffrey N. Taylor

Abstract: Particle physics experiments in the United States and Asia have greatly contributed to the understanding of elementary particles and their interactions. With the recent discovery of the Higgs boson at CERN, interest in the development of next-generation colliders has been rekindled. A linear electron-positron collider in Japan and a circular collider in China have been proposed for precision studi… ▽ More Particle physics experiments in the United States and Asia have greatly contributed to the understanding of elementary particles and their interactions. With the recent discovery of the Higgs boson at CERN, interest in the development of next-generation colliders has been rekindled. A linear electron-positron collider in Japan and a circular collider in China have been proposed for precision studies of the Higgs boson. In addition to the Higgs programme, new accelerator-based long-baseline neutrino mega-facilities are being built in the United States and Japan. Here, we outline the present status of key particle physics programmes at accelerators and future plans in the United States and Asia that largely complement approaches being explored in the European Strategy for Particle Physics Update. We encourage the pursuit of this global approach, reaching beyond regional boundaries for optimized development and operations of major accelerator facilities worldwide, to ensure active and productive future of the field. △ Less

Submitted 18 June, 2020; originally announced June 2020.

Comments: 12 pages, 4 figures

Journal ref: Nat. Phys. 16, 380-385 (2020)

arXiv:2006.07251 [pdf, other]

doi 10.1038/s41586-019-1754-6

Observation of inverse Compton emission from a long $γ$-ray burst

Authors: V. A. Acciari, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, D. Baack, A. Babić, B. Banerjee, U. Barres de Almeida, J. A. Barrio, J. Becerra González, W. Bednarek, L. Bellizzi, E. Bernardini, A. Berti, J. Besenrieder, W. Bhattacharyya, C. Bigongiari, A. Biland, O. Blanch, G. Bonnoli, Ž. Bošnjak, G. Busetto, R. Carosi, G. Ceribella, Y. Chai , et al. (279 additional authors not shown)

Abstract: Long-duration gamma-ray bursts (GRBs) originate from ultra-relativistic jets launched from the collapsing cores of dying massive stars. They are characterised by an initial phase of bright and highly variable radiation in the keV-MeV band that is likely produced within the jet and lasts from milliseconds to minutes, known as the prompt emission. Subsequently, the interaction of the jet with the ex… ▽ More Long-duration gamma-ray bursts (GRBs) originate from ultra-relativistic jets launched from the collapsing cores of dying massive stars. They are characterised by an initial phase of bright and highly variable radiation in the keV-MeV band that is likely produced within the jet and lasts from milliseconds to minutes, known as the prompt emission. Subsequently, the interaction of the jet with the external medium generates external shock waves, responsible for the afterglow emission, which lasts from days to months, and occurs over a broad energy range, from the radio to the GeV bands. The afterglow emission is generally well explained as synchrotron radiation by electrons accelerated at the external shock. Recently, an intense, long-lasting emission between 0.2 and 1 TeV was observed from the GRB 190114C. Here we present the results of our multi-frequency observational campaign of GRB~190114C, and study the evolution in time of the GRB emission across 17 orders of magnitude in energy, from $5\times10^{-6}$ up to $10^{12}$\,eV. We find that the broadband spectral energy distribution is double-peaked, with the TeV emission constituting a distinct spectral component that has power comparable to the synchrotron component. This component is associated with the afterglow, and is satisfactorily explained by inverse Compton upscattering of synchrotron photons by high-energy electrons. We find that the conditions required to account for the observed TeV component are not atypical, supporting the possibility that inverse Compton emission is commonly produced in GRBs. △ Less

Submitted 12 June, 2020; originally announced June 2020.

Journal ref: Nature 575 (2019) 459-463

arXiv:2002.11460 [pdf, other]

doi 10.3847/1538-4357/ab7a18

The Fourth Fermi-GBM Gamma-Ray Burst Catalog: A Decade of Data

Authors: A. von Kienlin, C. A. Meegan, W. S. Paciesas, P. N. Bhat, E. Bissaldi, M. S. Briggs, E. Burns, W. H. Cleveland, M. H. Gibby, M. M. Giles, A. Goldstein, R. Hamburg, C. M. Hui, D. Kocevski, B. Mailyan, C. Malacaria, S. Poolakkil, R. D. Preece, O. J. Roberts, P. Veres, C. A. Wilson-Hodge

Abstract: We present the fourth in a series of catalogs of gamma-ray bursts (GRBs) observed with Fermi's Gamma-Ray Burst Monitor (Fermi-GBM). It extends the six year catalog by four more years, now covering the ten year time period from trigger enabling on 2008 July 12 to 2018 July 11. During this time period GBM triggered almost twice a day on transient events of which we identifyied 2356 as cosmic GRBs. A… ▽ More We present the fourth in a series of catalogs of gamma-ray bursts (GRBs) observed with Fermi's Gamma-Ray Burst Monitor (Fermi-GBM). It extends the six year catalog by four more years, now covering the ten year time period from trigger enabling on 2008 July 12 to 2018 July 11. During this time period GBM triggered almost twice a day on transient events of which we identifyied 2356 as cosmic GRBs. Additional trigger events were due to solar are events, magnetar burst activities, and terrestrial gamma-ray flashes. The intention of the GBM GRB catalog series is to provide updated information to the community on the most important observables of the GBM-detected GRBs. For each GRB the location and main characteristics of the prompt emission, the duration, peak flux, and fluence are derived. The latter two quantities are calculated for the 50-300 keV energy band, where the maximum energy release of GRBs in the instrument reference system is observed and also for a broader energy band from 10-1000 keV, exploiting the full energy range of GBM's low-energy detectors. Furthermore, information is given on the settings of the triggering criteria and exceptional operational conditions during years 7 to 10 in the mission. This fourth catalog is an official product of the Fermi-GBM science team, and the data files containing the complete results are available from the High-Energy Astrophysics Science Archive Research Center (HEASARC). △ Less

Submitted 14 April, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

Comments: 273 pages, 10 figures, 8 tables. This is a 10 year catalog update of arXiv:1603.07612

Journal ref: ApJ 893, 46 (2020)

arXiv:1912.10398 [pdf, other]

Estimation of Spectral Risk Measures

Authors: Ajay Kumar Pandey, Prashanth L. A., Sanjay P. Bhat

Abstract: We consider the problem of estimating a spectral risk measure (SRM) from i.i.d. samples, and propose a novel method that is based on numerical integration. We show that our SRM estimate concentrates exponentially, when the underlying distribution has bounded support. Further, we also consider the case when the underlying distribution is either Gaussian or exponential, and derive a concentration bo… ▽ More We consider the problem of estimating a spectral risk measure (SRM) from i.i.d. samples, and propose a novel method that is based on numerical integration. We show that our SRM estimate concentrates exponentially, when the underlying distribution has bounded support. Further, we also consider the case when the underlying distribution is either Gaussian or exponential, and derive a concentration bound for our estimation scheme. We validate the theoretical findings on a synthetic setup, and in a vehicular traffic routing application. △ Less

Submitted 22 December, 2019; originally announced December 2019.

Showing 1–50 of 172 results for author: Bhat, P