-
Automating Weak Label Generation for Data Programming with Clinicians in the Loop
Authors:
Jean Park,
Sydney Pugh,
Kaustubh Sridhar,
Mengyu Liu,
Navish Yarna,
Ramneet Kaur,
Souradeep Dutta,
Elena Bernardis,
Oleg Sokolsky,
Insup Lee
Abstract:
Large Deep Neural Networks (DNNs) are often data hungry and need high-quality labeled data in copious amounts for learning to converge. This is a challenge in the field of medicine since high quality labeled data is often scarce. Data programming has been the ray of hope in this regard, since it allows us to label unlabeled data using multiple weak labeling functions. Such functions are often supp…
▽ More
Large Deep Neural Networks (DNNs) are often data hungry and need high-quality labeled data in copious amounts for learning to converge. This is a challenge in the field of medicine since high quality labeled data is often scarce. Data programming has been the ray of hope in this regard, since it allows us to label unlabeled data using multiple weak labeling functions. Such functions are often supplied by a domain expert. Data-programming can combine multiple weak labeling functions and suggest labels better than simple majority voting over the different functions. However, it is not straightforward to express such weak labeling functions, especially in high-dimensional settings such as images and time-series data. What we propose in this paper is a way to bypass this issue, using distance functions. In high-dimensional spaces, it is easier to find meaningful distance metrics which can generalize across different labeling tasks. We propose an algorithm that queries an expert for labels of a few representative samples of the dataset. These samples are carefully chosen by the algorithm to capture the distribution of the dataset. The labels assigned by the expert on the representative subset induce a labeling on the full dataset, thereby generating weak labels to be used in the data programming pipeline. In our medical time series case study, labeling a subset of 50 to 130 out of 3,265 samples showed 17-28% improvement in accuracy and 13-28% improvement in F1 over the baseline using clinician-defined labeling functions. In our medical image case study, labeling a subset of about 50 to 120 images from 6,293 unlabeled medical images using our approach showed significant improvement over the baseline method, Snuba, with an increase of approximately 5-15% in accuracy and 12-19% in F1 score.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Recovery from Adversarial Attacks in Cyber-physical Systems: Shallow, Deep and Exploratory Works
Authors:
Pengyuan Lu,
Lin Zhang,
Mengyu Liu,
Kaustubh Sridhar,
Fanxin Kong,
Oleg Sokolsky,
Insup Lee
Abstract:
Cyber-physical systems (CPS) have experienced rapid growth in recent decades. However, like any other computer-based systems, malicious attacks evolve mutually, driving CPS to undesirable physical states and potentially causing catastrophes. Although the current state-of-the-art is well aware of this issue, the majority of researchers have not focused on CPS recovery, the procedure we defined as r…
▽ More
Cyber-physical systems (CPS) have experienced rapid growth in recent decades. However, like any other computer-based systems, malicious attacks evolve mutually, driving CPS to undesirable physical states and potentially causing catastrophes. Although the current state-of-the-art is well aware of this issue, the majority of researchers have not focused on CPS recovery, the procedure we defined as restoring a CPS's physical state back to a target condition under adversarial attacks. To call for attention on CPS recovery and identify existing efforts, we have surveyed a total of 30 relevant papers. We identify a major partition of the proposed recovery strategies: shallow recovery vs. deep recovery, where the former does not use a dedicated recovery controller while the latter does. Additionally, we surveyed exploratory research on topics that facilitate recovery. From these publications, we discuss the current state-of-the-art of CPS recovery, with respect to applications, attack type, attack surfaces and system dynamics. Then, we identify untouched sub-domains in this field and suggest possible future directions for researchers.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Bottomonia production in Modified NRQCD
Authors:
Sudhansu S. Biswal,
Monalisa Mohanty,
K. Sridhar
Abstract:
Motivated by the success of Modified Non-Relativistic Quantum Chromodynamics (Modified NRQCD) in explaining data from experiments at the Large Hadron Collider (LHC) for charmonia, we now turn to the study of bottomonium production at the LHC. Modified NRQCD does very well in explaining $Υ$ data from the LHC. But this is true also of NRQCD which explains the $Υ$ data equally well. Where the two mod…
▽ More
Motivated by the success of Modified Non-Relativistic Quantum Chromodynamics (Modified NRQCD) in explaining data from experiments at the Large Hadron Collider (LHC) for charmonia, we now turn to the study of bottomonium production at the LHC. Modified NRQCD does very well in explaining $Υ$ data from the LHC. But this is true also of NRQCD which explains the $Υ$ data equally well. Where the two models differ substantially is in their predictions for $η_b$ production. As was the case with $η_c$, the measurement of $η_b$ production at the LHC will be another decisive test of Modified NRQCD.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
ADAPT-QAOA with a classically inspired initial state
Authors:
Vishvesha K. Sridhar,
Yanzhu Chen,
Bryan Gard,
Edwin Barnes,
Sophia E. Economou
Abstract:
Quantum computing may provide advantage in solving classical optimization problems. One promising algorithm is the quantum approximate optimization algorithm (QAOA). There have been many proposals for improving this algorithm, such as using an initial state informed by classical approximation solutions. A variation of QAOA called ADAPT-QAOA constructs the ansatz dynamically and can speed up conver…
▽ More
Quantum computing may provide advantage in solving classical optimization problems. One promising algorithm is the quantum approximate optimization algorithm (QAOA). There have been many proposals for improving this algorithm, such as using an initial state informed by classical approximation solutions. A variation of QAOA called ADAPT-QAOA constructs the ansatz dynamically and can speed up convergence. However, it faces the challenge of frequently converging to excited states which correspond to local minima in the energy landscape, limiting its performance. In this work, we propose to start ADAPT-QAOA with an initial state inspired by a classical approximation algorithm. Through numerical simulations we show that this new algorithm can reach the same accuracy with fewer layers than the standard QAOA and the original ADAPT-QAOA. It also appears to be less prone to the problem of converging to excited states.
△ Less
Submitted 14 October, 2023;
originally announced October 2023.
-
Memory-Consistent Neural Networks for Imitation Learning
Authors:
Kaustubh Sridhar,
Souradeep Dutta,
Dinesh Jayaraman,
James Weimer,
Insup Lee
Abstract:
Imitation learning considerably simplifies policy synthesis compared to alternative approaches by exploiting access to expert demonstrations. For such imitation policies, errors away from the training samples are particularly critical. Even rare slip-ups in the policy action outputs can compound quickly over time, since they lead to unfamiliar future states where the policy is still more likely to…
▽ More
Imitation learning considerably simplifies policy synthesis compared to alternative approaches by exploiting access to expert demonstrations. For such imitation policies, errors away from the training samples are particularly critical. Even rare slip-ups in the policy action outputs can compound quickly over time, since they lead to unfamiliar future states where the policy is still more likely to err, eventually causing task failures. We revisit simple supervised ``behavior cloning'' for conveniently training the policy from nothing more than pre-recorded demonstrations, but carefully design the model class to counter the compounding error phenomenon. Our ``memory-consistent neural network'' (MCNN) outputs are hard-constrained to stay within clearly specified permissible regions anchored to prototypical ``memory'' training samples. We provide a guaranteed upper bound for the sub-optimality gap induced by MCNN policies. Using MCNNs on 10 imitation learning tasks, with MLP, Transformer, and Diffusion backbones, spanning dexterous robotic manipulation and driving, proprioceptive inputs and visual inputs, and varying sizes and types of demonstration data, we find large and consistent gains in performance, validating that MCNNs are better-suited than vanilla deep neural networks for imitation learning applications. Website: https://sites.google.com/view/mcnn-imitation
△ Less
Submitted 16 March, 2024; v1 submitted 9 October, 2023;
originally announced October 2023.
-
Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation
Authors:
Arvind Krishna Sridhar,
Yinyi Guo,
Erik Visser,
Rehana Mahfuz
Abstract:
There has been significant research on develo** pretrained transformer architectures for multimodal-to-text generation tasks. Albeit performance improvements, such models are frequently overparameterized, hence suffer from hallucination and large memory footprint making them challenging to deploy on edge devices. In this paper, we address both these issues for the application of automated audio…
▽ More
There has been significant research on develo** pretrained transformer architectures for multimodal-to-text generation tasks. Albeit performance improvements, such models are frequently overparameterized, hence suffer from hallucination and large memory footprint making them challenging to deploy on edge devices. In this paper, we address both these issues for the application of automated audio captioning. First, we propose a data augmentation technique for generating hallucinated audio captions and show that similarity based on an audio-text shared latent space is suitable for detecting hallucination. Then, we propose a parameter efficient inference time faithful decoding algorithm that enables smaller audio captioning models with performance equivalent to larger models trained with more data. During the beam decoding step, the smaller model utilizes an audio-text shared latent representation to semantically align the generated text with corresponding input audio. Faithful guidance is introduced into the beam probability by incorporating the cosine similarity between latent representation projections of greedy rolled out intermediate beams and audio clip. We show the efficacy of our algorithm on benchmark datasets and evaluate the proposed scheme against baselines using conventional audio captioning and semantic similarity metrics while illustrating tradeoffs between performance and complexity.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Detecting False Alarms and Misses in Audio Captions
Authors:
Rehana Mahfuz,
Yinyi Guo,
Arvind Krishna Sridhar,
Erik Visser
Abstract:
Metrics to evaluate audio captions simply provide a score without much explanation regarding what may be wrong in case the score is low. Manual human intervention is needed to find any shortcomings of the caption. In this work, we introduce a metric which automatically identifies the shortcomings of an audio caption by detecting the misses and false alarms in a candidate caption with respect to a…
▽ More
Metrics to evaluate audio captions simply provide a score without much explanation regarding what may be wrong in case the score is low. Manual human intervention is needed to find any shortcomings of the caption. In this work, we introduce a metric which automatically identifies the shortcomings of an audio caption by detecting the misses and false alarms in a candidate caption with respect to a reference caption, and reports the recall, precision and F-score. Such a metric is very useful in profiling the deficiencies of an audio captioning model, which is a milestone towards improving the quality of audio captions.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Quantized topological energy pum** and Weyl points in Floquet synthetic dimensions with a driven-dissipative photonic molecule
Authors:
Sashank Kaushik Sridhar,
Sayan Ghosh,
Avik Dutt
Abstract:
Topological effects manifest in a wide range of physical systems, such as solid crystals, acoustic waves, photonic materials and cold atoms. These effects are characterized by `topological invariants' which are typically integer-valued, and lead to robust quantized channels of transport in space, time, and other degrees of freedom. The temporal channel, in particular, allows one to achieve higher-…
▽ More
Topological effects manifest in a wide range of physical systems, such as solid crystals, acoustic waves, photonic materials and cold atoms. These effects are characterized by `topological invariants' which are typically integer-valued, and lead to robust quantized channels of transport in space, time, and other degrees of freedom. The temporal channel, in particular, allows one to achieve higher-dimensional topological effects, by driving the system with multiple incommensurate frequencies. However, dissipation is generally detrimental to such topological effects, particularly when the systems consist of quantum spins or qubits. Here we introduce a photonic molecule subjected to multiple RF/optical drives and dissipation as a promising candidate system to observe quantized transport along Floquet synthetic dimensions. Topological energy pum** in the incommensurately modulated photonic molecule is enhanced by the driven-dissipative nature of our platform. Furthermore, we provide a path to realizing Weyl points and measuring the Berry curvature emanating from these reciprocal-space ($k$-space) magnetic monopoles, illustrating the capabilities for higher-dimensional topological Hamiltonian simulation in this platform. Our approach enables direct $k$-space engineering of a wide variety of Hamiltonians using modulation bandwidths that are well below the free-spectral range (FSR) of integrated photonic cavities.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
Striving for Authentic and Sustained Technology Use In the Classroom: Lessons Learned from a Longitudinal Evaluation of a Sensor-based Science Education Platform
Authors:
Yvonne Chua,
Sankha Cooray,
Juan Pablo Forero Cortes,
Paul Denny,
Sonia Dupuch,
Dawn L Garbett,
Alaeddin Nassani,
Jiashuo Cao,
Hannah Qiao,
Andrew Reis,
Deviana Reis,
Philipp M. Scholl,
Priyashri Kamlesh Sridhar,
Hussel Suriyaarachchi,
Fiona Taimana,
Vanessa Tanga,
Chamod Weerasinghe,
Elliott Wen,
Michelle Wu,
Qin Wu,
Haimo Zhang,
Suranga Nanayakkara
Abstract:
Technology integration in educational settings has led to the development of novel sensor-based tools that enable students to measure and interact with their environment. Although reports from using such tools can be positive, evaluations are often conducted under controlled conditions and short timeframes. There is a need for longitudinal data collected in realistic classroom settings. However, s…
▽ More
Technology integration in educational settings has led to the development of novel sensor-based tools that enable students to measure and interact with their environment. Although reports from using such tools can be positive, evaluations are often conducted under controlled conditions and short timeframes. There is a need for longitudinal data collected in realistic classroom settings. However, sustained and authentic classroom use requires technology platforms to be seen by teachers as both easy to use and of value. We describe our development of a sensor-based platform to support science teaching that followed a 14-month user-centered design process. We share insights from this design and development approach, and report findings from a 6-month large-scale evaluation involving 35 schools and 1245 students. We share lessons learnt, including that technology integration is not an educational goal per se and that technology should be a transparent tool to enable students to achieve their learning goals.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
Resolution of the LHCb $η_c$ anomaly
Authors:
Sudhansu S. Biswal,
Sushree S. Mishra,
K. Sridhar
Abstract:
Due to the heavy-quark symmetry of Non-Relativistic Quantum Chromodynamics (NRQCD), the cross-section for the production of $η_c$ can be predicted. This NRQCD prediction when confronted with data from the LHCb is seen to fail miserably. We address this LHCb $η_c$ anomaly in this paper using a new approach called modified NRQCD, an approach that has been shown to work extremely well for studying…
▽ More
Due to the heavy-quark symmetry of Non-Relativistic Quantum Chromodynamics (NRQCD), the cross-section for the production of $η_c$ can be predicted. This NRQCD prediction when confronted with data from the LHCb is seen to fail miserably. We address this LHCb $η_c$ anomaly in this paper using a new approach called modified NRQCD, an approach that has been shown to work extremely well for studying $J/ψ$, $ψ^{\prime}$ and $χ_c$ production at the LHC. We show, in the present paper, that the predictions for $η_c$ production agrees very well with LHCb measurements at the three different values of energy that the experiment has presented data for. Modified NRQCD also explains the intriguing agreement of the LHCb $η_c$ data with the colour-singlet prediction. The remarkable agreement of the theoretical predictions with the LHCb data suggests that modified NRQCD is closer to apprehending the true dynamics of quarkonium production.
△ Less
Submitted 8 January, 2023;
originally announced January 2023.
-
Improved Beam Search for Hallucination Mitigation in Abstractive Summarization
Authors:
Arvind Krishna Sridhar,
Erik Visser
Abstract:
Advancement in large pretrained language models has significantly improved their performance for conditional language generation tasks including summarization albeit with hallucinations. To reduce hallucinations, conventional methods proposed improving beam search or using a fact checker as a postprocessing step. In this paper, we investigate the use of the Natural Language Inference (NLI) entailm…
▽ More
Advancement in large pretrained language models has significantly improved their performance for conditional language generation tasks including summarization albeit with hallucinations. To reduce hallucinations, conventional methods proposed improving beam search or using a fact checker as a postprocessing step. In this paper, we investigate the use of the Natural Language Inference (NLI) entailment metric to detect and prevent hallucinations in summary generation. We propose an NLI-assisted beam re-ranking mechanism by computing entailment probability scores between the input context and summarization model-generated beams during saliency-enhanced greedy decoding. Moreover, a diversity metric is introduced to compare its effectiveness against vanilla beam search. Our proposed algorithm significantly outperforms vanilla beam decoding on XSum and CNN/DM datasets.
△ Less
Submitted 14 November, 2023; v1 submitted 5 December, 2022;
originally announced December 2022.
-
Predict-and-Critic: Accelerated End-to-End Predictive Control for Cloud Computing through Reinforcement Learning
Authors:
Kaustubh Sridhar,
Vikramank Singh,
Balakrishnan Narayanaswamy,
Abishek Sankararaman
Abstract:
Cloud computing holds the promise of reduced costs through economies of scale. To realize this promise, cloud computing vendors typically solve sequential resource allocation problems, where customer workloads are packed on shared hardware. Virtual machines (VM) form the foundation of modern cloud computing as they help logically abstract user compute from shared physical infrastructure. Tradition…
▽ More
Cloud computing holds the promise of reduced costs through economies of scale. To realize this promise, cloud computing vendors typically solve sequential resource allocation problems, where customer workloads are packed on shared hardware. Virtual machines (VM) form the foundation of modern cloud computing as they help logically abstract user compute from shared physical infrastructure. Traditionally, VM packing problems are solved by predicting demand, followed by a Model Predictive Control (MPC) optimization over a future horizon. We introduce an approximate formulation of an industrial VM packing problem as an MILP with soft-constraints parameterized by the predictions. Recently, predict-and-optimize (PnO) was proposed for end-to-end training of prediction models by back-propagating the cost of decisions through the optimization problem. But, PnO is unable to scale to the large prediction horizons prevalent in cloud computing. To tackle this issue, we propose the Predict-and-Critic (PnC) framework that outperforms PnO with just a two-step horizon by leveraging reinforcement learning. PnC jointly trains a prediction model and a terminal Q function that approximates cost-to-go over a long horizon, by back-propagating the cost of decisions through the optimization problem \emph{and from the future}. The terminal Q function allows us to solve a much smaller two-step horizon optimization problem than the multi-step horizon necessary in PnO. We evaluate PnO and the PnC framework on two datasets, three workloads, and with disturbances not modeled in the optimization problem. We find that PnC significantly improves decision quality over PnO, even when the optimization problem is not a perfect representation of reality. We also find that hardening the soft constraints of the MILP and back-propagating through the constraints improves decision quality for both PnO and PnC.
△ Less
Submitted 27 February, 2023; v1 submitted 2 December, 2022;
originally announced December 2022.
-
Guaranteed Conformance of Neurosymbolic Models to Natural Constraints
Authors:
Kaustubh Sridhar,
Souradeep Dutta,
James Weimer,
Insup Lee
Abstract:
Deep neural networks have emerged as the workhorse for a large section of robotics and control applications, especially as models for dynamical systems. Such data-driven models are in turn used for designing and verifying autonomous systems. They are particularly useful in modeling medical systems where data can be leveraged to individualize treatment. In safety-critical applications, it is import…
▽ More
Deep neural networks have emerged as the workhorse for a large section of robotics and control applications, especially as models for dynamical systems. Such data-driven models are in turn used for designing and verifying autonomous systems. They are particularly useful in modeling medical systems where data can be leveraged to individualize treatment. In safety-critical applications, it is important that the data-driven model is conformant to established knowledge from the natural sciences. Such knowledge is often available or can often be distilled into a (possibly black-box) model. For instance, an F1 racing car should conform to Newton's laws (which are encoded within a unicycle model). In this light, we consider the following problem - given a model $M$ and a state transition dataset, we wish to best approximate the system model while being a bounded distance away from $M$. We propose a method to guarantee this conformance. Our first step is to distill the dataset into a few representative samples called memories, using the idea of a growing neural gas. Next, using these memories we partition the state space into disjoint subsets and compute bounds that should be respected by the neural network in each subset. This serves as a symbolic wrapper for guaranteed conformance. We argue theoretically that this only leads to a bounded increase in approximation error; which can be controlled by increasing the number of memories. We experimentally show that on three case studies (Car Model, Drones, and Artificial Pancreas), our constrained neurosymbolic models conform to specified models (each encoding various constraints) with order-of-magnitude improvements compared to the augmented Lagrangian and vanilla training methods. Our code can be found at: https://github.com/kaustubhsridhar/Constrained_Models
△ Less
Submitted 7 November, 2023; v1 submitted 2 December, 2022;
originally announced December 2022.
-
Activity report analysis with automatic single or multispan answer extraction
Authors:
Ravi Choudhary,
Arvind Krishna Sridhar,
Erik Visser
Abstract:
In the era of loT (Internet of Things) we are surrounded by a plethora of Al enabled devices that can transcribe images, video, audio, and sensors signals into text descriptions. When such transcriptions are captured in activity reports for monitoring, life logging and anomaly detection applications, a user would typically request a summary or ask targeted questions about certain sections of the r…
▽ More
In the era of loT (Internet of Things) we are surrounded by a plethora of Al enabled devices that can transcribe images, video, audio, and sensors signals into text descriptions. When such transcriptions are captured in activity reports for monitoring, life logging and anomaly detection applications, a user would typically request a summary or ask targeted questions about certain sections of the report they are interested in. Depending on the context and the type of question asked, a question answering (QA) system would need to automatically determine whether the answer covers single-span or multi-span text components. Currently available QA datasets primarily focus on single span responses only (such as SQuAD[4]) or contain a low proportion of examples with multiple span answers (such as DROP[3]). To investigate automatic selection of single/multi-span answers in the use case described, we created a new smart home environment dataset comprised of questions paired with single-span or multi-span answers depending on the question and context queried. In addition, we propose a RoBERTa[6]-based multiple span extraction question answering (MSEQA) model returning the appropriate answer span for a given question. Our experiments show that the proposed model outperforms state-of-the-art QA models on our dataset while providing comparable performance on published individual single/multi-span task datasets.
△ Less
Submitted 9 September, 2022;
originally announced September 2022.
-
Bias Reducing Multitask Learning on Mental Health Prediction
Authors:
Khadija Zanna,
Kusha Sridhar,
Han Yu,
Akane Sano
Abstract:
There has been an increase in research in develo** machine learning models for mental health detection or prediction in recent years due to increased mental health issues in society. Effective use of mental health prediction or detection models can help mental health practitioners re-define mental illnesses more objectively than currently done, and identify illnesses at an earlier stage when int…
▽ More
There has been an increase in research in develo** machine learning models for mental health detection or prediction in recent years due to increased mental health issues in society. Effective use of mental health prediction or detection models can help mental health practitioners re-define mental illnesses more objectively than currently done, and identify illnesses at an earlier stage when interventions may be more effective. However, there is still a lack of standard in evaluating bias in such machine learning models in the field, which leads to challenges in providing reliable predictions and in addressing disparities. This lack of standards persists due to factors such as technical difficulties, complexities of high dimensional clinical health data, etc., which are especially true for physiological signals. This along with prior evidence of relations between some physiological signals with certain demographic identities restates the importance of exploring bias in mental health prediction models that utilize physiological signals. In this work, we aim to perform a fairness analysis and implement a multi-task learning based bias mitigation method on anxiety prediction models using ECG data. Our method is based on the idea of epistemic uncertainty and its relationship with model weights and feature space representation. Our analysis showed that our anxiety prediction base model introduced some bias with regards to age, income, ethnicity, and whether a participant is born in the U.S. or not, and our bias mitigation method performed better at reducing the bias in the model, when compared to the reweighting mitigation technique. Our analysis on feature importance also helped identify relationships between heart rate variability and multiple demographic grou**s.
△ Less
Submitted 6 August, 2022;
originally announced August 2022.
-
CODiT: Conformal Out-of-Distribution Detection in Time-Series Data
Authors:
Ramneet Kaur,
Kaustubh Sridhar,
Sangdon Park,
Susmit Jha,
Anirban Roy,
Oleg Sokolsky,
Insup Lee
Abstract:
Machine learning models are prone to making incorrect predictions on inputs that are far from the training distribution. This hinders their deployment in safety-critical applications such as autonomous vehicles and healthcare. The detection of a shift from the training distribution of individual datapoints has gained attention. A number of techniques have been proposed for such out-of-distribution…
▽ More
Machine learning models are prone to making incorrect predictions on inputs that are far from the training distribution. This hinders their deployment in safety-critical applications such as autonomous vehicles and healthcare. The detection of a shift from the training distribution of individual datapoints has gained attention. A number of techniques have been proposed for such out-of-distribution (OOD) detection. But in many applications, the inputs to a machine learning model form a temporal sequence. Existing techniques for OOD detection in time-series data either do not exploit temporal relationships in the sequence or do not provide any guarantees on detection. We propose using deviation from the in-distribution temporal equivariance as the non-conformity measure in conformal anomaly detection framework for OOD detection in time-series data.Computing independent predictions from multiple conformal detectors based on the proposed measure and combining these predictions by Fisher's method leads to the proposed detector CODiT with guarantees on false detection in time-series data. We illustrate the efficacy of CODiT by achieving state-of-the-art results on computer vision datasets in autonomous driving. We also show that CODiT can be used for OOD detection in non-vision datasets by performing experiments on the physiological GAIT sensory dataset. Code, data, and trained models are available at https://github.com/kaustubhsridhar/time-series-OOD.
△ Less
Submitted 24 July, 2022;
originally announced July 2022.
-
$χ_c$ production in modified NRQCD
Authors:
Sudhansu S. Biswal,
Sushree S. Mishra,
K. Sridhar
Abstract:
In a previous paper, we had modified Non-Relativistic QCD as it applies to quarkonium production by taking into account the effect of perturbative soft-gluon emission from the colour-octet quarkonium states. We tested the model by fitting the unknown non-perturbative parameter in the model from Tevatron data and using that to make parameter-free predictions for $J/ψ$ and $ψ'$ production at the LHC…
▽ More
In a previous paper, we had modified Non-Relativistic QCD as it applies to quarkonium production by taking into account the effect of perturbative soft-gluon emission from the colour-octet quarkonium states. We tested the model by fitting the unknown non-perturbative parameter in the model from Tevatron data and using that to make parameter-free predictions for $J/ψ$ and $ψ'$ production at the LHC. In this paper, we study $χ_c$ production: we fit as before the unknown matrix-element using data from Tevatron. We, then, extend the results of the previous paper for $J/ψ$ production by calculating the effect of $χ_c$ feed-down to the $J/ψ$ cross-section, which, by comparing with CMS results at $\sqrt{s}=$ 13 TeV, we demonstrate to be small. We have also computed $χ_c^1$ and $χ_c^2$ at $\sqrt{s}=$7 TeV and find excellent agreement with data from the ATLAS experiment.
△ Less
Submitted 30 June, 2022;
originally announced June 2022.
-
Towards Alternative Techniques for Improving Adversarial Robustness: Analysis of Adversarial Training at a Spectrum of Perturbations
Authors:
Kaustubh Sridhar,
Souradeep Dutta,
Ramneet Kaur,
James Weimer,
Oleg Sokolsky,
Insup Lee
Abstract:
Adversarial training (AT) and its variants have spearheaded progress in improving neural network robustness to adversarial perturbations and common corruptions in the last few years. Algorithm design of AT and its variants are focused on training models at a specified perturbation strength $ε$ and only using the feedback from the performance of that $ε$-robust model to improve the algorithm. In th…
▽ More
Adversarial training (AT) and its variants have spearheaded progress in improving neural network robustness to adversarial perturbations and common corruptions in the last few years. Algorithm design of AT and its variants are focused on training models at a specified perturbation strength $ε$ and only using the feedback from the performance of that $ε$-robust model to improve the algorithm. In this work, we focus on models, trained on a spectrum of $ε$ values. We analyze three perspectives: model performance, intermediate feature precision and convolution filter sensitivity. In each, we identify alternative improvements to AT that otherwise wouldn't have been apparent at a single $ε$. Specifically, we find that for a PGD attack at some strength $δ$, there is an AT model at some slightly larger strength $ε$, but no greater, that generalizes best to it. Hence, we propose overdesigning for robustness where we suggest training models at an $ε$ just above $δ$. Second, we observe (across various $ε$ values) that robustness is highly sensitive to the precision of intermediate features and particularly those after the first and second layer. Thus, we propose adding a simple quantization to defenses that improves accuracy on seen and unseen adaptive attacks. Third, we analyze convolution filters of each layer of models at increasing $ε$ and notice that those of the first and second layer may be solely responsible for amplifying input perturbations. We present our findings and demonstrate our techniques through experiments with ResNet and WideResNet models on the CIFAR-10 and CIFAR-10-C datasets.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
A Framework for Checkpointing and Recovery of Hierarchical Cyber-Physical Systems
Authors:
Kaustubh Sridhar,
Radoslav Ivanov,
Vuk Lesi,
Marcio Juliato,
Manoj Sastry,
Lily Yang,
James Weimer,
Oleg Sokolsky,
Insup Lee
Abstract:
This paper tackles the problem of making complex resource-constrained cyber-physical systems (CPS) resilient to sensor anomalies. In particular, we present a framework for checkpointing and roll-forward recovery of state-estimates in nonlinear, hierarchical CPS with anomalous sensor data. We introduce three checkpointing paradigms for ensuring different levels of checkpointing consistency across t…
▽ More
This paper tackles the problem of making complex resource-constrained cyber-physical systems (CPS) resilient to sensor anomalies. In particular, we present a framework for checkpointing and roll-forward recovery of state-estimates in nonlinear, hierarchical CPS with anomalous sensor data. We introduce three checkpointing paradigms for ensuring different levels of checkpointing consistency across the hierarchy. Our framework has algorithms implementing the consistent paradigm to perform accurate recovery in a time-efficient manner while managing the tradeoff with system resources and handling the interplay between diverse anomaly detection systems across the hierarchy. Further in this work, we detail bounds on the recovered state-estimate error, maximum tolerable anomaly duration and the accuracy-resource gap that results from the aforementioned tradeoff. We explore use-cases for our framework and evaluate it on a case study of a simulated ground robot to show that it scales to multiple hierarchies and performs better than an extended Kalman filter (EKF) that does not incorporate a checkpointing procedure during sensor anomalies. We conclude the work with a discussion on extending the proposed framework to distributed systems.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates
Authors:
Souradeep Dutta,
Kaustubh Sridhar,
Osbert Bastani,
Edgar Dobriban,
James Weimer,
Insup Lee,
Julia Parish-Morris
Abstract:
Long horizon robot learning tasks with sparse rewards pose a significant challenge for current reinforcement learning algorithms. A key feature enabling humans to learn challenging control tasks is that they often receive expert intervention that enables them to understand the high-level structure of the task before mastering low-level control actions. We propose a framework for leveraging expert…
▽ More
Long horizon robot learning tasks with sparse rewards pose a significant challenge for current reinforcement learning algorithms. A key feature enabling humans to learn challenging control tasks is that they often receive expert intervention that enables them to understand the high-level structure of the task before mastering low-level control actions. We propose a framework for leveraging expert intervention to solve long-horizon reinforcement learning tasks. We consider \emph{option templates}, which are specifications encoding a potential option that can be trained using reinforcement learning. We formulate expert intervention as allowing the agent to execute option templates before learning an implementation. This enables them to use an option, before committing costly resources to learning it. We evaluate our approach on three challenging reinforcement learning problems, showing that it outperforms state-of-the-art approaches by two orders of magnitude. Videos of trained agents and our code can be found at: https://sites.google.com/view/stickymittens
△ Less
Submitted 17 November, 2022; v1 submitted 25 February, 2022;
originally announced February 2022.
-
More to Less (M2L): Enhanced Health Recognition in the Wild with Reduced Modality of Wearable Sensors
Authors:
Huiyuan Yang,
Han Yu,
Kusha Sridhar,
Thomas Vaessen,
Inez Myin-Germeys,
Akane Sano
Abstract:
Accurately recognizing health-related conditions from wearable data is crucial for improved healthcare outcomes. To improve the recognition accuracy, various approaches have focused on how to effectively fuse information from multiple sensors. Fusing multiple sensors is a common scenario in many applications, but may not always be feasible in real-world scenarios. For example, although combining b…
▽ More
Accurately recognizing health-related conditions from wearable data is crucial for improved healthcare outcomes. To improve the recognition accuracy, various approaches have focused on how to effectively fuse information from multiple sensors. Fusing multiple sensors is a common scenario in many applications, but may not always be feasible in real-world scenarios. For example, although combining bio-signals from multiple sensors (i.e., a chest pad sensor and a wrist wearable sensor) has been proved effective for improved performance, wearing multiple devices might be impractical in the free-living context. To solve the challenges, we propose an effective more to less (M2L) learning framework to improve testing performance with reduced sensors through leveraging the complementary information of multiple modalities during training. More specifically, different sensors may carry different but complementary information, and our model is designed to enforce collaborations among different modalities, where positive knowledge transfer is encouraged and negative knowledge transfer is suppressed, so that better representation is learned for individual modalities. Our experimental results show that our framework achieves comparable performance when compared with the full modalities. Our code and results will be available at https://github.com/compwell-org/More2Less.git.
△ Less
Submitted 16 February, 2022;
originally announced February 2022.
-
Understanding $J/ψ$ and $ψ'$ production using a modified version of Non-Relativistic Quantum Chromodynamics
Authors:
Sudhansu S. Biswal,
Sushree S. Mishra,
K. Sridhar
Abstract:
There is serious disagreement between the predictions of Non-Relativistic Quantum Chromodynamics (NRQCD) and the data on $J/ψ$ polarisation which has persisted for almost a quarter of a century. We find that if we account for the effect of perturbative soft gluons on the intermediate charm-anticharm octet states in NRQCD then the polarisation problem can be resolved. In addition, this model, when…
▽ More
There is serious disagreement between the predictions of Non-Relativistic Quantum Chromodynamics (NRQCD) and the data on $J/ψ$ polarisation which has persisted for almost a quarter of a century. We find that if we account for the effect of perturbative soft gluons on the intermediate charm-anticharm octet states in NRQCD then the polarisation problem can be resolved. In addition, this model, when used to fit the Run 1 data on $J/ψ$ and $ψ'$ production from the CDF experiment at Tevatron, gives good fits and yields values of (energy-independent) non-perturbative parameters. These, in turn, can be used to make parameter-free predictions for $J/ψ$ and $ψ'$ data from the CMS experiment at the Large Hadron Collider and the predictions are in excellent agreement with the CMS data.
△ Less
Submitted 23 January, 2022;
originally announced January 2022.
-
Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech
Authors:
Kusha Sridhar,
Carlos Busso
Abstract:
The prediction of valence from speech is an important, but challenging problem. The externalization of valence in speech has speaker-dependent cues, which contribute to performances that are often significantly lower than the prediction of other emotional attributes such as arousal and dominance. A practical approach to improve valence prediction from speech is to adapt the models to the target sp…
▽ More
The prediction of valence from speech is an important, but challenging problem. The externalization of valence in speech has speaker-dependent cues, which contribute to performances that are often significantly lower than the prediction of other emotional attributes such as arousal and dominance. A practical approach to improve valence prediction from speech is to adapt the models to the target speakers in the test set. Adapting a speech emotion recognition (SER) system to a particular speaker is a hard problem, especially with deep neural networks (DNNs), since it requires optimizing millions of parameters. This study proposes an unsupervised approach to address this problem by searching for speakers in the train set with similar acoustic patterns as the speaker in the test set. Speech samples from the selected speakers are used to create the adaptation set. This approach leverages transfer learning using pre-trained models, which are adapted with these speech samples. We propose three alternative adaptation strategies: unique speaker, oversampling and weighting approaches. These methods differ on the use of the adaptation set in the personalization of the valence models. The results demonstrate that a valence prediction model can be efficiently personalized with these unsupervised approaches, leading to relative improvements as high as 13.52%.
△ Less
Submitted 19 January, 2022;
originally announced January 2022.
-
Active resonator depletion with short microwave pulses
Authors:
Sashank Kaushik Sridhar,
David P. DiVincenzo
Abstract:
We propose a physical model to explain the phenomenon of photon depletion in superconducting microwave resonators in the dispersive regime, coupled to Josephson junction qubits, via short microwave pulses. We discuss the conditions for matching the amplitude and phase of the pulse optimally within the framework of the model, allowing for significant reductions in reset times after measurement of t…
▽ More
We propose a physical model to explain the phenomenon of photon depletion in superconducting microwave resonators in the dispersive regime, coupled to Josephson junction qubits, via short microwave pulses. We discuss the conditions for matching the amplitude and phase of the pulse optimally within the framework of the model, allowing for significant reductions in reset times after measurement of the qubits. We consider how to deal with pulses and transient dynamics within the input-output formalism, along with a reassessment of the underlying assumptions for a wide-band pulse.
△ Less
Submitted 21 October, 2021;
originally announced October 2021.
-
HypoGen: Hyperbole Generation with Commonsense and Counterfactual Knowledge
Authors:
Yufei Tian,
Arvind krishna Sridhar,
Nanyun Peng
Abstract:
A hyperbole is an intentional and creative exaggeration not to be taken literally. Despite its ubiquity in daily life, the computational explorations of hyperboles are scarce. In this paper, we tackle the under-explored and challenging task: sentence-level hyperbole generation. We start with a representative syntactic pattern for intensification and systematically study the semantic (commonsense a…
▽ More
A hyperbole is an intentional and creative exaggeration not to be taken literally. Despite its ubiquity in daily life, the computational explorations of hyperboles are scarce. In this paper, we tackle the under-explored and challenging task: sentence-level hyperbole generation. We start with a representative syntactic pattern for intensification and systematically study the semantic (commonsense and counterfactual) relationships between each component in such hyperboles. Next, we leverage the COMeT and reverse COMeT models to do commonsense and counterfactual inference. We then generate multiple hyperbole candidates based on our findings from the pattern, and train neural classifiers to rank and select high-quality hyperboles. Automatic and human evaluations show that our generation method is able to generate hyperboles creatively with high success rate and intensity scores.
△ Less
Submitted 10 September, 2021;
originally announced September 2021.
-
Testing charge-radius coupling of the composite Higgs boson at hadron colliders
Authors:
G. Cacciapaglia,
S. Gascon-Shotkin,
A. Lesauvage,
N. Manglani,
K. Sridhar
Abstract:
We explore the collider relevance of a charge-radius coupling among light mesons in composite Higgs models. In particular, we focus of a coupling of the photon to the composite Higgs and a composite singlet, arising from isospin violation in the underlying theory. This coupling offers a deep probe of the composite nature of the Higgs mechanism, being sensitive to the electromagnetic and weak isosp…
▽ More
We explore the collider relevance of a charge-radius coupling among light mesons in composite Higgs models. In particular, we focus of a coupling of the photon to the composite Higgs and a composite singlet, arising from isospin violation in the underlying theory. This coupling offers a deep probe of the composite nature of the Higgs mechanism, being sensitive to the electromagnetic and weak isospin structure of its constituents. The main collider effect consists in the production of the Higgs boson in association with a light composite pseudo-scalar. We present an exploratory cut-and-count analysis at hadron colliders, like the LHC, showing that an efficient background suppression can be achieved. More sophisticated techniques, however, are necessary to select a sufficient number of signal events, due to the small production rates. This justifies further investigation of this channel, which is highly complementary to other searches for compositeness in the Higgs sector.
△ Less
Submitted 2 September, 2022; v1 submitted 6 August, 2021;
originally announced August 2021.
-
Joint Learning of Portrait Intrinsic Decomposition and Relighting
Authors:
Mona Zehni,
Shaona Ghosh,
Krishna Sridhar,
Sethu Raman
Abstract:
Inverse rendering is the problem of decomposing an image into its intrinsic components, i.e. albedo, normal and lighting. To solve this ill-posed problem from single image, state-of-the-art methods in shape from shading mostly resort to supervised training on all the components on either synthetic or real datasets. Here, we propose a new self-supervised training paradigm that 1) reduces the need f…
▽ More
Inverse rendering is the problem of decomposing an image into its intrinsic components, i.e. albedo, normal and lighting. To solve this ill-posed problem from single image, state-of-the-art methods in shape from shading mostly resort to supervised training on all the components on either synthetic or real datasets. Here, we propose a new self-supervised training paradigm that 1) reduces the need for full supervision on the decomposition task and 2) takes into account the relighting task. We introduce new self-supervised loss terms that leverage the consistencies between multi-lit images (images of the same scene under different illuminations). Our approach is applicable to multi-lit datasets. We apply our training approach in two settings: 1) train on a mixture of synthetic and real data, 2) train on real datasets with limited supervision. We show-case the effectiveness of our training paradigm on both intrinsic decomposition and relighting and demonstrate how the model struggles in both tasks without the self-supervised loss terms in limited supervision settings. We provide results of comprehensive experiments on SfSNet, CelebA and Photoface datasets and verify the performance of our approach on images in the wild.
△ Less
Submitted 22 June, 2021;
originally announced June 2021.
-
Improving Neural Network Robustness via Persistency of Excitation
Authors:
Kaustubh Sridhar,
Oleg Sokolsky,
Insup Lee,
James Weimer
Abstract:
Improving adversarial robustness of neural networks remains a major challenge. Fundamentally, training a neural network via gradient descent is a parameter estimation problem. In adaptive control, maintaining persistency of excitation (PoE) is integral to ensuring convergence of parameter estimates in dynamical systems to their true values. We show that parameter estimation with gradient descent c…
▽ More
Improving adversarial robustness of neural networks remains a major challenge. Fundamentally, training a neural network via gradient descent is a parameter estimation problem. In adaptive control, maintaining persistency of excitation (PoE) is integral to ensuring convergence of parameter estimates in dynamical systems to their true values. We show that parameter estimation with gradient descent can be modeled as a sampling of an adaptive linear time-varying continuous system. Leveraging this model, and with inspiration from Model-Reference Adaptive Control (MRAC), we prove a sufficient condition to constrain gradient descent updates to reference persistently excited trajectories converging to the true parameters. The sufficient condition is achieved when the learning rate is less than the inverse of the Lipschitz constant of the gradient of loss function. We provide an efficient technique for estimating the corresponding Lipschitz constant in practice using extreme value theory. Our experimental results in both standard and adversarial training illustrate that networks trained with the PoE-motivated learning rate schedule have similar clean accuracy but are significantly more robust to adversarial attacks than models trained using current state-of-the-art heuristics.
△ Less
Submitted 15 October, 2021; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Di-Higgs production ($γγ\to h h$) in Composite Models
Authors:
A. Bharucha,
G. Cacciapaglia,
A. Deandrea,
N. Gaur,
D. Harada,
F. Mahmoudi,
K. Sridhar
Abstract:
In Standard Model (SM) Higgs Boson pair production initiated by photons ($γγ\to h h$) is loop-generated process and thereby very sensitive to any new couplings and particles that may come in loops. The Composite Higgs Models provide an alternate mechanism to address the hierarchy problem of SM where Higgs instead of being an elementary field could be a bound state of a strongly interacting sector.…
▽ More
In Standard Model (SM) Higgs Boson pair production initiated by photons ($γγ\to h h$) is loop-generated process and thereby very sensitive to any new couplings and particles that may come in loops. The Composite Higgs Models provide an alternate mechanism to address the hierarchy problem of SM where Higgs instead of being an elementary field could be a bound state of a strongly interacting sector. These set of models apart from modifying the SM Higgs couplings could also introduce new effective couplings that can have substantial impact on the loop processes. In this work we have studied the impact of such modifications by Composite Higgs models in $γγ\to h h$ production process.
△ Less
Submitted 25 October, 2021; v1 submitted 23 May, 2021;
originally announced May 2021.
-
Tera-Zooming in on light (composite) axion-like particles
Authors:
G. Cacciapaglia,
A. Deandrea,
A. M. Iyer,
K. Sridhar
Abstract:
The Tera-Z phase of future $e^+ e^-$ colliders, FCC-ee and CepC, is a goldmine for exploring $Z$ portal physics. We focus on axion-like particles (ALPs) that can be produced via $Z$ decays with a monochromatic photon. As a template model, we consider composite Higgs models with a light pseudo-scalar that couples through the Wess-Zumino-Witten term to the electroweak gauge bosons. For both photophi…
▽ More
The Tera-Z phase of future $e^+ e^-$ colliders, FCC-ee and CepC, is a goldmine for exploring $Z$ portal physics. We focus on axion-like particles (ALPs) that can be produced via $Z$ decays with a monochromatic photon. As a template model, we consider composite Higgs models with a light pseudo-scalar that couples through the Wess-Zumino-Witten term to the electroweak gauge bosons. For both photophilic and photophobic cases, we show that the Tera-Z can probe composite scales up to $100$s of TeV, well beyond the capability of the LHC and current precision physics. Our results also apply to generic ALPs and, in particular, severely constrain models that explain the muon $g-2$ anomaly.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
Composite Higgs revealed in Higgs pair photo-production at future colliders
Authors:
A. Bharucha,
G. Cacciapaglia,
A. Deandrea,
N. Gaur,
D. Harada,
F. Mahmoudi,
K. Sridhar
Abstract:
The next generation electron-positron colliders are designed for precision studies of the Standard Model and its extensions, in particular in the Higgs sector. We consider the potential for discovery of composite Higgs models in Higgs pair production through photon collisions. This process is loop-generated, thus it provides access to all Higgs couplings and can show new physics effects in polariz…
▽ More
The next generation electron-positron colliders are designed for precision studies of the Standard Model and its extensions, in particular in the Higgs sector. We consider the potential for discovery of composite Higgs models in Higgs pair production through photon collisions. This process is loop-generated, thus it provides access to all Higgs couplings and can show new physics effects in polarized and unpolarized cross-sections starting at relatively low collider energies. It is, therefore, relevant for all electron-positron colliders planned or in preparation. Sizeable deviations from the Standard Model predictions are present in a general class of composite Higgs models, as couplings of one or more Higgs bosons to fermions, or fermionic and scalar resonances, modify the destructive interference present in the Standard Model. In particular, large effects are due to the new quartic coupling of the Higgs to tops and to the presence of a light scalar resonance.
△ Less
Submitted 13 October, 2021; v1 submitted 17 December, 2020;
originally announced December 2020.
-
ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results
Authors:
Kusha Sridhar,
Ross Cutler,
Ando Saabas,
Tanel Parnamaa,
Markus Loide,
Hannes Gamper,
Sebastian Braun,
Robert Aichner,
Sriram Srinivasan
Abstract:
The ICASSP 2021 Acoustic Echo Cancellation Challenge is intended to stimulate research in the area of acoustic echo cancellation (AEC), which is an important part of speech enhancement and still a top issue in audio communication and conferencing systems. Many recent AEC studies report good performance on synthetic datasets where the train and test samples come from the same underlying distributio…
▽ More
The ICASSP 2021 Acoustic Echo Cancellation Challenge is intended to stimulate research in the area of acoustic echo cancellation (AEC), which is an important part of speech enhancement and still a top issue in audio communication and conferencing systems. Many recent AEC studies report good performance on synthetic datasets where the train and test samples come from the same underlying distribution. However, the AEC performance often degrades significantly on real recordings. Also, most of the conventional objective metrics such as echo return loss enhancement (ERLE) and perceptual evaluation of speech quality (PESQ) do not correlate well with subjective speech quality tests in the presence of background noise and reverberation found in realistic environments. In this challenge, we open source two large datasets to train AEC models under both single talk and double talk scenarios. These datasets consist of recordings from more than 2,500 real audio devices and human speakers in real environments, as well as a synthetic dataset. We open source two large test sets, and we open source an online subjective test framework for researchers to quickly test their results. The winners of this challenge will be selected based on the average Mean Opinion Score (MOS) achieved across all different single talk and double talk scenarios.
△ Less
Submitted 30 October, 2020; v1 submitted 10 September, 2020;
originally announced September 2020.
-
Performance analysis of weighted low rank model with sparse image histograms for face recognition under lowlevel illumination and occlusion
Authors:
K. V. Sridhar,
Raghu vamshi Hemadri
Abstract:
In a broad range of computer vision applications, the purpose of Low-rank matrix approximation (LRMA) models is to recover the underlying low-rank matrix from its degraded observation. The latest LRMA methods - Robust Principal Component Analysis (RPCA) resort to using the nuclear norm minimization (NNM) as a convex relaxation of the non-convex rank minimization. However, NNM tends to over-shrink…
▽ More
In a broad range of computer vision applications, the purpose of Low-rank matrix approximation (LRMA) models is to recover the underlying low-rank matrix from its degraded observation. The latest LRMA methods - Robust Principal Component Analysis (RPCA) resort to using the nuclear norm minimization (NNM) as a convex relaxation of the non-convex rank minimization. However, NNM tends to over-shrink the rank components and treats the different rank components equally, limiting its flexibility in practical applications. We use a more flexible model, namely the Weighted Schatten p-Norm Minimization (WSNM), to generalize the NNM to the Schatten p-norm minimization with weights assigned to different singular values. The proposed WSNM not only gives a better approximation to the original low-rank assumption but also considers the importance of different rank components. In this paper, a comparison of the low-rank recovery performance of two LRMA algorithms- RPCA and WSNM is brought out on occluded human facial images. The analysis is performed on facial images from the Yale database and over own database , where different facial expressions, spectacles, varying illumination account for the facial occlusions. The paper also discusses the prominent trends observed from the experimental results performed through the application of these algorithms. As low-rank images sometimes might fail to capture the details of a face adequately, we further propose a novel method to use the image-histogram of the sparse images thus obtained to identify the individual in any given image. Extensive experimental results show, both qualitatively and quantitatively, that WSNM surpasses RPCA in its performance more effectively by removing facial occlusions, thus giving recovered low-rank images of higher PSNR and SSIM.
△ Less
Submitted 24 July, 2020;
originally announced July 2020.
-
New proximity potential for alpha decay for superheavy nuclei
Authors:
H. C. Manjunatha,
K. N. Sridhar
Abstract:
We have constructed new proximity function particularly for interaction between two superheavy nuclei based on the experimental alpha decay half-lives. The new proximity function is used to produce the alpha decay half-lives of superheavy nuclei whose experimental values are known. The new proximity function produces the alpha decay half-lives close to the experiments. Hence we can conclude that t…
▽ More
We have constructed new proximity function particularly for interaction between two superheavy nuclei based on the experimental alpha decay half-lives. The new proximity function is used to produce the alpha decay half-lives of superheavy nuclei whose experimental values are known. The new proximity function produces the alpha decay half-lives close to the experiments. Hence we can conclude that the new proximity function can be used to study the interaction between two superheavy nuclei.
△ Less
Submitted 19 March, 2020;
originally announced March 2020.
-
Real-Time Detectors for Digital and Physical Adversarial Inputs to Perception Systems
Authors:
Yiannis Kantaros,
Taylor Carpenter,
Kaustubh Sridhar,
Yahan Yang,
Insup Lee,
James Weimer
Abstract:
Deep neural network (DNN) models have proven to be vulnerable to adversarial digital and physical attacks. In this paper, we propose a novel attack- and dataset-agnostic and real-time detector for both types of adversarial inputs to DNN-based perception systems. In particular, the proposed detector relies on the observation that adversarial images are sensitive to certain label-invariant transform…
▽ More
Deep neural network (DNN) models have proven to be vulnerable to adversarial digital and physical attacks. In this paper, we propose a novel attack- and dataset-agnostic and real-time detector for both types of adversarial inputs to DNN-based perception systems. In particular, the proposed detector relies on the observation that adversarial images are sensitive to certain label-invariant transformations. Specifically, to determine if an image has been adversarially manipulated, the proposed detector checks if the output of the target classifier on a given input image changes significantly after feeding it a transformed version of the image under investigation. Moreover, we show that the proposed detector is computationally-light both at runtime and design-time which makes it suitable for real-time applications that may also involve large-scale image domains. To highlight this, we demonstrate the efficiency of the proposed detector on ImageNet, a task that is computationally challenging for the majority of relevant defenses, and on physically attacked traffic signs that may be encountered in real-time autonomy applications. Finally, we propose the first adversarial dataset, called AdvNet that includes both clean and physical traffic sign images. Our extensive comparative experiments on the MNIST, CIFAR10, ImageNet, and AdvNet datasets show that VisionGuard outperforms existing defenses in terms of scalability and detection performance. We have also evaluated the proposed detector on field test data obtained on a moving vehicle equipped with a perception-based DNN being under attack.
△ Less
Submitted 21 April, 2022; v1 submitted 22 February, 2020;
originally announced February 2020.
-
KK Higgs produced in association with a top quark pair in the bulk RS Model
Authors:
N. Manglani,
A. Misra,
K. Sridhar
Abstract:
We present a search strategy for the first Kaluza-Klein (KK) mode of the Higgs boson in the framework of the Randall-Sundrum (RS) model with a deformed metric. We study the production of this massive excitation in association with a ttbar pair at the Large Hadron Collider (LHC). The KK Higgs primarily decays into a boosted ttbar final state and we then end up with an interesting four-top final sta…
▽ More
We present a search strategy for the first Kaluza-Klein (KK) mode of the Higgs boson in the framework of the Randall-Sundrum (RS) model with a deformed metric. We study the production of this massive excitation in association with a ttbar pair at the Large Hadron Collider (LHC). The KK Higgs primarily decays into a boosted ttbar final state and we then end up with an interesting four-top final state of which two are boosted. The boosted products in the final state improve the sensitivity for the search of the KK Higgs in this channel whose production cross-section is otherwise rather small. Our results suggest that masses of the KK Higgs resonance upto about 1.2 TeV may be explorable at the highest planned luminosities of the LHC. Beyond this mass, the KK Higgs cross-section is too tiny for it to be explored at the LHC and may be possible only at a future higher energy collider.
△ Less
Submitted 30 August, 2019;
originally announced August 2019.
-
Prediction of stable superheavy nuclei
Authors:
H. C. Manjunatha,
L. Seenappa,
K. N. Sridhar
Abstract:
We have investigated most stable superheavy nuclei by studying the decay properties such as alpha decay, cluster decay and spontaneous fission. We have investigated nine stable nuclei in the island of stability which can be detected through fission are 318123(10.5ms), 319123(4.68μs), 317124(1.74x104 y), 318124(2.70x101 y), 319124(2.83x10-2 y), 320124(1.91x10-5 y), 319125(2.46x109 y), 320125(3.81x1…
▽ More
We have investigated most stable superheavy nuclei by studying the decay properties such as alpha decay, cluster decay and spontaneous fission. We have investigated nine stable nuclei in the island of stability which can be detected through fission are 318123(10.5ms), 319123(4.68μs), 317124(1.74x104 y), 318124(2.70x101 y), 319124(2.83x10-2 y), 320124(1.91x10-5 y), 319125(2.46x109 y), 320125(3.81x106 y) and 321125(3.99x103 y). Present work also investigates three stable superheavy nuclei which can be detected through alpha decay which are 318125(1.03x1012 y), 319126(5.77x1011 y) and 320126(3.99x1010 y). These nuclei will become most stable nuclei if they synthesized in the laboratory. The identified twelve stable nuclei is the evidence for the hypothesis of island of stability
△ Less
Submitted 3 July, 2019;
originally announced July 2019.
-
Unearthing the electroweak structure of warped 5D models
Authors:
Abhishek M. Iyer,
K. Sridhar
Abstract:
Heavy charged bosons, with masses in the range of a few TeV, are a characteristic of warped extra-dimensional models with bulk gauge fields. Rendering the latter consistent with electroweak precision tests typically requires either a deformation of the metric or extension of the gauge symmetry. We make here the first attempt at finding empirical discriminants which would tell these models apart. D…
▽ More
Heavy charged bosons, with masses in the range of a few TeV, are a characteristic of warped extra-dimensional models with bulk gauge fields. Rendering the latter consistent with electroweak precision tests typically requires either a deformation of the metric or extension of the gauge symmetry. We make here the first attempt at finding empirical discriminants which would tell these models apart. Demonstrating the power of simple kinematic observables involving same-sign leptons, we construct simple yet powerful statistical discriminants.
△ Less
Submitted 6 November, 2018;
originally announced November 2018.
-
The bulk Higgs in the Deformed RS Model
Authors:
F. Mahmoudi,
N. Manglani,
K. Sridhar
Abstract:
The Randall-Sundrum model with a deformed metric can generate light Kaluza-Klein (KK) Higgs modes consistent with the electroweak precision analysis for a certain range of parameters. The first KK mode of the Higgs ($h_{1}$) in such a model could lie in the mass range varying from 800 GeV to 1.3 TeV. We find that the $h_{1}$ is gaugephobic and decays dominantly into a $t\bar{t}$ pair. The search s…
▽ More
The Randall-Sundrum model with a deformed metric can generate light Kaluza-Klein (KK) Higgs modes consistent with the electroweak precision analysis for a certain range of parameters. The first KK mode of the Higgs ($h_{1}$) in such a model could lie in the mass range varying from 800 GeV to 1.3 TeV. We find that the $h_{1}$ is gaugephobic and decays dominantly into a $t\bar{t}$ pair. The search strategy for $h_{1}$ decaying to $t\bar{t}$ at the Large Hadron Collider (LHC) in this low mass range has been studies. We have used substructure tools to suppress the large QCD background associated with this channel. We find that $h_{1}$ can be probed at the LHC.
△ Less
Submitted 7 September, 2018; v1 submitted 13 December, 2017;
originally announced December 2017.
-
Deterministic Dispersion of Mobile Robots in Dynamic Rings
Authors:
Ankush Agarwalla,
John Augustine,
William K. Moses Jr.,
Madhav Sankar K.,
Arvind Krishna Sridhar
Abstract:
In this work, we study the problem of dispersion of mobile robots on dynamic rings. The problem of dispersion of $n$ robots on an $n$ node graph, introduced by Augustine and Moses Jr. [1], requires robots to coordinate with each other and reach a configuration where exactly one robot is present on each node. This problem has real world applications and applies whenever we want to minimize the tota…
▽ More
In this work, we study the problem of dispersion of mobile robots on dynamic rings. The problem of dispersion of $n$ robots on an $n$ node graph, introduced by Augustine and Moses Jr. [1], requires robots to coordinate with each other and reach a configuration where exactly one robot is present on each node. This problem has real world applications and applies whenever we want to minimize the total cost of $n$ agents sharing $n$ resources, located at various places, subject to the constraint that the cost of an agent moving to a different resource is comparatively much smaller than the cost of multiple agents sharing a resource (e.g. smart electric cars sharing recharge stations). The study of this problem also provides indirect benefits to the study of scattering on graphs, the study of exploration by mobile robots, and the study of load balancing on graphs.
We solve the problem of dispersion in the presence of two types of dynamism in the underlying graph: (i) vertex permutation and (ii) 1-interval connectivity. We introduce the notion of vertex permutation dynamism and have it mean that for a given set of nodes, in every round, the adversary ensures a ring structure is maintained, but the connections between the nodes may change. We use the idea of 1-interval connectivity from Di Luna et al. [10], where for a given ring, in each round, the adversary chooses at most one edge to remove.
We assume robots have full visibility and present asymptotically time optimal algorithms to achieve dispersion in the presence of both types of dynamism when robots have chirality. When robots do not have chirality, we present asymptotically time optimal algorithms to achieve dispersion subject to certain constraints. Finally, we provide impossibility results for dispersion when robots have no visibility.
△ Less
Submitted 16 October, 2017; v1 submitted 20 July, 2017;
originally announced July 2017.
-
Constraining compressed versions of MUED and MSSM using soft tracks at the LHC
Authors:
Sabyasachi Chakraborty,
Saurabh Niyogi,
K. Sridhar
Abstract:
A compressed spectrum is an anticipated hideout for many beyond standard model scenarios. Such a spectrum naturally arises in the minimal universal extra dimension framework and also in supersymmetric scenarios. Low $p_T$ leptons and jets are characteristic features of such situations. Hence, a monojet with $\not E_T$ has been the conventional signal at the Large Hadron Collider (LHC). However, we…
▽ More
A compressed spectrum is an anticipated hideout for many beyond standard model scenarios. Such a spectrum naturally arises in the minimal universal extra dimension framework and also in supersymmetric scenarios. Low $p_T$ leptons and jets are characteristic features of such situations. Hence, a monojet with $\not E_T$ has been the conventional signal at the Large Hadron Collider (LHC). However, we stress that inclusion of $p_T$-binned track observables from such soft objects provide very efficient discrimination of new physics signals against various SM backgrounds. We consider two benchmark points each for minimal universal extra dimension (MUED) and minimal supersymmetric standard model (MSSM) scenarios. We perform a detailed cut-based and multivariate analysis (MVA) to show that the new physics parameter space can be probed in the ongoing run of LHC at 13 TeV center-of-mass energy with an integrated luminosity $\sim$ 20-50 fb$^{-1}$. When studied in conjunction with the dark matter relic density constraint assuming standard cosmology, we find that compressed MUED (with $ΛR=2$) can be already excluded from the existing data. Also, MVA turns out to be a better technique than regular cut-based analysis since tracks provide uncorrelated observables which would extract more information from an event.
△ Less
Submitted 19 July, 2017; v1 submitted 24 April, 2017;
originally announced April 2017.
-
A Higgs in the Warped Bulk and LHC signals
Authors:
F. Mahmoudi,
U. Maitra,
N. Manglani,
K. Sridhar
Abstract:
Warped models with the Higgs in the bulk can generate light Kaluza-Klein (KK) Higgs modes consistent with the electroweak precision analysis. The first KK mode of the Higgs (h_{1}) could lie in the 1-2 TeV range in the models with a bulk custodial symmetry. We find that the h_{1} is gaugephobic and decays dominantly into a t\bar{t} pair. We also discuss the search strategy for h_{1} decaying to t\…
▽ More
Warped models with the Higgs in the bulk can generate light Kaluza-Klein (KK) Higgs modes consistent with the electroweak precision analysis. The first KK mode of the Higgs (h_{1}) could lie in the 1-2 TeV range in the models with a bulk custodial symmetry. We find that the h_{1} is gaugephobic and decays dominantly into a t\bar{t} pair. We also discuss the search strategy for h_{1} decaying to t\bar{t} at the Large Hadron Collider. We used substructure tools to suppress the large QCD background associated with this channel. We find that h_{1} can be probed at the LHC run-2 with an integrated luminosity of 300 fb^{-1}.
△ Less
Submitted 26 August, 2016;
originally announced August 2016.
-
Exploring the Inert Doublet Model through the dijet plus missing transverse energy channel at the LHC
Authors:
P. Poulose,
Shibananda Sahoo,
K. Sridhar
Abstract:
In this study of the Inert Doublet Model (IDM), we propose that the dijet + missing transverse energy channel at the Large Hadron Collider (LHC) will be an effective way of searching for the scalar particles of the IDM. This channel receives contributions from gauge boson fusion, and $t-$channel production, along with contributions from $H^+$ associated production. We perform the analysis includin…
▽ More
In this study of the Inert Doublet Model (IDM), we propose that the dijet + missing transverse energy channel at the Large Hadron Collider (LHC) will be an effective way of searching for the scalar particles of the IDM. This channel receives contributions from gauge boson fusion, and $t-$channel production, along with contributions from $H^+$ associated production. We perform the analysis including study of the Standard Model (SM) background with assumed systematic uncertainty, and optimise the selection criteria employing suitable cuts on the kinematic variables to maximise the signal significance. We find that with high luminosity option of the LHC, this channel has the potential to probe the IDM in the mass range of up to about 400 GeV, which is not accessible through other leptonic channels. In a scenario with light dark matter of mass about 65 GeV, charged Higgs in the mass range of around 200 GeV provides the best possibility with a signal significance of about $2σ$ at an integrated luminosity of about 3000 fb$^{-1}$.
△ Less
Submitted 13 December, 2016; v1 submitted 11 April, 2016;
originally announced April 2016.
-
Warped $R-$Parity Violation
Authors:
B. C. Allanach,
A. M. Iyer,
K. Sridhar
Abstract:
We consider a modified Randall-Sundrum (RS) framework between the Planck scale and the GUT scale. In this scenario, RS works as a theory of flavour and not as a solution to the hierarchy problem. The latter is resolved by supersymmetrising the bulk, so that the minimal supersymmetric standard model being the effective 4-dimensional theory. Matter fields are localised in the bulk in order to fit fe…
▽ More
We consider a modified Randall-Sundrum (RS) framework between the Planck scale and the GUT scale. In this scenario, RS works as a theory of flavour and not as a solution to the hierarchy problem. The latter is resolved by supersymmetrising the bulk, so that the minimal supersymmetric standard model being the effective 4-dimensional theory. Matter fields are localised in the bulk in order to fit fermion-mass and mixing-data. If $R$-parity violating terms are allowed in the superpotential, their orders of magnitude throughout flavour space are then predicted, resulting in rich flavour textures. If the $R$-parity violating contributions to neutrino masses are somewhat suppressed, then lepton-number violating models exist which explain the neutrino oscillation data while not being in contradiction with current experimental bounds. Another promising model is one where baryon number is violated and Dirac neutrino masses result solely from fermion localisation. We sketch the likely discovery signatures of the baryon-number and the lepton-number violating cases.
△ Less
Submitted 12 January, 2016;
originally announced January 2016.
-
Kaluza-Klein gluon + jets associated production at the Large Hadron Collider
Authors:
A. M. Iyer,
F. Mahmoudi,
N. Manglani,
K. Sridhar
Abstract:
The Kaluza-Klein excitations of gluons offer the exciting possibility of probing bulk Randall-Sundrum (RS) models. In these bulk models either a custodial symmetry or a deformation of the metric away from AdS is invoked in order to deal with electroweak precision tests. Addressing both these models, we suggest a new channel in which to study the production of KK-gluons ($g_{KK}$): one where it is…
▽ More
The Kaluza-Klein excitations of gluons offer the exciting possibility of probing bulk Randall-Sundrum (RS) models. In these bulk models either a custodial symmetry or a deformation of the metric away from AdS is invoked in order to deal with electroweak precision tests. Addressing both these models, we suggest a new channel in which to study the production of KK-gluons ($g_{KK}$): one where it is produced in association with one or more hard jets. The cross-section for the $g_{KK}+$ jets channel is significant because of several contributing sub-processes. In particular, the 1-jet and the 2-jet associated processes are important because at these orders in QCD the $qg$ and the $gg$ initial states respectively come into play. We have performed a hadron-level simulation of the signal and present strategies to effectively extract the signal from what could potentially be a huge background. We present results for the kinematic reach of the LHC Run-II for different $g_{KK}$ masses in bulk-RS models.
△ Less
Submitted 20 April, 2016; v1 submitted 8 January, 2016;
originally announced January 2016.
-
Bulk RS models, Electroweak Precision tests and the 125 GeV Higgs
Authors:
Abhishek M. Iyer,
K. Sridhar,
Sudhir K. Vempati
Abstract:
We present upto date electroweak fits of various Randall Sundrum (RS) models. We consider the bulk RS model, deformed RS and the custodial RS models. For the bulk RS case we find the lightest Kaluza Klein (KK) mode of the gauge boson to be $\sim 8$ TeV while for the custodial case it is $\sim 3$ TeV. The deformed model is the least fine tuned of all which can give a good fit for KK masses $< 2$ Te…
▽ More
We present upto date electroweak fits of various Randall Sundrum (RS) models. We consider the bulk RS model, deformed RS and the custodial RS models. For the bulk RS case we find the lightest Kaluza Klein (KK) mode of the gauge boson to be $\sim 8$ TeV while for the custodial case it is $\sim 3$ TeV. The deformed model is the least fine tuned of all which can give a good fit for KK masses $< 2$ TeV depending on the choice of the model parameters. We also comment on the fine tuning in each case.
△ Less
Submitted 20 June, 2016; v1 submitted 22 February, 2015;
originally announced February 2015.
-
R-Parity Violating Supersymmetry Explanation for Large t tbar Forward-Backward Asymmetry
Authors:
B. C. Allanach,
K. Sridhar
Abstract:
We propose a supersymmetric explanation for the anomalously high forward backward asymmetry in top pair production measured by CDF and D0. We suppose that it is due to the t-channel exchange of a right-handed sbottom which couples to d_R and t_R, as is present in the R-parity violating minimal supersymmetric standard model. We show that all Tevatron and LHC experiments' t tbar constraints may be r…
▽ More
We propose a supersymmetric explanation for the anomalously high forward backward asymmetry in top pair production measured by CDF and D0. We suppose that it is due to the t-channel exchange of a right-handed sbottom which couples to d_R and t_R, as is present in the R-parity violating minimal supersymmetric standard model. We show that all Tevatron and LHC experiments' t tbar constraints may be respected for a sbottom mass between 300 and 1200 GeV, and a large Yukawa coupling >2.2, yielding A_{FB} up to 0.18. The non Standard Model contribution to the LHC charge asymmetry parameter is Delta A_C^y=0.017-0.045, small enough to be consistent with current measurements but non-zero and positive, allowing for LHC confirmation in the future within 20 fb^-1. A small additional contribution to the LHC t tbar production cross-section is also predicted, allowing a further test. We estimate that 10 fb^-1 of LHC luminosity would be sufficient to rule out the proposal to 95% confidence level, if the measurements of the t tbar cross-section turn out to be centred on the Standard Model prediction.
△ Less
Submitted 3 September, 2012; v1 submitted 23 May, 2012;
originally announced May 2012.
-
eta_c production at the Large Hadron Collider
Authors:
Sudhansu S. Biswal,
K. Sridhar
Abstract:
We have studied the production of the 1S_0 charmonium state, eta_c, at the Large Hadron Collider (LHC) in the framework of Non-Relativistic Quantum Chromodynamics (NRQCD) using heavy-quark symmetry. We find that NRQCD predicts a large production cross-section for this resonance at the LHC even after taking account the small branching ratio of eta_c into two photons. We show that it will be possibl…
▽ More
We have studied the production of the 1S_0 charmonium state, eta_c, at the Large Hadron Collider (LHC) in the framework of Non-Relativistic Quantum Chromodynamics (NRQCD) using heavy-quark symmetry. We find that NRQCD predicts a large production cross-section for this resonance at the LHC even after taking account the small branching ratio of eta_c into two photons. We show that it will be possible to test NRQCD through its predictions for eta_c, with the statistics that will be achieved at the early stage of the LHC, running at a center of mass energy of 7 TeV with an integrated luminosity of 100 pb^{-1}
△ Less
Submitted 29 July, 2010;
originally announced July 2010.
-
Boosted Top Quark Signals for Heavy Vector Boson Excitations in a Universal Extra Dimension Model
Authors:
Biplob Bhattacherjee,
Manoranjan Guchait,
Sreerup Raychaudhuri,
K. Sridhar
Abstract:
In view of the fact that the $n = 1$ Kaluza-Klein (KK) modes in a model with a Universal Extra Dimension (UED), could mimic supersymmetry signatures at the LHC, it is necessary to look for the $n = 2$ KK modes, which have no analogues in supersymmetry. We discuss the possibility of searching for heavy $n = 2$ vector boson resonances -- especially the $g_2$ -- through their decays to a highly-boost…
▽ More
In view of the fact that the $n = 1$ Kaluza-Klein (KK) modes in a model with a Universal Extra Dimension (UED), could mimic supersymmetry signatures at the LHC, it is necessary to look for the $n = 2$ KK modes, which have no analogues in supersymmetry. We discuss the possibility of searching for heavy $n = 2$ vector boson resonances -- especially the $g_2$ -- through their decays to a highly-boosted top quark-antiquark pair using recently-developed top-jet tagging techniques in the hadronic channel. It is shown that $t\bar{t}$ signals from the $n = 2$ gluon resonance are as efficient a discovery mode at the LHC as dilepton channels from the $γ_2$ and $Z_2$ resonances.
△ Less
Submitted 16 June, 2010;
originally announced June 2010.
-
Gluon-initiated production of a Kaluza-Klein gluon in a Bulk Randall-Sundrum model
Authors:
B. C. Allanach,
F. Mahmoudi,
J. P. Skittrall,
K. Sridhar
Abstract:
In the Bulk Randall-Sundrum model, the Kaluza-Klein excitations of the gauge bosons are the primary signatures. In particular, the search for the Kaluza-Klein (KK) excitation of the gluon at hadron colliders is of great importance in testing this model. At the leading order in QCD, the production of this KK-gluon proceeds only via q qbar-initial states. We study the production of KK-gluons from…
▽ More
In the Bulk Randall-Sundrum model, the Kaluza-Klein excitations of the gauge bosons are the primary signatures. In particular, the search for the Kaluza-Klein (KK) excitation of the gluon at hadron colliders is of great importance in testing this model. At the leading order in QCD, the production of this KK-gluon proceeds only via q qbar-initial states. We study the production of KK-gluons from gluon initial states at next-to-leading order in QCD. We find that, even after including the sub-dominant KK-gluon loops at this order, the next-to-leading order (NLO) cross-section is tiny compared to the leading order cross-section and unlikely to impact the searches for this resonance at hardon colliders.
△ Less
Submitted 5 November, 2009; v1 submitted 7 October, 2009;
originally announced October 2009.