Search | arXiv e-print repository

Extended Shock Breakout and Early Circumstellar Interaction in SN 2024ggi

Authors: Manisha Shrestha, K. Azalee Bostroem, David J. Sand, Griffin Hosseinzadeh, Jennifer E. Andrews, Yize Dong, Emily Hoang, Daryl Janzen, Jeniveve Pearson, Jacob E. Jencson, M. J. Lundquist, Darshana Mehta, Aravind P. Ravi, Nicolas Meza Retamal, Stefano Valenti, Peter J. Brown, Saurabh W. Jha, Colin Macrie, Brian Hsu, Joseph Farah, D. Andrew Howell, Curtis McCully, Megan Newsome, Estefania Padilla Gonzalez, Craig Pellegrino , et al. (18 additional authors not shown)

Abstract: We present high-cadence photometric and spectroscopic observations of supernova (SN) 2024ggi, a Type II SN with flash spectroscopy features which exploded in the nearby galaxy NGC 3621 at $\sim$7 Mpc. The light-curve evolution over the first 30 hours can be fit by two power law indices with a break after 22 hours, rising from $M_V \approx -12.95$ mag at +0.66 days to $M_V \approx -17.91$ mag after… ▽ More We present high-cadence photometric and spectroscopic observations of supernova (SN) 2024ggi, a Type II SN with flash spectroscopy features which exploded in the nearby galaxy NGC 3621 at $\sim$7 Mpc. The light-curve evolution over the first 30 hours can be fit by two power law indices with a break after 22 hours, rising from $M_V \approx -12.95$ mag at +0.66 days to $M_V \approx -17.91$ mag after 7 days. In addition, the densely sampled color curve shows a strong blueward evolution over the first few days and then behaves as a normal SN II with a redward evolution as the ejecta cool. Such deviations could be due to interaction with circumstellar material (CSM). Early high- and low-resolution spectra clearly show high-ionization flash features from the first spectrum to +3.42 days after the explosion. From the high-resolution spectra, we calculate the CSM velocity to be 37 $\pm~4~\mathrm{km\,s^{-1}} $. We also see the line strength evolve rapidly from 1.22 to 1.49 days in the earliest high-resolution spectra. Comparison of the low-resolution spectra with CMFGEN models suggests that the pre-explosion mass-loss rate of SN 2024ggi falls in a range of $10^{-3}$ to $10^{-2}$ M$_{\odot}$ yr$^{-1}$, which is similar to that derived for SN 2023ixf. However, the rapid temporal evolution of the narrow lines in the spectra of SN 2024ggi ($R_\mathrm{CSM} \sim 2.7 \times 10^{14} \mathrm{cm}$) could indicate a smaller spatial extent of the CSM than in SN 2023ixf ($R_\mathrm{CSM} \sim 5.4 \times 10^{14} \mathrm{cm}$) which in turn implies lower total CSM mass for SN 2024ggi. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: 22 pages, 15 figures, 4 tables, submitted to ApJL

arXiv:2405.04583 [pdf, other]

SN2023fyq: A Type Ibn Supernova With Long-standing Precursor Activity Due to Binary Interaction

Authors: Yize Dong, Daichi Tsuna, Stefano Valenti, David J. Sand, Jennifer E. Andrews, K. Azalee Bostroem, Griffin Hosseinzadeh, Emily Hoang, Saurabh W. Jha, Daryl Janzen, Jacob E. Jencson, Michael Lundquist, Darshana Mehta, Aravind P. Ravi, Nicolas E. Meza Retamal, Jeniveve Pearson, Manisha Shrestha, Alceste Bonanos, D. Andrew Howell, Nathan Smith, Joseph Farah, Daichi Hiramatsu, Koichi Itagaki, Curtis McCully, Megan Newsome , et al. (7 additional authors not shown)

Abstract: We present photometric and spectroscopic observations of SN 2023fyq, a type Ibn supernova in the nearby galaxy NGC 4388 (D$\simeq$18~Mpc). In addition, we trace long-standing precursor emission at the position of SN 2023fyq using data from DLT40, ATLAS, ZTF, ASAS-SN, Swift, and amateur astronomer Koichi Itagaki. Precursor activity is observed up to nearly three years before the supernova explosion… ▽ More We present photometric and spectroscopic observations of SN 2023fyq, a type Ibn supernova in the nearby galaxy NGC 4388 (D$\simeq$18~Mpc). In addition, we trace long-standing precursor emission at the position of SN 2023fyq using data from DLT40, ATLAS, ZTF, ASAS-SN, Swift, and amateur astronomer Koichi Itagaki. Precursor activity is observed up to nearly three years before the supernova explosion, with a relatively rapid rise in the final 100 days. The double-peaked post-explosion light curve reaches a luminosity of $\sim10^{43}~\rm erg\,s^{-1}$. The strong intermediate-width He lines observed in the nebular spectrum of SN 2023fyq imply the interaction is still active at late phases. We found that the precursor activity in SN 2023fyq is best explained by the mass transfer in a binary system involving a low-mass He star and a compact companion. An equatorial disk is likely formed in this process ($\sim$0.6$\rm M_{\odot}$), and the interaction of SN ejecta with this disk powers the main peak of the supernova. The early SN light curve reveals the presence of dense extended material ($\sim$0.3$\rm M_{\odot}$) at $\sim$3000$\rm R_{\odot}$ ejected weeks before the SN explosion, likely due to final-stage core silicon burning or runaway mass transfer resulting from binary orbital shrinking, leading to rapid rising precursor emission within $\sim$30 days prior to explosion. The final explosion could be triggered either by the core-collapse of the He star or by the merger of the He star with a compact object. SN 2023fyq, along with SN 2018gjx and SN 2015G, forms a unique class of Type Ibn SNe which originate in binary systems and are likely to exhibit detectable long-lasting pre-explosion outbursts with magnitudes ranging from $-$10 to $-$13. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: submitted to ApJ

arXiv:2404.01585 [pdf, other]

FLEXIS: FLEXible Frequent Subgraph Mining using Maximal Independent Sets

Authors: Akshit Sharma, Sam Reinher, Dinesh Mehta, Bo Wu

Abstract: Frequent Subgraph Mining (FSM) is the process of identifying common subgraph patterns that surpass a predefined frequency threshold. While FSM is widely applicable in fields like bioinformatics, chemical analysis, and social network anomaly detection, its execution remains time-consuming and complex. This complexity stems from the need to recognize high-frequency subgraphs and ascertain if they ex… ▽ More Frequent Subgraph Mining (FSM) is the process of identifying common subgraph patterns that surpass a predefined frequency threshold. While FSM is widely applicable in fields like bioinformatics, chemical analysis, and social network anomaly detection, its execution remains time-consuming and complex. This complexity stems from the need to recognize high-frequency subgraphs and ascertain if they exceed the set threshold. Current approaches to identifying these patterns often rely on edge or vertex extension methods. However, these strategies can introduce redundancies and cause increased latency. To address these challenges, this paper introduces a novel approach for identifying potential k-vertex patterns by combining two frequently observed (k - 1)-vertex patterns. This method optimizes the breadth-]first search, which allows for quicker search termination based on vertices count and support value. Another challenge in FSM is the validation of the presumed pattern against a specific threshold. Existing metrics, such as Maximum Independent Set (MIS) and Minimum Node Image (MNI), either demand significant computational time or risk overestimating pattern counts. Our innovative approach aligns with the MIS and identifies independent subgraphs. Through the "Maximal Independent Set" metric, this paper offers an efficient solution that minimizes latency and provides users with control over pattern overlap. Through extensive experimentation, our proposed method achieves an average of 10.58x speedup when compared to GraMi and an average 3x speedup when compared to T-FSM △ Less

Submitted 1 April, 2024; originally announced April 2024.

arXiv:2402.04297 [pdf, other]

Road Surface Defect Detection -- From Image-based to Non-image-based: A Survey

Authors: Jongmin Yu, Jiaqi Jiang, Sebastiano Fichera, Paolo Paoletti, Lisa Layzell, Devansh Mehta, Shan Luo

Abstract: Ensuring traffic safety is crucial, which necessitates the detection and prevention of road surface defects. As a result, there has been a growing interest in the literature on the subject, leading to the development of various road surface defect detection methods. The methods for detecting road defects can be categorised in various ways depending on the input data types or training methodologies… ▽ More Ensuring traffic safety is crucial, which necessitates the detection and prevention of road surface defects. As a result, there has been a growing interest in the literature on the subject, leading to the development of various road surface defect detection methods. The methods for detecting road defects can be categorised in various ways depending on the input data types or training methodologies. The predominant approach involves image-based methods, which analyse pixel intensities and surface textures to identify defects. Despite their popularity, image-based methods share the distinct limitation of vulnerability to weather and lighting changes. To address this issue, researchers have explored the use of additional sensors, such as laser scanners or LiDARs, providing explicit depth information to enable the detection of defects in terms of scale and volume. However, the exploration of data beyond images has not been sufficiently investigated. In this survey paper, we provide a comprehensive review of road surface defect detection studies, categorising them based on input data types and methodologies used. Additionally, we review recently proposed non-image-based methods and discuss several challenges and open problems associated with these techniques. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: Survey papers

arXiv:2402.04064 [pdf, other]

Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing

Authors: Jongmin Yu, Chen Bene Chi, Sebastiano Fichera, Paolo Paoletti, Devansh Mehta, Shan Luo

Abstract: Road pavement detection and segmentation are critical for develo** autonomous road repair systems. However, develo** an instance segmentation method that simultaneously performs multi-class defect detection and segmentation is challenging due to the textural simplicity of road pavement image, the diversity of defect geometries, and the morphological ambiguity between classes. We propose a nove… ▽ More Road pavement detection and segmentation are critical for develo** autonomous road repair systems. However, develo** an instance segmentation method that simultaneously performs multi-class defect detection and segmentation is challenging due to the textural simplicity of road pavement image, the diversity of defect geometries, and the morphological ambiguity between classes. We propose a novel end-to-end method for multi-class road defect detection and segmentation. The proposed method comprises multiple spatial and channel-wise attention blocks available to learn global representations across spatial and channel-wise dimensions. Through these attention blocks, more globally generalised representations of morphological information (spatial characteristics) of road defects and colour and depth information of images can be learned. To demonstrate the effectiveness of our framework, we conducted various ablation studies and comparisons with prior methods on a newly collected dataset annotated with nine road defect classes. The experiments show that our proposed method outperforms existing state-of-the-art methods for multi-class road defect detection and segmentation methods. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: Accepted to the ICRA 2024

arXiv:2401.17671 [pdf, other]

Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain

Authors: Gavin Mischler, Yinghao Aaron Li, Stephan Bickel, Ashesh D. Mehta, Nima Mesgarani

Abstract: Recent advancements in artificial intelligence have sparked interest in the parallels between large language models (LLMs) and human neural processing, particularly in language comprehension. While prior research has established similarities in the representation of LLMs and the brain, the underlying computational principles that cause this convergence, especially in the context of evolving LLMs,… ▽ More Recent advancements in artificial intelligence have sparked interest in the parallels between large language models (LLMs) and human neural processing, particularly in language comprehension. While prior research has established similarities in the representation of LLMs and the brain, the underlying computational principles that cause this convergence, especially in the context of evolving LLMs, remain elusive. Here, we examined a diverse selection of high-performance LLMs with similar parameter sizes to investigate the factors contributing to their alignment with the brain's language processing mechanisms. We find that as LLMs achieve higher performance on benchmark tasks, they not only become more brain-like as measured by higher performance when predicting neural responses from LLM embeddings, but also their hierarchical feature extraction pathways map more closely onto the brain's while using fewer layers to do the same encoding. We also compare the feature extraction pathways of the LLMs to each other and identify new ways in which high-performing models have converged toward similar hierarchical processing mechanisms. Finally, we show the importance of contextual information in improving model performance and brain similarity. Our findings reveal the converging aspects of language processing in the brain and LLMs and offer new directions for develo** models that align more closely with human cognitive processing. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 19 pages, 5 figures and 4 supplementary figures

arXiv:2312.13225 [pdf, other]

Automated DevOps Pipeline Generation for Code Repositories using Large Language Models

Authors: Deep Mehta, Kartik Rawool, Subodh Gujar, Bowen Xu

Abstract: Automating software development processes through the orchestration of GitHub Action workflows has revolutionized the efficiency and agility of software delivery pipelines. This paper presents a detailed investigation into the use of Large Language Models (LLMs) specifically, GPT 3.5 and GPT 4 to generate and evaluate GitHub Action workflows for DevOps tasks. Our methodology involves data collecti… ▽ More Automating software development processes through the orchestration of GitHub Action workflows has revolutionized the efficiency and agility of software delivery pipelines. This paper presents a detailed investigation into the use of Large Language Models (LLMs) specifically, GPT 3.5 and GPT 4 to generate and evaluate GitHub Action workflows for DevOps tasks. Our methodology involves data collection from public GitHub repositories, prompt engineering for LLM utilization, and evaluation metrics encompassing exact match scores, BLEU scores, and a novel DevOps Aware score. The research scrutinizes the proficiency of GPT 3.5 and GPT 4 in generating GitHub workflows, while assessing the influence of various prompt elements in constructing the most efficient pipeline. Results indicate substantial advancements in GPT 4, particularly in DevOps awareness and syntax correctness. The research introduces a GitHub App built on Probot, empowering users to automate workflow generation within GitHub ecosystem. This study contributes insights into the evolving landscape of AI-driven automation in DevOps practices. △ Less

Submitted 20 December, 2023; originally announced December 2023.

arXiv:2311.17723 [pdf, other]

First results of evaporation residue cross-section measurements of $^{32}$S+$^{208}$Pb system

Authors: R. Sariyal, I. Mazumdar, D. Mehta, N. Madhavan, S. Nath, J. Gehlot, Gonika, S. M. Patel, P. B. Chavan, S. Panwar, V. Ranga, A. Parihari

Abstract: The dynamics of heavy ion-induced reactions play a critical role in forming super heavy elements (SHE), and one clear signature of the SHE formation is the evaporation residue (ER). In our pursuit of SHE, we present the heaviest element populated in India for ER cross-section measurements. These are the first-ever measurements of the Evaporation Residue (ER) cross-sections for the nuclear reaction… ▽ More The dynamics of heavy ion-induced reactions play a critical role in forming super heavy elements (SHE), and one clear signature of the SHE formation is the evaporation residue (ER). In our pursuit of SHE, we present the heaviest element populated in India for ER cross-section measurements. These are the first-ever measurements of the Evaporation Residue (ER) cross-sections for the nuclear reactions between $^{32}$S and $^{208}$Pb. These measurements were conducted above the Coulomb barrier at four distinct beam energies in the laboratory frame, ranging from 176 to 191 MeV at the pelletron Linac facility at the Inter-University Accelerator Centre (IUAC), New Delhi. The Hybrid Recoil Mass Analyzer (HYRA) in a gas-filled mode was employed for these experiments. The obtained range of ER cross-sections enriches our knowledge and helps advance the field of heavy ion-induced reactions, especially in the context of super heavy element formation. △ Less

Submitted 29 November, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

Comments: 12 pages, 10 figures. arXiv admin note: text overlap with arXiv:2311.09046

arXiv:2311.09046 [pdf, other]

Measurements of evaporation residue cross-sections and evaporation residue-gated $γ$-ray fold distributions for $^{32}$S+$^{154}$Sm system

Authors: R. Sariyal, I. Mazumdar, D. Mehta, N. Madhvan, S. Nath, J. Gehlot, Gonika, S. M. Patel, P. B. Chavan, S. Panwar, V. Ranga, A. Parihari, A. K. Nasirov, B. M. Kayumov

Abstract: Evaporation Residue (ER) cross-sections and ER-gated $γ$-ray fold distributions are measured for the $^{32}$S + $^{154}$Sm nuclear reaction above the Coulomb barrier at six different beam energies from 148 to 191 MeV. $γ$-ray multiplicities and spin distributions are extracted from the ER-gated fold distributions. The ER cross-sections measured in the present work are found to be much higher than… ▽ More Evaporation Residue (ER) cross-sections and ER-gated $γ$-ray fold distributions are measured for the $^{32}$S + $^{154}$Sm nuclear reaction above the Coulomb barrier at six different beam energies from 148 to 191 MeV. $γ$-ray multiplicities and spin distributions are extracted from the ER-gated fold distributions. The ER cross-sections measured in the present work are found to be much higher than what was reported in a previous work using a very different target-projectile ($^{48}$Ti + $^{138}$Ba) combination, leading to the same compound nucleus $^{186}$Pt, with much less mass asymmetry in the entrance channel than the present reaction. This clearly demonstrates the effect of the entrance channel on ER production cross-section. The ER cross-sections measured in the present work are compared with the results of both the statistical model calculations and the dynamical model calculations. Statistical model calculations have been performed to generate a range of parameter space for both the barrier height and Kramers' viscosity parameter over which the ER cross-section data can be reproduced. The calculations performed using the dinuclear system (DNS) model reproduce the data considering both complete and incomplete fusion processes. DNS calculations indicate the need for the inclusion of incomplete fusion channel at higher energies to reproduce the ER cross-sections. △ Less

Submitted 15 November, 2023; originally announced November 2023.

Comments: 13 pages, 18 figures

arXiv:2311.01009 [pdf, other]

Revam** AI Models in Dermatology: Overcoming Critical Challenges for Enhanced Skin Lesion Diagnosis

Authors: Deval Mehta, Brigid Betz-Stablein, Toan D Nguyen, Yaniv Gal, Adrian Bowling, Martin Haskett, Maithili Sashindranath, Paul Bonnington, Victoria Mar, H Peter Soyer, Zongyuan Ge

Abstract: The surge in develo** deep learning models for diagnosing skin lesions through image analysis is notable, yet their clinical black faces challenges. Current dermatology AI models have limitations: limited number of possible diagnostic outputs, lack of real-world testing on uncommon skin lesions, inability to detect out-of-distribution images, and over-reliance on dermoscopic images. To address t… ▽ More The surge in develo** deep learning models for diagnosing skin lesions through image analysis is notable, yet their clinical black faces challenges. Current dermatology AI models have limitations: limited number of possible diagnostic outputs, lack of real-world testing on uncommon skin lesions, inability to detect out-of-distribution images, and over-reliance on dermoscopic images. To address these, we present an All-In-One \textbf{H}ierarchical-\textbf{O}ut of Distribution-\textbf{C}linical Triage (HOT) model. For a clinical image, our model generates three outputs: a hierarchical prediction, an alert for out-of-distribution images, and a recommendation for dermoscopy if clinical image alone is insufficient for diagnosis. When the recommendation is pursued, it integrates both clinical and dermoscopic images to deliver final diagnosis. Extensive experiments on a representative cutaneous lesion dataset demonstrate the effectiveness and synergy of each component within our framework. Our versatile model provides valuable decision support for lesion diagnosis and sets a promising precedent for medical AI applications. △ Less

Submitted 2 November, 2023; originally announced November 2023.

arXiv:2310.12428 [pdf, other]

Towards Enhanced Local Explainability of Random Forests: a Proximity-Based Approach

Authors: Joshua Rosaler, Dhruv Desai, Bhaskarjit Sarmah, Dimitrios Vamvourellis, Deran Onay, Dhagash Mehta, Stefano Pasquali

Abstract: We initiate a novel approach to explain the out of sample performance of random forest (RF) models by exploiting the fact that any RF can be formulated as an adaptive weighted K nearest-neighbors model. Specifically, we use the proximity between points in the feature space learned by the RF to re-write random forest predictions exactly as a weighted average of the target labels of training data po… ▽ More We initiate a novel approach to explain the out of sample performance of random forest (RF) models by exploiting the fact that any RF can be formulated as an adaptive weighted K nearest-neighbors model. Specifically, we use the proximity between points in the feature space learned by the RF to re-write random forest predictions exactly as a weighted average of the target labels of training data points. This linearity facilitates a local notion of explainability of RF predictions that generates attributions for any model prediction across observations in the training set, and thereby complements established methods like SHAP, which instead generates attributions for a model prediction across dimensions of the feature space. We demonstrate this approach in the context of a bond pricing model trained on US corporate bond trades, and compare our approach to various existing approaches to model explainability. △ Less

Submitted 18 October, 2023; originally announced October 2023.

Comments: 5 pages, 6 figures

arXiv:2310.10760 [pdf, other]

Towards reducing hallucination in extracting information from financial reports using Large Language Models

Authors: Bhaskarjit Sarmah, Tianjie Zhu, Dhagash Mehta, Stefano Pasquali

Abstract: For a financial analyst, the question and answer (Q\&A) segment of the company financial report is a crucial piece of information for various analysis and investment decisions. However, extracting valuable insights from the Q\&A section has posed considerable challenges as the conventional methods such as detailed reading and note-taking lack scalability and are susceptible to human errors, and Op… ▽ More For a financial analyst, the question and answer (Q\&A) segment of the company financial report is a crucial piece of information for various analysis and investment decisions. However, extracting valuable insights from the Q\&A section has posed considerable challenges as the conventional methods such as detailed reading and note-taking lack scalability and are susceptible to human errors, and Optical Character Recognition (OCR) and similar techniques encounter difficulties in accurately processing unstructured transcript text, often missing subtle linguistic nuances that drive investor decisions. Here, we demonstrate the utilization of Large Language Models (LLMs) to efficiently and rapidly extract information from earnings report transcripts while ensuring high accuracy transforming the extraction process as well as reducing hallucination by combining retrieval-augmented generation technique as well as metadata. We evaluate the outcomes of various LLMs with and without using our proposed approach based on various objective metrics for evaluating Q\&A systems, and empirically demonstrate superiority of our method. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: 4 pages + references. Accepted for publication in Workshop on Generative AI at the 3rd International Conference on AI-ML Systems 2023, Bengaluru, India

arXiv:2310.00162 [pdf, other]

Evidence of weak circumstellar medium interaction in the Type II SN 2023axu

Authors: Manisha Shrestha, Jeniveve Pearson, Samuel Wyatt, David J. Sand, Griffin Hosseinzadeh, K. Azalee Bostroem, Jennifer E. Andrews, Yize Dong, Emily Hoang, Daryl Janzen, Jacob E. Jencson, M. J. Lundquist, Darshana Mehta, 4 Nicolas Meza Retamal, Stefano Valenti, Jillian C. Rastinejad, Phil Daly, Dallan Porter, Joannah Hinz, Skyler Self, Benjamin Weiner, Grant G. Williams, Daichi Hiramatsu, D. Andrew Howell, Curtis McCully , et al. (12 additional authors not shown)

Abstract: We present high-cadence photometric and spectroscopic observations of SN~2023axu, a classical Type II supernova with an absolute $V$-band peak magnitude of $-16.5 \pm 0.1$ mag. SN~2023axu was discovered by the Distance Less Than 40 Mpc (DLT40) survey within 1 day of the last non-detection in the nearby galaxy NGC 2283 at 13.7 Mpc. We modeled the early light curve using a recently updated shock coo… ▽ More We present high-cadence photometric and spectroscopic observations of SN~2023axu, a classical Type II supernova with an absolute $V$-band peak magnitude of $-16.5 \pm 0.1$ mag. SN~2023axu was discovered by the Distance Less Than 40 Mpc (DLT40) survey within 1 day of the last non-detection in the nearby galaxy NGC 2283 at 13.7 Mpc. We modeled the early light curve using a recently updated shock cooling model that includes the effects of line blanketing and found the explosion epoch to be MJD 59971.48 $\pm$ 0.03 and the probable progenitor to be a red supergiant with a radius of 417 $\pm$ 28 $R_\odot$. The shock cooling model cannot match the rise of observed data in the $r$ and $i$ bands and underpredicts the overall UV data which points to possible interaction with circumstellar material. This interpretation is further supported by spectral behavior. We see a ledge feature around 4600 Å in the very early spectra (+1.1 and +1.5 days after the explosion) which can be a sign of circumstellar interaction. The signs of circumstellar material are further bolstered by the presence of absorption features blueward of H$α$ and H$β$ at day $>$40 which is also generally attributed to circumstellar interaction. Our analysis shows the need for high-cadence early photometric and spectroscopic data to decipher the mass-loss history of the progenitor. △ Less

Submitted 29 September, 2023; originally announced October 2023.

Comments: 18 pages, 12 figures, to be submitted to the AAS Journals

arXiv:2309.10054 [pdf, other]

Strong Carbon Features and a Red Early Color in the Underluminous Type Ia SN 2022xkq

Authors: Jeniveve Pearson, David J. Sand, Peter Lundqvist, Lluís Galbany, Jennifer E. Andrews, K. Azalee Bostroem, Yize Dong, Emily Hoang, Griffin Hosseinzadeh, Daryl Janzen, Jacob E. Jencson, Michael J. Lundquist, Darshana Mehta, Nicolás Meza Retamal, Manisha Shrestha, Stefano Valenti, Samuel Wyatt, Joseph P. Anderson, Chris Ashall, Katie Auchettl, Eddie Baron, Stéphane Blondin, Christopher R. Burns, Yongzhi Cai, Ting-Wan Chen , et al. (63 additional authors not shown)

Abstract: We present optical, infrared, ultraviolet, and radio observations of SN 2022xkq, an underluminous fast-declining type Ia supernova (SN Ia) in NGC 1784 ($\mathrm{D}\approx31$ Mpc), from $<1$ to 180 days after explosion. The high-cadence observations of SN 2022xkq, a photometrically transitional and spectroscopically 91bg-like SN Ia, cover the first days and weeks following explosion which are criti… ▽ More We present optical, infrared, ultraviolet, and radio observations of SN 2022xkq, an underluminous fast-declining type Ia supernova (SN Ia) in NGC 1784 ($\mathrm{D}\approx31$ Mpc), from $<1$ to 180 days after explosion. The high-cadence observations of SN 2022xkq, a photometrically transitional and spectroscopically 91bg-like SN Ia, cover the first days and weeks following explosion which are critical to distinguishing between explosion scenarios. The early light curve of SN 2022xkq has a red early color and exhibits a flux excess which is more prominent in redder bands; this is the first time such a feature has been seen in a transitional/91bg-like SN Ia. We also present 92 optical and 19 near-infrared (NIR) spectra, beginning 0.4 days after explosion in the optical and 2.6 days after explosion in the NIR. SN 2022xkq exhibits a long-lived C I 1.0693 $μ$m feature which persists until 5 days post-maximum. We also detect C II $λ$6580 in the pre-maximum optical spectra. These lines are evidence for unburnt carbon that is difficult to reconcile with the double detonation of a sub-Chandrasekhar mass white dwarf. No existing explosion model can fully explain the photometric and spectroscopic dataset of SN 2022xkq, but the considerable breadth of the observations is ideal for furthering our understanding of the processes which produce faint SNe Ia. △ Less

Submitted 6 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

Comments: 38 pages, 16 figures, accepted for publication in ApJ, the figure 15 input models and synthetic spectra are now available at https://zenodo.org/record/8379254

arXiv:2309.09433 [pdf, other]

SN 2022crv: IIb, Or Not IIb: That is the Question

Authors: Yize Dong, Stefano Valenti, Chris Ashall, Marc Williamson, David J. Sand, Schuyler D. Van Dyk, Saurabh W. Jha, Michael Lundquist, Maryam Modjaz, Jennifer E. Andrews, Jacob E. Jencson, Griffin Hosseinzadeh, Jeniveve Pearson, Lindsey A. Kwok, Teresa Boland, Eric Y. Hsiao, Nathan Smith, Nancy Elias-Rosa, Shubham Srivastav, Stephen Smartt, Michael Fulton, WeiKang Zheng, Thomas G. Brink, Alexei V. Filippenko, Melissa Shahbandeh , et al. (30 additional authors not shown)

Abstract: We present optical and near-infrared observations of SN~2022crv, a stripped envelope supernova in NGC~3054, discovered within 12 hrs of explosion by the Distance Less Than 40 Mpc Survey. We suggest SN~2022crv is a transitional object on the continuum between SNe Ib and SNe IIb. A high-velocity hydrogen feature ($\sim$$-$20,000 -- $-$16,000 $\rm km\,s^{-1}$) was conspicuous in SN~2022crv at early p… ▽ More We present optical and near-infrared observations of SN~2022crv, a stripped envelope supernova in NGC~3054, discovered within 12 hrs of explosion by the Distance Less Than 40 Mpc Survey. We suggest SN~2022crv is a transitional object on the continuum between SNe Ib and SNe IIb. A high-velocity hydrogen feature ($\sim$$-$20,000 -- $-$16,000 $\rm km\,s^{-1}$) was conspicuous in SN~2022crv at early phases, and then quickly disappeared around maximum light. By comparing with hydrodynamic modeling, we find that a hydrogen envelope of $\sim 10^{-3}$ \msun{} can reproduce the behaviour of the hydrogen feature observed in SN~2022crv. The early light curve of SN~2022crv did not show envelope cooling emission, implying that SN~2022crv had a compact progenitor with extremely low amount of hydrogen. The analysis of the nebular spectra shows that SN~2022crv is consistent with the explosion of a He star with a final mass of $\sim$4.5 -- 5.6 \msun{} that has evolved from a $\sim$16 -- 22 \msun{} zero-age main sequence star in a binary system with about 1.0 -- 1.7 \msun{} of oxygen finally synthesized in the core. The high metallicity at the supernova site indicates that the progenitor experienced a strong stellar wind mass loss. In order to retain a small amount of residual hydrogen at such a high metallicity, the initial orbital separation of the binary system is likely larger than $\sim$1000~$\rm R_{\odot}$. The near-infrared spectra of SN~2022crv show a unique absorption feature on the blue side of He I line at $\sim$1.005~$μ$m. This is the first time that such a feature has been observed in a Type Ib/IIb, and could be due to \ion{Sr}{2}. Further detailed modelling on SN~2022crv can shed light on the progenitor and the origin of the mysterious absorption feature in the near infrared. △ Less

Submitted 17 September, 2023; originally announced September 2023.

Comments: 33 pages, 23 figures, submitted to ApJ

arXiv:2309.08794 [pdf, other]

Privacy-preserving Early Detection of Epileptic Seizures in Videos

Authors: Deval Mehta, Shobi Sivathamboo, Hugh Simpson, Patrick Kwan, Terence O`Brien, Zongyuan Ge

Abstract: In this work, we contribute towards the development of video-based epileptic seizure classification by introducing a novel framework (SETR-PKD), which could achieve privacy-preserved early detection of seizures in videos. Specifically, our framework has two significant components - (1) It is built upon optical flow features extracted from the video of a seizure, which encodes the seizure motion se… ▽ More In this work, we contribute towards the development of video-based epileptic seizure classification by introducing a novel framework (SETR-PKD), which could achieve privacy-preserved early detection of seizures in videos. Specifically, our framework has two significant components - (1) It is built upon optical flow features extracted from the video of a seizure, which encodes the seizure motion semiotics while preserving the privacy of the patient; (2) It utilizes a transformer based progressive knowledge distillation, where the knowledge is gradually distilled from networks trained on a longer portion of video samples to the ones which will operate on shorter portions. Thus, our proposed framework addresses the limitations of the current approaches which compromise the privacy of the patients by directly operating on the RGB video of a seizure as well as impede real-time detection of a seizure by utilizing the full video sample to make a prediction. Our SETR-PKD framework could detect tonic-clonic seizures (TCSs) in a privacy-preserving manner with an accuracy of 83.9% while they are only half-way into their progression. Our data and code is available at https://github.com/DevD1092/seizure-detection △ Less

Submitted 15 September, 2023; originally announced September 2023.

Comments: Accepted to MICCAI 2023

arXiv:2309.06884 [pdf, other]

Autoencoder-Based Visual Anomaly Localization for Manufacturing Quality Control

Authors: Devang Mehta, Noah Klarmann

Abstract: Manufacturing industries require efficient and voluminous production of high-quality finished goods. In the context of Industry 4.0, visual anomaly detection poses an optimistic solution for automatically controlled product quality with high precision. In general, automation based on computer vision is a promising solution to prevent bottlenecks at the product quality checkpoint. We considered rec… ▽ More Manufacturing industries require efficient and voluminous production of high-quality finished goods. In the context of Industry 4.0, visual anomaly detection poses an optimistic solution for automatically controlled product quality with high precision. In general, automation based on computer vision is a promising solution to prevent bottlenecks at the product quality checkpoint. We considered recent advancements in machine learning to improve visual defect localization, but challenges persist in obtaining a balanced feature set and database of the wide variety of defects occurring in the production line. Hence, this paper proposes a defect localizing autoencoder with unsupervised class selection by clustering with k-means the features extracted from a pre-trained VGG16 network. Moreover, the selected classes of defects are augmented with natural wild textures to simulate artificial defects. The study demonstrates the effectiveness of the defect localizing autoencoder with unsupervised class selection for improving defect detection in manufacturing industries. The proposed methodology shows promising results with precise and accurate localization of quality defects on melamine-faced boards for the furniture industry. Incorporating artificial defects into the training data shows significant potential for practical implementation in real-world quality control scenarios. △ Less

Submitted 3 November, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

arXiv:2308.11197 [pdf]

doi 10.1044/2023_JSLHR-23-00273

Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting

Authors: Hamzeh Ghasemzadeh, Robert E. Hillman, Daryush D. Mehta

Abstract: This study's first purpose is to provide quantitative evidence that would incentivize researchers to instead use the more robust method of nested cross-validation. The second purpose is to present methods and MATLAB codes for doing power analysis for ML-based analysis during the design of a study. Monte Carlo simulations were used to quantify the interactions between the employed cross-validation… ▽ More This study's first purpose is to provide quantitative evidence that would incentivize researchers to instead use the more robust method of nested cross-validation. The second purpose is to present methods and MATLAB codes for doing power analysis for ML-based analysis during the design of a study. Monte Carlo simulations were used to quantify the interactions between the employed cross-validation method, the discriminative power of features, the dimensionality of the feature space, and the dimensionality of the model. Four different cross-validations (single holdout, 10-fold, train-validation-test, and nested 10-fold) were compared based on the statistical power and statistical confidence of the ML models. Distributions of the null and alternative hypotheses were used to determine the minimum required sample size for obtaining a statistically significant outcome (α=0.05, 1-\b{eta}=0.8). Statistical confidence of the model was defined as the probability of correct features being selected and hence being included in the final model. Our analysis showed that the model generated based on the single holdout method had very low statistical power and statistical confidence and that it significantly overestimated the accuracy. Conversely, the nested 10-fold cross-validation resulted in the highest statistical confidence and the highest statistical power, while providing an unbiased estimate of the accuracy. The required sample size with a single holdout could be 50% higher than what would be needed if nested cross-validation were used. Confidence in the model based on nested cross-validation was as much as four times higher than the confidence in the single holdout-based model. A computational model, MATLAB codes, and lookup tables are provided to assist researchers with estimating the sample size during the design of their future studies. △ Less

Submitted 22 December, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

Comments: Accepted at JSLHR

Journal ref: Journal of Speech, Language, and Hearing Research (JSLHR),Volume 67 Issue 3, March 2024, Pages 753-781

arXiv:2308.08031 [pdf, other]

Company Similarity using Large Language Models

Authors: Dimitrios Vamvourellis, Máté Toth, Snigdha Bhagat, Dhruv Desai, Dhagash Mehta, Stefano Pasquali

Abstract: Identifying companies with similar profiles is a core task in finance with a wide range of applications in portfolio construction, asset pricing and risk attribution. When a rigorous definition of similarity is lacking, financial analysts usually resort to 'traditional' industry classifications such as Global Industry Classification System (GICS) which assign a unique category to each company at d… ▽ More Identifying companies with similar profiles is a core task in finance with a wide range of applications in portfolio construction, asset pricing and risk attribution. When a rigorous definition of similarity is lacking, financial analysts usually resort to 'traditional' industry classifications such as Global Industry Classification System (GICS) which assign a unique category to each company at different levels of granularity. Due to their discrete nature, though, GICS classifications do not allow for ranking companies in terms of similarity. In this paper, we explore the ability of pre-trained and finetuned large language models (LLMs) to learn company embeddings based on the business descriptions reported in SEC filings. We show that we can reproduce GICS classifications using the embeddings as features. We also benchmark these embeddings on various machine learning and financial metrics and conclude that the companies that are similar according to the embeddings are also similar in terms of financial performance metrics including return correlation. △ Less

Submitted 15 August, 2023; originally announced August 2023.

Comments: 8 pages, 2 figures, 2 tables

arXiv:2308.06882 [pdf, other]

Quantifying Outlierness of Funds from their Categories using Supervised Similarity

Authors: Dhruv Desai, Ashmita Dhiman, Tushar Sharma, Deepika Sharma, Dhagash Mehta, Stefano Pasquali

Abstract: Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. H… ▽ More Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. Here, we aim to quantify the effect of miscategorization of funds utilizing a machine learning based approach. We formulate the problem of miscategorization of funds as a distance-based outlier detection problem, where the outliers are the data-points that are far from the rest of the data-points in the given feature space. We implement and employ a Random Forest (RF) based method of distance metric learning, and compute the so-called class-wise outlier measures for each data-point to identify outliers in the data. We test our implementation on various publicly available data sets, and then apply it to mutual fund data. We show that there is a strong relationship between the outlier measures of the funds and their future returns and discuss the implications of our findings. △ Less

Submitted 13 August, 2023; originally announced August 2023.

Comments: 8 pages, 5 tables, 8 figures

arXiv:2305.18703 [pdf, other]

Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey

Authors: Chen Ling, Xujiang Zhao, Jiaying Lu, Chengyuan Deng, Can Zheng, Junxiang Wang, Tanmoy Chowdhury, Yun Li, Hejie Cui, Xuchao Zhang, Tianjiao Zhao, Amit Panalkar, Dhagash Mehta, Stefano Pasquali, Wei Cheng, Haoyu Wang, Yanchi Liu, Zhengzhang Chen, Haifeng Chen, Chris White, Quanquan Gu, Jian Pei, Carl Yang, Liang Zhao

Abstract: Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of dom… ▽ More Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of domain objectives, and the diversity of the constraints (e.g., various social norms, cultural conformity, religious beliefs, and ethical standards in the domain applications). Domain specification techniques are key to make large language models disruptive in many applications. Specifically, to solve these hurdles, there has been a notable increase in research and practices conducted in recent years on the domain specialization of LLMs. This emerging field of study, with its substantial potential for impact, necessitates a comprehensive and systematic review to better summarize and guide ongoing work in this area. In this article, we present a comprehensive survey on domain specification techniques for large language models, an emerging direction critical for large language model applications. First, we propose a systematic taxonomy that categorizes the LLM domain-specialization techniques based on the accessibility to LLMs and summarizes the framework for all the subcategories as well as their relations and differences to each other. Second, we present an extensive taxonomy of critical application domains that can benefit dramatically from specialized LLMs, discussing their practical significance and open challenges. Last, we offer our insights into the current research status and future trends in this area. △ Less

Submitted 29 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

arXiv:2305.00696 [pdf, other]

TPMIL: Trainable Prototype Enhanced Multiple Instance Learning for Whole Slide Image Classification

Authors: Litao Yang, Deval Mehta, Sidong Liu, Dwarikanath Mahapatra, Antonio Di Ieva, Zongyuan Ge

Abstract: Digital pathology based on whole slide images (WSIs) plays a key role in cancer diagnosis and clinical practice. Due to the high resolution of the WSI and the unavailability of patch-level annotations, WSI classification is usually formulated as a weakly supervised problem, which relies on multiple instance learning (MIL) based on patches of a WSI. In this paper, we aim to learn an optimal patch-l… ▽ More Digital pathology based on whole slide images (WSIs) plays a key role in cancer diagnosis and clinical practice. Due to the high resolution of the WSI and the unavailability of patch-level annotations, WSI classification is usually formulated as a weakly supervised problem, which relies on multiple instance learning (MIL) based on patches of a WSI. In this paper, we aim to learn an optimal patch-level feature space by integrating prototype learning with MIL. To this end, we develop a Trainable Prototype enhanced deep MIL (TPMIL) framework for weakly supervised WSI classification. In contrast to the conventional methods which rely on a certain number of selected patches for feature space refinement, we softly cluster all the instances by allocating them to their corresponding prototypes. Additionally, our method is able to reveal the correlations between different tumor subtypes through distances between corresponding trained prototypes. More importantly, TPMIL also enables to provide a more accurate interpretability based on the distance of the instances from the trained prototypes which serves as an alternative to the conventional attention score-based interpretability. We test our method on two WSI datasets and it achieves a new SOTA. GitHub repository: https://github.com/LitaoYang-Jet/TPMIL △ Less

Submitted 1 May, 2023; originally announced May 2023.

Comments: Accepted for MIDL 2023

arXiv:2304.14497 [pdf]

Vehicle Safety Management System

Authors: Chanthini Bhaskar, Bharath Manoj Nair, Dev Mehta

Abstract: Overtaking is a critical maneuver in driving that requires accurate information about the location and distance of other vehicles on the road. This study suggests a real-time overtaking assistance system that uses a combination of the You Only Look Once (YOLO) object detection algorithm and stereo vision techniques to accurately identify and locate vehicles in front of the driver, and estimate the… ▽ More Overtaking is a critical maneuver in driving that requires accurate information about the location and distance of other vehicles on the road. This study suggests a real-time overtaking assistance system that uses a combination of the You Only Look Once (YOLO) object detection algorithm and stereo vision techniques to accurately identify and locate vehicles in front of the driver, and estimate their distance. The system then signals the vehicles behind the driver using colored lights to inform them of the safe overtaking distance. The proposed system has been implemented using Stereo vision for distance analysis and You Only Look Once (YOLO) for object identification. The results demonstrate its effectiveness in providing vehicle type and the distance between the camera module and the vehicle accurately with an approximate error of 4.107%. Our system has the potential to reduce the risk of accidents and improve the safety of overtaking maneuvers, especially on busy highways and roads. △ Less

Submitted 16 April, 2023; originally announced April 2023.

arXiv:2211.16172 [pdf, other]

Learnings from Technological Interventions in a Low Resource Language: Enhancing Information Access in Gondi

Authors: Devansh Mehta, Harshita Diddee, Ananya Saxena, Anurag Shukla, Sebastin Santy, Ramaravind Kommiya Mothilal, Brij Mohan Lal Srivastava, Alok Sharma, Vishnu Prasad, Venkanna U, Kalika Bali

Abstract: The primary obstacle to develo** technologies for low-resource languages is the lack of representative, usable data. In this paper, we report the deployment of technology-driven data collection methods for creating a corpus of more than 60,000 translations from Hindi to Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. During this pr… ▽ More The primary obstacle to develo** technologies for low-resource languages is the lack of representative, usable data. In this paper, we report the deployment of technology-driven data collection methods for creating a corpus of more than 60,000 translations from Hindi to Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. During this process, we help expand information access in Gondi across 2 different dimensions (a) The creation of linguistic resources that can be used by the community, such as a dictionary, children's stories, Gondi translations from multiple sources and an Interactive Voice Response (IVR) based mass awareness platform; (b) Enabling its use in the digital domain by develo** a Hindi-Gondi machine translation model, which is compressed by nearly 4 times to enable it's edge deployment on low-resource edge devices and in areas of little to no internet connectivity. We also present preliminary evaluations of utilizing the developed machine translation model to provide assistance to volunteers who are involved in collecting more data for the target language. Through these interventions, we not only created a refined and evaluated corpus of 26,240 Hindi-Gondi translations that was used for building the translation model but also engaged nearly 850 community members who can help take Gondi onto the internet. △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: In Submission (Revised) to Language Resources and Evaluation Journal. arXiv admin note: text overlap with arXiv:2004.10270

arXiv:2211.05230 [pdf]

Multimodal Optical Techniques in Pre-Clinical Evaluation of Oral Cancer: Fluorescence Imaging and Spectroscopic Devices

Authors: Pramila Thapa, Veena Singh, Virendra Kumar, Sunil Bhatt, Kiran Maurya, Vivek Nayyar, Kiran Jot, Deepika Mishra, Anurag Shrivastava, Dalip Singh Mehta

Abstract: Objective: Survival rate of oral squamous cell carcinoma (OSCC) patients is very poor and can be improved using highly sensitive, specific and accurate techniques. Autofluorescence and fluorescence techniques are very sensitive and useful in cancer screening. Furthermore, fluorescence spectroscopy is directly linked with molecular levels of human tissue and can be used as quantitative tool for can… ▽ More Objective: Survival rate of oral squamous cell carcinoma (OSCC) patients is very poor and can be improved using highly sensitive, specific and accurate techniques. Autofluorescence and fluorescence techniques are very sensitive and useful in cancer screening. Furthermore, fluorescence spectroscopy is directly linked with molecular levels of human tissue and can be used as quantitative tool for cancer detection. Materials and Methods: Here, we report development of multi-modal autofluorescence and fluorescence imaging and spectroscopic (MAF-IS) smartphone-based systems for fast and real time oral cancer screening. Fluorescence-autofluorescence images and spectroscopic datasets shows significant change in oral cancer and normal tissue in terms of fluorescence-intensity, spectral-shape, and red-shift respectively. Results: In this study, total 68 samples (33 cancerous and 35 normal) of 18 OSCC patients and 13 patients of precancerous tissues (dysplasia and fibrosis) are screened. Main remarkable finding of the study is presence of three peaks viz ~636 nm, ~680 nm and ~705 nm with decrease in intensity around 450 nm ~ 520 nm in OSCC in case of autofluorescence. Another finding is red shift in fluorescence spectroscopy of OSCC, dysplasia and fibrosis from normal which is 6.59+-4.54 nm, 3+-4.78 nm and 1.5+-0.5 nm respectively and can be used as cancer marker in real-time screening. Finally, support vector machine (SVM) based classifier is applied for classification of OSCC tissue from normal tissue. The average sensitivity, specificity and accuracy are found as 88.89% ,100 %, and 95%, respectively. Conclusion: Autofluorescence and fluorescence-based imaging and spectroscopy is used for pre-clinical screening of different oral lesions. △ Less

Submitted 3 November, 2022; originally announced November 2022.

Comments: 12 pages, 7 figures

arXiv:2209.04406 [pdf, other]

Longitudinal Acoustic Speech Tracking Following Pediatric Traumatic Brain Injury

Authors: Camille Noufi, Adam C. Lammert, Daryush D. Mehta, James R. Williamson, Gregory Ciccarelli, Douglas Sturim, Jordan R. Green, Thomas F. Quatieri, Thomas F. Campbell

Abstract: Recommendations for common outcome measures following pediatric traumatic brain injury (TBI) support the integration of instrumental measurements alongside perceptual assessment in recovery and treatment plans. A comprehensive set of sensitive, robust and non-invasive measurements is therefore essential in assessing variations in speech characteristics over time following pediatric TBI. In this ar… ▽ More Recommendations for common outcome measures following pediatric traumatic brain injury (TBI) support the integration of instrumental measurements alongside perceptual assessment in recovery and treatment plans. A comprehensive set of sensitive, robust and non-invasive measurements is therefore essential in assessing variations in speech characteristics over time following pediatric TBI. In this article, we study the changes in the acoustic speech patterns of a pediatric cohort of ten subjects diagnosed with severe TBI. We extract a diverse set of both well-known and novel acoustic features from child speech recorded throughout the year after the child produced intelligible words. These features are analyzed individually and by speech subsystem, within-subject and across the cohort. As a group, older children exhibit highly significant (p<0.01) increases in pitch variation and phoneme diversity, shortened pause length, and steadying articulation rate variability. Younger children exhibit similar steadied rate variability alongside an increase in formant-based articulation complexity. Correlation analysis of the feature set with age and comparisons to normative developmental data confirm that age at injury plays a significant role in framing the recovery trajectory. Nearly all speech features significantly change (p<0.05) for the cohort as a whole, confirming that acoustic measures supplementing perceptual assessment are needed to identify efficacious treatment targets for speech therapy following TBI. △ Less

Submitted 9 September, 2022; originally announced September 2022.

arXiv:2208.13318 [pdf, other]

Multi-dimensional Racism Classification during COVID-19: Stigmatization, Offensiveness, Blame, and Exclusion

Authors: Xin Pei, Deval Mehta

Abstract: Transcending the binary categorization of racist texts, our study takes cues from social science theories to develop a multi-dimensional model for racism detection, namely stigmatization, offensiveness, blame, and exclusion. With the aid of BERT and topic modeling, this categorical detection enables insights into the underlying subtlety of racist discussion on digital platforms during COVID-19. Ou… ▽ More Transcending the binary categorization of racist texts, our study takes cues from social science theories to develop a multi-dimensional model for racism detection, namely stigmatization, offensiveness, blame, and exclusion. With the aid of BERT and topic modeling, this categorical detection enables insights into the underlying subtlety of racist discussion on digital platforms during COVID-19. Our study contributes to enriching the scholarly discussion on deviant racist behaviours on social media. First, a stage-wise analysis is applied to capture the dynamics of the topic changes across the early stages of COVID-19 which transformed from a domestic epidemic to an international public health emergency and later to a global pandemic. Furthermore, map** this trend enables a more accurate prediction of public opinion evolvement concerning racism in the offline world, and meanwhile, the enactment of specified intervention strategies to combat the upsurge of racism during the global public health crisis like COVID-19. In addition, this interdisciplinary research also points out a direction for future studies on social network analysis and mining. Integration of social science perspectives into the development of computational methods provides insights into more accurate data detection and analytics. △ Less

Submitted 28 August, 2022; originally announced August 2022.

Comments: Social Network Analysis and Mining (accepted, 2022). arXiv admin note: substantial text overlap with arXiv:2107.08347

arXiv:2208.10639 [pdf, other]

Evaluating Cardiovascular Surgical Planning in Mobile Augmented Reality

Authors: Haoyang Yang, Pratham Darrpan Mehta, Jonathan Leo, Zhiyan Zhou, Megan Dass, Anish Upadhayay, Timothy C. Slesnick, Fawwaz Shaw, Amanda Randles, Duen Horng Chau

Abstract: Advanced surgical procedures for congenital heart diseases (CHDs) require precise planning before the surgeries. The conventional approach utilizes 3D-printing and cutting physical heart models, which is a time and resource intensive process. While rapid advances in augmented reality (AR) technologies have the potential to streamline surgical planning, there is limited research that evaluates such… ▽ More Advanced surgical procedures for congenital heart diseases (CHDs) require precise planning before the surgeries. The conventional approach utilizes 3D-printing and cutting physical heart models, which is a time and resource intensive process. While rapid advances in augmented reality (AR) technologies have the potential to streamline surgical planning, there is limited research that evaluates such AR approaches with medical experts. This paper presents an evaluation with 6 experts, 4 cardiothoracic surgeons, and 2 cardiologists, from Children's Healthcare of Atlanta (CHOA) Heart Center to validate the usability and technical innovations of CardiacAR, a prototype mobile AR surgical planning application. Potential future improvements based on user feedback are also proposed to further improve the design of CardiacAR and broaden its access. △ Less

Submitted 22 August, 2022; originally announced August 2022.

Comments: IEEE VIS 2022. 2 pages, 1 figure

arXiv:2208.08331 [pdf, other]

Leukocyte Classification using Multimodal Architecture Enhanced by Knowledge Distillation

Authors: Litao Yang, Deval Mehta, Dwarikanath Mahapatra, Zongyuan Ge

Abstract: Recently, a lot of automated white blood cells (WBC) or leukocyte classification techniques have been developed. However, all of these methods only utilize a single modality microscopic image i.e. either blood smear or fluorescence based, thus missing the potential of a better learning from multimodal images. In this work, we develop an efficient multimodal architecture based on a first of its kin… ▽ More Recently, a lot of automated white blood cells (WBC) or leukocyte classification techniques have been developed. However, all of these methods only utilize a single modality microscopic image i.e. either blood smear or fluorescence based, thus missing the potential of a better learning from multimodal images. In this work, we develop an efficient multimodal architecture based on a first of its kind multimodal WBC dataset for the task of WBC classification. Specifically, our proposed idea is developed in two steps - 1) First, we learn modality specific independent subnetworks inside a single network only; 2) We further enhance the learning capability of the independent subnetworks by distilling knowledge from high complexity independent teacher networks. With this, our proposed framework can achieve a high performance while maintaining low complexity for a multimodal dataset. Our unique contribution is two-fold - 1) We present a first of its kind multimodal WBC dataset for WBC classification; 2) We develop a high performing multimodal architecture which is also efficient and low in complexity at the same time. △ Less

Submitted 17 August, 2022; originally announced August 2022.

Comments: Accepted to MICCAI 2022 workshop - MOVI2022

arXiv:2207.07183 [pdf, other]

Learning Embedded Representation of the Stock Correlation Matrix using Graph Machine Learning

Authors: Bhaskarjit Sarmah, Nayana Nair, Dhagash Mehta, Stefano Pasquali

Abstract: Understanding non-linear relationships among financial instruments has various applications in investment processes ranging from risk management, portfolio construction and trading strategies. Here, we focus on interconnectedness among stocks based on their correlation matrix which we represent as a network with the nodes representing individual stocks and the weighted links between pairs of nodes… ▽ More Understanding non-linear relationships among financial instruments has various applications in investment processes ranging from risk management, portfolio construction and trading strategies. Here, we focus on interconnectedness among stocks based on their correlation matrix which we represent as a network with the nodes representing individual stocks and the weighted links between pairs of nodes representing the corresponding pair-wise correlation coefficients. The traditional network science techniques, which are extensively utilized in financial literature, require handcrafted features such as centrality measures to understand such correlation networks. However, manually enlisting all such handcrafted features may quickly turn out to be a daunting task. Instead, we propose a new approach for studying nuances and relationships within the correlation network in an algorithmic way using a graph machine learning algorithm called Node2Vec. In particular, the algorithm compresses the network into a lower dimensional continuous space, called an embedding, where pairs of nodes that are identified as similar by the algorithm are placed closer to each other. By using log returns of S&P 500 stock data, we show that our proposed algorithm can learn such an embedding from its correlation network. We define various domain specific quantitative (and objective) and qualitative metrics that are inspired by metrics used in the field of Natural Language Processing (NLP) to evaluate the embeddings in order to identify the optimal one. Further, we discuss various applications of the embeddings in investment management. △ Less

Submitted 14 July, 2022; originally announced July 2022.

Comments: 8 pages, 2 column format, 3 figure, 7 tables

arXiv:2207.04959 [pdf, other]

Learning Mutual Fund Categorization using Natural Language Processing

Authors: Dimitrios Vamvourellis, Mate Attila Toth, Dhruv Desai, Dhagash Mehta, Stefano Pasquali

Abstract: Categorization of mutual funds or Exchange-Traded-funds (ETFs) have long served the financial analysts to perform peer analysis for various purposes starting from competitor analysis, to quantifying portfolio diversification. The categorization methodology usually relies on fund composition data in the structured format extracted from the Form N-1A. Here, we initiate a study to learn the categoriz… ▽ More Categorization of mutual funds or Exchange-Traded-funds (ETFs) have long served the financial analysts to perform peer analysis for various purposes starting from competitor analysis, to quantifying portfolio diversification. The categorization methodology usually relies on fund composition data in the structured format extracted from the Form N-1A. Here, we initiate a study to learn the categorization system directly from the unstructured data as depicted in the forms using natural language processing (NLP). Positing as a multi-class classification problem with the input data being only the investment strategy description as reported in the form and the target variable being the Lipper Global categories, and using various NLP models, we show that the categorization system can indeed be learned with high accuracy. We discuss implications and applications of our findings as well as limitations of existing pre-trained architectures in applying them to learn fund categorization. △ Less

Submitted 11 July, 2022; originally announced July 2022.

Comments: 8 pages, 5 figures, 2-column format

arXiv:2207.04368 [pdf, other]

Supervised similarity learning for corporate bonds using Random Forest proximities

Authors: Jerinsh Jeyapaulraj, Dhruv Desai, Peter Chu, Dhagash Mehta, Stefano Pasquali, Philip Sommer

Abstract: Financial literature consists of ample research on similarity and comparison of financial assets and securities such as stocks, bonds, mutual funds, etc. However, going beyond correlations or aggregate statistics has been arduous since financial datasets are noisy, lack useful features, have missing data and often lack ground truth or annotated labels. However, though similarity extrapolated from… ▽ More Financial literature consists of ample research on similarity and comparison of financial assets and securities such as stocks, bonds, mutual funds, etc. However, going beyond correlations or aggregate statistics has been arduous since financial datasets are noisy, lack useful features, have missing data and often lack ground truth or annotated labels. However, though similarity extrapolated from these traditional models heuristically may work well on an aggregate level, such as risk management when looking at large portfolios, they often fail when used for portfolio construction and trading which require a local and dynamic measure of similarity on top of global measure. In this paper we propose a supervised similarity framework for corporate bonds which allows for inference based on both local and global measures. From a machine learning perspective, this paper emphasis that random forest (RF), which is usually viewed as a supervised learning algorithm, can also be used as a similarity learning (more specifically, a distance metric learning) algorithm. In addition, this framework proposes a novel metric to evaluate similarities, and analyses other metrics which further demonstrate that RF outperforms all other methods experimented with, in this work. △ Less

Submitted 25 October, 2022; v1 submitted 9 July, 2022; originally announced July 2022.

Comments: A few minor typos corrected, 1 figure added. Conclusions unchanged. Matching with the accepted version

arXiv:2206.15186 [pdf, other]

Out-of-Distribution Detection for Long-tailed and Fine-grained Skin Lesion Images

Authors: Deval Mehta, Yaniv Gal, Adrian Bowling, Paul Bonnington, Zongyuan Ge

Abstract: Recent years have witnessed a rapid development of automated methods for skin lesion diagnosis and classification. Due to an increasing deployment of such systems in clinics, it has become important to develop a more robust system towards various Out-of-Distribution(OOD) samples (unknown skin lesions and conditions). However, the current deep learning models trained for skin lesion classification… ▽ More Recent years have witnessed a rapid development of automated methods for skin lesion diagnosis and classification. Due to an increasing deployment of such systems in clinics, it has become important to develop a more robust system towards various Out-of-Distribution(OOD) samples (unknown skin lesions and conditions). However, the current deep learning models trained for skin lesion classification tend to classify these OOD samples incorrectly into one of their learned skin lesion categories. To address this issue, we propose a simple yet strategic approach that improves the OOD detection performance while maintaining the multi-class classification accuracy for the known categories of skin lesion. To specify, this approach is built upon a realistic scenario of a long-tailed and fine-grained OOD detection task for skin lesion images. Through this approach, 1) First, we target the mixup amongst middle and tail classes to address the long-tail problem. 2) Later, we combine the above mixup strategy with prototype learning to address the fine-grained nature of the dataset. The unique contribution of this paper is two-fold, justified by extensive experiments. First, we present a realistic problem setting of OOD task for skin lesion. Second, we propose an approach to target the long-tailed and fine-grained aspects of the problem setting simultaneously to increase the OOD performance. △ Less

Submitted 30 June, 2022; originally announced June 2022.

Comments: Accepted to MICCAI 2022 (top 13% paper; early accept)

arXiv:2206.08236 [pdf, other]

Simple and Efficient Architectures for Semantic Segmentation

Authors: Dushyant Mehta, Andrii Skliar, Haitam Ben Yahia, Shubhankar Borse, Fatih Porikli, Amirhossein Habibian, Tijmen Blankevoort

Abstract: Though the state-of-the architectures for semantic segmentation, such as HRNet, demonstrate impressive accuracy, the complexity arising from their salient design choices hinders a range of model acceleration tools, and further they make use of operations that are inefficient on current hardware. This paper demonstrates that a simple encoder-decoder architecture with a ResNet-like backbone and a sm… ▽ More Though the state-of-the architectures for semantic segmentation, such as HRNet, demonstrate impressive accuracy, the complexity arising from their salient design choices hinders a range of model acceleration tools, and further they make use of operations that are inefficient on current hardware. This paper demonstrates that a simple encoder-decoder architecture with a ResNet-like backbone and a small multi-scale head, performs on-par or better than complex semantic segmentation architectures such as HRNet, FANet and DDRNets. Naively applying deep backbones designed for Image Classification to the task of Semantic Segmentation leads to sub-par results, owing to a much smaller effective receptive field of these backbones. Implicit among the various design choices put forth in works like HRNet, DDRNet, and FANet are networks with a large effective receptive field. It is natural to ask if a simple encoder-decoder architecture would compare favorably if comprised of backbones that have a larger effective receptive field, though without the use of inefficient operations like dilated convolutions. We show that with minor and inexpensive modifications to ResNets, enlarging the receptive field, very simple and competitive baselines can be created for Semantic Segmentation. We present a family of such simple architectures for desktop as well as mobile targets, which match or exceed the performance of complex models on the Cityscapes dataset. We hope that our work provides simple yet effective baselines for practitioners to develop efficient semantic segmentation models. △ Less

Submitted 16 June, 2022; originally announced June 2022.

Comments: To be presented at Efficient Deep Learning for Computer Vision Workshop at CVPR 2022

arXiv:2206.08009 [pdf, other]

Balancing Discriminability and Transferability for Source-Free Domain Adaptation

Authors: Jogendra Nath Kundu, Akshay Kulkarni, Suvaansh Bhambri, Deepesh Mehta, Shreyas Kulkarni, Varun Jampani, R. Venkatesh Babu

Abstract: Conventional domain adaptation (DA) techniques aim to improve domain transferability by learning domain-invariant representations; while concurrently preserving the task-discriminability knowledge gathered from the labeled source data. However, the requirement of simultaneous access to labeled source and unlabeled target renders them unsuitable for the challenging source-free DA setting. The trivi… ▽ More Conventional domain adaptation (DA) techniques aim to improve domain transferability by learning domain-invariant representations; while concurrently preserving the task-discriminability knowledge gathered from the labeled source data. However, the requirement of simultaneous access to labeled source and unlabeled target renders them unsuitable for the challenging source-free DA setting. The trivial solution of realizing an effective original to generic domain map** improves transferability but degrades task discriminability. Upon analyzing the hurdles from both theoretical and empirical standpoints, we derive novel insights to show that a mixup between original and corresponding translated generic samples enhances the discriminability-transferability trade-off while duly respecting the privacy-oriented source-free setting. A simple but effective realization of the proposed insights on top of the existing source-free DA approaches yields state-of-the-art performance with faster convergence. Beyond single-source, we also outperform multi-source prior-arts across both classification and semantic segmentation benchmarks. △ Less

Submitted 16 June, 2022; originally announced June 2022.

Comments: ICML 2022. Project page: https://sites.google.com/view/mixup-sfda

arXiv:2201.01517 [pdf]

Single-shot multispectral quantitative phase imaging using deep neural network

Authors: Sunil Bhatt, Ankit Butola, Anand Kumar, Pramila Thapa, Akshay Joshi, Neetu Singh, Krishna Agarwal, Dalip Singh Mehta

Abstract: Multi-spectral quantitative phase imaging (MS-QPI) is a cutting-edge label-free technique to determine the morphological changes, refractive index variations and spectroscopic information of the specimens. The bottleneck to implement this technique to extract quantitative information, is the need of more than two measurements for generating MS-QPI images. We propose a single-shot MS-QPI technique… ▽ More Multi-spectral quantitative phase imaging (MS-QPI) is a cutting-edge label-free technique to determine the morphological changes, refractive index variations and spectroscopic information of the specimens. The bottleneck to implement this technique to extract quantitative information, is the need of more than two measurements for generating MS-QPI images. We propose a single-shot MS-QPI technique using highly spatially sensitive digital holographic microscope assisted with deep neural network (DNN). Our method first acquires the interferometric datasets corresponding to multiple wavelengths (λ=532, 633 and 808 nm used here). The acquired datasets are used to train generative adversarial network (GAN) to generate multi-spectral quantitative phase maps from a single input interferogram. The network is trained and validated on two different samples, the optical waveguide and a MG63 osteosarcoma cells. Further, validation of the framework is performed by comparing the predicted phase maps with experimentally acquired and processed multi-spectral phase maps. The current MS-QPI+DNN framework can further empower spectroscopic QPI to improve the chemical specificity without complex instrumentation and color-cross talk. △ Less

Submitted 5 January, 2022; originally announced January 2022.

Comments: 8 pages, 5 figures

arXiv:2111.15629 [pdf, other]

DiPD: Disruptive event Prediction Dataset from Twitter

Authors: Sanskar Soni, Dev Mehta, Vinush Vishwanath, Aditi Seetha, Satyendra Singh Chouhan

Abstract: Riots and protests, if gone out of control, can cause havoc in a country. We have seen examples of this, such as the BLM movement, climate strikes, CAA Movement, and many more, which caused disruption to a large extent. Our motive behind creating this dataset was to use it to develop machine learning systems that can give its users insight into the trending events going on and alert them about the… ▽ More Riots and protests, if gone out of control, can cause havoc in a country. We have seen examples of this, such as the BLM movement, climate strikes, CAA Movement, and many more, which caused disruption to a large extent. Our motive behind creating this dataset was to use it to develop machine learning systems that can give its users insight into the trending events going on and alert them about the events that could lead to disruption in the nation. If any event starts going out of control, it can be handled and mitigated by monitoring it before the matter escalates. This dataset collects tweets of past or ongoing events known to have caused disruption and labels these tweets as 1. We also collect tweets that are considered non-eventful and label them as 0 so that they can also be used to train a classification system. The dataset contains 94855 records of unique events and 168706 records of unique non-events, thus giving the total dataset 263561 records. We extract multiple features from the tweets, such as the user's follower count and the user's location, to understand the impact and reach of the tweets. This dataset might be useful in various event related machine learning problems such as event classification, event recognition, and so on. △ Less

Submitted 25 November, 2021; originally announced November 2021.

arXiv:2109.05047 [pdf, other]

PAC Mode Estimation using PPR Martingale Confidence Sequences

Authors: Shubham Anand Jain, Rohan Shah, Sanit Gupta, Denil Mehta, Inderjeet Jayakumar Nair, Jian Vora, Sushil Khyalia, Sourav Das, Vinay J. Ribeiro, Shivaram Kalyanakrishnan

Abstract: We consider the problem of correctly identifying the \textit{mode} of a discrete distribution $\mathcal{P}$ with sufficiently high probability by observing a sequence of i.i.d. samples drawn from $\mathcal{P}$. This problem reduces to the estimation of a single parameter when $\mathcal{P}$ has a support set of size $K = 2$. After noting that this special case is tackled very well by prior-posterio… ▽ More We consider the problem of correctly identifying the \textit{mode} of a discrete distribution $\mathcal{P}$ with sufficiently high probability by observing a sequence of i.i.d. samples drawn from $\mathcal{P}$. This problem reduces to the estimation of a single parameter when $\mathcal{P}$ has a support set of size $K = 2$. After noting that this special case is tackled very well by prior-posterior-ratio (PPR) martingale confidence sequences \citep{waudby-ramdas-ppr}, we propose a generalisation to mode estimation, in which $\mathcal{P}$ may take $K \geq 2$ values. To begin, we show that the "one-versus-one" principle to generalise from $K = 2$ to $K \geq 2$ classes is more efficient than the "one-versus-rest" alternative. We then prove that our resulting stop** rule, denoted PPR-1v1, is asymptotically optimal (as the mistake probability is taken to $0$). PPR-1v1 is parameter-free and computationally light, and incurs significantly fewer samples than competitors even in the non-asymptotic regime. We demonstrate its gains in two practical applications of sampling: election forecasting and verification of smart contracts in blockchains. △ Less

Submitted 11 April, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

arXiv:2107.08347 [pdf, other]

Beyond a binary of (non)racist tweets: A four-dimensional categorical detection and analysis of racist and xenophobic opinions on Twitter in early Covid-19

Authors: Xin Pei, Deval Mehta

Abstract: Transcending the binary categorization of racist and xenophobic texts, this research takes cues from social science theories to develop a four dimensional category for racism and xenophobia detection, namely stigmatization, offensiveness, blame, and exclusion. With the aid of deep learning techniques, this categorical detection enables insights into the nuances of emergent topics reflected in raci… ▽ More Transcending the binary categorization of racist and xenophobic texts, this research takes cues from social science theories to develop a four dimensional category for racism and xenophobia detection, namely stigmatization, offensiveness, blame, and exclusion. With the aid of deep learning techniques, this categorical detection enables insights into the nuances of emergent topics reflected in racist and xenophobic expression on Twitter. Moreover, a stage wise analysis is applied to capture the dynamic changes of the topics across the stages of early development of Covid-19 from a domestic epidemic to an international public health emergency, and later to a global pandemic. The main contributions of this research include, first the methodological advancement. By bridging the state-of-the-art computational methods with social science perspective, this research provides a meaningful approach for future research to gain insight into the underlying subtlety of racist and xenophobic discussion on digital platforms. Second, by enabling a more accurate comprehension and even prediction of public opinions and actions, this research paves the way for the enactment of effective intervention policies to combat racist crimes and social exclusion under Covid-19. △ Less

Submitted 17 July, 2021; originally announced July 2021.

arXiv:2107.07452 [pdf, other]

GI-NNet \& RGI-NNet: Development of Robotic Grasp Pose Models, Trainable with Large as well as Limited Labelled Training Datasets, under supervised and semi supervised paradigms

Authors: Priya Shukla, Nilotpal Pramanik, Deepesh Mehta, G. C. Nandi

Abstract: Our way of gras** objects is challenging for efficient, intelligent and optimal grasp by COBOTs. To streamline the process, here we use deep learning techniques to help robots learn to generate and execute appropriate grasps quickly. We developed a Generative Inception Neural Network (GI-NNet) model, capable of generating antipodal robotic grasps on seen as well as unseen objects. It is trained… ▽ More Our way of gras** objects is challenging for efficient, intelligent and optimal grasp by COBOTs. To streamline the process, here we use deep learning techniques to help robots learn to generate and execute appropriate grasps quickly. We developed a Generative Inception Neural Network (GI-NNet) model, capable of generating antipodal robotic grasps on seen as well as unseen objects. It is trained on Cornell Gras** Dataset (CGD) and attained 98.87% grasp pose accuracy for detecting both regular and irregular shaped objects from RGB-Depth (RGB-D) images while requiring only one third of the network trainable parameters as compared to the existing approaches. However, to attain this level of performance the model requires the entire 90% of the available labelled data of CGD kee** only 10% labelled data for testing which makes it vulnerable to poor generalization. Furthermore, getting sufficient and quality labelled dataset is becoming increasingly difficult kee** in pace with the requirement of gigantic networks. To address these issues, we attach our model as a decoder with a semi-supervised learning based architecture known as Vector Quantized Variational Auto Encoder (VQVAE), which works efficiently when trained both with the available labelled and unlabelled data. The proposed model, which we name as Representation based GI-NNet (RGI-NNet), has been trained with various splits of label data on CGD with as minimum as 10% labelled dataset together with latent embedding generated from VQVAE up to 50% labelled data with latent embedding obtained from VQVAE. The performance level, in terms of grasp pose accuracy of RGI-NNet, varies between 92.13% to 95.6% which is far better than several existing models trained with only labelled dataset. For the performance verification of both GI-NNet and RGI-NNet models, we use Anukul (Baxter) hardware cobot. △ Less

Submitted 15 July, 2021; originally announced July 2021.

arXiv:2107.05592 [pdf, other]

Investor Behavior Modeling by Analyzing Financial Advisor Notes: A Machine Learning Perspective

Authors: Cynthia Pagliaro, Dhagash Mehta, Han-Tai Shiao, Shaofei Wang, Luwei Xiong

Abstract: Modeling investor behavior is crucial to identifying behavioral coaching opportunities for financial advisors. With the help of natural language processing (NLP) we analyze an unstructured (textual) dataset of financial advisors' summary notes, taken after every investor conversation, to gain first ever insights into advisor-investor interactions. These insights are used to predict investor needs… ▽ More Modeling investor behavior is crucial to identifying behavioral coaching opportunities for financial advisors. With the help of natural language processing (NLP) we analyze an unstructured (textual) dataset of financial advisors' summary notes, taken after every investor conversation, to gain first ever insights into advisor-investor interactions. These insights are used to predict investor needs during adverse market conditions; thus allowing advisors to coach investors and help avoid inappropriate financial decision-making. First, we perform topic modeling to gain insight into the emerging topics and trends. Based on this insight, we construct a supervised classification model to predict the probability that an advised investor will require behavioral coaching during volatile market periods. To the best of our knowledge, ours is the first work on exploring the advisor-investor relationship using unstructured data. This work may have far-reaching implications for both traditional and emerging financial advisory service models like robo-advising. △ Less

Submitted 12 July, 2021; originally announced July 2021.

Comments: 8 pages, 2 column format, 7 figures+5 tables

arXiv:2106.12987 [pdf, other]

Fund2Vec: Mutual Funds Similarity using Graph Learning

Authors: Vipul Satone, Dhruv Desai, Dhagash Mehta

Abstract: Identifying similar mutual funds with respect to the underlying portfolios has found many applications in financial services ranging from fund recommender systems, competitors analysis, portfolio analytics, marketing and sales, etc. The traditional methods are either qualitative, and hence prone to biases and often not reproducible, or, are known not to capture all the nuances (non-linearities) am… ▽ More Identifying similar mutual funds with respect to the underlying portfolios has found many applications in financial services ranging from fund recommender systems, competitors analysis, portfolio analytics, marketing and sales, etc. The traditional methods are either qualitative, and hence prone to biases and often not reproducible, or, are known not to capture all the nuances (non-linearities) among the portfolios from the raw data. We propose a radically new approach to identify similar funds based on the weighted bipartite network representation of funds and their underlying assets data using a sophisticated machine learning method called Node2Vec which learns an embedded low-dimensional representation of the network. We call the embedding \emph{Fund2Vec}. Ours is the first ever study of the weighted bipartite network representation of the funds-assets network in its original form that identifies structural similarity among portfolios as opposed to merely portfolio overlaps. △ Less

Submitted 24 June, 2021; originally announced June 2021.

Comments: 2 column format, 8 pages, 8 figures, 5 tables

arXiv:2106.06914 [pdf, other]

Understanding the L-shell ionization mechanism through osmium atoms bombarded by 4-6 MeV/u fluorine ions

Authors: Soumya Chatterjee, Sunil Kumar, Sarvesh Kumar, M Oswal, Biraja Mohanty, D. Mehta, D. Mitra, A. M. P. Mendez, D. M. Mitnik, C. C. Montanari, L. Sarkadi, T. Nandi

Abstract: The L-subshell ionization mechanism is studied in an ultra-thin osmium target bombarded by 4-6 MeV/u fluorine ions. Multiple ionization effects in the collisions are considered through the change of fluorescence and Coster-Kronig yields while determining L-subshell ionization cross sections from L-line x-ray production cross sections. The L-subshell ionization, as well as L-shell x-ray production… ▽ More The L-subshell ionization mechanism is studied in an ultra-thin osmium target bombarded by 4-6 MeV/u fluorine ions. Multiple ionization effects in the collisions are considered through the change of fluorescence and Coster-Kronig yields while determining L-subshell ionization cross sections from L-line x-ray production cross sections. The L-subshell ionization, as well as L-shell x-ray production cross sections so obtained, are compared with various theoretical approximations. The Coulomb direct ionization contributions is studied by (i) the relativistic semi-classical approximations (RSCA), (ii) the shellwise local plasma approximation (SLPA), and (iii) the ECUSAR theory, along with the inclusion of the vacancy sharing among the subshells by the coupled-states model (CSM) and the electron capture (EC) by a standard formalism. We find that the ECUSAR-CSM-EC describes the measured excitation function curves the best. However, the theoretical calculations are still about a factor of two smaller than the measured values. Such differences are resolved by re-evaluating the fluorescence and the Coster-Kronig yields. This work demonstrates that, in the present energy range, the heavy-ion induced inner-shell ionization of heavy atoms can be understood by combining the basic mechanisms of the direct Coulomb ionization, the electron capture, the multiple ionization, and the vacancy sharing among subshells, together with optimized atomic parameters. △ Less

Submitted 13 June, 2021; originally announced June 2021.

Comments: 11 pages, 10 figures

arXiv:2104.05926 [pdf]

doi 10.1038/s41467-022-29320-6

An Adaptive Synaptic Array using Fowler-Nordheim Dynamic Analog Memory

Authors: Darshit Mehta, Kenji Aono, Shantanu Chakrabartty

Abstract: In this paper we present a synaptic array that uses dynamical states to implement an analog memory for energy-efficient training of machine learning (ML) systems. Each of the analog memory elements is a micro-dynamical system that is driven by the physics of Fowler-Nordheim (FN) quantum tunneling, whereas the system level learning modulates the state trajectory of the memory ensembles towards the… ▽ More In this paper we present a synaptic array that uses dynamical states to implement an analog memory for energy-efficient training of machine learning (ML) systems. Each of the analog memory elements is a micro-dynamical system that is driven by the physics of Fowler-Nordheim (FN) quantum tunneling, whereas the system level learning modulates the state trajectory of the memory ensembles towards the optimal solution. We show that the extrinsic energy required for modulation can be matched to the dynamics of learning and weight decay leading to a significant reduction in the energy-dissipated during ML training. With the energy-dissipation as low as 5 fJ per memory update and a programming resolution up to 14 bits, the proposed synapse array could be used to address the energy-efficiency imbalance between the training and the inference phases observed in artificial intelligence (AI) systems. △ Less

Submitted 13 April, 2021; originally announced April 2021.

Comments: 22 pages (incl. 7 supplementary pages), 11 figures (incl. 6 supplementary figures)

arXiv:2104.04650 [pdf, other]

Towards Automated and Marker-less Parkinson Disease Assessment: Predicting UPDRS Scores using Sit-stand videos

Authors: Deval Mehta, Umar Asif, Tian Hao, Erhan Bilal, Stefan Von Cavallar, Stefan Harrer, Jeffrey Rogers

Abstract: This paper presents a novel deep learning enabled, video based analysis framework for assessing the Unified Parkinsons Disease Rating Scale (UPDRS) that can be used in the clinic or at home. We report results from comparing the performance of the framework to that of trained clinicians on a population of 32 Parkinsons disease (PD) patients. In-person clinical assessments by trained neurologists ar… ▽ More This paper presents a novel deep learning enabled, video based analysis framework for assessing the Unified Parkinsons Disease Rating Scale (UPDRS) that can be used in the clinic or at home. We report results from comparing the performance of the framework to that of trained clinicians on a population of 32 Parkinsons disease (PD) patients. In-person clinical assessments by trained neurologists are used as the ground truth for training our framework and for comparing the performance. We find that the standard sit-to-stand activity can be used to evaluate the UPDRS sub-scores of bradykinesia (BRADY) and posture instability and gait disorders (PIGD). For BRADY we find F1-scores of 0.75 using our framework compared to 0.50 for the video based rater clinicians, while for PIGD we find 0.78 for the framework and 0.45 for the video based rater clinicians. We believe our proposed framework has potential to provide clinically acceptable end points of PD in greater granularity without imposing burdens on patients and clinicians, which empowers a variety of use cases such as passive tracking of PD progression in spaces such as nursing homes, in-home self-assessment, and enhanced tele-medicine. △ Less

Submitted 9 April, 2021; originally announced April 2021.

Comments: Accepted by CVPR Workshops 2021

arXiv:2102.06837 [pdf, other]

Learning Speech-driven 3D Conversational Gestures from Video

Authors: Ikhsanul Habibie, Weipeng Xu, Dushyant Mehta, Lingjie Liu, Hans-Peter Seidel, Gerard Pons-Moll, Mohamed Elgharib, Christian Theobalt

Abstract: We propose the first approach to automatically and jointly synthesize both the synchronous 3D conversational body and hand gestures, as well as 3D face and head animations, of a virtual character from speech input. Our algorithm uses a CNN architecture that leverages the inherent correlation between facial expression and hand gestures. Synthesis of conversational body gestures is a multi-modal pro… ▽ More We propose the first approach to automatically and jointly synthesize both the synchronous 3D conversational body and hand gestures, as well as 3D face and head animations, of a virtual character from speech input. Our algorithm uses a CNN architecture that leverages the inherent correlation between facial expression and hand gestures. Synthesis of conversational body gestures is a multi-modal problem since many similar gestures can plausibly accompany the same input speech. To synthesize plausible body gestures in this setting, we train a Generative Adversarial Network (GAN) based model that measures the plausibility of the generated sequences of 3D body motion when paired with the input audio features. We also contribute a new way to create a large corpus of more than 33 hours of annotated body, hand, and face data from in-the-wild videos of talking people. To this end, we apply state-of-the-art monocular approaches for 3D body and hand pose estimation as well as dense 3D face performance capture to the video corpus. In this way, we can train on orders of magnitude more data than previous algorithms that resort to complex in-studio motion capture solutions, and thereby train more expressive synthesis algorithms. Our experiments and user study show the state-of-the-art quality of our speech-synthesized full 3D character animations. △ Less

Submitted 12 February, 2021; originally announced February 2021.

arXiv:2101.04104 [pdf, other]

Neural Re-Rendering of Humans from a Single Image

Authors: Kripasindhu Sarkar, Dushyant Mehta, Weipeng Xu, Vladislav Golyanik, Christian Theobalt

Abstract: Human re-rendering from a single image is a starkly under-constrained problem, and state-of-the-art algorithms often exhibit undesired artefacts, such as over-smoothing, unrealistic distortions of the body parts and garments, or implausible changes of the texture. To address these challenges, we propose a new method for neural re-rendering of a human under a novel user-defined pose and viewpoint,… ▽ More Human re-rendering from a single image is a starkly under-constrained problem, and state-of-the-art algorithms often exhibit undesired artefacts, such as over-smoothing, unrealistic distortions of the body parts and garments, or implausible changes of the texture. To address these challenges, we propose a new method for neural re-rendering of a human under a novel user-defined pose and viewpoint, given one input image. Our algorithm represents body pose and shape as a parametric mesh which can be reconstructed from a single image and easily reposed. Instead of a colour-based UV texture map, our approach further employs a learned high-dimensional UV feature map to encode appearance. This rich implicit representation captures detailed appearance variation across poses, viewpoints, person identities and clothing styles better than learned colour texture maps. The body model with the rendered feature maps is fed through a neural image-translation network that creates the final rendered colour image. The above components are combined in an end-to-end-trained neural network architecture that takes as input a source person image, and images of the parametric body model in the source pose and desired target pose. Experimental evaluation demonstrates that our approach produces higher quality single image re-rendering results than existing methods. △ Less

Submitted 11 January, 2021; originally announced January 2021.

Comments: Published in ECCV 2020

arXiv:2012.08859 [pdf, other]

Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces

Authors: Bert Moons, Parham Noorzad, Andrii Skliar, Giovanni Mariani, Dushyant Mehta, Chris Lott, Tijmen Blankevoort

Abstract: Current state-of-the-art Neural Architecture Search (NAS) methods neither efficiently scale to multiple hardware platforms, nor handle diverse architectural search-spaces. To remedy this, we present DONNA (Distilling Optimal Neural Network Architectures), a novel pipeline for rapid, scalable and diverse NAS, that scales to many user scenarios. DONNA consists of three phases. First, an accuracy pre… ▽ More Current state-of-the-art Neural Architecture Search (NAS) methods neither efficiently scale to multiple hardware platforms, nor handle diverse architectural search-spaces. To remedy this, we present DONNA (Distilling Optimal Neural Network Architectures), a novel pipeline for rapid, scalable and diverse NAS, that scales to many user scenarios. DONNA consists of three phases. First, an accuracy predictor is built using blockwise knowledge distillation from a reference model. This predictor enables searching across diverse networks with varying macro-architectural parameters such as layer types and attention mechanisms, as well as across micro-architectural parameters such as block repeats and expansion rates. Second, a rapid evolutionary search finds a set of pareto-optimal architectures for any scenario using the accuracy predictor and on-device measurements. Third, optimal models are quickly finetuned to training-from-scratch accuracy. DONNA is up to 100x faster than MNasNet in finding state-of-the-art architectures on-device. Classifying ImageNet, DONNA architectures are 20% faster than EfficientNet-B0 and MobileNetV2 on a Nvidia V100 GPU and 10% faster with 0.5% higher accuracy than MobileNetV2-1.4x on a Samsung S20 smartphone. In addition to NAS, DONNA is used for search-space extension and exploration, as well as hardware-aware model compression. △ Less

Submitted 27 August, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

Comments: Accepted at ICCV2021. Main text 9 pages, Full text 21 pages, 18 figures

arXiv:2012.06674 [pdf]

High throughput spatially sensitive single-shot quantitative phase microscopy

Authors: Azeem Ahmad, Vishesh Dubey, Nikhil Jayakumar, Anowarul Habib, Ankit Butola, Mona Nystad, Ganesh Acharya, Purusotam Basnet, Dalip Singh Mehta, Balpreet Singh Ahluwalia

Abstract: High space-bandwidth product with high spatial phase sensitivity is indispensable for a single-shot quantitative phase microscopy (QPM) system. It opens avenue for widespread applications of QPM in the field of biomedical imaging. Temporally low coherence length light sources are generally implemented to achieve high spatial phase sensitivity in QPM at the cost of either reduced temporal resolutio… ▽ More High space-bandwidth product with high spatial phase sensitivity is indispensable for a single-shot quantitative phase microscopy (QPM) system. It opens avenue for widespread applications of QPM in the field of biomedical imaging. Temporally low coherence length light sources are generally implemented to achieve high spatial phase sensitivity in QPM at the cost of either reduced temporal resolution or smaller field of view (FOV). On the contrary, high temporal coherence light sources like lasers are capable of exploiting the full FOV of the QPM systems at the expense of less spatial phase sensitivity. In the present work, we employed pseudo-thermal light source (PTLS) in QPM which overcomes the limitations of conventional light sources. The capabilities of PTLS over conventional light sources are systematically studied and demonstrated on various test objects like USAF resolution chart and thin optical waveguide (height ~ 8 nm). The spatial phase sensitivity of QPM in case of PTLS is measured to be equivalent to that for white light source. The high-speed and large FOV capabilities of PTLS based QPM is demonstrated by high-speed imaging of live sperm cells that is limited by the camera speed and by imaging extra-ordinary large FOV phase imaging on histopathology placenta tissue samples. △ Less

Submitted 11 December, 2020; originally announced December 2020.

Comments: 15 pages, 7 figures

arXiv:2011.06557 [pdf, other]

A partition-based similarity for classification distributions

Authors: Hayden S. Helm, Ronak D. Mehta, Brandon Duderstadt, Weiwei Yang, Christoper M. White, Ali Geisa, Joshua T. Vogelstein, Carey E. Priebe

Abstract: Herein we define a measure of similarity between classification distributions that is both principled from the perspective of statistical pattern recognition and useful from the perspective of machine learning practitioners. In particular, we propose a novel similarity on classification distributions, dubbed task similarity, that quantifies how an optimally-transformed optimal representation for a… ▽ More Herein we define a measure of similarity between classification distributions that is both principled from the perspective of statistical pattern recognition and useful from the perspective of machine learning practitioners. In particular, we propose a novel similarity on classification distributions, dubbed task similarity, that quantifies how an optimally-transformed optimal representation for a source distribution performs when applied to inference related to a target distribution. The definition of task similarity allows for natural definitions of adversarial and orthogonal distributions. We highlight limiting properties of representations induced by (universally) consistent decision rules and demonstrate in simulation that an empirical estimate of task similarity is a function of the decision rule deployed for inference. We demonstrate that for a given target distribution, both transfer efficiency and semantic similarity of candidate source distributions correlate with empirical task similarity. △ Less

Submitted 12 November, 2020; originally announced November 2020.

Showing 1–50 of 162 results for author: Mehta, D