Skip to main content

Showing 1–50 of 99 results for author: Krishnan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13842  [pdf, other

    cs.CL cs.SD eess.AS

    Joint vs Sequential Speaker-Role Detection and Automatic Speech Recognition for Air-traffic Control

    Authors: Alexander Blatt, Aravind Krishnan, Dietrich Klakow

    Abstract: Utilizing air-traffic control (ATC) data for downstream natural-language processing tasks requires preprocessing steps. Key steps are the transcription of the data via automatic speech recognition (ASR) and speaker diarization, respectively speaker role detection (SRD) to divide the transcripts into pilot and air-traffic controller (ATCO) transcripts. While traditional approaches take on these tas… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  2. arXiv:2406.09855  [pdf, other

    cs.CL

    On the Encoding of Gender in Transformer-based ASR Representations

    Authors: Aravind Krishnan, Badr M. Abdullah, Dietrich Klakow

    Abstract: While existing literature relies on performance differences to uncover gender biases in ASR models, a deeper analysis is essential to understand how gender is encoded and utilized during transcript generation. This work investigates the encoding and utilization of gender in the latent representations of two transformer-based ASR models, Wav2Vec2 and HuBERT. Using linear erasure, we demonstrate the… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  3. arXiv:2406.09330  [pdf, other

    cs.CL

    Learning from Natural Language Explanations for Generalizable Entity Matching

    Authors: Somin Wadhwa, Adit Krishnan, Runhui Wang, Byron C. Wallace, Chris Kong

    Abstract: Entity matching is the task of linking records from different sources that refer to the same real-world entity. Past work has primarily treated entity linking as a standard supervised learning problem. However, supervised entity matching models often do not generalize well to new data, and collecting exhaustive labeled training data is often cost prohibitive. Further, recent efforts have adopted L… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.02597  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    CoNO: Complex Neural Operator for Continous Dynamical Physical Systems

    Authors: Karn Tiwari, N M Anoop Krishnan, A P Prathosh

    Abstract: Neural operators extend data-driven models to map between infinite-dimensional functional spaces. While these operators perform effectively in either the time or frequency domain, their performance may be limited when applied to non-stationary spatial or temporal signals whose frequency characteristics change with time. Here, we introduce Complex Neural Operator (CoNO) that parameterizes the integ… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Under Review

  5. arXiv:2406.01650  [pdf, other

    q-bio.BM cs.AI cs.LG

    TAGMol: Target-Aware Gradient-guided Molecule Generation

    Authors: Vineeth Dorna, D. Subhalingam, Keshav Kolluru, Shreshth Tuli, Mrityunjay Singh, Saurabh Singal, N. M. Anoop Krishnan, Sayan Ranu

    Abstract: 3D generative models have shown significant promise in structure-based drug design (SBDD), particularly in discovering ligands tailored to specific target binding sites. Existing algorithms often focus primarily on ligand-target binding, characterized by binding affinity. Moreover, models trained solely on target-ligand distribution may fall short in addressing the broader objectives of drug disco… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  6. arXiv:2405.14567  [pdf, other

    cs.LG

    EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records

    Authors: Adibvafa Fallahpour, Mahshid Alinoori, Arash Afkanpour, Amrit Krishnan

    Abstract: Transformers have significantly advanced the modeling of Electronic Health Records (EHR), yet their deployment in real-world healthcare is limited by several key challenges. Firstly, the quadratic computational cost and insufficient context length of these models pose significant obstacles for hospitals in processing the extensive medical histories typical in EHR data. Additionally, existing model… ▽ More

    Submitted 23 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 17 Pages, 4 Figures

  7. arXiv:2405.12207  [pdf, other

    cs.LG cs.IR

    Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search

    Authors: Sebastian Bruch, Aditya Krishnan, Franco Maria Nardini

    Abstract: Clustering-based nearest neighbor search is a simple yet effective method in which data points are partitioned into geometric shards to form an index, and only a few shards are searched during query processing to find an approximate set of top-$k$ vectors. Even though the search efficacy is heavily influenced by the algorithm that identifies the set of shards to probe, it has received little atten… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  8. arXiv:2404.13933  [pdf

    cs.HC

    Comparison of On-Orbit Manual Attitude Control Methods for Non-Docking Spacecraft Through Virtual Reality Simulation

    Authors: Ajit Krishnan, Himanshu Vishwakarma, Maharudra Kharsade, Pradipta Biswas

    Abstract: On-orbit manual attitude control of manned spacecraft is accomplished using external visual references and some method of three axis attitude control. All past, present, and developmental spacecraft feature the capability to manually control attitude for deorbit. National Aeronautics and Space Administration (NASA) spacecraft permit an aircraft windshield type front view, wherein an arc of the Ear… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    ACM Class: H.5.2

  9. arXiv:2404.01475  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI physics.chem-ph

    Are large language models superhuman chemists?

    Authors: Adrian Mirza, Nawaf Alampara, Sreekanth Kunchapu, Benedict Emoekabu, Aswanth Krishnan, Mara Wilhelmi, Macjonathan Okereke, Juliane Eberhardt, Amir Mohammad Elahi, Maximilian Greiner, Caroline T. Holick, Tanya Gupta, Mehrdad Asgari, Christina Glaubitz, Lea C. Klepsch, Yannik Köster, Jakob Meyer, Santiago Miret, Tim Hoffmann, Fabian Alexander Kreth, Michael Ringleb, Nicole Roesner, Ulrich S. Schubert, Leanne M. Stafast, Dinga Wonanke , et al. (3 additional authors not shown)

    Abstract: Large language models (LLMs) have gained widespread interest due to their ability to process human language and perform tasks on which they have not been explicitly trained. This is relevant for the chemical sciences, which face the problem of small and diverse datasets that are frequently in the form of text. LLMs have shown promise in addressing these issues and are increasingly being harnessed… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  10. arXiv:2403.05591  [pdf, other

    cs.HC cs.LG

    Data-Driven Ergonomic Risk Assessment of Complex Hand-intensive Manufacturing Processes

    Authors: Anand Krishnan, Xingjian Yang, Utsav Seth, Jonathan M. Jeyachandran, Jonathan Y. Ahn, Richard Gardner, Samuel F. Pedigo, Adriana, Blom-Schieber, Ashis G. Banerjee, Krithika Manohar

    Abstract: Hand-intensive manufacturing processes, such as composite layup and textile dra**, require significant human dexterity to accommodate task complexity. These strenuous hand motions often lead to musculoskeletal disorders and rehabilitation surgeries. We develop a data-driven ergonomic risk assessment system with a special focus on hand and finger activity to better identify and address ergonomic… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 26 pages, 7 figures

  11. arXiv:2402.14290  [pdf, other

    cs.CL cs.LG

    CEV-LM: Controlled Edit Vector Language Model for Sha** Natural Language Generations

    Authors: Samraj Moorjani, Adit Krishnan, Hari Sundaram

    Abstract: As large-scale language models become the standard for text generation, there is a greater need to tailor the generations to be more or less concise, targeted, and informative, depending on the audience/application. Existing control approaches primarily adjust the semantic (e.g., emotion, topics), structural (e.g., syntax tree, parts-of-speech), and lexical (e.g., keyword/phrase inclusion) propert… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 16 pages, 3 figures, accepted into EACL 2024

  12. arXiv:2402.12394  [pdf, other

    cs.HC cs.AI cs.LG eess.IV

    Improving Model's Interpretability and Reliability using Biomarkers

    Authors: Gautam Rajendrakumar Gare, Tom Fox, Beam Chansangavej, Amita Krishnan, Ricardo Luis Rodriguez, Bennett P deBoisblanc, Deva Kannan Ramanan, John Michael Galeotti

    Abstract: Accurate and interpretable diagnostic models are crucial in the safety-critical field of medicine. We investigate the interpretability of our proposed biomarker-based lung ultrasound diagnostic pipeline to enhance clinicians' diagnostic capabilities. The objective of this study is to assess whether explanations from a decision tree classifier, utilizing biomarkers, can improve users' ability to id… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted at BIAS 2023 Conference

  13. arXiv:2402.05200  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.CL cs.LG

    Are LLMs Ready for Real-World Materials Discovery?

    Authors: Santiago Miret, N M Anoop Krishnan

    Abstract: Large Language Models (LLMs) create exciting possibilities for powerful language processing tools to accelerate research in materials science. While LLMs have great potential to accelerate materials understanding and discovery, they currently fall short in being practical materials science tools. In this position paper, we show relevant failure cases of LLMs in materials science that reveal curren… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  14. arXiv:2312.05231  [pdf, other

    cs.LG

    Modeling Risk in Reinforcement Learning: A Literature Map**

    Authors: Leonardo Villalobos-Arias, Derek Martin, Abhijeet Krishnan, Madeleine Gagné, Colin M. Potts, Arnav Jhala

    Abstract: Safe reinforcement learning deals with mitigating or avoiding unsafe situations by reinforcement learning (RL) agents. Safe RL approaches are based on specific risk representations for particular problems or domains. In order to analyze agent behaviors, compare safe RL approaches, and effectively transfer techniques between application domains, it is necessary to understand the types of risk speci… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 36 pages, 8 figures, Submitted to Artificial Intelligence Reviews

  15. arXiv:2311.18801  [pdf, other

    cs.CV cs.AI

    Distributed Global Structure-from-Motion with a Deep Front-End

    Authors: Ayush Baid, John Lambert, Travis Driver, Akshay Krishnan, Hayk Stepanyan, Frank Dellaert

    Abstract: While initial approaches to Structure-from-Motion (SfM) revolved around both global and incremental methods, most recent applications rely on incremental systems to estimate camera poses due to their superior robustness. Though there has been tremendous progress in SfM `front-ends' powered by deep models learned from data, the state-of-the-art (incremental) SfM pipelines still rely on classical SI… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  16. arXiv:2310.08383  [pdf, other

    cs.CL cond-mat.mtrl-sci

    Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction

    Authors: Kausik Hira, Mohd Zaki, Dhruvil Sheth, Mausam, N M Anoop Krishnan

    Abstract: The discovery of new materials has a documented history of propelling human progress for centuries and more. The behaviour of a material is a function of its composition, structure, and properties, which further depend on its processing and testing conditions. Recent developments in deep learning and natural language processing have enabled information extraction at scale from published literature… ▽ More

    Submitted 26 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Journal ref: Digital Discovery, 2024, Advance Article

  17. arXiv:2310.08099  [pdf, other

    cs.CL

    ClimateNLP: Analyzing Public Sentiment Towards Climate Change Using Natural Language Processing

    Authors: Ajay Krishnan, V. S. Anoop

    Abstract: Climate change's impact on human health poses unprecedented and diverse challenges. Unless proactive measures based on solid evidence are implemented, these threats will likely escalate and continue to endanger human well-being. The escalating advancements in information and communication technologies have facilitated the widespread availability and utilization of social media platforms. Individua… ▽ More

    Submitted 19 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  18. arXiv:2310.02428  [pdf, other

    cs.LG cond-mat.mtrl-sci

    EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations

    Authors: Vaibhav Bihani, Utkarsh Pratiush, Sajid Mannan, Tao Du, Zhimin Chen, Santiago Miret, Matthieu Micoulaut, Morten M Smedskjaer, Sayan Ranu, N M Anoop Krishnan

    Abstract: Equivariant graph neural networks force fields (EGraFFs) have shown great promise in modelling complex interactions in atomic systems by exploiting the graphs' inherent symmetries. Recent works have led to a surge in the development of novel architectures that incorporate equivariance-based inductive biases alongside architectural innovations like graph transformers and message passing to model at… ▽ More

    Submitted 24 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  19. arXiv:2310.02094  [pdf, other

    cs.LG cs.AI nlin.CD physics.flu-dyn

    CoNO: Complex Neural Operator for Continuous Dynamical Systems

    Authors: Karn Tiwari, N M Anoop Krishnan, Prathosh A P

    Abstract: Neural operators extend data-driven models to map between infinite-dimensional functional spaces. These models have successfully solved continuous dynamical systems represented by differential equations, viz weather forecasting, fluid flow, or solid mechanics. However, the existing operators still rely on real space, thereby losing rich representations potentially captured in the complex space by… ▽ More

    Submitted 4 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  20. arXiv:2310.01650  [pdf, other

    cs.LG cs.AI physics.comp-ph

    CoDBench: A Critical Evaluation of Data-driven Models for Continuous Dynamical Systems

    Authors: Priyanshu Burark, Karn Tiwari, Meer Mehran Rashid, Prathosh A P, N M Anoop Krishnan

    Abstract: Continuous dynamical systems, characterized by differential equations, are ubiquitously used to model several important problems: plasma dynamics, flow through porous media, weather forecasting, and epidemic dynamics. Recently, a wide range of data-driven models has been used successfully to model these systems. However, in contrast to established fields like computer vision, limited studies are a… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  21. arXiv:2309.16825  [pdf, other

    cs.LG

    FENDA-FL: Personalized Federated Learning on Heterogeneous Clinical Datasets

    Authors: Fatemeh Tavakoli, D. B. Emerson, Sana Ayromlou, John Jewell, Amrit Krishnan, Yuchong Zhang, Amol Verma, Fahad Razak

    Abstract: Federated learning (FL) is increasingly being recognized as a key approach to overcoming the data silos that so frequently obstruct the training and deployment of machine-learning models in clinical settings. This work contributes to a growing body of FL research specifically focused on clinical applications along three important directions. First, we expand the FLamby benchmark (du Terrail et al.… ▽ More

    Submitted 6 February, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: 23 pages, 5 figures, 11 tables, 1 algorithm Update includes a significant number of new experiments, a new format, and additional results

    MSC Class: 68T07

  22. arXiv:2309.12953  [pdf

    eess.IV cs.CV

    Inter-vendor harmonization of Computed Tomography (CT) reconstruction kernels using unpaired image translation

    Authors: Aravind R. Krishnan, Kaiwen Xu, Thomas Li, Chenyu Gao, Lucas W. Remedios, Praitayini Kanakaraj, Ho Hin Lee, Shunxing Bao, Kim L. Sandler, Fabien Maldonado, Ivana Isgum, Bennett A. Landman

    Abstract: The reconstruction kernel in computed tomography (CT) generation determines the texture of the image. Consistency in reconstruction kernels is important as the underlying CT texture can impact measurements during quantitative image analysis. Harmonization (i.e., kernel conversion) minimizes differences in measurements due to inconsistent reconstruction kernels. Existing methods investigate harmoni… ▽ More

    Submitted 26 January, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 10 pages, 6 figures, 1 table, Submitted to SPIE Medical Imaging : Image Processing. San Diego, CA. February 2024

  23. arXiv:2309.01188  [pdf, other

    cs.IR cs.AI

    Pre-trained Neural Recommenders: A Transferable Zero-Shot Framework for Recommendation Systems

    Authors: Junting Wang, Adit Krishnan, Hari Sundaram, Yunzhe Li

    Abstract: Modern neural collaborative filtering techniques are critical to the success of e-commerce, social media, and content-sharing platforms. However, despite technical advances -- for every new application domain, we need to train an NCF model from scratch. In contrast, pre-trained vision and language models are routinely applied to diverse applications directly (zero-shot) or with limited fine-tuning… ▽ More

    Submitted 29 September, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

  24. arXiv:2308.16884  [pdf, other

    cs.CL cs.AI cs.LG

    The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants

    Authors: Lucas Bandarkar, Davis Liang, Benjamin Muller, Mikel Artetxe, Satya Narayan Shukla, Donald Husa, Naman Goyal, Abhinandan Krishnan, Luke Zettlemoyer, Madian Khabsa

    Abstract: We present Belebele, a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. Significantly expanding the language coverage of natural language understanding (NLU) benchmarks, this dataset enables the evaluation of text models in high-, medium-, and low-resource languages. Each question is based on a short passage from the Flores-200 dataset and has four multip… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 27 pages, 13 figures

    ACM Class: I.2.7

  25. arXiv:2308.11520  [pdf

    cs.CL cs.AI

    Exploring the Power of Topic Modeling Techniques in Analyzing Customer Reviews: A Comparative Analysis

    Authors: Anusuya Krishnan

    Abstract: The exponential growth of online social network platforms and applications has led to a staggering volume of user-generated textual content, including comments and reviews. Consequently, users often face difficulties in extracting valuable insights or relevant information from such content. To address this challenge, machine learning and natural language processing algorithms have been deployed to… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: 13 pages

  26. arXiv:2308.11519  [pdf

    cs.CL cs.AI

    Optimizing Multi-Class Text Classification: A Diverse Stacking Ensemble Framework Utilizing Transformers

    Authors: Anusuya Krishnan

    Abstract: Customer reviews play a crucial role in assessing customer satisfaction, gathering feedback, and driving improvements for businesses. Analyzing these reviews provides valuable insights into customer sentiments, including compliments, comments, and suggestions. Text classification techniques enable businesses to categorize customer reviews into distinct categories, facilitating a better understandi… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

  27. arXiv:2308.09115  [pdf

    cs.CL cond-mat.mtrl-sci

    MaScQA: A Question Answering Dataset for Investigating Materials Science Knowledge of Large Language Models

    Authors: Mohd Zaki, Jayadeva, Mausam, N. M. Anoop Krishnan

    Abstract: Information extraction and textual comprehension from materials literature are vital for develo** an exhaustive knowledge base that enables accelerated materials discovery. Language models have demonstrated their capability to answer domain-specific questions and retrieve information from knowledge bases. However, there are no benchmark datasets in the materials domain that can evaluate the unde… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  28. arXiv:2308.05476  [pdf

    cs.CL cs.LG

    Exploring Machine Learning and Transformer-based Approaches for Deceptive Text Classification: A Comparative Analysis

    Authors: Anusuya Krishnan

    Abstract: Deceptive text classification is a critical task in natural language processing that aims to identify deceptive o fraudulent content. This study presents a comparative analysis of machine learning and transformer-based approaches for deceptive text classification. We investigate the effectiveness of traditional machine learning algorithms and state-of-the-art transformer models, such as BERT, XLNE… ▽ More

    Submitted 10 August, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: 12 pages, 8 figures

  29. arXiv:2307.10617  [pdf

    cs.IR cs.AI cs.LG

    Unmasking Falsehoods in Reviews: An Exploration of NLP Techniques

    Authors: Anusuya Baby Hari Krishnan

    Abstract: In the contemporary digital landscape, online reviews have become an indispensable tool for promoting products and services across various businesses. Marketers, advertisers, and online businesses have found incentives to create deceptive positive reviews for their products and negative reviews for their competitors' offerings. As a result, the writing of deceptive reviews has become an unavoidabl… ▽ More

    Submitted 24 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 6 pages, 3 figures

    ACM Class: I.2.7

  30. arXiv:2307.05299  [pdf, other

    cs.LG cond-mat.dis-nn cond-mat.mtrl-sci physics.comp-ph

    Discovering Symbolic Laws Directly from Trajectories with Hamiltonian Graph Neural Networks

    Authors: Suresh Bishnoi, Ravinder Bhattoo, Jayadeva, Sayan Ranu, N M Anoop Krishnan

    Abstract: The time evolution of physical systems is described by differential equations, which depend on abstract quantities like energy and force. Traditionally, these quantities are derived as functionals based on observables such as positions and velocities. Discovering these governing symbolic laws is the key to comprehending the interactions in nature. Here, we present a Hamiltonian graph neural networ… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  31. arXiv:2306.11435  [pdf, other

    cs.LG cond-mat.dis-nn

    Graph Neural Stochastic Differential Equations for Learning Brownian Dynamics

    Authors: Suresh Bishnoi, Jayadeva, Sayan Ranu, N. M. Anoop Krishnan

    Abstract: Neural networks (NNs) that exploit strong inductive biases based on physical laws and symmetries have shown remarkable success in learning the dynamics of physical systems directly from their trajectory. However, these works focus only on the systems that follow deterministic dynamics, for instance, Newtonian or Hamiltonian dynamics. Here, we propose a framework, namely Brownian graph neural netwo… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  32. arXiv:2306.06892  [pdf, other

    cs.CL

    On the N-gram Approximation of Pre-trained Language Models

    Authors: Aravind Krishnan, Jesujoba Alabi, Dietrich Klakow

    Abstract: Large pre-trained language models (PLMs) have shown remarkable performance across various natural language understanding (NLU) tasks, particularly in low-resource settings. Nevertheless, their potential in Automatic Speech Recognition (ASR) remains largely unexplored. This study investigates the potential usage of PLMs for language modelling in ASR. We compare the application of large-scale text s… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Accepted at Interspeech 2023

  33. arXiv:2305.02474  [pdf, other

    cs.LG

    MLHOps: Machine Learning for Healthcare Operations

    Authors: Faiza Khan Khattak, Vallijah Subasri, Amrit Krishnan, Elham Dolatabadi, Deval Pandya, Laleh Seyyed-Kalantari, Frank Rudzicz

    Abstract: Machine Learning Health Operations (MLHOps) is the combination of processes for reliable, efficient, usable, and ethical deployment and maintenance of machine learning models in healthcare settings. This paper provides both a survey of work in this area and guidelines for developers and clinicians to deploy and maintain their own models in clinical practice. We cover the foundational concepts of g… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  34. arXiv:2304.12149  [pdf, other

    cs.CV eess.IV

    Exploring shared memory architectures for end-to-end gigapixel deep learning

    Authors: Lucas W. Remedios, Leon Y. Cai, Samuel W. Remedios, Karthik Ramadass, Aravind Krishnan, Ruining Deng, Can Cui, Shunxing Bao, Lori A. Coburn, Yuankai Huo, Bennett A. Landman

    Abstract: Deep learning has made great strides in medical imaging, enabled by hardware advances in GPUs. One major constraint for the development of new models has been the saturation of GPU memory resources during training. This is especially true in computational pathology, where images regularly contain more than 1 billion pixels. These pathological images are traditionally divided into small patches to… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  35. arXiv:2304.03760  [pdf, other

    eess.IV cs.CV

    Zero-shot CT Field-of-view Completion with Unconditional Generative Diffusion Prior

    Authors: Kaiwen Xu, Aravind R. Krishnan, Thomas Z. Li, Yuankai Huo, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman

    Abstract: Anatomically consistent field-of-view (FOV) completion to recover truncated body sections has important applications in quantitative analyses of computed tomography (CT) with limited FOV. Existing solution based on conditional generative models relies on the fidelity of synthetic truncation patterns at training phase, which poses limitations for the generalizability of the method to potential unkn… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: Submitted to MIDL 2023, short paper track

  36. arXiv:2304.03280  [pdf, other

    cs.CV

    LANe: Lighting-Aware Neural Fields for Compositional Scene Synthesis

    Authors: Akshay Krishnan, Amit Raj, Xianling Zhang, Alexandra Carlson, Nathan Tseng, Sandhya Sridhar, Nikita Jaipuria, James Hays

    Abstract: Neural fields have recently enjoyed great success in representing and rendering 3D scenes. However, most state-of-the-art implicit representations model static or dynamic scenes as a whole, with minor variations. Existing work on learning disentangled world and object neural fields do not consider the problem of composing objects into different world neural fields in a lighting-aware manner. We pr… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: Project website: https://lane-composition.github.io

  37. arXiv:2304.02836  [pdf, other

    eess.IV cs.CV cs.LG

    Longitudinal Multimodal Transformer Integrating Imaging and Latent Clinical Signatures From Routine EHRs for Pulmonary Nodule Classification

    Authors: Thomas Z. Li, John M. Still, Kaiwen Xu, Ho Hin Lee, Leon Y. Cai, Aravind R. Krishnan, Riqiang Gao, Mirza S. Khan, Sanja Antic, Michael Kammer, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman, Thomas A. Lasko

    Abstract: The accuracy of predictive models for solitary pulmonary nodule (SPN) diagnosis can be greatly increased by incorporating repeat imaging and medical context, such as electronic health records (EHRs). However, clinically routine modalities such as imaging and diagnostic codes can be asynchronous and irregularly sampled over different time scales which are obstacles to longitudinal multimodal learni… ▽ More

    Submitted 29 June, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to MICCAI 2023

  38. arXiv:2303.16287  [pdf, ps, other

    cs.DS

    Lower Bounds for Pseudo-Deterministic Counting in a Stream

    Authors: Vladimir Braverman, Robert Krauthgamer, Aditya Krishnan, Shay Sapir

    Abstract: Many streaming algorithms provide only a high-probability relative approximation. These two relaxations, of allowing approximation and randomization, seem necessary -- for many streaming problems, both relaxations must be employed simultaneously, to avoid an exponentially larger (and often trivial) space complexity. A common drawback of these randomized approximate algorithms is that independent e… ▽ More

    Submitted 15 May, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: 14 pages, ICALP2023

  39. arXiv:2301.12477  [pdf, other

    cs.LG cond-mat.dis-nn

    StriderNET: A Graph Reinforcement Learning Approach to Optimize Atomic Structures on Rough Energy Landscapes

    Authors: Vaibhav Bihani, Sahil Manchanda, Srikanth Sastry, Sayan Ranu, N. M. Anoop Krishnan

    Abstract: Optimization of atomic structures presents a challenging problem, due to their highly rough and non-convex energy landscape, with wide applications in the fields of drug design, materials discovery, and mechanics. Here, we present a graph reinforcement learning approach, StriderNET, that learns a policy to displace the atoms towards low energy configurations. We evaluate the performance of Strider… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

  40. arXiv:2301.10283  [pdf, other

    cs.CL cs.LG

    Audience-Centric Natural Language Generation via Style Infusion

    Authors: Samraj Moorjani, Adit Krishnan, Hari Sundaram, Ewa Maslowska, Aravind Sankar

    Abstract: Adopting contextually appropriate, audience-tailored linguistic styles is critical to the success of user-centric language generation systems (e.g., chatbots, computer-aided writing, dialog systems). While existing approaches demonstrate textual style transfer with large volumes of parallel or non-parallel data, we argue that grounding style on audience-independent external factors is innately lim… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 14 pages, 3 figures, Accepted in Findings of EMNLP 2022

  41. arXiv:2212.14099  [pdf, other

    cs.CV

    Curator: Creating Large-Scale Curated Labelled Datasets using Self-Supervised Learning

    Authors: Tarun Narayanan, Ajay Krishnan, Anirudh Koul, Siddha Ganju

    Abstract: Applying Machine learning to domains like Earth Sciences is impeded by the lack of labeled data, despite a large corpus of raw data available in such domains. For instance, training a wildfire classifier on satellite imagery requires curating a massive and diverse dataset, which is an expensive and time-consuming process that can span from weeks to months. Searching for relevant examples in over 4… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: AAAI Fall Symposium 2022

  42. arXiv:2212.11922  [pdf, other

    cs.CV

    SupeRGB-D: Zero-shot Instance Segmentation in Cluttered Indoor Environments

    Authors: Evin Pınar Örnek, Aravindhan K Krishnan, Shreekant Gayaka, Cheng-Hao Kuo, Arnie Sen, Nassir Navab, Federico Tombari

    Abstract: Object instance segmentation is a key challenge for indoor robots navigating cluttered environments with many small objects. Limitations in 3D sensing capabilities often make it difficult to detect every possible object. While deep learning approaches may be effective for this problem, manually annotating 3D data for supervised learning is time-consuming. In this work, we explore zero-shot instanc… ▽ More

    Submitted 25 May, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: Accepted in Robotics and Automation Letters April 2023

  43. arXiv:2211.05520  [pdf, other

    cs.LG physics.comp-ph

    Unravelling the Performance of Physics-informed Graph Neural Networks for Dynamical Systems

    Authors: Abishek Thangamuthu, Gunjan Kumar, Suresh Bishnoi, Ravinder Bhattoo, N M Anoop Krishnan, Sayan Ranu

    Abstract: Recently, graph neural networks have been gaining a lot of attention to simulate dynamical systems due to their inductive nature leading to zero-shot generalizability. Similarly, physics-informed inductive biases in deep-learning frameworks have been shown to give superior performance in learning the dynamics of physical systems. There is a growing volume of literature that attempts to combine the… ▽ More

    Submitted 2 February, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted at 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  44. arXiv:2211.03223  [pdf

    cs.CV cond-mat.mtrl-sci eess.IV

    Cementron: Machine Learning the Constituent Phases in Cement Clinker from Optical Images

    Authors: Mohd Zaki, Siddhant Sharma, Sunil Kumar Gurjar, Raju Goyal, Jayadeva, N. M. Anoop Krishnan

    Abstract: Cement is the most used construction material. The performance of cement hydrate depends on the constituent phases, viz. alite, belite, aluminate, and ferrites present in the cement clinker, both qualitatively and quantitatively. Traditionally, clinker phases are analyzed from optical images relying on a domain expert and simple image processing techniques. However, the non-uniformity of the image… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  45. arXiv:2211.00129  [pdf, other

    cs.CV cs.CY cs.HC cs.LG

    Is Facial Recognition Biased at Near-Infrared Spectrum As Well?

    Authors: Anoop Krishnan, Brian Neas, Ajita Rattani

    Abstract: Published academic research and media articles suggest face recognition is biased across demographics. Specifically, unequal performance is obtained for women, dark-skinned people, and older adults. However, these published studies have examined the bias of facial recognition in the visible spectrum (VIS). Factors such as facial makeup, facial hair, skin color, and illumination variation have been… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 7 pages, 2022 Virtual IEEE International Symposium on Technologies for Homeland Security

  46. arXiv:2210.17511  [pdf, other

    cs.CY cs.CR cs.HC

    Examining the Landscape of Digital Safety and Privacy Assistance for Black Communities

    Authors: Nikita Samarin, Aparna Krishnan, Moses Namara, Joanne Ma, Elissa M. Redmiles

    Abstract: Recent events have placed a renewed focus on the issue of racial justice in the United States and other countries. One dimension of this issue that has received considerable attention is the security and privacy threats and vulnerabilities faced by the communities of color. Our study focuses on community-level advocates who organize workshops, clinics, and other initiatives that inform Black com… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Appears in the Workshop on Inclusive Privacy and Security (WIPS) 2021 co-located with Symposium on Usable Privacy and Security (SOUPS)

  47. arXiv:2210.15199  [pdf, other

    cs.RO

    Characterising the Robustness of Reinforcement Learning for Continuous Control using Disturbance Injection

    Authors: Catherine R. Glossop, Jacopo Panerati, Amrit Krishnan, Zhaocong Yuan, Angela P. Schoellig

    Abstract: In this study, we leverage the deliberate and systematic fault-injection capabilities of an open-source benchmark suite to perform a series of experiments on state-of-the-art deep and robust reinforcement learning algorithms. We aim to benchmark robustness in the context of continuous action spaces -- crucial for deployment in robot control. We find that robustness is more prominent for action dis… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 18 pages, 15 figures

  48. arXiv:2210.10507  [pdf

    cond-mat.mtrl-sci cs.LG

    Predicting Oxide Glass Properties with Low Complexity Neural Network and Physical and Chemical Descriptors

    Authors: Suresh Bishnoi, Skyler Badge, Jayadeva, N. M. Anoop Krishnan

    Abstract: Due to their disordered structure, glasses present a unique challenge in predicting the composition-property relationships. Recently, several attempts have been made to predict the glass properties using machine learning techniques. However, these techniques have the limitations, namely, (i) predictions are limited to the components that are present in the original dataset, and (ii) predictions to… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 15 pages, 3 figures

  49. arXiv:2209.11588  [pdf, other

    cs.LG

    Learning Articulated Rigid Body Dynamics with Lagrangian Graph Neural Network

    Authors: Ravinder Bhattoo, Sayan Ranu, N. M. Anoop Krishnan

    Abstract: Lagrangian and Hamiltonian neural networks (LNNs and HNNs, respectively) encode strong inductive biases that allow them to outperform other models of physical systems significantly. However, these models have, thus far, mostly been limited to simple systems such as pendulums and springs or a single rigid body such as a gyroscope or a rigid rotor. Here, we present a Lagrangian graph neural network… ▽ More

    Submitted 11 November, 2022; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: Accepted at NeurIPS 2022

  50. arXiv:2209.10740  [pdf, other

    cs.LG nlin.CD

    Enhancing the Inductive Biases of Graph Neural ODE for Modeling Dynamical Systems

    Authors: Suresh Bishnoi, Ravinder Bhattoo, Sayan Ranu, N. M. Anoop Krishnan

    Abstract: Neural networks with physics based inductive biases such as Lagrangian neural networks (LNN), and Hamiltonian neural networks (HNN) learn the dynamics of physical systems by encoding strong inductive biases. Alternatively, Neural ODEs with appropriate inductive biases have also been shown to give similar performances. However, these models, when applied to particle based systems, are transductive… ▽ More

    Submitted 15 June, 2024; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: 33 pages, 23 figures, Published as a conference paper at ICLR 2023