Skip to main content

Showing 1–50 of 195 results for author: Sundararajan

.
  1. arXiv:2406.18679  [pdf, other

    eess.AS cs.AI cs.CL cs.LG

    Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization

    Authors: Xiang Li, Vivek Govindan, Rohit Paturi, Sundararajan Srinivasan

    Abstract: End-to-end neural diarization (EEND) models offer significant improvements over traditional embedding-based Speaker Diarization (SD) approaches but falls short on generalizing to long-form audio with large number of speakers. EEND-vector-clustering method mitigates this by combining local EEND with global clustering of speaker embeddings from local windows, but this requires an additional speaker… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted at INTERSPEECH 2024

  2. arXiv:2406.17266  [pdf, other

    eess.AS cs.AI cs.CL cs.LG

    AG-LSEC: Audio Grounded Lexical Speaker Error Correction

    Authors: Rohit Paturi, Xiang Li, Sundararajan Srinivasan

    Abstract: Speaker Diarization (SD) systems are typically audio-based and operate independently of the ASR system in traditional speech transcription pipelines and can have speaker errors due to SD and/or ASR reconciliation, especially around speaker turns and regions of speech overlap. To reduce these errors, a Lexical Speaker Error Correction (LSEC), in which an external language model provides lexical inf… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted at INTERSPEECH 2024

  3. arXiv:2405.09023  [pdf, other

    econ.GN

    The Rise of Recommerce: Ownership and Sustainability with Overlap** Generations

    Authors: Rubing Li, Arun Sundararajan

    Abstract: The emergence of the branded recommerce channel - digitally enabled and branded marketplaces that facilitate purchasing pre-owned items directly from a manufacturer's e-commerce site - leads to new variants of classic IS and economic questions relating to secondary markets. Such branded recommerce is increasingly platform-enabled, creating opportunities for greater sustainability and stronger bran… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  4. arXiv:2405.08317  [pdf, other

    cs.CL cs.SD eess.AS

    SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

    Authors: Raghuveer Peri, Sai Muralidhar Jayanthi, Srikanth Ronanki, Anshu Bhatia, Karel Mundnich, Saket Dingliwal, Nilaksh Das, Zejiang Hou, Goeric Huybrechts, Srikanth Vishnubhotla, Daniel Garcia-Romero, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

    Abstract: Integrated Speech and Large Language Models (SLMs) that can follow speech instructions and generate relevant text responses have gained popularity lately. However, the safety and robustness of these models remains largely unclear. In this work, we investigate the potential vulnerabilities of such instruction-following speech-language models to adversarial attacks and jailbreaking. Specifically, we… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 9+6 pages, Submitted to ACL 2024

  5. arXiv:2405.08295  [pdf, other

    cs.CL cs.SD eess.AS

    SpeechVerse: A Large-scale Generalizable Audio Language Model

    Authors: Nilaksh Das, Saket Dingliwal, Srikanth Ronanki, Rohit Paturi, Zhaocheng Huang, Prashant Mathur, Jie Yuan, Dhanush Bekal, Xing Niu, Sai Muralidhar Jayanthi, Xilai Li, Karel Mundnich, Monica Sunkara, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

    Abstract: Large language models (LLMs) have shown incredible proficiency in performing tasks that require semantic understanding of natural language instructions. Recently, many works have further expanded this capability to perceive multimodal audio and text inputs, but their capabilities are often limited to specific fine-tuned tasks such as automatic speech recognition and translation. We therefore devel… ▽ More

    Submitted 31 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: Single Column, 13 page

  6. arXiv:2404.19534  [pdf, other

    cs.CV

    MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results

    Authors: Yuekun Dai, Dafeng Zhang, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Peiqing Yang, Zhezhu **, Guanqun Liu, Chen Change Loy, Lize Zhang, Shuai Liu, Chaoyu Feng, Luyang Wang, Shuan Chen, Guangqi Shao, Xiaotao Wang, Lei Lei, Qirui Yang, Qihua Cheng, Zhiqiang Xu, Yihao Liu, Huan**g Yue, **gyu Yang , et al. (38 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 27 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Nighttime Flare Removal Challenge Report. Website: https://mipi-challenge.org/MIPI2024/

  7. arXiv:2404.14353  [pdf

    q-bio.BM q-bio.CB

    Electroporation-mediated Metformin for effective anticancer treatment of triple-negative breast cancer cells

    Authors: Praveen Sahu, Ignacio G. Camarillo, Pragatheiswar Giri, Raji Sundararajan

    Abstract: In this research, we investigated the efficacy of Metformin, the most commonly administered type-2 diabetes drug for triple negative breast cancer (TNBC) treatment, due to its various anticancer properties. It is a plant-based bio-compound, synthesized as a novel biguanide, called dimethyl biguanide or metformin. One of the ways it operates is by hindering electron transport chain-complex I, in mi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  8. arXiv:2404.06977  [pdf

    cs.CV

    Accurate Tennis Court Line Detection on Amateur Recorded Matches

    Authors: Sameer Agrawal, Ragoth Sundararajan, Vishak Sagar

    Abstract: Typically, tennis court line detection is done by running Hough-Line-Detection to find straight lines in the image, and then computing a transformation matrix from the detected lines to create the final court structure. We propose numerous improvements and enhancements to this algorithm, including using pretrained State-of-the-Art shadow-removal and object-detection ML models to make our line-dete… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted to 5th International conference on Image, Video Processing and Artificial Intelligence

    ACM Class: I.4.6

  9. arXiv:2404.04103  [pdf, other

    cs.CL

    Improving Factual Accuracy of Neural Table-to-Text Output by Addressing Input Problems in ToTTo

    Authors: Barkavi Sundararajan, Somayajulu Sripada, Ehud Reiter

    Abstract: Neural Table-to-Text models tend to hallucinate, producing texts that contain factual errors. We investigate whether such errors in the output can be traced back to problems with the input. We manually annotated 1,837 texts generated by multiple models in the politics domain of the ToTTo dataset. We identify the input problems that are responsible for many output errors and show that fixing these… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Added link to human evaluation guidelines and error annotations

  10. arXiv:2403.09148  [pdf, other

    cs.CL cs.IR

    Evaluating LLMs for Gender Disparities in Notable Persons

    Authors: Lauren Rhue, Sofie Goethals, Arun Sundararajan

    Abstract: This study examines the use of Large Language Models (LLMs) for retrieving factual information, addressing concerns over their propensity to produce factually incorrect "hallucinated" responses or to altogether decline to even answer prompt at all. Specifically, it investigates the presence of gender-based biases in LLMs' responses to factual inquiries. This paper takes a multi-pronged approach to… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  11. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  13. arXiv:2311.00697  [pdf, other

    cs.CL eess.AS

    End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation

    Authors: Juan Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Rohit Paturi, Sundararajan Srinivasan, Prashant Mathur, Brian Thompson, Marcello Federico

    Abstract: Conventional speech-to-text translation (ST) systems are trained on single-speaker utterances, and they may not generalize to real-life scenarios where the audio contains conversations by multiple speakers. In this paper, we tackle single-channel multi-speaker conversational ST with an end-to-end and multi-task training model, named Speaker-Turn Aware Conversational Speech Translation, that combin… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023. Code: https://github.com/amazon-science/stac-speech-translation

  14. arXiv:2310.01892  [pdf, ps, other

    cs.LG cs.AI

    FiGURe: Simple and Efficient Unsupervised Node Representations with Filter Augmentations

    Authors: Chanakya Ekbote, A**kya Pankaj Deshpande, Arun Iyer, Ramakrishna Bairi, Sundararajan Sellamanickam

    Abstract: Unsupervised node representations learnt using contrastive learning-based methods have shown good performance on downstream tasks. However, these methods rely on augmentations that mimic low-pass filters, limiting their performance on tasks requiring different eigen-spectrum parts. This paper presents a simple filter-based augmentation method to capture different parts of the eigen-spectrum. We sh… ▽ More

    Submitted 4 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  15. arXiv:2309.11516  [pdf, other

    cs.IR cs.CR cs.LG

    Private Matrix Factorization with Public Item Features

    Authors: Mihaela Curmei, Walid Krichene, Li Zhang, Mukund Sundararajan

    Abstract: We consider the problem of training private recommendation models with access to public item features. Training with Differential Privacy (DP) offers strong privacy guarantees, at the expense of loss in recommendation quality. We show that incorporating public item features during training can help mitigate this loss in quality. We propose a general approach based on collective matrix factorizatio… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: Presented at ACM Recsys 2023

  16. arXiv:2309.10881  [pdf

    cs.RO q-bio.TO

    Nanorobotics in Medicine: A Systematic Review of Advances, Challenges, and Future Prospects

    Authors: Shishir Rajendran, Prathic Sundararajan, Ashi Awasthi, Suraj Rajendran

    Abstract: Nanorobotics offers an emerging frontier in biomedicine, holding the potential to revolutionize diagnostic and therapeutic applications through its unique capabilities in manipulating biological systems at the nanoscale. Following PRISMA guidelines, a comprehensive literature search was conducted using IEEE Xplore and PubMed databases, resulting in the identification and analysis of a total of 414… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  17. arXiv:2308.13773  [pdf, other

    cs.LO cs.CR

    Solving the insecurity problem for assertions

    Authors: R Ramanujam, Vaishnavi Sundararajan, S P Suresh

    Abstract: In the symbolic verification of cryptographic protocols, a central problem is deciding whether a protocol admits an execution which leaks a designated secret to the malicious intruder. Rusinowitch & Turuani (2003) show that, when considering finitely many sessions, this ``insecurity problem'' is NP-complete. Central to their proof strategy is the observation that any execution of a protocol can be… ▽ More

    Submitted 26 January, 2024; v1 submitted 26 August, 2023; originally announced August 2023.

  18. arXiv:2308.10882  [pdf, other

    cs.AI cs.CL

    Giraffe: Adventures in Expanding Context Lengths in LLMs

    Authors: Arka Pal, Deep Karkhanis, Manley Roberts, Samuel Dooley, Arvind Sundararajan, Siddartha Naidu

    Abstract: Modern large language models (LLMs) that rely on attention mechanisms are typically trained with fixed context lengths which enforce upper limits on the length of input sequences that they can handle at evaluation time. To use these models on sequences longer than the train-time context length, one might employ techniques from the growing family of context length extrapolation methods -- most of w… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  19. arXiv:2308.02160  [pdf, other

    cs.CL cs.LG

    Speaker Diarization of Scripted Audiovisual Content

    Authors: Yogesh Virkar, Brian Thompson, Rohit Paturi, Sundararajan Srinivasan, Marcello Federico

    Abstract: The media localization industry usually requires a verbatim script of the final film or TV production in order to create subtitles or dubbing scripts in a foreign language. In particular, the verbatim script (i.e. as-broadcast script) must be structured into a sequence of dialogue lines each including time codes, speaker name and transcript. Current speech recognition technology alleviates the tra… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: 5 pages, 3 figures

  20. arXiv:2307.09954  [pdf, other

    eess.SY cs.MA cs.RO

    Priority-based DREAM Approach for Highly Manoeuvring Intruders in A Perimeter Defense Problem

    Authors: Shridhar Velhal, Suresh Sundaram, Narasimhan Sundararajan

    Abstract: In this paper, a Priority-based Dynamic REsource Allocation with decentralized Multi-task assignment (P-DREAM) approach is presented to protect a territory from highly manoeuvring intruders. In the first part, static optimization problems are formulated to compute the following parameters of the perimeter defense problem; the number of reserve stations, their locations, the priority region, the mo… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  21. arXiv:2306.17776  [pdf, other

    stat.CO

    A multivariate heavy-tailed integer-valued GARCH process with EM algorithm-based inference

    Authors: Yuhyeong Jang, Raanju R. Sundararajan, Wagner Barreto-Souza

    Abstract: A new multivariate integer-valued Generalized AutoRegressive Conditional Heteroscedastic process based on a multivariate Poisson generalized inverse Gaussian distribution is proposed. The estimation of parameters of the proposed multivariate heavy-tailed count time series model via maximum likelihood method is challenging since the likelihood function involves a Bessel function that depends on the… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: 32pages, 14figures

    MSC Class: 62M10 (Primary); 62M09; 62P25 (Secondary)

  22. arXiv:2306.09313  [pdf, other

    eess.AS cs.AI cs.CL cs.LG

    Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction

    Authors: Rohit Paturi, Sundararajan Srinivasan, Xiang Li

    Abstract: Speaker diarization (SD) is typically used with an automatic speech recognition (ASR) system to ascribe speaker labels to recognized words. The conventional approach reconciles outputs from independently optimized ASR and SD systems, where the SD system typically uses only acoustic information to identify the speakers in the audio stream. This approach can lead to speaker errors especially around… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted at INTERSPEECH 2023

  23. arXiv:2306.09055  [pdf, other

    cs.RO cs.AI cs.LG

    Predictive Maneuver Planning with Deep Reinforcement Learning (PMP-DRL) for comfortable and safe autonomous driving

    Authors: Jayabrata Chowdhury, Vishruth Veerendranath, Suresh Sundaram, Narasimhan Sundararajan

    Abstract: This paper presents a Predictive Maneuver Planning with Deep Reinforcement Learning (PMP-DRL) model for maneuver planning. Traditional rule-based maneuver planning approaches often have to improve their abilities to handle the variabilities of real-world driving scenarios. By learning from its experience, a Reinforcement Learning (RL)-based driving agent can adapt to changing driving conditions an… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  24. arXiv:2306.06234  [pdf, other

    cs.CL cs.AI

    Using Foundation Models to Detect Policy Violations with Minimal Supervision

    Authors: Sid Mittal, Vineet Gupta, Frederick Liu, Mukund Sundararajan

    Abstract: Foundation models, i.e. large neural networks pre-trained on large text corpora, have revolutionized NLP. They can be instructed directly (e.g. (arXiv:2005.14165)) - this is called hard prompting - and they can be tuned using very little data (e.g. (arXiv:2104.08691)) - this technique is called soft prompting. We seek to leverage their capabilities to detect policy violations. Our contributions ar… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 16 pages

  25. arXiv:2305.04497  [pdf, other

    cs.CV cs.MM

    IIITD-20K: Dense captioning for Text-Image ReID

    Authors: A V Subramanyam, Niranjan Sundararajan, Vibhu Dubey, Brejesh Lall

    Abstract: Text-to-Image (T2I) ReID has attracted a lot of attention in the recent past. CUHK-PEDES, RSTPReid and ICFG-PEDES are the three available benchmarks to evaluate T2I ReID methods. RSTPReid and ICFG-PEDES comprise of identities from MSMT17 but due to limited number of unique persons, the diversity is limited. On the other hand, CUHK-PEDES comprises of 13,003 identities but has relatively shorter tex… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  26. arXiv:2302.07975  [pdf, other

    cs.LG cs.CR stat.ML

    Multi-Task Differential Privacy Under Distribution Skew

    Authors: Walid Krichene, Prateek Jain, Shuang Song, Mukund Sundararajan, Abhradeep Thakurta, Li Zhang

    Abstract: We study the problem of multi-task learning under user-level differential privacy, in which $n$ users contribute data to $m$ tasks, each involving a subset of users. One important aspect of the problem, that can significantly impact quality, is the distribution skew among tasks. Certain tasks may have much fewer data samples than others, making them more susceptible to the noise added for privacy.… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  27. arXiv:2301.03664  [pdf, other

    stat.ME

    Frequency Band Analysis of Nonstationary Multivariate Time Series

    Authors: Raanju R. Sundararajan, Scott A. Bruce

    Abstract: Information from frequency bands in biomedical time series provides useful summaries of the observed signal. Many existing methods consider summaries of the time series obtained over a few well-known, pre-defined frequency bands of interest. However, these methods do not provide data-driven methods for identifying frequency bands that optimally summarize frequency-domain information in the time se… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    MSC Class: 62M10; 62M15

  28. arXiv:2301.02032  [pdf, other

    math.NA

    On the fractional transversely isotropic functionally graded nature of soft biological tissues

    Authors: Sachin Gunda, Sundararajan Natarajan, Olga Barrera

    Abstract: This paper focuses on the origin of the poroelastic anisotropic behaviour of the meniscal tissue and its spatially varying properties. We present confined compression creep test results on samples extracted from three parts of the tissue (Central body, Anterior horn and Posterior horn) in three orientations (Circumferential, Radial and Vertical). We show that a poroelastic model in which the fluid… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

  29. Peek into the Future Camera-based Occupant Sensing in Configurable Cabins for Autonomous Vehicles

    Authors: Avinash Prabu, Renran Tian, Lingxi Li, Jialiang Le, Srinivasan Sundararajan, Saeed Barbat

    Abstract: The development of fully autonomous vehicles (AVs) can potentially eliminate drivers and introduce unprecedented seating design. However, highly flexible seat configurations may lead to occupants' unconventional poses and actions. Understanding occupant behaviors and prioritize safety features become eye-catching topics in the AV research frontier. Visual sensors have the advantages of cost-effici… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: Conference: 2021 IEEE International Intelligent Transportation Systems Conference (ITSC) Link: https://ieeexplore.ieee.org/document/9564420

  30. arXiv:2212.07084  [pdf, other

    cs.CV eess.IV

    Fully Complex-valued Fully Convolutional Multi-feature Fusion Network (FC2MFN) for Building Segmentation of InSAR images

    Authors: Aniruddh Sikdar, Sumanth Udupa, Suresh Sundaram, Narasimhan Sundararajan

    Abstract: Building segmentation in high-resolution InSAR images is a challenging task that can be useful for large-scale surveillance. Although complex-valued deep learning networks perform better than their real-valued counterparts for complex-valued SAR data, phase information is not retained throughout the network, which causes a loss of information. This paper proposes a Fully Complex-valued, Fully Conv… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: Accepted for publication in IEEE Symposium Series On Computational Intelligence 2022, 8 pages, 6 figures

  31. arXiv:2211.13280  [pdf, other

    cs.CL cs.SD eess.AS

    Device Directedness with Contextual Cues for Spoken Dialog Systems

    Authors: Dhanush Bekal, Sundararajan Srinivasan, Sravan Bodapati, Srikanth Ronanki, Katrin Kirchhoff

    Abstract: In this work, we define barge-in verification as a supervised learning task where audio-only information is used to classify user spoken dialogue into true and false barge-ins. Following the success of pre-trained models, we use low-level speech representations from a self-supervised representation learning model for our downstream classification task. Further, we propose a novel technique to infu… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  32. arXiv:2208.14494  [pdf, other

    physics.bio-ph cond-mat.soft q-bio.SC

    Theoretical analysis of cargo transport by catch bonded motors in optical trap** assays

    Authors: Naren Sundararajan, Sougata Guha, Sudipto Muhuri, Mithun K. Mitra

    Abstract: Dynein motors exhibit catch bonding, where the unbinding rate of the motors from microtubule filaments decreases with increasing opposing load. The implications of this catch bond on the transport properties of dynein-driven cargo are yet to be fully understood. In this context, optical trap** assays constitute an important means of accurately measuring the forces generated by molecular motor pr… ▽ More

    Submitted 16 November, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: 12 pages, 7 figures

  33. arXiv:2205.11781  [pdf, other

    cs.LG

    Attributing AUC-ROC to Analyze Binary Classifier Performance

    Authors: Arya Tafvizi, Besim Avci, Mukund Sundararajan

    Abstract: Area Under the Receiver Operating Characteristic Curve (AUC-ROC) is a popular evaluation metric for binary classifiers. In this paper, we discuss techniques to segment the AUC-ROC along human-interpretable dimensions. AUC-ROC is not an additive/linear function over the data samples, therefore such segmenting the overall AUC-ROC is different from tabulating the AUC-ROC of data segments. To segment… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  34. arXiv:2205.10092  [pdf, other

    cs.RO

    An efficient Deep Spatio-Temporal Context Aware decision Network (DST-CAN) for Predictive Manoeuvre Planning

    Authors: Jayabrata Chowdhury, Suresh Sundaram, Nishant Rao, Narasimhan Sundararajan

    Abstract: To ensure the safety and efficiency of its maneuvers, an Autonomous Vehicle (AV) should anticipate the future intentions of surrounding vehicles using its sensor information. If an AV can predict its surrounding vehicles' future trajectories, it can make safe and efficient manoeuvre decisions. In this paper, we present such a Deep Spatio-Temporal Context-Aware decision Network (DST-CAN) model for… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: 11 pages, 9 figures

  35. arXiv:2202.13870  [pdf, other

    cs.NI cs.LG eess.SY

    Simulating Network Paths with Recurrent Buffering Units

    Authors: Divyam Anshumaan, Sriram Balasubramanian, Shubham Tiwari, Nagarajan Natarajan, Sundararajan Sellamanickam, Venkata N. Padmanabhan

    Abstract: Simulating physical network paths (e.g., Internet) is a cornerstone research problem in the emerging sub-field of AI-for-networking. We seek a model that generates end-to-end packet delay values in response to the time-varying load offered by a sender, which is typically a function of the previously output delays. The problem setting is unique, and renders the state-of-the-art text and time-series… ▽ More

    Submitted 6 December, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: Accepted in AAAI 2023, 19 pages, 14 figures

  36. arXiv:2202.11844  [pdf, other

    cs.LG cs.CL cs.CY

    First is Better Than Last for Language Data Influence

    Authors: Chih-Kuan Yeh, Ankur Taly, Mukund Sundararajan, Frederick Liu, Pradeep Ravikumar

    Abstract: The ability to identify influential training examples enables us to debug training data and explain model behavior. Existing techniques to do so are based on the flow of training data influence through the model parameters. For large models in NLP applications, it is often computationally infeasible to study this flow through all model parameters, therefore techniques usually pick the last layer o… ▽ More

    Submitted 27 October, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

  37. arXiv:2202.09480  [pdf, other

    cs.LG cs.AI econ.GN

    Reciprocity in Machine Learning

    Authors: Mukund Sundararajan, Walid Krichene

    Abstract: Machine learning is pervasive. It powers recommender systems such as Spotify, Instagram and YouTube, and health-care systems via models that predict sleep patterns, or the risk of disease. Individuals contribute data to these models and benefit from them. Are these contributions (outflows of influence) and benefits (inflows of influence) reciprocal? We propose measures of outflows, inflows and rec… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

  38. arXiv:2202.04518  [pdf, ps, other

    cs.CR cs.LO

    Insecurity problem for assertions remains in NP

    Authors: R. Ramanujam, Vaishnavi Sundararajan, S. P. Suresh

    Abstract: In the symbolic verification of cryptographic protocols, a central problem is deciding whether a protocol admits an execution which leaks a designated secret to the malicious intruder. Rusinowitch and Turuani (2003) show that, when considering finitely many sessions and a protocol model where only terms are communicated, this ``insecurity problem'' is NP-complete. Central to their proof strategy i… ▽ More

    Submitted 25 January, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

  39. arXiv:2112.05863  [pdf, other

    eess.AS cs.CL cs.LG cs.SD eess.SP

    Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech

    Authors: Rohit Paturi, Sundararajan Srinivasan, Katrin Kirchhoff, Daniel Garcia-Romero

    Abstract: Many of the recent advances in speech separation are primarily aimed at synthetic mixtures of short audio utterances with high degrees of overlap. Most of these approaches need an additional stitching step to stitch the separated speech chunks for long form audio. Since most of the approaches involve Permutation Invariant training (PIT), the order of separated speech chunks is nondeterministic and… ▽ More

    Submitted 6 September, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: Accepted for publication at Interspeech 2022

  40. arXiv:2112.04960  [pdf, other

    cs.CE

    mechanoChemML: A software library for machine learning in computational materials physics

    Authors: X. Zhang, G. H. Teichert, Z. Wang, M. Duschenes, S. Srivastava, E. Livingston, J. Holber, M. Faghih Shojaei, A. Sundararajan, K. Garikipati

    Abstract: We present mechanoChemML, a machine learning software library for computational materials physics. mechanoChemML is designed to function as an interface between platforms that are widely used for machine learning on one hand, and others for solution of partial differential equations-based models of physics. Of special interest here, and the focus of mechanoChemML, are applications to computational… ▽ More

    Submitted 30 April, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

  41. arXiv:2112.03499  [pdf, other

    cs.LG

    A Piece-wise Polynomial Filtering Approach for Graph Neural Networks

    Authors: Vijay Lingam, Chanakya Ekbote, Manan Sharma, Rahul Ragesh, Arun Iyer, Sundararajan Sellamanickam

    Abstract: Graph Neural Networks (GNNs) exploit signals from node features and the input graph topology to improve node classification task performance. However, these models tend to perform poorly on heterophilic graphs, where connected nodes have different labels. Recently proposed GNNs work across graphs having varying levels of homophily. Among these, models relying on polynomial graph filters have shown… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: 28 pages, 9 figures, Under Review

  42. arXiv:2112.00158  [pdf

    eess.AS

    Representation learning through cross-modal conditional teacher-student training for speech emotion recognition

    Authors: Sundararajan Srinivasan, Zhaocheng Huang, Katrin Kirchhoff

    Abstract: Generic pre-trained speech and text representations promise to reduce the need for large labeled datasets on specific speech and language tasks. However, it is not clear how to effectively adapt these representations for speech emotion recognition. Recent public benchmarks show the efficacy of several popular self-supervised speech representations for emotion classification. In this study, we show… ▽ More

    Submitted 27 January, 2022; v1 submitted 30 November, 2021; originally announced December 2021.

    Comments: Accepted for publication at IEEE ICASSP 2022

  43. arXiv:2109.13995  [pdf, other

    cs.LG

    IGLU: Efficient GCN Training via Lazy Updates

    Authors: S Deepak Narayanan, Aditya Sinha, Prateek Jain, Purushottam Kar, Sundararajan Sellamanickam

    Abstract: Training multi-layer Graph Convolution Networks (GCN) using standard SGD techniques scales poorly as each descent step ends up updating node embeddings for a large portion of the graph. Recent attempts to remedy this sub-sample the graph that reduces compute but introduce additional variance and may offer suboptimal performance. This paper develops the IGLU method that caches intermediate computat… ▽ More

    Submitted 3 April, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: Published as Conference Paper at ICLR 2022, 36 Pages

  44. arXiv:2107.13312  [pdf, other

    cs.LG cs.SI

    Effective Eigendecomposition based Graph Adaptation for Heterophilic Networks

    Authors: Vijay Lingam, Rahul Ragesh, Arun Iyer, Sundararajan Sellamanickam

    Abstract: Graph Neural Networks (GNNs) exhibit excellent performance when graphs have strong homophily property, i.e. connected nodes have the same labels. However, they perform poorly on heterophilic graphs. Several approaches address the issue of heterophily by proposing models that adapt the graph by optimizing task-specific loss function using labelled data. These adaptations are made either via attenti… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: text overlap with arXiv:2106.12807

  45. arXiv:2107.10135  [pdf, other

    cs.NI eess.SP

    Global Outliers Detection in Wireless Sensor Networks: A Novel Approach Integrating Time-Series Analysis, Entropy, and Random Forest-based Classification

    Authors: Mahmood Safaei, Maha Driss, Wadii Boulila, Elankovan A Sundararajan, Mitra Safaei

    Abstract: Wireless Sensor Networks (WSNs) have recently attracted greater attention worldwide due to their practicality in monitoring, communicating, and reporting specific physical phenomena. The data collected by WSNs is often inaccurate as a result of unavoidable environmental factors, which may include noise, signal weakness, or intrusion attacks depending on the specific situation. Sending high-noise d… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

  46. arXiv:2106.12807  [pdf, other

    cs.LG

    Simple Truncated SVD based Model for Node Classification on Heterophilic Graphs

    Authors: Vijay Lingam, Rahul Ragesh, Arun Iyer, Sundararajan Sellamanickam

    Abstract: Graph Neural Networks (GNNs) have shown excellent performance on graphs that exhibit strong homophily with respect to the node labels i.e. connected nodes have same labels. However, they perform poorly on heterophilic graphs. Recent approaches have typically modified aggregation schemes, designed adaptive graph filters, etc. to address this limitation. In spite of this, the performance on heteroph… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: Accepted at Deep Learning on Graphs: Method and Applications (DLG-KDD 2021)

  47. Robust EMRAN-aided Coupled Controller for Autonomous Vehicles

    Authors: Sauranil Debarshi, Suresh Sundaram, Narasimhan Sundararajan

    Abstract: This paper presents a coupled, neural network-aided longitudinal cruise and lateral path-tracking controller for an autonomous vehicle with model uncertainties and experiencing unknown external disturbances. Using a feedback error learning mechanism, an inverse vehicle dynamics learning scheme utilizing an adaptive Radial Basis Function (RBF) neural network, referred to as the Extended Minimal Res… ▽ More

    Submitted 8 January, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

    Report number: Engineering Applications of Artificial Intelligence, vol. 110, p. 104717

  48. arXiv:2106.05792  [pdf, other

    eess.AS

    Speaker-conversation factorial designs for diarization error analysis

    Authors: Scott Seyfarth, Sundararajan Srinivasan, Katrin Kirchhoff

    Abstract: Speaker diarization accuracy can be affected by both acoustics and conversation characteristics. Determining the cause of diarization errors is difficult because speaker voice acoustics and conversation structure co-vary, and the interactions between acoustics, conversational structure, and diarization accuracy are complex. This paper proposes a methodology that can distinguish independent margina… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 5 pages, 2 figures, Interspeech 2021

  49. arXiv:2106.01977  [pdf, other

    cs.AI

    Safe RAN control: A Symbolic Reinforcement Learning Approach

    Authors: Alexandros Nikou, Anusha Mujumdar, Vaishnavi Sundararajan, Marin Orlic, Aneta Vulgarakis Feljan

    Abstract: In this paper, we present a Symbolic Reinforcement Learning (SRL) based architecture for safety control of Radio Access Network (RAN) applications. In particular, we provide a purely automated procedure in which a user can specify high-level logical safety specifications for a given cellular network topology in order for the latter to execute optimal safe performance which is measured through cert… ▽ More

    Submitted 25 April, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: To appear in International Conference of Control and Automation (ICCA) 2022

  50. arXiv:2105.14418  [pdf, other

    math.NA

    Virtual element approximation of two-dimensional parabolic variational inequalities

    Authors: Dibyendu Adak, Gianmarco Manzini, Sundararajan Natarajan

    Abstract: We design a virtual element method for the numerical treatment of the two-dimensional parabolic variational inequality problem on unstructured polygonal meshes. Due to the expected low regularity of the exact solution, the virtual element method is based on the lowest-order virtual element space that contains the subspace of the linear polynomials defined on each element. The connection between th… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

    Comments: 33 pages, 3 figures

    MSC Class: 65M12; 65M60