Skip to main content

Showing 1–50 of 67 results for author: Agrawal, M

.
  1. arXiv:2407.00856  [pdf, other

    astro-ph.IM

    Drone-Based Antenna Beam Calibration in the High Arctic

    Authors: Lawrence Herman, Christopher Barbarie, Mohan Agrawal, Vlad Calinescu, Simon Chen, H. Cynthia Chiang, Cherie K. Day, Eamon Egan, Stephen Fay, Kit Gerodias, Maya Goss, Michael Hétu, Daniel C. Jacobs, Marc-Olivier R. Lalonde, Francis McGee, Loïc Miara, John Orlowski-Scherer, Jonathan Sievers

    Abstract: The development of low-frequency radio astronomy experiments for detecting 21-cm line emission from hydrogen presents new opportunities for creative solutions to the challenge of characterizing an antenna beam pattern. The Array of Long Baseline Antennas for Taking Radio Observations from the Seventy-ninth parallel (ALBATROS) is a new radio interferometer sited in the Canadian high Arctic that aim… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2405.18263  [pdf, other

    physics.chem-ph

    The influence of solvent on surface adsorption and desorption

    Authors: Ardavan Farahvash, Mayank Agrawal, Adam P. Willard, Andrew A. Peterson

    Abstract: The adsorption and desorption of reactants and products from a solid surface is essential for achieving sustained surface chemical reactions. At a liquid-solid interface, these processes can involve the collective reorganization of interfacial solvent molecules in order to accommodate the adsorbing or desorbing species. Identifying the role of solvent in adsorption and desorption is important for… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2403.10327  [pdf

    cs.CR cs.AI

    Unsupervised Threat Hunting using Continuous Bag-of-Terms-and-Time (CBoTT)

    Authors: Varol Kayhan, Shivendu Shivendu, Rouzbeh Behnia, Clinton Daniel, Manish Agrawal

    Abstract: Threat hunting is sifting through system logs to detect malicious activities that might have bypassed existing security measures. It can be performed in several ways, one of which is based on detecting anomalies. We propose an unsupervised framework, called continuous bag-of-terms-and-time (CBoTT), and publish its application programming interface (API) to help researchers and cybersecurity analys… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  4. arXiv:2403.01628  [pdf, ps, other

    cs.LG

    Recent Advances, Applications, and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables at ML4H 2023 Symposium

    Authors: Hyewon Jeong, Sarah Jabbour, Yuzhe Yang, Rahul Thapta, Hussein Mozannar, William Jongwon Han, Nikita Mehandru, Michael Wornow, Vladislav Lialin, Xin Liu, Alejandro Lozano, Jiacheng Zhu, Rafal Dariusz Kocielnik, Keith Harrigian, Haoran Zhang, Edward Lee, Milos Vukadinovic, Aparna Balagopalan, Vincent Jeanselme, Katherine Matton, Ilker Demirel, Jason Fries, Parisa Rashidi, Brett Beaulieu-Jones, Xuhai Orson Xu , et al. (18 additional authors not shown)

    Abstract: The third ML4H symposium was held in person on December 10, 2023, in New Orleans, Louisiana, USA. The symposium included research roundtable sessions to foster discussions between participants and senior researchers on timely and relevant topics for the \ac{ML4H} community. Encouraged by the successful virtual roundtables in the previous year, we organized eleven in-person roundtables and four vir… ▽ More

    Submitted 5 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: ML4H 2023, Research Roundtables

  5. arXiv:2402.16903  [pdf, other

    cs.LG math.NA

    A novel data generation scheme for surrogate modelling with deep operator networks

    Authors: Shivam Choubey, Birupaksha Pal, Manish Agrawal

    Abstract: Operator-based neural network architectures such as DeepONets have emerged as a promising tool for the surrogate modeling of physical systems. In general, towards operator surrogate modeling, the training data is generated by solving the PDEs using techniques such as Finite Element Method (FEM). The computationally intensive nature of data generation is one of the biggest bottleneck in deploying t… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  6. arXiv:2402.15422  [pdf, other

    cs.CL cs.AI cs.LG

    A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models

    Authors: Stefan Hegselmann, Shannon Zejiang Shen, Florian Gierse, Monica Agrawal, David Sontag, Xiaoyi Jiang

    Abstract: Patients often face difficulties in understanding their hospitalizations, while healthcare workers have limited resources to provide explanations. In this work, we investigate the potential of large language models to generate patient summaries based on doctors' notes and study the effect of training data on the faithfulness and quality of the generated summaries. To this end, we release (i) a rig… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  7. arXiv:2401.09637  [pdf, other

    cs.HC cs.AI cs.CL

    Impact of Large Language Model Assistance on Patients Reading Clinical Notes: A Mixed-Methods Study

    Authors: Niklas Mannhardt, Elizabeth Bondi-Kelly, Barbara Lam, Chloe O'Connell, Mercy Asiedu, Hussein Mozannar, Monica Agrawal, Alejandro Buendia, Tatiana Urman, Irbaz B. Riaz, Catherine E. Ricciardi, Marzyeh Ghassemi, David Sontag

    Abstract: Patients derive numerous benefits from reading their clinical notes, including an increased sense of control over their health and improved understanding of their care plan. However, complex medical concepts and jargon within clinical notes hinder patient comprehension and may lead to anxiety. We developed a patient-facing tool to make clinical notes more readable, leveraging large language models… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  8. arXiv:2312.14804  [pdf, other

    cs.CY

    Use large language models to promote equity

    Authors: Emma Pierson, Divya Shanmugam, Rajiv Movva, Jon Kleinberg, Monica Agrawal, Mark Dredze, Kadija Ferryman, Judy Wawira Gichoya, Dan Jurafsky, Pang Wei Koh, Karen Levy, Sendhil Mullainathan, Ziad Obermeyer, Harini Suresh, Keyon Vafa

    Abstract: Advances in large language models (LLMs) have driven an explosion of interest about their societal impacts. Much of the discourse around how they will impact social equity has been cautionary or negative, focusing on questions like "how might LLMs be biased and how would we mitigate those biases?" This is a vital discussion: the ways in which AI generally, and LLMs specifically, can entrench biase… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  9. arXiv:2308.08494  [pdf, other

    cs.IR cs.CL cs.LG

    Conceptualizing Machine Learning for Dynamic Information Retrieval of Electronic Health Record Notes

    Authors: Sharon Jiang, Shannon Shen, Monica Agrawal, Barbara Lam, Nicholas Kurtzman, Steven Horng, David Karger, David Sontag

    Abstract: The large amount of time clinicians spend sifting through patient notes and documenting in electronic health records (EHRs) is a leading cause of clinician burnout. By proactively and dynamically retrieving relevant notes during the documentation process, we can reduce the effort required to find relevant patient history. In this work, we conceptualize the use of EHR audit logs for machine learnin… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: To be published in Proceedings of Machine Learning Research Volume 219; accepted to the Machine Learning for Healthcare 2023 conference

  10. arXiv:2307.01292  [pdf, other

    cs.CR cs.AI cs.LG

    Pareto-Secure Machine Learning (PSML): Fingerprinting and Securing Inference Serving Systems

    Authors: Debopam Sanyal, Jui-Tse Hung, Manav Agrawal, Prahlad Jasti, Shahab Nikkhoo, Somesh Jha, Tianhao Wang, Sibin Mohan, Alexey Tumanov

    Abstract: Model-serving systems have become increasingly popular, especially in real-time web applications. In such systems, users send queries to the server and specify the desired performance metrics (e.g., desired accuracy, latency). The server maintains a set of models (model zoo) in the back-end and serves the queries based on the specified metrics. This paper examines the security, specifically robust… ▽ More

    Submitted 6 August, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: 17 pages, 9 figures, 6 tables

  11. arXiv:2303.13261  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci physics.comp-ph

    Single device offset-free magnetic field sensing principle with tunable sensitivity and linear range based on spin-orbit-torques

    Authors: Sabri Koraltan, Christin Schmitt, Florian Bruckner, Claas Abert, Klemens Prügl, Michael Kirsch, Rahul Gupta, Sebastian Zeilinger, Joshua M. Salazar-Mejía, Milan Agrawal, Johannes Güttinger, Armin Satz, Gerhard Jakob, Mathias Kläui, Dieter Suess

    Abstract: We propose a novel device concept using spin-orbit-torques to realize a magnetic field sensor, where we eliminate the sensor offset using a differential measurement concept. We derive a simple analytical formulation for the sensor signal and demonstrate its validity with numerical investigations using macrospin simulations. The sensitivity and the measurable linear sensing range in the proposed co… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: 12 Pages, 7 Figures

  12. arXiv:2211.15564   

    cs.LG

    Machine Learning for Health symposium 2022 -- Extended Abstract track

    Authors: Antonio Parziale, Monica Agrawal, Shalmali Joshi, Irene Y. Chen, Shengpu Tang, Luis Oala, Adarsh Subbaswamy

    Abstract: A collection of the extended abstracts that were presented at the 2nd Machine Learning for Health symposium (ML4H 2022), which was held both virtually and in person on November 28, 2022, in New Orleans, Louisiana, USA. Machine Learning for Health (ML4H) is a longstanding venue for research into machine learning for health, including both theoretical works and applied works. ML4H 2022 featured two… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    MSC Class: 68Txx ACM Class: I.2; J.3; I.6; I.4

  13. arXiv:2211.09120  [pdf, other

    cs.CV cs.AI

    AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders

    Authors: Wele Gedara Chaminda Bandara, Naman Patel, Ali Gholami, Mehdi Nikkhah, Motilal Agrawal, Vishal M. Patel

    Abstract: Masked Autoencoders (MAEs) learn generalizable representations for image, text, audio, video, etc., by reconstructing masked input data from tokens of the visible data. Current MAE approaches for videos rely on random patch, tube, or frame-based masking strategies to select these tokens. This paper proposes AdaMAE, an adaptive masking strategy for MAEs that is end-to-end trainable. Our adaptive ma… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Code available at: https://github.com/wgcban/adamae

  14. arXiv:2211.02393  [pdf, other

    nucl-ex

    Determination of photo-nuclear cross section of $^{61}$Ni($γ$,xp) reaction via surrogate ratio technique

    Authors: Shaima Akbar, M. M Musthafa, Midhun C. V, Antony Joseph, S. V. Suryanarayana, A. Pal, S. Santra, P. C. Rout, Jyoti Pandey, Bhawna Pandey, H. M. Agrawal, K. C Jagadeesan, S. Ganesan

    Abstract: The photo nuclear reaction cross section of $^{61}$Ni($γ$,xp) reaction have been measured by employing surrogate reaction technique. This indirect method is used for the first time to obtain the cross section of photo nuclear reaction. The compound nucleus $^{61}$Ni$^{*}$ was populated using the transfer reaction $^{59}$Co($^{6}$Li,$α$) at E$_{lab}=$ 40.5 MeV. To calculate the surrogate ratio,… ▽ More

    Submitted 14 November, 2022; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: 7 pages, 7 figures

  15. arXiv:2210.10723  [pdf, other

    cs.CL cs.AI

    TabLLM: Few-shot Classification of Tabular Data with Large Language Models

    Authors: Stefan Hegselmann, Alejandro Buendia, Hunter Lang, Monica Agrawal, Xiaoyi Jiang, David Sontag

    Abstract: We study the application of large language models to zero-shot and few-shot classification of tabular data. We prompt the large language model with a serialization of the tabular data to a natural-language string, together with a short description of the classification problem. In the few-shot setting, we fine-tune the large language model using some labeled examples. We evaluate several serializa… ▽ More

    Submitted 17 March, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  16. Discovery of double BSS sequences in the old Galactic open cluster Berkeley 17

    Authors: Khushboo K Rao, Souradeep Bhattacharya, Kaushar Vaidya, Manan Agrawal

    Abstract: Blue straggler stars (BSS) are peculiar objects which normally appear as a single broad sequence along the extension of the main sequence. Only four globular clusters (GCs) have been observed to have two distinct and parallel BSS sequences. For the first time for any open cluster (OC), we report double BSS sequences in Berkeley 17. Using the machine-learning based membership algorithm ML-MOC on Ga… ▽ More

    Submitted 8 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at MNRAS Letters

  17. arXiv:2210.01594  [pdf, other

    cs.CR cs.CV

    GANTouch: An Attack-Resilient Framework for Touch-based Continuous Authentication System

    Authors: Mohit Agrawal, Pragyan Mehrotra, Rajesh Kumar, Rajiv Ratn Shah

    Abstract: Previous studies have shown that commonly studied (vanilla) implementations of touch-based continuous authentication systems (V-TCAS) are susceptible to active adversarial attempts. This study presents a novel Generative Adversarial Network assisted TCAS (G-TCAS) framework and compares it to the V-TCAS under three active adversarial environments viz. Zero-effort, Population, and Random-vector. The… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: 11 pages, 7 figures, 2 tables, 3 algorithms, in IEEE TBIOM 2022

    ACM Class: K.6.5

  18. arXiv:2209.05921  [pdf, other

    cs.CV cs.AI cs.LG

    Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks

    Authors: Bulla Rajesh, Manav Kamlesh Agrawal, Milan Bhuva, Kisalaya Kishore, Mohammed Javed

    Abstract: Image binarization techniques are being popularly used in enhancement of noisy and/or degraded images catering different Document Image Anlaysis (DIA) applications like word spotting, document retrieval, and OCR. Most of the existing techniques focus on feeding pixel images into the Convolution Neural Networks to accomplish document binarization, which may not produce effective results when workin… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Accepted in IAPR endorsed first International Conference on Computer Vision and Machine Intelligence (CVMI2022), held at IIIT Allahabad

  19. arXiv:2205.12689  [pdf, other

    cs.CL cs.AI

    Large Language Models are Few-Shot Clinical Information Extractors

    Authors: Monica Agrawal, Stefan Hegselmann, Hunter Lang, Yoon Kim, David Sontag

    Abstract: A long-running goal of the clinical NLP community is the extraction of important variables trapped in clinical notes. However, roadblocks have included dataset shift from the general domain and a lack of public clinical corpora and annotations. In this work, we show that large language models, such as InstructGPT, perform well at zero- and few-shot information extraction from clinical text despite… ▽ More

    Submitted 30 November, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted as a long paper to The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP)

  20. arXiv:2202.00828  [pdf, other

    cs.CL cs.AI cs.LG

    Co-training Improves Prompt-based Learning for Large Language Models

    Authors: Hunter Lang, Monica Agrawal, Yoon Kim, David Sontag

    Abstract: We demonstrate that co-training (Blum & Mitchell, 1998) can improve the performance of prompt-based learning by using unlabeled data. While prompting has emerged as a promising paradigm for few-shot and zero-shot learning, it is often brittle and requires much larger models compared to the standard supervised setup. We find that co-training makes it possible to improve the original prompt model an… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: 17 pages, 8 figures

  21. arXiv:2111.02599  [pdf, other

    cs.LG

    Leveraging Time Irreversibility with Order-Contrastive Pre-training

    Authors: Monica Agrawal, Hunter Lang, Michael Offin, Lior Gazit, David Sontag

    Abstract: Label-scarce, high-dimensional domains such as healthcare present a challenge for modern machine learning techniques. To overcome the difficulties posed by a lack of labeled data, we explore an "order-contrastive" method for self-supervised pre-training on longitudinal data. We sample pairs of time segments, switch the order for half of them, and train a model to predict whether a given pair is in… ▽ More

    Submitted 29 March, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

  22. MedKnowts: Unified Documentation and Information Retrieval for Electronic Health Records

    Authors: Luke Murray, Divya Gopinath, Monica Agrawal, Steven Horng, David Sontag, David R. Karger

    Abstract: Clinical documentation can be transformed by Electronic Health Records, yet the documentation process is still a tedious, time-consuming, and error-prone process. Clinicians are faced with multi-faceted requirements and fragmented interfaces for information exploration and documentation. These challenges are only exacerbated in the Emergency Department -- clinicians often see 35 patients in one sh… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: 15 Pages, 8 figures, UIST 21, October 10-13

  23. arXiv:2107.07005  [pdf, other

    cs.LG cs.AI

    WeightScale: Interpreting Weight Change in Neural Networks

    Authors: Ayush Manish Agrawal, Atharva Tendle, Harshvardhan Sikka, Sahib Singh

    Abstract: Interpreting the learning dynamics of neural networks can provide useful insights into how networks learn and the development of better training and design approaches. We present an approach to interpret learning in neural networks by measuring relative weight change on a per layer basis and dynamically aggregating emerging trends through combination of dimensionality reduction and clustering whic… ▽ More

    Submitted 26 March, 2022; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: Intelligent Computing, 2021. arXiv admin note: text overlap with arXiv:2011.06735

  24. arXiv:2107.05469  [pdf, other

    eess.SP

    A Multi-Lead Fusion Method for the Accurate Delineation of QRS Complex Location in 12 Lead ECG Signal

    Authors: Chhaviraj Chauhan, Monika Agrawal, Pooja Sabherwal

    Abstract: This paper presents a multi-lead fusion method for the accurate and automated detection of the QRS complex location in 12 lead ECG (Electrocardiogram) signals. The proposed multi-lead fusion method accurately delineates the QRS complex by the fusion of detected QRS complexes of the individual 12 leads. The proposed algorithm consists of two major stages. Firstly, the QRS complex location of each l… ▽ More

    Submitted 27 July, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

  25. arXiv:2106.07867  [pdf, other

    cs.CR cs.CV cs.HC

    Defending Touch-based Continuous Authentication Systems from Active Adversaries Using Generative Adversarial Networks

    Authors: Mohit Agrawal, Pragyan Mehrotra, Rajesh Kumar, Rajiv Ratn Shah

    Abstract: Previous studies have demonstrated that commonly studied (vanilla) touch-based continuous authentication systems (V-TCAS) are susceptible to population attack. This paper proposes a novel Generative Adversarial Network assisted TCAS (G-TCAS) framework, which showed more resilience to the population attack. G-TCAS framework was tested on a dataset of 117 users who interacted with a smartphone and t… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: 2021 IEEE International Joint Conference on Biometrics (IJCB), 8 pages

    ACM Class: K.6.5

  26. arXiv:2105.01384  [pdf, other

    cond-mat.mtrl-sci

    Deep neural networks based predictive-generative framework for designing composite materials

    Authors: Ashank, Soumen Chakravarty, Pranshu Garg, Ankit Kumar, Manish Agrawal, Prabhat K. Agnihotri

    Abstract: Designing composite materials as per the application requirements is fundamentally a challenging and time consuming task. Here we report the development of a deep neural network based computational framework capable of solving the forward (predictive) as well as inverse (generative) design problem. The predictor model is based on the popular convolution neural network architecture and trained with… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

  27. arXiv:2104.14259  [pdf, other

    astro-ph.IM astro-ph.EP

    Chandrayaan-2 Dual-Frequency SAR (DFSAR): Performance Characterization and Initial Results

    Authors: Sriram S. Bhiravarasu, Tathagata Chakraborty, Deepak Putrevu, Dharmendra K. Pandey, Anup K. Das, V. M. Ramanujam, Raghav Mehra, Parikshit Parasher, Krishna M. Agrawal, Shubham Gupta, Gaurav S. Seth, Amit Shukla, Nikhil Y. Pandya, Sanjay Trivedi, Arundhati Misra, Rajeev Jyoti, Raj Kumar

    Abstract: The Dual-Frequency synthetic aperture radar (DFSAR) system manifested on the Chandrayaan-2 spacecraft represents a significant step forward in radar exploration of solid solar system objects. It combines SAR at two wavelengths (L- and S-bands) and multiple resolutions with several polarimetric modes in one lightweight ($\sim$ 20 kg) package. The resulting data from DFSAR support calculation of the… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: 30 pages, 16 figures; accepted by The Planetary Science Journal

  28. arXiv:2103.04725  [pdf, other

    cs.HC

    Assessing the Impact of Automated Suggestions on Decision Making: Domain Experts Mediate Model Errors but Take Less Initiative

    Authors: Ariel Levy, Monica Agrawal, Arvind Satyanarayan, David Sontag

    Abstract: Automated decision support can accelerate tedious tasks as users can focus their attention where it is needed most. However, a key concern is whether users overly trust or cede agency to automation. In this paper, we investigate the effects of introducing automation to annotating clinical texts--a multi-step, error-prone task of identifying clinical concepts (e.g., procedures) in medical notes, an… ▽ More

    Submitted 29 March, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Fixed minor formatting

  29. arXiv:2101.09158  [pdf, other

    q-bio.PE

    SUTRA: A Novel Approach to Modelling Pandemics with Applications to COVID-19

    Authors: Manindra Agrawal, Madhuri Kanitkar, Deepu Phillip, Tanima Hajra, Arti Singh, Avaneesh Singh, Prabal Pratap Singh, Mathukumalli Vidyasagar

    Abstract: The Covid-19 pandemic has two key properties: (i) asymptomatic cases (both detected and undetected) that can result in new infections, and (ii) time-varying characteristics due to new variants, Non-Pharmaceutical Interventions etc. We develop a model called SUTRA (Susceptible, Undetected though infected, Tested positive, and Removed Analysis) that takes into account both of these two key propertie… ▽ More

    Submitted 25 October, 2022; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: 38 pages, 20 figures, 5 tables

  30. arXiv:2012.09345  [pdf, other

    cs.RO physics.app-ph

    Muscle-inspired flexible mechanical logic architecture for colloidal robotics

    Authors: Mayank Agrawal, Sharon C. Glotzer

    Abstract: Materials that respond to external stimuli by expanding or contracting provide a transduction route that integrates sensing and actuation powered directly by the stimuli. This motivates us to build colloidal scale robots using these materials that can morph into arbitrary configurations. For intelligent use of global stimuli in robotic systems, computation ability needs to be incorporated within t… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

  31. arXiv:2011.11970  [pdf, other

    cs.SD cs.IR cs.MM eess.AS

    A Novel Multimodal Music Genre Classifier using Hierarchical Attention and Convolutional Neural Network

    Authors: Manish Agrawal, Abhilash Nandy

    Abstract: Music genre classification is one of the trending topics in regards to the current Music Information Retrieval (MIR) Research. Since, the dependency of genre is not only limited to the audio profile, we also make use of textual content provided as lyrics of the corresponding song. We implemented a CNN based feature extractor for spectrograms in order to incorporate the acoustic features and a Hier… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: 7 pages, 4 figures

  32. arXiv:2011.06735  [pdf, other

    cs.LG

    Investigating Learning in Deep Neural Networks using Layer-Wise Weight Change

    Authors: Ayush Manish Agrawal, Atharva Tendle, Harshvardhan Sikka, Sahib Singh, Amr Kayid

    Abstract: Understanding the per-layer learning dynamics of deep neural networks is of significant interest as it may provide insights into how neural networks learn and the potential for better training regimens. We investigate learning in Deep Convolutional Neural Networks (CNNs) by measuring the relative weight change of layers while training. Several interesting trends emerge in a variety of CNN architec… ▽ More

    Submitted 30 November, 2020; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 14 pages, 20 figures

  33. arXiv:2007.16127  [pdf, other

    cs.CL cs.LG

    Robust Benchmarking for Machine Learning of Clinical Entity Extraction

    Authors: Monica Agrawal, Chloe O'Connell, Yasmin Fatemi, Ariel Levy, David Sontag

    Abstract: Clinical studies often require understanding elements of a patient's narrative that exist only in free text clinical notes. To transform notes into structured data for downstream use, these elements are commonly extracted and normalized to medical vocabularies. In this work, we audit the performance of and indicate areas of improvement for state-of-the-art systems. We find that high task accuracie… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

  34. arXiv:2007.15153  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    Fast, Structured Clinical Documentation via Contextual Autocomplete

    Authors: Divya Gopinath, Monica Agrawal, Luke Murray, Steven Horng, David Karger, David Sontag

    Abstract: We present a system that uses a learned autocompletion mechanism to facilitate rapid creation of semi-structured clinical documentation. We dynamically suggest relevant clinical concepts as a doctor drafts a note by leveraging features from both unstructured and structured medical data. By constraining our architecture to shallow neural networks, we are able to make these suggestions in real time.… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: Published in Machine Learning for Healthcare 2020 conference

  35. arXiv:2007.11838  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    PClean: Bayesian Data Cleaning at Scale with Domain-Specific Probabilistic Programming

    Authors: Alexander K. Lew, Monica Agrawal, David Sontag, Vikash K. Mansinghka

    Abstract: Data cleaning is naturally framed as probabilistic inference in a generative model of ground-truth data and likely errors, but the diversity of real-world error patterns and the hardness of inference make Bayesian approaches difficult to automate. We present PClean, a probabilistic programming language (PPL) for leveraging dataset-specific knowledge to automate Bayesian cleaning. Compared to gener… ▽ More

    Submitted 18 November, 2022; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: Published version

    Journal ref: AISTATS 2021

  36. arXiv:2006.09501  [pdf, other

    cs.CV cs.CY cs.HC

    On the Inference of Soft Biometrics from Ty** Patterns Collected in a Multi-device Environment

    Authors: Vishaal Udandarao, Mohit Agrawal, Rajesh Kumar, Rajiv Ratn Shah

    Abstract: In this paper, we study the inference of gender, major/minor (computer science, non-computer science), ty** style, age, and height from the ty** patterns collected from 117 individuals in a multi-device environment. The inference of the first three identifiers was considered as classification tasks, while the rest as regression tasks. For classification tasks, we benchmark the performance of s… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: The first two authors contributed equally. The code is available upon request. Please contact the last author

    ACM Class: I.3.6

    Journal ref: The Sixth IEEE International Conference on Multimedia Big Data, August 2020

  37. arXiv:2001.09765  [pdf

    cs.CY cs.LG

    Model-assisted cohort selection with bias analysis for generating large-scale cohorts from the EHR for oncology research

    Authors: Benjamin Birnbaum, Nathan Nussbaum, Katharina Seidl-Rathkopf, Monica Agrawal, Melissa Estevez, Evan Estola, Joshua Haimson, Lucy He, Peter Larson, Paul Richardson

    Abstract: Objective Electronic health records (EHRs) are a promising source of data for health outcomes research in oncology. A challenge in using EHR data is that selecting cohorts of patients often requires information in unstructured parts of the record. Machine learning has been used to address this, but even high-performing algorithms may select patients in a non-random manner and bias the resulting co… ▽ More

    Submitted 13 January, 2020; originally announced January 2020.

    Comments: Word count: Abstract, 254; text, 3934 Keywords: electronic health record; machine learning; cancer; real-world evidence

  38. arXiv:1910.07581  [pdf, other

    cs.CY cs.AI cs.LG

    Scaling up Psychology via Scientific Regret Minimization: A Case Study in Moral Decisions

    Authors: Mayank Agrawal, Joshua C. Peterson, Thomas L. Griffiths

    Abstract: Do large datasets provide value to psychologists? Without a systematic methodology for working with such datasets, there is a valid concern that analyses will produce noise artifacts rather than true effects. In this paper, we offer a way to enable researchers to systematically build models and identify novel phenomena in large datasets. One traditional approach is to analyze the residuals of mode… ▽ More

    Submitted 8 January, 2020; v1 submitted 16 October, 2019; originally announced October 2019.

  39. arXiv:1910.01116  [pdf, other

    stat.AP cs.LG stat.ML

    Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health Knowledge Graph

    Authors: Irene Y. Chen, Monica Agrawal, Steven Horng, David Sontag

    Abstract: Increasingly large electronic health records (EHRs) provide an opportunity to algorithmically learn medical knowledge. In one prominent example, a causal health knowledge graph could learn relationships between diseases and symptoms and then serve as a diagnostic tool to be refined with additional clinical input. Prior research has demonstrated the ability to construct such a graph from over 270,0… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    Comments: 12 pages, presented at PSB 2020

  40. arXiv:1902.06744  [pdf, other

    cs.CY cs.AI cs.LG

    Using Machine Learning to Guide Cognitive Modeling: A Case Study in Moral Reasoning

    Authors: Mayank Agrawal, Joshua C. Peterson, Thomas L. Griffiths

    Abstract: Large-scale behavioral datasets enable researchers to use complex machine learning algorithms to better predict human behavior, yet this increased predictive power does not always lead to a better understanding of the behavior in question. In this paper, we outline a data-driven, iterative procedure that allows cognitive scientists to use machine learning to generate models that are both interpret… ▽ More

    Submitted 10 May, 2019; v1 submitted 18 February, 2019; originally announced February 2019.

    Comments: Camera ready version for Cognitive Science Conference

  41. arXiv:1812.08480  [pdf, ps, other

    cs.DS cs.NE

    The Query Complexity of a Permutation-Based Variant of Mastermind

    Authors: Peyman Afshani, Manindra Agrawal, Benjamin Doerr, Carola Doerr, Kasper Green Larsen, Kurt Mehlhorn

    Abstract: We study the query complexity of a permutation-based variant of the guessing game Mastermind. In this variant, the secret is a pair $(z,π)$ which consists of a binary string $z \in \{0,1\}^n$ and a permutation $π$ of $[n]$. The secret must be unveiled by asking queries of the form $x \in \{0,1\}^n$. For each such query, we are returned the score \[ f_{z,π}(x):= \max \{ i \in [0..n]\mid \forall j \… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Comments: Full version of a result previously announced in 2013. Accepted subject to minor revision at Discrete Applied Mathematics

  42. arXiv:1811.12793  [pdf, other

    cs.CL

    TIFTI: A Framework for Extracting Drug Intervals from Longitudinal Clinic Notes

    Authors: Monica Agrawal, Griffin Adams, Nathan Nussbaum, Benjamin Birnbaum

    Abstract: Oral drugs are becoming increasingly common in oncology care. In contrast to intravenous chemotherapy, which is administered in the clinic and carefully tracked via structure electronic health records (EHRs), oral drug treatment is self-administered and therefore not tracked as well. Often, the details of oral cancer treatment occur only in unstructured clinic notes. Extracting this information is… ▽ More

    Submitted 3 December, 2018; v1 submitted 30 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/69

  43. arXiv:1804.06666  [pdf, ps, other

    eess.SP

    Application of Vector Sensor for Underwater Acoustic Communications

    Authors: Farheen Fauziya, Brejesh Lall, Monika Agrawal

    Abstract: The use of vector sensors as receivers for Underwater Acoustic Communications systems is gaining popularity. It has become important to obtain performance measures for such communication systems to quantify their efficacy. The fundamental advantage of using a vector sensor as a receiver is that a single sensor is able to provide diversity gains offered by MIMO systems. In a recent work novel frame… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

    Comments: 29 pages, 15 figures and 2 tables

  44. arXiv:1802.00543  [pdf, other

    cs.LG q-bio.MN stat.ML

    Modeling polypharmacy side effects with graph convolutional networks

    Authors: Marinka Zitnik, Monica Agrawal, Jure Leskovec

    Abstract: The use of drug combinations, termed polypharmacy, is common to treat patients with complex diseases and co-existing conditions. However, a major consequence of polypharmacy is a much higher risk of adverse side effects for the patient. Polypharmacy side effects emerge because of drug-drug interactions, in which activity of one drug may change if taken with another drug. The knowledge of drug inte… ▽ More

    Submitted 26 April, 2018; v1 submitted 1 February, 2018; originally announced February 2018.

    Comments: Presented at ISMB 2018

    Journal ref: Bioinformatics, 34:13, 457-466, 2018

  45. arXiv:1712.00843  [pdf, other

    q-bio.MN cs.LG cs.SI

    Large-scale analysis of disease pathways in the human interactome

    Authors: Monica Agrawal, Marinka Zitnik, Jure Leskovec

    Abstract: Discovering disease pathways, which can be defined as sets of proteins associated with a given disease, is an important problem that has the potential to provide clinically actionable insights for disease diagnosis, prognosis, and treatment. Computational methods aid the discovery by relying on protein-protein interaction (PPI) networks. They start with a few known disease-associated proteins and… ▽ More

    Submitted 3 December, 2017; originally announced December 2017.

    Journal ref: Pacific Symposium on Biocomputing 23:111-122(2018)

  46. arXiv:1711.05923   

    eess.SP stat.AP

    Enhanced Array Aperture using Higher Order Statistics for DoA Estimation

    Authors: Payal Gupta, Monika Agrawal

    Abstract: Recently, the higher order statistics (HOS) and sparsity based array are most talked about techniques to estimate the Direction of Arrival (DoA). They not only provide enhanced Degree of Freedom (DoF) to handle underdetermined cases but also improve the estimation accuracy of the system. To achieve high accuracy and more number of DoF with limited number of sensors, here we have proposed a method… ▽ More

    Submitted 19 April, 2018; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: I want to withdraw the paper because of I have noticed many drawbacks of the paper. I got the review about this "it is not correct technically"

  47. arXiv:1703.09968  [pdf, other

    cs.MM cs.CR

    An Evaluation of Digital Image Forgery Detection Approaches

    Authors: Abhishek Kashyap, Rajesh Singh Parmar, Megha Agrawal, Hariom Gupta

    Abstract: With the headway of the advanced image handling software and altering tools, a computerized picture can be effectively controlled. The identification of image manipulation is vital in light of the fact that an image can be utilized as legitimate confirmation, in crime scene investigation, and in numerous different fields. The image forgery detection techniques intend to confirm the credibility of… ▽ More

    Submitted 30 March, 2017; v1 submitted 29 March, 2017; originally announced March 2017.

  48. arXiv:1702.07180  [pdf, ps, other

    cs.CC

    Small hitting-sets for tiny arithmetic circuits or: How to turn bad designs into good

    Authors: Manindra Agrawal, Michael Forbes, Sumanta Ghosh, Nitin Saxena

    Abstract: We show that if we can design poly($s$)-time hitting-sets for $Σ\wedge^aΣΠ^{O(\log s)}$ circuits of size $s$, where $a=ω(1)$ is arbitrarily small and the number of variables, or arity $n$, is $O(\log s)$, then we can derandomize blackbox PIT for general circuits in quasipolynomial time. This also establishes that either E$\not\subseteq$\#P/poly or that VP$\ne$VNP. In fact, we show that one only ne… ▽ More

    Submitted 23 February, 2017; originally announced February 2017.

    Comments: 25 pages, No figures

    ACM Class: F.1.1; I.1.2; F.1.3

  49. arXiv:1603.09201  [pdf, ps, other

    cond-mat.mes-hall

    Low-dam** transmission of spin waves through YIG/Pt-based layered structures for spin-orbit-torque applications

    Authors: Dmytro A. Bozhko, Alexander A. Serga, Milan Agrawal, Burkard Hillebrands, Mikhail P. Kostylev

    Abstract: We show that in YIG-Pt bi-layers, which are widely used in experiments on the spin transfer torque and spin Hall effects, the spin-wave amplitude significantly decreases in comparison to a single YIG film due to the excitation of microwave eddy currents in a Pt coat. By introducing a novel excitation geometry, where the Pt layer faces the ground plane of a microstrip line structure, we suppressed… ▽ More

    Submitted 30 March, 2016; originally announced March 2016.

    Journal ref: Advanced Materials Interfaces 9, 2201323 (2022)

  50. arXiv:1508.07517  [pdf, ps, other

    cond-mat.mes-hall

    Spin-transfer torque based dam** control of parametrically excited spin waves in a magnetic insulator

    Authors: V. Lauer, D. A. Bozhko, T. Brächer, P. Pirro, V. I. Vasyuchka, A. A. Serga, M. B. Jungfleisch, M. Agrawal, Yu. V. Kobljanskyj, G. A. Melkov, C. Dubs, B. Hillebrands, A. V. Chumak

    Abstract: The dam** of spin waves parametrically excited in the magnetic insulator Yttrium Iron Garnet (YIG) is controlled by a dc current passed through an adjacent normal-metal film. The experiment is performed on a macroscopically sized YIG(100nm)/Pt(10nm) bilayer of 4x2 mm^2 lateral dimensions. The spin-wave relaxation frequency is determined via the threshold of the parametric instability measured by… ▽ More

    Submitted 29 August, 2015; originally announced August 2015.

    Journal ref: Appl. Phys. Lett. 108, 012402 (2016)