Skip to main content

Showing 1–50 of 57 results for author: Agarwal, V

.
  1. arXiv:2407.05887  [pdf, other

    cs.CL cs.AI cs.LG

    Generation and De-Identification of Indian Clinical Discharge Summaries using LLMs

    Authors: Sanjeet Singh, Shreya Gupta, Niralee Gupta, Naimish Sharma, Lokesh Srivastava, Vibhu Agarwal, Ashutosh Modi

    Abstract: The consequences of a healthcare data breach can be devastating for the patients, providers, and payers. The average financial impact of a data breach in recent months has been estimated to be close to USD 10 million. This is especially significant for healthcare organizations in India that are managing rapid digitization while still establishing data governance procedures that align with the lett… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted at BioNLP Workshop at ACL 2024; 21 pages (9 pages main content)

  2. What's in the Flow? Exploiting Temporal Motion Cues for Unsupervised Generic Event Boundary Detection

    Authors: Sourabh Vasant Gothe, Vibhav Agarwal, Sourav Ghosh, Jayesh Rajkumar Vachhani, Pranay Kashyap, Barath Raj Kandur Raja

    Abstract: Generic Event Boundary Detection (GEBD) task aims to recognize generic, taxonomy-free boundaries that segment a video into meaningful events. Current methods typically involve a neural model trained on a large volume of data, demanding substantial computational power and storage space. We explore two pivotal questions pertaining to GEBD: Can non-parametric algorithms outperform unsupervised neural… ▽ More

    Submitted 15 February, 2024; originally announced April 2024.

    Comments: Accepted in WACV-2024. Supplementary at https://openaccess.thecvf.com/content/WACV2024/supplemental/Gothe_Whats_in_the_WACV_2024_supplemental.pdf

    Journal ref: 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 2024, pp. 6926-6935

  3. arXiv:2404.05501  [pdf

    q-bio.NC cs.AI cs.LG

    Data Science In Olfaction

    Authors: Vivek Agarwal, Joshua Harvey, Dmitry Rinberg, Vasant Dhar

    Abstract: Advances in neural sensing technology are making it possible to observe the olfactory process in great detail. In this paper, we conceptualize smell from a Data Science and AI perspective, that relates the properties of odorants to how they are sensed and analyzed in the olfactory system from the nose to the brain. Drawing distinctions to color vision, we argue that smell presents unique measureme… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 20 pages, 10 Figures, 2 Appendix, 1 Table

  4. arXiv:2404.03048  [pdf, other

    cs.CY cs.CL

    Decentralised Moderation for Interoperable Social Networks: A Conversation-based Approach for Pleroma and the Fediverse

    Authors: Vibhor Agarwal, Aravindh Raman, Nishanth Sastry, Ahmed M. Abdelmoniem, Gareth Tyson, Ignacio Castro

    Abstract: The recent development of decentralised and interoperable social networks (such as the "fediverse") creates new challenges for content moderators. This is because millions of posts generated on one server can easily "spread" to another, even if the recipient server has very different moderation policies. An obvious solution would be to leverage moderation tools to automatically tag (and filter) po… ▽ More

    Submitted 16 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted at International AAAI Conference on Web and Social Media (ICWSM) 2024. Please cite accordingly!

  5. TrICy: Trigger-guided Data-to-text Generation with Intent aware Attention-Copy

    Authors: Vibhav Agarwal, Sourav Ghosh, Harichandana BSS, Himanshu Arora, Barath Raj Kandur Raja

    Abstract: Data-to-text (D2T) generation is a crucial task in many natural language understanding (NLU) applications and forms the foundation of task-oriented dialog systems. In the context of conversational AI solutions that can work directly with local data on the user's device, architectures utilizing large pre-trained language models (PLMs) are impractical for on-device deployment due to a high memory fo… ▽ More

    Submitted 25 January, 2024; originally announced February 2024.

    Comments: Published in the IEEE/ACM Transactions on Audio, Speech, and Language Processing. (Sourav Ghosh and Vibhav Agarwal contributed equally to this work.)

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1173-1184, 2024

  6. arXiv:2402.01687  [pdf, ps, other

    cs.CY cs.HC cs.LG

    "Which LLM should I use?": Evaluating LLMs for tasks performed by Undergraduate Computer Science Students

    Authors: Vibhor Agarwal, Madhav Krishan Garg, Sahiti Dharmavaram, Dhruv Kumar

    Abstract: This study evaluates the effectiveness of various large language models (LLMs) in performing tasks common among undergraduate computer science students. Although a number of research studies in the computing education community have explored the possibility of using LLMs for a variety of tasks, there is a lack of comprehensive research comparing different LLMs and evaluating which LLMs are most ef… ▽ More

    Submitted 3 April, 2024; v1 submitted 22 January, 2024; originally announced February 2024.

    Comments: Under review

  7. arXiv:2311.17921  [pdf, other

    cs.CV

    Do text-free diffusion models learn discriminative visual representations?

    Authors: Soumik Mukhopadhyay, Matthew Gwilliam, Yosuke Yamaguchi, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Tianyi Zhou, Abhinav Shrivastava

    Abstract: While many unsupervised learning models focus on one family of tasks, either generative or discriminative, we explore the possibility of a unified representation learner: a model which addresses both families of tasks simultaneously. We identify diffusion models, a state-of-the-art method for generative tasks, as a prime candidate. Such models involve training a U-Net to iteratively predict and re… ▽ More

    Submitted 29 November, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: Website: see https://mgwillia.github.io/diffssl/ . Code: see https://github.com/soumik-kanad/diffssl . The first two authors contributed equally. 15 pages, 9 figures, 15 tables. Submission under review. (this article supersedes arXiv:2307.08702)

  8. arXiv:2310.14028  [pdf, other

    cs.CL

    GASCOM: Graph-based Attentive Semantic Context Modeling for Online Conversation Understanding

    Authors: Vibhor Agarwal, Yu Chen, Nishanth Sastry

    Abstract: Online conversation understanding is an important yet challenging NLP problem which has many useful applications (e.g., hate speech detection). However, online conversations typically unfold over a series of posts and replies to those posts, forming a tree structure within which individual posts may refer to semantic context from higher up the tree. Such semantic cross-referencing makes it difficu… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  9. arXiv:2310.13985  [pdf, ps, other

    cs.CL

    HateRephrase: Zero- and Few-Shot Reduction of Hate Intensity in Online Posts using Large Language Models

    Authors: Vibhor Agarwal, Yu Chen, Nishanth Sastry

    Abstract: Hate speech has become pervasive in today's digital age. Although there has been considerable research to detect hate speech or generate counter speech to combat hateful views, these approaches still cannot completely eliminate the potential harmful societal consequences of hate speech -- hate speech, even when detected, can often not be taken down or is often not taken down enough; and hate speec… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  10. arXiv:2308.14608  [pdf, other

    cs.LG cs.CL cs.CY cs.SI

    AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics

    Authors: Vahid Ghafouri, Vibhor Agarwal, Yong Zhang, Nishanth Sastry, Jose Such, Guillermo Suarez-Tangil

    Abstract: The introduction of ChatGPT and the subsequent improvement of Large Language Models (LLMs) have prompted more and more individuals to turn to the use of ChatBots, both for information and assistance with decision-making. However, the information the user is after is often not formulated by these ChatBots objectively enough to be provided with a definite, globally accepted answer. Controversial t… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  11. arXiv:2307.08702  [pdf, other

    cs.CV

    Diffusion Models Beat GANs on Image Classification

    Authors: Soumik Mukhopadhyay, Matthew Gwilliam, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Srinidhi Hegde, Tianyi Zhou, Abhinav Shrivastava

    Abstract: While many unsupervised learning models focus on one family of tasks, either generative or discriminative, we explore the possibility of a unified representation learner: a model which uses a single pre-training stage to address both families of tasks simultaneously. We identify diffusion models as a prime candidate. Diffusion models have risen to prominence as a state-of-the-art method for image… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 15 pages, 7 figures, 10 tables, submission under review

  12. arXiv:2306.13995  [pdf, other

    cs.AI cs.LG

    A clustering and graph deep learning-based framework for COVID-19 drug repurposing

    Authors: Chaarvi Bansal, Rohitash Chandra, Vinti Agarwal, P. R. Deepa

    Abstract: Drug repurposing (or repositioning) is the process of finding new therapeutic uses for drugs already approved by drug regulatory authorities (e.g., the Food and Drug Administration (FDA) and Therapeutic Goods Administration (TGA)) for other diseases. This involves analyzing the interactions between different biological entities, such as drug targets (genes/proteins and biological pathways) and dru… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  13. arXiv:2304.14507  [pdf

    cs.CV eess.IV

    Suspicious Vehicle Detection Using Licence Plate Detection And Facial Feature Recognition

    Authors: Vrinda Agarwal, Aaron George Pichappa, Manideep Ramisetty, Bala Murugan MS, Manoj kumar Rajagopal

    Abstract: With the increasing need to strengthen vehicle safety and detection, the availability of pre-existing methods of catching criminals and identifying vehicles manually through the various traffic surveillance cameras is not only time-consuming but also inefficient. With the advancement of technology in every field the use of real-time traffic surveillance models will help facilitate an easy approach… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: eight pages and three figures

  14. arXiv:2212.10405  [pdf, other

    cs.CL cs.SI

    AnnoBERT: Effectively Representing Multiple Annotators' Label Choices to Improve Hate Speech Detection

    Authors: Wenjie Yin, Vibhor Agarwal, Aiqi Jiang, Arkaitz Zubiaga, Nishanth Sastry

    Abstract: Supervised approaches generally rely on majority-based labels. However, it is hard to achieve high agreement among annotators in subjective tasks such as hate speech detection. Existing neural network models principally regard labels as categorical variables, while ignoring the semantic information in diverse label texts. In this paper, we propose AnnoBERT, a first-of-its-kind architecture integra… ▽ More

    Submitted 10 January, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: accepted at ICWSM 2023

    Journal ref: 17th International AAAI Conference on Web and Social Media (ICWSM 2023). Please cite accordingly

  15. arXiv:2211.09207  [pdf, other

    cs.CL cs.AI cs.CY

    A Graph-Based Context-Aware Model to Understand Online Conversations

    Authors: Vibhor Agarwal, Anthony P. Young, Sagar Joglekar, Nishanth Sastry

    Abstract: Online forums that allow for participatory engagement between users have been transformative for the public discussion of many important issues. However, such conversations can sometimes escalate into full-blown exchanges of hate and misinformation. Existing approaches in natural language processing (NLP), such as deep learning models for classification tasks, use as inputs only a single comment o… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 25 pages, 9 figures. arXiv admin note: text overlap with arXiv:2202.08175

    Journal ref: ACM Transactions on the Web 2023

  16. arXiv:2211.08122  [pdf, ps, other

    physics.chem-ph

    The Desorption Rate at Liquid-Solid Interface

    Authors: Krishna Jaiswal, Horia Metiu, Vishal Agarwal

    Abstract: We use a simple generic model to study the desorption of atoms from a solid surface in contact with a liquid, by using a combination of Monte Carlo and molecular dynamics simulations. The behavior of the system depends on two parameters: the strength $ε_{LS}$ of the solid-liquid interaction energy and the strength $ε_{LL}$ of the liquid-liquid interaction energy. The contact with the solid surface… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  17. arXiv:2211.06104  [pdf, other

    cs.CV

    Bounding Box Priors for Cell Detection with Point Annotations

    Authors: Hari Om Aggrawal, Dipam Goswami, Vinti Agarwal

    Abstract: The size of an individual cell type, such as a red blood cell, does not vary much among humans. We use this knowledge as a prior for classifying and detecting cells in images with only a few ground truth bounding box annotations, while most of the cells are annotated with points. This setting leads to weakly semi-supervised learning. We propose replacing points with either stochastic (ST) boxes or… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  18. arXiv:2208.01439  [pdf, other

    q-bio.OT cs.LG stat.ML

    Unsupervised machine learning framework for discriminating major variants of concern during COVID-19

    Authors: Rohitash Chandra, Chaarvi Bansal, Mingyue Kang, Tom Blau, Vinti Agarwal, Pranjal Singh, Laurence O. W. Wilson, Seshadri Vasan

    Abstract: Due to the high mutation rate of the virus, the COVID-19 pandemic evolved rapidly. Certain variants of the virus, such as Delta and Omicron, emerged with altered viral properties leading to severe transmission and death rates. These variants burdened the medical systems worldwide with a major impact to travel, productivity, and the world economy. Unsupervised machine learning methods have the abil… ▽ More

    Submitted 25 May, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Journal ref: PLOS ONE, 2023

  19. arXiv:2206.10225  [pdf, other

    cs.CV cs.HC

    Broken News: Making Newspapers Accessible to Print-Impaired

    Authors: Vishal Agarwal, Tanuja Ganu, Saikat Guha

    Abstract: Accessing daily news content still remains a big challenge for people with print-impairment including blind and low-vision due to opacity of printed content and hindrance from online sources. In this paper, we present our approach for digitization of print newspaper into an accessible file format such as HTML. We use an ensemble of instance segmentation and detection framework for newspaper layout… ▽ More

    Submitted 23 June, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

    Journal ref: Extended Abstract at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)

  20. arXiv:2205.01404  [pdf, other

    cs.CL cs.AI cs.LG q-bio.NC

    Neural Language Taskonomy: Which NLP Tasks are the most Predictive of fMRI Brain Activity?

    Authors: Subba Reddy Oota, Jashn Arora, Veeral Agarwal, Mounika Marreddy, Manish Gupta, Bapi Raju Surampudi

    Abstract: Several popular Transformer based language models have been found to be successful for text-driven brain encoding. However, existing literature leverages only pretrained text Transformer models and has not explored the efficacy of task-specific learned Transformer representations. In this work, we explore transfer learning from representations learned for ten popular natural language processing ta… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: 18 pages, 18 figures

  21. "Way back then": A Data-driven View of 25+ years of Web Evolution

    Authors: Vibhor Agarwal, Nishanth Sastry

    Abstract: Since the inception of the first web page three decades back, the Web has evolved considerably, from static HTML pages in the beginning to the dynamic web pages of today, from mainly the text-based pages of the 1990s to today's multimedia rich pages, etc. Although much of this is known anecdotally, to our knowledge, there is no quantitative documentation of the extent and timing of these changes.… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: To appear at The ACM Web Conference 2022

  22. GraphNLI: A Graph-based Natural Language Inference Model for Polarity Prediction in Online Debates

    Authors: Vibhor Agarwal, Sagar Joglekar, Anthony P. Young, Nishanth Sastry

    Abstract: Online forums that allow participatory engagement between users have been transformative for public discussion of important issues. However, debates on such forums can sometimes escalate into full blown exchanges of hate or misinformation. An important tool in understanding and tackling such problems is to be able to infer the argumentative relation of whether a reply is supporting or attacking th… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: To appear at The ACM Web Conference 2022

  23. PrivPAS: A real time Privacy-Preserving AI System and applied ethics

    Authors: Harichandana B S S, Vibhav Agarwal, Sourav Ghosh, Gopi Ramena, Sumit Kumar, Barath Raj Kandur Raja

    Abstract: With 3.78 billion social media users worldwide in 2021 (48% of the human population), almost 3 billion images are shared daily. At the same time, a consistent evolution of smartphone cameras has led to a photography explosion with 85% of all new pictures being captured using smartphones. However, lately, there has been an increased discussion of privacy concerns when a person being photographed is… ▽ More

    Submitted 8 February, 2022; v1 submitted 5 February, 2022; originally announced February 2022.

    Comments: Accepted at 16th IEEE International Conference on Semantic Computing (ICSC), January 26-28, 2022 [update: Best Paper candidate at ICSC 2022]

    Journal ref: 2022 IEEE 16th International Conference on Semantic Computing (ICSC), Laguna Hills, CA, USA, 2022, pp. 9-16

  24. arXiv:2112.08984  [pdf, other

    eess.AS cs.SD eess.SP physics.app-ph

    Object-based synthesis of scra** and rolling sounds based on non-linear physical constraints

    Authors: Vinayak Agarwal, Maddie Cusimano, James Traer, Josh McDermott

    Abstract: Sustained contact interactions like scra** and rolling produce a wide variety of sounds. Previous studies have explored ways to synthesize these sounds efficiently and intuitively but could not fully mimic the rich structure of real instances of these sounds. We present a novel source-filter model for realistic synthesis of scra** and rolling sounds with physically and perceptually relevant co… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Journal ref: Proceeding of the 24th International Conference on Digital Audio Effects (DAFx-20in21), 2021

  25. arXiv:2111.10374  [pdf, other

    q-bio.QM cs.CV eess.IV

    Urine Microscopic Image Dataset

    Authors: Dipam Goswami, Hari Om Aggrawal, Rajiv Gupta, Vinti Agarwal

    Abstract: Urinalysis is a standard diagnostic test to detect urinary system related problems. The automation of urinalysis will reduce the overall diagnostic time. Recent studies used urine microscopic datasets for designing deep learning based algorithms to classify and detect urine cells. But these datasets are not publicly available for further research. To alleviate the need for urine datsets, we prepar… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: 7 pages, 1 image

  26. arXiv:2111.00861  [pdf, other

    cs.CV cs.LG

    A Frequency Perspective of Adversarial Robustness

    Authors: Shishira R Maiya, Max Ehrlich, Vatsal Agarwal, Ser-Nam Lim, Tom Goldstein, Abhinav Shrivastava

    Abstract: Adversarial examples pose a unique challenge for deep learning systems. Despite recent advances in both attacks and defenses, there is still a lack of clarity and consensus in the community about the true nature and underlying properties of adversarial examples. A deep understanding of these examples can provide new insights towards the development of more effective attacks and defenses. Driven by… ▽ More

    Submitted 26 October, 2021; originally announced November 2021.

  27. LIDSNet: A Lightweight on-device Intent Detection model using Deep Siamese Network

    Authors: Vibhav Agarwal, Sudeep Deepak Shivnikar, Sourav Ghosh, Himanshu Arora, Yashwant Saini

    Abstract: Intent detection is a crucial task in any Natural Language Understanding (NLU) system and forms the foundation of a task-oriented dialogue system. To build high-quality real-world conversational solutions for edge devices, there is a need for deploying intent detection model on device. This necessitates a light-weight, fast, and accurate model that can perform efficiently in a resource-constrained… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: Accepted for publication in 2021 IEEE 20th International Conference on Machine Learning and Applications (ICMLA)

    Journal ref: 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA), Pasadena, CA, USA, 2021, pp. 1112-1117

  28. arXiv:2110.08413  [pdf, other

    cs.CL cs.LG

    Invariant Language Modeling

    Authors: Maxime Peyrard, Sarvjeet Singh Ghotra, Martin Josifoski, Vidhan Agarwal, Barun Patra, Dean Carignan, Emre Kiciman, Robert West

    Abstract: Large pretrained language models are critical components of modern NLP pipelines. Yet, they suffer from spurious correlations, poor out-of-domain generalization, and biases. Inspired by recent progress in causal machine learning, in particular the invariant risk minimization (IRM) paradigm, we propose invariant language modeling, a framework for learning invariant representations that generalize b… ▽ More

    Submitted 14 November, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Published at EMNLP 2022

  29. A novel approach for modelling and classifying sit-to-stand kinematics using inertial sensors

    Authors: Maitreyee Wairagkar, Emma Villeneuve, Rachel King, Balazs Janko, Malcolm Burnett, Ann Ashburn, Veena Agarwal, R. Simon Sherratt, William Holderbaum, William Harwin

    Abstract: Sit-to-stand transitions are an important part of activities of daily living and play a key role in functional mobility in humans. The sit-to-stand movement is often affected in older adults due to frailty and in patients with motor impairments such as Parkinson's disease leading to falls. Studying kinematics of sit-to-stand transitions can provide insight in assessment, monitoring and develo**… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

    Comments: 25 pages, 11 figures

  30. arXiv:2105.07135  [pdf

    cs.MM cs.AI cs.SD eess.AS eess.IV

    Analyzing Images for Music Recommendation

    Authors: Anant Baijal, Vivek Agarwal, Danny Hyun

    Abstract: Experiencing images with suitable music can greatly enrich the overall user experience. The proposed image analysis method treats an artwork image differently from a photograph image. Automatic image classification is performed using deep-learning based models. An illustrative analysis showcasing the ability of our deep-models to inherently learn and utilize perceptually relevant features when cla… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

    Comments: IEEE International Conference on Consumer Electronics (IEEE ICCE 2021)

  31. arXiv:2104.14095  [pdf, ps, other

    cs.AI cs.LG

    Analyzing the Nuances of Transformers' Polynomial Simplification Abilities

    Authors: Vishesh Agarwal, Somak Aditya, Navin Goyal

    Abstract: Symbolic Mathematical tasks such as integration often require multiple well-defined steps and understanding of sub-tasks to reach a solution. To understand Transformers' abilities in such tasks in a fine-grained manner, we deviate from traditional end-to-end settings, and explore a step-wise polynomial simplification task. Polynomials can be written in a simple normal form as a sum of monomials wh… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: 16 pages, 18 Tables, Accepted ICLR 2021 MathAI Workshop

  32. arXiv:2103.16150  [pdf

    cs.CV cs.LG

    FONTNET: On-Device Font Understanding and Prediction Pipeline

    Authors: Rakshith S, Rishabh Khurana, Vibhav Agarwal, Jayesh Rajkumar Vachhani, Guggilla Bhanodai

    Abstract: Fonts are one of the most basic and core design concepts. Numerous use cases can benefit from an in depth understanding of Fonts such as Text Customization which can change text in an image while maintaining the Font attributes like style, color, size. Currently, Text recognition solutions can group recognized text based on line breaks or paragraph breaks, if the Font attributes are known multiple… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted for publication in IEEE ICASSP 2021: 46th IEEE International Conference on Acoustics, Speech, & Signal Processing

  33. arXiv:2103.13511  [pdf, other

    cs.LG cs.AI cs.CV

    Addressing catastrophic forgetting for medical domain expansion

    Authors: Sharut Gupta, Praveer Singh, Ken Chang, Liangqiong Qu, Mehak Aggarwal, Nishanth Arun, Ashwin Vaswani, Shruti Raghavan, Vibha Agarwal, Mishka Gidwani, Katharina Hoebel, Jay Patel, Charles Lu, Christopher P. Bridge, Daniel L. Rubin, Jayashree Kalpathy-Cramer

    Abstract: Model brittleness is a key concern when deploying deep learning models in real-world medical settings. A model that has high performance at one institution may suffer a significant decline in performance when tested at other institutions. While pooling datasets from multiple institutions and retraining may provide a straightforward solution, it is often infeasible and may compromise patient privac… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: First three authors contributed equally

  34. arXiv:2103.05806  [pdf, other

    physics.optics cond-mat.mtrl-sci

    Optimization of wide-band quasi-omnidirectional 1-D photonic structures

    Authors: Victor Castillo-Gallardo, Luis Eduardo Puente-Díaz, David Ariza-Flores, Héctor Pérez-Aguilar, W. Luis Mochán, Vivechana Agarwal

    Abstract: We have designed, optimized, fabricated and characterized highly reflective quasi-omnidirectional (angular range of $0-60^\circ$) multilayered structures with a wide spectral range. Two techniques, chir** (a continuous change in thicknesses) and stacking of Bragg-type sub-structures, have been used to enhance the reflectance with minimum thickness for a given pair of refractive indices. Numerica… ▽ More

    Submitted 23 March, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: 11 páginas

  35. arXiv:2103.04442  [pdf, other

    cs.CY

    Differential Tracking Across Topical Webpages of Indian News Media

    Authors: Yash Vekaria, Vibhor Agarwal, Pushkal Agarwal, Sangeeta Mahapatra, Sakthi Balan Muthiah, Nishanth Sastry, Nicolas Kourtellis

    Abstract: Online user privacy and tracking have been extensively studied in recent years, especially due to privacy and personal data-related legislations in the EU and the USA, such as the General Data Protection Regulation, ePrivacy Regulation, and California Consumer Privacy Act. Research has revealed novel tracking and personal identifiable information leakage methods that first- and third-parties emplo… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

  36. arXiv:2102.03656  [pdf, other

    cs.CY

    Under the Spotlight: Web Tracking in Indian Partisan News Websites

    Authors: Vibhor Agarwal, Yash Vekaria, Pushkal Agarwal, Sangeeta Mahapatra, Shounak Set, Sakthi Balan Muthiah, Nishanth Sastry, Nicolas Kourtellis

    Abstract: India is experiencing intense political partisanship and sectarian divisions. The paper performs, to the best of our knowledge, the first comprehensive analysis on the Indian online news media with respect to tracking and partisanship. We build a dataset of 103 online, mostly mainstream news websites. With the help of two experts, alongside data from the Media Ownership Monitor of the Reporters wi… ▽ More

    Submitted 8 March, 2021; v1 submitted 6 February, 2021; originally announced February 2021.

  37. Analytical Model for the Current Density in the Electrochemical Synthesis of Porous Silicon Structures with a Lateral Gradient

    Authors: C. A. Ospina-Delacruz, V. Agarwal, W. L. Mochán

    Abstract: Layered optical devices with a lateral gradient can be fabricated through electrochemical synthesis of porous silicon (PS) using a position dependent etching current density $\bm j(\bm r_\|)$. Predicting the local value of $\bm j(\bm r_\|)$ and the corresponding porosity $p(\bm r_\|)$ and etching rate $v(\bm r_\|)$ is desirable for their systematic design. We develop a simple analytical model for… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: 19 pages, 13 figures

    Journal ref: Optical Materials, Volume 113, 110859 (2021)

  38. arXiv:2101.03025  [pdf, other

    cs.CL cs.LG

    EmpLite: A Lightweight Sequence Labeling Model for Emphasis Selection of Short Texts

    Authors: Vibhav Agarwal, Sourav Ghosh, Kranti Chalamalasetti, Bharath Challa, Sonal Kumari, Harshavardhana, Barath Raj Kandur Raja

    Abstract: Word emphasis in textual content aims at conveying the desired intention by changing the size, color, typeface, style (bold, italic, etc.), and other typographical features. The emphasized words are extremely helpful in drawing the readers' attention to specific information that the authors wish to emphasize. However, performing such emphasis using a soft keyboard for social media interactions is… ▽ More

    Submitted 15 December, 2020; originally announced January 2021.

    Comments: Accepted for publication in ICON 2020: 17th International Conference on Natural Language Processing

    Report number: 2020.icon-1.3 (ACL Anthology)

    Journal ref: 17th International Conference on Natural Language Processing (ICON), Patna, India, December 18 - 21, 2020, pages 19-26, ACL Anthology: 2020.icon-1.3

  39. LiteMuL: A Lightweight On-Device Sequence Tagger using Multi-task Learning

    Authors: Sonal Kumari, Vibhav Agarwal, Bharath Challa, Kranti Chalamalasetti, Sourav Ghosh, Harshavardhana, Barath Raj Kandur Raja

    Abstract: Named entity detection and Parts-of-speech tagging are the key tasks for many NLP applications. Although the current state of the art methods achieved near perfection for long, formal, structured text there are hindrances in deploying these models on memory-constrained devices such as mobile phones. Furthermore, the performance of these models is degraded when they encounter short, informal, and c… ▽ More

    Submitted 29 March, 2021; v1 submitted 15 December, 2020; originally announced January 2021.

    Comments: Published in 2021 IEEE 15th International Conference on Semantic Computing (ICSC); Candidate for Best Paper Award

    Journal ref: 2021 IEEE 15th International Conference on Semantic Computing (ICSC), Laguna Hills, CA, USA, 2021, pp. 1-8

  40. arXiv:2011.05910  [pdf, other

    cs.CL cs.AI

    Audrey: A Personalized Open-Domain Conversational Bot

    Authors: Chung Hoon Hong, Yuan Liang, Sagnik Sinha Roy, Arushi Jain, Vihang Agarwal, Ryan Draves, Zhizhuo Zhou, William Chen, Yujian Liu, Martha Miracky, Lily Ge, Nikola Banovic, David Jurgens

    Abstract: Conversational Intelligence requires that a person engage on informational, personal and relational levels. Advances in Natural Language Understanding have helped recent chatbots succeed at dialog on the informational level. However, current techniques still lag for conversing with humans on a personal level and fully relating to them. The University of Michigan's submission to the Alexa Prize Gra… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

  41. arXiv:2010.10156  [pdf, other

    cs.AI cs.CL

    Extracting Procedural Knowledge from Technical Documents

    Authors: Shivali Agarwal, Shubham Atreja, Vikas Agarwal

    Abstract: Procedures are an important knowledge component of documents that can be leveraged by cognitive assistants for automation, question-answering or driving a conversation. It is a challenging problem to parse big dense documents like product manuals, user guides to automatically understand which parts are talking about procedures and subsequently extract them. Most of the existing research has focuse… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  42. arXiv:2005.00943  [pdf, other

    physics.optics cond-mat.mes-hall

    Stable Calculation of Optical Properties of Large Non-Periodic Dissipative Multilayered Systems

    Authors: Luis Eduardo Puente-Díaz, Victor Castillo-Gallardo, Guillermo P. Ortiz, José Samuel Pérez-Huerta, Héctor Pérez-Aguilar, Vivechana Agarwal, W. Luis Mochán

    Abstract: The calculation of the transfer matrix for a large non-periodic multilayered system may become unstable in the presence of absorption. We discuss the origin of this instability and we explore two methods to overcome it: the use of a total matrix to solve for all the fields at all the interfaces simultaneously and an expansion in the Bloch-like modes of a periodic artificially repeated system. We a… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

    Comments: 20 pages, 7 figures

  43. arXiv:2003.13440  [pdf

    eess.IV cs.CV

    Computer Aided Detection for Pulmonary Embolism Challenge (CAD-PE)

    Authors: Germán González, Daniel Jimenez-Carretero, Sara Rodríguez-López, Carlos Cano-Espinosa, Miguel Cazorla, Tanya Agarwal, Vinit Agarwal, Nima Tajbakhsh, Michael B. Gotway, Jianming Liang, Mojtaba Masoudi, Noushin Eftekhari, Mahdi Saadatmand, Hamid-Reza Pourreza, Patricia Fraga-Rivas, Eduardo Fraile, Frank J. Rybicki, Ara Kassarjian, Raúl San José Estépar, Maria J. Ledesma-Carbayo

    Abstract: Rationale: Computer aided detection (CAD) algorithms for Pulmonary Embolism (PE) algorithms have been shown to increase radiologists' sensitivity with a small increase in specificity. However, CAD for PE has not been adopted into clinical practice, likely because of the high number of false positives current CAD software produces. Objective: To generate a database of annotated computed tomography… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: 8 pages, 3 figures

  44. arXiv:2001.09174  [pdf, other

    cs.CV

    Weakly Supervised Lesion Co-segmentation on CT Scans

    Authors: Vatsal Agarwal, Youbao Tang, **g Xiao, Ronald M. Summers

    Abstract: Lesion segmentation in medical imaging serves as an effective tool for assessing tumor sizes and monitoring changes in growth. However, not only is manual lesion segmentation time-consuming, but it is also expensive and requires expert radiologist knowledge. Therefore many hospitals rely on a loose substitute called response evaluation criteria in solid tumors (RECIST). Although these annotations… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

  45. arXiv:2001.08590  [pdf, other

    cs.CV

    Weakly-Supervised Lesion Segmentation on CT Scans using Co-Segmentation

    Authors: Vatsal Agarwal, Youbao Tang, **g Xiao, Ronald M. Summers

    Abstract: Lesion segmentation on computed tomography (CT) scans is an important step for precisely monitoring changes in lesion/tumor growth. This task, however, is very challenging since manual segmentation is prohibitively time-consuming, expensive, and requires professional knowledge. Current practices rely on an imprecise substitute called response evaluation criteria in solid tumors (RECIST). Although… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

  46. arXiv:1912.07538  [pdf, other

    cs.CV cs.CL cs.LG

    Towards Causal VQA: Revealing and Reducing Spurious Correlations by Invariant and Covariant Semantic Editing

    Authors: Vedika Agarwal, Rakshith Shetty, Mario Fritz

    Abstract: Despite significant success in Visual Question Answering (VQA), VQA models have been shown to be notoriously brittle to linguistic variations in the questions. Due to deficiencies in models and datasets, today's models often rely on correlations rather than predictions that are causal w.r.t. data. In this paper, we propose a novel way to analyze and measure the robustness of the state of the art m… ▽ More

    Submitted 29 May, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: 16 pages

  47. arXiv:1906.03087  [pdf, other

    q-bio.GN cs.LG

    Unsupervised Representation Learning of DNA Sequences

    Authors: Vishal Agarwal, N Jayanth Kumar Reddy, Ashish Anand

    Abstract: Recently several deep learning models have been used for DNA sequence based classification tasks. Often such tasks require long and variable length DNA sequences in the input. In this work, we use a sequence-to-sequence autoencoder model to learn a latent representation of a fixed dimension for long and variable length DNA sequences in an unsupervised manner. We evaluate both quantitatively and qu… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

    Comments: Accepted at 2019 ICML Workshop on Computational Biology

  48. arXiv:1901.08978  [pdf, other

    math.OC

    Reinforcement Learning for Multi-Objective and Constrained Markov Decision Processes

    Authors: Ather Gattami, Qinbo Bai, Vaneet Agarwal

    Abstract: In this paper, we consider the problem of optimization and learning for constrained and multi-objective Markov decision processes, for both discounted rewards and expected average rewards. We formulate the problems as zero-sum games where one player (the agent) solves a Markov decision problem and its opponent solves a bandit optimization problem, which we here call Markov-Bandit games. We extend… ▽ More

    Submitted 4 March, 2021; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1901.07839

  49. arXiv:1811.04346  [pdf, other

    cs.CV

    Deep Face Quality Assessment

    Authors: Vishal Agarwal

    Abstract: Face image quality is an important factor in facial recognition systems as its verification and recognition accuracy is highly dependent on the quality of image presented. Rejecting low quality images can significantly increase the accuracy of any facial recognition system. In this project, a simple approach is presented to train a deep convolutional neural network to perform end-to-end face image… ▽ More

    Submitted 10 November, 2018; originally announced November 2018.

    Comments: Course project report

  50. arXiv:1805.02173  [pdf, other

    cs.CV

    An Interval Type-2 Fuzzy Approach to Automatic PDF Generation for Histogram Specification

    Authors: Vishal Agarwal, Diwanshu Jain, A. Vamshi Krishna Reddy, Frank Chung-Hoon Rhee

    Abstract: Image enhancement plays an important role in several application in the field of computer vision and image processing. Histogram specification (HS) is one of the most widely used techniques for contrast enhancement of an image, which requires an appropriate probability density function for the transformation. In this paper, we propose a fuzzy method to find a suitable PDF automatically for histogr… ▽ More

    Submitted 6 May, 2018; originally announced May 2018.