Skip to main content

Showing 1–23 of 23 results for author: Madan, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05132  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.RO

    3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination

    Authors: Jianing Yang, Xuweiyi Chen, Nikhil Madaan, Madhavan Iyengar, Shengyi Qian, David F. Fouhey, Joyce Chai

    Abstract: The integration of language and 3D perception is crucial for develo** embodied agents and robots that comprehend and interact with the physical world. While large language models (LLMs) have demonstrated impressive language understanding and generation capabilities, their adaptation to 3D environments (3D-LLMs) remains in its early stages. A primary challenge is the absence of large-scale datase… ▽ More

    Submitted 12 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Project website: https://3d-grand.github.io

  2. arXiv:2405.03770  [pdf, other

    cs.CV

    Foundation Models for Video Understanding: A Survey

    Authors: Neelu Madan, Andreas Moegelmose, Rajat Modi, Yogesh S. Rawat, Thomas B. Moeslund

    Abstract: Video Foundation Models (ViFMs) aim to learn a general-purpose representation for various video understanding tasks. Leveraging large-scale datasets and powerful models, ViFMs achieve this by capturing robust and generic features from video data. This survey analyzes over 200 video foundational models, offering a comprehensive overview of benchmarks and evaluation metrics across 14 distinct video… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  3. arXiv:2403.06009  [pdf, other

    cs.LG

    Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

    Authors: Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Kirushikesh DB, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Nishtha Madaan, Sameep Mehta, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations. Due to several limiting factors surrounding LLMs (training cost, API access, data availability, etc.), it may not always be feasible to impose direct safety constraints on a deployed model. Therefore, an efficient and reliable alternative is required. To this end, we presen… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  4. arXiv:2403.00826  [pdf, other

    cs.CL cs.CR cs.LG

    LLMGuard: Guarding Against Unsafe LLM Behavior

    Authors: Shubh Goyal, Medha Hira, Shubham Mishra, Sukriti Goyal, Arnav Goel, Niharika Dadu, Kirushikesh DB, Sameep Mehta, Nishtha Madaan

    Abstract: Although the rise of Large Language Models (LLMs) in enterprise settings brings new opportunities and capabilities, it also brings challenges, such as the risk of generating inappropriate, biased, or misleading content that violates regulations and can have legal concerns. To alleviate this, we present "LLMGuard", a tool that monitors user interactions with an LLM application and flags content aga… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

    Comments: accepted in demonstration track of AAAI-24

  5. arXiv:2312.13616  [pdf, other

    cs.LG cs.AI

    Navigating the Structured What-If Spaces: Counterfactual Generation via Structured Diffusion

    Authors: Nishtha Madaan, Srikanta Bedathur

    Abstract: Generating counterfactual explanations is one of the most effective approaches for uncovering the inner workings of black-box neural network models and building user trust. While remarkable strides have been made in generative modeling using diffusion models in domains like vision, their utility in generating counterfactual explanations in structured modalities remains unexplored. In this paper, w… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 13 pages

  6. arXiv:2310.11594  [pdf, other

    cs.LG cs.AI

    Adversarial Robustness Unhardening via Backdoor Attacks in Federated Learning

    Authors: Tae** Kim, Jiarui Li, Shubhranshu Singh, Nikhil Madaan, Carlee Joe-Wong

    Abstract: In today's data-driven landscape, the delicate equilibrium between safeguarding user privacy and unleashing data potential stands as a paramount concern. Federated learning, which enables collaborative model training without necessitating data sharing, has emerged as a privacy-centric solution. This decentralized approach brings forth security challenges, notably poisoning and backdoor attacks whe… ▽ More

    Submitted 20 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: 8 pages, 6 main pages of text, 4 figures, 2 tables. Made for a Neurips workshop on backdoor attacks

  7. arXiv:2309.12311  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.RO

    LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent

    Authors: Jianing Yang, Xuweiyi Chen, Shengyi Qian, Nikhil Madaan, Madhavan Iyengar, David F. Fouhey, Joyce Chai

    Abstract: 3D visual grounding is a critical skill for household robots, enabling them to navigate, manipulate objects, and answer questions based on their environment. While existing approaches often rely on extensive labeled data or exhibit limitations in handling complex language queries, we propose LLM-Grounder, a novel zero-shot, open-vocabulary, Large Language Model (LLM)-based 3D visual grounding pipe… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Project website: https://chat-with-nerf.github.io/

  8. arXiv:2308.16572  [pdf, other

    cs.CV cs.AI cs.LG

    CL-MAE: Curriculum-Learned Masked Autoencoders

    Authors: Neelu Madan, Nicolae-Catalin Ristea, Kamal Nasrollahi, Thomas B. Moeslund, Radu Tudor Ionescu

    Abstract: Masked image modeling has been demonstrated as a powerful pretext task for generating robust representations that can be effectively generalized across multiple downstream tasks. Typically, this approach involves randomly masking patches (tokens) in input images, with the masking strategy remaining unchanged during training. In this paper, we propose a curriculum learning approach that updates the… ▽ More

    Submitted 28 February, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted at WACV 2024

  9. arXiv:2308.07973  [pdf, other

    cs.CL

    "Beware of deception": Detecting Half-Truth and Debunking it through Controlled Claim Editing

    Authors: Sandeep Singamsetty, Nishtha Madaan, Sameep Mehta, Varad Bhatnagar, Pushpak Bhattacharyya

    Abstract: The prevalence of half-truths, which are statements containing some truth but that are ultimately deceptive, has risen with the increasing use of the internet. To help combat this problem, we have created a comprehensive pipeline consisting of a half-truth detection model and a claim editing model. Our approach utilizes the T5 model for controlled claim editing; "controlled" here means precise adj… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  10. arXiv:2211.04250  [pdf, other

    cs.LG cs.AI cs.CL

    DetAIL : A Tool to Automatically Detect and Analyze Drift In Language

    Authors: Nishtha Madaan, Adithya Manjunatha, Hrithik Nambiar, Aviral Kumar Goel, Harivansh Kumar, Diptikalyan Saha, Srikanta Bedathur

    Abstract: Machine learning and deep learning-based decision making has become part of today's software. The goal of this work is to ensure that machine learning and deep learning-based systems are as trusted as traditional software. Traditional software is made dependable by following rigorous practice like static analysis, testing, debugging, verifying, and repairing throughout the development and maintena… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  11. Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection

    Authors: Neelu Madan, Nicolae-Catalin Ristea, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: Anomaly detection has recently gained increasing attention in the field of computer vision, likely due to its broad set of applications ranging from product fault detection on industrial production lines and impending event detection in video surveillance to finding lesions in medical scans. Regardless of the domain, anomaly detection is typically framed as a one-class classification task, where t… ▽ More

    Submitted 5 October, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence

  12. arXiv:2209.08412  [pdf, other

    cs.LG cs.CR

    Characterizing Internal Evasion Attacks in Federated Learning

    Authors: Tae** Kim, Shubhranshu Singh, Nikhil Madaan, Carlee Joe-Wong

    Abstract: Federated learning allows for clients in a distributed system to jointly train a machine learning model. However, clients' models are vulnerable to attacks during the training and testing phases. In this paper, we address the issue of adversarial clients performing "internal evasion attacks": crafting evasion attacks at test time to deceive other clients. For example, adversaries may aim to deceiv… ▽ More

    Submitted 20 October, 2023; v1 submitted 17 September, 2022; originally announced September 2022.

    Comments: 16 pages, 8 figures (14 images if counting sub-figures separately), Camera ready version for AISTATS 2023, longer version of paper submitted to CrossFL 2022 poster workshop, code available at (https://github.com/tj-kim/pFedDef_v1)

  13. arXiv:2206.10429  [pdf, other

    cs.CL cs.LG

    Plug and Play Counterfactual Text Generation for Model Robustness

    Authors: Nishtha Madaan, Srikanta Bedathur, Diptikalyan Saha

    Abstract: Generating counterfactual test-cases is an important backbone for testing NLP models and making them as robust and reliable as traditional software. In generating the test-cases, a desired property is the ability to control the test-case generation in a flexible manner to test for a large variety of failure cases and to explain and repair them in a targeted manner. In this direction, significant p… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  14. arXiv:2206.08081  [pdf, other

    cs.CL cs.LG

    TransDrift: Modeling Word-Embedding Drift using Transformer

    Authors: Nishtha Madaan, Prateek Chaudhury, Nishant Kumar, Srikanta Bedathur

    Abstract: In modern NLP applications, word embeddings are a crucial backbone that can be readily shared across a number of tasks. However as the text distributions change and word semantics evolve over time, the downstream applications using the embeddings can suffer if the word representations do not conform to the data drift. Thus, maintaining word embeddings to be consistent with the underlying data dist… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 10 pages

  15. arXiv:2203.13834  [pdf, other

    cs.CV cs.AI cs.LG

    A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration

    Authors: Ramya Hebbalaguppe, Jatin Prakash, Neelabh Madan, Chetan Arora

    Abstract: Deep Neural Networks ( DNN s) are known to make overconfident mistakes, which makes their use problematic in safety-critical applications. State-of-the-art ( SOTA ) calibration techniques improve on the confidence of predicted labels alone and leave the confidence of non-max classes (e.g. top-2, top-5) uncalibrated. Such calibration is not suitable for label refinement using post-processing. Furth… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted in IEEE Computer Vision and Pattern Recognition 2022

  16. arXiv:2111.09099  [pdf, other

    cs.CV cs.LG

    Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection

    Authors: Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: Anomaly detection is commonly pursued as a one-class classification problem, where models can only learn from normal training samples, while being evaluated on both normal and abnormal test samples. Among the successful approaches for anomaly detection, a distinguished category of methods relies on predicting masked information (e.g. patches, future frames, etc.) and leveraging the reconstruction… ▽ More

    Submitted 14 March, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: Accepted at CVPR 2022. Paper + supplementary (14 pages, 9 figures)

  17. arXiv:2012.04698  [pdf, other

    cs.CL cs.AI cs.LG

    Generate Your Counterfactuals: Towards Controlled Counterfactual Generation for Text

    Authors: Nishtha Madaan, Inkit Padhi, Naveen Panwar, Diptikalyan Saha

    Abstract: Machine Learning has seen tremendous growth recently, which has led to larger adoption of ML systems for educational assessments, credit risk, healthcare, employment, criminal justice, to name a few. The trustworthiness of ML and NLP systems is a crucial aspect and requires a guarantee that the decisions they make are fair and robust. Aligned with this, we propose a framework GYC, to generate a se… ▽ More

    Submitted 17 March, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted at AAAI Conference on Artificial Intelligence (AAAI 2021)

  18. arXiv:2007.08028  [pdf

    q-bio.QM cs.CV cs.LG eess.IV

    Predicting Clinical Outcomes in COVID-19 using Radiomics and Deep Learning on Chest Radiographs: A Multi-Institutional Study

    Authors: Joseph Bae, Saarthak Kapse, Gagandeep Singh, Rishabh Gattu, Syed Ali, Neal Shah, Colin Marshall, Jonathan Pierce, Tej Phatak, Amit Gupta, Jeremy Green, Nikhil Madan, Prateek Prasanna

    Abstract: We predict mechanical ventilation requirement and mortality using computational modeling of chest radiographs (CXRs) for coronavirus disease 2019 (COVID-19) patients. This two-center, retrospective study analyzed 530 deidentified CXRs from 515 COVID-19 patients treated at Stony Brook University Hospital and Newark Beth Israel Medical Center between March and August 2020. DL and machine learning cl… ▽ More

    Submitted 1 July, 2021; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: Joseph Bae and Saarthak Kapse have contributed equally to this work

    ACM Class: J.3; I.2.6

  19. arXiv:2001.06693  [pdf, other

    cs.CL cs.AI cs.LG

    Fair Transfer of Multiple Style Attributes in Text

    Authors: Karan Dabas, Nishtha Madan, Vijay Arya, Sameep Mehta, Gautam Singh, Tanmoy Chakraborty

    Abstract: To preserve anonymity and obfuscate their identity on online platforms users may morph their text and portray themselves as a different gender or demographic. Similarly, a chatbot may need to customize its communication style to improve engagement with its audience. This manner of changing the style of written text has gained significant attention in recent years. Yet these past research works lar… ▽ More

    Submitted 18 January, 2020; originally announced January 2020.

  20. arXiv:1807.10615  [pdf, other

    cs.CL cs.AI

    Judging a Book by its Description : Analyzing Gender Stereotypes in the Man Bookers Prize Winning Fiction

    Authors: Nishtha Madaan, Sameep Mehta, Shravika Mittal, Ashima Suvarna

    Abstract: The presence of gender stereotypes in many aspects of society is a well-known phenomenon. In this paper, we focus on studying and quantifying such stereotypes and bias in the Man Bookers Prize winning fiction. We consider 275 books shortlisted for Man Bookers Prize between 1969 and 2017. The gender bias is analyzed by semantic modeling of book descriptions on Goodreads. This reveals the pervasiven… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1710.04117

  21. arXiv:1804.03839  [pdf, other

    cs.CL cs.CY

    Generating Clues for Gender based Occupation De-biasing in Text

    Authors: Nishtha Madaan, Gautam Singh, Sameep Mehta, Aditya Chetan, Brihi Joshi

    Abstract: Vast availability of text data has enabled widespread training and use of AI systems that not only learn and predict attributes from the text but also generate text automatically. However, these AI models also learn gender, racial and ethnic biases present in the training data. In this paper, we present the first system that discovers the possibility that a given text portrays a gender stereotype… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

  22. arXiv:1710.04142  [pdf, other

    cs.CY cs.CL

    Bollywood Movie Corpus for Text, Images and Videos

    Authors: Nishtha Madaan, Sameep Mehta, Mayank Saxena, Aditi Aggarwal, Taneea S Agrawaal, Vrinda Malhotra

    Abstract: In past few years, several data-sets have been released for text and images. We present an approach to create the data-set for use in detecting and removing gender bias from text. We also include a set of challenges we have faced while creating this corpora. In this work, we have worked with movie data from Wikipedia plots and movie trailers from YouTube. Our Bollywood Movie corpus contains 4000 m… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

  23. arXiv:1710.04117  [pdf, other

    cs.SI cs.CY

    Analyzing Gender Stereoty** in Bollywood Movies

    Authors: Nishtha Madaan, Sameep Mehta, Taneea S Agrawaal, Vrinda Malhotra, Aditi Aggarwal, Mayank Saxena

    Abstract: The presence of gender stereotypes in many aspects of society is a well-known phenomenon. In this paper, we focus on studying such stereotypes and bias in Hindi movie industry (Bollywood). We analyze movie plots and posters for all movies released since 1970. The gender bias is detected by semantic modeling of plots at inter-sentence and intra-sentence level. Different features like occupation, in… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.