Skip to main content

Showing 1–12 of 12 results for author: Madhavan, R

.
  1. arXiv:2405.18626  [pdf, other

    cs.LG cs.AI

    Causal Contextual Bandits with Adaptive Context

    Authors: Rahul Madhavan, Aurghya Maiti, Gaurav Sinha, Siddharth Barman

    Abstract: We study a variant of causal contextual bandits where the context is chosen based on an initial intervention chosen by the learner. At the beginning of each round, the learner selects an initial action, depending on which a stochastic context is revealed by the environment. Following this, the learner then selects a final action and receives a reward. Given $T$ rounds of interactions with the envi… ▽ More

    Submitted 2 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Reinforcement Learning Conference (RLC) 2024, 10 pages (31 pages including appendix), 8 plots. arXiv admin note: text overlap with arXiv:2111.00886

  2. arXiv:2401.15229  [pdf, other

    cs.CY

    Evolving AI Risk Management: A Maturity Model based on the NIST AI Risk Management Framework

    Authors: Ravit Dotan, Borhane Blili-Hamelin, Ravi Madhavan, Jeanna Matthews, Joshua Scarpino

    Abstract: Researchers, government bodies, and organizations have been repeatedly calling for a shift in the responsible AI community from general principles to tangible and operationalizable practices in mitigating the potential sociotechnical harms of AI. Frameworks like the NIST AI RMF embody an emerging consensus on recommended practices in operationalizing sociotechnical harm mitigation. However, privat… ▽ More

    Submitted 13 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  3. arXiv:2311.11229  [pdf, other

    cs.CL

    Causal ATE Mitigates Unintended Bias in Controlled Text Generation

    Authors: Rahul Madhavan, Kahini Wadhawan

    Abstract: We study attribute control in language models through the method of Causal Average Treatment Effect (Causal ATE). Existing methods for the attribute control task in Language Models (LMs) check for the co-occurrence of words in a sentence with the attribute of interest, and control for them. However, spurious correlation of the words with the attribute in the training dataset, can cause models to h… ▽ More

    Submitted 16 February, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

    Comments: 12 pages, 5 figures

  4. CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation

    Authors: Rahul Madhavan, Rishabh Garg, Kahini Wadhawan, Sameep Mehta

    Abstract: We propose a method to control the attributes of Language Models (LMs) for the text generation task using Causal Average Treatment Effect (ATE) scores and counterfactual augmentation. We explore this method, in the context of LM detoxification, and propose the Causally Fair Language (CFL) architecture for detoxifying pre-trained LMs in a plug-and-play manner. Our architecture is based on a Structu… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 19 pages, 10 figures. Findings of ACL 2023

    Journal ref: Findings of the Association for Computational Linguistics: ACL 2023

  5. arXiv:2305.04638  [pdf, other

    cs.LG cs.AI

    Learning Good Interventions in Causal Graphs via Covering

    Authors: Ayush Sawarni, Rahul Madhavan, Gaurav Sinha, Siddharth Barman

    Abstract: We study the causal bandit problem that entails identifying a near-optimal intervention from a specified set $A$ of (possibly non-atomic) interventions over a given causal graph. Here, an optimal intervention in ${A}$ is one that maximizes the expected value for a designated reward variable in the graph, and we use the standard notion of simple regret to quantify near optimality. Considering Berno… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 26 pages

  6. arXiv:2203.03541  [pdf, other

    cs.CL cs.AI

    Fairness for Text Classification Tasks with Identity Information Data Augmentation Methods

    Authors: Mohit Wadhwa, Mohan Bhambhani, Ashvini **dal, Uma Sawant, Ramanujam Madhavan

    Abstract: Counterfactual fairness methods address the question: How would the prediction change if the sensitive identity attributes referenced in the text instance were different? These methods are entirely based on generating counterfactuals for the given training and test set instances. Counterfactual instances are commonly prepared by replacing sensitive identity terms, i.e., the identity terms present… ▽ More

    Submitted 4 February, 2022; originally announced March 2022.

  7. arXiv:2111.00886  [pdf, other

    cs.LG cs.AI

    Intervention Efficient Algorithm for Two-Stage Causal MDPs

    Authors: Rahul Madhavan, Aurghya Maiti, Gaurav Sinha, Siddharth Barman

    Abstract: We study Markov Decision Processes (MDP) wherein states correspond to causal graphs that stochastically generate rewards. In this setup, the learner's goal is to identify atomic interventions that lead to high rewards by intervening on variables at each state. Generalizing the recent causal-bandit framework, the current work develops (simple) regret minimization guarantees for two-stage causal MDP… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 29 pages

  8. arXiv:2106.13849  [pdf, other

    cs.CV eess.IV

    A CNN Segmentation-Based Approach to Object Detection and Tracking in Ultrasound Scans with Application to the Vagus Nerve Detection

    Authors: Abdullah F. Al-Battal, Yan Gong, Lu Xu, Timothy Morton, Chen Du, Yifeng Bu 1, Imanuel R Lerman, Radhika Madhavan, Truong Q. Nguyen

    Abstract: Ultrasound scanning is essential in several medical diagnostic and therapeutic applications. It is used to visualize and analyze anatomical features and structures that influence treatment plans. However, it is both labor intensive, and its effectiveness is operator dependent. Real-time accurate and robust automatic detection and tracking of anatomical structures while scanning would significantly… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: 7 pages , 4 figures, submitted to the IEEE EMBC 2021 conference

  9. arXiv:2104.07361  [pdf, other

    cs.LG

    Scale Invariant Monte Carlo under Linear Function Approximation with Curvature based step-size

    Authors: Rahul Madhavan, Hemanta Makwana

    Abstract: We study the feature-scaled version of the Monte Carlo algorithm with linear function approximation. This algorithm converges to a scale-invariant solution, which is not unduly affected by states having feature vectors with large norms. The usual versions of the MCMC algorithm, obtained by minimizing the least-squares criterion, do not produce solutions that give equal importance to all states irr… ▽ More

    Submitted 29 May, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: 42 pages, 9 figures (9 pages main body with 5 figures)

  10. arXiv:2010.10737  [pdf, other

    cs.LG stat.ML

    Directed Graph Representation through Vector Cross Product

    Authors: Ramanujam Madhavan, Mohit Wadhwa

    Abstract: Graph embedding methods embed the nodes in a graph in low dimensional vector space while preserving graph topology to carry out the downstream tasks such as link prediction, node recommendation and clustering. These tasks depend on a similarity measure such as cosine similarity and Euclidean distance between a pair of embeddings that are symmetric in nature and hence do not hold good for directed… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  11. arXiv:2002.12143  [pdf, other

    cs.LG stat.ML

    Fairness-Aware Learning with Prejudice Free Representations

    Authors: Ramanujam Madhavan, Mohit Wadhwa

    Abstract: Machine learning models are extensively being used to make decisions that have a significant impact on human life. These models are trained over historical data that may contain information about sensitive attributes such as race, sex, religion, etc. The presence of such sensitive attributes can impact certain population subgroups unfairly. It is straightforward to remove sensitive features from t… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  12. arXiv:1704.05729  [pdf, other

    q-fin.GN

    A generalized Bayesian framework for the analysis of subscription based businesses

    Authors: Rahul Madhavan, Ankit Baraskar

    Abstract: We have created a framework for analyzing subscription based businesses in terms of a unified metric which we call SCV (single customer value). The major advance in this paper is to model customer churn as an exponential decay variable, which directly follows from experimental data relating to subscription based businesses. This Bayesian probabilistic model was used to compute an expected value fo… ▽ More

    Submitted 12 April, 2017; originally announced April 2017.

    Comments: 12 pages, 4 figures, Atidiv Research