Skip to main content

Showing 1–23 of 23 results for author: Chowdhury, S B R

.
  1. arXiv:2406.17740  [pdf, other

    cs.LG cs.AI cs.CV

    Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning

    Authors: Arijit Sehanobish, Avinava Dubey, Krzysztof Choromanski, Somnath Basu Roy Chowdhury, Deepali Jain, Vikas Sindhwani, Snigdha Chaturvedi

    Abstract: Recent efforts to scale Transformer models have demonstrated rapid progress across a wide range of tasks (Wei et al., 2022). However, fine-tuning these models for downstream tasks is expensive due to their large parameter counts. Parameter-efficient fine-tuning (PEFT) approaches have emerged as a viable alternative by allowing us to fine-tune models by updating only a small number of parameters. I… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Work in progress

  2. arXiv:2406.16257  [pdf, other

    cs.LG

    Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning

    Authors: Somnath Basu Roy Chowdhury, Krzysztof Choromanski, Arijit Sehanobish, Avinava Dubey, Snigdha Chaturvedi

    Abstract: Machine unlearning is the process of efficiently removing the influence of a training data instance from a trained machine learning model without retraining it from scratch. A popular subclass of unlearning approaches is exact machine unlearning, which focuses on techniques that explicitly guarantee the removal of the influence of a data instance from a model. Exact unlearning approaches use a mac… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  3. arXiv:2406.15881  [pdf, other

    cs.LG cs.AI

    Fast Tree-Field Integrators: From Low Displacement Rank to Topological Transformers

    Authors: Krzysztof Choromanski, Arijit Sehanobish, Somnath Basu Roy Chowdhury, Han Lin, Avinava Dubey, Tamas Sarlos, Snigdha Chaturvedi

    Abstract: We present a new class of fast polylog-linear algorithms based on the theory of structured matrices (in particular low displacement rank) for integrating tensor fields defined on weighted trees. Several applications of the resulting fast tree-field integrators (FTFIs) are presented, including (a) approximation of graph metrics with tree metrics, (b) graph classification, (c) modeling on meshes, an… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Preprint. Comments welcome

  4. arXiv:2406.11107  [pdf, other

    cs.CL

    Exploring Safety-Utility Trade-Offs in Personalized Language Models

    Authors: Anvesh Rao Vij**i, Somnath Basu Roy Chowdhury, Snigdha Chaturvedi

    Abstract: As large language models (LLMs) become increasingly integrated into daily applications, it is essential to ensure they operate fairly across diverse user demographics. In this work, we show that LLMs suffer from personalization bias, where their performance is impacted when they are personalized to a user's identity. We quantify personalization bias by evaluating the performance of LLMs along two… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  5. arXiv:2404.03570  [pdf, other

    cs.RO

    Embodied AI with Two Arms: Zero-shot Learning, Safety and Modularity

    Authors: Jake Varley, Sumeet Singh, Deepali Jain, Krzysztof Choromanski, Andy Zeng, Somnath Basu Roy Chowdhury, Avinava Dubey, Vikas Sindhwani

    Abstract: We present an embodied AI system which receives open-ended natural language instructions from a human, and controls two arms to collaboratively accomplish potentially long-horizon tasks over a large workspace. Our system is modular: it deploys state of the art Large Language Models for task planning,Vision-Language models for semantic perception, and Point Cloud transformers for gras**. With sem… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  6. arXiv:2401.08047  [pdf, other

    cs.CL cs.LG

    Incremental Extractive Opinion Summarization Using Cover Trees

    Authors: Somnath Basu Roy Chowdhury, Nicholas Monath, Avinava Dubey, Manzil Zaheer, Andrew McCallum, Amr Ahmed, Snigdha Chaturvedi

    Abstract: Extractive opinion summarization involves automatically producing a summary of text about an entity (e.g., a product's reviews) by extracting representative sentences that capture prevalent opinions in the review set. Typically, in online marketplaces user reviews accumulate over time, and opinion summaries need to be updated periodically to provide customers with up-to-date information. In this w… ▽ More

    Submitted 12 April, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted at TMLR

  7. arXiv:2312.00194  [pdf

    cs.LG cs.CL

    Robust Concept Erasure via Kernelized Rate-Distortion Maximization

    Authors: Somnath Basu Roy Chowdhury, Nicholas Monath, Avinava Dubey, Amr Ahmed, Snigdha Chaturvedi

    Abstract: Distributed representations provide a vector space that captures meaningful relationships between data instances. The distributed nature of these representations, however, entangles together multiple attributes or concepts of data instances (e.g., the topic or sentiment of a text, characteristics of the author (age, gender, etc), etc). Recent work has proposed the task of concept erasure, in which… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  8. arXiv:2310.11401  [pdf, other

    cs.LG

    Enhancing Group Fairness in Online Settings Using Oblique Decision Forests

    Authors: Somnath Basu Roy Chowdhury, Nicholas Monath, Ahmad Beirami, Rahul Kidambi, Avinava Dubey, Amr Ahmed, Snigdha Chaturvedi

    Abstract: Fairness, especially group fairness, is an important consideration in the context of machine learning systems. The most commonly adopted group fairness-enhancing techniques are in-processing methods that rely on a mixture of a fairness objective (e.g., demographic parity) and a task-specific objective (e.g., cross-entropy) during the training process. However, when data arrives in an online fashio… ▽ More

    Submitted 27 April, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 (Spotlight)

  9. arXiv:2302.00942  [pdf, other

    cs.LG

    Efficient Graph Field Integrators Meet Point Clouds

    Authors: Krzysztof Choromanski, Arijit Sehanobish, Han Lin, Yunfan Zhao, Eli Berger, Tetiana Parshakova, Alvin Pan, David Watkins, Tianyi Zhang, Valerii Likhosherstov, Somnath Basu Roy Chowdhury, Avinava Dubey, Deepali Jain, Tamas Sarlos, Snigdha Chaturvedi, Adrian Weller

    Abstract: We present two new classes of algorithms for efficient field integration on graphs encoding point clouds. The first class, SeparatorFactorization(SF), leverages the bounded genus of point cloud mesh graphs, while the second class, RFDiffusion(RFD), uses popular epsilon-nearest-neighbor graph representations for point clouds. Both can be viewed as providing the functionality of Fast Multipole Metho… ▽ More

    Submitted 4 October, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Journal ref: ICML 2023

  10. arXiv:2209.07496  [pdf, other

    cs.CL

    Unsupervised Opinion Summarization Using Approximate Geodesics

    Authors: Somnath Basu Roy Chowdhury, Nicholas Monath, Avinava Dubey, Amr Ahmed, Snigdha Chaturvedi

    Abstract: Opinion summarization is the task of creating summaries capturing popular opinions from user reviews. In this paper, we introduce Geodesic Summarizer (GeoSumm), a novel system to perform unsupervised extractive opinion summarization. GeoSumm involves an encoder-decoder based representation learning model, that generates representations of text as a distribution over latent semantic units. GeoSumm… ▽ More

    Submitted 20 November, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Findings of EMNLP 2023

  11. arXiv:2208.12212  [pdf, other

    cs.LG cs.CY

    Sustaining Fairness via Incremental Learning

    Authors: Somnath Basu Roy Chowdhury, Snigdha Chaturvedi

    Abstract: Machine learning systems are often deployed for making critical decisions like credit lending, hiring, etc. While making decisions, such systems often encode the user's demographic information (like gender, age) in their intermediate representations. This can lead to decisions that are biased towards specific demographics. Prior work has focused on debiasing intermediate representations to ensure… ▽ More

    Submitted 23 January, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: Accepted at AAAI 2023

  12. arXiv:2203.10254  [pdf, other

    cs.CL

    Read Top News First: A Document Reordering Approach for Multi-Document News Summarization

    Authors: Chao Zhao, Tenghao Huang, Somnath Basu Roy Chowdhury, Muthu Kumar Chandrasekaran, Kathleen McKeown, Snigdha Chaturvedi

    Abstract: A common method for extractive multi-document news summarization is to re-formulate it as a single-document summarization problem by concatenating all documents as a single meta-document. However, this method neglects the relative importance of documents. We propose a simple approach to reorder the documents according to their relative importance before concatenating and summarizing them. The reor… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: Accepted at Findings of ACL 2022

  13. arXiv:2203.07921  [pdf, other

    cs.CL

    Unsupervised Extractive Opinion Summarization Using Sparse Coding

    Authors: Somnath Basu Roy Chowdhury, Chao Zhao, Snigdha Chaturvedi

    Abstract: Opinion summarization is the task of automatically generating summaries that encapsulate information from multiple user reviews. We present Semantic Autoencoder (SemAE) to perform extractive opinion summarization in an unsupervised manner. SemAE uses dictionary learning to implicitly capture semantic information from the review and learns a latent representation of each sentence over semantic unit… ▽ More

    Submitted 18 May, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022

  14. arXiv:2202.00035  [pdf, other

    cs.LG cs.CL

    Learning Fair Representations via Rate-Distortion Maximization

    Authors: Somnath Basu Roy Chowdhury, Snigdha Chaturvedi

    Abstract: Text representations learned by machine learning models often encode undesirable demographic information of the user. Predictive models based on these representations can rely on such information, resulting in biased decisions. We present a novel debiasing technique, Fairness-aware Rate Maximization (FaRM), that removes protected information by making representations of instances belonging to the… ▽ More

    Submitted 25 August, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: Accepted at TACL

  15. arXiv:2109.08613  [pdf, other

    cs.CL

    Adversarial Scrubbing of Demographic Information for Text Classification

    Authors: Somnath Basu Roy Chowdhury, Sayan Ghosh, Yiyuan Li, Junier B. Oliva, Shashank Srivastava, Snigdha Chaturvedi

    Abstract: Contextual representations learned by language models can often encode undesirable attributes, like demographic associations of the users, while being trained for an unrelated target task. We aim to scrub such undesirable attributes and learn fair representations while maintaining performance on the target task. In this paper, we present an adversarial learning framework "Adversarial Scrubber" (AD… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP 2021

  16. arXiv:2109.08588  [pdf, other

    cs.CL

    Does Commonsense help in detecting Sarcasm?

    Authors: Somnath Basu Roy Chowdhury, Snigdha Chaturvedi

    Abstract: Sarcasm detection is important for several NLP tasks such as sentiment identification in product reviews, user feedback, and online forums. It is a challenging task requiring a deep understanding of language, context, and world knowledge. In this paper, we investigate whether incorporating commonsense knowledge helps in sarcasm detection. For this, we incorporate commonsense knowledge into the pre… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Accepted at Insights from Negative Results in NLP Workshop, EMNLP 2021

  17. arXiv:2104.07064  [pdf, other

    cs.CL

    Is Everything in Order? A Simple Way to Order Sentences

    Authors: Somnath Basu Roy Chowdhury, Faeze Brahman, Snigdha Chaturvedi

    Abstract: The task of organizing a shuffled set of sentences into a coherent text has been used to evaluate a machine's understanding of causal and temporal relations. We formulate the sentence ordering task as a conditional text-to-marker generation problem. We present Reorder-BART (Re-BART) that leverages a pre-trained Transformer-based model to identify a coherent order for a given set of shuffled senten… ▽ More

    Submitted 17 September, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted at EMNLP 2021

  18. arXiv:1802.05934  [pdf, other

    cs.CL

    Instance-based Inductive Deep Transfer Learning by Cross-Dataset Querying with Locality Sensitive Hashing

    Authors: Somnath Basu Roy Chowdhury, K M Annervaz, Ambedkar Dukkipati

    Abstract: Supervised learning models are typically trained on a single dataset and the performance of these models rely heavily on the size of the dataset, i.e., amount of data available with the ground truth. Learning algorithms try to generalize solely based on the data that is presented with during the training. In this work, we propose an inductive transfer learning method that can augment learning mode… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

  19. arXiv:1802.05930  [pdf, other

    cs.CL

    Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

    Authors: K M Annervaz, Somnath Basu Roy Chowdhury, Ambedkar Dukkipati

    Abstract: Machine Learning has been the quintessential solution for many AI problems, but learning is still heavily dependent on the specific training data. Some learning models can be incorporated with a prior knowledge in the Bayesian set up, but these learning models do not have the ability to access any organised world knowledge on demand. In this work, we propose to enhance learning models with world k… ▽ More

    Submitted 20 May, 2018; v1 submitted 16 February, 2018; originally announced February 2018.

    Comments: Accepted at NAACL 2018

  20. arXiv:1711.02831  [pdf, other

    cs.CV cs.AI

    SIMILARnet: Simultaneous Intelligent Localization and Recognition Network

    Authors: Arna Ghosh, Biswarup Bhattacharya, Somnath Basu Roy Chowdhury

    Abstract: Global Average Pooling (GAP) [4] has been used previously to generate class activation for image classification tasks. The motivation behind SIMILARnet comes from the fact that the convolutional filters possess position information of the essential features and hence, combination of the feature maps could help us locate the class instances in an image. We propose a biologically inspired model that… ▽ More

    Submitted 8 November, 2017; originally announced November 2017.

    Comments: 5 pages; 2 figures; 2 tables; All authors have equal contribution

  21. arXiv:1706.09243  [pdf, other

    cs.CY

    Location Optimization of ATM Networks

    Authors: Somnath Basu Roy Chowdhury, Biswarup Bhattacharya, Sumit Agarwal

    Abstract: ATMs enable the public to perform financial transactions. Banks try to strategically position their ATMs in order to maximize transactions and revenue. In this paper, we introduce a model which provides a score to an ATM location, which serves as an indicator of its relative likelihood of transactions. In order to efficiently capture the spatially dynamic features, we utilize two concurrent predic… ▽ More

    Submitted 26 June, 2017; originally announced June 2017.

    Comments: 5 pages; 2 figures; 2 tables

  22. arXiv:1611.08789  [pdf, other

    cs.CV cs.AI

    Handwriting Profiling using Generative Adversarial Networks

    Authors: Arna Ghosh, Biswarup Bhattacharya, Somnath Basu Roy Chowdhury

    Abstract: Handwriting is a skill learned by humans from a very early age. The ability to develop one's own unique handwriting as well as mimic another person's handwriting is a task learned by the brain with practice. This paper deals with this very problem where an intelligent system tries to learn the handwriting of an entity using Generative Adversarial Networks (GANs). We propose a modified architecture… ▽ More

    Submitted 27 November, 2016; originally announced November 2016.

    Comments: 2 pages; 2 figures; Accepted at The Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17 Student Abstract and Poster Program), San Francisco, USA; All authors have equal contribution

  23. arXiv:1611.08788  [pdf, other

    cs.CV cs.AI

    SAD-GAN: Synthetic Autonomous Driving using Generative Adversarial Networks

    Authors: Arna Ghosh, Biswarup Bhattacharya, Somnath Basu Roy Chowdhury

    Abstract: Autonomous driving is one of the most recent topics of interest which is aimed at replicating human driving behavior kee** in mind the safety issues. We approach the problem of learning synthetic driving using generative neural networks. The main idea is to make a controller trainer network using images plus key press data to mimic human learning. We used the architecture of a stable GAN to make… ▽ More

    Submitted 27 November, 2016; originally announced November 2016.

    Comments: 5 pages; 4 figures; Accepted at the Deep Learning for Action and Interaction Workshop, 30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain; All authors have equal contribution