Skip to main content

Showing 1–50 of 86 results for author: Ahuja, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09579  [pdf, other

    cs.HC

    Hovering Over the Key to Text Input in XR

    Authors: Mar Gonzalez-Franco, Diar Abdlkarim, Arpit Bhatia, Stuart Macgregor, Jason Alexander Fotso-Puepi, Eric J Gonzalez, Hasti Seifi, Massimiliano Di Luca, Karan Ahuja

    Abstract: Virtual, Mixed, and Augmented Reality (XR) technologies hold immense potential for transforming productivity beyond PC. Therefore there is a critical need for improved text input solutions for XR. However, achieving efficient text input in these environments remains a significant challenge. This paper examines the current landscape of XR text input techniques, focusing on the importance of keyboar… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2405.18359  [pdf, other

    cs.CL cs.AI cs.LG

    Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs

    Authors: Somnath Kumar, Vaibhav Balloli, Mercy Ranjit, Kabir Ahuja, Tanuja Ganu, Sunayana Sitaram, Kalika Bali, Akshay Nambi

    Abstract: Large language models (LLMs) are at the forefront of transforming numerous domains globally. However, their inclusivity and effectiveness remain limited for non-Latin scripts and low-resource languages. This paper tackles the imperative challenge of enhancing the multilingual performance of LLMs without extensive training or fine-tuning. Through systematic investigation and evaluation of diverse l… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Report number: MSR-TR-VeLLM-01

  3. arXiv:2405.11722  [pdf, ps, other

    cs.RO cs.AI

    AI Algorithm for Predicting and Optimizing Trajectory of UAV Swarm

    Authors: Amit Raj, Kapil Ahuja, Yann Busnel

    Abstract: This paper explores the application of Artificial Intelligence (AI) techniques for generating the trajectories of fleets of Unmanned Aerial Vehicles (UAVs). The two main challenges addressed include accurately predicting the paths of UAVs and efficiently avoiding collisions between them. Firstly, the paper systematically applies a diverse set of activation functions to a Feedforward Neural Network… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 24 Pages, 9 Tables, 6 Figures

    ACM Class: I.2.1; I.6.3

  4. arXiv:2405.01600  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning Descriptor Hybridization with Feature Reduction for Accurate Cervical Cancer Colposcopy Image Classification

    Authors: Saurabh Saini, Kapil Ahuja, Siddartha Chennareddy, Karthik Boddupalli

    Abstract: Cervical cancer stands as a predominant cause of female mortality, underscoring the need for regular screenings to enable early diagnosis and preemptive treatment of pre-cancerous conditions. The transformation zone in the cervix, where cellular differentiation occurs, plays a critical role in the detection of abnormalities. Colposcopy has emerged as a pivotal tool in cervical cancer prevention si… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 7 Pages double column, 5 figures, and 5 tables

    ACM Class: I.2.1; I.5.2

  5. arXiv:2404.16367  [pdf, other

    cs.CL cs.LG

    Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically

    Authors: Kabir Ahuja, Vidhisha Balachandran, Madhur Panwar, Tianxing He, Noah A. Smith, Navin Goyal, Yulia Tsvetkov

    Abstract: Transformers trained on natural language data have been shown to learn its hierarchical structure and generalize to sentences with unseen syntactic structures without explicitly encoding any structural bias. In this work, we investigate sources of inductive bias in transformer models and their training that could cause such generalization behavior to emerge. We extensively experiment with transfor… ▽ More

    Submitted 31 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Code now available: https://github.com/kabirahuja2431/transformers-hg

  6. arXiv:2404.13274  [pdf, other

    cs.HC cs.AI

    Augmented Object Intelligence: Making the Analog World Interactable with XR-Objects

    Authors: Mustafa Doga Dogan, Eric J. Gonzalez, Andrea Colaco, Karan Ahuja, Ruofei Du, Johnny Lee, Mar Gonzalez-Franco, David Kim

    Abstract: Seamless integration of physical objects as interactive digital entities remains a challenge for spatial computing. This paper introduces Augmented Object Intelligence (AOI), a novel XR interaction paradigm designed to blur the lines between digital and physical by equip** real-world objects with the ability to interact as if they were digital, where every object has the potential to serve as a… ▽ More

    Submitted 22 April, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

  7. arXiv:2404.05579  [pdf, other

    cs.LG cs.CV

    Robust Data Pruning: Uncovering and Overcoming Implicit Bias

    Authors: Artem Vysogorets, Kartik Ahuja, Julia Kempe

    Abstract: In the era of exceptionally data-hungry models, careful selection of the training data is essential to mitigate the extensive costs of deep learning. Data pruning offers a solution by removing redundant or uninformative samples from the dataset, which yields faster convergence and improved neural scaling laws. However, little is known about its impact on classification bias of the trained models.… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  8. arXiv:2403.11009  [pdf, other

    cs.CL cs.AI

    DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

    Authors: Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos

    Abstract: Language technologies should be judged on their usefulness in real-world use cases. An often overlooked aspect in natural language processing (NLP) research and evaluation is language variation in the form of non-standard dialects or language varieties (hereafter, varieties). Most NLP benchmarks are limited to standard language varieties. To fill this gap, we propose DIALECTBENCH, the first-ever l… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Equal contribution: Fahim Faisal, Orevaoghene Ahia

  9. arXiv:2403.00153  [pdf, other

    cs.HC cs.CV

    Practical and Rich User Digitization

    Authors: Karan Ahuja

    Abstract: A long-standing vision in computer science has been to evolve computing devices into proactive assistants that enhance our productivity, health and wellness, and many other facets of our lives. User digitization is crucial in achieving this vision as it allows computers to intimately understand their users, capturing activity, pose, routine, and behavior. Today's consumer devices - like smartphone… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: PhD thesis

  10. arXiv:2402.07519  [pdf, other

    cs.CL cs.CY

    MAFIA: Multi-Adapter Fused Inclusive LanguAge Models

    Authors: Prachi Jain, Ashutosh Sathe, Varun Gumma, Kabir Ahuja, Sunayana Sitaram

    Abstract: Pretrained Language Models (PLMs) are widely used in NLP for various tasks. Recent studies have identified various biases that such models exhibit and have proposed methods to correct these biases. However, most of the works address a limited set of bias dimensions independently such as gender, race, or religion. Moreover, the methods typically involve finetuning the full model to maintain the per… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024

  11. arXiv:2402.04875  [pdf, other

    cs.LG cs.CL stat.ML

    On Provable Length and Compositional Generalization

    Authors: Kartik Ahuja, Amin Mansouri

    Abstract: Out-of-distribution generalization capabilities of sequence-to-sequence models can be studied from the lens of two crucial forms of generalization: length generalization -- the ability to generalize to longer sequences than ones seen during training, and compositional generalization: the ability to generalize to token combinations not seen during training. In this work, we provide first provable g… ▽ More

    Submitted 7 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  12. arXiv:2312.14920  [pdf, ps, other

    cs.LG cs.AI

    A Novel Sampled Clustering Algorithm for Rice Phenotypic Data

    Authors: Mithun Singh, Kapil Ahuja, Milind B. Ratnaparkhe

    Abstract: Phenotypic (or Physical) characteristics of plant species are commonly used to perform clustering. In one of our recent works (Shastri et al. (2021)), we used a probabilistically sampled (using pivotal sampling) and spectrally clustered algorithm to group soybean species. These techniques were used to obtain highly accurate clusterings at a reduced cost. In this work, we extend the earlier algorit… ▽ More

    Submitted 12 May, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 31 Pages, 3 Figures, 7 Tables

    MSC Class: 68T01; 68T10 ACM Class: I.2.1; I.5.3

  13. arXiv:2310.02854  [pdf, other

    cs.LG stat.ML

    Multi-Domain Causal Representation Learning via Weak Distributional Invariances

    Authors: Kartik Ahuja, Amin Mansouri, Yixin Wang

    Abstract: Causal representation learning has emerged as the center of action in causal machine learning research. In particular, multi-domain datasets present a natural opportunity for showcasing the advantages of causal representation learning over standard unsupervised representation learning. While recent works have taken crucial steps towards learning causal representations, they often lack applicabilit… ▽ More

    Submitted 11 December, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

  14. arXiv:2309.09888  [pdf, other

    cs.LG cs.AI stat.ML

    Context is Environment

    Authors: Sharut Gupta, Stefanie Jegelka, David Lopez-Paz, Kartik Ahuja

    Abstract: Two lines of work are taking the central stage in AI research. On the one hand, the community is making increasing efforts to build models that discard spurious correlations and generalize better in novel test environments. Unfortunately, the bitter lesson so far is that no proposal convincingly outperforms a simple empirical risk minimization baseline. On the other hand, large language models (LL… ▽ More

    Submitted 20 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 41 Pages, 4 Figures

  15. arXiv:2307.01503  [pdf, other

    cs.CL

    On Evaluating and Mitigating Gender Biases in Multilingual Settings

    Authors: Aniket Vashishtha, Kabir Ahuja, Sunayana Sitaram

    Abstract: While understanding and removing gender biases in language models has been a long-standing problem in Natural Language Processing, prior research work has primarily been limited to English. In this work, we investigate some of the challenges with evaluating and mitigating biases in multilingual settings which stem from a lack of existing benchmarks and resources for bias evaluation beyond English… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Accepted in ACL 2023 Findings

  16. arXiv:2306.16334  [pdf, other

    cs.LG cs.AI

    On the Identifiability of Quantized Factors

    Authors: Vitória Barin-Pacela, Kartik Ahuja, Simon Lacoste-Julien, Pascal Vincent

    Abstract: Disentanglement aims to recover meaningful latent ground-truth factors from the observed distribution solely, and is formalized through the theory of identifiability. The identifiability of independent latent factors is proven to be impossible in the unsupervised i.i.d. setting under a general nonlinear map from factors to observations. In this work, however, we demonstrate that it is possible to… ▽ More

    Submitted 12 March, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Appears in: 3rd Conference on Causal Learning and Reasoning (CLeaR 2024). 39 pages

  17. arXiv:2306.04891  [pdf, other

    cs.LG cs.CL

    In-Context Learning through the Bayesian Prism

    Authors: Madhur Panwar, Kabir Ahuja, Navin Goyal

    Abstract: In-context learning (ICL) is one of the surprising and useful features of large language models and subject of intense research. Recently, stylized meta-learning-like ICL setups have been devised that train transformers on sequences of input-output pairs $(x, f(x))$. The function $f$ comes from a function class and generalization is checked by evaluating on sequences generated from unseen function… ▽ More

    Submitted 14 April, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: ICLR 2024

  18. arXiv:2305.17740  [pdf, ps, other

    cs.CL cs.AI

    Breaking Language Barriers with a LEAP: Learning Strategies for Polyglot LLMs

    Authors: Akshay Nambi, Vaibhav Balloli, Mercy Ranjit, Tanuja Ganu, Kabir Ahuja, Sunayana Sitaram, Kalika Bali

    Abstract: Large language models (LLMs) are at the forefront of transforming numerous domains globally. However, their inclusivity and effectiveness remain limited for non-Latin scripts and low-resource languages. This paper tackles the imperative challenge of enhancing the multilingual performance of LLMs, specifically focusing on Generative models. Through systematic investigation and evaluation of diverse… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  19. arXiv:2305.16704  [pdf, other

    cs.LG stat.ML

    A Closer Look at In-Context Learning under Distribution Shifts

    Authors: Kartik Ahuja, David Lopez-Paz

    Abstract: In-context learning, a capability that enables a model to learn from input examples on the fly without necessitating weight updates, is a defining characteristic of large language models. In this work, we follow the setting proposed in (Garg et al., 2022) to better understand the generality and limitations of in-context learning from the lens of the simple yet fundamental task of linear regression… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  20. IMUPoser: Full-Body Pose Estimation using IMUs in Phones, Watches, and Earbuds

    Authors: Vimal Mollyn, Riku Arakawa, Mayank Goel, Chris Harrison, Karan Ahuja

    Abstract: Tracking body pose on-the-go could have powerful uses in fitness, mobile gaming, context-aware virtual assistants, and rehabilitation. However, users are unlikely to buy and wear special suits or sensor arrays to achieve this end. Instead, in this work, we explore the feasibility of estimating body pose using IMUs already in devices that many users own -- namely smartphones, smartwatches, and earb… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  21. arXiv:2303.12528  [pdf, other

    cs.CL

    MEGA: Multilingual Evaluation of Generative AI

    Authors: Kabir Ahuja, Harshita Diddee, Rishav Hada, Millicent Ochieng, Krithika Ramesh, Prachi Jain, Akshay Nambi, Tanuja Ganu, Sameer Segal, Maxamed Axmed, Kalika Bali, Sunayana Sitaram

    Abstract: Generative AI models have shown impressive performance on many Natural Language Processing tasks such as language understanding, reasoning, and language generation. An important question being asked by the AI community today is about the capabilities and limits of these models, and it is clear that evaluating generative AI is very challenging. Most studies on generative LLMs have been restricted t… ▽ More

    Submitted 22 October, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: EMNLP 2023

  22. arXiv:2302.10503  [pdf, other

    cs.AI

    Reusable Slotwise Mechanisms

    Authors: Trang Nguyen, Amin Mansouri, Kanika Madan, Khuong Nguyen, Kartik Ahuja, Dianbo Liu, Yoshua Bengio

    Abstract: Agents with the ability to comprehend and reason about the dynamics of objects would be expected to exhibit improved robustness and generalization in novel scenarios. However, achieving this capability necessitates not only an effective scene representation but also an understanding of the mechanisms governing interactions among object subsets. Recent studies have made significant progress in repr… ▽ More

    Submitted 27 October, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  23. arXiv:2212.10445  [pdf, other

    cs.LG cs.AI cs.CV

    Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization

    Authors: Alexandre Ramé, Kartik Ahuja, Jianyu Zhang, Matthieu Cord, Léon Bottou, David Lopez-Paz

    Abstract: Foundation models are redefining how AI systems are built. Practitioners now follow a standard procedure to build their machine learning solutions: from a pre-trained foundation model, they fine-tune the weights on the target task of interest. So, the Internet is swarmed by a handful of foundation models fine-tuned on many diverse tasks: these individual fine-tunings exist in isolation without ben… ▽ More

    Submitted 9 August, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: 24 pages, 10 tables, 21 figures

  24. arXiv:2211.08583  [pdf, other

    cs.LG cs.AI

    Empirical Study on Optimizer Selection for Out-of-Distribution Generalization

    Authors: Hiroki Naganuma, Kartik Ahuja, Shiro Takagi, Tetsuya Motokawa, Rio Yokota, Kohta Ishikawa, Ikuro Sato, Ioannis Mitliagkas

    Abstract: Modern deep learning systems do not generalize well when the test data distribution is slightly different to the training data distribution. While much promising work has been accomplished to address this fragility, a systematic study of the role of optimizers and their out-of-distribution generalization performance has not been undertaken. In this study, we examine the performance of popular firs… ▽ More

    Submitted 5 June, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: Accepted to TMLR

  25. arXiv:2211.00184  [pdf, other

    cs.LG

    FL Games: A Federated Learning Framework for Distribution Shifts

    Authors: Sharut Gupta, Kartik Ahuja, Mohammad Havaei, Niladri Chatterjee, Yoshua Bengio

    Abstract: Federated learning aims to train predictive models for data that is distributed across clients, under the orchestration of a server. However, participating clients typically each hold data from a different distribution, which can yield to catastrophic generalization on data from a different client, which represents a new domain. In this work, we argue that in order to generalize better across non-… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: Accepted as ORAL at NeurIPS Workshop on Federated Learning: Recent Advances and New Challenges. arXiv admin note: text overlap with arXiv:2205.11101

  26. arXiv:2210.12265  [pdf, other

    cs.CL

    On the Calibration of Massively Multilingual Language Models

    Authors: Kabir Ahuja, Sunayana Sitaram, Sandipan Dandapat, Monojit Choudhury

    Abstract: Massively Multilingual Language Models (MMLMs) have recently gained popularity due to their surprising effectiveness in cross-lingual transfer. While there has been much work in evaluating these models for their performance on a variety of tasks and languages, little attention has been paid on how well calibrated these models are with respect to the confidence in their predictions. We first invest… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  27. arXiv:2209.11924  [pdf, other

    stat.ML cs.LG

    Interventional Causal Representation Learning

    Authors: Kartik Ahuja, Divyat Mahajan, Yixin Wang, Yoshua Bengio

    Abstract: Causal representation learning seeks to extract high-level latent factors from low-level sensory data. Most existing methods rely on observational data and structural assumptions (e.g., conditional independence) to identify the latent factors. However, interventional data is prevalent across applications. Can interventional data facilitate causal representation learning? We explore this question i… ▽ More

    Submitted 22 February, 2024; v1 submitted 24 September, 2022; originally announced September 2022.

  28. arXiv:2206.01101  [pdf, other

    cs.LG stat.ML

    Weakly Supervised Representation Learning with Sparse Perturbations

    Authors: Kartik Ahuja, Jason Hartford, Yoshua Bengio

    Abstract: The theory of representation learning aims to build methods that provably invert the data generating process with minimal domain knowledge or any source of supervision. Most prior approaches require strong distributional assumptions on the latent variables and weak supervision (auxiliary information such as timestamps) to provide provable identification guarantees. In this work, we show that if on… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  29. arXiv:2205.11672  [pdf, other

    stat.ML cs.LG

    Why does Throwing Away Data Improve Worst-Group Error?

    Authors: Kamalika Chaudhuri, Kartik Ahuja, Martin Arjovsky, David Lopez-Paz

    Abstract: When facing data with imbalanced classes or groups, practitioners follow an intriguing strategy to achieve best results. They throw away examples until the classes or groups are balanced in size, and then perform empirical risk minimization on the reduced training set. This opposes common wisdom in learning theory, where the expected error is supposed to decrease as the dataset grows in size. In t… ▽ More

    Submitted 21 February, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

  30. arXiv:2205.11101  [pdf, other

    cs.LG cs.AI

    FL Games: A federated learning framework for distribution shifts

    Authors: Sharut Gupta, Kartik Ahuja, Mohammad Havaei, Niladri Chatterjee, Yoshua Bengio

    Abstract: Federated learning aims to train predictive models for data that is distributed across clients, under the orchestration of a server. However, participating clients typically each hold data from a different distribution, whereby predictive models with strong in-distribution generalization can fail catastrophically on unseen domains. In this work, we argue that in order to generalize better across n… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  31. arXiv:2205.06356  [pdf, other

    cs.CL

    Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages

    Authors: Kabir Ahuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

    Abstract: Although recent Massively Multilingual Language Models (MMLMs) like mBERT and XLMR support around 100 languages, most existing multilingual NLP benchmarks provide evaluation data in only a handful of these languages with little linguistic diversity. We argue that this makes the existing practices in multilingual evaluation unreliable and does not provide a full picture of the performance of MMLMs… ▽ More

    Submitted 14 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: NLP Power! Workshop, ACL 2022

  32. arXiv:2205.06350  [pdf, other

    cs.CL

    On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data

    Authors: Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat

    Abstract: Borrowing ideas from {\em Production functions} in micro-economics, in this paper we introduce a framework to systematically evaluate the performance and cost trade-offs between machine-translated and manually-created labelled data for task-specific fine-tuning of massively multilingual language models. We illustrate the effectiveness of our framework through a case-study on the TyDIQA-GoldP datas… ▽ More

    Submitted 14 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  33. arXiv:2205.06130  [pdf, other

    cs.CL

    Multi Task Learning For Zero Shot Performance Prediction of Multilingual Models

    Authors: Kabir Ahuja, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury

    Abstract: Massively Multilingual Transformer based Language Models have been observed to be surprisingly effective on zero-shot transfer across languages, though the performance varies from language to language depending on the pivot language(s) used for fine-tuning. In this work, we build upon some of the existing techniques for predicting the zero-shot performance on a task, by modeling it as a multi-task… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: ACL 2022

  34. arXiv:2204.04606  [pdf, other

    cs.LG cs.AI stat.ML

    Towards efficient representation identification in supervised learning

    Authors: Kartik Ahuja, Divyat Mahajan, Vasilis Syrgkanis, Ioannis Mitliagkas

    Abstract: Humans have a remarkable ability to disentangle complex sensory inputs (e.g., image, text) into simple factors of variation (e.g., shape, color) without much supervision. This ability has inspired many works that attempt to solve the following question: how do we invert the data generation process to extract those factors with minimal or no supervision? Several works in the literature on non-linea… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: Proceedings of the First Conference on Causal Learning and Reasoning

  35. arXiv:2204.02790  [pdf, other

    cs.CY cs.CL

    Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic?

    Authors: Ishani Mondal, Kabir Ahuja, Mohit Jain, Jacki O Neil, Kalika Bali, Monojit Choudhury

    Abstract: The COVID-19 pandemic has brought out both the best and worst of language technology (LT). On one hand, conversational agents for information dissemination and basic diagnosis have seen widespread use, and arguably, had an important role in combating the pandemic. On the other hand, it has also become clear that such technologies are readily available for a handful of languages, and the vast major… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: Under Revision

  36. arXiv:2203.09978  [pdf, other

    cs.LG stat.ML

    WOODS: Benchmarks for Out-of-Distribution Generalization in Time Series

    Authors: Jean-Christophe Gagnon-Audet, Kartik Ahuja, Mohammad-Javad Darvishi-Bayazi, Pooneh Mousavi, Guillaume Dumas, Irina Rish

    Abstract: Machine learning models often fail to generalize well under distributional shifts. Understanding and overcoming these failures have led to a research field of Out-of-Distribution (OOD) generalization. Despite being extensively studied for static computer vision tasks, OOD generalization has been underexplored for time series tasks. To shine light on this gap, we present WOODS: eight challenging op… ▽ More

    Submitted 6 April, 2023; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: 47 pages, 21 figures

  37. Testing Deep Learning Models: A First Comparative Study of Multiple Testing Techniques

    Authors: Mohit Kumar Ahuja, Arnaud Gotlieb, Helge Spieker

    Abstract: Deep Learning (DL) has revolutionized the capabilities of vision-based systems (VBS) in critical applications such as autonomous driving, robotic surgery, critical infrastructure surveillance, air and maritime traffic control, etc. By analyzing images, voice, videos, or any type of complex signals, DL has considerably increased the situation awareness of these systems. At the same time, while rely… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: Artificial Intelligence in Software Testing 2022 workshop @ ICST 2022

    Journal ref: Artificial Intelligence in Software Testing @ 2022 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW)

  38. arXiv:2201.12143  [pdf, other

    cs.LG cs.AI

    Locally Invariant Explanations: Towards Stable and Unidirectional Explanations through Local Invariant Learning

    Authors: Amit Dhurandhar, Karthikeyan Ramamurthy, Kartik Ahuja, Vijay Arya

    Abstract: Locally interpretable model agnostic explanations (LIME) method is one of the most popular methods used to explain black-box models at a per example level. Although many variants have been proposed, few provide a simple way to produce high fidelity explanations that are also stable and intuitive. In this work, we provide a novel perspective by proposing a model agnostic local explanation method in… ▽ More

    Submitted 3 October, 2023; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: Accepted to NeurIPS 2023

  39. arXiv:2110.15796  [pdf, other

    cs.LG cs.AI stat.ML

    Properties from Mechanisms: An Equivariance Perspective on Identifiable Representation Learning

    Authors: Kartik Ahuja, Jason Hartford, Yoshua Bengio

    Abstract: A key goal of unsupervised representation learning is "inverting" a data generating process to recover its latent properties. Existing work that provably achieves this goal relies on strong assumptions on relationships between the latent variables (e.g., independence conditional on auxiliary information). In this paper, we take a very different perspective on the problem and ask, "Can we instead i… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  40. arXiv:2110.11418  [pdf, ps, other

    cs.CR

    SABMIS: Sparse approximation based blind multi-image steganography scheme

    Authors: Rohit Agrawal, Kapil Ahuja, Marc C. Steinbach, Thomas Wick

    Abstract: We hide grayscale secret images into a grayscale cover image, which is considered to be a challenging steganography problem. Our goal is to develop a steganography scheme with enhanced embedding capacity while preserving the visual quality of the stego-image as well as the extracted secret image, and ensuring that the stego-image is resistant to steganographic attacks. The novel embedding rule of… ▽ More

    Submitted 4 September, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: 37 Pages, 20 Figures, and 12 Tables

    MSC Class: 90-08 ACM Class: I.4.9

  41. arXiv:2110.07750  [pdf, other

    cs.HC

    Is that a Duiker or Dik Dik Next to the Giraffe? Impacts of Uncertainty on Classification Efficiency in Citizen Science

    Authors: Vinod Kumar Ahuja, Holly K. Rosser, Andrea Grover

    Abstract: Quality control is an ongoing concern in citizen science that is often managed by replication to consensus in online tasks such as image classification. Numerous factors can lead to disagreement, including image quality problems, interface specifics, and the complexity of the content itself. We conducted trace ethnography with statistical and qualitative analyses of six Snapshot Safari projects to… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  42. arXiv:2108.10262  [pdf, other

    cs.LG

    Cube Sampled K-Prototype Clustering for Featured Data

    Authors: Seemandhar Jain, Aditya A. Shastri, Kapil Ahuja, Yann Busnel, Navneet Pratap Singh

    Abstract: Clustering large amount of data is becoming increasingly important in the current times. Due to the large sizes of data, clustering algorithm often take too much time. Sampling this data before clustering is commonly used to reduce this time. In this work, we propose a probabilistic sampling technique called cube sampling along with K-Prototype clustering. Cube sampling is used because of its accu… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: 5 Pages, 2 Columns, 5 Tables, 2 Figures

    MSC Class: 68T09 ACM Class: I.2.1

  43. arXiv:2106.11560  [pdf, other

    cs.LG

    Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge

    Authors: Abhin Shah, Karthikeyan Shanmugam, Kartik Ahuja

    Abstract: Treatment effect estimation from observational data is a fundamental problem in causal inference. There are two very different schools of thought that have tackled this problem. On one hand, Pearlian framework commonly assumes structural knowledge (provided by an expert) in form of directed acyclic graphs and provides graphical criteria such as back-door criterion to identify valid adjustment sets… ▽ More

    Submitted 25 February, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

  44. arXiv:2106.06607  [pdf, other

    cs.LG stat.ML

    Invariance Principle Meets Information Bottleneck for Out-of-Distribution Generalization

    Authors: Kartik Ahuja, Ethan Caballero, Dinghuai Zhang, Jean-Christophe Gagnon-Audet, Yoshua Bengio, Ioannis Mitliagkas, Irina Rish

    Abstract: The invariance principle from causality is at the heart of notable approaches such as invariant risk minimization (IRM) that seek to address out-of-distribution (OOD) generalization failures. Despite the promising theory, invariance principle-based approaches fail in common classification tasks, where invariant (causal) features capture all the information about the label. Are these failures due t… ▽ More

    Submitted 20 November, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

  45. arXiv:2106.02890  [pdf, other

    cs.LG stat.ML

    Can Subnetwork Structure be the Key to Out-of-Distribution Generalization?

    Authors: Dinghuai Zhang, Kartik Ahuja, Yilun Xu, Yisen Wang, Aaron Courville

    Abstract: Can models with particular structure avoid being biased towards spurious correlation in out-of-distribution (OOD) generalization? Peters et al. (2016) provides a positive answer for linear cases. In this paper, we use a functional modular probing method to analyze deep model structures under OOD setting. We demonstrate that even in biased models (which focus on spurious correlation) there still ex… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

    Comments: Accepted to ICML2021 as long talk

  46. arXiv:2106.02266  [pdf, other

    cs.LG cs.AI

    SAND-mask: An Enhanced Gradient Masking Strategy for the Discovery of Invariances in Domain Generalization

    Authors: Soroosh Shahtalebi, Jean-Christophe Gagnon-Audet, Touraj Laleh, Mojtaba Faramarzi, Kartik Ahuja, Irina Rish

    Abstract: A major bottleneck in the real-world applications of machine learning models is their failure in generalizing to unseen domains whose data distribution is not i.i.d to the training domains. This failure often stems from learning non-generalizable features in the training domains that are spuriously correlated with the label of data. To address this shortcoming, there has been a growing surge of in… ▽ More

    Submitted 25 September, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

  47. arXiv:2103.07788  [pdf, ps, other

    cs.LG stat.ML

    Treatment Effect Estimation using Invariant Risk Minimization

    Authors: Abhin Shah, Kartik Ahuja, Karthikeyan Shanmugam, Dennis Wei, Kush Varshney, Amit Dhurandhar

    Abstract: Inferring causal individual treatment effect (ITE) from observational data is a challenging problem whose difficulty is exacerbated by the presence of treatment assignment bias. In this work, we propose a new way to estimate the ITE using the domain generalization framework of invariant risk minimization (IRM). IRM uses data from multiple domains, learns predictors that do not exploit spurious dom… ▽ More

    Submitted 13 March, 2021; originally announced March 2021.

  48. arXiv:2103.03180  [pdf, ps, other

    cs.DC cs.SI

    A Critical Note on Social Cloud

    Authors: Pramod C. Mane, Kapil Ahuja, Pradeep Singh

    Abstract: The idea of a social cloud has emerged as a resource sharing paradigm in a social network context. Undoubtedly, state-of-the-art social cloud systems demonstrate the potential of the social cloud acting as complementary to other computing paradigms such as the cloud, grid, peer-to-peer and volunteer computing. However, in this note, we have done a critical survey of the social cloud literature and… ▽ More

    Submitted 29 January, 2021; originally announced March 2021.

    Comments: 4 pages

  49. arXiv:2102.01071  [pdf, other

    cs.GT cs.DC econ.GN

    Resource Availability in the Social Cloud: An Economics Perspective

    Authors: Pramod C. Mane, Nagarajan Krishnamurthy, Kapil Ahuja

    Abstract: This paper focuses on social cloud formation, where agents are involved in a closeness-based conditional resource sharing and build their resource sharing network themselves. The objectives of this paper are: (1) to investigate the impact of agents' decisions of link addition and deletion on their local and global resource availability, (2) to analyze spillover effects in terms of the impact of li… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

    Comments: 11 pages, 10 figures

  50. arXiv:2101.00690  [pdf, other

    cs.MM math.OC

    CSIS: compressed sensing-based enhanced-embedding capacity image steganography scheme

    Authors: Rohit Agrawal, Kapil Ahuja

    Abstract: Image steganography plays a vital role in securing secret data by embedding it in the cover images. Usually, these images are communicated in a compressed format. Existing techniques achieve this but have low embedding capacity. Enhancing this capacity causes a deterioration in the visual quality of the stego-image. Hence, our goal here is to enhance the embedding capacity while preserving the vis… ▽ More

    Submitted 3 January, 2021; originally announced January 2021.

    Comments: 12 pages double-column, 7 tables, and 11 figures

    ACM Class: I.4.2; I.4.5; I.4.9; E.3