Skip to main content

Showing 1–16 of 16 results for author: Sachdeva, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.10224  [pdf, other

    cs.CV

    The Manga Whisperer: Automatically Generating Transcriptions for Comics

    Authors: Ragav Sachdeva, Andrew Zisserman

    Abstract: In the past few decades, Japanese comics, commonly referred to as Manga, have transcended both cultural and linguistic boundaries to become a true worldwide sensation. Yet, the inherent reliance on visual cues and illustration within manga renders it largely inaccessible to individuals with visual impairments. In this work, we seek to address this substantial barrier, with the aim of ensuring that… ▽ More

    Submitted 21 March, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted at CVPR'24

  2. arXiv:2309.07822  [pdf, other

    cs.CL

    CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

    Authors: Rachneet Sachdeva, Martin Tutek, Iryna Gurevych

    Abstract: In recent years, large language models (LLMs) have shown remarkable capabilities at scale, particularly at generating text conditioned on a prompt. In our work, we investigate the use of LLMs to augment training data of small language models~(SLMs) with automatically generated counterfactual~(CF) instances -- i.e. minimally altered inputs -- in order to improve out-of-domain~(OOD) performance of S… ▽ More

    Submitted 13 February, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted to EACL 2024 main conference

  3. arXiv:2309.01809  [pdf, other

    cs.CL

    Are Emergent Abilities in Large Language Models just In-Context Learning?

    Authors: Sheng Lu, Irina Bigoulaeva, Rachneet Sachdeva, Harish Tayyar Madabushi, Iryna Gurevych

    Abstract: Large language models have exhibited emergent abilities, demonstrating exceptional performance across diverse tasks for which they were not explicitly trained, including those that require complex reasoning abilities. The emergence of such abilities carries profound implications for the future direction of research in NLP, especially as the deployment of such models becomes more prevalent. However… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Code available at https://github.com/UKPLab/on-emergence and data available at https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/3931

  4. arXiv:2308.10417  [pdf, other

    cs.CV

    The Change You Want to See (Now in 3D)

    Authors: Ragav Sachdeva, Andrew Zisserman

    Abstract: The goal of this paper is to detect what has changed, if anything, between two "in the wild" images of the same 3D scene acquired from different camera positions and at different temporal instances. The open-set nature of this problem, occlusions/dis-occlusions due to the shift in viewpoint, and the lack of suitable training datasets, presents substantial challenges in devising a solution. To ad… ▽ More

    Submitted 11 September, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

  5. arXiv:2303.18120  [pdf, other

    cs.CL

    UKP-SQuARE v3: A Platform for Multi-Agent QA Research

    Authors: Haritz Puerto, Tim Baumgärtner, Rachneet Sachdeva, Haishuo Fang, Hao Zhang, Sewin Tariverdian, Kexin Wang, Iryna Gurevych

    Abstract: The continuous development of Question Answering (QA) datasets has drawn the research community's attention toward multi-domain models. A popular approach is to use multi-dataset models, which are models trained on multiple datasets to learn their regularities and prevent overfitting to a single dataset. However, with the proliferation of QA models in online repositories such as GitHub or Hugging… ▽ More

    Submitted 17 May, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: ACL 2023 Demo Paper

  6. arXiv:2210.17301  [pdf, other

    cs.CL

    Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5

    Authors: Irina Bigoulaeva, Rachneet Sachdeva, Harish Tayyar Madabushi, Aline Villavicencio, Iryna Gurevych

    Abstract: We compare sequential fine-tuning with a model for multi-task learning in the context where we are interested in boosting performance on two tasks, one of which depends on the other. We test these models on the FigLang2022 shared task which requires participants to predict language inference labels on figurative language along with corresponding textual explanations of the inference predictions. O… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Accepted for publication in the Proceedings of the Second Workshop on Figurative Language Processing (colocated with EMNLP 2022). Code and models at https://github.com/Rachneet/cross-task-figurative-explanations

  7. arXiv:2209.14341  [pdf, other

    cs.CV

    The Change You Want to See

    Authors: Ragav Sachdeva, Andrew Zisserman

    Abstract: We live in a dynamic world where things change all the time. Given two images of the same scene, being able to automatically detect the changes in them has practical applications in a variety of domains. In this paper, we tackle the change detection problem with the goal of detecting "object-level" changes in an image pair despite differences in their viewpoint and illumination. To this end, we ma… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: Paper accepted at WACV 2023

  8. arXiv:2208.09316  [pdf, other

    cs.CL

    UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA

    Authors: Rachneet Sachdeva, Haritz Puerto, Tim Baumgärtner, Sewin Tariverdian, Hao Zhang, Kexin Wang, Hossain Shaikh Saadi, Leonardo F. R. Ribeiro, Iryna Gurevych

    Abstract: Question Answering (QA) systems are increasingly deployed in applications where they support real-world decisions. However, state-of-the-art models rely on deep neural networks, which are difficult to interpret by humans. Inherently interpretable models or post hoc explainability methods can help users to comprehend how a model arrives at its prediction and, if successful, increase their trust in… ▽ More

    Submitted 20 October, 2022; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: Accepted at AACL 2022 as Demo Paper

  9. arXiv:2203.13693  [pdf, other

    cs.CL cs.IR

    UKP-SQUARE: An Online Platform for Question Answering Research

    Authors: Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, Jonas Pfeiffer, Nils Reimers, Gözde Gül Şahin, Iryna Gurevych

    Abstract: Recent advances in NLP and information retrieval have given rise to a diverse set of question answering tasks that are of different formats (e.g., extractive, abstractive), require different model architectures (e.g., generative, discriminative), and setups (e.g., with or without retrieval). Despite having a large number of powerful, specialized QA pipelines (which we refer to as Skills) that cons… ▽ More

    Submitted 28 March, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022 Demo Track

  10. arXiv:2109.12109  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Autonomy and Perception for Space Mining

    Authors: Ragav Sachdeva, Ravi Hammond, James Bockman, Alec Arthur, Brandon Smart, Dustin Craggs, Anh-Dzung Doan, Thomas Rowntree, Elijah Schutz, Adrian Orenstein, Andy Yu, Tat-Jun Chin, Ian Reid

    Abstract: Future Moon bases will likely be constructed using resources mined from the surface of the Moon. The difficulty of maintaining a human workforce on the Moon and communications lag with Earth means that mining will need to be conducted using collaborative robots with a high degree of autonomy. In this paper, we describe our solution for Phase 2 of the NASA Space Robotics Challenge, which provided a… ▽ More

    Submitted 13 April, 2022; v1 submitted 26 September, 2021; originally announced September 2021.

    Comments: This paper describes our 3rd place and innovation award winning solution to the NASA Space Robotics Challenge Phase 2

  11. arXiv:2103.11395  [pdf, other

    cs.CV cs.LG

    ScanMix: Learning from Severe Label Noise via Semantic Clustering and Semi-Supervised Learning

    Authors: Ragav Sachdeva, Filipe R Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

    Abstract: We propose a new training algorithm, ScanMix, that explores semantic clustering and semi-supervised learning (SSL) to allow superior robustness to severe label noise and competitive robustness to non-severe label noise problems, in comparison to the state of the art (SOTA) methods. ScanMix is based on the expectation maximisation framework, where the E-step estimates the latent variable to cluster… ▽ More

    Submitted 16 October, 2022; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: Paper accepted at Pattern Recognition

  12. LongReMix: Robust Learning with High Confidence Samples in a Noisy Label Environment

    Authors: Filipe R. Cordeiro, Ragav Sachdeva, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

    Abstract: Deep neural network models are robust to a limited amount of label noise, but their ability to memorise noisy labels in high noise rate problems is still an open issue. The most competitive noisy-label learning algorithms rely on a 2-stage process comprising an unsupervised learning to classify training samples as clean or noisy, followed by a semi-supervised learning that minimises the empirical… ▽ More

    Submitted 4 September, 2022; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: Published at Pattern Recognition 2022

  13. arXiv:2011.05704  [pdf, other

    cs.LG cs.CV

    EvidentialMix: Learning with Combined Open-set and Closed-set Noisy Labels

    Authors: Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

    Abstract: The efficacy of deep learning depends on large-scale data sets that have been carefully curated with reliable data acquisition and annotation processes. However, acquiring such large-scale data sets with precise annotations is very expensive and time-consuming, and the cheap alternatives often yield data sets that have noisy labels. The field has addressed this problem by focusing on training mode… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Paper accepted at WACV'21: Winter Conference on Applications of Computer Vision

  14. arXiv:2007.14987  [pdf, other

    cs.RO cs.CL cs.HC

    Presentation and Analysis of a Multimodal Dataset for Grounded Language Learning

    Authors: Patrick Jenkins, Rishabh Sachdeva, Gaoussou Youssouf Kebe, Padraig Higgins, Kasra Darvish, Edward Raff, Don Engel, John Winder, Francis Ferraro, Cynthia Matuszek

    Abstract: Grounded language acquisition -- learning how language-based interactions refer to the world around them -- is amajor area of research in robotics, NLP, and HCI. In practice the data used for learning consists almost entirely of textual descriptions, which tend to be cleaner, clearer, and more grammatical than actual human interactions. In this work, we present the Grounded Language Dataset (GoLD)… ▽ More

    Submitted 28 September, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: 11 pages, 6 figures

  15. arXiv:2004.12045  [pdf, other

    cs.NE

    The Dynamic Travelling Thief Problem: Benchmarks and Performance of Evolutionary Algorithms

    Authors: Ragav Sachdeva, Frank Neumann, Markus Wagner

    Abstract: Many real-world optimisation problems involve dynamic and stochastic components. While problems with multiple interacting components are omnipresent in inherently dynamic domains like supply-chain optimisation and logistics, most research on dynamic problems focuses on single-component problems. With this article, we define a number of scenarios based on the Travelling Thief Problem to enable rese… ▽ More

    Submitted 14 September, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: Accepted for publication and presentation at ICONIP 2020, https://iconip2020.apnns.org/

  16. arXiv:2001.09955  [pdf, other

    cs.CY

    The Effects of Gender Signals and Performance in Online Product Reviews

    Authors: Sandipan Sikdar, Rachneet Singh Sachdeva, Johannes Wachs, Florian Lemmerich, Markus Strohmaier

    Abstract: This work quantifies the effects of signaling and performing gender on the success of reviews written on the popular amazon shop** platform. Highly rated reviews play an important role in e-commerce since they are prominently displayed below products. Differences in how gender-signaling and gender-performing review authors are received can lead to important biases in what content and perspective… ▽ More

    Submitted 28 January, 2020; v1 submitted 27 January, 2020; originally announced January 2020.