Skip to main content

Showing 1–8 of 8 results for author: Gambardella, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02356  [pdf, other

    cs.LG cs.AI cs.CL

    Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks

    Authors: Andrew Gambardella, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: The ability (and inability) of large language models (LLMs) to perform arithmetic tasks has been the subject of much theoretical and practical debate. We show that LLMs are frequently able to correctly and confidently predict the first digit of n-digit by m-digit multiplication tasks without using chain of thought reasoning, despite these tasks require compounding operations to solve. Simultaneous… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

  2. arXiv:2402.05741  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Real-World Robot Applications of Foundation Models: A Review

    Authors: Kento Kawaharazuka, Tatsuya Matsushima, Andrew Gambardella, Jiaxian Guo, Chris Paxton, Andy Zeng

    Abstract: Recent developments in foundation models, like Large Language Models (LLMs) and Vision-Language Models (VLMs), trained on extensive data, facilitate flexible application across different tasks and modalities. Their impact spans various fields, including healthcare, education, and robotics. This paper provides an overview of the practical application of foundation models in real-world robotics, wit… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  3. arXiv:2210.14602  [pdf, other

    cs.SD eess.AS stat.AP

    Efficient Data Mosaicing with Simulation-based Inference

    Authors: Andrew Gambardella, Youngjun Choi, Doyo Choi, **joon Lee

    Abstract: We introduce an efficient algorithm for general data mosaicing, based on the simulation-based inference paradigm. Our algorithm takes as input a target datum, source data, and partitions of the target and source data into fragments, learning distributions over averages of fragments of the source data such that samples from those distributions approximate fragments of the target datum. We utilize a… ▽ More

    Submitted 1 February, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

  4. arXiv:2110.02483  [pdf, other

    stat.ML cs.CR cs.LG stat.AP

    Detecting and Quantifying Malicious Activity with Simulation-based Inference

    Authors: Andrew Gambardella, Bogdan State, Naeemullah Khan, Leo Tsourides, Philip H. S. Torr, Atılım Güneş Baydin

    Abstract: We propose the use of probabilistic programming techniques to tackle the malicious user identification problem in a recommendation algorithm. Probabilistic programming provides numerous advantages over other techniques, including but not limited to providing a disentangled representation of how malicious users acted under a structured model, as well as allowing for the quantification of damage cau… ▽ More

    Submitted 7 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Short version, appeared at ICML workshop on Socially Responsible Machine Learning 2021

  5. arXiv:2005.07062  [pdf, other

    cs.LG stat.AP stat.ML

    Simulation-Based Inference for Global Health Decisions

    Authors: Christian Schroeder de Witt, Bradley Gram-Hansen, Nantas Nardelli, Andrew Gambardella, Rob Zinkov, Puneet Dokania, N. Siddharth, Ana Belen Espinosa-Gonzalez, Ara Darzi, Philip Torr, Atılım Güneş Baydin

    Abstract: The COVID-19 pandemic has highlighted the importance of in-silico epidemiological modelling in predicting the dynamics of infectious diseases to inform health policy and decision makers about suitable prevention and containment strategies. Work in this setting involves solving challenging inference and control problems in individual-based models of ever increasing complexity. Here we discuss recen… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Journal ref: ICML Workshop on Machine Learning for Global Health, Thirty-Seventh International Conference on Machine Learning (ICML 2020)

  6. arXiv:1911.13270  [pdf, other

    cs.LG cs.CV stat.ML

    Transflow Learning: Repurposing Flow Models Without Retraining

    Authors: Andrew Gambardella, Atılım Güneş Baydin, Philip H. S. Torr

    Abstract: It is well known that deep generative models have a rich latent space, and that it is possible to smoothly manipulate their outputs by traversing this latent space. Recently, architectures have emerged that allow for more complex manipulations, such as making an image look as though it were from a different class, or painted in a certain style. These methods typically require large amounts of trai… ▽ More

    Submitted 5 December, 2019; v1 submitted 29 November, 2019; originally announced November 2019.

  7. arXiv:1904.01033  [pdf, other

    cs.LG stat.ML

    Multitask Soft Option Learning

    Authors: Maximilian Igl, Andrew Gambardella, **ke He, Nantas Nardelli, N. Siddharth, Wendelin Böhmer, Shimon Whiteson

    Abstract: We present Multitask Soft Option Learning(MSOL), a hierarchical multitask framework based on Planning as Inference. MSOL extends the concept of options, using separate variational posteriors for each task, regularized by a shared prior. This ''soft'' version of options avoids several instabilities during training in a multitask setting, and provides a natural way to learn both intra-option policie… ▽ More

    Submitted 21 June, 2020; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: Published at UAI 2020

  8. arXiv:1902.11162  [pdf

    cs.DL

    The FAIR Funder pilot programme to make it easy for funders to require and for grantees to produce FAIR Data

    Authors: P. Wittenburg, H. Pergl Sustkova, A. Montesanti, S. M. Bloemers, S. H. de Waard, M. A. Musen, J. B. Graybeal, K. M. Hettne, A. Jacobsen, R. Pergl, R. W. W. Hooft, C. Staiger, C. W. G. van Gelder, S. L. Knijnenburg, A. C. van Arkel, B. Meerman, M. D. Wilkinson, S-A Sansone, P. Rocca-Serra, P. McQuilton, A. N. Gonzalez-Beltran, G. J. C. Aben, P. Henning, S. Alencar, C. Ribeiro , et al. (35 additional authors not shown)

    Abstract: There is a growing acknowledgement in the scientific community of the importance of making experimental data machine findable, accessible, interoperable, and reusable (FAIR). Recognizing that high quality metadata are essential to make datasets FAIR, members of the GO FAIR Initiative and the Research Data Alliance (RDA) have initiated a series of workshops to encourage the creation of Metadata for… ▽ More

    Submitted 6 March, 2019; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: This is a pre-print of the FAIR Funders pilot, an outcome of the first Metadata for Machines workshop, see: https://www.go-fair.org/resources/go-fair-workshop-series/metadata-for-machines-workshops/. Corresponding author: E. A Schultes, ORCID 0000-0001-8888-635X