Skip to main content

Showing 1–8 of 8 results for author: Levinboim, T

.
  1. arXiv:2308.06912  [pdf, other

    cs.LG cs.CL

    CausalLM is not optimal for in-context learning

    Authors: Nan Ding, Tomer Levinboim, Jialin Wu, Sebastian Goodman, Radu Soricut

    Abstract: Recent empirical evidence indicates that transformer based in-context learning performs better when using a prefix language model (prefixLM), in which in-context samples can all attend to each other, compared to causal language models (causalLM), which use auto-regressive attention that prohibits in-context samples to attend to future samples. While this result is intuitive, it is not understood f… ▽ More

    Submitted 20 February, 2024; v1 submitted 13 August, 2023; originally announced August 2023.

    Comments: ICLR 2024 conference paper. Code available at: https://github.com/google-research/causallm_icl

  2. arXiv:2211.12624  [pdf, other

    cs.LG cs.AI

    Improving Robust Generalization by Direct PAC-Bayesian Bound Minimization

    Authors: Zifan Wang, Nan Ding, Tomer Levinboim, Xi Chen, Radu Soricut

    Abstract: Recent research in robust optimization has shown an overfitting-like phenomenon in which models trained against adversarial attacks exhibit higher robustness on the training set compared to the test set. Although previous work provided theoretical explanations for this phenomenon using a robust PAC-Bayesian bound over the adversarial test error, related algorithmic derivations are at best only loo… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  3. arXiv:2203.05126  [pdf, other

    cs.LG

    PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks

    Authors: Nan Ding, Xi Chen, Tomer Levinboim, Beer Changpinyo, Radu Soricut

    Abstract: With the increasing abundance of pretrained models in recent years, the problem of selecting the best pretrained checkpoint for a particular downstream classification task has been gaining increased attention. Although several methods have recently been proposed to tackle the selection problem (e.g. LEEP, H-score), these methods resort to applying heuristics that are not well motivated by learning… ▽ More

    Submitted 19 July, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: European Conference on Computer Vision 2022 (oral)

  4. arXiv:2105.14099  [pdf, other

    cs.LG stat.ML

    Bridging the Gap Between Practice and PAC-Bayes Theory in Few-Shot Meta-Learning

    Authors: Nan Ding, Xi Chen, Tomer Levinboim, Sebastian Goodman, Radu Soricut

    Abstract: Despite recent advances in its theoretical understanding, there still remains a significant gap in the ability of existing PAC-Bayesian theories on meta-learning to explain performance improvements in the few-shot learning setting, where the number of training examples in the target tasks is severely limited. This gap originates from an assumption in the existing theories which supposes that the n… ▽ More

    Submitted 25 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: Neural Information Processing Systems 2021

  5. arXiv:2010.06150  [pdf, other

    cs.CL cs.LG

    Improving Text Generation Evaluation with Batch Centering and Tempered Word Mover Distance

    Authors: Xi Chen, Nan Ding, Tomer Levinboim, Radu Soricut

    Abstract: Recent advances in automatic evaluation metrics for text have shown that deep contextualized word representations, such as those generated by BERT encoders, are helpful for designing metrics that correlate well with human judgements. At the same time, it has been argued that contextualized word representations exhibit sub-optimal statistical properties for encoding the true similarity between word… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020 Eval4NLP Workshop

  6. arXiv:1911.09753  [pdf, other

    cs.CV cs.CL

    Reinforcing an Image Caption Generator Using Off-Line Human Feedback

    Authors: Paul Hongsuck Seo, Piyush Sharma, Tomer Levinboim, Bohyung Han, Radu Soricut

    Abstract: Human ratings are currently the most accurate way to assess the quality of an image captioning model, yet most often the only used outcome of an expensive human rating evaluation is a few overall statistics over the evaluation dataset. In this paper, we show that the signal from instance-level human caption ratings can be leveraged to improve captioning models, even when the amount of caption rati… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: AAAI 2020

  7. arXiv:1909.03396  [pdf, other

    cs.CL cs.CV

    Quality Estimation for Image Captions Based on Large-scale Human Evaluations

    Authors: Tomer Levinboim, Ashish V. Thapliyal, Piyush Sharma, Radu Soricut

    Abstract: Automatic image captioning has improved significantly over the last few years, but the problem is far from being solved, with state of the art models still often producing low quality captions when used in the wild. In this paper, we focus on the task of Quality Estimation (QE) for image captions, which attempts to model the caption quality from a human perspective and without access to ground-tru… ▽ More

    Submitted 1 June, 2021; v1 submitted 8 September, 2019; originally announced September 2019.

    Comments: 10 pages, 6 figures, 3 tables. Accepted to NAACL2021. https://www.aclweb.org/anthology/2021.naacl-main.253/

  8. arXiv:1906.08876  [pdf, other

    cs.CL cs.CV

    Informative Image Captioning with External Sources of Information

    Authors: Sanqiang Zhao, Piyush Sharma, Tomer Levinboim, Radu Soricut

    Abstract: An image caption should fluently present the essential information in a given image, including informative, fine-grained entity mentions and the manner in which these entities interact. However, current captioning models are usually trained to generate captions that only contain common object names, thus falling short on an important "informativeness" dimension. We present a mechanism for integrat… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.