Skip to main content

Showing 1–2 of 2 results for author: Alonso, V A

.
  1. arXiv:2311.04046  [pdf, other

    cs.LG cs.CL

    Reinforcement Learning Fine-tuning of Language Models is Biased Towards More Extractable Features

    Authors: Diogo Cruz, Edoardo Pona, Alex Holness-Tofts, Elias Schmied, Víctor Abia Alonso, Charlie Griffin, Bogdan-Ionut Cirstea

    Abstract: Many capable large language models (LLMs) are developed via self-supervised pre-training followed by a reinforcement-learning fine-tuning phase, often based on human or AI feedback. During this stage, models may be guided by their inductive biases to rely on simpler features which may be easier to extract, at a cost to robustness and generalisation. We investigate whether principles governing indu… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  2. Discovery of TOI-1260d and the characterisation of the multi-planet system

    Authors: Kristine W. F. Lam, J. Cabrera, M. J. Hooton, Y. Alibert, A. Bonfanti, M. Beck, A. Deline, H. -G. Florén, A. E. Simon, L. Fossati, C. M. Persson, M. Fridlund, S. Salmon, S. Hoyer, H. P. Osborn, T . G. Wilson, I. Y. Georgieva, Gr. Nowak, R. Luque, J. A. Egger, V. Adibekyan R. Alonso, G. Anglada Escudé, T. Bárczy, D. Barrado, S. C. C. Barros , et al. (61 additional authors not shown)

    Abstract: We report the discovery of a third planet transiting the star TOI-1260, previously known to host two transiting sub-Neptune planets with orbital periods of 3.127 and 7.493 days, respectively. The nature of the third transiting planet with a 16.6-day orbit is supported by ground-based follow-up observations, including time-series photometry, high-angular resolution images, spectroscopy, and archiva… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 18 pages, 10 figures, accepted for publication in MNRAS