Skip to main content

Showing 1–10 of 10 results for author: Balaguer, J

.
  1. arXiv:2406.15310  [pdf, other

    hep-th

    Massive IIA flux compactifications with dynamical open strings

    Authors: Juan Ramón Balaguer, Valentina Bevilacqua, Giuseppe Dibitetto, Jose J. Fernández-Melgarejo, Giuseppe Sudano

    Abstract: We consider massive type IIA compactifications down to 4 dimensions in presence of O6 planes and D6 branes parallel to them, in order to preserve half-maximal supersymmetry in 4D. The dynamics of open strings living on the spacetime filling branes is taken into account, in the gauged supergravity description, by adding extra vector multiplets and embedding tensor components. The scalar potential g… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 39 pages

  2. arXiv:2404.15059  [pdf

    cs.AI cs.CY cs.GT

    Using deep reinforcement learning to promote sustainable human behaviour on a common pool resource problem

    Authors: Raphael Koster, Miruna Pîslar, Andrea Tacchetti, Jan Balaguer, Leqi Liu, Romuald Elie, Oliver P. Hauser, Karl Tuyls, Matt Botvinick, Christopher Summerfield

    Abstract: A canonical social dilemma arises when finite resources are allocated to a group of people, who can choose to either reciprocate with interest, or keep the proceeds for themselves. What resource allocation mechanisms will encourage levels of reciprocation that sustain the commons? Here, in an iterated multiplayer trust game, we use deep reinforcement learning (RL) to design an allocation mechanism… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  3. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  4. Open Strings in IIB Orientifold Reductions

    Authors: Juan R. Balaguer, Giuseppe Dibitetto, Jose J. Fernandez-Melgarejo, Alejandro Ruiperez

    Abstract: We consider type IIB compactifications on a general 4D group manifold with different types of possible spacetime filling O-planes and the corresponding D-branes parallel to them. Once fluxes allowed by the associated orientifold projection are included, a 6D $\mathcal{N}=(1,1)$ gauged supergravity is obtained. In this paper we show how the consistent coupling to dynamical open strings living on th… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 45 pages, 1 figure

  5. arXiv:2211.15006  [pdf, other

    cs.LG cs.CL

    Fine-tuning language models to find agreement among humans with diverse preferences

    Authors: Michiel A. Bakker, Martin J. Chadwick, Hannah R. Sheahan, Michael Henry Tessler, Lucy Campbell-Gillingham, Jan Balaguer, Nat McAleese, Amelia Glaese, John Aslanides, Matthew M. Botvinick, Christopher Summerfield

    Abstract: Recent work in large language modeling (LLMs) has used fine-tuning to align outputs with the preferences of a prototypical user. This work assumes that human preferences are static and homogeneous across individuals, so that aligning to a a single "generic" user will confer more general alignment. Here, we embrace the heterogeneity of human preferences to consider a different challenge: how might… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

  6. arXiv:2202.10135  [pdf, other

    cs.MA cs.AI cs.LG econ.GN

    The Good Shepherd: An Oracle Agent for Mechanism Design

    Authors: Jan Balaguer, Raphael Koster, Christopher Summerfield, Andrea Tacchetti

    Abstract: From social networks to traffic routing, artificial learning agents are playing a central role in modern institutions. We must therefore understand how to leverage these systems to foster outcomes and behaviors that align with our own values and aspirations. While multiagent learning has received considerable attention in recent years, artificial agents have been primarily evaluated when interacti… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  7. arXiv:2202.10122  [pdf, other

    cs.MA cs.AI cs.LG econ.GN

    HCMD-zero: Learning Value Aligned Mechanisms from Data

    Authors: Jan Balaguer, Raphael Koster, Ari Weinstein, Lucy Campbell-Gillingham, Christopher Summerfield, Matthew Botvinick, Andrea Tacchetti

    Abstract: Artificial learning agents are mediating a larger and larger number of interactions among humans, firms, and organizations, and the intersection between mechanism design and machine learning has been heavily investigated in recent years. However, mechanism design methods often make strong assumptions on how participants behave (e.g. rationality), on the kind of knowledge designers have access to a… ▽ More

    Submitted 20 May, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

  8. arXiv:2201.11441  [pdf

    cs.AI cs.HC cs.MA econ.GN

    Human-centered mechanism design with Democratic AI

    Authors: Raphael Koster, Jan Balaguer, Andrea Tacchetti, Ari Weinstein, Tina Zhu, Oliver Hauser, Duncan Williams, Lucy Campbell-Gillingham, Phoebe Thacker, Matthew Botvinick, Christopher Summerfield

    Abstract: Building artificial intelligence (AI) that aligns with human values is an unsolved problem. Here, we developed a human-in-the-loop research pipeline called Democratic AI, in which reinforcement learning is used to design a social mechanism that humans prefer by majority. A large group of humans played an online investment game that involved deciding whether to keep a monetary endowment or to share… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: 18 pages, 4 figures, 54 pages including supplemental materials

  9. arXiv:2107.05407  [pdf, other

    cs.LG cs.AI cs.CC

    PonderNet: Learning to Ponder

    Authors: Andrea Banino, Jan Balaguer, Charles Blundell

    Abstract: In standard neural networks the amount of computation used grows with the size of the inputs, but not with the complexity of the problem being learnt. To overcome this limitation we introduce PonderNet, a new algorithm that learns to adapt the amount of computation based on the complexity of the problem at hand. PonderNet learns end-to-end the number of computational steps to achieve an effective… ▽ More

    Submitted 2 September, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: 16 pages, 2 figures, 2 tables, 8th ICML Workshop on Automated Machine Learning (2021)

  10. New IIB intersecting brane solutions yielding supersymmetric AdS$_3$ vacua

    Authors: Juan R. Balaguer, Giuseppe Dibitetto, Jose J. Fernandez-Melgarejo

    Abstract: We consider genuine type IIB string theory (supersymmetric) brane intersections that preserve $(1+1)$D Lorentz symmetry. We provide the full supergravity solutions in their analytic form and discuss their physical properties. The Ansatz for the spacetime dependence of the different brane warp factors goes beyond the harmonic superposition principle. By studying the associated near-horizon geometry… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: 18 pages