Skip to main content

Showing 1–12 of 12 results for author: Keller, O

.
  1. arXiv:2405.14655  [pdf, other

    cs.LG

    Multi-turn Reinforcement Learning from Preference Human Feedback

    Authors: Lior Shani, Aviv Rosenberg, Asaf Cassel, Oran Lang, Daniele Calandriello, Avital Zipori, Hila Noga, Orgad Keller, Bilal Piot, Idan Szpektor, Avinatan Hassidim, Yossi Matias, Rémi Munos

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become the standard approach for aligning Large Language Models (LLMs) with human preferences, allowing LLMs to demonstrate remarkable abilities in various tasks. Existing methods work by emulating the preferences at the single decision (turn) level, limiting their capabilities in settings that require planning or multi-turn interactions to ach… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2306.00186  [pdf, other

    cs.CL

    Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback

    Authors: Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor

    Abstract: Despite the seeming success of contemporary grounded text generation systems, they often tend to generate factually inconsistent text with respect to their input. This phenomenon is emphasized in tasks like summarization, in which the generated summaries should be corroborated by their source article. In this work, we leverage recent progress on textual entailment models to directly address this p… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: ACL 2023

  4. arXiv:2208.02294  [pdf, other

    cs.CL cs.LG

    Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning

    Authors: Deborah Cohen, Moonkyung Ryu, Yinlam Chow, Orgad Keller, Ido Greenberg, Avinatan Hassidim, Michael Fink, Yossi Matias, Idan Szpektor, Craig Boutilier, Gal Elidan

    Abstract: Despite recent advances in natural language understanding and generation, and decades of research on the development of conversational bots, building automated agents that can carry on rich open-ended conversations with humans "in the wild" remains a formidable challenge. In this work we develop a real-time, open-ended dialogue system that uses reinforcement learning (RL) to power a bot's conversa… ▽ More

    Submitted 25 July, 2022; originally announced August 2022.

  5. arXiv:2206.14796  [pdf, other

    cs.CL cs.AI cs.LG

    On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method

    Authors: Zorik Gekhman, Nadav Oved, Orgad Keller, Idan Szpektor, Roi Reichart

    Abstract: Most works on modeling the conversation history in Conversational Question Answering (CQA) report a single main result on a common CQA benchmark. While existing models show impressive results on CQA leaderboards, it remains unclear whether they are robust to shifts in setting (sometimes to more realistic ones), training data size (e.g. from large to small sets) and domain. In this work, we design… ▽ More

    Submitted 28 December, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: Accepted for publication at TACL in December 2022. First two authors contributed equally to this work. Our code and data will be released at: https://github.com/zorikg/MarCQAp

  6. arXiv:2010.02592  [pdf, other

    cs.CL cs.AI cs.LG

    Semantically Driven Sentence Fusion: Modeling and Evaluation

    Authors: Eyal Ben-David, Orgad Keller, Eric Malmi, Idan Szpektor, Roi Reichart

    Abstract: Sentence fusion is the task of joining related sentences into coherent text. Current training and evaluation schemes for this task are based on single reference ground-truths and do not account for valid fusion variants. We show that this hinders models from robustly capturing the semantic relationship between input sentences. To alleviate this, we present an approach in which ground-truth solutio… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: This paper was accepted to Findings of EMNLP 2020

  7. arXiv:1708.04862  [pdf, other

    cs.DS

    New Approximations for Coalitional Manipulation in General Scoring Rules

    Authors: Orgad Keller, Avinatan Hassidim, Noam Hazon

    Abstract: We study the problem of coalitional manipulation---where $k$ manipulators try to manipulate an election on $m$ candidates---under general scoring rules, with a focus on the Borda protocol. We do so both in the weighted and unweighted settings. We focus on minimizing the maximum score obtainable by a non-preferred candidate. In the strongest, most general setting, we provide an algorithm for any… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

  8. arXiv:1307.2415  [pdf, ps, other

    cs.DS

    Finding the Minimum-Weight k-Path

    Authors: Avinatan Hassidim, Orgad Keller, Moshe Lewenstein, Liam Roditty

    Abstract: Given a weighted $n$-vertex graph $G$ with integer edge-weights taken from a range $[-M,M]$, we show that the minimum-weight simple path visiting $k$ vertices can be found in time $\tilde{O}(2^k \poly(k) M n^ω) = O^*(2^k M)$. If the weights are reals in $[1,M]$, we provide a $(1+\varepsilon)$-approximation which has a running time of $\tilde{O}(2^k \poly(k) n^ω(\log\log M + 1/\varepsilon))$. For t… ▽ More

    Submitted 9 July, 2013; originally announced July 2013.

    Comments: To appear at WADS 2013

  9. First Quantized Theory of the Photon

    Authors: Zhi-Yong Wang, Cai-Dong Xiong, Ole Keller

    Abstract: In near-field optics and optical tunneling theory, photon wave mechanics, i.e., the first quantized theory of the photon, allows us to address the spatial field localization problem in a flexible manner which links smoothly to classical electromagnetics. In this letter, photon wave mechanics is developed in a rigorous and unified way, based on which field quantization is obtained in a new way.

    Submitted 9 January, 2007; v1 submitted 11 May, 2006; originally announced May 2006.

    Comments: Detailed version by adding the last paragraph as well as Appendix A and B

    Journal ref: Chin. Phys. Lett. 24, 418 (2007)

  10. arXiv:quant-ph/0511270  [pdf

    quant-ph

    Photon Position Operator and Localization of Photons inside a Waveguide

    Authors: Zhi-Yong Wang, Cai-Dong Xiong, Ole Keller

    Abstract: In this article, we show that in the level of quantum mechanics, a photon position operator with commuting components can be obtained in a more natural way; in the level of quantum field theory, the photon position operator corresponds to the center of the photon number. It is most interesting for us to show that, a photon inside a waveguide can be localized in the same sense that a massive part… ▽ More

    Submitted 30 November, 2005; originally announced November 2005.

    Comments: 22 pages, 1 figure

  11. Photon Wave Mechanics

    Authors: Zhi-Yong Wang, Cai-Dong Xiong, Ole Keller

    Abstract: In contrast to wave functions in nonrelativistic quantum mechanics interpreted as probability amplitudes, wave functions in relativistic quantum mechanics have generalized meanings such as charge-density amplitudes, energy-density amplitudes as well as particle-number density amplitudes, etc. Applying electromagnetic field intensities we construct a photon wave function, it corresponds to the (1… ▽ More

    Submitted 13 March, 2009; v1 submitted 18 November, 2005; originally announced November 2005.

    Comments: Revised version based on Chin. Phys. Lett. 24, 418 (2007)

    Journal ref: Chin. Phys. Lett. 24, 418 (2007)

  12. arXiv:cond-mat/9910148  [pdf, ps, other

    cond-mat.mes-hall physics.optics

    Local-field study of phase conjugation in metallic quantum wells with probe fields of both propagating and evanescent character

    Authors: Torsten Andersen, Ole Keller

    Abstract: The phase conjugated response from nonmagnetic multi-level metallic quantum wells is analyzed and an essentially complete analytical solution is presented and discussed. The description is based on a semi-classical local-field theory for degenerate four-wave mixing in mesoscopic interaction volumes of condensed media developed by the present authors [T. Andersen and O. Keller, Phys. Scripta 58,… ▽ More

    Submitted 10 October, 1999; originally announced October 1999.

    Comments: 18 pages, 12 figures, 3 tables, accepted for publication in The Physical Review B. Figs. 3-6 reduced quality due to size limit

    Journal ref: Physical Review B, vol. 60, pp. 17046-17063 (1999).