Skip to main content

Showing 1–5 of 5 results for author: Arkhangorodsky, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.18796  [pdf, other

    cs.CL cs.AI

    Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

    Authors: Pat Verga, Sebastian Hofstatter, Sophia Althammer, Yixuan Su, Aleksandra Piktus, Arkady Arkhangorodsky, Minjie Xu, Naomi White, Patrick Lewis

    Abstract: As Large Language Models (LLMs) have become more advanced, they have outpaced our abilities to accurately evaluate their quality. Not only is finding data to adequately probe particular model properties difficult, but evaluating the correctness of a model's freeform generation alone is a challenge. To address this, many evaluations now rely on using LLMs themselves as judges to score the quality o… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  2. arXiv:2109.09597  [pdf, other

    cs.CL cs.AI cs.GT

    Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play

    Authors: Arkady Arkhangorodsky, Scot Fang, Victoria Knight, Ajay Nagesh, Maria Ryskina, Kevin Knight

    Abstract: Task-oriented dialog systems are often trained on human/human dialogs, such as collected from Wizard-of-Oz interfaces. However, human/human corpora are frequently too small for supervised training to be effective. This paper investigates two approaches to training agent-bots and user-bots through self-play, in which they autonomously explore an API environment, discovering communication strategies… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 4 pages, 5 figures

  3. arXiv:2109.09577  [pdf, other

    cs.CL cs.AI

    MeetDot: Videoconferencing with Live Translation Captions

    Authors: Arkady Arkhangorodsky, Christopher Chu, Scot Fang, Yiqi Huang, Denglin Jiang, Ajay Nagesh, Boliang Zhang, Kevin Knight

    Abstract: We present MeetDot, a videoconferencing system with live translation captions overlaid on screen. The system aims to facilitate conversation between people who speak different languages, thereby reducing communication barriers between multilingual participants. Currently, our system supports speech and captions in 4 languages and combines automatic speech recognition (ASR) and machine translation… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 7 pages, 4 figures, Accepted as EMNLP 2021 demo paper

  4. arXiv:2010.04747  [pdf, other

    cs.CL

    MEEP: An Open-Source Platform for Human-Human Dialog Collection and End-to-End Agent Training

    Authors: Arkady Arkhangorodsky, Amittai Axelrod, Christopher Chu, Scot Fang, Yiqi Huang, Ajay Nagesh, Xing Shi, Boliang Zhang, Kevin Knight

    Abstract: We create a new task-oriented dialog platform (MEEP) where agents are given considerable freedom in terms of utterances and API calls, but are constrained to work within a push-button environment. We include facilities for collecting human-human dialog corpora, and for training automatic agents in an end-to-end fashion. We demonstrate MEEP with a dialog assistant that lets users specify trip desti… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: 10 pages

  5. arXiv:2004.08752  [pdf, other

    cs.RO

    Zeus: A System Description of the Two-Time Winner of the Collegiate SAE AutoDrive Competition

    Authors: Keenan Burnett, **gxing Qian, Xintong Du, Linqiao Liu, David J. Yoon, Tianchang Shen, Susan Sun, Sepehr Samavi, Michael J. Sorocky, Mollie Bianchi, Kaicheng Zhang, Arkady Arkhangorodsky, Quinlan Sykora, Shichen Lu, Yizhou Huang, Angela P. Schoellig, Timothy D. Barfoot

    Abstract: The SAE AutoDrive Challenge is a three-year collegiate competition to develop a self-driving car by 2020. The second year of the competition was held in June 2019 at MCity, a mock town built for self-driving car testing at the University of Michigan. Teams were required to autonomously navigate a series of intersections while handling pedestrians, traffic lights, and traffic signs. Zeus is aUToron… ▽ More

    Submitted 18 April, 2020; originally announced April 2020.

    Comments: Submitted to the Journal of Field Robotics