Skip to main content

Showing 1–7 of 7 results for author: Alon, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09824  [pdf, other

    cs.CL

    Impact of Preference Noise on the Alignment Performance of Generative Language Models

    Authors: Yang Gao, Dana Alon, Donald Metzler

    Abstract: A key requirement in develo** Generative Language Models (GLMs) is to have their values aligned with human values. Preference-based alignment is a widely used paradigm for this purpose, in which preferences over generation pairs are first elicited from human annotators or AI systems, and then fed into some alignment techniques, e.g., Direct Preference Optimization. However, a substantial percent… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  2. arXiv:2404.05530  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Best-of-Venom: Attacking RLHF by Injecting Poisoned Preference Data

    Authors: Tim Baumgärtner, Yang Gao, Dana Alon, Donald Metzler

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is a popular method for aligning Language Models (LM) with human values and preferences. RLHF requires a large number of preference pairs as training data, which are often used in both the Supervised Fine-Tuning and Reward Model training, and therefore publicly available datasets are commonly used. In this work, we study to what extent a malicious… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  3. arXiv:2311.17946  [pdf, other

    cs.CV cs.AI cs.CL

    DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback

    Authors: Jiao Sun, Deqing Fu, Yushi Hu, Su Wang, Royi Rassin, Da-Cheng Juan, Dana Alon, Charles Herrmann, Sjoerd van Steenkiste, Ranjay Krishna, Cyrus Rashtchian

    Abstract: Despite their wide-spread success, Text-to-Image models (T2I) still struggle to produce images that are both aesthetically pleasing and faithful to the user's input text. We introduce DreamSync, a model-agnostic training algorithm by design that improves T2I models to be faithful to the text input. DreamSync builds off a recent insight from TIFA's evaluation framework -- that large vision-language… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  4. iMagLS: Interaural Level Difference with Magnitude Least-Squares Loss for Optimized First-Order Head-Related Transfer Function

    Authors: Or Berebi, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely

    Abstract: Binaural reproduction for headphone-based listening is an active research area due to its widespread use in evolving technologies such as augmented and virtual reality (AR and VR). On the one hand, these applications demand high quality spatial audio perception to preserve the sense of immersion. On the other hand, recording devices may only have a few microphones, leading to low-order representat… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 3 pages, 2 figures, Forum Acusticum 2023

  5. arXiv:2310.14408  [pdf, other

    cs.IR

    PaRaDe: Passage Ranking using Demonstrations with Large Language Models

    Authors: Andrew Drozdov, Honglei Zhuang, Zhuyun Dai, Zhen Qin, Razieh Rahimi, Xuanhui Wang, Dana Alon, Mohit Iyyer, Andrew McCallum, Donald Metzler, Kai Hui

    Abstract: Recent studies show that large language models (LLMs) can be instructed to effectively perform zero-shot passage re-ranking, in which the results of a first stage retrieval method, such as BM25, are rated and reordered to improve relevance. In this work, we improve LLM-based re-ranking by algorithmically selecting few-shot demonstrations to include in the prompt. Our analysis investigates the cond… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023

  6. arXiv:2309.10539  [pdf, other

    cs.CL cs.AI

    OpenMSD: Towards Multilingual Scientific Documents Similarity Measurement

    Authors: Yang Gao, Ji Ma, Ivan Korotkov, Keith Hall, Dana Alon, Don Metzler

    Abstract: We develop and evaluate multilingual scientific documents similarity measurement models in this work. Such models can be used to find related works in different languages, which can help multilingual researchers find and explore papers more efficiently. We propose the first multilingual scientific documents dataset, Open-access Multilingual Scientific Documents (OpenMSD), which has 74M papers in 1… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Scripts for constructing the OpenMSD dataset is available at: https://github.com/google-research/google-research/tree/master/OpenMSD

  7. arXiv:2304.11517  [pdf, other

    cs.LG cs.AI

    LayerNAS: Neural Architecture Search in Polynomial Complexity

    Authors: Yicheng Fan, Dana Alon, **gyue Shen, Daiyi Peng, Keshav Kumar, Yun Long, Xin Wang, Fotis Iliopoulos, Da-Cheng Juan, Erik Vee

    Abstract: Neural Architecture Search (NAS) has become a popular method for discovering effective model architectures, especially for target hardware. As such, NAS methods that find optimal architectures under constraints are essential. In our paper, we propose LayerNAS to address the challenge of multi-objective NAS by transforming it into a combinatorial optimization problem, which effectively constrains t… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.