Skip to main content

Showing 1–7 of 7 results for author: Ryan, M J

.
  1. arXiv:2406.11695  [pdf, other

    cs.CL cs.AI cs.LG

    Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

    Authors: Krista Opsahl-Ong, Michael J Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab

    Abstract: Language Model Programs, i.e. sophisticated pipelines of modular language model (LM) calls, are increasingly advancing NLP tasks, but they require crafting prompts that are jointly effective for all modules. We study prompt optimization for LM programs, i.e. how to update these prompts to maximize a downstream metric without access to module-level labels or gradients. To make this tractable, we fa… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Krista and Michael contributed equally to this work

  2. arXiv:2402.15018  [pdf, other

    cs.CL cs.CY cs.LG

    Unintended Impacts of LLM Alignment on Global Representation

    Authors: Michael J. Ryan, William Held, Diyi Yang

    Abstract: Before being deployed for user-facing applications, developers align Large Language Models (LLMs) to user preferences through a variety of procedures, such as Reinforcement Learning From Human Feedback (RLHF) and Direct Preference Optimization (DPO). Current evaluations of these procedures focus on benchmarks of instruction following, reasoning, and truthfulness. However, human preferences are not… ▽ More

    Submitted 6 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024 main conference

  3. arXiv:2305.15678  [pdf, other

    cs.CL cs.AI

    Revisiting non-English Text Simplification: A Unified Multilingual Benchmark

    Authors: Michael J. Ryan, Tarek Naous, Wei Xu

    Abstract: Recent advancements in high-quality, large-scale English resources have pushed the frontier of English Automatic Text Simplification (ATS) research. However, less work has been done on multilingual text simplification due to the lack of a diverse evaluation benchmark that covers complex-simple sentence pairs in many languages. This paper introduces the MultiSim benchmark, a collection of 27 resour… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023 main conference

  4. arXiv:2305.14463  [pdf, other

    cs.CL cs.AI cs.LG

    ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment

    Authors: Tarek Naous, Michael J. Ryan, Anton Lavrouk, Mohit Chandra, Wei Xu

    Abstract: We present a comprehensive evaluation of large language models for multilingual readability assessment. Existing evaluation resources lack domain and language diversity, limiting the ability for cross-domain and cross-lingual analyses. This paper introduces ReadMe++, a multilingual multi-domain dataset with human annotations of 9757 sentences in Arabic, English, French, Hindi, and Russian, collect… ▽ More

    Submitted 8 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  5. arXiv:2305.14456  [pdf, other

    cs.CL cs.AI cs.LG

    Having Beer after Prayer? Measuring Cultural Bias in Large Language Models

    Authors: Tarek Naous, Michael J. Ryan, Alan Ritter, Wei Xu

    Abstract: As the reach of large language models (LMs) expands globally, their ability to cater to diverse cultural contexts becomes crucial. Despite advancements in multilingual capabilities, models are not designed with appropriate cultural nuances. In this paper, we show that multilingual and Arabic monolingual LMs exhibit bias towards entities associated with Western culture. We introduce CAMeL, a novel… ▽ More

    Submitted 20 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  6. arXiv:2204.01711  [pdf, other

    eess.IV cs.CV

    Single Image Internal Distribution Measurement Using Non-Local Variational Autoencoder

    Authors: Yeahia Sarker, Abdullah-Al-Zubaer Imran, Md Hafiz Ahamed, Ripon K. Chakrabortty, Michael J. Ryan, Sajal K. Das

    Abstract: Deep learning-based super-resolution methods have shown great promise, especially for single image super-resolution (SISR) tasks. Despite the performance gain, these methods are limited due to their reliance on copious data for model training. In addition, supervised SISR solutions rely on local neighbourhood information focusing only on the feature learning processes for the reconstruction of low… ▽ More

    Submitted 2 April, 2022; originally announced April 2022.

    Comments: A Preprint Version

  7. arXiv:0912.4300  [pdf

    physics.ins-det hep-ex

    Radiation hardness qualification of PbWO4 scintillation crystals for the CMS Electromagnetic Calorimeter

    Authors: The CMS Electromagnetic Calorimeter Group, P. Adzic, N. Almeida, D. Andelin, I. Anicin, Z. Antunovic, R. Arcidiacono, M. W. Arenton, E. Auffray, S. Argiro, A. Askew, S. Baccaro, S. Baffioni, M. Balazs, D. Bandurin, D. Barney, L. M. Barone, A. Bartoloni, C. Baty, S. Beauceron, K. W. Bell, C. Bernet, M. Besancon, B. Betev, R. Beuselinck , et al. (245 additional authors not shown)

    Abstract: Ensuring the radiation hardness of PbWO4 crystals was one of the main priorities during the construction of the electromagnetic calorimeter of the CMS experiment at CERN. The production on an industrial scale of radiation hard crystals and their certification over a period of several years represented a difficult challenge both for CMS and for the crystal suppliers. The present article reviews t… ▽ More

    Submitted 21 December, 2009; originally announced December 2009.

    Comments: 24 pages, 19 figures, available on CMS information server at http://cms.cern.ch/iCMS/

    Report number: CMS Note 2009/016

    Journal ref: JINST 5:P03010,2010