Skip to main content

Showing 1–7 of 7 results for author: O'Neill, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00343  [pdf, other

    cs.CL

    Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios

    Authors: Millicent Ochieng, Varun Gumma, Sunayana Sitaram, **dong Wang, Vishrav Chaudhary, Keshet Ronen, Kalika Bali, Jacki O'Neill

    Abstract: The deployment of Large Language Models (LLMs) in real-world applications presents both opportunities and challenges, particularly in multilingual and code-mixed communication settings. This research evaluates the performance of seven leading LLMs in sentiment analysis on a dataset derived from multilingual and code-mixed WhatsApp chats, including Swahili, English and Sheng. Our evaluation include… ▽ More

    Submitted 13 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  2. arXiv:2403.15412  [pdf, other

    cs.CY cs.AI cs.CL

    Towards Measuring and Modeling "Culture" in LLMs: A Survey

    Authors: Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Alham Fikri Aji, Jacki O'Neill, Ashutosh Modi, Monojit Choudhury

    Abstract: We present a survey of more than 90 recent papers that aim to study cultural representation and inclusion in large language models (LLMs). We observe that none of the studies explicitly define "culture, which is a complex, multifaceted concept; instead, they probe the models on some specially designed datasets which represent certain aspects of "culture". We call these aspects the proxies of cultu… ▽ More

    Submitted 19 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  3. arXiv:2204.11660  [pdf, ps, other

    cs.CL cs.AI cs.LG

    A Survey on Word Meta-Embedding Learning

    Authors: Danushka Bollegala, James O'Neill

    Abstract: Meta-embedding (ME) learning is an emerging approach that attempts to learn more accurate word embeddings given existing (source) word embeddings as the sole input. Due to their ability to incorporate semantics from multiple source embeddings in a compact manner with superior performance, ME learning has gained popularity among practitioners in NLP. To the best of our knowledge, there exist no… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: Proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI-2022)

    Journal ref: Proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI-2022)

  4. arXiv:2111.14655  [pdf, other

    cs.LG cs.AI cs.DC

    FedHM: Efficient Federated Learning for Heterogeneous Models via Low-rank Factorization

    Authors: Dezhong Yao, Wanning Pan, Michael J O'Neill, Yutong Dai, Yao Wan, Hai **, Lichao Sun

    Abstract: One underlying assumption of recent federated learning (FL) paradigms is that all local models usually share the same network architecture and size, which becomes impractical for devices with different hardware resources. A scalable federated learning framework should address the heterogeneity that clients have different computing capacities and communication capabilities. To this end, this paper… ▽ More

    Submitted 26 May, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

  5. arXiv:2104.06893  [pdf, other

    cs.CL cs.AI cs.LG

    I Wish I Would Have Loved This One, But I Didn't -- A Multilingual Dataset for Counterfactual Detection in Product Reviews

    Authors: James O'Neill, Polina Rozenshtein, Ryuichi Kiryo, Motoko Kubota, Danushka Bollegala

    Abstract: Counterfactual statements describe events that did not or cannot take place. We consider the problem of counterfactual detection (CFD) in product reviews. For this purpose, we annotate a multilingual CFD dataset from Amazon product reviews covering counterfactual statements written in English, German, and Japanese languages. The dataset is unique as it contains counterfactuals in multiple language… ▽ More

    Submitted 15 September, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted to EMNLP 2021

  6. arXiv:2005.05754  [pdf, other

    cs.IR cs.CL

    Do not let the history haunt you -- Mitigating Compounding Errors in Conversational Question Answering

    Authors: Angrosh Mandya, James O'Neill, Danushka Bollegala, Frans Coenen

    Abstract: The Conversational Question Answering (CoQA) task involves answering a sequence of inter-related conversational questions about a contextual paragraph. Although existing approaches employ human-written ground-truth answers for answering conversational questions at test time, in a realistic scenario, the CoQA model will not have any access to ground-truth answers for the previous questions, compell… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

  7. arXiv:1710.01823  [pdf, other

    cs.AI

    Automatic Taxonomy Generation - A Use-Case in the Legal Domain

    Authors: Cécile Robin, James O'Neill, Paul Buitelaar

    Abstract: A key challenge in the legal domain is the adaptation and representation of the legal knowledge expressed through texts, in order for legal practitioners and researchers to access this information easier and faster to help with compliance related issues. One way to approach this goal is in the form of a taxonomy of legal concepts. While this task usually requires a manual construction of terms and… ▽ More

    Submitted 4 October, 2017; originally announced October 2017.

    Comments: 9 pages