Skip to main content

Showing 1–6 of 6 results for author: Peng, A Y

.
  1. arXiv:2402.02636  [pdf, other

    cs.CL cs.AI cs.IT cs.LG

    Can Large Language Models Learn Independent Causal Mechanisms?

    Authors: Gaël Gendron, Bao Trung Nguyen, Alex Yuxuan Peng, Michael Witbrock, Gillian Dobbie

    Abstract: Despite impressive performance on language modelling and complex reasoning tasks, Large Language Models (LLMs) fall short on the same tasks in uncommon settings or with distribution shifts, exhibiting some lack of generalisation ability. This issue has usually been alleviated by feeding more training data into the LLM. However, this method is brittle, as the scope of tasks may not be readily predi… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 17 pages, 8 pages for the main paper and 9 pages for references and appendices, 12 figures

    ACM Class: I.2.3; I.2.6; I.2.7; G.3

  2. arXiv:2310.09430  [pdf, ps, other

    cs.CL cs.AI

    Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning

    Authors: Qiming Bao, Gael Gendron, Alex Yuxuan Peng, Wanjun Zhong, Neset Tan, Yang Chen, Michael Witbrock, Jiamou Liu

    Abstract: Large language models (LLMs), such as LLaMA, Alpaca, Vicuna, GPT-3.5 and GPT-4, have advanced the performance of AI systems on various natural language processing tasks to human-like levels. However, their generalisation and robustness when performing logical reasoning has not been sufficiently assessed. To comprehensively evaluate this ability, we develop three new logical reasoning datasets name… ▽ More

    Submitted 30 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: The short version (v3) was accepted for oral presentation at the first LLM@IJCAI 2023 non-archival symposium; the full version is under review

  3. arXiv:2309.10444  [pdf, other

    cs.AI cs.CL

    Exploring Iterative Enhancement for Improving Learnersourced Multiple-Choice Question Explanations with Large Language Models

    Authors: Qiming Bao, Juho Leinonen, Alex Yuxuan Peng, Wanjun Zhong, Gaël Gendron, Timothy Pistotti, Alice Huang, Paul Denny, Michael Witbrock, Jiamou Liu

    Abstract: Large language models exhibit superior capabilities in processing and understanding language, yet their applications in educational contexts remain underexplored. Learnersourcing enhances learning by engaging students in creating their own educational content. When learnersourcing multiple-choice questions, creating explanations for the solution of a question is a crucial step; it helps other stud… ▽ More

    Submitted 10 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: The short version (v4) was accepted as a non-archival workshop paper at AGI@ICLR 2024; the full version is under review

  4. arXiv:2305.12599  [pdf, other

    cs.CL cs.AI

    Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning

    Authors: Qiming Bao, Alex Yuxuan Peng, Zhenyun Deng, Wanjun Zhong, Gael Gendron, Timothy Pistotti, Neset Tan, Nathan Young, Yang Chen, Yonghua Zhu, Paul Denny, Michael Witbrock, Jiamou Liu

    Abstract: Combining large language models with logical reasoning enhances their capacity to address problems in a robust and reliable manner. Nevertheless, the intricate nature of logical reasoning poses challenges when gathering reliable data from the web to build comprehensive training datasets, subsequently affecting performance on downstream tasks. To address this, we introduce a novel logic-driven data… ▽ More

    Submitted 6 June, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: 21 pages, 8 figures, the Findings of ACL 2024

  5. arXiv:2303.07585  [pdf, other

    cs.CL

    Input-length-shortening and text generation via attention values

    Authors: Neşet Özkan Tan, Alex Yuxuan Peng, Joshua Bensemann, Qiming Bao, Tim Hartill, Mark Gahegan, Michael Witbrock

    Abstract: Identifying words that impact a task's performance more than others is a challenge in natural language processing. Transformers models have recently addressed this issue by incorporating an attention mechanism that assigns greater attention (i.e., relevance) scores to some words than others. Because of the attention mechanism's high computational cost, transformer models usually have an input-leng… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 7 pages, 4 figures. AAAI23-EMC2

  6. arXiv:2207.14000  [pdf, other

    cs.CL cs.AI cs.LG cs.LO

    Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation

    Authors: Qiming Bao, Alex Yuxuan Peng, Tim Hartill, Neset Tan, Zhenyun Deng, Michael Witbrock, Jiamou Liu

    Abstract: Combining deep learning with symbolic logic reasoning aims to capitalize on the success of both fields and is drawing increasing attention. Inspired by DeepLogic, an end-to-end model trained to perform inference on logic programs, we introduce IMA-GloVe-GA, an iterative neural inference network for multi-step reasoning expressed in natural language. In our model, reasoning is performed using an it… ▽ More

    Submitted 30 March, 2024; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: 10 pages, 3 figures, The 2nd International Joint Conference on Learning & Reasoning and 16th International Workshop on Neural-Symbolic Learning and Reasoning (IJCLR-NeSy 2022)