Skip to main content

Showing 1–10 of 10 results for author: She, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13923  [pdf, other

    cs.CL

    Why Not Transform Chat Large Language Models to Non-English?

    Authors: Xiang Geng, Ming Zhu, Jiahuan Li, Zhejian Lai, Wei Zou, Shuaijie She, Jiaxin Guo, Xiaofeng Zhao, Yinglu Li, Yuang Li, Chang Su, Yanqing Zhao, Xinglin Lyu, Min Zhang, Jiajun Chen, Hao Yang, Shujian Huang

    Abstract: The scarcity of non-English data limits the development of non-English large language models (LLMs). Transforming English-centric LLMs to non-English has been identified as an effective and resource-efficient method. Previous works start from base LLMs and perform knowledge distillation (KD) with data generated by stronger LLMs, e.g. GPT-4. Compared to base LLMs, chat LLMs are further optimized fo… ▽ More

    Submitted 31 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2401.07817  [pdf, other

    cs.CL

    Question Translation Training for Better Multilingual Reasoning

    Authors: Wenhao Zhu, Shujian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch

    Abstract: Large language models show compelling performance on reasoning tasks but they tend to perform much worse in languages other than English. This is unsurprising given that their training data largely consists of English text and instructions. A typical solution is to translate instruction data into all languages of interest, and then train on the resulting multilingual data, which is called translat… ▽ More

    Submitted 29 June, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted to Findings of ACL 2024

  3. arXiv:2401.06838  [pdf, other

    cs.CL

    MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization

    Authors: Shuaijie She, Wei Zou, Shujian Huang, Wenhao Zhu, Xiang Liu, Xiang Geng, Jiajun Chen

    Abstract: Though reasoning abilities are considered language-agnostic, existing LLMs exhibit inconsistent reasoning abilities across different languages, e.g., reasoning in the dominant language like English is superior to other languages due to the imbalance of multilingual training data. To enhance reasoning abilities in non-dominant languages, we propose a Multilingual-Alignment-as-Preference Optimizatio… ▽ More

    Submitted 13 April, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: The project is available at https://github.com/NJUNLP/MAPO

  4. arXiv:2311.07194  [pdf, other

    cs.CL

    Exploring the Factual Consistency in Dialogue Comprehension of Large Language Models

    Authors: Shuaijie She, Shujian Huang, Xingyun Wang, Yanke Zhou, Jiajun Chen

    Abstract: LLMs (Large Language Models) usually interact with users in the form of dialogue and generate responses following their instructions, which naturally require dialogue comprehension abilities. However, dialogue comprehension is a general language ability which is hard to be evaluated directly. In this work, we propose to perform the evaluation focusing on the factual consistency issue with the help… ▽ More

    Submitted 1 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted at NAACL2024 Main

  5. ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning

    Authors: **gyuan Selena She, Christopher Potts, Samuel R. Bowman, Atticus Geiger

    Abstract: A number of recent benchmarks seek to assess how well models handle natural language negation. However, these benchmarks lack the controlled example paradigms that would allow us to infer whether a model had learned how negation morphemes semantically scope. To fill these analytical gaps, we present the Scoped Negation NLI (ScoNe-NLI) benchmark, which contains contrast sets of six examples with up… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  6. arXiv:2212.01611  [pdf, other

    cs.CL

    CoP: Factual Inconsistency Detection by Controlling the Preference

    Authors: Shuaijie She, Xiang Geng, Shujian Huang, Jiajun Chen

    Abstract: Abstractive summarization is the process of generating a summary given a document as input. Although significant progress has been made, the factual inconsistency between the document and the generated summary still limits its practical applications. Previous work found that the probabilities assigned by the generation model reflect its preferences for the generated summary, including the preferen… ▽ More

    Submitted 30 March, 2023; v1 submitted 3 December, 2022; originally announced December 2022.

    Comments: Accepted to AAAI2023 regular paper

  7. arXiv:2212.01488  [pdf

    cs.CL cs.AI

    Event knowledge in large language models: the gap between the impossible and the unlikely

    Authors: Carina Kauf, Anna A. Ivanova, Giulia Rambelli, Emmanuele Chersoni, **gyuan Selena She, Zawad Chowdhury, Evelina Fedorenko, Alessandro Lenci

    Abstract: Word co-occurrence patterns in language corpora contain a surprising amount of conceptual knowledge. Large language models (LLMs), trained to predict words in context, leverage these patterns to achieve impressive performance on diverse semantic tasks requiring world knowledge. An important but understudied question about LLMs' semantic abilities is whether they acquire generalized knowledge of co… ▽ More

    Submitted 26 October, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: The two lead authors have contributed equally to this work

  8. arXiv:2209.11633  [pdf, ps, other

    cs.SE

    Formal Semantics of the CDL Language

    Authors: Thorsten Berger, Steven She

    Abstract: We reverse-engineer a formal semantics of the Component Definition Language (CDL), which is part of the highly configurable, embedded operating system eCos. This work provides the basis for an analysis and comparison of the two variability-modeling languages Kconfig and CDL. The semantics given in this document are based on analyzing the CDL documentation, inspecting the source code of the toolcha… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: Technical Note, Department of Computer Science, University of Leipzig, Germany

  9. arXiv:2209.04916  [pdf, ps, other

    cs.SE

    Formal Semantics of the Kconfig Language

    Authors: Steven She, Thorsten Berger

    Abstract: The Kconfig language defines a set of symbols that are assigned a value in a configuration. We describe the semantics of the Kconfig language according to the behavior exhibited in the xconfig configurator. We assume an abstract syntax representation for concepts in the Kconfig language and delegate the details of the translation from concrete to abstract syntaxes to a later document.

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: Technical Note, Department of Electrical and Computer Engineering, University of Waterloo, Canada

  10. arXiv:1710.10523  [pdf, other

    cs.RO

    Autonomous Mobile Robot Navigation in Uneven and Unstructured Indoor Environments

    Authors: Chaoqun Wang, Lili Meng, Sizhen She, Ian M. Mitchell, Teng Li, Frederick Tung, Weiwei Wan, Max. Q. -H. Meng, Clarence W. de Silva

    Abstract: Robots are increasingly operating in indoor environments designed for and shared with people. However, robots working safely and autonomously in uneven and unstructured environments still face great challenges. Many modern indoor environments are designed with wheelchair accessibility in mind. This presents an opportunity for wheeled robots to navigate through sloped areas while avoiding staircase… ▽ More

    Submitted 28 October, 2017; originally announced October 2017.

    Comments: 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)