Skip to main content

Showing 1–9 of 9 results for author: Bian, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03199  [pdf, other

    cs.LG

    Boosting MLPs with a Coarsening Strategy for Long-Term Time Series Forecasting

    Authors: Nannan Bian, Minhong Zhu, Li Chen, Weiran Cai

    Abstract: Deep learning methods have been exerting their strengths in long-term time series forecasting. However, they often struggle to strike a balance between expressive power and computational efficiency. Resorting to multi-layer perceptrons (MLPs) provides a compromising solution, yet they suffer from two critical problems caused by the intrinsic point-wise map** mode, in terms of deficient contextua… ▽ More

    Submitted 20 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  2. arXiv:2402.14355  [pdf, other

    cs.CL

    Rule or Story, Which is a Better Commonsense Expression for Talking with Large Language Models?

    Authors: Ning Bian, Xianpei Han, Hongyu Lin, Yaojie Lu, Ben He, Le Sun

    Abstract: Building machines with commonsense has been a longstanding challenge in NLP due to the reporting bias of commonsense rules and the exposure bias of rule-based commonsense reasoning. In contrast, humans convey and pass down commonsense implicitly through stories. This paper investigates the inherent commonsense ability of large language models (LLMs) expressed through storytelling. We systematicall… ▽ More

    Submitted 4 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024

  3. arXiv:2305.04812  [pdf, other

    cs.CL

    Influence of External Information on Large Language Models Mirrors Social Cognitive Patterns

    Authors: Ning Bian, Hongyu Lin, Peilin Liu, Yaojie Lu, Chunkang Zhang, Ben He, Xianpei Han, Le Sun

    Abstract: Social cognitive theory explains how people learn and acquire knowledge through observing others. Recent years have witnessed the rapid development of large language models (LLMs), which suggests their potential significance as agents in the society. LLMs, as AI agents, can observe external information, which shapes their cognition and behaviors. However, the extent to which external information i… ▽ More

    Submitted 20 October, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

  4. arXiv:2303.16421  [pdf, other

    cs.CL

    ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models

    Authors: Ning Bian, Xianpei Han, Le Sun, Hongyu Lin, Yaojie Lu, Ben He, Shanshan Jiang, Bin Dong

    Abstract: Large language models (LLMs) have made significant progress in NLP. However, their ability to memorize, represent, and leverage commonsense knowledge has been a well-known pain point. In this paper, we specifically focus on ChatGPT, a widely used and easily accessible LLM, and ask the following questions: (1) Can ChatGPT effectively answer commonsense questions? (2) Is ChatGPT aware of the underly… ▽ More

    Submitted 19 April, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: Accepted by LREC-COLING 2024

  5. arXiv:2107.08582  [pdf, other

    cs.CL

    Bridging the Gap between Language Model and Reading Comprehension: Unsupervised MRC via Self-Supervision

    Authors: Ning Bian, Xianpei Han, Bo Chen, Hongyu Lin, Ben He, Le Sun

    Abstract: Despite recent success in machine reading comprehension (MRC), learning high-quality MRC models still requires large-scale labeled training data, even using strong pre-trained language models (PLMs). The pre-training tasks for PLMs are not question-answering or MRC-based tasks, making existing PLMs unable to be directly used for unsupervised MRC. Specifically, MRC aims to spot an accurate answer s… ▽ More

    Submitted 18 July, 2021; originally announced July 2021.

  6. arXiv:2101.00760  [pdf, other

    cs.CL

    Benchmarking Knowledge-Enhanced Commonsense Question Answering via Knowledge-to-Text Transformation

    Authors: Ning Bian, Xianpei Han, Bo Chen, Le Sun

    Abstract: A fundamental ability of humans is to utilize commonsense knowledge in language understanding and question answering. In recent years, many knowledge-enhanced Commonsense Question Answering (CQA) approaches have been proposed. However, it remains unclear: (1) How far can we get by exploiting external knowledge for CQA? (2) How much potential of knowledge has been exploited in current CQA models? (… ▽ More

    Submitted 4 January, 2021; v1 submitted 3 January, 2021; originally announced January 2021.

    Comments: Accepted to AAAI2021

  7. arXiv:2012.04334  [pdf, other

    cs.CL

    From Bag of Sentences to Document: Distantly Supervised Relation Extraction via Machine Reading Comprehension

    Authors: Lingyong Yan, Xianpei Han, Le Sun, Fangchao Liu, Ning Bian

    Abstract: Distant supervision (DS) is a promising approach for relation extraction but often suffers from the noisy label problem. Traditional DS methods usually represent an entity pair as a bag of sentences and denoise labels using multi-instance learning techniques. The bag-based paradigm, however, fails to leverage the inter-sentence-level and the entity-level evidence for relation extraction, and their… ▽ More

    Submitted 8 December, 2020; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: 12 pages, 3 figures

  8. arXiv:1907.06566  [pdf, other

    eess.IV cs.LG stat.ML

    Improved Hybrid Layered Image Compression using Deep Learning and Traditional Codecs

    Authors: Haisheng Fu, Feng Liang, Bo Lei, Nai Bian, Qian zhang, Mohammad Akbari, Jie Liang, Chengjie Tu

    Abstract: Recently deep learning-based methods have been applied in image compression and achieved many promising results. In this paper, we propose an improved hybrid layered image compression framework by combining deep learning and the traditional image codecs. At the encoder, we first use a convolutional neural network (CNN) to obtain a compact representation of the input image, which is losslessly enco… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: Submitted to Signal Processing: Image Communication

    Report number: 1907.06566

    Journal ref: Volume 82, March 2020, 115774

  9. arXiv:1907.01714  [pdf, other

    cs.CV

    A Deep Image Compression Framework for Face Recognition

    Authors: Nai Bian, Feng Liang, Haisheng Fu, Bo Lei

    Abstract: Face recognition technology has advanced rapidly and has been widely used in various applications. Due to the extremely huge amount of data of face images and the large computing resources required correspondingly in large-scale face recognition tasks, there is a requirement for a face image compression approach that is highly suitable for face recognition tasks. In this paper, we propose a deep c… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.