Skip to main content

Showing 1–15 of 15 results for author: An, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18192  [pdf, other

    cs.CL cs.AI

    Methodology of Adapting Large English Language Models for Specific Cultural Contexts

    Authors: Wen**g Zhang, Siqi Xiao, Xuejiao Lei, Ning Wang, Huazheng Zhang, Meijuan An, Bikun Yang, Zhaoxiang Liu, Kai Wang, Shiguo Lian

    Abstract: The rapid growth of large language models(LLMs) has emerged as a prominent trend in the field of artificial intelligence. However, current state-of-the-art LLMs are predominantly based on English. They encounter limitations when directly applied to tasks in specific cultural domains, due to deficiencies in domain-specific knowledge and misunderstandings caused by differences in cultural values. To… ▽ More

    Submitted 26 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: 11 pages, 2 figures

  2. arXiv:2406.11244  [pdf, other

    cs.LG cs.AI

    SpoT-Mamba: Learning Long-Range Dependency on Spatio-Temporal Graphs with Selective State Spaces

    Authors: **hyeok Choi, Heehyeon Kim, Minhyeong An, Joyce Jiyoung Whang

    Abstract: Spatio-temporal graph (STG) forecasting is a critical task with extensive applications in the real world, including traffic and weather forecasting. Although several recent methods have been proposed to model complex dynamics in STGs, addressing long-range spatio-temporal dependencies remains a significant challenge, leading to limited performance gains. Inspired by a recently proposed state space… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 6 pages, 2 figures, 3 tables. Spatio-Temporal Reasoning and Learning (STRL) Workshop at the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)

  3. arXiv:2406.10311  [pdf, other

    cs.CL cs.AI

    CHiSafetyBench: A Chinese Hierarchical Safety Benchmark for Large Language Models

    Authors: Wen**g Zhang, Xuejiao Lei, Zhaoxiang Liu, Meijuan An, Bikun Yang, KaiKai Zhao, Kai Wang, Shiguo Lian

    Abstract: With the profound development of large language models(LLMs), their safety concerns have garnered increasing attention. However, there is a scarcity of Chinese safety benchmarks for LLMs, and the existing safety taxonomies are inadequate, lacking comprehensive safety detection capabilities in authentic Chinese scenarios. In this work, we introduce CHiSafetyBench, a dedicated safety benchmark for e… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 13 pages, 3 figures

  4. arXiv:2405.12486  [pdf, other

    cs.IR cs.AI

    Time Matters: Enhancing Pre-trained News Recommendation Models with Robust User Dwell Time Injection

    Authors: Hao Jiang, Chuanzhen Li, Mingxiao An

    Abstract: Large Language Models (LLMs) have revolutionized text comprehension, leading to State-of-the-Art (SOTA) news recommendation models that utilize LLMs for in-depth news understanding. Despite this, accurately modeling user preferences remains challenging due to the inherent uncertainty of click behaviors. Techniques like multi-head attention in Transformers seek to alleviate this by capturing intera… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 10 pages,5 figures

  5. arXiv:2404.16660  [pdf, other

    cs.HC cs.AI cs.LG

    Benchmarking Mobile Device Control Agents across Diverse Configurations

    Authors: Juyong Lee, Taywon Min, Minyong An, Changyeon Kim, Kimin Lee

    Abstract: Develo** autonomous agents for mobile devices can significantly enhance user interactions by offering increased efficiency and accessibility. However, despite the growing interest in mobile device control agents, the absence of a commonly adopted benchmark makes it challenging to quantify scientific progress in this area. In this work, we introduce B-MoCA: a novel benchmark designed specifically… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted (Spotlight) to ICLR 2024 Workshop on Generative Models for Decision Making. Project website: https://b-moca.github.io

  6. arXiv:2404.01863  [pdf, other

    cs.LG cs.AI

    Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models

    Authors: Kyuyoung Kim, Jongheon Jeong, Minyong An, Mohammad Ghavamzadeh, Krishnamurthy Dvijotham, **woo Shin, Kimin Lee

    Abstract: Fine-tuning text-to-image models with reward functions trained on human feedback data has proven effective for aligning model behavior with human intent. However, excessive optimization with such reward models, which serve as mere proxy objectives, can compromise the performance of fine-tuned models, a phenomenon known as reward overoptimization. To investigate this issue in depth, we introduce th… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: ICLR 2024

  7. Blockchain technology research and application: a systematic literature review and future trends

    Authors: Min An, Qiyuan Fan, Hao Yu, Haiyang Zhao

    Abstract: Blockchain, as the basis for cryptocurrencies, has received extensive attentions recently. Blockchain serves as an immutable distributed ledger technology which allows transactions to be carried out credibly in a decentralized environment. Blockchain-based applications are springing up, covering numerous fields including financial services, reputation system and Internet of Things (IoT), and so on… ▽ More

    Submitted 26 June, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Report number: 2972-3841

    Journal ref: Journal of Data Science and Intelligent Systems 3 (2023) 1-13

  8. arXiv:2306.10073  [pdf, other

    cs.CY cs.AI cs.CL cs.SE

    Thrilled by Your Progress! Large Language Models (GPT-4) No Longer Struggle to Pass Assessments in Higher Education Programming Courses

    Authors: Jaromir Savelka, Arav Agarwal, Marshall An, Chris Bogart, Majd Sakr

    Abstract: This paper studies recent developments in large language models' (LLM) abilities to pass assessments in introductory and intermediate Python programming courses at the postsecondary level. The emergence of ChatGPT resulted in heated debates of its potential uses (e.g., exercise generation, code explanation) as well as misuses in programming classes (e.g., cheating). Recent studies show that while… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Journal ref: ICER '23: Proceedings of the 2023 ACM Conference on International Computing Education Research - Volume 1. August 2023. Pages 78 - 92

  9. arXiv:2305.13788  [pdf, other

    cs.CL cs.AI

    Can Large Language Models Capture Dissenting Human Voices?

    Authors: Noah Lee, Na Min An, James Thorne

    Abstract: Large language models (LLMs) have shown impressive achievements in solving a broad range of tasks. Augmented by instruction fine-tuning, LLMs have also been shown to generalize in zero-shot settings as well. However, whether LLMs closely align with the human disagreement distribution has not been well-studied, especially within the scope of natural language inference (NLI). In this paper, we evalu… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: To appear at EMNLP 2023

  10. arXiv:2210.05391  [pdf, other

    cs.CV

    PP-StructureV2: A Stronger Document Analysis System

    Authors: Chenxia Li, Ruoyu Guo, Jun Zhou, Mengtao An, Yuning Du, Lingfeng Zhu, Yi Liu, Xiaoguang Hu, Dianhai Yu

    Abstract: A large amount of document data exists in unstructured form such as raw images without any text information. Designing a practical document image analysis system is a meaningful but challenging task. In previous work, we proposed an intelligent document analysis system PP-Structure. In order to further upgrade the function and performance of PP-Structure, we propose PP-StructureV2 in this work, wh… ▽ More

    Submitted 13 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

  11. arXiv:2207.06529  [pdf, other

    stat.ML cs.LG

    Estimating Classification Confidence Using Kernel Densities

    Authors: Peter Salamon, David Salamon, V. Adrian Cantu, Michelle An, Tyler Perry, Robert A. Edwards, Anca M. Segall

    Abstract: This paper investigates the post-hoc calibration of confidence for "exploratory" machine learning classification problems. The difficulty in these problems stems from the continuing desire to push the boundaries of which categories have enough examples to generalize from when curating datasets, and confusion regarding the validity of those categories. We argue that for such problems the "one-versu… ▽ More

    Submitted 14 September, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

  12. arXiv:2009.14025   

    cs.LG

    Machine-Learning Approach to Analyze the Status of Forklift Vehicles with Irregular Movement in a Shipyard

    Authors: Hyeonju Lee, Jongho Lee, Minji An, Gunil Park, Sungchul Choi

    Abstract: In large shipyards, the management of equipment, which are used for building a variety of ships, is critical. Because orders vary year to year, shipyard managers are required to determine methods to make the most of their limited resources. A particular difficulty that arises because of the nature and size of shipyards is the management of moving vehicles. In recent years, shipbuilding companies h… ▽ More

    Submitted 12 October, 2020; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: I withdraw this paper because an error in the experiment has been found

  13. arXiv:1907.05576  [pdf, other

    cs.CL cs.IR cs.LG

    Neural News Recommendation with Attentive Multi-View Learning

    Authors: Chuhan Wu, Fangzhao Wu, Mingxiao An, Jianqiang Huang, Yongfeng Huang, Xing Xie

    Abstract: Personalized news recommendation is very important for online news platforms to help users find interested news and improve user experience. News and user representation learning is critical for news recommendation. Existing news recommendation methods usually learn these representations based on single news information, e.g., title, which may be insufficient. In this paper we propose a neural new… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

  14. NPA: Neural News Recommendation with Personalized Attention

    Authors: Chuhan Wu, Fangzhao Wu, Mingxiao An, Jianqiang Huang, Yongfeng Huang, Xing Xie

    Abstract: News recommendation is very important to help users find interested news and alleviate information overload. Different users usually have different interests and the same user may have various interests. Thus, different users may click the same news article with attention on different aspects. In this paper, we propose a neural news recommendation model with personalized attention (NPA). The core… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

  15. arXiv:1811.03821  [pdf, other

    cs.LG stat.ML

    Skeptical Deep Learning with Distribution Correction

    Authors: Mingxiao An, Yongzhou Chen, Qi Liu, Chuanren Liu, Guangyi Lv, Fangzhao Wu, Jianhui Ma

    Abstract: Recently deep neural networks have been successfully used for various classification tasks, especially for problems with massive perfectly labeled training data. However, it is often costly to have large-scale credible labels in real-world applications. One solution is to make supervised learning robust with imperfectly labeled input. In this paper, we develop a distribution correction approach th… ▽ More

    Submitted 13 January, 2019; v1 submitted 9 November, 2018; originally announced November 2018.