Skip to main content

Showing 1–12 of 12 results for author: Kuo, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05134  [pdf, other

    cs.CY cs.AI cs.LG

    Enhancing Deep Knowledge Tracing via Diffusion Models for Personalized Adaptive Learning

    Authors: Ming Kuo, Shouvon Sarker, Lijun Qian, Yujian Fu, Xiangfang Li, Xishuang Dong

    Abstract: In contrast to pedagogies like evidence-based teaching, personalized adaptive learning (PAL) distinguishes itself by closely monitoring the progress of individual students and tailoring the learning path to their unique knowledge and requirements. A crucial technique for effective PAL implementation is knowledge tracing, which models students' evolving knowledge to predict their future performance… ▽ More

    Submitted 24 April, 2024; originally announced May 2024.

  2. arXiv:2404.02936  [pdf, other

    cs.CL cs.LG

    Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models

    Authors: **gyang Zhang, **gwei Sun, Eric Yeats, Yang Ouyang, Martin Kuo, Jianyi Zhang, Hao Frank Yang, Hai Li

    Abstract: The problem of pre-training data detection for large language models (LLMs) has received growing attention due to its implications in critical issues like copyright violation and test data contamination. Despite improved performance, existing methods (including the state-of-the-art, Min-K%) are mostly developed upon simple heuristics and lack solid, reasonable foundations. In this work, we propose… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Project page and code is available at https://zjysteven.github.io/mink-plus-plus/

  3. arXiv:2311.04799  [pdf, other

    cs.CL cs.AI

    DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining

    Authors: Martin Kuo, Jianyi Zhang, Yiran Chen

    Abstract: Building on the cost-efficient pretraining advancements brought about by Crammed BERT, we enhance its performance and interpretability further by introducing a novel pretrained model Dependency Agreement Crammed BERT (DACBERT) and its two-stage pretraining framework - Dependency Agreement Pretraining. This framework, grounded by linguistic theories, seamlessly weaves syntax and semantic informatio… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  4. arXiv:2308.15118  [pdf, other

    cs.CL

    Large Language Models on the Chessboard: A Study on ChatGPT's Formal Language Comprehension and Complex Reasoning Skills

    Authors: Mu-Tien Kuo, Chih-Chung Hsueh, Richard Tzong-Han Tsai

    Abstract: While large language models have made strides in natural language processing, their proficiency in complex reasoning tasks requiring formal language comprehension, such as chess, remains less investigated. This paper probes the performance of ChatGPT, a sophisticated language model by OpenAI in tackling such complex reasoning tasks, using chess as a case study. Through robust metrics examining bot… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  5. arXiv:2305.05644  [pdf, other

    cs.CL cs.DC eess.SY

    Towards Building the Federated GPT: Federated Instruction Tuning

    Authors: Jianyi Zhang, Saeed Vahidian, Martin Kuo, Chunyuan Li, Ruiyi Zhang, Tong Yu, Yufan Zhou, Guoyin Wang, Yiran Chen

    Abstract: While "instruction-tuned" generative large language models (LLMs) have demonstrated an impressive ability to generalize to new tasks, the training phases heavily rely on large amounts of diverse and high-quality instruction data (such as ChatGPT and GPT-4). Unfortunately, acquiring high-quality data, especially when it comes to human-written data, can pose significant challenges both in terms of c… ▽ More

    Submitted 29 January, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Project page: https://github.com/JayZhang42/FederatedGPT-Shepherd

  6. arXiv:2112.11700  [pdf, other

    cs.CV

    Adaptive Contrast for Image Regression in Computer-Aided Disease Assessment

    Authors: Weihang Dai, Xiaomeng Li, Wan Hang Keith Chiu, Michael D. Kuo, Kwang-Ting Cheng

    Abstract: Image regression tasks for medical applications, such as bone mineral density (BMD) estimation and left-ventricular ejection fraction (LVEF) prediction, play an important role in computer-aided disease assessment. Most deep regression methods train the neural network with a single regression loss function like MSE or L1 loss. In this paper, we propose the first contrastive learning framework for d… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: Accepted in IEEE Transactions on Medical Imaging

  7. Secure Links: Secure-by-Design Communications in IEC 61499 Industrial Control Applications

    Authors: Awais Tanveer, Roopak Sinha, Matthew M. Y. Kuo

    Abstract: Increasing automation and external connectivity in industrial control systems (ICS) demand a greater emphasis on software-level communication security. In this article, we propose a secure-by-design development method for building ICS applications, where requirements from security standards like ISA/IEC 62443 are fulfilled by design-time abstractions called secure links. Proposed as an extension t… ▽ More

    Submitted 24 July, 2021; originally announced July 2021.

    Comments: Journal paper, 11 pages, 10 figures, 3 tables

    Journal ref: IEEE Transactions on Industrial Informatics 17(6)(2021), pp.3992-4002

  8. arXiv:2009.07406  [pdf, other

    cs.CL cs.AI

    Tag and Correct: Question aware Open Information Extraction with Two-stage Decoding

    Authors: Martin Kuo, Yaobo Liang, Lei Ji, Nan Duan, Linjun Shou, Ming Gong, Peng Chen

    Abstract: Question Aware Open Information Extraction (Question aware Open IE) takes question and passage as inputs, outputting an answer tuple which contains a subject, a predicate, and one or more arguments. Each field of answer is a natural language word sequence and is extracted from the passage. The semi-structured answer has two advantages which are more readable and falsifiable compared to span answer… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

    Comments: 11 pages, 1 figure, 4 tables

    MSC Class: 68T50; 68T01

  9. arXiv:2008.09394  [pdf, other

    cs.CL

    A Variational Approach to Unsupervised Sentiment Analysis

    Authors: Ziqian Zeng, Wenxuan Zhou, Xin Liu, Zizheng Lin, Yangqin Song, Michael David Kuo, Wan Hang Keith Chiu

    Abstract: In this paper, we propose a variational approach to unsupervised sentiment analysis. Instead of using ground truth provided by domain experts, we use target-opinion word pairs as a supervision signal. For example, in a document snippet "the room is big," (room, big) is a target-opinion word pair. These word pairs can be extracted by using dependency parsers and simple rules. Our objective function… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1904.05055

  10. arXiv:1912.12418  [pdf

    cs.LG cs.AI stat.ML

    Measuring group-separability in geometrical space for evaluation of pattern recognition and embedding algorithms

    Authors: A. Acevedo, S. Ciucci, MJ. Kuo, C. Duran, CV. Cannistraci

    Abstract: Evaluating data separation in a geometrical space is fundamental for pattern recognition. A plethora of dimensionality reduction (DR) algorithms have been developed in order to reveal the emergence of geometrical patterns in a low dimensional visible representation space, in which high-dimensional samples similarities are approximated by geometrical distances. However, statistical measures to eval… ▽ More

    Submitted 28 December, 2019; originally announced December 2019.

  11. arXiv:1906.10284  [pdf, other

    cs.CV

    Appearance and Shape from Water Reflection

    Authors: Ryo Kawahara, Meng-Yu Jennifer Kuo, Shohei Nobuhara, Ko Nishino

    Abstract: This paper introduces single-image geometric and appearance reconstruction from water reflection photography, i.e., images capturing direct and water-reflected real-world scenes. Water reflection offers an additional viewpoint to the direct sight, collectively forming a stereo pair. The water-reflected scene, however, includes internally scattered and reflected environmental illumination in additi… ▽ More

    Submitted 7 January, 2020; v1 submitted 24 June, 2019; originally announced June 2019.

    Comments: WACV 2020

  12. arXiv:0712.2587  [pdf, ps, other

    cs.IT

    Maximum-Likelihood Priority-First Search Decodable Codes for Combined Channel Estimation and Error Protection

    Authors: Chia-Lung Wu, Po-Ning Chen, Yunghsiang S. Han, Ming-Hsin Kuo

    Abstract: The code that combines channel estimation and error protection has received general attention recently, and has been considered a promising methodology to compensate multi-path fading effect. It has been shown by simulations that such code design can considerably improve the system performance over the conventional design with separate channel estimation and error protection modules under the sa… ▽ More

    Submitted 17 December, 2007; originally announced December 2007.

    Comments: 13 figures, 2 tables