Skip to main content

Showing 1–32 of 32 results for author: Guan, N

.
  1. arXiv:2406.11010  [pdf, other

    cs.LG cs.GT

    WeShap: Weak Supervision Source Evaluation with Shapley Values

    Authors: Naiqing Guan, Nick Koudas

    Abstract: Efficient data annotation stands as a significant bottleneck in training contemporary machine learning models. The Programmatic Weak Supervision (PWS) pipeline presents a solution by utilizing multiple weak supervision sources to automatically label data, thereby expediting the annotation process. Given the varied contributions of these weak supervision sources to the accuracy of PWS, it is impera… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2405.19694  [pdf, other

    cs.AI

    Grade Like a Human: Rethinking Automated Assessment with Large Language Models

    Authors: Wen**g Xie, Juxin Niu, Chun Jason Xue, Nan Guan

    Abstract: While large language models (LLMs) have been used for automated grading, they have not yet achieved the same level of performance as humans, especially when it comes to grading complex questions. Existing research on this topic focuses on a particular step in the grading procedure: grading using predefined rubrics. However, grading is a multifaceted procedure that encompasses other crucial steps,… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.17372  [pdf, other

    cs.AI cs.LG cs.RO

    BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction

    Authors: Zikang Zhou, Haibo Hu, Xinhong Chen, Jian** Wang, Nan Guan, Kui Wu, Yung-Hui Li, Yu-Kai Huang, Chun Jason Xue

    Abstract: Simulating realistic interactions among traffic agents is crucial for efficiently validating the safety of autonomous driving systems. Existing leading simulators primarily use an encoder-decoder structure to encode the historical trajectories for future simulation. However, such a paradigm complicates the model architecture, and the manual separation of history and future trajectories leads to lo… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2405.15198  [pdf, other

    cs.CL

    RAEE: A Training-Free Retrieval-Augmented Early Exiting Framework for Efficient Inference

    Authors: Lianming Huang, Shangyu Wu, Yufei Cui, Ying Xiong, Xue Liu, Tei-Wei Kuo, Nan Guan, Chun Jason Xue

    Abstract: Deploying large language model inference remains challenging due to their high computational overhead. Early exiting accelerates model inference by adaptively reducing the number of inference layers. Existing methods require training internal classifiers to determine whether to exit at each intermediate layer. However, such classifier-based early exiting frameworks require significant effort to de… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  5. arXiv:2405.08197  [pdf, other

    cs.CV

    IHC Matters: Incorporating IHC analysis to H&E Whole Slide Image Analysis for Improved Cancer Grading via Two-stage Multimodal Bilinear Pooling Fusion

    Authors: Jun Wang, Yu Mao, Yufei Cui, Nan Guan, Chun Jason Xue

    Abstract: Immunohistochemistry (IHC) plays a crucial role in pathology as it detects the over-expression of protein in tissue samples. However, there are still fewer machine learning model studies on IHC's impact on accurate cancer grading. We discovered that IHC and H\&E possess distinct advantages and disadvantages while possessing certain complementary qualities. Building on this observation, we develope… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  6. arXiv:2404.15096  [pdf, other

    cs.RO cs.LG

    Impedance Matching: Enabling an RL-Based Running Jump in a Quadruped Robot

    Authors: Neil Guan, Shangqun Yu, Shifan Zhu, Donghyun Kim

    Abstract: Replicating the remarkable athleticism seen in animals has long been a challenge in robotics control. Although Reinforcement Learning (RL) has demonstrated significant progress in dynamic legged locomotion control, the substantial sim-to-real gap often hinders the real-world demonstration of truly dynamic movements. We propose a new framework to mitigate this gap through frequency-domain analysis-… ▽ More

    Submitted 29 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Accepted by Ubiquitous Robots 2024

  7. arXiv:2404.11161  [pdf, other

    cs.CV cs.LG

    Pre-processing matters: A segment search method for WSI classification

    Authors: Jun Wang, Yufei Cui, Yu Mao, Nan Guan, Chun Jason Xue

    Abstract: Pre-processing for whole slide images can affect classification performance both in the training and inference stages. Our study analyzes the impact of pre-processing parameters on inference and training across single- and multiple-domain datasets. However, searching for an optimal parameter set is time-consuming. To overcome this, we propose a novel Similarity-based Simulated Annealing approach f… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  8. arXiv:2403.01384  [pdf, other

    cs.LG cs.AI cs.CL

    On the Compressibility of Quantized Large Language Models

    Authors: Yu Mao, Weilan Wang, Hongchao Du, Nan Guan, Chun Jason Xue

    Abstract: Deploying Large Language Models (LLMs) on edge or mobile devices offers significant benefits, such as enhanced data privacy and real-time processing capabilities. However, it also faces critical challenges due to the substantial memory requirement of LLMs. Quantization is an effective way of reducing the model size while maintaining good performance. However, even after quantization, LLMs may stil… ▽ More

    Submitted 5 May, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  9. arXiv:2402.06056  [pdf, other

    cs.LG cs.DB

    ActiveDP: Bridging Active Learning and Data Programming

    Authors: Naiqing Guan, Nick Koudas

    Abstract: Modern machine learning models require large labelled datasets to achieve good performance, but manually labelling large datasets is expensive and time-consuming. The data programming paradigm enables users to label large datasets efficiently but produces noisy labels, which deteriorates the downstream model's performance. The active learning paradigm, on the other hand, can acquire accurate label… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: accepted by EDBT 2024 research track

  10. arXiv:2311.00739  [pdf, other

    cs.CL cs.DB cs.LG

    Can Large Language Models Design Accurate Label Functions?

    Authors: Naiqing Guan, Kaiwen Chen, Nick Koudas

    Abstract: Programmatic weak supervision methodologies facilitate the expedited labeling of extensive datasets through the use of label functions (LFs) that encapsulate heuristic data sources. Nonetheless, the creation of precise LFs necessitates domain expertise and substantial endeavors. Recent advances in pre-trained language models (PLMs) have exhibited substantial potential across diverse tasks. However… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 9 pages, submitted to VLDB 2024

    ACM Class: H.2.8; I.5.4

  11. arXiv:2310.15471  [pdf, other

    cs.DC

    Multi-Path Bound for DAG Tasks

    Authors: Qingqiang He, Nan Guan, Shuai Zhao, Mingsong Lv

    Abstract: This paper studies the response time bound of a DAG (directed acyclic graph) task. Recently, the idea of using multiple paths to bound the response time of a DAG task, instead of using a single longest path in previous results, was proposed and leads to the so-called multi-path bound. Multi-path bounds can greatly reduce the response time bound and significantly improve the schedulability of DAG t… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  12. arXiv:2309.09276  [pdf, other

    cs.CV cs.LG

    MVP: Meta Visual Prompt Tuning for Few-Shot Remote Sensing Image Scene Classification

    Authors: Junjie Zhu, Yiying Li, Chun** Qiu, Ke Yang, Naiyang Guan, Xiaodong Yi

    Abstract: Vision Transformer (ViT) models have recently emerged as powerful and versatile models for various visual tasks. Recently, a work called PMF has achieved promising results in few-shot image classification by utilizing pre-trained vision transformer models. However, PMF employs full fine-tuning for learning the downstream tasks, leading to significant overfitting and storage issues, especially in t… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: SUBMIT TO IEEE TRANSACTIONS

  13. arXiv:2309.04806  [pdf, other

    cs.CV cs.AI

    Timely Fusion of Surround Radar/Lidar for Object Detection in Autonomous Driving Systems

    Authors: Wen**g Xie, Tao Hu, Neiwen Ling, Guoliang Xing, Chun Jason Xue, Nan Guan

    Abstract: Fusing Radar and Lidar sensor data can fully utilize their complementary advantages and provide more accurate reconstruction of the surrounding for autonomous driving systems. Surround Radar/Lidar can provide 360-degree view sampling with the minimal cost, which are promising sensing hardware solutions for autonomous driving systems. However, due to the intrinsic physical constraints, the rotating… ▽ More

    Submitted 27 May, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

  14. arXiv:2307.13401  [pdf, other

    cs.DC

    Longer Is Shorter: Making Long Paths to Improve the Worst-Case Response Time of DAG Tasks

    Authors: Qingqiang He, Nan Guan, Mingsong Lv

    Abstract: DAG (directed acyclic graph) tasks are widely used to model parallel real-time workload. The real-time performance of a DAG task not only depends on its total workload, but also its graph structure. Intuitively, with the same total workload, a DAG task with looser precedence constraints tends to have better real-time performance in terms of worst-case response time. However, this paper shows that… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  15. arXiv:2307.04339  [pdf, other

    cs.DC cs.AI

    Miriam: Exploiting Elastic Kernels for Real-time Multi-DNN Inference on Edge GPU

    Authors: Zhihe Zhao, Neiwen Ling, Nan Guan, Guoliang Xing

    Abstract: Many applications such as autonomous driving and augmented reality, require the concurrent running of multiple deep neural networks (DNN) that poses different levels of real-time performance requirements. However, coordinating multiple DNN tasks with varying levels of criticality on edge GPUs remains an area of limited study. Unlike server-level GPUs, edge GPUs are resource-limited and lack hardwa… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  16. arXiv:2307.01515  [pdf, other

    cs.CV

    LPN: Language-guided Prototypical Network for few-shot classification

    Authors: Kaihui Cheng, Chule Yang, Xiao Liu, Naiyang Guan, Zhiyuan Wang

    Abstract: Few-shot classification aims to adapt to new tasks with limited labeled examples. To fully use the accessible data, recent methods explore suitable measures for the similarity between the query and support images and better high-dimensional features with meta-training and pre-training strategies. However, the potential of multi-modality information has barely been explored, which may bring promisi… ▽ More

    Submitted 21 October, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

  17. arXiv:2304.13639  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    PVP: Pre-trained Visual Parameter-Efficient Tuning

    Authors: Zhao Song, Ke Yang, Naiyang Guan, Junjie Zhu, Peng Qiao, Qingyong Hu

    Abstract: Large-scale pre-trained transformers have demonstrated remarkable success in various computer vision tasks. However, it is still highly challenging to fully fine-tune these models for downstream tasks due to their high computational and storage costs. Recently, Parameter-Efficient Tuning (PETuning) techniques, e.g., Visual Prompt Tuning (VPT) and Low-Rank Adaptation (LoRA), have significantly redu… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  18. arXiv:2211.13724  [pdf, other

    cs.LG cs.CV

    Estimating Regression Predictive Distributions with Sample Networks

    Authors: Ali Harakeh, Jordan Hu, Naiqing Guan, Steven L. Waslander, Liam Paull

    Abstract: Estimating the uncertainty in deep neural network predictions is crucial for many real-world applications. A common approach to model uncertainty is to choose a parametric distribution and fit the data to it using maximum likelihood estimation. The chosen parametric form can be a poor fit to the data-generating distribution, resulting in unreliable uncertainty estimates. In this work, we propose S… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted for publication in AAAI 2023. Example code at: https://samplenet.github.io/

  19. arXiv:2211.08800  [pdf, other

    cs.DC

    Bounding the Response Time of DAG Tasks Using Long Paths

    Authors: Qingqiang He, Nan Guan, Mingsong Lv, Xu Jiang, Wanli Chang

    Abstract: In 1969, Graham developed a well-known response time bound for a DAG task using the total workload and the longest path of the DAG, which has been widely applied to solve many scheduling and analysis problems of DAG-based task systems. This paper presents a new response time bound for a DAG task using the total workload and the lengths of multiple long paths of the DAG, instead of the longest path… ▽ More

    Submitted 17 November, 2022; v1 submitted 16 November, 2022; originally announced November 2022.

  20. arXiv:2204.12586  [pdf

    q-bio.BM cs.LG

    Enhanced compound-protein binding affinity prediction by representing protein multimodal information via a coevolutionary strategy

    Authors: Binjie Guo, Hanyu Zheng, Haohan Jiang, Xiaodan Li, Naiyu Guan, Yanming Zuo, Yicheng Zhang, Hengfu Yang, Xuhua Wang

    Abstract: Due to the lack of a method to efficiently represent the multimodal information of a protein, including its structure and sequence information, predicting compound-protein binding affinity (CPA) still suffers from low accuracy when applying machine learning methods. To overcome this limitation, in a novel end-to-end architecture (named FeatNN), we develop a coevolutionary strategy to jointly repre… ▽ More

    Submitted 23 November, 2022; v1 submitted 29 March, 2022; originally announced April 2022.

    Comments: 53 pages, 14 figures, 3 tables

  21. arXiv:2201.05752  [pdf, other

    cs.LG cs.PL

    Moses: Efficient Exploitation of Cross-device Transferable Features for Tensor Program Optimization

    Authors: Zhihe Zhao, Xian Shuai, Yang Bai, Neiwen Ling, Nan Guan, Zhenyu Yan, Guoliang Xing

    Abstract: Achieving efficient execution of machine learning models has attracted significant attention recently. To generate tensor programs efficiently, a key component of DNN compilers is the cost model that can predict the performance of each configuration on specific devices. However, due to the rapid emergence of hardware platforms, it is increasingly labor-intensive to train domain-specific predictors… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  22. arXiv:2105.01892  [pdf, other

    cs.AR cs.PF

    TENET: A Framework for Modeling Tensor Dataflow Based on Relation-centric Notation

    Authors: Liqiang Lu, Naiqing Guan, Yuyue Wang, Liancheng Jia, Zizhang Luo, Jieming Yin, Jason Cong, Yun Liang

    Abstract: Accelerating tensor applications on spatial architectures provides high performance and energy-efficiency, but requires accurate performance models for evaluating various dataflow alternatives. Such modeling relies on the notation of tensor dataflow and the formulation of performance metrics. Recent proposed compute-centric and data-centric notations describe the dataflow using imperative directiv… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

  23. arXiv:2011.06762  [pdf, other

    cs.DC

    Schedulability Bounds for Parallel Real-Time Tasks under Global Rate-Monotonic Scheduling

    Authors: Xu Jiang, Nan Guan, Maolin Yang, Yue Tang, Wang Yi

    Abstract: Schedulability bounds not only serve as efficient tests to decide schedulability of real-time task systems but also reveal insights about the worst-case performance of scheduling algorithms. Different from sequential real-time task systems for which utilization is a suitable metric to develop schedulability bounds, schedulability of parallel real-time tasks depends on not only utilization but also… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: 11 pages

  24. arXiv:2007.00706  [pdf, other

    cs.OS

    DPCP-p: A Distributed Locking Protocol for Parallel Real-Time Tasks

    Authors: Maolin Yang, Zewei Chen, Xu Jiang, Nan Guan, Hang Lei

    Abstract: Real-time scheduling and locking protocols are fundamental facilities to construct time-critical systems. For parallel real-time tasks, predictable locking protocols are required when concurrent sub-jobs mutually exclusive access to shared resources. This paper for the first time studies the distributed synchronization framework of parallel real-time tasks, where both tasks and global resources ar… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  25. arXiv:2003.08233  [pdf, other

    cs.DC

    On the Analysis of Parallel Real-Time Tasks with Spin Locks

    Authors: Xu Jiang, Nan Guan, He Du, Weichen Liu, Wang Yi

    Abstract: Locking protocol is an essential component in resource management of real-time systems, which coordinates mutually exclusive accesses to shared resources from different tasks. Although the design and analysis of locking protocols have been intensively studied for sequential real-time tasks, there has been little work on this topic for parallel real-time tasks. In this paper, we study the analysis… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

  26. arXiv:1906.00495  [pdf, other

    cs.LG cs.CV stat.ML

    Truncated Cauchy Non-negative Matrix Factorization

    Authors: Naiyang Guan, Tongliang Liu, Yangmuzi Zhang, Dacheng Tao, Larry S. Davis

    Abstract: Non-negative matrix factorization (NMF) minimizes the Euclidean distance between the data matrix and its low rank approximation, and it fails when applied to corrupted data because the loss function is sensitive to outliers. In this paper, we propose a Truncated CauchyNMF loss that handle outliers by truncating large errors, and develop a Truncated CauchyNMF to robustly learn the subspace on noisy… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE T-PAMI), vol. 41, no. 1, pp. 246-259, Jan. 2019

  27. arXiv:1809.07689  [pdf, other

    cs.DC

    Response Time Bounds for Typed DAG Parallel Tasks on Heterogeneous Multi-cores

    Authors: Meiling Han, Nan Guan, **ghao Sun, Qingqiang He, Qingxu Deng, Weichen Liu

    Abstract: Heterogeneous multi-cores utilize the strength of different architectures for executing particular types of workload, and usually offer higher performance and energy efficiency. In this paper, we study the worst-case response time (WCRT) analysis of \emph{typed} scheduling of parallel DAG tasks on heterogeneous multi-cores, where the workload of each vertex in the DAG is only allowed to execute on… ▽ More

    Submitted 7 August, 2018; originally announced September 2018.

  28. arXiv:1711.00100  [pdf, ps, other

    cs.DC

    Utilization-Based Scheduling of Flexible Mixed-Criticality Real-Time Tasks

    Authors: Gang Chen, Nan Guan, Di Liu, Qingqiang He, Kai Huang, Todor Stefanov, Wang Yi

    Abstract: Mixed-criticality models are an emerging paradigm for the design of real-time systems because of their significantly improved resource efficiency. However, formal mixed-criticality models have traditionally been characterized by two impractical assumptions: once \textit{any} high-criticality task overruns, \textit{all} low-criticality tasks are suspended and \textit{all other} high-criticality tas… ▽ More

    Submitted 29 September, 2017; originally announced November 2017.

    Comments: This paper has been submitted to IEEE Transaction on Computers (TC) on Sept-09th-2016

  29. arXiv:1705.03245  [pdf, other

    cs.DC

    Semi-Federated Scheduling of Parallel Real-Time Tasks on Multiprocessors

    Authors: Xu Jiang, Nan Guan, Xiang Long, Wang Yi

    Abstract: Federated scheduling is a promising approach to schedule parallel real-time tasks on multi-cores, where each heavy task exclusively executes on a number of dedicated processors, while light tasks are treated as sequential sporadic tasks and share the remaining processors. However, federated scheduling suffers resource waste since a heavy task with processing capacity requirement $x + ε$ (where… ▽ More

    Submitted 9 May, 2017; originally announced May 2017.

  30. EDF-VD Scheduling of Mixed-Criticality Systems with Degraded Quality Guarantees

    Authors: Di Liu, Jelena Spasic, Gang Chen, Nan Guan, Songran Liu, Todor Stefanov, Wang Yi

    Abstract: This paper studies real-time scheduling of mixed-criticality systems where low-criticality tasks are still guaranteed some service in the high-criticality mode, with reduced execution budgets. First, we present a utilization-based schedulability test for such systems under EDF-VD scheduling. Second, we quantify the suboptimality of EDF-VD (with our test condition) in terms of speedup factors. In g… ▽ More

    Submitted 4 May, 2016; originally announced May 2016.

  31. arXiv:1207.3438  [pdf, ps, other

    stat.ML cs.LG math.NA

    MahNMF: Manhattan Non-negative Matrix Factorization

    Authors: Naiyang Guan, Dacheng Tao, Zhigang Luo, John Shawe-Taylor

    Abstract: Non-negative matrix factorization (NMF) approximates a non-negative matrix $X$ by a product of two non-negative low-rank factor matrices $W$ and $H$. NMF and its extensions minimize either the Kullback-Leibler divergence or the Euclidean distance between $X$ and $W^T H$ to model the Poisson noise or the Gaussian noise. In practice, when the noise distribution is heavy tailed, they cannot perform w… ▽ More

    Submitted 14 July, 2012; originally announced July 2012.

    Comments: 43 pages, 20 figures, 2 tables, submission to Journal of Machine Learning Research

    MSC Class: 65K10 ACM Class: I.2.4; I.2.10; I.4.6; I.4.8; I.5.3; I.5.4; G.1.6

  32. Viscosity and dilepton production of a chemically equilibrating quark-gluon plasma at finite baryon density

    Authors: N. N. Guan, Z. J. He, J. L. Long, X. Z. Cai, Y. G. Ma, J. W. Li, W. Q. Shen

    Abstract: By considering the effect of shear viscosity we have investigated the evolution of a chemically equilibrating quark-gluon plasma at finite baryon density. Based on the evolution of the system we have performed a complete calculation for the dilepton production from the following processes: $q\bar{q}{\to}l\bar{l}$, $q\bar{q}{\to}gl\bar{l}$, Compton-like scattering ($qg{\to}ql\bar{l}$,… ▽ More

    Submitted 2 September, 2009; originally announced September 2009.

    Comments: 9 pages, 8 figures

    Journal ref: Phys.Rev.C80:014908,2009