Skip to main content

Showing 1–50 of 122 results for author: Yin, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02730  [pdf, other

    cs.CV cs.AI

    MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context

    Authors: Zishan Gu, Changchang Yin, Fenglin Liu, ** Zhang

    Abstract: Large Vision Language Models (LVLMs) have recently achieved superior performance in various tasks on natural image and text data, which inspires a large amount of studies for LVLMs fine-tuning and training. Despite their advancements, there has been scant research on the robustness of these models against hallucination when fine-tuned on smaller datasets. In this study, we introduce a new benchmar… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.08445  [pdf, other

    eess.AS cs.LG cs.SD

    SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models

    Authors: Chun Yin, Tai-Shih Chi, Yu Tsao, Hsin-Min Wang

    Abstract: Representations from pre-trained speech foundation models (SFMs) have shown impressive performance in many downstream tasks. However, the potential benefits of incorporating pre-trained SFM representations into speaker voice similarity assessment have not been thoroughly investigated. In this paper, we propose SVSNet+, a model that integrates pre-trained SFM representations to improve performance… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 2024

  3. arXiv:2406.01026  [pdf, other

    cs.CL

    Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice Selectors

    Authors: Mengge Xue, Zhenyu Hu, Liqun Liu, Kuo Liao, Shuang Li, Honglin Han, Meng Zhao, Chengguo Yin

    Abstract: Multiple-Choice Questions (MCQs) constitute a critical area of research in the study of Large Language Models (LLMs). Previous works have investigated the selection bias problem in MCQs within few-shot scenarios, in which the LLM's performance may be influenced by the presentation of answer choices, leaving the selection bias during Supervised Fine-Tuning (SFT) unexplored. In this paper, we reveal… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accept at ACL2024 Main

    Journal ref: ACL 2024

  4. arXiv:2405.19763  [pdf, other

    cs.CL

    Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding

    Authors: Kuo Liao, Shuang Li, Meng Zhao, Liqun Liu, Mengge Xue, Zhenyu Hu, Honglin Han, Chengguo Yin

    Abstract: Recent strides in large language models (LLMs) have yielded remarkable performance, leveraging reinforcement learning from human feedback (RLHF) to significantly enhance generation and alignment capabilities. However, RLHF encounters numerous challenges, including the objective mismatch issue, leading to suboptimal performance in Natural Language Understanding (NLU) tasks. To address this limitati… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accept at ACL2024 Main

  5. arXiv:2405.18203  [pdf, other

    cs.CL

    IAPT: Instruction-Aware Prompt Tuning for Large Language Models

    Authors: Wei Zhu, Aaron Xuxiang Tian, Congrui Yin, Yuan Ni, Xiaoling Wang, Guotong Xie

    Abstract: Soft prompt tuning is a widely studied parameter-efficient fine-tuning method. However, it has a clear drawback: many soft tokens must be inserted into the input sequences to guarantee downstream performance. As a result, soft prompt tuning is less considered than Low-rank adaptation (LoRA) in the large language modeling (LLM) era. In this work, we propose a novel prompt tuning method, Instruction… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL-2024

  6. arXiv:2405.11640  [pdf, other

    cs.AI cs.CL cs.CV

    Inquire, Interact, and Integrate: A Proactive Agent Collaborative Framework for Zero-Shot Multimodal Medical Reasoning

    Authors: Zishan Gu, Fenglin Liu, Changchang Yin, ** Zhang

    Abstract: The adoption of large language models (LLMs) in healthcare has attracted significant research interest. However, their performance in healthcare remains under-investigated and potentially limited, due to i) they lack rich domain-specific knowledge and medical reasoning skills; and ii) most state-of-the-art LLMs are unimodal, text-only models that cannot directly process multimodal inputs. To this… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  7. arXiv:2405.11597  [pdf, other

    cs.CL cs.AI

    Language Reconstruction with Brain Predictive Coding from fMRI Data

    Authors: Congchi Yin, Ziyi Ye, Piji Li

    Abstract: Many recent studies have shown that the perception of speech can be decoded from brain signals and subsequently reconstructed as continuous language. However, there is a lack of neurological basis for how the semantic information embedded within brain signals can be used more effectively to guide language reconstruction. The theory of predictive coding suggests that human brain naturally engages i… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  8. arXiv:2405.03943  [pdf, other

    cs.LG cs.AI

    Predictive Modeling with Temporal Graphical Representation on Electronic Health Records

    Authors: Jiayuan Chen, Changchang Yin, Yuanlong Wang, ** Zhang

    Abstract: Deep learning-based predictive models, leveraging Electronic Health Records (EHR), are receiving increasing attention in healthcare. An effective representation of a patient's EHR should hierarchically encompass both the temporal relationships between historical visits and medical events, and the inherent structural information within these elements. Existing patient representation methods can be… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: IJCAI 2024 main track

  9. arXiv:2403.20296  [pdf, other

    cs.IR

    Aiming at the Target: Filter Collaborative Information for Cross-Domain Recommendation

    Authors: Hanyu Li, Weizhi Ma, Peijie Sun, Jiayu Li, Cunxiang Yin, Yancheng He, Guoqiang Xu, Min Zhang, Shao** Ma

    Abstract: Cross-domain recommender (CDR) systems aim to enhance the performance of the target domain by utilizing data from other related domains. However, irrelevant information from the source domain may instead degrade target domain performance, which is known as the negative transfer problem. There have been some attempts to address this problem, mostly by designing adaptive representations for overlapp… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Accepted by SIGIR 2024

  10. arXiv:2403.16702  [pdf, other

    cs.CL cs.IR cs.SE

    ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search

    Authors: Zehan Li, Jianfei Zhang, Chuantao Yin, Yuanxin Ouyang, Wenge Rong

    Abstract: Retrieval-based code question answering seeks to match user queries in natural language to relevant code snippets. Previous approaches typically rely on pretraining models using crafted bi-modal and uni-modal datasets to align text and code representations. In this paper, we introduce ProCQA, a large-scale programming question answering dataset extracted from the StackOverflow community, offering… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

  11. arXiv:2402.01330  [pdf, other

    cs.NI

    Video Semantic Communication with Major Object Extraction and Contextual Video Encoding

    Authors: Haopeng Li, Haonan Tong, Sihua Wang, Nuocheng Yang, Zhaohui Yang, Changchuan Yin

    Abstract: This paper studies an end-to-end video semantic communication system for massive communication. In the considered system, the transmitter must continuously send the video to the receiver to facilitate character reconstruction in immersive applications, such as interactive video conference. However, transmitting the original video information with substantial amounts of data poses a challenge to th… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 6 pages, 9 figures, accepted by IEEE WCNC wksp 2024

  12. arXiv:2401.12079  [pdf, other

    cs.MA cs.LG

    Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Trajectory Design for 3D UAV Tracking

    Authors: Yujiao Zhu, Mingzhe Chen, Sihua Wang, Ye Hu, Yuchen Liu, Changchuan Yin

    Abstract: In this paper, the problem of using one active unmanned aerial vehicle (UAV) and four passive UAVs to localize a 3D target UAV in real time is investigated. In the considered model, each passive UAV receives reflection signals from the target UAV, which are initially transmitted by the active UAV. The received reflection signals allow each passive UAV to estimate the signal transmission distance w… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  13. arXiv:2401.10153  [pdf, other

    cs.NI cs.CV

    Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving

    Authors: Jie Lv, Haonan Tong, Qiang Pan, Zhilong Zhang, Xinxin He, Tao Luo, Changchuan Yin

    Abstract: This article studies the problem of image segmentation-based semantic communication in autonomous driving. In real traffic scenes, detecting the key objects (e.g., vehicles, pedestrians and obstacles) is more crucial than that of other objects to guarantee driving safety. Therefore, we propose a vehicular image segmentation-oriented semantic communication system, termed VIS-SemCom, where image seg… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 10 pages, 8 figures

  14. arXiv:2401.07329  [pdf, other

    cs.NE

    Attention-based UNet enabled Lightweight Image Semantic Communication System over Internet of Things

    Authors: Guoxin Ma, Haonan Tong, Nuocheng Yang, Changchuan Yin

    Abstract: This paper studies the problem of the lightweight image semantic communication system that is deployed on Internet of Things (IoT) devices. In the considered system model, devices must use semantic communication techniques to support user behavior recognition in ultimate video service with high data transmission efficiency. However, it is computationally expensive for IoT devices to deploy semanti… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 6 pages, 6 figures, accepted by IEEE WCNC 2024

  15. arXiv:2401.00137  [pdf, other

    cs.CR cs.CV

    SSL-OTA: Unveiling Backdoor Threats in Self-Supervised Learning for Object Detection

    Authors: Qiannan Wang, Changchun Yin, Lu Zhou, Liming Fang

    Abstract: The extensive adoption of Self-supervised learning(SSL) has led to an increased security threat from backdoor attacks. While existing research has mainly focused on backdoor attacks in image classification, there has been limited exploration of their implications for object detection. Object detection plays a critical role in security-sensitive applications, such as autonomous driving, where backd… ▽ More

    Submitted 12 June, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: 10 pages, 4figures

  16. arXiv:2312.13311  [pdf, other

    cs.LG eess.IV

    Unlocking Deep Learning: A BP-Free Approach for Parallel Block-Wise Training of Neural Networks

    Authors: Anzhe Cheng, Zhenkun Wang, Chenzhong Yin, Mingxi Cheng, Heng **, Xiongye Xiao, Shahin Nazarian, Paul Bogdan

    Abstract: Backpropagation (BP) has been a successful optimization technique for deep learning models. However, its limitations, such as backward- and update-locking, and its biological implausibility, hinder the concurrent updating of layers and do not mimic the local learning processes observed in the human brain. To address these issues, recent research has suggested using local error signals to asynchron… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: The paper has been accepted by ICASSP2024

  17. arXiv:2312.12723  [pdf, other

    cs.CV

    Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual Question Answering

    Authors: Chengxiang Yin, Zheng** Che, Kun Wu, Zhiyuan Xu, Jian Tang

    Abstract: Visual Question Answering (VQA) has emerged as one of the most challenging tasks in artificial intelligence due to its multi-modal nature. However, most existing VQA methods are incapable of handling Knowledge-based Visual Question Answering (KB-VQA), which requires external knowledge beyond visible contents to answer questions about a given image. To address this issue, we propose a novel framewo… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  18. arXiv:2312.12721  [pdf, other

    cs.CV

    Cross-Modal Reasoning with Event Correlation for Video Question Answering

    Authors: Chengxiang Yin, Zheng** Che, Kun Wu, Zhiyuan Xu, Qinru Qiu, Jian Tang

    Abstract: Video Question Answering (VideoQA) is a very attractive and challenging research direction aiming to understand complex semantics of heterogeneous data from two domains, i.e., the spatio-temporal video content and the word sequence in question. Although various attention mechanisms have been utilized to manage contextualized representations by modeling intra- and inter-modal relationships of the t… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  19. arXiv:2312.12667  [pdf, other

    cs.CR cs.AI cs.LG

    Discovering Malicious Signatures in Software from Structural Interactions

    Authors: Chenzhong Yin, Hantang Zhang, Mingxi Cheng, Xiongye Xiao, Xinghe Chen, Xin Ren, Paul Bogdan

    Abstract: Malware represents a significant security concern in today's digital landscape, as it can destroy or disable operating systems, steal sensitive user information, and occupy valuable disk space. However, current malware detection methods, such as static-based and dynamic-based approaches, struggle to identify newly developed (``zero-day") malware and are limited by customized virtual machine (VM) e… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: ICASSP 2024, Accepted

  20. arXiv:2312.10987  [pdf, other

    cs.CL

    Cross-Subject Data Splitting for Brain-to-Text Decoding

    Authors: Congchi Yin, Qian Yu, Zhiwei Fang, Jie He, Chang** Peng, Zhangang Lin, **g** Shao, Piji Li

    Abstract: Recent major milestones have successfully decoded non-invasive brain signals (e.g. functional Magnetic Resonance Imaging (fMRI) and electroencephalogram (EEG)) into natural language. Despite the progress in model design, how to split the datasets for training, validating, and testing still remains a matter of debate. Most of the prior researches applied subject-specific data splitting, where the d… ▽ More

    Submitted 14 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  21. arXiv:2312.06330  [pdf, other

    cs.CV cs.AI cs.RO eess.IV

    Navigating Open Set Scenarios for Skeleton-based Action Recognition

    Authors: Kunyu Peng, Cheng Yin, Junwei Zheng, Rui** Liu, David Schneider, Jiaming Zhang, Kailun Yang, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg

    Abstract: In real-world scenarios, human actions often fall outside the distribution of training data, making it crucial for models to recognize known actions and reject unknown ones. However, using pure skeleton data in such open-set conditions poses challenges due to the lack of visual background cues and the distinct sparse structure of body pose sequences. In this paper, we tackle the unexplored Open-Se… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024. The benchmark, code, and models will be released at https://github.com/KPeng9510/OS-SAR

  22. arXiv:2311.08945  [pdf, other

    math.OC cs.DC cs.LG

    A Single-Loop Algorithm for Decentralized Bilevel Optimization

    Authors: Youran Dong, Shiqian Ma, Junfeng Yang, Chao Yin

    Abstract: Bilevel optimization has gained significant attention in recent years due to its broad applications in machine learning. This paper focuses on bilevel optimization in decentralized networks and proposes a novel single-loop algorithm for solving decentralized bilevel optimization with a strongly convex lower-level problem. Our approach is a fully single-loop method that approximates the hypergradie… ▽ More

    Submitted 23 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  23. arXiv:2311.04014  [pdf, other

    cs.AI math.OC

    A Method to Improve the Performance of Reinforcement Learning Based on the Y Operator for a Class of Stochastic Differential Equation-Based Child-Mother Systems

    Authors: Cheng Yin, Yi Chen

    Abstract: This paper introduces a novel operator, termed the Y operator, to elevate control performance in Actor-Critic(AC) based reinforcement learning for systems governed by stochastic differential equations(SDEs). The Y operator ingeniously integrates the stochasticity of a class of child-mother system into the Critic network's loss function, yielding substantial advancements in the control performance… ▽ More

    Submitted 1 January, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 15 pages, 2 figures

  24. arXiv:2310.07885  [pdf, other

    cs.LG cs.AI

    Leader-Follower Neural Networks with Local Error Signals Inspired by Complex Collectives

    Authors: Chenzhong Yin, Mingxi Cheng, Xiongye Xiao, Xinghe Chen, Shahin Nazarian, Andrei Irimia, Paul Bogdan

    Abstract: The collective behavior of a network with heterogeneous, resource-limited information processing units (e.g., group of fish, flock of birds, or network of neurons) demonstrates high self-organization and complexity. These emergent properties arise from simple interaction rules where certain individuals can exhibit leadership-like behavior and influence the collective activity of the group. Motivat… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  25. arXiv:2310.00626  [pdf, other

    cs.CV cs.CR

    GhostEncoder: Stealthy Backdoor Attacks with Dynamic Triggers to Pre-trained Encoders in Self-supervised Learning

    Authors: Qiannan Wang, Changchun Yin, Zhe Liu, Liming Fang, Run Wang, Chenhao Lin

    Abstract: Within the realm of computer vision, self-supervised learning (SSL) pertains to training pre-trained image encoders utilizing a substantial quantity of unlabeled images. Pre-trained image encoders can serve as feature extractors, facilitating the construction of downstream classifiers for various tasks. However, the use of SSL has led to an increase in security research related to various backdoor… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: 24 pages,8 figures

  26. arXiv:2309.12368  [pdf, other

    cs.HC cs.AI cs.LG

    Rethinking Human-AI Collaboration in Complex Medical Decision Making: A Case Study in Sepsis Diagnosis

    Authors: Shao Zhang, Jianing Yu, Xuhai Xu, Changchang Yin, Yuxuan Lu, Bingsheng Yao, Melanie Tory, Lace M. Padilla, Jeffrey Caterino, ** Zhang, Dakuo Wang

    Abstract: Today's AI systems for medical decision support often succeed on benchmark datasets in research papers but fail in real-world deployment. This work focuses on the decision making of sepsis, an acute life-threatening systematic infection that requires an early diagnosis with high uncertainty from the clinician. Our aim is to explore the design requirements for AI systems that can support clinical e… ▽ More

    Submitted 26 February, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: Accepted by CHI'24

    MSC Class: 68U35 ACM Class: H.5.2; I.2.1

  27. arXiv:2309.10305  [pdf, other

    cs.CL

    Baichuan 2: Open Large-scale Language Models

    Authors: Aiyuan Yang, Bin Xiao, Bingning Wang, Borong Zhang, Ce Bian, Chao Yin, Chenxu Lv, Da Pan, Dian Wang, Dong Yan, Fan Yang, Fei Deng, Feng Wang, Feng Liu, Guangwei Ai, Guosheng Dong, Haizhou Zhao, Hang Xu, Haoze Sun, Hongda Zhang, Hui Liu, Jiaming Ji, Jian Xie, JunTao Dai, Kun Fang , et al. (30 additional authors not shown)

    Abstract: Large language models (LLMs) have demonstrated remarkable performance on a variety of natural language tasks based on just a few examples of natural language instructions, reducing the need for extensive feature engineering. However, most powerful LLMs are closed-source or limited in their capability for languages other than English. In this technical report, we present Baichuan 2, a series of lar… ▽ More

    Submitted 20 September, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Baichuan 2 technical report. Github: https://github.com/baichuan-inc/Baichuan2

  28. arXiv:2308.13259  [pdf, other

    cs.CL cs.AI

    Knowledge-Driven CoT: Exploring Faithful Reasoning in LLMs for Knowledge-intensive Question Answering

    Authors: Keheng Wang, Feiyu Duan, Sirui Wang, Peiguang Li, Yunsen Xian, Chuantao Yin, Wenge Rong, Zhang Xiong

    Abstract: Equipped with Chain-of-Thought (CoT), Large language models (LLMs) have shown impressive reasoning ability in various downstream tasks. Even so, suffering from hallucinations and the inability to access external knowledge, LLMs often come with incorrect or unfaithful intermediate reasoning steps, especially in the context of answering knowledge-intensive tasks such as KBQA. To alleviate this issue… ▽ More

    Submitted 28 October, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

  29. arXiv:2308.04673  [pdf, other

    cs.CR cs.AI

    SSL-Auth: An Authentication Framework by Fragile Watermarking for Pre-trained Encoders in Self-supervised Learning

    Authors: Xiaobei Li, Changchun Yin, Liyue Zhu, Xiaogang Xu, Liming Fang, Run Wang, Chenhao Lin

    Abstract: Self-supervised learning (SSL), a paradigm harnessing unlabeled datasets to train robust encoders, has recently witnessed substantial success. These encoders serve as pivotal feature extractors for downstream tasks, demanding significant computational resources. Nevertheless, recent studies have shed light on vulnerabilities in pre-trained encoders, including backdoor and adversarial threats. Safe… ▽ More

    Submitted 6 December, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

  30. arXiv:2308.00531  [pdf, ps, other

    cs.NI

    Adaptive Bitrate Video Semantic Communication over Wireless Networks

    Authors: Wentao Gong, Haonan Tong, Sihua Wang, Zhaohui Yang, Xinxin He, Changchuan Yin

    Abstract: This paper investigates the adaptive bitrate (ABR) video semantic communication over wireless networks. In the considered model, video sensing devices must transmit video semantic information to an edge server, to facilitate ubiquitous video sensing services such as road environment monitoring at the edge server in autonomous driving scenario. However, due to the varying wireless network condition… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  31. arXiv:2305.18514  [pdf, other

    quant-ph cond-mat.stat-mech cs.CC math-ph

    Polynomial-time classical sampling of high-temperature quantum Gibbs states

    Authors: Chao Yin, Andrew Lucas

    Abstract: The computational complexity of simulating quantum many-body systems generally scales exponentially with the number of particles. This enormous computational cost prohibits first principles simulations of many important problems throughout science, ranging from simulating quantum chemistry to discovering the thermodynamic phase diagram of quantum materials or high-density neutron stars. We present… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: 4+4 pages; 0+1 figure

  32. arXiv:2305.11916  [pdf, other

    cs.CL

    F-PABEE: Flexible-patience-based Early Exiting for Single-label and Multi-label text Classification Tasks

    Authors: Xiangxiang Gao, Wei Zhu, Jiasheng Gao, Congrui Yin

    Abstract: Computational complexity and overthinking problems have become the bottlenecks for pre-training language models (PLMs) with millions or even trillions of parameters. A Flexible-Patience-Based Early Exiting method (F-PABEE) has been proposed to alleviate the problems mentioned above for single-label classification (SLC) and multi-label classification (MLC) tasks. F-PABEE makes predictions at the cl… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: accepted by ICASSP-2023

  33. arXiv:2305.08353  [pdf, other

    cs.DS cs.LG

    Fast and Efficient Matching Algorithm with Deadline Instances

    Authors: Zhao Song, Weixin Wang, Chenbo Yin, Junze Yin

    Abstract: The online weighted matching problem is a fundamental problem in machine learning due to its numerous applications. Despite many efforts in this area, existing algorithms are either too slow or don't take $\mathrm{deadline}$ (the longest time a node can be matched) into account. In this paper, we introduce a market model with $\mathrm{deadline}$ first. Next, we present our two optimized algorithms… ▽ More

    Submitted 12 February, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

  34. arXiv:2303.07537  [pdf, other

    cs.LG q-bio.QM

    Fractional dynamics foster deep learning of COPD stage prediction

    Authors: Chenzhong Yin, Mihai Udrescu, Gaurav Gupta, Mingxi Cheng, Andrei Lihu, Lucretia Udrescu, Paul Bogdan, David M Mannino, Stefan Mihaicuta

    Abstract: Chronic obstructive pulmonary disease (COPD) is one of the leading causes of death worldwide. Current COPD diagnosis (i.e., spirometry) could be unreliable because the test depends on an adequate effort from the tester and testee. Moreover, the early diagnosis of COPD is challenging. We address COPD detection by constructing two novel physiological signals datasets (4432 records from 54 patients i… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: Published on Advanced Science

  35. arXiv:2303.06962  [pdf, other

    cs.IT eess.SP

    A Novel Two-Layer Codebook Based Near-Field Beam Training for Intelligent Reflecting Surface

    Authors: Tao Wang, Jie Lv, Haonan Tong, Changsheng You, Changchuan Yin

    Abstract: In this paper, we study the codebook-based near-field beam training for intelligent reflecting surfaces (IRSs) aided wireless system. In the considered model, the near-field beam training is critical to focus signals at the location of user equipment (UE) to obtain prominent IRS array gain. However, existing codebook schemes cannot achieve low training overhead and high receiving power simultaneou… ▽ More

    Submitted 18 April, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: 6 pages, 4 figures

  36. arXiv:2303.06747  [pdf, other

    cs.CV eess.IV

    Raising The Limit Of Image Rescaling Using Auxiliary Encoding

    Authors: Chenzhong Yin, Zhihong Pan, Xin Zhou, Le Kang, Paul Bogdan

    Abstract: Normalizing flow models using invertible neural networks (INN) have been widely investigated for successful generative image super-resolution (SR) by learning the transformation between the normal distribution of latent variable $z$ and the conditional distribution of high-resolution (HR) images gave a low-resolution (LR) input. Recently, image rescaling models like IRN utilize the bidirectional n… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

  37. arXiv:2303.02304  [pdf, other

    cs.LG

    Coupled Multiwavelet Neural Operator Learning for Coupled Partial Differential Equations

    Authors: Xiongye Xiao, Defu Cao, Ruochen Yang, Gaurav Gupta, Gengshuo Liu, Chenzhong Yin, Radu Balan, Paul Bogdan

    Abstract: Coupled partial differential equations (PDEs) are key tasks in modeling the complex dynamics of many physical processes. Recently, neural operators have shown the ability to solve PDEs by learning the integral kernel directly in Fourier/Wavelet space, so the difficulty for solving the coupled PDEs depends on dealing with the coupled map**s between the functions. Towards this end, we propose a \t… ▽ More

    Submitted 8 December, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted to ICLR 2023

  38. CTRLStruct: Dialogue Structure Learning for Open-Domain Response Generation

    Authors: Congchi Yin, Piji Li, Zhaochun Ren

    Abstract: Dialogue structure discovery is essential in dialogue generation. Well-structured topic flow can leverage background information and predict future topics to help generate controllable and explainable responses. However, most previous work focused on dialogue structure learning in task-oriented dialogue other than open-domain dialogue which is more complicated and challenging. In this paper, we pr… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 12 pages, to be published in The Web Conference 2023

  39. arXiv:2302.14648  [pdf, other

    cs.IT cs.AI cs.LG

    Digital Over-the-Air Federated Learning in Multi-Antenna Systems

    Authors: Sihua Wang, Mingzhe Chen, Cong Shen, Changchuan Yin, Christopher G. Brinton

    Abstract: In this paper, the performance optimization of federated learning (FL), when deployed over a realistic wireless multiple-input multiple-output (MIMO) communication system with digital modulation and over-the-air computation (AirComp) is studied. In particular, a MIMO system is considered in which edge devices transmit their local FL models (trained using their locally collected data) to a paramete… ▽ More

    Submitted 25 April, 2024; v1 submitted 4 February, 2023; originally announced February 2023.

  40. arXiv:2302.05406  [pdf, other

    cs.CL

    Adversarial Transformer Language Models for Contextual Commonsense Inference

    Authors: Pedro Colon-Hernandez, Henry Lieberman, Yida Xin, Claire Yin, Cynthia Breazeal, Peter Chin

    Abstract: Contextualized or discourse aware commonsense inference is the task of generating coherent commonsense assertions (i.e., facts) from a given story, and a particular sentence from that story. Some problems with the task are: lack of controllability for topics of the inferred facts; lack of commonsense knowledge during training; and, possibly, hallucinated or false facts. In this work, we utilize a… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Submitted to Semantic Web Journal special edition. https://semantic-web-journal.org/content/adversarial-transformer-language-models-contextual-commonsense-inference-1

  41. arXiv:2301.12833  [pdf, other

    cs.IT

    Sum-Rate Maximization for Active RIS-Aided Downlink RSMA System

    Authors: Xinhao Li, Tao Wang, Haonan Tong, Zhaohui Yang, Yijie Mao, Changchuan Yin

    Abstract: In this paper, the problem of sum-rate maximization for an active reconfigurable intelligent surface (RIS) assisted downlink rate-splitting multiple access (RSMA) transmission system is studied. In the considered model, the active RIS is deployed to overcome severe power attenuation, which is caused by the cumulative product of RIS incidence path loss and the reflection path loss. Since the active… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  42. arXiv:2211.15118  [pdf, other

    cs.DS cs.LG

    A Faster $k$-means++ Algorithm

    Authors: Jiehao Liang, Somdeb Sarkhel, Zhao Song, Chenbo Yin, Junze Yin, Danyang Zhuo

    Abstract: $k$-means++ is an important algorithm for choosing initial cluster centers for the $k$-means clustering algorithm. In this work, we present a new algorithm that can solve the $k$-means++ problem with nearly optimal running time. Given $n$ data points in $\mathbb{R}^d$, the current state-of-the-art algorithm runs in $\widetilde{O}(k )$ iterations, and each iteration takes $\widetilde{O}(nd k)… ▽ More

    Submitted 13 February, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

  43. arXiv:2210.16097  [pdf, other

    eess.IV cs.CV

    cRedAnno+: Annotation Exploitation in Self-Explanatory Lung Nodule Diagnosis

    Authors: Jiahao Lu, Chong Yin, Kenny Erleben, Michael Bachmann Nielsen, Sune Darkner

    Abstract: Recently, attempts have been made to reduce annotation requirements in feature-based self-explanatory models for lung nodule diagnosis. As a representative, cRedAnno achieves competitive performance with considerably reduced annotation needs by introducing self-supervised contrastive learning to do unsupervised feature extraction. However, it exhibits unstable performance under scarce annotation c… ▽ More

    Submitted 7 November, 2022; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: 5 pages, 5 figures, 2 tables. arXiv admin note: text overlap with arXiv:2206.13608

  44. arXiv:2210.05321  [pdf, other

    cs.NI

    Image Segmentation Semantic Communication over Internet of Vehicles

    Authors: Qiang Pan, Haonan Tong, Jie Lv, Tao Luo, Zhilong Zhang, Changchuan Yin, Jianfeng Li

    Abstract: In this paper, the problem of semantic-based efficient image transmission is studied over the Internet of Vehicles (IoV). In the considered model, a vehicle shares massive amount of visual data perceived by its visual sensors to assist other vehicles in making driving decisions. However, it is hard to maintain a high reliable visual data transmission due to the limited spectrum resources. To tackl… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  45. arXiv:2209.10200  [pdf, other

    cs.LG

    Performance Optimization for Variable Bitwidth Federated Learning in Wireless Networks

    Authors: Sihua Wang, Mingzhe Chen, Christopher G. Brinton, Changchuan Yin, Walid Saad, Shuguang Cui

    Abstract: This paper considers improving wireless communication and computation efficiency in federated learning (FL) via model quantization. In the proposed bitwidth FL scheme, edge devices train and transmit quantized versions of their local FL model parameters to a coordinating server, which aggregates them into a quantized global model and synchronizes the devices. The goal is to jointly determine the b… ▽ More

    Submitted 10 July, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

  46. arXiv:2209.04589  [pdf, other

    cs.IR cs.AI

    Causal Intervention for Fairness in Multi-behavior Recommendation

    Authors: Xi Wang, Wenjie Wang, Fuli Feng, Wenge Rong, Chuantao Yin, Zhang Xiong

    Abstract: Recommender systems usually learn user interests from various user behaviors, including clicks and post-click behaviors (e.g., like and favorite). However, these behaviors inevitably exhibit popularity bias, leading to some unfairness issues: 1) for items with similar quality, more popular ones get more exposure; and 2) even worse the popular items with lower popularity might receive more exposure… ▽ More

    Submitted 17 April, 2024; v1 submitted 10 September, 2022; originally announced September 2022.

    Comments: This paper is accepted by IEEE Transactions on Computational Social Systems

  47. arXiv:2209.01963  [pdf, other

    cs.IR

    Modeling User Repeat Consumption Behavior for Online Novel Recommendation

    Authors: Yuncong Li, Cunxiang Yin, Yancheng He, Guoqiang Xu, **g Cai, Leeven Luo, Sheng-hua Zhong

    Abstract: Given a user's historical interaction sequence, online novel recommendation suggests the next novel the user may be interested in. Online novel recommendation is important but underexplored. In this paper, we concentrate on recommending online novels to new users of an online novel reading platform, whose first visits to the platform occurred in the last seven days. We have two observations about… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: RecSys 2022

  48. arXiv:2209.01882  [pdf, other

    cs.CL cs.AI cs.CR

    PromptAttack: Prompt-based Attack for Language Models via Gradient Search

    Authors: Yundi Shi, Piji Li, Changchun Yin, Zhaoyang Han, Lu Zhou, Zhe Liu

    Abstract: As the pre-trained language models (PLMs) continue to grow, so do the hardware and data requirements for fine-tuning PLMs. Therefore, the researchers have come up with a lighter method called \textit{Prompt Learning}. However, during the investigations, we observe that the prompt learning methods are vulnerable and can easily be attacked by some illegally constructed prompts, resulting in classifi… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: 12 pages

  49. arXiv:2207.12303  [pdf, other

    cs.LG cs.AI cs.CV

    Continual Few-Shot Learning with Adversarial Class Storage

    Authors: Kun Wu, Chengxiang Yin, Jian Tang, Zhiyuan Xu, Yanzhi Wang, Dejun Yang

    Abstract: Humans have a remarkable ability to quickly and effectively learn new concepts in a continuous manner without forgetting old knowledge. Though deep learning has made tremendous successes on various computer vision tasks, it faces challenges for achieving such human-level intelligence. In this paper, we define a new problem called continual few-shot learning, in which tasks arrive sequentially and… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

    Comments: 9 pages

  50. arXiv:2206.13608  [pdf, other

    eess.IV cs.CV cs.LG

    Reducing Annotation Need in Self-Explanatory Models for Lung Nodule Diagnosis

    Authors: Jiahao Lu, Chong Yin, Oswin Krause, Kenny Erleben, Michael Bachmann Nielsen, Sune Darkner

    Abstract: Feature-based self-explanatory methods explain their classification in terms of human-understandable features. In the medical imaging community, this semantic matching of clinical knowledge adds significantly to the trustworthiness of the AI. However, the cost of additional annotation of features remains a pressing issue. We address this problem by proposing cRedAnno, a data-/annotation-efficient… ▽ More

    Submitted 25 July, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: 10 pages, 4 figures, 2 tables