Skip to main content

Showing 1–50 of 156 results for author: Yao, B

.
  1. arXiv:2407.00840  [pdf, other

    cs.LG

    MUSE-Net: Missingness-aware mUlti-branching Self-attention Encoder for Irregular Longitudinal Electronic Health Records

    Authors: Zekai Wang, Tieming Liu, Bing Yao

    Abstract: The era of big data has made vast amounts of clinical data readily available, particularly in the form of electronic health records (EHRs), which provides unprecedented opportunities for develo** data-driven diagnostic tools to enhance clinical decision making. However, the application of EHRs in data-driven modeling faces challenges such as irregularly spaced multi-variate time series, issues o… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.19749  [pdf, other

    eess.IV cs.CV

    SPIRONet: Spatial-Frequency Learning and Topological Channel Interaction Network for Vessel Segmentation

    Authors: De-Xing Huang, Xiao-Hu Zhou, Xiao-Liang Xie, Shi-Qi Liu, Shuang-Yi Wang, Zhen-Qiu Feng, Mei-Jiang Gui, Hao Li, Tian-Yu Xiang, Bo-Xian Yao, Zeng-Guang Hou

    Abstract: Automatic vessel segmentation is paramount for develo** next-generation interventional navigation systems. However, current approaches suffer from suboptimal segmentation performances due to significant challenges in intraoperative images (i.e., low signal-to-noise ratio, small or slender vessels, and strong interference). In this paper, a novel spatial-frequency learning and topological channel… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  3. arXiv:2406.19205  [pdf, other

    eess.SP

    Coordinated RSMA for Integrated Sensing and Communication in Emergency UAV Systems

    Authors: Binghan Yao, Ruoguang Li, Yingyang Chen, Li Wang

    Abstract: Recently, unmanned aerial vehicle (UAV)-enabled integrated sensing and communication (ISAC) is emerging as a promising technique for achieving robust and rapid emergency response capabilities. Such a novel framework offers high-quality and cost-efficient C\&S services due to the intrinsic flexibility and mobility of UAVs. In parallel, rate-splitting multiple access (RSMA) is able to achieve a tail… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.17578  [pdf, other

    eess.IV

    Sparse-view Signal-domain Photoacoustic Tomography Reconstruction Method Based on Neural Representation

    Authors: Bowei Yao, Yi Zeng, Haizhao Dai, Qing Wu, Youshen Xiao, Fei Gao, Yuyao Zhang, **gyi Yu, Xiran Cai

    Abstract: Photoacoustic tomography is a hybrid biomedical technology, which combines the advantages of acoustic and optical imaging. However, for the conventional image reconstruction method, the image quality is affected obviously by artifacts under the condition of sparse sampling. in this paper, a novel model-based sparse reconstruction method via implicit neural representation was proposed for improving… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  5. arXiv:2405.13803  [pdf, other

    cs.HC cs.CL

    Sunnie: An Anthropomorphic LLM-Based Conversational Agent for Mental Well-Being Activity Recommendation

    Authors: Siyi Wu, Feixue Han, Bingsheng Yao, Tianyi Xie, Xuan Zhao, Dakuo Wang

    Abstract: A longstanding challenge in mental well-being support is the reluctance of people to adopt psychologically beneficial activities, often due to lack of motivation, low perceived trustworthiness, and limited personalization of recommendations. Chatbots have shown promise in promoting positive mental health practices, yet their rigid interaction flows and less human-like conversational experiences pr… ▽ More

    Submitted 13 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: In Submission

  6. arXiv:2404.13409  [pdf, other

    cs.HC

    "I Wish There Were an AI": Challenges and AI Potential in Cancer Patient-Provider Communication

    Authors: Ziqi Yang, Xuhai Xu, Bingsheng Yao, Jiachen Li, Jennifer Bagdasarian, Guodong Gao, Dakuo Wang

    Abstract: Patient-provider communication has been crucial to cancer patients' survival after their cancer treatments. However, the research community and patients themselves often overlook the communication challenges after cancer treatments as they are overshadowed by the severity of the patient's illness and the variety and rarity of the cancer disease itself. Meanwhile, the recent technical advances in A… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 18 pages, 2 figures, submission to CSCW'24

  7. arXiv:2404.05012  [pdf, other

    cs.AI cs.CL

    Towards Reliable and Empathetic Depression-Diagnosis-Oriented Chats

    Authors: Kunyao Lan, Cong Ming, Binwei Yao, Lu Chen, Mengyue Wu

    Abstract: Chatbots can serve as a viable tool for preliminary depression diagnosis via interactive conversations with potential patients. Nevertheless, the blend of task-oriented and chit-chat in diagnosis-related dialogues necessitates professional expertise and empathy. Such unique requirements challenge traditional dialogue frameworks geared towards single optimization goals. To address this, we propose… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  8. arXiv:2403.16398  [pdf, other

    cs.LG cs.AI

    Rethinking the Representation in Federated Unsupervised Learning with Non-IID Data

    Authors: Xinting Liao, Weiming Liu, Chaochao Chen, Pengyang Zhou, Fengyuan Yu, Huabin Zhu, Binhui Yao, Tao Wang, Xiaolin Zheng, Yanchao Tan

    Abstract: Federated learning achieves effective performance in modeling decentralized data. In practice, client data are not well-labeled, which makes it potential for federated unsupervised learning (FUSL) with non-IID data. However, the performance of existing FUSL methods suffers from insufficient representations, i.e., (1) representation collapse entanglement among local and global models, and (2) incon… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  9. arXiv:2403.14870  [pdf, other

    cs.CV cs.CL cs.LG

    VidLA: Video-Language Alignment at Scale

    Authors: Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan, Son Tran, Benjamin Z. Yao, Belinda Zeng, Mubarak Shah, Trishul Chilimbi

    Abstract: In this paper, we propose VidLA, an approach for video-language alignment at scale. There are two major limitations of previous video-language alignment approaches. First, they do not capture both short-range and long-range temporal dependencies and typically employ complex hierarchical deep network architectures that are hard to integrate with existing pretrained image-text foundation models. To… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  10. arXiv:2403.08154  [pdf, other

    cs.LG eess.SP

    The Effect of Different Optimization Strategies to Physics-Constrained Deep Learning for Soil Moisture Estimation

    Authors: Jianxin Xie, Bing Yao, Zheyu Jiang

    Abstract: Soil moisture is a key hydrological parameter that has significant importance to human society and the environment. Accurate modeling and monitoring of soil moisture in crop fields, especially in the root zone (top 100 cm of soil), is essential for improving agricultural production and crop yield with the help of precision irrigation and farming tools. Realizing the full sensor data potential depe… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  11. arXiv:2403.07228  [pdf, other

    eess.SP

    Physics-constrained Active Learning for Soil Moisture Estimation and Optimal Sensor Placement

    Authors: Jianxin Xie, Bing Yao, Zheyu Jiang

    Abstract: Soil moisture is a crucial hydrological state variable that has significant importance to the global environment and agriculture. Precise monitoring of soil moisture in crop fields is critical to reducing agricultural drought and improving crop yield. In-situ soil moisture sensors, which are buried at pre-determined depths and distributed across the field, are promising solutions for monitoring so… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  12. arXiv:2403.01273  [pdf, other

    cs.LG cs.AI cs.CL

    NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention

    Authors: Tianyi Zhang, Jonah Wonkyu Yi, Bowen Yao, Zhaozhuo Xu, Anshumali Shrivastava

    Abstract: Large language model inference on Central Processing Units (CPU) is challenging due to the vast quantities of expensive Multiply-Add (MAD) matrix operations in the attention computations. In this paper, we argue that there is a rare gem in modern CPUs, Single-Instruction-Multiple-Data (SIMD) registers, which allow for ultra-low-latency lookups in batch. We leverage this unique capability of CPUs t… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  13. arXiv:2402.01994  [pdf, ps, other

    cs.HC cs.AI cs.CR

    Human-Centered Privacy Research in the Age of Large Language Models

    Authors: Tianshi Li, Sauvik Das, Hao-** Lee, Dakuo Wang, Bingsheng Yao, Zhi** Zhang

    Abstract: The emergence of large language models (LLMs), and their increased use in user-facing systems, has led to substantial privacy concerns. To date, research on these privacy concerns has been model-centered: exploring how LLMs lead to privacy risks like memorization, or can be used to infer personal characteristics about people from their content. We argue that there is a need for more research focus… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 4 pages, CHI EA'24

  14. arXiv:2401.13804  [pdf, other

    cs.HC cs.CY

    Exploring Parent's Needs for Children-Centered AI to Support Preschoolers' Storytelling and Reading Activities

    Authors: Yuling Sun, Jiali Liu, Bingsheng Yao, Jiaju Chen, Dakuo Wang, Xiaojuan Ma, Yuxuan Lu, Ying Xu, Liang He

    Abstract: Interactive storytelling is vital for preschooler development. While children's interactive partners have traditionally been their parents and teachers, recent advances in artificial intelligence (AI) have sparked a surge of AI-based storytelling technologies. As these technologies become increasingly ubiquitous in preschoolers' lives, questions arise regarding how they function in practical story… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  15. arXiv:2401.13799  [pdf, other

    cs.CY cs.HC

    Who Changed the Destiny of Rural Students, and How?: Unpacking ICT-Mediated Remote Education in Rural China

    Authors: Yuling Sun, Xiuqi Zhu, Xiaomu Zhou, Bingsheng Yao, Kai Zhang, Dakuo Wang, Jiaju Chen, Liang He

    Abstract: The proliferation of Information and Communication Technologies (ICTs) has shown great promise in addressing educational challenges facing rural areas. However, the complex rural context poses significant challenges to the effective utilization of these technologies. This paper examines the empirical integration of live-streaming-based remote classrooms (LSRC) through a qualitative study in rural… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: In submission

  16. arXiv:2401.11876  [pdf, other

    cs.SE cs.RO

    First-principles Based 3D Virtual Simulation Testing for Discovering SOTIF Corner Cases of Autonomous Driving

    Authors: Lehang Li, Haokuan Wu, Botao Yao, Tianyu He, Shuohan Huang, Chuanyi Liu

    Abstract: 3D virtual simulation, which generates diversified test scenarios and tests full-stack of Autonomous Driving Systems (ADSes) modules dynamically as a whole, is a promising approach for Safety of The Intended Functionality (SOTIF) ADS testing. However, as different configurations of a test scenario will affect the sensor perceptions and environment interaction, e.g. light pulses emitted by the LiDA… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 11 pages, 10 figures

  17. arXiv:2401.11480  [pdf

    physics.med-ph

    Organ-Level Radiation Doses from CT Scans for 10,000 Chinese Subjects Undergoing Physical Examinations: The Feasibility of AI-Based Multi-organ CT Image Segmentation and Near Real-time Monte Carlo Dose Computing

    Authors: Zirui Ye, Bei Yao, Haoran Zheng, Li Tao, Ripeng Wang, Yang Lu, Yankui Chang, Xi Pei, Zhi Chen, Xie George Xu

    Abstract: Considering the increasing trend of physical examinations in China, the escalating frequency of Computed Tomography (CT) scans has amplified concerns regarding population radiation exposure and its consequent risks. The challenges mainly manifest in two aspects: one is the rapid construction of patient-specific human phantoms, and the other is the fast Monte Carlo (MC) simulation of radiation dose… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 19 pages, 10 figures

  18. arXiv:2312.14478  [pdf, other

    cs.LG

    Federated Learning via Input-Output Collaborative Distillation

    Authors: Xuan Gong, Shanglin Li, Yuxiang Bao, Barry Yao, Yawen Huang, Ziyan Wu, Baochang Zhang, Yefeng Zheng, David Doermann

    Abstract: Federated learning (FL) is a machine learning paradigm in which distributed local nodes collaboratively train a central model without sharing individually held private data. Existing FL methods either iteratively share local model parameters or deploy co-distillation. However, the former is highly susceptible to private data leakage, and the latter design relies on the prerequisites of task-releva… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

  19. arXiv:2312.00029  [pdf, other

    cs.CR cs.AI cs.CL

    Bergeron: Combating Adversarial Attacks through a Conscience-Based Alignment Framework

    Authors: Matthew Pisano, Peter Ly, Abraham Sanders, Bingsheng Yao, Dakuo Wang, Tomek Strzalkowski, Mei Si

    Abstract: Research into AI alignment has grown considerably since the recent introduction of increasingly capable Large Language Models (LLMs). Unfortunately, modern methods of alignment still fail to fully prevent harmful responses when models are deliberately attacked. These attacks can trick seemingly aligned models into giving manufacturing instructions for dangerous materials, inciting violence, or rec… ▽ More

    Submitted 15 March, 2024; v1 submitted 16 November, 2023; originally announced December 2023.

  20. arXiv:2311.09825  [pdf, other

    cs.CL

    Human Still Wins over LLM: An Empirical Study of Active Learning on Domain-Specific Annotation Tasks

    Authors: Yuxuan Lu, Bingsheng Yao, Shao Zhang, Yun Wang, Peng Zhang, Tun Lu, Toby Jia-Jun Li, Dakuo Wang

    Abstract: Large Language Models (LLMs) have demonstrated considerable advances, and several claims have been made about their exceeding human performance. However, in real-world tasks, domain knowledge is often required. Low-resource learning methods like Active Learning (AL) have been proposed to tackle the cost of domain expert annotation, raising this question: Can LLMs surpass compact models trained wit… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  21. arXiv:2311.09782  [pdf, other

    cs.CL

    More Samples or More Prompts? Exploring Effective In-Context Sampling for LLM Few-Shot Prompt Engineering

    Authors: Bingsheng Yao, Guiming Chen, Ruishi Zou, Yuxuan Lu, Jiachen Li, Shao Zhang, Yisi Sang, Sijia Liu, James Hendler, Dakuo Wang

    Abstract: While most existing works on LLM prompting techniques focus only on how to select a better set of data samples inside one single prompt input (In-Context Learning or ICL), why can not we design and leverage multiple prompts together to further improve the LLM's performance? In this work, we propose In-Context Sampling (ICS), a low-resource LLM prompting technique to produce confident predictions b… ▽ More

    Submitted 2 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted at NAACL 2024 Findings

  22. arXiv:2311.09756  [pdf, other

    cs.CL

    FairytaleCQA: Integrating a Commonsense Knowledge Graph into Children's Storybook Narratives

    Authors: Jiaju Chen, Yuxuan Lu, Shao Zhang, Bingsheng Yao, Yuanzhe Dong, Ying Xu, Yunyao Li, Qianwen Wang, Dakuo Wang, Yuling Sun

    Abstract: AI models (including LLM) often rely on narrative question-answering (QA) datasets to provide customized QA functionalities to support downstream children education applications; however, existing datasets only include QA pairs that are grounded within the given storybook content, but children can learn more when teachers refer the storybook content to real-world knowledge (e.g., commonsense knowl… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  23. arXiv:2310.17245  [pdf, other

    cs.LG cs.AI

    CROP: Conservative Reward for Model-based Offline Policy Optimization

    Authors: Hao Li, Xiao-Hu Zhou, Xiao-Liang Xie, Shi-Qi Liu, Zhen-Qiu Feng, Xiao-Yin Liu, Mei-Jiang Gui, Tian-Yu Xiang, De-Xing Huang, Bo-Xian Yao, Zeng-Guang Hou

    Abstract: Offline reinforcement learning (RL) aims to optimize policy using collected data without online interactions. Model-based approaches are particularly appealing for addressing offline RL challenges due to their capability to mitigate the limitations of offline data through data generation using models. Prior research has demonstrated that introducing conservatism into the model or Q-function during… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  24. arXiv:2310.15077  [pdf, other

    cs.CL

    'Don't Get Too Technical with Me': A Discourse Structure-Based Framework for Science Journalism

    Authors: Ronald Cardenas, Bingsheng Yao, Dakuo Wang, Yufang Hou

    Abstract: Science journalism refers to the task of reporting technical findings of a scientific paper as a less technical news article to the general public audience. We aim to design an automated system to support this real-world task (i.e., automatic science journalism) by 1) introducing a newly-constructed and real-world dataset (SciTechNews), with tuples of a publicly-available scientific paper, its cor… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  25. arXiv:2310.05853  [pdf, other

    cs.HC

    "Mango Mango, How to Let The Lettuce Dry Without A Spinner?'': Exploring User Perceptions of Using An LLM-Based Conversational Assistant Toward Cooking Partner

    Authors: Szeyi Chan, Jiachen Li, Bingsheng Yao, Amama Mahmood, Chien-Ming Huang, Holly Jimison, Elizabeth D Mynatt, Dakuo Wang

    Abstract: The rapid advancement of the Large Language Model (LLM) has created numerous potentials for integration with conversational assistants (CAs) assisting people in their daily tasks, particularly due to their extensive flexibility. However, users' real-world experiences interacting with these assistants remain unexplored. In this research, we chose cooking, a complex daily task, as a scenario to inve… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Under submission to CHI2024

  26. arXiv:2309.13879  [pdf, other

    cs.HC

    LLM-Powered Conversational Voice Assistants: Interaction Patterns, Opportunities, Challenges, and Design Guidelines

    Authors: Amama Mahmood, Junxiang Wang, Bingsheng Yao, Dakuo Wang, Chien-Ming Huang

    Abstract: Conventional Voice Assistants (VAs) rely on traditional language models to discern user intent and respond to their queries, leading to interactions that often lack a broader contextual understanding, an area in which Large Language Models (LLMs) excel. However, current LLMs are largely designed for text-based interactions, thus making it unclear how user interactions will evolve if their modality… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  27. arXiv:2309.12368  [pdf, other

    cs.HC cs.AI cs.LG

    Rethinking Human-AI Collaboration in Complex Medical Decision Making: A Case Study in Sepsis Diagnosis

    Authors: Shao Zhang, Jianing Yu, Xuhai Xu, Changchang Yin, Yuxuan Lu, Bingsheng Yao, Melanie Tory, Lace M. Padilla, Jeffrey Caterino, ** Zhang, Dakuo Wang

    Abstract: Today's AI systems for medical decision support often succeed on benchmark datasets in research papers but fail in real-world deployment. This work focuses on the decision making of sepsis, an acute life-threatening systematic infection that requires an early diagnosis with high uncertainty from the clinician. Our aim is to explore the design requirements for AI systems that can support clinical e… ▽ More

    Submitted 26 February, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: Accepted by CHI'24

    MSC Class: 68U35 ACM Class: H.5.2; I.2.1

  28. arXiv:2309.11653  [pdf, other

    cs.HC cs.AI cs.CR

    "It's a Fair Game", or Is It? Examining How Users Navigate Disclosure Risks and Benefits When Using LLM-Based Conversational Agents

    Authors: Zhi** Zhang, Michelle Jia, Hao-** Lee, Bingsheng Yao, Sauvik Das, Ada Lerner, Dakuo Wang, Tianshi Li

    Abstract: The widespread use of Large Language Model (LLM)-based conversational agents (CAs), especially in high-stakes domains, raises many privacy concerns. Building ethical LLM-based CAs that respect user privacy requires an in-depth understanding of the privacy risks that concern users the most. However, existing research, primarily model-centered, does not provide insight into users' perspectives. To b… ▽ More

    Submitted 1 April, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: 26 pages, 5 figures

  29. arXiv:2309.09357  [pdf, other

    cs.CL cs.AI cs.HC

    Talk2Care: Facilitating Asynchronous Patient-Provider Communication with Large-Language-Model

    Authors: Ziqi Yang, Xuhai Xu, Bingsheng Yao, Shao Zhang, Ethan Rogers, Stephen Intille, Nawar Shara, Guodong Gordon Gao, Dakuo Wang

    Abstract: Despite the plethora of telehealth applications to assist home-based older adults and healthcare providers, basic messaging and phone calls are still the most common communication methods, which suffer from limited availability, information loss, and process inefficiencies. One promising solution to facilitate patient-provider communication is to leverage large language models (LLMs) with their po… ▽ More

    Submitted 3 February, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: Under submission to IMWUT'23, 26 pages

    MSC Class: 68U35 ACM Class: H.5.2; I.2.7

  30. arXiv:2309.00221  [pdf, other

    quant-ph physics.optics

    A multinode quantum network over a metropolitan area

    Authors: Jian-Long Liu, Xi-Yu Luo, Yong Yu, Chao-Yang Wang, Bin Wang, Yi Hu, Jun Li, Ming-Yang Zheng, Bo Yao, Zi Yan, Da Teng, **-Wei Jiang, Xiao-Bing Liu, Xiu-** Xie, Jun Zhang, Qing-He Mao, Xiao Jiang, Qiang Zhang, Xiao-Hui Bao, Jian-Wei Pan

    Abstract: Towards realizing the future quantum internet, a pivotal milestone entails the transition from two-node proof-of-principle experiments conducted in laboratories to comprehensive, multi-node setups on large scales. Here, we report on the debut implementation of a multi-node entanglement-based quantum network over a metropolitan area. We equipped three quantum nodes with atomic quantum memories and… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: 21 pages in total, 4 figures and 1 table in the main text, 5 figures and 8 tables in the supplementary material

  31. arXiv:2307.15868  [pdf, ps, other

    math.OC cs.LG

    Faster Stochastic Algorithms for Minimax Optimization under Polyak--Łojasiewicz Conditions

    Authors: Lesi Chen, Boyuan Yao, Luo Luo

    Abstract: This paper considers stochastic first-order algorithms for minimax optimization under Polyak--Łojasiewicz (PL) conditions. We propose SPIDER-GDA for solving the finite-sum problem of the form $\min_x \max_y f(x,y)\triangleq \frac{1}{n} \sum_{i=1}^n f_i(x,y)$, where the objective function $f(x,y)$ is $μ_x$-PL in $x$ and $μ_y$-PL in $y$; and each $f_i(x,y)$ is $L$-smooth. We prove SPIDER-GDA could f… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: published in NeurIPS 2022; fix a mistake in the proof of Thm. 4.1 and polish the writing

  32. Mental-LLM: Leveraging Large Language Models for Mental Health Prediction via Online Text Data

    Authors: Xuhai Xu, Bingsheng Yao, Yuanzhe Dong, Saadia Gabriel, Hong Yu, James Hendler, Marzyeh Ghassemi, Anind K. Dey, Dakuo Wang

    Abstract: Advances in large language models (LLMs) have empowered a variety of applications. However, there is still a significant gap in research when it comes to understanding and enhancing the capabilities of LLMs in the field of mental health. In this work, we present a comprehensive evaluation of multiple LLMs on various mental health prediction tasks via online text data, including Alpaca, Alpaca-LoRA… ▽ More

    Submitted 28 January, 2024; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: Published at Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT) 2024

    MSC Class: 68U35 ACM Class: H.5.2; I.2.m

  33. arXiv:2306.15096  [pdf, other

    eess.SP

    Automated Identication of Atrial Fibrillation from Single-lead ECGs Using Multi-branching ResNet

    Authors: Jianxin Xie, Stavros Stavrakis, Bing Yao

    Abstract: Atrial fibrillation (AF) is the most common cardiac arrhythmia, which is clinically identified with irregular and rapid heartbeat rhythm. AF puts a patient at risk of forming blood clots, which can eventually lead to heart failure, stroke, or even sudden death. It is of critical importance to develop an advanced analytical model that can effectively interpret the electrocardiography (ECG) signals… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  34. arXiv:2306.08126  [pdf, other

    cs.CL cs.AI

    PersonaPKT: Building Personalized Dialogue Agents via Parameter-efficient Knowledge Transfer

    Authors: Xu Han, Bin Guo, Yoon Jung, Benjamin Yao, Yu Zhang, Xiaohu Liu, Chenlei Guo

    Abstract: Personalized dialogue agents (DAs) powered by large pre-trained language models (PLMs) often rely on explicit persona descriptions to maintain personality consistency. However, such descriptions may not always be available or may pose privacy concerns. To tackle this bottleneck, we introduce PersonaPKT, a lightweight transfer learning approach that can build persona-consistent dialogue models with… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 10 pages, 3 figures, accepted to SustaiNLP 2023

  35. arXiv:2306.02120  [pdf, other

    cond-mat.mes-hall physics.optics

    Giant Enhancement of Magnonic Frequency Combs by Exceptional Points

    Authors: Congyi Wang, **wei Rao, Zhijian Chen, Kaixin Zhao, Liaoxin Sun, Bimu Yao, Tao Yu, Yi-Pu Wang, Wei Lu

    Abstract: With their incomparable time-frequency accuracy, frequency combs have significantly advanced precision spectroscopy, ultra-sensitive detection, and atomic clocks. Traditional methods to create photonic, phononic, and magnonic frequency combs hinge on material nonlinearities which are often weak, necessitating high power densities to surpass their initiation thresholds, which subsequently limits th… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

    Comments: 7 pages, 4 figures

  36. Reducing Communication for Split Learning by Randomized Top-k Sparsification

    Authors: Fei Zheng, Chaochao Chen, Lingjuan Lyu, Binhui Yao

    Abstract: Split learning is a simple solution for Vertical Federated Learning (VFL), which has drawn substantial attention in both research and application due to its simplicity and efficiency. However, communication efficiency is still a crucial issue for split learning. In this paper, we investigate multiple communication reduction methods for split learning, including cut layer size reduction, top-k spar… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted by IJCAI 2023

    Journal ref: IJCAI 2023

  37. arXiv:2305.16163  [pdf, other

    cs.IR cs.AI

    PPGenCDR: A Stable and Robust Framework for Privacy-Preserving Cross-Domain Recommendation

    Authors: Xinting Liao, Weiming Liu, Xiaolin Zheng, Binhui Yao, Chaochao Chen

    Abstract: Privacy-preserving cross-domain recommendation (PPCDR) refers to preserving the privacy of users when transferring the knowledge from source domain to target domain for better performance, which is vital for the long-term development of recommender systems. Existing work on cross-domain recommendation (CDR) reaches advanced and satisfying recommendation performance, but mostly neglects preserving… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: To be appear in AAAI2023

  38. arXiv:2305.14725  [pdf, other

    cs.CL

    AMELI: Enhancing Multimodal Entity Linking with Fine-Grained Attributes

    Authors: Barry Menglong Yao, Yu Chen, Qifan Wang, Sijia Wang, Minqian Liu, Zhiyang Xu, Licheng Yu, Lifu Huang

    Abstract: We propose attribute-aware multimodal entity linking, where the input is a mention described with a text and image, and the goal is to predict the corresponding target entity from a multimodal knowledge base (KB) where each entity is also described with a text description, a visual image and a set of attributes and values. To support this research, we construct AMELI, a large-scale dataset consist… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 12 pages, 4 figures

    ACM Class: I.2.7

  39. arXiv:2305.14328  [pdf, other

    cs.CL

    Benchmarking LLM-based Machine Translation on Cultural Awareness

    Authors: Binwei Yao, Ming Jiang, Diyi Yang, Junjie Hu

    Abstract: Translating cultural-specific content is crucial for effective cross-cultural communication. However, many MT systems still struggle to translate sentences containing cultural-specific entities accurately and understandably. Recent advancements in in-context learning utilize lightweight prompts to guide large language models (LLMs) in machine translation tasks. Nevertheless, the effectiveness of t… ▽ More

    Submitted 22 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  40. arXiv:2305.12710  [pdf, other

    cs.CL

    Beyond Labels: Empowering Human Annotators with Natural Language Explanations through a Novel Active-Learning Architecture

    Authors: Bingsheng Yao, Ishan **dal, Lucian Popa, Yannis Katsis, Sayan Ghosh, Lihong He, Yuxuan Lu, Shashank Srivastava, Yunyao Li, James Hendler, Dakuo Wang

    Abstract: Real-world domain experts (e.g., doctors) rarely annotate only a decision label in their day-to-day workflow without providing explanations. Yet, existing low-resource learning techniques, such as Active Learning (AL), that aim to support human annotators mostly focus on the label while neglecting the natural language explanation of a data point. This work proposes a novel AL architecture to suppo… ▽ More

    Submitted 23 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023 Findings

  41. arXiv:2305.03117  [pdf, other

    cs.CL

    Are Human Explanations Always Helpful? Towards Objective Evaluation of Human Natural Language Explanations

    Authors: Bingsheng Yao, Prithviraj Sen, Lucian Popa, James Hendler, Dakuo Wang

    Abstract: Human-annotated labels and explanations are critical for training explainable NLP models. However, unlike human-annotated labels whose quality is easier to calibrate (e.g., with a majority vote), human-crafted free-form explanations can be quite subjective. Before blindly using them as ground truth to train ML models, a vital question needs to be asked: How do we evaluate a human-annotated explana… ▽ More

    Submitted 22 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL2023

  42. arXiv:2305.01810  [pdf, other

    cs.CL cs.AI

    KEPLET: Knowledge-Enhanced Pretrained Language Model with Topic Entity Awareness

    Authors: Yichuan Li, Jialong Han, Kyumin Lee, Chengyuan Ma, Benjamin Yao, Derek Liu

    Abstract: In recent years, Pre-trained Language Models (PLMs) have shown their superiority by pre-training on unstructured text corpus and then fine-tuning on downstream tasks. On entity-rich textual resources like Wikipedia, Knowledge-Enhanced PLMs (KEPLMs) incorporate the interactions between tokens and mentioned entities in pre-training, and are thus more effective on entity-centric tasks such as entity… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  43. arXiv:2303.14274  [pdf, ps, other

    math.LO

    Set Theory with Urelements

    Authors: Bokai Yao

    Abstract: This dissertation aims to provide a comprehensive account of set theory with urelements. In Chapter 1, I present mathematical and philosophical motivations for studying urelement set theory and lay out the necessary technical preliminaries. Chapter 2 is devoted to the axiomatization of urelement set theory, where I introduce a hierarchy of axioms and discuss how ZFC with urelements should be axiom… ▽ More

    Submitted 18 June, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2212.13627. Definition 15 in the previous versions is flawed, which is fixed in this version

    MSC Class: 03E25 03E30 03E35 03E40 03E55 03E65 03E70

  44. arXiv:2303.10925  [pdf, other

    quant-ph cond-mat.other

    Meter-scale strong coupling between magnons and photons

    Authors: **wei Rao, C. Y. Wang, Bimu Yao, Z. J. Chen, K. X. Zhao, Wei Lu

    Abstract: We experimentally realize a meter-scale strong coupling effect between magnons and photons at room temperature, with a coherent coupling of 20 m and a dissipative coupling of 7.6 m. To this end, we integrate a saturable gain into a microwave cavity and then couple this active cavity to a magnon mode via a long coaxial cable. The gain compensates for the cavity dissipation, but preserves the cavity… ▽ More

    Submitted 9 August, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  45. Time series anomaly detection with reconstruction-based state-space models

    Authors: Fan Wang, Keli Wang, Boyu Yao

    Abstract: Recent advances in digitization have led to the availability of multivariate time series data in various domains, enabling real-time monitoring of operations. Identifying abnormal data patterns and detecting potential failures in these scenarios are important yet rather challenging. In this work, we propose a novel unsupervised anomaly detection method for time series data. The proposed framework… ▽ More

    Submitted 9 October, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  46. arXiv:2302.08904  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci nlin.AO physics.optics

    Coherent Microwave Emission of a Gain-Driven Polariton

    Authors: Bimu Yao, Y. S. Gui, J. W. Rao, Y. H. Zhang, Wei Lu, C. -M. Hu

    Abstract: By develo** a gain-embedded cavity magnonics platform, we create gain-driven polariton (GDP) that is activated by an amplified electromagnetic field. Distinct effects of gain-driven light-matter interaction, such as polariton auto-oscillations, polariton phase singularity, self-selection of a polariton bright mode, and gain-induced magnon-photon synchronization, are theoretically studied and exp… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 6 pages, 4 figures

  47. arXiv:2302.08665  [pdf, other

    cond-mat.mes-hall physics.app-ph quant-ph

    Control of the magnon-polariton hybridization with a microwave pump

    Authors: C. Zhang, **wei Rao, C. Y. Wang, Z. J. Chen, K. X. Zhao, Bimu Yao, Xu-Guang Xu, Wei Lu

    Abstract: Pump-induced magnon modes (PIMs) are recently discovered elementary excitations in ferrimagnets that offer significant tunability to spin dynamics. Here, we investigate the coupling between a PIM and cavity magnon polaritons (CMPs) by driving a cavity magnonic system away from equilibrium with a microwave pump. In our experiment, the Walker mode simultaneously couples with the PIM and cavity photo… ▽ More

    Submitted 5 August, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

  48. arXiv:2302.02599  [pdf, other

    cs.LG cs.AI cs.DC

    Colossal-Auto: Unified Automation of Parallelization and Activation Checkpoint for Large-scale Models

    Authors: Yuliang Liu, Shenggui Li, Jiarui Fang, Yanjun Shao, Boyuan Yao, Yang You

    Abstract: In recent years, large-scale models have demonstrated state-of-the-art performance across various domains. However, training such models requires various techniques to address the problem of limited computing power and memory on devices such as GPUs. Some commonly used techniques include pipeline parallelism, tensor parallelism, and activation checkpointing. While existing works have focused on fi… ▽ More

    Submitted 21 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  49. arXiv:2301.07950  [pdf

    physics.optics

    A Monolithic Graphene-Functionalized Microlaser for Multispecies Gas Detection

    Authors: Yanhong Guo, Zhaoyu Li, Ning An, Yongzheng Guo, Yuchen Wang, Yusen Yuan, Hao Zhang, Teng Tan, Caihao Wu, Bo Peng, Giancarlo Soavi, Yunjiang Rao, Baicheng Yao

    Abstract: Optical microcavity enhanced light-matter interaction offers a powerful tool to develop fast and precise sensing techniques, spurring applications in the detection of biochemical targets ranging from cells, nanoparticles, and large molecules. However, the intrinsic inertness of such pristine microresonators limits their spread in new fields such as gas detection. Here, a functionalized microlaser… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Journal ref: Advanced Materials 34 (2022) 2207777

  50. arXiv:2212.13627  [pdf, ps, other

    math.LO

    Forcing with Urelements

    Authors: Bokai Yao

    Abstract: I first isolate a hierarchy of axioms over ZFC that allows a proper class of urelements. The Collection Principle and Reflection Principle hold precisely when the urelements are arranged in a specific manner. I then turn to forcing with urelements. A new forcing machinery with urelements is proposed to address a problem with the existing approach regarding the property of fullness. Every new forci… ▽ More

    Submitted 21 August, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

    MSC Class: 03E30; 03E40