Skip to main content

Showing 1–50 of 78 results for author: Mei, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00674  [pdf, other

    cs.MA cs.GR cs.RO

    Emergent Crowd Grou** via Heuristic Self-Organization

    Authors: Xiao-Cheng Liao, Wei-Neng Chen, Xiang-Ling Chen, Yi Mei

    Abstract: Modeling crowds has many important applications in games and computer animation. Inspired by the emergent following effect in real-life crowd scenarios, in this work, we develop a method for implicitly grou** moving agents. We achieve this by analyzing local information around each agent and rotating its preferred velocity accordingly. Each agent could automatically form an implicit group with i… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.16578  [pdf, other

    cs.RO cs.AI

    QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds

    Authors: Ye Wang, Yuting Mei, Sipeng Zheng, Qin **

    Abstract: While pets offer companionship, their limited intelligence restricts advanced reasoning and autonomous interaction with humans. Considering this, we propose QuadrupedGPT, a versatile agent designed to master a broad range of complex tasks with agility comparable to that of a pet. To achieve this goal, the primary challenges include: i) effectively leveraging multimodal observations for decision-ma… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Under review

  3. arXiv:2406.16301  [pdf, other

    cs.CV cs.AI cs.MM

    UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos

    Authors: Yuting Mei, Linli Yao, Qin **

    Abstract: With the surge in the amount of video data, video summarization techniques, including visual-modal(VM) and textual-modal(TM) summarization, are attracting more and more attention. However, unimodal summarization inevitably loses the rich semantics of the video. In this paper, we focus on a more comprehensive video summarization task named Bimodal Semantic Summarization of Videos (BiSSV). Specifica… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Accepted by ACM International Conference on Multimedia Retrieval (ICMR'24)

    Journal ref: Proceedings of the 2024 International Conference on Multimedia Retrieval, May 2024, Pages 1034-1042

  4. arXiv:2406.11844  [pdf

    cs.CY cs.AI

    Prompting the E-Brushes: Users as Authors in Generative AI

    Authors: Yiyang Mei

    Abstract: Since its introduction in 2022, Generative AI has significantly impacted the art world, from winning state art fairs to creating complex videos from simple prompts. Amid this renaissance, a pivotal issue emerges: should users of Generative AI be recognized as authors eligible for copyright protection? The Copyright Office, in its March 2023 Guidance, argues against this notion. By comparing the pr… ▽ More

    Submitted 24 March, 2024; originally announced June 2024.

    Journal ref: International Journal of Law, Ethics, and Technology 2024

  5. arXiv:2406.10373  [pdf, other

    cs.CV cs.GR

    Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections

    Authors: Jiacong Xu, Yiqun Mei, Vishal M. Patel

    Abstract: Photographs captured in unstructured tourist environments frequently exhibit variable appearances and transient occlusions, challenging accurate scene reconstruction and inducing artifacts in novel view synthesis. Although prior approaches have integrated the Neural Radiance Field (NeRF) with additional learnable modules to handle the dynamic appearances and eliminate transient objects, their exte… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 15 pages, 7 figures

  6. arXiv:2406.01566  [pdf, other

    cs.DC cs.CL cs.LG

    Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs

    Authors: Yixuan Mei, Yonghao Zhuang, Xupeng Miao, Juncheng Yang, Zhihao Jia, Rashmi Vinayak

    Abstract: This paper introduces Helix, a distributed system for high-throughput, low-latency large language model (LLM) serving on heterogeneous GPU clusters. A key idea behind Helix is to formulate inference computation of LLMs over heterogeneous GPUs and network connections as a max-flow problem for a directed, weighted graph, whose nodes represent GPU instances and edges capture both GPU and network hete… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2405.14268  [pdf, other

    cs.NE cs.AI

    Multi-Representation Genetic Programming: A Case Study on Tree-based and Linear Representations

    Authors: Zhixing Huang, Yi Mei, Fangfang Zhang, Mengjie Zhang, Wolfgang Banzhaf

    Abstract: Existing genetic programming (GP) methods are typically designed based on a certain representation, such as tree-based or linear representations. These representations show various pros and cons in different domains. However, due to the complicated relationships among representation and fitness landscapes of GP, it is hard to intuitively determine which GP representation is the most suitable for s… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  8. arXiv:2405.04759  [pdf, ps, other

    cs.CV cs.LG

    Multi-Label Out-of-Distribution Detection with Spectral Normalized Joint Energy

    Authors: Yihan Mei, Xinyu Wang, Dell Zhang, Xiaoling Wang

    Abstract: In today's interconnected world, achieving reliable out-of-distribution (OOD) detection poses a significant challenge for machine learning models. While numerous studies have introduced improved approaches for multi-class OOD detection tasks, the investigation into multi-label OOD detection tasks has been notably limited. We introduce Spectral Normalized Joint Energy (SNoJoE), a method that consol… ▽ More

    Submitted 12 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  9. arXiv:2405.04652  [pdf, ps, other

    cs.HC

    AffirmativeAI: Towards LGBTQ+ Friendly Audit Frameworks for Large Language Models

    Authors: Yinru Long, Zilin Ma, Yiyang Mei, Zhaoyuan Su

    Abstract: LGBTQ+ community face disproportionate mental health challenges, including higher rates of depression, anxiety, and suicidal ideation. Research has shown that LGBTQ+ people have been using large language model-based chatbots, such as ChatGPT, for their mental health needs. Despite the potential for immediate support and anonymity these chatbots offer, concerns regarding their capacity to provide e… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  10. arXiv:2404.10252  [pdf, other

    cs.NE

    Learning from Offline and Online Experiences: A Hybrid Adaptive Operator Selection Framework

    Authors: Jiyuan Pei, Jialin Liu, Yi Mei

    Abstract: In many practical applications, usually, similar optimisation problems or scenarios repeatedly appear. Learning from previous problem-solving experiences can help adjust algorithm components of meta-heuristics, e.g., adaptively selecting promising search operators, to achieve better optimisation performance. However, those experiences obtained from previously solved problems, namely offline experi… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  11. arXiv:2403.17328  [pdf, other

    cs.AI cs.NE

    Learning Traffic Signal Control via Genetic Programming

    Authors: Xiao-Cheng Liao, Yi Mei, Mengjie Zhang

    Abstract: The control of traffic signals is crucial for improving transportation efficiency. Recently, learning-based methods, especially Deep Reinforcement Learning (DRL), garnered substantial success in the quest for more efficient traffic signal control strategies. However, the design of rewards in DRL highly demands domain knowledge to converge to an effective policy, and the final policy also presents… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  12. arXiv:2403.15872  [pdf, other

    cs.CL

    RAAMove: A Corpus for Analyzing Moves in Research Article Abstracts

    Authors: Hongzheng Li, Ruo** Wang, Ge Shi, Xing Lv, Lei Lei, Chong Feng, Fang Liu, **kun Lin, Yangguang Mei, Lingnan Xu

    Abstract: Move structures have been studied in English for Specific Purposes (ESP) and English for Academic Purposes (EAP) for decades. However, there are few move annotation corpora for Research Article (RA) abstracts. In this paper, we introduce RAAMove, a comprehensive multi-domain corpus dedicated to the annotation of move structures in RA abstracts. The primary objective of RAAMove is to facilitate mov… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  13. arXiv:2403.09632  [pdf, other

    cs.CV

    Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image

    Authors: Yiqun Mei, Yu Zeng, He Zhang, Zhixin Shu, Xuaner Zhang, Sai Bi, Jianming Zhang, HyunJoon Jung, Vishal M. Patel

    Abstract: At the core of portrait photography is the search for ideal lighting and viewpoint. The process often requires advanced knowledge in photography and an elaborate studio setup. In this work, we propose Holo-Relighting, a volumetric relighting method that is capable of synthesizing novel viewpoints, and novel lighting from a single image. Holo-Relighting leverages the pretrained 3D GAN (EG3D) to rec… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: CVPR2024

  14. arXiv:2402.13777  [pdf, other

    cs.LG cs.AI

    Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions

    Authors: Jiayu Chen, Bhargav Ganguly, Yang Xu, Yongsheng Mei, Tian Lan, Vaneet Aggarwal

    Abstract: Deep generative models (DGMs) have demonstrated great success across various domains, particularly in generating texts, images, and videos using models trained from offline data. Similarly, data-driven decision-making and robotic control also necessitate learning a generator function from the offline data to serve as the strategy or policy. In this case, applying deep generative models in offline… ▽ More

    Submitted 25 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: We restructured the paper and added more discussion

  15. Evaluating the Experience of LGBTQ+ People Using Large Language Model Based Chatbots for Mental Health Support

    Authors: Zilin Ma, Yiyang Mei, Yinru Long, Zhaoyuan Su, Krzysztof Z. Gajos

    Abstract: LGBTQ+ individuals are increasingly turning to chatbots powered by large language models (LLMs) to meet their mental health needs. However, little research has explored whether these chatbots can adequately and safely provide tailored support for this demographic. We interviewed 18 LGBTQ+ and 13 non-LGBTQ+ participants about their experiences with LLM-based chatbots for mental health needs. LGBTQ+… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  16. arXiv:2402.01439  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM

    From Words to Molecules: A Survey of Large Language Models in Chemistry

    Authors: Chang Liao, Yemin Yu, Yu Mei, Ying Wei

    Abstract: In recent years, Large Language Models (LLMs) have achieved significant success in natural language processing (NLP) and various interdisciplinary areas. However, applying LLMs to chemistry is a complex task that requires specialized domain knowledge. This paper provides a thorough exploration of the nuanced methodologies employed in integrating LLMs into the field of chemistry, delving into the c… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Submitted to IJCAI 2024 survey track

  17. arXiv:2402.00404  [pdf, other

    cs.NE

    Improving Critical Node Detection Using Neural Network-based Initialization in a Genetic Algorithm

    Authors: Chanjuan Liu, Shike Ge, Zhihan Chen, Wenbin Pei, Enqiang Zhu, Yi Mei, Hisao Ishibuchi

    Abstract: The Critical Node Problem (CNP) is concerned with identifying the critical nodes in a complex network. These nodes play a significant role in maintaining the connectivity of the network, and removing them can negatively impact network performance. CNP has been studied extensively due to its numerous real-world applications. Among the different versions of CNP, CNP-1a has gained the most popularity… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 14 pages, 13 figures

  18. arXiv:2401.15279  [pdf, other

    cs.GR cs.HC

    FabHacks: Transform Everyday Objects into Functional Fixtures

    Authors: Yuxuan Mei, Benjamin Jones, Dan Cascaval, Jennifer Mankoff, Etienne Vouga, Adriana Schulz

    Abstract: Storage, organizing, and decorating are an important part of home design. While one can buy commercial items for many of these tasks, this can be costly, and re-use is more sustainable. An alternative is a "home hack", a functional assembly that can be constructed from existing household items. However, coming up with such hacks requires combining objects to make a physically valid design, which m… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  19. arXiv:2401.14544  [pdf, other

    cs.LG math.FA math.PR

    Bayesian Optimization through Gaussian Cox Process Models for Spatio-temporal Data

    Authors: Yongsheng Mei, Mahdi Imani, Tian Lan

    Abstract: Bayesian optimization (BO) has established itself as a leading strategy for efficiently optimizing expensive-to-evaluate functions. Existing BO methods mostly rely on Gaussian process (GP) surrogate models and are not applicable to (doubly-stochastic) Gaussian Cox processes, where the observation process is modulated by a latent intensity function modeled as a GP. In this paper, we propose a novel… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 2024 International Conference on Learning Representations (ICLR)

  20. arXiv:2401.06979  [pdf, ps, other

    cs.AI cs.LG

    Distance-aware Attention Resha**: Enhance Generalization of Neural Solver for Large-scale Vehicle Routing Problems

    Authors: Yang Wang, Ya-Hui Jia, Wei-Neng Chen, Yi Mei

    Abstract: Neural solvers based on attention mechanism have demonstrated remarkable effectiveness in solving vehicle routing problems. However, in the generalization process from small scale to large scale, we find a phenomenon of the dispersion of attention scores in existing neural solvers, which leads to poor performance. To address this issue, this paper proposes a distance-aware attention resha** meth… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  21. arXiv:2401.06377  [pdf, other

    cs.RO

    Design and Nonlinear Modeling of a Modular Cable Driven Soft Robotic Arm

    Authors: Xinda Qi, Yu Mei, Dong Chen, Zhaojian Li, Xiaobo Tan

    Abstract: We propose a novel multi-section cable-driven soft robotic arm inspired by octopus tentacles along with a new modeling approach. Each section of the modular manipulator is made of a soft tubing backbone, a soft silicon arm body, and two rigid endcaps, which connect adjacent sections and decouple the actuation cables of different sections. The soft robotic arm is made with casting after the rigid e… ▽ More

    Submitted 15 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: The paper has been accepted by IEEE Transactions on Mechatronics

  22. arXiv:2401.00283  [pdf, other

    cs.IT eess.SP

    Near-Space Communications: the Last Piece of 6G Space-Air-Ground-Sea Integrated Network Puzzle

    Authors: Hongshan Liu, Tong Qin, Zhen Gao, Tianqi Mao, Keke Ying, Ziwei Wan, Li Qiao, Rui Na, Zhongxiang Li, Chun Hu, Yikun Mei, Tuan Li, Guanghui Wen, Lei Chen, Zhonghuai Wu, Ruiqi Liu, Gaojie Chen, Shuo Wang, Dezhi Zheng

    Abstract: This article presents a comprehensive study on the emerging near-space communications (NS-COM) within the context of space-air-ground-sea integrated network (SAGSIN). Specifically, we firstly explore the recent technical developments of NS-COM, followed by the discussions about motivations behind integrating NS-COM into SAGSIN. To further demonstrate the necessity of NS-COM, a comparative analysis… ▽ More

    Submitted 4 March, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: 28 pages, 8 figures, 2 tables

  23. XC-NAS: A New Cellular Encoding Approach for Neural Architecture Search of Multi-path Convolutional Neural Networks

    Authors: Trevor Londt, Xiaoying Gao, Peter Andreae, Yi Mei

    Abstract: Convolutional Neural Networks (CNNs) continue to achieve great success in classification tasks as innovative techniques and complex multi-path architecture topologies are introduced. Neural Architecture Search (NAS) aims to automate the design of these complex architectures, reducing the need for costly manual design work by human experts. Cellular Encoding (CE) is an evolutionary computation tech… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Australasian Joint Conference on Artificial Intelligence 2023

  24. arXiv:2312.07696  [pdf, ps, other

    cs.CR cs.AI

    Real-time Network Intrusion Detection via Decision Transformers

    Authors: **gdi Chen, Hanhan Zhou, Yongsheng Mei, Gina Adam, Nathaniel D. Bastian, Tian Lan

    Abstract: Many cybersecurity problems that require real-time decision-making based on temporal observations can be abstracted as a sequence modeling problem, e.g., network intrusion detection from a sequence of arriving packets. Existing approaches like reinforcement learning may not be suitable for such cybersecurity decision problems, since the Markovian property may not necessarily hold and the underlyin… ▽ More

    Submitted 16 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

  25. arXiv:2312.04948  [pdf, other

    cs.CV astro-ph.GA cs.LG

    Scientific Preparation for CSST: Classification of Galaxy and Nebula/Star Cluster Based on Deep Learning

    Authors: Yuquan Zhang, Zhong Cao, Feng Wang, Lam, Man I, Hui Deng, Ying Mei, Lei Tan

    Abstract: The Chinese Space Station Telescope (abbreviated as CSST) is a future advanced space telescope. Real-time identification of galaxy and nebula/star cluster (abbreviated as NSC) images is of great value during CSST survey. While recent research on celestial object recognition has progressed, the rapid and efficient identification of high-resolution local celestial images remains challenging. In this… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  26. arXiv:2312.01728  [pdf, other

    cs.LG

    ImputeFormer: Low Rankness-Induced Transformers for Generalizable Spatiotemporal Imputation

    Authors: Tong Nie, Guoyang Qin, Wei Ma, Yuewen Mei, Jian Sun

    Abstract: Missing data is a pervasive issue in both scientific and engineering tasks, especially for the modeling of spatiotemporal data. This problem attracts many studies to contribute to data-driven solutions. Existing imputation solutions mainly include low-rank models and deep learning models. The former assumes general structural priors but has limited model capacity. The latter possesses salient feat… ▽ More

    Submitted 28 May, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted by KDD'24 (Research Track)

  27. arXiv:2311.15920  [pdf, other

    cs.AI

    A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning

    Authors: Jianxiong Li, Shichao Lin, Tianyu Shi, Chujie Tian, Yu Mei, Jian Song, Xianyuan Zhan, Ruimin Li

    Abstract: The optimization of traffic signal control (TSC) is critical for an efficient transportation system. In recent years, reinforcement learning (RL) techniques have emerged as a popular approach for TSC and show promising results for highly adaptive control. However, existing RL-based methods suffer from notably poor real-world applicability and hardly have any successful deployments. The reasons for… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 15 pages, 6 figures

  28. arXiv:2311.09611  [pdf, other

    cs.HC

    DeltaLCA: Comparative Life-Cycle Assessment for Electronics Design

    Authors: Zhihan Zhang, Felix Hähnlein, Yuxuan Mei, Zachary Englhardt, Shwetak Patel, Adriana Schulz, Vikram Iyer

    Abstract: Reducing the environmental footprint of electronics and computing devices requires new tools that empower designers to make informed decisions about sustainability during the design process itself. This is not possible with current tools for life cycle assessment (LCA) which require substantial domain expertise and time to evaluate the numerous chips and other components that make up a device. We… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  29. arXiv:2311.06770  [pdf, other

    cs.IT eess.SP

    Compressive Sensing-Based Grant-Free Massive Access for 6G Massive Communication

    Authors: Zhen Gao, Malong Ke, Yikun Mei, Li Qiao, Sheng Chen, Derrick Wing Kwan Ng, H. Vincent Poor

    Abstract: The advent of the sixth-generation (6G) of wireless communications has given rise to the necessity to connect vast quantities of heterogeneous wireless devices, which requires advanced system capabilities far beyond existing network architectures. In particular, such massive communication has been recognized as a prime driver that can empower the 6G vision of future ubiquitous connectivity, suppor… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted by IEEE IoT Journal

  30. arXiv:2310.16310  [pdf, other

    cs.LG

    Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process with Uncertainty Quantification

    Authors: Zichong Li, Qunzhi Xu, Zhenghao Xu, Yajun Mei, Tuo Zhao, Hongyuan Zha

    Abstract: Spatio-temporal point processes (STPPs) are potent mathematical tools for modeling and predicting events with both temporal and spatial features. Despite their versatility, most existing methods for learning STPPs either assume a restricted form of the spatio-temporal distribution, or suffer from inaccurate approximations of the intractable integral in the likelihood training objective. These issu… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  31. arXiv:2308.16818  [pdf, other

    cs.LG cs.AI

    Irregular Traffic Time Series Forecasting Based on Asynchronous Spatio-Temporal Graph Convolutional Network

    Authors: Weijia Zhang, Le Zhang, **dong Han, Hao Liu, **gbo Zhou, Yu Mei, Hui Xiong

    Abstract: Accurate traffic forecasting at intersections governed by intelligent traffic signals is critical for the advancement of an effective intelligent traffic signal control system. However, due to the irregular traffic time series produced by intelligent intersections, the traffic forecasting task becomes much more intractable and imposes three major new challenges: 1) asynchronous spatial dependency,… ▽ More

    Submitted 1 September, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  32. arXiv:2307.15810  [pdf

    cs.HC

    Understanding the Benefits and Challenges of Using Large Language Model-based Conversational Agents for Mental Well-being Support

    Authors: Zilin Ma, Yiyang Mei, Zhaoyuan Su

    Abstract: Conversational agents powered by large language models (LLM) have increasingly been utilized in the realm of mental well-being support. However, the implications and outcomes associated with their usage in such a critical field remain somewhat ambiguous and unexplored. We conducted a qualitative analysis of 120 posts, encompassing 2917 user comments, drawn from the most popular subreddit focused o… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  33. arXiv:2307.10120  [pdf, other

    quant-ph cs.LG

    Quarl: A Learning-Based Quantum Circuit Optimizer

    Authors: Zikun Li, **jun Peng, Yixuan Mei, Sina Lin, Yi Wu, Oded Padon, Zhihao Jia

    Abstract: Optimizing quantum circuits is challenging due to the very large search space of functionally equivalent circuits and the necessity of applying transformations that temporarily decrease performance to achieve a final performance improvement. This paper presents Quarl, a learning-based quantum circuit optimizer. Applying reinforcement learning (RL) to quantum circuit optimization raises two main ch… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  34. arXiv:2307.01482  [pdf, other

    cs.LG

    Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale

    Authors: Tong Nie, Guoyang Qin, Lijun Sun, Wei Ma, Yu Mei, Jian Sun

    Abstract: Spatiotemporal urban data (STUD) displays complex correlational patterns. Extensive advanced techniques have been designed to capture these patterns for effective forecasting. However, because STUD is often massive in scale, practitioners need to strike a balance between effectiveness and efficiency by choosing computationally efficient models. An alternative paradigm called MLP-Mixer has the pote… ▽ More

    Submitted 7 February, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

  35. arXiv:2306.15334  [pdf, other

    cs.CL

    Understanding Client Reactions in Online Mental Health Counseling

    Authors: Anqi Li, Lizhi Ma, Yaling Mei, Hongliang He, Shuai Zhang, Huachuan Qiu, Zhenzhong Lan

    Abstract: Communication success relies heavily on reading participants' reactions. Such feedback is especially important for mental health counselors, who must carefully consider the client's progress and adjust their approach accordingly. However, previous NLP research on counseling has mainly focused on studying counselors' intervention strategies rather than their clients' reactions to the intervention.… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Accept to ACL 2023, oral. For code and data, see https://github.com/dll-wu/Client-React

  36. arXiv:2306.00187  [pdf, other

    cs.MA

    AccMER: Accelerating Multi-Agent Experience Replay with Cache Locality-aware Prioritization

    Authors: Kailash Gogineni, Yongsheng Mei, Peng Wei, Tian Lan, Guru Venkataramani

    Abstract: Multi-Agent Experience Replay (MER) is a key component of off-policy reinforcement learning~(RL) algorithms. By remembering and reusing experiences from the past, experience replay significantly improves the stability of RL algorithms and their learning efficiency. In many scenarios, multiple agents interact in a shared environment during online training under centralized training and decentralize… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted to ASAP'23

  37. arXiv:2305.02805  [pdf, other

    cs.AI cs.NE

    Local Optima Correlation Assisted Adaptive Operator Selection

    Authors: Jiyuan Pei, Hao Tong, Jialin Liu, Yi Mei, Xin Yao

    Abstract: For solving combinatorial optimisation problems with metaheuristics, different search operators are applied for sampling new solutions in the neighbourhood of a given solution. It is important to understand the relationship between operators for various purposes, e.g., adaptively deciding when to use which operator to find optimal solutions efficiently. However, it is difficult to theoretically an… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  38. arXiv:2303.12950  [pdf, other

    cs.CV cs.GR

    LightPainter: Interactive Portrait Relighting with Freehand Scribble

    Authors: Yiqun Mei, He Zhang, Xuaner Zhang, Jianming Zhang, Zhixin Shu, Yilin Wang, Zijun Wei, Shi Yan, HyunJoon Jung, Vishal M. Patel

    Abstract: Recent portrait relighting methods have achieved realistic results of portrait lighting effects given a desired lighting representation such as an environment map. However, these methods are not intuitive for user interaction and lack precise lighting control. We introduce LightPainter, a scribble-based relighting system that allows users to interactively manipulate portrait lighting effect with e… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: CVPR2023

  39. arXiv:2302.10418  [pdf, other

    cs.LG cs.AI cs.MA

    MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

    Authors: Yongsheng Mei, Hanhan Zhou, Tian Lan, Guru Venkataramani, Peng Wei

    Abstract: Experience replay is crucial for off-policy reinforcement learning (RL) methods. By remembering and reusing the experiences from past different policies, experience replay significantly improves the training efficiency and stability of RL algorithms. Many decision-making problems in practice naturally involve multiple agents and require multi-agent reinforcement learning (MARL) under centralized t… ▽ More

    Submitted 27 February, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: The 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023). arXiv admin note: text overlap with arXiv:2302.05593

  40. arXiv:2302.05593  [pdf, other

    cs.LG

    ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning

    Authors: Yongsheng Mei, Hanhan Zhou, Tian Lan

    Abstract: Value function factorization methods have become a dominant approach for cooperative multiagent reinforcement learning under a centralized training and decentralized execution paradigm. By factorizing the optimal joint action-value function using a monotonic mixing function of agents' utilities, these algorithms ensure the consistency between joint and local action selections for decentralized dec… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

  41. arXiv:2302.02521  [pdf, other

    cs.CV cs.LG

    Exploiting Partial Common Information Microstructure for Multi-Modal Brain Tumor Segmentation

    Authors: Yongsheng Mei, Guru Venkataramani, Tian Lan

    Abstract: Learning with multiple modalities is crucial for automated brain tumor segmentation from magnetic resonance imaging data. Explicitly optimizing the common information shared among all modalities (e.g., by maximizing the total correlation) has been shown to achieve better feature representations and thus enhance the segmentation performance. However, existing approaches are oblivious to partial com… ▽ More

    Submitted 14 July, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: 2023 ICML Workshop on Machine Learning for Multimodal Healthcare Data (ML4MHD)

  42. arXiv:2301.01333  [pdf

    cs.LG cs.PF

    oneDNN Graph Compiler: A Hybrid Approach for High-Performance Deep Learning Compilation

    Authors: Jianhui Li, Zhennan Qin, Yijie Mei, **gze Cui, Yunfei Song, Ciyong Chen, Yifei Zhang, Longsheng Du, Xianhang Cheng, Baihui **, Yan Zhang, Jason Ye, Eric Lin, Dan Lavery

    Abstract: With the rapid development of deep learning models and hardware support for dense computing, the deep learning workload characteristics changed significantly from a few hot spots on compute-intensive operations to a broad range of operations scattered across the models. Accelerating a few compute-intensive operations using the expert-tuned implementation of primitives does not fully exploit the pe… ▽ More

    Submitted 11 March, 2024; v1 submitted 3 January, 2023; originally announced January 2023.

    Comments: 10 pages excluding reference, 9 figures, 1 table

  43. arXiv:2212.01259  [pdf, other

    stat.ML cs.LG

    Covariance Estimators for the ROOT-SGD Algorithm in Online Learning

    Authors: Yiling Luo, Xiaoming Huo, Yajun Mei

    Abstract: Online learning naturally arises in many statistical and machine learning problems. The most widely used methods in online learning are stochastic first-order algorithms. Among this family of algorithms, there is a recently developed algorithm, Recursive One-Over-T SGD (ROOT-SGD). ROOT-SGD is advantageous in that it converges at a non-asymptotically fast rate, and its estimator further converges t… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  44. arXiv:2211.17207  [pdf, other

    cs.AR

    Canal: A Flexible Interconnect Generator for Coarse-Grained Reconfigurable Arrays

    Authors: Jackson Melchert, Keyi Zhang, Yuchen Mei, Mark Horowitz, Christopher Torng, Priyanka Raina

    Abstract: The architecture of a coarse-grained reconfigurable array (CGRA) interconnect has a significant effect on not only the flexibility of the resulting accelerator, but also its power, performance, and area. Design decisions that have complex trade-offs need to be explored to maintain efficiency and performance across a variety of evolving applications. This paper presents Canal, a Python-embedded dom… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Comments: Preprint version

  45. arXiv:2211.13182  [pdf, other

    cs.AR

    Cascade: An Application Pipelining Toolkit for Coarse-Grained Reconfigurable Arrays

    Authors: Jackson Melchert, Yuchen Mei, Kalhan Koul, Qiaoyi Liu, Mark Horowitz, Priyanka Raina

    Abstract: While coarse-grained reconfigurable arrays (CGRAs) have emerged as promising programmable accelerator architectures, pipelining applications running on CGRAs is required to ensure high maximum clock frequencies. Current CGRA compilers either lack pipelining techniques resulting in low performance or perform exhaustive pipelining resulting in high energy and resource consumption. We introduce Casca… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Preprint version

  46. arXiv:2210.06635  [pdf, other

    math.OC cs.LG

    A Bayesian Optimization Framework for Finding Local Optima in Expensive Multi-Modal Functions

    Authors: Yongsheng Mei, Tian Lan, Mahdi Imani, Suresh Subramaniam

    Abstract: Bayesian optimization (BO) is a popular global optimization scheme for sample-efficient optimization in domains with expensive function evaluations. The existing BO techniques are capable of finding a single global optimum solution. However, finding a set of global and local optimum solutions is crucial in a wide range of real-world problems, as implementing some of the optimal solutions might not… ▽ More

    Submitted 5 August, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: European Conference on Artificial Intelligence (ECAI) 2023

  47. arXiv:2209.04846  [pdf, ps, other

    cs.IT

    Joint Activity Detection and Channel Estimation for Massive IoT Access Based on Millimeter-Wave/Terahertz Multi-Panel Massive MIMO

    Authors: Hanlin Xiu, Zhen Gao, Anwen Liao, Yikun Mei, Dezhi Zheng, Shufeng Tan, Marco Di Renzo, Lajos Hanzo

    Abstract: The multi-panel array, as a state-of-the-art antenna-in-package technology, is very suitable for millimeter-wave (mmWave)/terahertz (THz) systems, due to its low-cost deployment and scalable configuration. But in the context of nonuniform array structures it leads to intractable signal processing. Based on such an array structure at the base station, this paper investigates a joint active user det… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: accepted by IEEE Transactions on Vehicular Technology

  48. arXiv:2208.05045  [pdf, other

    cs.LG stat.AP stat.ME

    Adaptive Resources Allocation CUSUM for Binomial Count Data Monitoring with Application to COVID-19 Hotspot Detection

    Authors: Jiuyun Hu, Yajun Mei, Sarah Holte, Hao Yan

    Abstract: In this paper, we present an efficient statistical method (denoted as "Adaptive Resources Allocation CUSUM") to robustly and efficiently detect the hotspot with limited sampling resources. Our main idea is to combine the multi-arm bandit (MAB) and change-point detection methods to balance the exploration and exploitation of resource allocation for hotspot detection. Further, a Bayesian weighted up… ▽ More

    Submitted 17 August, 2022; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted in Journal of Applied Statistics

  49. arXiv:2207.01983  [pdf, ps, other

    cs.IT eess.SP

    Massive Access in Extra Large-Scale MIMO with Mixed-ADC over Near Field Channels

    Authors: Yikun Mei, Zhen Gao, De Mi, Mingyu Zhou, Dezhi Zheng, Michail Matthaiou, Pei Xiao, Robert Schober

    Abstract: Massive connectivity for extra large-scale multi-input multi-output (XL-MIMO) systems is a challenging issue due to the near-field access channels and the prohibitive cost. In this paper, we propose an uplink grant-free massive access scheme for XL-MIMO systems, in which a mixed-analog-to-digital converters (ADC) architecture is adopted to strike the right balance between access performance and po… ▽ More

    Submitted 3 April, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE TVT

  50. The Directional Bias Helps Stochastic Gradient Descent to Generalize in Kernel Regression Models

    Authors: Yiling Luo, Xiaoming Huo, Yajun Mei

    Abstract: We study the Stochastic Gradient Descent (SGD) algorithm in nonparametric statistics: kernel regression in particular. The directional bias property of SGD, which is known in the linear regression setting, is generalized to the kernel regression. More specifically, we prove that SGD with moderate and annealing step-size converges along the direction of the eigenvector that corresponds to the large… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.