Skip to main content

Showing 101–150 of 1,405 results for author: He, Z

.
  1. arXiv:2404.05220  [pdf, other

    cs.CV

    StylizedGS: Controllable Stylization for 3D Gaussian Splatting

    Authors: Dingxi Zhang, Zhuoxun Chen, Yu-Jie Yuan, Fang-Lue Zhang, Zhenliang He, Shiguang Shan, Lin Gao

    Abstract: With the rapid development of XR, 3D generation and editing are becoming more and more important, among which, stylization is an important tool of 3D appearance editing. It can achieve consistent 3D artistic stylization given a single reference style image and thus is a user-friendly editing way. However, recent NeRF-based 3D stylization methods face efficiency issues that affect the actual user e… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  2. arXiv:2404.04946  [pdf, other

    cs.CV

    AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment

    Authors: Yuanfeng Xu, Yuhao Chen, Zhongzhan Huang, Zijian He, Guangrun Wang, Philip Torr, Liang Lin

    Abstract: Recent video editing advancements rely on accurate pose sequences to animate subjects. However, these efforts are not suitable for cross-species animation due to pose misalignment between species (for example, the poses of a cat differs greatly from that of a pig due to differences in body structure). In this paper, we present AnimateZoo, a zero-shot diffusion-based video generator to address this… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Technical report,15 pages

  3. arXiv:2404.04483  [pdf

    eess.IV cs.CV

    FastHDRNet: A new efficient method for SDR-to-HDR Translation

    Authors: Siyuan Tian, Hao Wang, Yiren Rong, Junhao Wang, Renjie Dai, Zhengxiao He

    Abstract: Modern displays nowadays possess the capability to render video content with a high dynamic range (HDR) and an extensive color gamut .However, the majority of available resources are still in standard dynamic range (SDR). Therefore, we need to identify an effective methodology for this objective.The existing deep neural networks (DNN) based SDR to HDR conversion methods outperforms conventional me… ▽ More

    Submitted 11 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: 16 pages, 4 figures

  4. arXiv:2404.04355  [pdf, other

    math.OC eess.SY

    Gray-Box Nonlinear Feedback Optimization

    Authors: Zhiyu He, Saverio Bolognani, Michael Muehlebach, Florian Dörfler

    Abstract: Feedback optimization enables autonomous optimality seeking of a dynamical system through its closed-loop interconnection with iterative optimization algorithms. Among various iteration structures, model-based approaches require the input-output sensitivity of the system to construct gradients, whereas model-free approaches bypass this need by estimating gradients from real-time evaluations of the… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  5. arXiv:2404.01637  [pdf, other

    physics.soc-ph

    The dual role of constructive agents in public goods games: limited alone, amplifying cooperation with destructive agents

    Authors: Yuting Dong, Zhixue He, Chen Shen, Lei Shi, Jun Tanimoto

    Abstract: Recent studies have revealed a paradoxical phenomenon in public goods games, wherein destructive agents, harming both cooperators and defectors, can unexpectedly bolster cooperation. Building upon this intriguing premise, our paper introduces a novel concept: constructive agents, which confer additional benefits to both cooperators and defectors. We investigate the impact of these agents on cooper… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  6. arXiv:2404.01617  [pdf, other

    cs.NI cs.LG cs.MM

    LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models

    Authors: Zhiyuan He, Aashish Gottipati, Lili Qiu, Francis Y. Yan, Xufang Luo, Kenuo Xu, Yuqing Yang

    Abstract: We present LLM-ABR, the first system that utilizes the generative capabilities of large language models (LLMs) to autonomously design adaptive bitrate (ABR) algorithms tailored for diverse network characteristics. Operating within a reinforcement learning framework, LLM-ABR empowers LLMs to design key components such as states and neural network architectures. We evaluate LLM-ABR across diverse ne… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  7. arXiv:2404.01153  [pdf, other

    stat.ML cs.DC cs.LG math.ST stat.ME

    TransFusion: Covariate-Shift Robust Transfer Learning for High-Dimensional Regression

    Authors: Zelin He, Ying Sun, **gyuan Liu, Runze Li

    Abstract: The main challenge that sets transfer learning apart from traditional supervised learning is the distribution shift, reflected as the shift between the source and target models and that between the marginal covariate distributions. In this work, we tackle model shifts in the presence of covariate shifts in the high-dimensional regression setting. Specifically, we propose a two-step method with a n… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted by the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

  8. arXiv:2404.01008  [pdf, other

    cs.IR

    EEG-SVRec: An EEG Dataset with User Multidimensional Affective Engagement Labels in Short Video Recommendation

    Authors: Shaorun Zhang, Zhiyu He, Ziyi Ye, Peijie Sun, Qingyao Ai, Min Zhang, Yiqun Liu

    Abstract: In recent years, short video platforms have gained widespread popularity, making the quality of video recommendations crucial for retaining users. Existing recommendation systems primarily rely on behavioral data, which faces limitations when inferring user preferences due to issues such as data sparsity and noise from accidental interactions or personal habits. To address these challenges and pro… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  9. arXiv:2404.00579  [pdf, other

    cs.IR cs.AI

    A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys)

    Authors: Yashar Deldjoo, Zhankui He, Julian McAuley, Anton Korikov, Scott Sanner, Arnau Ramisa, René Vidal, Maheswaran Sathiamoorthy, Atoosa Kasirzadeh, Silvia Milano

    Abstract: Traditional recommender systems (RS) have used user-item rating histories as their primary data source, with collaborative filtering being one of the principal methods. However, generative models have recently developed abilities to model and sample from complex data distributions, including not only user-item interaction histories but also text, images, and videos - unlocking this rich data for n… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  10. arXiv:2404.00481  [pdf, other

    stat.ML cs.LG eess.SY

    Convolutional Bayesian Filtering

    Authors: Wenhan Cao, Shiqi Liu, Chang Liu, Zeyu He, Stephen S. -T. Yau, Shengbo Eben Li

    Abstract: Bayesian filtering serves as the mainstream framework of state estimation in dynamic systems. Its standard version utilizes total probability rule and Bayes' law alternatively, where how to define and compute conditional probability is critical to state distribution inference. Previously, the conditional probability is assumed to be exactly known, which represents a measure of the occurrence proba… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  11. arXiv:2403.20328  [pdf, other

    cs.RO cs.LG

    Learning Visual Quadrupedal Loco-Manipulation from Demonstrations

    Authors: Zhengmao He, Kun Lei, Yanjie Ze, Koushil Sreenath, Zhongyu Li, Huazhe Xu

    Abstract: Quadruped robots are progressively being integrated into human environments. Despite the growing locomotion capabilities of quadrupedal robots, their interaction with objects in realistic scenes is still limited. While additional robotic arms on quadrupedal robots enable manipulating objects, they are sometimes redundant given that a quadruped robot is essentially a mobile unit equipped with four… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Project website: https://zhengmaohe.github.io/leg-manip

  12. arXiv:2403.20014  [pdf, other

    cs.DB cs.AI cs.CL

    PURPLE: Making a Large Language Model a Better SQL Writer

    Authors: Tonghui Ren, Yuankai Fan, Zhenying He, Ren Huang, Jiaqi Dai, Can Huang, Yinan **g, Kai Zhang, Yifan Yang, X. Sean Wang

    Abstract: Large Language Model (LLM) techniques play an increasingly important role in Natural Language to SQL (NL2SQL) translation. LLMs trained by extensive corpora have strong natural language understanding and basic SQL generation abilities without additional tuning specific to NL2SQL tasks. Existing LLMs-based NL2SQL approaches try to improve the translation by enhancing the LLMs with an emphasis on us… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 12 pages, accepted by ICDE 2024 (40th IEEE International Conference on Data Engineering)

  13. arXiv:2403.19955  [pdf, ps, other

    cs.IT

    Joint Training and Reflection Pattern Optimization for Non-Ideal RIS-Aided Multiuser Systems

    Authors: Zhenyao He, **dan Xu, Hong Shen, Wei Xu, Chau Yuen, Marco Di Renzo

    Abstract: Reconfigurable intelligent surface (RIS) is a promising technique to improve the performance of future wireless communication systems at low energy consumption. To reap the potential benefits of RIS-aided beamforming, it is vital to enhance the accuracy of channel estimation. In this paper, we consider an RIS-aided multiuser system with non-ideal reflecting elements, each of which has a phase-depe… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  14. arXiv:2403.19834  [pdf, other

    math.OC

    Online Feedback Optimization over Networks: A Distributed Model-free Approach

    Authors: Wenbin Wang, Zhiyu He, Giuseppe Belgioioso, Saverio Bolognani, Florian Dörfler

    Abstract: Online feedback optimization (OFO) enables optimal steady-state operations of a physical system by employing an iterative optimization algorithm as a dynamic feedback controller. When the plant consists of several interconnected sub-systems, centralized implementations become impractical due to the heavy computational burden and the need to pre-compute system-wide sensitivities, which may not be e… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  15. arXiv:2403.19708  [pdf, other

    cs.CL cs.LG

    Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention

    Authors: Bin Gao, Zhuomin He, Puru Sharma, Qingxuan Kang, Djordje Jevdjic, Junbo Deng, Xingkun Yang, Zhou Yu, Pengfei Zuo

    Abstract: Interacting with humans through multi-turn conversations is a fundamental feature of large language models (LLMs). However, existing LLM serving engines executing multi-turn conversations are inefficient due to the need to repeatedly compute the key-value (KV) caches of historical tokens, incurring high serving costs. To address the problem, this paper proposes CachedAttention, a new attention mec… ▽ More

    Submitted 30 June, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted to USENIX Annual Technical Conference (ATC) 2024

  16. arXiv:2403.19242  [pdf, other

    cs.CV

    RTracker: Recoverable Tracking via PN Tree Structured Memory

    Authors: Yuqing Huang, Xin Li, Zikun Zhou, Yaowei Wang, Zhenyu He, Ming-Hsuan Yang

    Abstract: Existing tracking methods mainly focus on learning better target representation or develo** more robust prediction models to improve tracking performance. While tracking performance has significantly improved, the target loss issue occurs frequently due to tracking failures, complete occlusion, or out-of-view situations. However, considerably less attention is paid to the self-recovery issue of… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: accepted by CVPR 2024

  17. arXiv:2403.16252  [pdf, other

    cs.RO eess.SY

    Legged Robot State Estimation within Non-inertial Environments

    Authors: Zijian He, Sangli Teng, Tzu-Yuan Lin, Maani Ghaffari, Yan Gu

    Abstract: This paper investigates the robot state estimation problem within a non-inertial environment. The proposed state estimation approach relaxes the common assumption of static ground in the system modeling. The process and measurement models explicitly treat the movement of the non-inertial environments without requiring knowledge of its motion in the inertial frame or relying on GPS or sensing envir… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  18. arXiv:2403.16034  [pdf, other

    cs.CV

    V2X-Real: a Largs-Scale Dataset for Vehicle-to-Everything Cooperative Perception

    Authors: Hao Xiang, Zhaoliang Zheng, Xin Xia, Runsheng Xu, Letian Gao, Zewei Zhou, Xu Han, Xinkai Ji, Mingxi Li, Zonglin Meng, Li **, Mingyue Lei, Zhaoyang Ma, Zihang He, Haoxuan Ma, Yunshuang Yuan, Yingqian Zhao, Jiaqi Ma

    Abstract: Recent advancements in Vehicle-to-Everything (V2X) technologies have enabled autonomous vehicles to share sensing information to see through occlusions, greatly boosting the perception capability. However, there are no real-world datasets to facilitate the real V2X cooperative perception research -- existing datasets either only support Vehicle-to-Infrastructure cooperation or Vehicle-to-Vehicle c… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  19. arXiv:2403.16015  [pdf, other

    cs.RO

    MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment

    Authors: Ziyan Xiong, Bo Chen, Shiyu Huang, Wei-Wei Tu, Zhaofeng He, Yang Gao

    Abstract: The advent of deep reinforcement learning (DRL) has significantly advanced the field of robotics, particularly in the control and coordination of quadruped robots. However, the complexity of real-world tasks often necessitates the deployment of multi-robot systems capable of sophisticated interaction and collaboration. To address this need, we introduce the Multi-agent Quadruped Environment (MQE),… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Open-source code is available at https://github.com/ziyanx02/multiagent-quadruped-environment

  20. arXiv:2403.15866  [pdf, ps, other

    math.AP

    Existence and multiplicity of solutions for the logarithmic Schrödinger equation with a potential on lattice graphs

    Authors: Zhentao He, Chao Ji

    Abstract: In this paper, we consider the existence and multiplicity of solutions for the logarithmic Schrödinger equation on lattice graphs $\mathbb{Z}^N$ $$ -Δu+V(x) u=u \log u^2, \quad x \in \mathbb{Z}^N, $$ When the potential $V$ is coercive, we obtain infinitely many solutions by adapting some arguments of the Fountain theorem. In the cases of periodic potential, asymptotically periodic potential and bo… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  21. arXiv:2403.14280  [pdf, other

    cs.CR

    Large Language Models for Blockchain Security: A Systematic Literature Review

    Authors: Zheyuan He, Zihao Li, Sen Yang, Ao Qiao, Xiaosong Zhang, Xiapu Luo, Ting Chen

    Abstract: Large Language Models (LLMs) have emerged as powerful tools across various domains within cyber security. Notably, recent studies are increasingly exploring LLMs applied to the context of blockchain security (BS). However, there remains a gap in a comprehensive understanding regarding the full scope of applications, impacts, and potential constraints of LLMs on blockchain security. To fill this ga… ▽ More

    Submitted 11 May, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  22. arXiv:2403.14188  [pdf, other

    cond-mat.dis-nn cs.AI cs.CR

    Quantum-activated neural reservoirs on-chip open up large hardware security models for resilient authentication

    Authors: Zhao He, Maxim S. Elizarov, Ning Li, Fei Xiang, Andrea Fratalocchi

    Abstract: Quantum artificial intelligence is a frontier of artificial intelligence research, pioneering quantum AI-powered circuits to address problems beyond the reach of deep learning with classical architectures. This work implements a large-scale quantum-activated recurrent neural network possessing more than 3 trillion hardware nodes/cm$^2$, originating from repeatable atomic-scale nucleation dynamics… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  23. arXiv:2403.13565  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    AdaTrans: Feature-wise and Sample-wise Adaptive Transfer Learning for High-dimensional Regression

    Authors: Zelin He, Ying Sun, **gyuan Liu, Runze Li

    Abstract: We consider the transfer learning problem in the high dimensional setting, where the feature dimension is larger than the sample size. To learn transferable information, which may vary across features or the source samples, we propose an adaptive transfer learning method that can detect and aggregate the feature-wise (F-AdaTrans) or sample-wise (S-AdaTrans) transferable structures. We achieve this… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Technical Report

  24. arXiv:2403.13491  [pdf, other

    cs.DB

    Distance Comparison Operators for Approximate Nearest Neighbor Search: Exploration and Benchmark

    Authors: Zeyu Wang, Haoran Xiong, Zhenying He, Peng Wang, Wei wang

    Abstract: Approximate nearest neighbor search (ANNS) on high-dimensional vectors has become a fundamental and essential component in various machine learning tasks. Prior research has shown that the distance comparison operation is the bottleneck of ANNS, which determines the query and indexing performance. To overcome this challenge, some novel methods have been proposed recently. The basic idea is to esti… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  25. arXiv:2403.12350  [pdf, other

    cs.LG

    Friendly Sharpness-Aware Minimization

    Authors: Tao Li, Pan Zhou, Zhengbao He, Xinwen Cheng, Xiaolin Huang

    Abstract: Sharpness-Aware Minimization (SAM) has been instrumental in improving deep neural network training by minimizing both training loss and loss sharpness. Despite the practical success, the mechanisms behind SAM's generalization enhancements remain elusive, limiting its progress in deep learning optimization. In this work, we investigate SAM's core components for generalization improvement and introd… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  26. arXiv:2403.11374  [pdf, other

    math.NA math.ST

    Quasi-Monte Carlo and importance sampling methods for Bayesian inverse problems

    Authors: Zhijian He, He** Wang, Xiaoqun Wang

    Abstract: Importance Sampling (IS), an effective variance reduction strategy in Monte Carlo (MC) simulation, is frequently utilized for Bayesian inference and other statistical challenges. Quasi-Monte Carlo (QMC) replaces the random samples in MC with low discrepancy points and has the potential to substantially enhance error rates. In this paper, we integrate IS with a randomly shifted rank-1 lattice rule,… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    MSC Class: 35R60; 62F15; 65C05; 65N21

  27. arXiv:2403.11101  [pdf, other

    cs.CV

    Hierarchical Generative Network for Face Morphing Attacks

    Authors: Zuyuan He, Zongyong Deng, Qiaoyun He, Qijun Zhao

    Abstract: Face morphing attacks circumvent face recognition systems (FRSs) by creating a morphed image that contains multiple identities. However, existing face morphing attack methods either sacrifice image quality or compromise the identity preservation capability. Consequently, these attacks fail to bypass FRSs verification well while still managing to deceive human observers. These methods typically rel… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by FG2024

  28. arXiv:2403.10894  [pdf, other

    cs.CL

    Towards Robustness and Diversity: Continual Learning in Dialog Generation with Text-Mixup and Batch Nuclear-Norm Maximization

    Authors: Zihan Wang, Jiayu Xiao, Mengxiang Li, Zhongjiang He, Yongxiang Li, Chao Wang, Shuangyong Song

    Abstract: In our dynamic world where data arrives in a continuous stream, continual learning enables us to incrementally add new tasks/domains without the need to retrain from scratch. A major challenge in continual learning of language model is catastrophic forgetting, the tendency of models to forget knowledge from previously trained tasks/domains when training on new ones. This paper studies dialog gener… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 12 pages, 2 figures

  29. arXiv:2403.10798  [pdf, other

    cs.CV

    Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object Retrieval

    Authors: Shichao Kan, Yuhai Deng, Yixiong Liang, Lihui Cen, Zhe Qu, Yigang Cen, Zhihai He

    Abstract: The task of searching for visual objects in a large image dataset is difficult because it requires efficient matching and accurate localization of objects that can vary in size. Although the segment anything model (SAM) offers a potential solution for extracting object spatial context, learning embeddings for local objects remains a challenging problem. This paper presents a novel unsupervised dee… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 13 pages, 10 figures

  30. arXiv:2403.09738  [pdf, other

    cs.CL cs.AI cs.IR

    Evaluating Large Language Models as Generative User Simulators for Conversational Recommendation

    Authors: Se-eun Yoon, Zhankui He, Jessica Maria Echterhoff, Julian McAuley

    Abstract: Synthetic users are cost-effective proxies for real users in the evaluation of conversational recommender systems. Large language models show promise in simulating human-like behavior, raising the question of their ability to represent a diverse population of users. We introduce a new protocol to measure the degree to which language models can accurately emulate human behavior in conversational re… ▽ More

    Submitted 25 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: NAACL 2024

  31. arXiv:2403.07591  [pdf, other

    cs.LG

    Robustifying and Boosting Training-Free Neural Architecture Search

    Authors: Zhenfeng He, Yao Shu, Zhongxiang Dai, Bryan Kian Hsiang Low

    Abstract: Neural architecture search (NAS) has become a key component of AutoML and a standard tool to automate the design of deep neural networks. Recently, training-free NAS as an emerging paradigm has successfully reduced the search costs of standard training-based NAS by estimating the true architecture performance with only training-free metrics. Nevertheless, the estimation ability of these metrics ty… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted by ICLR 2024. Code available at https://github.com/hzf1174/RoBoT

  32. arXiv:2403.06470  [pdf, other

    cs.CV

    3D-aware Image Generation and Editing with Multi-modal Conditions

    Authors: Bo Li, Yi-ke Li, Zhi-fen He, Bin Liu, Yun-Kun Lai

    Abstract: 3D-consistent image generation from a single 2D semantic label is an important and challenging research topic in computer graphics and computer vision. Although some related works have made great progress in this field, most of the existing methods suffer from poor disentanglement performance of shape and appearance, and lack multi-modal control. In this paper, we propose a novel end-to-end 3D-awa… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  33. arXiv:2403.06463  [pdf, other

    eess.SY

    A prediction-based forward-looking vehicle dispatching strategy for dynamic ride-pooling

    Authors: Xiaolei Wang, Chen Yang, Yuzhen Feng, Luohan Hu, Zhengbing He

    Abstract: For on-demand dynamic ride-pooling services, e.g., Uber Pool and Didi Pinche, a well-designed vehicle dispatching strategy is crucial for platform profitability and passenger experience. Most existing dispatching strategies overlook incoming pairing opportunities, therefore suffer from short-sighted limitations. In this paper, we propose a forward-looking vehicle dispatching strategy, which first… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  34. arXiv:2403.06447  [pdf, other

    cs.IR cs.AI

    CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation

    Authors: Junda Wu, Cheng-Chun Chang, Tong Yu, Zhankui He, Jianing Wang, Yupeng Hou, Julian McAuley

    Abstract: The long-tail recommendation is a challenging task for traditional recommender systems, due to data sparsity and data imbalance issues. The recent development of large language models (LLMs) has shown their abilities in complex reasoning, which can help to deduce users' preferences based on very few previous interactions. However, since most LLM-based systems rely on items' semantic meaning as the… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 11 pages

  35. arXiv:2403.06214  [pdf, other

    quant-ph

    Distributed quantum architecture search

    Authors: Haozhen Situ, Zhimin He, Shenggen Zheng, Lvzhou Li

    Abstract: Variational quantum algorithms, inspired by neural networks, have become a novel approach in quantum computing. However, designing efficient parameterized quantum circuits remains a challenge. Quantum architecture search tackles this by adjusting circuit structures along with gate parameters to automatically discover high-performance circuit structures. In this study, we propose an end-to-end dist… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  36. arXiv:2403.04407  [pdf, ps, other

    math.NA

    Unbiased Markov chain quasi-Monte Carlo for Gibbs samplers

    Authors: Jiarui Du, Zhijian He

    Abstract: In statistical analysis, Monte Carlo (MC) stands as a classical numerical integration method. When encountering challenging sample problem, Markov chain Monte Carlo (MCMC) is a commonly employed method. However, the MCMC estimator is biased after a fixed number of iterations. Unbiased MCMC, an advancement achieved through coupling techniques, addresses this bias issue in MCMC. It allows us to run… ▽ More

    Submitted 31 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  37. arXiv:2403.04085  [pdf, other

    cs.CL cs.CY

    Don't Blame the Data, Blame the Model: Understanding Noise and Bias When Learning from Subjective Annotations

    Authors: Abhishek Anand, Negar Mokhberian, Prathyusha Naresh Kumar, Anweasha Saha, Zihao He, Ashwin Rao, Fred Morstatter, Kristina Lerman

    Abstract: Researchers have raised awareness about the harms of aggregating labels especially in subjective tasks that naturally contain disagreements among human annotators. In this work we show that models that are only provided aggregated labels show low confidence on high-disagreement data instances. While previous studies consider such instances as mislabeled, we argue that the reason the high-disagreem… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  38. arXiv:2403.03952  [pdf, other

    cs.IR

    Bridging Language and Items for Retrieval and Recommendation

    Authors: Yupeng Hou, Jiacheng Li, Zhankui He, An Yan, Xiusi Chen, Julian McAuley

    Abstract: This paper introduces BLaIR, a series of pretrained sentence embedding models specialized for recommendation scenarios. BLaIR is trained to learn correlations between item metadata and potential natural language context, which is useful for retrieving and recommending items. To pretrain BLaIR, we collect Amazon Reviews 2023, a new dataset comprising over 570 million reviews and 48 million items fr… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  39. arXiv:2403.03405  [pdf, other

    cs.CV

    Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation

    Authors: Liuyi Wang, Zongtao He, Ronghao Dang, Huiyi Chen, Chengju Liu, Qijun Chen

    Abstract: Vision-and-Language Navigation (VLN) has gained significant research interest in recent years due to its potential applications in real-world scenarios. However, existing VLN methods struggle with the issue of spurious associations, resulting in poor generalization with a significant performance gap between seen and unseen environments. In this paper, we tackle this challenge by proposing a unifie… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 16 pages

  40. arXiv:2403.02959  [pdf, other

    cs.CL cs.AI

    SimuCourt: Building Judicial Decision-Making Agents with Real-world Judgement Documents

    Authors: Zhitao He, Pengfei Cao, Chenhao Wang, Zhuoran **, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao

    Abstract: With the development of deep learning, natural language processing technology has effectively improved the efficiency of various aspects of the traditional judicial industry. However, most current efforts focus solely on individual judicial stage, overlooking cross-stage collaboration. As the autonomous agents powered by large language models are becoming increasingly smart and able to make comple… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  41. arXiv:2403.02893  [pdf, other

    cs.CL cs.AI

    Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning

    Authors: Zhitao He, Pengfei Cao, Zhuoran **, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun, Jun Zhao

    Abstract: Event Causality Identification (ECI) refers to the detection of causal relations between events in texts. However, most existing studies focus on sentence-level ECI with high-resource languages, leaving more challenging document-level ECI (DECI) with low-resource languages under-explored. In this paper, we propose a Heterogeneous Graph Interaction Model with Multi-granularity Contrastive Transfer… ▽ More

    Submitted 22 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024

  42. arXiv:2403.02714  [pdf, other

    cs.CV

    DomainVerse: A Benchmark Towards Real-World Distribution Shifts For Tuning-Free Adaptive Domain Generalization

    Authors: Feng Hou, ** Yuan, Ying Yang, Yang Liu, Yang Zhang, Cheng Zhong, Zhongchao Shi, Jian** Fan, Yong Rui, Zhiqiang He

    Abstract: Traditional cross-domain tasks, including domain adaptation and domain generalization, rely heavily on training model by source domain data. With the recent advance of vision-language models (VLMs), viewed as natural source models, the cross-domain task changes to directly adapt the pre-trained source model to arbitrary target domains equipped with prior domain knowledge, and we name this task Ada… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Currently in review for ICML 2024

  43. arXiv:2403.02707  [pdf, other

    cs.CV cs.MM

    Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation

    Authors: Gang Liu, Hongyang Li, Zerui He, Shenjun Zhong

    Abstract: Leveraging pre-trained visual language models has become a widely adopted approach for improving performance in downstream visual question answering (VQA) applications. However, in the specialized field of medical VQA, the scarcity of available data poses a significant barrier to achieving reliable model generalization. Numerous methods have been proposed to enhance model generalization, addressin… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  44. arXiv:2403.02604  [pdf, other

    cs.RO

    UniDoorManip: Learning Universal Door Manipulation Policy Over Large-scale and Diverse Door Manipulation Environments

    Authors: Yu Li, Xiaojie Zhang, Ruihai Wu, Zilong Zhang, Yiran Geng, Hao Dong, Zhaofeng He

    Abstract: Learning a universal manipulation policy encompassing doors with diverse categories, geometries and mechanisms, is crucial for future embodied agents to effectively work in complex and broad real-world scenarios. Due to the limited datasets and unrealistic simulation environments, previous works fail to achieve good performance across various doors. In this work, we build a novel door manipulation… ▽ More

    Submitted 12 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Project page https://unidoormanip.github.io/

  45. arXiv:2403.02132  [pdf, other

    cs.CV

    UB-FineNet: Urban Building Fine-grained Classification Network for Open-access Satellite Images

    Authors: Zhiyi He, Wei Yao, Jie Shao, Puzuo Wang

    Abstract: Fine classification of city-scale buildings from satellite remote sensing imagery is a crucial research area with significant implications for urban planning, infrastructure development, and population distribution analysis. However, the task faces big challenges due to low-resolution overhead images acquired from high altitude space-borne platforms and the long-tail sample distribution of fine-gr… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  46. arXiv:2403.01988  [pdf, other

    cs.CL

    FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs

    Authors: Xuannan Liu, Peipei Li, Huaibo Huang, Zekun Li, Xing Cui, Jiahao Liang, Lixiong Qin, Weihong Deng, Zhaofeng He

    Abstract: The massive generation of multimodal fake news exhibits substantial distribution discrepancies, prompting the need for generalized detectors. However, the insulated nature of training within specific domains restricts the capability of classical detectors to obtain open-world facts. In this paper, we propose FakeNewsGPT4, a novel framework that augments Large Vision-Language Models (LVLMs) with fo… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  47. arXiv:2403.01153  [pdf, other

    eess.SP

    Transfer Learning-Enhanced Instantaneous Multi-Person Indoor Localization by CSI

    Authors: Zhiyuan He, Ke Deng, Jiangchao Gong, Yi Zhou, Desheng Wang

    Abstract: Passive indoor localization, integral to smart buildings, emergency response, and indoor navigation, has traditionally been limited by a focus on single-target localization and reliance on multi-packet CSI. We introduce a novel Multi-target loss, notably enhancing multi-person localization. Utilizing this loss function, our instantaneous CSI-ResNet achieves an impressive 99.21% accuracy at 0.6m pr… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  48. arXiv:2403.00835  [pdf, other

    cs.CL cs.AI

    CLLMs: Consistency Large Language Models

    Authors: Siqi Kou, Lanxiang Hu, Zhezhi He, Zhijie Deng, Hao Zhang

    Abstract: Parallel decoding methods such as Jacobi decoding show promise for more efficient LLM inference as it breaks the sequential nature of the LLM decoding process and transforms it into parallelizable computation. However, in practice, it achieves little speedup compared to traditional autoregressive (AR) decoding, primarily because Jacobi decoding seldom accurately predicts more than one token in a s… ▽ More

    Submitted 13 June, 2024; v1 submitted 28 February, 2024; originally announced March 2024.

    Comments: In the proceedings of the 41st International Conference on Machine Learning (ICML) 2024

  49. arXiv:2403.00811  [pdf, other

    cs.AI cs.CL

    Cognitive Bias in High-Stakes Decision-Making with LLMs

    Authors: Jessica Echterhoff, Yao Liu, Abeer Alessa, Julian McAuley, Zexue He

    Abstract: Large language models (LLMs) offer significant potential as tools to support an expanding range of decision-making tasks. However, given their training on human (created) data, LLMs can inherit both societal biases against protected groups, as well as be subject to cognitive bias. Such human-like bias can impede fair and explainable decisions made with LLM assistance. Our work introduces BiasBuste… ▽ More

    Submitted 24 February, 2024; originally announced March 2024.

  50. arXiv:2403.00591  [pdf, other

    cs.CV

    Learning Causal Features for Incremental Object Detection

    Authors: Zhenwei He, Lei Zhang

    Abstract: Object detection limits its recognizable categories during the training phase, in which it can not cover all objects of interest for users. To satisfy the practical necessity, the incremental learning ability of the detector becomes a critical factor for real-world applications. Unfortunately, neural networks unavoidably meet catastrophic forgetting problem when it is implemented on a new task. To… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.