Skip to main content

Showing 1–50 of 241 results for author: Niu, Y

Searching in archive cs. Search in all archives.
.
  1. Embracing Federated Learning: Enabling Weak Client Participation via Partial Model Training

    Authors: Sunwoo Lee, Tuo Zhang, Saurav Prakash, Yue Niu, Salman Avestimehr

    Abstract: In Federated Learning (FL), clients may have weak devices that cannot train the full model or even hold it in their memory space. To implement large-scale FL applications, thus, it is crucial to develop a distributed learning method that enables the participation of such weak clients. We propose EmbracingFL, a general FL framework that allows all available clients to join the distributed training… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Journal ref: IEEE Transactions on Mobile Computing, Early Access, (2024)

  2. arXiv:2406.12793  [pdf, other

    cs.CL

    ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

    Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, **g Zhang, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, Mingdao Liu, Minlie Huang , et al. (32 additional authors not shown)

    Abstract: We introduce ChatGLM, an evolving family of large language models that we have been develo** over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.10923  [pdf, other

    cs.CV cs.CL cs.LG

    Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies

    Authors: Hung-Ting Su, Chun-Tong Chao, Ya-Ching Hsu, Xudong Lin, Yulei Niu, Hung-Yi Lee, Winston H. Hsu

    Abstract: Large Language Models (LLMs) have demonstrated effectiveness not only in language tasks but also in video reasoning. This paper introduces a novel dataset, Tropes in Movies (TiM), designed as a testbed for exploring two critical yet previously overlooked video reasoning skills: (1) Abstract Perception: understanding and tokenizing abstract concepts in videos, and (2) Long-range Compositional Reaso… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Project page: https://ander1119.github.io/TiM

  4. arXiv:2406.10667  [pdf, other

    cs.LG

    UniZero: Generalized and Efficient Planning with Scalable Latent World Models

    Authors: Yuan Pu, Yazhe Niu, Jiyuan Ren, Zhenjie Yang, Hongsheng Li, Yu Liu

    Abstract: Learning predictive world models is essential for enhancing the planning capabilities of reinforcement learning agents. Notably, the MuZero-style algorithms, based on the value equivalence principle and Monte Carlo Tree Search (MCTS), have achieved superhuman performance in various domains. However, in environments that require capturing long-term dependencies, MuZero's performance deteriorates ra… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 32 pages, 16 figures

  5. arXiv:2405.18405  [pdf, other

    cs.CV cs.AI

    WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization

    Authors: Jiawei Ma, Yulei Niu, Shiyuan Huang, Guangxing Han, Shih-Fu Chang

    Abstract: Language has been useful in extending the vision encoder to data from diverse distributions without empirical discovery in training domains. However, as the image description is mostly at coarse-grained level and ignores visual details, the resulted embeddings are still ineffective in overcoming complexity of domains at inference time. We present a self-supervision framework WIDIn, Wording Images… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  6. arXiv:2405.15377   

    cs.RO

    Dynamic Planning for Sequential Whole-body Mobile Manipulation

    Authors: Zhitian Li, Yida Niu, Yao Su, Hangxin Liu, Ziyuan Jiao

    Abstract: The dynamic Sequential Mobile Manipulation Planning (SMMP) framework is essential for the safe and robust operation of mobile manipulators in dynamic environments. Previous research has primarily focused on either motion-level or task-level dynamic planning, with limitations in handling state changes that have long-term effects or in generating responsive motions for diverse tasks, respectively. T… ▽ More

    Submitted 20 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: technical issue, withdraw all the versions

  7. arXiv:2405.15269  [pdf, other

    cs.CV cs.LG

    BDetCLIP: Multimodal Prompting Contrastive Test-Time Backdoor Detection

    Authors: Yuwei Niu, Shuo He, Qi Wei, Feng Liu, Lei Feng

    Abstract: Multimodal contrastive learning methods (e.g., CLIP) have shown impressive zero-shot classification performance due to their strong ability to joint representation learning for visual and textual modalities. However, recent research revealed that multimodal contrastive learning on poisoned pre-training data with a small proportion of maliciously backdoored data can induce backdoored CLIP that coul… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  8. arXiv:2405.08816  [pdf, other

    cs.CV cs.RO

    The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

    Authors: Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingxing Wei, Yinpeng Dong, Bo Yang, Shengyin Jiang , et al. (66 additional authors not shown)

    Abstract: In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the development of driving perception technologies that c… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICRA 2024; 32 pages, 24 figures, 5 tables; Code at https://robodrive-24.github.io/

  9. arXiv:2405.04844  [pdf, ps, other

    cs.IR

    Full Stage Learning to Rank: A Unified Framework for Multi-Stage Systems

    Authors: Kai Zheng, Haijun Zhao, Rui Huang, Beichuan Zhang, Na Mou, Yanan Niu, Yang Song, Hongning Wang, Kun Gai

    Abstract: The Probability Ranking Principle (PRP) has been considered as the foundational standard in the design of information retrieval (IR) systems. The principle requires an IR module's returned list of results to be ranked with respect to the underlying user interests, so as to maximize the results' utility. Nevertheless, we point out that it is inappropriate to indiscriminately apply PRP through eve… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted by WWW 2024

  10. arXiv:2404.18369  [pdf, other

    quant-ph cs.ET

    A SAT Scalpel for Lattice Surgery: Representation and Synthesis of Subroutines for Surface-Code Fault-Tolerant Quantum Computing

    Authors: Daniel Bochen Tan, Murphy Yuezhen Niu, Craig Gidney

    Abstract: Quantum error correction is necessary for large-scale quantum computing. A promising quantum error correcting code is the surface code. For this code, fault-tolerant quantum computing (FTQC) can be performed via lattice surgery, i.e., splitting and merging patches of code. Given the frequent use of certain lattice-surgery subroutines (LaS), it becomes crucial to optimize their design in order to m… ▽ More

    Submitted 17 May, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: To appear in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA)

  11. arXiv:2404.17462  [pdf, other

    cs.NI

    Integrated Sensing and Communication Channel Modeling: A Survey

    Authors: Zhiqing Wei, **zhu Jia, Yangyang Niu, Lin Wang, Huici Wu, Heng Yang, Zhiyong Feng

    Abstract: Integrated sensing and communication (ISAC) is expected to play a crucial role in the sixth-generation (6G) mobile communication systems, offering potential applications in the scenarios of intelligent transportation, smart factories, etc. The performance of radar sensing in ISAC systems is closely related to the characteristics of radar sensing and communication channels. Therefore, ISAC channel… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  12. arXiv:2404.16364  [pdf, other

    cs.AI

    ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze

    Authors: Chunyu Xuan, Yazhe Niu, Yuan Pu, Shuai Hu, Yu Liu, **g Yang

    Abstract: Monte Carlo Tree Search (MCTS)-based algorithms, such as MuZero and its derivatives, have achieved widespread success in various decision-making domains. These algorithms employ the reanalyze process to enhance sample efficiency from stale data, albeit at the expense of significant wall-clock time consumption. To address this issue, we propose a general approach named ReZero to boost tree search o… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  13. arXiv:2404.09520  [pdf, other

    cs.IR

    UniSAR: Modeling User Transition Behaviors between Search and Recommendation

    Authors: Teng Shi, Zihua Si, Jun Xu, Xiao Zhang, Xiaoxue Zang, Kai Zheng, Dewei Leng, Yanan Niu, Yang Song

    Abstract: Nowadays, many platforms provide users with both search and recommendation services as important tools for accessing information. The phenomenon has led to a correlation between user search and recommendation behaviors, providing an opportunity to model user interests in a fine-grained way. Existing approaches either model user search and recommendation behaviors separately or overlook the differe… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted by SIGIR 2024

  14. arXiv:2404.05280  [pdf, other

    cs.CV

    MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues

    Authors: Xiahan Chen, Mingjian Chen, Sanli Tang, Yi Niu, Jiang Zhu

    Abstract: 3D object detection based on roadside cameras is an additional way for autonomous driving to alleviate the challenges of occlusion and short perception range from vehicle cameras. Previous methods for roadside 3D object detection mainly focus on modeling the depth or height of objects, neglecting the stationary of cameras and the characteristic of inter-frame consistency. In this work, we propose… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  15. arXiv:2404.00934  [pdf, other

    cs.CL

    ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback

    Authors: Zhenyu Hou, Yilin Niu, Zhengxiao Du, Xiaohan Zhang, Xiao Liu, Aohan Zeng, Qinkai Zheng, Minlie Huang, Hongning Wang, Jie Tang, Yuxiao Dong

    Abstract: ChatGLM is a free-to-use AI service powered by the ChatGLM family of large language models (LLMs). In this paper, we present the ChatGLM-RLHF pipeline -- a reinforcement learning from human feedback (RLHF) system -- designed to enhance ChatGLM's alignment with human preferences. ChatGLM-RLHF encompasses three major components: the collection of human preference data, the training of the reward mod… ▽ More

    Submitted 3 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  16. arXiv:2404.00212  [pdf, other

    cs.PL

    Cost-sensitive computational adequacy of higher-order recursion in synthetic domain theory

    Authors: Yue Niu, Jonathan Sterling, Robert Harper

    Abstract: We study a cost-aware programming language for higher-order recursion dubbed $\textbf{PCF}_\mathsf{cost}$ in the setting of synthetic domain theory (SDT). Our main contribution relates the denotational cost semantics of $\textbf{PCF}_\mathsf{cost}$ to its computational cost semantics, a new kind of dynamic semantics for program execution that serves as a mathematically natural alternative to opera… ▽ More

    Submitted 10 June, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: Accepted at MFPS '24

  17. arXiv:2403.18600  [pdf, other

    cs.CV cs.AI cs.RO

    RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos

    Authors: Ali Zare, Yulei Niu, Hammad Ayyubi, Shih-fu Chang

    Abstract: Procedure Planning in instructional videos entails generating a sequence of action steps based on visual observations of the initial and target states. Despite the rapid progress in this task, there remain several critical challenges to be solved: (1) Adaptive procedures: Prior works hold an unrealistic assumption that the number of action steps is known and fixed, leading to non-generalizable mod… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 23 pages, 6 figures, 12 tables

  18. arXiv:2403.18197  [pdf, other

    cs.RO

    LocoMan: Advancing Versatile Quadrupedal Dexterity with Lightweight Loco-Manipulators

    Authors: Changyi Lin, Xingyu Liu, Yuxiang Yang, Yaru Niu, Wenhao Yu, Tingnan Zhang, Jie Tan, Byron Boots, Ding Zhao

    Abstract: Quadrupedal robots have emerged as versatile agents capable of locomoting and manipulating in complex environments. Traditional designs typically rely on the robot's inherent body parts or incorporate top-mounted arms for manipulation tasks. However, these configurations may limit the robot's operational dexterity, efficiency and adaptability, particularly in cluttered or constrained spaces. In th… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Project page: https://linchangyi1.github.io/LocoMan

  19. arXiv:2403.16189  [pdf, other

    cs.NI

    Interference Management for Integrated Sensing and Communication Systems: A Survey

    Authors: Yangyang Niu, Zhiqing Wei, Lin Wang, Huici Wu, Zhiyong Feng

    Abstract: Emerging applications such as autonomous driving and Internet of things (IoT) services put forward the demand for simutaneous sensing and communication functions in the same system. Integrated sensing and communication (ISAC) has the potential to meet the demands of ubiquitous communication and high-precision sensing due to the advantages of spectrum and hardware resource sharing, as well as the m… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  20. arXiv:2403.16107  [pdf, other

    cs.HC

    Designing Upper-Body Gesture Interaction with and for People with Spinal Muscular Atrophy in VR

    Authors: **gze Tian, Yingna Wang, Keye Yu, Liyi Xu, Junan Xie, Franklin Mingzhe Li, Yafeng Niu, Mingming Fan

    Abstract: Recent research proposed gaze-assisted gestures to enhance interaction within virtual reality (VR), providing opportunities for people with motor impairments to experience VR. Compared to people with other motor impairments, those with Spinal Muscular Atrophy (SMA) exhibit enhanced distal limb mobility, providing them with more design space. However, it remains unknown what gaze-assisted upper-bod… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11--16, 2024, Honolulu, HI, USA

  21. arXiv:2403.10995  [pdf, other

    cs.LG cs.AI cs.CR cs.SI

    Edge Private Graph Neural Networks with Singular Value Perturbation

    Authors: Tingting Tang, Yue Niu, Salman Avestimehr, Murali Annavaram

    Abstract: Graph neural networks (GNNs) play a key role in learning representations from graph-structured data and are demonstrated to be useful in many applications. However, the GNN training pipeline has been shown to be vulnerable to node feature leakage and edge extraction attacks. This paper investigates a scenario where an attacker aims to recover private edge information from a trained GNN model. Prev… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Accepted at Privacy Enhancing Technologies Symposium (PETS) 2024

  22. arXiv:2403.10361  [pdf, other

    cs.CR

    Unveiling Wash Trading in Popular NFT Markets

    Authors: Yuanzheng Niu, Xiaoqi Li, Hongli Peng, Wenkai Li

    Abstract: As emerging digital assets, NFTs are susceptible to anomalous trading behaviors due to the lack of stringent regulatory mechanisms, potentially causing economic losses. In this paper, we conduct the first systematic analysis of four non-fungible tokens (NFT) markets. Specifically, we analyze more than 25 million transactions within these markets, to explore the evolution of wash trade activities.… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted by WWW 2024

  23. arXiv:2403.08994  [pdf, other

    cs.CL

    Ethos: Rectifying Language Models in Orthogonal Parameter Space

    Authors: Lei Gao, Yue Niu, Tingting Tang, Salman Avestimehr, Murali Annavaram

    Abstract: Language models (LMs) have greatly propelled the research on natural language processing. However, LMs also raise concerns regarding the generation of biased or toxic content and the potential disclosure of private information from the training dataset. In this work, we present a new efficient approach, Ethos, that rectifies LMs to mitigate toxicity and bias in outputs and avoid privacy leakage. E… ▽ More

    Submitted 1 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  24. arXiv:2403.02352  [pdf, other

    cs.LG cs.AI

    ATP: Enabling Fast LLM Serving via Attention on Top Principal Keys

    Authors: Yue Niu, Saurav Prakash, Salman Avestimehr

    Abstract: We propose a new attention mechanism with linear complexity, ATP, that fixates \textbf{A}ttention on \textbf{T}op \textbf{P}rincipal keys, rather than on each individual token. Particularly, ATP is driven by an important observation that input sequences are typically low-rank, i.e., input sequences can be represented by a few principal bases. Therefore, instead of directly iterating over all the i… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 10 pages, 7 figures, 8 tables

  25. arXiv:2403.01599  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos

    Authors: Yulei Niu, Wenliang Guo, Long Chen, Xudong Lin, Shih-Fu Chang

    Abstract: We study the problem of procedure planning in instructional videos, which aims to make a goal-oriented sequence of action steps given partial visual state observations. The motivation of this problem is to learn a structured and plannable state and action space. Recent works succeeded in sequence modeling of steps with only sequence-level annotations accessible during training, which overlooked th… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: Accepted by ICLR 2024

  26. arXiv:2402.14399  [pdf, other

    cs.IR cs.AI

    Ensure Timeliness and Accuracy: A Novel Sliding Window Data Stream Paradigm for Live Streaming Recommendation

    Authors: Fengqi Liang, Baigong Zheng, Liqin Zhao, Guorui Zhou, Qian Wang, Yanan Niu

    Abstract: Live streaming recommender system is specifically designed to recommend real-time live streaming of interest to users. Due to the dynamic changes of live content, improving the timeliness of the live streaming recommender system is a critical problem. Intuitively, the timeliness of the data determines the upper bound of the timeliness that models can learn. However, none of the previous works addr… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  27. arXiv:2402.03047  [pdf, other

    cs.CV cs.LG

    PFDM: Parser-Free Virtual Try-on via Diffusion Model

    Authors: Yunfang Niu, Dong Yi, Lingxiang Wu, Zhiwei Liu, Pengxiang Cai, **qiao Wang

    Abstract: Virtual try-on can significantly improve the garment shop** experiences in both online and in-store scenarios, attracting broad interest in computer vision. However, to achieve high-fidelity try-on performance, most state-of-the-art methods still rely on accurate segmentation masks, which are often produced by near-perfect parsers or manual labeling. To overcome the bottleneck, we propose a pars… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE ICASSP 2024

  28. arXiv:2402.00856  [pdf, other

    cs.CL

    Towards Efficient Exact Optimization of Language Model Alignment

    Authors: Haozhe Ji, Cheng Lu, Yilin Niu, Pei Ke, Hongning Wang, Jun Zhu, Jie Tang, Minlie Huang

    Abstract: The alignment of language models with human preferences is vital for their application in real-world tasks. The problem is formulated as optimizing the model's policy to maximize the expected reward that reflects human preferences with minimal deviation from the initial policy. While considered as a straightforward solution, reinforcement learning (RL) suffers from high variance in policy updates,… ▽ More

    Submitted 5 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 24 pages, 9 figures

    Journal ref: Forty-first International Conference on Machine Learning (ICML 2024)

  29. arXiv:2401.09424  [pdf, other

    physics.ao-ph cs.AI cs.CV cs.LG

    Precipitation Prediction Using an Ensemble of Lightweight Learners

    Authors: Xinzhe Li, Sun Rui, Yiming Niu, Yao Liu

    Abstract: Precipitation prediction plays a crucial role in modern agriculture and industry. However, it poses significant challenges due to the diverse patterns and dynamics in time and space, as well as the scarcity of high precipitation events. To address this challenge, we propose an ensemble learning framework that leverages multiple learners to capture the diverse patterns of precipitation distributi… ▽ More

    Submitted 29 November, 2023; originally announced January 2024.

  30. LabelCraft: Empowering Short Video Recommendations with Automated Label Crafting

    Authors: Yimeng Bai, Yang Zhang, **g Lu, Jianxin Chang, Xiaoxue Zang, Yanan Niu, Yang Song, Fuli Feng

    Abstract: Short video recommendations often face limitations due to the quality of user feedback, which may not accurately depict user interests. To tackle this challenge, a new task has emerged: generating more dependable labels from original feedback. Existing label generation methods rely on manual rules, demanding substantial human effort and potentially misaligning with the desired objectives of the pl… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted by WSDM'24

    ACM Class: H.3.3; H.3.5

  31. arXiv:2312.10885   

    cs.IR

    A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm

    Authors: Yong Niu, Xing Xing, Zhichun Jia, Ruidi Liu, Mindong Xin

    Abstract: Sequential recommendation aims to infer user preferences from historical interaction sequences and predict the next item that users may be interested in the future. The current mainstream design approach is to represent items as fixed vectors, capturing the underlying relationships between items and user preferences based on the order of interactions. However, relying on a single fixed-item embedd… ▽ More

    Submitted 20 December, 2023; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: This paper needs to be further modified, including the ablation experiment, model framework and other information in Chapter 5. There are some inaccuracies in the presentation of this paper. Two datasets are used instead of three, and there are many inaccuracies in the presentation, which need to be further corrected

  32. arXiv:2312.07685  [pdf, other

    cs.LG cs.AI

    A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

    Authors: Yinmin Zhang, Jie Liu, Chuming Li, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang

    Abstract: Offline-to-online Reinforcement Learning (O2O RL) aims to improve the performance of offline pretrained policy using only a few online samples. Built on offline RL algorithms, most O2O methods focus on the balance between RL objective and pessimism, or the utilization of offline and online samples. In this paper, from a novel perspective, we systematically study the challenges that remain in O2O R… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

  33. arXiv:2312.05264  [pdf, other

    cs.CR cs.LG

    All Rivers Run to the Sea: Private Learning with Asymmetric Flows

    Authors: Yue Niu, Ramy E. Ali, Saurav Prakash, Salman Avestimehr

    Abstract: Data privacy is of great concern in cloud machine-learning service platforms, when sensitive data are exposed to service providers. While private computing environments (e.g., secure enclaves), and cryptographic approaches (e.g., homomorphic encryption) provide strong privacy protection, their computing performance still falls short compared to cloud GPUs. To achieve privacy protection with high c… ▽ More

    Submitted 29 March, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Camera-ready for CVPR 2024

  34. Throughput Maximization for Intelligent Refracting Surface Assisted mmWave High-Speed Train Communications

    Authors: **g Li, Yong Niu, Hao Wu, Bo Ai, Ruisi He, Ning Wang, Sheng Chen

    Abstract: With the increasing demands from passengers for data-intensive services, millimeter-wave (mmWave) communication is considered as an effective technique to release the transmission pressure on high speed train (HST) networks. However, mmWave signals ncounter severe losses when passing through the carriage, which decreases the quality of services on board. In this paper, we investigate an intelligen… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 13 pages, 7 figures, IEEE Internet of Things Journal

  35. arXiv:2311.10747  [pdf, other

    cs.RO cs.AI cs.LG

    Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving

    Authors: Haohong Lin, Wenhao Ding, Zuxin Liu, Yaru Niu, Jiacheng Zhu, Yuming Niu, Ding Zhao

    Abstract: In the domain of autonomous driving, the offline Reinforcement Learning~(RL) approaches exhibit notable efficacy in addressing sequential decision-making problems from offline datasets. However, maintaining safety in diverse safety-critical scenarios remains a significant challenge due to long-tailed and unforeseen scenarios absent from offline datasets. In this paper, we introduce the saFety-awar… ▽ More

    Submitted 12 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

  36. arXiv:2311.10041  [pdf, other

    cs.RO

    Interpretable Reinforcement Learning for Robotics and Continuous Control

    Authors: Rohan Paleja, Letian Chen, Yaru Niu, Andrew Silva, Zhaoxin Li, Songan Zhang, Chace Ritchie, Sugju Choi, Kimberlee Chestnut Chang, Hongtei Eric Tseng, Yan Wang, Subramanya Nageshrao, Matthew Gombolay

    Abstract: Interpretability in machine learning is critical for the safe deployment of learned policies across legally-regulated and safety-critical domains. While gradient-based approaches in reinforcement learning have achieved tremendous success in learning policies for continuous control problems such as robotics and autonomous driving, the lack of interpretability is a fundamental barrier to adoption. W… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2202.02352

  37. arXiv:2311.09271  [pdf, other

    cs.HC cs.AI

    An Empathetic User-Centric Chatbot for Emotional Support

    Authors: Yanting Pan, Yixuan Tang, Yuchen Niu

    Abstract: This paper explores the intersection of Otome Culture and artificial intelligence, particularly focusing on how Otome-oriented games fulfill the emotional needs of young women. These games, which are deeply rooted in a subcultural understanding of love, provide players with feelings of satisfaction, companionship, and protection through carefully crafted narrative structures and character developm… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  38. arXiv:2311.08302  [pdf, other

    cs.IR

    Inverse Learning with Extremely Sparse Feedback for Recommendation

    Authors: Guanyu Lin, Chen Gao, Yu Zheng, Yinfeng Li, Jianxin Chang, Yanan Niu, Yang Song, Kun Gai, Zhiheng Li, Depeng **, Yong Li

    Abstract: Modern personalized recommendation services often rely on user feedback, either explicit or implicit, to improve the quality of services. Explicit feedback refers to behaviors like ratings, while implicit feedback refers to behaviors like user clicks. However, in the scenario of full-screen video viewing experiences like Tiktok and Reels, the click action is absent, resulting in unclear feedback f… ▽ More

    Submitted 20 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: WSDM 2024

  39. arXiv:2311.08272  [pdf, other

    cs.IR cs.LG

    Mixed Attention Network for Cross-domain Sequential Recommendation

    Authors: Guanyu Lin, Chen Gao, Yu Zheng, Jianxin Chang, Yanan Niu, Yang Song, Kun Gai, Zhiheng Li, Depeng **, Yong Li, Meng Wang

    Abstract: In modern recommender systems, sequential recommendation leverages chronological user behaviors to make effective next-item suggestions, which suffers from data sparsity issues, especially for new users. One promising line of work is the cross-domain recommendation, which trains models with data across multiple domains to improve the performance in data-scarce domains. Recent proposed cross-domain… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: WSDM 2024

  40. Sum Rate Maximization under AoI Constraints for RIS-Assisted mmWave Communications

    Authors: Ziqi Guo, Yong Niu, Shiwen Mao, Changming Zhang, Ning Wang, Zhangdui Zhong, Bo Ai

    Abstract: The concept of age of information (AoI) has been proposed to quantify information freshness, which is crucial for time-sensitive applications. However, in millimeter wave (mmWave) communication systems, the link blockage caused by obstacles and the severe path loss greatly impair the freshness of information received by the user equipments (UEs). In this paper, we focus on reconfigurable intellige… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  41. arXiv:2311.06497  [pdf, other

    cs.CV cs.AI

    DRUformer: Enhancing the driving scene Important object detection with driving relationship self-understanding

    Authors: Yingjie Niu, Ming Ding, Keisuke Fujii, Kento Ohtani, Alexander Carballo, Kazuya Takeda

    Abstract: Traffic accidents frequently lead to fatal injuries, contributing to over 50 million deaths until 2023. To mitigate driving hazards and ensure personal safety, it is crucial to assist vehicles in anticipating important objects during travel. Previous research on important object detection primarily assessed the importance of individual participants, treating them as independent entities and freque… ▽ More

    Submitted 13 December, 2023; v1 submitted 11 November, 2023; originally announced November 2023.

  42. arXiv:2310.13065  [pdf, other

    cs.RO cs.AI cs.LG

    Creative Robot Tool Use with Large Language Models

    Authors: Mengdi Xu, Peide Huang, Wenhao Yu, Shiqi Liu, Xilun Zhang, Yaru Niu, Tingnan Zhang, Fei Xia, Jie Tan, Ding Zhao

    Abstract: Tool use is a hallmark of advanced intelligence, exemplified in both animal behavior and robotic capabilities. This paper investigates the feasibility of imbuing robots with the ability to creatively use tools in tasks that involve implicit physical constraints and long-term planning. Leveraging Large Language Models (LLMs), we develop RoboTool, a system that accepts natural language instructions… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 19 pages, 14 figures, 2 tables

  43. arXiv:2310.12429  [pdf, other

    cs.IT eess.SP

    Reconfigurable Intelligent Surface Assisted High-Speed Train Communications: Coverage Performance Analysis and Placement Optimization

    Authors: Changzhu Liu, Ruisi He, Yong Niu, Zhu Han, Bo Ai, Meilin Gao, Zhangfeng Ma, Gongpu Wang, Zhangdui Zhong

    Abstract: Reconfigurable intelligent surface (RIS) emerges as an efficient and promising technology for the next wireless generation networks and has attracted a lot of attention owing to the capability of extending wireless coverage by reflecting signals toward targeted receivers. In this paper, we consider a RIS-assisted high-speed train (HST) communication system to enhance wireless coverage and improve… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 14 figures, accepted by IEEE Transactions on Vehicular Technology

  44. arXiv:2310.10902  [pdf, other

    cs.AR eess.SP

    Reuse Kernels or Activations? A Flexible Dataflow for Low-latency Spectral CNN Acceleration

    Authors: Yue Niu, Rajgopal Kannan, Ajitesh Srivastava, Viktor Prasanna

    Abstract: Spectral-domain CNNs have been shown to be more efficient than traditional spatial CNNs in terms of reducing computation complexity. However they come with a `kernel explosion' problem that, even after compression (pruning), imposes a high memory burden and off-chip bandwidth requirement for kernel access. This creates a performance gap between the potential acceleration offered by compression and… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 11 pages, 11 figures Accepted to ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA) 2020

  45. arXiv:2310.08348  [pdf, other

    cs.LG

    LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios

    Authors: Yazhe Niu, Yuan Pu, Zhenjie Yang, Xueyan Li, Tong Zhou, Jiyuan Ren, Shuai Hu, Hongsheng Li, Yu Liu

    Abstract: Building agents based on tree-search planning capabilities with learned models has achieved remarkable success in classic decision-making problems, such as Go and Atari. However, it has been deemed challenging or even infeasible to extend Monte Carlo Tree Search (MCTS) based algorithms to diverse real-world applications, especially when these environments involve complex action spaces and signific… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 Spotlight

  46. arXiv:2310.05900  [pdf, other

    quant-ph cs.LG

    Learning to Decode the Surface Code with a Recurrent, Transformer-Based Neural Network

    Authors: Johannes Bausch, Andrew W Senior, Francisco J H Heras, Thomas Edlich, Alex Davies, Michael Newman, Cody Jones, Kevin Satzinger, Murphy Yuezhen Niu, Sam Blackwell, George Holland, Dvir Kafri, Juan Atalaya, Craig Gidney, Demis Hassabis, Sergio Boixo, Hartmut Neven, Pushmeet Kohli

    Abstract: Quantum error-correction is a prerequisite for reliable quantum computation. Towards this goal, we present a recurrent, transformer-based neural network which learns to decode the surface code, the leading quantum error-correction code. Our decoder outperforms state-of-the-art algorithmic decoders on real-world data from Google's Sycamore quantum processor for distance 3 and 5 surface codes. On di… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    MSC Class: 81P73; 68T07 ACM Class: I.2.0; J.2

  47. arXiv:2310.00871  [pdf, other

    cs.RO cs.LG

    COMPOSER: Scalable and Robust Modular Policies for Snake Robots

    Authors: Yuyou Zhang, Yaru Niu, Xingyu Liu, Ding Zhao

    Abstract: Snake robots have showcased remarkable compliance and adaptability in their interaction with environments, mirroring the traits of their natural counterparts. While their hyper-redundant and high-dimensional characteristics add to this adaptability, they also pose great challenges to robot control. Instead of perceiving the hyper-redundancy and flexibility of snake robots as mere challenges, there… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: 7 pages, 5 figures

  48. arXiv:2309.16873  [pdf, other

    cs.RO

    Predicting Object Interactions with Behavior Primitives: An Application in Stowing Tasks

    Authors: Haonan Chen, Yilong Niu, Kaiwen Hong, Shui**g Liu, Yixuan Wang, Yunzhu Li, Katherine Driggs-Campbell

    Abstract: Stowing, the task of placing objects in cluttered shelves or bins, is a common task in warehouse and manufacturing operations. However, this task is still predominantly carried out by human workers as stowing is challenging to automate due to the complex multi-object interactions and long-horizon nature of the task. Previous works typically involve extensive data collection and costly human labeli… ▽ More

    Submitted 3 November, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Project Page: https://haonan16.github.io/stow_page/ 16 pages, 9 figures, Accepted for an oral presentation at CoRL 2023

  49. arXiv:2309.10528  [pdf, other

    cs.CV

    Retinex-guided Channel-grou** based Patch Swap for Arbitrary Style Transfer

    Authors: Chang Liu, Yi Niu, Mingming Ma, Fu Li, Guangming Shi

    Abstract: The basic principle of the patch-matching based style transfer is to substitute the patches of the content image feature maps by the closest patches from the style image feature maps. Since the finite features harvested from one single aesthetic style image are inadequate to represent the rich textures of the content natural image, existing techniques treat the full-channel style feature patches a… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  50. arXiv:2309.10479  [pdf, other

    cs.CV

    RECALL+: Adversarial Web-based Replay for Continual Learning in Semantic Segmentation

    Authors: Chang Liu, Giulia Rizzoli, Francesco Barbato, Andrea Maracani, Marco Toldo, Umberto Michieli, Yi Niu, Pietro Zanuttigh

    Abstract: Catastrophic forgetting of previous knowledge is a critical issue in continual learning typically handled through various regularization strategies. However, existing methods struggle especially when several incremental steps are performed. In this paper, we extend our previous approach (RECALL) and tackle forgetting by exploiting unsupervised web-crawled data to retrieve examples of old classes f… ▽ More

    Submitted 16 February, 2024; v1 submitted 19 September, 2023; originally announced September 2023.