Skip to main content

Showing 1–50 of 106 results for author: Cai, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18045  [pdf, other

    cs.CL cs.AI

    PharmGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical and Chemistry

    Authors: Linqing Chen, Weilei Wang, Zilong Bai, Peng Xu, Yan Fang, Jie Fang, Wentao Wu, Lizhi Zhou, Ruiji Zhang, Yubin Xia, Chaobo Xu, Ran Hu, Licong Xu, Qijun Cai, Haoran Hua, **g Sun, ** Liu, Tian Qiu, Haowen Liu, Meng Hu, Xiuwen Li, Fei Gao, Yufu Wang, Lin Tie, Chaochao Wang , et al. (9 additional authors not shown)

    Abstract: Large language models (LLMs) have revolutionized Natural Language Processing (NLP) by by minimizing the need for complex feature engineering. However, the application of LLMs in specialized domains like biopharmaceuticals and chemistry remains largely unexplored. These fields are characterized by intricate terminologies, specialized knowledge, and a high demand for precision areas where general pu… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.16422  [pdf, other

    cs.CV cs.AI

    Exploring Cross-Domain Few-Shot Classification via Frequency-Aware Prompting

    Authors: Tiange Zhang, Qing Cai, Feng Gao, Lin Qi, Junyu Dong

    Abstract: Cross-Domain Few-Shot Learning has witnessed great stride with the development of meta-learning. However, most existing methods pay more attention to learning domain-adaptive inductive bias (meta-knowledge) through feature-wise manipulation or task diversity improvement while neglecting the phenomenon that deep networks tend to rely more on high-frequency cues to make the classification decision,… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.14015  [pdf, other

    cs.LG

    CohortNet: Empowering Cohort Discovery for Interpretable Healthcare Analytics

    Authors: Qingpeng Cai, Kai** Zheng, H. V. Jagadish, Beng Chin Ooi, James Yip

    Abstract: Cohort studies are of significant importance in the field of healthcare analysis. However, existing methods typically involve manual, labor-intensive, and expert-driven pattern definitions or rely on simplistic clustering techniques that lack medical relevance. Automating cohort studies with interpretable patterns has great potential to facilitate healthcare analysis but remains an unmet need in p… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 10 pages, 12 figures

  4. Modeling User Retention through Generative Flow Networks

    Authors: Ziru Liu, Shuchang Liu, Bin Yang, Zhenghai Xue, Qingpeng Cai, Xiangyu Zhao, Zijian Zhang, Lantao Hu, Han Li, Peng Jiang

    Abstract: Recommender systems aim to fulfill the user's daily demands. While most existing research focuses on maximizing the user's engagement with the system, it has recently been pointed out that how frequently the users come back for the service also reflects the quality and stability of recommendations. However, optimizing this user retention behavior is non-trivial and poses several challenges includi… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: KDD-ADS 2024

  5. arXiv:2406.02213  [pdf, other

    cs.LG

    Rectifying Reinforcement Learning for Reward Matching

    Authors: Haoran He, Emmanuel Bengio, Qingpeng Cai, Ling Pan

    Abstract: The Generative Flow Network (GFlowNet) is a probabilistic framework in which an agent learns a stochastic policy and flow functions to sample objects with probability proportional to an unnormalized reward function. GFlowNets share a strong resemblance to reinforcement learning (RL), that typically aims to maximize reward, due to their sequential decision-making processes. Recent works have studie… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2406.01901  [pdf, other

    cs.LG

    Bifurcated Generative Flow Networks

    Authors: Chunhui Li, Cheng-Hao Liu, Dianbo Liu, Qingpeng Cai, Ling Pan

    Abstract: Generative Flow Networks (GFlowNets), a new family of probabilistic samplers, have recently emerged as a promising framework for learning stochastic policies that generate high-quality and diverse objects proportionally to their rewards. However, existing GFlowNets often suffer from low data efficiency due to the direct parameterization of edge flows or reliance on backward policies that may strug… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2405.09788  [pdf, ps, other

    cs.CY

    Synthesizing Proteins on the Graphics Card. Protein Folding and the Limits of Critical AI Studies

    Authors: Fabian Offert, Paul Kim, Qiaoyu Cai

    Abstract: This paper investigates the application of the transformer architecture in protein folding, as exemplified by DeepMind's AlphaFold project, and its implications for the understanding of large language models as models of language. The prevailing discourse often assumes a ready-made analogy between proteins -- encoded as sequences of amino acids -- and natural language -- encoded as sequences of di… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  8. M3oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework

    Authors: Zijian Zhang, Shuchang Liu, Jiaao Yu, Qingpeng Cai, Xiangyu Zhao, Chunxu Zhang, Ziru Liu, Qidong Liu, Hongwei Zhao, Lantao Hu, Peng Jiang, Kun Gai

    Abstract: Multi-domain recommendation and multi-task recommendation have demonstrated their effectiveness in leveraging common information from different domains and objectives for comprehensive user modeling. Nonetheless, the practical recommendation usually faces multiple domains and tasks simultaneously, which cannot be well-addressed by current methods. To this end, we introduce M3oE, an adaptive Multi-… ▽ More

    Submitted 12 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  9. arXiv:2404.18255  [pdf, other

    cs.CL cs.AI

    PatentGPT: A Large Language Model for Intellectual Property

    Authors: Zilong Bai, Ruiji Zhang, Linqing Chen, Qijun Cai, Yuan Zhong, Cong Wang, Yan Fang, Jie Fang, **g Sun, Weikuan Wang, Lizhi Zhou, Haoran Hua, Tian Qiu, Chaochao Wang, Cheng Sun, Jian** Lu, Yixin Wang, Yubin Xia, Meng Hu, Haowen Liu, Peng Xu, Licong Xu, Fu Bian, Xiaolong Gu, Lisha Zhang , et al. (2 additional authors not shown)

    Abstract: In recent years, large language models(LLMs) have attracted significant attention due to their exceptional performance across a multitude of natural language process tasks, and have been widely applied in various fields. However, the application of large language models in the Intellectual Property (IP) domain is challenging due to the strong need for specialized knowledge, privacy protection, pro… ▽ More

    Submitted 4 June, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: 19 pages, 9 figures

    ACM Class: I.2.7

  10. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 19 pages

  11. Sequential Recommendation for Optimizing Both Immediate Feedback and Long-term Retention

    Authors: Ziru Liu, Shuchang Liu, Zijian Zhang, Qingpeng Cai, Xiangyu Zhao, Kesen Zhao, Lantao Hu, Peng Jiang, Kun Gai

    Abstract: In the landscape of Recommender System (RS) applications, reinforcement learning (RL) has recently emerged as a powerful tool, primarily due to its proficiency in optimizing long-term rewards. Nevertheless, it suffers from instability in the learning process, stemming from the intricate interactions among bootstrap**, off-policy training, and function approximation. Moreover, in multi-reward rec… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: SIGIR 2024

  12. arXiv:2404.01150  [pdf, other

    cs.RO

    Visual-inertial state estimation based on Chebyshev polynomial optimization

    Authors: Hongyu Zhang, Maoran Zhu, Qi Cai, Yuanxin Wu

    Abstract: This paper proposes an innovative state estimation method for visual-inertial fusion based on Chebyshev polynomial optimization. Specifically, the pose is modeled as a Chebyshev polynomial of a certain order, and its time derivatives are used to calculate linear acceleration and angular velocity, which, along with inertial measurements, constitute dynamic constraints. This is coupled with a visual… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  13. arXiv:2403.17870  [pdf, other

    cs.CV cs.MM

    Boosting Diffusion Models with Moving Average Sampling in Frequency Domain

    Authors: Yurui Qian, Qi Cai, Yingwei Pan, Yehao Li, Ting Yao, Qibin Sun, Tao Mei

    Abstract: Diffusion models have recently brought a powerful revolution in image generation. Despite showing impressive generative capabilities, most of these models rely on the current sample to denoise the next one, possibly resulting in denoising instability. In this paper, we reinterpret the iterative denoising process as model optimization and leverage a moving average mechanism to ensemble all the prio… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  14. arXiv:2403.16209  [pdf

    cs.CV cs.AI

    Image Captioning in news report scenario

    Authors: Tianrui Liu, Qi Cai, Changxin Xu, Bo Hong, Jize Xiong, Yuxin Qiao, Tsungwei Yang

    Abstract: Image captioning strives to generate pertinent captions for specified images, situating itself at the crossroads of Computer Vision (CV) and Natural Language Processing (NLP). This endeavor is of paramount importance with far-reaching applications in recommendation systems, news outlets, social media, and beyond. Particularly within the realm of news reporting, captions are expected to encompass d… ▽ More

    Submitted 1 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: 10 pages, 4 figures

  15. arXiv:2403.16206  [pdf

    cs.AI

    Rumor Detection with a novel graph neural network approach

    Authors: Tianrui Liu, Qi Cai, Changxin Xu, Bo Hong, Fanghao Ni, Yuxin Qiao, Tsungwei Yang

    Abstract: The wide spread of rumors on social media has caused a negative impact on people's daily life, leading to potential panic, fear, and mental health problems for the public. How to debunk rumors as early as possible remains a challenging problem. Existing studies mainly leverage information propagation structure to detect rumors, while very few works focus on correlation among users that they may co… ▽ More

    Submitted 1 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: 10 pages, 5 figures

  16. arXiv:2403.07279  [pdf, other

    cs.CL

    A Survey of Explainable Knowledge Tracing

    Authors: Yanhong Bai, Jiabao Zhao, Tingjiang Wei, Qing Cai, Liang He

    Abstract: With the long term accumulation of high quality educational data, artificial intelligence has shown excellent performance in knowledge tracing. However, due to the lack of interpretability and transparency of some algorithms, this approach will result in reduced stakeholder trust and a decreased acceptance of intelligent decisions. Therefore, algorithms need to achieve high accuracy, and users nee… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  17. arXiv:2403.04444  [pdf, other

    cs.CV

    Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser

    Authors: Qingyuan Cai, Xuecai Hu, Saihui Hou, Li Yao, Yongzhen Huang

    Abstract: Recently, diffusion-based methods for monocular 3D human pose estimation have achieved state-of-the-art (SOTA) performance by directly regressing the 3D joint coordinates from the 2D pose sequence. Although some methods decompose the task into bone length and bone direction prediction based on the human anatomical skeleton to explicitly incorporate more human body prior constraints, the performanc… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted by AAAI24

  18. arXiv:2403.01895  [pdf, other

    cs.LG cs.AI

    Unsupervised Distance Metric Learning for Anomaly Detection Over Multivariate Time Series

    Authors: Hanyang Yuan, Qinglin Cai, Keting Yin

    Abstract: Distance-based time series anomaly detection methods are prevalent due to their relative non-parametric nature and interpretability. However, the commonly used Euclidean distance is sensitive to noise. While existing works have explored dynamic time war** (DTW) for its robustness, they only support supervised tasks over multivariate time series (MTS), leaving a scarcity of unsupervised methods.… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  19. Future Impact Decomposition in Request-level Recommendations

    Authors: Xiaobei Wang, Shuchang Liu, Xueliang Wang, Qingpeng Cai, Lantao Hu, Han Li, Peng Jiang, Kun Gai, Guangming Xie

    Abstract: In recommender systems, reinforcement learning solutions have shown promising results in optimizing the interaction sequence between users and the system over the long-term performance. For practical reasons, the policy's actions are typically designed as recommending a list of items to handle users' frequent and continuous browsing requests more efficiently. In this list-wise recommendation scena… ▽ More

    Submitted 18 June, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 12 pages, 8 figures

    ACM Class: H.3.3

  20. arXiv:2401.13357  [pdf, other

    cs.CV

    Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry

    Authors: Qi Cai, Xinrui Li, Yuanxin Wu

    Abstract: How to efficiently and accurately handle image matching outliers is a critical issue in two-view relative estimation. The prevailing RANSAC method necessitates that the minimal point pairs be inliers. This paper introduces a linear relative pose estimation algorithm for n $( n \geq 6$) point pairs, which is founded on the recent pose-only imaging geometry to filter out outliers by proper reweighti… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  21. arXiv:2401.12435  [pdf, ps, other

    cs.AI cs.LG math.AP

    Quantitative Analysis of Molecular Transport in the Extracellular Space Using Physics-Informed Neural Network

    Authors: Jiayi Xie, Hongfeng Li, ** Cheng, Qingrui Cai, Hanbo Tan, Lingyun Zu, Xiaobo Qu, Hongbin Han

    Abstract: The brain extracellular space (ECS), an irregular, extremely tortuous nanoscale space located between cells or between cells and blood vessels, is crucial for nerve cell survival. It plays a pivotal role in high-level brain functions such as memory, emotion, and sensation. However, the specific form of molecular transport within the ECS remain elusive. To address this challenge, this paper propose… ▽ More

    Submitted 23 January, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  22. arXiv:2311.13752  [pdf, other

    cs.CV cs.AI

    3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology

    Authors: Asma Ben Abacha, Alberto Santamaria-Pang, Ho Hin Lee, Jameson Merkow, Qin Cai, Surya Teja Devarakonda, Abdullah Islam, Julia Gong, Matthew P. Lungren, Thomas Lin, Noel C Codella, Ivan Tarapov

    Abstract: The increasing use of medical imaging in healthcare settings presents a significant challenge due to the increasing workload for radiologists, yet it also offers opportunity for enhancing healthcare outcomes if effectively leveraged. 3D image retrieval holds potential to reduce radiologist workloads by enabling clinicians to efficiently search through diagnostically similar or otherwise relevant c… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  23. arXiv:2311.02551  [pdf

    eess.SY cs.GT cs.LG

    High-dimensional Bid Learning for Energy Storage Bidding in Energy Markets

    Authors: **yu Liu, Hongye Guo, Qinghu Tang, En Lu, Qiuna Cai, Qixin Chen

    Abstract: With the growing penetration of renewable energy resource, electricity market prices have exhibited greater volatility. Therefore, it is important for Energy Storage Systems(ESSs) to leverage the multidimensional nature of energy market bids to maximize profitability. However, current learning methods cannot fully utilize the high-dimensional price-quantity bids in the energy markets. To address t… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 5 pages, 3 figures, Accepted by the 15th International Conference on Applied Energy (ICAE2023)

  24. arXiv:2310.03984  [pdf, other

    cs.IR cs.LG

    AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement

    Authors: Zhenghai Xue, Qingpeng Cai, Tianyou Zuo, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai, Bo An

    Abstract: Growing attention has been paid to Reinforcement Learning (RL) algorithms when optimizing long-term user engagement in sequential recommendation tasks. One challenge in large-scale online recommendation systems is the constant and complicated changes in users' behavior patterns, such as interaction rates and retention tendencies. When formulated as a Markov Decision Process (MDP), the dynamics and… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: Preprint. Under Review

  25. arXiv:2309.16140  [pdf, other

    cs.MM cs.CV

    CLIP-Hand3D: Exploiting 3D Hand Pose Estimation via Context-Aware Prompting

    Authors: Shaoxiang Guo, Qing Cai, Lin Qi, Junyu Dong

    Abstract: Contrastive Language-Image Pre-training (CLIP) starts to emerge in many computer vision tasks and has achieved promising performance. However, it remains underexplored whether CLIP can be generalized to 3D hand pose estimation, as bridging text prompts with pose-aware features presents significant challenges due to the discrete nature of joint positions in 3D space. In this paper, we make one of t… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted In Proceedings of the 31st ACM International Conference on Multimedia (MM' 23)

  26. arXiv:2309.12645  [pdf, other

    cs.IR

    KuaiSim: A Comprehensive Simulator for Recommender Systems

    Authors: Kesen Zhao, Shuchang Liu, Qingpeng Cai, Xiangyu Zhao, Ziru Liu, Dong Zheng, Peng Jiang, Kun Gai

    Abstract: Reinforcement Learning (RL)-based recommender systems (RSs) have garnered considerable attention due to their ability to learn optimal recommendation policies and maximize long-term user rewards. However, deploying RL models directly in online environments and generating authentic data through A/B tests can pose challenges and require substantial resources. Simulators offer an alternative approach… ▽ More

    Submitted 19 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

  27. arXiv:2309.09224  [pdf, other

    cs.RO

    CapsuleBot: A Novel Compact Hybrid Aerial-Ground Robot with Two Actuated-wheel-rotors

    Authors: Zhi Zheng, Qifeng Cai, Xinhang Xu, Muqing Cao, Huan Yu, Jihao Li, Guodong Lu, ** Wang

    Abstract: This paper presents the design, modeling, and experimental validation of CapsuleBot, a compact hybrid aerial-ground vehicle designed for long-term covert reconnaissance. CapsuleBot combines the manoeuvrability of bicopter in the air with the energy efficiency and noise reduction of ground vehicles on the ground. To accomplish this, a structure named actuated-wheel-rotor has been designed, utilizin… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 7 pages, 10 figures, submitted to 2024 IEEE International Conference on Robotics and Automation (ICRA). This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  28. arXiv:2308.06212  [pdf, other

    cs.IR cs.CL

    A Large Language Model Enhanced Conversational Recommender System

    Authors: Yue Feng, Shuchang Liu, Zhenghai Xue, Qingpeng Cai, Lantao Hu, Peng Jiang, Kun Gai, Fei Sun

    Abstract: Conversational recommender systems (CRSs) aim to recommend high-quality items to users through a dialogue interface. It usually contains multiple sub-tasks, such as user preference elicitation, recommendation, explanation, and item information search. To develop effective CRSs, there are some challenges: 1) how to properly manage sub-tasks; 2) how to effectively solve different sub-tasks; and 3) h… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  29. arXiv:2306.03552  [pdf, other

    cs.LG cs.AI

    State Regularized Policy Optimization on Data with Dynamics Shift

    Authors: Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An

    Abstract: In many real-world scenarios, Reinforcement Learning (RL) algorithms are trained on data with dynamics shift, i.e., with different underlying environment dynamics. A majority of current methods address such issue by training context encoders to identify environment parameters. Data with dynamics shift are separated according to their environment parameters to train the corresponding policy. Howeve… ▽ More

    Submitted 21 February, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: Accepted at NeurIPS 2023

  30. Generative Flow Network for Listwise Recommendation

    Authors: Shuchang Liu, Qingpeng Cai, Zhankui He, Bowen Sun, Julian McAuley, Dong Zheng, Peng Jiang, Kun Gai

    Abstract: Personalized recommender systems fulfill the daily demands of customers and boost online businesses. The goal is to learn a policy that can generate a list of items that matches the user's demand or interest. While most existing methods learn a pointwise scoring model that predicts the ranking score of each individual item, recent research shows that the listwise approach can further improve the r… ▽ More

    Submitted 9 June, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: 11 pages, 5 figures, 7 tables

    Journal ref: KDD '23, August 6-10, 2023, Long Beach, CA, USA

  31. arXiv:2303.00668  [pdf, other

    cs.RO

    Roller-Quadrotor: A Novel Hybrid Terrestrial/Aerial Quadrotor with Unicycle-Driven and Rotor-Assisted Turning

    Authors: Zhi Zheng, ** Wang, Yuze Wu, Qifeng Cai, Huan Yu, Ruibin Zhang, Jie Tu, Jun Meng, Guodong Lu, Fei Gao

    Abstract: The Roller-Quadrotor is a novel quadrotor that combines the maneuverability of aerial drones with the endurance of ground vehicles. This work focuses on the design, modeling, and experimental validation of the Roller-Quadrotor. Flight capabilities are achieved through a quadrotor configuration, with four thrust-providing actuators. Additionally, rolling motion is facilitated by a unicycle-driven a… ▽ More

    Submitted 26 June, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: 8 pages, 10 figures, accepted by 2023 IEEE/RSJ International Conference on Intelligent Robots(IROS). This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  32. Exploration and Regularization of the Latent Action Space in Recommendation

    Authors: Shuchang Liu, Qingpeng Cai, Bowen Sun, Yuhao Wang, Ji Jiang, Dong Zheng, Kun Gai, Peng Jiang, Xiangyu Zhao, Yongfeng Zhang

    Abstract: In recommender systems, reinforcement learning solutions have effectively boosted recommendation performance because of their ability to capture long-term user-system interaction. However, the action space of the recommendation policy is a list of items, which could be extremely large with a dynamic candidate item pool. To overcome this challenge, we propose a hyper-actor and critic learning frame… ▽ More

    Submitted 7 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: Proceedings of the ACM Web Conference 2023 (WWW '23), May 1--5, 2023, Austin, TX, USA

  33. Multi-Task Recommendations with Reinforcement Learning

    Authors: Ziru Liu, Jiejie Tian, Qingpeng Cai, Xiangyu Zhao, **gtong Gao, Shuchang Liu, Dayou Chen, Tonghao He, Dong Zheng, Peng Jiang, Kun Gai

    Abstract: In recent years, Multi-task Learning (MTL) has yielded immense success in Recommender System (RS) applications. However, current MTL-based recommendation models tend to disregard the session-wise patterns of user-item interactions because they are predominantly constructed based on item-wise datasets. Moreover, balancing multiple objectives has always been a challenge in this field, which is typic… ▽ More

    Submitted 9 March, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: TheWebConf2023

  34. arXiv:2302.01724  [pdf, other

    cs.LG cs.IR

    Reinforcing User Retention in a Billion Scale Short Video Recommender System

    Authors: Qingpeng Cai, Shuchang Liu, Xueliang Wang, Tianyou Zuo, Wentao Xie, Bin Yang, Dong Zheng, Peng Jiang, Kun Gai

    Abstract: Recently, short video platforms have achieved rapid user growth by recommending interesting content to users. The objective of the recommendation is to optimize user retention, thereby driving the growth of DAU (Daily Active Users). Retention is a long-term feedback after multiple interactions of users and the system, and it is hard to decompose retention reward to each item or a list of items. Th… ▽ More

    Submitted 12 February, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Journal ref: The Web Conference 2023 Industry Track

  35. arXiv:2302.01680  [pdf, other

    cs.LG cs.IR

    Two-Stage Constrained Actor-Critic for Short Video Recommendation

    Authors: Qingpeng Cai, Zhenghai Xue, Chi Zhang, Wanqi Xue, Shuchang Liu, Ruohan Zhan, Xueliang Wang, Tianyou Zuo, Wentao Xie, Dong Zheng, Peng Jiang, Kun Gai

    Abstract: The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users sequentially interact with the system and provide complex and multi-faceted responses, including watch time and various types of interactions with multiple videos. One the one hand, the platforms aims at optimizing the users' cumulative wa… ▽ More

    Submitted 9 January, 2024; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Code Available at https://github.com/AIDefender/TSCAC. arXiv admin note: substantial text overlap with arXiv:2205.13248

    Journal ref: The Web Conference 2023

  36. arXiv:2212.14852  [pdf, other

    cs.LG

    An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models

    Authors: Yufeng Zhang, Boyi Liu, Qi Cai, Lingxiao Wang, Zhaoran Wang

    Abstract: With the attention mechanism, transformers achieve significant empirical successes. Despite the intuitive understanding that transformers perform relational inference over long sequences to produce desirable representations, we lack a rigorous theory on how the attention mechanism achieves it. In particular, several intriguing questions remain open: (a) What makes a desirable representation? (b) H… ▽ More

    Submitted 31 March, 2024; v1 submitted 30 December, 2022; originally announced December 2022.

    Comments: 85 pages, 7 figures, add acknowledgement

  37. arXiv:2212.02779  [pdf, other

    cs.IR cs.AI cs.LG

    PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement

    Authors: Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An

    Abstract: Current advances in recommender systems have been remarkably successful in optimizing immediate engagement. However, long-term user engagement, a more desirable performance metric, remains difficult to improve. Meanwhile, recent reinforcement learning (RL) algorithms have shown their effectiveness in a variety of long-term goal optimization tasks. For this reason, RL is widely considered as a prom… ▽ More

    Submitted 2 June, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

  38. arXiv:2211.14052  [pdf, other

    cs.CV

    Towards Hard-pose Virtual Try-on via 3D-aware Global Correspondence Learning

    Authors: Zaiyu Huang, Hanhui Li, Zhenyu Xie, Michael Kampffmeyer, Qingling Cai, Xiaodan Liang

    Abstract: In this paper, we target image-based person-to-person virtual try-on in the presence of diverse poses and large viewpoint variations. Existing methods are restricted in this setting as they estimate garment war** flows mainly based on 2D poses and appearance, which omits the geometric prior of the 3D human body shape. Moreover, current garment war** methods are confined to localized regions, w… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  39. arXiv:2211.08248  [pdf, other

    cs.CV

    3D Cascade RCNN: High Quality Object Detection in Point Clouds

    Authors: Qi Cai, Yingwei Pan, Ting Yao, Tao Mei

    Abstract: Recent progress on 2D object detection has featured Cascade RCNN, which capitalizes on a sequence of cascade detectors to progressively improve proposal quality, towards high-quality object detection. However, there has not been evidence in support of building such cascade structures for 3D object detection, a challenging detection scenario with highly sparse LiDAR point clouds. In this work, we p… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: IEEE Transactions on Image Processing (TIP) 2022. The source code is publicly available at \url{https://github.com/caiqi/Cascasde-3D}

  40. arXiv:2211.02806  [pdf

    cs.AI eess.SY

    Modified EDAS Method Based on Cumulative Prospect Theory for Multiple Attributes Group Decision Making with Interval-valued Intuitionistic Fuzzy Information

    Authors: **g Wang, Qiang Cai, Guiwu Wei, Ningna Liao

    Abstract: The Interval-valued intuitionistic fuzzy sets (IVIFSs) based on the intuitionistic fuzzy sets combines the classical decision method is in its research and application is attracting attention. After comparative analysis, there are multiple classical methods with IVIFSs information have been applied into many practical issues. In this paper, we extended the classical EDAS method based on cumulative… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 48 pages

    MSC Class: 91B06 ACM Class: F.2.2

  41. arXiv:2210.06906  [pdf, other

    cs.CV

    Hierarchical and Progressive Image Matting

    Authors: Yu Qiao, Yuhao Liu, Ziqi Wei, Yuxin Wang, Qiang Cai, Guofeng Zhang, Xin Yang

    Abstract: Most matting researches resort to advanced semantics to achieve high-quality alpha mattes, and direct low-level features combination is usually explored to complement alpha details. However, we argue that appearance-agnostic integration can only provide biased foreground details and alpha mattes require different-level feature aggregation for better pixel-wise opacity perception. In this paper, we… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 23 pages, 11 Figures, ACM TOMM accepted

  42. arXiv:2209.12807  [pdf, other

    cs.LG cs.CV

    Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization

    Authors: **gyang Lin, Yu Wang, Qi Cai, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei

    Abstract: Outlier detection tasks have been playing a critical role in AI safety. There has been a great challenge to deal with this task. Observations show that deep neural network classifiers usually tend to incorrectly classify out-of-distribution (OOD) inputs into in-distribution classes with high confidence. Existing works attempt to solve the problem by explicitly imposing uncertainty on classifiers w… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: Source code is available at \url{https://github.com/jylins/hood}

  43. arXiv:2209.11433  [pdf, other

    eess.AS cs.SD

    The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022

    Authors: Qutang Cai, Guoqiang Hong, Zhijian Ye, Ximin Li, Haizhou Li

    Abstract: This technical report describes our system for track 1, 2 and 4 of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22). By combining several ResNet variants, our submission for track 1 attained a minDCF of 0:090 with EER 1:401%. By further incorporating three fine-tuned pre-trained models, our submission for track 2 achieved a minDCF of 0:072 with EER 1:119%. For track 4, our system consis… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: System description of VoxSRC 2022: track 1, 2 and 4

  44. arXiv:2206.06289  [pdf, other

    cs.CV cs.LG cs.MM cs.RO

    Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation

    Authors: Yingwei Pan, Yehao Li, Yiheng Zhang, Qi Cai, Fuchen Long, Zhaofan Qiu, Ting Yao, Tao Mei

    Abstract: This paper presents an overview and comparative analysis of our systems designed for the following two tracks in SAPIEN ManiSkill Challenge 2021: No Interaction Track: The No Interaction track targets for learning policies from pre-collected demonstration trajectories. We investigate both imitation learning-based approach, i.e., imitating the observed behavior using classical supervised learning… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: Accepted by ICLR 2022 Workshop on Generalizable Policy Learning in Physical World. Top-performing systems for both no interaction and no restriction tracks in SAPIEN ManiSkill Challenge 2021. The source code and model are publicly available at: https://github.com/caiqi/Silver-Bullet-3D/

  45. arXiv:2206.02620  [pdf, other

    cs.IR cs.LG

    ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor

    Authors: Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Kun Gai, Bo An

    Abstract: Long-term engagement is preferred over immediate engagement in sequential recommendation as it directly affects product operational metrics such as daily active users (DAUs) and dwell time. Meanwhile, reinforcement learning (RL) is widely regarded as a promising framework for optimizing long-term engagement in sequential recommendation. However, due to expensive online interactions, it is very dif… ▽ More

    Submitted 16 June, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: Accpetd by ICLR 2023

  46. arXiv:2205.13476  [pdf, ps, other

    cs.LG cs.AI eess.SY stat.ML

    Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency

    Authors: Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang

    Abstract: Reinforcement learning in partially observed Markov decision processes (POMDPs) faces two challenges. (i) It often takes the full history to predict the future, which induces a sample complexity that scales exponentially with the horizon. (ii) The observation and state spaces are often continuous, which induces a sample complexity that scales exponentially with the extrinsic dimension. Addressing… ▽ More

    Submitted 31 March, 2024; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Accepted by ICLR 2022

  47. arXiv:2205.13248  [pdf, other

    cs.LG cs.IR

    Constrained Reinforcement Learning for Short Video Recommendation

    Authors: Qingpeng Cai, Ruohan Zhan, Chi Zhang, Jie Zheng, Guangwei Ding, **hua Gong, Dong Zheng, Peng Jiang

    Abstract: The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users provide complex and multi-faceted responses towards recommendations, including watch time and various types of interactions with videos. As a result, established recommendation algorithms that concern a single objective are not adequate to… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

  48. arXiv:2204.09787  [pdf, ps, other

    cs.LG math.OC stat.ML

    Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency

    Authors: Qi Cai, Zhuoran Yang, Zhaoran Wang

    Abstract: We study reinforcement learning for partially observed Markov decision processes (POMDPs) with infinite observation and state spaces, which remains less investigated theoretically. To this end, we make the first attempt at bridging partial observability and function approximation for a class of POMDPs with a linear structure. In detail, we propose a reinforcement learning algorithm (Optimistic Exp… ▽ More

    Submitted 31 March, 2024; v1 submitted 20 April, 2022; originally announced April 2022.

  49. HIPA: Hierarchical Patch Transformer for Single Image Super Resolution

    Authors: Qing Cai, Yiming Qian, **xing Li, Jun Lv, Yee-Hong Yang, Feng Wu, David Zhang

    Abstract: Transformer-based architectures start to emerge in single image super resolution (SISR) and have achieved promising performance. Most existing Vision Transformers divide images into the same number of patches with a fixed size, which may not be optimal for restoring patches with different levels of texture richness. This paper presents HIPA, a novel Transformer architecture that progressively reco… ▽ More

    Submitted 6 June, 2023; v1 submitted 19 March, 2022; originally announced March 2022.

  50. arXiv:2203.02533  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation

    Authors: Wenqiao Zhang, Lei Zhu, James Hallinan, Andrew Makmur, Shengyu Zhang, Qingpeng Cai, Beng Chin Ooi

    Abstract: In this paper, we propose a novel semi-supervised learning (SSL) framework named BoostMIS that combines adaptive pseudo labeling and informative active annotation to unleash the potential of medical image SSL models: (1) BoostMIS can adaptively leverage the cluster assumption and consistency regularization of the unlabeled data according to the current learning status. This strategy can adaptively… ▽ More

    Submitted 21 March, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: 11 pages

    Journal ref: CVPR 2022