Skip to main content

Showing 1–46 of 46 results for author: Lyu, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01300  [pdf, other

    cs.CL cs.AI cs.LG

    Collaborative Performance Prediction for Large Language Models

    Authors: Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma

    Abstract: Comprehensively understanding and accurately predicting the performance of large language models across diverse downstream tasks has emerged as a pivotal challenge in NLP research. The pioneering scaling law on downstream works demonstrated intrinsic similarities within model families and utilized such similarities for performance prediction. However, they tend to overlook the similarities between… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.02609  [pdf, other

    cs.LG cs.AI

    Less is More: Pseudo-Label Filtering for Continual Test-Time Adaptation

    Authors: Jiayao Tan, Fan Lyu, Chenggong Ni, Tingliang Feng, Fuyuan Hu, Zhang Zhang, Shaochuang Zhao, Liang Wang

    Abstract: Continual Test-Time Adaptation (CTTA) aims to adapt a pre-trained model to a sequence of target domains during the test phase without accessing the source data. To adapt to unlabeled data from unknown domains, existing methods rely on constructing pseudo-labels for all samples and updating the model through self-training. However, these pseudo-labels often involve noise, leading to insufficient ad… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.03335 by other authors

  3. arXiv:2405.17054  [pdf, other

    cs.LG

    Improving Data-aware and Parameter-aware Robustness for Continual Learning

    Authors: Hanxi Xiao, Fan Lyu

    Abstract: The goal of Continual Learning (CL) task is to continuously learn multiple new tasks sequentially while achieving a balance between the plasticity and stability of new and old knowledge. This paper analyzes that this insufficiency arises from the ineffective handling of outliers, leading to abnormal gradients and unexpected model updates. To address this issue, we enhance the data-aware and parame… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2405.14602  [pdf, other

    cs.LG

    Controllable Continual Test-Time Adaptation

    Authors: Ziqi Shi, Fan Lyu, Ye Liu, Fanhua Shang, Fuyuan Hu, Wei Feng, Zhang Zhang, Liang Wang

    Abstract: Continual Test-Time Adaptation (CTTA) is an emerging and challenging task where a model trained in a source domain must adapt to continuously changing conditions during testing, without access to the original source data. CTTA is prone to error accumulation due to uncontrollable domain shifts, leading to blurred decision boundaries between categories. Existing CTTA methods primarily focus on suppr… ▽ More

    Submitted 28 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2405.09133  [pdf, other

    cs.LG

    Overcoming Domain Drift in Online Continual Learning

    Authors: Fan Lyu, Daofeng Liu, Linglan Zhao, Zhang Zhang, Fanhua Shang, Fuyuan Hu, Wei Feng, Liang Wang

    Abstract: Online Continual Learning (OCL) empowers machine learning models to acquire new knowledge online across a sequence of tasks. However, OCL faces a significant challenge: catastrophic forgetting, wherein the model learned in previous tasks is substantially overwritten upon encountering new tasks, leading to a biased forgetting of prior knowledge. Moreover, the continual doman drift in sequential lea… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  6. arXiv:2404.07200  [pdf, other

    cs.LG

    Toward a Better Understanding of Fourier Neural Operators: Analysis and Improvement from a Spectral Perspective

    Authors: Shaoxiang Qin, Fuyuan Lyu, Wenhui Peng, Dingyang Geng, Ju Wang, Nai** Gao, Xue Liu, Liangzhu Leon Wang

    Abstract: In solving partial differential equations (PDEs), Fourier Neural Operators (FNOs) have exhibited notable effectiveness compared to Convolutional Neural Networks (CNNs). This paper presents clear empirical evidence through spectral analysis to elucidate the superiority of FNO over CNNs: FNO is significantly more capable of learning low-frequencies. This empirical evidence also unveils FNO's distinc… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  7. arXiv:2403.17442  [pdf, other

    cs.IR

    Touch the Core: Exploring Task Dependence Among Hybrid Targets for Recommendation

    Authors: Xing Tang, Yang Qiao, Fuyuan Lyu, Dugang Liu, Xiuqiang He

    Abstract: As user behaviors become complicated on business platforms, online recommendations focus more on how to touch the core conversions, which are highly related to the interests of platforms. These core conversions are usually continuous targets, such as \textit{watch time}, \textit{revenue}, and so on, whose predictions can be enhanced by previous discrete conversion actions. Therefore, multi-task le… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  8. arXiv:2403.12559  [pdf, other

    cs.CV cs.LG

    Confidence Self-Calibration for Multi-Label Class-Incremental Learning

    Authors: Kaile Du, Yifan Zhou, Fan Lyu, Yuyang Li, Chen Lu, Guangcan Liu

    Abstract: The partial label challenge in Multi-Label Class-Incremental Learning (MLCIL) arises when only the new classes are labeled during training, while past and future labels remain unavailable. This issue leads to a proliferation of false-positive errors due to erroneously high confidence multi-label predictions, exacerbating catastrophic forgetting within the disjoint label space. In this paper, we ai… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  9. arXiv:2402.18609  [pdf, other

    cs.LG cs.AI

    ICE-SEARCH: A Language Model-Driven Feature Selection Approach

    Authors: Tianze Yang, Tianyi Yang, Fuyuan Lyu, Shaoshan Liu, Xue, Liu

    Abstract: This study unveils the In-Context Evolutionary Search (ICE-SEARCH) method, which is among the first works that melds large language models (LLMs) with evolutionary algorithms for feature selection (FS) tasks and demonstrates its effectiveness in Medical Predictive Analytics (MPA) applications. ICE-SEARCH harnesses the crossover and mutation capabilities inherent in LLMs within an evolutionary fram… ▽ More

    Submitted 8 May, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  10. arXiv:2402.08182  [pdf, other

    cs.LG stat.ML

    Variational Continual Test-Time Adaptation

    Authors: Fan Lyu, Kaile Du, Yuyang Li, Hanyu Zhao, Zhang Zhang, Guangcan Liu, Liang Wang

    Abstract: The prior drift is crucial in Continual Test-Time Adaptation (CTTA) methods that only use unlabeled test data, as it can cause significant error propagation. In this paper, we introduce VCoTTA, a variational Bayesian approach to measure uncertainties in CTTA. At the source stage, we transform a pre-trained deterministic model into a Bayesian Neural Network (BNN) via a variational warm-up strategy,… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  11. arXiv:2401.01054  [pdf, other

    cs.LG cs.AI

    Elastic Multi-Gradient Descent for Parallel Continual Learning

    Authors: Fan Lyu, Wei Feng, Yuepan Li, Qing Sun, Fanhua Shang, Liang Wan, Liang Wang

    Abstract: The goal of Continual Learning (CL) is to continuously learn from new data streams and accomplish the corresponding tasks. Previously studied CL assumes that data are given in sequence nose-to-tail for different tasks, thus indeed belonging to Serial Continual Learning (SCL). This paper studies the novel paradigm of Parallel Continual Learning (PCL) in dynamic multi-task scenarios, where a diverse… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: Submited to IEEE TPAMI

  12. arXiv:2311.03526  [pdf, other

    cs.IR

    Towards Automated Negative Sampling in Implicit Recommendation

    Authors: Fuyuan Lyu, Yaochen Hu, Xing Tang, Yingxue Zhang, Ruiming Tang, Xue Liu

    Abstract: Negative sampling methods are vital in implicit recommendation models as they allow us to obtain negative instances from massive unlabeled data. Most existing approaches focus on sampling hard negative samples in various ways. These studies are orthogonal to the recommendation model and implicit datasets. However, such an idea contradicts the common belief in AutoML that the model and dataset shou… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  13. arXiv:2310.20490  [pdf, other

    cs.CV cs.LG

    Long-Tailed Learning as Multi-Objective Optimization

    Authors: Weiqi Li, Fan Lyu, Fanhua Shang, Liang Wan, Wei Feng

    Abstract: Real-world data is extremely imbalanced and presents a long-tailed distribution, resulting in models that are biased towards classes with sufficient samples and perform poorly on rare classes. Recent methods propose to rebalance classes but they undertake the seesaw dilemma (what is increasing performance on tail classes may decrease that of head classes, and vice versa). In this paper, we argue t… ▽ More

    Submitted 1 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: In submission

  14. arXiv:2310.20268  [pdf, other

    cs.CV cs.AI

    Constructing Sample-to-Class Graph for Few-Shot Class-Incremental Learning

    Authors: Fuyuan Hu, Jian Zhang, Fan Lyu, Linyan Li, Fenglei Xu

    Abstract: Few-shot class-incremental learning (FSCIL) aims to build machine learning model that can continually learn new concepts from a few data samples, without forgetting knowledge of old classes. The challenges of FSCIL lies in the limited data of new classes, which not only lead to significant overfitting issues but also exacerbates the notorious catastrophic forgetting problems. As proved in early… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  15. arXiv:2310.19113  [pdf, other

    cs.CV cs.AI eess.SP

    Dynamic V2X Autonomous Perception from Road-to-Vehicle Vision

    Authors: Jiayao Tan, Fan Lyu, Linyan Li, Fuyuan Hu, Tingliang Feng, Fenglei Xu, Rui Yao

    Abstract: Vehicle-to-everything (V2X) perception is an innovative technology that enhances vehicle perception accuracy, thereby elevating the security and reliability of autonomous systems. However, existing V2X perception methods focus on static scenes from mainly vehicle-based vision, which is constrained by sensor capabilities and communication loads. To adapt V2X perception models to dynamic scenes, we… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

  16. arXiv:2310.15342  [pdf, other

    cs.LG cs.IR

    Towards Hybrid-grained Feature Interaction Selection for Deep Sparse Network

    Authors: Fuyuan Lyu, Xing Tang, Dugang Liu, Chen Ma, Weihong Luo, Liang Chen, Xiuqiang He, Xue Liu

    Abstract: Deep sparse networks are widely investigated as a neural network architecture for prediction tasks with high-dimensional sparse features, with which feature interaction selection is a critical component. While previous methods primarily focus on how to search feature interaction in a coarse-grained space, less attention has been given to a finer granularity. In this work, we introduce a hybrid-gra… ▽ More

    Submitted 30 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 poster

  17. arXiv:2306.13382  [pdf, other

    cs.IR

    OptMSM: Optimizing Multi-Scenario Modeling for Click-Through Rate Prediction

    Authors: Xing Tang, Yang Qiao, Yuwen Fu, Fuyuan Lyu, Dugang Liu, Xiuqiang He

    Abstract: A large-scale industrial recommendation platform typically consists of multiple associated scenarios, requiring a unified click-through rate (CTR) prediction model to serve them simultaneously. Existing approaches for multi-scenario CTR prediction generally consist of two main modules: i) a scenario-aware learning module that learns a set of multi-functional representations with scenario-shared an… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: Accepted by ECML-PKDD 2023 Applied Data Science Track

  18. arXiv:2306.00315  [pdf, other

    cs.LG cs.IR

    Explicit Feature Interaction-aware Uplift Network for Online Marketing

    Authors: Dugang Liu, Xing Tang, Han Gao, Fuyuan Lyu, Xiuqiang He

    Abstract: As a key component in online marketing, uplift modeling aims to accurately capture the degree to which different treatments motivate different users, such as coupons or discounts, also known as the estimation of individual treatment effect (ITE). In an actual business scenario, the options for treatment may be numerous and complex, and there may be correlations between different treatments. In add… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted by SIGKDD 2023 Applied Data Science Track

  19. arXiv:2303.13862  [pdf, other

    cs.CV

    Two-level Graph Network for Few-Shot Class-Incremental Learning

    Authors: Hao Chen, Linyan Li, Fan Lyu, Fuyuan Hu, Fenglei Xu

    Abstract: Few-shot class-incremental learning (FSCIL) aims to design machine learning algorithms that can continually learn new concepts from a few data points, without forgetting knowledge of old classes. The difficulty lies in that limited data from new classes not only lead to significant overfitting issues but also exacerbates the notorious catastrophic forgetting problems. However, existing FSCIL metho… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2203.06953 by other authors

  20. arXiv:2303.02954  [pdf, other

    cs.LG cs.CV

    Centroid Distance Distillation for Effective Rehearsal in Continual Learning

    Authors: Daofeng Liu, Fan Lyu, Linyan Li, Fuyuan Hu

    Abstract: Rehearsal, retraining on a stored small data subset of old tasks, has been proven effective in solving catastrophic forgetting in continual learning. However, due to the sampled data may have a large bias towards the original dataset, retraining them is susceptible to driving continual domain drift of old tasks in feature space, resulting in forgetting. In this paper, we focus on tackling the cont… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  21. arXiv:2302.02241  [pdf, other

    cs.IR

    Feature Representation Learning for Click-through Rate Prediction: A Review and New Perspectives

    Authors: Fuyuan Lyu, Xing Tang, Dugang Liu, Haolun Wu, Chen Ma, Xiuqiang He, Xue Liu

    Abstract: Representation learning has been a critical topic in machine learning. In Click-through Rate Prediction, most features are represented as embedding vectors and learned simultaneously with other parameters in the model. With the development of CTR models, feature representation learning has become a trending topic and has been extensively studied by both industrial and academic researchers in recen… ▽ More

    Submitted 4 February, 2023; originally announced February 2023.

    Comments: Submitted to IJCAI 2023 Survey Track

  22. arXiv:2301.10909  [pdf, other

    cs.IR

    Optimizing Feature Set for Click-Through Rate Prediction

    Authors: Fuyuan Lyu, Xing Tang, Dugang Liu, Liang Chen, Xiuqiang He, Xue Liu

    Abstract: Click-through prediction (CTR) models transform features into latent vectors and enumerate possible feature interactions to improve performance based on the input feature set. Therefore, when selecting an optimal feature set, we should consider the influence of both feature and its interaction. However, most previous works focus on either feature field selection or only select feature interaction… ▽ More

    Submitted 26 March, 2024; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Accepted by WWW 2023 Research Tracks

  23. arXiv:2212.14464  [pdf, other

    cs.IR

    Result Diversification in Search and Recommendation: A Survey

    Authors: Haolun Wu, Yansen Zhang, Chen Ma, Fuyuan Lyu, Bowei He, Bhaskar Mitra, Xue Liu

    Abstract: Diversifying return results is an important research topic in retrieval systems in order to satisfy both the various interests of customers and the equal market exposure of providers. There has been growing attention on diversity-aware research during recent years, accompanied by a proliferation of literature on methods to promote diversity in search and recommendation. However, diversity-aware st… ▽ More

    Submitted 18 February, 2024; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: 20 pages

  24. arXiv:2211.14763  [pdf, other

    cs.CV cs.AI

    Multi-Label Continual Learning using Augmented Graph Convolutional Network

    Authors: Kaile Du, Fan Lyu, Linyan Li, Fuyuan Hu, Wei Feng, Fenglei Xu, Xuefeng Xi, Han**g Cheng

    Abstract: Multi-Label Continual Learning (MLCL) builds a class-incremental framework in a sequential multi-label image recognition data stream. The critical challenges of MLCL are the construction of label relationships on past-missing and future-missing partial labels of training data and the catastrophic forgetting on old classes, resulting in poor generalization. To solve the problems, the study proposes… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

  25. arXiv:2210.10581  [pdf, other

    cs.CL cs.CR

    CEntRE: A paragraph-level Chinese dataset for Relation Extraction among Enterprises

    Authors: Peipei Liu, Hong Li, Zhiyu Wang, Yimo Ren, Jie Liu, Fei Lyu, Hongsong Zhu, Limin Sun

    Abstract: Enterprise relation extraction aims to detect pairs of enterprise entities and identify the business relations between them from unstructured or semi-structured text data, and it is crucial for several real-world applications such as risk analysis, rating research and supply chain security. However, previous work mainly focuses on getting attribute information about enterprises like personnel and… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  26. arXiv:2209.12241  [pdf, other

    cs.LG

    Exploring Example Influence in Continual Learning

    Authors: Qing Sun, Fan Lyu, Fanhua Shang, Wei Feng, Liang Wan

    Abstract: Continual Learning (CL) sequentially learns new tasks like human beings, with the goal to achieve better Stability (S, remembering past tasks) and Plasticity (P, adapting to new tasks). Due to the fact that past training data is not available, it is valuable to explore the influence difference on S and P among training examples, which may improve the learning pattern towards better SP. Inspired by… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Comments: Accepted at NeurIPS 2022

  27. OptEmbed: Learning Optimal Embedding Table for Click-through Rate Prediction

    Authors: Fuyuan Lyu, Xing Tang, Hong Zhu, Huifeng Guo, Yingxue Zhang, Ruiming Tang, Xue Liu

    Abstract: Learning embedding table plays a fundamental role in Click-through rate(CTR) prediction from the view of the model performance and memory usage. The embedding table is a two-dimensional tensor, with its axes indicating the number of feature values and the embedding dimension, respectively. To learn an efficient and effective embedding table, recent works either assign various embedding dimensions… ▽ More

    Submitted 6 September, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: Accepted by CIKM 2022 Research Track

  28. arXiv:2207.07840  [pdf, other

    cs.LG cs.AI

    Class-Incremental Lifelong Learning in Multi-Label Classification

    Authors: Kaile Du, Linyan Li, Fan Lyu, Fuyuan Hu, Fenglei Xu

    Abstract: Existing class-incremental lifelong learning studies only the data is with single-label, which limits its adaptation to multi-label data. This paper studies Lifelong Multi-Label (LML) classification, which builds an online class-incremental classifier in a sequential multi-label classification data stream. Training on the data with Partial Labels in LML classification may result in more serious Ca… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.05534

  29. arXiv:2203.10480  [pdf, other

    cs.LG

    Encoder-Decoder Architecture for Supervised Dynamic Graph Learning: A Survey

    Authors: Yuecai Zhu, Fuyuan Lyu, Chengming Hu, Xi Chen, Xue Liu

    Abstract: In recent years, the prevalent online services generate a sheer volume of user activity data. Service providers collect these data in order to perform client behavior analysis, and offer better and more customized services. Majority of these data can be modeled and stored as graph, such as the social graph in Facebook, user-video interaction graph in Youtube. These graphs need to evolve over time… ▽ More

    Submitted 27 March, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: Optimize title for better visibility

  30. arXiv:2203.05534  [pdf, other

    cs.CV cs.AI

    AGCN: Augmented Graph Convolutional Network for Lifelong Multi-label Image Recognition

    Authors: Kaile Du, Fan Lyu, Fuyuan Hu, Linyan Li, Wei Feng, Fenglei Xu, Qiming Fu

    Abstract: The Lifelong Multi-Label (LML) image recognition builds an online class-incremental classifier in a sequential multi-label image recognition data stream. The key challenges of LML image recognition are the construction of label relationships on Partial Labels of training data and the Catastrophic Forgetting on old classes, resulting in poor generalization. To solve the problems, the study proposes… ▽ More

    Submitted 10 March, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: Accpted in ICME 2022

  31. arXiv:2110.11159  [pdf, other

    cs.CV cs.LG cs.MM

    Each Attribute Matters: Contrastive Attention for Sentence-based Image Editing

    Authors: Liuqing Zhao, Fan Lyu, Fuyuan Hu, Kaizhu Huang, Fenglei Xu, Linyan Li

    Abstract: Sentence-based Image Editing (SIE) aims to deploy natural language to edit an image. Offering potentials to reduce expensive manual editing, SIE has attracted much interest recently. However, existing methods can hardly produce accurate editing and even lead to failures in attribute editing when the query sentence is with multiple editable attributes. To cope with this problem, by focusing on enha… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: Accepted by BMVC 2021

  32. arXiv:2108.01265  [pdf, other

    cs.LG cs.IR

    Memorize, Factorize, or be Naïve: Learning Optimal Feature Interaction Methods for CTR Prediction

    Authors: Fuyuan Lyu, Xing Tang, Huifeng Guo, Ruiming Tang, Xiuqiang He, Rui Zhang, Xue Liu

    Abstract: Click-through rate prediction is one of the core tasks in commercial recommender systems. It aims to predict the probability of a user clicking a particular item given user and item features. As feature interactions bring in non-linearity, they are widely adopted to improve the performance of CTR prediction models. Therefore, effectively modelling feature interactions has attracted much attention… ▽ More

    Submitted 24 November, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Published in ICDE 2022

  33. Disentangling Semantic-to-visual Confusion for Zero-shot Learning

    Authors: Zihan Ye, Fuyuan Hu, Fan Lyu, Linyan Li, Kaizhu Huang

    Abstract: Using generative models to synthesize visual features from semantic distribution is one of the most popular solutions to ZSL image classification in recent years. The triplet loss (TL) is popularly used to generate realistic visual distributions from semantics by automatically searching discriminative representations. However, the traditional TL cannot search reliable unseen disentangled represent… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted by IEEE TRANSACTIONS ON MULTIMEDIA (TMM) in 2021

  34. arXiv:2102.06528  [pdf, other

    cs.SI

    A Tale of Two Countries: A Longitudinal Cross-Country Study of Mobile Users' Reactions to the COVID-19 Pandemic Through the Lens of App Popularity

    Authors: Liu Wang, Haoyu Wang, Yi Wang, Gareth Tyson, Fei Lyu

    Abstract: The ongoing COVID-19 pandemic has profoundly impacted people's life around the world, including how they interact with mobile technologies. In this paper, we seek to develop an understanding of how the dynamic trajectory of a pandemic shapes mobile phone users' experiences. Through the lens of app popularity, we approach this goal from a cross-country perspective. We compile a dataset consisting o… ▽ More

    Submitted 30 March, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

  35. arXiv:2012.13662  [pdf, other

    cs.CV

    Coarse to Fine: Multi-label Image Classification with Global/Local Attention

    Authors: Fan Lyu, Fuyuan Hu, Victor S. Sheng, Zhengtian Wu, Qiming Fu, Baochuan Fu

    Abstract: In our daily life, the scenes around us are always with multiple labels especially in a smart city, i.e., recognizing the information of city operation to response and control. Great efforts have been made by using Deep Neural Networks to recognize multi-label images. Since multi-label image classification is very complicated, people seek to use the attention mechanism to guide the classification… ▽ More

    Submitted 25 December, 2020; originally announced December 2020.

    Comments: Accepted by IEEE International Smart Cities Conference 2018

  36. arXiv:2012.07236  [pdf, other

    cs.LG cs.AI cs.CV

    Multi-Domain Multi-Task Rehearsal for Lifelong Learning

    Authors: Fan Lyu, Shuai Wang, Wei Feng, Zihan Ye, Fuyuan Hu, Song Wang

    Abstract: Rehearsal, seeking to remind the model by storing old knowledge in lifelong learning, is one of the most effective ways to mitigate catastrophic forgetting, i.e., biased forgetting of previous knowledge when moving to new tasks. However, the old tasks of the most previous rehearsal-based methods suffer from the unpredictable domain shift when training the new task. This is because these methods al… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

    Comments: Accepted by AAAI 2021

  37. arXiv:2010.01471  [pdf, ps, other

    cs.LG eess.SP

    Deep Reinforcement Learning for Delay-Oriented IoT Task Scheduling in Space-Air-Ground Integrated Network

    Authors: Conghao Zhou, Wen Wu, Hongli He, Peng Yang, Feng Lyu, Nan Cheng, Xuemin, Shen

    Abstract: In this paper, we investigate a computing task scheduling problem in space-air-ground integrated network (SAGIN) for delay-oriented Internet of Things (IoT) services. In the considered scenario, an unmanned aerial vehicle (UAV) collects computing tasks from IoT devices and then makes online offloading decisions, in which the tasks can be processed at the UAV or offloaded to the nearby base station… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

    Comments: 14 pages, 8 figures

  38. Efficient Hybrid Beamforming with Anti-Blockage Design for High-Speed Railway Communications

    Authors: Meilin Gao, Bo Ai, Yong Niu, Wen Wu, Peng Yang, Feng Lyu, Xuemin, Shen

    Abstract: Future railway is expected to accommodate both train operation services and passenger broadband services. The millimeter wave (mmWave) communication is a promising technology in providing multi-gigabit data rates to onboard users. However, mmWave communications suffer from severe propagation attenuation and vulnerability to blockage, which can be very challenging in high-speed railway (HSR) scenar… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: 11 Pages, 9 Figures

    Journal ref: IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020

  39. arXiv:2005.09153  [pdf, other

    cs.CV

    Associating Multi-Scale Receptive Fields for Fine-grained Recognition

    Authors: Zihan Ye, Fuyuan Hu, Yin Liu, Fan Lyu, Pengqing Liu

    Abstract: Extracting and fusing part features have become the key of fined-grained image recognition. Recently, Non-local (NL) module has shown excellent improvement in image recognition. However, it lacks the mechanism to model the interactions between multi-scale part features, which is vital for fine-grained recognition. In this paper, we propose a novel cross-layer non-local (CNL) module to associate mu… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: Accepted by ICIP2020

  40. arXiv:2005.09034  [pdf, other

    cs.CV

    Cross-filter compression for CNN inference acceleration

    Authors: Fuyuan Lyu, Shien Zhu, Weichen Liu

    Abstract: Convolution neural network demonstrates great capability for multiple tasks, such as image classification and many others. However, much resource is required to train a network. Hence much effort has been made to accelerate neural network by reducing precision of weights, activation, and gradient. However, these filter-wise quantification methods exist a natural upper limit, caused by the size of… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

  41. arXiv:2003.10663  [pdf, ps, other

    cs.CV

    Modeling Cross-view Interaction Consistency for Paired Egocentric Interaction Recognition

    Authors: Zhongguo Li, Fan Lyu, Wei Feng, Song Wang

    Abstract: With the development of Augmented Reality (AR), egocentric action recognition (EAR) plays important role in accurately understanding demands from the user. However, EAR is designed to help recognize human-machine interaction in single egocentric view, thus difficult to capture interactions between two face-to-face AR users. Paired egocentric interaction recognition (PEIR) is the task to collaborat… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: ICME2020

  42. arXiv:2003.08027  [pdf, other

    cs.CV

    MUTATT: Visual-Textual Mutual Guidance for Referring Expression Comprehension

    Authors: Shuai Wang, Fan Lyu, Wei Feng, Song Wang

    Abstract: Referring expression comprehension (REC) aims to localize a text-related region in a given image by a referring expression in natural language. Existing methods focus on how to build convincing visual and language representations independently, which may significantly isolate visual and language information. In this paper, we argue that for REC the referring expression and the target region are se… ▽ More

    Submitted 20 March, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: 6 pages, Accepted by ICME-2020

  43. Multi-Drone 3D Trajectory Planning and Scheduling in Drone Assisted Radio Access Networks

    Authors: Weisen Shi, Junlng Li, Nan Cheng, Feng Lyu, Shan Zhang, Haibo Zhou, Xuemin Shen

    Abstract: Drone base station (DBS) is a promising technique to extend wireless connections for uncovered users of terrestrial radio access networks (RAN). To improve user fairness and network performance, in this paper, we design 3D trajectories of multiple DBSs in the drone assisted radio access networks (DA-RAN) where DBSs fly over associated areas of interests (AoIs) and relay communications between the… ▽ More

    Submitted 22 October, 2019; v1 submitted 29 May, 2019; originally announced June 2019.

    Comments: Published on IEEE TVT

    Journal ref: IEEE Transactions on Vehicular Technology, vol. 68, no. 8, pp. 8145-8158, Aug. 2019

  44. 3D Multi-Drone-Cell Trajectory Design for Efficient IoT Data Collection

    Authors: Weisen Shi, Junling Li, Nan Cheng, Feng Lyu, Yanpeng Dai, Haibo Zhou, Xuemin, Shen

    Abstract: Drone cell (DC) is an emerging technique to offer flexible and cost-effective wireless connections to collect Internet-of-things (IoT) data in uncovered areas of terrestrial networks. The flying trajectory of DC significantly impacts the data collection performance. However, designing the trajectory is a challenging issue due to the complicated 3D mobility of DC, unique DC-to-ground (D2G) channel… ▽ More

    Submitted 29 May, 2019; originally announced June 2019.

    Comments: Accepted by IEEE ICC'19

    Journal ref: ICC 2019 - 2019 IEEE International Conference on Communications (ICC)

  45. arXiv:1904.06996  [pdf, other

    cs.CV

    SR-GAN: Semantic Rectifying Generative Adversarial Network for Zero-shot Learning

    Authors: Zihan Ye, Fan Lyu, Linyan Li, Qiming Fu, **chang Ren, Fuyuan Hu

    Abstract: The existing Zero-Shot learning (ZSL) methods may suffer from the vague class attributes that are highly overlapped for different classes. Unlike these methods that ignore the discrimination among classes, in this paper, we propose to classify unseen image by rectifying the semantic space guided by the visual space. First, we pre-train a Semantic Rectifying Network (SRN) to rectify semantic space… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: ICME 2019 Oral

  46. arXiv:1805.09583  [pdf, other

    cs.NI eess.SY

    Vehicular Communication Networks in Automated Driving Era

    Authors: Shan Zhang, Jiayin Chen, Feng Lyu, Nan Cheng, Weisen Shi, Xuemin, Shen

    Abstract: Embedded with advanced sensors, cameras and processors, the emerging automated driving vehicles are capable of sensing the environment and conducting automobile operation, paving the way to modern intelligent transportation systems (ITS) with high safety and efficiency. On the other hand, vehicular communication networks (VCNs) connect vehicles, infrastructures, clouds, and all other devices with… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

    Comments: 15 pages, 5 figures, IEEE Communications Magazine