Skip to main content

Showing 1–8 of 8 results for author: Mou, N

.
  1. arXiv:2405.04844  [pdf, ps, other

    cs.IR

    Full Stage Learning to Rank: A Unified Framework for Multi-Stage Systems

    Authors: Kai Zheng, Haijun Zhao, Rui Huang, Beichuan Zhang, Na Mou, Yanan Niu, Yang Song, Hongning Wang, Kun Gai

    Abstract: The Probability Ranking Principle (PRP) has been considered as the foundational standard in the design of information retrieval (IR) systems. The principle requires an IR module's returned list of results to be ranked with respect to the underlying user interests, so as to maximize the results' utility. Nevertheless, we point out that it is inappropriate to indiscriminately apply PRP through eve… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted by WWW 2024

  2. arXiv:2404.06078  [pdf, other

    cs.IR

    End-to-end training of Multimodal Model and ranking Model

    Authors: Xiuqi Deng, Lu Xu, Xiyao Li, **kai Yu, Erpeng Xue, Zhongyuan Wang, Di Zhang, Zhaojie Liu, Guorui Zhou, Yang Song, Na Mou, Shen Jiang, Han Li

    Abstract: Traditional recommender systems heavily rely on ID features, which often encounter challenges related to cold-start and generalization. Modeling pre-extracted content features can mitigate these issues, but is still a suboptimal solution due to the discrepancies between training tasks and model parameters. End-to-end training presents a promising solution for these problems, yet most of the existi… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 9 pages, 8 figures

  3. arXiv:2403.10073  [pdf, other

    cs.CV

    Revisiting Adversarial Training under Long-Tailed Distributions

    Authors: Xinli Yue, Ning** Mou, Qian Wang, Lingchen Zhao

    Abstract: Deep neural networks are vulnerable to adversarial attacks, often leading to erroneous outputs. Adversarial training has been recognized as one of the most effective methods to counter such attacks. However, existing adversarial training techniques have predominantly been tested on balanced datasets, whereas real-world data often exhibit a long-tailed distribution, casting doubt on the efficacy of… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  4. arXiv:2308.09395  [pdf, other

    cs.IR

    SHARK: A Lightweight Model Compression Approach for Large-scale Recommender Systems

    Authors: Beichuan Zhang, Chenggen Sun, Jianchao Tan, Xinjun Cai, Jun Zhao, Mengqi Miao, Kang Yin, Chengru Song, Na Mou, Yang Song

    Abstract: Increasing the size of embedding layers has shown to be effective in improving the performance of recommendation models, yet gradually causing their sizes to exceed terabytes in industrial recommender systems, and hence the increase of computing and storage costs. To save resources while maintaining model performances, we propose SHARK, the model compression practice we have summarized in the reco… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: accepted by cikm 2023

  5. arXiv:2306.04095  [pdf, other

    cs.IR

    PANE-GNN: Unifying Positive and Negative Edges in Graph Neural Networks for Recommendation

    Authors: Ziyang Liu, Chaokun Wang, **gcao Xu, Cheng Wu, Kai Zheng, Yang Song, Na Mou, Kun Gai

    Abstract: Recommender systems play a crucial role in addressing the issue of information overload by delivering personalized recommendations to users. In recent years, there has been a growing interest in leveraging graph neural networks (GNNs) for recommender systems, capitalizing on advancements in graph representation learning. These GNN-based models primarily focus on analyzing users' positive feedback… ▽ More

    Submitted 7 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

  6. arXiv:2212.11211  [pdf, other

    cs.CV

    Land Cover and Land Use Detection using Semi-Supervised Learning

    Authors: Fahmida Tasnim Lisa, Md. Zarif Hossain, Sharmin Naj Mou, Shahriar Ivan, Md. Hasanul Kabir

    Abstract: Semi-supervised learning (SSL) has made significant strides in the field of remote sensing. Finding a large number of labeled datasets for SSL methods is uncommon, and manually labeling datasets is expensive and time-consuming. Furthermore, accurately identifying remote sensing satellite images is more complicated than it is for conventional images. Class-imbalanced datasets are another prevalent… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  7. arXiv:2011.05625  [pdf, other

    cs.IR stat.ML

    CAN: Feature Co-Action for Click-Through Rate Prediction

    Authors: Weijie Bian, Kailun Wu, Lejian Ren, Qi Pi, Yu**g Zhang, Can Xiao, Xiang-Rong Sheng, Yong-Nan Zhu, Zhangming Chan, Na Mou, Xinchen Luo, Shiming Xiang, Guorui Zhou, Xiaoqiang Zhu, Hongbo Deng

    Abstract: Feature interaction has been recognized as an important problem in machine learning, which is also very essential for click-through rate (CTR) prediction tasks. In recent years, Deep Neural Networks (DNNs) can automatically learn implicit nonlinear interactions from original sparse features, and therefore have been widely used in industrial CTR prediction tasks. However, the implicit feature inter… ▽ More

    Submitted 7 December, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: WSDM 2022

    MSC Class: Machine Learning (stat.ML); Information Retrieval (cs.IR); Machine Learning (cs.LG) ACM Class: I.2.6

  8. arXiv:1809.03672  [pdf, other

    stat.ML cs.IR cs.LG

    Deep Interest Evolution Network for Click-Through Rate Prediction

    Authors: Guorui Zhou, Na Mou, Ying Fan, Qi Pi, Weijie Bian, Chang Zhou, Xiaoqiang Zhu, Kun Gai

    Abstract: Click-through rate~(CTR) prediction, whose goal is to estimate the probability of the user clicks, has become one of the core tasks in advertising systems. For CTR prediction model, it is necessary to capture the latent user interest behind the user behavior data. Besides, considering the changing of the external environment and the internal cognition, user interest evolves over time dynamically.… ▽ More

    Submitted 16 November, 2018; v1 submitted 10 September, 2018; originally announced September 2018.

    Comments: 9 pages. Accepted by AAAI 2019

    ACM Class: I.2.6