Skip to main content

Showing 1–32 of 32 results for author: Qin, Z

Searching in archive stat. Search in all archives.
.
  1. Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I

    Authors: Harrie Oosterhuis, Rolf Jagerman, Zhen Qin, Xuanhui Wang, Michael Bendersky

    Abstract: The traditional evaluation of information retrieval (IR) systems is generally very costly as it requires manual relevance annotation from human experts. Recent advancements in generative artificial intelligence -- specifically large language models (LLMs) -- can generate relevance annotations at an enormous scale with relatively small computational costs. Potentially, this could alleviate the cost… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: KDD '24

  2. arXiv:2405.16219  [pdf, other

    cs.LG stat.ML

    Deep Causal Generative Models with Property Control

    Authors: Qilong Zhao, Shiyu Wang, Guangji Bai, Bo Pan, Zhaohui Qin, Liang Zhao

    Abstract: Generating data with properties of interest by external users while following the right causation among its intrinsic factors is important yet has not been well addressed jointly. This is due to the long-lasting challenge of jointly identifying key latent variables, their causal relations, and their correlation with properties of interest, as well as how to leverage their discoveries toward causal… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 13 pages, 6 figures

  3. arXiv:2401.02592  [pdf, other

    stat.ML cs.LG eess.SP math.OC

    Guaranteed Nonconvex Factorization Approach for Tensor Train Recovery

    Authors: Zhen Qin, Michael B. Wakin, Zhihui Zhu

    Abstract: In this paper, we provide the first convergence guarantee for the factorization approach. Specifically, to avoid the scaling ambiguity and to facilitate theoretical analysis, we optimize over the so-called left-orthogonal TT format which enforces orthonormality among most of the factors. To ensure the orthonormal structure, we utilize the Riemannian gradient descent (RGD) for optimizing those fact… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  4. arXiv:2312.16439  [pdf, other

    stat.ME

    Inferring the Effect of a Confounded Treatment by Calibrating Resistant Population's Variance

    Authors: Zikun Qin, Bikram Karmakar

    Abstract: In a general set-up that allows unmeasured confounding, we show that the conditional average treatment effect on the treated can be identified as one of two possible values. Unlike existing causal inference methods, we do not require an exogenous source of variability in the treatment, e.g., an instrument or another outcome unaffected by the treatment. Instead, we require (a) a nondeterministic tr… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  5. arXiv:2310.15976  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Convergence of Sign-based Random Reshuffling Algorithms for Nonconvex Optimization

    Authors: Zhen Qin, Zhishuai Liu, Pan Xu

    Abstract: signSGD is popular in nonconvex optimization due to its communication efficiency. Yet, existing analyses of signSGD rely on assuming that data are sampled with replacement in each iteration, contradicting the practical implementation where data are randomly reshuffled and sequentially fed into the algorithm. We bridge this gap by proving the first convergence result of signSGD with random reshuffl… ▽ More

    Submitted 27 December, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 44 pages, 4 figures

  6. arXiv:2209.10675  [pdf, other

    math.OC cs.LG eess.IV stat.ML

    A Validation Approach to Over-parameterized Matrix and Image Recovery

    Authors: Lijun Ding, Zhen Qin, Liwei Jiang, **xin Zhou, Zhihui Zhu

    Abstract: In this paper, we study the problem of recovering a low-rank matrix from a number of noisy random linear measurements. We consider the setting where the rank of the ground-truth matrix is unknown a prior and use an overspecified factored representation of the matrix variable, where the global optimal solutions overfit and do not correspond to the underlying ground-truth. We then solve the associat… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 29 pages and 9 figures

  7. arXiv:2106.10471  [pdf, ps, other

    cs.LG stat.ML

    Neural Network Classifier as Mutual Information Evaluator

    Authors: Zhenyue Qin, Dongwoo Kim, Tom Gedeon

    Abstract: Cross-entropy loss with softmax output is a standard choice to train neural network classifiers. We give a new view of neural network classifiers with softmax and cross-entropy as mutual information evaluators. We show that when the dataset is balanced, training a neural network with cross-entropy maximises the mutual information between inputs and labels through a variational form of mutual infor… ▽ More

    Submitted 14 August, 2021; v1 submitted 19 June, 2021; originally announced June 2021.

    Comments: ICML Workshop 2021

  8. arXiv:2010.00163  [pdf, other

    cs.LG stat.ML

    Bayesian Meta-reinforcement Learning for Traffic Signal Control

    Authors: Yayi Zou, Zhiwei Qin

    Abstract: In recent years, there has been increasing amount of interest around meta reinforcement learning methods for traffic signal control, which have achieved better performance compared with traditional control methods. However, previous methods lack robustness in adaptation and stability in training process in complex situations, which largely limits its application in real-world traffic signal contro… ▽ More

    Submitted 22 October, 2021; v1 submitted 30 September, 2020; originally announced October 2020.

  9. arXiv:2008.06767  [pdf, other

    cs.LG stat.ML

    Heterogeneous Federated Learning

    Authors: Fuxun Yu, Weishan Zhang, Zhuwei Qin, Zirui Xu, Di Wang, Chenchen Liu, Zhi Tian, Xiang Chen

    Abstract: Federated learning learns from scattered data by fusing collaborative models from local nodes. However, due to chaotic information distribution, the model fusion may suffer from structural misalignment with regard to unmatched parameters. In this work, we propose a novel federated learning framework to resolve this issue by establishing a firm structure-information alignment across collaborative m… ▽ More

    Submitted 19 March, 2022; v1 submitted 15 August, 2020; originally announced August 2020.

    Comments: Full version [Fed2: Feature-Aligned Federated Learning] accepted in KDD'2021

  10. arXiv:2007.07204  [pdf, other

    cs.IR cs.LG stat.ML

    Sampler Design for Implicit Feedback Data by Noisy-label Robust Learning

    Authors: Wenhui Yu, Zheng Qin

    Abstract: Implicit feedback data is extensively explored in recommendation as it is easy to collect and generally applicable. However, predicting users' preference on implicit feedback data is a challenging task since we can only observe positive (voted) samples and unvoted samples. It is difficult to distinguish between the negative samples and unlabeled positive samples from the unvoted ones. Existing wor… ▽ More

    Submitted 28 June, 2020; originally announced July 2020.

    Comments: SIGIR 2020 paper

  11. arXiv:2007.07085  [pdf, other

    cs.IR cs.LG stat.ML

    Semi-supervised Collaborative Filtering by Text-enhanced Domain Adaptation

    Authors: Wenhui Yu, Xiao Lin, Junfeng Ge, Wenwu Ou, Zheng Qin

    Abstract: Data sparsity is an inherent challenge in the recommender systems, where most of the data is collected from the implicit feedbacks of users. This causes two difficulties in designing effective algorithms: first, the majority of users only have a few interactions with the system and there is no enough data for learning; second, there are no negative samples in the implicit feedbacks and it is a com… ▽ More

    Submitted 28 June, 2020; originally announced July 2020.

    Comments: KDD 2020 paper

  12. arXiv:2006.15516  [pdf, other

    cs.LG cs.IR stat.ML

    Graph Convolutional Network for Recommendation with Low-pass Collaborative Filters

    Authors: Wenhui Yu, Zheng Qin

    Abstract: \textbf{G}raph \textbf{C}onvolutional \textbf{N}etwork (\textbf{GCN}) is widely used in graph data learning tasks such as recommendation. However, when facing a large graph, the graph convolution is very computationally expensive thus is simplified in all existing GCNs, yet is seriously impaired due to the oversimplification. To address this gap, we leverage the \textit{original graph convolution}… ▽ More

    Submitted 18 January, 2021; v1 submitted 28 June, 2020; originally announced June 2020.

    Comments: ICML 2020 paper

  13. Interpretable Deep Graph Generation with Node-Edge Co-Disentanglement

    Authors: Xiaojie Guo, Liang Zhao, Zhao Qin, Lingfei Wu, Amarda Shehu, Yanfang Ye

    Abstract: Disentangled representation learning has recently attracted a significant amount of attention, particularly in the field of image representation learning. However, learning the disentangled representations behind a graph remains largely unexplored, especially for the attributed graph with both node and edge features. Disentanglement learning for graph generation has substantial new challenges incl… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: This paper has been accepted by KDD 2020

  14. arXiv:2006.03860  [pdf, other

    stat.ML cs.LG

    Do RNN and LSTM have Long Memory?

    Authors: **gyu Zhao, Feiqing Huang, Jia Lv, Yanjie Duan, Zhen Qin, Guodong Li, Guangjian Tian

    Abstract: The LSTM network was proposed to overcome the difficulty in learning long-term dependence, and has made significant advancements in applications. With its success and drawbacks in mind, this paper raises the question - do RNN and LSTM have long memory? We answer it partially by proving that RNN and LSTM do not have long memory from a statistical perspective. A new definition for long memory networ… ▽ More

    Submitted 10 June, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

    Comments: Accepted by ICML 2020. Added references, experiments and acknowledgements

  15. Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation

    Authors: Mengyue Yang, Qingyang Li, Zhiwei Qin, Jie** Ye

    Abstract: Contextual multi-armed bandit (MAB) achieves cutting-edge performance on a variety of problems. When it comes to real-world scenarios such as recommendation system and online advertising, however, it is essential to consider the resource consumption of exploration. In practice, there is typically non-zero cost associated with executing a recommendation (arm) in the environment, and hence, the poli… ▽ More

    Submitted 6 April, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: Accepted for publication at WWW (The Web Conference) 2020

  16. arXiv:1911.11260  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem

    Authors: John Holler, Risto Vuorio, Zhiwei Qin, Xiaocheng Tang, Yan Jiao, Tiancheng **, Satinder Singh, Chenxi Wang, Jie** Ye

    Abstract: Order dispatching and driver repositioning (also known as fleet management) in the face of spatially and temporally varying supply and demand are central to a ride-sharing platform marketplace. Hand-crafting heuristic solutions that account for the dynamics in these resource allocation problems is difficult, and may be better handled by an end-to-end machine learning method. Previous works have ex… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: ICDM 2019 Short Paper

  17. arXiv:1911.10688  [pdf, other

    cs.LG cs.CV stat.ML

    Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator

    Authors: Zhenyue Qin, Dongwoo Kim, Tom Gedeon

    Abstract: Mutual information is widely applied to learn latent representations of observations, whilst its implication in classification neural networks remain to be better explained. We show that optimising the parameters of classification neural networks with softmax cross-entropy is equivalent to maximising the mutual information between inputs and labels under the balanced data assumption. Through exper… ▽ More

    Submitted 17 September, 2020; v1 submitted 24 November, 2019; originally announced November 2019.

  18. arXiv:1910.02629   

    cs.LG cs.CV stat.ML

    Softmax Is Not an Artificial Trick: An Information-Theoretic View of Softmax in Neural Networks

    Authors: Zhenyue Qin, Dongwoo Kim

    Abstract: Despite great popularity of applying softmax to map the non-normalised outputs of a neural network to a probability distribution over predicting classes, this normalised exponential transformation still seems to be artificial. A theoretic framework that incorporates softmax as an intrinsic component is still lacking. In this paper, we view neural networks embedding softmax from an information-theo… ▽ More

    Submitted 15 October, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Withdrawn due to Zhenyue Qin uploading the manuscript without consent of the other authors

  19. arXiv:1908.10506  [pdf, other

    cs.LG stat.ME stat.ML

    Similarity Kernel and Clustering via Random Projection Forests

    Authors: Donghui Yan, Songxiang Gu, Ying Xu, Zhiwei Qin

    Abstract: Similarity plays a fundamental role in many areas, including data mining, machine learning, statistics and various applied domains. Inspired by the success of ensemble methods and the flexibility of trees, we propose to learn a similarity kernel called rpf-kernel through random projection forests (rpForests). Our theoretical analysis reveals a highly desirable property of rpf-kernel: far-away (dis… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: 22 pages, 5 figures

  20. arXiv:1907.06584  [pdf, other

    cs.LG cs.AI stat.ML

    Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation

    Authors: Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei Qin, Yi** Meng, Jie** Ye

    Abstract: Reinforcement learning aims at searching the best policy model for decision making, and has been shown powerful for sequential recommendations. The training of the policy by reinforcement learning, however, is placed in an environment. In many real-world applications, however, the policy training in the real environment can cause an unbearable cost, due to the exploration in the environment. Envir… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

    Comments: Appears in KDD 2019

  21. arXiv:1907.00700  [pdf, other

    cs.LG stat.ML

    An Improvement of PAA on Trend-Based Approximation for Time Series

    Authors: Chunkai Zhang, Yingyang Chen, Ao Yin, Zhen Qin, Xing Zhang, Keli Zhang, Zoe L. Jiang

    Abstract: Piecewise Aggregate Approximation (PAA) is a competitive basic dimension reduction method for high-dimensional time series mining. When deployed, however, the limitations are obvious that some important information will be missed, especially the trend. In this paper, we propose two new approaches for time series that utilize approximate trend feature information. Our first method is based on relat… ▽ More

    Submitted 28 June, 2019; originally announced July 2019.

  22. arXiv:1905.04270  [pdf, other

    cs.LG cs.CV stat.ML

    Interpreting and Evaluating Neural Network Robustness

    Authors: Fuxun Yu, Zhuwei Qin, Chenchen Liu, Liang Zhao, Yanzhi Wang, Xiang Chen

    Abstract: Recently, adversarial deception becomes one of the most considerable threats to deep neural networks. However, compared to extensive research in new designs of various adversarial attacks and defenses, the neural networks' intrinsic robustness property is still lack of thorough investigation. This work aims to qualitatively interpret the adversarial attack and defense mechanism through loss visual… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

    Comments: Accepted in IJCAI'19

  23. arXiv:1901.00456  [pdf, ps, other

    stat.ME cs.LG stat.ML

    Cost-sensitive Selection of Variables by Ensemble of Model Sequences

    Authors: Donghui Yan, Zhiwei Qin, Songxiang Gu, Hai** Xu, Ming Shao

    Abstract: Many applications require the collection of data on different variables or measurements over many system performance metrics. We term those broadly as measures or variables. Often data collection along each measure incurs a cost, thus it is desirable to consider the cost of measures in modeling. This is a fairly new class of problems in the area of cost-sensitive learning. A few attempts have been… ▽ More

    Submitted 28 November, 2021; v1 submitted 2 January, 2019; originally announced January 2019.

    Comments: 27 pages, 13 figures

  24. arXiv:1811.04345  [pdf, other

    cs.LG cs.AI stat.ML

    Optimizing Taxi Carpool Policies via Reinforcement Learning and Spatio-Temporal Mining

    Authors: Ishan **dal, Zhiwei Qin, Xuewen Chen, Matthew Nokleby, Jie** Ye

    Abstract: In this paper, we develop a reinforcement learning (RL) based system to learn an effective policy for carpooling that maximizes transportation efficiency so that fewer cars are required to fulfill the given amount of trip demand. For this purpose, first, we develop a deep neural network model, called ST-NN (Spatio-Temporal Neural Network), to predict taxi trip time from the raw GPS trip data. Seco… ▽ More

    Submitted 10 November, 2018; originally announced November 2018.

    Comments: Accepted at IEEE International Conference on Big Data 2018. arXiv admin note: text overlap with arXiv:1710.04350

  25. arXiv:1810.07322  [pdf, other

    cs.LG cs.CV stat.ML

    Functionality-Oriented Convolutional Filter Pruning

    Authors: Zhuwei Qin, Fuxun Yu, Chenchen Liu, Xiang Chen

    Abstract: The sophisticated structure of Convolutional Neural Network (CNN) allows for outstanding performance, but at the cost of intensive computation. As significant redundancies inevitably present in such a structure, many works have been proposed to prune the convolutional filters for computation cost reduction. Although extremely effective, most works are based only on quantitative characteristics of… ▽ More

    Submitted 11 September, 2019; v1 submitted 12 October, 2018; originally announced October 2018.

  26. arXiv:1809.05822  [pdf, other

    cs.IR cs.LG stat.ML

    Aesthetic-based Clothing Recommendation

    Authors: Wenhui Yu, Huidi Zhang, Xiangnan He, Xu Chen, Li Xiong, Zheng Qin

    Abstract: Recently, product images have gained increasing attention in clothing recommendation since the visual appearance of clothing products has a significant impact on consumers' decision. Most existing methods rely on conventional features to represent an image, such as the visual features extracted by convolutional neural networks (CNN features) and the scale-invariant feature transform algorithm (SIF… ▽ More

    Submitted 16 September, 2018; originally announced September 2018.

    Comments: WWW 2018

  27. One-sample aggregate data meta-analysis of medians

    Authors: Sean McGrath, XiaoFei Zhao, Zhi Zhen Qin, Russell Steele, Andrea Benedetti

    Abstract: An aggregate data meta-analysis is a statistical method that pools the summary statistics of several selected studies to estimate the outcome of interest. When considering a continuous outcome, typically each study must report the same measure of the outcome variable and its spread (e.g., the sample mean and its standard error). However, some studies may instead report the median along with variou… ▽ More

    Submitted 15 December, 2017; v1 submitted 9 September, 2017; originally announced September 2017.

    Journal ref: Stat. Med. 38 (2019) 969-984

  28. arXiv:1605.04034  [pdf, other

    cs.LG stat.ML

    Transfer Hashing with Privileged Information

    Authors: Joey Tianyi Zhou, Xinxing Xu, Sinno Jialin Pan, Ivor W. Tsang, Zheng Qin, Rick Siow Mong Goh

    Abstract: Most existing learning to hash methods assume that there are sufficient data, either labeled or unlabeled, on the domain of interest (i.e., the target domain) for training. However, this assumption cannot be satisfied in some real-world applications. To address this data sparsity issue in hashing, inspired by transfer learning, we propose a new framework named Transfer Hashing with Privileged Info… ▽ More

    Submitted 12 May, 2016; originally announced May 2016.

    Comments: Accepted by IJCAI-2016

  29. arXiv:1411.4286  [pdf, other

    stat.ML cs.LG

    HIPAD - A Hybrid Interior-Point Alternating Direction algorithm for knowledge-based SVM and feature selection

    Authors: Zhiwei Qin, Xiaocheng Tang, Ioannis Akrotirianakis, Amit Chakraborty

    Abstract: We consider classification tasks in the regime of scarce labeled training data in high dimensional feature space, where specific expert knowledge is also available. We propose a new hybrid optimization algorithm that solves the elastic-net support vector machine (SVM) through an alternating direction method of multipliers in the first phase, followed by an interior-point method for the classical S… ▽ More

    Submitted 16 November, 2014; originally announced November 2014.

    Comments: Proceedings of 8th Learning and Intelligent OptimizatioN (LION8) Conference, 2014

  30. arXiv:1402.3740  [pdf, ps, other

    math.OC stat.ME

    Joint Variable Selection for Data Envelopement Analysis via Group Sparsity

    Authors: Zhiwei Qin, Irene Song

    Abstract: This study develops a data-driven group variable selection method for data envelopment analysis (DEA), a non-parametric linear programming approach to the estimation of production frontiers. The proposed method extends the group Lasso (least absolute shrinkage and selection operator) designed for variable selection on (often predefined) groups of variables in linear regression models to DEA models… ▽ More

    Submitted 15 February, 2014; originally announced February 2014.

  31. Robust Low-rank Tensor Recovery: Models and Algorithms

    Authors: Donald Goldfarb, Zhiwei Qin

    Abstract: Robust tensor recovery plays an instrumental role in robustifying tensor decompositions for multilinear data analysis against outliers, gross corruptions and missing values and has a diverse array of applications. In this paper, we study the problem of robust low-rank tensor recovery in a convex optimization framework, drawing upon recent advances in robust Principal Component Analysis and tensor… ▽ More

    Submitted 24 November, 2013; originally announced November 2013.

    Comments: appearing in SIAM Journal on Matrix Analysis and Applications

  32. arXiv:1105.0728  [pdf, other

    math.OC cs.AI stat.ML

    Structured Sparsity via Alternating Direction Methods

    Authors: Zhiwei Qin, Donald Goldfarb

    Abstract: We consider a class of sparse learning problems in high dimensional feature space regularized by a structured sparsity-inducing norm which incorporates prior knowledge of the group structure of the features. Such problems often pose a considerable challenge to optimization algorithms due to the non-smoothness and non-separability of the regularization term. In this paper, we focus on two commonly… ▽ More

    Submitted 14 December, 2011; v1 submitted 3 May, 2011; originally announced May 2011.

    Journal ref: Journal of Machine Learning Research 13 (2012) 1435-1468