Skip to main content

Showing 1–7 of 7 results for author: Qi, Q

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.18577  [pdf, other

    math.OC cs.LG stat.ML

    Single-loop Stochastic Algorithms for Difference of Max-Structured Weakly Convex Functions

    Authors: Quanqi Hu, Qi Qi, Zhaosong Lu, Tianbao Yang

    Abstract: In this paper, we study a class of non-smooth non-convex problems in the form of $\min_{x}[\max_{y\in Y}φ(x, y) - \max_{z\in Z}ψ(x, z)]$, where both $Φ(x) = \max_{y\in Y}φ(x, y)$ and $Ψ(x)=\max_{z\in Z}ψ(x, z)$ are weakly convex functions, and $φ(x, y), ψ(x, z)$ are strongly concave functions in terms of $y$ and $z$, respectively. It covers two families of problems that have been studied but are m… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2404.17380  [pdf, other

    stat.ME stat.AP

    Correspondence analysis: handling cell-wise outliers via the reconstitution algorithm

    Authors: Qianqian Qi, David J. Hessen, Aike N. Vonk, Peter G. M. van der Heijden

    Abstract: Correspondence analysis (CA) is a popular technique to visualize the relationship between two categorical variables. CA uses the data from a two-way contingency table and is affected by the presence of outliers. The supplementary points method is a popular method to handle outliers. Its disadvantage is that the information from entire rows or columns is removed. However, outliers can be caused by… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  3. arXiv:2012.06951  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    Attentional-Biased Stochastic Gradient Descent

    Authors: Qi Qi, Yi Xu, Rong **, Wotao Yin, Tianbao Yang

    Abstract: In this paper, we present a simple yet effective provable method (named ABSGD) for addressing the data imbalance or label noise problem in deep learning. Our method is a simple modification to momentum SGD where we assign an individual importance weight to each sample in the mini-batch. The individual-level weight of sampled data is systematically proportional to the exponential of a scaled loss v… ▽ More

    Submitted 8 June, 2023; v1 submitted 12 December, 2020; originally announced December 2020.

    Comments: 29 pages

    Journal ref: Transanctions on Machine Learning Research, 2023

  4. arXiv:2009.06548  [pdf, other

    cs.LG stat.ML

    Variance-Reduced Off-Policy Memory-Efficient Policy Search

    Authors: Daoming Lyu, Qi Qi, Mohammad Ghavamzadeh, Hengshuai Yao, Tianbao Yang, Bo Liu

    Abstract: Off-policy policy optimization is a challenging problem in reinforcement learning (RL). The algorithms designed for this problem often suffer from high variance in their estimators, which results in poor sample efficiency, and have issues with convergence. A few variance-reduced on-policy policy gradient algorithms have been recently proposed that use methods from stochastic optimization to reduce… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  5. arXiv:2006.10138  [pdf, other

    cs.LG cs.CV stat.ML

    An Online Method for A Class of Distributionally Robust Optimization with Non-Convex Objectives

    Authors: Qi Qi, Zhishuai Guo, Yi Xu, Rong **, Tianbao Yang

    Abstract: In this paper, we propose a practical online method for solving a class of distributionally robust optimization (DRO) with non-convex objectives, which has important applications in machine learning for improving the robustness of neural networks. In the literature, most methods for solving DRO are based on stochastic primal-dual methods. However, primal-dual methods for DRO suffer from several dr… ▽ More

    Submitted 12 November, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: 25 pages, 9 figures

  6. arXiv:1912.11194  [pdf, other

    cs.LG cs.CV stat.ML

    A Simple and Effective Framework for Pairwise Deep Metric Learning

    Authors: Qi Qi, Yan Yan, Xiaoyu Wang, Tianbao Yang

    Abstract: Deep metric learning (DML) has received much attention in deep learning due to its wide applications in computer vision. Previous studies have focused on designing complicated losses and hard example mining methods, which are mostly heuristic and lack of theoretical understanding. In this paper, we cast DML as a simple pairwise binary classification problem that classifies a pair of examples as si… ▽ More

    Submitted 18 June, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

    Comments: 16 pages, 5 figures

  7. arXiv:1811.11829  [pdf, other

    math.OC stat.ML

    Stochastic Optimization for DC Functions and Non-smooth Non-convex Regularizers with Non-asymptotic Convergence

    Authors: Yi Xu, Qi Qi, Qihang Lin, Rong **, Tianbao Yang

    Abstract: Difference of convex (DC) functions cover a broad family of non-convex and possibly non-smooth and non-differentiable functions, and have wide applications in machine learning and statistics. Although deterministic algorithms for DC functions have been extensively studied, stochastic optimization that is more suitable for learning with big data remains under-explored. In this paper, we propose new… ▽ More

    Submitted 4 February, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: In the revised version, we present some improved complexity results for non-smooth and non-convex regularizers and for functions with known Hölder continuity parameter $ν\in(0,1]$ by a simple change of an algorithmic parameter