Skip to main content

Showing 1–14 of 14 results for author: Mao, Q

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.15403  [pdf, other

    cs.LG stat.ML

    Fine-Grained Dynamic Framework for Bias-Variance Joint Optimization on Data Missing Not at Random

    Authors: Mingming Ha, Xuewen Tao, Wenfang Lin, Qionxu Ma, Wujiang Xu, Linxun Chen

    Abstract: In most practical applications such as recommendation systems, display advertising, and so forth, the collected data often contains missing values and those missing values are generally missing-not-at-random, which deteriorates the prediction performance of models. Some existing estimators and regularizers attempt to achieve unbiased estimation to improve the predictive performance. However, varia… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2402.14145  [pdf, other

    stat.ML cs.LG stat.ME

    Multiply Robust Estimation for Local Distribution Shifts with Multiple Domains

    Authors: Steven Wilkins-Reeves, Xu Chen, Qi Ma, Christine Agarwal, Aude Hofleitner

    Abstract: Distribution shifts are ubiquitous in real-world machine learning applications, posing a challenge to the generalization of models trained on one data distribution to another. We focus on scenarios where data distributions vary across multiple segments of the entire population and only make local assumptions about the differences between training and test (deployment) distributions within each seg… ▽ More

    Submitted 3 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures

  3. arXiv:2309.12658  [pdf, other

    cs.LG stat.ML

    Neural Operator Variational Inference based on Regularized Stein Discrepancy for Deep Gaussian Processes

    Authors: Jian Xu, Shian Du, Junmei Yang, Qianli Ma, Delu Zeng

    Abstract: Deep Gaussian Process (DGP) models offer a powerful nonparametric approach for Bayesian inference, but exact inference is typically intractable, motivating the use of various approximations. However, existing approaches, such as mean-field Gaussian assumptions, limit the expressiveness and efficacy of DGP models, while stochastic approximation can be computationally expensive. To tackle these chal… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  4. arXiv:2201.07945  [pdf, other

    stat.AP

    A Guideline for the Statistical Analysis of Compositional Data in Immunology

    Authors: **kyung Yoo, Zequn Sun, Michael Greenacre, Qin Ma, Dongjun Chung, Young Min Kim

    Abstract: The study of immune cellular composition has been of great scientific interest in immunology because of the generation of multiple large-scale data. From the statistical point of view, such immune cellular data should be treated as compositional. In compositional data, each element is positive, and all the elements sum to a constant, which can be set to one in general. Standard statistical methods… ▽ More

    Submitted 21 April, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

  5. arXiv:2009.08868  [pdf

    q-bio.BM cs.LG stat.ML

    Review of Machine-Learning Methods for RNA Secondary Structure Prediction

    Authors: Qi Zhao, Zheng Zhao, Xiaoya Fan, Zhengwei Yuan, Qian Mao, Yudong Yao

    Abstract: Secondary structure plays an important role in determining the function of non-coding RNAs. Hence, identifying RNA secondary structures is of great value to research. Computational prediction is a mainstream approach for predicting RNA secondary structure. Unfortunately, even though new methods have been proposed over the past 40 years, the performance of computational prediction methods has stagn… ▽ More

    Submitted 31 August, 2020; originally announced September 2020.

    Comments: 25 pages, 5 figures, 1 table

    MSC Class: I.2.0 General

  6. arXiv:2006.11383  [pdf, other

    stat.ML cs.LG

    A Non-Iterative Quantile Change Detection Method in Mixture Model with Heavy-Tailed Components

    Authors: Yuantong Li, Qi Ma, Sujit K. Ghosh

    Abstract: Estimating parameters of mixture model has wide applications ranging from classification problems to estimating of complex distributions. Most of the current literature on estimating the parameters of the mixture densities are based on iterative Expectation Maximization (EM) type algorithms which require the use of either taking expectations over the latent label variables or generating samples fr… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

  7. arXiv:2006.03750  [pdf, other

    cs.LG stat.ML

    Learning to Solve Combinatorial Optimization Problems on Real-World Graphs in Linear Time

    Authors: Iddo Drori, Anant Kharkar, William R. Sickinger, Brandon Kates, Qiang Ma, Suwen Ge, Eden Dolev, Brenda Dietrich, David P. Williamson, Madeleine Udell

    Abstract: Combinatorial optimization algorithms for graph problems are usually designed afresh for each new problem with careful attention by an expert to the problem structure. In this work, we develop a new framework to solve any combinatorial optimization problem over graphs that can be formulated as a single player game defined by states, actions, and rewards, including minimum spanning tree, shortest p… ▽ More

    Submitted 11 June, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

  8. arXiv:1911.04936  [pdf, other

    cs.LG stat.ML

    Combinatorial Optimization by Graph Pointer Networks and Hierarchical Reinforcement Learning

    Authors: Qiang Ma, Suwen Ge, Danyang He, Darshan Thaker, Iddo Drori

    Abstract: In this work, we introduce Graph Pointer Networks (GPNs) trained using reinforcement learning (RL) for tackling the traveling salesman problem (TSP). GPNs build upon Pointer Networks by introducing a graph embedding layer on the input, which captures relationships between nodes. Furthermore, to approximate solutions to constrained combinatorial optimization problems such as the TSP with time windo… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

  9. arXiv:1910.10202  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Complex Transformer: A Framework for Modeling Complex-Valued Sequence

    Authors: Muqiao Yang, Martin Q. Ma, Dongyu Li, Yao-Hung Hubert Tsai, Ruslan Salakhutdinov

    Abstract: While deep learning has received a surge of interest in a variety of fields in recent years, major deep learning models barely use complex numbers. However, speech, signal and audio data are naturally complex-valued after Fourier Transform, and studies have shown a potentially richer representation of complex nets. In this paper, we propose a Complex Transformer, which incorporates the transformer… ▽ More

    Submitted 6 August, 2021; v1 submitted 22 October, 2019; originally announced October 2019.

  10. arXiv:1906.01004  [pdf, other

    cs.LG cs.CV stat.ML

    Frontal Low-rank Random Tensors for Fine-grained Action Segmentation

    Authors: Yan Zhang, Krikamol Muandet, Qianli Ma, Heiko Neumann, Siyu Tang

    Abstract: Fine-grained action segmentation in long untrimmed videos is an important task for many applications such as surveillance, robotics, and human-computer interaction. To understand subtle and precise actions within a long time period, second-order information (e.g. feature covariance) or higher is reported to be effective in the literature. However, extracting such high-order information is consider… ▽ More

    Submitted 6 April, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 19 pages (4 pages appendix), 3 figures. Revised theories and models, new experiments

  11. arXiv:1905.01620  [pdf

    cs.LG stat.ML

    Maximal Margin Distribution Support Vector Regression with coupled Constraints-based Convex Optimization

    Authors: Gaoyang Li, **yu Yang, Chunguo Wu, Qin Ma

    Abstract: Support vector regression (SVR) is one of the most popular machine learning algorithms aiming to generate the optimal regression curve through maximizing the minimal margin of selected training samples, i.e., support vectors. Recent researchers reveal that maximizing the margin distribution of whole training dataset rather than the minimal margin of a few support vectors, is prone to achieve bette… ▽ More

    Submitted 5 May, 2019; originally announced May 2019.

  12. arXiv:1512.02752  [pdf, other

    cs.AI cs.LG stat.ML

    A Novel Regularized Principal Graph Learning Framework on Explicit Graph Representation

    Authors: Qi Mao, Li Wang, Ivor W. Tsang, Yijun Sun

    Abstract: Many scientific datasets are of high dimension, and the analysis usually requires visual manipulation by retaining the most important structures of data. Principal curve is a widely used approach for this purpose. However, many existing methods work only for data with structures that are not self-intersected, which is quite restrictive for real applications. A few methods can overcome the above pr… ▽ More

    Submitted 17 January, 2016; v1 submitted 8 December, 2015; originally announced December 2015.

  13. arXiv:1206.6475  [pdf

    cs.LG stat.ML

    A Split-Merge Framework for Comparing Clusterings

    Authors: Qiaoliang Xiang, Qi Mao, Kian Ming Chai, Hai Leong Chieu, Ivor Tsang, Zhendong Zhao

    Abstract: Clustering evaluation measures are frequently used to evaluate the performance of algorithms. However, most measures are not properly normalized and ignore some information in the inherent structure of clusterings. We model the relation between two clusterings as a bipartite graph and propose a general component-based decomposition formula based on the components of the graph. Most existing measur… ▽ More

    Submitted 4 September, 2012; v1 submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  14. arXiv:1203.3495  [pdf

    cs.LG stat.ML

    Parameter-Free Spectral Kernel Learning

    Authors: Qi Mao, Ivor W. Tsang

    Abstract: Due to the growing ubiquity of unlabeled data, learning with unlabeled data is attracting increasing attention in machine learning. In this paper, we propose a novel semi-supervised kernel learning method which can seamlessly combine manifold structure of unlabeled data and Regularized Least-Squares (RLS) to learn a new kernel. Interestingly, the new kernel matrix can be obtained analytically with… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-350-357