Skip to main content

Showing 1–21 of 21 results for author: Yuan, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.03065  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    LibAUC: A Deep Learning Library for X-Risk Optimization

    Authors: Zhuoning Yuan, Dixian Zhu, Zi-Hao Qiu, Gang Li, Xuanhui Wang, Tianbao Yang

    Abstract: This paper introduces the award-winning deep learning (DL) library called LibAUC for implementing state-of-the-art algorithms towards optimizing a family of risk functions named X-risks. X-risks refer to a family of compositional functions in which the loss function of each data point is defined in a way that contrasts the data point with a large number of others. They have broad applications in A… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by KDD2023

  2. arXiv:2305.11965  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization

    Authors: Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang

    Abstract: In this paper, we aim to optimize a contrastive loss with individualized temperatures in a principled and systematic manner for self-supervised learning. The common practice of using a global temperature parameter $τ$ ignores the fact that ``not all semantics are created equal", meaning that different anchor data may have different numbers of samples with similar semantics, especially when data ex… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 33 pages, 11 figures, accepted by ICML2023

  3. arXiv:2209.01805  [pdf, other

    econ.EM q-fin.RM stat.ME stat.ML

    Robust Causal Learning for the Estimation of Average Treatment Effects

    Authors: Yiyan Huang, Cheuk Hang Leung, Xing Yan, Qi Wu, Shumin Ma, Zhiri Yuan, Dongdong Wang, Zhixiang Huang

    Abstract: Many practical decision-making problems in economics and healthcare seek to estimate the average treatment effect (ATE) from observational data. The Double/Debiased Machine Learning (DML) is one of the prevalent methods to estimate ATE in the observational study. However, the DML estimators can suffer an error-compounding issue and even give an extreme estimate when the propensity scores are missp… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: This paper was accepted and will be published at The 2022 International Joint Conference on Neural Networks (IJCNN2022). arXiv admin note: substantial text overlap with arXiv:2103.11869

  4. arXiv:2208.11481  [pdf, ps, other

    math.ST stat.ME

    An Improved Bernstein-type Inequality for C-Mixing-type Processes and Its Application to Kernel Smoothing

    Authors: Zihao Yuan, Martin Spindler

    Abstract: There are many processes, particularly dynamic systems, that cannot be described as strong mixing processes. \citet{maume2006exponential} introduced a new mixing coefficient called C-mixing, which includes a large class of dynamic systems. Based on this, \citet{hang2017bernstein} obtained a Bernstein-type inequality for a geometric C-mixing process, which, modulo a logarithmic factor and some cons… ▽ More

    Submitted 7 October, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

  5. arXiv:2208.11433  [pdf, other

    math.ST stat.ME

    Bernstein-type Inequalities and Nonparametric Estimation under Near-Epoch Dependence

    Authors: Zihao Yuan, Martin Spindler

    Abstract: The major contributions of this paper lie in two aspects. Firstly, we focus on deriving Bernstein-type inequalities for both geometric and algebraic irregularly-spaced NED random fields, which contain time series as special case. Furthermore, by introducing the idea of "effective dimension" to the index set of random field, our results reflect that the sharpness of inequalities are only associated… ▽ More

    Submitted 17 October, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

  6. arXiv:2202.12387  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance

    Authors: Zhuoning Yuan, Yuexin Wu, Zi-Hao Qiu, Xianzhi Du, Lijun Zhang, Denny Zhou, Tianbao Yang

    Abstract: In this paper, we study contrastive learning from an optimization perspective, aiming to analyze and address a fundamental issue of existing contrastive learning methods that either rely on a large batch size or a large dictionary of feature vectors. We consider a global objective for contrastive learning, which contrasts each positive pair with all negative pairs for an anchor point. From the opt… ▽ More

    Submitted 20 September, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: Accepted by ICML2022

  7. arXiv:2102.04635  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Federated Deep AUC Maximization for Heterogeneous Data with a Constant Communication Complexity

    Authors: Zhuoning Yuan, Zhishuai Guo, Yi Xu, Yiming Ying, Tianbao Yang

    Abstract: Deep AUC (area under the ROC curve) Maximization (DAM) has attracted much attention recently due to its great potential for imbalanced data classification. However, the research on Federated Deep AUC Maximization (FDAM) is still limited. Compared with standard federated learning (FL) approaches that focus on decomposable minimization objectives, FDAM is more complicated due to its minimization obj… ▽ More

    Submitted 13 September, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted by ICML2021. Code is available in https://github.com/Optimization-AI/ICML2021_FedDeepAUC_CODASCA, which is a part of our open-sourced library LibAUC (www.libauc.org)

    Journal ref: International Conference on Machine Learning (ICML 2021)

  8. arXiv:2012.03173  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    Large-scale Robust Deep AUC Maximization: A New Surrogate Loss and Empirical Studies on Medical Image Classification

    Authors: Zhuoning Yuan, Yan Yan, Milan Sonka, Tianbao Yang

    Abstract: Deep AUC Maximization (DAM) is a new paradigm for learning a deep neural network by maximizing the AUC score of the model on a dataset. Most previous works of AUC maximization focus on the perspective of optimization by designing efficient stochastic algorithms, and studies on generalization performance of large-scale DAM on difficult tasks are missing. In this work, we aim to make DAM more practi… ▽ More

    Submitted 7 September, 2021; v1 submitted 5 December, 2020; originally announced December 2020.

    Comments: Accepted by ICCV2021

    Journal ref: International Conference on Computer Vision (ICCV2021)

  9. arXiv:2009.08868  [pdf

    q-bio.BM cs.LG stat.ML

    Review of Machine-Learning Methods for RNA Secondary Structure Prediction

    Authors: Qi Zhao, Zheng Zhao, Xiaoya Fan, Zhengwei Yuan, Qian Mao, Yudong Yao

    Abstract: Secondary structure plays an important role in determining the function of non-coding RNAs. Hence, identifying RNA secondary structures is of great value to research. Computational prediction is a mainstream approach for predicting RNA secondary structure. Unfortunately, even though new methods have been proposed over the past 40 years, the performance of computational prediction methods has stagn… ▽ More

    Submitted 31 August, 2020; originally announced September 2020.

    Comments: 25 pages, 5 figures, 1 table

    MSC Class: I.2.0 General

  10. arXiv:2009.07022  [pdf, other

    cs.LG cs.CL cs.DB cs.IR stat.ML

    The Devil is the Classifier: Investigating Long Tail Relation Classification with Decoupling Analysis

    Authors: Haiyang Yu, Ningyu Zhang, Shumin Deng, Zonggang Yuan, Yantao Jia, Huajun Chen

    Abstract: Long-tailed relation classification is a challenging problem as the head classes may dominate the training phase, thereby leading to the deterioration of the tail performance. Existing solutions usually address this issue via class-balancing strategies, e.g., data re-sampling and loss re-weighting, but all these methods adhere to the schema of entangling learning of the representation and classifi… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  11. arXiv:2006.06889  [pdf, ps, other

    cs.LG math.OC stat.ML

    Fast Objective & Duality Gap Convergence for Non-Convex Strongly-Concave Min-Max Problems with PL Condition

    Authors: Zhishuai Guo, Yan Yan, Zhuoning Yuan, Tianbao Yang

    Abstract: This paper focuses on stochastic methods for solving smooth non-convex strongly-concave min-max problems, which have received increasing attention due to their potential applications in deep learning (e.g., deep AUC maximization, distributionally robust optimization). However, most of the existing algorithms are slow in practice, and their analysis revolves around the convergence to a nearly stati… ▽ More

    Submitted 17 April, 2023; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted by Journal of Machine Learning Research

  12. arXiv:2005.08419  [pdf

    cs.LG eess.IV physics.geo-ph stat.ML

    Hybrid-DNNs: Hybrid Deep Neural Networks for Mixed Inputs

    Authors: Zhenyu Yuan, Yuxin Jiang, **g**g Li, Handong Huang

    Abstract: Rapid development of big data and high-performance computing have encouraged explosive studies of deep learning in geoscience. However, most studies only take single-type data as input, frittering away invaluable multisource, multi-scale information. We develop a general architecture of hybrid deep neural networks (HDNNs) to support mixed inputs. Regarding as a combination of feature learning and… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

  13. arXiv:2005.02426  [pdf, ps, other

    cs.DC cs.LG math.OC stat.ML

    Communication-Efficient Distributed Stochastic AUC Maximization with Deep Neural Networks

    Authors: Zhishuai Guo, Mingrui Liu, Zhuoning Yuan, Li Shen, Wei Liu, Tianbao Yang

    Abstract: In this paper, we study distributed algorithms for large-scale AUC maximization with a deep neural network as a predictive model. Although distributed learning techniques have been investigated extensively in deep learning, they are not directly applicable to stochastic AUC maximization with deep neural networks due to its striking differences from standard loss minimization problems (e.g., cross-… ▽ More

    Submitted 8 October, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    Journal ref: 37th International Conference on Machine Learning, 2020

  14. arXiv:2003.13373  [pdf, other

    stat.ME astro-ph.GA astro-ph.IM

    A flexible method for estimating luminosity functions via Kernel Density Estimation

    Authors: Zunli Yuan, Matt J. Jarvis, Jiancheng Wang

    Abstract: We propose a flexible method for estimating luminosity functions (LFs) based on kernel density estimation (KDE), the most popular nonparametric density estimation approach developed in modern statistics, to overcome issues surrounding binning of LFs. One challenge in applying KDE to LFs is how to treat the boundary bias problem, since astronomical surveys usually obtain truncated samples predomina… ▽ More

    Submitted 30 April, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: 23 pages, accepted for publication in The Astrophysical Journal Supplement Series

    Journal ref: 2020, ApJS, 248, 1

  15. arXiv:1909.11591  [pdf, other

    cs.LG cs.AI cs.LO eess.SY stat.ML

    Modular Deep Reinforcement Learning with Temporal Logic Specifications

    Authors: Lim Zun Yuan, Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

    Abstract: We propose an actor-critic, model-free, and online Reinforcement Learning (RL) framework for continuous-state continuous-action Markov Decision Processes (MDPs) when the reward is highly sparse but encompasses a high-level temporal structure. We represent this temporal structure by a finite-state machine and construct an on-the-fly synchronised product with the MDP and the finite machine. The temp… ▽ More

    Submitted 22 November, 2019; v1 submitted 23 September, 2019; originally announced September 2019.

    Comments: arXiv admin note: text overlap with arXiv:1902.00778

  16. arXiv:1908.10831  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic AUC Maximization with Deep Neural Networks

    Authors: Mingrui Liu, Zhuoning Yuan, Yiming Ying, Tianbao Yang

    Abstract: Stochastic AUC maximization has garnered an increasing interest due to better fit to imbalanced data classification. However, existing works are limited to stochastic AUC maximization with a linear predictive model, which restricts its predictive power when dealing with extremely complex data. In this paper, we consider stochastic AUC maximization problem with a deep neural network as the predicti… ▽ More

    Submitted 29 June, 2020; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: Accepted by ICLR 2020

  17. arXiv:1905.10115  [pdf, ps, other

    cs.LG stat.ML

    Multi-Kernel Correntropy for Robust Learning

    Authors: Badong Chen, Yuqing Xie, Xin Wang, Zejian yuan, Pengju Ren, **g Qin

    Abstract: As a novel similarity measure that is defined as the expectation of a kernel function between two random variables, correntropy has been successfully applied in robust machine learning and signal processing to combat large outliers. The kernel function in correntropy is usually a zero-mean Gaussian kernel. In a recent work, the concept of mixture correntropy (MC) was proposed to improve the learni… ▽ More

    Submitted 5 September, 2021; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: 12 pages, 5 figures

  18. arXiv:1812.03934  [pdf, ps, other

    stat.ML cs.LG math.OC

    Stagewise Training Accelerates Convergence of Testing Error Over SGD

    Authors: Zhuoning Yuan, Yan Yan, Rong **, Tianbao Yang

    Abstract: Stagewise training strategy is widely used for learning neural networks, which runs a stochastic algorithm (e.g., SGD) starting with a relatively large step size (aka learning rate) and geometrically decreasing the step size after a number of iterations. It has been observed that the stagewise SGD has much faster convergence than the vanilla SGD with a polynomially decaying step size in terms of b… ▽ More

    Submitted 2 February, 2019; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: More experiments on deep learning are added to verify the assumptions

  19. arXiv:1810.11536  [pdf, other

    cs.LG cs.GR stat.ML

    Automatic Graphics Program Generation using Attention-Based Hierarchical Decoder

    Authors: Zhihao Zhu, Zhan Xue, Zejian Yuan

    Abstract: Recent progress on deep learning has made it possible to automatically transform the screenshot of Graphic User Interface (GUI) into code by using the encoder-decoder framework. While the commonly adopted image encoder (e.g., CNN network), might be capable of extracting image features to the desired level, interpreting these abstract image features into hundreds of tokens of code puts a particular… ▽ More

    Submitted 26 October, 2018; originally announced October 2018.

    Comments: Asian Conference on Computer Vision

  20. arXiv:1808.06296  [pdf, ps, other

    math.OC stat.ML

    Universal Stagewise Learning for Non-Convex Problems with Convergence on Averaged Solutions

    Authors: Zaiyi Chen, Zhuoning Yuan, **feng Yi, Bowen Zhou, Enhong Chen, Tianbao Yang

    Abstract: Although stochastic gradient descent (SGD) method and its variants (e.g., stochastic momentum methods, AdaGrad) are the choice of algorithms for solving non-convex problems (especially deep learning), there still remain big gaps between the theory and the practice with many questions unresolved. For example, there is still a lack of theories of convergence for SGD and its variants that use stagewi… ▽ More

    Submitted 5 March, 2019; v1 submitted 19 August, 2018; originally announced August 2018.

    Comments: added more experimental results

  21. arXiv:1806.00069  [pdf, ps, other

    cs.AI cs.LG stat.ML

    Explaining Explanations: An Overview of Interpretability of Machine Learning

    Authors: Leilani H. Gilpin, David Bau, Ben Z. Yuan, Ayesha Bajwa, Michael Specter, Lalana Kagal

    Abstract: There has recently been a surge of work in explanatory artificial intelligence (XAI). This research area tackles the important problem that complex machines and algorithms often cannot provide insights into their behavior and thought processes. XAI allows users and parts of the internal system to be more transparent, providing explanations of their decisions in some level of detail. These explanat… ▽ More

    Submitted 3 February, 2019; v1 submitted 31 May, 2018; originally announced June 2018.

    Comments: The 5th IEEE International Conference on Data Science and Advanced Analytics (DSAA 2018). [Research Track]