Skip to main content

Showing 1–18 of 18 results for author: Feng, Q

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.02689  [pdf, ps, other

    cs.LG cs.DC math.OC stat.ML

    Accelerating Distributed Optimization: A Primal-Dual Perspective on Local Steps

    Authors: Junchi Yang, Murat Yildirim, Qiu Feng

    Abstract: In distributed machine learning, efficient training across multiple agents with different data distributions poses significant challenges. Even with a centralized coordinator, current algorithms that achieve optimal communication complexity typically require either large minibatches or compromise on gradient complexity. In this work, we tackle both centralized and decentralized settings across str… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.06802  [pdf, other

    stat.ML cs.LG

    Satisficing Exploration in Bandit Optimization

    Authors: Qing Feng, Tianyi Ma, Ruihao Zhu

    Abstract: Motivated by the concept of satisficing in decision-making, we consider the problem of satisficing exploration in bandit optimization. In this setting, the learner aims at selecting satisficing arms (arms with mean reward exceeding a certain threshold value) as frequently as possible. The performance is measured by satisficing regret, which is the cumulative deficit of the chosen arm's mean reward… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  3. arXiv:2405.07839  [pdf, other

    cs.LG cs.AI stat.ML

    Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

    Authors: Haoyang Zheng, Hengrong Du, Qi Feng, Wei Deng, Guang Lin

    Abstract: Replica exchange stochastic gradient Langevin dynamics (reSGLD) is an effective sampler for non-convex learning in large-scale datasets. However, the simulation may encounter stagnation issues when the high-temperature chain delves too deeply into the distribution tails. To tackle this issue, we propose reflected reSGLD (r2SGLD): an algorithm tailored for constrained non-convex exploration by util… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: 28 pages, 13 figures

  4. arXiv:2402.01036  [pdf, other

    math.PR cs.LG stat.ML

    Fisher information dissipation for time inhomogeneous stochastic differential equations

    Authors: Qi Feng, Xinzhe Zuo, Wuchen Li

    Abstract: We provide a Lyapunov convergence analysis for time-inhomogeneous variable coefficient stochastic differential equations (SDEs). Three typical examples include overdamped, irreversible drift, and underdamped Langevin dynamics. We first formula the probability transition equation of Langevin dynamics as a modified gradient flow of the Kullback-Leibler divergence in the probability space with respec… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 9 figures, 36 pages

  5. arXiv:2401.03228  [pdf, other

    stat.ML cs.LG

    Reflected Schrödinger Bridge for Constrained Generative Modeling

    Authors: Wei Deng, Yu Chen, Nicole Tianjiao Yang, Hengrong Du, Qi Feng, Ricky T. Q. Chen

    Abstract: Diffusion models have become the go-to method for large-scale generative models in real-world applications. These applications often involve data distributions confined within bounded domains, typically requiring ad-hoc thresholding techniques for boundary enforcement. Reflected diffusion models (Lou23) aim to enhance generalizability by generating the data distribution through a backward process… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  6. arXiv:2312.17210  [pdf, other

    stat.ML cs.AI cs.LG

    Continual Learning via Sequential Function-Space Variational Inference

    Authors: Tim G. J. Rudner, Freddie Bickford Smith, Qixuan Feng, Yee Whye Teh, Yarin Gal

    Abstract: Sequential Bayesian inference over predictive functions is a natural framework for continual learning from streams of data. However, applying it to neural networks has proved challenging in practice. Addressing the drawbacks of existing techniques, we propose an optimization objective derived by formulating continual learning as sequential function-space variational inference. In contrast to exist… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Published in Proceedings of the 39th International Conference on Machine Learning (ICML 2022)

  7. arXiv:2211.12717  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Benchmarking Bayesian Deep Learning on Diabetic Retinopathy Detection Tasks

    Authors: Neil Band, Tim G. J. Rudner, Qixuan Feng, Angelos Filos, Zachary Nado, Michael W. Dusenberry, Ghassen Jerfel, Dustin Tran, Yarin Gal

    Abstract: Bayesian deep learning seeks to equip deep neural networks with the ability to precisely quantify their predictive uncertainty, and has promised to make deep learning more reliable for safety-critical real-world applications. Yet, existing Bayesian deep learning methods fall short of this promise; new methods continue to be evaluated on unrealistic test beds that do not reflect the complexities of… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Published in Neural Information Processing Systems (NeurIPS) 2021 Datasets and Benchmarks Track Proceedings. First two authors contributed equally. Code available at https://rebrand.ly/retina-benchmark

  8. arXiv:2211.10837  [pdf, other

    cs.LG stat.CO

    Non-reversible Parallel Tempering for Deep Posterior Approximation

    Authors: Wei Deng, Qian Zhang, Qi Feng, Faming Liang, Guang Lin

    Abstract: Parallel tempering (PT), also known as replica exchange, is the go-to workhorse for simulations of multi-modal distributions. The key to the success of PT is to adopt efficient swap schemes. The popular deterministic even-odd (DEO) scheme exploits the non-reversibility property and has successfully reduced the communication cost from $O(P^2)$ to $O(P)$ given sufficiently many $P$ chains. However,… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: Accepted by AAAI 2023

  9. arXiv:2211.01798  [pdf, other

    stat.ML cs.LG

    Phase Transitions in Learning and Earning under Price Protection Guarantee

    Authors: Qing Feng, Ruihao Zhu, Stefanus Jasin

    Abstract: Motivated by the prevalence of ``price protection guarantee", which allows a customer who purchased a product in the past to receive a refund from the seller during the so-called price protection period (typically defined as a certain time window after the purchase date) in case the seller decides to lower the price, we study the impact of such policy on the design of online learning algorithm for… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  10. arXiv:2203.01900  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Sparse Bayesian Optimization

    Authors: Sulin Liu, Qing Feng, David Eriksson, Benjamin Letham, Eytan Bakshy

    Abstract: Bayesian optimization (BO) is a powerful approach to sample-efficient optimization of black-box objective functions. However, the application of BO to areas such as recommendation systems often requires taking the interpretability and simplicity of the configurations into consideration, a setting that has not been previously studied in the BO literature. To make BO useful for this setting, we pres… ▽ More

    Submitted 3 March, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

  11. arXiv:2010.01084  [pdf, other

    stat.ML cs.LG math.PR stat.CO stat.ME

    Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction

    Authors: Wei Deng, Qi Feng, Georgios Karagiannis, Guang Lin, Faming Liang

    Abstract: Replica exchange stochastic gradient Langevin dynamics (reSGLD) has shown promise in accelerating the convergence in non-convex learning; however, an excessively large correction for avoiding biases from noisy energy estimators has limited the potential of the acceleration. To address this issue, we study the variance reduction for noisy energy estimators, which promotes much more effective swaps.… ▽ More

    Submitted 18 March, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Accepted by ICLR 2021

  12. arXiv:2008.05367  [pdf, other

    stat.ML cs.LG math.PR stat.ME

    Non-convex Learning via Replica Exchange Stochastic Gradient MCMC

    Authors: Wei Deng, Qi Feng, Liyao Gao, Faming Liang, Guang Lin

    Abstract: Replica exchange Monte Carlo (reMC), also known as parallel tempering, is an important technique for accelerating the convergence of the conventional Markov Chain Monte Carlo (MCMC) algorithms. However, such a method requires the evaluation of the energy function based on the full dataset and is not scalable to big data. The naïve implementation of reMC in mini-batch settings introduces large bias… ▽ More

    Submitted 22 March, 2021; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: Accepted by ICML 2020

  13. arXiv:1807.09741  [pdf, other

    cs.LG stat.ML

    PADME: A Deep Learning-based Framework for Drug-Target Interaction Prediction

    Authors: Qingyuan Feng, Evgenia Dueva, Artem Cherkasov, Martin Ester

    Abstract: In silico drug-target interaction (DTI) prediction is an important and challenging problem in biomedical research with a huge potential benefit to the pharmaceutical industry and patients. Most existing methods for DTI prediction including deep learning models generally have binary endpoints, which could be an oversimplification of the problem, and those methods are typically unable to handle cold… ▽ More

    Submitted 21 August, 2019; v1 submitted 25 July, 2018; originally announced July 2018.

  14. arXiv:1704.02060  [pdf, ps, other

    stat.ML

    Angle-Based Joint and Individual Variation Explained

    Authors: Qing Feng, Meilei Jiang, Jan Hannig, J. S. Marron

    Abstract: Integrative analysis of disparate data blocks measured on a common set of experimental subjects is a major challenge in modern data analysis. This data structure naturally motivates the simultaneous exploration of the joint and individual variation within each data block resulting in new insights. For instance, there is a strong desire to integrate the multiple genomic data sets in The Cancer Geno… ▽ More

    Submitted 18 March, 2018; v1 submitted 6 April, 2017; originally announced April 2017.

    Comments: arXiv admin note: text overlap with arXiv:1512.04060

  15. arXiv:1612.08082  [pdf, other

    stat.ML cs.LG

    Constructing Effective Personalized Policies Using Counterfactual Inference from Biased Data Sets with Many Features

    Authors: Onur Atan, William R. Zame, Qiaojun Feng, Mihaela van der Schaar

    Abstract: This paper proposes a novel approach for constructing effective personalized policies when the observed data lacks counter-factual information, is biased and possesses many features. The approach is applicable in a wide variety of settings from healthcare to advertising to education to finance. These settings have in common that the decision maker can observe, for each previous instance, an array… ▽ More

    Submitted 10 July, 2018; v1 submitted 23 December, 2016; originally announced December 2016.

  16. arXiv:1610.08899  [pdf, other

    stat.ME

    Estimation of heterogeneous individual treatment effects with endogenous treatments

    Authors: Qian Feng, Quang Vuong, Haiqing Xu

    Abstract: This paper estimates individual treatment effects in a triangular model with binary--valued endogenous treatments. Following the identification strategy established in Vuong and Xu (2014), we propose a two--stage estimation approach. First, we estimate the counterfactual outcome and hence the individual treatment effect (ITE) for every observational unit in the sample. Second, we estimate the dens… ▽ More

    Submitted 27 October, 2016; originally announced October 2016.

    Comments: 32 pages, 12 figures, 6 tables, Texas Econometrics Camp 2015 and 2016 CEME conference

  17. arXiv:1601.01986  [pdf, ps, other

    stat.ME

    A Note on Automatic Data Transformation

    Authors: Qing Feng, Jan Hannig, J. S. Marron

    Abstract: Modern data analysis frequently involves variables with highly non-Gaussian marginal distributions. However, commonly used analysis methods are most effective with roughly Gaussian data. This paper introduces an automatic transformation that improves the closeness of distributions to normality. For each variable, a new family of parametrizations of the shifted logarithm transformation is proposed,… ▽ More

    Submitted 8 January, 2016; originally announced January 2016.

  18. arXiv:1512.04060  [pdf, ps, other

    stat.ME

    Non-iterative Joint and Individual Variation Explained

    Authors: Qing Feng, Jan Hannig, J. S. Marron

    Abstract: Integrative analysis of disparate data blocks measured on a common set of experimental subjects is one major challenge in modern data analysis. This data structure naturally motivates the simultaneous exploration of the joint and individual variation within each data block resulting in new insights. For instance, there is a strong desire to integrate the multiple genomic data sets in The Cancer Ge… ▽ More

    Submitted 25 April, 2016; v1 submitted 13 December, 2015; originally announced December 2015.