Skip to main content

Showing 1–50 of 66 results for author: Huang, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.11092  [pdf, other

    cs.LG math.NA stat.ML

    Guaranteed Sampling Flexibility for Low-tubal-rank Tensor Completion

    Authors: Bowen Su, Juntao You, HanQin Cai, Longxiu Huang

    Abstract: While Bernoulli sampling is extensively studied in tensor completion, t-CUR sampling approximates low-tubal-rank tensors via lateral and horizontal subtensors. However, both methods lack sufficient flexibility for diverse practical applications. To address this, we introduce Tensor Cross-Concentrated Sampling (t-CCS), a novel and straightforward sampling model that advances the matrix cross-concen… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2406.07409  [pdf, other

    stat.ML cs.IT cs.LG eess.SP math.OC

    Accelerating Ill-conditioned Hankel Matrix Recovery via Structured Newton-like Descent

    Authors: HanQin Cai, Longxiu Huang, Xiliang Lu, Juntao You

    Abstract: This paper studies the robust Hankel recovery problem, which simultaneously removes the sparse outliers and fulfills missing entries from the partial observation. We propose a novel non-convex algorithm, coined Hankel Structured Newton-Like Descent (HSNLD), to tackle the robust Hankel recovery problem. HSNLD is highly efficient with linear convergence, and its convergence rate is independent of th… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    MSC Class: 15A29; 15A83; 47B35; 90C17; 90C26; 90C53

  3. arXiv:2406.05822  [pdf, other

    cs.LG stat.ML

    Symmetric Matrix Completion with ReLU Sampling

    Authors: Huikang Liu, Peng Wang, Longxiu Huang, Qing Qu, Laura Balzano

    Abstract: We study the problem of symmetric positive semi-definite low-rank matrix completion (MC) with deterministic entry-dependent sampling. In particular, we consider rectified linear unit (ReLU) sampling, where only positive entries are observed, as well as a generalization to threshold-based sampling. We first empirically demonstrate that the landscape of this MC problem is not globally benign: Gradie… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 39 pages, 9 figures; This work has been accepted for publication in the Proceedings of the 41st International Conference on Machine Learning (ICML 2024)

  4. arXiv:2406.00539  [pdf, other

    cs.LG stat.ML

    CONFINE: Conformal Prediction for Interpretable Neural Networks

    Authors: Linhui Huang, Sayeri Lala, Niraj K. Jha

    Abstract: Deep neural networks exhibit remarkable performance, yet their black-box nature limits their utility in fields like healthcare where interpretability is crucial. Existing explainability approaches often sacrifice accuracy and lack quantifiable measures of prediction uncertainty. In this study, we introduce Conformal Prediction for Interpretable Neural Networks (CONFINE), a versatile framework that… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  5. arXiv:2403.02625  [pdf, ps, other

    stat.ME

    Determining the Number of Common Functional Factors with Twice Cross-Validation

    Authors: Hui Jiang, Lei Huang, Shengfan Wu

    Abstract: The semiparametric factor model serves as a vital tool to describe the dependence patterns in the data. It recognizes that the common features observed in the data are actually explained by functions of specific exogenous variables.Unlike traditional factor models, where the focus is on selecting the number of factors, our objective here is to identify the appropriate number of common functions, a… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  6. arXiv:2402.18149  [pdf, ps, other

    cs.LG stat.ML

    Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation

    Authors: Tonghe Zhang, Yu Chen, Longbo Huang

    Abstract: This work pioneers regret analysis of risk-sensitive reinforcement learning in partially observable environments with hindsight observation, addressing a gap in theoretical exploration. We introduce a novel formulation that integrates hindsight observations into a Partially Observable Markov Decision Process (POMDP) framework, where the goal is to optimize accumulated reward under the entropic ris… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 38 pages

  7. arXiv:2402.03447  [pdf, other

    stat.ML cs.LG stat.ME

    Challenges in Variable Importance Ranking Under Correlation

    Authors: Annie Liang, Thomas Jemielita, Andy Liaw, Vladimir Svetnik, Lingkang Huang, Richard Baumgartner, Jason M. Klusowski

    Abstract: Variable importance plays a pivotal role in interpretable machine learning as it helps measure the impact of factors on the output of the prediction model. Model agnostic methods based on the generation of "null" features via permutation (or related approaches) can be applied. Such analysis is often utilized in pharmaceutical applications due to its ability to interpret black-box models, including… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  8. arXiv:2401.15566  [pdf, other

    stat.ML cs.IT cs.LG math.OC

    On the Robustness of Cross-Concentrated Sampling for Matrix Completion

    Authors: HanQin Cai, Longxiu Huang, Chandra Kundu, Bowen Su

    Abstract: Matrix completion is one of the crucial tools in modern data science research. Recently, a novel sampling model for matrix completion coined cross-concentrated sampling (CCS) has caught much attention. However, the robustness of the CCS model against sparse outliers remains unclear in the existing studies. In this paper, we aim to answer this question by exploring a novel Robust CCS Completion pro… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 58th Annual Conference of Information Sciences and Systems

  9. arXiv:2310.13969  [pdf, ps, other

    stat.ML cs.LG

    Distributed Linear Regression with Compositional Covariates

    Authors: Yue Chao, Lei Huang, Xuejun Ma

    Abstract: With the availability of extraordinarily huge data sets, solving the problems of distributed statistical methodology and computing for such data sets has become increasingly crucial in the big data area. In this paper, we focus on the distributed sparse penalized linear log-contrast model in massive compositional data. In particular, two distributed optimization techniques under centralized and de… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 35 pages,2 figures

    MSC Class: 62-08 62-08 62-08 62-08 62-08 ACM Class: G.3

  10. arXiv:2307.11214  [pdf, other

    cs.LG stat.AP

    FairMobi-Net: A Fairness-aware Deep Learning Model for Urban Mobility Flow Generation

    Authors: Zhewei Liu, Lipai Huang, Chao Fan, Ali Mostafavi

    Abstract: Generating realistic human flows across regions is essential for our understanding of urban structures and population activity patterns, enabling important applications in the fields of urban planning and management. However, a notable shortcoming of most existing mobility generation methodologies is neglect of prediction fairness, which can result in underestimation of mobility flows across regio… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  11. arXiv:2303.01131  [pdf

    stat.AP

    Association Among Gender, Age, and Region in Taiwan's First Ten Thousand COVID-19 Cases: A Log-linear-model Analysis

    Authors: Tai-Cheng Hung, Li-Shan Huang

    Abstract: Objectives: We explore the association between age, gender, and region among Taiwan's 11290 local Covid-19 cases from January 22, 2020 to June 11, 2021. Methods: Using open data from Taiwan's CDC, we organize them into a three-dimensional contingency table. The groups are gender, age 0-29, 30-59, and 60+ years old, and two classifications for region: (1) 7 commonly-defined regions, (2) 12 groups s… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 19 pages, 4 tables, 2 figures

    MSC Class: 62P10 (Primary); 62J12 (Secondary)

  12. arXiv:2210.08228  [pdf, other

    math.ST stat.ME

    Nonparametric Estimation of Mediation Effects with A General Treatment

    Authors: Lukang Huang, Wei Huang, Oliver Linton, Zheng Zhang

    Abstract: To investigate causal mechanisms, causal mediation analysis decomposes the total treatment effect into the natural direct and indirect effects. This paper examines the estimation of the direct and indirect effects in a general treatment effect model, where the treatment can be binary, multi-valued, continuous, or a mixture. We propose generalized weighting estimators with weights estimated by solv… ▽ More

    Submitted 22 January, 2024; v1 submitted 15 October, 2022; originally announced October 2022.

  13. arXiv:2210.05122  [pdf, other

    cond-mat.stat-mech stat.AP

    Universal cover-time distribution of heterogeneous random walks

    Authors: Jia-Qi Dong, Wen-Hui Han, Yisen Wang, Xiao-Song Chen, Liang Huang

    Abstract: The cover-time problem, i.e., time to visit every site in a system, is one of the key issues of random walks with wide applications in natural, social, and engineered systems. Addressing the full distribution of cover times for random walk on complex structures has been a long-standing challenge and has attracted persistent efforts. Yet, the known results are essentially limited to homogeneous sys… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: 12 pages, 6 figures

  14. arXiv:2208.04298  [pdf, other

    cs.CV cs.AI cs.GR stat.AP

    Gaze Estimation Approach Using Deep Differential Residual Network

    Authors: Longzhao Huang, Yujie Li, Xu Wang, Haoyu Wang, Ahmed Bouridane, Ahmad Chaddad

    Abstract: Gaze estimation, which is a method to determine where a person is looking at given the person's full face, is a valuable clue for understanding human intention. Similarly to other domains of computer vision, deep learning (DL) methods have gained recognition in the gaze estimation domain. However, there are still gaze calibration problems in the gaze estimation domain, thus preventing existing met… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Journal ref: Sensors 2022, 22(14), 5462;

  15. arXiv:2203.14702  [pdf, other

    cs.CV cs.LG stat.ML

    Bi-level Doubly Variational Learning for Energy-based Latent Variable Models

    Authors: Ge Kan, **hu Lü, Tian Wang, Baochang Zhang, Aichun Zhu, Lei Huang, Guodong Guo, Hichem Snoussi

    Abstract: Energy-based latent variable models (EBLVMs) are more expressive than conventional energy-based models. However, its potential on visual tasks are limited by its training process based on maximum likelihood estimate that requires sampling from two intractable distributions. In this paper, we propose Bi-level doubly variational learning (BiDVL), which is based on a new bi-level optimization framewo… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  16. arXiv:2201.13324  [pdf, other

    cs.LG cs.IR stat.ML

    Guided Semi-Supervised Non-negative Matrix Factorization on Legal Documents

    Authors: Pengyu Li, Christine Tseng, Yaxuan Zheng, Joyce A. Chew, Longxiu Huang, Benjamin Jarman, Deanna Needell

    Abstract: Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to perform classification and topic modeling tasks; however, most methods that can perform both do not allow for guidance of the topics or features. In this paper, we… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: 14 pages, 4 figures

  17. arXiv:2110.15263  [pdf, other

    cs.LG cs.CG cs.DS econ.EM stat.ML

    Coresets for Time Series Clustering

    Authors: Lingxiao Huang, K. Sudhir, Nisheeth K. Vishnoi

    Abstract: We study the problem of constructing coresets for clustering problems with time series data. This problem has gained importance across many fields including biology, medicine, and economics due to the proliferation of sensors facilitating real-time measurement and rapid drop in storage costs. In particular, we consider the setting where the time series data on $N$ entities is generated from a Gaus… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: Full version of a paper appearing in NeurIPS 2021

  18. arXiv:2110.14446  [pdf, other

    cs.LG cs.SI stat.ML

    Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

    Authors: Derek Lim, Felix Hohne, Xiuyu Li, Sijia Linda Huang, Vaishnavi Gupta, Omkar Bhalerao, Ser-Nam Lim

    Abstract: Many widely used datasets for graph machine learning tasks have generally been homophilous, where nodes with similar labels connect to each other. Recently, new Graph Neural Networks (GNNs) have been developed that move beyond the homophily regime; however, their evaluation has often been conducted on small graphs with limited application domains. We collect and introduce diverse non-homophilous d… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021

  19. arXiv:2110.13400   

    cs.LG stat.ML

    Scale-Free Adversarial Multi-Armed Bandit with Arbitrary Feedback Delays

    Authors: Jiatai Huang, Yan Dai, Longbo Huang

    Abstract: We consider the Scale-Free Adversarial Multi-Armed Bandit (MAB) problem with unrestricted feedback delays. In contrast to the standard assumption that all losses are $[0,1]$-bounded, in our setting, losses can fall in a general bounded interval $[-L, L]$, unknown to the agent beforehand. Furthermore, the feedback of each arm pull can experience arbitrary delays. We propose a novel approach named S… ▽ More

    Submitted 25 January, 2023; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Preliminary work, merged to arXiv:2301.10500

  20. arXiv:2110.05636  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    CAPITAL: Optimal Subgroup Identification via Constrained Policy Tree Search

    Authors: Hengrui Cai, Wenbin Lu, Rachel Marceau West, Devan V. Mehrotra, Lingkang Huang

    Abstract: Personalized medicine, a paradigm of medicine tailored to a patient's characteristics, is an increasingly attractive field in health care. An important goal of personalized medicine is to identify a subgroup of patients, based on baseline covariates, that benefits more from the targeted treatment than other comparative treatments. Most of the current subgroup identification methods only focus on o… ▽ More

    Submitted 28 January, 2023; v1 submitted 11 October, 2021; originally announced October 2021.

  21. arXiv:2109.14079  [pdf, other

    cs.IT math.NA stat.CO

    Robust recovery of bandlimited graph signals via randomized dynamical sampling

    Authors: Longxiu Huang, Deanna Needell, Sui Tang

    Abstract: Heat diffusion processes have found wide applications in modelling dynamical systems over graphs. In this paper, we consider the recovery of a $k$-bandlimited graph signal that is an initial signal of a heat diffusion process from its space-time samples. We propose three random space-time sampling regimes, termed dynamical sampling techniques, that consist in selecting a small subset of space-time… ▽ More

    Submitted 3 October, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: corrected mistakes in plotting. arXiv admin note: text overlap with arXiv:1511.05118 by other authors

    MSC Class: 94A20; 94A12

  22. arXiv:2107.04061  [pdf, other

    cs.LG cs.AI stat.ML

    Scaling Gaussian Processes with Derivative Information Using Variational Inference

    Authors: Misha Padidar, Xinran Zhu, Leo Huang, Jacob R. Gardner, David Bindel

    Abstract: Gaussian processes with derivative information are useful in many settings where derivative information is available, including numerous Bayesian optimization and regression tasks that arise in the natural sciences. Incorporating derivative observations, however, comes with a dominating $O(N^3D^3)$ computational cost when training on $N$ points in $D$ input dimensions. This is intractable for even… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  23. arXiv:2106.08943   

    cs.LG stat.ML

    Banker Online Mirror Descent

    Authors: Jiatai Huang, Longbo Huang

    Abstract: We propose Banker-OMD, a novel framework generalizing the classical Online Mirror Descent (OMD) technique in online learning algorithm design. Banker-OMD allows algorithms to robustly handle delayed feedback, and offers a general methodology for achieving $\tilde{O}(\sqrt{T} + \sqrt{D})$-style regret bounds in various delayed-feedback online learning tasks, where $T$ is the time horizon length and… ▽ More

    Submitted 25 January, 2023; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Preliminary work, merged to arXiv:2301.10500

  24. arXiv:2012.07048  [pdf, other

    cs.LG stat.ML

    Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback

    Authors: Siwei Wang, Haoyun Wang, Longbo Huang

    Abstract: We study the multi-armed bandit (MAB) problem with composite and anonymous feedback. In this model, the reward of pulling an arm spreads over a period of time (we call this period as reward interval) and the player receives partial rewards of the action, convoluted with rewards from pulling other arms, successively. Existing results on this model require prior knowledge about the reward interval s… ▽ More

    Submitted 15 December, 2020; v1 submitted 13 December, 2020; originally announced December 2020.

  25. arXiv:2011.00981  [pdf, other

    cs.LG cs.CG cs.DS econ.EM stat.ML

    Coresets for Regressions with Panel Data

    Authors: Lingxiao Huang, K. Sudhir, Nisheeth K. Vishnoi

    Abstract: This paper introduces the problem of coresets for regression problems to panel data settings. We first define coresets for several variants of regression problems with panel data and then present efficient algorithms to construct coresets of size that depend polynomially on 1/$\varepsilon$ (where $\varepsilon$ is the error parameter) and the number of regression parameters - independent of the num… ▽ More

    Submitted 2 November, 2020; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: This is a Full version of a paper to appear in NeurIPS 2020. The code can be found in https://github.com/huanglx12/Coresets-for-regressions-with-panel-data

  26. arXiv:2010.07422  [pdf, other

    stat.ML cs.AI cs.IT cs.LG math.NA math.OC

    Rapid Robust Principal Component Analysis: CUR Accelerated Inexact Low Rank Estimation

    Authors: HanQin Cai, Keaton Hamm, Longxiu Huang, Jiaqi Li, Tao Wang

    Abstract: Robust principal component analysis (RPCA) is a widely used tool for dimension reduction. In this work, we propose a novel non-convex algorithm, coined Iterated Robust CUR (IRCUR), for solving RPCA problems, which dramatically improves the computational efficiency in comparison with the existing algorithms. IRCUR achieves this acceleration by employing CUR decomposition when updating the low rank… ▽ More

    Submitted 7 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Journal ref: IEEE Signal Processing Letters, 28 (2021): 116-120

  27. arXiv:2009.13333  [pdf, other

    cs.LG cs.CV stat.ML

    Group Whitening: Balancing Learning Efficiency and Representational Capacity

    Authors: Lei Huang, Yi Zhou, Li Liu, Fan Zhu, Ling Shao

    Abstract: Batch normalization (BN) is an important technique commonly incorporated into deep learning models to perform standardization within mini-batches. The merits of BN in improving a model's learning efficiency can be further amplified by applying whitening, while its drawbacks in estimating population statistics for inference can be avoided through group normalization (GN). This paper proposes group… ▽ More

    Submitted 6 April, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: V4: camera version of CVPR 2021. Code available at: https://github.com/huangleiBuaa/GroupWhitening

  28. arXiv:2009.12836  [pdf, other

    cs.LG cs.CV stat.ML

    Normalization Techniques in Training DNNs: Methodology, Analysis and Application

    Authors: Lei Huang, Jie Qin, Yi Zhou, Fan Zhu, Li Liu, Ling Shao

    Abstract: Normalization techniques are essential for accelerating the training and improving the generalization of deep neural networks (DNNs), and have successfully been used in various applications. This paper reviews and comments on the past, present and future of normalization methods in the context of DNN training. We provide a unified picture of the main motivation behind different approaches from the… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

    Comments: 20 pages

  29. arXiv:2009.09074  [pdf, other

    cs.DL cs.IR cs.LG stat.ML

    COVID-19 Literature Topic-Based Search via Hierarchical NMF

    Authors: Rachel Grotheer, Yihuan Huang, Pengyu Li, Elizaveta Rebrova, Deanna Needell, Longxiu Huang, Alona Kryshchenko, Xia Li, Kyung Ha, Oleksandr Kryshchenko

    Abstract: A dataset of COVID-19-related scientific literature is compiled, combining the articles from several online libraries and selecting those with open access and full text available. Then, hierarchical nonnegative matrix factorization is used to organize literature related to the novel coronavirus into a tree structure that allows researchers to search for relevant literature based on detected topics… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

  30. arXiv:2007.13040  [pdf, other

    cs.LG stat.ML

    Improving Generalization in Meta-learning via Task Augmentation

    Authors: Huaxiu Yao, Longkai Huang, Linjun Zhang, Ying Wei, Li Tian, James Zou, Junzhou Huang, Zhenhui Li

    Abstract: Meta-learning has proven to be a powerful paradigm for transferring the knowledge from previous tasks to facilitate the learning of a novel task. Current dominant algorithms train a well-generalized model initialization which is adapted to each task via the support set. The crux lies in optimizing the generalization capability of the initialization, which is measured by the performance of the adap… ▽ More

    Submitted 9 June, 2021; v1 submitted 25 July, 2020; originally announced July 2020.

    Comments: Accepted by ICML 2021

  31. arXiv:2007.04873  [pdf, other

    cs.LG cs.CV stat.ML

    Invertible Zero-Shot Recognition Flows

    Authors: Yuming Shen, Jie Qin, Lei Huang

    Abstract: Deep generative models have been successfully applied to Zero-Shot Learning (ZSL) recently. However, the underlying drawbacks of GANs and VAEs (e.g., the hardness of training with ZSL-oriented regularizers and the limited generation quality) hinder the existing generative ZSL models from fully bypassing the seen-unseen bias. To tackle the above limitations, for the first time, this work incorporat… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: ECCV2020

  32. arXiv:2007.00784  [pdf, other

    cs.LG cs.DC stat.ML

    Convolutional Neural Network Training with Distributed K-FAC

    Authors: J. Gregory Pauloski, Zhao Zhang, Lei Huang, Weijia Xu, Ian T. Foster

    Abstract: Training neural networks with many processors can reduce time-to-solution; however, it is challenging to maintain convergence and efficiency at large scales. The Kronecker-factored Approximate Curvature (K-FAC) was recently proposed as an approximation of the Fisher Information Matrix that can be used in natural gradient optimizers. We investigate here a scalable K-FAC design and its applicability… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: To be published in the proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC20)

  33. arXiv:2006.12772  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Combinatorial Pure Exploration of Dueling Bandit

    Authors: Wei Chen, Yihan Du, Longbo Huang, Haoyu Zhao

    Abstract: In this paper, we study combinatorial pure exploration for dueling bandits (CPE-DB): we have multiple candidates for multiple positions as modeled by a bipartite graph, and in each round we sample a duel of two candidates on one position and observe who wins in the duel, with the goal of finding the best candidate-position matching with high probability after multiple rounds of samples. CPE-DB is… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Comments: Accepted to ICML 2020

  34. arXiv:2006.10254  [pdf, other

    stat.ML cs.LG math.DG

    Neural Manifold Ordinary Differential Equations

    Authors: Aaron Lou, Derek Lim, Isay Katsman, Leo Huang, Qingxuan Jiang, Ser-Nam Lim, Christopher De Sa

    Abstract: To better conform to data geometry, recent deep generative modelling techniques adapt Euclidean constructions to non-Euclidean spaces. In this paper, we study normalizing flows on manifolds. Previous work has developed flow models for specific cases; however, these advancements hand craft layers on a manifold-by-manifold basis, restricting generality and inducing cumbersome design constraints. We… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: Submitted to NeurIPS 2020

  35. arXiv:2006.06555  [pdf, ps, other

    cs.LG cs.MA stat.ML

    Multi-Agent Reinforcement Learning in Stochastic Networked Systems

    Authors: Yiheng Lin, Guannan Qu, Longbo Huang, Adam Wierman

    Abstract: We study multi-agent reinforcement learning (MARL) in a stochastic network of agents. The objective is to find localized policies that maximize the (discounted) global reward. In general, scalability is a challenge in this setting because the size of the global state/action space can be exponential in the number of agents. Scalable algorithms are only known in cases where dependencies are static,… ▽ More

    Submitted 1 November, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

  36. arXiv:2006.06193  [pdf, other

    cs.LG stat.ML

    Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework

    Authors: Chuheng Zhang, Yuanying Cai, Longbo Huang, Jian Li

    Abstract: Exploration is essential for reinforcement learning (RL). To face the challenges of exploration, we consider a reward-free RL framework that completely separates exploration from exploitation and brings new challenges for exploration algorithms. In the exploration phase, the agent learns an exploratory policy by interacting with a reward-free environment and collects a dataset of transitions by ex… ▽ More

    Submitted 10 December, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted by AAAI-21

  37. arXiv:2006.04778  [pdf, other

    cs.LG cs.AI cs.CY cs.DS stat.ML

    Fair Classification with Noisy Protected Attributes: A Framework with Provable Guarantees

    Authors: L. Elisa Celis, Lingxiao Huang, Vijay Keswani, Nisheeth K. Vishnoi

    Abstract: We present an optimization framework for learning a fair classifier in the presence of noisy perturbations in the protected attributes. Compared to prior work, our framework can be employed with a very general class of linear and linear-fractional fairness constraints, can handle multiple, non-binary protected attributes, and outputs a classifier that comes with provable guarantees on both accurac… ▽ More

    Submitted 16 February, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

  38. arXiv:2006.01424  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Image Super-Resolution with Cross-Scale Non-Local Attention and Exhaustive Self-Exemplars Mining

    Authors: Yiqun Mei, Yuchen Fan, Yuqian Zhou, Lichao Huang, Thomas S. Huang, Humphrey Shi

    Abstract: Deep convolution-based single image super-resolution (SISR) networks embrace the benefits of learning from large-scale external image resources for local recovery, yet most existing works have ignored the long-range feature-wise similarities in natural images. Some recent works have successfully leveraged this intrinsic feature correlation by exploring non-local attention modules. However, none of… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: CVPR2020

  39. arXiv:2006.00978  [pdf, ps, other

    cs.LG stat.ML

    On the Number of Linear Regions of Convolutional Neural Networks

    Authors: H. Xiong, L. Huang, M. Yu, L. Liu, F. Zhu, L. Shao

    Abstract: One fundamental problem in deep learning is understanding the outstanding performance of deep Neural Networks (NNs) in practice. One explanation for the superiority of NNs is that they can realize a large class of complicated functions, i.e., they have powerful expressivity. The expressivity of a ReLU NN can be quantified by the maximal number of linear regions it can separate its input space into… ▽ More

    Submitted 27 June, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: International Conference on Machine Learning (ICML) 2020

  40. arXiv:2002.10319  [pdf, other

    cs.LG cs.CV stat.ML

    Self-Adaptive Training: beyond Empirical Risk Minimization

    Authors: Lang Huang, Chao Zhang, Hongyang Zhang

    Abstract: We propose self-adaptive training---a new training algorithm that dynamically corrects problematic training labels by model predictions without incurring extra computational cost---to improve generalization of deep learning for potentially corrupted training data. This problem is crucial towards robustly learning from data that are corrupted by, e.g., label noises and out-of-distribution samples.… ▽ More

    Submitted 30 September, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: To appear in NeurIPS 2020

  41. arXiv:2002.02090  [pdf, other

    cs.LG cs.DC stat.ML

    Faster On-Device Training Using New Federated Momentum Algorithm

    Authors: Zhouyuan Huo, Qian Yang, Bin Gu, Lawrence Carin. Heng Huang

    Abstract: Mobile crowdsensing has gained significant attention in recent years and has become a critical paradigm for emerging Internet of Things applications. The sensing devices continuously generate a significant quantity of data, which provide tremendous opportunities to develop innovative intelligent applications. To utilize these data to train machine learning models while not compromising user privac… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

  42. arXiv:2002.00401  [pdf

    stat.ML cs.LG

    Provable Noisy Sparse Subspace Clustering using Greedy Neighbor Selection: A Coherence-Based Perspective

    Authors: Jwo-Yuh Wu, Wen-Hsuan Li, Liang-Chi Huang, Yen-** Lin, Chun-Hung Liu, Rung-Hung Gau

    Abstract: Sparse subspace clustering (SSC) using greedy-based neighbor selection, such as matching pursuit (MP) and orthogonal matching pursuit (OMP), has been known as a popular computationally-efficient alternative to the conventional L1-minimization based methods. Under deterministic bounded noise corruption, in this paper we derive coherence-based sufficient conditions guaranteeing correct neighbor iden… ▽ More

    Submitted 2 February, 2020; originally announced February 2020.

  43. arXiv:1911.04207  [pdf, other

    cs.LG stat.ML

    Multi-Path Policy Optimization

    Authors: Ling Pan, Qingpeng Cai, Longbo Huang

    Abstract: Recent years have witnessed a tremendous improvement of deep reinforcement learning. However, a challenging problem is that an agent may suffer from inefficient exploration, particularly for on-policy methods. Previous exploration methods either rely on complex structure to estimate the novelty of states, or incur sensitive hyper-parameters causing instability. We propose an efficient exploration… ▽ More

    Submitted 14 February, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: AAMAS-2020

  44. arXiv:1911.00741  [pdf, other

    stat.ME

    Yakovlev Promotion Time Cure Model with Local Polynomial Estimation

    Authors: Li-Hsiang Lin, Li-Shan Huang

    Abstract: In modeling survival data with a cure fraction, flexible modeling of covariate effects on the probability of cure has important medical implications, which aids investigators in identifying better treatments to cure. This paper studies a semiparametric form of the Yakovlev promotion time cure model that allows for nonlinear effects of a continuous covariate. We adopt the local polynomial approach… ▽ More

    Submitted 4 November, 2019; v1 submitted 2 November, 2019; originally announced November 2019.

    Comments: 26 pages, 4 figures

    MSC Class: 62N99; 62G08

  45. arXiv:1910.09734  [pdf, other

    cs.LG cs.CV stat.ML

    Single and Union Non-parallel Support Vector Machine Frameworks

    Authors: Chun-Na Li, Yuan-Hai Shao, Huajun Wang, Yu-Ting Zhao, Ling-Wei Huang, Naihua Xiu, Nai-Yang Deng

    Abstract: Considering the classification problem, we summarize the nonparallel support vector machines with the nonparallel hyperplanes to two types of frameworks. The first type constructs the hyperplanes separately. It solves a series of small optimization problems to obtain a series of hyperplanes, but is hard to measure the loss of each sample. The other type constructs all the hyperplanes simultaneousl… ▽ More

    Submitted 25 June, 2021; v1 submitted 21 October, 2019; originally announced October 2019.

  46. arXiv:1909.03276  [pdf, other

    cs.LG cs.AI cs.IR stat.ML

    Adaptive Factorization Network: Learning Adaptive-Order Feature Interactions

    Authors: Weiyu Cheng, Yanyan Shen, Linpeng Huang

    Abstract: Various factorization-based methods have been proposed to leverage second-order, or higher-order cross features for boosting the performance of predictive models. They generally enumerate all the cross features under a predefined maximum order, and then identify useful feature interactions through model training, which suffer from two drawbacks. First, they have to make a trade-off between the exp… ▽ More

    Submitted 23 June, 2020; v1 submitted 7 September, 2019; originally announced September 2019.

    Comments: Accepted by AAAI'20

  47. arXiv:1906.08484  [pdf, other

    cs.DS cs.CG cs.LG stat.ML

    Coresets for Clustering with Fairness Constraints

    Authors: Lingxiao Huang, Shaofeng H. -C. Jiang, Nisheeth K. Vishnoi

    Abstract: In a recent work, [19] studied the following "fair" variants of classical clustering problems such as $k$-means and $k$-median: given a set of $n$ data points in $\mathbb{R}^d$ and a binary type associated to each data point, the goal is to cluster the points while ensuring that the proportion of each type in each cluster is roughly the same as its underlying proportion. Subsequent work has focuse… ▽ More

    Submitted 17 December, 2019; v1 submitted 20 June, 2019; originally announced June 2019.

  48. arXiv:1903.09296  [pdf

    cs.LG stat.ML

    Patient Clustering Improves Efficiency of Federated Machine Learning to predict mortality and hospital stay time using distributed Electronic Medical Records

    Authors: Li Huang, Dianbo Liu

    Abstract: Electronic medical records (EMRs) supports the development of machine learning algorithms for predicting disease incidence, patient response to treatment, and other healthcare events. But insofar most algorithms have been centralized, taking little account of the decentralized, non-identically independently distributed (non-IID), and privacy-sensitive characteristics of EMRs that can complicate da… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

  49. arXiv:1903.05926  [pdf, other

    cs.LG cs.AI stat.ML

    Reinforcement Learning with Dynamic Boltzmann Softmax Updates

    Authors: Ling Pan, Qingpeng Cai, Qi Meng, Wei Chen, Longbo Huang, Tie-Yan Liu

    Abstract: Value function estimation is an important task in reinforcement learning, i.e., prediction. The Boltzmann softmax operator is a natural value estimator and can provide several benefits. However, it does not satisfy the non-expansion property, and its direct use may fail to converge even in value iteration. In this paper, we propose to update the value function with dynamic Boltzmann softmax (DBS)… ▽ More

    Submitted 8 September, 2019; v1 submitted 14 March, 2019; originally announced March 2019.

  50. arXiv:1902.07823  [pdf, other

    cs.LG cs.AI cs.CY cs.DS stat.ML

    Stable and Fair Classification

    Authors: Lingxiao Huang, Nisheeth K. Vishnoi

    Abstract: Fair classification has been a topic of intense study in machine learning, and several algorithms have been proposed towards this important task. However, in a recent study, Friedler et al. observed that fair classification algorithms may not be stable with respect to variations in the training dataset -- a crucial consideration in several real-world applications. Motivated by their work, we study… ▽ More

    Submitted 9 September, 2020; v1 submitted 20 February, 2019; originally announced February 2019.