Skip to main content

Showing 1–50 of 65 results for author: Xu, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.04137  [pdf, other

    cs.LG math.ST stat.ML

    Optimal Batched Linear Bandits

    Authors: Xuanfei Ren, Tianyuan **, Pan Xu

    Abstract: We introduce the E$^4$ algorithm for the batched linear bandit problem, incorporating an Explore-Estimate-Eliminate-Exploit framework. With a proper choice of exploration rate, we prove E$^4$ achieves the finite-time minimax optimal regret with only $O(\log\log T)$ batches, and the asymptotically optimal regret with only $3$ batches as $T\rightarrow\infty$, where $T$ is the time horizon. We furthe… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 26 pages, 6 figures, 4 tables. To appear in the proceedings of the 41st International Conference on Machine Learning (ICML 2024)

  2. arXiv:2404.10728  [pdf, other

    cs.LG stat.ML

    Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning

    Authors: Hao-Lun Hsu, Weixin Wang, Miroslav Pajic, Pan Xu

    Abstract: We present the first study on provably efficient randomized exploration in cooperative multi-agent reinforcement learning (MARL). We propose a unified algorithm framework for randomized exploration in parallel Markov Decision Processes (MDPs), and two Thompson Sampling (TS)-type algorithms, CoopTS-PHE and CoopTS-LMC, incorporating the perturbed-history exploration (PHE) strategy and the Langevin M… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 80 pages, 14 figures, 1 table. Hao-Lun Hsu and Weixin Wang contributed equally to this work

  3. arXiv:2403.09621  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning

    Authors: Zhishuai Liu, Pan Xu

    Abstract: Distributionally robust offline reinforcement learning (RL), which seeks robust policy training against environment perturbation by modeling dynamics uncertainty, calls for function approximations when facing large state-action spaces. However, the consideration of dynamics uncertainty introduces essential nonlinearity and computational burden, posing unique challenges for analyzing and practicall… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 53 pages, 1 figure, 1 table

  4. arXiv:2312.15549  [pdf, other

    cs.LG cs.MA math.ST stat.ML

    Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs

    Authors: Tianyuan **, Hao-Lun Hsu, William Chang, Pan Xu

    Abstract: We study the multi-agent multi-armed bandit (MAMAB) problem, where $m$ agents are factored into $ρ$ overlap** groups. Each group represents a hyperedge, forming a hypergraph over the agents. At each round of interaction, the learner pulls a joint arm (composed of individual arms for each agent) and receives a reward according to the hypergraph structure. Specifically, we assume there is a local… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: 22 pages, 7 figures, 2 tables. To appear in the proceedings of the 38th Annual AAAI Conference on Artificial Intelligence (AAAI'2024)

  5. arXiv:2310.15976  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Convergence of Sign-based Random Reshuffling Algorithms for Nonconvex Optimization

    Authors: Zhen Qin, Zhishuai Liu, Pan Xu

    Abstract: signSGD is popular in nonconvex optimization due to its communication efficiency. Yet, existing analyses of signSGD rely on assuming that data are sampled with replacement in each iteration, contradicting the practical implementation where data are randomly reshuffled and sequentially fed into the algorithm. We bridge this gap by proving the first convergence result of signSGD with random reshuffl… ▽ More

    Submitted 27 December, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 44 pages, 4 figures

  6. arXiv:2310.14129  [pdf, other

    cs.LG stat.ML

    Optimal Batched Best Arm Identification

    Authors: Tianyuan **, Yu Yang, **g Tang, Xiaokui Xiao, Pan Xu

    Abstract: We study the batched best arm identification (BBAI) problem, where the learner's goal is to identify the best arm while switching the policy as less as possible. In particular, we aim to find the best arm with probability $1-δ$ for some small constant $δ>0$ while minimizing both the sample complexity (total number of arm pulls) and the batch complexity (total number of batches). We propose the thr… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 32 pages, 1 figure, 3 tables

  7. arXiv:2307.01928  [pdf, other

    cs.RO cs.AI stat.AP

    Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners

    Authors: Allen Z. Ren, Anushri Dixit, Alexandra Bodrova, Sumeet Singh, Stephen Tu, Noah Brown, Peng Xu, Leila Takayama, Fei Xia, Jake Varley, Zhenjia Xu, Dorsa Sadigh, Andy Zeng, Anirudha Majumdar

    Abstract: Large language models (LLMs) exhibit a wide range of promising capabilities -- from step-by-step planning to commonsense reasoning -- that may provide utility for robots, but remain prone to confidently hallucinated predictions. In this work, we present KnowNo, which is a framework for measuring and aligning the uncertainty of LLM-based planners such that they know when they don't know and ask for… ▽ More

    Submitted 4 September, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: Conference on Robot Learning (CoRL) 2023, Oral Presentation

  8. arXiv:2211.12631  [pdf, other

    stat.ML cs.LG

    A Generic Approach for Reproducible Model Distillation

    Authors: Yunzhe Zhou, Peiru Xu, Giles Hooker

    Abstract: Model distillation has been a popular method for producing interpretable machine learning. It uses an interpretable "student" model to mimic the predictions made by the black box "teacher" model. However, when the student model is sensitive to the variability of the data sets used for training even when kee** the teacher fixed, the corresponded interpretation is not reliable. Existing strategies… ▽ More

    Submitted 27 April, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: 31 pages, 8 figures

  9. arXiv:2206.11254  [pdf, other

    cs.LG stat.ML

    Langevin Monte Carlo for Contextual Bandits

    Authors: Pan Xu, Hongkai Zheng, Eric Mazumdar, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: We study the efficiency of Thompson sampling for contextual bandits. Existing Thompson sampling-based algorithms need to construct a Laplace approximation (i.e., a Gaussian distribution) of the posterior distribution, which is inefficient to sample in high dimensional applications for general covariance matrices. Moreover, the Gaussian approximation may not be a good surrogate for the posterior di… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 21 pages, 3 figures, 2 tables. To appear in the proceedings of the 39th International Conference on Machine Learning (ICML2022)

  10. arXiv:2206.03520  [pdf, ps, other

    stat.ML cs.LG

    Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits

    Authors: Tianyuan **, Pan Xu, Xiaokui Xiao, Anima Anandkumar

    Abstract: We study the regret of Thompson sampling (TS) algorithms for exponential family bandits, where the reward distribution is from a one-dimensional exponential family, which covers many common reward distributions including Bernoulli, Gaussian, Gamma, Exponential, etc. We propose a Thompson sampling algorithm, termed ExpTS, which uses a novel sampling distribution to avoid the under-estimation of the… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: 49 pages

  11. arXiv:2202.09459  [pdf, other

    cs.LG cs.HC stat.ML

    Interactive Visual Pattern Search on Graph Data via Graph Representation Learning

    Authors: Huan Song, Zeng Dai, Panpan Xu, Liu Ren

    Abstract: Graphs are a ubiquitous data structure to model processes and relations in a wide range of domains. Examples include control-flow graphs in programs and semantic scene graphs in images. Identifying subgraph patterns in graphs is an important approach to understanding their structural properties. We propose a visual analytics system GraphQ to support human-in-the-loop, example-based, subgraph patte… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Comments: IEEE Transactions on Visualization and Computer Graphics. Published version: https://ieeexplore.ieee.org/abstract/document/9552902

  12. arXiv:2106.10983  [pdf, other

    stat.ME

    A generalized EMS algorithm for model selection with incomplete data

    Authors: **-Feng Xu, Lai-Xu Shang, Man-Lai Tang, Na Shan, Guoliang Tian

    Abstract: Recently, a so-called E-MS algorithm was developed for model selection in the presence of missing data. Specifically, it performs the Expectation step (E step) and Model Selection step (MS step) alternately to find the minimum point of the observed generalized information criteria (GIC). In practice, it could be numerically infeasible to perform the MS-step for high dimensional settings. In this p… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  13. arXiv:2106.07454  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning

    Authors: Minghan Yang, Dong Xu, Qiwen Cui, Zaiwen Wen, Pengxiang Xu

    Abstract: In this paper, a novel second-order method called NG+ is proposed. By following the rule ``the shape of the gradient equals the shape of the parameter", we define a generalized fisher information matrix (GFIM) using the products of gradients in the matrix form rather than the traditional vectorization. Then, our generalized natural gradient direction is simply the inverse of the GFIM multiplies th… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

  14. arXiv:2103.00373  [pdf, other

    stat.ML cs.DC cs.LG

    Communication-efficient Byzantine-robust distributed learning with statistical guarantee

    Authors: Xingcai Zhou, Le Chang, Pengfei Xu, Shaogao Lv

    Abstract: Communication efficiency and robustness are two major issues in modern distributed learning framework. This is due to the practical situations where some computing nodes may have limited communication power or may behave adversarial behaviors. To address the two issues simultaneously, this paper develops two communication-efficient and robust distributed learning algorithms for convex problems. Ou… ▽ More

    Submitted 27 February, 2021; originally announced March 2021.

    Comments: 34 pages

  15. arXiv:2101.01163  [pdf, other

    cs.LG stat.ML

    SmartDeal: Re-Modeling Deep Network Weights for Efficient Inference and Training

    Authors: Xiaohan Chen, Yang Zhao, Yue Wang, Pengfei Xu, Haoran You, Chaojian Li, Yonggan Fu, Yingyan Lin, Zhangyang Wang

    Abstract: The record-breaking performance of deep neural networks (DNNs) comes with heavy parameterization, leading to external dynamic random-access memory (DRAM) for storage. The prohibitive energy of DRAM accesses makes it non-trivial to deploy DNN on resource-constrained devices, calling for minimizing the weight and data movements to improve the energy efficiency. We present SmartDeal (SD), an algorith… ▽ More

    Submitted 21 December, 2021; v1 submitted 4 January, 2021; originally announced January 2021.

    Comments: Accepted to IEEE Transactions on Neural Networks and Learning Systems arXiv admin note: substantial text overlap with arXiv:2005.03403

  16. arXiv:2012.01780  [pdf, other

    cs.LG stat.ML

    Neural Contextual Bandits with Deep Representation and Shallow Exploration

    Authors: Pan Xu, Zheng Wen, Handong Zhao, Quanquan Gu

    Abstract: We study a general class of contextual bandits, where each context-action pair is associated with a raw feature vector, but the reward generating function is unknown. We propose a novel learning algorithm that transforms the raw feature vector using the last hidden layer of a deep ReLU neural network (deep representation learning), and uses an upper confidence bound (UCB) approach to explore in th… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: 28 pages, 1 figure, 1 table

  17. arXiv:2010.09597  [pdf, other

    cs.LG math.ST stat.ML

    Faster Convergence of Stochastic Gradient Langevin Dynamics for Non-Log-Concave Sampling

    Authors: Difan Zou, Pan Xu, Quanquan Gu

    Abstract: We provide a new convergence analysis of stochastic gradient Langevin dynamics (SGLD) for sampling from a class of distributions that can be non-log-concave. At the core of our approach is a novel conductance analysis of SGLD using an auxiliary time-reversible Markov Chain. Under certain conditions on the target distribution, we prove that $\tilde O(d^4ε^{-2})$ stochastic gradient evaluations suff… ▽ More

    Submitted 23 February, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 44 pages, 1 figure

  18. arXiv:2009.06114  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Towards the Quantification of Safety Risks in Deep Neural Networks

    Authors: Peipei Xu, Wenjie Ruan, Xiaowei Huang

    Abstract: Safety concerns on the deep neural networks (DNNs) have been raised when they are applied to critical sectors. In this paper, we define safety risks by requesting the alignment of the network's decision with human perception. To enable a general methodology for quantifying safety risks, we define a generic safety property and instantiate it to express various safety risks. For the quantification o… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

    Comments: 19 pages, 10 figures

  19. arXiv:2006.05924  [pdf, ps, other

    math.OC stat.ML

    Sketchy Empirical Natural Gradient Methods for Deep Learning

    Authors: Minghan Yang, Dong Xu, Zaiwen Wen, Mengyun Chen, Pengxiang Xu

    Abstract: In this paper, we develop an efficient sketchy empirical natural gradient method (SENG) for large-scale deep learning problems. The empirical Fisher information matrix is usually low-rank since the sampling is only practical on a small amount of data at each iteration. Although the corresponding natural gradient direction lies in a small subspace, both the computational cost and memory requirement… ▽ More

    Submitted 25 March, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

  20. arXiv:2005.07606   

    cs.LG stat.ML

    Initializing Perturbations in Multiple Directions for Fast Adversarial Training

    Authors: Xunguang Wang, Ship Peng Xu, Eric Ke Wang

    Abstract: Recent developments in the filed of Deep Learning have demonstrated that Deep Neural Networks(DNNs) are vulnerable to adversarial examples. Specifically, in image classification, an adversarial example can fool the well trained deep neural networks by adding barely imperceptible perturbations to clean images. Adversarial Training, one of the most direct and effective methods, minimizes the losses… ▽ More

    Submitted 25 January, 2021; v1 submitted 15 May, 2020; originally announced May 2020.

    Comments: has no contribution

  21. arXiv:2005.01350  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Finite Time Analysis of Two Time-Scale Actor Critic Methods

    Authors: Yue Wu, Weitong Zhang, Pan Xu, Quanquan Gu

    Abstract: Actor-critic (AC) methods have exhibited great empirical success compared with other reinforcement learning algorithms, where the actor uses the policy gradient to improve the learning policy and the critic uses temporal difference learning to estimate the policy gradient. Under the two time-scale learning rate schedule, the asymptotic convergence of AC has been well studied in the literature. How… ▽ More

    Submitted 10 October, 2022; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 39 pages. In NeurIPS 2020

  22. arXiv:2003.06959  [pdf, other

    cs.LG cs.GR stat.ML

    PFPN: Continuous Control of Physically Simulated Characters using Particle Filtering Policy Network

    Authors: Pei Xu, Ioannis Karamouzas

    Abstract: Data-driven methods for physics-based character control using reinforcement learning have been successfully applied to generate high-quality motions. However, existing approaches typically rely on Gaussian distributions to represent the action policy, which can prematurely commit to suboptimal actions when solving high-dimensional continuous control problems for highly-articulated characters. In t… ▽ More

    Submitted 1 October, 2021; v1 submitted 15 March, 2020; originally announced March 2020.

    Comments: Motion, Interaction and Games (MIG '21)

  23. arXiv:2003.01803  [pdf, other

    cs.LG math.ST stat.ML

    MOTS: Minimax Optimal Thompson Sampling

    Authors: Tianyuan **, Pan Xu, Jieming Shi, Xiaokui Xiao, Quanquan Gu

    Abstract: Thompson sampling is one of the most widely used algorithms for many online decision problems, due to its simplicity in implementation and superior empirical performance over other state-of-the-art methods. Despite its popularity and empirical success, it has remained an open problem whether Thompson sampling can match the minimax lower bound $Ω(\sqrt{KT})$ for $K$-armed bandit problems, where… ▽ More

    Submitted 1 October, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 27 pages, 1 table, 2 figures. This version improves the presentation in V2

  24. arXiv:2002.12169  [pdf, other

    cs.LG cs.CV stat.ML

    Multi-source Domain Adaptation in the Deep Learning Era: A Systematic Survey

    Authors: Sicheng Zhao, Bo Li, Colorado Reed, Pengfei Xu, Kurt Keutzer

    Abstract: In many practical applications, it is often difficult and expensive to obtain enough large-scale labeled data to train deep neural networks to their full capability. Therefore, transferring the learned knowledge from a separate, labeled source domain to an unlabeled or sparsely labeled target domain becomes an appealing alternative. However, direct transfer often results in significant performance… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  25. arXiv:2002.09174  [pdf, other

    cs.LG stat.ML

    Double Explore-then-Commit: Asymptotic Optimality and Beyond

    Authors: Tianyuan **, Pan Xu, Xiaokui Xiao, Quanquan Gu

    Abstract: We study the multi-armed bandit problem with subgaussian rewards. The explore-then-commit (ETC) strategy, which consists of an exploration phase followed by an exploitation phase, is one of the most widely used algorithms in a variety of online decision applications. Nevertheless, it has been shown in Garivier et al. (2016) that ETC is suboptimal in the asymptotic sense as the horizon grows, and t… ▽ More

    Submitted 19 November, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: 46 pages. This version improves the presentation, and adds new algorithms and theoretical results: an anytime algorithm with asymptotic optimality guarantee, and an extension to K-armed bandits

  26. arXiv:2001.10133  [pdf, ps, other

    cs.LG stat.ML

    COKE: Communication-Censored Decentralized Kernel Learning

    Authors: ** Xu, Yue Wang, Xiang Chen, Zhi Tian

    Abstract: This paper studies the decentralized optimization and learning problem where multiple interconnected agents aim to learn an optimal decision function defined over a reproducing kernel Hilbert space by jointly minimizing a global objective function, with access to their own locally observed dataset. As a non-parametric approach, kernel learning faces a major challenge in distributed implementation:… ▽ More

    Submitted 29 June, 2021; v1 submitted 27 January, 2020; originally announced January 2020.

  27. arXiv:2001.00705  [pdf, other

    cs.LG stat.ML

    Fractional Skip**: Towards Finer-Grained Dynamic CNN Inference

    Authors: Jianghao Shen, Yonggan Fu, Yue Wang, Pengfei Xu, Zhangyang Wang, Yingyan Lin

    Abstract: While increasingly deep networks are still in general desired for achieving state-of-the-art performance, for many specific inputs a simpler network might already suffice. Existing works exploited this observation by learning to skip convolutional layers in an input-dependent manner. However, we argue their binary decision scheme, i.e., either fully executing or completely bypassing one layer for… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

  28. arXiv:1912.04511  [pdf, other

    cs.LG math.OC stat.ML

    A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation

    Authors: Pan Xu, Quanquan Gu

    Abstract: Q-learning with neural network function approximation (neural Q-learning for short) is among the most prevalent deep reinforcement learning algorithms. Despite its empirical success, the non-asymptotic convergence rate of neural Q-learning remains virtually unknown. In this paper, we present a finite-time analysis of a neural Q-learning algorithm, where the data are generated from a Markov decisio… ▽ More

    Submitted 3 March, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: 22 pages, 1 table. This version simplifies the proof and improves the presentation

  29. arXiv:1912.01211  [pdf, other

    cs.LG stat.ML

    Rank Aggregation via Heterogeneous Thurstone Preference Models

    Authors: Tao **, Pan Xu, Quanquan Gu, Farzad Farnoud

    Abstract: We propose the Heterogeneous Thurstone Model (HTM) for aggregating ranked data, which can take the accuracy levels of different users into account. By allowing different noise distributions, the proposed HTM model maintains the generality of Thurstone's original framework, and as such, also extends the Bradley-Terry-Luce (BTL) model for pairwise comparisons to heterogeneous populations of users. U… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Comments: 36 pages, 2 figures, 8 tables. In AAAI 2020

  30. arXiv:1911.11554  [pdf, other

    cs.LG cs.CV stat.ML

    Multi-source Distilling Domain Adaptation

    Authors: Sicheng Zhao, Guangzhi Wang, Shanghang Zhang, Yang Gu, Yaxian Li, Zhichao Song, Pengfei Xu, Runbo Hu, Hua Chai, Kurt Keutzer

    Abstract: Deep neural networks suffer from performance decay when there is domain shift between the labeled source domain and unlabeled target domain, which motivates the research on domain adaptation (DA). Conventional DA methods usually assume that the labeled data is sampled from a single source distribution. However, in practice, labeled data may be collected from multiple sources, while naive applicati… ▽ More

    Submitted 7 February, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI 2020

  31. Akaike's Bayesian information criterion (ABIC) or not ABIC for geophysical inversion

    Authors: Peiliang Xu

    Abstract: Akaike's Bayesian information criterion (ABIC) has been widely used in geophysical inversion and beyond. However, little has been done to investigate its statistical aspects. We present an alternative derivation of the marginal distribution of measurements, whose maximization directly leads to the invention of ABIC by Akaike. We show that ABIC is to statistically estimate the variance of measureme… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

    Comments: 7

    Journal ref: Journal of the Franklin Institute, 2021, 358, 4077-4102

  32. Fault Detection and Identification using Bayesian Recurrent Neural Networks

    Authors: Weike Sun, Antonio R. C. Paiva, Peng Xu, Anantha Sundaram, Richard D. Braatz

    Abstract: In processing and manufacturing industries, there has been a large push to produce higher quality products and ensure maximum efficiency of processes. This requires approaches to effectively detect and resolve disturbances to ensure optimal operations. While the control system can compensate for many types of disturbances, there are changes to the process which it still cannot handle adequately. I… ▽ More

    Submitted 26 June, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: 43 pages, 23 figures. Accepted for publication in Computers & Chemical Engineering

    MSC Class: 68T05

  33. arXiv:1910.13349  [pdf, other

    cs.LG stat.ML

    E2-Train: Training State-of-the-art CNNs with Over 80% Energy Savings

    Authors: Yue Wang, Ziyu Jiang, Xiaohan Chen, Pengfei Xu, Yang Zhao, Yingyan Lin, Zhangyang Wang

    Abstract: Convolutional neural networks (CNNs) have been increasingly deployed to edge devices. Hence, many efforts have been made towards efficient CNN inference in resource-constrained platforms. This paper attempts to explore an orthogonal direction: how to conduct more energy-efficient training of CNNs, so as to enable on-device training. We strive to reduce the energy cost during training, by drop**… ▽ More

    Submitted 5 December, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

  34. arXiv:1909.12974  [pdf, ps, other

    econ.EM stat.ME

    Monotonicity-Constrained Nonparametric Estimation and Inference for First-Price Auctions

    Authors: Jun Ma, Vadim Marmer, Artyom Shneyerov, Pai Xu

    Abstract: We propose a new nonparametric estimator for first-price auctions with independent private values that imposes the monotonicity constraint on the estimated inverse bidding strategy. We show that our estimator has a smaller asymptotic variance than that of Guerre, Perrigne and Vuong's (2000) estimator. In addition to establishing pointwise asymptotic normality of our estimator, we provide a bootstr… ▽ More

    Submitted 27 September, 2019; originally announced September 2019.

  35. arXiv:1909.11957  [pdf, other

    cs.LG stat.ML

    Drawing Early-Bird Tickets: Towards More Efficient Training of Deep Networks

    Authors: Haoran You, Chaojian Li, Pengfei Xu, Yonggan Fu, Yue Wang, Xiaohan Chen, Richard G. Baraniuk, Zhangyang Wang, Yingyan Lin

    Abstract: (Frankle & Carbin, 2019) shows that there exist winning tickets (small but critical subnetworks) for dense, randomly initialized networks, that can be trained alone to achieve comparable accuracies to the latter in a similar number of iterations. However, the identification of these winning tickets still requires the costly train-prune-retrain process, limiting their practical benefits. In this pa… ▽ More

    Submitted 16 February, 2022; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: Accepted as ICLR2020 Spotlight

  36. arXiv:1909.08610  [pdf, other

    cs.LG math.OC stat.ML

    Sample Efficient Policy Gradient Methods with Recursive Variance Reduction

    Authors: Pan Xu, Felicia Gao, Quanquan Gu

    Abstract: Improving the sample efficiency in reinforcement learning has been a long-standing research problem. In this work, we aim to reduce the sample complexity of existing policy gradient methods. We propose a novel policy gradient algorithm called SRVR-PG, which only requires $O(1/ε^{3/2})$ episodes to find an $ε$-approximate stationary point of the nonconcave performance function $J(\boldsymbolθ)$ (i.… ▽ More

    Submitted 1 August, 2021; v1 submitted 18 September, 2019; originally announced September 2019.

    Comments: 23 pages, 2 figures, 3 tables. In ICLR 2020

  37. arXiv:1908.04771  [pdf

    cs.LG cs.AI stat.ML

    Multi-View Fuzzy Clustering with The Alternative Learning between Shared Hidden Space and Partition

    Authors: Zhaohong Deng, Chen Cui, Peng Xu, Ling Liang, Haoran Chen, Te Zhang, Shitong Wang

    Abstract: As the multi-view data grows in the real world, multi-view clus-tering has become a prominent technique in data mining, pattern recognition, and machine learning. How to exploit the relation-ship between different views effectively using the characteristic of multi-view data has become a crucial challenge. Aiming at this, a hidden space sharing multi-view fuzzy clustering (HSS-MVFC) method is prop… ▽ More

    Submitted 12 August, 2019; originally announced August 2019.

    Comments: This paper has been submitted to IEEE Transactions on Cybnetics in Apr. 8th 2019

  38. arXiv:1908.04766  [pdf

    cs.LG cs.AI stat.ML

    Multi-view Clustering with the Cooperation of Visible and Hidden Views

    Authors: Zhaohong Deng, Ruixiu Liu, Te Zhang, Peng Xu, Kup-Sze Choi, Bin Qin, Shitong Wang

    Abstract: Multi-view data are becoming common in real-world modeling tasks and many multi-view data clustering algorithms have thus been proposed. The existing algorithms usually focus on the cooperation of different views in the original space but neglect the influence of the hidden information among these different visible views, or they only consider the hidden information between the views. The algorith… ▽ More

    Submitted 12 August, 2019; originally announced August 2019.

    Comments: This paper has been submitted to IEEE TKDE in Jun. 2019

  39. arXiv:1907.09728  [pdf, other

    cs.LG cs.HC stat.ML

    Interpretable and Steerable Sequence Learning via Prototypes

    Authors: Yao Ming, Panpan Xu, Huamin Qu, Liu Ren

    Abstract: One of the major challenges in machine learning nowadays is to provide predictions with not only high accuracy but also user-friendly explanations. Although in recent years we have witnessed increasingly popular use of deep neural networks for sequence modeling, it is still challenging to explain the rationales behind the model outputs, which is essential for building trust and supporting the doma… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: Accepted as a full paper at KDD 2019 on May 8, 2019

    Journal ref: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD 2019

  40. arXiv:1905.12615  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    An Improved Convergence Analysis of Stochastic Variance-Reduced Policy Gradient

    Authors: Pan Xu, Felicia Gao, Quanquan Gu

    Abstract: We revisit the stochastic variance-reduced policy gradient (SVRPG) method proposed by Papini et al. (2018) for reinforcement learning. We provide an improved convergence analysis of SVRPG and show that it can find an $ε$-approximate stationary point of the performance function within $O(1/ε^{5/3})$ trajectories. This sample complexity improves upon the best known result $O(1/ε^2)$ by a factor of… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: 10 pages, 2 figures, 1 table. To appear in the proceedings of the 35th International Conference on Uncertainty in Artificial Intelligence

  41. arXiv:1905.11978  [pdf, other

    cs.LG cs.CL stat.ML

    Better Long-Range Dependency By Bootstrap** A Mutual Information Regularizer

    Authors: Yanshuai Cao, Peng Xu

    Abstract: In this work, we develop a novel regularizer to improve the learning of long-range dependency of sequence data. Applied on language modelling, our regularizer expresses the inductive bias that sequence variables should have high mutual information even though the model might not see abundant observations for complex long-range dependency. We show how the `next sentence prediction (classification)'… ▽ More

    Submitted 22 February, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Camera-ready for AISTATS 2020

  42. arXiv:1905.10594  [pdf

    cs.LG stat.ML

    Multi-view Information-theoretic Co-clustering for Co-occurrence Data

    Authors: Peng Xu, Zhaohong Deng, Kup-Sze Choi, Longbing Cao, Shitong Wang

    Abstract: Multi-view clustering has received much attention recently. Most of the existing multi-view clustering methods only focus on one-sided clustering. As the co-occurring data elements involve the counts of sample-feature co-occurrences, it is more efficient to conduct two-sided clustering along the samples and features simultaneously. To take advantage of two-sided clustering for the co-occurrences i… ▽ More

    Submitted 25 May, 2019; originally announced May 2019.

    Journal ref: AAAI 2019

  43. arXiv:1905.08924  [pdf

    cs.LG cs.CV stat.ML

    Joint Information Preservation for Heterogeneous Domain Adaptation

    Authors: Peng Xu, Zhaohong Deng, Kup-Sze Choi, Jun Wang, Shitong Wang

    Abstract: Domain adaptation aims to assist the modeling tasks of the target domain with knowledge of the source domain. The two domains often lie in different feature spaces due to diverse data collection methods, which leads to the more challenging task of heterogeneous domain adaptation (HDA). A core issue of HDA is how to preserve the information of the original data during adaptation. In this paper, we… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  44. Concise Fuzzy System Modeling Integrating Soft Subspace Clustering and Sparse Learning

    Authors: Peng Xu, Zhaohong Deng, Chen Cui, Te Zhang, Kup-Sze Choi, Gu Suhang, Jun Wang, ShiTong Wang

    Abstract: The superior interpretability and uncertainty modeling ability of Takagi-Sugeno-Kang fuzzy system (TSK FS) make it possible to describe complex nonlinear systems intuitively and efficiently. However, classical TSK FS usually adopts the whole feature space of the data for model construction, which can result in lengthy rules for high-dimensional data and lead to degeneration in interpretability. Fu… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

  45. arXiv:1812.06371  [pdf, other

    cs.LG cs.CR stat.ML

    Trust Region Based Adversarial Attack on Neural Networks

    Authors: Zhewei Yao, Amir Gholami, Peng Xu, Kurt Keutzer, Michael Mahoney

    Abstract: Deep Neural Networks are quite vulnerable to adversarial perturbations. Current state-of-the-art adversarial attack methods typically require very time consuming hyper-parameter tuning, or require many iterations to solve an optimization based adversarial attack. To address this problem, we present a new family of trust region based adversarial attacks, with the goal of computing adversarial pertu… ▽ More

    Submitted 15 December, 2018; originally announced December 2018.

    Journal ref: CVPR 2019

  46. arXiv:1811.11989  [pdf, other

    math.OC cs.LG stat.ML

    Sample Efficient Stochastic Variance-Reduced Cubic Regularization Method

    Authors: Dongruo Zhou, Pan Xu, Quanquan Gu

    Abstract: We propose a sample efficient stochastic variance-reduced cubic regularization (Lite-SVRC) algorithm for finding the local minimum efficiently in nonconvex optimization. The proposed algorithm achieves a lower sample complexity of Hessian matrix computation than existing cubic regularization based methods. At the heart of our analysis is the choice of a constant batch size of Hessian matrix comput… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: 24 pages, 2 figures, 1 table. The first version of this paper was submitted to UAI 2018 on March 9, 2018. This is the second version with improved presentation and additional baselines in the experiments, and was submitted to NeurIPS 2018 on May 18, 2018

  47. arXiv:1811.11733  [pdf, other

    stat.CO stat.ME

    orthoDr: Semiparametric Dimension Reduction via Orthogonality Constrained Optimization

    Authors: Ruoqing Zhu, Jiyang Zhang, Ruilin Zhao, Peng Xu, Wenzhuo Zhou, Xin Zhang

    Abstract: orthoDr is a package in R that solves dimension reduction problems using orthogonality constrained optimization approach. The package serves as a unified framework for many regression and survival analysis dimension reduction models that utilize semiparametric estimating equations. The main computational machinery of orthoDr is a first-order algorithm developed by \cite{wen2013feasible} for optimi… ▽ More

    Submitted 4 July, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

  48. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  49. arXiv:1807.01349  [pdf, other

    cs.LG stat.ML

    Anomaly Detection for Skin Disease Images Using Variational Autoencoder

    Authors: Yuchen Lu, Peng Xu

    Abstract: In this paper, we demonstrate the potential of applying Variational Autoencoder (VAE) [10] for anomaly detection in skin disease images. VAE is a class of deep generative models which is trained by maximizing the evidence lower bound of data distribution [10]. When trained on only normal data, the resulting model is able to perform efficient inference and to determine if a test image is normal or… ▽ More

    Submitted 24 July, 2018; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: 8 pages, 2 figures, submitted to ISIC Skin Image Analysis Workshop and Challenge at MICCAI 2018

  50. arXiv:1806.08782  [pdf, other

    cs.LG stat.ML

    Finding Local Minima via Stochastic Nested Variance Reduction

    Authors: Dongruo Zhou, Pan Xu, Quanquan Gu

    Abstract: We propose two algorithms that can find local minima faster than the state-of-the-art algorithms in both finite-sum and general stochastic nonconvex optimization. At the core of the proposed algorithms is $\text{One-epoch-SNVRG}^+$ using stochastic nested variance reduction (Zhou et al., 2018a), which outperforms the state-of-the-art variance reduction algorithms such as SCSG (Lei et al., 2017). I… ▽ More

    Submitted 22 June, 2018; originally announced June 2018.

    Comments: 37 pages, 4 figures, 1 table