Skip to main content

Showing 1–50 of 58 results for author: Chen, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.00271  [pdf, other

    math.DS physics.data-an stat.ML

    Minimum Reduced-Order Models via Causal Inference

    Authors: Nan Chen, Honghu Liu

    Abstract: Enhancing the sparsity of data-driven reduced-order models (ROMs) has gained increasing attention in recent years. In this work, we analyze an efficient approach to identifying skillful ROMs with a sparse structure using an information-theoretic indicator called causation entropy. The causation entropy quantifies in a statistical way the additional contribution of each term to the underlying dynam… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2402.12710  [pdf, other

    stat.ME cs.LG stat.ML

    Integrating Active Learning in Causal Inference with Interference: A Novel Approach in Online Experiments

    Authors: Hongtao Zhu, Sizhe Zhang, Yang Su, Zhenyu Zhao, Nan Chen

    Abstract: In the domain of causal inference research, the prevalent potential outcomes framework, notably the Rubin Causal Model (RCM), often overlooks individual interference and assumes independent treatment effects. This assumption, however, is frequently misaligned with the intricate realities of real-world scenarios, where interference is not merely a possibility but a common occurrence. Our research e… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: conference paper

  3. arXiv:2401.03281  [pdf, other

    physics.ao-ph stat.AP

    Statistical Response of ENSO Complexity to Initial Condition and Model Parameter Perturbations

    Authors: Marios Andreou, Nan Chen

    Abstract: Studying the response of a climate system to perturbations has practical significance. Standard methods in computing the trajectory-wise deviation caused by perturbations may suffer from the chaotic nature that makes the model error dominate the true response after a short lead time. Statistical response, which computes the return described by the statistics, provides a systematic way of reaching… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: This is the first revision version. 53 pages, 11 figures, 2 tables (1 in main text and 1 in the appendix), typeset in LaTeX. Submitted for peer-review to AMS' Journal of Climate (JCLI). For more info see https://sites.google.com/wisc.edu/mariosandreou/cv-publications/statistical-response-enso-complexity

    MSC Class: 86A05; 86A08; 86A10 (Primary) 94-08; 94-10 (Secondary)

  4. arXiv:2311.05819  [pdf, other

    stat.ME

    A flexible framework for synthesizing human activity patterns with application to sequential categorical data

    Authors: Zuofu Huang, Julian Wolfson, Jayne A. Fulkerson, Ryan Demmer, Helen N. Chen

    Abstract: The ability to synthesize realistic data in a parametrizable way is valuable for a number of reasons, including privacy, missing data imputation, and evaluating the performance of statistical and computational methods. When the underlying data generating process is complex, data synthesis requires approaches that balance realism and simplicity. In this paper, we address the problem of synthesizing… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  5. arXiv:2310.17063  [pdf, other

    stat.CO stat.ML

    Coreset Markov Chain Monte Carlo

    Authors: Naitong Chen, Trevor Campbell

    Abstract: A Bayesian coreset is a small, weighted subset of data that replaces the full dataset during inference in order to reduce computational cost. However, state of the art methods for tuning coreset weights are expensive, require nontrivial user input, and impose constraints on the model. In this work, we propose a new method -- Coreset MCMC -- that simulates a Markov chain targeting the coreset poste… ▽ More

    Submitted 8 March, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  6. arXiv:2306.16578  [pdf, other

    cs.LG math.ST stat.ML

    Allocating Divisible Resources on Arms with Unknown and Random Rewards

    Authors: Ningyuan Chen, Wenhao Li

    Abstract: We consider a decision maker allocating one unit of renewable and divisible resource in each period on a number of arms. The arms have unknown and random rewards whose means are proportional to the allocated resource and whose variances are proportional to an order $b$ of the allocated resource. In particular, if the decision maker allocates resource $A_i$ to arm $i$ in a period, then the reward… ▽ More

    Submitted 2 November, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  7. arXiv:2211.11028  [pdf, other

    stat.ML cs.HC cs.LG

    Algorithmic Decision-Making Safeguarded by Human Knowledge

    Authors: Ningyuan Chen, Ming Hu, Wenhao Li

    Abstract: Commercial AI solutions provide analysts and managers with data-driven business intelligence for a wide range of decisions, such as demand forecasting and pricing. However, human analysts may have their own insights and experiences about the decision-making that is at odds with the algorithmic recommendation. In view of such a conflict, we provide a general analytical framework to study the augmen… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  8. arXiv:2209.04942  [pdf, other

    stat.ML cs.LG stat.ME

    Learning Consumer Preferences from Bundle Sales Data

    Authors: Ningyuan Chen, Setareh Farajollahzadeh, Guan Wang

    Abstract: Product bundling is a common selling mechanism used in online retailing. To set profitable bundle prices, the seller needs to learn consumer preferences from the transaction data. When customers purchase bundles or multiple products, classical methods such as discrete choice models cannot be used to estimate customers' valuations. In this paper, we propose an approach to learn the distribution of… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

  9. arXiv:2206.02111  [pdf, other

    eess.SY stat.AP

    LASSO-Based Multiple-Line Outage Identification In Partially Observable Power Systems

    Authors: Xiaozhou Yang, Nan Chen

    Abstract: Phasor measurement units (PMUs) create ample real-time monitoring opportunities for modern power systems. Among them, line outage detection and identification remains a crucial but challenging task. Current works on outage identification succeed in full PMU deployment and single-line outages. Performance however degrades for multiple-line outage with partial system observability. We propose a nove… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

    Comments: 9 pages, 6 figures

  10. arXiv:2205.07475  [pdf, other

    stat.ML cs.LG stat.CO

    MixFlows: principled variational inference via mixed flows

    Authors: Zuheng Xu, Naitong Chen, Trevor Campbell

    Abstract: This work presents mixed variational flows (MixFlows), a new variational family that consists of a mixture of repeated applications of a map to an initial reference distribution. First, we provide efficient algorithms for i.i.d. sampling, density evaluation, and unbiased ELBO estimation. We then show that MixFlows have MCMC-like convergence guarantees when the flow map is ergodic and measure-prese… ▽ More

    Submitted 1 June, 2023; v1 submitted 16 May, 2022; originally announced May 2022.

  11. arXiv:2205.03623  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Determination of class-specific variables in nonparametric multiple-class classification

    Authors: Wan-** Nicole Chen, Yuan-chin Ivan Chang

    Abstract: As technology advanced, collecting data via automatic collection devices become popular, thus we commonly face data sets with lengthy variables, especially when these data sets are collected without specific research goals beforehand. It has been pointed out in the literature that the difficulty of high-dimensional classification problems is intrinsically caused by too many noise variables useless… ▽ More

    Submitted 7 May, 2022; originally announced May 2022.

  12. arXiv:2203.16749  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Sha**

    Authors: Yuma Koizumi, Heiga Zen, Kohei Yatabe, Nanxin Chen, Michiel Bacchiani

    Abstract: Neural vocoder using denoising diffusion probabilistic model (DDPM) has been improved by adaptation of the diffusion noise distribution to given acoustic features. In this study, we propose SpecGrad that adapts the diffusion noise so that its time-varying spectral envelope becomes close to the conditioning log-mel spectrogram. This adaptation by time-varying filtering improves the sound quality es… ▽ More

    Submitted 4 August, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted to Interspeech 2022

  13. arXiv:2203.05723  [pdf, other

    stat.ML cs.LG stat.CO

    Bayesian inference via sparse Hamiltonian flows

    Authors: Naitong Chen, Zuheng Xu, Trevor Campbell

    Abstract: A Bayesian coreset is a small, weighted subset of data that replaces the full dataset during Bayesian inference, with the goal of reducing computational cost. Although past work has shown empirically that there often exists a coreset with low inferential error, efficiently constructing such a coreset remains a challenge. Current methods tend to be slow, require a secondary inference step after cor… ▽ More

    Submitted 12 January, 2023; v1 submitted 10 March, 2022; originally announced March 2022.

  14. arXiv:2203.02657  [pdf, ps, other

    physics.ao-ph stat.AP

    Quantifying the Predictability of ENSO Complexity Using a Statistically Accurate Multiscale Stochastic Model and Information Theory

    Authors: Xianghui Fang, Nan Chen

    Abstract: An information-theoretic framework is developed to assess the predictability of ENSO complexity, which is a central problem in contemporary meteorology with large societal impacts. The information theory advances a unique way to quantify the forecast uncertainty and allows to distinguish the predictability limit of different ENSO events. One key step in applying the framework to compute the inform… ▽ More

    Submitted 31 July, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: 19 figures

  15. arXiv:2201.01628  [pdf, other

    cs.LG stat.ML

    Bridging Adversarial and Nonstationary Multi-armed Bandit

    Authors: Ningyuan Chen, Shuoguang Yang, Hailun Zhang

    Abstract: In the multi-armed bandit framework, there are two formulations that are commonly employed to handle time-varying reward distributions: adversarial bandit and nonstationary bandit. Although their oracles, algorithms, and regret analysis differ significantly, we provide a unified formulation in this paper that smoothly bridges the two as special cases. The formulation uses an oracle that takes the… ▽ More

    Submitted 25 November, 2023; v1 submitted 5 January, 2022; originally announced January 2022.

  16. arXiv:2107.06754  [pdf, other

    eess.SY stat.AP

    Dynamic Power Systems Line Outage Detection Using Particle Filter and Partially Observed States

    Authors: Xiaozhou Yang, Nan Chen, Chao Zhai

    Abstract: Real-time transmission line outage detection is difficult because of partial phasor measurement unit (PMU) deployment and varying outage signal strength. Existing detection approaches focus on monitoring PMU-measured nodal algebraic states, i.e., voltage phase angle and magnitude. The success of such approaches, however, is largely predicated on strong outage signals and the presence of PMUs in th… ▽ More

    Submitted 27 October, 2021; v1 submitted 14 July, 2021; originally announced July 2021.

    Comments: Under review for IEEE Transactions on Power Systems; 9 pages, 7 figures

  17. arXiv:2106.03790  [pdf, ps, other

    cs.LG stat.ML

    Multi-armed Bandit Requiring Monotone Arm Sequences

    Authors: Ningyuan Chen

    Abstract: In many online learning or multi-armed bandit problems, the taken actions or pulled arms are ordinal and required to be monotone over time. Examples include dynamic pricing, in which the firms use markup pricing policies to please early adopters and deter strategic waiting, and clinical trials, in which the dose allocation usually follows the dose escalation principle to prevent dose limiting toxi… ▽ More

    Submitted 6 October, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

  18. arXiv:2009.08265  [pdf, other

    cs.LG stat.ML

    Dimension Reduction in Contextual Online Learning via Nonparametric Variable Selection

    Authors: Wenhao Li, Ningyuan Chen, L. Jeff Hong

    Abstract: We consider a contextual online learning (multi-armed bandit) problem with high-dimensional covariate $\mathbf{x}$ and decision $\mathbf{y}$. The reward function to learn, $f(\mathbf{x},\mathbf{y})$, does not have a particular parametric form. The literature has shown that the optimal regret is $\tilde{O}(T^{(d_x+d_y+1)/(d_x+d_y+2)})$, where $d_x$ and $d_y$ are the dimensions of $\mathbf x$ and… ▽ More

    Submitted 2 October, 2022; v1 submitted 17 September, 2020; originally announced September 2020.

  19. arXiv:2009.00713  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    WaveGrad: Estimating Gradients for Waveform Generation

    Authors: Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, William Chan

    Abstract: This paper introduces WaveGrad, a conditional model for waveform generation which estimates gradients of the data density. The model is built on prior work on score matching and diffusion probabilistic models. It starts from a Gaussian white noise signal and iteratively refines the signal via a gradient-based sampler conditioned on the mel-spectrogram. WaveGrad offers a natural way to trade infere… ▽ More

    Submitted 9 October, 2020; v1 submitted 2 September, 2020; originally announced September 2020.

  20. arXiv:2007.03792  [pdf, other

    stat.ME stat.AP

    Statistical design considerations for trials that study multiple indications

    Authors: Alexander M. Kaizer, Joseph S. Koopmeiners, Nan Chen, Brian P. Hobbs

    Abstract: Breakthroughs in cancer biology have defined new research programs emphasizing the development of therapies that target specific pathways in tumor cells. Innovations in clinical trial design have followed with master protocols defined by inclusive eligibility criteria and evaluations of multiple therapies and/or histologies. Consequently, characterization of subpopulation heterogeneity has become… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: 27 pages, 10 figures

  21. arXiv:2005.08520  [pdf, other

    cs.LG cs.CL stat.ML

    Robust Training of Vector Quantized Bottleneck Models

    Authors: Adrian Łańcucki, Jan Chorowski, Guillaume Sanchez, Ricard Marxer, Nanxin Chen, Hans J. G. A. Dolfing, Sameer Khurana, Tanel Alumäe, Antoine Laurent

    Abstract: In this paper we demonstrate methods for reliable and efficient training of discrete representation using Vector-Quantized Variational Auto-Encoder models (VQ-VAEs). Discrete latent variable models have been shown to learn nontrivial representations of speech, applicable to unsupervised voice conversion and reaching state-of-the-art performance on unit discovery tasks. For unsupervised representat… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: Published at IJCNN 2020

  22. arXiv:2005.08088  [pdf, ps, other

    cs.LG stat.ML

    Learning and Optimization with Seasonal Patterns

    Authors: Ningyuan Chen, Chun Wang, Longlin Wang

    Abstract: A standard assumption adopted in the multi-armed bandit (MAB) framework is that the mean rewards are constant over time. This assumption can be restrictive in the business world as decision-makers often face an evolving environment where the mean rewards are time-varying. In this paper, we consider a non-stationary MAB model with $K$ arms whose mean rewards vary over time in a periodic manner. The… ▽ More

    Submitted 22 August, 2021; v1 submitted 16 May, 2020; originally announced May 2020.

  23. arXiv:2002.05039  [pdf, ps, other

    eess.AS cs.LG cs.SD stat.ML

    x-vectors meet emotions: A study on dependencies between emotion and speaker recognition

    Authors: Raghavendra Pappagari, Tianzi Wang, Jesus Villalba, Nanxin Chen, Najim Dehak

    Abstract: In this work, we explore the dependencies between speaker recognition and emotion recognition. We first show that knowledge learned for speaker recognition can be reused for emotion recognition through transfer learning. Then, we show the effect of emotion on speaker recognition. For emotion recognition, we show that using a simple linear model is enough to obtain good performance on the features… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

  24. arXiv:2002.04881  [pdf, other

    stat.ML cs.LG

    Learning Flat Latent Manifolds with VAEs

    Authors: Nutan Chen, Alexej Klushyn, Francesco Ferroni, Justin Bayer, Patrick van der Smagt

    Abstract: Measuring the similarity between data points often requires domain knowledge, which can in parts be compensated by relying on unsupervised methods such as latent-variable models, where similarity/distance is estimated in a more compact latent space. Prevalent is the use of the Euclidean metric, which has the drawback of ignoring information about similarity of data stored in the decoder, as captur… ▽ More

    Submitted 12 August, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Thirty-seventh International Conference on Machine Learning (ICML) 2020

    Journal ref: International Conference on Machine Learning 2020

  25. arXiv:2001.11019  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Improving Language Identification for Multilingual Speakers

    Authors: Andrew Titus, Jan Silovsky, Nanxin Chen, Roger Hsiao, Mary Young, Arnab Ghoshal

    Abstract: Spoken language identification (LID) technologies have improved in recent years from discriminating largely distinct languages to discriminating highly similar languages or even dialects of the same language. One aspect that has been mostly neglected, however, is discrimination of languages for multilingual speakers, despite being a primary target audience of many systems that utilize LID technolo… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

    Comments: 5 pages, 2 figures. Submitted to ICASSP 2020

  26. arXiv:2001.09390  [pdf, ps, other

    cs.LG stat.ML

    Regime Switching Bandits

    Authors: Xiang Zhou, Yi Xiong, Ningyuan Chen, Xuefeng Gao

    Abstract: We study a multi-armed bandit problem where the rewards exhibit regime switching. Specifically, the distributions of the random rewards generated from all arms are modulated by a common underlying state modeled as a finite-state Markov chain. The agent does not observe the underlying state and has to learn the transition matrix and the reward distributions. We propose a learning algorithm for this… ▽ More

    Submitted 1 February, 2021; v1 submitted 25 January, 2020; originally announced January 2020.

  27. Phase I analysis of hidden operating status for wind turbine

    Authors: Yuchen Shi, Nan Chen

    Abstract: Data-driven methods based on Supervisory Control and Data Acquisition (SCADA) become a recent trend for wind turbine condition monitoring. However, SCADA data are known to be of low quality due to low sampling frequency and complex turbine working dynamics. In this work, we focus on the phase I analysis of SCADA data to better understand turbines' operating status. As one of the most important cha… ▽ More

    Submitted 8 July, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

  28. arXiv:1912.04006  [pdf, other

    stat.ME eess.SP

    Conditional Kernel Density Estimation Considering Autocorrelation for Renewable Energy Probabilistic Modeling

    Authors: Yuchen Shi, Nan Chen

    Abstract: Renewable energy is essential for energy security and global warming mitigation. However, power generation from renewable energy sources is uncertain due to volatile weather conditions and complex equipment operations. To improve equipment's operation efficiency, it is important to understand and characterize the uncertainty in renewable power generation. In this paper, we proposed a conditional k… ▽ More

    Submitted 8 July, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

  29. arXiv:1911.04908  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Listen and Fill in the Missing Letters: Non-Autoregressive Transformer for Speech Recognition

    Authors: Nanxin Chen, Shinji Watanabe, Jesús Villalba, Najim Dehak

    Abstract: Recently very deep transformers have outperformed conventional bi-directional long short-term memory networks by a large margin in speech recognition. However, to put it into production usage, inference computation cost is still a serious concern in real scenarios. In this paper, we study two different non-autoregressive transformer structure for automatic speech recognition (ASR): A-CMLM and A-FM… ▽ More

    Submitted 6 April, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

  30. A Control Chart Approach to Power System Line Outage Detection Under Transient Dynamics

    Authors: Xiaozhou Yang, Nan Chen, Chao Zhai

    Abstract: Online transmission line outage detection over the entire network enables timely corrective action to be taken, which prevents a local event from cascading into a large scale blackout. Line outage detection aims to detect an outage as soon as possible after it happened. Traditional methods either do not consider the transient dynamics following an outage or require a full Phasor Measurement Unit (… ▽ More

    Submitted 22 May, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: 9 pages, 8 figures, under review for IEEE Transactions on Power Systems

  31. arXiv:1909.05659  [pdf, other

    cs.RO cs.CV cs.LG stat.ML

    Estimating Fingertip Forces, Torques, and Local Curvatures from Fingernail Images

    Authors: Nutan Chen, Göran Westling, Benoni B. Edin, Patrick van der Smagt

    Abstract: The study of dexterous manipulation has provided important insights in humans sensorimotor control as well as inspiration for manipulation strategies in robotic hands. Previous work focused on experimental environment with restrictions. Here we describe a method using the deformation and color distribution of the fingernail and its surrounding skin, to estimate the fingertip forces, torques and co… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: Robotica

  32. arXiv:1908.08750  [pdf, other

    stat.ML cs.LG

    Increasing the Generalisation Capacity of Conditional VAEs

    Authors: Alexej Klushyn, Nutan Chen, Botond Cseke, Justin Bayer, Patrick van der Smagt

    Abstract: We address the problem of one-to-many map**s in supervised learning, where a single instance has many different solutions of possibly equal cost. The framework of conditional variational autoencoders describes a class of methods to tackle such structured-prediction tasks by means of latent variables. We propose to incentivise informative latent representations for increasing the generalisation c… ▽ More

    Submitted 10 September, 2019; v1 submitted 23 August, 2019; originally announced August 2019.

  33. arXiv:1908.01109  [pdf, other

    cs.LG econ.EM stat.ML

    The Use of Binary Choice Forests to Model and Estimate Discrete Choices

    Authors: Ningyuan Chen, Guillermo Gallego, Zhuodong Tang

    Abstract: Problem definition. In retailing, discrete choice models (DCMs) are commonly used to capture the choice behavior of customers when offered an assortment of products. When estimating DCMs using transaction data, flexible models (such as machine learning models or nonparametric models) are typically not interpretable and hard to estimate, while tractable models (such as the multinomial logit model)… ▽ More

    Submitted 17 April, 2024; v1 submitted 2 August, 2019; originally announced August 2019.

    Comments: 61 pages, 10 figures, 30 tables

  34. arXiv:1908.00618  [pdf, other

    stat.CO

    Analyzing Basket Trials under Multisource Exchangeability Assumptions

    Authors: Michael J. Kane, Nan Chen, Alexander M. Kaizer, Xun Jiang, H. Amy Xia, Brian P. Hobbs

    Abstract: Basket designs are prospective clinical trials that are devised with the hypothesis that the presence of selected molecular features determine a patient's subsequent response to a particular "targeted" treatment strategy. Basket trials are designed to enroll multiple clinical subpopulations to which it is assumed that the therapy in question offers beneficial efficacy in the presence of the target… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

    Comments: 18 pages, 4 figures, 3 tables, submitted to the Journal of Open Source Software

    MSC Class: 62-04 ACM Class: G.3

  35. arXiv:1907.06550  [pdf, other

    stat.ML cs.LG

    A Dimension-free Algorithm for Contextual Continuum-armed Bandits

    Authors: Wenhao Li, Ningyuan Chen, L. Jeff Hong

    Abstract: In contextual continuum-armed bandits, the contexts $x$ and the arms $y$ are both continuous and drawn from high-dimensional spaces. The payoff function to learn $f(x,y)$ does not have a particular parametric form. The literature has shown that for Lipschitz-continuous functions, the optimal regret is $\tilde{O}(T^{\frac{d_x+d_y+1}{d_x+d_y+2}})$, where $d_x$ and $d_y$ are the dimensions of context… ▽ More

    Submitted 3 October, 2022; v1 submitted 15 July, 2019; originally announced July 2019.

  36. arXiv:1905.10626  [pdf, other

    cs.LG cs.CR stat.ML

    Rethinking Softmax Cross-Entropy Loss for Adversarial Robustness

    Authors: Tianyu Pang, Kun Xu, Yinpeng Dong, Chao Du, Ning Chen, Jun Zhu

    Abstract: Previous work shows that adversarially robust generalization requires larger sample complexity, and the same dataset, e.g., CIFAR-10, which enables good standard accuracy may not suffice to train robust models. Since collecting new training data could be costly, we focus on better utilizing the given data by inducing the regions with high sample density in the feature space, which could lead to lo… ▽ More

    Submitted 20 February, 2020; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: ICLR 2020

  37. arXiv:1905.04982  [pdf, other

    stat.ML cs.LG

    Learning Hierarchical Priors in VAEs

    Authors: Alexej Klushyn, Nutan Chen, Richard Kurle, Botond Cseke, Patrick van der Smagt

    Abstract: We propose to learn a hierarchical prior in the context of variational autoencoders to avoid the over-regularisation resulting from a standard normal prior distribution. To incentivise an informative latent representation of the data, we formulate the learning problem as a constrained optimisation problem by extending the Taming VAEs framework to two-level hierarchical models. We introduce a graph… ▽ More

    Submitted 5 October, 2019; v1 submitted 13 May, 2019; originally announced May 2019.

    Comments: Published at NeurIPS 2019 (spotlight)

  38. arXiv:1903.08766  [pdf, other

    stat.AP

    A Method for Measuring Network Effects of One-to-One Communication Features in Online A/B Tests

    Authors: Guillaume Saint-Jacques, James Eric Sorenson, Nanyu Chen, Ya Xu

    Abstract: A/B testing is an important decision making tool in product development because can provide an accurate estimate of the average treatment effect of a new features, which allows developers to understand how the business impact of new changes to products or algorithms. However, an important assumption of A/B testing, Stable Unit Treatment Value Assumption (SUTVA), is not always a valid assumption to… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

  39. arXiv:1901.08846  [pdf, other

    cs.LG stat.ML

    Improving Adversarial Robustness via Promoting Ensemble Diversity

    Authors: Tianyu Pang, Kun Xu, Chao Du, Ning Chen, Jun Zhu

    Abstract: Though deep neural networks have achieved significant progress on various tasks, often enhanced by model ensemble, existing high-performance models can be vulnerable to adversarial attacks. Many efforts have been devoted to enhancing the robustness of individual networks and then constructing a straightforward ensemble, e.g., by directly averaging the outputs, which ignores the interaction among n… ▽ More

    Submitted 29 May, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: ICML 2019

  40. arXiv:1901.01000  [pdf, ps, other

    stat.ML cs.LG

    Fast Multi-Class Probabilistic Classifier by Sparse Non-parametric Density Estimation

    Authors: Wan-** Nicole Chen, Yuan-chin Ivan Chang

    Abstract: The model interpretation is essential in many application scenarios and to build a classification model with a ease of model interpretation may provide useful information for further studies and improvement. It is common to encounter with a lengthy set of variables in modern data analysis, especially when data are collected in some automatic ways. This kinds of datasets may not collected with a sp… ▽ More

    Submitted 4 January, 2019; originally announced January 2019.

  41. arXiv:1812.09234  [pdf, ps, other

    cs.LG econ.EM stat.ML

    A Primal-dual Learning Algorithm for Personalized Dynamic Pricing with an Inventory Constraint

    Authors: Ningyuan Chen, Guillermo Gallego

    Abstract: We consider the problem of a firm seeking to use personalized pricing to sell an exogenously given stock of a product over a finite selling horizon to different consumer types. We assume that the type of an arriving consumer can be observed but the demand function associated with each type is initially unknown. The firm sets personalized prices dynamically for each type and attempts to maximize th… ▽ More

    Submitted 7 October, 2021; v1 submitted 20 December, 2018; originally announced December 2018.

  42. arXiv:1812.08284  [pdf, other

    stat.ML cs.LG

    Fast Approximate Geodesics for Deep Generative Models

    Authors: Nutan Chen, Francesco Ferroni, Alexej Klushyn, Alexandros Paraschos, Justin Bayer, Patrick van der Smagt

    Abstract: The length of the geodesic between two data points along a Riemannian manifold, induced by a deep generative model, yields a principled measure of similarity. Current approaches are limited to low-dimensional latent spaces, due to the computational complexity of solving a non-convex optimisation problem. We propose finding shortest paths in a finite graph of samples from the aggregate approximate… ▽ More

    Submitted 23 May, 2019; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: 28th International Conference on Artificial Neural Networks, 2019

    Journal ref: 28th International Conference on Artificial Neural Networks, 2019

  43. False Discovery Rate Controlled Heterogeneous Treatment Effect Detection for Online Controlled Experiments

    Authors: Yuxiang Xie, Nanyu Chen, Xiaolin Shi

    Abstract: Online controlled experiments (a.k.a. A/B testing) have been used as the mantra for data-driven decision making on feature changing and product ship** in many Internet companies. However, it is still a great challenge to systematically measure how every code or feature change impacts millions of users with great heterogeneity (e.g. countries, ages, devices). The most commonly used A/B testing fr… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

    MSC Class: 62

    Journal ref: Yuxiang Xie, Nanyu Chen, and Xiaolin Shi. 2018. KDD '18 Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining Pages 876-885

  44. arXiv:1808.02026  [pdf, other

    stat.ML cs.LG

    Active Learning based on Data Uncertainty and Model Sensitivity

    Authors: Nutan Chen, Alexej Klushyn, Alexandros Paraschos, Djalel Benbouzid, Patrick van der Smagt

    Abstract: Robots can rapidly acquire new skills from demonstrations. However, during generalisation of skills or transitioning across fundamentally different skills, it is unclear whether the robot has the necessary knowledge to perform the task. Failing to detect missing information often leads to abrupt movements or to collisions with the environment. Active learning can quantify the uncertainty of perfor… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

    Comments: Published on 2018 IEEE/RSJ International Conference on Intelligent Robots and System

  45. arXiv:1808.00114  [pdf

    stat.AP

    Automatic Detection and Diagnosis of Biased Online Experiments

    Authors: Nanyu Chen, Min Liu, Ya Xu

    Abstract: We have seen a massive growth of online experiments at LinkedIn, and in industry at large. It is now more important than ever to create an intelligent A/B platform that can truly democratize A/B testing by allowing everyone to make quality decisions, regardless of their skillset. With the tremendous knowledge base created around experimentation, we are able to mine through historical data, and dis… ▽ More

    Submitted 31 July, 2018; originally announced August 2018.

  46. arXiv:1805.01136  [pdf, other

    cs.LG stat.ML

    Nonparametric Pricing Analytics with Customer Covariates

    Authors: Ningyuan Chen, Guillermo Gallego

    Abstract: Personalized pricing analytics is becoming an essential tool in retailing. Upon observing the personalized information of each arriving customer, the firm needs to set a price accordingly based on the covariates such as income, education background, past purchasing history to extract more revenue. For new entrants of the business, the lack of historical data may severely limit the power and profit… ▽ More

    Submitted 15 February, 2020; v1 submitted 3 May, 2018; originally announced May 2018.

  47. arXiv:1803.10366  [pdf, other

    cs.LG cs.DS stat.ML

    Smoothed Online Convex Optimization in High Dimensions via Online Balanced Descent

    Authors: Niangjun Chen, Gautam Goel, Adam Wierman

    Abstract: We study Smoothed Online Convex Optimization, a version of online convex optimization where the learner incurs a penalty for changing her actions between rounds. Given a $Ω(\sqrt{d})$ lower bound on the competitive ratio of any online algorithm, where $d$ is the dimension of the action space, we ask under what conditions this bound can be beaten. We introduce a novel algorithmic framework for this… ▽ More

    Submitted 8 July, 2018; v1 submitted 27 March, 2018; originally announced March 2018.

  48. arXiv:1711.04425  [pdf, other

    stat.ML

    Message Passing Stein Variational Gradient Descent

    Authors: **gwei Zhuo, Chang Liu, Jiaxin Shi, Jun Zhu, Ning Chen, Bo Zhang

    Abstract: Stein variational gradient descent (SVGD) is a recently proposed particle-based Bayesian inference method, which has attracted a lot of interest due to its remarkable approximation ability and particle efficiency compared to traditional variational inference and Markov Chain Monte Carlo methods. However, we observed that particles of SVGD tend to collapse to modes of the target distribution, and t… ▽ More

    Submitted 8 June, 2018; v1 submitted 13 November, 2017; originally announced November 2017.

    Comments: To appear in the Proceedings of the 35th International Conference on Machine Learning (ICML 2018)

  49. arXiv:1711.01204  [pdf, other

    stat.ML cs.LG

    Metrics for Deep Generative Models

    Authors: Nutan Chen, Alexej Klushyn, Richard Kurle, Xueyan Jiang, Justin Bayer, Patrick van der Smagt

    Abstract: Neural samplers such as variational autoencoders (VAEs) or generative adversarial networks (GANs) approximate distributions by transforming samples from a simple random source---the latent space---to samples from a more complex distribution represented by a dataset. While the manifold hypothesis implies that the density induced by a dataset contains large regions of low density, the training crite… ▽ More

    Submitted 8 February, 2018; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: Published on the 21st International Conference on Artificial Intelligence and Statistics (AISTATS), 2018

    Journal ref: The 21st International Conference on Artificial Intelligence and Statistics, 2018

  50. arXiv:1709.05562  [pdf, other

    stat.ME math.DS math.PR

    Efficient Statistically Accurate Algorithms for the Fokker-Planck Equation in Large Dimensions

    Authors: Nan Chen, Andrew J. Majda

    Abstract: Solving the Fokker-Planck equation for high-dimensional complex turbulent dynamical systems is an important and practical issue. However, most traditional methods suffer from the curse of dimensionality and have difficulties in capturing the fat tailed highly intermittent probability density functions (PDFs) of complex systems in turbulence, neuroscience and excitable media. In this article, effic… ▽ More

    Submitted 16 September, 2017; originally announced September 2017.

    MSC Class: 35Q84; 37F99; 76F55; 65C05