Skip to main content

Showing 1–18 of 18 results for author: Lai, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.00317  [pdf, other

    stat.ML cs.LG stat.ME

    Combining Experimental and Historical Data for Policy Evaluation

    Authors: Ting Li, Chengchun Shi, Qianglin Wen, Yang Sui, Yongli Qin, Chunbo Lai, Hongtu Zhu

    Abstract: This paper studies policy evaluation with multiple data sources, especially in scenarios that involve one experimental dataset with two arms, complemented by a historical dataset generated under a single control arm. We propose novel data integration methods that linearly integrate base policy value estimators constructed based on the experimental and historical data, with weights optimized to min… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  2. arXiv:2405.14822  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher

    Authors: Dongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon

    Abstract: To accelerate sampling, diffusion models (DMs) are often distilled into generators that directly map noise to data in a single step. In this approach, the resolution of the generator is fundamentally limited by that of the teacher DM. To overcome this limitation, we propose Progressive Growing of Diffusion Autoencoder (PaGoDA), a technique to progressively grow the resolution of the generator beyo… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2310.02279  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion

    Authors: Dongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Yutong He, Yuki Mitsufuji, Stefano Ermon

    Abstract: Consistency Models (CM) (Song et al., 2023) accelerate score-based diffusion model sampling at the cost of sample quality but lack a natural way to trade-off quality for speed. To address this limitation, we propose Consistency Trajectory Model (CTM), a generalization encompassing CM and score-based models as special cases. CTM trains a single neural network that can -- in a single forward pass --… ▽ More

    Submitted 30 March, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: International Conference on Learning Representations

  4. arXiv:2303.14366  [pdf, other

    stat.ML cs.LG

    Hybrid Fuzzy-Crisp Clustering Algorithm: Theory and Experiments

    Authors: Akira R. Kinjo, Daphne Teck Ching Lai

    Abstract: With the membership function being strictly positive, the conventional fuzzy c-means clustering method sometimes causes imbalanced influence when clusters of vastly different sizes exist. That is, an outstandingly large cluster drags to its center all the other clusters, however far they are separated. To solve this problem, we propose a hybrid fuzzy-crisp clustering algorithm based on a target fu… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 41 pages, 13 figures, 10 tables

  5. arXiv:2110.08005  [pdf, other

    stat.ME stat.AP

    Spatially Adaptive Calibrations of AirBox PM$_{2.5}$ Data

    Authors: ShengLi Tzeng, Chi-Wei Lai, Hsin-Cheng Huang

    Abstract: Two networks are available to monitor PM$_{2.5}$ in Taiwan, including the Taiwan Air Quality Monitoring Network (TAQMN) and the AirBox network. The TAQMN, managed by Taiwan's Environmental Protection Administration (EPA), provides high-quality PM$_{2.5}$ measurements at $77$ monitoring stations. More recently, the AirBox network was launched, consisting of low-cost, small internet-of-things (IoT)… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: 21 pages, 9 figures

    MSC Class: 62P12

  6. arXiv:2009.14373  [pdf, other

    cs.LG stat.ML

    Facilitate the Parametric Dimension Reduction by Gradient Clip**

    Authors: Chien-Hsun Lai, Yu-Shuen Wang

    Abstract: We extend a well-known dimension reduction method, t-distributed stochastic neighbor embedding (t-SNE), from non-parametric to parametric by training neural networks. The main advantage of a parametric technique is the generalization of handling new data, which is particularly beneficial for streaming data exploration. However, training a neural network to optimize the t-SNE objective function fre… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  7. arXiv:2006.05534  [pdf, other

    cs.LG stat.ML

    Novelty Detection via Robust Variational Autoencoding

    Authors: Chieh-Hsin Lai, Dongmian Zou, Gilad Lerman

    Abstract: We propose a new method for novelty detection that can tolerate high corruption of the training points, whereas previous works assumed either no or very low corruption. Our method trains a robust variational autoencoder (VAE), which aims to generate a model for the uncorrupted training points. To gain robustness to high corruption, we incorporate the following four changes to the common VAE: 1. Ex… ▽ More

    Submitted 1 March, 2023; v1 submitted 9 June, 2020; originally announced June 2020.

  8. arXiv:2003.09077  [pdf, other

    cs.LG eess.SP math.NA math.OC stat.ML

    Inverse Problems, Deep Learning, and Symmetry Breaking

    Authors: Kshitij Tayal, Chieh-Hsin Lai, Vipin Kumar, Ju Sun

    Abstract: In many physical systems, inputs related by intrinsic system symmetries are mapped to the same output. When inverting such systems, i.e., solving the associated inverse problems, there is no unique solution. This causes fundamental difficulties for deploying the emerging end-to-end deep learning approach. Using the generalized phase retrieval problem as an illustrative example, we show that carefu… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

  9. arXiv:2003.06686  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0

    Authors: Zack Hodari, Catherine Lai, Simon King

    Abstract: In English, prosody adds a broad range of information to segment sequences, from information structure (e.g. contrast) to stylistic variation (e.g. expression of emotion). However, when learning to control prosody in text-to-speech voices, it is not clear what exactly the control is modifying. Existing research on discrete representation learning for prosody has demonstrated high naturalness, but… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

    Comments: Published to the 10th ISCA International Conference on Speech Prosody (SP2020)

  10. arXiv:1909.05176  [pdf, other

    cs.LG cs.NE nlin.AO nlin.CD stat.ML

    Optimal Machine Intelligence at the Edge of Chaos

    Authors: Ling Feng, Lin Zhang, Choy Heng Lai

    Abstract: It has long been suggested that the biological brain operates at some critical point between two different phases, possibly order and chaos. Despite many indirect empirical evidence from the brain and analytical indication on simple neural networks, the foundation of this hypothesis on generic non-linear systems remains unclear. Here we develop a general theory that reveals the exact edge of chaos… ▽ More

    Submitted 29 October, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

  11. arXiv:1904.00152  [pdf, other

    cs.LG cs.CV stat.ML

    Robust Subspace Recovery Layer for Unsupervised Anomaly Detection

    Authors: Chieh-Hsin Lai, Dongmian Zou, Gilad Lerman

    Abstract: We propose a neural network for unsupervised anomaly detection with a novel robust subspace recovery layer (RSR layer). This layer seeks to extract the underlying subspace from a latent representation of the given data and removes outliers that lie away from this subspace. It is used within an autoencoder. The encoder maps the data into a latent space, from which the RSR layer extracts the subspac… ▽ More

    Submitted 24 December, 2019; v1 submitted 30 March, 2019; originally announced April 2019.

    Comments: This work is on the ICLR 2020 conference

    Journal ref: Eighth International Conference on Learning Representations (ICLR), 2020, https://openreview.net/pdf?id=rylb3eBtwr

  12. arXiv:1812.07135  [pdf, other

    cs.SI cs.LG stat.ML

    Globalness Detection in Online Social Network

    Authors: Yu-Cheng Lin, Chun-Ming Lai, S. Felix Wu, George A. Barnett

    Abstract: Classification problems have made significant progress due to the maturity of artificial intelligence (AI). However, differentiating items from categories without noticeable boundaries is still a huge challenge for machines -- which is also crucial for machines to be intelligent. In order to study the fuzzy concept on classification, we define and propose a globalness detection with the four-sta… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Comments: 6 pages, to be appeared in IEEE International Conference on Semantic Computing (ICSC2019)

  13. arXiv:1810.13048  [pdf, other

    eess.AS cs.CL cs.SD stat.ML

    Attentive Filtering Networks for Audio Replay Attack Detection

    Authors: Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King

    Abstract: An attacker may use a variety of techniques to fool an automatic speaker verification system into accepting them as a genuine user. Anti-spoofing methods meanwhile aim to make the system robust against such attacks. The ASVspoof 2017 Challenge focused specifically on replay attacks, with the intention of measuring the limits of replay attack detection as well as develo** countermeasures against… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

    Comments: Submitted to ICASSP 2019

  14. arXiv:1805.07427  [pdf, other

    stat.CO

    Method G: Uncertainty Quantification for Distributed Data Problems using Generalized Fiducial Inference

    Authors: Randy C. S. Lai, J. Hannig, Thomas C. M. Lee

    Abstract: It is not unusual for a data analyst to encounter data sets distributed across several computers. This can happen for reasons such as privacy concerns, efficiency of likelihood evaluations, or just the sheer size of the whole data set. This presents new challenges to statisticians as even computing simple summary statistics such as the median becomes computationally challenging. Furthermore, if ot… ▽ More

    Submitted 18 May, 2018; originally announced May 2018.

  15. Uncertainty Quantification for High Dimensional Sparse Nonparametric Additive Models

    Authors: Qi Gao, Randy C. S. Lai, Thomas C. M. Lee, Yao Li

    Abstract: Statistical inference in high dimensional settings has recently attracted enormous attention within the literature. However, most published work focuses on the parametric linear regression problem. This paper considers an important extension of this problem: statistical inference for high dimensional sparse nonparametric additive models. To be more precise, this paper develops a methodology for co… ▽ More

    Submitted 13 November, 2019; v1 submitted 23 September, 2017; originally announced September 2017.

    Journal ref: 2019, Technometrics

  16. arXiv:1708.04929  [pdf, other

    stat.ME

    Covariance Estimation via Fiducial Inference

    Authors: W. Jenny Shi, Jan Hannig, Randy C. S. Lai, Thomas C. M. Lee

    Abstract: As a classical problem, covariance estimation has drawn much attention from the statistical community for decades. Much work has been done under the frequentist and the Bayesian frameworks. Aiming to quantify the uncertainty of the estimators without having to choose a prior, we have developed a fiducial approach to the estimation of covariance matrix. Built upon the Fiducial Berstein-von Mises Th… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

    Comments: 31 pages with 5 figures, including appendix; 1 supplementary document with 5 figures

    MSC Class: 62J10; 62E20; 62F25; 62F12

  17. arXiv:1501.00599  [pdf, ps, other

    math.ST stat.OT

    On testing More IFRA Ordering-II

    Authors: Muhyiddin Izadi, Baha-Eldin Khaledi, Chin-Diew Lai

    Abstract: Suppose F and G are two life distribution functions. It is said that F is more IFRA than G (written by F<_* G) if G^(-1) F(x) is starshaped on (0,infty). In this paper, the problem of testing H_0:F=_* G against H_1:F<_* G and F \neq_* G is considered in both cases when G is known and when G is unknown. We propose a new test based on U-statistics and obtain the asymptotic distribution of the test s… ▽ More

    Submitted 3 January, 2015; originally announced January 2015.

  18. arXiv:1304.7847  [pdf, ps, other

    stat.ME

    Generalized Fiducial Inference for Ultrahigh Dimensional Regression

    Authors: Randy C. S. Lai, Jan Hannig, Thomas C. M. Lee

    Abstract: In recent years the ultrahigh dimensional linear regression problem has attracted enormous attentions from the research community. Under the sparsity assumption most of the published work is devoted to the selection and estimation of the significant predictor variables. This paper studies a different but fundamentally important aspect of this problem: uncertainty quantification for parameter estim… ▽ More

    Submitted 29 April, 2013; originally announced April 2013.