Skip to main content

Showing 1–47 of 47 results for author: Chang, W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.15549  [pdf, other

    cs.LG cs.MA math.ST stat.ML

    Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs

    Authors: Tianyuan **, Hao-Lun Hsu, William Chang, Pan Xu

    Abstract: We study the multi-agent multi-armed bandit (MAMAB) problem, where $m$ agents are factored into $ρ$ overlap** groups. Each group represents a hyperedge, forming a hypergraph over the agents. At each round of interaction, the learner pulls a joint arm (composed of individual arms for each agent) and receives a reward according to the hypergraph structure. Specifically, we assume there is a local… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: 22 pages, 7 figures, 2 tables. To appear in the proceedings of the 38th Annual AAAI Conference on Artificial Intelligence (AAAI'2024)

  2. arXiv:2311.06210  [pdf, other

    cs.LG cs.MA stat.ML

    Optimal Cooperative Multiplayer Learning Bandits with Noisy Rewards and No Communication

    Authors: William Chang, Yuanhao Lu

    Abstract: We consider a cooperative multiplayer bandit learning problem where the players are only allowed to agree on a strategy beforehand, but cannot communicate during the learning process. In this problem, each player simultaneously selects an action. Based on the actions selected by all players, the team of players receives a reward. The actions of all the players are commonly observed. However, each… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  3. arXiv:2306.02451  [pdf, other

    cs.LG cs.AI stat.ML

    For SALE: State-Action Representation Learning for Deep Reinforcement Learning

    Authors: Scott Fujimoto, Wei-Di Chang, Edward J. Smith, Shixiang Shane Gu, Doina Precup, David Meger

    Abstract: In the field of reinforcement learning (RL), representation learning is a proven tool for complex image-based tasks, but is often overlooked for environments with low-level states, such as physical control problems. This paper introduces SALE, a novel approach for learning embeddings that model the nuanced interaction between state and action, enabling effective representation learning from low-le… ▽ More

    Submitted 5 November, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  4. arXiv:2305.17380  [pdf, ps, other

    cs.LG stat.ML

    No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions

    Authors: Tiancheng **, Junyan Liu, Chloé Rouyer, William Chang, Chen-Yu Wei, Haipeng Luo

    Abstract: Existing online learning algorithms for adversarial Markov Decision Processes achieve ${O}(\sqrt{T})$ regret after $T$ rounds of interactions even if the loss functions are chosen arbitrarily by an adversary, with the caveat that the transition function has to be fixed. This is because it has been shown that adversarial transition functions make no-regret learning impossible. Despite such impossib… ▽ More

    Submitted 26 October, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Update the camera-ready version for NeurIPS 2023

    ACM Class: I.2.6

  5. arXiv:2211.12200  [pdf, other

    stat.AP stat.ME

    Fast Computer Model Calibration using Annealed and Transformed Variational Inference

    Authors: Dongkyu Derek Cho, Won Chang, Jaewoo Park

    Abstract: Computer models play a crucial role in numerous scientific and engineering domains. To ensure the accuracy of simulations, it is essential to properly calibrate the input parameters of these models through statistical inference. While Bayesian inference is the standard approach for this task, employing Markov Chain Monte Carlo methods often encounters computational hurdles due to the costly evalua… ▽ More

    Submitted 5 March, 2024; v1 submitted 22 November, 2022; originally announced November 2022.

  6. arXiv:2210.09560  [pdf, other

    stat.ME

    A Bayesian Convolutional Neural Network-based Generalized Linear Model

    Authors: Yeseul Jeon, Won Chang, Seonghyun Jeong, Sanghoon Han, Jaewoo Park

    Abstract: Convolutional neural networks (CNNs) provide flexible function approximations for a wide variety of applications when the input variables are in the form of images or spatial data. Although CNNs often outperform traditional statistical models in prediction accuracy, statistical inference, such as estimating the effects of covariates and quantifying the prediction uncertainty, is not trivial due to… ▽ More

    Submitted 22 May, 2024; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: 25 pages, 7 figures

  7. arXiv:2207.06587  [pdf, other

    stat.ME stat.AP

    A Spatio-Temporal Dirichlet Process Mixture Model for Coronavirus Disease-19

    Authors: Jaewoo Park, Seorim Yi, Won Chang, Jorge Mateu

    Abstract: Understanding the spatio-temporal patterns of the coronavirus disease 2019 (COVID-19) is essential to construct public health interventions. Spatially referenced data can provide richer opportunities to understand the mechanism of the disease spread compared to the more often encountered aggregated count data. We propose a spatio-temporal Dirichlet process mixture model to analyze confirmed cases… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: 26 pages, 10 figures

  8. arXiv:2110.13006  [pdf, other

    stat.ML cs.LG

    Gradient-based Quadratic Multiform Separation

    Authors: Wen-Teng Chang

    Abstract: Classification as a supervised learning concept is an important content in machine learning. It aims at categorizing a set of data into classes. There are several commonly-used classification methods nowadays such as k-nearest neighbors, random forest, and support vector machine. Each of them has its own pros and cons, and none of them is invincible for all kinds of problems. In this thesis, we fo… ▽ More

    Submitted 26 October, 2021; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: 47 pages, 11 figures

  9. arXiv:2110.10604  [pdf, other

    stat.AP

    Bayesian Model Calibration and Sensitivity Analysis for Oscillating Biological Experiments

    Authors: Youngdeok Hwang, Hang J. Kim, Won Chang, Christian Hong, Steven N. MacEachern

    Abstract: Understanding the oscillating behaviors that govern organisms' internal biological processes requires interdisciplinary efforts combining both biological and computer experiments, as the latter can complement the former by simulating perturbed conditions with higher resolution. Harmonizing the two types of experiment, however, poses significant statistical challenges due to identifiability issues,… ▽ More

    Submitted 28 November, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: manuscript 33 pages, appendix 6 pages

    MSC Class: 62P10 (Primary); 62-08 (Secondary)

  10. arXiv:2110.00685  [pdf, other

    cs.LG cs.AI cs.IR stat.ML

    Fast Multi-Resolution Transformer Fine-tuning for Extreme Multi-label Text Classification

    Authors: Jiong Zhang, Wei-cheng Chang, Hsiang-fu Yu, Inderjit S. Dhillon

    Abstract: Extreme multi-label text classification (XMC) seeks to find relevant labels from an extreme large label collection for a given text input. Many real-world applications can be formulated as XMC problems, such as recommendation systems, document tagging and semantic search. Recently, transformer based XMC methods, such as X-Transformer and LightXML, have shown significant improvement over other XMC… ▽ More

    Submitted 28 October, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

  11. arXiv:2109.00539  [pdf, other

    stat.ME cs.LG

    Spatially and Robustly Hybrid Mixture Regression Model for Inference of Spatial Dependence

    Authors: Wennan Chang, Pengtao Dang, Changlin Wan, Xiaoyu Lu, Yue Fang, Tong Zhao, Yong Zang, Bo Li, Chi Zhang, Sha Cao

    Abstract: In this paper, we propose a Spatial Robust Mixture Regression model to investigate the relationship between a response variable and a set of explanatory variables over the spatial domain, assuming that the relationships may exhibit complex spatially dynamic patterns that cannot be captured by constant regression coefficients. Our method integrates the robust finite mixture Gaussian regression mode… ▽ More

    Submitted 28 September, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: Accepted by ICDM IEEE 2021

  12. arXiv:2106.12751  [pdf, other

    stat.ML cs.LG

    Label Disentanglement in Partition-based Extreme Multilabel Classification

    Authors: Xuanqing Liu, Wei-Cheng Chang, Hsiang-Fu Yu, Cho-Jui Hsieh, Inderjit S. Dhillon

    Abstract: Partition-based methods are increasingly-used in extreme multi-label classification (XMC) problems due to their scalability to large output spaces (e.g., millions or more). However, existing methods partition the large label space into mutually exclusive clusters, which is sub-optimal when labels have multi-modality and rich semantics. For instance, the label "Apple" can be the fruit or the brand… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  13. arXiv:2102.02999  [pdf, other

    stat.AP stat.ME

    An Interaction Neyman-Scott Point Process Model for Coronavirus Disease-19

    Authors: J. Park, W. Chang, B. Choi

    Abstract: With rapid transmission, the coronavirus disease 2019 (COVID-19) has led to over 2 million deaths worldwide, posing significant societal challenges. Understanding the spatial patterns of patient visits and detecting the local spreading events are crucial to controlling disease outbreaks. We analyze highly detailed COVID-19 contact tracing data collected from Seoul, which provides a unique opportun… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

  14. arXiv:2101.06813  [pdf, other

    cs.LG cs.AI stat.AP

    Fast and accurate learned multiresolution dynamical downscaling for precipitation

    Authors: Jiali Wang, Zhengchun Liu, Ian Foster, Won Chang, Rajkumar Kettimuthu, Rao Kotamarthi

    Abstract: This study develops a neural network-based approach for emulating high-resolution modeled precipitation data with comparable statistical properties but at greatly reduced computational cost. The key idea is to use combination of low- and high- resolution simulations to train a neural network to map from the former to the latter. Specifically, we define two types of CNNs, one that stacks variables… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

  15. arXiv:2008.13066  [pdf, other

    stat.ML cs.LG stat.ME

    Computer Model Calibration with Time Series Data using Deep Learning and Quantile Regression

    Authors: Saumya Bhatnagar, Won Chang, Seon** Kim Jiali Wang

    Abstract: Computer models play a key role in many scientific and engineering problems. One major source of uncertainty in computer model experiment is input parameter uncertainty. Computer model calibration is a formal statistical procedure to infer input parameters by combining information from model runs and observational data. The existing standard calibration framework suffers from inferential issues wh… ▽ More

    Submitted 8 September, 2020; v1 submitted 29 August, 2020; originally announced August 2020.

  16. arXiv:2007.15821  [pdf, other

    cs.LG cs.CG stat.ML

    Geometric All-Way Boolean Tensor Decomposition

    Authors: Changlin Wan, Wennan Chang, Tong Zhao, Sha Cao, Chi Zhang

    Abstract: Boolean tensor has been broadly utilized in representing high dimensional logical data collected on spatial, temporal and/or other relational domains. Boolean Tensor Decomposition (BTD) factorizes a binary tensor into the Boolean sum of multiple rank-1 tensors, which is an NP-hard problem. Existing BTD methods have been limited by their high computational cost, in applications to large scale or hi… ▽ More

    Submitted 26 October, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020

  17. arXiv:2007.15816  [pdf, other

    cs.LG stat.ML

    Denoising individual bias for a fairer binary submatrix detection

    Authors: Changlin Wan, Wennan Chang, Tong Zhao, Sha Cao, Chi Zhang

    Abstract: Low rank representation of binary matrix is powerful in disentangling sparse individual-attribute associations, and has received wide applications. Existing binary matrix factorization (BMF) or co-clustering (CC) methods often assume i.i.d background noise. However, this assumption could be easily violated in real data, where heterogeneous row- or column-wise probability of binary entries results… ▽ More

    Submitted 9 August, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: Accepted at CIKM 2020

  18. arXiv:2007.09720  [pdf, ps, other

    stat.ME cs.LG

    Supervised clustering of high dimensional data using regularized mixture modeling

    Authors: Wennan Chang, Changlin Wan, Yong Zang, Chi Zhang, Sha Cao

    Abstract: Identifying relationships between molecular variations and their clinical presentations has been challenged by the heterogeneous causes of a disease. It is imperative to unveil the relationship between the high dimensional molecular manifestations and the clinical presentations, while taking into account the possible heterogeneity of the study subjects. We proposed a novel supervised clustering al… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

  19. arXiv:2007.03074  [pdf, other

    stat.ML cs.CV cs.LG

    Kernel Stein Generative Modeling

    Authors: Wei-Cheng Chang, Chun-Liang Li, Youssef Mroueh, Yiming Yang

    Abstract: We are interested in gradient-based Explicit Generative Modeling where samples can be derived from iterative gradient updates based on an estimate of the score function of the data distribution. Recent advances in Stochastic Gradient Langevin Dynamics (SGLD) demonstrates impressive results with energy-based models on high-dimensional and complex data distributions. Stein Variational Gradient Desce… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  20. arXiv:2005.11599  [pdf, other

    stat.ME cs.LG

    Component-wise Adaptive Trimming For Robust Mixture Regression

    Authors: Wennan Chang, Xinyu Zhou, Yong Zang, Chi Zhang, Sha Cao

    Abstract: Parameter estimation of mixture regression model using the expectation maximization (EM) algorithm is highly sensitive to outliers. Here we propose a fast and efficient robust mixture regression algorithm, called Component-wise Adaptive Trimming (CAT) method. We consider simultaneous outlier detection and robust parameter estimation to minimize the effect of outlier contamination. Robust mixture r… ▽ More

    Submitted 19 April, 2021; v1 submitted 23 May, 2020; originally announced May 2020.

  21. arXiv:2004.11934  [pdf, other

    cs.LG stat.ML

    Correlation-aware Unsupervised Change-point Detection via Graph Neural Networks

    Authors: Ruohong Zhang, Yu Hao, Donghan Yu, Wei-Cheng Chang, Guokun Lai, Yiming Yang

    Abstract: Change-point detection (CPD) aims to detect abrupt changes over time series data. Intuitively, effective CPD over multivariate time series should require explicit modeling of the dependencies across input variables. However, existing CPD methods either ignore the dependency structures entirely or rely on the (unrealistic) assumption that the correlation structures are static over time. In this pap… ▽ More

    Submitted 13 September, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: Accepted for publication in the International Conference on Neural Information Processing (ICONIP) 2020 Original paper is 12 pages, additional appendix is available on arxiv

    MSC Class: I.2.6

    Journal ref: ICONIP 2020: Neural Information Processing

  22. arXiv:2002.03932  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    Pre-training Tasks for Embedding-based Large-scale Retrieval

    Authors: Wei-Cheng Chang, Felix X. Yu, Yin-Wen Chang, Yiming Yang, Sanjiv Kumar

    Abstract: We consider the large-scale query-document retrieval problem: given a query (e.g., a question), return the set of relevant documents (e.g., paragraphs containing the answer) from a large document corpus. This problem is often solved in two steps. The retrieval phase first reduces the solution space, returning a subset of candidate documents. The scoring phase then re-ranks the documents. Criticall… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: Accepted by ICLR 2020

  23. arXiv:1911.09816  [pdf, other

    eess.IV cs.CV stat.AP

    Two-stage dimension reduction for noisy high-dimensional images and application to Cryogenic Electron Microscopy

    Authors: Szu-Chi Chung, Shao-Hsuan Wang, Po-Yao Niu, Su-Yun Huang, Wei-Hau Chang, I-** Tu

    Abstract: Principal component analysis (PCA) is arguably the most widely used dimension-reduction method for vector-type data. When applied to a sample of images, PCA requires vectorization of the image data, which in turn entails solving an eigenvalue problem for the sample covariance matrix. We propose herein a two-stage dimension reduction (2SDR) method for image reconstruction from high-dimensional nois… ▽ More

    Submitted 27 February, 2021; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: 29 pages, 8 figures and 3 tables

    Journal ref: Annals of Mathematical Sciences and Applications. Volume 5, Number 2, 283-316, 2020

  24. arXiv:1910.10479  [pdf, ps, other

    cs.CL cs.LG stat.ML

    XL-Editor: Post-editing Sentences with XLNet

    Authors: Yong-Siang Shih, Wei-Cheng Chang, Yiming Yang

    Abstract: While neural sequence generation models achieve initial success for many NLP applications, the canonical decoding procedure with left-to-right generation order (i.e., autoregressive) in one-pass can not reflect the true nature of human revising a sentence to obtain a refined result. In this work, we propose XL-Editor, a novel training framework that enables state-of-the-art generalized autoregress… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

    Comments: Under review

  25. arXiv:1910.09745  [pdf, other

    cs.LG cs.NE stat.ML

    Vanishing Nodes: Another Phenomenon That Makes Training Deep Neural Networks Difficult

    Authors: Wen-Yu Chang, Tsung-Nan Lin

    Abstract: It is well known that the problem of vanishing/exploding gradients is a challenge when training deep networks. In this paper, we describe another phenomenon, called vanishing nodes, that also increases the difficulty of training deep neural networks. As the depth of a neural network increases, the network's hidden nodes have more highly correlated behavior. This results in great similarities betwe… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: 16 pages, 9 figures and 2 tables

  26. arXiv:1910.04500  [pdf, other

    cs.LG eess.AS stat.ML

    Orthogonality Constrained Multi-Head Attention For Keyword Spotting

    Authors: Mingu Lee, **kyu Lee, Hye ** Jang, Byeonggeun Kim, Wonil Chang, Kyuwoong Hwang

    Abstract: Multi-head attention mechanism is capable of learning various representations from sequential data while paying attention to different subsequences, e.g., word-pieces or syllables in a spoken word. From the subsequences, it retrieves richer information than a single-head attention which only summarizes the whole sequence into one context vector. However, a naive use of the multi-head attention doe… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: Accepted to ASRU 2019

  27. arXiv:1909.03991  [pdf, other

    cs.LG cs.CG stat.ML

    Fast And Efficient Boolean Matrix Factorization By Geometric Segmentation

    Authors: Changlin Wan, Wennan Chang, Tong Zhao, Mengya Li, Sha Cao, Chi Zhang

    Abstract: Boolean matrix has been used to represent digital information in many fields, including bank transaction, crime records, natural language processing, protein-protein interaction, etc. Boolean matrix factorization (BMF) aims to find an approximation of a binary matrix as the Boolean product of two low rank Boolean matrices, which could generate vast amount of information for the patterns of relatio… ▽ More

    Submitted 10 February, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: Accepted at AAAI 2020

  28. arXiv:1908.02612  [pdf, ps, other

    eess.AS cs.LG cs.SD stat.ML

    An End-to-End Text-independent Speaker Verification Framework with a Keyword Adversarial Network

    Authors: Sungrack Yun, Janghoon Cho, Jungyun Eum, Wonil Chang, Kyuwoong Hwang

    Abstract: This paper presents an end-to-end text-independent speaker verification framework by jointly considering the speaker embedding (SE) network and automatic speech recognition (ASR) network. The SE network learns to output an embedding vector which distinguishes the speaker characteristics of the input utterance, while the ASR network learns to recognize the phonetic context of the input. In training… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

    Comments: Will be appeared in INTERSPEECH 2019

  29. arXiv:1907.13554  [pdf, other

    stat.ME stat.AP stat.CO

    Ice Model Calibration Using Semi-continuous Spatial Data

    Authors: Won Chang, Bledar A. Konomi, Georgios Karagiannis, Yawen Guan, Murali Haran

    Abstract: Rapid changes in Earth's cryosphere caused by human activity can lead to significant environmental impacts. Computer models provide a useful tool for understanding the behavior and projecting the future of Arctic and Antarctic ice sheets. However, these models are typically subject to large parametric uncertainties due to poorly constrained model input parameters that govern the behavior of simula… ▽ More

    Submitted 31 July, 2019; originally announced July 2019.

  30. arXiv:1906.03950  [pdf, other

    cs.LG cs.AI stat.ML

    Domain-Specific Batch Normalization for Unsupervised Domain Adaptation

    Authors: Woong-Gi Chang, Tackgeun You, Seonguk Seo, Suha Kwak, Bohyung Han

    Abstract: We propose a novel unsupervised domain adaptation framework based on domain-specific batch normalization in deep neural networks. We aim to adapt to both domains by specializing batch normalization layers in convolutional neural networks while allowing them to share all other model parameters, which is realized by a two-stage algorithm. In the first stage, we estimate pseudo-labels for the example… ▽ More

    Submitted 27 May, 2019; originally announced June 2019.

  31. arXiv:1905.06942  [pdf, other

    cs.IT stat.ML

    Random Sampling for Distributed Coded Matrix Multiplication

    Authors: Wei-Ting Chang, Ravi Tandon

    Abstract: Matrix multiplication is a fundamental building block for large scale computations arising in various applications, including machine learning. There has been significant recent interest in using coding to speed up distributed matrix multiplication, that are robust to stragglers (i.e., machines that may perform slower computations). In many scenarios, instead of exact computation, approximate matr… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

  32. arXiv:1905.02331  [pdf, other

    cs.LG cs.AI cs.IR stat.ML

    Taming Pretrained Transformers for Extreme Multi-label Text Classification

    Authors: Wei-Cheng Chang, Hsiang-Fu Yu, Kai Zhong, Yiming Yang, Inderjit Dhillon

    Abstract: We consider the extreme multi-label text classification (XMC) problem: given an input text, return the most relevant labels from a large label collection. For example, the input text could be a product description on Amazon.com and the labels could be product categories. XMC is an important yet challenging problem in the NLP community. Recently, deep pretrained transformer models have achieved sta… ▽ More

    Submitted 23 June, 2020; v1 submitted 6 May, 2019; originally announced May 2019.

    Comments: KDD 2020 Applied Data Track

  33. arXiv:1902.10214  [pdf, other

    stat.ML cs.AI cs.LG

    Implicit Kernel Learning

    Authors: Chun-Liang Li, Wei-Cheng Chang, Youssef Mroueh, Yiming Yang, Barnabás Póczos

    Abstract: Kernels are powerful and versatile tools in machine learning and statistics. Although the notion of universal kernels and characteristic kernels has been studied, kernel selection still greatly influences the empirical performance. While learning the kernel in a data driven way has been investigated, in this paper we explore learning the spectral distribution of kernel via implicit generative mode… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: In the Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019)

  34. arXiv:1901.06077  [pdf, other

    stat.ML cs.LG

    Kernel Change-point Detection with Auxiliary Deep Generative Models

    Authors: Wei-Cheng Chang, Chun-Liang Li, Yiming Yang, Barnabás Póczos

    Abstract: Detecting the emergence of abrupt property changes in time series is a challenging problem. Kernel two-sample test has been studied for this task which makes fewer assumptions on the distributions than traditional parametric approaches. However, selecting kernels is non-trivial in practice. Although kernel selection for two-sample test has been studied, the insufficient samples in change point det… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

    Comments: To appear in ICLR 2019

  35. arXiv:1811.09734  [pdf

    stat.AP stat.ME

    A Regularized Spatial Market Segmentation Method with Dirichlet Process Gaussian Mixture Prior

    Authors: Won Chang, Sunghoon Kim, Heewon Chae

    Abstract: Spatially referenced data are increasingly available thanks to the development of modern GPS technology. They also provide rich opportunities for spatial analytics in the field of marketing science. Our main interest is to propose a new efficient statistical framework to conduct spatial segmentation analysis for restaurants located in a metropolitan area in the U.S. The spatial segmentation proble… ▽ More

    Submitted 23 November, 2018; originally announced November 2018.

  36. arXiv:1810.06608  [pdf, other

    stat.AP stat.ME

    Computer model calibration based on image war** metrics: an application for sea ice deformation

    Authors: Yawen Guan, Christian Sampson, J. Derek Tucker, Won Chang, Anirban Mondal, Murali Haran, Deborah Sulsky

    Abstract: Arctic sea ice plays an important role in the global climate. Sea ice models governed by physical equations have been used to simulate the state of the ice including characteristics such as ice thickness, concentration, and motion. More recent models also attempt to capture features such as fractures or leads in the ice. These simulated features can be partially misaligned or misshapen when compar… ▽ More

    Submitted 24 January, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

  37. arXiv:1807.03933  [pdf

    cs.LG cs.IT stat.ML

    Instance-based entropy fuzzy support vector machine for imbalanced data

    Authors: Poong** Cho, Minhyuk Lee, Woo** Chang

    Abstract: Imbalanced classification has been a major challenge for machine learning because many standard classifiers mainly focus on balanced datasets and tend to have biased results towards the majority class. We modify entropy fuzzy support vector machine (EFSVM) and introduce instance-based entropy fuzzy support vector machine (IEFSVM). Both EFSVM and IEFSVM use the entropy information of k-nearest neig… ▽ More

    Submitted 10 July, 2018; originally announced July 2018.

  38. arXiv:1712.04075  [pdf, other

    stat.AP physics.geo-ph

    Diagnosing added value of convection-permitting regional models using precipitation event identification and tracking

    Authors: Won Chang, Jiali Wang, Julian Marohnic, Rao Kotamarthi, Elisabeth J. Moyer

    Abstract: Dynamical downscaling with high-resolution regional climate models may offer the possibility of realistically reproducing precipitation and weather events in climate simulations. As resolutions fall to order kilometers, the use of explicit rather than parametrized convection may offer even greater fidelity. However, these increased model resolutions both allow and require increasingly complex diag… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

  39. arXiv:1706.00476  [pdf, other

    math.OC cs.LG stat.ML

    The Mixing method: low-rank coordinate descent for semidefinite programming with diagonal constraints

    Authors: Po-Wei Wang, Wei-Cheng Chang, J. Zico Kolter

    Abstract: In this paper, we propose a low-rank coordinate descent approach to structured semidefinite programming with diagonal constraints. The approach, which we call the Mixing method, is extremely simple to implement, has no free parameters, and typically attains an order of magnitude or better improvement in optimization performance over the current state of the art. We show that the method is strictly… ▽ More

    Submitted 4 July, 2018; v1 submitted 1 June, 2017; originally announced June 2017.

  40. arXiv:1705.08584  [pdf, other

    cs.LG cs.AI stat.ML

    MMD GAN: Towards Deeper Understanding of Moment Matching Network

    Authors: Chun-Liang Li, Wei-Cheng Chang, Yu Cheng, Yiming Yang, Barnabás Póczos

    Abstract: Generative moment matching network (GMMN) is a deep generative model that differs from Generative Adversarial Network (GAN) by replacing the discriminator in GAN with a two-sample test based on kernel maximum mean discrepancy (MMD). Although some theoretical guarantees of MMD have been studied, the empirical performance of GMMN is still not as competitive as that of GAN on challenging and large be… ▽ More

    Submitted 27 November, 2017; v1 submitted 23 May, 2017; originally announced May 2017.

    Comments: In the Proceedings of Thirty-first Annual Conference on Neural Information Processing Systems (NIPS 2017)

  41. arXiv:1705.08525  [pdf, other

    cs.LG stat.ML

    Data-driven Random Fourier Features using Stein Effect

    Authors: Wei-Cheng Chang, Chun-Liang Li, Yiming Yang, Barnabas Poczos

    Abstract: Large-scale kernel approximation is an important problem in machine learning research. Approaches using random Fourier features have become increasingly popular [Rahimi and Recht, 2007], where kernel approximation is treated as empirical mean estimation via Monte Carlo (MC) or Quasi-Monte Carlo (QMC) integration [Yang et al., 2014]. A limitation of the current approaches is that all the features r… ▽ More

    Submitted 23 May, 2017; originally announced May 2017.

    Comments: To appear in International Joint Conference on Artificial Intelligence (IJCAI), 2017

  42. Changes in Spatio-temporal Precipitation Patterns in Changing Climate Conditions

    Authors: Won Chang, Michael L. Stein, Jiali Wang, V. Rao Kotamarthi, Elisabeth J. Moyer

    Abstract: Climate models robustly imply that some significant change in precipitation patterns will occur. Models consistently project that the intensity of individual precipitation events increases by approximately 6-7%/K, following the increase in atmospheric water content, but that total precipitation increases by a lesser amount (1-2 %/K in the global average in transient runs). Some other aspect of pre… ▽ More

    Submitted 24 May, 2016; v1 submitted 6 January, 2016; originally announced January 2016.

    Comments: This work has been submitted for publication. Copyright in this work may be transferred without further notice, and this version may no longer be accessible

  43. arXiv:1510.01676  [pdf, other

    stat.AP

    Improving Ice Sheet Model Calibration Using Paleoclimate and Modern Data

    Authors: Won Chang, Murali Haran, Patrick Applegate, David Pollard

    Abstract: Human-induced climate change may cause significant ice volume loss from the West Antarctic Ice Sheet (WAIS). Projections of ice volume change from ice-sheet models and corresponding future sea-level rise have large uncertainties due to poorly constrained input parameters. In most future applications to date, model calibration has utilized only modern or recent (decadal) observations, leaving input… ▽ More

    Submitted 24 August, 2016; v1 submitted 6 October, 2015; originally announced October 2015.

    Journal ref: The Annals of Applied Statistics, 10 (4), 2274-2302 (2016)

  44. Calibrating an ice sheet model using high-dimensional binary spatial data

    Authors: Won Chang, Murali Haran, Patrick Applegate, David Pollard

    Abstract: Rapid retreat of ice in the Amundsen Sea sector of West Antarctica may cause drastic sea level rise, posing significant risks to populations in low-lying coastal regions. Calibration of computer models representing the behavior of the West Antarctic Ice Sheet is key for informative projections of future sea level rise. However, both the relevant observations and the model output are high-dimension… ▽ More

    Submitted 20 May, 2016; v1 submitted 8 January, 2015; originally announced January 2015.

    Journal ref: Journal of the American Statistical Association (2016), Volume 111, Issue 513, 57-72

  45. arXiv:1308.0049  [pdf, other

    stat.ME stat.CO

    A composite likelihood approach to computer model calibration using high-dimensional spatial data

    Authors: Won Chang, Murali Haran, Roman Olson, Klaus Keller

    Abstract: Computer models are used to model complex processes in various disciplines. Often, a key source of uncertainty in the behavior of complex computer models is uncertainty due to unknown model input parameters. Statistical computer model calibration is the process of inferring model parameter values, along with associated uncertainties, from observations of the physical process and from model outputs… ▽ More

    Submitted 31 July, 2013; originally announced August 2013.

  46. arXiv:1303.1382  [pdf, ps, other

    stat.AP stat.ME

    Fast dimension-reduced climate model calibration and the effect of data aggregation

    Authors: Won Chang, Murali Haran, Roman Olson, Klaus Keller

    Abstract: How will the climate system respond to anthropogenic forcings? One approach to this question relies on climate model projections. Current climate projections are considerably uncertain. Characterizing and, if possible, reducing this uncertainty is an area of ongoing research. We consider the problem of making projections of the North Atlantic meridional overturning circulation (AMOC). Uncertaintie… ▽ More

    Submitted 31 July, 2014; v1 submitted 6 March, 2013; originally announced March 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS733 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS733

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 2, 649-673

  47. $γ$-SUP: A clustering algorithm for cryo-electron microscopy images of asymmetric particles

    Authors: Ting-Li Chen, Dai-Ni Hsieh, Hung Hung, I-** Tu, Pei-Shien Wu, Yi-Ming Wu, Wei-Hau Chang, Su-Yun Huang

    Abstract: Cryo-electron microscopy (cryo-EM) has recently emerged as a powerful tool for obtaining three-dimensional (3D) structures of biological macromolecules in native states. A minimum cryo-EM image data set for deriving a meaningful reconstruction is comprised of thousands of randomly orientated projections of identical particles photographed with a small number of electrons. The computation of 3D str… ▽ More

    Submitted 25 April, 2014; v1 submitted 9 May, 2012; originally announced May 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOAS680 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS680

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 1, 259-285