Skip to main content

Showing 1–39 of 39 results for author: Xiao, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.08583  [pdf, other

    cs.CL stat.ML

    ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks

    Authors: Xiaoxia Wu, Haojun Xia, Stephen Youn, Zhen Zheng, Shiyang Chen, Arash Bakhtiari, Michael Wyatt, Reza Yazdani Aminabadi, Yuxiong He, Olatunji Ruwase, Leon Song, Zhewei Yao

    Abstract: This study examines 4-bit quantization methods like GPTQ in large language models (LLMs), highlighting GPTQ's overfitting and limited enhancement in Zero-Shot tasks. While prior works merely focusing on zero-shot measurement, we extend task scope to more generative categories such as code generation and abstractive summarization, in which we found that INT4 quantization can significantly underperf… ▽ More

    Submitted 18 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

  2. arXiv:2310.15932  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Online Robust Mean Estimation

    Authors: Daniel M. Kane, Ilias Diakonikolas, Hanshen Xiao, Sihan Liu

    Abstract: We study the problem of high-dimensional robust mean estimation in an online setting. Specifically, we consider a scenario where $n$ sensors are measuring some common, ongoing phenomenon. At each time step $t=1,2,\ldots,T$, the $i^{th}$ sensor reports its readings $x^{(i)}_t$ for that time step. The algorithm must then commit to its estimate $μ_t$ for the true mean value of the process at time… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: To appear in SODA2024

  3. Beyond expected values: Making environmental decisions using value of information analysis when measurement outcome matters

    Authors: Morenikeji D. Akinlotan, David J. Warne, Kate J. Helmstedt, Sarah A. Vollert, Iadine Chadès, Ryan F. Heneghan, Hui Xiao, Matthew P. Adams

    Abstract: In ecological and environmental contexts, management actions must sometimes be chosen urgently. Value of information (VoI) analysis provides a quantitative toolkit for projecting the improved management outcomes expected after making additional measurements. However, traditional VoI analysis reports metrics as expected values (i.e. risk-neutral). This can be problematic because expected values hid… ▽ More

    Submitted 14 March, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: 53 pages, 3 figures

    Journal ref: Ecological Indicators 160 (2024) 111828

  4. arXiv:2304.09981  [pdf, other

    stat.ME cs.LG q-bio.QM

    Interpretable (not just posthoc-explainable) heterogeneous survivor bias-corrected treatment effects for assignment of postdischarge interventions to prevent readmissions

    Authors: Hong**g Xia, Joshua C. Chang, Sarah Nowak, Sonya Mahajan, Rohit Mahajan, Ted L. Chang, Carson C. Chow

    Abstract: We used survival analysis to quantify the impact of postdischarge evaluation and management (E/M) services in preventing hospital readmission or death. Our approach avoids a specific pitfall of applying machine learning to this problem, which is an inflated estimate of the effect of interventions, due to survivors bias -- where the magnitude of inflation may be conditional on heterogeneous confoun… ▽ More

    Submitted 3 August, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: Submitted

    Journal ref: PMLR 219:884-905, 2023

  5. arXiv:2303.05223  [pdf, other

    stat.ME

    LEAP: The latent exchangeability prior for borrowing information from historical data

    Authors: Ethan M. Alt, Xiuya Chang, Xun Jiang, Qing Liu, May Mo, H. Amy Xia, Joseph G. Ibrahim

    Abstract: It is becoming increasingly popular to elicit informative priors on the basis of historical data. Popular existing priors, including the power prior, commensurate prior, and robust meta-analytic prior provide blanket discounting. Thus, if only a subset of participants in the historical data are exchangeable with the current data, these priors may not be appropriate. In order to combat this issue,… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  6. arXiv:2208.12814  [pdf, other

    cs.CY cs.AI cs.LG stat.AP

    Interpretable (not just posthoc-explainable) medical claims modeling for discharge placement to prevent avoidable all-cause readmissions or death

    Authors: Joshua C. Chang, Ted L. Chang, Carson C. Chow, Rohit Mahajan, Sonya Mahajan, Joe Maisog, Shashaank Vattikuti, Hong**g Xia

    Abstract: We developed an inherently interpretable multilevel Bayesian framework for representing variation in regression coefficients that mimics the piecewise linearity of ReLU-activated deep neural networks. We used the framework to formulate a survival model for using medical claims to predict hospital readmission and death that focuses on discharge placement, adjusting for confounding in estimating cau… ▽ More

    Submitted 29 January, 2023; v1 submitted 28 August, 2022; originally announced August 2022.

    Comments: In review

  7. arXiv:2110.00928  [pdf, other

    stat.ME

    Multi-linear Tensor Autoregressive Models

    Authors: Zebang Li, Han Xiao

    Abstract: Contemporary time series analysis has seen more and more tensor type data, from many fields. For example, stocks can be grouped according to Size, Book-to-Market ratio, and Operating Profitability, leading to a 3-way tensor observation at each month. We propose an autoregressive model for the tensor-valued time series, with autoregressive terms depending on multi-linear coefficient matrices. Compa… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

  8. arXiv:2110.00174  [pdf, other

    cs.LG stat.ML

    Empirical Quantitative Analysis of COVID-19 Forecasting Models

    Authors: Yun Zhao, Yuqing Wang, Junfeng Liu, Haotian Xia, Zhenni Xu, Qinghang Hong, Zhiyang Zhou, Linda Petzold

    Abstract: COVID-19 has been a public health emergency of international concern since early 2020. Reliable forecasting is critical to diminish the impact of this disease. To date, a large number of different forecasting models have been proposed, mainly including statistical models, compartmental models, and deep learning models. However, due to various uncertain factors across different regions such as econ… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: ICDM workshop 2021

  9. arXiv:2108.09431  [pdf, other

    stat.ME math.ST

    Equivariant Variance Estimation for Multiple Change-point Model

    Authors: Ning Hao, Yue Selena Niu, Han Xiao

    Abstract: The variance of noise plays an important role in many change-point detection procedures and the associated inferences. Most commonly used variance estimators require strong assumptions on the true mean structure or normality of the error distribution, which may not hold in applications. More importantly, the qualities of these estimators have not been discussed systematically in the literature. In… ▽ More

    Submitted 15 November, 2023; v1 submitted 21 August, 2021; originally announced August 2021.

    Comments: 44 pages

  10. arXiv:2107.11136  [pdf, other

    cs.LG cs.CR stat.ML

    High Dimensional Differentially Private Stochastic Optimization with Heavy-tailed Data

    Authors: Lijie Hu, Shuo Ni, Hanshen Xiao, Di Wang

    Abstract: As one of the most fundamental problems in machine learning, statistics and differential privacy, Differentially Private Stochastic Convex Optimization (DP-SCO) has been extensively studied in recent years. However, most of the previous work can only handle either regular data distribution or irregular data in the low dimensional space case. To better understand the challenges arising from irregul… ▽ More

    Submitted 9 August, 2021; v1 submitted 23 July, 2021; originally announced July 2021.

  11. arXiv:2106.00612  [pdf, other

    eess.SP stat.AP

    Weak target detection with multi-bit quantization in colocated MIMO radar

    Authors: Hang Xiao, Shixing Yang, Wei Yi

    Abstract: We consider the weak target detection problem with unknown parameter in colocated multiple-input multiple-output (MIMO) radar. To cope with the sheer amount of data for large-size systems, a multi-bit quantizer is utilized in the sampling process. As a low-complexity alternative to classic generalized likelihood ratio test (GLRT) for quantized data, we propose the multi-bit detector on Rao test wi… ▽ More

    Submitted 5 September, 2021; v1 submitted 29 May, 2021; originally announced June 2021.

    Comments: 6 pages, 3 figures, conference

  12. arXiv:2105.05532  [pdf, other

    stat.ME econ.EM

    Generalized Autoregressive Moving Average Models with GARCH Errors

    Authors: Tingguo Zheng, Han Xiao, Rong Chen

    Abstract: One of the important and widely used classes of models for non-Gaussian time series is the generalized autoregressive model average models (GARMA), which specifies an ARMA structure for the conditional mean process of the underlying time series. However, in many applications one often encounters conditional heteroskedasticity. In this paper we propose a new class of models, referred to as GARMA-GA… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  13. arXiv:2105.00866  [pdf

    cs.LG stat.AP

    Causal Discovery of Flight Service Process Based on Event Sequence

    Authors: Zhiwei Xing, Lin Zhang, Huan Xia, Qian Luo, Zhao-xin Chen

    Abstract: The development of the civil aviation industry has continuously increased the requirements for the efficiency of airport ground support services. In the existing ground support research, there has not yet been a process model that directly obtains support from the ground support log to study the causal relationship between service nodes and flight delays. Most ground support studies mainly use mac… ▽ More

    Submitted 28 April, 2021; originally announced May 2021.

  14. arXiv:2011.04418  [pdf, other

    astro-ph.HE cs.LG gr-qc stat.ML

    Improved deep learning techniques in gravitational-wave data analysis

    Authors: Heming Xia, Li**g Shao, Junjie Zhao, Zhoujian Cao

    Abstract: In recent years, convolutional neural network (CNN) and other deep learning models have been gradually introduced into the area of gravitational-wave (GW) data processing. Compared with the traditional matched-filtering techniques, CNN has significant advantages in efficiency in GW signal detection tasks. In addition, matched-filtering techniques are based on the template bank of the existing theo… ▽ More

    Submitted 23 December, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: 13 pages, 11 figures; accepted by PRD

    Journal ref: Phys. Rev. D 103, 024040 (2021)

  15. arXiv:2010.11082  [pdf, ps, other

    cs.LG cs.CR stat.ML

    On Differentially Private Stochastic Convex Optimization with Heavy-tailed Data

    Authors: Di Wang, Hanshen Xiao, Srini Devadas, **hui Xu

    Abstract: In this paper, we consider the problem of designing Differentially Private (DP) algorithms for Stochastic Convex Optimization (SCO) on heavy-tailed data. The irregularity of such data violates some key assumptions used in almost all existing DP-SCO and DP-ERM methods, resulting in failure to provide the DP guarantees. To better understand this type of challenges, we provide in this paper a compreh… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: Published in ICML 2020

  16. arXiv:2009.07875  [pdf, other

    stat.ME

    A Survival Mediation Model with Bayesian Model Averaging

    Authors: Jie Zhou, Xun Jiang, H. Amy Xia, Peng Wei, Brian P. Hobbs

    Abstract: Determining the extent to which a patient is benefiting from cancer therapy is challenging. Criteria for quantifying the extent of "tumor response" observed within a few cycles of treatment have been established for various types of solid as well as hematologic malignancies. These measures comprise the primary endpoints of phase II trials. Regulatory approvals of new cancer therapies, however, are… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: 25 pages, 3 figures and 3 tables in the main manuscript. Supplementary materials included

  17. arXiv:1912.02955  [pdf, other

    stat.ME stat.ML

    Hybrid Kronecker Product Decomposition and Approximation

    Authors: Chencheng Cai, Rong Chen, Han Xiao

    Abstract: Discovering the underlying low dimensional structure of high dimensional data has attracted a significant amount of researches recently and has shown to have a wide range of applications. As an effective dimension reduction tool, singular value decomposition is often used to analyze high dimensional matrices, which are traditionally assumed to have a low rank matrix approximation. In this paper, w… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  18. arXiv:1912.02392  [pdf, other

    math.ST stat.ME stat.ML

    KoPA: Automated Kronecker Product Approximation

    Authors: Chencheng Cai, Rong Chen, Han Xiao

    Abstract: We consider the problem of matrix approximation and denoising induced by the Kronecker product decomposition. Specifically, we propose to approximate a given matrix by the sum of a few Kronecker products of matrices, which we refer to as the Kronecker product approximation (KoPA). Because the Kronecker product is an extension of the outer product from vectors to matrices, KoPA extends the low rank… ▽ More

    Submitted 26 August, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

  19. arXiv:1911.11774  [pdf, other

    stat.ML cs.LG stat.ME

    Matrix Completion using Kronecker Product Approximation

    Authors: Chencheng Cai, Rong Chen, Han Xiao

    Abstract: A matrix completion problem is to recover the missing entries in a partially observed matrix. Most of the existing matrix completion methods assume a low rank structure of the underlying complete matrix. In this paper, we introduce an alternative and more general form of the underlying complete matrix, which assumes a low Kronecker rank instead of a low regular rank, but includes the latter as a s… ▽ More

    Submitted 13 November, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

  20. arXiv:1911.06683  [pdf, other

    physics.comp-ph stat.ME

    Enforcing Boundary Conditions on Physical Fields in Bayesian Inversion

    Authors: Carlos A. Michelén Ströfer, Xinlei Zhang, Heng Xiao, Olivier Coutier-Delgosha

    Abstract: Inverse problems in computational mechanics consist of inferring physical fields that are latent in the model describing some observable fields. For instance, an inverse problem of interest is inferring the Reynolds stress field in the Navier--Stokes equations describing mean fluid velocity and pressure. The physical nature of the latent fields means they have their own set of physical constra… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

  21. arXiv:1911.06671  [pdf, other

    physics.comp-ph stat.ML

    Enforcing Deterministic Constraints on Generative Adversarial Networks for Emulating Physical Systems

    Authors: Zeng Yang, **-Long Wu, Heng Xiao

    Abstract: Generative adversarial networks (GANs) were initially proposed to generate images by learning from a large number of samples. Recently, GANs have been used to emulate complex physical systems such as turbulent flows. However, a critical question must be answered before GANs can be considered trusted emulators for physical systems: do GANs-generated samples conform to the various physical constrain… ▽ More

    Submitted 21 November, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

  22. arXiv:1909.00225  [pdf, other

    stat.OT math.CO

    Statistical Robust Chinese Remainder Theorem for Multiple Numbers

    Authors: Hanshen Xiao, Nan Du, Zhikang T. Wang, Guoqiang Xiao

    Abstract: Generalized Chinese Remainder Theorem (CRT) is a well-known approach to solve ambiguity resolution related problems. In this paper, we study the robust CRT reconstruction for multiple numbers from a view of statistics. To the best of our knowledge, it is the first rigorous analysis on the underlying statistical model of CRT-based multiple parameter estimation. To address the problem, two novel app… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

  23. arXiv:1908.00618  [pdf, other

    stat.CO

    Analyzing Basket Trials under Multisource Exchangeability Assumptions

    Authors: Michael J. Kane, Nan Chen, Alexander M. Kaizer, Xun Jiang, H. Amy Xia, Brian P. Hobbs

    Abstract: Basket designs are prospective clinical trials that are devised with the hypothesis that the presence of selected molecular features determine a patient's subsequent response to a particular "targeted" treatment strategy. Basket trials are designed to enroll multiple clinical subpopulations to which it is assumed that the therapy in question offers beneficial efficacy in the presence of the target… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

    Comments: 18 pages, 4 figures, 3 tables, submitted to the Journal of Open Source Software

    MSC Class: 62-04 ACM Class: G.3

  24. arXiv:1907.05496  [pdf

    cs.LG stat.ML

    Online Learning to Estimate Warfarin Dose with Contextual Linear Bandits

    Authors: Hai Xiao

    Abstract: Warfarin is one of the most commonly used oral blood anticoagulant agent in the world, the proper dose of Warfarin is difficult to establish not only because it is substantially variant among patients, but also adverse even severe consequences of taking an incorrect dose. Typical practice is to prescribe an initial dose, then doctor closely monitor patient response and adjust accordingly to the co… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

  25. arXiv:1906.08113  [pdf, other

    cs.LG stat.ML

    Wasserstein Adversarial Imitation Learning

    Authors: Huang Xiao, Michael Herman, Joerg Wagner, Sebastian Ziesche, Jalal Etesami, Thai Hong Linh

    Abstract: Imitation Learning describes the problem of recovering an expert policy from demonstrations. While inverse reinforcement learning approaches are known to be very sample-efficient in terms of expert demonstrations, they usually require problem-dependent reward functions or a (task-)specific reward-function regularization. In this paper, we show a natural connection between inverse reinforcement lea… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  26. arXiv:1905.06841  [pdf, other

    physics.comp-ph physics.flu-dyn stat.ML

    Enforcing Statistical Constraints in Generative Adversarial Networks for Modeling Chaotic Dynamical Systems

    Authors: **-Long Wu, Karthik Kashinath, Adrian Albert, Dragos Chirila, Prabhat, Heng Xiao

    Abstract: Simulating complex physical systems often involves solving partial differential equations (PDEs) with some closures due to the presence of multi-scale physics that cannot be fully resolved. Therefore, reliable and accurate closure models for unresolved physics remains an important requirement for many computational physics problems, e.g., turbulence simulation. Recently, several researchers have a… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

  27. arXiv:1904.10639  [pdf, other

    math.OC stat.ME

    Efficient Simulation Budget Allocation for Subset Selection Using Regression Metamodels

    Authors: Fei Gao, Zhongshun Shi, Siyang Gao, Hui Xiao

    Abstract: This research considers the ranking and selection (R&S) problem of selecting the optimal subset from a finite set of alternative designs. Given the total simulation budget constraint, we aim to maximize the probability of correctly selecting the top-m designs. In order to improve the selection efficiency, we incorporate the information from across the domain into regression metamodels. In this res… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

  28. arXiv:1904.08249  [pdf, other

    cs.LG stat.ML

    Bonsai -- Diverse and Shallow Trees for Extreme Multi-label Classification

    Authors: Sujay Khandagale, Han Xiao, Rohit Babbar

    Abstract: Extreme multi-label classification (XMC) refers to supervised multi-label learning involving hundreds of thousand or even millions of labels. In this paper, we develop a suite of algorithms, called Bonsai, which generalizes the notion of label representation in XMC, and partitions the labels in the representation space to learn shallow trees. We show three concrete realizations of this label repre… ▽ More

    Submitted 10 August, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

  29. arXiv:1902.09347  [pdf, other

    cs.LG cs.IR stat.ML

    Efficient Path Prediction for Semi-Supervised and Weakly Supervised Hierarchical Text Classification

    Authors: Huiru Xiao, Xin Liu, Yangqiu Song

    Abstract: Hierarchical text classification has many real-world applications. However, labeling a large number of documents is costly. In practice, we can use semi-supervised learning or weakly supervised learning (e.g., dataless classification) to reduce the labeling cost. In this paper, we propose a path cost-sensitive learning algorithm to utilize the structural information and further make use of unlabel… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Comments: Aceepted by 2019 World Wide Web Conference (WWW19)

  30. arXiv:1812.08916  [pdf, other

    stat.ME

    Autoregressive Models for Matrix-Valued Time Series

    Authors: Rong Chen, Han Xiao, Dan Yang

    Abstract: In finance, economics and many other fields, observations in a matrix form are often generated over time. For example, a set of key economic indicators are regularly reported in different countries every quarter. The observations at each quarter neatly form a matrix and are observed over many consecutive quarters. Dynamic transport networks with observations generated on the edges can be formed as… ▽ More

    Submitted 24 July, 2019; v1 submitted 20 December, 2018; originally announced December 2018.

    MSC Class: 62M10; 62H99

  31. arXiv:1812.04808  [pdf, other

    stat.ML cs.LG stat.ME

    Kernel Treelets

    Authors: Hedi Xia, Hector D. Ceniceros

    Abstract: A new method for hierarchical clustering is presented. It combines treelets, a particular multiscale decomposition of data, with a projection on a reproducing kernel Hilbert space. The proposed approach, called kernel treelets (KT), effectively substitutes the correlation coefficient matrix used in treelets with a symmetric, positive semi-definite matrix efficiently constructed from a kernel funct… ▽ More

    Submitted 11 December, 2018; originally announced December 2018.

  32. arXiv:1812.02598  [pdf

    stat.ML cs.LG stat.AP

    Finding the needle in high-dimensional haystack: A tutorial on canonical correlation analysis

    Authors: Hao-Ting Wang, Jonathan Smallwood, Janaina Mourao-Miranda, Cedric Huchuan Xia, Theodore D. Satterthwaite, Danielle S. Bassett, Danilo Bzdok

    Abstract: Since the beginning of the 21st century, the size, breadth, and granularity of data in biology and medicine has grown rapidly. In the example of neuroscience, studies with thousands of subjects are becoming more common, which provide extensive phenoty** on the behavioral, neural, and genomic level with hundreds of variables. The complexity of such big data repositories offer new opportunities an… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

  33. arXiv:1811.11339  [pdf, other

    stat.ML cs.LG

    Statistical Robust Chinese Remainder Theorem for Multiple Numbers: Wrapped Gaussian Mixture Model

    Authors: Nan Du, Zhikang Wang, Hanshen Xiao

    Abstract: Generalized Chinese Remainder Theorem (CRT) has been shown to be a powerful approach to solve the ambiguity resolution problem. However, with its close relationship to number theory, study in this area is mainly from a coding theory perspective under deterministic conditions. Nevertheless, it can be proved that even with the best deterministic condition known, the probability of success in robust… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

  34. arXiv:1808.07449  [pdf, other

    stat.ME

    Robust Spatial Extent Inference with a Semiparametric Bootstrap Joint Testing Procedure

    Authors: Simon N. Vandekar, Theodore D. Satterthwaite, Cedric H. Xia, Kosha Ruparel, Ruben C. Gur, Raquel E. Gur, Russell T. Shinohara

    Abstract: Spatial extent inference (SEI) is widely used across neuroimaging modalities to study brain-phenotype associations that inform our understanding of disease. Recent studies have shown that Gaussian random field (GRF) based tools can have inflated family-wise error rates (FWERs). This has led to fervent discussion as to which preprocessing steps are necessary to control the FWER using GRF-based SEI.… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

  35. arXiv:1807.09751  [pdf, other

    cs.IR cs.LG stat.ML

    Multi-Perspective Neural Architecture for Recommendation System

    Authors: Han Xiao, Yidong Chen, Xiaodong Shi

    Abstract: Currently, there starts a research trend to leverage neural architecture for recommendation systems. Though several deep recommender models are proposed, most methods are too simple to characterize users' complex preference. In this paper, for a fine-grain analysis, users' ratings are explained from multiple perspectives, based on which, we propose our neural architecture. Specifically, our model… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

  36. arXiv:1804.07933  [pdf, other

    cs.LG cs.CR cs.GT stat.ML

    Is feature selection secure against training data poisoning?

    Authors: Huang Xiao, Battista Biggio, Gavin Brown, Giorgio Fumera, Claudia Eckert, Fabio Roli

    Abstract: Learning in adversarial settings is becoming an important task for application domains where attackers may inject malicious data into the training set to subvert normal operation of data-driven technologies. Feature selection has been widely used in machine learning for security applications to improve generalization and computational efficiency, although it is not clear whether its use may be ben… ▽ More

    Submitted 21 April, 2018; originally announced April 2018.

    Journal ref: Proc. of the 32nd ICML, Lille, France, 2015. JMLR: W&CP vol. 37

  37. arXiv:1801.02901  [pdf, other

    cs.LG stat.ML

    Convexification of Neural Graph

    Authors: Han Xiao

    Abstract: Traditionally, most complex intelligence architectures are extremely non-convex, which could not be well performed by convex optimization. However, this paper decomposes complex structures into three types of nodes: operators, algorithms and functions. Iteratively, propagating from node to node along edge, we prove that "regarding the tree-structured neural graph, it is nearly convex in each varia… ▽ More

    Submitted 13 January, 2018; v1 submitted 9 January, 2018; originally announced January 2018.

  38. arXiv:1711.01790  [pdf, ps, other

    cs.LG cs.IT eess.SP stat.ML

    Simultaneous Block-Sparse Signal Recovery Using Pattern-Coupled Sparse Bayesian Learning

    Authors: Hang Xiao, Zhengli Xing, Linxiao Yang, Jun Fang, Yanlun Wu

    Abstract: In this paper, we consider the block-sparse signals recovery problem in the context of multiple measurement vectors (MMV) with common row sparsity patterns. We develop a new method for recovery of common row sparsity MMV signals, where a pattern-coupled hierarchical Gaussian prior model is introduced to characterize both the block-sparsity of the coefficients and the statistical dependency between… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

  39. arXiv:1708.07747  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

    Authors: Han Xiao, Kashif Rasul, Roland Vollgraf

    Abstract: We present Fashion-MNIST, a new dataset comprising of 28x28 grayscale images of 70,000 fashion products from 10 categories, with 7,000 images per category. The training set has 60,000 images and the test set has 10,000 images. Fashion-MNIST is intended to serve as a direct drop-in replacement for the original MNIST dataset for benchmarking machine learning algorithms, as it shares the same image s… ▽ More

    Submitted 15 September, 2017; v1 submitted 25 August, 2017; originally announced August 2017.

    Comments: Dataset is freely available at https://github.com/zalandoresearch/fashion-mnist Benchmark is available at http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/