Skip to main content

Showing 1–38 of 38 results for author: Fang, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.04865  [pdf, other

    cs.LG cs.CV stat.ML

    On the Learnability of Out-of-distribution Detection

    Authors: Zhen Fang, Yixuan Li, Feng Liu, Bo Han, Jie Lu

    Abstract: Supervised learning aims to train a classifier under the assumption that training and test data are from the same distribution. To ease the above assumption, researchers have studied a more realistic setting: out-of-distribution (OOD) detection, where test data may come from classes that are unknown during training (i.e., OOD data). Due to the unavailability and diversity of OOD data, good general… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted by JMLR in 7th of April, 2024. This is a journal extension of the previous NeurIPS 2022 Outstanding Paper "Is Out-of-distribution Detection Learnable?" [arXiv:2210.14707]

  2. arXiv:2402.03502  [pdf, other

    cs.LG stat.ML

    How Does Unlabeled Data Provably Help Out-of-Distribution Detection?

    Authors: Xuefeng Du, Zhen Fang, Ilias Diakonikolas, Yixuan Li

    Abstract: Using unlabeled data to regularize the machine learning models has demonstrated promise for improving safety and reliability in detecting out-of-distribution (OOD) data. Harnessing the power of unlabeled in-the-wild data is non-trivial due to the heterogeneity of both in-distribution (ID) and OOD data. This lack of a clean set of OOD samples poses significant challenges in learning an optimal OOD… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  3. arXiv:2401.11093  [pdf, other

    stat.AP

    Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding

    Authors: Haisheng Fu, Feng Liang, Jie Liang, Zhenman Fang, Guohe Zhang, **gning Han

    Abstract: Recent advancements in deep learning-based image compression are notable. However, prevalent schemes that employ a serial context-adaptive entropy model to enhance rate-distortion (R-D) performance are markedly slow. Furthermore, the complexities of the encoding and decoding networks are substantially high, rendering them unsuitable for some practical applications. In this paper, we propose two te… ▽ More

    Submitted 21 March, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted by DCC2024

  4. arXiv:2311.00836  [pdf, ps, other

    math.OC eess.SP math.PR stat.CO

    Effective filtering approach for joint parameter-state estimation in SDEs via Rao-Blackwellization and modularization

    Authors: Zhou Fang, Ankit Gupta, Mustafa Khammash

    Abstract: Stochastic filtering is a vibrant area of research in both control theory and statistics, with broad applications in many scientific fields. Despite its extensive historical development, there still lacks an effective method for joint parameter-state estimation in SDEs. The state-of-the-art particle filtering methods suffer from either sample degeneracy or information loss, with both issues stemmi… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures

    MSC Class: 62M20; 62F15; 65C05; 92-08; 93E11

  5. arXiv:2308.03666  [pdf, other

    stat.ML cs.LG

    Bridging Trustworthiness and Open-World Learning: An Exploratory Neural Approach for Enhancing Interpretability, Generalization, and Robustness

    Authors: Shide Du, Zihan Fang, Shiyang Lan, Yanchao Tan, Manuel Günther, Shi** Wang, Wenzhong Guo

    Abstract: As researchers strive to narrow the gap between machine intelligence and human through the development of artificial intelligence technologies, it is imperative that we recognize the critical importance of trustworthiness in open-world, which has become ubiquitous in all aspects of daily life for everyone. However, several challenges may create a crisis of trust in current artificial intelligence… ▽ More

    Submitted 18 October, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  6. arXiv:2305.09869  [pdf, ps, other

    cs.LG cs.SI stat.ML

    A Signed Subgraph Encoding Approach via Linear Optimization for Link Sign Prediction

    Authors: Zhihong Fang, Shaolin Tan, Yaonan Wang

    Abstract: In this paper, we consider the problem of inferring the sign of a link based on limited sign data in signed networks. Regarding this link sign prediction problem, SDGNN (Signed Directed Graph Neural Networks) provides the best prediction performance currently to the best of our knowledge. In this paper, we propose a different link sign prediction architecture call SELO (Subgraph Encoding via Linea… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  7. Investigating the spatial heterogeneity of factors influencing speeding-related crash severities using correlated random parameter order models with heterogeneity-in-means

    Authors: Renteng Yuan, Qiaojun Xiang, Zhiheng Fang, Xin Gu

    Abstract: Speeding has been acknowledged as a critical determinant in increasing the risk of crashes and their resulting injury severities. This paper demonstrates that severe speeding-related crashes within the state of Pennsylvania have a spatial clustering trend, where four crash datasets are extracted from four hotspot districts. Two log-likelihood ratio (LR) tests were conducted to determine whether sp… ▽ More

    Submitted 5 July, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

  8. arXiv:2302.00841  [pdf, other

    stat.AP stat.ME

    Longitudinal Canonical Correlation Analysis

    Authors: Seonjoo Lee, Jongwoo Choi, Zhiqian Fang, F. DuBois Bowman

    Abstract: This paper considers canonical correlation analysis for two longitudinal variables that are possibly sampled at different time resolutions with irregular grids. We modeled trajectories of the multivariate variables using random effects and found the most correlated sets of linear combinations in the latent space. Our numerical simulations showed that the longitudinal canonical correlation analysis… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: 24 pages, 16 figures

    MSC Class: 62H20

  9. arXiv:2210.14707  [pdf, other

    cs.LG stat.ML

    Is Out-of-Distribution Detection Learnable?

    Authors: Zhen Fang, Yixuan Li, Jie Lu, Jiahua Dong, Bo Han, Feng Liu

    Abstract: Supervised learning aims to train a classifier under the assumption that training and test data are from the same distribution. To ease the above assumption, researchers have studied a more realistic setting: out-of-distribution (OOD) detection, where test data may come from classes that are unknown during training (i.e., OOD data). Due to the unavailability and diversity of OOD data, good general… ▽ More

    Submitted 23 February, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022 Outstanding Paper

  10. arXiv:2207.05067  [pdf, other

    cs.AI cs.LG stat.ML

    On the Representation of Causal Background Knowledge and its Applications in Causal Inference

    Authors: Zhuangyan Fang, Ruiqi Zhao, Yue Liu, Yangbo He

    Abstract: Causal background knowledge about the existence or the absence of causal edges and paths is frequently encountered in observational studies. The shared directed edges and links of a subclass of Markov equivalent DAGs refined due to background knowledge can be represented by a causal maximally partially directed acyclic graph (MPDAG). In this paper, we first provide a sound and complete graphical c… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

  11. arXiv:2203.17262  [pdf

    stat.OT

    Length L-function for Network-Constrained Point Data

    Authors: Zidong Fang, Ci Song, Hua Shu, Jie Chen, Tianyu Liu, Xi Wang, Xiao Chen, Tao Pei

    Abstract: Network constrained points are referred to as points restricted to road networks, such as taxi pick up and drop off locations. A significant pattern of network constrained points is referred to as an aggregation; e.g., the aggregation of pick up points may indicate a high taxi demand in a particular area. Although the network K function using the shortest path network distance has been proposed to… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  12. arXiv:2108.09042  [pdf

    cs.CG stat.ME

    Identifying Aggregation Artery Architecture of constrained Origin-Destination flows using Manhattan L-function

    Authors: Zidong Fang, Hua Shu, Ci Song, Jie Chen, Tianyu Liu, Xiaohan Liu, Tao Pei

    Abstract: The movement of humans and goods in cities can be represented by constrained flow, which is defined as the movement of objects between origin and destination in road networks. Flow aggregation, namely origins and destinations aggregated simultaneously, is one of the most common patterns, say the aggregated origin-to-destination flows between two transport hubs may indicate the great traffic demand… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: 29 pages, 12 figures

  13. arXiv:2108.00511  [pdf, ps, other

    econ.EM stat.CO

    Implementing an Improved Test of Matrix Rank in Stata

    Authors: Qihui Chen, Zheng Fang, Xun Huang

    Abstract: We develop a Stata command, bootranktest, for implementing the matrix rank test of Chen and Fang (2019) in linear instrumental variable regression models. Existing rank tests employ critical values that may be too small, and hence may not even be first order valid in the sense that they may fail to control the Type I error. By appealing to the bootstrap, they devise a test that overcomes the defic… ▽ More

    Submitted 1 August, 2021; originally announced August 2021.

  14. arXiv:2107.12494  [pdf, ps, other

    econ.EM math.ST stat.ME

    A Unifying Framework for Testing Shape Restrictions

    Authors: Zheng Fang

    Abstract: This paper makes the following original contributions. First, we develop a unifying framework for testing shape restrictions based on the Wald principle. The test has asymptotic uniform size control and is uniformly consistent. Second, we examine the applicability and usefulness of some prominent shape enforcing operators in implementing our framework. In particular, in stark contrast to its use i… ▽ More

    Submitted 1 August, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

  15. A Local Method for Identifying Causal Relations under Markov Equivalence

    Authors: Zhuangyan Fang, Yue Liu, Zhi Geng, Shengyu Zhu, Yangbo He

    Abstract: Causality is important for designing interpretable and robust methods in artificial intelligence research. We propose a local approach to identify whether a variable is a cause of a given target under the framework of causal graphical models of directed acyclic graphs (DAGs). In general, the causal relation between two variables may not be identifiable from observational data as many causal DAGs e… ▽ More

    Submitted 5 March, 2022; v1 submitted 25 February, 2021; originally announced February 2021.

  16. arXiv:2012.03612  [pdf, ps, other

    cs.LG cs.AI cs.DS stat.ML

    LCS Graph Kernel Based on Wasserstein Distance in Longest Common Subsequence Metric Space

    Authors: Jianming Huang, Zhongxi Fang, Hiroyuki Kasai

    Abstract: For graph learning tasks, many existing methods utilize a message-passing mechanism where vertex features are updated iteratively by aggregation of neighbor information. This strategy provides an efficient means for graph features extraction, but obtained features after many iterations might contain too much information from other vertices, and tend to be similar to each other. This makes their re… ▽ More

    Submitted 29 October, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Journal ref: Signal Processing, Vol.189, 2021

  17. arXiv:2011.08810  [pdf, other

    stat.AP stat.ML

    Data Driven Reaction Mechanism Estimation via Transient Kinetics and Machine Learning

    Authors: M. Ross Kunz, Adam Yonge, Zongtang Fang, Andrew J. Medford, Denis Constales, Gregory Yablonsky, Rebecca Fushimi

    Abstract: Understanding the set of elementary steps and kinetics in each reaction is extremely valuable to make informed decisions about creating the next generation of catalytic materials. With physical and mechanistic complexity of industrial catalysts, it is critical to obtain kinetic information through experimental methods. As such, this work details a methodology based on the combination of transient… ▽ More

    Submitted 21 April, 2021; v1 submitted 17 November, 2020; originally announced November 2020.

  18. arXiv:2008.11682  [pdf, other

    stat.ME math.OC q-bio.QM

    Stochastic filters based on hybrid approximations of multiscale stochastic reaction networks

    Authors: Zhou Fang, Ankit Gupta, Mustafa Khammash

    Abstract: We consider the problem of estimating the dynamic latent states of an intracellular multiscale stochastic reaction network from time-course measurements of fluorescent reporters. We first prove that accurate solutions to the filtering problem can be constructed by solving the filtering problem for a reduced model that represents the dynamics as a hybrid process. The model reduction is based on exp… ▽ More

    Submitted 8 September, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: 6 pages, 1 figure. Accepted to CDC 2020

    MSC Class: 60J22; 62M20; 65C05; 92-08; 93E11

  19. arXiv:2008.01454  [pdf, other

    cs.LG stat.ML

    Learning from a Complementary-label Source Domain: Theory and Algorithms

    Authors: Yiyang Zhang, Feng Liu, Zhen Fang, Bo Yuan, Guangquan Zhang, Jie Lu

    Abstract: In unsupervised domain adaptation (UDA), a classifier for the target domain is trained with massive true-label data from the source domain and unlabeled data from the target domain. However, collecting fully-true-label data in the source domain is high-cost and sometimes impossible. Compared to the true labels, a complementary label specifies a class that a pattern does not belong to, hence collec… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: arXiv admin note: text overlap with arXiv:2007.14612

  20. arXiv:2007.14612  [pdf, other

    cs.LG cs.CV stat.ML

    Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation

    Authors: Yiyang Zhang, Feng Liu, Zhen Fang, Bo Yuan, Guangquan Zhang, Jie Lu

    Abstract: In unsupervised domain adaptation (UDA), classifiers for the target domain are trained with massive true-label data from the source domain and unlabeled data from the target domain. However, it may be difficult to collect fully-true-label data in a source domain given a limited budget. To mitigate this problem, we consider a novel problem setting where the classifier for the target domain has to b… ▽ More

    Submitted 4 March, 2021; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: This paper has been accepted by IJCAI-PRICAI 2020. Yiyang Zhang, Feng Liu and Zhen Fang equally contribute to this paper

  21. arXiv:2007.14285  [pdf, ps, other

    cs.LG math.FA stat.ML

    Theory of Deep Convolutional Neural Networks II: Spherical Analysis

    Authors: Zhiying Fang, Han Feng, Shuo Huang, Ding-Xuan Zhou

    Abstract: Deep learning based on deep neural networks of various structures and architectures has been powerful in many practical applications, but it lacks enough theoretical verifications. In this paper, we consider a family of deep convolutional neural networks applied to approximate functions on the unit sphere $\mathbb{S}^{d-1}$ of $\mathbb{R}^d$. Our analysis presents rates of uniform approximation wh… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

  22. arXiv:2006.13022  [pdf, other

    cs.LG cs.CV stat.ML

    Bridging the Theoretical Bound and Deep Algorithms for Open Set Domain Adaptation

    Authors: Li Zhong, Zhen Fang, Feng Liu, Bo Yuan, Guangquan Zhang, Jie Lu

    Abstract: In the unsupervised open set domain adaptation (UOSDA), the target domain contains unknown classes that are not observed in the source domain. Researchers in this area aim to train a classifier to accurately: 1) recognize unknown target data (data with unknown classes) and, 2) classify other target data. To achieve this aim, a previous study has proven an upper bound of the target-domain risk, and… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

  23. arXiv:2006.05691  [pdf, other

    cs.LG stat.ML

    On Low Rank Directed Acyclic Graphs and Causal Structure Learning

    Authors: Zhuangyan Fang, Shengyu Zhu, Jiji Zhang, Yue Liu, Zhitang Chen, Yangbo He

    Abstract: Despite several advances in recent years, learning causal structures represented by directed acyclic graphs (DAGs) remains a challenging task in high dimensional settings when the graphs to be learned are not sparse. In this paper, we propose to exploit a low rank assumption regarding the (weighted) adjacency matrix of a DAG causal model to help address this problem. We utilize existing low rank t… ▽ More

    Submitted 15 May, 2023; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: This paper has been accepted by the IEEE Transactions on Neural Networks and Learning Systems

  24. arXiv:1912.01198  [pdf, other

    cs.LG stat.ML

    Towards Understanding the Spectral Bias of Deep Learning

    Authors: Yuan Cao, Zhiying Fang, Yue Wu, Ding-Xuan Zhou, Quanquan Gu

    Abstract: An intriguing phenomenon observed during training neural networks is the spectral bias, which states that neural networks are biased towards learning less complex functions. The priority of learning functions with low complexity might be at the core of explaining generalization ability of neural network, and certain efforts have been made to provide theoretical explanation for spectral bias. Howev… ▽ More

    Submitted 5 October, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: 29 pages, 7 figures. This version adds more experimental results

  25. arXiv:1911.07420  [pdf, other

    cs.LG stat.ML

    A Graph Autoencoder Approach to Causal Structure Learning

    Authors: Ignavier Ng, Shengyu Zhu, Zhitang Chen, Zhuangyan Fang

    Abstract: Causal structure learning has been a challenging task in the past decades and several mainstream approaches such as constraint- and score-based methods have been studied with theoretical guarantees. Recently, a new approach has transformed the combinatorial structure learning problem into a continuous one and then solved it using gradient-based optimization methods. Following the recent state-of-t… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: NeurIPS 2019 Workshop "Do the right thing": machine learning and causal inference for improved decision making

  26. arXiv:1910.08527  [pdf, other

    cs.LG stat.ME stat.ML

    Masked Gradient-Based Causal Structure Learning

    Authors: Ignavier Ng, Shengyu Zhu, Zhuangyan Fang, Haoyang Li, Zhitang Chen, Jun Wang

    Abstract: This paper studies the problem of learning causal structures from observational data. We reformulate the Structural Equation Model (SEM) with additive noises in a form parameterized by binary graph adjacency matrix and show that, if the original SEM is identifiable, then the binary adjacency matrix can be identified up to super-graphs of the true causal graph under mild conditions. We then utilize… ▽ More

    Submitted 10 January, 2022; v1 submitted 18 October, 2019; originally announced October 2019.

    Comments: Accepted to SDM 2022

  27. arXiv:1910.07689  [pdf, ps, other

    econ.EM math.ST stat.ME

    A Projection Framework for Testing Shape Restrictions That Form Convex Cones

    Authors: Zheng Fang, Juwon Seo

    Abstract: This paper develops a uniformly valid and asymptotically nonconservative test based on projection for a class of shape restrictions. The key insight we exploit is that these restrictions form convex cones, a simple and yet elegant structure that has been barely harnessed in the literature. Based on a monotonicity property afforded by such a geometric structure, we construct a bootstrap procedure t… ▽ More

    Submitted 20 September, 2021; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: This version contains the following sections omitted from the published version: i) discussions of the examples in the main text, ii) proofs for Appendix C (in the online appendix), and iii) the complete set of simulation results. A previous version of this paper was circulated under the title "A General Framework for Inference on Shape Restrictions."

  28. arXiv:1907.08375  [pdf, other

    cs.LG stat.ML

    Open Set Domain Adaptation: Theoretical Bound and Algorithm

    Authors: Zhen Fang, Jie Lu, Feng Liu, Junyu Xuan, Guangquan Zhang

    Abstract: The aim of unsupervised domain adaptation is to leverage the knowledge in a labeled (source) domain to improve a model's learning performance with an unlabeled (target) domain -- the basic strategy being to mitigate the effects of discrepancies between the two distributions. Most existing algorithms can only handle unsupervised closed set domain adaptation (UCSDA), i.e., where the source and targe… ▽ More

    Submitted 7 October, 2020; v1 submitted 19 July, 2019; originally announced July 2019.

    Comments: This paper has been accepted by IEEE-TNNLS

  29. arXiv:1906.10305  [pdf, other

    math.ST stat.ME

    Refinements of the Kiefer-Wolfowitz Theorem and a Test of Concavity

    Authors: Zheng Fang

    Abstract: This paper studies estimation of and inference on a distribution function $F$ that is concave on the nonnegative half line and admits a density function $f$ with potentially unbounded support. When $F$ is strictly concave, we show that the supremum distance between the Grenander distribution estimator and the empirical distribution may still be of order $O(n^{-2/3}(\log n)^{2/3})$ almost surely, w… ▽ More

    Submitted 9 November, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

    Comments: Forthcoming in Electronic Journal of Statistics. Compared to the journal version, the difference is that this version contains additional simulation results, collected in Appendix C

  30. arXiv:1901.04598  [pdf, other

    stat.CO physics.data-an physics.geo-ph

    Precision Annealing Monte Carlo Methods for Statistical Data Assimilation: Metropolis-Hastings Procedures

    Authors: Adrian S. Wong, Kangbo Hao, Zheng Fang, Henry D. I. Abarbanel

    Abstract: Statistical Data Assimilation (SDA) is the transfer of information from field or laboratory observations to a user selected model of the dynamical system producing those observations. The data is noisy and the model has errors; the information transfer addresses properties of the conditional probability distribution of the states of the model conditioned on the observations. The quantities of inte… ▽ More

    Submitted 14 January, 2019; originally announced January 2019.

  31. arXiv:1812.02337  [pdf, ps, other

    econ.EM math.ST stat.ME

    Improved Inference on the Rank of a Matrix

    Authors: Qihui Chen, Zheng Fang

    Abstract: This paper develops a general framework for conducting inference on the rank of an unknown matrix $Π_0$. A defining feature of our setup is the null hypothesis of the form $\mathrm H_0: \mathrm{rank}(Π_0)\le r$. The problem is of first order importance because the previous literature focuses on $\mathrm H_0': \mathrm{rank}(Π_0)= r$ by implicitly assuming away $\mathrm{rank}(Π_0)<r$, which may lead… ▽ More

    Submitted 25 March, 2019; v1 submitted 5 December, 2018; originally announced December 2018.

  32. arXiv:1808.04447  [pdf, other

    cs.CV cs.LG stat.ML

    Deep Learning Super-Resolution Enables Rapid Simultaneous Morphological and Quantitative Magnetic Resonance Imaging

    Authors: Akshay Chaudhari, Zhongnan Fang, ** Hyung Lee, Garry Gold, Brian Hargreaves

    Abstract: Obtaining magnetic resonance images (MRI) with high resolution and generating quantitative image-based biomarkers for assessing tissue biochemistry is crucial in clinical and research applications. How- ever, acquiring quantitative biomarkers requires high signal-to-noise ratio (SNR), which is at odds with high-resolution in MRI, especially in a single rapid sequence. In this paper, we demonstrate… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

    Comments: Accepted for the Machine Learning for Medical Image Reconstruction Workshop at MICCAI 2018

  33. arXiv:1805.12507  [pdf, other

    stat.ML cs.LG

    Efficacy of regularized multi-task learning based on SVM models

    Authors: Shaohan Chen, Zhou Fang, Sijie Lu, Chuanhou Gao

    Abstract: This paper investigates the efficacy of a regularized multi-task learning (MTL) framework based on SVM (M-SVM) to answer whether MTL always provides reliable results and how MTL outperforms independent learning. We first find that M-SVM is Bayes risk consistent in the limit of large sample size. This implies that despite the task dissimilarities, M-SVM always produces a reliable decision rule for… ▽ More

    Submitted 20 February, 2022; v1 submitted 31 May, 2018; originally announced May 2018.

    Comments: 12 pages, 4 figures

  34. arXiv:1206.2716  [pdf, other

    stat.ME

    Semiparametric Mixed Model for Evaluating Pathway-Environment Interaction

    Authors: Zaili Fang, Inyoung Kim, Jeesun Jung

    Abstract: A biological pathway represents a set of genes that serves a particular cellular or a physiological function. The genes within the same pathway are expected to function together and hence may interact with each other. It is also known that many genes, and so pathways, interact with other environmental variables. However, no formal procedure has yet been developed to evaluate the pathway-environmen… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

  35. arXiv:1206.2715  [pdf, other

    stat.ME

    A Graphical View of Bayesian Variable Selection

    Authors: Zaili Fang, Inyoung Kim

    Abstract: In recent years, Ising prior with the network information for the "in" or "out" binary random variable in Bayesian variable selections has received more and more attentions. In this paper, we discover that even without the informative prior a Bayesian variable selection problem itself can be considered as a complete graph and described by a Ising model with random interactions. There are many adva… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

  36. arXiv:1206.2696  [pdf, other

    stat.ME

    Flexible Variable Selection for Recovering Sparsity in Nonadditive Nonparametric Models

    Authors: Zaili Fang, Inyoung Kim, Patrick Schaumont

    Abstract: Variable selection for recovering sparsity in nonadditive nonparametric models has been challenging. This problem becomes even more difficult due to complications in modeling unknown interaction terms among high dimensional variables. There is currently no variable selection method to overcome these limitations. Hence, in this paper we propose a variable selection approach that is developed by con… ▽ More

    Submitted 12 June, 2012; originally announced June 2012.

  37. arXiv:1111.4416  [pdf, other

    stat.ME

    Sparse Group Selection Through Co-Adaptive Penalties

    Authors: Zhou Fang

    Abstract: Recent work has focused on the problem of conducting linear regression when the number of covariates is very large, potentially greater than the sample size. To facilitate this, one useful tool is to assume that the model can be well approximated by a fit involving only a small number of covariates -- a so called sparsity assumption, which leads to the Lasso and other methods. In many situations,… ▽ More

    Submitted 18 November, 2011; originally announced November 2011.

  38. arXiv:1006.2940  [pdf, other

    stat.ME stat.CO stat.ML

    LASSO ISOtone for High Dimensional Additive Isotonic Regression

    Authors: Zhou Fang, Nicolai Meinshausen

    Abstract: Additive isotonic regression attempts to determine the relationship between a multi-dimensional observation variable and a response, under the constraint that the estimate is the additive sum of univariate component effects that are monotonically increasing. In this article, we present a new method for such regression called LASSO Isotone (LISO). LISO adapts ideas from sparse linear modelling to a… ▽ More

    Submitted 15 June, 2010; originally announced June 2010.