Skip to main content

Showing 1–21 of 21 results for author: Zhe, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.02746  [pdf, other

    cs.LG stat.ML

    Standard Gaussian Process Can Be Excellent for High-Dimensional Bayesian Optimization

    Authors: Zhitong Xu, Shandian Zhe

    Abstract: There has been a long-standing and widespread belief that Bayesian Optimization (BO) with standard Gaussian process (GP), referred to as standard BO, is ineffective in high-dimensional optimization problems. While this belief sounds reasonable, strong empirical evidence is lacking. In this paper, we systematically investigated BO with standard GP regression across a variety of synthetic and real-w… ▽ More

    Submitted 15 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  2. arXiv:2311.04829  [pdf, other

    cs.LG stat.ML

    Functional Bayesian Tucker Decomposition for Continuous-indexed Tensor Data

    Authors: Shikai Fang, Xin Yu, Zheng Wang, Shibo Li, Mike Kirby, Shandian Zhe

    Abstract: Tucker decomposition is a powerful tensor model to handle multi-aspect data. It demonstrates the low-rank property by decomposing the grid-structured data as interactions between a core tensor and a set of object representations (factors). A fundamental assumption of such decomposition is that there are finite objects in each aspect or mode, corresponding to discrete indexes of data entries. Howev… ▽ More

    Submitted 18 March, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Journal ref: The Twelfth International Conference on Learning Representations (ICLR 2024)

  3. arXiv:2310.19666  [pdf, other

    cs.LG stat.ML

    Dynamic Tensor Decomposition via Neural Diffusion-Reaction Processes

    Authors: Zheng Wang, Shikai Fang, Shibo Li, Shandian Zhe

    Abstract: Tensor decomposition is an important tool for multiway data analysis. In practice, the data is often sparse yet associated with rich temporal information. Existing methods, however, often under-use the time information and ignore the structural knowledge within the sparsely observed tensor entries. To overcome these limitations and to better capture the underlying temporal structure, we propose Dy… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  4. arXiv:2310.05387  [pdf, other

    cs.LG stat.ML

    Equation Discovery with Bayesian Spike-and-Slab Priors and Efficient Kernels

    Authors: Da Long, Wei W. Xing, Aditi S. Krishnapriyan, Robert M. Kirby, Shandian Zhe, Michael W. Mahoney

    Abstract: Discovering governing equations from data is important to many scientific and engineering applications. Despite promising successes, existing methods are still challenged by data sparsity and noise issues, both of which are ubiquitous in practice. Moreover, state-of-the-art methods lack uncertainty quantification and/or are costly in training. To overcome these limitations, we propose a novel equa… ▽ More

    Submitted 21 April, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

  5. arXiv:2308.14906  [pdf, other

    cs.LG stat.ML

    BayOTIDE: Bayesian Online Multivariate Time series Imputation with functional decomposition

    Authors: Shikai Fang, Qingsong Wen, Yingtao Luo, Shandian Zhe, Liang Sun

    Abstract: In real-world scenarios like traffic and energy, massive time-series data with missing values and noises are widely observed, even sampled irregularly. While many imputation methods have been proposed, most of them work with a local horizon, which means models are trained by splitting the long sequence into batches of fit-sized patches. This local horizon can make models ignore global trends or pe… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted by The 41st International Conference on Machine Learning (ICML 2024)

  6. arXiv:2210.08140  [pdf, other

    stat.ML cs.LG

    A Kernel Approach for PDE Discovery and Operator Learning

    Authors: Da Long, Nicole Mrvaljevic, Shandian Zhe, Bamdad Hosseini

    Abstract: This article presents a three-step framework for learning and solving partial differential equations (PDEs) using kernel methods. Given a training set consisting of pairs of noisy PDE solutions and source/boundary terms on a mesh, kernel smoothing is utilized to denoise the data and approximate derivatives of the solution. This information is then used in a kernel regression model to learn the alg… ▽ More

    Submitted 30 March, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

  7. arXiv:2207.03639  [pdf, other

    cs.LG stat.ML

    Nonparametric Embeddings of Sparse High-Order Interaction Events

    Authors: Zheng Wang, Yiming Xu, Conor Tillinghast, Shibo Li, Akil Narayan, Shandian Zhe

    Abstract: High-order interaction events are common in real-world applications. Learning embeddings that encode the complex relationships of the participants from these events is of great importance in knowledge mining and predictive tasks. Despite the success of existing approaches, e.g. Poisson tensor factorization, they ignore the sparse structure underlying the data, namely the occurred interactions are… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: 9 pages, ICML 2022

  8. arXiv:2110.10082  [pdf, other

    stat.ML cs.LG

    Nonparametric Sparse Tensor Factorization with Hierarchical Gamma Processes

    Authors: Conor Tillinghast, Zheng Wang, Shandian Zhe

    Abstract: We propose a nonparametric factorization approach for sparsely observed tensors. The sparsity does not mean zero-valued entries are massive or dominated. Rather, it implies the observed entries are very few, and even fewer with the growth of the tensor; this is ubiquitous in practice. Compared with the existent works, our model not only leverages the structural information underlying the observed… ▽ More

    Submitted 3 November, 2021; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: 15 pages, 4 figures

  9. arXiv:2106.09884  [pdf, other

    cs.LG stat.ML

    Batch Multi-Fidelity Bayesian Optimization with Deep Auto-Regressive Networks

    Authors: Shibo Li, Robert M. Kirby, Shandian Zhe

    Abstract: Bayesian optimization (BO) is a powerful approach for optimizing black-box, expensive-to-evaluate functions. To enable a flexible trade-off between the cost and accuracy, many applications allow the function to be evaluated at different fidelities. In order to reduce the optimization cost while maximizing the benefit-cost ratio, in this paper, we propose Batch Multi-fidelity Bayesian Optimization… ▽ More

    Submitted 25 October, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

  10. arXiv:2007.07367  [pdf, other

    cs.LG stat.ML

    Streaming Probabilistic Deep Tensor Factorization

    Authors: Shikai Fang, Zheng Wang, Zhimeng Pan, Ji Liu, Shandian Zhe

    Abstract: Despite the success of existing tensor factorization methods, most of them conduct a multilinear decomposition, and rarely exploit powerful modeling frameworks, like deep neural networks, to capture a variety of complicated interactions in data. More important, for highly expressive, deep factorization, we lack an effective approach to handle streaming data, which are ubiquitous in real-world appl… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  11. arXiv:2007.03117  [pdf, ps, other

    cs.LG stat.ML

    Multi-Fidelity Bayesian Optimization via Deep Neural Networks

    Authors: Shibo Li, Wei Xing, Mike Kirby, Shandian Zhe

    Abstract: Bayesian optimization (BO) is a popular framework to optimize black-box functions. In many applications, the objective function can be evaluated at multiple fidelities to enable a trade-off between the cost and accuracy. To reduce the optimization cost, many multi-fidelity BO methods have been proposed. Despite their success, these methods either ignore or over-simplify the strong, complex correla… ▽ More

    Submitted 10 December, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

  12. arXiv:2006.04976  [pdf, other

    stat.ML cs.LG

    Physics Informed Deep Kernel Learning

    Authors: Zheng Wang, Wei Xing, Robert Kirby, Shandian Zhe

    Abstract: Deep kernel learning is a promising combination of deep neural networks and nonparametric function learning. However, as a data driven approach, the performance of deep kernel learning can still be restricted by scarce or insufficient data, especially in extrapolation tasks. To address these limitations, we propose Physics Informed Deep Kernel Learning (PI-DKL) that exploits physics knowledge repr… ▽ More

    Submitted 18 January, 2022; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: 8 pages, 5 figures, AISTATS

  13. arXiv:2006.04972  [pdf, other

    stat.ML cs.LG physics.comp-ph

    Multi-Fidelity High-Order Gaussian Processes for Physical Simulation

    Authors: Zheng Wang, Wei Xing, Robert Kirby, Shandian Zhe

    Abstract: The key task of physical simulation is to solve partial differential equations (PDEs) on discretized domains, which is known to be costly. In particular, high-fidelity solutions are much more expensive than low-fidelity ones. To reduce the cost, we consider novel Gaussian process (GP) models that leverage simulation examples of different fidelities to predict high-dimensional PDE solution outputs.… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  14. arXiv:2003.11489  [pdf, ps, other

    cs.LG stat.ML

    Scalable Variational Gaussian Process Regression Networks

    Authors: Shibo Li, Wei Xing, Mike Kirby, Shandian Zhe

    Abstract: Gaussian process regression networks (GPRN) are powerful Bayesian models for multi-output regression, but their inference is intractable. To address this issue, existing methods use a fully factorized structure (or a mixture of such structures) over all the outputs and latent functions for posterior approximation, which, however, can miss the strong posterior dependencies among the latent variable… ▽ More

    Submitted 18 May, 2020; v1 submitted 25 March, 2020; originally announced March 2020.

  15. Macroscopic Traffic Flow Modeling with Physics Regularized Gaussian Process: A New Insight into Machine Learning Applications

    Authors: Yun Yuan, Xianfeng Terry Yang, Zhao Zhang, Shandian Zhe

    Abstract: Despite the wide implementation of machine learning (ML) techniques in traffic flow modeling recently, those data-driven approaches often fall short of accuracy in the cases with a small or noisy dataset. To address this issue, this study presents a new modeling framework, named physics regularized machine learning (PRML), to encode classical traffic flow models (referred as physical models) into… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    Comments: 30 pages, 13 figures

    Journal ref: Transp Res B: Methodol, 146, 88-110 (2021)

  16. arXiv:1910.12360  [pdf, ps, other

    stat.ML cs.LG

    Conditional Expectation Propagation

    Authors: Zheng Wang, Shandian Zhe

    Abstract: Expectation propagation (EP) is a powerful approximate inference algorithm. However, a critical barrier in applying EP is that the moment matching in message updates can be intractable. Handcrafting approximations is usually tricky, and lacks generalizability. Importance sampling is very expensive. While Laplace propagation provides a good solution, it has to run numerical optimizations to find La… ▽ More

    Submitted 8 November, 2019; v1 submitted 27 October, 2019; originally announced October 2019.

    Comments: 10 pages, 5 figures, UAI 2019

  17. arXiv:1712.05134  [pdf, other

    cs.LG stat.ML

    Learning Compact Recurrent Neural Networks with Block-Term Tensor Decomposition

    Authors: **mian Ye, Linnan Wang, Guangxi Li, Di Chen, Shandian Zhe, Xinqi Chu, Zenglin Xu

    Abstract: Recurrent Neural Networks (RNNs) are powerful sequence modeling tools. However, when dealing with high dimensional inputs, the training of RNNs becomes computational expensive due to the large number of model parameters. This hinders RNNs from solving many important computer vision tasks, such as Action Recognition in Videos and Image Captioning. To overcome this problem, we propose a compact and… ▽ More

    Submitted 11 May, 2018; v1 submitted 14 December, 2017; originally announced December 2017.

    Comments: CVPR2018

  18. arXiv:1704.06735  [pdf, ps, other

    stat.ML

    Asynchronous Distributed Variational Gaussian Processes for Regression

    Authors: Hao Peng, Shandian Zhe, Yuan Qi

    Abstract: Gaussian processes (GPs) are powerful non-parametric function estimators. However, their applications are largely limited by the expensive computational cost of the inference procedures. Existing stochastic or distributed synchronous variational inferences, although have alleviated this issue by scaling up GPs to millions of samples, are still far from satisfactory for real-world large application… ▽ More

    Submitted 12 June, 2017; v1 submitted 21 April, 2017; originally announced April 2017.

    Comments: International Conference on Machine Learning 2017

  19. arXiv:1604.07928  [pdf, ps, other

    cs.LG cs.AI cs.DC stat.ML

    Distributed Flexible Nonlinear Tensor Factorization

    Authors: Shandian Zhe, Kai Zhang, Pengyuan Wang, Kuang-chih Lee, Zenglin Xu, Yuan Qi, Zoubin Ghahramani

    Abstract: Tensor factorization is a powerful tool to analyse multi-way data. Compared with traditional multi-linear methods, nonlinear tensor factorization models are capable of capturing more complex relationships in the data. However, they are computationally expensive and may suffer severe learning bias in case of extreme data sparsity. To overcome these limitations, in this paper we propose a distribute… ▽ More

    Submitted 21 May, 2016; v1 submitted 27 April, 2016; originally announced April 2016.

    Comments: Gaussian process, tensor factorization, multidimensional arrays, large scale, spark, map-reduce

    ACM Class: I.5.1; I.5.4

  20. arXiv:1311.2663  [pdf, ps, other

    cs.LG cs.DC stat.ML

    DinTucker: Scaling up Gaussian process models on multidimensional arrays with billions of elements

    Authors: Shandian Zhe, Yuan Qi, Youngja Park, Ian Molloy, Suresh Chari

    Abstract: Infinite Tucker Decomposition (InfTucker) and random function prior models, as nonparametric Bayesian models on infinite exchangeable arrays, are more powerful models than widely-used multilinear factorization methods including Tucker and PARAFAC decomposition, (partly) due to their capability of modeling nonlinear relationships between array elements. Despite their great predictive performance an… ▽ More

    Submitted 1 February, 2014; v1 submitted 11 November, 2013; originally announced November 2013.

  21. arXiv:1304.7284  [pdf, other

    cs.LG cs.CE stat.ML

    Supervised Heterogeneous Multiview Learning for Joint Association Study and Disease Diagnosis

    Authors: Shandian Zhe, Zenglin Xu, Yuan Qi

    Abstract: Given genetic variations and various phenotypical traits, such as Magnetic Resonance Imaging (MRI) features, we consider two important and related tasks in biomedical research: i)to select genetic and phenotypical markers for disease diagnosis and ii) to identify associations between genetic and phenotypical data. These two tasks are tightly coupled because underlying associations between genetic… ▽ More

    Submitted 16 October, 2013; v1 submitted 26 April, 2013; originally announced April 2013.