Skip to main content

Showing 1–31 of 31 results for author: Luo, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.08281  [pdf, other

    cs.LG stat.ML

    Conformal Load Prediction with Transductive Graph Autoencoders

    Authors: Rui Luo, Nicolo Colombo

    Abstract: Predicting edge weights on graphs has various applications, from transportation systems to social networks. This paper describes a Graph Neural Network (GNN) approach for edge weight prediction with guaranteed coverage. We leverage conformal prediction to calibrate the GNN outputs and produce valid prediction intervals. We handle data heteroscedasticity through error reweighting and Conformalized… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2404.03828  [pdf, other

    cs.LG cs.AI stat.ML

    Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

    Authors: Jerry Yao-Chieh Hu, Pei-Hsuan Chang, Robin Luo, Hong-Yu Chen, Weijian Li, Wei-Po Wang, Han Liu

    Abstract: We introduce an Outlier-Efficient Modern Hopfield Model (termed $\mathrm{OutEffHop}$) and use it to address the outlier inefficiency problem of {training} gigantic transformer-based models. Our main contribution is a novel associative memory model facilitating \textit{outlier-efficient} associative memory retrievals. Interestingly, this memory model manifests a model-based interpretation of an out… ▽ More

    Submitted 26 June, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted at ICML 2024; v2 updated to camera-ready version; Code available at https://github.com/MAGICS-LAB/OutEffHop; Models are on Hugging Face: https://huggingface.co/collections/magicslabnu/outeffhop-6610fcede8d2cda23009a98f

  3. arXiv:2308.06769  [pdf, other

    stat.ML cs.SI eess.SP

    Fréchet Statistics Based Change Point Detection in Multivariate Hawkes Process

    Authors: Rui Luo, Vikram Krishnamurthy

    Abstract: This paper proposes a new approach for change point detection in causal networks of multivariate Hawkes processes using Frechet statistics. Our method splits the point process into overlap** windows, estimates kernel matrices in each window, and reconstructs the signed Laplacians by treating the kernel matrices as the adjacency matrices of the causal network. We demonstrate the effectiveness of… ▽ More

    Submitted 15 August, 2023; v1 submitted 13 August, 2023; originally announced August 2023.

  4. arXiv:2203.16666  [pdf, other

    cs.NI stat.AP

    Hawkes Process Modeling of Block Arrivals in Bitcoin Blockchain

    Authors: Rui Luo, Vikram Krishnamurthy, Erik Blasch

    Abstract: The paper constructs a multi-variate Hawkes process model of Bitcoin block arrivals and price jumps. Hawkes processes are selfexciting point processes that can capture the self- and cross-excitation effects of block mining and Bitcoin price volatility. We use publicly available blockchain datasets to estimate the model parameters via maximum likelihood estimation. The results show that Bitcoin pri… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  5. arXiv:2109.12727  [pdf, other

    cs.SI stat.ML

    Anomalous Edge Detection in Edge Exchangeable Social Network Models

    Authors: Rui Luo, Buddhika Nettasinghe, Vikram Krishnamurthy

    Abstract: This paper studies detecting anomalous edges in directed graphs that model social networks. We exploit edge exchangeability as a criterion for distinguishing anomalous edges from normal edges. Then we present an anomaly detector based on conformal prediction theory; this detector has a guaranteed upper bound for false positive rate. In numerical experiments, we show that the proposed algorithm ach… ▽ More

    Submitted 21 August, 2023; v1 submitted 26 September, 2021; originally announced September 2021.

  6. arXiv:2008.09643  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Privacy Preserving Recalibration under Domain Shift

    Authors: Rachel Luo, Shengjia Zhao, Jiaming Song, Jonathan Kuck, Stefano Ermon, Silvio Savarese

    Abstract: Classifiers deployed in high-stakes real-world applications must output calibrated confidence scores, i.e. their predicted probabilities should reflect empirical frequencies. Recalibration algorithms can greatly improve a model's probability estimates; however, existing algorithms are not applicable in real-world situations where the test data follows a different distribution from the training dat… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

  7. arXiv:2007.04785  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Accuracy Prediction with Non-neural Model for Neural Architecture Search

    Authors: Renqian Luo, Xu Tan, Rui Wang, Tao Qin, Enhong Chen, Tie-Yan Liu

    Abstract: Neural architecture search (NAS) with an accuracy predictor that predicts the accuracy of candidate architectures has drawn increasing attention due to its simplicity and effectiveness. Previous works usually employ neural network-based predictors which require more delicate design and are easy to overfit. Considering that most architectures are represented as sequences of discrete symbols which a… ▽ More

    Submitted 19 July, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Code is available at https://github.com/renqianluo/GBDT-NAS

  8. arXiv:2007.00295  [pdf, ps, other

    cs.LG stat.ML

    Belief Propagation Neural Networks

    Authors: Jonathan Kuck, Shuvam Chakraborty, Hao Tang, Rachel Luo, Jiaming Song, Ashish Sabharwal, Stefano Ermon

    Abstract: Learned neural solvers have successfully been used to solve combinatorial optimization and decision problems. More general counting variants of these problems, however, are still largely solved with hand-crafted solvers. To bridge this gap, we introduce belief propagation neural networks (BPNNs), a class of parameterized operators that operate on factor graphs and generalize Belief Propagation (BP… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  9. arXiv:2006.05620  [pdf, other

    cs.LG stat.ML

    Exploring the Vulnerability of Deep Neural Networks: A Study of Parameter Corruption

    Authors: Xu Sun, Zhiyuan Zhang, Xuancheng Ren, Ruixuan Luo, Liangyou Li

    Abstract: We argue that the vulnerability of model parameters is of crucial value to the study of model robustness and generalization but little research has been devoted to understanding this matter. In this work, we propose an indicator to measure the robustness of neural network parameters by exploiting their vulnerability via parameter corruption. The proposed indicator describes the maximum loss variat… ▽ More

    Submitted 10 December, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: Accepted by AAAI 2021

  10. arXiv:2002.10389  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Neural Architecture Search

    Authors: Renqian Luo, Xu Tan, Rui Wang, Tao Qin, Enhong Chen, Tie-Yan Liu

    Abstract: Neural architecture search (NAS) relies on a good controller to generate better architectures or predict the accuracy of given architectures. However, training the controller requires both abundant and high-quality pairs of architectures and their accuracy, while it is costly to evaluate an architecture and obtain its accuracy. In this paper, we propose SemiNAS, a semi-supervised NAS approach that… ▽ More

    Submitted 3 November, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: NeurIPS 2020

  11. arXiv:1911.06191  [pdf, other

    cs.CL cs.LG stat.ML

    Microsoft Research Asia's Systems for WMT19

    Authors: Yingce Xia, Xu Tan, Fei Tian, Fei Gao, Weicong Chen, Yang Fan, Linyuan Gong, Yichong Leng, Renqian Luo, Yiren Wang, Lijun Wu, **hua Zhu, Tao Qin, Tie-Yan Liu

    Abstract: We Microsoft Research Asia made submissions to 11 language directions in the WMT19 news translation tasks. We won the first place for 8 of the 11 directions and the second place for the other three. Our basic systems are built on Transformer, back translation and knowledge distillation. We integrate several of our rececent techniques to enhance the baseline systems: multi-agent dual learning (MADL… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: Accepted to "Fourth Conference on Machine Translation (WMT19)"

  12. arXiv:1910.12249  [pdf, other

    cs.LG stat.ML

    An Adaptive and Momental Bound Method for Stochastic Learning

    Authors: Jianbang Ding, Xuancheng Ren, Ruixuan Luo, Xu Sun

    Abstract: Training deep neural networks requires intricate initialization and careful selection of learning rates. The emergence of stochastic gradient optimization methods that use adaptive learning rates based on squared past gradients, e.g., AdaGrad, AdaDelta, and Adam, eases the job slightly. However, such methods have also been proven problematic in recent studies with their own pitfalls including non-… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

  13. arXiv:1909.10815  [pdf, other

    cs.LG cs.NE stat.ML

    Balanced One-shot Neural Architecture Optimization

    Authors: Renqian Luo, Tao Qin, Enhong Chen

    Abstract: The ability to rank candidate architectures is the key to the performance of neural architecture search~(NAS). One-shot NAS is proposed to reduce the expense but shows inferior performance against conventional NAS and is not adequately stable. We investigate into this and find that the ranking correlation between architectures under one-shot training and the ones under stand-alone full training is… ▽ More

    Submitted 31 March, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: Code and model checkpoints are publicly available at https://github.com/renqianluo/NAO_pytorch

  14. arXiv:1908.03595  [pdf, other

    cs.LG stat.ML

    Adaptive Ensemble of Classifiers with Regularization for Imbalanced Data Classification

    Authors: Chen Wang, Chengyuan Deng, Zhoulu Yu, Dafeng Hui, Xiaofeng Gong, Ruisen Luo

    Abstract: The dynamic ensemble selection of classifiers is an effective approach for processing label-imbalanced data classifications. However, such a technique is prone to overfitting, owing to the lack of regularization methods and the dependence of the aforementioned technique on local geometry. In this study, focusing on binary imbalanced data classification, a novel dynamic ensemble method, namely adap… ▽ More

    Submitted 5 November, 2020; v1 submitted 9 August, 2019; originally announced August 2019.

    Comments: Major revision; Change of authors due to contributions

  15. arXiv:1907.13196  [pdf, other

    cs.LG cs.AI stat.ML

    Wasserstein Robust Reinforcement Learning

    Authors: Mohammed Amin Abdullah, Hang Ren, Haitham Bou Ammar, Vladimir Milenkovic, Rui Luo, Mingtian Zhang, Jun Wang

    Abstract: Reinforcement learning algorithms, though successful, tend to over-fit to training environments hampering their application to the real-world. This paper proposes $\text{W}\text{R}^{2}\text{L}$ -- a robust reinforcement learning algorithm with significant robust performance on low and high-dimensional control tasks. Our method formalises robust reinforcement learning as a novel min-max game with a… ▽ More

    Submitted 16 September, 2019; v1 submitted 30 July, 2019; originally announced July 2019.

  16. arXiv:1907.04536  [pdf

    cs.LG cs.SD eess.AS stat.ML

    Multi-layer Attention Mechanism for Speech Keyword Recognition

    Authors: Ruisen Luo, Tianran Sun, Chen Wang, Miao Du, Zuodong Tang, Kai Zhou, Xiaofeng Gong, Xiaomei Yang

    Abstract: As an important part of speech recognition technology, automatic speech keyword recognition has been intensively studied in recent years. Such technology becomes especially pivotal under situations with limited infrastructures and computational resources, such as voice command recognition in vehicles and robot interaction. At present, the mainstream methods in automatic speech keyword recognition… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

  17. arXiv:1905.12569  [pdf, other

    stat.ML cs.LG

    Replica-exchange Nosé-Hoover dynamics for Bayesian learning on large datasets

    Authors: Rui Luo, Qiang Zhang, Yaodong Yang, Jun Wang

    Abstract: In this paper, we present a new practical method for Bayesian learning that can rapidly draw representative samples from complex posterior distributions with multiple isolated modes in the presence of mini-batch noise. This is achieved by simulating a collection of replicas in parallel with different temperatures and periodically swap** them. When evolving the replicas' states, the Nosé-Hoover d… ▽ More

    Submitted 21 February, 2021; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2020

  18. arXiv:1904.00204  [pdf, other

    stat.ME

    Combining Smoothing Spline with Conditional Gaussian Graphical Model for Density and Graph Estimation

    Authors: Runfei Luo, Anna Liu, Yuedong Wang

    Abstract: Multivariate density estimation and graphical models play important roles in statistical learning. The estimated density can be used to construct a graphical model that reveals conditional relationships whereas a graphical structure can be used to build models for density estimation. Our goal is to construct a consolidated framework that can perform both density and graph estimation. Denote… ▽ More

    Submitted 30 March, 2019; originally announced April 2019.

  19. arXiv:1901.09207  [pdf, other

    cs.LG cs.AI stat.ML

    Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning

    Authors: Ying Wen, Yaodong Yang, Rui Luo, Jun Wang, Wei Pan

    Abstract: Humans are capable of attributing latent mental contents such as beliefs or intentions to others. The social skill is critical in daily life for reasoning about the potential consequences of others' behaviors so as to plan ahead. It is known that humans use such reasoning ability recursively by considering what others believe about their own beliefs. In this paper, we start from level-$1$ recursio… ▽ More

    Submitted 1 March, 2019; v1 submitted 26 January, 2019; originally announced January 2019.

    Comments: ICLR 2019

  20. arXiv:1812.01181  [pdf, other

    stat.ML cs.LG

    Parallel-tempered Stochastic Gradient Hamiltonian Monte Carlo for Approximate Multimodal Posterior Sampling

    Authors: Rui Luo, Qiang Zhang, Yuanyuan Liu

    Abstract: We propose a new sampler that integrates the protocol of parallel tempering with the Nosé-Hoover (NH) dynamics. The proposed method can efficiently draw representative samples from complex posterior distributions with multiple isolated modes in the presence of noise arising from stochastic gradient. It potentially facilitates deep Bayesian learning on large datasets where complex multimodal poster… ▽ More

    Submitted 7 December, 2018; v1 submitted 3 December, 2018; originally announced December 2018.

  21. arXiv:1811.03711  [pdf, other

    cs.LG q-fin.ST stat.ML

    Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series

    Authors: Qiang Zhang, Rui Luo, Yaodong Yang, Yuanyuan Liu

    Abstract: Volatility is a quantity of measurement for the price movements of stocks or options which indicates the uncertainty within financial markets. As an indicator of the level of risk or the degree of variation, volatility is important to analyse the financial market, and it is taken into consideration in various decision-making processes in financial activities. On the other hand, recent advancement… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: NIPS 2018, Workshop on Challenges and Opportunities for AI in Financial Services

  22. arXiv:1808.07233  [pdf, other

    cs.LG stat.ML

    Neural Architecture Optimization

    Authors: Renqian Luo, Fei Tian, Tao Qin, Enhong Chen, Tie-Yan Liu

    Abstract: Automatic neural architecture design has shown its potential in discovering powerful neural network architectures. Existing methods, no matter based on reinforcement learning or evolutionary algorithms (EA), conduct architecture search in a discrete space, which is highly inefficient. In this paper, we propose a simple and efficient method to automatic neural architecture design based on continuou… ▽ More

    Submitted 4 September, 2019; v1 submitted 22 August, 2018; originally announced August 2018.

    Comments: NeurIPS 2018. Code available at: https://github.com/renqianluo/NAO

  23. arXiv:1808.03679  [pdf

    physics.ins-det cs.LG stat.ML

    Machine Learning Promoting Extreme Simplification of Spectroscopy Equipment

    Authors: Jianchao Lee, Qiannan Duan, Sifan Bi, Ruen Luo, Yachao Lian, Hanqiang Liu, Ruixing Tian, Jiayuan Chen, Guodong Ma, **hong Gao, Zhaoyi Xu

    Abstract: The spectroscopy measurement is one of main pathways for exploring and understanding the nature. Today, it seems that racing artificial intelligence will remould its styles. The algorithms contained in huge neural networks are capable of substituting many of expensive and complex components of spectrum instruments. In this work, we presented a smart machine learning strategy on the measurement of… ▽ More

    Submitted 13 September, 2019; v1 submitted 5 August, 2018; originally announced August 2018.

    Comments: This is the second version. On pages 7 through 8, we have added a new case about the spectral properties of mixtures. Specifically, paragraph 1 on page 8 and Fig.7 is added

  24. arXiv:1803.00204  [pdf, other

    cs.LG cs.AI math.NA stat.ML

    Scalar Quantization as Sparse Least Square Optimization

    Authors: Chen Wang, Xiaomei Yang, Shaomin Fei, Kai Zhou, Xiaofeng Gong, Miao Du, Ruisen Luo

    Abstract: Quantization can be used to form new vectors/matrices with shared values close to the original. In recent years, the popularity of scalar quantization for value-sharing applications has been soaring as it has been found huge utilities in reducing the complexity of neural networks. Existing clustering-based quantization techniques, while being well-developed, have multiple drawbacks including the d… ▽ More

    Submitted 5 November, 2019; v1 submitted 28 February, 2018; originally announced March 2018.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019

  25. arXiv:1712.00504  [pdf, other

    cs.LG cs.CE q-fin.ST stat.ML

    A Neural Stochastic Volatility Model

    Authors: Rui Luo, Weinan Zhang, Xiaojun Xu, Jun Wang

    Abstract: In this paper, we show that the recent integration of statistical models with deep recurrent neural networks provides a new way of formulating volatility (the degree of variation of time series) models that have been widely used in time series analysis and prediction in finance. The model comprises a pair of complementary stochastic recurrent neural networks: the generative network models the join… ▽ More

    Submitted 4 December, 2018; v1 submitted 30 November, 2017; originally announced December 2017.

  26. arXiv:1711.11511  [pdf, other

    stat.ML

    Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning

    Authors: Rui Luo, Jianhong Wang, Yaodong Yang, Zhanxing Zhu, Jun Wang

    Abstract: We propose a new sampling method, the thermostat-assisted continuously-tempered Hamiltonian Monte Carlo, for Bayesian learning on large datasets and multimodal distributions. It simulates the Nosé-Hoover dynamics of a continuously-tempered Hamiltonian system built on the distribution of interest. A significant advantage of this method is that it is not only able to efficiently draw representative… ▽ More

    Submitted 28 January, 2019; v1 submitted 30 November, 2017; originally announced November 2017.

  27. arXiv:1706.05446  [pdf, other

    stat.ML stat.AP

    Adversarial Variational Bayes Methods for Tweedie Compound Poisson Mixed Models

    Authors: Yaodong Yang, Rui Luo, Yuanyuan Liu

    Abstract: The Tweedie Compound Poisson-Gamma model is routinely used for modeling non-negative continuous data with a discrete probability mass at zero. Mixed models with random effects account for the covariance structure related to the grou** hierarchy in the data. An important application of Tweedie mixed models is pricing the insurance policies, e.g. car insurance. However, the intractable likelihood… ▽ More

    Submitted 3 February, 2019; v1 submitted 16 June, 2017; originally announced June 2017.

    Comments: ICASSP 2019

  28. arXiv:1508.01113  [pdf, ps, other

    stat.ME

    Sparse Fisher's discriminant analysis with thresholded linear constraints

    Authors: Ruiyan Luo, Xin Qi

    Abstract: Various regularized linear discriminant analysis (LDA) methods have been proposed to address the problems of the classic methods in high-dimensional settings. Asymptotic optimality has been established for some of these methods in high dimension when there are only two classes. A major difficulty in proving asymptotic optimality for multiclass classification is that the classification boundary is… ▽ More

    Submitted 5 August, 2015; originally announced August 2015.

  29. arXiv:1508.01105  [pdf, other

    stat.ME

    Signal extraction approach for sparse multivariate response regression

    Authors: Ruiyan Luo, Xin Qi

    Abstract: In this paper, we consider multivariate response regression models with high dimensional predictor variables. One way to model the correlation among the response variables is through the low rank decomposition of the coefficient matrix, which has been considered by several papers for the high dimensional predictors. However, all these papers focus on the singular value decomposition of the coeffic… ▽ More

    Submitted 5 August, 2015; originally announced August 2015.

    Comments: 28 pages, 4 figures

  30. Bayesian hierarchical modeling for signaling pathway inference from single cell interventional data

    Authors: Ruiyan Luo, Hongyu Zhao

    Abstract: Recent technological advances have made it possible to simultaneously measure multiple protein activities at the single cell level. With such data collected under different stimulatory or inhibitory conditions, it is possible to infer the causal relationships among proteins from single cell interventional data. In this article we propose a Bayesian hierarchical modeling framework to infer the sign… ▽ More

    Submitted 3 August, 2011; originally announced August 2011.

    Comments: Published in at http://dx.doi.org/10.1214/10-AOAS425 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS425

    Journal ref: Annals of Applied Statistics 2011, Vol. 5, No. 2A, 725-745

  31. arXiv:0906.1094  [pdf, ps, other

    stat.AP q-bio.GN

    Modeling substitution and indel processes for AFLP marker evolution and phylogenetic inference

    Authors: Ruiyan Luo, Bret Larget

    Abstract: The amplified fragment length polymorphism (AFLP) method produces anonymous genetic markers from throughout a genome. We extend the nucleotide substitution model of AFLP evolution to additionally include insertion and deletion processes. The new Sub-ID model relaxes the common assumption that markers are independent and homologous. We build a Markov chain Monte Carlo methodology tailored for the… ▽ More

    Submitted 5 June, 2009; originally announced June 2009.

    Comments: Published in at http://dx.doi.org/10.1214/08-AOAS212 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS212

    Journal ref: Annals of Applied Statistics 2009, Vol. 3, No. 1, 222-248