Skip to main content

Showing 1–18 of 18 results for author: Lai, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.11451  [pdf, ps, other

    math.NA cs.AI math.AP stat.ML

    Error Analysis of Three-Layer Neural Network Trained with PGD for Deep Ritz Method

    Authors: Yuling Jiao, Yanming Lai, Yang Wang

    Abstract: Machine learning is a rapidly advancing field with diverse applications across various domains. One prominent area of research is the utilization of deep learning techniques for solving partial differential equations(PDEs). In this work, we specifically focus on employing a three-layer tanh neural network within the framework of the deep Ritz method(DRM) to solve second-order elliptic equations wi… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    MSC Class: 65N12; 65N15; 68T07; 62G05; 35J25

  2. arXiv:2404.05678  [pdf, other

    stat.ML cs.CY cs.LG

    Flexible Fairness Learning via Inverse Conditional Permutation

    Authors: Yuheng Lai, Leying Guan

    Abstract: Equalized odds, as a popular notion of algorithmic fairness, aims to ensure that sensitive variables, such as race and gender, do not unfairly influence the algorithm prediction when conditioning on the true outcome. Despite rapid advancements, most of the current research focuses on the violation of equalized odds caused by one sensitive attribute, leaving the challenge of simultaneously accounti… ▽ More

    Submitted 9 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  3. arXiv:2404.02538  [pdf, other

    stat.ML cs.LG

    Convergence Analysis of Flow Matching in Latent Space with Transformers

    Authors: Yuling Jiao, Yanming Lai, Yang Wang, Bokai Yan

    Abstract: We present theoretical convergence guarantees for ODE-based generative models, specifically flow matching. We use a pre-trained autoencoder network to map high-dimensional original inputs to a low-dimensional latent space, where a transformer network is trained to predict the velocity field of the transformation from a standard normal distribution to the target latent distribution. Our error analy… ▽ More

    Submitted 28 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  4. arXiv:2104.12953  [pdf, other

    cs.LG cs.AI stat.ML

    Exploring Uncertainty in Deep Learning for Construction of Prediction Intervals

    Authors: Yuandu Lai, Yucheng Shi, Yahong Han, Yunfeng Shao, Meiyu Qi, Bingshuai Li

    Abstract: Deep learning has achieved impressive performance on many tasks in recent years. However, it has been found that it is still not enough for deep neural networks to provide only point estimates. For high-risk tasks, we need to assess the reliability of the model predictions. This requires us to quantify the uncertainty of model prediction and construct prediction intervals. In this paper, We explor… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

  5. Anticipating synchronization with machine learning

    Authors: Huawei Fan, Ling-Wei Kong, Ying-Cheng Lai, Xingang Wang

    Abstract: In applications of dynamical systems, situations can arise where it is desired to predict the onset of synchronization as it can lead to characteristic and significant changes in the system performance and behaviors, for better or worse. In experimental and real settings, the system equations are often unknown, raising the need to develop a prediction framework that is model free and fully data dr… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

    Comments: 13 pages; 12 figures

    Journal ref: Phys. Rev. Research 3, 023237 (2021)

  6. arXiv:2006.08361  [pdf, other

    cs.CY stat.ML

    An Unsupervised Machine Learning Approach to Assess the ZIP Code Level Impact of COVID-19 in NYC

    Authors: Fadoua Khmaissia, Pegah Sagheb Haghighi, Aarthe Jayaprakash, Zhenwei Wu, Sokratis Papadopoulos, Yuan Lai, Freddy T. Nguyen

    Abstract: New York City has been recognized as the world's epicenter of the novel Coronavirus pandemic. To identify the key inherent factors that are highly correlated to the Increase Rate of COVID-19 new cases in NYC, we propose an unsupervised machine learning framework. Based on the assumption that ZIP code areas with similar demographic, socioeconomic, and mobility patterns are likely to experience simi… ▽ More

    Submitted 18 September, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Presented at ICML 2020 Workshop on the Healthcare Systems, Population Health, and the Role of Health-Tech

  7. arXiv:2004.01258  [pdf, other

    cs.LG nlin.AO stat.ML

    Long-term prediction of chaotic systems with recurrent neural networks

    Authors: Huawei Fan, Junjie Jiang, Chun Zhang, Xingang Wang, Ying-Cheng Lai

    Abstract: Reservoir computing systems, a class of recurrent neural networks, have recently been exploited for model-free, data-based prediction of the state evolution of a variety of chaotic dynamical systems. The prediction horizon demonstrated has been about half dozen Lyapunov time. Is it possible to significantly extend the prediction time beyond what has been achieved so far? We articulate a scheme inc… ▽ More

    Submitted 6 March, 2020; originally announced April 2020.

    Comments: 10 pages, 8 figures

  8. arXiv:1912.05796  [pdf, other

    cs.LG cs.AI stat.ML

    Automatic Layout Generation with Applications in Machine Learning Engine Evaluation

    Authors: Haoyu Yang, Wen Chen, Piyush Pathak, Frank Gennari, Ya-Chieh Lai, Bei Yu

    Abstract: Machine learning-based lithography hotspot detection has been deeply studied recently, from varies feature extraction techniques to efficient learning models. It has been observed that such machine learning-based frameworks are providing satisfactory metal layer hotspot prediction results on known public metal layer benchmarks. In this work, we seek to evaluate how these machine learning-based hot… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: 6 pages, submitted to 1st ACM/IEEE Workshop on Machine Learning for CAD (MLCAD) for review

  9. Ensemble Quantile Classifier

    Authors: Yuanhao Lai, Ian McLeod

    Abstract: Both the median-based classifier and the quantile-based classifier are useful for discriminating high-dimensional data with heavy-tailed or skewed inputs. But these methods are restricted as they assign equal weight to each variable in an unregularized way. The ensemble quantile classifier is a more flexible regularized classifier that provides better performance with high-dimensional data, asymme… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Journal ref: Computational Statistics and Data Analysis (2019) 106849

  10. arXiv:1910.04426  [pdf, other

    cs.LG nlin.CD physics.data-an stat.ML

    Model-free prediction of spatiotemporal dynamical systems with recurrent neural networks: Role of network spectral radius

    Authors: Junjie Jiang, Ying-Cheng Lai

    Abstract: A common difficulty in applications of machine learning is the lack of any general principle for guiding the choices of key parameters of the underlying neural network. Focusing on a class of recurrent neural networks - reservoir computing systems that have recently been exploited for model-free prediction of nonlinear dynamical systems, we uncover a surprising phenomenon: the emergence of an inte… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: 15 pages, 13 figures

  11. arXiv:1810.08765  [pdf, other

    cs.IR cs.LG stat.ML

    Attribute-aware Collaborative Filtering: Survey and Classification

    Authors: Wen-Hao Chen, Chin-Chi Hsu, Yi-An Lai, Vincent Liu, Mi-Yen Yeh, Shou-De Lin

    Abstract: Attribute-aware CF models aims at rating prediction given not only the historical rating from users to items, but also the information associated with users (e.g. age), items (e.g. price), or even ratings (e.g. rating time). This paper surveys works in the past decade develo** attribute-aware CF systems, and discovered that mathematically they can be classified into four different categories. We… ▽ More

    Submitted 20 October, 2018; originally announced October 2018.

  12. arXiv:1807.10693  [pdf, ps, other

    cs.LG stat.ML

    Infinite Mixture of Inverted Dirichlet Distributions

    Authors: Zhanyu Ma, Yu** Lai

    Abstract: In this work, we develop a novel Bayesian estimation method for the Dirichlet process (DP) mixture of the inverted Dirichlet distributions, which has been shown to be very flexible for modeling vectors with positive elements. The recently proposed extended variational inference (EVI) framework is adopted to derive an analytically tractable solution. The convergency of the proposed algorithm is the… ▽ More

    Submitted 2 February, 2020; v1 submitted 27 July, 2018; originally announced July 2018.

    Comments: Technical Report of ongoing work

  13. arXiv:1806.05424  [pdf, other

    stat.AP stat.CO

    Sequential Bayesian inference for spatio-temporal models of temperature and humidity data

    Authors: Yingying Lai, Andrew Golightly, Richard Boys

    Abstract: We develop a spatio-temporal model to forecast sensor output at five locations in North East England. The signal is described using coupled dynamic linear models, with spatial effects specified by a Gaussian process. Data streams are analysed using a stochastic algorithm which sequentially approximates the parameter posterior through a series of reweighting and resampling steps. An iterated batch… ▽ More

    Submitted 14 June, 2018; originally announced June 2018.

    Comments: 25 pages

  14. arXiv:1709.00944  [pdf

    cs.SD cs.MM eess.AS stat.ML

    Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks

    Authors: Jen-Cheng Hou, Syu-Siang Wang, Ying-Hui Lai, Yu Tsao, Hsiu-Wen Chang, Hsin-Min Wang

    Abstract: Speech enhancement (SE) aims to reduce noise in speech signals. Most SE techniques focus only on addressing audio information. In this work, inspired by multimodal learning, which utilizes data from different modalities, and the recent success of convolutional neural networks (CNNs) in SE, we propose an audio-visual deep CNNs (AVDCNN) SE model, which incorporates audio and visual streams into a un… ▽ More

    Submitted 18 April, 2022; v1 submitted 1 September, 2017; originally announced September 2017.

    Comments: This paper is the same as arXiv:1703.10893v6. Apologies for the inconvenience. arXiv admin note: text overlap with arXiv:1703.10893

  15. arXiv:1703.10893  [pdf

    cs.SD cs.MM stat.ML

    Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks

    Authors: Jen-Cheng Hou, Syu-Siang Wang, Ying-Hui Lai, Yu Tsao, Hsiu-Wen Chang, Hsin-Min Wang

    Abstract: Speech enhancement (SE) aims to reduce noise in speech signals. Most SE techniques focus only on addressing audio information. In this work, inspired by multimodal learning, which utilizes data from different modalities, and the recent success of convolutional neural networks (CNNs) in SE, we propose an audio-visual deep CNNs (AVDCNN) SE model, which incorporates audio and visual streams into a un… ▽ More

    Submitted 24 January, 2018; v1 submitted 30 March, 2017; originally announced March 2017.

    Comments: To appear in IEEE Transactions on Emerging Topics in Computational Intelligence. Some audio samples can be reached in this link: https://sites.google.com/view/avse2017

  16. arXiv:1603.07439  [pdf, ps, other

    cs.CR physics.data-an physics.soc-ph stat.AP

    Spatiotemporal patterns and predictability of cyberattacks

    Authors: Yu-Zhong Chen, Zi-Gang Huang, Shouhuai Xu, Ying-Cheng Lai

    Abstract: A relatively unexplored issue in cybersecurity science and engineering is whether there exist intrinsic patterns of cyberattacks. Conventional wisdom favors absence of such patterns due to the overwhelming complexity of the modern cyberspace. Surprisingly, through a detailed analysis of an extensive data set that records the time-dependent frequencies of attacks over a relatively wide range of con… ▽ More

    Submitted 24 March, 2016; originally announced March 2016.

    Journal ref: PLoS One 10(5): e0124472 (2015)

  17. arXiv:1008.0901  [pdf, ps, other

    stat.AP nlin.AO physics.soc-ph

    Convergence to global consensus in opinion dynamics under a nonlinear voter model

    Authors: Han-Xin Yang, Wen-Xu Wang, Ying-Cheng Lai, Bing-Hong Wang

    Abstract: We propose a nonlinear voter model to study the emergence of global consensus in opinion dynamics. In our model, agent $i$ agrees with one of binary opinions with the probability that is a power function of the number of agents holding this opinion among agent $i$ and its nearest neighbors, where an adjustable parameter $α$ controls the effect of herd behavior on consensus. We find that there exis… ▽ More

    Submitted 28 December, 2011; v1 submitted 4 August, 2010; originally announced August 2010.

    Journal ref: Physics Letters A 376 (2012) 282-285

  18. arXiv:0804.4361  [pdf, ps, other

    stat.ME math.ST

    Improving Coverage Accuracy of Block Bootstrap Confidence Intervals

    Authors: Stephen M. S. Lee, P. Y. Lai

    Abstract: The block bootstrap confidence interval based on dependent data can outperform the computationally more convenient normal approximation only with non-trivial Studentization which, in the case of complicated statistics, calls for highly specialist treatment. We propose two different approaches to improving the accuracy of the block bootstrap confidence interval under very general conditions. The… ▽ More

    Submitted 28 April, 2008; originally announced April 2008.

    Report number: Research Report No. 435. Department of Statistics and Actuarial Science, The University of Hong Kong