Skip to main content

Showing 1–13 of 13 results for author: Lu, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.01252  [pdf, other

    cs.CL cs.AI stat.ML

    Towards Scalable Automated Alignment of LLMs: A Survey

    Authors: Boxi Cao, Keming Lu, Xinyu Lu, Jiawei Chen, Mengjie Ren, Hao Xiang, Peilin Liu, Yaojie Lu, Ben He, Xianpei Han, Le Sun, Hongyu Lin, Bowen Yu

    Abstract: Alignment is the most critical step in building large language models (LLMs) that meet human needs. With the rapid development of LLMs gradually surpassing human capabilities, traditional alignment methods based on human-annotation are increasingly unable to meet the scalability demands. Therefore, there is an urgent need to explore new sources of automated alignment signals and technical approach… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2405.00626  [pdf, other

    stat.ME

    SARMA: Scalable Low-Rank High-Dimensional Autoregressive Moving Averages via Tensor Decomposition

    Authors: Feiqing Huang, Kexin Lu, Yao Zheng

    Abstract: Existing models for high-dimensional time series are overwhelmingly developed within the finite-order vector autoregressive (VAR) framework, whereas the more flexible vector autoregressive moving averages (VARMA) have been much less considered. This paper introduces a high-dimensional model for capturing VARMA dynamics, namely the Scalable ARMA (SARMA) model, by combining novel reparameterization… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  3. arXiv:2312.00346  [pdf, other

    stat.ME

    Supervised Factor Modeling for High-Dimensional Linear Time Series

    Authors: Feiqing Huang, Kexin Lu, Guodong Li

    Abstract: Motivated by Tucker tensor decomposition, this paper imposes low-rank structures to the column and row spaces of coefficient matrices in a multivariate infinite-order vector autoregression (VAR), which leads to a supervised factor model with two factor modelings being conducted to responses and predictors simultaneously. Interestingly, the stationarity condition implies an intrinsic weak group spa… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  4. arXiv:2303.02896  [pdf, other

    stat.ME

    HAR-Ito models and high-dimensional HAR modeling for high-frequency data

    Authors: Huiling Yuan, Kexin Lu, Yifeng Guo, Guodong Li

    Abstract: It is an important task to model realized volatilities for high-frequency data in finance and economics and, as arguably the most popular model, the heterogeneous autoregressive (HAR) model has dominated the applications in this area. However, this model suffers from three drawbacks: (i.) its heterogeneous volatility components are linear combinations of daily realized volatilities with fixed weig… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  5. arXiv:2011.14542  [pdf, other

    math.ST math.PR stat.ME

    Calibration for multivariate Lévy-driven Ornstein-Uhlenbeck processes with applications to weak subordination

    Authors: Kevin W. Lu

    Abstract: Consider a multivariate Lévy-driven Ornstein-Uhlenbeck process where the stationary distribution or background driving Lévy process is from a parametric family. We derive the likelihood function assuming that the innovation term is absolutely continuous. Two examples are studied in detail: the process where the stationary distribution or background driving Lévy process is given by a weak variance… ▽ More

    Submitted 31 August, 2021; v1 submitted 29 November, 2020; originally announced November 2020.

    MSC Class: 62M05; 60G51; 60G10

  6. arXiv:2009.03506  [pdf

    cs.LG stat.ML

    High-throughput relation extraction algorithm development associating knowledge articles and electronic health records

    Authors: Yucong Lin, Keming Lu, Yulin Chen, Chuan Hong, Sheng Yu

    Abstract: Objective: Medical relations are the core components of medical knowledge graphs that are needed for healthcare artificial intelligence. However, the requirement of expert annotation by conventional algorithm development processes creates a major bottleneck for mining new relations. In this paper, we present Hi-RES, a framework for high-throughput relation extraction algorithm development. We also… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

  7. arXiv:1912.01188  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Adaptive Online Planning for Continual Lifelong Learning

    Authors: Kevin Lu, Igor Mordatch, Pieter Abbeel

    Abstract: We study learning control in an online reset-free lifelong learning scenario, where mistakes can compound catastrophically into the future and the underlying dynamics of the environment may change. Traditional model-free policy learning methods have achieved successes in difficult tasks due to their broad flexibility, but struggle in this setting, as they can activate failure modes early in their… ▽ More

    Submitted 27 June, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: Originally published in NeurIPS Deep RL 2019

  8. arXiv:1904.01509  [pdf, other

    cs.LG cs.CV cs.GR eess.IV stat.ML

    FEAFA: A Well-Annotated Dataset for Facial Expression Analysis and 3D Facial Animation

    Authors: Yanfu Yan, Ke Lu, Jian Xue, Pengcheng Gao, Jiayi Lyu

    Abstract: Facial expression analysis based on machine learning requires large number of well-annotated data to reflect different changes in facial motion. Publicly available datasets truly help to accelerate research in this area by providing a benchmark resource, but all of these datasets, to the best of our knowledge, are limited to rough annotations for action units, including only their absence, presenc… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: 9 pages, 7 figures

    Journal ref: 2019 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

  9. arXiv:1903.01610  [pdf, other

    cs.LG cs.CR stat.ML

    Adversarial Examples on Graph Data: Deep Insights into Attack and Defense

    Authors: Huijun Wu, Chen Wang, Yuriy Tyshetskiy, Andrew Docherty, Kai Lu, Liming Zhu

    Abstract: Graph deep learning models, such as graph convolutional networks (GCN) achieve remarkable performance for tasks on graph data. Similar to other types of deep models, graph deep learning models often suffer from adversarial attacks. However, compared with non-graph data, the discrete features, graph connections and different definitions of imperceptible perturbations bring unique challenges and opp… ▽ More

    Submitted 22 May, 2019; v1 submitted 4 March, 2019; originally announced March 2019.

    Comments: to appear in IJCAI'19

  10. arXiv:1801.08852  [pdf, other

    stat.ME q-fin.MF

    Calibration for Weak Variance-Alpha-Gamma Processes

    Authors: Boris Buchmann, Kevin W. Lu, Dilip B. Madan

    Abstract: The weak variance-alpha-gamma process is a multivariate Lévy process constructed by weakly subordinating Brownian motion, possibly with correlated components with an alpha-gamma subordinator. It generalises the variance-alpha-gamma process of Semeraro constructed by traditional subordination. We compare three calibration methods for the weak variance-alpha-gamma process, method of moments, maximum… ▽ More

    Submitted 27 July, 2018; v1 submitted 26 January, 2018; originally announced January 2018.

    MSC Class: 60G51; 62F10; 60E10

  11. arXiv:1508.00298  [pdf, other

    stat.ME

    Two-Way Partial AUC and Its Properties

    Authors: Hanfang Yang, Kun Lu, Xiang Lyu, Feifang Hu

    Abstract: When people evaluate the performance of a diagnostic test, it is important to control both True Positive Rate (TPR) and False Positive Rate (FPR). In the literature, most researchers propose the partial area under the ROC curve (pAUC) with restrictions on FPR to assess a binary classification system, which is named as FPR pAUC. It could be artificially designed to measure the area controlled by TP… ▽ More

    Submitted 20 June, 2017; v1 submitted 2 August, 2015; originally announced August 2015.

  12. arXiv:1403.0829  [pdf

    cs.CV cs.LG stat.ML

    Multiview Hessian regularized logistic regression for action recognition

    Authors: W. Liu, H. Liu, D. Tao, Y. Wang, Ke Lu

    Abstract: With the rapid development of social media sharing, people often need to manage the growing volume of multimedia data such as large scale video classification and annotation, especially to organize those videos containing human activities. Recently, manifold regularized semi-supervised learning (SSL), which explores the intrinsic data probability distribution and then improves the generalization a… ▽ More

    Submitted 2 March, 2014; originally announced March 2014.

    Comments: 13 pages,2 figures, submitted to signal processing

  13. arXiv:1312.6182  [pdf

    cs.MS cs.LG math.NA stat.ML

    Large-Scale Paralleled Sparse Principal Component Analysis

    Authors: W. Liu, H. Zhang, D. Tao, Y. Wang, K. Lu

    Abstract: Principal component analysis (PCA) is a statistical technique commonly used in multivariate data analysis. However, PCA can be difficult to interpret and explain since the principal components (PCs) are linear combinations of the original variables. Sparse PCA (SPCA) aims to balance statistical fidelity and interpretability by approximating sparse PCs whose projections capture the maximal variance… ▽ More

    Submitted 20 December, 2013; originally announced December 2013.

    Comments: submitted to Multimedia Tools and Applications