Skip to main content

Showing 1–12 of 12 results for author: Ju, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.14162  [pdf

    stat.AP

    Forecasting and Analysis of CSI 300 Daily Index and S&P 500 Index Based on ARMA and GARCH Models

    Authors: Ningyi Li, Chennan Ju, Dexiang Su, Shuyan Wang, Xing Tong

    Abstract: In this paper, the ARMA(0,6)-GARCH(1,1) and ARMA(2,6)-eGARCH(1,1) models are constructed by applying ARMA and GARCH models to daily data of the CSI 300 and S&P 500 indices from 2018 to 2021, and the forecasts for the next 7 steps and the corresponding VaR and ES are calculated. After testing the sensitivity of the models, the two index stocks are compared and the corresponding conclusions are pres… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  2. arXiv:2008.06853  [pdf, other

    cs.LG math.NA stat.ML

    Survey: Geometric Foundations of Data Reduction

    Authors: Ce Ju

    Abstract: This survey is written in summer, 2016. The purpose of this survey is to briefly introduce nonlinear dimensionality reduction (NLDR) in data reduction. The first two NLDR were respectively published in Science in 2000 in which they solve the similar reduction problem of high-dimensional data endowed with the intrinsic nonlinear structure. The intrinsic nonlinear structure is always interpreted as… ▽ More

    Submitted 20 March, 2022; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: 78 pages, Suvery

    ACM Class: I.2

  3. arXiv:2007.01587  [pdf, other

    cs.CR cs.LG stat.ML

    Privacy Threats Against Federated Matrix Factorization

    Authors: Dashan Gao, Ben Tan, Ce Ju, Vincent W. Zheng, Qiang Yang

    Abstract: Matrix Factorization has been very successful in practical recommendation applications and e-commerce. Due to data shortage and stringent regulations, it can be hard to collect sufficient data to build performant recommender systems for a single company. Federated learning provides the possibility to bridge the data silos and build machine learning models without compromising privacy and security.… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: 6 pages, 2 figures, 1 table, Accepted for Workshop on Federated Learning for Data Privacy and Confidentiality in Conjunction with IJCAI 2020 (FL-IJCAI'20)

  4. arXiv:2006.11601  [pdf, other

    cs.LG cs.CR cs.DC stat.ML

    Rethinking Privacy Preserving Deep Learning: How to Evaluate and Thwart Privacy Attacks

    Authors: Lixin Fan, Kam Woh Ng, Ce Ju, Tianyu Zhang, Chang Liu, Chee Seng Chan, Qiang Yang

    Abstract: This paper investigates capabilities of Privacy-Preserving Deep Learning (PPDL) mechanisms against various forms of privacy attacks. First, we propose to quantitatively measure the trade-off between model accuracy and privacy losses incurred by reconstruction, tracing and membership attacks. Second, we formulate reconstruction attacks as solving a noisy system of linear equations, and prove that a… ▽ More

    Submitted 23 June, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: under review, 36 pages (updated Eq. 3 and Fig. 8)

  5. arXiv:1905.08513  [pdf, other

    cs.LG cs.AI stat.ML

    Stochastic Inverse Reinforcement Learning

    Authors: Ce Ju

    Abstract: The goal of the inverse reinforcement learning (IRL) problem is to recover the reward functions from expert demonstrations. However, the IRL problem like any ill-posed inverse problem suffers the congenital defect that the policy may be optimal for many reward functions, and expert demonstrations may be optimal for many policies. In this work, we generalize the IRL problem to a well-posed expectat… ▽ More

    Submitted 23 September, 2022; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: 8+2 pages, 5 figures, Under Review

    ACM Class: I.2.6

  6. arXiv:1806.06784  [pdf, other

    stat.ME stat.CO stat.ML

    Robust inference on the average treatment effect using the outcome highly adaptive lasso

    Authors: Cheng Ju, David Benkeser, Mark J. van der Laan

    Abstract: Many estimators of the average effect of a treatment on an outcome require estimation of the propensity score, the outcome regression, or both. It is often beneficial to utilize flexible techniques such as semiparametric regression or machine learning to estimate these quantities. However, optimal estimation of these regressions does not necessarily lead to optimal estimation of the average treatm… ▽ More

    Submitted 12 May, 2019; v1 submitted 18 June, 2018; originally announced June 2018.

    Comments: The first two authors contributed equally to this work

  7. arXiv:1804.00102  [pdf, other

    stat.ME math.ST stat.ML

    Collaborative targeted inference from continuously indexed nuisance parameter estimators

    Authors: Cheng Ju, Antoine Chambaz, Mark J. van der Laan

    Abstract: We wish to infer the value of a parameter at a law from which we sample independent observations. The parameter is smooth and we can define two variation-independent features of the law, its $Q$- and $G$-components, such that estimating them consistently at a fast enough product of rates allows to build a confidence interval (CI) with a given asymptotic level from a plain targeted minimum loss est… ▽ More

    Submitted 5 April, 2018; v1 submitted 30 March, 2018; originally announced April 2018.

    Comments: 38 pages

  8. arXiv:1707.05861  [pdf, other

    stat.ME stat.CO stat.ML

    On Adaptive Propensity Score Truncation in Causal Inference

    Authors: Cheng Ju, Joshua Schwab, Mark J. van der Laan

    Abstract: The positivity assumption, or the experimental treatment assignment (ETA) assumption, is important for identifiability in causal inference. Even if the positivity assumption holds, practical violations of this assumption may jeopardize the finite sample performance of the causal estimator. One of the consequences of practical violations of the positivity assumption is extreme values in the estimat… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

  9. arXiv:1706.10029  [pdf, other

    stat.ME stat.CO stat.ML

    Collaborative-controlled LASSO for Constructing Propensity Score-based Estimators in High-Dimensional Data

    Authors: Cheng Ju, Richard Wyss, Jessica M. Franklin, Sebastian Schneeweiss, Jenny Häggström, Mark J. van der Laan

    Abstract: Propensity score (PS) based estimators are increasingly used for causal inference in observational studies. However, model selection for PS estimation in high-dimensional data has received little attention. In these settings, PS models have traditionally been selected based on the goodness-of-fit for the treatment mechanism itself, without consideration of the causal parameter of interest. Collabo… ▽ More

    Submitted 30 June, 2017; originally announced June 2017.

  10. arXiv:1704.01664  [pdf, other

    stat.ML cs.CV cs.LG stat.ME

    The Relative Performance of Ensemble Methods with Deep Convolutional Neural Networks for Image Classification

    Authors: Cheng Ju, Aurélien Bibaut, Mark J. van der Laan

    Abstract: Artificial neural networks have been successfully applied to a variety of machine learning tasks, including image recognition, semantic segmentation, and machine translation. However, few studies fully investigated ensembles of artificial neural networks. In this work, we investigated multiple widely used ensemble methods, including unweighted averaging, majority voting, the Bayes Optimal Classifi… ▽ More

    Submitted 5 April, 2017; originally announced April 2017.

  11. arXiv:1703.02237  [pdf, other

    stat.CO stat.ME

    Scalable Collaborative Targeted Learning for High-Dimensional Data

    Authors: Cheng Ju, Susan Gruber, Samuel D. Lendle, Antoine Chambaz, Jessica M. Franklin, Richard Wyss, Sebastian Schneeweiss, Mark J. van der Laan

    Abstract: Robust inference of a low-dimensional parameter in a large semi-parametric model relies on external estimators of infinite-dimensional features of the distribution of the data. Typically, only one of the latter is optimized for the sake of constructing a well behaved estimator of the low-dimensional parameter of interest. Optimizing more than one of them for the sake of achieving a better bias-var… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

  12. arXiv:1703.02236  [pdf, other

    stat.AP stat.ML

    Propensity score prediction for electronic healthcare databases using Super Learner and High-dimensional Propensity Score Methods

    Authors: Cheng Ju, Mary Combs, Samuel D Lendle, Jessica M Franklin, Richard Wyss, Sebastian Schneeweiss, Mark J. van der Laan

    Abstract: The optimal learner for prediction modeling varies depending on the underlying data-generating distribution. Super Learner (SL) is a generic ensemble learning algorithm that uses cross-validation to select among a "library" of candidate prediction models. The SL is not restricted to a single prediction model, but uses the strengths of a variety of learning algorithms to adapt to different database… ▽ More

    Submitted 14 March, 2017; v1 submitted 7 March, 2017; originally announced March 2017.