Skip to main content

Showing 1–10 of 10 results for author: Wan, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.09703  [pdf, other

    cs.LG stat.ML

    AI Competitions and Benchmarks: Dataset Development

    Authors: Romain Egele, Julio C. S. Jacques Junior, Jan N. van Rijn, Isabelle Guyon, Xavier Baró, Albert Clapés, Prasanna Balaprakash, Sergio Escalera, Thomas Moeslund, Jun Wan

    Abstract: Machine learning is now used in many applications thanks to its ability to predict, generate, or discover patterns from large quantities of data. However, the process of collecting and transforming data for practical use is intricate. Even in today's digital era, where substantial data is generated daily, it is uncommon for it to be readily usable; most often, it necessitates meticulous manual dat… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Preprint version of the 3rd Chapter of the book: Competitions and Benchmarks, the science behind the contests (https://sites.google.com/chalearn.org/book/home)

  2. arXiv:2403.06238  [pdf, other

    stat.ME

    Quantifying the Uncertainty of Imputed Demographic Disparity Estimates: The Dual-Bootstrap

    Authors: Benjamin Lu, Jia Wan, Derek Ouyang, Jacob Goldin, Daniel E. Ho

    Abstract: Measuring average differences in an outcome across racial or ethnic groups is a crucial first step for equity assessments, but researchers often lack access to data on individuals' races and ethnicities to calculate them. A common solution is to impute the missing race or ethnicity labels using proxies, then use those imputations to estimate the disparity. Conventional standard errors mischaracter… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 31 pages; 7 figures; CRIW Race, Ethnicity, and Economic Statistics for the 21st Century, Spring 2024

  3. arXiv:2312.16607  [pdf, other

    eess.IV cs.CV stat.ML

    A Polarization and Radiomics Feature Fusion Network for the Classification of Hepatocellular Carcinoma and Intrahepatic Cholangiocarcinoma

    Authors: Jia Dong, Yao Yao, Liyan Lin, Yang Dong, Jiachen Wan, Ran Peng, Chao Li, Hui Ma

    Abstract: Classifying hepatocellular carcinoma (HCC) and intrahepatic cholangiocarcinoma (ICC) is a critical step in treatment selection and prognosis evaluation for patients with liver diseases. Traditional histopathological diagnosis poses challenges in this context. In this study, we introduce a novel polarization and radiomics feature fusion network, which combines polarization features obtained from Mu… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  4. arXiv:2311.02146  [pdf, other

    stat.ML cs.LG math.OC

    Bayesian Optimization of Function Networks with Partial Evaluations

    Authors: Poompol Buathong, Jiayue Wan, Raul Astudillo, Samuel Daulton, Maximilian Balandat, Peter I. Frazier

    Abstract: Bayesian optimization is a powerful framework for optimizing functions that are expensive or time-consuming to evaluate. Recent work has considered Bayesian optimization of function networks (BOFN), where the objective function is given by a network of functions, each taking as input the output of previous nodes in the network as well as additional parameters. Leveraging this network structure has… ▽ More

    Submitted 12 June, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 34 pages, 15 figures, 3 tables

  5. arXiv:2111.07517  [pdf, other

    stat.AP physics.soc-ph q-bio.QM stat.ME

    Correlation Improves Group Testing: Capturing the Dilution Effect

    Authors: Jiayue Wan, Yujia Zhang, Peter I. Frazier

    Abstract: Population-wide screening to identify and isolate infectious individuals is a powerful tool for controlling COVID-19 and other infectious diseases. Group testing can enable such screening despite limited testing resources. Samples' viral loads are often positively correlated, either because prevalence and sample collection are both correlated with geography, or through intentional enhancement, e.g… ▽ More

    Submitted 5 November, 2023; v1 submitted 14 November, 2021; originally announced November 2021.

    Comments: 66 pages, 10 figures, 15 tables

  6. arXiv:2102.03497  [pdf, other

    cs.LG stat.ML

    Weight Rescaling: Effective and Robust Regularization for Deep Neural Networks with Batch Normalization

    Authors: Ziquan Liu, Yufei Cui, Jia Wan, Yu Mao, Antoni B. Chan

    Abstract: Weight decay is often used to ensure good generalization in the training practice of deep neural networks with batch normalization (BN-DNNs), where some convolution layers are invariant to weight rescaling due to the normalization. In this paper, we demonstrate that the practical usage of weight decay still has some unsolved problems in spite of existing theoretical work on explaining the effect o… ▽ More

    Submitted 17 June, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Preprint

  7. arXiv:2008.06642  [pdf, other

    q-bio.PE math.OC stat.ME

    Group Testing Enables Asymptomatic Screening for COVID-19 Mitigation: Feasibility and Optimal Pool Size Selection with Dilution Effects

    Authors: Yifan Lin, Yuxuan Ren, **gyuan Wan, Massey Cashore, Jiayue Wan, Yujia Zhang, Peter Frazier, Enlu Zhou

    Abstract: Repeated asymptomatic screening for SARS-CoV-2 promises to control spread of the virus but would require too many resources to implement at scale. Group testing is promising for screening more people with fewer test resources: multiple samples tested together in one pool can be excluded with one negative test result. Existing approaches to group testing design for SARS-CoV-2 asymptomatic screening… ▽ More

    Submitted 16 November, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

  8. arXiv:1909.11532  [pdf, other

    q-fin.CP cs.LG stat.ML

    Deep Neural Network Framework Based on Backward Stochastic Differential Equations for Pricing and Hedging American Options in High Dimensions

    Authors: Yangang Chen, Justin W. L. Wan

    Abstract: We propose a deep neural network framework for computing prices and deltas of American options in high dimensions. The architecture of the framework is a sequence of neural networks, where each network learns the difference of the price functions between adjacent timesteps. We introduce the least squares residual of the associated backward stochastic differential equation as the loss function. Our… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

    Comments: 35 pages, 11 figures, 15 tables

  9. arXiv:1811.07143  [pdf, other

    cs.LG q-bio.QM stat.ML

    High Quality Prediction of Protein Q8 Secondary Structure by Diverse Neural Network Architectures

    Authors: Iddo Drori, Isht Dwivedi, Pranav Shrestha, Jeffrey Wan, Yueqi Wang, Yunchu He, Anthony Mazza, Hugh Krogh-Freeman, Dimitri Leggas, Kendal Sandridge, Linyong Nan, Kaveri Thakoor, Chinmay Joshi, Sonam Goenka, Chen Keasar, Itsik Pe'er

    Abstract: We tackle the problem of protein secondary structure prediction using a common task framework. This lead to the introduction of multiple ideas for neural architectures based on state of the art building blocks, used in this task for the first time. We take a principled machine learning approach, which provides genuine, unbiased performance measures, correcting longstanding errors in the applicatio… ▽ More

    Submitted 17 November, 2018; originally announced November 2018.

    Comments: NIPS 2018 Workshop on Machine Learning for Molecules and Materials, 10 pages

  10. arXiv:1104.1671  [pdf, ps, other

    stat.AP

    Density-based Monte Carlo filter and its applications in estimation of unobservable variables and pharmacokinetic parameters

    Authors: Guanghui Huang, Jian** Wan, Hui Chen

    Abstract: Nonlinear stochastic differential equation models with unobservable variables are now widely used in the analysis of PK/PD data. The unobservable variables are often estimated with extended Kalman filter (EKF), and the unknown pharmacokinetic parameters are usually estimated by maximum likelihood estimator. However, EKF is inadequate for nonlinear PK/PD models, and MLE is known to be biased downwa… ▽ More

    Submitted 5 March, 2012; v1 submitted 9 April, 2011; originally announced April 2011.

    Comments: 15 pages, 1 figure, 2 tables