Skip to main content

Showing 1–50 of 138 results for author: Song, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.13944  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Generalization error of min-norm interpolators in transfer learning

    Authors: Yanke Song, Sohom Bhattacharya, Pragya Sur

    Abstract: This paper establishes the generalization error of pooled min-$\ell_2$-norm interpolation in transfer learning where data from diverse distributions are available. Min-norm interpolators emerge naturally as implicit regularized limits of modern machine learning algorithms. Previous work characterized their out-of-distribution risk when samples from the test distribution are unavailable during trai… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 53 pages, 2 figures

  2. arXiv:2406.11828  [pdf, other

    cs.LG stat.ML

    Learning sum of diverse features: computational hardness and efficient gradient-based training for ridge combinations

    Authors: Kazusato Oko, Yu** Song, Taiji Suzuki, Denny Wu

    Abstract: We study the computational and sample complexity of learning a target function $f_*:\mathbb{R}^d\to\mathbb{R}$ with additive structure, that is, $f_*(x) = \frac{1}{\sqrt{M}}\sum_{m=1}^M f_m(\langle x, v_m\rangle)$, where $f_1,f_2,...,f_M:\mathbb{R}\to\mathbb{R}$ are nonlinear link functions of single-index models (ridge functions) with diverse and near-orthogonal index features $\{v_m\}_{m=1}^M$,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: COLT 2024

  3. arXiv:2406.11184  [pdf, other

    stat.ME math.ST

    HEDE: Heritability estimation in high dimensions by Ensembling Debiased Estimators

    Authors: Yanke Song, Xihong Lin, Pragya Sur

    Abstract: Estimating heritability remains a significant challenge in statistical genetics. Diverse approaches have emerged over the years that are broadly categorized as either random effects or fixed effects heritability methods. In this work, we focus on the latter. We propose HEDE, an ensemble approach to estimate heritability or the signal-to-noise ratio in high-dimensional linear models where the sampl… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 58 pages, 7 figures

  4. arXiv:2406.06149  [pdf, other

    cs.LG stat.ML

    Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations

    Authors: Yujee Song, Donghyun Lee, Rui Meng, Won Hwa Kim

    Abstract: A Marked Temporal Point Process (MTPP) is a stochastic process whose realization is a set of event-time data. MTPP is often used to understand complex dynamics of asynchronous temporal events such as money transaction, social media, healthcare, etc. Recent studies have utilized deep neural networks to capture complex temporal dependencies of events and generate embedding that aptly represent the o… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 8 figures, The Twelfth International Conference on Learning Representations (ICLR 2024)

  5. arXiv:2406.00396  [pdf, other

    cs.LG cond-mat.stat-mech cs.AI stat.ML

    Stochastic Restarting to Overcome Overfitting in Neural Networks with Noisy Labels

    Authors: Youngkyoung Bae, Yeongwoo Song, Hawoong Jeong

    Abstract: Despite its prevalence, giving up and starting over may seem wasteful in many situations such as searching for a target or training deep neural networks (DNNs). Our study, though, demonstrates that restarting from a checkpoint can significantly improve generalization performance when training DNNs with noisy labels. In the presence of noisy labels, DNNs initially learn the general patterns of the… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 21 pages, 10 figures

  6. arXiv:2405.07220  [pdf, other

    cs.LG cs.AI stat.ML

    On Discovery of Local Independence over Continuous Variables via Neural Contextual Decomposition

    Authors: Inwoo Hwang, Yunhyeok Kwak, Yeon-Ji Song, Byoung-Tak Zhang, Sanghack Lee

    Abstract: Conditional independence provides a way to understand causal relationships among the variables of interest. An underlying system may exhibit more fine-grained causal relationships especially between a variable and its parents, which will be called the local independence relationships. One of the most widely studied local relationships is Context-Specific Independence (CSI), which holds in a specif… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Conference on Causal Learning and Reasoning (CLeaR), 2023

  7. arXiv:2402.13259  [pdf, other

    stat.ME cs.CE math.NA math.PR

    Fast Discrete-Event Simulation of Markovian Queueing Networks through Euler Approximation

    Authors: L. Jeff Hong, Yingda Song, Tan Wang

    Abstract: The efficient management of large-scale queueing networks is critical for a variety of sectors, including healthcare, logistics, and customer service, where system performance has profound implications for operational effectiveness and cost management. To address this key challenge, our paper introduces simulation techniques tailored for complex, large-scale Markovian queueing networks. We develop… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  8. arXiv:2401.12824  [pdf, other

    cs.LG stat.ML

    MAPPING: Debiasing Graph Neural Networks for Fair Node Classification with Limited Sensitive Information Leakage

    Authors: Ying Song, Balaji Palanisamy

    Abstract: Despite remarkable success in diverse web-based applications, Graph Neural Networks(GNNs) inherit and further exacerbate historical discrimination and social stereotypes, which critically hinder their deployments in high-stake domains such as online clinical diagnosis, financial crediting, etc. However, current fairness research that primarily craft on i.i.d data, cannot be trivially replicated to… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Finished May last year. Remember to submit all papers to arXiv early without compromising the principles of conferences

  9. arXiv:2311.08384  [pdf, other

    cs.LG cs.AI stat.ML

    Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees

    Authors: Yifei Zhou, Ayush Sekhari, Yuda Song, Wen Sun

    Abstract: Hybrid RL is the setting where an RL agent has access to both offline data and online data by interacting with the real-world environment. In this work, we propose a new hybrid RL algorithm that combines an on-policy actor-critic method with offline data. On-policy methods such as policy gradient and natural policy gradient (NPG) have shown to be more robust to model misspecification, though somet… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: The first two authors contributed equally

  10. arXiv:2310.04367  [pdf

    stat.ML cs.LG

    A Marketplace Price Anomaly Detection System at Scale

    Authors: Akshit Sarpal, Qiwen Kang, Fang** Huang, Yang Song, Lijie Wan

    Abstract: Online marketplaces execute large volume of price updates that are initiated by individual marketplace sellers each day on the platform. This price democratization comes with increasing challenges with data quality. Lack of centralized guardrails that are available for a traditional online retailer causes a higher likelihood for inaccurate prices to get published on the website, leading to poor cu… ▽ More

    Submitted 9 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: 10 pages, 4 figures, 7 tables

  11. arXiv:2310.02216  [pdf, other

    stat.AP

    Efficient stochastic generators with spherical harmonic transformation for high-resolution global climate simulations from CESM2-LENS2

    Authors: Yan Song, Zubair Khalid, Marc G. Genton

    Abstract: Earth system models (ESMs) are fundamental for understanding Earth's complex climate system. However, the computational demands and storage requirements of ESM simulations limit their utility. For the newly published CESM2-LENS2 data, which suffer from this issue, we propose a novel stochastic generator (SG) as a practical complement to the CESM2, capable of rapidly producing emulations closely mi… ▽ More

    Submitted 24 May, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

  12. arXiv:2307.00190  [pdf

    stat.AP

    Estimands in Real-World Evidence Studies

    Authors: Jie Chen, Daniel Scharfstein, Hongwei Wang, Binbing Yu, Yang Song, Weili He, John Scott, Xiwu Lin, Hana Lee

    Abstract: A Real-World Evidence (RWE) Scientific Working Group (SWG) of the American Statistical Association Biopharmaceutical Section (ASA BIOP) has been reviewing statistical considerations for the generation of RWE to support regulatory decision-making. As part of the effort, the working group is addressing estimands in RWE studies. Constructing the right estimand -- the target of estimation -- which ref… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  13. arXiv:2305.07813  [pdf, other

    stat.ME stat.CO

    Fast robust location and scatter estimation: a depth-based method

    Authors: Maoyu Zhang, Yan Song, Wenlin Dai

    Abstract: The minimum covariance determinant (MCD) estimator is ubiquitous in multivariate analysis, the critical step of which is to select a subset of a given size with the lowest sample covariance determinant. The concentration step (C-step) is a common tool for subset-seeking; however, it becomes computationally demanding for high-dimensional data. To alleviate the challenge, we propose a depth-based al… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  14. arXiv:2305.01188  [pdf, other

    stat.AP stat.ME

    Advancing inverse scattering with surrogate modeling and Bayesian inference for functional inputs

    Authors: Chih-Li Sung, Yao Song, Ying Hung

    Abstract: Inverse scattering aims to infer information about a hidden object by using the received scattered waves and training data collected from forward mathematical models. Recent advances in computing have led to increasing attention towards functional inverse inference, which can reveal more detailed properties of a hidden object. However, rigorous studies on functional inverse, including the reconstr… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  15. arXiv:2304.09868  [pdf, other

    cs.LG cs.AI stat.ML

    Accelerate Support Vector Clustering via Spectrum-Preserving Data Compression

    Authors: Yuxuan Song, Yongyu Wang

    Abstract: This paper proposes a novel framework for accelerating support vector clustering. The proposed method first computes much smaller compressed data sets while preserving the key cluster properties of the original data sets based on a novel spectral data compression approach. Then, the resultant spectrally-compressed data sets are leveraged for the development of fast and high quality algorithm for s… ▽ More

    Submitted 14 May, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

  16. arXiv:2304.09132  [pdf, other

    stat.ME

    Independence testing for inhomogeneous random graphs

    Authors: Yukun Song, Carey E. Priebe, Minh Tang

    Abstract: Testing for independence between graphs is a problem that arises naturally in social network analysis and neuroscience. In this paper, we address independence testing for inhomogeneous Erdős-Rényi random graphs on the same vertex set. We first formulate a notion of pairwise correlations between the edges of these graphs and derive a necessary condition for their detectability. We next show that th… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 24 pages, 2 figures

  17. arXiv:2303.01469  [pdf, other

    cs.LG cs.CV stat.ML

    Consistency Models

    Authors: Yang Song, Prafulla Dhariwal, Mark Chen, Ilya Sutskever

    Abstract: Diffusion models have significantly advanced the fields of image, audio, and video generation, but they depend on an iterative sampling process that causes slow generation. To overcome this limitation, we propose consistency models, a new family of models that generate high quality samples by directly map** noise to data. They support fast one-step generation by design, while still allowing mult… ▽ More

    Submitted 31 May, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: ICML 2023

  18. Impact of Event Encoding and Dissimilarity Measures on Traffic Crash Characterization Based on Sequence of Events

    Authors: Yu Song, Madhav V. Chitturi, David A. Noyce

    Abstract: Crash sequence analysis has been shown in prior studies to be useful for characterizing crashes and identifying safety countermeasures. Sequence analysis is highly domain-specific, but its various techniques have not been evaluated for adaptation to crash sequences. This paper evaluates the impact of encoding and dissimilarity measures on crash sequence analysis and clustering. Sequence data of in… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  19. arXiv:2302.01269  [pdf, other

    stat.ME math.ST

    Adjusting for Incomplete Baseline Covariates in Randomized Controlled Trials: A Cross-World Imputation Framework

    Authors: Yilin Song, James P. Hughes, Ting Ye

    Abstract: In randomized controlled trials, adjusting for baseline covariates is often applied to improve the precision of treatment effect estimation. However, missingness in covariates is common. Recently, Zhao & Ding (2022) studied two simple strategies, the single imputation method and missingness indicator method (MIM), to deal with missing covariates, and showed that both methods can provide efficiency… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  20. arXiv:2212.01168  [pdf, other

    cs.LG cs.AI physics.comp-ph stat.ML

    Towards Cross Domain Generalization of Hamiltonian Representation via Meta Learning

    Authors: Yeongwoo Song, Hawoong Jeong

    Abstract: Recent advances in deep learning for physics have focused on discovering shared representations of target systems by incorporating physics priors or inductive biases into neural networks. While effective, these methods are limited to the system domain, where the type of system remains consistent and thus cannot ensure the adaptation to new, or unseen physical systems governed by different laws. Fo… ▽ More

    Submitted 27 April, 2024; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Conference paper at ICLR 2024

  21. arXiv:2210.16976  [pdf, other

    cs.LG stat.ML

    Representation Learning for General-sum Low-rank Markov Games

    Authors: Chengzhuo Ni, Yuda Song, Xuezhou Zhang, Chi **, Mengdi Wang

    Abstract: We study multi-agent general-sum Markov games with nonlinear function approximation. We focus on low-rank Markov games whose transition matrix admits a hidden low-rank structure on top of an unknown non-linear representation. The goal is to design an algorithm that (1) finds an $\varepsilon$-equilibrium policy sample efficiently without prior knowledge of the environment or the representation, and… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  22. Consistent Covariance estimation for stratum imbalances under minimization method for covariate-adaptive randomization

    Authors: Zixuan Zhao, Yanglei Song, Wenyu Jiang, Dongsheng Tu

    Abstract: Pocock and Simon's minimization method is a popular approach for covariate-adaptive randomization in clinical trials. Valid statistical inference with data collected under the minimization method requires the knowledge of the limiting covariance matrix of within-stratum imbalances, whose existence is only recently established. In this work, we propose a bootstrap-based estimator for this limit and… ▽ More

    Submitted 26 December, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: 29 pages, peer reviewed version, will appear in Scandinavian Journal of Statistics

  23. Intersection Two-Vehicle Crash Scenario Specification for Automated Vehicle Safety Evaluation Using Sequence Analysis and Bayesian Networks

    Authors: Yu Song, Madhav V. Chitturi, David A. Noyce

    Abstract: This paper develops a test scenario specification procedure using crash sequence analysis and Bayesian network modeling. Intersection two-vehicle crash data was obtained from the 2016 to 2018 National Highway Traffic Safety Administration Crash Report Sampling System database. Vehicles involved in the crashes are specifically renumbered based on their initial positions and trajectories. Crash sequ… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

  24. arXiv:2207.12804  [pdf, other

    stat.ME

    Large-Scale Low-Rank Gaussian Process Prediction with Support Points

    Authors: Yan Song, Wenlin Dai, Marc G. Genton

    Abstract: Low-rank approximation is a popular strategy to tackle the "big n problem" associated with large-scale Gaussian process regressions. Basis functions for develo** low-rank structures are crucial and should be carefully specified. Predictive processes simplify the problem by inducing basis functions with a covariance function and a set of knots. The existing literature suggests certain practical i… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  25. Covariate Adjustment in Randomized Clinical Trials with Missing Covariate and Outcome Data

    Authors: Chia-Rui Chang, Yue Song, Fan Li, Rui Wang

    Abstract: When analyzing data from randomized clinical trials, covariate adjustment can be used to account for chance imbalance in baseline covariates and to increase precision of the treatment effect estimate. A practical barrier to covariate adjustment is the presence of missing data. In this paper, in the light of recent theoretical advancement, we first review several covariate adjustment methods with i… ▽ More

    Submitted 16 May, 2023; v1 submitted 16 July, 2022; originally announced July 2022.

  26. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  27. arXiv:2202.11735  [pdf, other

    stat.ML cs.LG math.ST

    Truncated LinUCB for Stochastic Linear Bandits

    Authors: Yanglei Song, Meng zhou

    Abstract: This paper considers contextual bandits with a finite number of arms, where the contexts are independent and identically distributed $d$-dimensional random vectors, and the expected rewards are linear in both the arm parameters and contexts. The LinUCB algorithm, which is near minimax optimal for related linear bandits, is shown to have a cumulative regret that is suboptimal in both the dimension… ▽ More

    Submitted 17 November, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: A typo corrected: in Lemma 34(ii), it should be \|x\| instead of \|x\|^2. Thus, in the proof of Lemma 3, exp(-r^2) should be exp(-r), which, however, does not affect other parts

  28. arXiv:2112.10992  [pdf, other

    cs.CV stat.ML

    Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition

    Authors: Xiangbo Shu, Jiawen Yang, Rui Yan, Yan Song

    Abstract: This work focuses on the task of elderly activity recognition, which is a challenging task due to the existence of individual actions and human-object interactions in elderly activities. Thus, we attempt to effectively aggregate the discriminative information of actions and interactions from both RGB videos and skeleton sequences by attentively fusing multi-modal features. Recently, some nonlinear… ▽ More

    Submitted 24 April, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

  29. arXiv:2111.11010  [pdf, other

    cs.LG stat.ML

    Density Ratio Estimation via Infinitesimal Classification

    Authors: Kristy Choi, Chenlin Meng, Yang Song, Stefano Ermon

    Abstract: Density ratio estimation (DRE) is a fundamental machine learning technique for comparing two probability distributions. However, existing methods struggle in high-dimensional settings, as it is difficult to accurately compare probability distributions based on finite samples. In this work we propose DRE-\infty, a divide-and-conquer approach to reduce DRE to a series of easier subproblems. Inspired… ▽ More

    Submitted 12 March, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: First two authors contributed equally

  30. arXiv:2111.08005  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Solving Inverse Problems in Medical Imaging with Score-Based Generative Models

    Authors: Yang Song, Liyue Shen, Lei Xing, Stefano Ermon

    Abstract: Reconstructing medical images from partial measurements is an important inverse problem in Computed Tomography (CT) and Magnetic Resonance Imaging (MRI). Existing solutions based on machine learning typically train a model to directly map measurements to medical images, leveraging a training dataset of paired images and measurements. These measurements are typically synthesized from images using a… ▽ More

    Submitted 15 June, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: Published at ICLR 2022

  31. arXiv:2111.07067  [pdf, other

    stat.ME

    Interquantile Shrinkage in Spatial Quantile Autoregressive Regression models

    Authors: ** Dong, Jiawei Hou, Yunquan Song

    Abstract: Spatial dependent data frequently occur in many fields such as spatial econometrics and epidemiology. To deal with the dependence of variables and estimate quantile-specific effects by covariates, spatial quantile autoregressive models (SQAR models) are introduced. Conventional quantile regression only focuses on the fitting models but ignores the examination of multiple conditional quantile funct… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  32. arXiv:2111.04726  [pdf, other

    cs.LG stat.ML

    Estimating High Order Gradients of the Data Distribution by Denoising

    Authors: Chenlin Meng, Yang Song, Wenzhe Li, Stefano Ermon

    Abstract: The first order derivative of a data density can be estimated efficiently by denoising score matching, and has become an important component in many applications, such as image generation and audio synthesis. Higher order derivatives provide additional local information about the data distribution and enable new applications. Although they can be estimated via automatic differentiation of a learne… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021

  33. arXiv:2110.00473  [pdf, other

    stat.ML cs.CV cs.LG

    Score-Based Generative Classifiers

    Authors: Roland S. Zimmermann, Lukas Schott, Yang Song, Benjamin A. Dunn, David A. Klindt

    Abstract: The tremendous success of generative models in recent years raises the question whether they can also be used to perform classification. Generative models have been used as adversarially robust classifiers on simple datasets such as MNIST, but this robustness has not been observed on more complex datasets like CIFAR-10. Additionally, on natural image datasets, previous results have suggested a tra… ▽ More

    Submitted 11 December, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: published at https://dgms-and-applications.github.io/2021/ project website https://zimmerrol.github.io/SBGC/

  34. arXiv:2109.15261  [pdf, other

    stat.ME math.PR math.ST q-bio.QM

    A simple and flexible test of sample exchangeability with applications to statistical genomics

    Authors: Alan J. Aw, Jeffrey P. Spence, Yun S. Song

    Abstract: In scientific studies involving analyses of multivariate data, basic but important questions often arise for the researcher: Is the sample exchangeable, meaning that the joint distribution of the sample is invariant to the ordering of the units? Are the features independent of one another, or perhaps the features can be grouped so that the groups are mutually independent? In statistical genomics,… ▽ More

    Submitted 30 August, 2023; v1 submitted 30 September, 2021; originally announced September 2021.

    Comments: 24 pages. Supplementary Information file (38 pages, contains mathematical proofs) is available at https://github.com/songlab-cal/flinty/

    MSC Class: 62G10; 62H15; 62P10 ACM Class: G.3

  35. arXiv:2107.03502  [pdf, other

    cs.LG stat.ML

    CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation

    Authors: Yusuke Tashiro, Jiaming Song, Yang Song, Stefano Ermon

    Abstract: The imputation of missing values in time series has many applications in healthcare and finance. While autoregressive models are natural candidates for time series imputation, score-based diffusion models have recently outperformed existing counterparts including autoregressive models in many tasks such as image generation and audio synthesis, and would be promising for time series imputation. In… ▽ More

    Submitted 27 October, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021

  36. arXiv:2106.13097  [pdf, other

    cs.LG stat.ML

    Understanding the Spread of COVID-19 Epidemic: A Spatio-Temporal Point Process View

    Authors: Shuang Li, Lu Wang, Xinyun Chen, Yixiang Fang, Yan Song

    Abstract: Since the first coronavirus case was identified in the U.S. on Jan. 21, more than 1 million people in the U.S. have confirmed cases of COVID-19. This infectious respiratory disease has spread rapidly across more than 3000 counties and 50 states in the U.S. and have exhibited evolutionary clustering and complex triggering patterns. It is essential to understand the complex spacetime intertwined pro… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  37. arXiv:2105.10590  [pdf, other

    stat.ML cs.LG q-bio.BM q-bio.QM

    Parallelizing Contextual Bandits

    Authors: Jeffrey Chan, Aldo Pacchiano, Nilesh Tripuraneni, Yun S. Song, Peter Bartlett, Michael I. Jordan

    Abstract: Standard approaches to decision-making under uncertainty focus on sequential exploration of the space of decisions. However, \textit{simultaneously} proposing a batch of decisions, which leverages available resources for parallel experimentation, has the potential to rapidly accelerate exploration. We present a family of (parallel) contextual bandit algorithms applicable to problems with bounded e… ▽ More

    Submitted 5 February, 2023; v1 submitted 21 May, 2021; originally announced May 2021.

  38. arXiv:2104.10029  [pdf, other

    cs.CV eess.IV stat.AP

    Multiple Sclerosis Lesion Analysis in Brain Magnetic Resonance Images: Techniques and Clinical Applications

    Authors: Yang Ma, Chaoyi Zhang, Mariano Cabezas, Yang Song, Zihao Tang, Dongnan Liu, Weidong Cai, Michael Barnett, Chenyu Wang

    Abstract: Multiple sclerosis (MS) is a chronic inflammatory and degenerative disease of the central nervous system, characterized by the appearance of focal lesions in the white and gray matter that topographically correlate with an individual patient's neurological symptoms and signs. Magnetic resonance imaging (MRI) provides detailed in-vivo structural information, permitting the quantification and catego… ▽ More

    Submitted 27 January, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: Accepted to appear in IEEE Journal of Biomedical And Health Informatics

  39. Automated Vehicle Crash Sequences: Patterns and Potential Uses in Safety Testing

    Authors: Yu Song, Madhav V. Chitturi, David A. Noyce

    Abstract: With safety being one of the primary motivations for develo** automated vehicles (AVs), extensive field and simulation tests are being carried out to ensure AVs can operate safely on roadways. Since 2014, the California DMV has been collecting AV collision and disengagement reports, which are valuable data sources for studying AV crash patterns. In this study, crash sequence data extracted from… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Journal ref: Accident Analysis & Prevention, 153, p.106017 (2021)

  40. arXiv:2102.05291  [pdf, other

    cs.LG cs.AI stat.ML

    Clusterability as an Alternative to Anchor Points When Learning with Noisy Labels

    Authors: Zhaowei Zhu, Yiwen Song, Yang Liu

    Abstract: The label noise transition matrix, characterizing the probabilities of a training instance being wrongly annotated, is crucial to designing popular solutions to learning with noisy labels. Existing works heavily rely on finding "anchor points" or their approximates, defined as instances belonging to a particular class almost surely. Nonetheless, finding anchor points remains a non-trivial task, an… ▽ More

    Submitted 13 July, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: ICML 2021

  41. arXiv:2102.03450  [pdf, other

    cs.LG stat.ML

    Wasserstein Graph Neural Networks for Graphs with Missing Attributes

    Authors: Zhixian Chen, Tengfei Ma, Yangqiu Song, Yang Wang

    Abstract: Missing node attributes is a common problem in real-world graphs. Graph neural networks have been demonstrated power in graph representation learning while their performance is affected by the completeness of graph information. Most of them are not specified for missing-attribute graphs and fail to leverage incomplete attribute information effectively. In this paper, we propose an innovative node… ▽ More

    Submitted 16 February, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

  42. arXiv:2101.09258  [pdf, other

    stat.ML cs.LG

    Maximum Likelihood Training of Score-Based Diffusion Models

    Authors: Yang Song, Conor Durkan, Iain Murray, Stefano Ermon

    Abstract: Score-based diffusion models synthesize samples by reversing a stochastic process that diffuses data to noise, and are trained by minimizing a weighted combination of score matching losses. The log-likelihood of score-based diffusion models can be tractably computed through a connection to continuous normalizing flows, but log-likelihood is not directly optimized by the weighted combination of sco… ▽ More

    Submitted 20 October, 2021; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: NeurIPS 2021 (Spotlight)

  43. arXiv:2101.03288  [pdf, other

    cs.LG stat.ML

    How to Train Your Energy-Based Models

    Authors: Yang Song, Diederik P. Kingma

    Abstract: Energy-Based Models (EBMs), also known as non-normalized probabilistic models, specify probability density or mass functions up to an unknown normalizing constant. Unlike most other probabilistic models, EBMs do not place a restriction on the tractability of the normalizing constant, thus are more flexible to parameterize and can model a more expressive family of probability distributions. However… ▽ More

    Submitted 17 February, 2021; v1 submitted 8 January, 2021; originally announced January 2021.

  44. arXiv:2101.03098  [pdf, other

    math.OC stat.AP

    Optimization Models for Integrated Biorefinery Operations

    Authors: Berkay Gulcan, Sandra D. Eksioglu, Yongjia Song, Mohammad Roni, Qiushi Chen

    Abstract: Variations of physical and chemical characteristics of biomass lead to an uneven flow of biomass in a biorefinery, which reduces equipment utilization and increases operational costs. Uncertainty of biomass supply and high processing costs increase the risk of investing in the US's cellulosic biofuel industry. We propose a stochastic programming model to streamline processes within a biorefinery.… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

  45. arXiv:2012.08125  [pdf, other

    cs.LG stat.ML

    Learning Energy-Based Models by Diffusion Recovery Likelihood

    Authors: Ruiqi Gao, Yang Song, Ben Poole, Ying Nian Wu, Diederik P. Kingma

    Abstract: While energy-based models (EBMs) exhibit a number of desirable properties, training and sampling on high-dimensional datasets remains challenging. Inspired by recent progress on diffusion probabilistic models, we present a diffusion recovery likelihood method to tractably learn and sample from a sequence of EBMs trained on increasingly noisy versions of a dataset. Each EBM is trained with recovery… ▽ More

    Submitted 27 March, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

  46. arXiv:2012.03761  [pdf, ps, other

    math.OC cs.LG stat.CO stat.ML

    Adaptive Sequential SAA for Solving Two-stage Stochastic Linear Programs

    Authors: Raghu Pasupathy, Yongjia Song

    Abstract: We present adaptive sequential SAA (sample average approximation) algorithms to solve large-scale two-stage stochastic linear programs. The iterative algorithm framework we propose is organized into \emph{outer} and \emph{inner} iterations as follows: during each outer iteration, a sample-path problem is implicitly generated using a sample of observations or ``scenarios," and solved only \emph{imp… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  47. arXiv:2011.13456  [pdf, other

    cs.LG stat.ML

    Score-Based Generative Modeling through Stochastic Differential Equations

    Authors: Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, Ben Poole

    Abstract: Creating noise from data is easy; creating data from noise is generative modeling. We present a stochastic differential equation (SDE) that smoothly transforms a complex data distribution to a known prior distribution by slowly injecting noise, and a corresponding reverse-time SDE that transforms the prior distribution back into the data distribution by slowly removing the noise. Crucially, the re… ▽ More

    Submitted 10 February, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: ICLR 2021 (Oral)

  48. arXiv:2010.12810  [pdf, other

    cs.LG stat.ML

    Autoregressive Score Matching

    Authors: Chenlin Meng, Lantao Yu, Yang Song, Jiaming Song, Stefano Ermon

    Abstract: Autoregressive models use chain rule to define a joint probability distribution as a product of conditionals. These conditionals need to be normalized, imposing constraints on the functional families that can be used. To increase flexibility, we propose autoregressive conditional score models (AR-CSM) where we parameterize the joint distribution in terms of the derivatives of univariate log-condit… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020

  49. arXiv:2010.09808  [pdf, other

    cs.LG cs.AI stat.ML

    Imitation with Neural Density Models

    Authors: Kuno Kim, Akshat **dal, Yang Song, Jiaming Song, Yanan Sui, Stefano Ermon

    Abstract: We propose a new framework for Imitation Learning (IL) via density estimation of the expert's occupancy measure followed by Maximum Occupancy Entropy Reinforcement Learning (RL) using the density as a reward. Our approach maximizes a non-adversarial model-free RL objective that provably lower bounds reverse Kullback-Leibler divergence between occupancy measures of the expert and imitator. We prese… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

  50. arXiv:2009.11409  [pdf, other

    stat.AP

    Bayesian Hierarchical Models for High-Dimensional Mediation Analysis with Coordinated Selection of Correlated Mediators

    Authors: Yanyi Song, Xiang Zhou, Jian Kang, Max T. Aung, Min Zhang, Wei Zhao, Belinda L. Needham, Sharon L. R. Kardia, Yongmei Liu, John D. Meeker, Jennifer A. Smith, Bhramar Mukherjee

    Abstract: We consider Bayesian high-dimensional mediation analysis to identify among a large set of correlated potential mediators the active ones that mediate the effect from an exposure variable to an outcome of interest. Correlations among mediators are commonly observed in modern data analysis; examples include the activated voxels within connected regions in brain image data, regulatory signals driven… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.