Skip to main content

Showing 1–50 of 225 results for author: Chen, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.12525  [pdf, other

    cs.SI physics.soc-ph stat.AP

    Anatomy of Elite and Mass Polarization in Social Networks

    Authors: Ali Salloum, Ted Hsuan Yun Chen, Mikko Kivelä

    Abstract: Existing methods for quantifying polarization in social networks typically report a single value describing the amount of polarization in a social system. While this approach can be used to confirm the observation that many societies have witnessed an increase in political polarization in recent years, it misses the complexities that could be used to understand the reasons behind this phenomenon.… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.10148  [pdf, other

    math.OC cs.LG stat.ML

    A Primal-Dual-Assisted Penalty Approach to Bilevel Optimization with Coupled Constraints

    Authors: Liuyuan Jiang, Quan Xiao, Victor M. Tenorio, Fernando Real-Rojas, Antonio Marques, Tianyi Chen

    Abstract: Interest in bilevel optimization has grown in recent years, partially due to its applications to tackle challenging machine-learning problems. Several exciting recent works have been centered around develo** efficient gradient-based algorithms that can solve bilevel optimization problems with provable guarantees. However, the existing literature mainly focuses on bilevel problems either without… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2406.04713  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI physics.comp-ph stat.ML

    FlowMM: Generating Materials with Riemannian Flow Matching

    Authors: Benjamin Kurt Miller, Ricky T. Q. Chen, Anuroop Sriram, Brandon M Wood

    Abstract: Crystalline materials are a fundamental component in next-generation technologies, yet modeling their distribution presents unique computational challenges. Of the plausible arrangements of atoms in a periodic lattice only a vanishingly small percentage are thermodynamically stable, which is a key indicator of the materials that can be experimentally realized. Two fundamental tasks in this area ar… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: https://github.com/facebookresearch/flowmm

    Journal ref: ICML 2024

  4. arXiv:2406.00288  [pdf, other

    cs.LG stat.ML

    Neural Optimal Transport with Lagrangian Costs

    Authors: Aram-Alexandre Pooladian, Carles Domingo-Enrich, Ricky T. Q. Chen, Brandon Amos

    Abstract: We investigate the optimal transport problem between probability measures when the underlying cost function is understood to satisfy a least action principle, also known as a Lagrangian cost. These generalizations are useful when connecting observations from a physical system where the transport dynamics are influenced by the geometry of the system, such as obstacles (e.g., incorporating barrier f… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: UAI 2024

  5. arXiv:2405.16381  [pdf, other

    cs.LG cs.AI stat.ML

    Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups

    Authors: Yuchen Zhu, Tianrong Chen, Lingkai Kong, Evangelos A. Theodorou, Molei Tao

    Abstract: The generative modeling of data on manifold is an important task, for which diffusion models in flat spaces typically need nontrivial adaptations. This article demonstrates how a technique called `trivialization' can transfer the effectiveness of diffusion models in Euclidean spaces to Lie groups. In particular, an auxiliary momentum variable was algorithmically introduced to help transport the po… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  6. arXiv:2405.15920  [pdf, other

    cs.LG stat.ML

    SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

    Authors: Shuai Zhang, Heshan Devaka Fernando, Miao Liu, Keerthiram Murugesan, Songtao Lu, Pin-Yu Chen, Tianyi Chen, Meng Wang

    Abstract: This paper studies the transfer reinforcement learning (RL) problem where multiple RL problems have different reward functions but share the same underlying transition dynamics. In this setting, the Q-function of each RL problem (task) can be decomposed into a successor feature (SF) and a reward map**: the former characterizes the transition dynamics, and the latter characterizes the task-specif… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.16173

  7. arXiv:2405.11111  [pdf, other

    stat.ME

    Euclidean mirrors and first-order changepoints in network time series

    Authors: Tianyi Chen, Zachary Lubberts, Avanti Athreya, Youngser Park, Carey E. Priebe

    Abstract: We describe a model for a network time series whose evolution is governed by an underlying stochastic process, known as the latent position process, in which network evolution can be represented in Euclidean space by a curve, called the Euclidean mirror. We define the notion of a first-order changepoint for a time series of networks, and construct a family of latent position process networks with… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  8. arXiv:2405.07098  [pdf, other

    cs.LG cs.AI math-ph math.OC stat.ML

    Interpretable global minima of deep ReLU neural networks on sequentially separable data

    Authors: Thomas Chen, Patricia Muñoz Ewald

    Abstract: We explicitly construct zero loss neural network classifiers. We write the weight matrices and bias vectors in terms of cumulative parameters, which determine truncation maps acting recursively on input space. The configurations for the training data considered are (i) sufficiently small, well separated clusters corresponding to each class, and (ii) equivalence classes which are sequentially linea… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: AMS Latex, 22 pages, 3 figures

    MSC Class: 57R70; 62M45

  9. arXiv:2404.06336  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum State Generation with Structure-Preserving Diffusion Model

    Authors: Yuchen Zhu, Tianrong Chen, Evangelos A. Theodorou, Xie Chen, Molei Tao

    Abstract: This article considers the generative modeling of the (mixed) states of quantum systems, and an approach based on denoising diffusion model is proposed. The key contribution is an algorithmic innovation that respects the physical nature of quantum states. More precisely, the commonly used density matrix representation of mixed-state has to be complex-valued Hermitian, positive semi-definite, and t… ▽ More

    Submitted 25 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  10. arXiv:2403.02060  [pdf, other

    stat.ME

    Expectile Periodograms

    Authors: Tianbo Chen

    Abstract: In this paper, we introduce a periodogram-like function, called expectile periodograms, for detecting and estimating hidden periodicity from observations with asymmetrically distributed noise. The expectile periodograms are constructed from trigonometric expectile regression where a specially designed objective function is used to substitute the squared $l_2$ norm that leads to the ordinary period… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  11. arXiv:2402.06886  [pdf, other

    cs.LG math.OC stat.ML

    Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

    Authors: Han Shen, Zhuoran Yang, Tianyi Chen

    Abstract: Bilevel optimization has been recently applied to many machine learning tasks. However, their applications have been restricted to the supervised learning setting, where static objective functions with benign structures are considered. But bilevel problems such as incentive design, inverse reinforcement learning (RL), and RL from human feedback (RLHF) are often modeled as dynamic objective functio… ▽ More

    Submitted 31 May, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Shorter version accepted to ICML 2024

  12. arXiv:2401.06980  [pdf, other

    cs.CL cs.LG stat.ML

    Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization

    Authors: A F M Saif, Xiaodong Cui, Han Shen, Songtao Lu, Brian Kingsbury, Tianyi Chen

    Abstract: In this paper, we present a novel bilevel optimization-based training approach to training acoustic models for automatic speech recognition (ASR) tasks that we term {bi-level joint unsupervised and supervised training (BL-JUST)}. {BL-JUST employs a lower and upper level optimization with an unsupervised loss and a supervised loss respectively, leveraging recent advances in penalty-based bilevel op… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted in ICASSP-2024 conference

  13. arXiv:2401.03482  [pdf, other

    cs.LG stat.ML

    Uncertainty Quantification on Clinical Trial Outcome Prediction

    Authors: Tianyi Chen, Yingzhou Lu, Nan Hao, Capucine Van Rechem, **tai Chen, Tianfan Fu

    Abstract: The importance of uncertainty quantification is increasingly recognized in the diverse field of machine learning. Accurately assessing model prediction uncertainty can help provide deeper understanding and confidence for researchers and practitioners. This is especially critical in medical diagnosis and drug discovery areas, where reliable predictions directly impact research quality and patient h… ▽ More

    Submitted 18 June, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  14. arXiv:2401.03228  [pdf, other

    stat.ML cs.LG

    Reflected Schrödinger Bridge for Constrained Generative Modeling

    Authors: Wei Deng, Yu Chen, Nicole Tianjiao Yang, Hengrong Du, Qi Feng, Ricky T. Q. Chen

    Abstract: Diffusion models have become the go-to method for large-scale generative models in real-world applications. These applications often involve data distributions confined within bounded domains, typically requiring ad-hoc thresholding techniques for boundary enforcement. Reflected diffusion models (Lou23) aim to enhance generalizability by generating the data distribution through a backward process… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  15. arXiv:2312.13331  [pdf, other

    stat.ME stat.AP

    A Bayesian Spatial Berkson error approach to estimate small area opioid mortality rates accounting for population-at-risk uncertainty

    Authors: Emily N Peterson, Rachel C. Nethery, Jarvis T. Chen, Loni P. Tabb, Brent A. Coull, Frederic B. Piel, Lance A Waller

    Abstract: Monitoring small-area geographical population trends in opioid mortality has large scale implications to informing preventative resource allocation. A common approach to obtain small area estimates of opioid mortality is to use a standard disease map** approach in which population-at-risk estimates are treated as fixed and known. Assuming fixed populations ignores the uncertainty surrounding sma… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  16. arXiv:2312.05250  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    TaskMet: Task-Driven Metric Learning for Model Learning

    Authors: Dishank Bansal, Ricky T. Q. Chen, Mustafa Mukadam, Brandon Amos

    Abstract: Deep learning models are often deployed in downstream tasks that the training procedure may not be aware of. For example, models solely trained to achieve accurate predictions may struggle to perform well on downstream tasks because seemingly small prediction errors may incur drastic task errors. The standard end-to-end learning approach is to make the task loss differentiable or to introduce a di… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  17. arXiv:2312.02027  [pdf, other

    math.OC cs.LG math.NA math.PR stat.ML

    Stochastic Optimal Control Matching

    Authors: Carles Domingo-Enrich, Jiequn Han, Brandon Amos, Joan Bruna, Ricky T. Q. Chen

    Abstract: Stochastic optimal control, which has the goal of driving the behavior of noisy systems, is broadly applicable in science, engineering and artificial intelligence. Our work introduces Stochastic Optimal Control Matching (SOCM), a novel Iterative Diffusion Optimization (IDO) technique for stochastic optimal control that stems from the same philosophy as the conditional score matching loss for diffu… ▽ More

    Submitted 28 June, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  18. arXiv:2312.01260  [pdf, other

    cs.LG cs.CR stat.ML

    Rethinking PGD Attack: Is Sign Function Necessary?

    Authors: Junjie Yang, Tianlong Chen, Xuxi Chen, Zhangyang Wang, Yingbin Liang

    Abstract: Neural networks have demonstrated success in various domains, yet their performance can be significantly degraded by even a small input perturbation. Consequently, the construction of such perturbations, known as adversarial attacks, has gained significant attention, many of which fall within "white-box" scenarios where we have full access to the neural network. Existing attack algorithms, such as… ▽ More

    Submitted 20 May, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

  19. arXiv:2311.16375  [pdf, other

    stat.ME q-bio.QM stat.AP

    Testing for a difference in means of a single feature after clustering

    Authors: Yiqun T. Chen, Lucy L. Gao

    Abstract: For many applications, it is critical to interpret and validate groups of observations obtained via clustering. A common validation approach involves testing differences in feature means between observations in two estimated clusters. In this setting, classical hypothesis tests lead to an inflated Type I error rate. To overcome this problem, we propose a new test for the difference in means in a s… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    MSC Class: 62H30; 62H15; 62P10

  20. arXiv:2311.15487  [pdf, ps, other

    cs.LG cs.AI math-ph math.OC stat.ML

    Global $\mathcal{L}^2$ minimization at uniform exponential rate via geometrically adapted gradient descent in Deep Learning

    Authors: Thomas Chen

    Abstract: We consider the scenario of supervised learning in Deep Learning (DL) networks, and exploit the arbitrariness of choice in the Riemannian metric relative to which the gradient descent flow can be defined (a general fact of differential geometry). In the standard approach to DL, the gradient flow on the space of parameters (weights and biases) is defined with respect to the Euclidean metric. Here i… ▽ More

    Submitted 10 April, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: AMS Latex, 20 pages. Significantly edited and extended, abstract changed

    MSC Class: 57R70; 62M45

  21. arXiv:2311.13443  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Guided Flows for Generative Modeling and Decision Making

    Authors: Qinqing Zheng, Matt Le, Neta Shaul, Yaron Lipman, Aditya Grover, Ricky T. Q. Chen

    Abstract: Classifier-free guidance is a key component for enhancing the performance of conditional generative models across diverse tasks. While it has previously demonstrated remarkable improvements for the sample quality, it has only been exclusively employed for diffusion models. In this paper, we integrate classifier-free guidance into Flow Matching (FM) models, an alternative simulation-free approach t… ▽ More

    Submitted 7 December, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

  22. arXiv:2311.11913  [pdf, other

    cs.LG q-fin.CP stat.ML

    Deep Calibration of Market Simulations using Neural Density Estimators and Embedding Networks

    Authors: Namid R. Stillman, Rory Baggott, Justin Lyon, Jianfei Zhang, Dingqiu Zhu, Tao Chen, Perukrishnen Vytelingum

    Abstract: The ability to construct a realistic simulator of financial exchanges, including reproducing the dynamics of the limit order book, can give insight into many counterfactual scenarios, such as a flash crash, a margin call, or changes in macroeconomic outlook. In recent years, agent-based models have been developed that reproduce many features of an exchange, as summarised by a set of stylised facts… ▽ More

    Submitted 27 November, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 4th ACM International Conference on AI in Finance (ICAIF 2023)

  23. arXiv:2311.07065  [pdf, ps, other

    cs.LG cs.AI math-ph math.OC stat.ML

    Non-approximability of constructive global $\mathcal{L}^2$ minimizers by gradient descent in Deep Learning

    Authors: Thomas Chen, Patricia Muñoz Ewald

    Abstract: We analyze geometric aspects of the gradient descent algorithm in Deep Learning (DL) networks. In particular, we prove that the globally minimizing weights and biases for the $\mathcal{L}^2$ cost obtained constructively in [Chen-Munoz Ewald 2023] for underparametrized ReLU DL networks can generically not be approximated via the gradient descent flow. We therefore conclude that the method introduce… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: AMS Latex, 7 pages

    MSC Class: 57R70; 62M45

  24. arXiv:2311.06978  [pdf, other

    cs.LG cs.CV stat.ML

    Augmented Bridge Matching

    Authors: Valentin De Bortoli, Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou, Weilie Nie

    Abstract: Flow and bridge matching are a novel class of processes which encompass diffusion models. One of the main aspect of their increased flexibility is that these models can interpolate between arbitrary data distributions i.e. they generalize beyond generative modeling and can be applied to learning stochastic (and deterministic) processes of arbitrary transfer tasks between two given distributions. I… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  25. arXiv:2311.04691  [pdf

    stat.AP cs.CE

    Sustainable Collaborative Strategy in Pharmaceutical Refrigerated Logistics Routing Problem

    Authors: Tingting Chen, Feng Chu, Jiantong Zhang, Jiaqing Sun

    Abstract: The rapid growth of pharmaceutical refrigerated logistics poses sustainability challenges, including elevated costs, energy consumption, and resource inefficiency. Collaborating multiple depots can enhance logistics efficiency when standalone distribution centers have limited transport resources, i.e., refrigerated vehicles. However, the sustainable benefits and performance across different strate… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  26. arXiv:2310.08649  [pdf, other

    math.NA cs.LG stat.ML

    Time-vectorized numerical integration for systems of ODEs

    Authors: Mark C. Messner, Tianchen Hu, Tianju Chen

    Abstract: Stiff systems of ordinary differential equations (ODEs) and sparse training data are common in scientific problems. This paper describes efficient, implicit, vectorized methods for integrating stiff systems of ordinary differential equations through time and calculating parameter gradients with the adjoint method. The main innovation is to vectorize the problem both over the number of independent… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  27. arXiv:2310.02679  [pdf, other

    cs.LG cs.AI stat.CO stat.ME stat.ML

    Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory optimization

    Authors: Dinghuai Zhang, Ricky T. Q. Chen, Cheng-Hao Liu, Aaron Courville, Yoshua Bengio

    Abstract: We tackle the problem of sampling from intractable high-dimensional density functions, a fundamental task that often appears in machine learning and statistics. We extend recent sampling-based approaches that leverage controlled stochastic processes to model approximate samples from these target densities. The main drawback of these approaches is that the training objective requires full trajector… ▽ More

    Submitted 9 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024

  28. arXiv:2310.02233  [pdf, other

    stat.ML cs.LG math.OC

    Generalized Schrödinger Bridge Matching

    Authors: Guan-Horng Liu, Yaron Lipman, Maximilian Nickel, Brian Karrer, Evangelos A. Theodorou, Ricky T. Q. Chen

    Abstract: Modern distribution matching algorithms for training diffusion or flow models directly prescribe the time evolution of the marginal distributions between two boundary distributions. In this work, we consider a generalized distribution matching setup, where these marginals are only implicitly described as a solution to some task-specific objective function. The problem setup, known as the Generaliz… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Camera Ready

  29. arXiv:2310.01236  [pdf, other

    stat.ML cs.CV cs.LG

    Mirror Diffusion Models for Constrained and Watermarked Generation

    Authors: Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou, Molei Tao

    Abstract: Modern successes of diffusion models in learning complex, high-dimensional data distributions are attributed, in part, to their capability to construct diffusion processes with analytic transition kernels and score functions. The tractability results in a simulation-free framework with stable regression losses, from which reversed, generative processes can be learned at scale. However, when data i… ▽ More

    Submitted 29 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: submitted to NeurIPS on 5/18 but did not arxiv per NeurIPS policy, accepted on 9/22

  30. arXiv:2309.10639  [pdf, ps, other

    cs.LG cs.AI math-ph math.OC stat.ML

    Geometric structure of Deep Learning networks and construction of global ${\mathcal L}^2$ minimizers

    Authors: Thomas Chen, Patricia Muñoz Ewald

    Abstract: In this paper, we explicitly determine local and global minimizers of the $\mathcal{L}^2$ cost function in underparametrized Deep Learning (DL) networks; our main goal is to shed light on their geometric structure and properties. We accomplish this by a direct construction, without invoking the gradient descent flow at any point of this work. We specifically consider $L$ hidden layers, a ReLU ramp… ▽ More

    Submitted 14 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: AMS Latex, 22 pages. Typos corrected, slightly extended

    MSC Class: 57R70; 62M45

  31. arXiv:2309.10370  [pdf, ps, other

    cs.LG cs.AI math-ph math.OC stat.ML

    Geometric structure of shallow neural networks and constructive ${\mathcal L}^2$ cost minimization

    Authors: Thomas Chen, Patricia Muñoz Ewald

    Abstract: In this paper, we approach the problem of cost (loss) minimization in underparametrized shallow neural networks through the explicit construction of upper bounds, without any use of gradient descent. A key focus is on elucidating the geometric structure of approximate and precise minimizers. We consider shallow neural networks with one hidden layer, a ReLU activation function, an ${\mathcal L}^2$… ▽ More

    Submitted 17 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: AMS Latex, 28 pages. Exposition has been streamlined

    MSC Class: 57R70; 62M45

  32. arXiv:2309.08165  [pdf, other

    cs.LG cs.AI stat.ME

    To Predict or to Reject: Causal Effect Estimation with Uncertainty on Networked Data

    Authors: Hechuan Wen, Tong Chen, Li Kheng Chai, Shazia Sadiq, Kai Zheng, Hongzhi Yin

    Abstract: Due to the imbalanced nature of networked observational data, the causal effect predictions for some individuals can severely violate the positivity/overlap assumption, rendering unreliable estimations. Nevertheless, this potential risk of individual-level treatment effect estimation on networked data has been largely under-explored. To create a more trustworthy causal effect estimator, we propose… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted by ICDM'23

  33. arXiv:2309.07867  [pdf, other

    cs.LG cs.AI stat.CO stat.ME stat.ML

    Beta Diffusion

    Authors: Mingyuan Zhou, Tianqi Chen, Zhendong Wang, Huangjie Zheng

    Abstract: We introduce beta diffusion, a novel generative modeling method that integrates demasking and denoising to generate data within bounded ranges. Using scaled and shifted beta distributions, beta diffusion utilizes multiplicative transitions over time to create both forward and reverse diffusion processes, maintaining beta distributions in both the forward marginals and the reverse conditionals, giv… ▽ More

    Submitted 24 December, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: NeurIPS 2023

  34. arXiv:2308.08177  [pdf

    stat.AP

    Develo** a Conceptual Tribal Crash Safety Dashboard: Data-Driven Strategies for Identifying High-Risk Areas and Enhancing Tribal Safety Programs

    Authors: Tianyi Chen, Haotian Shi, Steven T. Parker, Glenn Vorhes, David A Noyce, Bin Ran

    Abstract: Tribal lands in the United States have consistently exhibited higher crash rates and injury severities compared to other regions. To address this issue, effective data-driven safety analysis methods are essential for resource allocation and tribal safety program development. This study outlines the minimum data requirements and presents a generic tribal crash dashboard prototype to enable feasible… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  35. arXiv:2306.17361  [pdf, other

    cs.LG cs.AI stat.AP stat.ME stat.ML

    iSCAN: Identifying Causal Mechanism Shifts among Nonlinear Additive Noise Models

    Authors: Tianyu Chen, Kevin Bello, Bryon Aragam, Pradeep Ravikumar

    Abstract: Structural causal models (SCMs) are widely used in various disciplines to represent causal relationships among variables in complex systems. Unfortunately, the underlying causal structure is often unknown, and estimating it from data remains a challenging task. In many situations, however, the end goal is to localize the changes (shifts) in the causal mechanisms between related datasets instead of… ▽ More

    Submitted 12 January, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 36 pages, 18 figures. Published at NeurIPS 2023

  36. arXiv:2306.13271  [pdf, other

    cs.LG stat.ML

    Variational Counterfactual Prediction under Runtime Domain Corruption

    Authors: Hechuan Wen, Tong Chen, Li Kheng Chai, Shazia Sadiq, Junbin Gao, Hongzhi Yin

    Abstract: To date, various neural methods have been proposed for causal effect estimation based on observational data, where a default assumption is the same distribution and availability of variables at both training and inference (i.e., runtime) stages. However, distribution shift (i.e., domain shift) could happen during runtime, and bigger challenges arise from the impaired accessibility of variables. Th… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  37. arXiv:2306.06626  [pdf, other

    cs.LG stat.ML

    On Kinetic Optimal Probability Paths for Generative Models

    Authors: Neta Shaul, Ricky T. Q. Chen, Maximilian Nickel, Matt Le, Yaron Lipman

    Abstract: Recent successful generative models are trained by fitting a neural network to an a-priori defined tractable probability density path taking noise to training examples. In this paper we investigate the space of Gaussian probability paths, which includes diffusion paths as an instance, and look for an optimal member in some useful sense. In particular, minimizing the Kinetic Energy (KE) of a path i… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  38. arXiv:2305.18375  [pdf, other

    cs.LG stat.ME stat.ML

    Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling

    Authors: Tianqi Chen, Mingyuan Zhou

    Abstract: Learning to denoise has emerged as a prominent paradigm to design state-of-the-art deep generative models for natural images. How to use it to model the distributions of both continuous real-valued data and categorical data has been well studied in recently proposed diffusion models. However, it is found in this paper to have limited ability in modeling some other types of data, such as count and… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  39. arXiv:2303.04871  [pdf, other

    stat.AP

    Discovering a change point and piecewise linear structure in a time series of organoid networks via the iso-mirror

    Authors: Tianyi Chen, Youngser Park, Ali Saad-Eldin, Zachary Lubberts, Avanti Athreya, Benjamin D. Pedigo, Joshua T. Vogelstein, Francesca Puppo, Gabriel A. Silva, Alysson R. Muotri, Weiwei Yang, Christopher M. White, Carey E. Priebe

    Abstract: Recent advancements have been made in the development of cell-based in-vitro neuronal networks, or organoids. In order to better understand the network structure of these organoids, a super-selective algorithm has been proposed for inferring the effective connectivity networks from multi-electrode array data. In this paper, we apply a novel statistical method called spectral mirror estimation to t… ▽ More

    Submitted 12 April, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

  40. arXiv:2303.01751  [pdf, other

    stat.ML cs.LG

    Deep Momentum Multi-Marginal Schrödinger Bridge

    Authors: Tianrong Chen, Guan-Horng Liu, Molei Tao, Evangelos A. Theodorou

    Abstract: It is a crucial challenge to reconstruct population dynamics using unlabeled samples from distributions at coarse time intervals. Recent approaches such as flow-based models or Schrödinger Bridge (SB) models have demonstrated appealing performance, yet the inferred sample trajectories either fail to account for the underlying stochasticity or are $\underline{D}$eep $\underline{M}$omentum Multi-Mar… ▽ More

    Submitted 5 October, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

  41. arXiv:2303.00039  [pdf, other

    cs.LG stat.ML

    M-L2O: Towards Generalizable Learning-to-Optimize by Test-Time Fast Self-Adaptation

    Authors: Junjie Yang, Xuxi Chen, Tianlong Chen, Zhangyang Wang, Yingbin Liang

    Abstract: Learning to Optimize (L2O) has drawn increasing attention as it often remarkably accelerates the optimization procedure of complex tasks by ``overfitting" specific task type, leading to enhanced performance compared to analytical optimizers. Generally, L2O develops a parameterized optimization method (i.e., ``optimizer") by learning from solving sample problems. This data-driven procedure yields L… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

    Comments: This paper is accepted in ICLR 2023

  42. arXiv:2302.11085  [pdf, other

    cs.LG stat.ML

    Learning to Generalize Provably in Learning to Optimize

    Authors: Junjie Yang, Tianlong Chen, Mingkang Zhu, Fengxiang He, Dacheng Tao, Yingbin Liang, Zhangyang Wang

    Abstract: Learning to optimize (L2O) has gained increasing popularity, which automates the design of optimizers by data-driven approaches. However, current L2O methods often suffer from poor generalization performance in at least two folds: (i) applying the L2O-learned optimizer to unseen optimizees, in terms of lowering their loss function values (optimizer generalization, or ``generalizable learning of op… ▽ More

    Submitted 28 March, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: This paper is accepted in AISTATS 2023

  43. arXiv:2302.10324  [pdf, other

    stat.AP stat.ME

    Bayesian subty** for multi-state brain functional connectome with application on adolescent brain cognition

    Authors: Tianqi Chen, Chichun Tan, Hongyu Zhao, Todd Constable, Sarah Yip, Yize Zhao

    Abstract: Converging evidence indicates that the heterogeneity of cognitive profiles may arise through detectable alternations in brain functions. Particularly, brain functional connectivity, measured under resting and cognitive states, characterizes the unique neuronal interconnections across large-scale brain networks. Despite an unprecedented opportunity to uncover neurobiological subtypes through cluste… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  44. arXiv:2302.05793  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    Distributional GFlowNets with Quantile Flows

    Authors: Dinghuai Zhang, Ling Pan, Ricky T. Q. Chen, Aaron Courville, Yoshua Bengio

    Abstract: Generative Flow Networks (GFlowNets) are a new family of probabilistic samplers where an agent learns a stochastic policy for generating complex combinatorial structure through a series of decision-making steps. Despite being inspired from reinforcement learning, the current GFlowNet framework is relatively limited in its applicability and cannot handle stochasticity in the reward function. In thi… ▽ More

    Submitted 17 February, 2024; v1 submitted 11 February, 2023; originally announced February 2023.

    Comments: Accepted by TMLR

  45. arXiv:2302.05185  [pdf, other

    cs.LG math.OC stat.ML

    On Penalty-based Bilevel Gradient Descent Method

    Authors: Han Shen, Quan Xiao, Tianyi Chen

    Abstract: Bilevel optimization enjoys a wide range of applications in hyper-parameter optimization, meta-learning and reinforcement learning. However, bilevel optimization problems are difficult to solve. Recent progress on scalable bilevel algorithms mainly focuses on bilevel optimization problems where the lower-level objective is either strongly convex or unconstrained. In this work, we tackle the bileve… ▽ More

    Submitted 12 September, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: Improved Section 4 by removing a critical assumption; Added Section 5 and citations

  46. arXiv:2302.03660  [pdf, other

    cs.LG cs.AI stat.ML

    Flow Matching on General Geometries

    Authors: Ricky T. Q. Chen, Yaron Lipman

    Abstract: We propose Riemannian Flow Matching (RFM), a simple yet powerful framework for training continuous normalizing flows on manifolds. Existing methods for generative modeling on manifolds either require expensive simulation, are inherently unable to scale to high dimensions, or use approximations for limiting quantities that result in biased training objectives. Riemannian Flow Matching bypasses thes… ▽ More

    Submitted 26 February, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

    Journal ref: ICLR 2024

  47. arXiv:2302.03157  [pdf, other

    stat.ME math.OC stat.ML

    A distribution-free mixed-integer optimization approach to hierarchical modelling of clustered and longitudinal data

    Authors: Madhav Sankaranarayanan, Intekhab Hossain, Tom Chen

    Abstract: Recent advancements in Mixed Integer Optimization (MIO) algorithms, paired with hardware enhancements, have led to significant speedups in resolving MIO problems. These strategies have been utilized for optimal subset selection, specifically for choosing $k$ features out of $p$ in linear regression given $n$ observations. In this paper, we broaden this method to facilitate cluster-aware regression… ▽ More

    Submitted 25 March, 2024; v1 submitted 6 February, 2023; originally announced February 2023.

  48. arXiv:2212.14384  [pdf

    astro-ph.IM astro-ph.SR physics.data-an physics.space-ph stat.CO

    Towards data-driven modeling and real-time prediction of solar flares and coronal mass ejections

    Authors: M. Rempel, Y. Fan, M. Dikpati, A. Malanushenko, M. D. Kazachenko, M. C. M. Cheung, G. Chintzoglou, X. Sun, G. H. Fisher, T. Y. Chen

    Abstract: Modeling of transient events in the solar atmosphere requires the confluence of 3 critical elements: (1) model sophistication, (2) data availability, and (3) data assimilation. This white paper describes required advances that will enable statistical flare and CME forecasting (e.g. eruption probability and timing, estimation of strength, and CME details, such as speed and magnetic field orientatio… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: Heliophysics 2050 White Paper

  49. arXiv:2212.13659  [pdf, other

    cs.LG stat.ML

    Latent Discretization for Continuous-time Sequence Compression

    Authors: Ricky T. Q. Chen, Matthew Le, Matthew Muckley, Maximilian Nickel, Karen Ullrich

    Abstract: Neural compression offers a domain-agnostic approach to creating codecs for lossy or lossless compression via deep generative models. For sequence compression, however, most deep sequence models have costs that scale with the sequence length rather than the sequence complexity. In this work, we instead treat data sequences as observations from an underlying continuous-time process and learn how to… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  50. arXiv:2211.10926  [pdf, other

    stat.AP physics.soc-ph q-bio.PE

    Unraveling implicit human behavioral effects on dynamic characteristics of Covid-19 daily infection rates in Taiwan

    Authors: Ting-Li Chen, Elizabeth P. Chou, Min-Yi Chen, Hsieh Fushing

    Abstract: We study Covid-19 spreading dynamics underlying 84 curves of daily Covid-19 infection rates pertaining to 84 districts belonging to the largest seven cities in Taiwan during her pristine surge period. Our computational developments begin with selecting and extracting 18 features from each smoothed district-specific curve. This step of computing effort allows unstructured data to be converted into… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.