Skip to main content

Showing 1–13 of 13 results for author: Sui, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.00317  [pdf, other

    stat.ML cs.LG stat.ME

    Combining Experimental and Historical Data for Policy Evaluation

    Authors: Ting Li, Chengchun Shi, Qianglin Wen, Yang Sui, Yongli Qin, Chunbo Lai, Hongtu Zhu

    Abstract: This paper studies policy evaluation with multiple data sources, especially in scenarios that involve one experimental dataset with two arms, complemented by a historical dataset generated under a single control arm. We propose novel data integration methods that linearly integrate base policy value estimators constructed based on the experimental and historical data, with weights optimized to min… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  2. arXiv:2404.17489  [pdf, other

    cs.LG cs.AI stat.ML

    Tabular Data Contrastive Learning via Class-Conditioned and Feature-Correlation Based Augmentation

    Authors: Wei Cui, Rasa Hosseinzadeh, Junwei Ma, Tongzi Wu, Yi Sui, Keyvan Golestan

    Abstract: Contrastive learning is a model pre-training technique by first creating similar views of the original data, and then encouraging the data and its corresponding views to be close in the embedding space. Contrastive learning has witnessed success in image and natural language data, thanks to the domain-specific augmentation techniques that are both intuitive and effective. Nonetheless, in tabular d… ▽ More

    Submitted 30 April, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: 14 pages, 4 algorithms, 3 figures, 5 tables

  3. arXiv:2401.13744  [pdf, other

    cs.LG cs.HC stat.ML

    Conformal Prediction Sets Improve Human Decision Making

    Authors: Jesse C. Cresswell, Yi Sui, Bhargava Kumar, Noël Vouitsis

    Abstract: In response to everyday queries, humans explicitly signal uncertainty and offer alternative answers when they are unsure. Machine learning models that output calibrated prediction sets through conformal prediction mimic this human behaviour; larger sets signal greater uncertainty while providing alternatives. In this work, we study the usefulness of conformal prediction sets as an aid for human de… ▽ More

    Submitted 9 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Published at ICML 2024. Code available at https://github.com/layer6ai-labs/hitl-conformal-prediction

  4. arXiv:2401.02650  [pdf, other

    cs.LG stat.ML

    Improving sample efficiency of high dimensional Bayesian optimization with MCMC

    Authors: Zeji Yi, Yunyue Wei, Chu Xin Cheng, Kaibo He, Yanan Sui

    Abstract: Sequential optimization methods are often confronted with the curse of dimensionality in high-dimensional spaces. Current approaches under the Gaussian process framework are still burdened by the computational complexity of tracking Gaussian process posteriors and need to partition the optimization problem into small regions to ensure exploration or assume an underlying low-dimensional structure.… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  5. arXiv:2306.04675  [pdf, other

    cs.LG cs.CV stat.ML

    Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

    Authors: George Stein, Jesse C. Cresswell, Rasa Hosseinzadeh, Yi Sui, Brendan Leigh Ross, Valentin Villecroze, Zhaoyan Liu, Anthony L. Caterini, J. Eric T. Taylor, Gabriel Loaiza-Ganem

    Abstract: We systematically study a wide variety of generative models spanning semantically-diverse image datasets to understand and improve the feature extractors and metrics used to evaluate them. Using best practices in psychophysics, we measure human perception of image realism for generated samples by conducting the largest experiment evaluating generative models to date, and find that no existing metr… ▽ More

    Submitted 30 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023. 53 pages, 29 figures, 12 tables. Code at https://github.com/layer6ai-labs/dgm-eval, reviews at https://openreview.net/forum?id=08zf7kTOoh

    Journal ref: Thirty-seventh Conference on Neural Information Processing Systems (2023)

  6. arXiv:2206.12280  [pdf, other

    stat.ME

    Bayesian Circular Lattice Filters for Computationally Efficient Estimation of Multivariate Time-Varying Autoregressive Models

    Authors: Yuelei Sui, Scott H. Holan, Wen-Hsi Yang

    Abstract: Nonstationary time series data exist in various scientific disciplines, including environmental science, biology, signal processing, econometrics, among others. Many Bayesian models have been developed to handle nonstationary time series. The time-varying vector autoregressive (TV-VAR) model is a well-established model for multivariate nonstationary time series. Nevertheless, in most cases, the la… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  7. arXiv:2102.12769  [pdf, other

    cs.LG stat.ML

    No-Regret Reinforcement Learning with Heavy-Tailed Rewards

    Authors: Vincent Zhuang, Yanan Sui

    Abstract: Reinforcement learning algorithms typically assume rewards to be sampled from light-tailed distributions, such as Gaussian or bounded. However, a wide variety of real-world systems generate rewards that follow heavy-tailed distributions. We consider such scenarios in the setting of undiscounted reinforcement learning. By constructing a lower bound, we show that the difficulty of learning heavy-tai… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: AISTATS 21

  8. arXiv:2102.06790  [pdf, other

    cs.LG cs.AI stat.ML

    A Unified Lottery Ticket Hypothesis for Graph Neural Networks

    Authors: Tianlong Chen, Yongduo Sui, Xuxi Chen, Aston Zhang, Zhangyang Wang

    Abstract: With graphs rapidly growing in size and deeper graph neural networks (GNNs) emerging, the training and inference of GNNs become increasingly expensive. Existing network weight pruning algorithms cannot address the main space and computational bottleneck in GNNs, caused by the size and connectivity of the graph. To this end, this paper first presents a unified GNN sparsification (UGS) framework tha… ▽ More

    Submitted 7 June, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

  9. arXiv:2010.09808  [pdf, other

    cs.LG cs.AI stat.ML

    Imitation with Neural Density Models

    Authors: Kuno Kim, Akshat **dal, Yang Song, Jiaming Song, Yanan Sui, Stefano Ermon

    Abstract: We propose a new framework for Imitation Learning (IL) via density estimation of the expert's occupancy measure followed by Maximum Occupancy Entropy Reinforcement Learning (RL) using the density as a reward. Our approach maximizes a non-adversarial model-free RL objective that provably lower bounds reverse Kullback-Leibler divergence between occupancy measures of the expert and imitator. We prese… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

  10. arXiv:2003.13413  [pdf, other

    cs.LG stat.ML

    Secure Metric Learning via Differential Pairwise Privacy

    Authors: **g Li, Yuangang Pan, Yulei Sui, Ivor W. Tsang

    Abstract: Distance Metric Learning (DML) has drawn much attention over the last two decades. A number of previous works have shown that it performs well in measuring the similarities of individuals given a set of correctly labeled pairwise data by domain experts. These important and precisely-labeled pairwise data are often highly sensitive in real world (e.g., patients similarity). This paper studies, for… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

  11. arXiv:1908.01289  [pdf, other

    cs.LG cs.AI stat.ML

    Dueling Posterior Sampling for Preference-Based Reinforcement Learning

    Authors: Ellen R. Novoseller, Yibing Wei, Yanan Sui, Yisong Yue, Joel W. Burdick

    Abstract: In preference-based reinforcement learning (RL), an agent interacts with the environment while receiving preferences instead of absolute feedback. While there is increasing research activity in preference-based RL, the design of formal frameworks that admit tractable theoretical analysis remains an open challenge. Building upon ideas from preference-based bandit learning and posterior sampling in… ▽ More

    Submitted 29 June, 2020; v1 submitted 4 August, 2019; originally announced August 2019.

    Comments: To appear in Conference on Uncertainty in Artificial Intelligence (UAI), 2020. 9 pages before references and appendix; 51 pages total; 7 figures; 4 tables. This replacement incorporates reviewer comments, and in comparison to version 1, extends the theoretical and empirical analyses and adds mathematical detail. Code: https://github.com/ernovoseller/DuelingPosteriorSampling

  12. arXiv:1806.07555  [pdf, other

    cs.LG stat.ML

    Stagewise Safe Bayesian Optimization with Gaussian Processes

    Authors: Yanan Sui, Vincent Zhuang, Joel W. Burdick, Yisong Yue

    Abstract: Enforcing safety is a key aspect of many problems pertaining to sequential decision making under uncertainty, which require the decisions made at every step to be both informative of the optimal decision and also safe. For example, we value both efficacy and comfort in medical therapy, and efficiency and safety in robotic control. We consider this problem of optimizing an unknown utility function… ▽ More

    Submitted 26 January, 2020; v1 submitted 20 June, 2018; originally announced June 2018.

    Comments: International Conference on Machine Learning (ICML) 2018

  13. arXiv:1711.07894  [pdf, other

    stat.ML cs.AI q-bio.NC

    Quantifying Performance of Bipedal Standing with Multi-channel EMG

    Authors: Yanan Sui, Kun ho Kim, Joel W. Burdick

    Abstract: Spinal cord stimulation has enabled humans with motor complete spinal cord injury (SCI) to independently stand and recover some lost autonomic function. Quantifying the quality of bipedal standing under spinal stimulation is important for spinal rehabilitation therapies and for new strategies that seek to combine spinal stimulation and rehabilitative robots (such as exoskeletons) in real time feed… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Journal ref: IROS 2017