Skip to main content

Showing 1–19 of 19 results for author: Tsay, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13433  [pdf, other

    cs.LG cs.AI

    Certificates of Differential Privacy and Unlearning for Gradient-Based Training

    Authors: Matthew Wicker, Philip Sosnin, Adrianna Janik, Mark N. Müller, Adrian Weller, Calvin Tsay

    Abstract: Proper data stewardship requires that model owners protect the privacy of individuals' data used during training. Whether through anonymization with differential privacy or the use of unlearning in non-anonymized settings, the gold-standard techniques for providing privacy guarantees can come with significant performance penalties or be too weak to provide practical assurances. In part, this is du… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 15 pages, 14 figures

  2. arXiv:2406.05670  [pdf, other

    cs.LG cs.CR cs.CV

    Certified Robustness to Data Poisoning in Gradient-Based Training

    Authors: Philip Sosnin, Mark N. Müller, Maximilian Baader, Calvin Tsay, Matthew Wicker

    Abstract: Modern machine learning pipelines leverage large amounts of public data, making it infeasible to guarantee data quality and leaving models open to poisoning and backdoor attacks. However, provably bounding model behavior under such attacks remains an open problem. In this work, we address this challenge and develop the first framework providing provable guarantees on the behavior of models trained… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 15 pages, 5 figures

  3. arXiv:2406.02352  [pdf, other

    cs.LG

    System-Aware Neural ODE Processes for Few-Shot Bayesian Optimization

    Authors: Jixiang Qing, Becky D Langdon, Robert M Lee, Behrang Shafei, Mark van der Wilk, Calvin Tsay, Ruth Misener

    Abstract: We consider the problem of optimizing initial conditions and timing in dynamical systems governed by unknown ordinary differential equations (ODEs), where evaluating different initial conditions is costly and there are constraints on observation times. To identify the optimal conditions within several trials, we introduce a few-shot Bayesian Optimization (BO) framework based on the system's prior… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  4. arXiv:2402.08406  [pdf, other

    cs.LG

    Transition Constrained Bayesian Optimization via Markov Decision Processes

    Authors: Jose Pablo Folch, Calvin Tsay, Robert M Lee, Behrang Shafei, Weronika Ormaniec, Andreas Krause, Mark van der Wilk, Ruth Misener, Mojmír Mutný

    Abstract: Bayesian optimization is a methodology to optimize black-box functions. Traditionally, it focuses on the setting where you can arbitrarily query the search space. However, many real-life problems do not offer this flexibility; in particular, the search space of the next query may depend on previous ones. Example challenges arise in the physical sciences in the form of local movement constraints, r… ▽ More

    Submitted 29 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 10 pages main, 32 pages total, 16 figures, 2 tables, preprint

  5. arXiv:2401.16373  [pdf, other

    cs.LG math.OC

    Bayesian optimization as a flexible and efficient design framework for sustainable process systems

    Authors: Joel A. Paulson, Calvin Tsay

    Abstract: Bayesian optimization (BO) is a powerful technology for optimizing noisy expensive-to-evaluate black-box functions, with a broad range of real-world applications in science, engineering, economics, manufacturing, and beyond. In this paper, we provide an overview of recent developments, challenges, and opportunities in BO for design of next-generation process systems. After describing several motiv… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 16 pages, 1 figure, 1 table

  6. arXiv:2312.01228  [pdf, other

    math.OC cs.NE

    Mixed-Integer Optimisation of Graph Neural Networks for Computer-Aided Molecular Design

    Authors: Tom McDonald, Calvin Tsay, Artur M. Schweidtmann, Neil Yorke-Smith

    Abstract: ReLU neural networks have been modelled as constraints in mixed integer linear programming (MILP), enabling surrogate-based optimisation in various domains and efficient solution of machine learning certification problems. However, previous works are mostly limited to MLPs. Graph neural networks (GNNs) can learn from non-euclidean data structures such as molecular structures efficiently and are th… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    MSC Class: 90C11 ACM Class: G.1.6; I.2.6; J.2

  7. arXiv:2312.00622  [pdf, other

    cs.LG math.OC stat.ME

    Practical Path-based Bayesian Optimization

    Authors: Jose Pablo Folch, James Odgers, Shiqiang Zhang, Robert M Lee, Behrang Shafei, David Walz, Calvin Tsay, Mark van der Wilk, Ruth Misener

    Abstract: There has been a surge in interest in data-driven experimental design with applications to chemical engineering and drug manufacturing. Bayesian optimization (BO) has proven to be adaptable to such cases, since we can model the reactions of interest as expensive black-box functions. Sometimes, the cost of this black-box functions can be separated into two parts: (a) the cost of the experiment itse… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 main pages, 12 with references and appendix. 4 figures, 2 tables. To appear in NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World

    Journal ref: NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World

  8. arXiv:2305.00241  [pdf, other

    math.OC cs.LG

    When Deep Learning Meets Polyhedral Theory: A Survey

    Authors: Joey Huchette, Gonzalo Muñoz, Thiago Serra, Calvin Tsay

    Abstract: In the past decade, deep learning became the prevalent methodology for predictive modeling thanks to the remarkable accuracy of deep neural networks in tasks such as computer vision and natural language processing. Meanwhile, the structure of neural networks converged back to simpler representations based on piecewise constant and piecewise linear functions such as the Rectified Linear Unit (ReLU)… ▽ More

    Submitted 31 August, 2023; v1 submitted 29 April, 2023; originally announced May 2023.

  9. arXiv:2302.10344  [pdf, other

    math.OC cs.LG

    Model-based feature selection for neural networks: A mixed-integer programming approach

    Authors: Shudian Zhao, Calvin Tsay, Jan Kronqvist

    Abstract: In this work, we develop a novel input feature selection framework for ReLU-based deep neural networks (DNNs), which builds upon a mixed-integer optimization approach. While the method is generally applicable to various classification tasks, we focus on finding input features for image classification for clarity of presentation. The idea is to use a trained DNN, or an ensemble of trained DNNs, to… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 15 pages, 3 figures, 5 tables

  10. arXiv:2302.01727  [pdf, other

    cs.LG math.OC

    Distributional constrained reinforcement learning for supply chain optimization

    Authors: Jaime Sabal Bermúdez, Antonio del Rio Chanona, Calvin Tsay

    Abstract: This work studies reinforcement learning (RL) in the context of multi-period supply chains subject to constraints, e.g., on production and inventory. We introduce Distributional Constrained Policy Optimization (DCPO), a novel approach for reliable constraint satisfaction in RL. Our approach is based on Constrained Policy Optimization (CPO), which is subject to approximation errors that in practice… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: 6 pages, 4 figures

  11. arXiv:2211.06149  [pdf, other

    cs.LG cs.CE stat.ML

    Combining Multi-Fidelity Modelling and Asynchronous Batch Bayesian Optimization

    Authors: Jose Pablo Folch, Robert M Lee, Behrang Shafei, David Walz, Calvin Tsay, Mark van der Wilk, Ruth Misener

    Abstract: Bayesian Optimization is a useful tool for experiment design. Unfortunately, the classical, sequential setting of Bayesian Optimization does not translate well into laboratory experiments, for instance battery design, where measurements may come from different sources and their evaluations may require significant waiting times. Multi-fidelity Bayesian Optimization addresses the setting with measur… ▽ More

    Submitted 23 February, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 19 pages in main paper / 28 with references and appendix, 7 figures, 2 tables, accepted into Computers and Chemical Engineering

  12. arXiv:2207.00879  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Tree ensemble kernels for Bayesian optimization with known constraints over mixed-feature spaces

    Authors: Alexander Thebelt, Calvin Tsay, Robert M. Lee, Nathan Sudermann-Merx, David Walz, Behrang Shafei, Ruth Misener

    Abstract: Tree ensembles can be well-suited for black-box optimization tasks such as algorithm tuning and neural architecture search, as they achieve good predictive performance with little or no manual tuning, naturally handle discrete feature spaces, and are relatively insensitive to outliers in the training data. Two well-known challenges in using tree ensembles for black-box optimization are (i) effecti… ▽ More

    Submitted 30 December, 2022; v1 submitted 2 July, 2022; originally announced July 2022.

    Comments: 27 pages, 9 figures, 4 tables

  13. arXiv:2202.05198  [pdf, other

    math.OC cs.LG

    P-split formulations: A class of intermediate formulations between big-M and convex hull for disjunctive constraints

    Authors: Jan Kronqvist, Ruth Misener, Calvin Tsay

    Abstract: We develop a class of mixed-integer formulations for disjunctive constraints intermediate to the big-M and convex hull formulations in terms of relaxation strength. The main idea is to capture the best of both the big-M and convex hull formulations: a computationally light formulation with a tight relaxation. The "P-split" formulations are based on a lifted transformation that splits convex additi… ▽ More

    Submitted 27 May, 2024; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: 29 pages, 6 figures

  14. arXiv:2202.02414  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    OMLT: Optimization & Machine Learning Toolkit

    Authors: Francesco Ceccon, Jordan Jalving, Joshua Haddad, Alexander Thebelt, Calvin Tsay, Carl D. Laird, Ruth Misener

    Abstract: The optimization and machine learning toolkit (OMLT) is an open-source software package incorporating neural network and gradient-boosted tree surrogate models, which have been trained using machine learning, into larger optimization problems. We discuss the advances in optimization technology that made OMLT possible and show how OMLT seamlessly integrates with the algebraic modeling language Pyom… ▽ More

    Submitted 15 November, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: 8 pages, 1 figure

  15. arXiv:2202.00060  [pdf, other

    cs.LG math.OC

    SnAKe: Bayesian Optimization with Pathwise Exploration

    Authors: Jose Pablo Folch, Shiqiang Zhang, Robert M Lee, Behrang Shafei, David Walz, Calvin Tsay, Mark van der Wilk, Ruth Misener

    Abstract: Bayesian Optimization is a very effective tool for optimizing expensive black-box functions. Inspired by applications develo** and characterizing reaction chemistry using droplet microfluidic reactors, we consider a novel setting where the expense of evaluating the function can increase significantly when making large input changes between iterations. We further assume we are working asynchronou… ▽ More

    Submitted 11 January, 2023; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: 10 main pages, 39 with appendix, 30 figures, 10 tables. Final camera-ready version for NeurIPS, with supplementary material included

  16. arXiv:2201.10035  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Maximizing information from chemical engineering data sets: Applications to machine learning

    Authors: Alexander Thebelt, Johannes Wiebe, Jan Kronqvist, Calvin Tsay, Ruth Misener

    Abstract: It is well-documented how artificial intelligence can have (and already is having) a big impact on chemical engineering. But classical machine learning approaches may be weak for many chemical engineering applications. This review discusses how challenging data characteristics arise in chemical engineering applications. We identify four characteristics of data arising in chemical engineering appli… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 34 pages, 3 figures, 1 table

  17. arXiv:2111.03140  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Multi-Objective Constrained Optimization for Energy Applications via Tree Ensembles

    Authors: Alexander Thebelt, Calvin Tsay, Robert M. Lee, Nathan Sudermann-Merx, David Walz, Tom Tranter, Ruth Misener

    Abstract: Energy systems optimization problems are complex due to strongly non-linear system behavior and multiple competing objectives, e.g. economic gain vs. environmental impact. Moreover, a large number of input variables and different variable types, e.g. continuous and categorical, are challenges commonly present in real-world applications. In some cases, proposed optimal solutions need to obey explic… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: 36 pages, 8 figures, 5 tables

  18. arXiv:2102.04373  [pdf, other

    math.OC cs.LG stat.ML

    Partition-based formulations for mixed-integer optimization of trained ReLU neural networks

    Authors: Calvin Tsay, Jan Kronqvist, Alexander Thebelt, Ruth Misener

    Abstract: This paper introduces a class of mixed-integer formulations for trained ReLU neural networks. The approach balances model size and tightness by partitioning node inputs into a number of groups and forming the convex hull over the partitions via disjunctive programming. At one extreme, one partition per input recovers the convex hull of a node, i.e., the tightest possible formulation for each node.… ▽ More

    Submitted 20 October, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: Conference on Neural Information Processing Systems (NeurIPS) 2021

  19. arXiv:2101.12708  [pdf, other

    math.OC cs.LG

    Between steps: Intermediate relaxations between big-M and convex hull formulations

    Authors: Jan Kronqvist, Ruth Misener, Calvin Tsay

    Abstract: This work develops a class of relaxations in between the big-M and convex hull formulations of disjunctions, drawing advantages from both. The proposed "P-split" formulations split convex additively separable constraints into P partitions and form the convex hull of the partitioned disjuncts. Parameter P represents the trade-off of model size vs. relaxation strength. We examine the novel formulati… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

    Comments: 16 pages