-
On Profinite Quandles
Authors:
On Profinite Quandles Alexander W. Byard,
Brian Cai,
Nathan P. Jones,
Lucy H. Vuong,
David N. Yetter
Abstract:
We undertake the study of profinite quandles. We provide several constructions of profinite quandles from profinite groups, and from other profinite quandle. We characterize which subquandles of profinite quandles are again profinite. Finally, we provide a characterization of algebraically connected profinite quandles in terms of the profinite completion of their inner automorphism groups…
▽ More
We undertake the study of profinite quandles. We provide several constructions of profinite quandles from profinite groups, and from other profinite quandle. We characterize which subquandles of profinite quandles are again profinite. Finally, we provide a characterization of algebraically connected profinite quandles in terms of the profinite completion of their inner automorphism groups $\widehat{\Inn(Q)}$. It is anticipated that the results herein will find applications to the étale homotopy theory of number fields.
△ Less
Submitted 22 April, 2024;
originally announced June 2024.
-
Pivoting through the chiral-clock family
Authors:
Nick G. Jones,
Abhishodh Prakash,
Paul Fendley
Abstract:
The Onsager algebra, invented to solve the two-dimensional Ising model, can be used to construct conserved charges for a family of integrable $N$-state chiral clock models. We show how it naturally gives rise to a "pivot" procedure for this family of chiral Hamiltonians. These Hamiltonians have an anti-unitary CPT symmetry that when combined with the usual $\mathbb{Z}_N$ clock symmetry gives a non…
▽ More
The Onsager algebra, invented to solve the two-dimensional Ising model, can be used to construct conserved charges for a family of integrable $N$-state chiral clock models. We show how it naturally gives rise to a "pivot" procedure for this family of chiral Hamiltonians. These Hamiltonians have an anti-unitary CPT symmetry that when combined with the usual $\mathbb{Z}_N$ clock symmetry gives a non-abelian dihedral symmetry group $D_{2N}$. We show that this symmetry gives rise to symmetry-protected topological (SPT) order in this family for all even $N$, and representation-SPT (RSPT) physics for all odd $N$. The simplest such example is a next-nearest-neighbour chain generalising the spin-1/2 cluster model, an SPT phase of matter. We derive a matrix-product state representation of its fixed-point ground state along with the ensuing entanglement spectrum and symmetry fractionalisation. We analyse a rich phase diagram combining this model with the Onsager-integrable chiral Potts chain, and find trivial, symmetry-breaking and (R)SPT orders, as well as extended gapless regions. For odd $N$, the phase transitions are "unnecessarily" critical from the SPT point of view.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals
Authors:
Ruiyi Wang,
Stephanie Milani,
Jamie C. Chiu,
Jiayin Zhi,
Shaun M. Eack,
Travis Labrum,
Samuel M. Murphy,
Nev Jones,
Kate Hardy,
Hong Shen,
Fei Fang,
Zhiyu Zoey Chen
Abstract:
Mental illness remains one of the most critical public health issues. Despite its importance, many mental health professionals highlight a disconnect between their training and actual real-world patient practice. To help bridge this gap, we propose PATIENT-Ψ, a novel patient simulation framework for cognitive behavior therapy (CBT) training. To build PATIENT-Ψ, we construct diverse patient cogniti…
▽ More
Mental illness remains one of the most critical public health issues. Despite its importance, many mental health professionals highlight a disconnect between their training and actual real-world patient practice. To help bridge this gap, we propose PATIENT-Ψ, a novel patient simulation framework for cognitive behavior therapy (CBT) training. To build PATIENT-Ψ, we construct diverse patient cognitive models based on CBT principles and use large language models (LLMs) programmed with these cognitive models to act as a simulated therapy patient. We propose an interactive training scheme, PATIENT-Ψ-TRAINER, for mental health trainees to practice a key skill in CBT -- formulating the cognitive model of the patient -- through role-playing a therapy session with PATIENT-Ψ. To evaluate PATIENT-Ψ, we conducted a comprehensive user study of 13 mental health trainees and 20 experts. The results demonstrate that practice using PATIENT-Ψ-TRAINER enhances the perceived skill acquisition and confidence of the trainees beyond existing forms of training such as textbooks, videos, and role-play with non-patients. Based on the experts' perceptions, PATIENT-Ψ is perceived to be closer to real patient interactions than GPT-4, and PATIENT-Ψ-TRAINER holds strong promise to improve trainee competencies. Our code and data are released at \url{https://github.com/ruiyiw/patient-psi}.
△ Less
Submitted 18 June, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
On identifying the non-linear dynamics of a hovercraft using an end-to-end deep learning approach
Authors:
Roland Schwan,
Nicolaj Schmid,
Etienne Chassaing,
Karim Samaha,
Colin N. Jones
Abstract:
We present the identification of the non-linear dynamics of a novel hovercraft design, employing end-to-end deep learning techniques. Our experimental setup consists of a hovercraft propelled by racing drone propellers mounted on a lightweight foam base, allowing it to float and be controlled freely on an air hockey table. We learn parametrized physics-inspired non-linear models directly from data…
▽ More
We present the identification of the non-linear dynamics of a novel hovercraft design, employing end-to-end deep learning techniques. Our experimental setup consists of a hovercraft propelled by racing drone propellers mounted on a lightweight foam base, allowing it to float and be controlled freely on an air hockey table. We learn parametrized physics-inspired non-linear models directly from data trajectories, leveraging gradient-based optimization techniques prevalent in machine learning research. The chosen model structure allows us to control the position of the hovercraft precisely on the air hockey table. We then analyze the prediction performance and demonstrate the closed-loop control performance on the real system.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Classical origins of Landau-incompatible transitions
Authors:
Abhishodh Prakash,
Nick G. Jones
Abstract:
Continuous phase transitions where symmetry is spontaneously broken are ubiquitous in physics and often found between `Landau-compatible' phases where residual symmetries of one phase are a subset of the other. However, continuous `deconfined quantum critical' transitions between Landau-incompatible symmetry-breaking phases are known to exist in certain quantum systems, often with anomalous micros…
▽ More
Continuous phase transitions where symmetry is spontaneously broken are ubiquitous in physics and often found between `Landau-compatible' phases where residual symmetries of one phase are a subset of the other. However, continuous `deconfined quantum critical' transitions between Landau-incompatible symmetry-breaking phases are known to exist in certain quantum systems, often with anomalous microscopic symmetries. In this paper, we investigate the need for such special conditions. We show that Landau-incompatible transitions can be found in a family of well-known classical statistical mechanical models with anomaly-free on-site microscopic symmetries, introduced by José, Kadanoff, Kirkpatick and Nelson (Phys. Rev. B 16, 1217). The models are labeled by a positive integer $Q$ and constructed by a deformation of the 2d classical XY model, defined on any lattice, with an on-site potential that preserves a discrete $Q$-fold spin rotation and reflection symmetry. For a range of temperatures, even $Q$ models exhibit two Landau-incompatible partial symmetry-breaking phases and a direct transition between them for $Q \ge 4$. Characteristic features of Landau-incompatible transitions are easily seen, such as enhanced symmetries and melting of charged defects. For odd $Q$, and corresponding temperature ranges, two regions of a single partial symmetry-breaking phase are obtained, split by a stable `unnecessary critical' line. We present quantum models with anomaly-free symmetries that also exhibit similar phase diagrams.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
A Cyclic Spectroscopy Scintillation Study of PSR B1937+21 I. Demonstration of Improved Scintillometry
Authors:
Jacob E. Turner,
Timothy Dolch,
James M. Cordes,
Stella K. Ocker,
Daniel R. Stinebring,
Shami Chatterjee,
Maura A. McLaughlin,
Victoria E. Catlett,
Cody Jessup,
Nathaniel Jones,
Christopher Scheithauer
Abstract:
We use cyclic spectroscopy to perform high frequency-resolution analyses of multi-hour baseband Arecibo observations of the millisecond pulsar PSR B1937+21. This technique allows for the examination of scintillation features in far greater detail than is otherwise possible under most pulsar timing array observing setups. We measure scintillation bandwidths and timescales in each of eight subbands…
▽ More
We use cyclic spectroscopy to perform high frequency-resolution analyses of multi-hour baseband Arecibo observations of the millisecond pulsar PSR B1937+21. This technique allows for the examination of scintillation features in far greater detail than is otherwise possible under most pulsar timing array observing setups. We measure scintillation bandwidths and timescales in each of eight subbands across a 200 MHz observing band in each observation. Through these measurements we obtain intra-epoch estimates of the frequency scalings for scintillation bandwidth and timescale.Thanks to our high frequency resolution and the narrow scintles of this pulsar, we resolve scintillation arcs in the secondary spectra due to the increased Nyquist limit, which would not have been resolved at the same observing frequency with a traditional filterbank spectrum using NANOGrav's current time and frequency resolutions, and the frequency-dependent evolution of scintillation arc features within individual observations. We observe the dimming of prominent arc features at higher frequencies, possibly due to a combination of decreasing flux density and the frequency dependence of the plasma refractive index of the interstellar medium. We also find agreement with arc curvature frequency dependence predicted by Stinebring et al. (2001) in some epochs. Thanks to the frequency resolution improvement provided by cyclic spectroscopy, these results show strong promise for future such analyses with millisecond pulsars, particularly for pulsar timing arrays, where such techniques can allow for detailed studies of the interstellar medium in highly scattered pulsars without sacrificing the timing resolution that is crucial to their gravitational wave detection efforts.
△ Less
Submitted 21 June, 2024; v1 submitted 21 April, 2024;
originally announced April 2024.
-
Optimal Slicing and Scheduling with Service Guarantees in Multi-Hop Wireless Networks
Authors:
Nicholas Jones,
Eytan Modiano
Abstract:
We analyze the problem of scheduling in wireless networks to meet end-to-end service guarantees. Using network slicing to decouple the queueing dynamics between flows, we show that the network's ability to meet hard throughput and deadline requirements is largely influenced by the scheduling policy. We characterize the feasible throughput/deadline region for a flow under a fixed route and set of s…
▽ More
We analyze the problem of scheduling in wireless networks to meet end-to-end service guarantees. Using network slicing to decouple the queueing dynamics between flows, we show that the network's ability to meet hard throughput and deadline requirements is largely influenced by the scheduling policy. We characterize the feasible throughput/deadline region for a flow under a fixed route and set of slices, and find throughput- and deadline-optimal policies for a solitary flow. We formulate the feasibility problem for multiple flows in a general topology as a mixed-integer program, and show that it grows exponentially in the size of the network. Drawing on results from the solitary flow setting, we show that scheduling links in a regular fashion leads to smaller delay, and we derive tighter upper bounds on end-to-end delay for regular schedules. Finally, we design a polynomial-time algorithm that returns an (almost) regular schedule, optimized to meet service guarantees for all flows.
△ Less
Submitted 15 April, 2024; v1 submitted 12 April, 2024;
originally announced April 2024.
-
Using Flexibility Envelopes for the Demand-Side Hierarchical Optimization of District Heating Networks
Authors:
Audrey Blizard,
Colin N. Jones,
Stephanie Stockar
Abstract:
The demand-side control of district heating networks is notoriously challenging due to the large number of connected users and the high number of states to be considered. To overcome these challenges, this paper presents a hierarchical optimization scheme using the flexibility in heating demand provided by the users to improve the performance of the network. This hierarchical scheme relies on a lo…
▽ More
The demand-side control of district heating networks is notoriously challenging due to the large number of connected users and the high number of states to be considered. To overcome these challenges, this paper presents a hierarchical optimization scheme using the flexibility in heating demand provided by the users to improve the performance of the network. This hierarchical scheme relies on a low level controller to calculate the costs for a subsystem over a given set of potential pressure drops for that subsystem. The high level controller then uses these calculated costs to determine the optimal set of pressure drops for every subgraph of the partitioned network. The proposed hierarchical optimization scheme is demonstrated on a representative 20 user district heating network, resulting in a 67\% reduction in bypass mass flow while ensuring all network users stay within 2 \degree C of their desired nominal temperatures.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
RNA Dynamics from Experimental and Computational Approaches
Authors:
Giovanni Bussi,
Massimiliano Bonomi,
Paraskevi Gkeka,
Michael Sattler,
Hashim M. Al-Hashimi,
Pascal Auffinger,
Maria Duca,
Yann Foricher,
Danny Incarnato,
Alisha N. Jones,
Serdal Kirmizialtin,
Miroslav Krepl,
Modesto Orozco,
Giulia Palermo,
Samuela Pasquali,
Loïc Salmon,
Harald Schwalbe,
Eric Westhof,
Martin Zacharias
Abstract:
Ribonucleic acids (RNA) are unique in that they can store genetic information, replicate and perform catalysis. Importantly, RNA molecules are highly dynamic, and thus determining the ensemble of conformations that they populate is crucial not only to elucidate their biological functions, but also for their potential use as therapeutic targets. Computational and experimental techniques provide com…
▽ More
Ribonucleic acids (RNA) are unique in that they can store genetic information, replicate and perform catalysis. Importantly, RNA molecules are highly dynamic, and thus determining the ensemble of conformations that they populate is crucial not only to elucidate their biological functions, but also for their potential use as therapeutic targets. Computational and experimental techniques provide complementary views on RNA dynamics, and their integration is fundamental to improve the accuracy of computations and the resolution of experiments. Recent exciting developments in this field, were discussed at the CECAM workshop ``RNA dynamics from experimental and computational approaches'', in Paris, June 26-28, 2023. This report outlines key `take-home' messages that emerged during this workshop from the presentations and discussions.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
The Promises and Pitfalls of Using Language Models to Measure Instruction Quality in Education
Authors:
Paiheng Xu,
**g Liu,
Nathan Jones,
Julie Cohen,
Wei Ai
Abstract:
Assessing instruction quality is a fundamental component of any improvement efforts in the education system. However, traditional manual assessments are expensive, subjective, and heavily dependent on observers' expertise and idiosyncratic factors, preventing teachers from getting timely and frequent feedback. Different from prior research that mostly focuses on low-inference instructional practic…
▽ More
Assessing instruction quality is a fundamental component of any improvement efforts in the education system. However, traditional manual assessments are expensive, subjective, and heavily dependent on observers' expertise and idiosyncratic factors, preventing teachers from getting timely and frequent feedback. Different from prior research that mostly focuses on low-inference instructional practices on a singular basis, this paper presents the first study that leverages Natural Language Processing (NLP) techniques to assess multiple high-inference instructional practices in two distinct educational settings: in-person K-12 classrooms and simulated performance tasks for pre-service teachers. This is also the first study that applies NLP to measure a teaching practice that is widely acknowledged to be particularly effective for students with special needs. We confront two challenges inherent in NLP-based instructional analysis, including noisy and long input data and highly skewed distributions of human ratings. Our results suggest that pretrained Language Models (PLMs) demonstrate performances comparable to the agreement level of human raters for variables that are more discrete and require lower inference, but their efficacy diminishes with more complex teaching practices. Interestingly, using only teachers' utterances as input yields strong results for student-centered variables, alleviating common concerns over the difficulty of collecting and transcribing high-quality student speech data in in-person teaching settings. Our findings highlight both the potential and the limitations of current NLP techniques in the education domain, opening avenues for further exploration.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Revealing the EuCd_{2}As_{2} Semiconducting Band Gap via n-type La-Do**
Authors:
Ryan A. Nelson,
Jesaiah King,
Shuyu Cheng,
Archibald J. Williams,
Christopher Jozwiak,
Aaron Bostwick,
Eli Rotenberg,
Souvik Sasmal,
I-Hsuan Kao,
Aalok Tiwari,
Natalie R. Jones,
Chuting Cai,
Emma Martin,
Andrei Dolocan,
Li Shi,
Roland Kawakami,
Joseph P. Heremans,
Jyoti Katoch,
Joshua E. Goldberger
Abstract:
EuCd_{2}As_{2} has attracted considerable interest as one of the few magnetic Weyl semimetal candidate materials, although recently there have been emerging reports that claim it to have a semiconducting electronic structure. To resolve this debate, we established the growth of n-type EuCd_{2}As_{2} crystals, to directly visualize the nature of the conduction band using angle resolve photoemission…
▽ More
EuCd_{2}As_{2} has attracted considerable interest as one of the few magnetic Weyl semimetal candidate materials, although recently there have been emerging reports that claim it to have a semiconducting electronic structure. To resolve this debate, we established the growth of n-type EuCd_{2}As_{2} crystals, to directly visualize the nature of the conduction band using angle resolve photoemission spectroscopy (ARPES). We show that La-do** leads to n-type transport signatures in both the thermopower and Hall effect measurements, in crystals with do** levels at 2 - 6 x 10^{17} e^{-} cm^{-3}. Both p-type and n-type doped samples exhibit antiferromagnetic ordering at 9 K. ARPES experiments at 6 K clearly show the presence of the conduction band minimum at 0.8 eV above the valence band maximum, which is further corroborated by the observation of a 0.71 - 0.72 eV band gap in room temperature diffuse reflectance absorbance measurements. Together these findings unambiguously show that EuCd_{2}As_{2} is indeed a semiconductor with a substantial band gap and not a topological semimetal.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
A joint optimization approach of parameterized quantum circuits with a tensor network
Authors:
Clara Ferreira Cores,
Kaur Kristjuhan,
Mark Nicholas Jones
Abstract:
Despite the advantage quantum computers are expected to deliver when performing simulations compared to their classical counterparts, the current noisy intermediate-scale quantum (NISQ) devices remain limited in their capabilities. The training of parameterized quantum circuits (PQCs) remains a significant practical challenge, exacerbated by the requirement of shallow circuit depth necessary for t…
▽ More
Despite the advantage quantum computers are expected to deliver when performing simulations compared to their classical counterparts, the current noisy intermediate-scale quantum (NISQ) devices remain limited in their capabilities. The training of parameterized quantum circuits (PQCs) remains a significant practical challenge, exacerbated by the requirement of shallow circuit depth necessary for their hardware implementation. Hybrid methods employing classical computers alongside quantum devices, such as the Variational Quantum Eigensolver (VQE), have proven useful for analyzing the capabilities of NISQ devices to solve relevant optimization problems. Still, in the simulation of complex structures involving the many-body problem in quantum mechanics, major issues remain about the representation of the system and obtaining results which clearly outperform classical computational devices. In this research contribution we propose the use of parameterized Tensor Networks (TNs) to attempt an improved performance of the VQE algorithm. A joint approach is presented where the Hamiltonian of a system is encapsulated into a Matrix Product Operator (MPO) within a parameterized unitary TN hereby splitting up the optimization task between the TN and the VQE. We show that the hybrid TN-VQE implementation improves the convergence of the algorithm in comparison to optimizing randomly-initialized quantum circuits via VQE.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Principled Preferential Bayesian Optimization
Authors:
Wenjie Xu,
Wenbin Wang,
Yuning Jiang,
Bratislav Svetozarevic,
Colin N. Jones
Abstract:
We study the problem of preferential Bayesian optimization (BO), where we aim to optimize a black-box function with only preference feedback over a pair of candidate solutions. Inspired by the likelihood ratio idea, we construct a confidence set of the black-box function using only the preference feedback. An optimistic algorithm with an efficient computational method is then developed to solve th…
▽ More
We study the problem of preferential Bayesian optimization (BO), where we aim to optimize a black-box function with only preference feedback over a pair of candidate solutions. Inspired by the likelihood ratio idea, we construct a confidence set of the black-box function using only the preference feedback. An optimistic algorithm with an efficient computational method is then developed to solve the problem, which enjoys an information-theoretic bound on the total cumulative regret, a first-of-its-kind for preferential BO. This bound further allows us to design a scheme to report an estimated best solution, with a guaranteed convergence rate. Experimental results on sampled instances from Gaussian processes, standard test functions, and a thermal comfort optimization problem all show that our method stably achieves better or competitive performance as compared to the existing state-of-the-art heuristics, which, however, do not have theoretical guarantees on regret bounds or convergence.
△ Less
Submitted 29 May, 2024; v1 submitted 7 February, 2024;
originally announced February 2024.
-
Ensuring Data Privacy in AC Optimal Power Flow with a Distributed Co-Simulation Framework
Authors:
Xinliang Dai,
Alexander Kocher,
Jovana Kovačević,
Burak Dindar,
Yuning Jiang,
Colin N. Jones,
Hüseyin Çakmak,
Veit Hagenmeyer
Abstract:
During the energy transition, the significance of collaborative management among institutions is rising, confronting challenges posed by data privacy concerns. Prevailing research on distributed approaches, as an alternative to centralized management, often lacks numerical convergence guarantees or is limited to single-machine numerical simulation. To address this, we present a distributed approac…
▽ More
During the energy transition, the significance of collaborative management among institutions is rising, confronting challenges posed by data privacy concerns. Prevailing research on distributed approaches, as an alternative to centralized management, often lacks numerical convergence guarantees or is limited to single-machine numerical simulation. To address this, we present a distributed approach for solving AC Optimal Power Flow (OPF) problems within a geographically distributed environment. This involves integrating the energy system Co-Simulation (eCoSim) module in the eASiMOV framework with the convergence-guaranteed distributed optimization algorithm, i.e., the Augmented Lagrangian based Alternating Direction Inexact Newton method (ALADIN). Comprehensive evaluations across multiple system scenarios reveal a marginal performance slowdown compared to the centralized approach and the distributed approach executed on single machines -- a justified trade-off for enhanced data privacy. This investigation serves as empirical validation of the successful execution of distributed AC OPF within a geographically distributed environment, highlighting potential directions for future research.
△ Less
Submitted 15 March, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Real-Time Coordination of Integrated Transmission and Distribution Systems: Flexibility Modeling and Distributed NMPC Scheduling
Authors:
Xinliang Dai,
Yi Guo,
Yuning Jiang,
Colin N. Jones,
Gabriela Hug,
Veit Hagenmeyer
Abstract:
This paper proposes a real-time distributed operational architecture to efficiently coordinate intergrated transmission and distribution systems (ITD). At the distribution system level, the distribution system operator (DSO) computes the aggregated flexibility of all controllable devices by power-energy envelopes and provides them to the transmission system operators. At the transmission system le…
▽ More
This paper proposes a real-time distributed operational architecture to efficiently coordinate intergrated transmission and distribution systems (ITD). At the distribution system level, the distribution system operator (DSO) computes the aggregated flexibility of all controllable devices by power-energy envelopes and provides them to the transmission system operators. At the transmission system level, a distributed nonlinear MPC approach is proposed to coordinate the economic dispatch of multiple TSOs, considering the aggregated flexibility of all distribution systems. The subproblems of the proposed approach are associated with different TSOs and individual time periods. In addition, the aggregated flexibility of controllable devices in distribution networks is encapsulated, re-calculated, and communicated through the power-energy envelopes, facilitating a reduction in computational complexity and eliminating redundant information exchanges between TSOs and DSOs, thereby enhancing privacy and security. The framework's effectiveness and applicability in real-world scenarios are validated through simulated operational scenarios on a summer day in Germany, highlighting its robustness in the face of significant prediction mismatches due to severe weather conditions.
△ Less
Submitted 20 February, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
SIMBa: System Identification Methods leveraging Backpropagation
Authors:
Loris Di Natale,
Muhammad Zakwan,
Philipp Heer,
Giancarlo Ferrari-Trecate,
Colin N. Jones
Abstract:
This manuscript details and extends the SIMBa toolbox (System Identification Methods leveraging Backpropagation) presented in previous work, which uses well-established Machine Learning tools for discrete-time linear multi-step-ahead state-space System Identification (SI). SIMBa leverages linear-matrix-inequality-based free parametrizations of Schur matrices to guarantee the stability of the ident…
▽ More
This manuscript details and extends the SIMBa toolbox (System Identification Methods leveraging Backpropagation) presented in previous work, which uses well-established Machine Learning tools for discrete-time linear multi-step-ahead state-space System Identification (SI). SIMBa leverages linear-matrix-inequality-based free parametrizations of Schur matrices to guarantee the stability of the identified model by design. In this paper, backed up by novel free parametrizations of Schur matrices, we extend the toolbox to show how SIMBa can incorporate known sparsity patterns or true values of the state-space matrices to identify without jeopardizing stability.
We extensively investigate SIMBa's behavior when identifying diverse systems with various properties from both simulated and real-world data. Overall, we find it consistently outperforms traditional stable subspace identification methods, and sometimes significantly, especially when enforcing desired model properties. These results hint at the potential of SIMBa to pave the way for generic structured nonlinear SI. The toolbox is open-sourced on https://github.com/Cemempamoi/simba.
△ Less
Submitted 17 May, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
Risk-aware Scheduling and Dispatch of Flexibility Events in Buildings
Authors:
Paul Scharnhorst,
Baptiste Schubnel,
Rafael E. Carrillo,
Pierre-Jean Alet,
Colin N. Jones
Abstract:
Residential and commercial buildings, equipped with systems such as heat pumps, hot water tanks, or stationary energy storage, have a large potential to offer their consumption flexibility as grid services. In this work, we leverage this flexibility to react to consumption requests related to maximizing self-consumption and reducing peak loads. We present a general characterization of consumption…
▽ More
Residential and commercial buildings, equipped with systems such as heat pumps, hot water tanks, or stationary energy storage, have a large potential to offer their consumption flexibility as grid services. In this work, we leverage this flexibility to react to consumption requests related to maximizing self-consumption and reducing peak loads. We present a general characterization of consumption flexibility in the form of flexibility envelopes and discuss a data-driven battery model formulation for modeling individual buildings. These models are used to predict the available consumption flexibility while incorporating a description of uncertainty and being risk-aware with a pre-defined risk level. A Mixed-integer Linear Program (MILP) is formulated to schedule the activation of the buildings in order to best respond to an external consumption request. An aggregated consumption request is dispatched to the active individual buildings by an algorithm, based on the previously determined schedule. The effectiveness of the approach is demonstrated by coordinating up to 500 simulated buildings using the Energym Python library and observing about 1.5 times peak power reduction in comparison with a baseline approach while maintaining comfort more robustly. We demonstrate the scalability of the approach, with solving times being approximately linear in the number of considered assets in the scheduling problem.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Stable Linear Subspace Identification: A Machine Learning Approach
Authors:
Loris Di Natale,
Muhammad Zakwan,
Bratislav Svetozarevic,
Philipp Heer,
Giancarlo Ferrari-Trecate,
Colin N. Jones
Abstract:
Machine Learning (ML) and linear System Identification (SI) have been historically developed independently. In this paper, we leverage well-established ML tools - especially the automatic differentiation framework - to introduce SIMBa, a family of discrete linear multi-step-ahead state-space SI methods using backpropagation. SIMBa relies on a novel Linear-Matrix-Inequality-based free parametrizati…
▽ More
Machine Learning (ML) and linear System Identification (SI) have been historically developed independently. In this paper, we leverage well-established ML tools - especially the automatic differentiation framework - to introduce SIMBa, a family of discrete linear multi-step-ahead state-space SI methods using backpropagation. SIMBa relies on a novel Linear-Matrix-Inequality-based free parametrization of Schur matrices to ensure the stability of the identified model.
We show how SIMBa generally outperforms traditional linear state-space SI methods, and sometimes significantly, although at the price of a higher computational burden. This performance gap is particularly remarkable compared to other SI methods with stability guarantees, where the gain is frequently above 25% in our investigations, hinting at SIMBa's ability to simultaneously achieve state-of-the-art fitting performance and enforce stability. Interestingly, these observations hold for a wide variety of input-output systems and on both simulated and real-world data, showcasing the flexibility of the proposed approach. We postulate that this new SI paradigm presents a great extension potential to identify structured nonlinear models from data, and we hence open-source SIMBa on https://github.com/Cemempamoi/simba.
△ Less
Submitted 26 March, 2024; v1 submitted 6 November, 2023;
originally announced November 2023.
-
Multi-Agent Bayesian Optimization with Coupled Black-Box and Affine Constraints
Authors:
Wenjie Xu,
Yuning Jiang,
Bratislav Svetozarevic,
Colin N. Jones
Abstract:
This paper studies the problem of distributed multi-agent Bayesian optimization with both coupled black-box constraints and known affine constraints. A primal-dual distributed algorithm is proposed that achieves similar regret/violation bounds as those in the single-agent case for the black-box objective and constraint functions. Additionally, the algorithm guarantees an $\mathcal{O}(N\sqrt{T})$ b…
▽ More
This paper studies the problem of distributed multi-agent Bayesian optimization with both coupled black-box constraints and known affine constraints. A primal-dual distributed algorithm is proposed that achieves similar regret/violation bounds as those in the single-agent case for the black-box objective and constraint functions. Additionally, the algorithm guarantees an $\mathcal{O}(N\sqrt{T})$ bound on the cumulative violation for the known affine constraints, where $N$ is the number of agents. Hence, it is ensured that the average of the samples satisfies the affine constraints up to the error $\mathcal{O}({N}/{\sqrt{T}})$. Furthermore, we characterize certain conditions under which our algorithm can bound a stronger metric of cumulative violation and provide best-iterate convergence without affine constraint. The method is then applied to both sampled instances from Gaussian processes and a real-world optimal power allocation problem for wireless communication; the results show that our method simultaneously provides close-to-optimal performance and maintains minor violations on average, corroborating our theoretical analysis.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Data-driven adaptive building thermal controller tuning with constraints: A primal-dual contextual Bayesian optimization approach
Authors:
Wenjie Xu,
Bratislav Svetozarevic,
Loris Di Natale,
Philipp Heer,
Colin N Jones
Abstract:
We study the problem of tuning the parameters of a room temperature controller to minimize its energy consumption, subject to the constraint that the daily cumulative thermal discomfort of the occupants is below a given threshold. We formulate it as an online constrained black-box optimization problem where, on each day, we observe some relevant environmental context and adaptively select the cont…
▽ More
We study the problem of tuning the parameters of a room temperature controller to minimize its energy consumption, subject to the constraint that the daily cumulative thermal discomfort of the occupants is below a given threshold. We formulate it as an online constrained black-box optimization problem where, on each day, we observe some relevant environmental context and adaptively select the controller parameters. In this paper, we propose to use a data-driven Primal-Dual Contextual Bayesian Optimization (PDCBO) approach to solve this problem. In a simulation case study on a single room, we apply our algorithm to tune the parameters of a Proportional Integral (PI) heating controller and the pre-heating time. Our results show that PDCBO can save up to 4.7% energy consumption compared to other state-of-the-art Bayesian optimization-based methods while kee** the daily thermal discomfort below the given tolerable threshold on average. Additionally, PDCBO can automatically track time-varying tolerable thresholds while existing methods fail to do so. We then study an alternative constrained tuning problem where we aim to minimize the thermal discomfort with a given energy budget. With this formulation, PDCBO reduces the average discomfort by up to 63% compared to state-of-the-art safe optimization methods while kee** the average daily energy consumption below the required threshold.
△ Less
Submitted 1 October, 2023;
originally announced October 2023.
-
Efficient Recursive Data-enabled Predictive Control (Extended Version)
Authors:
Jicheng Shi,
Yingzhao Lian,
Colin N. Jones
Abstract:
In the field of model predictive control, Data-enabled Predictive Control (DeePC) offers direct predictive control, bypassing traditional modeling. However, challenges emerge with increased computational demand due to recursive data updates. This paper introduces a novel recursive updating algorithm for DeePC. It emphasizes the use of Singular Value Decomposition (SVD) for efficient low-dimensiona…
▽ More
In the field of model predictive control, Data-enabled Predictive Control (DeePC) offers direct predictive control, bypassing traditional modeling. However, challenges emerge with increased computational demand due to recursive data updates. This paper introduces a novel recursive updating algorithm for DeePC. It emphasizes the use of Singular Value Decomposition (SVD) for efficient low-dimensional transformations of DeePC in its general form, as well as a fast SVD update scheme. Importantly, our proposed algorithm is highly flexible due to its reliance on the general form of DeePC, which is demonstrated to encompass various data-driven methods that utilize Pseudoinverse and Hankel matrices. This is exemplified through a comparison to Subspace Predictive Control, where the algorithm achieves asymptotically consistent prediction for stochastic linear time-invariant systems. Our proposed methodologies' efficacy is validated through simulation studies.
△ Less
Submitted 24 March, 2024; v1 submitted 24 September, 2023;
originally announced September 2023.
-
A Generalized Stop** Criterion for Real-Time MPC with Guaranteed Stability
Authors:
Kristína Fedorová,
Yuning Jiang,
Juraj Oravec,
Colin N. Jones,
Michal Kvasnica
Abstract:
Most of the real-time implementations of the stabilizing optimal control actions suffer from the necessity to provide high computational effort. This paper presents a cutting-edge approach for real-time evaluation of linear-quadratic model predictive control (MPC) that employs a novel generalized stop** criterion, achieving asymptotic stability in the presence of input constraints. The proposed…
▽ More
Most of the real-time implementations of the stabilizing optimal control actions suffer from the necessity to provide high computational effort. This paper presents a cutting-edge approach for real-time evaluation of linear-quadratic model predictive control (MPC) that employs a novel generalized stop** criterion, achieving asymptotic stability in the presence of input constraints. The proposed method evaluates a fixed number of iterations independent of the initial condition, eliminating the necessity for computationally expensive methods. We demonstrate the effectiveness of the introduced technique by its implementation of two widely-used first-order optimization methods: the projected gradient descent method (PGDM) and the alternating directions method of multipliers (ADMM). The numerical simulation confirmed a significantly reduced number of iterations, resulting in suboptimality rates of less than 2\,\%, while the effort reductions exceeded 80\,\%. These results nominate the proposed criterion for an efficient real-time implementation method of MPC controllers.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Data-Driven Robust Control Using Prediction Error Bounds Based on Perturbation Analysis
Authors:
Baiwei Guo,
Yuning Jiang,
Colin N. Jones,
Giancarlo Ferrari-Trecate
Abstract:
For linear systems, many data-driven control methods rely on the behavioral framework, using historical data of the system to predict the future trajectories. However, measurement noise introduces errors in predictions. When the noise is bounded, we propose a method for designing historical experiments that enable the computation of an upper bound on the prediction error. This approach allows us t…
▽ More
For linear systems, many data-driven control methods rely on the behavioral framework, using historical data of the system to predict the future trajectories. However, measurement noise introduces errors in predictions. When the noise is bounded, we propose a method for designing historical experiments that enable the computation of an upper bound on the prediction error. This approach allows us to formulate a minimax control problem where robust constraint satisfaction is enforced. We derive an upper bound on the suboptimality gap of the resulting control input sequence compared to optimal control utilizing accurate measurements. As demonstrated in numerical experiments, the solution derived by our method can achieve constraint satisfaction and a small suboptimality gap despite the measurement noise.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
Advancing Distributed AC Optimal Power Flow for Integrated Transmission-Distribution Systems
Authors:
Xinliang Dai,
Junyi Zhai,
Yuning Jiang,
Yi Guo,
Colin N. Jones,
Veit Hagenmeyer
Abstract:
This paper introduces a distributed operational solution for coordinating integrated transmission-distribution (ITD) systems regarding data privacy. To tackle the nonconvex challenges of AC optimal power flow (OPF) problems, our research proposes an enhanced version of the Augmented Lagrangian based Alternating Direction Inexact Newton method (ALADIN). This proposed framework incorporates a second…
▽ More
This paper introduces a distributed operational solution for coordinating integrated transmission-distribution (ITD) systems regarding data privacy. To tackle the nonconvex challenges of AC optimal power flow (OPF) problems, our research proposes an enhanced version of the Augmented Lagrangian based Alternating Direction Inexact Newton method (ALADIN). This proposed framework incorporates a second-order correction strategy and convexification, thereby enhancing numerical robustness and computational efficiency. The theoretical studies demonstrate that the proposed distributed algorithm operates the ITD systems with a local quadratic convergence guarantee. Extensive simulations on various ITD configurations highlight the superior performance of our distributed approach in terms of convergence speed, computational efficiency, scalability, and adaptability.
△ Less
Submitted 30 January, 2024; v1 submitted 25 August, 2023;
originally announced August 2023.
-
The valence and Rydberg states of difluoromethane: A combined experimental vacuum ultraviolet spectrum absorption and theoretical study by ab initio configuration interaction and density functional computations
Authors:
Michael H. Palmer,
Søren Vrønning Hoffmann,
Nykola C. Jones,
Marcello Coreno,
Monica de Simone,
Cesare Grazioli
Abstract:
A new synchrotron study for CH$_2$F$_2$ from has been combined with earlier data. The onset of absorption, band I and also band IV, is resolved into broad vibrational peaks, which contrast with the continuous absorption previously claimed. A new theoretical analysis, using a combination of time dependent density functional theory (TDDFT) calculations and complete active space self-consistent field…
▽ More
A new synchrotron study for CH$_2$F$_2$ from has been combined with earlier data. The onset of absorption, band I and also band IV, is resolved into broad vibrational peaks, which contrast with the continuous absorption previously claimed. A new theoretical analysis, using a combination of time dependent density functional theory (TDDFT) calculations and complete active space self-consistent field, leads to a major new interpretation. Adiabatic excitation energies (AEEs) and vertical excitation energies, evaluated by these methods, are used to interpret the spectra in unprecedented detail using theoretical vibronic analysis. This includes both Franck-Condon (FC) and Herzberg-Teller (HT) effects on cold and hot bands. These results lead to the re-assignment of several known excited states and the identification of new ones. The lowest calculated AEE sequence for singlet states is 1$^1$B$_1$ $\sim$ 1$^1$A$_2$ < 2$^1$B$_1$ < 1$^1$A$_1$ < 2$^1$A$_1$ < 1$^1$B$_2$ < 3$^1$A$_1$ < 3$^1$B$_1$. These, together with calculated higher energy states, give a satisfactory account of the principal maxima observed in the VUV spectrum. Basis sets up to quadruple zeta valence with extensive polarization are used. The diffuse functions within this type of basis generate both valence and low-lying Rydberg excited states. The optimum position for the site of further diffuse functions in the calculations of Rydberg states is shown to lie on the H-atoms. The routine choice on the F-atoms is shown to be inadequate for both CHF$_3$ and CH$_2$F$_2$. The lowest excitation energy region has mixed valence and Rydberg character. TDDFT calculations show that the unusual structure of the onset arises from the near degeneracy of 1$^1$B$_1$ and 1$^1$A$_2$ valence states, which mix in symmetric and antisymmetric combinations.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Adaptive Data-Driven Prediction in a Building Control Hierarchy: A Case Study of Demand Response in Switzerland
Authors:
Jicheng Shi,
Yingzhao Lian,
Christophe Salzmann,
Colin N. Jones
Abstract:
By providing various services, such as Demand Response (DR), buildings can play a crucial role in the energy market due to their significant energy consumption. However, effectively commissioning buildings for such desired functionalities requires significant expert knowledge and design effort, considering the variations in building dynamics and intended use. In this study, we introduce an adaptiv…
▽ More
By providing various services, such as Demand Response (DR), buildings can play a crucial role in the energy market due to their significant energy consumption. However, effectively commissioning buildings for such desired functionalities requires significant expert knowledge and design effort, considering the variations in building dynamics and intended use. In this study, we introduce an adaptive data-driven prediction scheme based on Willems' Fundamental Lemma within the building control hierarchy. This scheme offers a versatile, flexible, and user-friendly interface for diverse prediction and control objectives. We provide an easy-to-use tuning process and an adaptive update pipeline for the scheme, both validated through extensive prediction tests. We evaluate the proposed scheme by coordinating a building and an energy storage system to provide Secondary Frequency Control (SFC) in a Swiss DR program. Specifically, we integrate the scheme into a three-layer hierarchical SFC control framework, and each layer of this hierarchy is designed to achieve distinct operational goals. Apart from its flexibility, our approach significantly improves cost efficiency, resulting in a 28.74% reduction in operating costs compared to a conventional control scheme, as demonstrated by a 52-day experiment in an actual building. Our findings emphasize the potential of the proposed scheme to reduce the commissioning costs of advanced building control strategies and to facilitate the adoption of new techniques in building control.
△ Less
Submitted 29 December, 2023; v1 submitted 17 July, 2023;
originally announced July 2023.
-
Hypergraph-Based Fast Distributed AC Power Flow Optimization
Authors:
Xinliang Dai,
Yingzhao Lian,
Yuning Jiang,
Colin N. Jones,
Veit Hagenmeyer
Abstract:
This paper presents a novel distributed approach for solving AC power flow (PF) problems. The optimization problem is reformulated into a distributed form using a communication structure corresponding to a hypergraph, by which complex relationships between subgrids can be expressed as hyperedges. Then, a hypergraph-based distributed sequential quadratic programming (HDQ) approach is proposed to ha…
▽ More
This paper presents a novel distributed approach for solving AC power flow (PF) problems. The optimization problem is reformulated into a distributed form using a communication structure corresponding to a hypergraph, by which complex relationships between subgrids can be expressed as hyperedges. Then, a hypergraph-based distributed sequential quadratic programming (HDQ) approach is proposed to handle the reformulated problems, and the hypergraph-based distributed sequential quadratic programming (HDSQP) is used as the inner algorithm to solve the corresponding QP subproblems, which are respectively condensed using Schur complements with respect to coupling variables defined by hyperedges. Furthermore, we rigorously establish the convergence guarantee of the proposed algorithm with a locally quadratic rate and the one-step convergence of the inner algorithm when using the Levenberg-Marquardt regularization. Our analysis also demonstrates that the computational complexity of the proposed algorithm is much lower than the state-of-art distributed algorithm. We implement the proposed algorithm in an open-source toolbox, i.e., rapidPF, and conduct numerical tests that validate the proof and demonstrate the great potential of the proposed distributed algorithm in terms of communication effort and computational speed.
△ Less
Submitted 14 July, 2023; v1 submitted 13 July, 2023;
originally announced July 2023.
-
Bayesian Optimization of Expensive Nested Grey-Box Functions
Authors:
Wenjie Xu,
Yuning Jiang,
Bratislav Svetozarevic,
Colin N. Jones
Abstract:
We consider the problem of optimizing a grey-box objective function, i.e., nested function composed of both black-box and white-box functions. A general formulation for such grey-box problems is given, which covers the existing grey-box optimization formulations as special cases. We then design an optimism-driven algorithm to solve it. Under certain regularity assumptions, our algorithm achieves s…
▽ More
We consider the problem of optimizing a grey-box objective function, i.e., nested function composed of both black-box and white-box functions. A general formulation for such grey-box problems is given, which covers the existing grey-box optimization formulations as special cases. We then design an optimism-driven algorithm to solve it. Under certain regularity assumptions, our algorithm achieves similar regret bound as that for the standard black-box Bayesian optimization algorithm, up to a constant multiplicative term depending on the Lipschitz constants of the functions considered. We further extend our method to the constrained case and discuss special cases. For the commonly used kernel functions, the regret bounds allow us to derive a convergence rate to the optimal solution. Experimental results show that our grey-box optimization method empirically improves the speed of finding the global optimal solution significantly, as compared to the standard black-box optimization algorithm.
△ Less
Submitted 2 August, 2023; v1 submitted 8 June, 2023;
originally announced June 2023.
-
A comparison of methods to eliminate regularization weight tuning from data-enabled predictive control
Authors:
Manuel Koch,
Colin N. Jones
Abstract:
Data-enabled predictive control (DeePC) is a recently established form of Model Predictive Control (MPC), based on behavioral systems theory. While eliminating the need to explicitly identify a model, it requires an additional regularization with a corresponding weight to function well with noisy data. The tuning of this weight is non-trivial and has a significant impact on performance. In this pa…
▽ More
Data-enabled predictive control (DeePC) is a recently established form of Model Predictive Control (MPC), based on behavioral systems theory. While eliminating the need to explicitly identify a model, it requires an additional regularization with a corresponding weight to function well with noisy data. The tuning of this weight is non-trivial and has a significant impact on performance. In this paper, we compare three reformulations of DeePC that either eliminate the regularization, or simplify the tuning to a trivial point. A building simulation study shows a comparable performance for all three reformulations of DeePC. However, a conventional MPC with a black-box model slightly outperforms them, while solving much faster, and yielding smoother optimal trajectories. Two of the DeePC variants also show sensitivity to an unobserved biased input noise, which is not present in the conventional MPC.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
SOStab: a Matlab Toolbox for Transient Stability Analysis
Authors:
Stéphane Drobot,
Matteo Tacchi,
Carmen Cardozo,
Colin N. Jones
Abstract:
This paper presents a new Matlab toolbox, aimed at facilitating the use of polynomial optimization for stability analysis of nonlinear systems. In the past decade several decisive contributions made it possible to recast this type of problems as convex optimization ones that are tractable in modest dimensions. However, available software requires their user to be fluent in Sum-of-Squares programmi…
▽ More
This paper presents a new Matlab toolbox, aimed at facilitating the use of polynomial optimization for stability analysis of nonlinear systems. In the past decade several decisive contributions made it possible to recast this type of problems as convex optimization ones that are tractable in modest dimensions. However, available software requires their user to be fluent in Sum-of-Squares programming, preventing them from being more widely explored by practitioners. To address this issue, SOStab entirely automates the writing and solving of optimization problems, and directly outputs relevant data for the user, while requiring minimal input. In particular, no specific knowledge of optimization is needed for implementation. The toolbox allows a user to obtain outer and inner approximates of the \ac{roa} of the operating point of different grid connected devices such as synchronous machines and power converters.
△ Less
Submitted 2 April, 2024; v1 submitted 18 April, 2023;
originally announced April 2023.
-
Primal-Dual Contextual Bayesian Optimization for Control System Online Optimization with Time-Average Constraints
Authors:
Wenjie Xu,
Yuning Jiang,
Bratislav Svetozarevic,
Colin N. Jones
Abstract:
This paper studies the problem of online performance optimization of constrained closed-loop control systems, where both the objective and the constraints are unknown black-box functions affected by exogenous time-varying contextual disturbances. A primal-dual contextual Bayesian optimization algorithm is proposed that achieves sublinear cumulative regret with respect to the dynamic optimal soluti…
▽ More
This paper studies the problem of online performance optimization of constrained closed-loop control systems, where both the objective and the constraints are unknown black-box functions affected by exogenous time-varying contextual disturbances. A primal-dual contextual Bayesian optimization algorithm is proposed that achieves sublinear cumulative regret with respect to the dynamic optimal solution under certain regularity conditions. Furthermore, the algorithm achieves zero time-average constraint violation, ensuring that the average value of the constraint function satisfies the desired constraint. The method is applied to both sampled instances from Gaussian processes and a continuous stirred tank reactor parameter tuning problem; simulation results show that the method simultaneously provides close-to-optimal performance and maintains constraint feasibility on average. This contrasts current state-of-the-art methods, which either suffer from large cumulative regret or severe constraint violations for the case studies presented.
△ Less
Submitted 20 September, 2023; v1 submitted 12 April, 2023;
originally announced April 2023.
-
Locally imprimitive points on elliptic curves
Authors:
Nathan Jones,
Francesco Pappalardi,
Peter Stevenhagen
Abstract:
Under GRH, any element in the multiplicative group of a number field $K$ that is globally primitive (i.e., not a perfect power in $K^*$) is a primitive root modulo a set of primes of $K$ of positive density. For elliptic curves $E/K$ that are known to have infinitely many primes $\mathfrak p$ of cyclic reduction, possibly under GRH, a globally primitive point $P\in E(K)$ may fail to generate any o…
▽ More
Under GRH, any element in the multiplicative group of a number field $K$ that is globally primitive (i.e., not a perfect power in $K^*$) is a primitive root modulo a set of primes of $K$ of positive density. For elliptic curves $E/K$ that are known to have infinitely many primes $\mathfrak p$ of cyclic reduction, possibly under GRH, a globally primitive point $P\in E(K)$ may fail to generate any of the point groups $E(k_{\mathfrak p})$. We describe this phenomenon in terms of an associated Galois representation $ρ_{E/K, P}:G_K\to\mathrm{GL}_3(\hat{\mathbf Z})$, and use it to construct non-trivial examples of global points on elliptic curves that are locally imprimitive.
△ Less
Submitted 8 April, 2023;
originally announced April 2023.
-
PIQP: A Proximal Interior-Point Quadratic Programming Solver
Authors:
Roland Schwan,
Yuning Jiang,
Daniel Kuhn,
Colin N. Jones
Abstract:
This paper presents PIQP, a high-performance toolkit for solving generic sparse quadratic programs (QP). Combining an infeasible Interior Point Method (IPM) with the Proximal Method of Multipliers (PMM), the algorithm can handle ill-conditioned convex QP problems without the need for linear independence of the constraints. The open-source implementation is written in C++ with interfaces to C, Pyth…
▽ More
This paper presents PIQP, a high-performance toolkit for solving generic sparse quadratic programs (QP). Combining an infeasible Interior Point Method (IPM) with the Proximal Method of Multipliers (PMM), the algorithm can handle ill-conditioned convex QP problems without the need for linear independence of the constraints. The open-source implementation is written in C++ with interfaces to C, Python, Matlab, and R leveraging the Eigen3 library. The method uses a pivoting-free factorization routine and allocation-free updates of the problem data, making the solver suitable for embedded applications. The solver is evaluated on the Maros-Mészáros problem set and optimal control problems, demonstrating state-of-the-art performance for both small and large-scale problems, outperforming commercial and open-source solvers.
△ Less
Submitted 15 September, 2023; v1 submitted 1 April, 2023;
originally announced April 2023.
-
Understanding Frontline Workers' and Unhoused Individuals' Perspectives on AI Used in Homeless Services
Authors:
Tzu-Sheng Kuo,
Hong Shen,
Jisoo Geum,
Nev Jones,
Jason I. Hong,
Haiyi Zhu,
Kenneth Holstein
Abstract:
Recent years have seen growing adoption of AI-based decision-support systems (ADS) in homeless services, yet we know little about stakeholder desires and concerns surrounding their use. In this work, we aim to understand impacted stakeholders' perspectives on a deployed ADS that prioritizes scarce housing resources. We employed AI lifecycle comicboarding, an adapted version of the comicboarding me…
▽ More
Recent years have seen growing adoption of AI-based decision-support systems (ADS) in homeless services, yet we know little about stakeholder desires and concerns surrounding their use. In this work, we aim to understand impacted stakeholders' perspectives on a deployed ADS that prioritizes scarce housing resources. We employed AI lifecycle comicboarding, an adapted version of the comicboarding method, to elicit stakeholder feedback and design ideas across various components of an AI system's design. We elicited feedback from county workers who operate the ADS daily, service providers whose work is directly impacted by the ADS, and unhoused individuals in the region. Our participants shared concerns and design suggestions around the AI system's overall objective, specific model design choices, dataset selection, and use in deployment. Our findings demonstrate that stakeholders, even without AI knowledge, can provide specific and critical feedback on an AI system's design and deployment, if empowered to do so.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Physically Consistent Multiple-Step Data-Driven Predictions Using Physics-based Filters
Authors:
Yingzhao Lian,
Jicheng Shi,
Colin N. Jones
Abstract:
(Extended Version) Data-driven control can facilitate the rapid development of controllers, offering an alternative to conventional approaches. In order to maintain consistency between any known underlying physical laws and a data-driven decision-making process, preprocessing of raw data is necessary to account for measurement noise and any inconsistencies it may introduce. In this paper, we prese…
▽ More
(Extended Version) Data-driven control can facilitate the rapid development of controllers, offering an alternative to conventional approaches. In order to maintain consistency between any known underlying physical laws and a data-driven decision-making process, preprocessing of raw data is necessary to account for measurement noise and any inconsistencies it may introduce. In this paper, we present a physics-based filter to achieve this and demonstrate its effectiveness through practical applications, using real-world datasets collected in a building on the Ecole Polytechnique Federale de Lausanne (EPFL) campus. Two distinct use cases are explored: indoor temperature control and demand response bidding.
△ Less
Submitted 2 June, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
Resource efficient method for representation and measurement of constrained electronic structure states with a quantum computer
Authors:
Kaur Kristjuhan,
Mark Nicholas Jones
Abstract:
We present a novel method for improving the quantum simulation of the ground state energy of molecules. We perform a pre-processing step classically, which reduces the dimensionality of the problem by generating a custom map** which excludes states which violate problem constraints. Subsequently, a specialized measurement scheme is used to extract the expectation value of the problem Hamiltonian…
▽ More
We present a novel method for improving the quantum simulation of the ground state energy of molecules. We perform a pre-processing step classically, which reduces the dimensionality of the problem by generating a custom map** which excludes states which violate problem constraints. Subsequently, a specialized measurement scheme is used to extract the expectation value of the problem Hamiltonian through this map**. We demonstrate that this method reduces the amount of quantum resources needed to run a Variational Quantum Eigensolver (VQE) algorithm without making any approximations to the physics of the quantum chemistry problem.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
Comparison of behavioral systems theory and conventional linear models for predicting building zone temperature in long-term in situ measurements
Authors:
Manuel Koch,
Colin N. Jones
Abstract:
The potential of Model Predictive Control in buildings has been shown many times, being successfully used to achieve various goals, such as minimizing energy consumption or maximizing thermal comfort. However, mass deployment has thus far failed, in part because of the high engineering cost of obtaining and maintaining a sufficiently accurate model. This can be addressed by using adaptive data-dri…
▽ More
The potential of Model Predictive Control in buildings has been shown many times, being successfully used to achieve various goals, such as minimizing energy consumption or maximizing thermal comfort. However, mass deployment has thus far failed, in part because of the high engineering cost of obtaining and maintaining a sufficiently accurate model. This can be addressed by using adaptive data-driven approaches. The idea of using behavioral systems theory for this purpose has recently found traction in the academic community. In this study, we compare variations thereof with different amounts of data used, different regularization weights, and different methods of data selection. Autoregressive models with exogenous inputs (ARX) are used as a well-established reference. All methods are evaluated by performing iterative system identification on two long-term data sets from real occupied buildings, neither of which include artificial excitation for the purpose of system identification. We find that: (1) Sufficient prediction accuracy is achieved with all methods. (2) The ARX models perform slightly better, while having the additional advantages of fewer tuning parameters and faster computation. (3) Adaptive and non-adaptive schemes perform similarly. (4) The regularization weights of the behavioral systems theory methods show the expected trade-off characteristic with an optimal middle value. (5) Using the most recent data yields better performance than selecting data with similar weather as the day to be predicted. (6) More data improves the model performance.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Violation-Aware Contextual Bayesian Optimization for Controller Performance Optimization with Unmodeled Constraints
Authors:
Wenjie Xu,
Colin N Jones,
Bratislav Svetozarevic,
Christopher R. Laughman,
Ankush Chakrabarty
Abstract:
We study the problem of performance optimization of closed-loop control systems with unmodeled dynamics. Bayesian optimization (BO) has been demonstrated to be effective for improving closed-loop performance by automatically tuning controller gains or reference setpoints in a model-free manner. However, BO methods have rarely been tested on dynamical systems with unmodeled constraints and time-var…
▽ More
We study the problem of performance optimization of closed-loop control systems with unmodeled dynamics. Bayesian optimization (BO) has been demonstrated to be effective for improving closed-loop performance by automatically tuning controller gains or reference setpoints in a model-free manner. However, BO methods have rarely been tested on dynamical systems with unmodeled constraints and time-varying ambient conditions. In this paper, we propose a violation-aware contextual BO algorithm (VACBO) that optimizes closed-loop performance while simultaneously learning constraint-feasible solutions under time-varying ambient conditions. Unlike classical constrained BO methods which allow unlimited constraint violations, or 'safe' BO algorithms that are conservative and try to operate with near-zero violations, we allow budgeted constraint violations to improve constraint learning and accelerate optimization. We demonstrate the effectiveness of our proposed VACBO method for energy minimization of industrial vapor compression systems under time-varying ambient temperature and humidity.
△ Less
Submitted 28 January, 2023;
originally announced January 2023.
-
CM elliptic curves and vertically entangled 2-adic groups
Authors:
Nathan Jones
Abstract:
Consider the elliptic curve $E$ given by the Weierstrass equation $y^2 = x^3 - 11x - 14$, which has complex multiplication by the order of conductor $2$ inside $\mathbb{Z}[i]$. It was recently observed in a paper of Daniels and Lozano-Robledo that, for each $n \geq 2$, $\mathbb{Q}(μ_{2^{n+1}}) \subseteq \mathbb{Q}(E[2^n])$. In this note, we prove that this (a priori surprising) ``tower of vertical…
▽ More
Consider the elliptic curve $E$ given by the Weierstrass equation $y^2 = x^3 - 11x - 14$, which has complex multiplication by the order of conductor $2$ inside $\mathbb{Z}[i]$. It was recently observed in a paper of Daniels and Lozano-Robledo that, for each $n \geq 2$, $\mathbb{Q}(μ_{2^{n+1}}) \subseteq \mathbb{Q}(E[2^n])$. In this note, we prove that this (a priori surprising) ``tower of vertical entanglements'' is actually more a feature than a bug: it holds for any elliptic curve $E$ over $\mathbb{Q}$ with complex multiplication by any order of even discriminant.
△ Less
Submitted 3 January, 2023;
originally announced January 2023.
-
Towards Scalable Physically Consistent Neural Networks: an Application to Data-driven Multi-zone Thermal Building Models
Authors:
Loris Di Natale,
Bratislav Svetozarevic,
Philipp Heer,
Colin Neil Jones
Abstract:
With more and more data being collected, data-driven modeling methods have been gaining in popularity in recent years. While physically sound, classical gray-box models are often cumbersome to identify and scale, and their accuracy might be hindered by their limited expressiveness. On the other hand, classical black-box methods, typically relying on Neural Networks (NNs) nowadays, often achieve im…
▽ More
With more and more data being collected, data-driven modeling methods have been gaining in popularity in recent years. While physically sound, classical gray-box models are often cumbersome to identify and scale, and their accuracy might be hindered by their limited expressiveness. On the other hand, classical black-box methods, typically relying on Neural Networks (NNs) nowadays, often achieve impressive performance, even at scale, by deriving statistical patterns from data. However, they remain completely oblivious to the underlying physical laws, which may lead to potentially catastrophic failures if decisions for real-world physical systems are based on them. Physically Consistent Neural Networks (PCNNs) were recently developed to address these aforementioned issues, ensuring physical consistency while still leveraging NNs to attain state-of-the-art accuracy.
In this work, we scale PCNNs to model building temperature dynamics and propose a thorough comparison with classical gray-box and black-box methods. More precisely, we design three distinct PCNN extensions, thereby exemplifying the modularity and flexibility of the architecture, and formally prove their physical consistency. In the presented case study, PCNNs are shown to achieve state-of-the-art accuracy, even outperforming classical NN-based models despite their constrained structure. Our investigations furthermore provide a clear illustration of NNs achieving seemingly good performance while remaining completely physics-agnostic, which can be misleading in practice. While this performance comes at the cost of computational complexity, PCNNs on the other hand show accuracy improvements of 17-35% compared to all other physically consistent methods, paving the way for scalable physically consistent models with state-of-the-art performance.
△ Less
Submitted 4 April, 2023; v1 submitted 23 December, 2022;
originally announced December 2022.
-
Minimizing Age of Information in Spatially Distributed Random Access Wireless Networks
Authors:
Nicholas Jones,
Eytan Modiano
Abstract:
We analyze Age of Information (AoI) in wireless networks where nodes use a spatially adaptive random access scheme to send status updates to a central base station. We show that the set of achievable AoI in this setting is convex, and design policies to minimize weighted sum, min-max, and proportionally fair AoI by setting transmission probabilities as a function of node locations. We show that un…
▽ More
We analyze Age of Information (AoI) in wireless networks where nodes use a spatially adaptive random access scheme to send status updates to a central base station. We show that the set of achievable AoI in this setting is convex, and design policies to minimize weighted sum, min-max, and proportionally fair AoI by setting transmission probabilities as a function of node locations. We show that under the capture model, when the spatial topology of the network is considered, AoI can be significantly improved, and we obtain tight performance bounds on weighted sum and min-max AoI. Finally, we design a policy where each node sets its transmission probability based only on its own distance from the base station, when it does not know the positions of other nodes, and show that it converges to the optimal proportionally fair policy as the size of the network goes to infinity.
△ Less
Submitted 4 January, 2023; v1 submitted 7 December, 2022;
originally announced December 2022.
-
Fresh-CSMA: A Distributed Protocol for Minimizing Age of Information
Authors:
Vishrant Tripathi,
Nicholas Jones,
Eytan Modiano
Abstract:
We consider the design of distributed scheduling algorithms that minimize age of information in single-hop wireless networks. The centralized max-weight policy is known to be nearly optimal in this setting; hence, our goal is to design a distributed CSMA scheme that can mimic its performance. To that end, we propose a distributed protocol called Fresh-CSMA and show that in an idealized setting, Fr…
▽ More
We consider the design of distributed scheduling algorithms that minimize age of information in single-hop wireless networks. The centralized max-weight policy is known to be nearly optimal in this setting; hence, our goal is to design a distributed CSMA scheme that can mimic its performance. To that end, we propose a distributed protocol called Fresh-CSMA and show that in an idealized setting, Fresh-CSMA can match the scheduling decisions of the max-weight policy with high probability in each frame, and also match the theoretical performance guarantees of the max-weight policy over the entire time horizon. We then consider a more realistic setting and study the impact of protocol parameters on the probability of collisions and the overhead caused by the distributed nature of the protocol. We also consider the monitoring of Markov sources and extend our approach to CSMA protocols that incorporate Age of Incorrect Information (AoII) instead of AoI. Finally, we provide simulations that support our theoretical results and show that the performance gap between the ideal and realistic versions of Fresh-CSMA is small.
△ Less
Submitted 12 July, 2023; v1 submitted 6 December, 2022;
originally announced December 2022.
-
Computationally Efficient Reinforcement Learning: Targeted Exploration leveraging Simple Rules
Authors:
Loris Di Natale,
Bratislav Svetozarevic,
Philipp Heer,
Colin N. Jones
Abstract:
Model-free Reinforcement Learning (RL) generally suffers from poor sample complexity, mostly due to the need to exhaustively explore the state-action space to find well-performing policies. On the other hand, we postulate that expert knowledge of the system often allows us to design simple rules we expect good policies to follow at all times. In this work, we hence propose a simple yet effective m…
▽ More
Model-free Reinforcement Learning (RL) generally suffers from poor sample complexity, mostly due to the need to exhaustively explore the state-action space to find well-performing policies. On the other hand, we postulate that expert knowledge of the system often allows us to design simple rules we expect good policies to follow at all times. In this work, we hence propose a simple yet effective modification of continuous actor-critic frameworks to incorporate such rules and avoid regions of the state-action space that are known to be suboptimal, thereby significantly accelerating the convergence of RL agents. Concretely, we saturate the actions chosen by the agent if they do not comply with our intuition and, critically, modify the gradient update step of the policy to ensure the learning process is not affected by the saturation step. On a room temperature control case study, it allows agents to converge to well-performing policies up to 6-7x faster than classical agents without computational overhead and while retaining good final performance.
△ Less
Submitted 12 September, 2023; v1 submitted 29 November, 2022;
originally announced November 2022.
-
Bulk-boundary correspondence and singularity-filling in long-range free-fermion chains
Authors:
Nick G. Jones,
Ryan Thorngren,
Ruben Verresen
Abstract:
The bulk-boundary correspondence relates topologically-protected edge modes to bulk topological invariants, and is well-understood for short-range free-fermion chains. Although case studies have considered long-range Hamiltonians whose couplings decay with a power-law exponent $α$, there has been no systematic study for a free-fermion symmetry class. We introduce a technique for solving gapped, tr…
▽ More
The bulk-boundary correspondence relates topologically-protected edge modes to bulk topological invariants, and is well-understood for short-range free-fermion chains. Although case studies have considered long-range Hamiltonians whose couplings decay with a power-law exponent $α$, there has been no systematic study for a free-fermion symmetry class. We introduce a technique for solving gapped, translationally invariant models in the 1D BDI and AIII symmetry classes with $α>1$, linking together the quantized winding invariant, bulk topological string-order parameters and a complete solution of the edge modes. The physics of these chains is elucidated by studying a complex function determined by the couplings of the Hamiltonian: in contrast to the short-range case where edge modes are associated to roots of this function, we find that they are now associated to singularities. A remarkable consequence is that the finite-size splitting of the edge modes depends on the topological winding number, which can be used as a probe of the latter. We furthermore generalise these results by (i) identifying a family of BDI chains with $α<1$ where our results still hold, and (ii) showing that gapless symmetry-protected topological chains can have topological invariants and edge modes when $α-1$ exceeds the dynamical critical exponent.
△ Less
Submitted 14 April, 2023; v1 submitted 28 November, 2022;
originally announced November 2022.
-
CONFIG: Constrained Efficient Global Optimization for Closed-Loop Control System Optimization with Unmodeled Constraints
Authors:
Wenjie Xu,
Yuning Jiang,
Bratislav Svetozarevic,
Colin N. Jones
Abstract:
In this paper, the CONFIG algorithm, a simple and provably efficient constrained global optimization algorithm, is applied to optimize the closed-loop control performance of an unknown system with unmodeled constraints. Existing Gaussian process based closed-loop optimization methods, either can only guarantee local convergence (e.g., SafeOPT), or have no known optimality guarantee (e.g., constrai…
▽ More
In this paper, the CONFIG algorithm, a simple and provably efficient constrained global optimization algorithm, is applied to optimize the closed-loop control performance of an unknown system with unmodeled constraints. Existing Gaussian process based closed-loop optimization methods, either can only guarantee local convergence (e.g., SafeOPT), or have no known optimality guarantee (e.g., constrained expected improvement) at all, whereas the recently introduced CONFIG algorithm has been proven to enjoy a theoretical global optimality guarantee. In this study, we demonstrate the effectiveness of CONFIG algorithm in the applications. The algorithm is first applied to an artificial numerical benchmark problem to corroborate its effectiveness. It is then applied to a classical constrained steady-state optimization problem of a continuous stirred-tank reactor. Simulation results show that our CONFIG algorithm can achieve performance competitive with the popular CEI (Constrained Expected Improvement) algorithm, which has no known optimality guarantee. As such, the CONFIG algorithm offers a new tool, with both a provable global optimality guarantee and competitive empirical performance, to optimize the closed-loop control performance for a system with soft unmodeled constraints. Last, but not least, the open-source code is available as a python package to facilitate future applications.
△ Less
Submitted 18 December, 2022; v1 submitted 21 November, 2022;
originally announced November 2022.
-
Physically Consistent Neural ODEs for Learning Multi-Physics Systems
Authors:
Muhammad Zakwan,
Loris Di Natale,
Bratislav Svetozarevic,
Philipp Heer,
Colin N. Jones,
Giancarlo Ferrari Trecate
Abstract:
Despite the immense success of neural networks in modeling system dynamics from data, they often remain physics-agnostic black boxes. In the particular case of physical systems, they might consequently make physically inconsistent predictions, which makes them unreliable in practice. In this paper, we leverage the framework of Irreversible port-Hamiltonian Systems (IPHS), which can describe most m…
▽ More
Despite the immense success of neural networks in modeling system dynamics from data, they often remain physics-agnostic black boxes. In the particular case of physical systems, they might consequently make physically inconsistent predictions, which makes them unreliable in practice. In this paper, we leverage the framework of Irreversible port-Hamiltonian Systems (IPHS), which can describe most multi-physics systems, and rely on Neural Ordinary Differential Equations (NODEs) to learn their parameters from data. Since IPHS models are consistent with the first and second principles of thermodynamics by design, so are the proposed Physically Consistent NODEs (PC-NODEs). Furthermore, the NODE training procedure allows us to seamlessly incorporate prior knowledge of the system properties in the learned dynamics. We demonstrate the effectiveness of the proposed method by learning the thermodynamics of a building from the real-world measurements and the dynamics of a simulated gas-piston system. Thanks to the modularity and flexibility of the IPHS framework, PC-NODEs can be extended to learn physically consistent models of multi-physics distributed systems.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
Constrained Efficient Global Optimization of Expensive Black-box Functions
Authors:
Wenjie Xu,
Yuning Jiang,
Bratislav Svetozarevic,
Colin N. Jones
Abstract:
We study the problem of constrained efficient global optimization, where both the objective and constraints are expensive black-box functions that can be learned with Gaussian processes. We propose CONFIG (CONstrained efFIcient Global Optimization), a simple and effective algorithm to solve it. Under certain regularity assumptions, we show that our algorithm enjoys the same cumulative regret bound…
▽ More
We study the problem of constrained efficient global optimization, where both the objective and constraints are expensive black-box functions that can be learned with Gaussian processes. We propose CONFIG (CONstrained efFIcient Global Optimization), a simple and effective algorithm to solve it. Under certain regularity assumptions, we show that our algorithm enjoys the same cumulative regret bound as that in the unconstrained case and similar cumulative constraint violation upper bounds. For commonly used Matern and Squared Exponential kernels, our bounds are sublinear and allow us to derive a convergence rate to the optimal solution of the original constrained problem. In addition, our method naturally provides a scheme to declare infeasibility when the original black-box optimization problem is infeasible. Numerical experiments on sampled instances from the Gaussian process, artificial numerical problems, and a black-box building controller tuning problem all demonstrate the competitive performance of our algorithm. Compared to the other state-of-the-art methods, our algorithm significantly improves the theoretical guarantees, while achieving competitive empirical performance.
△ Less
Submitted 26 April, 2023; v1 submitted 31 October, 2022;
originally announced November 2022.
-
Distributed data-driven predictive control for cooperatively smoothing mixed traffic flow
Authors:
Jiawei Wang,
Yingzhao Lian,
Yuning Jiang,
Qing Xu,
Keqiang Li,
Colin N. Jones
Abstract:
Cooperative control of connected and automated vehicles (CAVs) promises smoother traffic flow. In mixed traffic, where human-driven vehicles with unknown dynamics coexist, data-driven predictive control techniques allow for CAV safe and optimal control with measurable traffic data. However, the centralized control setting in most existing strategies limits their scalability for large-scale mixed t…
▽ More
Cooperative control of connected and automated vehicles (CAVs) promises smoother traffic flow. In mixed traffic, where human-driven vehicles with unknown dynamics coexist, data-driven predictive control techniques allow for CAV safe and optimal control with measurable traffic data. However, the centralized control setting in most existing strategies limits their scalability for large-scale mixed traffic flow. To address this problem, this paper proposes a cooperative DeeP-LCC (Data-EnablEd Predictive Leading Cruise Control) formulation and its distributed implementation algorithm. In cooperative DeeP-LCC, the traffic system is naturally partitioned into multiple subsystems with one single CAV, which collects local trajectory data for subsystem behavior predictions based on the Willems' fundamental lemma. Meanwhile, the cross-subsystem interaction is formulated as a coupling constraint. Then, we employ the Alternating Direction Method of Multipliers (ADMM) to design the distributed DeeP-LCC algorithm. This algorithm achieves both computation and communication efficiency, as well as trajectory data privacy, through parallel calculation. Our simulations on different traffic scales verify the real-time wave-dampening potential of distributed DeeP-LCC, which can reduce fuel consumption by over 31.84% in a large-scale traffic system of 100 vehicles with only 5%-20% CAVs.
△ Less
Submitted 28 April, 2023; v1 submitted 24 October, 2022;
originally announced October 2022.
-
Uncertainty-aware Flexibility Envelope Prediction in Buildings with Controller-agnostic Battery Models
Authors:
Paul Scharnhorst,
Baptiste Schubnel,
Rafael E. Carrillo,
Pierre-Jean Alet,
Colin N. Jones
Abstract:
Buildings are a promising source of flexibility for the application of demand response. In this work, we introduce a novel battery model formulation to capture the state evolution of a single building. Being fully data-driven, the battery model identification requires one dataset from a period of nominal controller operation, and one from a period with flexibility requests, without making any assu…
▽ More
Buildings are a promising source of flexibility for the application of demand response. In this work, we introduce a novel battery model formulation to capture the state evolution of a single building. Being fully data-driven, the battery model identification requires one dataset from a period of nominal controller operation, and one from a period with flexibility requests, without making any assumptions on the underlying controller structure. We consider parameter uncertainty in the model formulation and show how to use risk measures to encode risk preferences of the user in robust uncertainty sets. Finally, we demonstrate the uncertainty-aware prediction of flexibility envelopes for a building simulation model from the Python library Energym.
△ Less
Submitted 7 October, 2022;
originally announced October 2022.
-
Lower Bounds on the Worst-Case Complexity of Efficient Global Optimization
Authors:
Wenjie Xu,
Yuning Jiang,
Emilio T. Maddalena,
Colin N. Jones
Abstract:
Efficient global optimization is a widely used method for optimizing expensive black-box functions such as tuning hyperparameter, and designing new material, etc. Despite its popularity, less attention has been paid to analyzing the inherent hardness of the problem although, given its extensive use, it is important to understand the fundamental limits of efficient global optimization algorithms. I…
▽ More
Efficient global optimization is a widely used method for optimizing expensive black-box functions such as tuning hyperparameter, and designing new material, etc. Despite its popularity, less attention has been paid to analyzing the inherent hardness of the problem although, given its extensive use, it is important to understand the fundamental limits of efficient global optimization algorithms. In this paper, we study the worst-case complexity of the efficient global optimization problem and, in contrast to existing kernel-specific results, we derive a unified lower bound for the complexity of efficient global optimization in terms of the metric entropy of a ball in its corresponding reproducing kernel Hilbert space~(RKHS). Specifically, we show that if there exists a deterministic algorithm that achieves suboptimality gap smaller than $ε$ for any function $f\in S$ in $T$ function evaluations, it is necessary that $T$ is at least $Ω\left(\frac{\log\mathcal{N}(S(\mathcal{X}), 4ε,\|\cdot\|_\infty)}{\log(\frac{R}ε)}\right)$, where $\mathcal{N}(\cdot,\cdot,\cdot)$ is the covering number, $S$ is the ball centered at $0$ with radius $R$ in the RKHS and $S(\mathcal{X})$ is the restriction of $S$ over the feasible set $\mathcal{X}$. Moreover, we show that this lower bound nearly matches the upper bound attained by non-adaptive search algorithms for the commonly used squared exponential kernel and the Matérn kernel with a large smoothness parameter $ν$, up to a replacement of $d/2$ by $d$ and a logarithmic term $\log\frac{R}ε$. That is to say, our lower bound is nearly optimal for these kernels.
△ Less
Submitted 20 September, 2022;
originally announced September 2022.