-
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Authors:
Seungone Kim,
Juyoung Suk,
Ji Yong Cho,
Shayne Longpre,
Chaeeun Kim,
Dongkeun Yoon,
Gui** Son,
Ye** Cho,
Sheikh Shafayat,
**heon Baek,
Sue Hyun Park,
Hyeonbin Hwang,
**kyung Jo,
Hyowon Cho,
Haebin Shin,
Seongyun Lee,
Hanseok Oh,
Noah Lee,
Namgyu Ho,
Se June Joo,
Miyoung Ko,
Yoonjoo Lee,
Hyungjoo Chae,
Jamin Shin,
Joel Jang
, et al. (7 additional authors not shown)
Abstract:
As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec…
▽ More
As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on specific capabilities such as instruction following, leading to coverage bias. To overcome these limitations, we introduce the BiGGen Bench, a principled generation benchmark designed to thoroughly evaluate nine distinct capabilities of LMs across 77 diverse tasks. A key feature of the BiGGen Bench is its use of instance-specific evaluation criteria, closely mirroring the nuanced discernment of human evaluation. We apply this benchmark to assess 103 frontier LMs using five evaluator LMs. Our code, data, and evaluation results are all publicly available at https://github.com/prometheus-eval/prometheus-eval/tree/main/BiGGen-Bench.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
Authors:
Seungone Kim,
Juyoung Suk,
Shayne Longpre,
Bill Yuchen Lin,
Jamin Shin,
Sean Welleck,
Graham Neubig,
Moontae Lee,
Kyungjae Lee,
Minjoon Seo
Abstract:
Proprietary LMs such as GPT-4 are often employed to assess the quality of responses from various LMs. However, concerns including transparency, controllability, and affordability strongly motivate the development of open-source LMs specialized in evaluations. On the other hand, existing open evaluator LMs exhibit critical shortcomings: 1) they issue scores that significantly diverge from those ass…
▽ More
Proprietary LMs such as GPT-4 are often employed to assess the quality of responses from various LMs. However, concerns including transparency, controllability, and affordability strongly motivate the development of open-source LMs specialized in evaluations. On the other hand, existing open evaluator LMs exhibit critical shortcomings: 1) they issue scores that significantly diverge from those assigned by humans, and 2) they lack the flexibility to perform both direct assessment and pairwise ranking, the two most prevalent forms of assessment. Additionally, they do not possess the ability to evaluate based on custom evaluation criteria, focusing instead on general attributes like helpfulness and harmlessness. To address these issues, we introduce Prometheus 2, a more powerful evaluator LM than its predecessor that closely mirrors human and GPT-4 judgements. Moreover, it is capable of processing both direct assessment and pair-wise ranking formats grouped with a user-defined evaluation criteria. On four direct assessment benchmarks and four pairwise ranking benchmarks, Prometheus 2 scores the highest correlation and agreement with humans and proprietary LM judges among all tested open evaluator LMs. Our models, code, and data are all publicly available at https://github.com/prometheus-eval/prometheus-eval.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Optimal and Adaptive Non-Stationary Dueling Bandits Under a Generalized Borda Criterion
Authors:
Joe Suk,
Arpit Agarwal
Abstract:
In dueling bandits, the learner receives preference feedback between arms, and the regret of an arm is defined in terms of its suboptimality to a winner arm. The more challenging and practically motivated non-stationary variant of dueling bandits, where preferences change over time, has been the focus of several recent works (Saha and Gupta, 2022; Buening and Saha, 2023; Suk and Agarwal, 2023). Th…
▽ More
In dueling bandits, the learner receives preference feedback between arms, and the regret of an arm is defined in terms of its suboptimality to a winner arm. The more challenging and practically motivated non-stationary variant of dueling bandits, where preferences change over time, has been the focus of several recent works (Saha and Gupta, 2022; Buening and Saha, 2023; Suk and Agarwal, 2023). The goal is to design algorithms without foreknowledge of the amount of change.
The bulk of known results here studies the Condorcet winner setting, where an arm preferred over any other exists at all times. Yet, such a winner may not exist and, to contrast, the Borda version of this problem (which is always well-defined) has received little attention. In this work, we establish the first optimal and adaptive Borda dynamic regret upper bound, which highlights fundamental differences in the learnability of severe non-stationarity between Condorcet vs. Borda regret objectives in dueling bandits.
Surprisingly, our techniques for non-stationary Borda dueling bandits also yield improved rates within the Condorcet winner setting, and reveal new preference models where tighter notions of non-stationarity are adaptively learnable. This is accomplished through a novel generalized Borda score framework which unites the Borda and Condorcet problems, thus allowing reduction of Condorcet regret to a Borda-like task. Such a generalization was not previously known and is likely to be of independent interest.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
LaB-GATr: geometric algebra transformers for large biomedical surface and volume meshes
Authors:
Julian Suk,
Baris Imre,
Jelmer M. Wolterink
Abstract:
Many anatomical structures can be described by surface or volume meshes. Machine learning is a promising tool to extract information from these 3D models. However, high-fidelity meshes often contain hundreds of thousands of vertices, which creates unique challenges in building deep neural network architectures. Furthermore, patient-specific meshes may not be canonically aligned which limits the ge…
▽ More
Many anatomical structures can be described by surface or volume meshes. Machine learning is a promising tool to extract information from these 3D models. However, high-fidelity meshes often contain hundreds of thousands of vertices, which creates unique challenges in building deep neural network architectures. Furthermore, patient-specific meshes may not be canonically aligned which limits the generalisation of machine learning algorithms. We propose LaB-GATr, a transfomer neural network with geometric tokenisation that can effectively learn with large-scale (bio-)medical surface and volume meshes through sequence compression and interpolation. Our method extends the recently proposed geometric algebra transformer (GATr) and thus respects all Euclidean symmetries, i.e. rotation, translation and reflection, effectively mitigating the problem of canonical alignment between patients. LaB-GATr achieves state-of-the-art results on three tasks in cardiovascular hemodynamics modelling and neurodevelopmental phenotype prediction, featuring meshes of up to 200,000 vertices. Our results demonstrate that LaB-GATr is a powerful architecture for learning with high-fidelity meshes which has the potential to enable interesting downstream applications. Our implementation is publicly available.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
Authors:
Eunsu Kim,
Juyoung Suk,
Philhoon Oh,
Haneul Yoo,
James Thorne,
Alice Oh
Abstract:
Despite the rapid development of large language models (LLMs) for the Korean language, there remains an obvious lack of benchmark datasets that test the requisite Korean cultural and linguistic knowledge. Because many existing Korean benchmark datasets are derived from the English counterparts through translation, they often overlook the different cultural contexts. For the few benchmark datasets…
▽ More
Despite the rapid development of large language models (LLMs) for the Korean language, there remains an obvious lack of benchmark datasets that test the requisite Korean cultural and linguistic knowledge. Because many existing Korean benchmark datasets are derived from the English counterparts through translation, they often overlook the different cultural contexts. For the few benchmark datasets that are sourced from Korean data capturing cultural knowledge, only narrow tasks such as bias and hate speech detection are offered. To address this gap, we introduce a benchmark of Cultural and Linguistic Intelligence in Korean (CLIcK), a dataset comprising 1,995 QA pairs. CLIcK sources its data from official Korean exams and textbooks, partitioning the questions into eleven categories under the two main categories of language and culture. For each instance in CLIcK, we provide fine-grained annotation of which cultural and linguistic knowledge is required to answer the question correctly. Using CLIcK, we test 13 language models to assess their performance. Our evaluation uncovers insights into their performances across the categories, as well as the diverse factors affecting their comprehension. CLIcK offers the first large-scale comprehensive Korean-centric analysis of LLMs' proficiency in Korean culture and language.
△ Less
Submitted 15 March, 2024; v1 submitted 10 March, 2024;
originally announced March 2024.
-
SIRE: scale-invariant, rotation-equivariant estimation of artery orientations using graph neural networks
Authors:
Dieuwertje Alblas,
Julian Suk,
Christoph Brune,
Kak Khee Yeung,
Jelmer M. Wolterink
Abstract:
Blood vessel orientation as visualized in 3D medical images is an important descriptor of its geometry that can be used for centerline extraction and subsequent segmentation and visualization. Arteries appear at many scales and levels of tortuosity, and determining their exact orientation is challenging. Recent works have used 3D convolutional neural networks (CNNs) for this purpose, but CNNs are…
▽ More
Blood vessel orientation as visualized in 3D medical images is an important descriptor of its geometry that can be used for centerline extraction and subsequent segmentation and visualization. Arteries appear at many scales and levels of tortuosity, and determining their exact orientation is challenging. Recent works have used 3D convolutional neural networks (CNNs) for this purpose, but CNNs are sensitive to varying vessel sizes and orientations. We present SIRE: a scale-invariant, rotation-equivariant estimator for local vessel orientation. SIRE is modular and can generalise due to symmetry preservation.
SIRE consists of a gauge equivariant mesh CNN (GEM-CNN) operating on multiple nested spherical meshes with different sizes in parallel. The features on each mesh are a projection of image intensities within the corresponding sphere. These features are intrinsic to the sphere and, in combination with the GEM-CNN, lead to SO(3)-equivariance. Approximate scale invariance is achieved by weight sharing and use of a symmetric maximum function to combine multi-scale predictions. Hence, SIRE can be trained with arbitrarily oriented vessels with varying radii to generalise to vessels with a wide range of calibres and tortuosity.
We demonstrate the efficacy of SIRE using three datasets containing vessels of varying scales: the vascular model repository (VMR), the ASOCA coronary artery set, and a set of abdominal aortic aneurysms (AAAs). We embed SIRE in a centerline tracker which accurately tracks AAAs, regardless of the data SIRE is trained with. Moreover, SIRE can be used to track coronary arteries, even when trained only with AAAs.
In conclusion, by incorporating SO(3) and scale symmetries, SIRE can determine the orientations of vessels outside of the training domain, forming a robust and data-efficient solution to geometric analysis of blood vessels in 3D medical images.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Tracking Most Significant Shifts in Nonparametric Contextual Bandits
Authors:
Joe Suk,
Samory Kpotufe
Abstract:
We study nonparametric contextual bandits where Lipschitz mean reward functions may change over time. We first establish the minimax dynamic regret rate in this less understood setting in terms of number of changes $L$ and total-variation $V$, both capturing all changes in distribution over context space, and argue that state-of-the-art procedures are suboptimal in this setting.
Next, we tend to…
▽ More
We study nonparametric contextual bandits where Lipschitz mean reward functions may change over time. We first establish the minimax dynamic regret rate in this less understood setting in terms of number of changes $L$ and total-variation $V$, both capturing all changes in distribution over context space, and argue that state-of-the-art procedures are suboptimal in this setting.
Next, we tend to the question of an adaptivity for this setting, i.e. achieving the minimax rate without knowledge of $L$ or $V$. Quite importantly, we posit that the bandit problem, viewed locally at a given context $X_t$, should not be affected by reward changes in other parts of context space $\cal X$. We therefore propose a notion of change, which we term experienced significant shifts, that better accounts for locality, and thus counts considerably less changes than $L$ and $V$. Furthermore, similar to recent work on non-stationary MAB (Suk & Kpotufe, 2022), experienced significant shifts only count the most significant changes in mean rewards, e.g., severe best-arm changes relevant to observed contexts.
Our main result is to show that this more tolerant notion of change can in fact be adapted to.
△ Less
Submitted 18 November, 2023; v1 submitted 11 July, 2023;
originally announced July 2023.
-
Generative modeling of living cells with SO(3)-equivariant implicit neural representations
Authors:
David Wiesner,
Julian Suk,
Sven Dummer,
Tereza Nečasová,
Vladimír Ulman,
David Svoboda,
Jelmer M. Wolterink
Abstract:
Data-driven cell tracking and segmentation methods in biomedical imaging require diverse and information-rich training data. In cases where the number of training samples is limited, synthetic computer-generated data sets can be used to improve these methods. This requires the synthesis of cell shapes as well as corresponding microscopy images using generative models. To synthesize realistic livin…
▽ More
Data-driven cell tracking and segmentation methods in biomedical imaging require diverse and information-rich training data. In cases where the number of training samples is limited, synthetic computer-generated data sets can be used to improve these methods. This requires the synthesis of cell shapes as well as corresponding microscopy images using generative models. To synthesize realistic living cell shapes, the shape representation used by the generative model should be able to accurately represent fine details and changes in topology, which are common in cells. These requirements are not met by 3D voxel masks, which are restricted in resolution, and polygon meshes, which do not easily model processes like cell growth and mitosis. In this work, we propose to represent living cell shapes as level sets of signed distance functions (SDFs) which are estimated by neural networks. We optimize a fully-connected neural network to provide an implicit representation of the SDF value at any point in a 3D+time domain, conditioned on a learned latent code that is disentangled from the rotation of the cell shape. We demonstrate the effectiveness of this approach on cells that exhibit rapid deformations (Platynereis dumerilii), cells that grow and divide (C. elegans), and cells that have growing and branching filopodial protrusions (A549 human lung carcinoma cells). A quantitative evaluation using shape features and Dice similarity coefficients of real and synthetic cell shapes shows that our model can generate topologically plausible complex cell shapes in 3D+time with high similarity to real living cell shapes. Finally, we show how microscopy images of living cells that correspond to our generated cell shapes can be synthesized using an image-to-image model.
△ Less
Submitted 12 October, 2023; v1 submitted 18 April, 2023;
originally announced April 2023.
-
SE(3) symmetry lets graph neural networks learn arterial velocity estimation from small datasets
Authors:
Julian Suk,
Christoph Brune,
Jelmer M. Wolterink
Abstract:
Hemodynamic velocity fields in coronary arteries could be the basis of valuable biomarkers for diagnosis, prognosis and treatment planning in cardiovascular disease. Velocity fields are typically obtained from patient-specific 3D artery models via computational fluid dynamics (CFD). However, CFD simulation requires meticulous setup by experts and is time-intensive, which hinders large-scale accept…
▽ More
Hemodynamic velocity fields in coronary arteries could be the basis of valuable biomarkers for diagnosis, prognosis and treatment planning in cardiovascular disease. Velocity fields are typically obtained from patient-specific 3D artery models via computational fluid dynamics (CFD). However, CFD simulation requires meticulous setup by experts and is time-intensive, which hinders large-scale acceptance in clinical practice. To address this, we propose graph neural networks (GNN) as an efficient black-box surrogate method to estimate 3D velocity fields mapped to the vertices of tetrahedral meshes of the artery lumen. We train these GNNs on synthetic artery models and CFD-based ground truth velocity fields. Once the GNN is trained, velocity estimates in a new and unseen artery can be obtained with 36-fold speed-up compared to CFD. We demonstrate how to construct an SE(3)-equivariant GNN that is independent of the spatial orientation of the input mesh and show how this reduces the necessary amount of training data compared to a baseline neural network.
△ Less
Submitted 4 August, 2023; v1 submitted 17 February, 2023;
originally announced February 2023.
-
When Can We Track Significant Preference Shifts in Dueling Bandits?
Authors:
Joe Suk,
Arpit Agarwal
Abstract:
The $K$-armed dueling bandits problem, where the feedback is in the form of noisy pairwise preferences, has been widely studied due its applications in information retrieval, recommendation systems, etc. Motivated by concerns that user preferences/tastes can evolve over time, we consider the problem of dueling bandits with distribution shifts. Specifically, we study the recent notion of significan…
▽ More
The $K$-armed dueling bandits problem, where the feedback is in the form of noisy pairwise preferences, has been widely studied due its applications in information retrieval, recommendation systems, etc. Motivated by concerns that user preferences/tastes can evolve over time, we consider the problem of dueling bandits with distribution shifts. Specifically, we study the recent notion of significant shifts (Suk and Kpotufe, 2022), and ask whether one can design an adaptive algorithm for the dueling problem with $O(\sqrt{K\tilde{L}T})$ dynamic regret, where $\tilde{L}$ is the (unknown) number of significant shifts in preferences. We show that the answer to this question depends on the properties of underlying preference distributions.
Firstly, we give an impossibility result that rules out any algorithm with $O(\sqrt{K\tilde{L}T})$ dynamic regret under the well-studied Condorcet and SST classes of preference distributions. Secondly, we show that $\text{SST} \cap \text{STI}$ is the largest amongst popular classes of preference distributions where it is possible to design such an algorithm. Overall, our results provides an almost complete resolution of the above question for the hierarchy of distribution classes.
△ Less
Submitted 24 January, 2024; v1 submitted 13 February, 2023;
originally announced February 2023.
-
Mesh Neural Networks for SE(3)-Equivariant Hemodynamics Estimation on the Artery Wall
Authors:
Julian Suk,
Pim de Haan,
Phillip Lippe,
Christoph Brune,
Jelmer M. Wolterink
Abstract:
Computational fluid dynamics (CFD) is a valuable asset for patient-specific cardiovascular-disease diagnosis and prognosis, but its high computational demands hamper its adoption in practice. Machine-learning methods that estimate blood flow in individual patients could accelerate or replace CFD simulation to overcome these limitations. In this work, we consider the estimation of vector-valued qua…
▽ More
Computational fluid dynamics (CFD) is a valuable asset for patient-specific cardiovascular-disease diagnosis and prognosis, but its high computational demands hamper its adoption in practice. Machine-learning methods that estimate blood flow in individual patients could accelerate or replace CFD simulation to overcome these limitations. In this work, we consider the estimation of vector-valued quantities on the wall of three-dimensional geometric artery models. We employ group equivariant graph convolution in an end-to-end SE(3)-equivariant neural network that operates directly on triangular surface meshes and makes efficient use of training data. We run experiments on a large dataset of synthetic coronary arteries and find that our method estimates directional wall shear stress (WSS) with an approximation error of 7.6% and normalised mean absolute error (NMAE) of 0.4% while up to two orders of magnitude faster than CFD. Furthermore, we show that our method is powerful enough to accurately predict transient, vector-valued WSS over the cardiac cycle while conditioned on a range of different inflow boundary conditions. These results demonstrate the potential of our proposed method as a plugin replacement for CFD in the personalised prediction of hemodynamic vector and scalar fields.
△ Less
Submitted 14 June, 2024; v1 submitted 9 December, 2022;
originally announced December 2022.
-
Implicit Neural Representations for Generative Modeling of Living Cell Shapes
Authors:
David Wiesner,
Julian Suk,
Sven Dummer,
David Svoboda,
Jelmer M. Wolterink
Abstract:
Methods allowing the synthesis of realistic cell shapes could help generate training data sets to improve cell tracking and segmentation in biomedical images. Deep generative models for cell shape synthesis require a light-weight and flexible representation of the cell shape. However, commonly used voxel-based representations are unsuitable for high-resolution shape synthesis, and polygon meshes h…
▽ More
Methods allowing the synthesis of realistic cell shapes could help generate training data sets to improve cell tracking and segmentation in biomedical images. Deep generative models for cell shape synthesis require a light-weight and flexible representation of the cell shape. However, commonly used voxel-based representations are unsuitable for high-resolution shape synthesis, and polygon meshes have limitations when modeling topology changes such as cell growth or mitosis. In this work, we propose to use level sets of signed distance functions (SDFs) to represent cell shapes. We optimize a neural network as an implicit neural representation of the SDF value at any point in a 3D+time domain. The model is conditioned on a latent code, thus allowing the synthesis of new and unseen shape sequences. We validate our approach quantitatively and qualitatively on C. elegans cells that grow and divide, and lung cancer cells with growing complex filopodial protrusions. Our results show that shape descriptors of synthetic cells resemble those of real cells, and that our model is able to generate topologically plausible sequences of complex cell shapes in 3D+time.
△ Less
Submitted 6 October, 2022; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Tracking Most Significant Arm Switches in Bandits
Authors:
Joe Suk,
Samory Kpotufe
Abstract:
In bandit with distribution shifts, one aims to automatically adapt to unknown changes in reward distribution, and restart exploration when necessary. While this problem has been studied for many years, a recent breakthrough of Auer et al. (2018, 2019) provides the first adaptive procedure to guarantee an optimal (dynamic) regret $\sqrt{LT}$, for $T$ rounds, and an unknown number $L$ of changes. H…
▽ More
In bandit with distribution shifts, one aims to automatically adapt to unknown changes in reward distribution, and restart exploration when necessary. While this problem has been studied for many years, a recent breakthrough of Auer et al. (2018, 2019) provides the first adaptive procedure to guarantee an optimal (dynamic) regret $\sqrt{LT}$, for $T$ rounds, and an unknown number $L$ of changes. However, while this rate is tight in the worst case, it remained open whether faster rates are possible, without prior knowledge, if few changes in distribution are actually severe.
To resolve this question, we propose a new notion of significant shift, which only counts very severe changes that clearly necessitate a restart: roughly, these are changes involving not only best arm switches, but also involving large aggregate differences in reward overtime. Thus, our resulting procedure adaptively achieves rates always faster (sometimes significantly) than $O(\sqrt{ST})$, where $S\ll L$ only counts best arm switches, while at the same time, always faster than the optimal $O(V^{\frac{1}{3}}T^{\frac{2}{3}})$ when expressed in terms of total variation $V$ (which aggregates differences overtime). Our results are expressed in enough generality to also capture non-stochastic adversarial settings.
△ Less
Submitted 16 June, 2022; v1 submitted 27 December, 2021;
originally announced December 2021.
-
Mesh convolutional neural networks for wall shear stress estimation in 3D artery models
Authors:
Julian Suk,
Pim de Haan,
Phillip Lippe,
Christoph Brune,
Jelmer M. Wolterink
Abstract:
Computational fluid dynamics (CFD) is a valuable tool for personalised, non-invasive evaluation of hemodynamics in arteries, but its complexity and time-consuming nature prohibit large-scale use in practice. Recently, the use of deep learning for rapid estimation of CFD parameters like wall shear stress (WSS) on surface meshes has been investigated. However, existing approaches typically depend on…
▽ More
Computational fluid dynamics (CFD) is a valuable tool for personalised, non-invasive evaluation of hemodynamics in arteries, but its complexity and time-consuming nature prohibit large-scale use in practice. Recently, the use of deep learning for rapid estimation of CFD parameters like wall shear stress (WSS) on surface meshes has been investigated. However, existing approaches typically depend on a hand-crafted re-parametrisation of the surface mesh to match convolutional neural network architectures. In this work, we propose to instead use mesh convolutional neural networks that directly operate on the same finite-element surface mesh as used in CFD. We train and evaluate our method on two datasets of synthetic coronary artery models with and without bifurcation, using a ground truth obtained from CFD simulation. We show that our flexible deep learning model can accurately predict 3D WSS vectors on this surface mesh. Our method processes new meshes in less than 5 [s], consistently achieves a normalised mean absolute error of $\leq$ 1.6 [%], and peaks at 90.5 [%] median approximation accuracy over the held-out test set, comparing favourably to previously published work. This demonstrates the feasibility of CFD surrogate modelling using mesh convolutional neural networks for hemodynamic parameter estimation in artery models.
△ Less
Submitted 20 January, 2022; v1 submitted 10 September, 2021;
originally announced September 2021.
-
Self-Tuning Bandits over Unknown Covariate-Shifts
Authors:
Joseph Suk,
Samory Kpotufe
Abstract:
Bandits with covariates, a.k.a. contextual bandits, address situations where optimal actions (or arms) at a given time $t$, depend on a context $x_t$, e.g., a new patient's medical history, a consumer's past purchases. While it is understood that the distribution of contexts might change over time, e.g., due to seasonalities, or deployment to new environments, the bulk of studies concern the most…
▽ More
Bandits with covariates, a.k.a. contextual bandits, address situations where optimal actions (or arms) at a given time $t$, depend on a context $x_t$, e.g., a new patient's medical history, a consumer's past purchases. While it is understood that the distribution of contexts might change over time, e.g., due to seasonalities, or deployment to new environments, the bulk of studies concern the most adversarial such changes, resulting in regret bounds that are often worst-case in nature.
Covariate-shift on the other hand has been considered in classification as a middle-ground formalism that can capture mild to relatively severe changes in distributions. We consider nonparametric bandits under such middle-ground scenarios, and derive new regret bounds that tightly capture a continuum of changes in context distribution. Furthermore, we show that these rates can be adaptively attained without knowledge of the time of shift nor the amount of shift.
△ Less
Submitted 20 February, 2021; v1 submitted 16 July, 2020;
originally announced July 2020.
-
Practicable Simulation-Free Model Order Reduction by Nonlinear Moment Matching
Authors:
Maria Cruz Varona,
Raphael Gebhart,
Julian Suk,
Boris Lohmann
Abstract:
In this paper, a practicable simulation-free model order reduction method by nonlinear moment matching is developed. Based on the steady-state interpretation of linear moment matching, we comprehensively explain the extension of this reduction concept to nonlinear systems presented in [1], provide some new insights and propose some simplifications to achieve a feasible and numerically efficient no…
▽ More
In this paper, a practicable simulation-free model order reduction method by nonlinear moment matching is developed. Based on the steady-state interpretation of linear moment matching, we comprehensively explain the extension of this reduction concept to nonlinear systems presented in [1], provide some new insights and propose some simplifications to achieve a feasible and numerically efficient nonlinear model reduction algorithm. This algorithm relies on the solution of nonlinear systems of equations rather than on the expensive simulation of the original model or the difficult solution of a nonlinear partial differential equation.
△ Less
Submitted 30 January, 2019;
originally announced January 2019.
-
Factorizations of $k$-Nonnegative Matrices
Authors:
Sunita Chepuri,
Neeraja Kulkarni,
Joe Suk,
Ewin Tang
Abstract:
A matrix is $k$-nonnegative if all its minors of size $k$ or less are nonnegative. We give a parametrized set of generators and relations for the semigroup of $k$-nonnegative $n\times n$ invertible matrices in two special cases: when $k = n-1$ and when $k = n-2$, restricted to unitriangular matrices. For these two cases, we prove that the set of $k$-nonnegative matrices can be partitioned into cel…
▽ More
A matrix is $k$-nonnegative if all its minors of size $k$ or less are nonnegative. We give a parametrized set of generators and relations for the semigroup of $k$-nonnegative $n\times n$ invertible matrices in two special cases: when $k = n-1$ and when $k = n-2$, restricted to unitriangular matrices. For these two cases, we prove that the set of $k$-nonnegative matrices can be partitioned into cells based on their factorizations into generators, generalizing the notion of Bruhat cells from totally nonnegative matrices. Like Bruhat cells, these cells are homeomorphic to open balls and have a topological structure that neatly relates closure of cells to subwords of factorizations. In the case of $(n-2)$-nonnegative unitriangular matrices, we show the cells form a Bruhat-like CW-complex.
△ Less
Submitted 30 October, 2017;
originally announced October 2017.
-
Dihedral Sieving Phenomena
Authors:
Sujit Rao,
Joe Suk
Abstract:
Cyclic sieving is a well-known phenomenon where certain interesting polynomials, especially $q$-analogues, have useful interpretations related to actions and representations of the cyclic group. We propose a definition of sieving for an arbitrary group $G$ and study it for the dihedral group $I_2(n)$ of order $2n$. This requires understanding the generators of the representation ring of the dihedr…
▽ More
Cyclic sieving is a well-known phenomenon where certain interesting polynomials, especially $q$-analogues, have useful interpretations related to actions and representations of the cyclic group. We propose a definition of sieving for an arbitrary group $G$ and study it for the dihedral group $I_2(n)$ of order $2n$. This requires understanding the generators of the representation ring of the dihedral group. For $n$ odd, we exhibit several instances of dihedral sieving which involve the generalized Fibonomial coefficients, recently studied by Amdeberhan, Chen, Moll, and Sagan. We also exhibit an instance of dihedral sieving involving Garsia and Haiman's $(q,t)$-Catalan numbers.
△ Less
Submitted 8 March, 2019; v1 submitted 17 October, 2017;
originally announced October 2017.
-
Utility Max-Min Fair Link Adaptation in IEEE 802.11ac Downlink Multi-User
Authors:
Ali A. Khavasi,
Mojtaba Aajami,
Hae-Ryeon Park,
Jung-Bong Suk
Abstract:
In this letter, we propose a novel model and corresponding algorithms to address the optimal utility max-min fair link adaptation in Downlink Multi-User (DL-MU) feature of the emerging IEEE 802.11ac WLAN standard. Herein, we first propose a simple yet accurate model to formulate the max-min fair link adaptation problem. Furthermore, this model guarantees the minimum utility gain of each receiver a…
▽ More
In this letter, we propose a novel model and corresponding algorithms to address the optimal utility max-min fair link adaptation in Downlink Multi-User (DL-MU) feature of the emerging IEEE 802.11ac WLAN standard. Herein, we first propose a simple yet accurate model to formulate the max-min fair link adaptation problem. Furthermore, this model guarantees the minimum utility gain of each receiver according to its requirements. In the second step, we show that the optimal solution of the proposed model can be obtained in polynomial time, and then the solution algorithms are proposed and analyzed. The simulation results demonstrate the significant achievement of the proposed utility-aware link adaptation approach in terms of max-min fairness and utility gain compared to utility-oblivious schemes.
△ Less
Submitted 24 March, 2014;
originally announced March 2014.
-
Graphene films with large domain size by a two-step chemical vapor deposition process
Authors:
Xuesong Li,
Carl W. Magnuson,
Archana Venugopal,
**ho An,
Ji Won Suk,
Boyang Han,
Mark Borysiak,
Weiwei Cai,
Aruna Velamakanni,
Yanwu Zhu,
Lianfeng Fu,
Eric M. Vogel,
Edgar Voelkl,
Luigi Colombo,
Rodney S. Ruoff
Abstract:
The fundamental properties of graphene are making it an attractive material for a wide variety of applications. Various techniques have been developed to produce graphene and recently we discovered the synthesis of large area graphene by chemical vapor deposition (CVD) of methane on Cu foils. We also showed that graphene growth on Cu is a surface-mediated process and the films were polycrystalline…
▽ More
The fundamental properties of graphene are making it an attractive material for a wide variety of applications. Various techniques have been developed to produce graphene and recently we discovered the synthesis of large area graphene by chemical vapor deposition (CVD) of methane on Cu foils. We also showed that graphene growth on Cu is a surface-mediated process and the films were polycrystalline with domains having an area of tens of square microns. In this paper we report on the effect of growth parameters such as temperature, and methane flow rate and partial pressure on the growth rate, domain size, and surface coverage of graphene as determined by Raman spectroscopy, and transmission and scanning electron microscopy. Based on the results, we developed a two-step CVD process to synthesize graphene films with domains having an area of hundreds of square microns. Scanning electron microscopy and Raman spectroscopy clearly show an increase in domain size by changing the growth parameters. Transmission electron microscopy further shows that the domains are crystallographically rotated with respect to each other with a range of angles from about 13 degrees to nearly 30 degrees. Electrical transport measurements performed on back-gated FETs show that overall films with larger domains tend to have higher carrier mobility, up to about 16,000 cm2 V-1 s-1 at room temperature.
△ Less
Submitted 22 October, 2010;
originally announced October 2010.
-
Domain (Grain) Boundaries and Evidence of Twin Like Structures in CVD Grown Graphene
Authors:
**ho An,
Edgar Voelkl,
Jiwon Suk,
Xuesong Li,
Carl W. Magnuson,
Lianfeng Fu,
Peter Tiemeijer,
Maarten Bischoff,
Bert Freitag,
Elmira Popova,
Rodney S. Ruoff
Abstract:
Understanding and engineering the domain boundaries in chemically vapor deposited (CVD) monolayer graphene will be critical for improving its properties. In this study, a combination of transmission electron microscopy (TEM) techniques including selected area electron diffraction (SAED), high resolution transmission electron microscopy (HRTEM), and dark field (DF) TEM was used to study the boundar…
▽ More
Understanding and engineering the domain boundaries in chemically vapor deposited (CVD) monolayer graphene will be critical for improving its properties. In this study, a combination of transmission electron microscopy (TEM) techniques including selected area electron diffraction (SAED), high resolution transmission electron microscopy (HRTEM), and dark field (DF) TEM was used to study the boundary orientation angle distribution and the nature of the carbon bonds at the domain boundaries. This report provides an important first step towards a fundamental understanding of these domain boundaries. The results show that, for the graphene grown in this study, the 46 measured misorientation angles are all between 11-30 degrees (with the exception of one at 7 degrees). HRTEM images show the presence of adsorbates in almost all of the boundary areas. When a boundary was imaged, defects were seen (dangling bonds) at the boundaries that likely contribute to adsorbates binding at these boundaries. DFTEM images also showed the presence of a 'twin like' boundary.
△ Less
Submitted 19 October, 2010;
originally announced October 2010.