Skip to main content

Showing 1–21 of 21 results for author: Suk, J

.
  1. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Gui** Son, Ye** Cho, Sheikh Shafayat, **heon Baek, Sue Hyun Park, Hyeonbin Hwang, **kyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  2. arXiv:2405.01535  [pdf, other

    cs.CL

    Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

    Authors: Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo

    Abstract: Proprietary LMs such as GPT-4 are often employed to assess the quality of responses from various LMs. However, concerns including transparency, controllability, and affordability strongly motivate the development of open-source LMs specialized in evaluations. On the other hand, existing open evaluator LMs exhibit critical shortcomings: 1) they issue scores that significantly diverge from those ass… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Work in Progress

  3. arXiv:2403.12950  [pdf, other

    cs.LG stat.ML

    Optimal and Adaptive Non-Stationary Dueling Bandits Under a Generalized Borda Criterion

    Authors: Joe Suk, Arpit Agarwal

    Abstract: In dueling bandits, the learner receives preference feedback between arms, and the regret of an arm is defined in terms of its suboptimality to a winner arm. The more challenging and practically motivated non-stationary variant of dueling bandits, where preferences change over time, has been the focus of several recent works (Saha and Gupta, 2022; Buening and Saha, 2023; Suk and Agarwal, 2023). Th… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  4. arXiv:2403.07536  [pdf, ps, other

    cs.CV cs.LG

    LaB-GATr: geometric algebra transformers for large biomedical surface and volume meshes

    Authors: Julian Suk, Baris Imre, Jelmer M. Wolterink

    Abstract: Many anatomical structures can be described by surface or volume meshes. Machine learning is a promising tool to extract information from these 3D models. However, high-fidelity meshes often contain hundreds of thousands of vertices, which creates unique challenges in building deep neural network architectures. Furthermore, patient-specific meshes may not be canonically aligned which limits the ge… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  5. arXiv:2403.06412  [pdf, other

    cs.CL

    CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean

    Authors: Eunsu Kim, Juyoung Suk, Philhoon Oh, Haneul Yoo, James Thorne, Alice Oh

    Abstract: Despite the rapid development of large language models (LLMs) for the Korean language, there remains an obvious lack of benchmark datasets that test the requisite Korean cultural and linguistic knowledge. Because many existing Korean benchmark datasets are derived from the English counterparts through translation, they often overlook the different cultural contexts. For the few benchmark datasets… ▽ More

    Submitted 15 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  6. arXiv:2311.05400  [pdf, other

    cs.CV cs.LG

    SIRE: scale-invariant, rotation-equivariant estimation of artery orientations using graph neural networks

    Authors: Dieuwertje Alblas, Julian Suk, Christoph Brune, Kak Khee Yeung, Jelmer M. Wolterink

    Abstract: Blood vessel orientation as visualized in 3D medical images is an important descriptor of its geometry that can be used for centerline extraction and subsequent segmentation and visualization. Arteries appear at many scales and levels of tortuosity, and determining their exact orientation is challenging. Recent works have used 3D convolutional neural networks (CNNs) for this purpose, but CNNs are… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Submitted to Medical Image Analysis

  7. arXiv:2307.05341  [pdf, other

    stat.ML cs.LG

    Tracking Most Significant Shifts in Nonparametric Contextual Bandits

    Authors: Joe Suk, Samory Kpotufe

    Abstract: We study nonparametric contextual bandits where Lipschitz mean reward functions may change over time. We first establish the minimax dynamic regret rate in this less understood setting in terms of number of changes $L$ and total-variation $V$, both capturing all changes in distribution over context space, and argue that state-of-the-art procedures are suboptimal in this setting. Next, we tend to… ▽ More

    Submitted 18 November, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

  8. arXiv:2304.08960  [pdf, other

    cs.CV cs.LG eess.IV q-bio.QM

    Generative modeling of living cells with SO(3)-equivariant implicit neural representations

    Authors: David Wiesner, Julian Suk, Sven Dummer, Tereza Nečasová, Vladimír Ulman, David Svoboda, Jelmer M. Wolterink

    Abstract: Data-driven cell tracking and segmentation methods in biomedical imaging require diverse and information-rich training data. In cases where the number of training samples is limited, synthetic computer-generated data sets can be used to improve these methods. This requires the synthesis of cell shapes as well as corresponding microscopy images using generative models. To synthesize realistic livin… ▽ More

    Submitted 12 October, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: Medical Image Analysis (MedIA) 2023 (Accepted)

  9. arXiv:2302.08780  [pdf, ps, other

    cs.LG math.GR physics.flu-dyn

    SE(3) symmetry lets graph neural networks learn arterial velocity estimation from small datasets

    Authors: Julian Suk, Christoph Brune, Jelmer M. Wolterink

    Abstract: Hemodynamic velocity fields in coronary arteries could be the basis of valuable biomarkers for diagnosis, prognosis and treatment planning in cardiovascular disease. Velocity fields are typically obtained from patient-specific 3D artery models via computational fluid dynamics (CFD). However, CFD simulation requires meticulous setup by experts and is time-intensive, which hinders large-scale accept… ▽ More

    Submitted 4 August, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: First published in "12th International Conference on Functional Imaging and Modeling of the Heart" (FIMH), pp 445-454, 2023 by Springer Nature

  10. arXiv:2302.06595  [pdf, other

    cs.LG stat.ML

    When Can We Track Significant Preference Shifts in Dueling Bandits?

    Authors: Joe Suk, Arpit Agarwal

    Abstract: The $K$-armed dueling bandits problem, where the feedback is in the form of noisy pairwise preferences, has been widely studied due its applications in information retrieval, recommendation systems, etc. Motivated by concerns that user preferences/tastes can evolve over time, we consider the problem of dueling bandits with distribution shifts. Specifically, we study the recent notion of significan… ▽ More

    Submitted 24 January, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

  11. arXiv:2212.05023  [pdf, other

    cs.LG cs.CV math.GR physics.flu-dyn

    Mesh Neural Networks for SE(3)-Equivariant Hemodynamics Estimation on the Artery Wall

    Authors: Julian Suk, Pim de Haan, Phillip Lippe, Christoph Brune, Jelmer M. Wolterink

    Abstract: Computational fluid dynamics (CFD) is a valuable asset for patient-specific cardiovascular-disease diagnosis and prognosis, but its high computational demands hamper its adoption in practice. Machine-learning methods that estimate blood flow in individual patients could accelerate or replace CFD simulation to overcome these limitations. In this work, we consider the estimation of vector-valued qua… ▽ More

    Submitted 14 June, 2024; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: Published in "Computers in Biology and Medicine"

  12. Implicit Neural Representations for Generative Modeling of Living Cell Shapes

    Authors: David Wiesner, Julian Suk, Sven Dummer, David Svoboda, Jelmer M. Wolterink

    Abstract: Methods allowing the synthesis of realistic cell shapes could help generate training data sets to improve cell tracking and segmentation in biomedical images. Deep generative models for cell shape synthesis require a light-weight and flexible representation of the cell shape. However, commonly used voxel-based representations are unsuitable for high-resolution shape synthesis, and polygon meshes h… ▽ More

    Submitted 6 October, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: MICCAI 2022

    Journal ref: Medical Image Computing and Computer Assisted Intervention - MICCAI 2022

  13. arXiv:2112.13838  [pdf, other

    cs.LG stat.ML

    Tracking Most Significant Arm Switches in Bandits

    Authors: Joe Suk, Samory Kpotufe

    Abstract: In bandit with distribution shifts, one aims to automatically adapt to unknown changes in reward distribution, and restart exploration when necessary. While this problem has been studied for many years, a recent breakthrough of Auer et al. (2018, 2019) provides the first adaptive procedure to guarantee an optimal (dynamic) regret $\sqrt{LT}$, for $T$ rounds, and an unknown number $L$ of changes. H… ▽ More

    Submitted 16 June, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

  14. arXiv:2109.04797  [pdf, other

    cs.LG cs.CV physics.flu-dyn

    Mesh convolutional neural networks for wall shear stress estimation in 3D artery models

    Authors: Julian Suk, Pim de Haan, Phillip Lippe, Christoph Brune, Jelmer M. Wolterink

    Abstract: Computational fluid dynamics (CFD) is a valuable tool for personalised, non-invasive evaluation of hemodynamics in arteries, but its complexity and time-consuming nature prohibit large-scale use in practice. Recently, the use of deep learning for rapid estimation of CFD parameters like wall shear stress (WSS) on surface meshes has been investigated. However, existing approaches typically depend on… ▽ More

    Submitted 20 January, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: (MICCAI 2021) Workshop on Statistical Atlases and Computational Modelling of the Heart (STACOM). The final authenticated version is available on SpringerLink

  15. arXiv:2007.08584  [pdf, other

    stat.ML cs.LG

    Self-Tuning Bandits over Unknown Covariate-Shifts

    Authors: Joseph Suk, Samory Kpotufe

    Abstract: Bandits with covariates, a.k.a. contextual bandits, address situations where optimal actions (or arms) at a given time $t$, depend on a context $x_t$, e.g., a new patient's medical history, a consumer's past purchases. While it is understood that the distribution of contexts might change over time, e.g., due to seasonalities, or deployment to new environments, the bulk of studies concern the most… ▽ More

    Submitted 20 February, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

  16. arXiv:1901.10750  [pdf, ps, other

    eess.SY math.DS math.NA

    Practicable Simulation-Free Model Order Reduction by Nonlinear Moment Matching

    Authors: Maria Cruz Varona, Raphael Gebhart, Julian Suk, Boris Lohmann

    Abstract: In this paper, a practicable simulation-free model order reduction method by nonlinear moment matching is developed. Based on the steady-state interpretation of linear moment matching, we comprehensively explain the extension of this reduction concept to nonlinear systems presented in [1], provide some new insights and propose some simplifications to achieve a feasible and numerically efficient no… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

    Comments: 7 pages, 3 figures; submitted to ECC 2019

  17. arXiv:1710.10867  [pdf, ps, other

    math.CO math.RA

    Factorizations of $k$-Nonnegative Matrices

    Authors: Sunita Chepuri, Neeraja Kulkarni, Joe Suk, Ewin Tang

    Abstract: A matrix is $k$-nonnegative if all its minors of size $k$ or less are nonnegative. We give a parametrized set of generators and relations for the semigroup of $k$-nonnegative $n\times n$ invertible matrices in two special cases: when $k = n-1$ and when $k = n-2$, restricted to unitriangular matrices. For these two cases, we prove that the set of $k$-nonnegative matrices can be partitioned into cel… ▽ More

    Submitted 30 October, 2017; originally announced October 2017.

  18. Dihedral Sieving Phenomena

    Authors: Sujit Rao, Joe Suk

    Abstract: Cyclic sieving is a well-known phenomenon where certain interesting polynomials, especially $q$-analogues, have useful interpretations related to actions and representations of the cyclic group. We propose a definition of sieving for an arbitrary group $G$ and study it for the dihedral group $I_2(n)$ of order $2n$. This requires understanding the generators of the representation ring of the dihedr… ▽ More

    Submitted 8 March, 2019; v1 submitted 17 October, 2017; originally announced October 2017.

    Comments: 10 pages

  19. Utility Max-Min Fair Link Adaptation in IEEE 802.11ac Downlink Multi-User

    Authors: Ali A. Khavasi, Mojtaba Aajami, Hae-Ryeon Park, Jung-Bong Suk

    Abstract: In this letter, we propose a novel model and corresponding algorithms to address the optimal utility max-min fair link adaptation in Downlink Multi-User (DL-MU) feature of the emerging IEEE 802.11ac WLAN standard. Herein, we first propose a simple yet accurate model to formulate the max-min fair link adaptation problem. Furthermore, this model guarantees the minimum utility gain of each receiver a… ▽ More

    Submitted 24 March, 2014; originally announced March 2014.

    Comments: Has been accepted in IEEE Communications Letters

  20. arXiv:1010.4731  [pdf

    cond-mat.mtrl-sci

    Graphene films with large domain size by a two-step chemical vapor deposition process

    Authors: Xuesong Li, Carl W. Magnuson, Archana Venugopal, **ho An, Ji Won Suk, Boyang Han, Mark Borysiak, Weiwei Cai, Aruna Velamakanni, Yanwu Zhu, Lianfeng Fu, Eric M. Vogel, Edgar Voelkl, Luigi Colombo, Rodney S. Ruoff

    Abstract: The fundamental properties of graphene are making it an attractive material for a wide variety of applications. Various techniques have been developed to produce graphene and recently we discovered the synthesis of large area graphene by chemical vapor deposition (CVD) of methane on Cu foils. We also showed that graphene growth on Cu is a surface-mediated process and the films were polycrystalline… ▽ More

    Submitted 22 October, 2010; originally announced October 2010.

    Comments: 13 pages, 1 table, 7 figures

  21. arXiv:1010.3905  [pdf

    cond-mat.mtrl-sci

    Domain (Grain) Boundaries and Evidence of Twin Like Structures in CVD Grown Graphene

    Authors: **ho An, Edgar Voelkl, Jiwon Suk, Xuesong Li, Carl W. Magnuson, Lianfeng Fu, Peter Tiemeijer, Maarten Bischoff, Bert Freitag, Elmira Popova, Rodney S. Ruoff

    Abstract: Understanding and engineering the domain boundaries in chemically vapor deposited (CVD) monolayer graphene will be critical for improving its properties. In this study, a combination of transmission electron microscopy (TEM) techniques including selected area electron diffraction (SAED), high resolution transmission electron microscopy (HRTEM), and dark field (DF) TEM was used to study the boundar… ▽ More

    Submitted 19 October, 2010; originally announced October 2010.