-
Improved limit on neutrinoless double beta decay of \mohundred~from AMoRE-I
Authors:
A. Agrawal,
V. V. Alenkov,
P. Aryal,
J. Beyer,
B. Bhandari,
R. S. Boiko,
K. Boonin,
O. Buzanov,
C. R. Byeon,
N. Chanthima,
M. K. Cheoun,
J. S. Choe,
Seonho Choi,
S. Choudhury,
J. S. Chung,
F. A. Danevich,
M. Djamal,
D. Drung,
C. Enss,
A. Fleischmann,
A. M. Gangapshev,
L. Gastaldo,
Y. M. Gavrilyuk,
A. M. Gezhaev,
O. Gileva
, et al. (83 additional authors not shown)
Abstract:
AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate c…
▽ More
AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate crystals, at the Yangyang Underground Laboratory for over two years. The exposure was 8.02 kg$\cdot$year (or 3.89 kg$_{\mathrm{^{100}Mo}}\cdot$year) and the total background rate near the Q-value was 0.025 $\pm$ 0.002 counts/keV/kg/year. We observed no indication of $0νββ$ decay and report a new lower limit of the half-life of $^{100}$Mo $0νββ$ decay as $ T^{0ν}_{1/2}>3.0\times10^{24}~\mathrm{years}$ at 90\% confidence level. The effective Majorana mass limit range is $m_{ββ}<$(210--610) meV using nuclear matrix elements estimated in the framework of different models, including the recent shell model calculations.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Balancing Operator's Risk Averseness in Model Predictive Control of a Reservoir System
Authors:
Ja-Ho Koo,
Edo Abraham,
Andreja Jonoski,
Dimitri P. Solomatine
Abstract:
Model Predictive Control (MPC) is an optimal control strategy suited for flood control of water resources infrastructure. Despite many studies on reservoir flood control and their theoretical contribution, optimisation methodologies have not been widely applied in real-time operation due to disparities between research assumptions and practical requirements. First, tacit objectives such as minimis…
▽ More
Model Predictive Control (MPC) is an optimal control strategy suited for flood control of water resources infrastructure. Despite many studies on reservoir flood control and their theoretical contribution, optimisation methodologies have not been widely applied in real-time operation due to disparities between research assumptions and practical requirements. First, tacit objectives such as minimising the magnitude and frequency of changes in the existing outflow schedule are considered important in practice, but these are nonlinear and challenging to formulate to suit all conditions. Incorporating these objectives transforms the problem into a multi-objective nonlinear optimisation problem that is difficult to solve online. Second, it is reasonable to assume that the weights and parameters are not stationary because the preference varies depending on the state of the system. To overcome these limitations, we propose a framework that converts the original intractable problem into parameterized linear MPC problems with dynamic optimisation of weights and parameters. This is done by introducing a model-based learning concept under the assumption of the dynamic nature of the operator's preference. We refer to this framework as Parameterised Dynamic MPC (PD-MPC). The effectiveness of this framework is demonstrated through a numerical experiment for the Daecheong multipurpose reservoir in South Korea. We find that PD-MPC outperforms `standard' MPC-based designs without a dynamic optimisation process under the same uncertain inflows.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Effective Heterogeneous Federated Learning via Efficient Hypernetwork-based Weight Generation
Authors:
Yu** Shin,
Kichang Lee,
Sungmin Lee,
You Rim Choi,
Hyung-Sin Kim,
JeongGil Ko
Abstract:
While federated learning leverages distributed client resources, it faces challenges due to heterogeneous client capabilities. This necessitates allocating models suited to clients' resources and careful parameter aggregation to accommodate this heterogeneity. We propose HypeMeFed, a novel federated learning framework for supporting client heterogeneity by combining a multi-exit network architectu…
▽ More
While federated learning leverages distributed client resources, it faces challenges due to heterogeneous client capabilities. This necessitates allocating models suited to clients' resources and careful parameter aggregation to accommodate this heterogeneity. We propose HypeMeFed, a novel federated learning framework for supporting client heterogeneity by combining a multi-exit network architecture with hypernetwork-based model weight generation. This approach aligns the feature spaces of heterogeneous model layers and resolves per-layer information disparity during weight aggregation. To practically realize HypeMeFed, we also propose a low-rank factorization approach to minimize computation and memory overhead associated with hypernetworks. Our evaluations on a real-world heterogeneous device testbed indicate that HypeMeFed enhances accuracy by 5.12% over FedAvg, reduces the hypernetwork memory requirements by 98.22%, and accelerates its operations by 1.86 times compared to a naive hypernetwork approach. These results demonstrate HypeMeFed's effectiveness in leveraging and engaging heterogeneous clients for federated learning.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Thermodynamic relation on higher-dimensional black hole with arbitrary cosmological constant
Authors:
Junbeom Ko,
Bogeun Gwak
Abstract:
We investigated the Goon--Penco (GP) relation on higher-dimensional Reissner-Nordström black holes with an arbitrary cosmological constant. It was found that the GP relation retained its form in four- and higher-dimensional spacetimes. Thus, the reactions in the black holes are universal with respect to the dimensionality. Furthermore, the GP relation was found to be universal on any state of the…
▽ More
We investigated the Goon--Penco (GP) relation on higher-dimensional Reissner-Nordström black holes with an arbitrary cosmological constant. It was found that the GP relation retained its form in four- and higher-dimensional spacetimes. Thus, the reactions in the black holes are universal with respect to the dimensionality. Furthermore, the GP relation was found to be universal on any state of the black hole including near-extremal and near-Nariai cases. Thus, this study showed that the GP relation was prevalent for higher-dimensional Reissner-Nordström black holes with an arbitrary cosmological constant regardless of the initial state.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses
Authors:
Seungwoo Yoo,
Juil Koo,
Kyeongmin Yeo,
Minhyuk Sung
Abstract:
We propose a novel method for learning representations of poses for 3D deformable objects, which specializes in 1) disentangling pose information from the object's identity, 2) facilitating the learning of pose variations, and 3) transferring pose information to other object identities. Based on these properties, our method enables the generation of 3D deformable objects with diversity in both ide…
▽ More
We propose a novel method for learning representations of poses for 3D deformable objects, which specializes in 1) disentangling pose information from the object's identity, 2) facilitating the learning of pose variations, and 3) transferring pose information to other object identities. Based on these properties, our method enables the generation of 3D deformable objects with diversity in both identities and poses, using variations of a single object. It does not require explicit shape parameterization such as skeletons or joints, point-level or shape-level correspondence supervision, or variations of the target object for pose transfer. To achieve pose disentanglement, compactness for generative models, and transferability, we first design the pose extractor to represent the pose as a keypoint-based hybrid representation and the pose applier to learn an implicit deformation field. To better distill pose information from the object's geometry, we propose the implicit pose applier to output an intrinsic mesh property, the face Jacobian. Once the extracted pose information is transferred to the target object, the pose applier is fine-tuned in a self-supervised manner to better describe the target object's shapes with pose variations. The extracted poses are also used to train a cascaded diffusion model to enable the generation of novel poses. Our experiments with the DeformThings4D and Human datasets demonstrate state-of-the-art performance in pose transfer and the ability to generate diverse deformed shapes with various objects and poses.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Projected background and sensitivity of AMoRE-II
Authors:
A. Agrawal,
V. V. Alenkov,
P. Aryal,
J. Beyer,
B. Bhandari,
R. S. Boiko,
K. Boonin,
O. Buzanov,
C. R. Byeon,
N. Chanthima,
M. K. Cheoun,
J. S. Choe,
Seonho Choi,
S. Choudhury,
J. S. Chung,
F. A. Danevich,
M. Djamal,
D. Drung,
C. Enss,
A. Fleischmann,
A. M. Gangapshev,
L. Gastaldo,
Y. M. Gavrilyuk,
A. M. Gezhaev,
O. Gileva
, et al. (81 additional authors not shown)
Abstract:
AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located ap…
▽ More
AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located approximately 1000 meters deep in Jeongseon, Korea. The goal of AMoRE-II is to reach up to $T^{0νββ}_{1/2}$ $\sim$ 6 $\times$ 10$^{26}$ years, corresponding to an effective Majorana mass of 15 - 29 meV, covering all the inverted mass hierarchy regions. To achieve this, the background level of the experimental configurations and possible background sources of gamma and beta events should be well understood. We have intensively performed Monte Carlo simulations using the GEANT4 toolkit in all the experimental configurations with potential sources. We report the estimated background level that meets the 10$^{-4}$counts/(keV$\cdot$kg$\cdot$yr) requirement for AMoRE-II in the region of interest (ROI) and show the projected half-life sensitivity based on the simulation study.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Exclusion of the Cosmological Triangle in Reactor-Based Search for Axion-Like Particles
Authors:
Byung Ju Park,
Jae ** Choi,
Eunju Jeon,
**yu Kim,
Kyungwon Kim,
Sung Hyun Kim,
Sun Kee Kim,
Yeongduk Kim,
Young Ju Ko,
Byoung-Cheol Koh,
Chang Hyon Ha,
Seo Hyun Lee,
In Soo Lee,
Hyunseok Lee,
Hyun Su Lee,
Jaison Lee,
Yoomin Oh,
Doo** Kim
Abstract:
We report new constraints on axion-like particle (ALP) using data corresponding to a sodium iodine target exposure of 3063 kg$\cdot$days from the neutrino elastic scattering observation with NaI (NEON) experiment. A 16.7 kg of thallium-doped sodium iodide target was located 23.7 meters from a 2.8 GW thermal power nuclear reactor. We searched for ALPs produced by high-flux photons by comparing the…
▽ More
We report new constraints on axion-like particle (ALP) using data corresponding to a sodium iodine target exposure of 3063 kg$\cdot$days from the neutrino elastic scattering observation with NaI (NEON) experiment. A 16.7 kg of thallium-doped sodium iodide target was located 23.7 meters from a 2.8 GW thermal power nuclear reactor. We searched for ALPs produced by high-flux photons by comparing the energy spectra of data collected during reactor-on (1596 kg$\cdot$days exposure) and reactor-off (1467 kg$\cdot$days exposure) periods. No signal consistent with ALP interaction was identified, allowing us to set exclusion limits at the 95% confidence level. Our limits cover previously unexplored regions for both photon couplings (${g_{aγ}}$) and electron couplings (${g_{ae}}$) for axion masses around 1 MeV/c$^2$. Notably, the NEON data excludes the unconstrained region identified by laboratory-based searches for photon couplings within the "cosmological triangle" for the first time. The observed 95\% confidence level limits reach as low as ${g_{aγ}}$ of 4.33$\times$ 10$^{-8}$ GeV$^{-1}$ and ${g_{ae}}$ of 1.10$\times$ 10$^{-9}$ for axion masses of 1.7 MeV/c$^2$ and 1.0 MeV/c$^2$, respectively.
△ Less
Submitted 11 June, 2024; v1 submitted 10 June, 2024;
originally announced June 2024.
-
Demystifying SGD with Doubly Stochastic Gradients
Authors:
Kyurae Kim,
Joohwan Ko,
Yi-An Ma,
Jacob R. Gardner
Abstract:
Optimization objectives in the form of a sum of intractable expectations are rising in importance (e.g., diffusion models, variational autoencoders, and many more), a setting also known as "finite sum with infinite data." For these problems, a popular strategy is to employ SGD with doubly stochastic gradients (doubly SGD): the expectations are estimated using the gradient estimator of each compone…
▽ More
Optimization objectives in the form of a sum of intractable expectations are rising in importance (e.g., diffusion models, variational autoencoders, and many more), a setting also known as "finite sum with infinite data." For these problems, a popular strategy is to employ SGD with doubly stochastic gradients (doubly SGD): the expectations are estimated using the gradient estimator of each component, while the sum is estimated by subsampling over these estimators. Despite its popularity, little is known about the convergence properties of doubly SGD, except under strong assumptions such as bounded variance. In this work, we establish the convergence of doubly SGD with independent minibatching and random reshuffling under general conditions, which encompasses dependent component gradient estimators. In particular, for dependent estimators, our analysis allows fined-grained analysis of the effect correlations. As a result, under a per-iteration computational budget of $b \times m$, where $b$ is the minibatch size and $m$ is the number of Monte Carlo samples, our analysis suggests where one should invest most of the budget in general. Furthermore, we prove that random reshuffling (RR) improves the complexity dependence on the subsampling noise.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting
Authors:
Xiangyu Sun,
Joo Chan Lee,
Daniel Rho,
Jong Hwan Ko,
Usman Ali,
Eunbyung Park
Abstract:
The neural radiance field (NeRF) has made significant strides in representing 3D scenes and synthesizing novel views. Despite its advancements, the high computational costs of NeRF have posed challenges for its deployment in resource-constrained environments and real-time applications. As an alternative to NeRF-like neural rendering methods, 3D Gaussian Splatting (3DGS) offers rapid rendering spee…
▽ More
The neural radiance field (NeRF) has made significant strides in representing 3D scenes and synthesizing novel views. Despite its advancements, the high computational costs of NeRF have posed challenges for its deployment in resource-constrained environments and real-time applications. As an alternative to NeRF-like neural rendering methods, 3D Gaussian Splatting (3DGS) offers rapid rendering speeds while maintaining excellent image quality. However, as it represents objects and scenes using a myriad of Gaussians, it requires substantial storage to achieve high-quality representation. To mitigate the storage overhead, we propose Factorized 3D Gaussian Splatting (F-3DGS), a novel approach that drastically reduces storage requirements while preserving image quality. Inspired by classical matrix and tensor factorization techniques, our method represents and approximates dense clusters of Gaussians with significantly fewer Gaussians through efficient factorization. We aim to efficiently represent dense 3D Gaussians by approximating them with a limited amount of information for each axis and their combinations. This method allows us to encode a substantially large number of Gaussians along with their essential attributes -- such as color, scale, and rotation -- necessary for rendering using a relatively small number of elements. Extensive experimental results demonstrate that F-3DGS achieves a significant reduction in storage costs while maintaining comparable quality in rendered images.
△ Less
Submitted 28 May, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
Charge transfer and Spin-Valley locking in 4Hb-TaS$_{2}$
Authors:
Avior Almoalem,
Roni Gofman,
Yuval Nitzav,
Ilay Mangel,
Irena Feldman,
Jahyun Koo,
Federico Mazzola,
Jun Fujii,
Ivana Vobornik,
J. Sanchez-Barriga,
Oliver J. Clark,
Nicholas Clark Plumb,
Ming Shi,
Binghai Yan,
Amit Kanigel
Abstract:
4Hb-TaS$_2$ is a superconductor that exhibits unique characteristics such as time-reversal symmetry breaking, hidden magnetic memory, and topological edge modes. It is a naturally occurring heterostructure comprising of alternating layers of 1H-TaS$_2$ and 1T-TaS$_2$. The former is a well-known superconductor, while the latter is a correlated insulator with a possible non-trivial magnetic ground s…
▽ More
4Hb-TaS$_2$ is a superconductor that exhibits unique characteristics such as time-reversal symmetry breaking, hidden magnetic memory, and topological edge modes. It is a naturally occurring heterostructure comprising of alternating layers of 1H-TaS$_2$ and 1T-TaS$_2$. The former is a well-known superconductor, while the latter is a correlated insulator with a possible non-trivial magnetic ground state. In this study, we use angle resolved photoemission spectroscopy to investigate the normal state electronic structure of this unconventional superconductor. Our findings reveal that the band structure of 4H-TaS$_2$ fundamentally differs from that of its constituent materials. Specifically, we observe a significant charge transfer from the 1T layers to the 1H layers that drives the 1T layers away from half-filling. In addition, we find a substantial reduction in inter-layer coupling in 4Hb-TaS$_2$ compared to the coupling in 2H-TaS$_2$ that results in a pronounced spin-valley locking within 4Hb-TaS$_2$
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Unveiling Key Aspects of Fine-Tuning in Sentence Embeddings: A Representation Rank Analysis
Authors:
Euna Jung,
Jaeill Kim,
Jungmin Ko,
**woo Park,
Wonjong Rhee
Abstract:
The latest advancements in unsupervised learning of sentence embeddings predominantly involve employing contrastive learning-based (CL-based) fine-tuning over pre-trained language models. In this study, we analyze the latest sentence embedding methods by adopting representation rank as the primary tool of analysis. We first define Phase 1 and Phase 2 of fine-tuning based on when representation ran…
▽ More
The latest advancements in unsupervised learning of sentence embeddings predominantly involve employing contrastive learning-based (CL-based) fine-tuning over pre-trained language models. In this study, we analyze the latest sentence embedding methods by adopting representation rank as the primary tool of analysis. We first define Phase 1 and Phase 2 of fine-tuning based on when representation rank peaks. Utilizing these phases, we conduct a thorough analysis and obtain essential findings across key aspects, including alignment and uniformity, linguistic abilities, and correlation between performance and rank. For instance, we find that the dynamics of the key aspects can undergo significant changes as fine-tuning transitions from Phase 1 to Phase 2. Based on these findings, we experiment with a rank reduction (RR) strategy that facilitates rapid and stable fine-tuning of the latest CL-based methods. Through empirical investigations, we showcase the efficacy of RR in enhancing the performance and stability of five state-of-the-art sentence embedding methods.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
Formation channels of the diffuse lights in the groups and clusters over time
Authors:
Kyungwon Chun,
Jihye Shin,
Jongwan Ko,
Rory Smith,
Jaewon Yoo
Abstract:
We explore the formation of the intragroup light (IGL) and intracluster light (ICL), representing diffuse lights within groups and clusters, since $z=1.5$. For this, we perform multi-resolution cosmological N-body simulations using the ``galaxy replacement technique" (GRT) and identify the progenitors in which the diffuse light stars existed when they fell into the groups or clusters. Our findings…
▽ More
We explore the formation of the intragroup light (IGL) and intracluster light (ICL), representing diffuse lights within groups and clusters, since $z=1.5$. For this, we perform multi-resolution cosmological N-body simulations using the ``galaxy replacement technique" (GRT) and identify the progenitors in which the diffuse light stars existed when they fell into the groups or clusters. Our findings reveal that typical progenitors contributing to diffuse lights enter the host halo with the massive galaxies containing a stellar mass of $10 < \log M_{\rm{gal}}~[M_{\odot}]< 11$, regardless of the mass or dynamical state of the host halos at $z=0$. In cases where the host halos are dynamically unrelaxed or more massive, diffuse lights from massive progenitors with $\log M_{\rm{gal}}~[M_{\odot}]> 11$ are more prominent, with over half of them already pre-processed before entering the host halo. Additionally, we find that the main formation mechanism of diffuse lights is the strip** process of satellites, and a substantial fraction ($40-45\%$) of diffuse light stars is linked to the merger tree of the BCG. Remarkably, all trends persist for groups and clusters at higher redshifts. The fraction of diffuse lights in the host halos with a similar mass decreases as the redshift increases, but they are already substantial at $z=1.5$ ($\sim10\%$). However, it's crucial to acknowledge that detection limits related to the observable radius and faint-end surface brightness may obscure numerous diffuse light stars and even alter the main formation channel of diffuse lights.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
New observational recipes for measuring dynamical state of galaxy clusters
Authors:
Hyowon Kim,
Rory Smith,
Jongwan Ko,
Jong-Ho Shinn,
Kyungwon Chun,
Jihye Shin,
Jaewon Yoo
Abstract:
During cluster assembly, a cluster's virialization process leaves behind signatures that can provide information on its dynamical state. However, no clear consensus yet exists on the best way to achieve this. Therefore, we attempt to derive improved recipes for classifying the dynamical state of clusters in observations using cosmological simulations. Cluster halo mass and their subhalos' mass are…
▽ More
During cluster assembly, a cluster's virialization process leaves behind signatures that can provide information on its dynamical state. However, no clear consensus yet exists on the best way to achieve this. Therefore, we attempt to derive improved recipes for classifying the dynamical state of clusters in observations using cosmological simulations. Cluster halo mass and their subhalos' mass are used to $ 10^{14}M_{\odot} h^{-1}$ and $10^{10}M_{\odot} h^{-1}$ to calculate five independent dynamical state indicators. We experiment with recipes by combining two to four indicators for detecting specific merger stages like recent and ancient mergers. These recipes are made by plotting merging clusters and a control sample of relaxed clusters in multiple indicators parameter space, and then applying a rotation matrix method to derive the best way to separate mergers from the control sample. The success of the recipe is quantified using the success rate and the overlap percentage of the merger and control histograms along the newly rotated $x$-axis. This provides us with recipes using different numbers of combined indicators and for different merger stage. Among the recipes, the stellar mass gap and center offset are the first and second most dominant of the indicators, and using more indicators improves the effectiveness of the recipe. When applied to observations, our results show good agreement with literature values of cluster dynamical state.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
A Lock-free Binary Trie
Authors:
Jeremy Ko
Abstract:
A binary trie is a sequential data structure for a dynamic set on the universe $\{0,\dots,u-1\}$ supporting Search with $O(1)$ worst-case step complexity, and Insert, Delete, and Predecessor operations with $O(\log u)$ worst-case step complexity.
We give a wait-free implementation of a relaxed binary trie, using read, write, CAS, and ($\log u$)-bit AND operations. It supports all operations with…
▽ More
A binary trie is a sequential data structure for a dynamic set on the universe $\{0,\dots,u-1\}$ supporting Search with $O(1)$ worst-case step complexity, and Insert, Delete, and Predecessor operations with $O(\log u)$ worst-case step complexity.
We give a wait-free implementation of a relaxed binary trie, using read, write, CAS, and ($\log u$)-bit AND operations. It supports all operations with the same worst-case step complexity as the sequential binary trie. However, Predecessor operations may not return a key when there are concurrent update operations. We use this as a component of a lock-free, linearizable implementation of a binary trie. It supports Search with $O(1)$ worst-case step complexity and Insert, Delete and Predecessor with $O(c^2 + \log u)$ amortized step complexity, where $c$ is a measure of the contention.
A lock-free binary trie is challenging to implement as compared to many other lock-free data structures because Insert and Delete operations perform a non-constant number of modifications to the binary trie in the worst-case to ensure the correctness of Predecessor operations.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Semantic Line Combination Detector
Authors:
**won Ko,
Dongkwon **,
Chang-Su Kim
Abstract:
A novel algorithm, called semantic line combination detector (SLCD), to find an optimal combination of semantic lines is proposed in this paper. It processes all lines in each line combination at once to assess the overall harmony of the lines. First, we generate various line combinations from reliable lines. Second, we estimate the score of each line combination and determine the best one. Experi…
▽ More
A novel algorithm, called semantic line combination detector (SLCD), to find an optimal combination of semantic lines is proposed in this paper. It processes all lines in each line combination at once to assess the overall harmony of the lines. First, we generate various line combinations from reliable lines. Second, we estimate the score of each line combination and determine the best one. Experimental results demonstrate that the proposed SLCD outperforms existing semantic line detectors on various datasets. Moreover, it is shown that SLCD can be applied effectively to three vision tasks of vanishing point detection, symmetry axis detection, and composition-based image retrieval. Our codes are available at https://github.com/**won-Ko/SLCD.
△ Less
Submitted 1 May, 2024; v1 submitted 28 April, 2024;
originally announced April 2024.
-
Dynamic Global Feedback Stabilization: why do the twist?
Authors:
Mohamed-Ali Belabbas,
Jehyung Ko
Abstract:
We investigate global dynamic feedback stabilization from a topological viewpoint. In particular, we consider the general case of dynamic feedback systems, whereby the total space (which includes the state space of the system and of the controller) is a fibre bundle, and derive conditions on the topology of the bundle that are necessary for various notions of global stabilization to hold. This poi…
▽ More
We investigate global dynamic feedback stabilization from a topological viewpoint. In particular, we consider the general case of dynamic feedback systems, whereby the total space (which includes the state space of the system and of the controller) is a fibre bundle, and derive conditions on the topology of the bundle that are necessary for various notions of global stabilization to hold. This point of view highlight the importance of distinguishing trivial bundles and twisted bundles in the study of global dynamic feedback stabilization, as we show that dynamic feedback defined on a twisted bundle can stabilize systems that dynamic feedback on trivial bundles cannot.
△ Less
Submitted 28 April, 2024;
originally announced April 2024.
-
GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting
Authors:
Kyusun Cho,
Joungbin Lee,
Heeji Yoon,
Yeobin Hong,
Jaehoon Ko,
Sangjun Ahn,
Seungryong Kim
Abstract:
We propose GaussianTalker, a novel framework for real-time generation of pose-controllable talking heads. It leverages the fast rendering capabilities of 3D Gaussian Splatting (3DGS) while addressing the challenges of directly controlling 3DGS with speech audio. GaussianTalker constructs a canonical 3DGS representation of the head and deforms it in sync with the audio. A key insight is to encode t…
▽ More
We propose GaussianTalker, a novel framework for real-time generation of pose-controllable talking heads. It leverages the fast rendering capabilities of 3D Gaussian Splatting (3DGS) while addressing the challenges of directly controlling 3DGS with speech audio. GaussianTalker constructs a canonical 3DGS representation of the head and deforms it in sync with the audio. A key insight is to encode the 3D Gaussian attributes into a shared implicit feature representation, where it is merged with audio features to manipulate each Gaussian attribute. This design exploits the spatial-aware features and enforces interactions between neighboring points. The feature embeddings are then fed to a spatial-audio attention module, which predicts frame-wise offsets for the attributes of each Gaussian. It is more stable than previous concatenation or multiplication approaches for manipulating the numerous Gaussians and their intricate parameters. Experimental results showcase GaussianTalker's superiority in facial fidelity, lip synchronization accuracy, and rendering speed compared to previous methods. Specifically, GaussianTalker achieves a remarkable rendering speed up to 120 FPS, surpassing previous benchmarks. Our code is made available at https://github.com/KU-CVLAB/GaussianTalker/ .
△ Less
Submitted 25 April, 2024; v1 submitted 24 April, 2024;
originally announced April 2024.
-
ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images
Authors:
**seo Jeong,
Junseo Koo,
Qimeng Zhang,
Gunhee Kim
Abstract:
Existing NeRF-based inverse rendering methods suppose that scenes are exclusively illuminated by distant light sources, neglecting the potential influence of emissive sources within a scene. In this work, we confront this limitation using LDR multi-view images captured with emissive sources turned on and off. Two key issues must be addressed: 1) ambiguity arising from the limited dynamic range alo…
▽ More
Existing NeRF-based inverse rendering methods suppose that scenes are exclusively illuminated by distant light sources, neglecting the potential influence of emissive sources within a scene. In this work, we confront this limitation using LDR multi-view images captured with emissive sources turned on and off. Two key issues must be addressed: 1) ambiguity arising from the limited dynamic range along with unknown lighting details, and 2) the expensive computational cost in volume rendering to backtrace the paths leading to final object colors. We present a novel approach, ESR-NeRF, leveraging neural networks as learnable functions to represent ray-traced fields. By training networks to satisfy light transport segments, we regulate outgoing radiances, progressively identifying emissive sources while being aware of reflection areas. The results on scenes encompassing emissive sources with various properties demonstrate the superiority of ESR-NeRF in qualitative and quantitative ways. Our approach also extends its applicability to the scenes devoid of emissive sources, achieving lower CD metrics on the DTU dataset.
△ Less
Submitted 6 June, 2024; v1 submitted 24 April, 2024;
originally announced April 2024.
-
An Optimal MPC Algorithm for Subunit-Monge Matrix Multiplication, with Applications to LIS
Authors:
Jaehyun Koo
Abstract:
We present an $O(1)$-round fully-scalable deterministic massively parallel algorithm for computing the min-plus matrix multiplication of unit-Monge matrices. We use this to derive a $O(\log n)$-round fully-scalable massively parallel algorithm for solving the exact longest increasing subsequence (LIS) problem. For a fully-scalable MPC regime, this result substantially improves the previously known…
▽ More
We present an $O(1)$-round fully-scalable deterministic massively parallel algorithm for computing the min-plus matrix multiplication of unit-Monge matrices. We use this to derive a $O(\log n)$-round fully-scalable massively parallel algorithm for solving the exact longest increasing subsequence (LIS) problem. For a fully-scalable MPC regime, this result substantially improves the previously known algorithm of $O(\log^4 n)$-round complexity, and matches the best algorithm for computing the $(1+ε)$-approximation of LIS.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
Anarchy in the APSP: Algorithm and Hardness for Incorrect Implementation of Floyd-Warshall
Authors:
Jaehyun Koo
Abstract:
The celebrated Floyd-Warshall algorithm efficiently computes the all-pairs shortest path, and its simplicity made it a staple in computer science classes. Frequently, students discover a variant of this Floyd-Warshall algorithm by mixing up the loop order, ending up with the incorrect APSP matrix. This paper considers a computational problem of computing this incorrect APSP matrix. We will propose…
▽ More
The celebrated Floyd-Warshall algorithm efficiently computes the all-pairs shortest path, and its simplicity made it a staple in computer science classes. Frequently, students discover a variant of this Floyd-Warshall algorithm by mixing up the loop order, ending up with the incorrect APSP matrix. This paper considers a computational problem of computing this incorrect APSP matrix. We will propose efficient algorithms for this problem and prove that this incorrect variant is APSP-complete.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Upgrade of NaI(Tl) crystal encapsulation for the NEON experiment
Authors:
J. J. Choi,
E. J. Jeon,
J. Y. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
Y. D. Kim,
Y. J. Ko,
B. C. Koh,
C. Ha,
B. J. Park,
S. H. Lee,
I. S. Lee,
H. Lee,
H. S. Lee,
J. Lee,
Y. M. Oh
Abstract:
The Neutrino Elastic-scattering Observation with NaI(Tl) experiment (NEON) aims to detect coherent elastic neutrino-nucleus scattering~(\cenns) in a NaI(Tl) crystal using reactor anti-electron neutrinos at the Hanbit nuclear power plant complex. A total of 13.3 kg of NaI(Tl) crystals were initially installed in December 2020 at the tendon gallery, 23.7$\pm$0.3\,m away from the reactor core, which…
▽ More
The Neutrino Elastic-scattering Observation with NaI(Tl) experiment (NEON) aims to detect coherent elastic neutrino-nucleus scattering~(\cenns) in a NaI(Tl) crystal using reactor anti-electron neutrinos at the Hanbit nuclear power plant complex. A total of 13.3 kg of NaI(Tl) crystals were initially installed in December 2020 at the tendon gallery, 23.7$\pm$0.3\,m away from the reactor core, which operates at a thermal power of 2.8\,GW. Initial engineering operation was performed from May 2021 to March 2022 and observed unexpected photomultiplier-induced noise and a decreased light yield that were caused by leakage of liquid scintillator into the detector due to weakness of detector encapsulation. We upgraded the detector encapsulation design to prevent the leakage of the liquid scintillator. Meanwhile two small-sized detectors were replaced with larger ones resulting in a total mass of 16.7\,kg. With this new design implementation, the detector system has been operating stably since April 2022 for over a year without detector gain drop. In this paper, we present an improved crystal encapsulation design and stability of the NEON experiment.
△ Less
Submitted 28 June, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Shell-type Tidal Features Are More Frequently Detected in Slowly Rotating Early-type Galaxies than Stream- and Tail-type Features
Authors:
Yongmin Yoon,
Jongwan Ko,
Haeun Chung,
Woowon Byun,
Kyungwon Chun
Abstract:
To enhance our understanding of the impact of galaxy mergers on the kinematics of early-type galaxies (ETGs), we examine differences in specific stellar angular momentum within the half-light radius ($λ_{R_e}$) among ETGs with different types of tidal features and those without such features. This is accomplished by categorizing tidal features, which serve as direct evidence of recent mergers, int…
▽ More
To enhance our understanding of the impact of galaxy mergers on the kinematics of early-type galaxies (ETGs), we examine differences in specific stellar angular momentum within the half-light radius ($λ_{R_e}$) among ETGs with different types of tidal features and those without such features. This is accomplished by categorizing tidal features, which serve as direct evidence of recent mergers, into shells, streams, and tails, through deep images from the DESI Legacy Survey, and by using MaNGA data for the analysis of the kinematics of 1244 ETGs at $z<0.055$. We find that ETGs with tidal features typically have reduced $λ_{R_e}$ values that are lower by 0.12 dex than ETGs without tidal features. ETGs with shells contribute most to the reduction in $λ_{R_e}$. Consequently, nearly half of ETGs with shells are classified as slow rotators, a fraction that is more than twice as high as that of ETGs with tails or streams, and over three times higher than that of ETGs without tidal features. These trends generally remain valid even when ETGs are divided into several mass bins. Our findings support the idea that radial mergers, which are more effective at reducing $λ_{R_e}$ than circular mergers, are more closely associated with the formation of shells rather than streams or tails. The detection of shells in slightly more massive ETGs compared to streams and tails may be attributed to the fact that massive satellite galaxies are more likely to be accreted through radial orbits, due to the nature of dynamical friction.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud
Authors:
Wencan Cheng,
Hao Tang,
Luc Van Gool,
Jong Hwan Ko
Abstract:
Extracting keypoint locations from input hand frames, known as 3D hand pose estimation, is a critical task in various human-computer interaction applications. Essentially, the 3D hand pose estimation can be regarded as a 3D point subset generative problem conditioned on input frames. Thanks to the recent significant progress on diffusion-based generative models, hand pose estimation can also benef…
▽ More
Extracting keypoint locations from input hand frames, known as 3D hand pose estimation, is a critical task in various human-computer interaction applications. Essentially, the 3D hand pose estimation can be regarded as a 3D point subset generative problem conditioned on input frames. Thanks to the recent significant progress on diffusion-based generative models, hand pose estimation can also benefit from the diffusion model to estimate keypoint locations with high quality. However, directly deploying the existing diffusion models to solve hand pose estimation is non-trivial, since they cannot achieve the complex permutation map** and precise localization. Based on this motivation, this paper proposes HandDiff, a diffusion-based hand pose estimation model that iteratively denoises accurate hand pose conditioned on hand-shaped image-point clouds. In order to recover keypoint permutation and accurate location, we further introduce joint-wise condition and local detail condition. Experimental results demonstrate that the proposed HandDiff significantly outperforms the existing approaches on four challenging hand pose benchmark datasets. Codes and pre-trained models are publicly available at https://github.com/cwc1260/HandDiff.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Authors:
Junghyun Koo,
Gordon Wichern,
Francois G. Germain,
Sameer Khurana,
Jonathan Le Roux
Abstract:
We introduce Self-Monitored Inference-Time INtervention (SMITIN), an approach for controlling an autoregressive generative music transformer using classifier probes. These simple logistic regression probes are trained on the output of each attention head in the transformer using a small dataset of audio examples both exhibiting and missing a specific musical trait (e.g., the presence/absence of dr…
▽ More
We introduce Self-Monitored Inference-Time INtervention (SMITIN), an approach for controlling an autoregressive generative music transformer using classifier probes. These simple logistic regression probes are trained on the output of each attention head in the transformer using a small dataset of audio examples both exhibiting and missing a specific musical trait (e.g., the presence/absence of drums, or real/synthetic music). We then steer the attention heads in the probe direction, ensuring the generative model output captures the desired musical trait. Additionally, we monitor the probe output to avoid adding an excessive amount of intervention into the autoregressive generation, which could lead to temporally incoherent music. We validate our results objectively and subjectively for both audio continuation and text-to-music applications, demonstrating the ability to add controls to large generative models for which retraining or even fine-tuning is impractical for most musicians.
Audio samples of the proposed intervention approach are available on our demo page http://tinyurl.com/smitin .
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Prompt Learning via Meta-Regularization
Authors:
**young Park,
Juyeon Ko,
Hyunwoo J. Kim
Abstract:
Pre-trained vision-language models have shown impressive success on various computer vision tasks with their zero-shot generalizability. Recently, prompt learning approaches have been explored to efficiently and effectively adapt the vision-language models to a variety of downstream tasks. However, most existing prompt learning methods suffer from task overfitting since the general knowledge of th…
▽ More
Pre-trained vision-language models have shown impressive success on various computer vision tasks with their zero-shot generalizability. Recently, prompt learning approaches have been explored to efficiently and effectively adapt the vision-language models to a variety of downstream tasks. However, most existing prompt learning methods suffer from task overfitting since the general knowledge of the pre-trained vision language models is forgotten while the prompts are finetuned on a small data set from a specific target task. To address this issue, we propose a Prompt Meta-Regularization (ProMetaR) to improve the generalizability of prompt learning for vision-language models. Specifically, ProMetaR meta-learns both the regularizer and the soft prompts to harness the task-specific knowledge from the downstream tasks and task-agnostic general knowledge from the vision-language models. Further, ProMetaR augments the task to generate multiple virtual tasks to alleviate the meta-overfitting. In addition, we provide the analysis to comprehend how ProMetaR improves the generalizability of prompt tuning in the perspective of the gradient alignment. Our extensive experiments demonstrate that our ProMetaR improves the generalizability of conventional prompt learning methods under base-to-base/base-to-new and domain generalization settings. The code of ProMetaR is available at https://github.com/mlvlab/ProMetaR.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior
Authors:
Jaehoon Ko,
Kyusun Cho,
Joungbin Lee,
Heeji Yoon,
Sangmin Lee,
Sangjun Ahn,
Seungryong Kim
Abstract:
Recent methods for audio-driven talking head synthesis often optimize neural radiance fields (NeRF) on a monocular talking portrait video, leveraging its capability to render high-fidelity and 3D-consistent novel-view frames. However, they often struggle to reconstruct complete face geometry due to the absence of comprehensive 3D information in the input monocular videos. In this paper, we introdu…
▽ More
Recent methods for audio-driven talking head synthesis often optimize neural radiance fields (NeRF) on a monocular talking portrait video, leveraging its capability to render high-fidelity and 3D-consistent novel-view frames. However, they often struggle to reconstruct complete face geometry due to the absence of comprehensive 3D information in the input monocular videos. In this paper, we introduce a novel audio-driven talking head synthesis framework, called Talk3D, that can faithfully reconstruct its plausible facial geometries by effectively adopting the pre-trained 3D-aware generative prior. Given the personalized 3D generative model, we present a novel audio-guided attention U-Net architecture that predicts the dynamic face variations in the NeRF space driven by audio. Furthermore, our model is further modulated by audio-unrelated conditioning tokens which effectively disentangle variations unrelated to audio features. Compared to existing methods, our method excels in generating realistic facial geometries even under extreme head poses. We also conduct extensive experiments showing our approach surpasses state-of-the-art benchmarks in terms of both quantitative and qualitative evaluations.
△ Less
Submitted 29 March, 2024;
originally announced March 2024.
-
PropTest: Automatic Property Testing for Improved Visual Programming
Authors:
Jaywon Koo,
Ziyan Yang,
Paola Cascante-Bonilla,
Baishakhi Ray,
Vicente Ordonez
Abstract:
Visual Programming has emerged as an alternative to end-to-end black-box visual reasoning models. This type of methods leverage Large Language Models (LLMs) to decompose a problem and generate the source code for an executable computer program. This strategy has the advantage of offering an interpretable reasoning path and does not require finetuning a model with task-specific data. We propose Pro…
▽ More
Visual Programming has emerged as an alternative to end-to-end black-box visual reasoning models. This type of methods leverage Large Language Models (LLMs) to decompose a problem and generate the source code for an executable computer program. This strategy has the advantage of offering an interpretable reasoning path and does not require finetuning a model with task-specific data. We propose PropTest, a general strategy that improves visual programming by further using an LLM to generate code that tests for visual properties in an initial round of proposed solutions. Particularly, our method tests for data-type consistency, as well as syntactic and semantic properties in the generated solutions. Our proposed solution outperforms baselines and achieves comparable results to state-of-the-art methods while using smaller and publicly available LLMs (CodeLlama-7B and WizardCoder-15B). This is demonstrated across different benchmarks on visual question answering and referring expression comprehension, showing the efficacy of our approach in enhancing the performance and generalization of visual reasoning tasks. Specifically, PropTest improves ViperGPT by obtaining 48.66% accuracy (+8.3%) on the A-OKVQA benchmark and 52.8% (+3.3%) on the RefCOCO+ benchmark using CodeLlama-7B.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
SyncTweedies: A General Generative Framework Based on Synchronized Diffusions
Authors:
Jaihoon Kim,
Juil Koo,
Kyeongmin Yeo,
Minhyuk Sung
Abstract:
We introduce a general framework for generating diverse visual content, including ambiguous images, panorama images, mesh textures, and Gaussian splat textures, by synchronizing multiple diffusion processes. We present exhaustive investigation into all possible scenarios for synchronizing multiple diffusion processes through a canonical space and analyze their characteristics across applications.…
▽ More
We introduce a general framework for generating diverse visual content, including ambiguous images, panorama images, mesh textures, and Gaussian splat textures, by synchronizing multiple diffusion processes. We present exhaustive investigation into all possible scenarios for synchronizing multiple diffusion processes through a canonical space and analyze their characteristics across applications. In doing so, we reveal a previously unexplored case: averaging the outputs of Tweedie's formula while conducting denoising in multiple instance spaces. This case also provides the best quality with the widest applicability to downstream tasks. We name this case SyncTweedies. In our experiments generating visual content aforementioned, we demonstrate the superior quality of generation by SyncTweedies compared to other synchronization methods, optimization-based and iterative-update-based methods.
△ Less
Submitted 20 June, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Theoretical investigation of the vertical dielectric screening dependence on defects for few-layered van der Waals materials
Authors:
Amit Singh,
Seunghan Lee,
Hyeonhu Bae,
Jahyun Koo,
Li Yang,
Hoonkyung Lee
Abstract:
First-principle calculations were employed to analyze the effects induced by vacancies of molybdenum (Mo) and sulfur (S) on the dielectric properties of few-layered MoS2. We explored the combined effects of vacancies and dipole interactions on the dielectric properties of few-layered MoS2. In the presence of dielectric screening, we investigated uniformly distributed Mo and S vacancies, and then c…
▽ More
First-principle calculations were employed to analyze the effects induced by vacancies of molybdenum (Mo) and sulfur (S) on the dielectric properties of few-layered MoS2. We explored the combined effects of vacancies and dipole interactions on the dielectric properties of few-layered MoS2. In the presence of dielectric screening, we investigated uniformly distributed Mo and S vacancies, and then considered the case of concentrated vacancies. Our results show that the dielectric screening remarkably depends on the distribution of vacancies owing to the polarization induced by the vacancies and on the interlayer distances. This conclusion was validated for a wide range of wide-gap semiconductors with different positions and distributions of vacancies, providing an effective and reliable method for calculating and predicting electrostatic screening of dimensionally reduced materials. We further provided a method for engineering the dielectric constant by changing the interlayer distance, tuning the number of vacancies and the distribution of vacancies in few-layered van der Waals materials for their application in nanodevices and supercapacitors.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
A multiscale cavity method for sublinear-rank symmetric matrix factorization
Authors:
Jean Barbier,
Justin Ko,
Anas A. Rahman
Abstract:
We consider a statistical model for symmetric matrix factorization with additive Gaussian noise in the high-dimensional regime where the rank $M$ of the signal matrix to infer scales with its size $N$ as $M = o(N^{1/10})$. Allowing for a $N$-dependent rank offers new challenges and requires new methods. Working in the Bayesian-optimal setting, we show that whenever the signal has i.i.d. entries th…
▽ More
We consider a statistical model for symmetric matrix factorization with additive Gaussian noise in the high-dimensional regime where the rank $M$ of the signal matrix to infer scales with its size $N$ as $M = o(N^{1/10})$. Allowing for a $N$-dependent rank offers new challenges and requires new methods. Working in the Bayesian-optimal setting, we show that whenever the signal has i.i.d. entries the limiting mutual information between signal and data is given by a variational formula involving a rank-one replica symmetric potential. In other words, from the information-theoretic perspective, the case of a (slowly) growing rank is the same as when $M = 1$ (namely, the standard spiked Wigner model). The proof is primarily based on a novel multiscale cavity method allowing for growing rank along with some information-theoretic identities on worst noise for the Gaussian vector channel. We believe that the cavity method developed here will play a role in the analysis of a broader class of inference and spin models where the degrees of freedom are large arrays instead of vectors.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Fundamental limits of Non-Linear Low-Rank Matrix Estimation
Authors:
Pierre Mergny,
Justin Ko,
Florent Krzakala,
Lenka Zdeborová
Abstract:
We consider the task of estimating a low-rank matrix from non-linear and noisy observations. We prove a strong universality result showing that Bayes-optimal performances are characterized by an equivalent Gaussian model with an effective prior, whose parameters are entirely determined by an expansion of the non-linear function. In particular, we show that to reconstruct the signal accurately, one…
▽ More
We consider the task of estimating a low-rank matrix from non-linear and noisy observations. We prove a strong universality result showing that Bayes-optimal performances are characterized by an equivalent Gaussian model with an effective prior, whose parameters are entirely determined by an expansion of the non-linear function. In particular, we show that to reconstruct the signal accurately, one requires a signal-to-noise ratio growing as $N^{\frac 12 (1-1/k_F)}$, where $k_F$ is the first non-zero Fisher information coefficient of the function. We provide asymptotic characterization for the minimal achievable mean squared error (MMSE) and an approximate message-passing algorithm that reaches the MMSE under conditions analogous to the linear version of the problem. We also provide asymptotic errors achieved by methods such as principal component analysis combined with Bayesian denoising, and compare them with Bayes-optimal MMSE.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
An Adaptable, Safe, and Portable Robot-Assisted Feeding System
Authors:
Ethan Kroll Gordon,
Rajat Kumar Jenamani,
Amal Nanavati,
Ziang Liu,
Haya Bolotski,
Raida Karim,
Daniel Stabile,
Atharva Kashyap,
Bernie Hao Zhu,
Xilai Dai,
Tyler Schrenk,
Jonathan Ko,
Taylor Kessler Faulkner,
Tapomayukh Bhattacharjee,
Siddhartha Srinivasa
Abstract:
We demonstrate a robot-assisted feeding system that enables people with mobility impairments to feed themselves. Our system design embodies Safety, Portability, and User Control, with comprehensive full-stack safety checks, the ability to be mounted on and powered by any powered wheelchair, and a custom web-app allowing care-recipients to leverage their own assistive devices for robot control. For…
▽ More
We demonstrate a robot-assisted feeding system that enables people with mobility impairments to feed themselves. Our system design embodies Safety, Portability, and User Control, with comprehensive full-stack safety checks, the ability to be mounted on and powered by any powered wheelchair, and a custom web-app allowing care-recipients to leverage their own assistive devices for robot control. For bite acquisition, we leverage multi-modal online learning to tractably adapt to unseen food types. For bite transfer, we leverage real-time mouth perception and interaction-aware control. Co-designed with community researchers, our system has been validated through multiple end-user studies.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Spectral Phase Transition and Optimal PCA in Block-Structured Spiked models
Authors:
Pierre Mergny,
Justin Ko,
Florent Krzakala
Abstract:
We discuss the inhomogeneous spiked Wigner model, a theoretical framework recently introduced to study structured noise in various learning scenarios, through the prism of random matrix theory, with a specific focus on its spectral properties. Our primary objective is to find an optimal spectral method and to extend the celebrated \cite{BBP} (BBP) phase transition criterion -- well-known in the ho…
▽ More
We discuss the inhomogeneous spiked Wigner model, a theoretical framework recently introduced to study structured noise in various learning scenarios, through the prism of random matrix theory, with a specific focus on its spectral properties. Our primary objective is to find an optimal spectral method and to extend the celebrated \cite{BBP} (BBP) phase transition criterion -- well-known in the homogeneous case -- to our inhomogeneous, block-structured, Wigner model. We provide a thorough rigorous analysis of a transformed matrix and show that the transition for the appearance of 1) an outlier outside the bulk of the limiting spectral distribution and 2) a positive overlap between the associated eigenvector and the signal, occurs precisely at the optimal threshold, making the proposed spectral method optimal within the class of iterative methods for the inhomogeneous Wigner problem.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset
Authors:
Abbi Ward,
Jimmy Li,
Julie Wang,
Sriram Lakshminarasimhan,
Ashley Carrick,
Bilson Campana,
Jay Hartford,
Pradeep Kumar S,
Tiya Tiyasirichokchai,
Sunny Virmani,
Renee Wong,
Yossi Matias,
Greg S. Corrado,
Dale R. Webster,
Dawn Siegel,
Steven Lin,
Justin Ko,
Alan Karthikesalingam,
Christopher Semturs,
Pooja Rao
Abstract:
Background: Health datasets from clinical sources do not reflect the breadth and diversity of disease in the real world, impacting research, medical education, and artificial intelligence (AI) tool development. Dermatology is a suitable area to develop and test a new and scalable method to create representative health datasets.
Methods: We used Google Search advertisements to invite contribution…
▽ More
Background: Health datasets from clinical sources do not reflect the breadth and diversity of disease in the real world, impacting research, medical education, and artificial intelligence (AI) tool development. Dermatology is a suitable area to develop and test a new and scalable method to create representative health datasets.
Methods: We used Google Search advertisements to invite contributions to an open access dataset of images of dermatology conditions, demographic and symptom information. With informed contributor consent, we describe and release this dataset containing 10,408 images from 5,033 contributions from internet users in the United States over 8 months starting March 2023. The dataset includes dermatologist condition labels as well as estimated Fitzpatrick Skin Type (eFST) and Monk Skin Tone (eMST) labels for the images.
Results: We received a median of 22 submissions/day (IQR 14-30). Female (66.72%) and younger (52% < age 40) contributors had a higher representation in the dataset compared to the US population, and 32.6% of contributors reported a non-White racial or ethnic identity. Over 97.5% of contributions were genuine images of skin conditions. Dermatologist confidence in assigning a differential diagnosis increased with the number of available variables, and showed a weaker correlation with image sharpness (Spearman's P values <0.001 and 0.01 respectively). Most contributions were short-duration (54% with onset < 7 days ago ) and 89% were allergic, infectious, or inflammatory conditions. eFST and eMST distributions reflected the geographical origin of the dataset. The dataset is available at github.com/google-research-datasets/scin .
Conclusion: Search ads are effective at crowdsourcing images of health conditions. The SCIN dataset bridges important gaps in the availability of representative images of common skin conditions.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Continuous Memory Representation for Anomaly Detection
Authors:
Joo Chan Lee,
Taejune Kim,
Eunbyung Park,
Simon S. Woo,
Jong Hwan Ko
Abstract:
There have been significant advancements in anomaly detection in an unsupervised manner, where only normal images are available for training. Several recent methods aim to detect anomalies based on a memory, comparing or reconstructing the input with directly stored normal features (or trained features with normal images). However, such memory-based approaches operate on a discrete feature space i…
▽ More
There have been significant advancements in anomaly detection in an unsupervised manner, where only normal images are available for training. Several recent methods aim to detect anomalies based on a memory, comparing or reconstructing the input with directly stored normal features (or trained features with normal images). However, such memory-based approaches operate on a discrete feature space implemented by the nearest neighbor or attention mechanism, suffering from poor generalization or an identity shortcut issue outputting the same as input, respectively. Furthermore, the majority of existing methods are designed to detect single-class anomalies, resulting in unsatisfactory performance when presented with multiple classes of objects. To tackle all of the above challenges, we propose CRAD, a novel anomaly detection method for representing normal features within a "continuous" memory, enabled by transforming spatial features into coordinates and map** them to continuous grids. Furthermore, we carefully design the grids tailored for anomaly detection, representing both local and global normal features and fusing them effectively. Our extensive experiments demonstrate that CRAD successfully generalizes the normal features and mitigates the identity shortcut, furthermore, CRAD effectively handles diverse classes in a single model thanks to the high-granularity continuous representation. In an evaluation using the MVTec AD dataset, CRAD significantly outperforms the previous state-of-the-art method by reducing 65.0% of the error for multi-class unified anomaly detection. The project page is available at https://tae-mo.github.io/crad/.
△ Less
Submitted 10 March, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
Spatial Distribution of Intracluster Light versus Dark Matter in Horizon Run 5
Authors:
Jaewon Yoo,
Changbom Park,
Cristiano G. Sabiu,
Ankit Singh,
Jongwan Ko,
Jaehyun Lee,
Christophe Pichon,
M. James Jee,
Brad K. Gibson,
Owain Snaith,
Juhan Kim,
Jihye Shin,
Yonghwi Kim,
Hyowon Kim
Abstract:
One intriguing approach for studying the dynamical evolution of galaxy clusters is to compare the spatial distributions among various components, such as dark matter, member galaxies, gas, and intracluster light (ICL). Utilizing the recently introduced Weighted Overlap Coefficient (WOC) \citep{2022ApJS..261...28Y}, we analyze the spatial distributions of components within 174 galaxy clusters (…
▽ More
One intriguing approach for studying the dynamical evolution of galaxy clusters is to compare the spatial distributions among various components, such as dark matter, member galaxies, gas, and intracluster light (ICL). Utilizing the recently introduced Weighted Overlap Coefficient (WOC) \citep{2022ApJS..261...28Y}, we analyze the spatial distributions of components within 174 galaxy clusters ($M_{\rm tot}> 5 \times 10^{13} M_{\odot}$, $z=0.625$) at varying dynamical states in the cosmological hydrodynamical simulation Horizon Run 5. We observe that the distributions of gas and the combination of ICL with the brightest cluster galaxy (BCG) closely resembles the dark matter distribution, particularly in more relaxed clusters, characterized by the half-mass epoch. The similarity in spatial distribution between dark matter and BCG+ICL mimics the changes in the dynamical state of clusters during a major merger. Notably, at redshifts $>$ 1, BCG+ICL traced dark matter more accurately than the gas. Additionally, we examined the one-dimensional radial profiles of each component, which show that the BCG+ICL is a sensitive component revealing the dynamical state of clusters. We propose a new method that can approximately recover the dark matter profile by scaling the BCG+ICL radial profile. Furthermore, we find a recipe for tracing dark matter in unrelaxed clusters by including the most massive satellite galaxies together with BCG+ICL distribution. Combining the BCG+ICL and the gas distribution enhances the dark matter tracing ability. Our results imply that the BCG+ICL distribution is an effective tracer for the dark matter distribution, and the similarity of spatial distribution may be a useful probe of the dynamical state of a cluster.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Waveform Simulation for Scintillation Characteristics of NaI(Tl) Crystal
Authors:
J. J. Choi,
C. Ha,
E. J. Jeon,
K. W. Kim,
S. K. Kim,
Y. D. Kim,
Y. J. Ko,
B. C. Koh,
H. S. Lee,
S. H. Lee,
S. M. Lee,
B. J. Park,
G. H. Yu
Abstract:
The lowering of the energy threshold in the NaI detector is crucial not only for comprehensive validation of DAMA/LIBRA but also for exploring new possibilities in the search for low-mass dark matter and observing coherent elastic scattering between neutrino and nucleus. Alongside hardware enhancements, extensive efforts have focused on refining event selection to discern noise, achieved through p…
▽ More
The lowering of the energy threshold in the NaI detector is crucial not only for comprehensive validation of DAMA/LIBRA but also for exploring new possibilities in the search for low-mass dark matter and observing coherent elastic scattering between neutrino and nucleus. Alongside hardware enhancements, extensive efforts have focused on refining event selection to discern noise, achieved through parameter development and the application of machine learning. Acquiring pure, unbiased datasets is crucial in this endeavor, for which a waveform simulation was developed. The simulation data were compared with the experimental data using several pulse shape discrimination parameters to test its performance in describing the experimental data. Additionally, we present the outcomes of multi-variable machine learning trained with simulation data as a scintillation signal sample. The distributions of outcomes for experimental and simulation data show a good agreement. As an application of the waveform simulation, we validate the trigger efficiency alongside estimations derived from the minimally biased measurement data.
△ Less
Submitted 17 June, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis
Authors:
Juyeon Ko,
Inho Kong,
Dogyun Park,
Hyunwoo J. Kim
Abstract:
Semantic image synthesis (SIS) is a task to generate realistic images corresponding to semantic maps (labels). However, in real-world applications, SIS often encounters noisy user inputs. To address this, we propose Stochastic Conditional Diffusion Model (SCDM), which is a robust conditional diffusion model that features novel forward and generation processes tailored for SIS with noisy labels. It…
▽ More
Semantic image synthesis (SIS) is a task to generate realistic images corresponding to semantic maps (labels). However, in real-world applications, SIS often encounters noisy user inputs. To address this, we propose Stochastic Conditional Diffusion Model (SCDM), which is a robust conditional diffusion model that features novel forward and generation processes tailored for SIS with noisy labels. It enhances robustness by stochastically perturbing the semantic label maps through Label Diffusion, which diffuses the labels with discrete diffusion. Through the diffusion of labels, the noisy and clean semantic maps become similar as the timestep increases, eventually becoming identical at $t=T$. This facilitates the generation of an image close to a clean image, enabling robust generation. Furthermore, we propose a class-wise noise schedule to differentially diffuse the labels depending on the class. We demonstrate that the proposed method generates high-quality samples through extensive experiments and analyses on benchmark datasets, including a novel experimental setup simulating human errors during real-world applications. Code is available at https://github.com/mlvlab/SCDM.
△ Less
Submitted 3 June, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
Closing the AI generalization gap by adjusting for dermatology condition distribution differences across clinical settings
Authors:
Rajeev V. Rikhye,
Aaron Loh,
Grace Eunhae Hong,
Preeti Singh,
Margaret Ann Smith,
Vijaytha Muralidharan,
Doris Wong,
Rory Sayres,
Michelle Phung,
Nicolas Betancourt,
Bradley Fong,
Rachna Sahasrabudhe,
Khoban Nasim,
Alec Eschholz,
Basil Mustafa,
Jan Freyberg,
Terry Spitz,
Yossi Matias,
Greg S. Corrado,
Katherine Chou,
Dale R. Webster,
Peggy Bui,
Yuan Liu,
Yun Liu,
Justin Ko
, et al. (1 additional authors not shown)
Abstract:
Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generali…
▽ More
Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generalizable AI that can aid in the diagnosis of skin conditions across a variety of clinical settings. In this retrospective study, we demonstrate that differences in skin condition distribution, rather than in demographics or image capture mode are the main source of errors when an AI algorithm is evaluated on data from a previously unseen source. We demonstrate a series of steps to close this generalization gap, requiring progressively more information about the new source, ranging from the condition distribution to training data enriched for data less frequently seen during training. Our results also suggest comparable performance from end-to-end fine tuning versus fine tuning solely the classification layer on top of a frozen embedding model. Our approach can inform the adaptation of AI algorithms to new settings, based on the information and resources available.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Measurements of low-energy nuclear recoil quenching factors for Na and I recoils in the NaI(Tl) scintillator
Authors:
S. H. Lee,
H. W. Joo,
H. J. Kim,
K. W. Kim,
S. K. Kim,
Y. D. Kim,
Y. J. Ko,
H. S. Lee,
J. Y. Lee,
H. S. Park,
Y. S. Yoon
Abstract:
Elastic scattering off nuclei in target detectors, involving interactions with dark matter and coherent elastic neutrino nuclear recoil (CE$ν$NS), results in the deposition of low energy within the nuclei, dissipating rapidly through a combination of heat and ionization. The primary energy loss mechanism for nuclear recoil is heat, leading to consistently smaller measurable scintillation signals c…
▽ More
Elastic scattering off nuclei in target detectors, involving interactions with dark matter and coherent elastic neutrino nuclear recoil (CE$ν$NS), results in the deposition of low energy within the nuclei, dissipating rapidly through a combination of heat and ionization. The primary energy loss mechanism for nuclear recoil is heat, leading to consistently smaller measurable scintillation signals compared to electron recoils of the same energy. The nuclear recoil quenching factor (QF), representing the ratio of scintillation light yield produced by nuclear recoil to that of electron recoil at the same energy, is a critical parameter for understanding dark matter and neutrino interactions with nuclei. The low energy QF of NaI(Tl) crystals, commonly employed in dark matter searches and CE$ν$NS measurements, is of substantial importance. Previous low energy QF measurements were constrained by contamination from photomultiplier tube (PMT)-induced noise, resulting in an observed light yield of approximately 15 photoelectrons per keVee (kilo-electron-volt electron-equivalent energy) and nuclear recoil energy above 5 keVnr (kilo-electron-volt nuclear recoil energy). Through enhanced crystal encapsulation, an increased light yield of around 26 photoelectrons per keVee is achieved. This improvement enables the measurement of the nuclear recoil QF for sodium nuclei at an energy of 3.8 $\pm$ 0.6 keVnr with a QF of 11.2 $\pm$ 1.7%. Furthermore, a reevaluation of previously reported QF results is conducted, incorporating enhancements in low energy events based on waveform simulation. The outcomes are generally consistent with various recent QF measurements for sodium and iodine.
△ Less
Submitted 8 July, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields
Authors:
Seungtae Nam,
Daniel Rho,
Jong Hwan Ko,
Eunbyung Park
Abstract:
Despite the remarkable achievements of neural radiance fields (NeRF) in representing 3D scenes and generating novel view images, the aliasing issue, rendering "jaggies" or "blurry" images at varying camera distances, remains unresolved in most existing approaches. The recently proposed mip-NeRF has addressed this challenge by rendering conical frustums instead of rays. However, it relies on MLP ar…
▽ More
Despite the remarkable achievements of neural radiance fields (NeRF) in representing 3D scenes and generating novel view images, the aliasing issue, rendering "jaggies" or "blurry" images at varying camera distances, remains unresolved in most existing approaches. The recently proposed mip-NeRF has addressed this challenge by rendering conical frustums instead of rays. However, it relies on MLP architecture to represent the radiance fields, missing out on the fast training speed offered by the latest grid-based methods. In this work, we present mip-Grid, a novel approach that integrates anti-aliasing techniques into grid-based representations for radiance fields, mitigating the aliasing artifacts while enjoying fast training time. The proposed method generates multi-scale grids by applying simple convolution operations over a shared grid representation and uses the scale-aware coordinate to retrieve features at different scales from the generated multi-scale grids. To test the effectiveness, we integrated the proposed method into the two recent representative grid-based methods, TensoRF and K-Planes. Experimental results demonstrate that mip-Grid greatly improves the rendering performance of both methods and even outperforms mip-NeRF on multi-scale datasets while achieving significantly faster training time. For code and demo videos, please see https://stnamjef.github.io/mipgrid.github.io/.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
DistiLLM: Towards Streamlined Distillation for Large Language Models
Authors:
Jongwoo Ko,
Sungnyun Kim,
Tianyi Chen,
Se-Young Yun
Abstract:
Knowledge distillation (KD) is widely used for compressing a teacher model to a smaller student model, reducing its inference cost and memory footprint while preserving model capabilities. However, current KD methods for auto-regressive sequence models (e.g., large language models) suffer from missing a standardized objective function. Moreover, the recent use of student-generated outputs to addre…
▽ More
Knowledge distillation (KD) is widely used for compressing a teacher model to a smaller student model, reducing its inference cost and memory footprint while preserving model capabilities. However, current KD methods for auto-regressive sequence models (e.g., large language models) suffer from missing a standardized objective function. Moreover, the recent use of student-generated outputs to address training-inference mismatches has significantly escalated computational costs. To tackle these issues, we introduce DistiLLM, a more effective and efficient KD framework for auto-regressive language models. DistiLLM comprises two components: (1) a novel skew Kullback-Leibler divergence loss, where we unveil and leverage its theoretical properties, and (2) an adaptive off-policy approach designed to enhance the efficiency in utilizing student-generated outputs. Extensive experiments, including instruction-following tasks, demonstrate the effectiveness of DistiLLM in building high-performing student models while achieving up to 4.3$\times$ speedup compared to recent KD methods.
△ Less
Submitted 3 July, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
A Gated MLP Architecture for Learning Topological Dependencies in Spatio-Temporal Graphs
Authors:
Yun Young Choi,
Minho Lee,
Sun Woo Park,
Seunghwan Lee,
Joohwan Ko
Abstract:
Graph Neural Networks (GNNs) and Transformer have been increasingly adopted to learn the complex vector representations of spatio-temporal graphs, capturing intricate spatio-temporal dependencies crucial for applications such as traffic datasets. Although many existing methods utilize multi-head attention mechanisms and message-passing neural networks (MPNNs) to capture both spatial and temporal r…
▽ More
Graph Neural Networks (GNNs) and Transformer have been increasingly adopted to learn the complex vector representations of spatio-temporal graphs, capturing intricate spatio-temporal dependencies crucial for applications such as traffic datasets. Although many existing methods utilize multi-head attention mechanisms and message-passing neural networks (MPNNs) to capture both spatial and temporal relations, these approaches encode temporal and spatial relations independently, and reflect the graph's topological characteristics in a limited manner. In this work, we introduce the Cycle to Mixer (Cy2Mixer), a novel spatio-temporal GNN based on topological non-trivial invariants of spatio-temporal graphs with gated multi-layer perceptrons (gMLP). The Cy2Mixer is composed of three blocks based on MLPs: A message-passing block for encapsulating spatial information, a cycle message-passing block for enriching topological information through cyclic subgraphs, and a temporal block for capturing temporal properties. We bolster the effectiveness of Cy2Mixer with mathematical evidence emphasizing that our cycle message-passing block is capable of offering differentiated information to the deep learning model compared to the message-passing block. Furthermore, empirical evaluations substantiate the efficacy of the Cy2Mixer, demonstrating state-of-the-art performances across various traffic benchmark datasets.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
νOscillation: a software package for computation and simulation of neutrino propagation and interaction
Authors:
Seonghyeok Jang,
Eunju Jeon,
Eunil Won,
Young Ju Ko,
Kyungmin Lee
Abstract:
The behavior of neutrinos is the only phenomenon that cannot be explained by the standard model of particle physics. Because of these mysterious neutrino interactions observed in nature, at present, there is growing interest in this field and ongoing or planned neutrino experiments are seeking solutions to this mystery very actively. The design of neutrino experiments and the analysis of neutrino…
▽ More
The behavior of neutrinos is the only phenomenon that cannot be explained by the standard model of particle physics. Because of these mysterious neutrino interactions observed in nature, at present, there is growing interest in this field and ongoing or planned neutrino experiments are seeking solutions to this mystery very actively. The design of neutrino experiments and the analysis of neutrino data rely on precise computations of neutrino oscillations and scattering processes in general. Motivated by this, we developed a software package that calculates neutrino production and oscillation in nuclear reactors, neutrino-electron scattering of solar neutrinos, and the oscillation of neutrinos from radioactive isotopes for the search of sterile neutrinos. This software package is validated by reproducing the result of calculations and observations in other publications. We also demonstrate the feasibility of this package by calculating the sensitivity of a liquid scintillator detector, currently in planning, to the sterile neutrinos. This work is expected to be used in designs of future neutrino experiments.
△ Less
Submitted 18 June, 2024; v1 submitted 23 January, 2024;
originally announced January 2024.
-
Provably Scalable Black-Box Variational Inference with Structured Variational Families
Authors:
Joohwan Ko,
Kyurae Kim,
Woo Chang Kim,
Jacob R. Gardner
Abstract:
Variational families with full-rank covariance approximations are known not to work well in black-box variational inference (BBVI), both empirically and theoretically. In fact, recent computational complexity results for BBVI have established that full-rank variational families scale poorly with the dimensionality of the problem compared to e.g. mean-field families. This is particularly critical t…
▽ More
Variational families with full-rank covariance approximations are known not to work well in black-box variational inference (BBVI), both empirically and theoretically. In fact, recent computational complexity results for BBVI have established that full-rank variational families scale poorly with the dimensionality of the problem compared to e.g. mean-field families. This is particularly critical to hierarchical Bayesian models with local variables; their dimensionality increases with the size of the datasets. Consequently, one gets an iteration complexity with an explicit (\mathcal{O}(N^2)) dependence on the dataset size (N). In this paper, we explore a theoretical middle ground between mean-field variational families and full-rank families: structured variational families. We rigorously prove that certain scale matrix structures can achieve a better iteration complexity of (\mathcal{O}\left(N\right)), implying better scaling with respect to (N). We empirically verify our theoretical results on large-scale hierarchical models.
△ Less
Submitted 1 June, 2024; v1 submitted 19 January, 2024;
originally announced January 2024.
-
Improving Local Training in Federated Learning via Temperature Scaling
Authors:
Kichang Lee,
Songkuk Kim,
JeongGil Ko
Abstract:
Federated learning is inherently hampered by data heterogeneity: non-i.i.d. training data over local clients. We propose a novel model training approach for federated learning, FLex&Chill, which exploits the Logit Chilling method. Through extensive evaluations, we demonstrate that, in the presence of non-i.i.d. data characteristics inherent in federated learning systems, this approach can expedite…
▽ More
Federated learning is inherently hampered by data heterogeneity: non-i.i.d. training data over local clients. We propose a novel model training approach for federated learning, FLex&Chill, which exploits the Logit Chilling method. Through extensive evaluations, we demonstrate that, in the presence of non-i.i.d. data characteristics inherent in federated learning systems, this approach can expedite model convergence and improve inference accuracy. Quantitatively, from our experiments, we observe up to 6X improvement in the global federated learning model convergence time, and up to 3.37% improvement in inference accuracy.
△ Less
Submitted 26 June, 2024; v1 submitted 18 January, 2024;
originally announced January 2024.
-
Integrating Graceful Degradation and Recovery through Requirement-driven Adaptation
Authors:
Simon Chu,
Justin Koe,
David Garlan,
Eunsuk Kang
Abstract:
Cyber-physical systems (CPS) are subject to environmental uncertainties such as adverse operating conditions, malicious attacks, and hardware degradation. These uncertainties may lead to failures that put the system in a sub-optimal or unsafe state. Systems that are resilient to such uncertainties rely on two types of operations: (1) graceful degradation, to ensure that the system maintains an acc…
▽ More
Cyber-physical systems (CPS) are subject to environmental uncertainties such as adverse operating conditions, malicious attacks, and hardware degradation. These uncertainties may lead to failures that put the system in a sub-optimal or unsafe state. Systems that are resilient to such uncertainties rely on two types of operations: (1) graceful degradation, to ensure that the system maintains an acceptable level of safety during unexpected environmental conditions and (2) recovery, to facilitate the resumption of normal system functions. Typically, mechanisms for degradation and recovery are developed independently from each other, and later integrated into a system, requiring the designer to develop an additional, ad-hoc logic for activating and coordinating between the two operations. In this paper, we propose a self-adaptation approach for improving system resiliency through automated triggering and coordination of graceful degradation and recovery. The key idea behind our approach is to treat degradation and recovery as requirement-driven adaptation tasks: Degradation can be thought of as temporarily weakening original (i.e., ideal) system requirements to be achieved by the system, and recovery as strengthening the weakened requirements when the environment returns within an expected operating boundary. Furthermore, by treating weakening and strengthening as dual operations, we argue that a single requirement-based adaptation method is sufficient to enable coordination between degradation and recovery. Given system requirements specified in signal temporal logic (STL), we propose a run-time adaptation framework that performs degradation and recovery in response to environmental changes. We describe a prototype implementation of our framework and demonstrate the feasibility of the proposed approach using a case study in unmanned underwater vehicles.
△ Less
Submitted 8 April, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
Background study of the AMoRE-pilot experiment
Authors:
A. Agrawal,
V. V. Alenkov,
P. Aryal,
J. Beyer,
B. Bhandari,
R. S. Boiko,
K. Boonin,
O. Buzanov,
C. R. Byeon,
N. Chanthima,
M. K. Cheoun,
J. S. Choe,
Seonho Choi,
S. Choudhury,
J. S. Chung,
F. A. Danevich,
M. Djamal,
D. Drung,
C. Enss,
A. Fleischmann,
A. M. Gangapshev,
L. Gastaldo,
Yu. M. Gavrilyuk,
A. M. Gezhaev,
O. Gileva
, et al. (83 additional authors not shown)
Abstract:
We report a study on the background of the Advanced Molybdenum-Based Rare process Experiment (AMoRE), a search for neutrinoless double beta decay (\znbb) of $^{100}$Mo. The pilot stage of the experiment was conducted using $\sim$1.9 kg of \CAMOO~ crystals at the Yangyang Underground Laboratory, South Korea, from 2015 to 2018. We compared the measured $β/γ$ energy spectra in three experimental conf…
▽ More
We report a study on the background of the Advanced Molybdenum-Based Rare process Experiment (AMoRE), a search for neutrinoless double beta decay (\znbb) of $^{100}$Mo. The pilot stage of the experiment was conducted using $\sim$1.9 kg of \CAMOO~ crystals at the Yangyang Underground Laboratory, South Korea, from 2015 to 2018. We compared the measured $β/γ$ energy spectra in three experimental configurations with the results of Monte Carlo simulations and identified the background sources in each configuration. We replaced several detector components and enhanced the neutron shielding to lower the background level between configurations. A limit on the half-life of $0νββ$ decay of $^{100}$Mo was found at $T_{1/2}^{0ν} \ge 3.0\times 10^{23}$ years at 90\% confidence level, based on the measured background and its modeling. Further reduction of the background rate in the AMoRE-I and AMoRE-II are discussed.
△ Less
Submitted 7 April, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
Nonproportionality of NaI(Tl) Scintillation Detector for Dark Matter Search Experiments
Authors:
S. M. Lee,
G. Adhikari,
N. Carlin,
J. Y. Cho,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. Fran. a,
C. Ha,
I. S. Hahn,
S. J. Hollick,
E. J. Jeon,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
S. W. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim
, et al. (37 additional authors not shown)
Abstract:
We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced…
▽ More
We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced by decays supported by both long and short-lived isotopes. Analyzing peaks from decays supported only by short-lived isotopes presented a unique challenge due to their limited statistics and overlap** energies, which was overcome by long-term data collection and a time-dependent analysis. A key achievement is the direct measurement of the 0.87 keV light yield, resulting from the cascade following electron capture decay of $^{22}$Na from internal contamination. This measurement, previously accessible only indirectly, deepens our understanding of NaI(Tl) scintillator behavior in the region of interest for dark matter searches. This study holds substantial implications for background modeling and the interpretation of dark matter signals in NaI(Tl) experiments.
△ Less
Submitted 10 May, 2024; v1 submitted 14 January, 2024;
originally announced January 2024.
-
Transfer-Learning-Based Autotuning Using Gaussian Copula
Authors:
Thomas Randall,
Jaehoon Koo,
Brice Videau,
Michael Kruse,
Xingfu Wu,
Paul Hovland,
Mary Hall,
Rong Ge,
Prasanna Balaprakash
Abstract:
As diverse high-performance computing (HPC) systems are built, many opportunities arise for applications to solve larger problems than ever before. Given the significantly increased complexity of these HPC systems and application tuning, empirical performance tuning, such as autotuning, has emerged as a promising approach in recent years. Despite its effectiveness, autotuning is often a computatio…
▽ More
As diverse high-performance computing (HPC) systems are built, many opportunities arise for applications to solve larger problems than ever before. Given the significantly increased complexity of these HPC systems and application tuning, empirical performance tuning, such as autotuning, has emerged as a promising approach in recent years. Despite its effectiveness, autotuning is often a computationally expensive approach. Transfer learning (TL)-based autotuning seeks to address this issue by leveraging the data from prior tuning. Current TL methods for autotuning spend significant time modeling the relationship between parameter configurations and performance, which is ineffective for few-shot (that is, few empirical evaluations) tuning on new tasks. We introduce the first generative TL-based autotuning approach based on the Gaussian copula (GC) to model the high-performing regions of the search space from prior data and then generate high-performing configurations for new tasks. This allows a sampling-based approach that maximizes few-shot performance and provides the first probabilistic estimation of the few-shot budget for effective TL-based autotuning. We compare our generative TL approach with state-of-the-art autotuning techniques on several benchmarks. We find that the GC is capable of achieving 64.37% of peak few-shot performance in its first evaluation. Furthermore, the GC model can determine a few-shot transfer budget that yields up to 33.39$\times$ speedup, a dramatic improvement over the 20.58$\times$ speedup using prior techniques.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.