-
Collision of two spinning billiard balls and the role of table
Authors:
Hyeong-Chan Kim
Abstract:
We study the collision dynamics of a spinning cue ball approaching a static object ball with equal mass on a plane, common in billiards. While typical collisions in billiards are nearly perfectly elastic, with a restitution coefficient close to 1 and low friction, we explore three deviations from ideal elastic collisions: The non-elastic nature, the friction effects between the balls during collis…
▽ More
We study the collision dynamics of a spinning cue ball approaching a static object ball with equal mass on a plane, common in billiards. While typical collisions in billiards are nearly perfectly elastic, with a restitution coefficient close to 1 and low friction, we explore three deviations from ideal elastic collisions: The non-elastic nature, the friction effects between the balls during collision, the friction between the ball and the table. We describe the detailed collision outcomes, emphasizing the importance of considering frictions. We reveal that friction, both between the balls and with the table, significantly influences the post-collision motions, deviating from the expectations of a purely elastic collision. The insights gained contribute to a better understanding of ball dynamics, impacting strategies and gameplay in billiards.
△ Less
Submitted 18 January, 2024;
originally announced February 2024.
-
Revisiting thermodynamics in (LiF, NaF, KF, CrF2)-CrF3 by first-principles calculations and CALPHAD modeling
Authors:
Rushi Gong,
Shun-Li Shang,
Yi Wang,
Jorge Paz Soldan Palma,
Hojong Kim,
Zi-Kui Liu
Abstract:
The thermodynamic description of the (LiF, NaF, KF, CrF2)-CrF3 systems has been revisited, aiming for a better understanding of the effects of Cr on the FLiNaK molten salt. First-principles calculations based on density functional theory (DFT) were performed to determine the electronic and structural properties of each compound, including the formation enthalpy, volume, and bulk modulus. DFT-based…
▽ More
The thermodynamic description of the (LiF, NaF, KF, CrF2)-CrF3 systems has been revisited, aiming for a better understanding of the effects of Cr on the FLiNaK molten salt. First-principles calculations based on density functional theory (DFT) were performed to determine the electronic and structural properties of each compound, including the formation enthalpy, volume, and bulk modulus. DFT-based phonon calculations were carried out to determine the thermodynamic properties of compounds, for example, enthalpy, entropy, and heat capacity as functions of temperature. Phonon-based thermodynamic properties show a good agreement with experimental data of binary compounds LiF, NaF, KF, CrF3, and CrF2, establishing a solid foundation to determine thermodynamic properties of ternary compounds as well as to verify results estimated by the Neumann-Kopp rule. Additionally, DFT-based ab initio molecular dynamics (AIMD) simulations were employed to predict the mixing enthalpies of liquid salts. Using DFT-based results and experimental data in the literature, the (LiF, NaF, KF, CrF2)-CrF3 system has been remodeled in terms of the CALculation of PHAse Diagrams (CALPHAD) approach using the modified quasichemical model with quadruplet approximation (MQMQA) for liquid. Calculated phase stability in the present work shows an excellent agreement with experiments, indicating the effectiveness of combining DFT-based total energy, phonon, and AIMD calculations, and CALPHAD modeling to provide the thermodynamic description in complex molten salt systems.
△ Less
Submitted 28 February, 2024; v1 submitted 19 February, 2024;
originally announced February 2024.
-
Holographic dual effective field theory for an SYK model
Authors:
Yoon-Seok Choun,
Hyeon Jung Kim,
Ki-Seok Kim
Abstract:
We derive an emergent holographic dual description for an SYK model, where the renormalization group (RG) flows of collective bi-local fields appear manifestly in the bulk effective action with an emergent extradimension. This holographic dual effective field theory reproduces $1/N$ quantum corrections given by the Schwarzian action when we take the UV limit in the bulk effective action. Going int…
▽ More
We derive an emergent holographic dual description for an SYK model, where the renormalization group (RG) flows of collective bi-local fields appear manifestly in the bulk effective action with an emergent extradimension. This holographic dual effective field theory reproduces $1/N$ quantum corrections given by the Schwarzian action when we take the UV limit in the bulk effective action. Going into the IR regime in the extradimension, we observe that the field theoretic $1/N$, $1/N^{2}$, ... quantum corrections are resummed in the all-loop order and reorganized to form a holographic dual effective field theory in a large $N$ fashion living on the one-higher dimensional spacetime. Taking the large $N$ limit in the holographic dual effective field theory, we obtain nonlinearly coupled second-order bulk differential equations of motion for the three bi-local order-parameter fields of fermion self-energy, Green's function, and polarization function. Here, both UV and IR boundary conditions are derived self-consistently from the boundary effective action. We solve these highly intertwined nonlinear differential equations based on the so called matching method. Our ansatz for the bi-local order-parameter fields coincide with the conformally invariant solution of the field theoretic large $N$ limit in the UV limit, but their overall coefficients RG-flow along the extradimensional space, respectively, reflecting effects of higher-order quantum corrections. As a result, we find an insulating behavior, where the self-energy diverges at IR. To confirm this insulating physics, we investigate thermodynamics. We obtain an effective free energy functional in terms of such bi-local dual order-parameter fields, which satisfy the Hamilton-Jacobi equation of the holographic dual effective field theory. ...
△ Less
Submitted 21 June, 2024; v1 submitted 19 February, 2024;
originally announced February 2024.
-
Zagier-Hoffman's conjectures in positive characteristic II
Authors:
Bo-Hae Im,
Ho** Kim,
Khac Nhuan Le,
Tuan Ngo Dac,
Lan Huong Pham
Abstract:
Zagier-Hoffman's conjectures predict the dimension and a basis for the $\mathbb Q$-vector spaces spanned by $N$th cyclotomic multiple zeta values (MZV's) of fixed weight where $N$ is a natural number.
For $N=1$ (MZV's case), half of these conjectures have been solved by the work of Terasoma, Deligne-Goncharov and Brown with the help of Zagier's identity. The other half are completely open. For…
▽ More
Zagier-Hoffman's conjectures predict the dimension and a basis for the $\mathbb Q$-vector spaces spanned by $N$th cyclotomic multiple zeta values (MZV's) of fixed weight where $N$ is a natural number.
For $N=1$ (MZV's case), half of these conjectures have been solved by the work of Terasoma, Deligne-Goncharov and Brown with the help of Zagier's identity. The other half are completely open. For $N=2$ (alternating MZV's case) and $N=3,4,8$, Deligne-Goncharov and Deligne solved the same half of these conjectures for $N$th-cyclotomic MZV's. For other values of $N$, no sharp upper bound on the dimension is known.
In this paper we completely establish, for all $N$, Zagier-Hoffman's conjectures for $N$th cyclotomic multiple zeta values in positive characteristic. By working with the tower of all cyclotomic extensions, we present a proof that is uniform on $N$ and give an effective algorithm to express any cyclotomic multiple zeta value in the chosen basis. This generalizes all previous work on these conjectures for MZV's and alternating MZV's in positive characteristic.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
Can Separators Improve Chain-of-Thought Prompting?
Authors:
Yoonjeong Park,
Hyun** Kim,
Chanyeol Choi,
Junseong Kim,
Jy-yong Sohn
Abstract:
Chain-of-thought (CoT) prompting is a simple and effective method for improving the reasoning capabilities of Large language models (LLMs). The basic idea of CoT is to let LLMs break down their thought processes step-by-step by putting exemplars in the input prompt. However, the densely structured prompt exemplars of CoT may cause the cognitive overload of LLMs. Inspired by human cognition, we int…
▽ More
Chain-of-thought (CoT) prompting is a simple and effective method for improving the reasoning capabilities of Large language models (LLMs). The basic idea of CoT is to let LLMs break down their thought processes step-by-step by putting exemplars in the input prompt. However, the densely structured prompt exemplars of CoT may cause the cognitive overload of LLMs. Inspired by human cognition, we introduce CoT-Sep, a novel method that strategically employs separators at the end of each exemplar in CoT prompting. These separators are designed to help the LLMs understand their thought processes better while reasoning. It turns out that CoT-Sep significantly improves the LLMs' performances on complex reasoning tasks (e.g., GSM-8K, AQuA, CSQA), compared with the vanilla CoT, which does not use separators. We also study the effects of the type and the location of separators tested on multiple LLMs, including GPT-3.5-Turbo, GPT-4, and LLaMA-2 7B. Interestingly, the type/location of separators should be chosen appropriately to boost the reasoning capability of CoT.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference
Authors:
Taesu Kim,
Jongho Lee,
Daehyun Ahn,
Sarang Kim,
Jiwoong Choi,
Minkyu Kim,
Hyungjun Kim
Abstract:
We introduce QUICK, a group of novel optimized CUDA kernels for the efficient inference of quantized Large Language Models (LLMs). QUICK addresses the shared memory bank-conflict problem of state-of-the-art mixed precision matrix multiplication kernels. Our method interleaves the quantized weight matrices of LLMs offline to skip the shared memory write-back after the dequantization. We demonstrate…
▽ More
We introduce QUICK, a group of novel optimized CUDA kernels for the efficient inference of quantized Large Language Models (LLMs). QUICK addresses the shared memory bank-conflict problem of state-of-the-art mixed precision matrix multiplication kernels. Our method interleaves the quantized weight matrices of LLMs offline to skip the shared memory write-back after the dequantization. We demonstrate up to 1.91x speedup over existing kernels of AutoAWQ on larger batches and up to 1.94x throughput gain on representative LLM models on various NVIDIA GPU devices.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Authors:
Jisu Nam,
Heesu Kim,
DongJae Lee,
Siyoon **,
Seungryong Kim,
Seunggyu Chang
Abstract:
The objective of text-to-image (T2I) personalization is to customize a diffusion model to a user-provided reference concept, generating diverse images of the concept aligned with the target prompts. Conventional methods representing the reference concepts using unique text embeddings often fail to accurately mimic the appearance of the reference. To address this, one solution may be explicitly con…
▽ More
The objective of text-to-image (T2I) personalization is to customize a diffusion model to a user-provided reference concept, generating diverse images of the concept aligned with the target prompts. Conventional methods representing the reference concepts using unique text embeddings often fail to accurately mimic the appearance of the reference. To address this, one solution may be explicitly conditioning the reference images into the target denoising process, known as key-value replacement. However, prior works are constrained to local editing since they disrupt the structure path of the pre-trained T2I model. To overcome this, we propose a novel plug-in method, called DreamMatcher, which reformulates T2I personalization as semantic matching. Specifically, DreamMatcher replaces the target values with reference values aligned by semantic matching, while leaving the structure path unchanged to preserve the versatile capability of pre-trained T2I models for generating diverse structures. We also introduce a semantic-consistent masking strategy to isolate the personalized concept from irrelevant regions introduced by the target prompts. Compatible with existing T2I models, DreamMatcher shows significant improvements in complex scenarios. Intensive analyses demonstrate the effectiveness of our approach.
△ Less
Submitted 23 April, 2024; v1 submitted 15 February, 2024;
originally announced February 2024.
-
Approximating maximum independent set on Rydberg atom arrays using local detunings
Authors:
Hyeonjun Yeo,
Ha Eum Kim,
Kabgyun Jeong
Abstract:
Rydberg atom arrays are among the most promising quantum simulating platforms due to their scalability and long coherence time. From the perspective of combinatorial optimization, they are intrinsic solver for the maximum independent set problem because of the resemblance between the Rydberg Hamiltonian and the cost function of the maximum independent set problem. In this paper, we suggest a strat…
▽ More
Rydberg atom arrays are among the most promising quantum simulating platforms due to their scalability and long coherence time. From the perspective of combinatorial optimization, they are intrinsic solver for the maximum independent set problem because of the resemblance between the Rydberg Hamiltonian and the cost function of the maximum independent set problem. In this paper, we suggest a strategy to approximate maximum independent sets by adjusting local detunings on the Rydberg Hamiltonian according to each vertex's vertex support, which is a quantity that represents connectivity between vertices. By doing so, we explicitly reflect on the Rydberg Hamiltonian the potential probability that each vertex will be included in maximum independent sets. Our strategy reduces an error rate three times for the checkerboard graphs with defects when the adiabaticity is enough. Our strategy also decreases the error rate for random graphs of density 3.0, even when the adiabaticity is relatively insufficient. Moreover, we harness our strategy to raise the fidelity between the evolved quantum state and a 2D cat state on a square lattice, showing that our strategy helps to prepare a quantum many-body ground state.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
High-precision and low-noise dielectric tensor tomography using a micro-electromechanical system mirror
Authors:
Juheon Lee,
Byung Gyu Chae,
Hyuneui Kim,
MinSung Yoon,
Herve Hugonnet,
YongKeun Park
Abstract:
Dielectric tensor tomography is an imaging technique for map** three-dimensional distributions of dielectric properties in transparent materials. This work introduces an enhanced illumination strategy employing a micro-electromechanical system mirror to achieve high precision and reduced noise in imaging. This illumination approach allows for precise manipulation of light, significantly improvin…
▽ More
Dielectric tensor tomography is an imaging technique for map** three-dimensional distributions of dielectric properties in transparent materials. This work introduces an enhanced illumination strategy employing a micro-electromechanical system mirror to achieve high precision and reduced noise in imaging. This illumination approach allows for precise manipulation of light, significantly improving the accuracy of angle control and minimizing diffraction noise compared to traditional beam steering approaches. Our experiments have successfully reconstructed the dielectric properties of liquid crystal droplets, which are known for their anisotropic structures, while demonstrating a notable reduction in background noise of the imag-es. Additionally, the technique has been applied to more complex samples, revealing its capability to achieve a high signal-to-noise ratio. This development represents a significant step forward in the field of birefringence imaging, offering a powerful tool for detailed study of materials with anisotropic properties.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
Authors:
Jiwon Song,
Kyungseok Oh,
Taesu Kim,
Hyungjun Kim,
Yulhwa Kim,
Jae-Joon Kim
Abstract:
Large language models (LLMs) have proven to be highly effective across various natural language processing tasks. However, their large number of parameters poses significant challenges for practical deployment. Pruning, a technique aimed at reducing the size and complexity of LLMs, offers a potential solution by removing redundant components from the network. Despite the promise of pruning, existi…
▽ More
Large language models (LLMs) have proven to be highly effective across various natural language processing tasks. However, their large number of parameters poses significant challenges for practical deployment. Pruning, a technique aimed at reducing the size and complexity of LLMs, offers a potential solution by removing redundant components from the network. Despite the promise of pruning, existing methods often struggle to achieve substantial end-to-end LLM inference speedup. In this paper, we introduce SLEB, a novel approach designed to streamline LLMs by eliminating redundant transformer blocks. We choose the transformer block as the fundamental unit for pruning, because LLMs exhibit block-level redundancy with high similarity between the outputs of neighboring blocks. This choice allows us to effectively enhance the processing speed of LLMs. Our experimental results demonstrate that SLEB outperforms previous LLM pruning methods in accelerating LLM inference while also maintaining superior perplexity and accuracy, making SLEB as a promising technique for enhancing the efficiency of LLMs. The code is available at: https://github.com/jiwonsong-dev/SLEB.
△ Less
Submitted 11 June, 2024; v1 submitted 14 February, 2024;
originally announced February 2024.
-
Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers
Authors:
Junhan Kim,
Kyungphil Park,
Chungman Lee,
Ho-young Kim,
Joonyoung Kim,
Yongkweon Jeon
Abstract:
With the increasing complexity of generative AI models, post-training quantization (PTQ) has emerged as a promising solution for deploying hyper-scale models on edge devices such as mobile devices and TVs. Existing PTQ schemes, however, consume considerable time and resources, which could be a bottleneck in real situations where frequent model updates and multiple hyper-parameter tunings are requi…
▽ More
With the increasing complexity of generative AI models, post-training quantization (PTQ) has emerged as a promising solution for deploying hyper-scale models on edge devices such as mobile devices and TVs. Existing PTQ schemes, however, consume considerable time and resources, which could be a bottleneck in real situations where frequent model updates and multiple hyper-parameter tunings are required. As a cost-effective alternative, one-shot PTQ schemes have been proposed. Still, the performance is somewhat limited because they cannot consider the inter-layer dependency within the attention module, which is a very important feature of Transformers. In this paper, we thus propose a novel PTQ algorithm that balances accuracy and efficiency. The key idea of the proposed algorithm called aespa is to perform quantization layer-wise for efficiency while considering cross-layer dependency to preserve the attention score. Through extensive experiments on various language models and complexity analysis, we demonstrate that aespa is accurate and efficient in quantizing Transformer models.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
AINeedsPlanner: A Workbook to Support Effective Collaboration Between AI Experts and Clients
Authors:
Dae Hyun Kim,
Hyungyu Shin,
Shakhnozakhon Yadgarova,
**ho Son,
Hariharan Subramonyam,
Juho Kim
Abstract:
Clients often partner with AI experts to develop AI applications tailored to their needs. In these partnerships, careful planning and clear communication are critical, as inaccurate or incomplete specifications can result in misaligned model characteristics, expensive reworks, and potential friction between collaborators. Unfortunately, given the complexity of requirements ranging from functionali…
▽ More
Clients often partner with AI experts to develop AI applications tailored to their needs. In these partnerships, careful planning and clear communication are critical, as inaccurate or incomplete specifications can result in misaligned model characteristics, expensive reworks, and potential friction between collaborators. Unfortunately, given the complexity of requirements ranging from functionality, data, and governance, effective guidelines for collaborative specification of requirements in client-AI expert collaborations are missing. In this work, we introduce AINeedsPlanner, a workbook that AI experts and clients can use to facilitate effective interchange and clear specifications. The workbook is based on (1) an interview of 10 completed AI application project teams, which identifies and characterizes steps in AI application planning and (2) a study with 12 AI experts, which defines a taxonomy of AI experts' information needs and dimensions that affect the information needs. Finally, we demonstrate the workbook's utility with two case studies in real-world settings.
△ Less
Submitted 26 May, 2024; v1 submitted 13 February, 2024;
originally announced February 2024.
-
High-resolution spectroscopy of proximity superconductivity in finite-size quantized surface states
Authors:
Lucas Schneider,
Christian von Bredow,
Howon Kim,
Khai That Ton,
Torben Hänke,
Jens Wiebe,
Roland Wiesendanger
Abstract:
Adding superconducting (SC) electron pairing via the proximity effect to pristinely non-superconducting materials can lead to a variety of interesting physical phenomena. Particular interest has recently focused on inducing SC into two-dimensional surface states (SSs), potentially also combined with non-trivial topology. We study the mechanism of proximity-induced SC into the Shockley-type SSs of…
▽ More
Adding superconducting (SC) electron pairing via the proximity effect to pristinely non-superconducting materials can lead to a variety of interesting physical phenomena. Particular interest has recently focused on inducing SC into two-dimensional surface states (SSs), potentially also combined with non-trivial topology. We study the mechanism of proximity-induced SC into the Shockley-type SSs of the noble metals Ag(111) and Cu(111) grown on the elemental SC Nb(110) using scanning tunneling spectroscopy. The tunneling spectra exhibit an intriguing multitude of sharp states at low energies. Their appearance can be explained by Andreev bound states (ABS) formed by the weakly proximitized SSs subject to lateral finite-size confinement. We study systematically how the proximity gap in the bulk states of both Ag(111) and Cu(111) persists up to island thicknesses of several times the bulk coherence length of Nb. We find that even for thick islands, the SSs acquire a gap, with the gap size for Cu being consistently larger than for Ag. Based on this, we argue that the SC in the SS is not provided through direct overlap of the SS wavefunction with the SC host but can be understood to be mediated by step edges inducing electronic coupling to the bulk. Our work provides important input for the microscopic understanding of induced superconductivity in heterostructures and its spectral manifestation. Moreover, it lays the foundation for more complex SC heterostructures based on noble metals.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
DeepPolar: Inventing Nonlinear Large-Kernel Polar Codes via Deep Learning
Authors:
S Ashwin Hebbar,
Sravan Kumar Ankireddy,
Hyeji Kim,
Sewoong Oh,
Pramod Viswanath
Abstract:
Progress in designing channel codes has been driven by human ingenuity and, fittingly, has been sporadic. Polar codes, developed on the foundation of Arikan's polarization kernel, represent the latest breakthrough in coding theory and have emerged as the state-of-the-art error-correction code for short-to-medium block length regimes. In an effort to automate the invention of good channel codes, es…
▽ More
Progress in designing channel codes has been driven by human ingenuity and, fittingly, has been sporadic. Polar codes, developed on the foundation of Arikan's polarization kernel, represent the latest breakthrough in coding theory and have emerged as the state-of-the-art error-correction code for short-to-medium block length regimes. In an effort to automate the invention of good channel codes, especially in this regime, we explore a novel, non-linear generalization of Polar codes, which we call DeepPolar codes. DeepPolar codes extend the conventional Polar coding framework by utilizing a larger kernel size and parameterizing these kernels and matched decoders through neural networks. Our results demonstrate that these data-driven codes effectively leverage the benefits of a larger kernel size, resulting in enhanced reliability when compared to both existing neural codes and conventional Polar codes.
△ Less
Submitted 4 June, 2024; v1 submitted 13 February, 2024;
originally announced February 2024.
-
A demonstration of the effect of fringe-rate filtering in the Hydrogen Epoch of Reionization Array delay power spectrum pipeline
Authors:
Hugh Garsden,
Philip Bull,
Mike Wilensky,
Zuhra Abdurashidova,
Tyrone Adams,
James E. Aguirre,
Paul Alexander,
Zaki S. Ali,
Rushelle Baartman,
Yanga Balfour,
Adam P. Beardsley,
Lindsay M. Berkhout,
Gianni Bernardi,
Tashalee S. Billings,
Judd D. Bowman,
Richard F. Bradley,
Jacob Burba,
Steven Carey,
Chris L. Carilli,
Kai-Feng Chen,
Carina Cheng,
Samir Choudhuri,
David R. DeBoer,
Eloy de Lera Acedo,
Matt Dexter
, et al. (72 additional authors not shown)
Abstract:
Radio interferometers targeting the 21cm brightness temperature fluctuations at high redshift are subject to systematic effects that operate over a range of different timescales. These can be isolated by designing appropriate Fourier filters that operate in fringe-rate (FR) space, the Fourier pair of local sidereal time (LST). Applications of FR filtering include separating effects that are correl…
▽ More
Radio interferometers targeting the 21cm brightness temperature fluctuations at high redshift are subject to systematic effects that operate over a range of different timescales. These can be isolated by designing appropriate Fourier filters that operate in fringe-rate (FR) space, the Fourier pair of local sidereal time (LST). Applications of FR filtering include separating effects that are correlated with the rotating sky vs. those relative to the ground, down-weighting emission in the primary beam sidelobes, and suppressing noise. FR filtering causes the noise contributions to the visibility data to become correlated in time however, making interpretation of subsequent averaging and error estimation steps more subtle. In this paper, we describe fringe rate filters that are implemented using discrete prolate spheroidal sequences, and designed for two different purposes -- beam sidelobe/horizon suppression (the `mainlobe' filter), and ground-locked systematics removal (the `notch' filter). We apply these to simulated data, and study how their properties affect visibilities and power spectra generated from the simulations. Included is an introduction to fringe-rate filtering and a demonstration of fringe-rate filters applied to simple situations to aid understanding.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Collective biphoton temporal waveform of photon-pair generated from Doppler-broadened atomic ensemble
Authors:
Heewoo Kim,
Hansol Jeong,
Han Seb Moon
Abstract:
Photonic quantum states generated from atomic ensembles will play important roles in future quantum networks and long-distance quantum communication because their advantages, such as universal identity and narrow spectral bandwidth, are essential for quantum nodes and quantum repeaters based on atomic ensembles. In this study of the biphoton temporal waveform (BTW) of the photon pairs generated fr…
▽ More
Photonic quantum states generated from atomic ensembles will play important roles in future quantum networks and long-distance quantum communication because their advantages, such as universal identity and narrow spectral bandwidth, are essential for quantum nodes and quantum repeaters based on atomic ensembles. In this study of the biphoton temporal waveform (BTW) of the photon pairs generated from a cascade-type two-photon-transition, we report the collectively coherent superposition of biphoton wavefunction emitted from different velocity classes in a Doppler-broadened cascade-type atomic ensemble. We experimentally demonstrate that the three times difference of temporal width of both BTWs varies dependent on the wavelengths of the signal and idler photons from both 6S_{1/2}-6P_{3/2}-6D_{5/2} and -8S_{1/2} transitions of Cs, corresponding to the idler and signal wavelengths of 852 nm-917 nm and 852 nm-795 nm, respectively. Our results help understand the characteristics of biphoton sources from a warm atomic ensemble and can be applied to long-distance quantum networks and practical quantum repeaters based on atom-photon interactions.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education
Authors:
Unggi Lee,
Minji Jeon,
Yunseo Lee,
Gyuri Byun,
Yoorim Son,
Jaeyoon Shin,
Hongkyu Ko,
Hyeoncheol Kim
Abstract:
Art appreciation is vital in nurturing critical thinking and emotional intelligence among learners. However, traditional art appreciation education has often been hindered by limited access to art resources, especially for disadvantaged students, and an imbalanced emphasis on STEM subjects in mainstream education. In response to these challenges, recent technological advancements have paved the wa…
▽ More
Art appreciation is vital in nurturing critical thinking and emotional intelligence among learners. However, traditional art appreciation education has often been hindered by limited access to art resources, especially for disadvantaged students, and an imbalanced emphasis on STEM subjects in mainstream education. In response to these challenges, recent technological advancements have paved the way for innovative solutions. This study explores the application of multi-modal large language models (MLLMs) in art appreciation education, focusing on develo** LLaVA-Docent, a model that leverages these advancements. Our approach involved a comprehensive literature review and consultations with experts in the field, leading to develo** a robust data framework. Utilizing this framework, we generated a virtual dialogue dataset that was leveraged by GPT-4. This dataset was instrumental in training the MLLM, named LLaVA-Docent. Six researchers conducted quantitative and qualitative evaluations of LLaVA-Docent to assess its effectiveness, benchmarking it against the GPT-4 model in a few-shot setting. The evaluation process revealed distinct strengths and weaknesses of the LLaVA-Docent model. Our findings highlight the efficacy of LLaVA-Docent in enhancing the accessibility and engagement of art appreciation education. By harnessing the potential of MLLMs, this study makes a significant contribution to the field of art education, proposing a novel methodology that reimagines the way art appreciation is taught and experienced.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
The AGORA High-resolution Galaxy Simulations Comparison Project IV: Halo and Galaxy Mass Assembly in a Cosmological Zoom-in Simulation at $z\le2$
Authors:
Santi Roca-Fàbrega,
Ji-hoon Kim,
Joel R. Primack,
Minyong Jung,
Anna Genina,
Loic Hausammann,
Hyeonyong Kim,
Alessandro Lupi,
Kentaro Nagamine,
Johnny W. Powell,
Yves Revaz,
Ikkoh Shimizu,
Clayton Strawn,
Héctor Velázquez,
Tom Abel,
Daniel Ceverino,
Bili Dong,
Thomas R. Quinn,
Eun-** Shin,
Alvaro Segovia-Otero,
Oscar Agertz,
Kirk S. S. Barrow,
Corentin Cadiou,
Avishai Dekel,
Cameron Hummels
, et al. (3 additional authors not shown)
Abstract:
In this fourth paper from the AGORA Collaboration, we study the evolution down to redshift $z=2$ and below of a set of cosmological zoom-in simulations of a Milky Way mass galaxy by eight of the leading hydrodynamic simulation codes. We also compare this CosmoRun suite of simulations with dark matter-only simulations by the same eight codes. We analyze general properties of the halo and galaxy at…
▽ More
In this fourth paper from the AGORA Collaboration, we study the evolution down to redshift $z=2$ and below of a set of cosmological zoom-in simulations of a Milky Way mass galaxy by eight of the leading hydrodynamic simulation codes. We also compare this CosmoRun suite of simulations with dark matter-only simulations by the same eight codes. We analyze general properties of the halo and galaxy at $z=4$ and 3, and before the last major merger, focusing on the formation of well-defined rotationally-supported disks, the mass-metallicity relation, the specific star formation rate, the gas metallicity gradients, and the non-axisymmetric structures in the stellar disks. Codes generally converge well to the stellar-to-halo mass ratios predicted by semi-analytic models at $z\sim$2. We see that almost all the hydro codes develop rotationally-supported structures at low redshifts. Most agree within 0.5 dex with the observed MZR at high and intermediate redshifts, and reproduce the gas metallicity gradients obtained from analytical models and low-redshift observations. We confirm that the inter-code differences in the halo assembly history reported in the first paper of the collaboration also exist in CosmoRun, making the code-to-code comparison more difficult. We show that such differences are mainly due to variations in code-dependent parameters that control the time-step** strategy of the gravity solver. We find that variations in the early stellar feedback can also result in differences in the timing of the low-redshift mergers. All the simulation data down to $z=2$ and the auxiliary data will be made publicly available.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Hybrid Neural Representations for Spherical Data
Authors:
Hyomin Kim,
Yunhui Jang,
Jaeho Lee,
Sungsoo Ahn
Abstract:
In this paper, we study hybrid neural representations for spherical data, a domain of increasing relevance in scientific research. In particular, our work focuses on weather and climate data as well as comic microwave background (CMB) data. Although previous studies have delved into coordinate-based neural representations for spherical signals, they often fail to capture the intricate details of h…
▽ More
In this paper, we study hybrid neural representations for spherical data, a domain of increasing relevance in scientific research. In particular, our work focuses on weather and climate data as well as comic microwave background (CMB) data. Although previous studies have delved into coordinate-based neural representations for spherical signals, they often fail to capture the intricate details of highly nonlinear signals. To address this limitation, we introduce a novel approach named Hybrid Neural Representations for Spherical data (HNeR-S). Our main idea is to use spherical feature-grids to obtain positional features which are combined with a multilayer perception to predict the target signal. We consider feature-grids with equirectangular and hierarchical equal area isolatitude pixelization structures that align with weather data and CMB data, respectively. We extensively verify the effectiveness of our HNeR-S for regression, super-resolution, temporal interpolation, and compression tasks.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Genetic-guided GFlowNets for Sample Efficient Molecular Optimization
Authors:
Hyeonah Kim,
Minsu Kim,
Sanghyeok Choi,
**kyoo Park
Abstract:
The challenge of discovering new molecules with desired properties is crucial in domains like drug discovery and material design. Recent advances in deep learning-based generative methods have shown promise but face the issue of sample efficiency due to the computational expense of evaluating the reward function. This paper proposes a novel algorithm for sample-efficient molecular optimization by…
▽ More
The challenge of discovering new molecules with desired properties is crucial in domains like drug discovery and material design. Recent advances in deep learning-based generative methods have shown promise but face the issue of sample efficiency due to the computational expense of evaluating the reward function. This paper proposes a novel algorithm for sample-efficient molecular optimization by distilling a powerful genetic algorithm into deep generative policy using GFlowNets training, the off-policy method for amortized inference. This approach enables the deep generative policy to learn from domain knowledge, which has been explicitly integrated into the genetic algorithm. Our method achieves state-of-the-art performance in the official molecular optimization benchmark, significantly outperforming previous methods. It also demonstrates effectiveness in designing inhibitors against SARS-CoV-2 with substantially fewer reward calls.
△ Less
Submitted 25 May, 2024; v1 submitted 4 February, 2024;
originally announced February 2024.
-
Unified Speech-Text Pretraining for Spoken Dialog Modeling
Authors:
Heeseung Kim,
Soonshin Seo,
Kyeongseok Jeong,
Ohsung Kwon,
Jungwhan Kim,
Jaehong Lee,
Eunwoo Song,
Myungwoo Oh,
Sungroh Yoon,
Kang Min Yoo
Abstract:
While recent work shows promising results in expanding the capabilities of large language models (LLM) to directly understand and synthesize speech, an LLM-based strategy for modeling spoken dialogs remains elusive and calls for further investigation. This work proposes an extensive speech-text LLM framework, named the Unified Spoken Dialog Model (USDM), to generate coherent spoken responses with…
▽ More
While recent work shows promising results in expanding the capabilities of large language models (LLM) to directly understand and synthesize speech, an LLM-based strategy for modeling spoken dialogs remains elusive and calls for further investigation. This work proposes an extensive speech-text LLM framework, named the Unified Spoken Dialog Model (USDM), to generate coherent spoken responses with organic prosodic features relevant to the given input speech without relying on automatic speech recognition (ASR) or text-to-speech (TTS) solutions. Our approach employs a multi-step speech-text inference scheme that leverages chain-of-reasoning capabilities exhibited by the underlying LLM. We also propose a generalized speech-text pretraining scheme that helps with capturing cross-modal semantics. Automatic and human evaluations show that the proposed approach is effective in generating natural-sounding spoken responses, outperforming both prior and cascaded baselines. Detailed comparative studies reveal that, despite the cascaded approach being stronger in individual components, the joint speech-text modeling improves robustness against recognition errors and speech quality. Demo is available at https://unifiedsdm.github.io.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
The AGORA High-resolution Galaxy Simulations Comparison Project. V: Satellite Galaxy Populations In A Cosmological Zoom-in Simulation of A Milky Way-mass Halo
Authors:
Minyong Jung,
Santi Roca-Fàbrega,
Ji-hoon Kim,
Anna Genina,
Loic Hausammann,
Hyeonyong Kim,
Alessandro Lupi,
Kentaro Nagamine,
Johnny W. Powell,
Yves Revaz,
Ikkoh Shimizu,
Héctor Velázquez,
Daniel Ceverino,
Joel R. Primack,
Thomas R. Quinn,
Clayton Strawn,
Tom Abel,
Avishai Dekel,
Bili Dong,
Boon Kiat Oh,
Romain Teyssier
Abstract:
We analyze and compare the satellite halo populations at $z\sim2$ in the high-resolution cosmological zoom-in simulations of a $10^{12}\,{\rm M}_{\odot}$ target halo ($z=0$ mass) carried out on eight widely-used astrophysical simulation codes ({\sc Art-I}, {\sc Enzo}, {\sc Ramses}, {\sc Changa}, {\sc Gadget-3}, {\sc Gear}, {\sc Arepo-t}, and {\sc Gizmo}) for the {\it AGORA} High-resolution Galaxy…
▽ More
We analyze and compare the satellite halo populations at $z\sim2$ in the high-resolution cosmological zoom-in simulations of a $10^{12}\,{\rm M}_{\odot}$ target halo ($z=0$ mass) carried out on eight widely-used astrophysical simulation codes ({\sc Art-I}, {\sc Enzo}, {\sc Ramses}, {\sc Changa}, {\sc Gadget-3}, {\sc Gear}, {\sc Arepo-t}, and {\sc Gizmo}) for the {\it AGORA} High-resolution Galaxy Simulations Comparison Project. We use slightly different redshift epochs near $z=2$ for each code (hereafter ``$z\sim2$') at which the eight simulations are in the same stage in the target halo's merger history. After identifying the matched pairs of halos between the {\it CosmoRun} simulations and the DMO simulations, we discover that each {\it CosmoRun} halo tends to be less massive than its DMO counterpart. When we consider only the halos containing stellar particles at $z\sim2$, the number of satellite {\it galaxies} is significantly fewer than that of dark matter halos in all participating {\it AGORA} simulations, and is comparable to the number of present-day satellites near the Milky Way or M31. The so-called ``missing satellite problem' is fully resolved across all participating codes simply by implementing the common baryonic physics adopted in {\it AGORA} and the stellar feedback prescription commonly used in each code, with sufficient numerical resolution ($\lesssim100$ proper pc at $z=2$). We also compare other properties such as the stellar mass$-$halo mass relation and the mass$-$metallicity relation. Our work highlights the value of comparison studies such as {\it AGORA}, where outstanding problems in galaxy formation theory are studied simultaneously on multiple numerical platforms.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
The AGORA High-resolution Galaxy Simulations Comparison Project. VI. Similarities and Differences in the Circumgalactic Medium
Authors:
Clayton Strawn,
Santi Roca-Fàbrega,
Joel R. Primack,
Ji-hoon Kim,
Anna Genina,
Loic Hausammann,
Hyeonyong Kim,
Alessandro Lupi,
Kentaro Nagamine,
Johnny W. Powell,
Yves Revaz,
Ikkoh Shimizu,
Héctor Velázquez,
Tom Abel,
Daniel Ceverino,
Bili Dong,
Minyong Jung,
Thomas R. Quinn,
Eun-** Shin,
Kirk S. S. Barrow,
Avishai Dekel,
Boon Kiat Oh,
Nir Mandelker,
Romain Teyssier,
Cameron Hummels
, et al. (4 additional authors not shown)
Abstract:
We analyze the circumgalactic medium (CGM) for eight commonly-used cosmological codes in the AGORA collaboration. The codes are calibrated to use identical initial conditions, cosmology, heating and cooling, and star formation thresholds, but each evolves with its own unique code architecture and stellar feedback implementation. Here, we analyze the results of these simulations in terms of the str…
▽ More
We analyze the circumgalactic medium (CGM) for eight commonly-used cosmological codes in the AGORA collaboration. The codes are calibrated to use identical initial conditions, cosmology, heating and cooling, and star formation thresholds, but each evolves with its own unique code architecture and stellar feedback implementation. Here, we analyze the results of these simulations in terms of the structure, composition, and phase dynamics of the CGM. We show properties such as metal distribution, ionization levels, and kinematics are effective tracers of the effects of the different code feedback and implementation methods, and as such they can be highly divergent between simulations. This is merely a fiducial set of models, against which we will in the future compare multiple feedback recipes for each code. Nevertheless, we find that the large parameter space these simulations establish can help disentangle the different variables that affect observable quantities in the CGM, e.g., showing that abundances for ions with higher ionization energy are more strongly determined by the simulation's metallicity, while abundances for ions with lower ionization energy are more strongly determined by the gas density and temperature.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Giant piezoelectricity in group IV monochalcogenides with ferroelectric AA layer stacking
Authors:
Seungjun Lee,
Hyeong-Ryul Kim,
Wei Jiang,
Young-Kyun Kwon,
Tony Low
Abstract:
The piezoelectricity of group IV monochalcogenides (MXs, with M = Ge, Sn and X = S, Se) has attracted much attention due to their substantially higher piezoelectric coefficients compared to other 2D materials. However, with increasing layer number, their piezoelectricity rapidly disappears due to the antiferroelectric stacking order, severely limiting their practical applications. Using first-prin…
▽ More
The piezoelectricity of group IV monochalcogenides (MXs, with M = Ge, Sn and X = S, Se) has attracted much attention due to their substantially higher piezoelectric coefficients compared to other 2D materials. However, with increasing layer number, their piezoelectricity rapidly disappears due to the antiferroelectric stacking order, severely limiting their practical applications. Using first-principles calculations, we investigated the piezoelectricity of MXs with the ferroelectric AA stacking configuration, which has recently been stabilized in experiments. We found that AA-stacked MXs have a ferroelectric ground state with the smallest lattice constant among other stacking configurations, resulting in a giant piezoelectric coefficient, which is the first demonstration of a strategy where the piezoelectric coefficients can increase with the number of layers. This can be attributed to a strong negative correlation between the lattice constant along the armchair direction and the piezoelectric coefficient, and spontaneous compressive strain stabilized in ferroelectric AA stacking configuration.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
Authors:
Ashok Vardhan Makkuva,
Marco Bondaschi,
Adway Girish,
Alliot Nagle,
Martin Jaggi,
Hyeji Kim,
Michael Gastpar
Abstract:
In recent years, attention-based transformers have achieved tremendous success across a variety of disciplines including natural languages. A key ingredient behind their success is the generative pretraining procedure, during which these models are trained on a large text corpus in an auto-regressive manner. To shed light on this phenomenon, we propose a new framework that allows both theory and s…
▽ More
In recent years, attention-based transformers have achieved tremendous success across a variety of disciplines including natural languages. A key ingredient behind their success is the generative pretraining procedure, during which these models are trained on a large text corpus in an auto-regressive manner. To shed light on this phenomenon, we propose a new framework that allows both theory and systematic experiments to study the sequential modeling capabilities of transformers through the lens of Markov chains. Inspired by the Markovianity of natural languages, we model the data as a Markovian source and utilize this framework to systematically study the interplay between the data-distributional properties, the transformer architecture, the learnt distribution, and the final model performance. In particular, we theoretically characterize the loss landscape of single-layer transformers and show the existence of global minima and bad local minima contingent upon the specific data characteristics and the transformer architecture. Backed by experiments, we demonstrate that our theoretical findings are in congruence with the empirical results. We further investigate these findings in the broader context of higher order Markov chains and deeper architectures, and outline open problems in this arena. Code is available at \url{https://github.com/Bond1995/Markov}.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Saturation of fishbone instability through zonal flows driven by energetic particle transport in tokamak plasmas
Authors:
G. Brochard,
C. Liu,
X. Wei,
W. Heidbrink,
Z. Lin,
M. V. Falessi,
F. Zonca,
Z. Qiu,
N. Gorelenkov,
C. Chrystal,
X. Du,
J. Bao,
A. R. Polevoi,
M. Schneider,
S. H. Kim,
S. D. Pinches,
P. Liu,
J. H. Nicolau,
H. Lütjens,
the ISEP group
Abstract:
Gyrokinetic and kinetic-MHD simulations are performed for the fishbone instability in the DIII-D discharge #178631, chosen for validation of first-principles simulations to predict the energetic particle (EP) transport in an ITER prefusion baseline scenario. Fishbone modes are found to generate zonal flows, which dominate the fishbone saturation. The underlying mechanisms of the two-way fishbone-z…
▽ More
Gyrokinetic and kinetic-MHD simulations are performed for the fishbone instability in the DIII-D discharge #178631, chosen for validation of first-principles simulations to predict the energetic particle (EP) transport in an ITER prefusion baseline scenario. Fishbone modes are found to generate zonal flows, which dominate the fishbone saturation. The underlying mechanisms of the two-way fishbone-zonal flows nonlinear interplay are discussed in details. Numerical and analytical analyses identify the fishbone-induced EP redistribution as the dominant generation mechanism for zonal flows. The zonal flows modify the nonlinear dynamics of phase space zonal structures, which reduces the amount of EPs able to resonate with the mode, leading to an early fishbone saturation. Simulation results including zonal flows agree quantitatively with DIII-D experimental measurements of the fishbone saturation amplitude and EP transport, supporting this novel saturation mechanism by self-generated zonal flows. Moreover, the wave-particle mode-locking mechanism is shown to determine quantitatively the fishbone frequency down-chir**, as evident in GTC simulation results in agreement with predictions from analytical theory. Finally, the fishbone-induced zonal flows are possibly responsible for the formation of an ion-ITB in the DIII-D discharge. Based on the low EP transport and the large zonal flow shearing rates associated with the fishbone instability in gyrokinetic simulations of the ITER scenario, it is conjectured that high performance scenarios could be designed in ITER burning plasmas through fishbone-induced ITBs.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Knowledge Acquisition and Integration with Expert-in-the-loop
Authors:
Sajjadur Rahman,
Frederick Choi,
Hannah Kim,
Dan Zhang,
Estevam Hruschka
Abstract:
Constructing and serving knowledge graphs (KGs) is an iterative and human-centered process involving on-demand programming and analysis. In this paper, we present Kyurem, a programmable and interactive widget library that facilitates human-in-the-loop knowledge acquisition and integration to enable continuous curation a knowledge graph (KG). Kyurem provides a seamless environment within computatio…
▽ More
Constructing and serving knowledge graphs (KGs) is an iterative and human-centered process involving on-demand programming and analysis. In this paper, we present Kyurem, a programmable and interactive widget library that facilitates human-in-the-loop knowledge acquisition and integration to enable continuous curation a knowledge graph (KG). Kyurem provides a seamless environment within computational notebooks where data scientists explore a KG to identify opportunities for acquiring new knowledge and verify recommendations provided by AI agents for integrating the acquired knowledge in the KG. We refined Kyurem through participatory design and conducted case studies in a real-world setting for evaluation. The case-studies show that introduction of Kyurem within an existing HR knowledge graph construction and serving platform improved the user experience of the experts and helped eradicate inefficiencies related to knowledge acquisition and integration tasks
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models
Authors:
Anthony Sicilia,
Hyunwoo Kim,
Khyathi Raghavi Chandu,
Malihe Alikhani,
Jack Hessel
Abstract:
Effective interlocutors account for the uncertain goals, beliefs, and emotions of others. But even the best human conversationalist cannot perfectly anticipate the trajectory of a dialogue. How well can language models represent inherent uncertainty in conversations? We propose FortUne Dial, an expansion of the long-standing "conversation forecasting" task: instead of just accuracy, evaluation is…
▽ More
Effective interlocutors account for the uncertain goals, beliefs, and emotions of others. But even the best human conversationalist cannot perfectly anticipate the trajectory of a dialogue. How well can language models represent inherent uncertainty in conversations? We propose FortUne Dial, an expansion of the long-standing "conversation forecasting" task: instead of just accuracy, evaluation is conducted with uncertainty-aware metrics, effectively enabling abstention on individual instances. We study two ways in which language models potentially represent outcome uncertainty (internally, using scores and directly, using tokens) and propose fine-tuning strategies to improve calibration of both representations. Experiments on eight difficult negotiation corpora demonstrate that our proposed fine-tuning strategies (a traditional supervision strategy and an off-policy reinforcement learning strategy) can calibrate smaller open-source models to compete with pre-trained models 10x their size.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
UniHENN: Designing More Versatile Homomorphic Encryption-based CNNs without im2col
Authors:
Hyunmin Choi,
Jihun Kim,
Seungho Kim,
Seonhye Park,
Jeongyong Park,
Wonbin Choi,
Hyoungshick Kim
Abstract:
Homomorphic encryption enables computations on encrypted data without decryption, which is crucial for privacy-preserving cloud services. However, deploying convolutional neural networks (CNNs) with homomorphic encryption encounters significant challenges, particularly in converting input data into a two-dimensional matrix for convolution, typically achieved using the im2col technique. While effic…
▽ More
Homomorphic encryption enables computations on encrypted data without decryption, which is crucial for privacy-preserving cloud services. However, deploying convolutional neural networks (CNNs) with homomorphic encryption encounters significant challenges, particularly in converting input data into a two-dimensional matrix for convolution, typically achieved using the im2col technique. While efficient, this method limits the variety of deployable CNN models due to compatibility constraints with the encrypted data structure. UniHENN, a homomorphic encryption-based CNN architecture, eliminates the need for im2col, ensuring compatibility with a diverse range of CNN models using homomorphic encryption. Our experiments demonstrate that UniHENN surpasses the leading 2D CNN inference architecture, PyCrCNN, in inference time, as evidenced by its performance on the LeNet-1 dataset, where it averages 30.090 seconds--significantly faster than PyCrCNN's 794.064 seconds. Furthermore, UniHENN outperforms TenSEAL, which employs im2col, in processing concurrent images, an essential feature for high-demand cloud applications. The versatility of UniHENN is proven across various CNN architectures, including 1D and six different 2D CNNs, highlighting its flexibility and efficiency. These qualities establish UniHENN as a promising solution for privacy-preserving, cloud-based CNN services, addressing the increasing demand for scalable, secure, and efficient deep learning in cloud computing environments.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Retrieval-Augmented Score Distillation for Text-to-3D Generation
Authors:
Junyoung Seo,
Susung Hong,
Wooseok Jang,
Inès Hyeonsu Kim,
Minseop Kwak,
Doyup Lee,
Seungryong Kim
Abstract:
Text-to-3D generation has achieved significant success by incorporating powerful 2D diffusion models, but insufficient 3D prior knowledge also leads to the inconsistency of 3D geometry. Recently, since large-scale multi-view datasets have been released, fine-tuning the diffusion model on the multi-view datasets becomes a mainstream to solve the 3D inconsistency problem. However, it has confronted…
▽ More
Text-to-3D generation has achieved significant success by incorporating powerful 2D diffusion models, but insufficient 3D prior knowledge also leads to the inconsistency of 3D geometry. Recently, since large-scale multi-view datasets have been released, fine-tuning the diffusion model on the multi-view datasets becomes a mainstream to solve the 3D inconsistency problem. However, it has confronted with fundamental difficulties regarding the limited quality and diversity of 3D data, compared with 2D data. To sidestep these trade-offs, we explore a retrieval-augmented approach tailored for score distillation, dubbed ReDream. We postulate that both expressiveness of 2D diffusion models and geometric consistency of 3D assets can be fully leveraged by employing the semantically relevant assets directly within the optimization process. To this end, we introduce novel framework for retrieval-based quality enhancement in text-to-3D generation. We leverage the retrieved asset to incorporate its geometric prior in the variational objective and adapt the diffusion model's 2D prior toward view consistency, achieving drastic improvements in both geometry and fidelity of generated scenes. We conduct extensive experiments to demonstrate that ReDream exhibits superior quality with increased geometric consistency. Project page is available at https://ku-cvlab.github.io/ReDream/.
△ Less
Submitted 2 May, 2024; v1 submitted 5 February, 2024;
originally announced February 2024.
-
Minimal grid diagrams of the prime knots with crossing number 13 and arc index 13
Authors:
Hwa Jeong Lee,
Yoonsang Lee,
Chanmin Lee,
Yeseo Park,
Hun Kim,
Gyo Taek **
Abstract:
We give a list of minimal grid diagrams of the 13 crossing prime nonalternating knots which have arc index 13. There are 9,988 prime knots with crossing number 13. Among them 4,878 are alternating and have arc index 15. Among the other nonalternating knots, 49, 399, 1,412 and 3,250 have arc index 10, 11, 12, and 13, respectively. We used the Dowker-Thistlethwaite code of the 3,250 knots provided b…
▽ More
We give a list of minimal grid diagrams of the 13 crossing prime nonalternating knots which have arc index 13. There are 9,988 prime knots with crossing number 13. Among them 4,878 are alternating and have arc index 15. Among the other nonalternating knots, 49, 399, 1,412 and 3,250 have arc index 10, 11, 12, and 13, respectively. We used the Dowker-Thistlethwaite code of the 3,250 knots provided by the program Knotscape to generate spanning trees of the corresponding knot diagrams to obtain minimal arc presentations in the form of grid diagrams.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
Search for a heavy neutral lepton that mixes predominantly with the tau neutrino
Authors:
Belle Collaboration,
M. Nayak,
S. Dey,
A. Soffer,
I. Adachi,
H. Aihara,
S. Al Said,
D. M. Asner,
H. Atmacan,
R. Ayad,
V. Babu,
Sw. Banerjee,
M. Bauer,
P. Behera,
K. Belous,
M. Bessner,
V. Bhardwaj,
B. Bhuyan,
T. Bilka,
D. Biswas,
A. Bobrov,
D. Bodrov,
M. Bračko,
P. Branchini,
T. E. Browder
, et al. (143 additional authors not shown)
Abstract:
We report a search for a heavy neutral lepton (HNL) that mixes predominantly with $ν_τ$. The search utilizes data collected with the Belle detector at the KEKB asymmetric energy $e^+ e^-$ collider. The data sample was collected at and just below the center-of-mass energies of the $Υ(4S)$ and $Υ(5S)$ resonances and has an integrated luminosity of $915~\textrm{fb}^{-1}$, corresponding to…
▽ More
We report a search for a heavy neutral lepton (HNL) that mixes predominantly with $ν_τ$. The search utilizes data collected with the Belle detector at the KEKB asymmetric energy $e^+ e^-$ collider. The data sample was collected at and just below the center-of-mass energies of the $Υ(4S)$ and $Υ(5S)$ resonances and has an integrated luminosity of $915~\textrm{fb}^{-1}$, corresponding to $(836\pm 12)\times 10^6$ $e^+e^\toτ^+τ^-$ events. We search for production of the HNL (denoted $N$) in the decay $τ^-\to π^- N$ followed by its decay via $N \to μ^+μ^- ν_τ$. The search focuses on the parameter-space region in which the HNL is long lived, so that the $μ^+μ^-$ originate from a common vertex that is significantly displaced from the collision point of the KEKB beams. Consistent with the expected background yield, one event is observed in the data sample after application of all the event-selection criteria. We report limits on the mixing parameter of the HNL with the $τ$ neutrino as a function of the HNL mass.
△ Less
Submitted 14 June, 2024; v1 submitted 4 February, 2024;
originally announced February 2024.
-
Breaking MLPerf Training: A Case Study on Optimizing BERT
Authors:
Yongdeok Kim,
Jaehyung Ahn,
Myeongwoo Kim,
Changin Choi,
Heejae Kim,
Narankhuu Tuvshinjargal,
Seungwon Lee,
Yanzi Zhang,
Yuan Pei,
Xiongzhan Linghu,
**gkun Ma,
Lin Chen,
Yuehua Dai,
Sungjoo Yoo
Abstract:
Speeding up the large-scale distributed training is challenging in that it requires improving various components of training including load balancing, communication, optimizers, etc. We present novel approaches for fast large-scale training of BERT model which individually ameliorates each component thereby leading to a new level of BERT training performance. Load balancing is imperative in distri…
▽ More
Speeding up the large-scale distributed training is challenging in that it requires improving various components of training including load balancing, communication, optimizers, etc. We present novel approaches for fast large-scale training of BERT model which individually ameliorates each component thereby leading to a new level of BERT training performance. Load balancing is imperative in distributed BERT training since its training datasets are characterized by samples with various lengths. Communication cost, which is proportional to the scale of distributed training, needs to be hidden by useful computation. In addition, the optimizers, e.g., ADAM, LAMB, etc., need to be carefully re-evaluated in the context of large-scale distributed training. We propose two new ideas, (1) local presorting based on dataset stratification for load balancing and (2) bucket-wise gradient clip** before allreduce which allows us to benefit from the overlap of gradient computation and synchronization as well as the fast training of gradient clip** before allreduce. We also re-evaluate existing optimizers via hyperparameter optimization and utilize ADAM, which also contributes to fast training via larger batches than existing methods. Our proposed methods, all combined, give the fastest MLPerf BERT training of 25.1 (22.3) seconds on 1,024 NVIDIA A100 GPUs, which is 1.33x (1.13x) and 1.57x faster than the other top two (one) submissions to MLPerf v1.1 (v2.0). Our implementation and evaluation results are available at MLPerf v1.1~v2.1.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
Interference-Aware Emergent Random Access Protocol for Downlink LEO Satellite Networks
Authors:
Chang-Yong Lim,
Jihong Park,
**ho Choi,
Ju-Hyung Lee,
Daesub Oh,
Heewook Kim
Abstract:
In this article, we propose a multi-agent deep reinforcement learning (MADRL) framework to train a multiple access protocol for downlink low earth orbit (LEO) satellite networks. By improving the existing learned protocol, emergent random access channel (eRACH), our proposed method, coined centralized and compressed emergent signaling for eRACH (Ce2RACH), can mitigate inter-satellite interference…
▽ More
In this article, we propose a multi-agent deep reinforcement learning (MADRL) framework to train a multiple access protocol for downlink low earth orbit (LEO) satellite networks. By improving the existing learned protocol, emergent random access channel (eRACH), our proposed method, coined centralized and compressed emergent signaling for eRACH (Ce2RACH), can mitigate inter-satellite interference by exchanging additional signaling messages jointly learned through the MADRL training process. Simulations demonstrate that Ce2RACH achieves up to 36.65% higher network throughput compared to eRACH, while the cost of signaling messages increase linearly with the number of users.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
EBV: Electronic Bee-Veterinarian for Principled Mining and Forecasting of Honeybee Time Series
Authors:
Mst. Shamima Hossain,
Christos Faloutsos,
Boris Baer,
Hyoseung Kim,
Vassilis J. Tsotras
Abstract:
Honeybees are vital for pollination and food production. Among many factors, extreme temperature (e.g., due to climate change) is particularly dangerous for bee health. Anticipating such extremities would allow beekeepers to take early preventive action. Thus, given sensor (temperature) time series data from beehives, how can we find patterns and do forecasting? Forecasting is crucial as it helps…
▽ More
Honeybees are vital for pollination and food production. Among many factors, extreme temperature (e.g., due to climate change) is particularly dangerous for bee health. Anticipating such extremities would allow beekeepers to take early preventive action. Thus, given sensor (temperature) time series data from beehives, how can we find patterns and do forecasting? Forecasting is crucial as it helps spot unexpected behavior and thus issue warnings to the beekeepers. In that case, what are the right models for forecasting? ARIMA, RNNs, or something else?
We propose the EBV (Electronic Bee-Veterinarian) method, which has the following desirable properties: (i) principled: it is based on a) diffusion equations from physics and b) control theory for feedback-loop controllers; (ii) effective: it works well on multiple, real-world time sequences, (iii) explainable: it needs only a handful of parameters (e.g., bee strength) that beekeepers can easily understand and trust, and (iv) scalable: it performs linearly in time. We applied our method to multiple real-world time sequences, and found that it yields accurate forecasting (up to 49% improvement in RMSE compared to baselines), and segmentation. Specifically, discontinuities detected by EBV mostly coincide with domain expert's opinions, showcasing our approach's potential and practical feasibility. Moreover, EBV is scalable and fast, taking about 20 minutes on a stock laptop for reconstructing two months of sensor data.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Amorphous Boron Nitride as a Diffusion Barrier to Cu Atoms
Authors:
Onurcan Kaya,
Hyeongjoon Kim,
Byeongkyu Kim,
Luigi Colombo,
Hyeon-** Shin,
Ivan Cole,
Hyeon Suk Shin,
Stephan Roche
Abstract:
This study focuses on amorphous boron nitride ($\rm α$-BN) as a novel diffusion barrier for advanced semiconductor technology, particularly addressing the critical challenge of copper diffusion in back-end-of-logic (BEOL) interconnects. Owing to its ultralow dielectric constant and robust barrier properties, $\rm α$-BN is examined as an alternative to conventional low-k dielectrics. The investigat…
▽ More
This study focuses on amorphous boron nitride ($\rm α$-BN) as a novel diffusion barrier for advanced semiconductor technology, particularly addressing the critical challenge of copper diffusion in back-end-of-logic (BEOL) interconnects. Owing to its ultralow dielectric constant and robust barrier properties, $\rm α$-BN is examined as an alternative to conventional low-k dielectrics. The investigation primarily employs theoretical modeling, using a Gaussian Approximation Potential, to simulate and understand the atomic-level interactions and barrier mechanisms of $\rm α$-BN. This machine learning-based approach allows for realistic simulations of its amorphous structure, enabling the exploration of the impact of different film morphologies on barrier efficacy. Complementing the theoretical study, experimental analyses are conducted on Plasma-Enhanced Chemical Vapor Deposition (PECVD) grown $\rm α$-BN films, evaluating their effectiveness in preventing copper diffusion in silicon-based substrates. The results from both the theoretical and experimental investigations highlight the potential of $\rm α$-BN as a highly effective diffusion barrier, suitable for integration in nanoelectronics. This research not only proposes $\rm α$-BN as a promising candidate for BEOL interconnects but also demonstrates the synergy of advanced computational models and experimental methods in material innovation for semiconductor applications.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Large Language Models as Hyper-Heuristics for Combinatorial Optimization
Authors:
Haoran Ye,
Jiarui Wang,
Zhiguang Cao,
Federico Berto,
Chuanbo Hua,
Haeyeon Kim,
**kyoo Park,
Guojie Song
Abstract:
The omnipresence of NP-hard combinatorial optimization problems (COPs) compels domain experts to engage in trial-and-error heuristic design. The long-standing endeavor of design automation has gained new momentum with the rise of large language models (LLMs). This paper introduces Language Hyper-Heuristics (LHHs), an emerging variant of Hyper-Heuristics that leverages LLMs for heuristic generation…
▽ More
The omnipresence of NP-hard combinatorial optimization problems (COPs) compels domain experts to engage in trial-and-error heuristic design. The long-standing endeavor of design automation has gained new momentum with the rise of large language models (LLMs). This paper introduces Language Hyper-Heuristics (LHHs), an emerging variant of Hyper-Heuristics that leverages LLMs for heuristic generation, featuring minimal manual intervention and open-ended heuristic spaces. To empower LHHs, we present Reflective Evolution (ReEvo), a novel integration of evolutionary search for efficiently exploring the heuristic space, and LLM reflections to provide verbal gradients within the space. Across five heterogeneous algorithmic types, six different COPs, and both white-box and black-box views of COPs, ReEvo yields state-of-the-art and competitive meta-heuristics, evolutionary algorithms, heuristics, and neural solvers, while being more sample-efficient than prior LHHs. Our code is available: https://github.com/ai4co/LLM-as-HH.
△ Less
Submitted 20 May, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions
Authors:
Hansung Kim,
Siddharth H. Nair,
Francesco Borrelli
Abstract:
We propose a hierarchical architecture designed for scalable real-time Model Predictive Control (MPC) in complex, multi-modal traffic scenarios. This architecture comprises two key components: 1) RAID-Net, a novel attention-based Recurrent Neural Network that predicts relevant interactions along the MPC prediction horizon between the autonomous vehicle and the surrounding vehicles using Lagrangian…
▽ More
We propose a hierarchical architecture designed for scalable real-time Model Predictive Control (MPC) in complex, multi-modal traffic scenarios. This architecture comprises two key components: 1) RAID-Net, a novel attention-based Recurrent Neural Network that predicts relevant interactions along the MPC prediction horizon between the autonomous vehicle and the surrounding vehicles using Lagrangian duality, and 2) a reduced Stochastic MPC problem that eliminates irrelevant collision avoidance constraints, enhancing computational efficiency. Our approach is demonstrated in a simulated traffic intersection with interactive surrounding vehicles, showcasing a 12x speed-up in solving the motion planning problem. A video demonstrating the proposed architecture in multiple complex traffic scenarios can be found here: https://youtu.be/-pRiOnPb9_c. GitHub: https://github.com/MPC-Berkeley/hmpc_raidnet
△ Less
Submitted 2 June, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Axion Dark Matter from Cosmic String Network
Authors:
Heejoo Kim,
Junghyeon Park,
Minho Son
Abstract:
We perform the lattice simulation to estimate the axion dark matter abundance radiated from the global cosmic strings in the post-inflationary scenario. The independent numerical confirmation on the recently observed logarithmic growth in both the number of strings per Hubble patch and the spectral index of the power law scaling for the axion spectrum is reported. These logarithmic scalings are ch…
▽ More
We perform the lattice simulation to estimate the axion dark matter abundance radiated from the global cosmic strings in the post-inflationary scenario. The independent numerical confirmation on the recently observed logarithmic growth in both the number of strings per Hubble patch and the spectral index of the power law scaling for the axion spectrum is reported. These logarithmic scalings are checked against two different prescriptions for generating initial random field configurations, namely fat-string type and thermal phase transition. We discuss a possible strong correlation between the axion spectrum and the string evolutions with different initial conditions to support the insensitivity of scaling behaviors against different initial data and we provide a qualitative understanding of it. The impact of various combinations of the power law of the axion spectrum, nonlinearities around the QCD scale, and average inter-string distances on the axion abundance is discussed. Additionally, we introduce a new novel string identification method, based on the tetrahedralization of the space, which guarantees the connectedness of the strings and provides a convenient way of assigning the core location. Finally we derive the lower bound on the axion mass.
△ Less
Submitted 8 February, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Legged Robot State Estimation With Invariant Extended Kalman Filter Using Neural Measurement Network
Authors:
Donghoon Youm,
Hyunsik Oh,
Suyoung Choi,
Hyeongjun Kim,
Jemin Hwangbo
Abstract:
This paper introduces a novel proprioceptive state estimator for legged robots that combines model-based filters and deep neural networks. Recent studies have shown that neural networks such as multi-layer perceptron or recurrent neural networks can estimate the robot states, including contact probability and linear velocity. Inspired by this, we develop a state estimation framework that integrate…
▽ More
This paper introduces a novel proprioceptive state estimator for legged robots that combines model-based filters and deep neural networks. Recent studies have shown that neural networks such as multi-layer perceptron or recurrent neural networks can estimate the robot states, including contact probability and linear velocity. Inspired by this, we develop a state estimation framework that integrates a neural measurement network (NMN) with an invariant extended Kalman filter. We show that our framework improves estimation performance in various terrains. Existing studies that combine model-based filters and learning-based approaches typically use real-world data. However, our approach relies solely on simulation data, as it allows us to easily obtain extensive data. This difference leads to a gap between the learning and the inference domain, commonly referred to as a sim-to-real gap. We address this challenge by adapting existing learning techniques and regularization. To validate our proposed method, we conduct experiments using a quadruped robot on four types of terrain: \textit{flat}, \textit{debris}, \textit{soft}, and \textit{slippery}. We observe that our approach significantly reduces position drift compared to the existing model-based state estimator.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Securing Cloud-Based Internet of Things: Challenges and Mitigations
Authors:
Nivedita Singh,
Rajkumar Buyya,
Hyoungshich Kim
Abstract:
The Internet of Things (IoT) has seen remarkable advancements in recent years, leading to a paradigm shift in the digital landscape. However, these technological strides have also brought new challenges, particularly in terms of cybersecurity. IoT devices are inherently connected to the internet, which makes them more vulnerable to attack. In addition, IoT services often handle sensitive user data…
▽ More
The Internet of Things (IoT) has seen remarkable advancements in recent years, leading to a paradigm shift in the digital landscape. However, these technological strides have also brought new challenges, particularly in terms of cybersecurity. IoT devices are inherently connected to the internet, which makes them more vulnerable to attack. In addition, IoT services often handle sensitive user data, which could be misused by malicious actors or unauthorized service providers. As more mainstream service providers emerge without uniform regulations, these security risks are expected to escalate exponentially. The task of maintaining the security of IoT devices while they interact with cloud services is also challenging. Newer IoT services, especially those developed and deployed via Platform-as-a-Service (PaaS) and Infrastructure-as-a-Service (IaaS) models, pose additional security threats. Although IoT devices are becoming more affordable and ubiquitous, their growing complexity could expose users to heightened security and privacy risks. This paper highlights these pressing security concerns associated with the widespread adoption of IoT devices and services. We propose potential solutions to bridge the existing security gaps and expect future challenges. Our approach entails a comprehensive exploration of the key security challenges that IoT services are currently facing. We also suggest proactive strategies to mitigate these risks, strengthening the overall security of IoT devices and services.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Erie: A Declarative Grammar for Data Sonification
Authors:
Hyeok Kim,
Yea-Seul Kim,
Jessica Hullman
Abstract:
Data sonification-map** data variables to auditory variables, such as pitch or volume-is used for data accessibility, scientific exploration, and data-driven art (e.g., museum exhibitions) among others. While a substantial amount of research has been made on effective and intuitive sonification design, software support is not commensurate, limiting researchers from fully exploring its capabiliti…
▽ More
Data sonification-map** data variables to auditory variables, such as pitch or volume-is used for data accessibility, scientific exploration, and data-driven art (e.g., museum exhibitions) among others. While a substantial amount of research has been made on effective and intuitive sonification design, software support is not commensurate, limiting researchers from fully exploring its capabilities. We contribute Erie, a declarative grammar for data sonification, that enables abstractly expressing auditory map**s. Erie supports specifying extensible tone designs (e.g., periodic wave, sampling, frequency/amplitude modulation synthesizers), various encoding channels, auditory legends, and composition options like sequencing and overlaying. Using standard Web Audio and Web Speech APIs, we provide an Erie compiler for web environments. We demonstrate the expressiveness and feasibility of Erie by replicating research prototypes presented by prior work and provide a sonification design gallery. We discuss future steps to extend Erie toward other audio computing environments and support interactive data sonification.
△ Less
Submitted 8 February, 2024; v1 submitted 31 January, 2024;
originally announced February 2024.
-
Bayesian Optimization with Noise-Free Observations: Improved Regret Bounds via Random Exploration
Authors:
Hwanwoo Kim,
Daniel Sanz-Alonso
Abstract:
This paper studies Bayesian optimization with noise-free observations. We introduce new algorithms rooted in scattered data approximation that rely on a random exploration step to ensure that the fill-distance of query points decays at a near-optimal rate. Our algorithms retain the ease of implementation of the classical GP-UCB algorithm and satisfy cumulative regret bounds that nearly match those…
▽ More
This paper studies Bayesian optimization with noise-free observations. We introduce new algorithms rooted in scattered data approximation that rely on a random exploration step to ensure that the fill-distance of query points decays at a near-optimal rate. Our algorithms retain the ease of implementation of the classical GP-UCB algorithm and satisfy cumulative regret bounds that nearly match those conjectured in arXiv:2002.05096, hence solving a COLT open problem. Furthermore, the new algorithms outperform GP-UCB and other popular Bayesian optimization strategies in several examples.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Characterization of Magnetic Labyrinthine Structures through Junctions and Terminals Detection Using Template Matching and CNN
Authors:
Vinícius Yu Okubo,
Kotaro Shimizu,
B. S. Shivaram,
Hae Yong Kim
Abstract:
Defects influence diverse properties of materials, sha** their structural, mechanical, and electronic characteristics. Among a variety of materials exhibiting unique defects, magnets exhibit diverse nano- to micro-scale defects and have been intensively studied in materials science. Specifically, defects in magnetic labyrinthine patterns, called junctions and terminals, serve as the canonical ta…
▽ More
Defects influence diverse properties of materials, sha** their structural, mechanical, and electronic characteristics. Among a variety of materials exhibiting unique defects, magnets exhibit diverse nano- to micro-scale defects and have been intensively studied in materials science. Specifically, defects in magnetic labyrinthine patterns, called junctions and terminals, serve as the canonical targets of the research. While detecting and characterizing such defects is crucial for understanding magnets, systematically investigating large-scale images containing over a thousand closely packed junctions and terminals remains a formidable challenge. This study introduces a new technique called TM-CNN (Template Matching - Convolutional Neural Network) designed to detect a multitude of small objects in images, such as the defects in magnetic labyrinthine patterns. TM-CNN was used to identify 641,649 such structures in 444 experimental images, and the results were explored to deepen understanding of magnetic materials. It employs a two-stage detection approach combining template matching, used in initial detection, with a convolutional neural network, used to eliminate incorrect identifications. To train a CNN classifier, it is necessary to annotate a large number of training images.This difficulty prevents the use of CNN in many practical applications. TM-CNN significantly reduces the manual workload for creating training images by automatically making most of the annotations and leaving only a small number of corrections to human reviewers. In testing, TM-CNN achieved an impressive F1 score of 0.991, far outperforming traditional template matching and CNN-based object detection algorithms.
△ Less
Submitted 16 May, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
Unleashing the Power of Preemptive Priority-based Scheduling for Real-Time GPU Tasks
Authors:
Yidi Wang,
Cong Liu,
Daniel Wong,
Hyoseung Kim
Abstract:
Scheduling real-time tasks that utilize GPUs with analyzable guarantees poses a significant challenge due to the intricate interaction between CPU and GPU resources, as well as the complex GPU hardware and software stack. While much research has been conducted in the real-time research community, several limitations persist, including the absence or limited availability of preemption, extended blo…
▽ More
Scheduling real-time tasks that utilize GPUs with analyzable guarantees poses a significant challenge due to the intricate interaction between CPU and GPU resources, as well as the complex GPU hardware and software stack. While much research has been conducted in the real-time research community, several limitations persist, including the absence or limited availability of preemption, extended blocking times, and/or the need for extensive modifications to program code. In this paper, we propose two novel techniques, namely the kernel thread and IOCTL-based approaches, to enable preemptive priority-based scheduling for real-time GPU tasks. Our approaches exert control over GPU context scheduling at the device driver level and enable preemptive GPU scheduling based on task priorities. The kernel thread-based approach achieves this without requiring modifications to user-level programs, while the IOCTL-based approach needs only a single macro at the boundaries of GPU access segments. In addition, we provide a comprehensive response time analysis that takes into account overlaps between different task segments, mitigating pessimism in worst-case estimates. Through empirical evaluations and case studies, we demonstrate the effectiveness of the proposed approaches in improving taskset schedulability and timeliness of real-time tasks. The results highlight significant improvements over prior work, with up to 40\% higher schedulability, while also achieving predictable worst-case behavior on Nvidia Jetson embedded platforms.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
ReTaSA: A Nonparametric Functional Estimation Approach for Addressing Continuous Target Shift
Authors:
Hwanwoo Kim,
Xin Zhang,
Jiwei Zhao,
Qinglong Tian
Abstract:
The presence of distribution shifts poses a significant challenge for deploying modern machine learning models in real-world applications. This work focuses on the target shift problem in a regression setting (Zhang et al., 2013; Nguyen et al., 2016). More specifically, the target variable y (also known as the response variable), which is continuous, has different marginal distributions in the tra…
▽ More
The presence of distribution shifts poses a significant challenge for deploying modern machine learning models in real-world applications. This work focuses on the target shift problem in a regression setting (Zhang et al., 2013; Nguyen et al., 2016). More specifically, the target variable y (also known as the response variable), which is continuous, has different marginal distributions in the training source and testing domain, while the conditional distribution of features x given y remains the same. While most literature focuses on classification tasks with finite target space, the regression problem has an infinite dimensional target space, which makes many of the existing methods inapplicable. In this work, we show that the continuous target shift problem can be addressed by estimating the importance weight function from an ill-posed integral equation. We propose a nonparametric regularized approach named ReTaSA to solve the ill-posed integral equation and provide theoretical justification for the estimated importance weight function. The effectiveness of the proposed method has been demonstrated with extensive numerical studies on synthetic and real-world datasets.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Iterative assembly of $^{171}$Yb atom arrays with cavity-enhanced optical lattices
Authors:
M. A. Norcia,
H. Kim,
W. B. Cairncross,
M. Stone,
A. Ryou,
M. Jaffe,
M. O. Brown,
K. Barnes,
P. Battaglino,
T. C. Bohdanowicz,
A. Brown,
K. Cassella,
C. -A. Chen,
R. Coxe,
D. Crow,
J. Epstein,
C. Griger,
E. Halperin,
F. Hummel,
A. M. W. Jones,
J. M. Kindem,
J. King,
K. Kotru,
J. Lauigan,
M. Li
, et al. (25 additional authors not shown)
Abstract:
Assembling and maintaining large arrays of individually addressable atoms is a key requirement for continued scaling of neutral-atom-based quantum computers and simulators. In this work, we demonstrate a new paradigm for assembly of atomic arrays, based on a synergistic combination of optical tweezers and cavity-enhanced optical lattices, and the incremental filling of a target array from a repeti…
▽ More
Assembling and maintaining large arrays of individually addressable atoms is a key requirement for continued scaling of neutral-atom-based quantum computers and simulators. In this work, we demonstrate a new paradigm for assembly of atomic arrays, based on a synergistic combination of optical tweezers and cavity-enhanced optical lattices, and the incremental filling of a target array from a repetitively filled reservoir. In this protocol, the tweezers provide microscopic rearrangement of atoms, while the cavity-enhanced lattices enable the creation of large numbers of optical traps with sufficient depth for rapid low-loss imaging of atoms. We apply this protocol to demonstrate near-deterministic filling (99% per-site occupancy) of 1225-site arrays of optical traps. Because the reservoir is repeatedly filled with fresh atoms, the array can be maintained in a filled state indefinitely. We anticipate that this protocol will be compatible with mid-circuit reloading of atoms into a quantum processor, which will be a key capability for running large-scale error-corrected quantum computations whose durations exceed the lifetime of a single atom in the system.
△ Less
Submitted 18 June, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
Meta-Learning for Neural Network-based Temporal Point Processes
Authors:
Yoshiaki Takimoto,
Yusuke Tanaka,
Tomoharu Iwata,
Maya Okawa,
Hideaki Kim,
Hiroyuki Toda,
Takeshi Kurashima
Abstract:
Human activities generate various event sequences such as taxi trip records, bike-sharing pick-ups, crime occurrence, and infectious disease transmission. The point process is widely used in many applications to predict such events related to human activities. However, point processes present two problems in predicting events related to human activities. First, recent high-performance point proces…
▽ More
Human activities generate various event sequences such as taxi trip records, bike-sharing pick-ups, crime occurrence, and infectious disease transmission. The point process is widely used in many applications to predict such events related to human activities. However, point processes present two problems in predicting events related to human activities. First, recent high-performance point process models require the input of sufficient numbers of events collected over a long period (i.e., long sequences) for training, which are often unavailable in realistic situations. Second, the long-term predictions required in real-world applications are difficult. To tackle these problems, we propose a novel meta-learning approach for periodicity-aware prediction of future events given short sequences. The proposed method first embeds short sequences into hidden representations (i.e., task representations) via recurrent neural networks for creating predictions from short sequences. It then models the intensity of the point process by monotonic neural networks (MNNs), with the input being the task representations. We transfer the prior knowledge learned from related tasks and can improve event prediction given short sequences of target tasks. We design the MNNs to explicitly take temporal periodic patterns into account, contributing to improved long-term prediction performance. Experiments on multiple real-world datasets demonstrate that the proposed method has higher prediction performance than existing alternatives.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach Without Reanalysis Data
Authors:
Young-Jae Park,
Minseok Seo,
Doyi Kim,
Hyeri Kim,
Sanghoon Choi,
Beomkyu Choi,
Jeongwon Ryu,
Sohee Son,
Hae-Gon Jeon,
Yeji Choi
Abstract:
In the face of escalating climate changes, typhoon intensities and their ensuing damage have surged. Accurate trajectory prediction is crucial for effective damage control. Traditional physics-based models, while comprehensive, are computationally intensive and rely heavily on the expertise of forecasters. Contemporary data-driven methods often rely on reanalysis data, which can be considered to b…
▽ More
In the face of escalating climate changes, typhoon intensities and their ensuing damage have surged. Accurate trajectory prediction is crucial for effective damage control. Traditional physics-based models, while comprehensive, are computationally intensive and rely heavily on the expertise of forecasters. Contemporary data-driven methods often rely on reanalysis data, which can be considered to be the closest to the true representation of weather conditions. However, reanalysis data is not produced in real-time and requires time for adjustment because prediction models are calibrated with observational data. This reanalysis data, such as ERA5, falls short in challenging real-world situations. Optimal preparedness necessitates predictions at least 72 hours in advance, beyond the capabilities of standard physics models. In response to these constraints, we present an approach that harnesses real-time Unified Model (UM) data, sidestep** the limitations of reanalysis data. Our model provides predictions at 6-hour intervals for up to 72 hours in advance and outperforms both state-of-the-art data-driven methods and numerical weather prediction models. In line with our efforts to mitigate adversities inflicted by \rthree{typhoons}, we release our preprocessed \textit{PHYSICS TRACK} dataset, which includes ERA5 reanalysis data, typhoon best-track, and UM forecast data.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
Asymptotic Midpoint Mixup for Margin Balancing and Moderate Broadening
Authors:
Hoyong Kim,
Semi Lee,
Kangil Kim
Abstract:
In the feature space, the collapse between features invokes critical problems in representation learning by remaining the features undistinguished. Interpolation-based augmentation methods such as mixup have shown their effectiveness in relieving the collapse problem between different classes, called inter-class collapse. However, intra-class collapse raised in coarse-to-fine transfer learning has…
▽ More
In the feature space, the collapse between features invokes critical problems in representation learning by remaining the features undistinguished. Interpolation-based augmentation methods such as mixup have shown their effectiveness in relieving the collapse problem between different classes, called inter-class collapse. However, intra-class collapse raised in coarse-to-fine transfer learning has not been discussed in the augmentation approach. To address them, we propose a better feature augmentation method, asymptotic midpoint mixup. The method generates augmented features by interpolation but gradually moves them toward the midpoint of inter-class feature pairs. As a result, the method induces two effects: 1) balancing the margin for all classes and 2) only moderately broadening the margin until it holds maximal confidence. We empirically analyze the collapse effects by measuring alignment and uniformity with visualizing representations. Then, we validate the intra-class collapse effects in coarse-to-fine transfer learning and the inter-class collapse effects in imbalanced learning on long-tailed datasets. In both tasks, our method shows better performance than other augmentation methods.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.