-
Generic magnetic field dependence of thermal conductivity in magnetic insulators via hybridization of acoustic phonons and spin-flip excitations
Authors:
Christopher A. Pocs,
Ian A. Leahy,
Jie Xing,
Eun Sang Choi,
Athena S. Sefat,
Michael Hermele,
Minhyea Lee
Abstract:
Magnetic insulators provide excellent playgrounds to realize a range of exciting spin models, some of which predict exotic spin ground states, and thermal transport properties have been taking center stage in probing the spin excitations. Despite the fact that acoustic phonons make the major contribution to heat conduction in a crystalline system, their interplay with magnetic excitations is often…
▽ More
Magnetic insulators provide excellent playgrounds to realize a range of exciting spin models, some of which predict exotic spin ground states, and thermal transport properties have been taking center stage in probing the spin excitations. Despite the fact that acoustic phonons make the major contribution to heat conduction in a crystalline system, their interplay with magnetic excitations is often viewed as peripheral to the physics of interest, for instance as an inconvenient source of scattering or decoherence. Here, we present a comprehensive study on the longitudinal magneto-thermal transport in a paramagnetic effective spin-1/2 magnetic insulator CsYbSe$_2$. We introduce a minimal model requiring only Zeeman splitting and magnetoelastic coupling, and use it to argue that hybridized excitations -- formed from acoustic phonons and localized spin-flip-excitations across the Zeeman gap of the crystal electric field ground doublet -- are responsible for a striking non-monotonic field dependence of longitudinal thermal conductivity. Beyond highlighting a starring role for phonons, our results raise the prospect of universal magneto-thermal transport phenomena in magnetic insulators that originate from simple features shared across many systems.
△ Less
Submitted 26 January, 2024; v1 submitted 2 January, 2024;
originally announced January 2024.
-
OGHReS: Star formation in the Outer Galaxy ($\ell = 250^\circ$-$280^\circ$)
Authors:
J. S. Urquhart,
C. König,
D. Colombo,
A. Karska,
F. Wyrowski,
K. M. Menten,
T. J. T. Moore,
J. Brand,
D. Elia,
A. Giannetti,
S. Leurini,
M. Figueira,
M. -Y. Lee,
M. Dumke
Abstract:
We have used data from the Outer Galaxy High-Resolution Survey (OGHReS) to refine the velocities, distances, and physical properties of a large sample of 3584 clumps detected in far infrared/submillimetre emission in the HiGAL survey located in the $\ell = 250^\circ-280^\circ$ region of the Galactic plane. Using $^{12}$CO and $^{13}$CO spectra, we have determined reliable velocities to 3412 clumps…
▽ More
We have used data from the Outer Galaxy High-Resolution Survey (OGHReS) to refine the velocities, distances, and physical properties of a large sample of 3584 clumps detected in far infrared/submillimetre emission in the HiGAL survey located in the $\ell = 250^\circ-280^\circ$ region of the Galactic plane. Using $^{12}$CO and $^{13}$CO spectra, we have determined reliable velocities to 3412 clumps (95% of the sample). In comparison to the velocities from the HiGAL catalogue, we find good agreement for 80% of the sample (within 5 km/s). Using the higher resolution and sensitivity of OGHReS has allowed us to correct the velocity for 632 clumps and provide velocities for 687 clumps for which no velocity had been previously allocated. The velocities are used with a rotation curve to refine the distances to the clumps and to calculate the clumps' properties using a distance-dependent gas-to-dust ratio. We have determined reliable physical parameters for 3200 outer Galaxy dense clumps (~90% of the HiGAL sources in the region). We find a trend of decreasing luminosity-to-mass ratio with increasing Galactocentric distance, suggesting the star formation efficiency is lower in the outer Galaxy or that it is resulting in more lower mass stars than in the inner Galaxy. We also find a similar surface density for protostellar clumps located in the inner and outer Galaxy, revealing that the surface density requirements for star formation are the same across the Galactic disc.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
The structure of the stellar halo of the Andromeda galaxy explored with the NB515 for Subaru/HSC. I.: New Insights on the stellar halo up to 120 kpc
Authors:
Itsuki Ogami,
Mikito Tanaka,
Yutaka Komiyama,
Masashi Chiba,
Puragra Guhathakurta,
Evan N. Kirby,
Rosemary F. G. Wyse,
Carrie Filion,
Karoline M. Gilbert,
Ivanna Escala,
Masao Mori,
Takanobu Kirihara,
Masayuki Tanaka,
Miho N. Ishigaki,
Kohei Hayashi,
Myun Gyoon Lee,
Sanjib Sharma,
Jason S. Kalirai,
Robert H. Lupton
Abstract:
We analyse the M31 halo and its substructure within a projected radius of 120 kpc using a combination of Subaru/HSC NB515 and CFHT/MegaCam g- & i-bands. We succeed in separating M31's halo stars from foreground contamination with $\sim$ 90 \% accuracy by using the surface gravity sensitive NB515 filter. Based on the selected M31 halo stars, we discover three new substructures, which associate with…
▽ More
We analyse the M31 halo and its substructure within a projected radius of 120 kpc using a combination of Subaru/HSC NB515 and CFHT/MegaCam g- & i-bands. We succeed in separating M31's halo stars from foreground contamination with $\sim$ 90 \% accuracy by using the surface gravity sensitive NB515 filter. Based on the selected M31 halo stars, we discover three new substructures, which associate with the Giant Southern Stream (GSS) based on their photometric metallicity estimates. We also produce the distance and photometric metallicity estimates for the known substructures. While these quantities for the GSS are reproduced in our study, we find that the North-Western stream shows a steeper distance gradient than found in an earlier study, suggesting that it is likely to have formed in an orbit closer to the Milky Way. For two streams in the eastern halo (Stream C and D), we identify distance gradients that had not been resolved. Finally, we investigate the global halo photometric metallicity distribution and surface brightness profile using the NB515-selected halo stars. We find that the surface brightness of the metal-poor and metal-rich halo populations, and the all population can be fitted to a power-law profile with an index of $α= -1.65 \pm 0.02$, $-2.82\pm0.01$, and $-2.44\pm0.01$, respectively. In contrast to the relative smoothness of the halo profile, its photometric metallicity distribution appears to be spatially non-uniform with nonmonotonic trends with radius, suggesting that the halo population had insufficient time to dynamically homogenize the accreted populations.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
Noise-free Optimization in Early Training Steps for Image Super-Resolution
Authors:
MinKyu Lee,
Jae-Pil Heo
Abstract:
Recent deep-learning-based single image super-resolution (SISR) methods have shown impressive performance whereas typical methods train their networks by minimizing the pixel-wise distance with respect to a given high-resolution (HR) image. However, despite the basic training scheme being the predominant choice, its use in the context of ill-posed inverse problems has not been thoroughly investiga…
▽ More
Recent deep-learning-based single image super-resolution (SISR) methods have shown impressive performance whereas typical methods train their networks by minimizing the pixel-wise distance with respect to a given high-resolution (HR) image. However, despite the basic training scheme being the predominant choice, its use in the context of ill-posed inverse problems has not been thoroughly investigated. In this work, we aim to provide a better comprehension of the underlying constituent by decomposing target HR images into two subcomponents: (1) the optimal centroid which is the expectation over multiple potential HR images, and (2) the inherent noise defined as the residual between the HR image and the centroid. Our findings show that the current training scheme cannot capture the ill-posed nature of SISR and becomes vulnerable to the inherent noise term, especially during early training steps. To tackle this issue, we propose a novel optimization method that can effectively remove the inherent noise term in the early steps of vanilla training by estimating the optimal centroid and directly optimizing toward the estimation. Experimental results show that the proposed method can effectively enhance the stability of vanilla training, leading to overall performance gain. Codes are available at github.com/2minkyulee/ECO.
△ Less
Submitted 29 December, 2023;
originally announced December 2023.
-
Industrial Internet of Things Intelligence Empowering Smart Manufacturing: A Literature Review
Authors:
Yujiao Hu,
Qingmin Jia,
Yuao Yao,
Yong Lee,
Mengjie Lee,
Chenyi Wang,
Xiaomao Zhou,
Renchao Xie,
F. Richard Yu
Abstract:
The fiercely competitive business environment and increasingly personalized customization needs are driving the digital transformation and upgrading of the manufacturing industry. IIoT intelligence, which can provide innovative and efficient solutions for various aspects of the manufacturing value chain, illuminates the path of transformation for the manufacturing industry. It's time to provide a…
▽ More
The fiercely competitive business environment and increasingly personalized customization needs are driving the digital transformation and upgrading of the manufacturing industry. IIoT intelligence, which can provide innovative and efficient solutions for various aspects of the manufacturing value chain, illuminates the path of transformation for the manufacturing industry. It's time to provide a systematic vision of IIoT intelligence. However, existing surveys often focus on specific areas of IIoT intelligence, leading researchers and readers to have biases in their understanding of IIoT intelligence, that is, believing that research in one direction is the most important for the development of IIoT intelligence, while ignoring contributions from other directions. Therefore, this paper provides a comprehensive overview of IIoT intelligence. We first conduct an in-depth analysis of the inevitability of manufacturing transformation and study the successful experiences from the practices of Chinese enterprises. Then we give our definition of IIoT intelligence and demonstrate the value of IIoT intelligence for industries in fucntions, operations, deployments, and application. Afterwards, we propose a hierarchical development architecture for IIoT intelligence, which consists of five layers. The practical values of technical upgrades at each layer are illustrated by a close look on lighthouse factories. Following that, we identify seven kinds of technologies that accelerate the transformation of manufacturing, and clarify their contributions. The ethical implications and environmental impacts of adopting IIoT intelligence in manufacturing are analyzed as well. Finally, we explore the open challenges and development trends from four aspects to inspire future researches.
△ Less
Submitted 21 February, 2024; v1 submitted 2 December, 2023;
originally announced December 2023.
-
An endpoint estimate for product singular integral operators on stratified Lie groups
Authors:
Michael G. Cowling,
Ming-Yi Lee,
Ji Li,
Jill Pipher
Abstract:
We establish hyperweak boundedness of area functions, square functions, maximal operators and Calderón--Zygmund operators on products of two stratified Lie groups.
We establish hyperweak boundedness of area functions, square functions, maximal operators and Calderón--Zygmund operators on products of two stratified Lie groups.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
Track reconstruction for the COMET Phase-II experiment with ACTS
Authors:
Amaia Razquin,
MyeongJae Lee
Abstract:
An implementation of A Common Tracking Software (ACTS) toolkit for signal electron reconstruction for the COMET muon to electron conversion experiment is discussed. The COMET experiment in J-PARC, Japan, will search for neutrinoless conversion of muons into electrons in the field of an aluminium nucleus, a lepton flavour violating process, aiming target sensitivity of $10^{-17}$. To achieve its sc…
▽ More
An implementation of A Common Tracking Software (ACTS) toolkit for signal electron reconstruction for the COMET muon to electron conversion experiment is discussed. The COMET experiment in J-PARC, Japan, will search for neutrinoless conversion of muons into electrons in the field of an aluminium nucleus, a lepton flavour violating process, aiming target sensitivity of $10^{-17}$. To achieve its scientific goals, the experiment requires a reconstructed momentum resolution of lower than 150 keV/c. For the first time by applying ACTS to signal events in the 100 MeV energy range with multiple-turn trajectories in the presence of background events, it is found that the reconstruction efficiency is around 14\% with no fake reconstructed events. The implementation details, performance, and issues of ACTS in the context of COMET are presented.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
Search for the $e^+e^-\toη_{b}(1S)ω$ and $e^+e^-\toχ_{b0}(1P)ω$ processes at $\sqrt{s}=10.745\,\mathrm{GeV}$
Authors:
Belle II Collaboration,
I. Adachi,
L. Aggarwal,
H. Ahmed,
H. Aihara,
N. Akopov,
A. Aloisio,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
V. Babu,
H. Bae,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
M. Barrett,
J. Baudot,
M. Bauer,
A. Baur,
A. Beaubien,
F. Becherer,
J. Becker
, et al. (397 additional authors not shown)
Abstract:
We search for the $e^+e^-\toη_b(1S)ω$ and $e^+e^-\toχ_{b0}(1P)ω$ processes at a center-of-mass energy of 10.745 GeV, which is close to the peak of the $Υ(10753)$ state. We use data collected by the Belle II experiment during a special run, corresponding to an integrated luminosity of $9.8\,\mathrm{fb}^{-1}$. We reconstruct $ω\toπ^+π^-π^0$ decays and use the $ω$ meson's recoil mass to search for th…
▽ More
We search for the $e^+e^-\toη_b(1S)ω$ and $e^+e^-\toχ_{b0}(1P)ω$ processes at a center-of-mass energy of 10.745 GeV, which is close to the peak of the $Υ(10753)$ state. We use data collected by the Belle II experiment during a special run, corresponding to an integrated luminosity of $9.8\,\mathrm{fb}^{-1}$. We reconstruct $ω\toπ^+π^-π^0$ decays and use the $ω$ meson's recoil mass to search for the signals. We do not find evidence for either process, and set upper limits on the corresponding Born-level cross sections of 2.5 pb and 7.8 pb, respectively, at the 90% confidence level. The $χ_{b0}(1P)ω$ limit is the result of a combination of this analysis and a previous search using full reconstruction.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Application of AI in Nutrition
Authors:
Ritu Ramakrishnan,
Tianxiang Xing,
Tianfeng Chen,
Ming-Hao Lee,
**zhu Gao
Abstract:
In healthcare, artificial intelligence (AI) has been changing the way doctors and health experts take care of people. This paper will cover how AI is making major changes in the health care system, especially with nutrition. Various machine learning and deep learning algorithms have been developed to extract valuable information from healthcare data which help doctors, nutritionists, and health ex…
▽ More
In healthcare, artificial intelligence (AI) has been changing the way doctors and health experts take care of people. This paper will cover how AI is making major changes in the health care system, especially with nutrition. Various machine learning and deep learning algorithms have been developed to extract valuable information from healthcare data which help doctors, nutritionists, and health experts to make better decisions and make our lifestyle healthy. This paper provides an overview of the current state of AI applications in healthcare with a focus on the utilization of AI-driven recommender systems in nutrition. It will discuss the positive outcomes and challenges that arise when AI is used in this field. This paper addresses the challenges to develop AI recommender systems in healthcare, providing a well-rounded perspective on the complexities. Real-world examples and research findings are presented to underscore the tangible and significant impact AI recommender systems have in the field of healthcare, particularly in nutrition. The ongoing efforts of applying AI in nutrition lay the groundwork for a future where personalized recommendations play a pivotal role in guiding individuals toward healthier lifestyles.
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
Rigidity of area non-increasing maps
Authors:
Man-Chun Lee,
Luen-Fai Tam,
**gbo Wan
Abstract:
In this work, we consider the area non-increasing map between manifolds with positive curvature. By exploring the strong maximum principle along the graphical mean curvature flow, we show that an area non-increasing map between certain positively curved manifolds is either homotopy trivial, Riemannian submersion, local isometry or isometric immersion. This implies that an area non-increasing self…
▽ More
In this work, we consider the area non-increasing map between manifolds with positive curvature. By exploring the strong maximum principle along the graphical mean curvature flow, we show that an area non-increasing map between certain positively curved manifolds is either homotopy trivial, Riemannian submersion, local isometry or isometric immersion. This implies that an area non-increasing self map of $\mathbb{CP}^n$, $n\ge 2$ is either homotopically trivial or is an isometry. This confirms a speculation of Tsai-Tsui-Wang. We also use Brendle's sphere Theorem and mean curvature flow coupled with Ricci flow to establish related results on manifolds with positive $1$-isotropic curvature.
△ Less
Submitted 24 February, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Predicted Multiple Walker Breakdowns for Current-Driven Domain-Wall Motion in Antiferromagnets
Authors:
Mu-Kun Lee,
Rubén M. Otxoa,
Masahito Mochizuki
Abstract:
We theoretically discover possible emergence of reentrant Walker breakdowns for current-driven domain walls in layered antiferromagnets in striking contrast to the unique Walker breakdown in ferromagnets. We reveal that the Lorentz contraction of domain-wall width in antiferromagnets gives rise to nonlinear current-dependence of the wall velocity and the predicted multiple Walker breakdowns. The d…
▽ More
We theoretically discover possible emergence of reentrant Walker breakdowns for current-driven domain walls in layered antiferromagnets in striking contrast to the unique Walker breakdown in ferromagnets. We reveal that the Lorentz contraction of domain-wall width in antiferromagnets gives rise to nonlinear current-dependence of the wall velocity and the predicted multiple Walker breakdowns. The dominant efficiency of the current-induced staggered spin-orbit torque over the spin-transfer torque to drive the domain-wall motion is also demonstrated. These findings are expected to be observed in synthetic antiferromagnets experimentally and provide an important contribution to the growing research field of antiferromagnetic spintronics.
△ Less
Submitted 22 April, 2024; v1 submitted 16 December, 2023;
originally announced December 2023.
-
SeiT++: Masked Token Modeling Improves Storage-efficient Training
Authors:
Minhyun Lee,
Song Park,
Byeongho Heo,
Dongyoon Han,
Hyunjung Shim
Abstract:
Recent advancements in Deep Neural Network (DNN) models have significantly improved performance across computer vision tasks. However, achieving highly generalizable and high-performing vision models requires expansive datasets, resulting in significant storage requirements. This storage challenge is a critical bottleneck for scaling up models. A recent breakthrough by SeiT proposed the use of Vec…
▽ More
Recent advancements in Deep Neural Network (DNN) models have significantly improved performance across computer vision tasks. However, achieving highly generalizable and high-performing vision models requires expansive datasets, resulting in significant storage requirements. This storage challenge is a critical bottleneck for scaling up models. A recent breakthrough by SeiT proposed the use of Vector-Quantized (VQ) feature vectors (i.e., tokens) as network inputs for vision classification. This approach achieved 90% of the performance of a model trained on full-pixel images with only 1% of the storage. While SeiT needs labeled data, its potential in scenarios beyond fully supervised learning remains largely untapped. In this paper, we extend SeiT by integrating Masked Token Modeling (MTM) for self-supervised pre-training. Recognizing that self-supervised approaches often demand more data due to the lack of labels, we introduce TokenAdapt and ColorAdapt. These methods facilitate comprehensive token-friendly data augmentation, effectively addressing the increased data requirements of self-supervised learning. We evaluate our approach across various scenarios, including storage-efficient ImageNet-1k classification, fine-grained classification, ADE-20k semantic segmentation, and robustness benchmarks. Experimental results demonstrate consistent performance improvement in diverse experiments, validating the effectiveness of our method. Code is available at https://github.com/naver-ai/tokenadapt.
△ Less
Submitted 2 April, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
CNC-Net: Self-Supervised Learning for CNC Machining Operations
Authors:
Mohsen Yavartanoo,
Sangmin Hong,
Reyhaneh Neshatavar,
Kyoung Mu Lee
Abstract:
CNC manufacturing is a process that employs computer numerical control (CNC) machines to govern the movements of various industrial tools and machinery, encompassing equipment ranging from grinders and lathes to mills and CNC routers. However, the reliance on manual CNC programming has become a bottleneck, and the requirement for expert knowledge can result in significant costs. Therefore, we intr…
▽ More
CNC manufacturing is a process that employs computer numerical control (CNC) machines to govern the movements of various industrial tools and machinery, encompassing equipment ranging from grinders and lathes to mills and CNC routers. However, the reliance on manual CNC programming has become a bottleneck, and the requirement for expert knowledge can result in significant costs. Therefore, we introduce a pioneering approach named CNC-Net, representing the use of deep neural networks (DNNs) to simulate CNC machines and grasp intricate operations when supplied with raw materials. CNC-Net constitutes a self-supervised framework that exclusively takes an input 3D model and subsequently generates the essential operation parameters required by the CNC machine to construct the object. Our method has the potential to transformative automation in manufacturing by offering a cost-effective alternative to the high costs of manual CNC programming while maintaining exceptional precision in 3D object production. Our experiments underscore the effectiveness of our CNC-Net in constructing the desired 3D objects through the utilization of CNC operations. Notably, it excels in preserving finer local details, exhibiting a marked enhancement in precision compared to the state-of-the-art 3D CAD reconstruction approaches.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Identified charged-hadron production in $p$$+$Al, $^3$He$+$Au, and Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and in U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
J. Alexander,
M. Alfred,
V. Andrieux,
K. Aoki,
N. Apadula,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
X. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
V. Baublis
, et al. (456 additional authors not shown)
Abstract:
The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interprete…
▽ More
The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interpreted in terms of radially expanding thermalized systems. The particle ratios of $K/π$ and $p/π$ have been measured in different centrality ranges of large (Cu$+$Au, U$+$U) and small ($p$$+$Al, $^3$He$+$Au) collision systems. The values of $K/π$ ratios measured in all considered collision systems were found to be consistent with those measured in $p$$+$$p$ collisions. However the values of $p/π$ ratios measured in large collision systems reach the values of $\approx0.6$, which is $\approx2$ times larger than in $p$$+$$p$ collisions. These results can be qualitatively understood in terms of the baryon enhancement expected from hadronization by recombination. Identified charged-hadron nuclear-modification factors ($R_{AB}$) are also presented. Enhancement of proton $R_{AB}$ values over meson $R_{AB}$ values was observed in central $^3$He$+$Au, Cu$+$Au, and U$+$U collisions. The proton $R_{AB}$ values measured in $p$$+$Al collision system were found to be consistent with $R_{AB}$ values of $φ$, $π^\pm$, $K^\pm$, and $π^0$ mesons, which may indicate that the size of the system produced in $p$$+$Al collisions is too small for recombination to cause a noticeable increase in proton production.
△ Less
Submitted 22 May, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
Gap Theorem on manifolds with small curvature concentration
Authors:
Pak-Yeung Chan,
Man-Chun Lee
Abstract:
In this work, we show that complete non-compact manifolds with non-negative Ricci curvature, Euclidean volume growth and sufficiently small curvature concentration are necessarily flat Euclidean space.
In this work, we show that complete non-compact manifolds with non-negative Ricci curvature, Euclidean volume growth and sufficiently small curvature concentration are necessarily flat Euclidean space.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
A Galactic Eclipse: The Small Magellanic Cloud is Forming Stars in Two, Superimposed Systems
Authors:
Claire E. Murray,
Sten Hasselquist,
Joshua E. G. Peek,
Christina Willecke Lindberg,
Andres Almeida,
Yumi Choi,
Jessica E. M. Craig,
Helga Denes,
John M. Dickey,
Enrico M. Di Teodoro,
Christoph Federrath,
Isabella A. Gerrard,
Steven J. Gibson,
Denis Leahy,
Min-Young Lee,
Callum Lynn,
Yik Ki Ma,
Antoine Marchal,
N. M. McClure-Griffiths,
David Nidever,
Hiep Nguyen,
Nickolas M. **el,
Elizabeth Tarantino,
Lucero Uscanga,
Jacco Th. van Loon
Abstract:
The structure and dynamics of the star-forming disk of the Small Magellanic Cloud (SMC) have long confounded us. The SMC is widely used as a prototype for galactic physics at low metallicity, and yet we fundamentally lack an understanding of the structure of its interstellar medium (ISM). In this work, we present a new model for the SMC by comparing the kinematics of young, massive stars with the…
▽ More
The structure and dynamics of the star-forming disk of the Small Magellanic Cloud (SMC) have long confounded us. The SMC is widely used as a prototype for galactic physics at low metallicity, and yet we fundamentally lack an understanding of the structure of its interstellar medium (ISM). In this work, we present a new model for the SMC by comparing the kinematics of young, massive stars with the structure of the ISM traced by high-resolution observations of neutral atomic hydrogen (HI) from the Galactic Australian Square Kilometer Array Pathfinder survey (GASKAP-HI). Specifically, we identify thousands of young, massive stars with precise radial velocity constraints from the Gaia and APOGEE surveys and match these stars to the ISM structures in which they likely formed. By comparing the average dust extinction towards these stars, we find evidence that the SMC is composed of two structures with distinct stellar and gaseous chemical compositions. We construct a simple model that successfully reproduces the observations and shows that the ISM of the SMC is arranged into two, superimposed, star-forming systems with similar gas mass separated by ~5 kpc along the line of sight.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
VGF: Value-Guided Fuzzing -- Fuzzing Hardware as Hardware
Authors:
Ruochen Dai,
Michael Lee,
Patrick Hoey,
Weimin Fu,
Tuba Yavuz,
Xiaolong Guo,
Shuo Wang,
Dean Sullivan,
Orlando Arias
Abstract:
As the complexity of logic designs increase, new avenues for testing digital hardware becomes necessary. Fuzz Testing (fuzzing) has recently received attention as a potential candidate for input vector generation on hardware designs. Using this technique, a fuzzer is used to generate an input to a logic design. Using a simulation engine, the logic design is given the generated stimulus and some me…
▽ More
As the complexity of logic designs increase, new avenues for testing digital hardware becomes necessary. Fuzz Testing (fuzzing) has recently received attention as a potential candidate for input vector generation on hardware designs. Using this technique, a fuzzer is used to generate an input to a logic design. Using a simulation engine, the logic design is given the generated stimulus and some metric of feedback is given to the fuzzer to aid in the input mutation. However, much like software fuzzing, hardware fuzzing uses code coverage as a metric to find new possible fuzzing paths. Unfortunately, as we show in this work, this coverage metric falls short of generic on some hardware designs where designers have taken a more direct approach at expressing a particular microarchitecture, or implementation, of the desired hardware.
With this work, we introduce a new coverage metric which employs not code coverage, but state coverage internal to a design. By observing changes in signals within the logic circuit under testing, we are able to explore the state space of the design and provide feedback to a fuzzer engine for input generation. Our approach, Value-Guided Fuzzing (VGF), provides a generic metric of coverage which can be applied to any design regardless of its implementation. In this paper, we introduce our state-based VGF metric as well as a sample implementation which can be used with any VPI, DPI, VHPI, or FLI compliant simulator, making it completely HDL agnostic. We demonstrate the generality of VGF and show how our sample implementation is capable of finding bugs considerably faster than previous approaches.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
MMM: Generative Masked Motion Model
Authors:
Ekkasit Pinyoanuntapong,
Pu Wang,
Minwoo Lee,
Chen Chen
Abstract:
Recent advances in text-to-motion generation using diffusion and autoregressive models have shown promising results. However, these models often suffer from a trade-off between real-time performance, high fidelity, and motion editability. To address this gap, we introduce MMM, a novel yet simple motion generation paradigm based on Masked Motion Model. MMM consists of two key components: (1) a moti…
▽ More
Recent advances in text-to-motion generation using diffusion and autoregressive models have shown promising results. However, these models often suffer from a trade-off between real-time performance, high fidelity, and motion editability. To address this gap, we introduce MMM, a novel yet simple motion generation paradigm based on Masked Motion Model. MMM consists of two key components: (1) a motion tokenizer that transforms 3D human motion into a sequence of discrete tokens in latent space, and (2) a conditional masked motion transformer that learns to predict randomly masked motion tokens, conditioned on the pre-computed text tokens. By attending to motion and text tokens in all directions, MMM explicitly captures inherent dependency among motion tokens and semantic map** between motion and text tokens. During inference, this allows parallel and iterative decoding of multiple motion tokens that are highly consistent with fine-grained text descriptions, therefore simultaneously achieving high-fidelity and high-speed motion generation. In addition, MMM has innate motion editability. By simply placing mask tokens in the place that needs editing, MMM automatically fills the gaps while guaranteeing smooth transitions between editing and non-editing parts. Extensive experiments on the HumanML3D and KIT-ML datasets demonstrate that MMM surpasses current leading methods in generating high-quality motion (evidenced by superior FID scores of 0.08 and 0.429), while offering advanced editing features such as body-part modification, motion in-betweening, and the synthesis of long motion sequences. In addition, MMM is two orders of magnitude faster on a single mid-range GPU than editable motion diffusion models. Our project page is available at \url{https://exitudio.github.io/MMM-page}.
△ Less
Submitted 27 March, 2024; v1 submitted 6 December, 2023;
originally announced December 2023.
-
Improving Bias Mitigation through Bias Experts in Natural Language Understanding
Authors:
Eo** Jeon,
Mingyu Lee,
Juhyeong Park,
Yeachan Kim,
Wing-Lam Mok,
SangKeun Lee
Abstract:
Biases in the dataset often enable the model to achieve high performance on in-distribution data, while poorly performing on out-of-distribution data. To mitigate the detrimental effect of the bias on the networks, previous works have proposed debiasing methods that down-weight the biased examples identified by an auxiliary model, which is trained with explicit bias labels. However, finding a type…
▽ More
Biases in the dataset often enable the model to achieve high performance on in-distribution data, while poorly performing on out-of-distribution data. To mitigate the detrimental effect of the bias on the networks, previous works have proposed debiasing methods that down-weight the biased examples identified by an auxiliary model, which is trained with explicit bias labels. However, finding a type of bias in datasets is a costly process. Therefore, recent studies have attempted to make the auxiliary model biased without the guidance (or annotation) of bias labels, by constraining the model's training environment or the capability of the model itself. Despite the promising debiasing results of recent works, the multi-class learning objective, which has been naively used to train the auxiliary model, may harm the bias mitigation effect due to its regularization effect and competitive nature across classes. As an alternative, we propose a new debiasing framework that introduces binary classifiers between the auxiliary model and the main model, coined bias experts. Specifically, each bias expert is trained on a binary classification task derived from the multi-class classification task via the One-vs-Rest approach. Experimental results demonstrate that our proposed strategy improves the bias identification ability of the auxiliary model. Consequently, our debiased model consistently outperforms the state-of-the-art on various challenge datasets.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Authors:
Kibeom Kim,
Kisung Shin,
Min Whoo Lee,
Moonhoen Lee,
Minsu Lee,
Byoung-Tak Zhang
Abstract:
Interactive visual navigation tasks, which involve following instructions to reach and interact with specific targets, are challenging not only because successful experiences are very rare but also because the complex visual inputs require a substantial number of samples. Previous methods for these tasks often rely on intricately designed dense rewards or the use of expensive expert data for imita…
▽ More
Interactive visual navigation tasks, which involve following instructions to reach and interact with specific targets, are challenging not only because successful experiences are very rare but also because the complex visual inputs require a substantial number of samples. Previous methods for these tasks often rely on intricately designed dense rewards or the use of expensive expert data for imitation learning. To tackle these challenges, we propose a novel approach, Visual Hindsight Self-Imitation Learning (VHS) for enhancing sample efficiency through hindsight goal re-labeling and self-imitation. We also introduce a prototypical goal embedding method derived from experienced goal observations, that is particularly effective in vision-based and partially observable environments. This embedding technique allows the agent to visually reinterpret its unsuccessful attempts, enabling vision-based goal re-labeling and self-imitation from enhanced successful experiences. Experimental results show that VHS outperforms existing techniques in interactive visual navigation tasks, confirming its superior performance and sample efficiency.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Projection Regret: Reducing Background Bias for Novelty Detection via Diffusion Models
Authors:
Sungik Choi,
Hankook Lee,
Honglak Lee,
Moontae Lee
Abstract:
Novelty detection is a fundamental task of machine learning which aims to detect abnormal ($\textit{i.e.}$ out-of-distribution (OOD)) samples. Since diffusion models have recently emerged as the de facto standard generative framework with surprising generation results, novelty detection via diffusion models has also gained much attention. Recent methods have mainly utilized the reconstruction prop…
▽ More
Novelty detection is a fundamental task of machine learning which aims to detect abnormal ($\textit{i.e.}$ out-of-distribution (OOD)) samples. Since diffusion models have recently emerged as the de facto standard generative framework with surprising generation results, novelty detection via diffusion models has also gained much attention. Recent methods have mainly utilized the reconstruction property of in-distribution samples. However, they often suffer from detecting OOD samples that share similar background information to the in-distribution data. Based on our observation that diffusion models can \emph{project} any sample to an in-distribution sample with similar background information, we propose \emph{Projection Regret (PR)}, an efficient novelty detection method that mitigates the bias of non-semantic information. To be specific, PR computes the perceptual distance between the test image and its diffusion-based projection to detect abnormality. Since the perceptual distance often fails to capture semantic changes when the background information is dominant, we cancel out the background bias by comparing it against recursive projections. Extensive experiments demonstrate that PR outperforms the prior art of generative-model-based novelty detection methods by a significant margin.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Practical Path-based Bayesian Optimization
Authors:
Jose Pablo Folch,
James Odgers,
Shiqiang Zhang,
Robert M Lee,
Behrang Shafei,
David Walz,
Calvin Tsay,
Mark van der Wilk,
Ruth Misener
Abstract:
There has been a surge in interest in data-driven experimental design with applications to chemical engineering and drug manufacturing. Bayesian optimization (BO) has proven to be adaptable to such cases, since we can model the reactions of interest as expensive black-box functions. Sometimes, the cost of this black-box functions can be separated into two parts: (a) the cost of the experiment itse…
▽ More
There has been a surge in interest in data-driven experimental design with applications to chemical engineering and drug manufacturing. Bayesian optimization (BO) has proven to be adaptable to such cases, since we can model the reactions of interest as expensive black-box functions. Sometimes, the cost of this black-box functions can be separated into two parts: (a) the cost of the experiment itself, and (b) the cost of changing the input parameters. In this short paper, we extend the SnAKe algorithm to deal with both types of costs simultaneously. We further propose extensions to the case of a maximum allowable input change, as well as to the multi-objective setting.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
Synchronizing Vision and Language: Bidirectional Token-Masking AutoEncoder for Referring Image Segmentation
Authors:
Minhyeok Lee,
Dogyoon Lee,
Jungho Lee,
Suhwan Cho,
Heeseung Choi,
Ig-Jae Kim,
Sangyoun Lee
Abstract:
Referring Image Segmentation (RIS) aims to segment target objects expressed in natural language within a scene at the pixel level. Various recent RIS models have achieved state-of-the-art performance by generating contextual tokens to model multimodal features from pretrained encoders and effectively fusing them using transformer-based cross-modal attention. While these methods match language feat…
▽ More
Referring Image Segmentation (RIS) aims to segment target objects expressed in natural language within a scene at the pixel level. Various recent RIS models have achieved state-of-the-art performance by generating contextual tokens to model multimodal features from pretrained encoders and effectively fusing them using transformer-based cross-modal attention. While these methods match language features with image features to effectively identify likely target objects, they often struggle to correctly understand contextual information in complex and ambiguous sentences and scenes. To address this issue, we propose a novel bidirectional token-masking autoencoder (BTMAE) inspired by the masked autoencoder (MAE). The proposed model learns the context of image-to-language and language-to-image by reconstructing missing features in both image and language features at the token level. In other words, this approach involves mutually complementing across the features of images and language, with a focus on enabling the network to understand interconnected deep contextual information between the two modalities. This learning method enhances the robustness of RIS performance in complex sentences and scenes. Our BTMAE achieves state-of-the-art performance on three popular datasets, and we demonstrate the effectiveness of the proposed method through various ablation studies.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
A superconducting tensor detector for mid-frequency gravitational waves: its multi-channel nature and main astrophysical targets
Authors:
Yeong-Bok Bae,
Chan Park,
Edwin J. Son,
Sang-Hyeon Ahn,
Minjoong Jeong,
Gungwon Kang,
Chunglee Kim,
Dong Lak Kim,
Jaewan Kim,
Whansun Kim,
Hyung Mok Lee,
Yong-Ho Lee,
Ronald S. Norton,
John J. Oh,
Sang Hoon Oh,
Ho Jung Paik
Abstract:
Mid-frequency band gravitational-wave detectors will be complementary for the existing Earth-based detectors (sensitive above 10 Hz or so) and the future space-based detectors such as LISA, which will be sensitive below around 10 mHz. A ground-based superconducting omnidirectional gravitational radiation observatory (SOGRO) has recently been proposed along with several design variations for the fr…
▽ More
Mid-frequency band gravitational-wave detectors will be complementary for the existing Earth-based detectors (sensitive above 10 Hz or so) and the future space-based detectors such as LISA, which will be sensitive below around 10 mHz. A ground-based superconducting omnidirectional gravitational radiation observatory (SOGRO) has recently been proposed along with several design variations for the frequency band of 0.1 to 10 Hz. For three conceptual designs of SOGRO (e.g., pSOGRO, SOGRO and aSOGRO), we examine their multi-channel natures, sensitivities and science cases. One of the key characteristics of the SOGRO concept is its six detection channels. The response functions of each channel are calculated for all possible gravitational wave polarizations including scalar and vector modes. Combining these response functions, we also confirm the omnidirectional nature of SOGRO. Hence, even a single SOGRO detector will be able to determine the position of a source and polarizations of gravitational waves, if detected. Taking into account SOGRO's sensitivity and technical requirements, two main targets are most plausible: gravitational waves from compact binaries and stochastic backgrounds. Based on assumptions we consider in this work, detection rates for intermediate-mass binary black holes (in the mass range of hundreds up to $10^{4}$ $M_\odot$) are expected to be $0.0014-2.5 \,\, {\rm yr}^{-1}$. In order to detect stochastic gravitational wave background, multiple detectors are required. Two aSOGRO detector networks may be able to put limits on the stochastic background beyond the indirect limit from cosmological observations.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
An ALMA Spectroscopic Survey of the Brightest Submillimeter Galaxies in the SCUBA-2-COSMOS Field (AS2COSPEC): Physical Properties of z=2-5 Ultra- and Hyperluminous Infrared Galaxies
Authors:
Cheng-Lin Liao,
Chian-Chou Chen,
Wei-Hao Wang,
Ian Smail,
Yi** Ao,
Scott C. Chapman,
Ugne Dudzeviciute,
Marta Frias Castillo,
Minju M. Lee,
Stephen Serjeant,
A. Mark Swinbank,
Dominic J. Taylor,
Hideki Umehata,
Yinghe Zhao
Abstract:
We report physical properties of the brightest ($S_{870\,μ\rm m}=12.4$-$19.2\,$mJy) and not strongly lensed 18 870$\,μ$m selected dusty star-forming galaxies (DSFGs), also known as submillimeter galaxies (SMGs), in the COSMOS field. This sample is part of an ALMA band$\,$3 spectroscopic survey (AS2COSPEC), and spectroscopic redshifts are measured in 17 of them at $z=2$-$5$. We perform spectral ene…
▽ More
We report physical properties of the brightest ($S_{870\,μ\rm m}=12.4$-$19.2\,$mJy) and not strongly lensed 18 870$\,μ$m selected dusty star-forming galaxies (DSFGs), also known as submillimeter galaxies (SMGs), in the COSMOS field. This sample is part of an ALMA band$\,$3 spectroscopic survey (AS2COSPEC), and spectroscopic redshifts are measured in 17 of them at $z=2$-$5$. We perform spectral energy distribution analyses and deduce a median total infrared luminosity of $L_{\rm IR}=(1.3\pm0.1)\times10^{13}\,L_{\odot}$, infrared-based star-formation rate of ${\rm SFR}_{\rm IR}=1390\pm150~M_{\odot}\,\rm yr^{-1}$, stellar mass of $M_\ast=(1.4\pm0.6)\times10^{11}\,M_\odot$, dust mass of $M_{\rm dust}=(3.7\pm0.5)\times10^9\,M_\odot$, and molecular gas mass of $M_{\rm gas}= (α_{\rm CO}/0.8)(1.2\pm0.1)\times10^{11}\,M_\odot$, suggesting that they are one of the most massive, ISM-enriched, and actively star-forming systems at $z=2$-$5$. In addition, compared to less massive and less active galaxies at similar epochs, SMGs have comparable gas fractions; however, they have much shorter depletion time, possibly caused by more active dynamical interactions. We determine a median dust emissivity index of $β=2.1\pm0.1$ for our sample, and by combining our results with those from other DSFG samples, we find no correlation of $β$ with redshift or infrared luminosity, indicating similar dust grain compositions across cosmic time for infrared luminous galaxies. We also find that AS2COSPEC SMGs have one of the highest dust-to-stellar mass ratios, with a median of $0.02\pm0.01$, significantly higher than model predictions, possibly due to too strong of a AGN feedback implemented in the model. Finally, our complete and uniform survey enables us to put constraints on the most massive end of the dust and molecular gas mass functions.
△ Less
Submitted 31 January, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
JWST and ALMA discern the assembly of structural and obscured components in a high-redshift starburst galaxy
Authors:
Zhaoxuan Liu,
John D. Silverman,
Emanuele Daddi,
Annagrazia Puglisi,
Alvio Renzini,
Boris S. Kalita,
Jeyhan S. Kartaltepe,
Daichi Kashino,
Giulia Rodighiero,
Wiphu Rujopakarn,
Tomoko L. Suzuki,
Takumi S. Tanaka,
Francesco Valentino,
Irham Taufik Andika,
Caitlin M. Casey,
Andreas Faisst,
Maximilien Franco,
Ghassem Gozaliasl,
Steven Gillman,
Christopher C. Hayward,
Anton M. Koekemoer,
Vasily Kokorev,
Erini Lambrides,
Minju M. Lee,
Georgios E. Magdis
, et al. (5 additional authors not shown)
Abstract:
We present observations and analysis of the starburst, PACS-819, at z=1.45 ($M_*=10^{10.7}$ M$_{ \odot}$), using high-resolution ($0^{\prime \prime}.1$; 0.8 kpc) ALMA and multi-wavelength JWST images from the COSMOS-Web program. Dissimilar to HST/ACS images in the rest-frame UV, the redder NIRCam and MIRI images reveal a smooth central mass concentration and spiral-like features, atypical for such…
▽ More
We present observations and analysis of the starburst, PACS-819, at z=1.45 ($M_*=10^{10.7}$ M$_{ \odot}$), using high-resolution ($0^{\prime \prime}.1$; 0.8 kpc) ALMA and multi-wavelength JWST images from the COSMOS-Web program. Dissimilar to HST/ACS images in the rest-frame UV, the redder NIRCam and MIRI images reveal a smooth central mass concentration and spiral-like features, atypical for such an intense starburst. Through dynamical modeling of the CO J=5--4 emission with ALMA, PACS-819 is rotation-dominated thus has a disk-like nature. However, kinematic anomalies in CO and asymmetric features in the bluer JWST bands (e.g., F150W) support a more disturbed nature likely due to interactions. The JWST imaging further enables us to map the distribution of stellar mass and dust attenuation, thus clarifying the relationships between different structural components, not discernable in the previous HST images. The CO J = 5 -- 4 and FIR dust continuum emission are co-spatial with a heavily-obscured starbursting core (<1 kpc) which is partially surrounded by much less obscured star-forming structures including a prominent arc, possibly a tidally-distorted dwarf galaxy, and a clump, either a sign of an ongoing violent disk instability or a recently accreted low-mass satellite. With spatially-resolved maps, we find a high molecular gas fraction in the central area reaching $\sim3$ ($M_{\text{gas}}$/$M_*$) and short depletion times ($M_{\text{gas}}/SFR\sim$ 120 Myrs) across the entire system. These observations provide insights into the complex nature of starbursts in the distant universe and underscore the wealth of complementary information from high-resolution observations with both ALMA and JWST.
△ Less
Submitted 10 May, 2024; v1 submitted 24 November, 2023;
originally announced November 2023.
-
Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images
Authors:
Jaeyoung Chung,
Jeongtaek Oh,
Kyoung Mu Lee
Abstract:
In this paper, we present a method to optimize Gaussian splatting with a limited number of images while avoiding overfitting. Representing a 3D scene by combining numerous Gaussian splats has yielded outstanding visual quality. However, it tends to overfit the training views when only a small number of images are available. To address this issue, we introduce a dense depth map as a geometry guide…
▽ More
In this paper, we present a method to optimize Gaussian splatting with a limited number of images while avoiding overfitting. Representing a 3D scene by combining numerous Gaussian splats has yielded outstanding visual quality. However, it tends to overfit the training views when only a small number of images are available. To address this issue, we introduce a dense depth map as a geometry guide to mitigate overfitting. We obtained the depth map using a pre-trained monocular depth estimation model and aligning the scale and offset using sparse COLMAP feature points. The adjusted depth aids in the color-based optimization of 3D Gaussian splatting, mitigating floating artifacts, and ensuring adherence to geometric constraints. We verify the proposed method on the NeRF-LLFF dataset with varying numbers of few images. Our approach demonstrates robust geometry compared to the original method that relies solely on images. Project page: robot0321.github.io/DepthRegGS
△ Less
Submitted 4 January, 2024; v1 submitted 22 November, 2023;
originally announced November 2023.
-
LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes
Authors:
Jaeyoung Chung,
Suyoung Lee,
Hyeong** Nam,
Jaerin Lee,
Kyoung Mu Lee
Abstract:
With the widespread usage of VR devices and contents, demands for 3D scene generation techniques become more popular. Existing 3D scene generation models, however, limit the target scene to specific domain, primarily due to their training strategies using 3D scan dataset that is far from the real-world. To address such limitation, we propose LucidDreamer, a domain-free scene generation pipeline by…
▽ More
With the widespread usage of VR devices and contents, demands for 3D scene generation techniques become more popular. Existing 3D scene generation models, however, limit the target scene to specific domain, primarily due to their training strategies using 3D scan dataset that is far from the real-world. To address such limitation, we propose LucidDreamer, a domain-free scene generation pipeline by fully leveraging the power of existing large-scale diffusion-based generative model. Our LucidDreamer has two alternate steps: Dreaming and Alignment. First, to generate multi-view consistent images from inputs, we set the point cloud as a geometrical guideline for each image generation. Specifically, we project a portion of point cloud to the desired view and provide the projection as a guidance for inpainting using the generative model. The inpainted images are lifted to 3D space with estimated depth maps, composing a new points. Second, to aggregate the new points into the 3D scene, we propose an aligning algorithm which harmoniously integrates the portions of newly generated 3D scenes. The finally obtained 3D scene serves as initial points for optimizing Gaussian splats. LucidDreamer produces Gaussian splats that are highly-detailed compared to the previous 3D scene generation methods, with no constraint on domain of the target scene. Project page: https://luciddreamer-cvlab.github.io/
△ Less
Submitted 23 November, 2023; v1 submitted 22 November, 2023;
originally announced November 2023.
-
One-ninth magnetization plateau stabilized by spin entanglement in a kagome antiferromagnet
Authors:
Sungmin Jeon,
Dirk Wulferding,
Youngsu Choi,
Seungyeol Lee,
Kiwan Nam,
Kee Hoon Kim,
Minseong Lee,
Tae-Hwan Jang,
Jae-Hoon Park,
Suheon Lee,
Sungkyun Choi,
Chanhyeon Lee,
Hiroyuki Nojiri,
Kwang-Yong Choi
Abstract:
The spin-1/2 antiferromagnetic Heisenberg model on a Kagome lattice is geometrically frustrated, which is expected to promote the formation of many-body quantum entangled states. The most sought-after among these is the quantum spin liquid phase, but magnetic analogs of liquid, solid, and supersolid phases may also occur, producing fractional plateaus in the magnetization. Here, we investigate the…
▽ More
The spin-1/2 antiferromagnetic Heisenberg model on a Kagome lattice is geometrically frustrated, which is expected to promote the formation of many-body quantum entangled states. The most sought-after among these is the quantum spin liquid phase, but magnetic analogs of liquid, solid, and supersolid phases may also occur, producing fractional plateaus in the magnetization. Here, we investigate the experimental realization of these predicted phases in the Kagome material YCu3(OD)6+xBr3-x (x=0.5). By combining thermodynamic and Raman spectroscopic techniques, we provide evidence for fractionalized spinon excitations and observe the emergence of a 1/9 magnetization plateau. These observations establish YCu3(OD)6+xBr3-x as a model material for exploring the 1/9 plateau phase.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
Kondo screening in a Majorana metal
Authors:
S. Lee,
Y. S. Choi,
S. -H. Do,
W. Lee,
C. H. Lee,
M. Lee,
M. Vojta,
C. N. Wang,
H. Luetkens,
Z. Guguchia,
K. -Y. Choi
Abstract:
Kondo impurities provide a nontrivial probe to unravel the character of the excitations of a quantum spin liquid. In the S=1/2 Kitaev model on the honeycomb lattice, Kondo impurities embedded in the spin-liquid host can be screened by itinerant Majorana fermions via gauge-flux binding. Here, we report experimental signatures of metallic-like Kondo screening at intermediate temperatures in the Kita…
▽ More
Kondo impurities provide a nontrivial probe to unravel the character of the excitations of a quantum spin liquid. In the S=1/2 Kitaev model on the honeycomb lattice, Kondo impurities embedded in the spin-liquid host can be screened by itinerant Majorana fermions via gauge-flux binding. Here, we report experimental signatures of metallic-like Kondo screening at intermediate temperatures in the Kitaev honeycomb material α-RuCl3 with dilute Cr3+ (S=3/2) impurities. The static magnetic susceptibility, the muon Knight shift, and the muon spin-relaxation rate all feature logarithmic divergences, a hallmark of a metallic Kondo effect. Concurrently, the linear coefficient of the magnetic specific heat is large in the same temperature regime, indicating the presence of a host Majorana metal. This observation opens new avenues for exploring uncharted Kondo physics in insulating quantum magnets.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
How Far Can We Extract Diverse Perspectives from Large Language Models?
Authors:
Shirley Anugrah Hayati,
Minhwa Lee,
Dheeraj Rajagopal,
Dongyeop Kang
Abstract:
Collecting diverse human opinions is costly and challenging. This leads to a recent trend in collaborative efforts between humans and Large Language Models (LLMs) for generating diverse data, offering potential scalable and efficient solutions. However, the extent of LLMs' capability to generate diverse perspectives on subjective topics remains an unexplored question. In this study, we investigate…
▽ More
Collecting diverse human opinions is costly and challenging. This leads to a recent trend in collaborative efforts between humans and Large Language Models (LLMs) for generating diverse data, offering potential scalable and efficient solutions. However, the extent of LLMs' capability to generate diverse perspectives on subjective topics remains an unexplored question. In this study, we investigate LLMs' capacity for generating diverse perspectives and rationales on subjective topics, such as social norms and argumentative texts. We formulate a new problem of maximum diversity extraction from LLMs. Motivated by how humans develop their opinions through their values, we propose a criteria-based prompting technique to ground diverse opinions. To see how far we can extract diverse perspectives from LLMs, or called diversity coverage, we employ a step-by-step recall prompting for generating more outputs from the model in an iterative manner. As we apply our methods to various tasks, indeed we find that LLMs can generate diverse opinions according to the degree of task subjectivity
△ Less
Submitted 18 February, 2024; v1 submitted 16 November, 2023;
originally announced November 2023.
-
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments
Authors:
Bangzhao Shu,
Lechen Zhang,
Minje Choi,
Lavinia Dunagan,
Lajanugen Logeswaran,
Moontae Lee,
Dallas Card,
David Jurgens
Abstract:
The versatility of Large Language Models (LLMs) on natural language understanding tasks has made them popular for research in social sciences. To properly understand the properties and innate personas of LLMs, researchers have performed studies that involve using prompts in the form of questions that ask LLMs about particular opinions. In this study, we take a cautionary step back and examine whet…
▽ More
The versatility of Large Language Models (LLMs) on natural language understanding tasks has made them popular for research in social sciences. To properly understand the properties and innate personas of LLMs, researchers have performed studies that involve using prompts in the form of questions that ask LLMs about particular opinions. In this study, we take a cautionary step back and examine whether the current format of prompting LLMs elicits responses in a consistent and robust manner. We first construct a dataset that contains 693 questions encompassing 39 different instruments of persona measurement on 115 persona axes. Additionally, we design a set of prompts containing minor variations and examine LLMs' capabilities to generate answers, as well as prompt variations to examine their consistency with respect to content-level variations such as switching the order of response options or negating the statement. Our experiments on 17 different LLMs reveal that even simple perturbations significantly downgrade a model's question-answering ability, and that most LLMs have low negation consistency. Our results suggest that the currently widespread practice of prompting is insufficient to accurately and reliably capture model perceptions, and we therefore discuss potential alternatives to improve these issues.
△ Less
Submitted 1 April, 2024; v1 submitted 16 November, 2023;
originally announced November 2023.
-
Code Models are Zero-shot Precondition Reasoners
Authors:
Lajanugen Logeswaran,
Sungryull Sohn,
Yiwei Lyu,
Anthony Zhe Liu,
Dong-Ki Kim,
Dongsub Shim,
Moontae Lee,
Honglak Lee
Abstract:
One of the fundamental skills required for an agent acting in an environment to complete tasks is the ability to understand what actions are plausible at any given point. This work explores a novel use of code representations to reason about action preconditions for sequential decision making tasks. Code representations offer the flexibility to model procedural activities and associated constraint…
▽ More
One of the fundamental skills required for an agent acting in an environment to complete tasks is the ability to understand what actions are plausible at any given point. This work explores a novel use of code representations to reason about action preconditions for sequential decision making tasks. Code representations offer the flexibility to model procedural activities and associated constraints as well as the ability to execute and verify constraint satisfaction. Leveraging code representations, we extract action preconditions from demonstration trajectories in a zero-shot manner using pre-trained code models. Given these extracted preconditions, we propose a precondition-aware action sampling strategy that ensures actions predicted by a policy are consistent with preconditions. We demonstrate that the proposed approach enhances the performance of few-shot policy learning approaches across task-oriented dialog and embodied textworld benchmarks.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Two-Stage Predict+Optimize for Mixed Integer Linear Programs with Unknown Parameters in Constraints
Authors:
Xinyi Hu,
Jasper C. H. Lee,
Jimmy H. M. Lee
Abstract:
Consider the setting of constrained optimization, with some parameters unknown at solving time and requiring prediction from relevant features. Predict+Optimize is a recent framework for end-to-end training supervised learning models for such predictions, incorporating information about the optimization problem in the training process in order to yield better predictions in terms of the quality of…
▽ More
Consider the setting of constrained optimization, with some parameters unknown at solving time and requiring prediction from relevant features. Predict+Optimize is a recent framework for end-to-end training supervised learning models for such predictions, incorporating information about the optimization problem in the training process in order to yield better predictions in terms of the quality of the predicted solution under the true parameters. Almost all prior works have focused on the special case where the unknowns appear only in the optimization objective and not the constraints. Hu et al.~proposed the first adaptation of Predict+Optimize to handle unknowns appearing in constraints, but the framework has somewhat ad-hoc elements, and they provided a training algorithm only for covering and packing linear programs. In this work, we give a new \emph{simpler} and \emph{more powerful} framework called \emph{Two-Stage Predict+Optimize}, which we believe should be the canonical framework for the Predict+Optimize setting. We also give a training algorithm usable for all mixed integer linear programs, vastly generalizing the applicability of the framework. Experimental results demonstrate the superior prediction performance of our training framework over all classical and state-of-the-art methods.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
Towards Robotic Tree Manipulation: Leveraging Graph Representations
Authors:
Chung Hee Kim,
Moonyoung Lee,
Oliver Kroemer,
George Kantor
Abstract:
There is growing interest in automating agricultural tasks that require intricate and precise interaction with specialty crops, such as trees and vines. However, develo** robotic solutions for crop manipulation remains a difficult challenge due to complexities involved in modeling their deformable behavior. In this study, we present a framework for learning the deformation behavior of tree-like…
▽ More
There is growing interest in automating agricultural tasks that require intricate and precise interaction with specialty crops, such as trees and vines. However, develo** robotic solutions for crop manipulation remains a difficult challenge due to complexities involved in modeling their deformable behavior. In this study, we present a framework for learning the deformation behavior of tree-like crops under contact interaction. Our proposed method involves encoding the state of a spring-damper modeled tree crop as a graph. This representation allows us to employ graph networks to learn both a forward model for predicting resulting deformations, and a contact policy for inferring actions to manipulate tree crops. We conduct a comprehensive set of experiments in a simulated environment and demonstrate generalizability of our method on previously unseen trees. Videos can be found on the project website: https://kantor-lab.github.io/tree_gnn
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
First Measurement of $R(X_{τ/\ell})$ as an Inclusive Test of the $b \to c τν$ Anomaly
Authors:
Belle II Collaboration,
I. Adachi,
K. Adamczyk,
L. Aggarwal,
H. Ahmed,
H. Aihara,
N. Akopov,
A. Aloisio,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
V. Babu,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
S. Bansal,
M. Barrett,
J. Baudot,
A. Baur,
A. Beaubien,
F. Becherer,
J. Becker,
J. V. Bennett
, et al. (368 additional authors not shown)
Abstract:
We measure the tau-to-light-lepton ratio of inclusive $B$-meson branching fractions $R(X_{τ/\ell}) \equiv \mathcal{B}(B\to X τν)/\mathcal{B}(B \to X \ell ν)$, where $\ell$ indicates an electron or muon, and thereby test the universality of charged-current weak interactions. We select events that have one fully reconstructed $B$ meson and a charged lepton candidate from $189~\mathrm{fb}^{-1}$ of el…
▽ More
We measure the tau-to-light-lepton ratio of inclusive $B$-meson branching fractions $R(X_{τ/\ell}) \equiv \mathcal{B}(B\to X τν)/\mathcal{B}(B \to X \ell ν)$, where $\ell$ indicates an electron or muon, and thereby test the universality of charged-current weak interactions. We select events that have one fully reconstructed $B$ meson and a charged lepton candidate from $189~\mathrm{fb}^{-1}$ of electron-positron collision data collected with the Belle II detector. We find $R(X_{τ/\ell}) = 0.228 \pm 0.016~(\mathrm{stat}) \pm 0.036~(\mathrm{syst})$, in agreement with standard-model expectations. This is the first direct measurement of $R(X_{τ/\ell})$.
△ Less
Submitted 29 May, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Alpha backgrounds in NaI(Tl) crystals of COSINE-100
Authors:
G. Adhikari,
N. Carlin,
D. F. F. S. Cavalcante,
J. Y. Cho,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. Franca,
C. Ha,
I. S. Hahn,
S. J. Hollick,
E. J. Jeon,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
S. W. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim
, et al. (38 additional authors not shown)
Abstract:
COSINE-100 is a dark matter direct detection experiment with 106 kg NaI(Tl) as the target material. 210Pb and daughter isotopes are a dominant background in the WIMP region of interest and are detected via beta decay and alpha decay. Analysis of the alpha channel complements the background model as observed in the beta/gamma channel. We present the measurement of the quenching factors and Monte Ca…
▽ More
COSINE-100 is a dark matter direct detection experiment with 106 kg NaI(Tl) as the target material. 210Pb and daughter isotopes are a dominant background in the WIMP region of interest and are detected via beta decay and alpha decay. Analysis of the alpha channel complements the background model as observed in the beta/gamma channel. We present the measurement of the quenching factors and Monte Carlo simulation results and activity quantification of the alpha decay components of the COSINE-100 NaI(Tl) crystals. The data strongly indicate that the alpha decays probabilistically undergo two possible quenching factors but require further investigation. The fitted results are consistent with independent measurements and improve the overall understanding of the COSINE-100 backgrounds. Furthermore, the half-life of 216Po has been measured to be 143.4 +/- 1.2 ms, which is consistent with and more precise than recent measurements.
△ Less
Submitted 30 January, 2024; v1 submitted 8 November, 2023;
originally announced November 2023.
-
Cosmic Vine: A z=3.44 large-scale structure hosting massive quiescent galaxies
Authors:
Shuowen **,
Nikolaj B. Sillassen,
Georgios E. Magdis,
Malte Brinch,
Marko Shuntov,
Gabriel Brammer,
Raphael Gobat,
Francesco Valentino,
Adam C. Carnall,
Minju Lee,
Aswin P. Vijayan,
Steven Gillman,
Vasily Kokorev,
Aurélien Le Bail,
Thomas R. Greve,
Bitten Gullberg,
Katriona M. L. Gould,
Sune Toft
Abstract:
We report the discovery of a large-scale structure at z=3.44 revealed by JWST data in the Extended Groth Strip (EGS) field. This structure, called the Cosmic Vine, consists of 20 galaxies with spectroscopic redshifts at 3.43<z<3.45 and six galaxy overdensities ($4-7σ$) with consistent photometric redshifts, making up a vine-like structure extending over a ~4x0.2 pMpc^2 area. The two most massive g…
▽ More
We report the discovery of a large-scale structure at z=3.44 revealed by JWST data in the Extended Groth Strip (EGS) field. This structure, called the Cosmic Vine, consists of 20 galaxies with spectroscopic redshifts at 3.43<z<3.45 and six galaxy overdensities ($4-7σ$) with consistent photometric redshifts, making up a vine-like structure extending over a ~4x0.2 pMpc^2 area. The two most massive galaxies ($M_*\approx10^{10.9}~M_\odot$) of the Cosmic Vine are found to be quiescent with bulge-dominated morphologies ($B/T>70\%$). Comparisons with simulations suggest that the Cosmic Vine would form a cluster with halo mass $M_{\rm halo}>10^{14}M_\odot$ at z=0, and the two massive galaxies are likely forming the brightest cluster galaxies (BCGs). The results unambiguously reveal that massive quiescent galaxies can form in growing large-scale structures at z>3, thus disfavoring the environmental quenching mechanisms that require a virialized cluster core. Instead, as suggested by the interacting and bulge-dominated morphologies, the two galaxies are likely quenched by merger-triggered starburst or active galactic nucleus (AGN) feedback before falling into a cluster core. Moreover, we found that the observed specific star formation rates of massive quiescent galaxies in z>3 dense environments are one to two orders of magnitude lower than that of the BCGs in the TNG300 simulation. This discrepancy potentially poses a challenge to the models of massive cluster galaxy formation. Future studies comparing a large sample with dedicated cluster simulations are required to solve the problem.
△ Less
Submitted 18 February, 2024; v1 submitted 8 November, 2023;
originally announced November 2023.
-
Towards Autonomous Crop Monitoring: Inserting Sensors in Cluttered Environments
Authors:
Moonyoung Lee,
Aaron Berger,
Dominic Guri,
Kevin Zhang,
Lisa Coffee,
George Kantor,
Oliver Kroemer
Abstract:
We present a contact-based phenoty** robot platform that can autonomously insert nitrate sensors into cornstalks to proactively monitor macronutrient levels in crops. This task is challenging because inserting such sensors requires sub-centimeter precision in an environment which contains high levels of clutter, lighting variation, and occlusion. To address these challenges, we develop a robust…
▽ More
We present a contact-based phenoty** robot platform that can autonomously insert nitrate sensors into cornstalks to proactively monitor macronutrient levels in crops. This task is challenging because inserting such sensors requires sub-centimeter precision in an environment which contains high levels of clutter, lighting variation, and occlusion. To address these challenges, we develop a robust perception-action pipeline to detect and grasp stalks, and create a custom robot gripper which mechanically aligns the sensor before inserting it into the stalk. Through experimental validation on 48 unique stalks in a cornfield in Iowa, we demonstrate our platform's capability of detecting a stalk with 94% success, gras** a stalk with 90% success, and inserting a sensor with 60% success. In addition to develo** an autonomous phenoty** research platform, we share key challenges and insights obtained from deployment in the field. Our research platform is open-sourced, with additional information available at https://kantor-lab.github.io/cornbot.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Visual-information-driven model for crowd simulation using temporal convolutional network
Authors:
Xuanwen Liang,
Eric Wai Ming Lee
Abstract:
Crowd simulations play a pivotal role in building design, influencing both user experience and public safety. While traditional knowledge-driven models have their merits, data-driven crowd simulation models promise to bring a new dimension of realism to these simulations. However, most of the existing data-driven models are designed for specific geometries, leading to poor adaptability and applica…
▽ More
Crowd simulations play a pivotal role in building design, influencing both user experience and public safety. While traditional knowledge-driven models have their merits, data-driven crowd simulation models promise to bring a new dimension of realism to these simulations. However, most of the existing data-driven models are designed for specific geometries, leading to poor adaptability and applicability. A promising strategy for enhancing the adaptability and realism of data-driven crowd simulation models is to incorporate visual information, including the scenario geometry and pedestrian locomotion. Consequently, this paper proposes a novel visual-information-driven (VID) crowd simulation model. The VID model predicts the pedestrian velocity at the next time step based on the prior social-visual information and motion data of an individual. A radar-geometry-locomotion method is established to extract the visual information of pedestrians. Moreover, a temporal convolutional network (TCN)-based deep learning model, named social-visual TCN, is developed for velocity prediction. The VID model is tested on three public pedestrian motion datasets with distinct geometries, i.e., corridor, corner, and T-junction. Both qualitative and quantitative metrics are employed to evaluate the VID model, and the results highlight the improved adaptability of the model across all three geometric scenarios. Overall, the proposed method demonstrates effectiveness in enhancing the adaptability of data-driven crowd models.
△ Less
Submitted 9 April, 2024; v1 submitted 6 November, 2023;
originally announced November 2023.
-
Calderón-Zygmund estimates for nonlinear equations of differential forms with BMO coefficients
Authors:
Mikyoung Lee,
Jihoon Ok,
Juncheol Pyo
Abstract:
We obtain $L^q$-regularity estimates for weak solutions to $p$-Laplacian type equations of differential forms. In particular, we prove local Calderón-Zygmund type estimates for equations with discontinuous coefficients satisfying the bounded mean oscillation (BMO) condition.
We obtain $L^q$-regularity estimates for weak solutions to $p$-Laplacian type equations of differential forms. In particular, we prove local Calderón-Zygmund type estimates for equations with discontinuous coefficients satisfying the bounded mean oscillation (BMO) condition.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Neural Collage Transfer: Artistic Reconstruction via Material Manipulation
Authors:
Ganghun Lee,
Minji Kim,
Yunsu Lee,
Minsu Lee,
Byoung-Tak Zhang
Abstract:
Collage is a creative art form that uses diverse material scraps as a base unit to compose a single image. Although pixel-wise generation techniques can reproduce a target image in collage style, it is not a suitable method due to the solid stroke-by-stroke nature of the collage form. While some previous works for stroke-based rendering produced decent sketches and paintings, collages have receive…
▽ More
Collage is a creative art form that uses diverse material scraps as a base unit to compose a single image. Although pixel-wise generation techniques can reproduce a target image in collage style, it is not a suitable method due to the solid stroke-by-stroke nature of the collage form. While some previous works for stroke-based rendering produced decent sketches and paintings, collages have received much less attention in research despite their popularity as a style. In this paper, we propose a method for learning to make collages via reinforcement learning without the need for demonstrations or collage artwork data. We design the collage Markov Decision Process (MDP), which allows the agent to handle various materials and propose a model-based soft actor-critic to mitigate the agent's training burden derived from the sophisticated dynamics of collage. Moreover, we devise additional techniques such as active material selection and complexity-based multi-scale collage to handle target images at any size and enhance the results' aesthetics by placing relatively more scraps in areas of high complexity. Experimental results show that the trained agent appropriately selected and pasted materials to regenerate the target image into a collage and obtained a higher evaluation score on content and style than pixel-wise generation methods. Code is available at https://github.com/northadventure/CollageRL.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Quatro++: Robust Global Registration Exploiting Ground Segmentation for Loop Closing in LiDAR SLAM
Authors:
Hyungtae Lim,
Beomsoo Kim,
Daebeom Kim,
Eungchang Mason Lee,
Hyun Myung
Abstract:
Global registration is a fundamental task that estimates the relative pose between two viewpoints of 3D point clouds. However, there are two issues that degrade the performance of global registration in LiDAR SLAM: one is the sparsity issue and the other is degeneracy. The sparsity issue is caused by the sparse characteristics of the 3D point cloud measurements in a mechanically spinning LiDAR sen…
▽ More
Global registration is a fundamental task that estimates the relative pose between two viewpoints of 3D point clouds. However, there are two issues that degrade the performance of global registration in LiDAR SLAM: one is the sparsity issue and the other is degeneracy. The sparsity issue is caused by the sparse characteristics of the 3D point cloud measurements in a mechanically spinning LiDAR sensor. The degeneracy issue sometimes occurs because the outlier-rejection methods reject too many correspondences, leaving less than three inliers. These two issues have become more severe as the pose discrepancy between the two viewpoints of 3D point clouds becomes greater. To tackle these problems, we propose a robust global registration framework, called \textit{Quatro++}. Extending our previous work that solely focused on the global registration itself, we address the robust global registration in terms of the loop closing in LiDAR SLAM. To this end, ground segmentation is exploited to achieve robust global registration. Through the experiments, we demonstrate that our proposed method shows a higher success rate than the state-of-the-art global registration methods, overcoming the sparsity and degeneracy issues. In addition, we show that ground segmentation significantly helps to increase the success rate for the ground vehicles. Finally, we apply our proposed method to the loop closing module in LiDAR SLAM and confirm that the quality of the loop constraints is improved, showing more precise map** results. Therefore, the experimental evidence corroborated the suitability of our method as an initial alignment in the loop closing. Our code is available at https://quatro-plusplus.github.io.
△ Less
Submitted 21 January, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving
Authors:
Zhan Ling,
Yunhao Fang,
Xuanlin Li,
Tongzhou Mu,
Mingu Lee,
Reza Pourreza,
Roland Memisevic,
Hao Su
Abstract:
Large Language Models (LLMs) have achieved tremendous progress, yet they still often struggle with challenging reasoning problems. Current approaches address this challenge by sampling or searching detailed and low-level reasoning chains. However, these methods are still limited in their exploration capabilities, making it challenging for correct solutions to stand out in the huge solution space.…
▽ More
Large Language Models (LLMs) have achieved tremendous progress, yet they still often struggle with challenging reasoning problems. Current approaches address this challenge by sampling or searching detailed and low-level reasoning chains. However, these methods are still limited in their exploration capabilities, making it challenging for correct solutions to stand out in the huge solution space. In this work, we unleash LLMs' creative potential for exploring multiple diverse problem solving strategies by framing an LLM as a hierarchical policy via in-context learning. This policy comprises of a visionary leader that proposes multiple diverse high-level problem-solving tactics as hints, accompanied by a follower that executes detailed problem-solving processes following each of the high-level instruction. The follower uses each of the leader's directives as a guide and samples multiple reasoning chains to tackle the problem, generating a solution group for each leader proposal. Additionally, we propose an effective and efficient tournament-based approach to select among these explored solution groups to reach the final answer. Our approach produces meaningful and inspiring hints, enhances problem-solving strategy exploration, and improves the final answer accuracy on challenging problems in the MATH dataset. Code will be released at https://github.com/lz1oceani/LLM-As-Hierarchical-Policy.
△ Less
Submitted 5 December, 2023; v1 submitted 1 November, 2023;
originally announced November 2023.
-
DEIMOS spectroscopy of $z=6$ protocluster candidate in COSMOS -- A massive protocluster embedded in a large scale structure?
Authors:
Malte Brinch,
Thomas R. Greve,
David B. Sanders,
Conor J. R. McPartland,
Nima Chartab,
Steven Gillman,
Aswin P. Vijayan,
Minju M. Lee,
Gabriel Brammer,
Caitlin M. Casey,
Olivier Ilbert,
Shuowen **,
Georgios Magdis,
H. J. McCracken,
Nikolaj B. Sillassen,
Sune Toft,
Jorge A. Zavala
Abstract:
We present the results of our Keck/DEIMOS spectroscopic follow-up of candidate galaxies of i-band-dropout protocluster candidate galaxies at $z\sim6$ in the COSMOS field. We securely detect Lyman-$α$ emission lines in 14 of the 30 objects targeted, 10 of them being at $z=6$ with a signal-to-noise ratio of $5-20$, the remaining galaxies are either non-detections or interlopers with redshift too dif…
▽ More
We present the results of our Keck/DEIMOS spectroscopic follow-up of candidate galaxies of i-band-dropout protocluster candidate galaxies at $z\sim6$ in the COSMOS field. We securely detect Lyman-$α$ emission lines in 14 of the 30 objects targeted, 10 of them being at $z=6$ with a signal-to-noise ratio of $5-20$, the remaining galaxies are either non-detections or interlopers with redshift too different from $z=6$ to be part of the protocluster. The 10 galaxies at $z\approx6$ make the protocluster one of the riches at $z>5$. The emission lines exhibit asymmetric profiles with high skewness values ranging from 2.87 to 31.75, with a median of 7.37. This asymmetry is consistent with them being Ly$α$, resulting in a redshift range of $z=5.85-6.08$. Using the spectroscopic redshifts, we re-calculate the overdensity map for the COSMOS field and find the galaxies to be in a significant overdensity at the $4σ$ level, with a peak overdensity of $δ=11.8$ (compared to the previous value of $δ=9.2$). The protocluster galaxies have stellar masses derived from Bagpipes SED fits of $10^{8.29}-10^{10.28} \rm \,M_{\rm \odot}$ and star formation rates of $2-39\,\rm M_{\rm \odot}\rm\,yr^{-1}$, placing them on the main sequence at this epoch. Using a stellar-to-halo-mass relationship, we estimate the dark matter halo mass of the most massive halo in the protocluster to be $\sim 10^{12}\rm M_{\rm \odot}$. By comparison with halo mass evolution tracks from simulations, the protocluster is expected to evolve into a Virgo- or Coma-like cluster in the present day.
△ Less
Submitted 18 December, 2023; v1 submitted 1 November, 2023;
originally announced November 2023.
-
High dust content of a quiescent galaxy at z~2 revealed by deep ALMA observation
Authors:
Minju M. Lee,
Charles C. Steidel,
Gabriel Brammer,
Natascha Förster-Schreiber,
Alvio Renzini,
Daizhong Liu,
Rodrigo Herrera-Camus,
Thorsten Naab,
Sedona H. Price,
Hannah Übler,
Sebastián Arriagada,
Georgios Magdis
Abstract:
We report the detection of cold dust in an apparently quiescent massive galaxy ($\log({M_{\star}/M_{\odot}})\approx11$) at $z\sim2$ (G4). The source is identified as a serendipitous 2 mm continuum source in a deep ALMA observation within the field of Q2343-BX610, a $z=2.21$ massive star-forming disk galaxy. Available multi-band photometry of G4 suggests redshift of $z\sim2$ and a low specific star…
▽ More
We report the detection of cold dust in an apparently quiescent massive galaxy ($\log({M_{\star}/M_{\odot}})\approx11$) at $z\sim2$ (G4). The source is identified as a serendipitous 2 mm continuum source in a deep ALMA observation within the field of Q2343-BX610, a $z=2.21$ massive star-forming disk galaxy. Available multi-band photometry of G4 suggests redshift of $z\sim2$ and a low specific star-formation rate (sSFR), $\log(SFR/M_{\star}) [yr^{-1}] \approx -10.2$, corresponding to $\approx1.2$ dex below the $z=2$ main sequence (MS). G4 appears to be a peculiar dust-rich quiescent galaxy for its stellar mass ($\log({M_{\rm dust}/M_{\star}}) = -2.71 \pm 0.26$), with its estimated mass-weighted age ($\sim$ 1-2 Gyr). We compile $z\gtrsim1$ quiescent galaxies in the literature and discuss their age-$Δ$MS and $\log({M_{\rm dust}/M_{\star}})$-age relations to investigate passive evolution and dust depletion scale. A long dust depletion time and its morphology suggest morphological quenching along with less efficient feedback that could have acted on G4. The estimated dust yield for G4 further supports this idea, requiring efficient survival of dust and/or grain growth, and rejuvenation (or additional accretion). Follow-up observations probing the stellar light and cold dust peak are necessary to understand the implication of these findings in the broader context of galaxy evolutionary studies and quenching in the early universe.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
Gravitational-wave Electromagnetic Counterpart Korean Observatory (GECKO): GECKO Follow-up Observation of GW190425
Authors:
Gregory S. H. Paek,
Myungshin Im,
Joonho Kim,
Gu Lim,
Bomi Park,
Changsu Choi,
Sophia Kim,
Claudio Barbieri,
Om Sharan Salafia,
Insu Paek,
Suhyun Shin,
**guk Seo,
Hyung Mok Lee,
Chung-Uk Lee,
Seung-Lee Kim,
Hyun-Il Sung
Abstract:
One of the keys to the success of multimessenger astronomy is the rapid identification of the electromagnetic wave counterpart, kilonova (KN), of the gravitational-wave (GW) event. Despite its importance, it is hard to find a KN associated with a GW event, due to a poorly constrained GW localization map and numerous signals that could be confused as a KN. Here, we present the Gravitational-wave El…
▽ More
One of the keys to the success of multimessenger astronomy is the rapid identification of the electromagnetic wave counterpart, kilonova (KN), of the gravitational-wave (GW) event. Despite its importance, it is hard to find a KN associated with a GW event, due to a poorly constrained GW localization map and numerous signals that could be confused as a KN. Here, we present the Gravitational-wave Electromagnetic wave Counterpart Korean Observatory (GECKO) project, the GECKO observation of GW190425, and prospects of GECKO in the fourth observing run (O4) of the GW detectors. We outline our follow-up observation strategies during O3. In particular, we describe our galaxy-targeted observation criteria that prioritize based on galaxy properties. Armed with this strategy, we performed an optical and/or near-infrared follow-up observation of GW190425, the first binary neutron star merger event during the O3 run. Despite a vast localization area of 7460 deg^2, we observed 621 host galaxy candidates, corresponding to 29.5% of the scores we assigned, with most of them observed within the first 3 days of the GW event. Ten transients were discovered during this search, including a new transient with a host galaxy. No plausible KN was found, but we were still able to constrain the properties of potential KNe using upper limits. The GECKO observation demonstrates that GECKO can possibly uncover a GW170817-like KN at a distance less than 200 Mpc if the localization area is of the order of hundreds of square degrees, providing a bright prospect for the identification of GW electromagnetic wave counterparts during the O4 run.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning
Authors:
Zheyuan Zhang,
Shane Storks,
Fengyuan Hu,
Sungryull Sohn,
Moontae Lee,
Honglak Lee,
Joyce Chai
Abstract:
Pre-trained language models (PLMs) have shown impressive performance in various language tasks. However, they are prone to spurious correlations, and often generate illusory information. In real-world applications, PLMs should justify decisions with formalized, coherent reasoning chains, but this challenge remains under-explored. Cognitive psychology theorizes that humans are capable of utilizing…
▽ More
Pre-trained language models (PLMs) have shown impressive performance in various language tasks. However, they are prone to spurious correlations, and often generate illusory information. In real-world applications, PLMs should justify decisions with formalized, coherent reasoning chains, but this challenge remains under-explored. Cognitive psychology theorizes that humans are capable of utilizing fast and intuitive heuristic thinking to make decisions based on past experience, then rationalizing the decisions through slower and deliberative analytic reasoning. We incorporate these interlinked dual processes in fine-tuning and in-context learning with PLMs, applying them to two language understanding tasks that require coherent physical commonsense reasoning. We show that our proposed Heuristic-Analytic Reasoning (HAR) strategies drastically improve the coherence of rationalizations for model decisions, yielding state-of-the-art results on Tiered Reasoning for Intuitive Physics (TRIP). We also find that this improved coherence is a direct result of more faithful attention to relevant language context in each step of reasoning. Our findings suggest that human-like reasoning strategies can effectively improve the coherence and reliability of PLM reasoning.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Peccei-Quinn Inflation at the Pole and Axion Kinetic Misalignment
Authors:
Hyun Min Lee,
Adriana G. Menkara,
Myeong-Jung Seong,
Jun-Ho Song
Abstract:
We propose a minimal extension of the Standard Model with the Peccei-Quinn (PQ) scalar field and explain the relic density of the QCD axion through the kinetic misalignment with a relatively small axion decay constant. To this purpose, we consider a slow-roll inflation from the radial component of the PQ field with the PQ conserving potential near the pole of its kinetic term and investigate the p…
▽ More
We propose a minimal extension of the Standard Model with the Peccei-Quinn (PQ) scalar field and explain the relic density of the QCD axion through the kinetic misalignment with a relatively small axion decay constant. To this purpose, we consider a slow-roll inflation from the radial component of the PQ field with the PQ conserving potential near the pole of its kinetic term and investigate the post-inflationary dynamics of the PQ field for reheating. The angular mode of the PQ field, identified with the QCD axion, receives a nonzero velocity during inflation due to the PQ violating potential, evolving with an approximately conserved Noether PQ charge. We determine the reheating temperature from the perturbative decays and scattering processes of the inflaton and obtain dark radiation from the axions produced from the inflaton scattering at a testable level in the future Cosmic Microwave Background experiments. We show the correlation between the reheating temperature, the initial velocity of the axion and the axion decay constant, realizing the axion kinetic misalignment for the correct relic density.
△ Less
Submitted 1 May, 2024; v1 submitted 26 October, 2023;
originally announced October 2023.
-
DeepVox and SAVE-CT: a contrast- and dose-independent 3D deep learning approach for thoracic aorta segmentation and aneurysm prediction using computed tomography scans
Authors:
Matheus del-Valle,
Lariza Laura de Oliveira,
Henrique Cursino Vieira,
Henrique Min Ho Lee,
Lucas Lembrança Pinheiro,
Maria Fernanda Portugal,
Newton Shydeo Brandão Miyoshi,
Nelson Wolosker
Abstract:
Thoracic aortic aneurysm (TAA) is a fatal disease which potentially leads to dissection or rupture through progressive enlargement of the aorta. It is usually asymptomatic and screening recommendation are limited. The gold-standard evaluation is performed by computed tomography angiography (CTA) and radiologists time-consuming assessment. Scans for other indications could help on this screening, h…
▽ More
Thoracic aortic aneurysm (TAA) is a fatal disease which potentially leads to dissection or rupture through progressive enlargement of the aorta. It is usually asymptomatic and screening recommendation are limited. The gold-standard evaluation is performed by computed tomography angiography (CTA) and radiologists time-consuming assessment. Scans for other indications could help on this screening, however if acquired without contrast enhancement or with low dose protocol, it can make the clinical evaluation difficult, besides increasing the scans quantity for the radiologists. In this study, it was selected 587 unique CT scans including control and TAA patients, acquired with low and standard dose protocols, with or without contrast enhancement. A novel segmentation model, DeepVox, exhibited dice score coefficients of 0.932 and 0.897 for development and test sets, respectively, with faster training speed in comparison to models reported in the literature. The novel TAA classification model, SAVE-CT, presented accuracies of 0.930 and 0.922 for development and test sets, respectively, using only the binary segmentation mask from DeepVox as input, without hand-engineered features. These two models together are a potential approach for TAA screening, as they can handle variable number of slices as input, handling thoracic and thoracoabdominal sequences, in a fully automated contrast- and dose-independent evaluation. This may assist to decrease TAA mortality and prioritize the evaluation queue of patients for radiologists.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.