-
Comparison of optical spectra between asteroids Ryugu and Bennu: II. High-precision analysis for space weathering trends
Authors:
K. Yumoto,
E. Tatsumi,
T. Kouyama,
D. R. Golish,
Y. Cho,
T. Morota,
S. Kameda,
H. Sato,
B. Rizk,
D. N. DellaGiustina,
Y. Yokota,
H. Suzuki,
J. de León,
H. Campins,
J. Licandro,
M. Popescu,
J. L. Rizos,
R. Honda,
M. Yamada,
N. Sakatani,
C. Honda,
M. Matsuoka,
M. Hayakawa,
H. Sawada,
K. Ogawa
, et al. (3 additional authors not shown)
Abstract:
The influence of space weathering on the observed spectra of C-complex asteroids remains uncertain. This has long hindered our understanding of their composition through telescope observations. Multi-band imaging of Ryugu by ONC-T on Hayabusa2 and that of Bennu by MapCam on OSIRIS-REx found opposite spectral trends of space weathering; Ryugu darkened/reddened while Bennu brightened/blued. How the…
▽ More
The influence of space weathering on the observed spectra of C-complex asteroids remains uncertain. This has long hindered our understanding of their composition through telescope observations. Multi-band imaging of Ryugu by ONC-T on Hayabusa2 and that of Bennu by MapCam on OSIRIS-REx found opposite spectral trends of space weathering; Ryugu darkened/reddened while Bennu brightened/blued. How the spectra of Ryugu and Bennu evolved relative to each other would place a constraint for understanding their origins and evolutions. In this study, we compared the space weathering trends on Ryugu and Bennu by applying the results of cross calibration between ONC-T and MapCam. We show that the average Bennu surface is brighter by 18.0 $\pm$ 1.5% at 550 nm and bluer by 0.18 $\pm$ 0.03 $μ$m$^{-1}$ (480-850 nm slope) than Ryugu. The spectral slopes of surface materials are more uniform on Bennu than on Ryugu at spatial scales $\gtrsim$1 m, but Bennu is more heterogeneous at $\lesssim$1 m. This suggests that lateral mixing due to resurfacing may have been more efficient on Bennu. The reflectance-spectral slope distributions of craters on Ryugu and Bennu appeared to follow two trend lines with an offset before cross calibration, but they converged to a single straight trend without a bend after cross calibration. We show that the spectra of the freshest craters on Ryugu and Bennu are indistinguishable within the uncertainty of cross calibration. These results suggest that Ryugu and Bennu initially had similar spectra before space weathering and that they evolved in completely opposite directions along the same trend line, subsequently evolving into asteroids with different disk-averaged spectra. These findings further suggest that space weathering likely expanded the spectral slope variation of C-complex asteroids, implying that they may have formed from materials with more uniform spectral slopes.
△ Less
Submitted 7 July, 2024;
originally announced July 2024.
-
Vortex confinement through an unquantized magnetic flux
Authors:
Geunyong Kim,
**young Yun,
**ho Yang,
Ilkyu Yang,
Dirk Wulferding,
Roman Movshovich,
Gil Young Cho,
Ki-Seok Kim,
Garam Hahn,
Jeehoon Kim
Abstract:
Geometrically confined superconductors often experience a breakdown in the quantization of magnetic flux owing to the incomplete screening of the supercurrent against the field penetration. In this study, we report that the confinement of a magnetic field occurs regardless of the dimensionality of the system, extending even to 1D linear potential systems. By utilizing a vector-field magnetic force…
▽ More
Geometrically confined superconductors often experience a breakdown in the quantization of magnetic flux owing to the incomplete screening of the supercurrent against the field penetration. In this study, we report that the confinement of a magnetic field occurs regardless of the dimensionality of the system, extending even to 1D linear potential systems. By utilizing a vector-field magnetic force microscope, we successfully create a vortex-antivortex pair connected by a 1D unquantized magnetic flux in ultra-thin superconducting films. Through an investigation of the manipulation and thermal behavior of the vortex pair, we uncover a long-range interaction mediated by the unquantized magnetic flux. These findings suggest a universal phenomenon of unquantized magnetic flux formation, independent of the geometry of the system. Our results present an experimental route for probing the impact of confinement on superconducting properties and order parameters in unconventional superconductors characterized by extremely low dimensionality.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation
Authors:
Takyoung Kim,
Kyungjae Lee,
Young Rok Jang,
Ji Yong Cho,
Gangwoo Kim,
Minseok Cho,
Moontae Lee
Abstract:
Interactions with billion-scale large language models typically yield long-form responses due to their extensive parametric capacities, along with retrieval-augmented features. While detailed responses provide insightful viewpoint of a specific subject, they frequently generate redundant and less engaging content that does not meet user interests. In this work, we focus on the role of query outlin…
▽ More
Interactions with billion-scale large language models typically yield long-form responses due to their extensive parametric capacities, along with retrieval-augmented features. While detailed responses provide insightful viewpoint of a specific subject, they frequently generate redundant and less engaging content that does not meet user interests. In this work, we focus on the role of query outlining (i.e., selected sequence of queries) in scenarios that users request a specific range of information, namely coverage-conditioned ($C^2$) scenarios. For simulating $C^2$ scenarios, we construct QTree, 10K sets of information-seeking queries decomposed with various perspectives on certain topics. By utilizing QTree, we train QPlanner, a 7B language model generating customized query outlines that follow coverage-conditioned queries. We analyze the effectiveness of generated outlines through automatic and human evaluation, targeting on retrieval-augmented generation (RAG). Moreover, the experimental results demonstrate that QPlanner with alignment training can further provide outlines satisfying diverse user interests. Our resources are available at https://github.com/youngerous/qtree.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
The global dynamics for the Maxwell-Dirac system
Authors:
Yonggeun Cho,
Kiyeon Lee
Abstract:
In this paper, we study the (1+3) dimensional massive Maxwell-Dirac system in the context of global existence and asymptotic behavior of solutions under the Lorenz gauge condition, as well as the modified and linear scattering phenomena for the Dirac spinor and the electromagnetic potential, respectively. We employ a vector fields energy method combined with a detailed analysis of the space-time r…
▽ More
In this paper, we study the (1+3) dimensional massive Maxwell-Dirac system in the context of global existence and asymptotic behavior of solutions under the Lorenz gauge condition, as well as the modified and linear scattering phenomena for the Dirac spinor and the electromagnetic potential, respectively. We employ a vector fields energy method combined with a detailed analysis of the space-time resonance argument. This approach allows us to establish decay estimates and energy bounds crucial for proving the main theorems. Especially, we provide the explicit phase correction arising from the strong nonlinear resonances.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Quantum Multi-Agent Reinforcement Learning for Cooperative Mobile Access in Space-Air-Ground Integrated Networks
Authors:
Gyu Seon Kim,
Yeryeong Cho,
Jaehyun Chung,
Soohyun Park,
Soyi Jung,
Zhu Han,
Joongheon Kim
Abstract:
Achieving global space-air-ground integrated network (SAGIN) access only with CubeSats presents significant challenges such as the access sustainability limitations in specific regions (e.g., polar regions) and the energy efficiency limitations in CubeSats. To tackle these problems, high-altitude long-endurance unmanned aerial vehicles (HALE-UAVs) can complement these CubeSat shortcomings for prov…
▽ More
Achieving global space-air-ground integrated network (SAGIN) access only with CubeSats presents significant challenges such as the access sustainability limitations in specific regions (e.g., polar regions) and the energy efficiency limitations in CubeSats. To tackle these problems, high-altitude long-endurance unmanned aerial vehicles (HALE-UAVs) can complement these CubeSat shortcomings for providing cooperatively global access sustainability and energy efficiency. However, as the number of CubeSats and HALE-UAVs, increases, the scheduling dimension of each ground station (GS) increases. As a result, each GS can fall into the curse of dimensionality, and this challenge becomes one major hurdle for efficient global access. Therefore, this paper provides a quantum multi-agent reinforcement Learning (QMARL)-based method for scheduling between GSs and CubeSats/HALE-UAVs in order to improve global access availability and energy efficiency. The main reason why the QMARL-based scheduler can be beneficial is that the algorithm facilitates a logarithmic-scale reduction in scheduling action dimensions, which is one critical feature as the number of CubeSats and HALE-UAVs expands. Additionally, individual GSs have different traffic demands depending on their locations and characteristics, thus it is essential to provide differentiated access services. The superiority of the proposed scheduler is validated through data-intensive experiments in realistic CubeSat/HALE-UAV settings.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Simulating nonlinear optical processes on a superconducting quantum device
Authors:
Yuan Shi,
Bram Evert,
Amy F. Brown,
Vinay Tripathi,
Eyob A. Sete,
Vasily Geyko,
Yu** Cho,
Jonathan L DuBois,
Daniel Lidar,
Ilon Joseph,
Matt Reagor
Abstract:
Simulating plasma physics on quantum computers is difficult, because most problems of interest are nonlinear, but quantum computers are not naturally suitable for nonlinear operations. In weakly nonlinear regimes, plasma problems can be modeled as wave-wave interactions. In this paper, we develop a quantization approach to convert nonlinear wave-wave interaction problems to Hamiltonian simulation…
▽ More
Simulating plasma physics on quantum computers is difficult, because most problems of interest are nonlinear, but quantum computers are not naturally suitable for nonlinear operations. In weakly nonlinear regimes, plasma problems can be modeled as wave-wave interactions. In this paper, we develop a quantization approach to convert nonlinear wave-wave interaction problems to Hamiltonian simulation problems. We demonstrate our approach using two qubits on a superconducting device. Unlike a photonic device, a superconducting device does not naturally have the desired interactions in its native Hamiltonian. Nevertheless, Hamiltonian simulations can still be performed by decomposing required unitary operations into native gates. To improve experimental results, we employ a range of error mitigation techniques. Apart from readout error mitigation, we use randomized compilation to transform undiagnosed coherent errors into well-behaved stochastic Pauli channels. Moreover, to compensate for stochastic noise, we rescale exponentially decaying probability amplitudes using rates measured from cycle benchmarking. We carefully consider how different choices of product-formula algorithms affect the overall error and show how a trade-off can be made to best utilize limited quantum resources. This study provides a point example of how plasma problems may be solved on near-term quantum computing platforms.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
LLM-Mediated Domain-Specific Voice Agents: The Case of TextileBot
Authors:
Shu Zhong,
Elia Gatti,
James Hardwick,
Miriam Ribul,
Youngjun Cho,
Marianna Obrist
Abstract:
Develo** domain-specific conversational agents (CAs) has been challenged by the need for extensive domain-focused data. Recent advancements in Large Language Models (LLMs) make them a viable option as a knowledge backbone. LLMs behaviour can be enhanced through prompting, instructing them to perform downstream tasks in a zero-shot fashion (i.e. without training). To this end, we incorporated str…
▽ More
Develo** domain-specific conversational agents (CAs) has been challenged by the need for extensive domain-focused data. Recent advancements in Large Language Models (LLMs) make them a viable option as a knowledge backbone. LLMs behaviour can be enhanced through prompting, instructing them to perform downstream tasks in a zero-shot fashion (i.e. without training). To this end, we incorporated structural knowledge into prompts and used prompted LLMs to build domain-specific voice-based CAs. We demonstrate this approach for the specific domain of textile circularity in form of the design, development, and evaluation of TextileBot. We present the design and development of the voice agent TextileBot and also the insights from an in-person user study (N=30) evaluating three variations of TextileBots. We analyse the human-agent interactions, combining quantitative and qualitative methods. Our results suggest that participants engaged in multi-turn conversations, and their perceptions of the three variation agents and respective interactions varied demonstrating the effectiveness of our prompt-based LLM approach. We discuss the dynamics of these interactions and their implications for designing future voice-based CAs. The results show that our method's potential for building domain-specific CAs. Furthermore, most participants engaged in multi-turn conversations, and their perceptions of the three voice agents and respective interactions varied demonstrating the effectiveness of our prompt-based LLM approach. We discuss the dynamics of these interactions and their implications for designing future voice-based CAs.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Exploring Human-AI Perception Alignment in Sensory Experiences: Do LLMs Understand Textile Hand?
Authors:
Shu Zhong,
Elia Gatti,
Youngjun Cho,
Marianna Obrist
Abstract:
Aligning large language models (LLMs) behaviour with human intent is critical for future AI. An important yet often overlooked aspect of this alignment is the perceptual alignment. Perceptual modalities like touch are more multifaceted and nuanced compared to other sensory modalities such as vision. This work investigates how well LLMs align with human touch experiences using the "textile hand" ta…
▽ More
Aligning large language models (LLMs) behaviour with human intent is critical for future AI. An important yet often overlooked aspect of this alignment is the perceptual alignment. Perceptual modalities like touch are more multifaceted and nuanced compared to other sensory modalities such as vision. This work investigates how well LLMs align with human touch experiences using the "textile hand" task. We created a "Guess What Textile" interaction in which participants were given two textile samples -- a target and a reference -- to handle. Without seeing them, participants described the differences between them to the LLM. Using these descriptions, the LLM attempted to identify the target textile by assessing similarity within its high-dimensional embedding space. Our results suggest that a degree of perceptual alignment exists, however varies significantly among different textile samples. For example, LLM predictions are well aligned for silk satin, but not for cotton denim. Moreover, participants didn't perceive their textile experiences closely matched by the LLM predictions. This is only the first exploration into perceptual alignment around touch, exemplified through textile hand. We discuss possible sources of this alignment variance, and how better human-AI perceptual alignment can benefit future everyday tasks.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Authors:
Seungone Kim,
Juyoung Suk,
Ji Yong Cho,
Shayne Longpre,
Chaeeun Kim,
Dongkeun Yoon,
Gui** Son,
Ye** Cho,
Sheikh Shafayat,
**heon Baek,
Sue Hyun Park,
Hyeonbin Hwang,
**kyung Jo,
Hyowon Cho,
Haebin Shin,
Seongyun Lee,
Hanseok Oh,
Noah Lee,
Namgyu Ho,
Se June Joo,
Miyoung Ko,
Yoonjoo Lee,
Hyungjoo Chae,
Jamin Shin,
Joel Jang
, et al. (7 additional authors not shown)
Abstract:
As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec…
▽ More
As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on specific capabilities such as instruction following, leading to coverage bias. To overcome these limitations, we introduce the BiGGen Bench, a principled generation benchmark designed to thoroughly evaluate nine distinct capabilities of LMs across 77 diverse tasks. A key feature of the BiGGen Bench is its use of instance-specific evaluation criteria, closely mirroring the nuanced discernment of human evaluation. We apply this benchmark to assess 103 frontier LMs using five evaluator LMs. Our code, data, and evaluation results are all publicly available at https://github.com/prometheus-eval/prometheus-eval/tree/main/BiGGen-Bench.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
The Geometry of Categorical and Hierarchical Concepts in Large Language Models
Authors:
Kiho Park,
Yo Joong Choe,
Yibo Jiang,
Victor Veitch
Abstract:
Understanding how semantic meaning is encoded in the representation spaces of large language models is a fundamental problem in interpretability. In this paper, we study the two foundational questions in this area. First, how are categorical concepts, such as {'mammal', 'bird', 'reptile', 'fish'}, represented? Second, how are hierarchical relations between concepts encoded? For example, how is the…
▽ More
Understanding how semantic meaning is encoded in the representation spaces of large language models is a fundamental problem in interpretability. In this paper, we study the two foundational questions in this area. First, how are categorical concepts, such as {'mammal', 'bird', 'reptile', 'fish'}, represented? Second, how are hierarchical relations between concepts encoded? For example, how is the fact that 'dog' is a kind of 'mammal' encoded? We show how to extend the linear representation hypothesis to answer these questions. We find a remarkably simple structure: simple categorical concepts are represented as simplices, hierarchically related concepts are orthogonal in a sense we make precise, and (in consequence) complex concepts are represented as polytopes constructed from direct sums of simplices, reflecting the hierarchical structure. We validate these theoretical results on the Gemma large language model, estimating representations for 957 hierarchically related concepts using data from WordNet.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Early-time small-scale structures in hot-exoplanet atmosphere simulations
Authors:
J. W. Skinner,
J. Y-K. Cho
Abstract:
We report on the critical influence of small-scale flow structures (e.g., fronts, vortices, and waves) that immediately arise in hot-exoplanet atmosphere simulations initialized with a resting state. A hot, 1:1 spin-orbit synchronized Jupiter is used here as a clear example; but, the phenomenon is generic and important for any type of hot synchronized planet--gaseous, oceanic, or telluric. When th…
▽ More
We report on the critical influence of small-scale flow structures (e.g., fronts, vortices, and waves) that immediately arise in hot-exoplanet atmosphere simulations initialized with a resting state. A hot, 1:1 spin-orbit synchronized Jupiter is used here as a clear example; but, the phenomenon is generic and important for any type of hot synchronized planet--gaseous, oceanic, or telluric. When the early-time structures are not captured in simulations (due to, e.g., poor resolution and/or too much dissipation), the flow behavior is markedly different at later times--in an observationally significant way; for example, the flow at large-scale is smoother and much less dynamic. This results in the temperature field, and its corresponding thermal flux, to be incorrectly predicted in numerical simulations, even when the quantities are spatially averaged.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models
Authors:
Taehyun Kim,
Kwanseok Choi,
Youngmock Cho,
Jaehoon Cho,
Hyuk-Jae Lee,
Jaewoong Sim
Abstract:
Mixture-of-Experts (MoE) large language models (LLM) have memory requirements that often exceed the GPU memory capacity, requiring costly parameter movement from secondary memories to the GPU for expert computation. In this work, we present Mixture of Near-Data Experts (MoNDE), a near-data computing solution that efficiently enables MoE LLM inference. MoNDE reduces the volume of MoE parameter move…
▽ More
Mixture-of-Experts (MoE) large language models (LLM) have memory requirements that often exceed the GPU memory capacity, requiring costly parameter movement from secondary memories to the GPU for expert computation. In this work, we present Mixture of Near-Data Experts (MoNDE), a near-data computing solution that efficiently enables MoE LLM inference. MoNDE reduces the volume of MoE parameter movement by transferring only the $\textit{hot}$ experts to the GPU, while computing the remaining $\textit{cold}$ experts inside the host memory device. By replacing the transfers of massive expert parameters with the ones of small activations, MoNDE enables far more communication-efficient MoE inference, thereby resulting in substantial speedups over the existing parameter offloading frameworks for both encoder and decoder operations.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Gemini & Physical World: Large Language Models Can Estimate the Intensity of Earthquake Shaking from Multi-Modal Social Media Posts
Authors:
S. Mostafa Mousavi,
Marc Stogaitis,
Ta**der Gadh,
Richard M Allen,
Alexei Barski,
Robert Bosch,
Patrick Robertson,
Nivetha Thiruverahan,
Youngmin Cho,
Aman Raj
Abstract:
This paper presents a novel approach to extract scientifically valuable information about Earth's physical phenomena from unconventional sources, such as multi-modal social media posts. Employing a state-of-the-art large language model (LLM), Gemini 1.5 Pro (Reid et al. 2024), we estimate earthquake ground shaking intensity from these unstructured posts. The model's output, in the form of Modified…
▽ More
This paper presents a novel approach to extract scientifically valuable information about Earth's physical phenomena from unconventional sources, such as multi-modal social media posts. Employing a state-of-the-art large language model (LLM), Gemini 1.5 Pro (Reid et al. 2024), we estimate earthquake ground shaking intensity from these unstructured posts. The model's output, in the form of Modified Mercalli Intensity (MMI) values, aligns well with independent observational data. Furthermore, our results suggest that LLMs, trained on vast internet data, may have developed a unique understanding of physical phenomena. Specifically, Google's Gemini models demonstrate a simplified understanding of the general relationship between earthquake magnitude, distance, and MMI intensity, accurately describing observational data even though it's not identical to established models. These findings raise intriguing questions about the extent to which Gemini's training has led to a broader understanding of the physical world and its phenomena. The ability of Generative AI models like Gemini to generate results consistent with established scientific knowledge highlights their potential to augment our understanding of complex physical phenomena like earthquakes. The flexible and effective approach proposed in this study holds immense potential for enriching our understanding of the impact of physical phenomena and improving resilience during natural disasters. This research is a significant step toward harnessing the power of social media and AI for natural disaster mitigation, opening new avenues for understanding the emerging capabilities of Generative AI and LLMs for scientific applications.
△ Less
Submitted 14 June, 2024; v1 submitted 28 May, 2024;
originally announced May 2024.
-
Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation
Authors:
JuneHyoung Kwon,
Eunju Lee,
Yunsung Cho,
YoungBin Kim
Abstract:
Weakly supervised semantic segmentation (WSSS) employing weak forms of labels has been actively studied to alleviate the annotation cost of acquiring pixel-level labels. However, classifiers trained on biased datasets tend to exploit shortcut features and make predictions based on spurious correlations between certain backgrounds and objects, leading to a poor generalization performance. In this p…
▽ More
Weakly supervised semantic segmentation (WSSS) employing weak forms of labels has been actively studied to alleviate the annotation cost of acquiring pixel-level labels. However, classifiers trained on biased datasets tend to exploit shortcut features and make predictions based on spurious correlations between certain backgrounds and objects, leading to a poor generalization performance. In this paper, we propose shortcut mitigating augmentation (SMA) for WSSS, which generates synthetic representations of object-background combinations not seen in the training data to reduce the use of shortcut features. Our approach disentangles the object-relevant and background features. We then shuffle and combine the disentangled representations to create synthetic features of diverse object-background combinations. SMA-trained classifier depends less on contexts and focuses more on the target object when making predictions. In addition, we analyzed the behavior of the classifier on shortcut usage after applying our augmentation using an attribution method-based metric. The proposed method achieved the improved performance of semantic segmentation result on PASCAL VOC 2012 and MS COCO 2014 datasets.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Improving Health Professionals' Onboarding with AI and XAI for Trustworthy Human-AI Collaborative Decision Making
Authors:
Min Hun Lee,
Silvana Xin Yi Choo,
Shamala D/O Thilarajah
Abstract:
With advanced AI/ML, there has been growing research on explainable AI (XAI) and studies on how humans interact with AI and XAI for effective human-AI collaborative decision-making. However, we still have a lack of understanding of how AI systems and XAI should be first presented to users without technical backgrounds. In this paper, we present the findings of semi-structured interviews with healt…
▽ More
With advanced AI/ML, there has been growing research on explainable AI (XAI) and studies on how humans interact with AI and XAI for effective human-AI collaborative decision-making. However, we still have a lack of understanding of how AI systems and XAI should be first presented to users without technical backgrounds. In this paper, we present the findings of semi-structured interviews with health professionals (n=12) and students (n=4) majoring in medicine and health to study how to improve onboarding with AI and XAI. For the interviews, we built upon human-AI interaction guidelines to create onboarding materials of an AI system for stroke rehabilitation assessment and AI explanations and introduce them to the participants. Our findings reveal that beyond presenting traditional performance metrics on AI, participants desired benchmark information, the practical benefits of AI, and interaction trials to better contextualize AI performance, and refine the objectives and performance of AI. Based on these findings, we highlight directions for improving onboarding with AI and XAI and human-AI collaborative decision-making.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments
Authors:
Jooyong Park,
Jungwoo Lee,
Euncheol Choi,
Younggun Cho
Abstract:
In urban environments for delivery robots, particularly in areas such as campuses and towns, many custom features defy standard road semantic categorizations. Addressing this challenge, our paper introduces a method leveraging Salient Object Detection (SOD) to extract these unique features, employing them as pivotal factors for enhanced robot loop closure and localization. Traditional geometric fe…
▽ More
In urban environments for delivery robots, particularly in areas such as campuses and towns, many custom features defy standard road semantic categorizations. Addressing this challenge, our paper introduces a method leveraging Salient Object Detection (SOD) to extract these unique features, employing them as pivotal factors for enhanced robot loop closure and localization. Traditional geometric feature-based localization is hampered by fluctuating illumination and appearance changes. Our preference for SOD over semantic segmentation sidesteps the intricacies of classifying a myriad of non-standardized urban features. To achieve consistent ground features, the Motion Compensate IPM (MC-IPM) technique is implemented, capitalizing on motion for distortion compensation and subsequently selecting the most pertinent salient ground features through moment computations. For thorough evaluation, we validated the saliency detection and localization performances to the real urban scenarios. Project page: https://sites.google.com/view/salient-ground-feature/home.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Discursive objection strategies in online comments: Develo** a classification schema and validating its training
Authors:
Ashley L. Shea,
Aspen K. B. Omapang,
Ji Yong Cho,
Miryam Y. Ginsparg,
Natalie Bazarova,
Winice Hui,
René F. Kizilcec,
Chau Tong,
Drew Margolin
Abstract:
Most Americans agree that misinformation, hate speech and harassment are harmful and inadequately curbed on social media through current moderation practices. In this paper, we aim to understand the discursive strategies employed by people in response to harmful speech in news comments. We conducted a content analysis of more than 6500 comment replies to trending news videos on YouTube and Twitter…
▽ More
Most Americans agree that misinformation, hate speech and harassment are harmful and inadequately curbed on social media through current moderation practices. In this paper, we aim to understand the discursive strategies employed by people in response to harmful speech in news comments. We conducted a content analysis of more than 6500 comment replies to trending news videos on YouTube and Twitter and identified seven distinct discursive objection strategies (Study 1). We examined the frequency of each strategy's occurrence from the 6500 comment replies, as well as from a second sample of 2004 replies (Study 2). Together, these studies show that people deploy a diversity of discursive strategies when objecting to speech, and reputational attacks are the most common. The resulting classification scheme accounts for different theoretical approaches for expressing objections and offers a comprehensive perspective on grassroots efforts aimed at stop** offensive or problematic speech on campus.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
A Personalizable Controller for the Walking Assistive omNi-Directional Exo-Robot (WANDER)
Authors:
A. Fortuna,
M. Lorenzini,
M. Leonori,
JM. Gandarias,
P. Balatti,
Y. Cho,
E. De Momi,
A. Ajoudani
Abstract:
Preserving and encouraging mobility in the elderly and adults with chronic conditions is of paramount importance. However, existing walking aids are either inadequate to provide sufficient support to users' stability or too bulky and poorly maneuverable to be used outside hospital environments. In addition, they all lack adaptability to individual requirements. To address these challenges, this pa…
▽ More
Preserving and encouraging mobility in the elderly and adults with chronic conditions is of paramount importance. However, existing walking aids are either inadequate to provide sufficient support to users' stability or too bulky and poorly maneuverable to be used outside hospital environments. In addition, they all lack adaptability to individual requirements. To address these challenges, this paper introduces WANDER, a novel Walking Assistive omNi-Directional Exo-Robot. It consists of an omnidirectional platform and a robust aluminum structure mounted on top of it, which provides partial body weight support. A comfortable and minimally restrictive coupling interface embedded with a force/torque sensor allows to detect users' intentions, which are translated into command velocities by means of a variable admittance controller. An optimization technique based on users' preferences, i.e., Preference-Based Optimization (PBO) guides the choice of the admittance parameters (i.e., virtual mass and dam**) to better fit subject-specific needs and characteristics. Experiments with twelve healthy subjects exhibited a significant decrease in energy consumption and jerk when using WANDER with PBO parameters as well as improved user performance and comfort. The great interpersonal variability in the optimized parameters highlights the importance of personalized control settings when walking with an assistive device, aiming to enhance users' comfort and mobility while ensuring reliable physical support.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Unicorn: U-Net for Sea Ice Forecasting with Convolutional Neural Ordinary Differential Equations
Authors:
Jaesung Park,
Sungchul Hong,
Yoonseo Cho,
Jong-June Jeon
Abstract:
Sea ice at the North Pole is vital to global climate dynamics. However, accurately forecasting sea ice poses a significant challenge due to the intricate interaction among multiple variables. Leveraging the capability to integrate multiple inputs and powerful performances seamlessly, many studies have turned to neural networks for sea ice forecasting. This paper introduces a novel deep architectur…
▽ More
Sea ice at the North Pole is vital to global climate dynamics. However, accurately forecasting sea ice poses a significant challenge due to the intricate interaction among multiple variables. Leveraging the capability to integrate multiple inputs and powerful performances seamlessly, many studies have turned to neural networks for sea ice forecasting. This paper introduces a novel deep architecture named Unicorn, designed to forecast weekly sea ice. Our model integrates multiple time series images within its architecture to enhance its forecasting performance. Moreover, we incorporate a bottleneck layer within the U-Net architecture, serving as neural ordinary differential equations with convolution operations, to capture the spatiotemporal dynamics of latent variables. Through real data analysis with datasets spanning from 1998 to 2021, our proposed model demonstrates significant improvements over state-of-the-art models in the sea ice concentration forecasting task. It achieves an average MAE improvement of 12% compared to benchmark models. Additionally, our method outperforms existing approaches in sea ice extent forecasting, achieving a classification performance improvement of approximately 18%. These experimental results show the superiority of our proposed model.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Mesh-based Photorealistic and Real-time 3D Map** for Robust Visual Perception of Autonomous Underwater Vehicle
Authors:
Jungwoo Lee,
Younggun Cho
Abstract:
This paper proposes a photorealistic real-time dense 3D map** system that utilizes a learning-based image enhancement method and mesh-based map representation. Due to the characteristics of the underwater environment, where problems such as hazing and low contrast occur, it is hard to apply conventional simultaneous localization and map** (SLAM) methods. Furthermore, for sensitive tasks like i…
▽ More
This paper proposes a photorealistic real-time dense 3D map** system that utilizes a learning-based image enhancement method and mesh-based map representation. Due to the characteristics of the underwater environment, where problems such as hazing and low contrast occur, it is hard to apply conventional simultaneous localization and map** (SLAM) methods. Furthermore, for sensitive tasks like inspecting cracks, photorealistic map** is very important. However, the behavior of Autonomous Underwater Vehicle (AUV) is computationally constrained. In this paper, we utilize a neural network-based image enhancement method to improve pose estimation and map** quality and apply a sliding window-based mesh expansion method to enable lightweight, fast, and photorealistic map**. To validate our results, we utilize real-world and indoor synthetic datasets. We performed qualitative validation with the real-world dataset and quantitative validation by modeling images from the indoor synthetic dataset as underwater scenes.
△ Less
Submitted 28 April, 2024;
originally announced April 2024.
-
Spontaneous emission decay and excitation in photonic temporal crystals
Authors:
Jagang Park,
Kyungmin Lee,
Ruo-Yang Zhang,
Hee-Chul Park,
Jung-Wan Ryu,
Gil Young Cho,
Min Yeul Lee,
Zhaoqing Zhang,
Namkyoo Park,
Wonju Jeon,
Jonghwa Shin,
C. T. Chan,
Bumki Min
Abstract:
Over the last few decades, the prominent strategies for controlling spontaneous emission has been the use of resonant or space-periodic photonic structures. This approach, initially articulated by Purcell and later expanded upon by Yablonovitch in the context of photonic crystals, leverages the spatial surroundings to modify the spontaneous emission decay rate of atoms or quantum emitters. However…
▽ More
Over the last few decades, the prominent strategies for controlling spontaneous emission has been the use of resonant or space-periodic photonic structures. This approach, initially articulated by Purcell and later expanded upon by Yablonovitch in the context of photonic crystals, leverages the spatial surroundings to modify the spontaneous emission decay rate of atoms or quantum emitters. However, the rise of time-varying photonics has compelled a reevaluation of the spontaneous emission process within dynamically changing environments, especially concerning photonic temporal crystals where optical properties undergo time-periodic modulation. Here, we apply classical light-matter interaction theory along with Floquet analysis to reveal a substantial enhancement in the spontaneous emission decay rate at the momentum gap frequency in photonic temporal crystals. This enhancement is attributed to time-periodicity-induced loss and gain mechanisms, as well as the non-orthogonality of Floquet eigenstates that are inherent to photonic temporal crystals. Intriguingly, our findings also suggest that photonic temporal crystals enable the spontaneous excitation of an atom from its ground state to an excited state, accompanied by the concurrent emission of a photon.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
Automatic Quantification of Serial PET/CT Images for Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-Aware Segmentation Network
Authors:
Xin Tie,
Muheon Shin,
Changhee Lee,
Scott B. Perlman,
Zachary Huemann,
Amy J. Weisman,
Sharon M. Castellino,
Kara M. Kelly,
Kathleen M. McCarten,
Adina L. Alazraki,
Junjie Hu,
Steve Y. Cho,
Tyler J. Bradshaw
Abstract:
$\textbf{Purpose}$: Automatic quantification of longitudinal changes in PET scans for lymphoma patients has proven challenging, as residual disease in interim-therapy scans is often subtle and difficult to detect. Our goal was to develop a longitudinally-aware segmentation network (LAS-Net) that can quantify serial PET/CT images for pediatric Hodgkin lymphoma patients. $\textbf{Materials and Metho…
▽ More
$\textbf{Purpose}$: Automatic quantification of longitudinal changes in PET scans for lymphoma patients has proven challenging, as residual disease in interim-therapy scans is often subtle and difficult to detect. Our goal was to develop a longitudinally-aware segmentation network (LAS-Net) that can quantify serial PET/CT images for pediatric Hodgkin lymphoma patients. $\textbf{Materials and Methods}$: This retrospective study included baseline (PET1) and interim (PET2) PET/CT images from 297 patients enrolled in two Children's Oncology Group clinical trials (AHOD1331 and AHOD0831). LAS-Net incorporates longitudinal cross-attention, allowing relevant features from PET1 to inform the analysis of PET2. Model performance was evaluated using Dice coefficients for PET1 and detection F1 scores for PET2. Additionally, we extracted and compared quantitative PET metrics, including metabolic tumor volume (MTV) and total lesion glycolysis (TLG) in PET1, as well as qPET and $Δ$SUVmax in PET2, against physician measurements. We quantified their agreement using Spearman's $ρ$ correlations and employed bootstrap resampling for statistical analysis. $\textbf{Results}$: LAS-Net detected residual lymphoma in PET2 with an F1 score of 0.606 (precision/recall: 0.615/0.600), outperforming all comparator methods (P<0.01). For baseline segmentation, LAS-Net achieved a mean Dice score of 0.772. In PET quantification, LAS-Net's measurements of qPET, $Δ$SUVmax, MTV and TLG were strongly correlated with physician measurements, with Spearman's $ρ$ of 0.78, 0.80, 0.93 and 0.96, respectively. The performance remained high, with a slight decrease, in an external testing cohort. $\textbf{Conclusion}$: LAS-Net achieved high performance in quantifying PET metrics across serial scans, highlighting the value of longitudinal awareness in evaluating multi-time-point imaging datasets.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Coarse-grained quantum state tomography with optimal POVM construction
Authors:
Donghun Jung,
Young-Wook Cho,
Yosep Kim,
Junghyun Lee
Abstract:
Constructing an integrated large-scale qubit system of realistic size requires addressing the challenge of physical crowding among qubits. This constraint poses an issue of coarse-grained (CG) measurement, wherein information from the multi-qubit system is collectively gathered. In this work, we introduce a novel approach to reconstruct the target density matrix from a comprehensive set of Positiv…
▽ More
Constructing an integrated large-scale qubit system of realistic size requires addressing the challenge of physical crowding among qubits. This constraint poses an issue of coarse-grained (CG) measurement, wherein information from the multi-qubit system is collectively gathered. In this work, we introduce a novel approach to reconstruct the target density matrix from a comprehensive set of Positive Operator-Valued Measures (POVM) using a Parameterized Quantum Circuit (PQC) under the constraint of CG measurement. We improve the robustness and stability of CG quantum state tomography (QST) by optimizing the POVM set to achieve a generalized symmetric informationally complete (GSIC) POVM through maximization of the von Neumann entropy. This optimized construction of CG-POVMs is scalable to an N-qubit system. We further discuss a more efficient construction of N-qubit CG-QST without exponential increases in two-qubit gates or circuit depth per measurement. Our scheme offers a viable pathway towards a detector-efficient large-scale solid-state embedded qubit platform by reconstructing crucial quantum information from collective measurements.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Indexing Analytics to Instances: How Integrating a Dashboard can Support Design Education
Authors:
Ajit Jain,
Andruid Kerne,
Nic Lupfer,
Gabriel Britain,
Aaron Perrine,
Yoonsuck Choe,
John Keyser,
Ruihong Huang,
**sil Seo,
Annie Sungkajun,
Robert Lightfoot,
Timothy McGuire
Abstract:
We investigate how to use AI-based analytics to support design education. The analytics at hand measure multiscale design, that is, students' use of space and scale to visually and conceptually organize their design work. With the goal of making the analytics intelligible to instructors, we developed a research artifact integrating a design analytics dashboard with design instances, and the design…
▽ More
We investigate how to use AI-based analytics to support design education. The analytics at hand measure multiscale design, that is, students' use of space and scale to visually and conceptually organize their design work. With the goal of making the analytics intelligible to instructors, we developed a research artifact integrating a design analytics dashboard with design instances, and the design environment that students use to create them. We theorize about how Suchman's notion of mutual intelligibility requires contextualized investigation of AI in order to develop findings about how analytics work for people. We studied the research artifact in 5 situated course contexts, in 3 departments. A total of 236 students used the multiscale design environment. The 9 instructors who taught those students experienced the analytics via the new research artifact.
We derive findings from a qualitative analysis of interviews with instructors regarding their experiences. Instructors reflected on how the analytics and their presentation in the dashboard have the potential to affect design education. We develop research implications addressing: (1) how indexing design analytics in the dashboard to actual design work instances helps design instructors reflect on what they mean and, more broadly, is a technique for how AI-based design analytics can support instructors' assessment and feedback experiences in situated course contexts; and (2) how multiscale design analytics, in particular, have the potential to support design education. By indexing, we mean linking which provides context, here connecting the numbers of the analytics with visually annotated design work instances.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
On Finite Presentability of Subsemigroups of the Monogenic Free Inverse Semigroup
Authors:
Yung Won Cho,
Nik Ruskuc
Abstract:
The monogenic free inverse semigroup $FI_1$ is not finitely presented as a semigroup due to the classic result by Schein (1975). We extend this result and prove that a finitely generated subsemigroup of $FI_1$ is finitely presented if and only if it contains only finitely many idempotents. As a consequence, we derive that an inverse subsemigroup of $FI_1$ is finitely presented as a semigroup if an…
▽ More
The monogenic free inverse semigroup $FI_1$ is not finitely presented as a semigroup due to the classic result by Schein (1975). We extend this result and prove that a finitely generated subsemigroup of $FI_1$ is finitely presented if and only if it contains only finitely many idempotents. As a consequence, we derive that an inverse subsemigroup of $FI_1$ is finitely presented as a semigroup if and only if it is a finite semilattice.
△ Less
Submitted 12 June, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
Modeling Kinematic Uncertainty of Tendon-Driven Continuum Robots via Mixture Density Networks
Authors:
Jordan Thompson,
Brian Y. Cho,
Daniel S. Brown,
Alan Kuntz
Abstract:
Tendon-driven continuum robot kinematic models are frequently computationally expensive, inaccurate due to unmodeled effects, or both. In particular, unmodeled effects produce uncertainties that arise during the robot's operation that lead to variability in the resulting geometry. We propose a novel solution to these issues through the development of a Gaussian mixture kinematic model. We train a…
▽ More
Tendon-driven continuum robot kinematic models are frequently computationally expensive, inaccurate due to unmodeled effects, or both. In particular, unmodeled effects produce uncertainties that arise during the robot's operation that lead to variability in the resulting geometry. We propose a novel solution to these issues through the development of a Gaussian mixture kinematic model. We train a mixture density network to output a Gaussian mixture model representation of the robot geometry given the current tendon displacements. This model computes a probability distribution that is more representative of the true distribution of geometries at a given configuration than a model that outputs a single geometry, while also reducing the computation time. We demonstrate one use of this model through a trajectory optimization method that explicitly reasons about the workspace uncertainty to minimize the probability of collision.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Accounting for Hysteresis in the Forward Kinematics of Nonlinearly-Routed Tendon-Driven Continuum Robots via a Learned Deep Decoder Network
Authors:
Brian Y. Cho,
Daniel S. Esser,
Jordan Thompson,
Bao Thach,
Robert J. Webster III,
Alan Kuntz
Abstract:
Tendon-driven continuum robots have been gaining popularity in medical applications due to their ability to curve around complex anatomical structures, potentially reducing the invasiveness of surgery. However, accurate modeling is required to plan and control the movements of these flexible robots. Physics-based models have limitations due to unmodeled effects, leading to mismatches between model…
▽ More
Tendon-driven continuum robots have been gaining popularity in medical applications due to their ability to curve around complex anatomical structures, potentially reducing the invasiveness of surgery. However, accurate modeling is required to plan and control the movements of these flexible robots. Physics-based models have limitations due to unmodeled effects, leading to mismatches between model prediction and actual robot shape. Recently proposed learning-based methods have been shown to overcome some of these limitations but do not account for hysteresis, a significant source of error for these robots. To overcome these challenges, we propose a novel deep decoder neural network that predicts the complete shape of tendon-driven robots using point clouds as the shape representation, conditioned on prior configurations to account for hysteresis. We evaluate our method on a physical tendon-driven robot and show that our network model accurately predicts the robot's shape, significantly outperforming a state-of-the-art physics-based model and a learning-based model that does not account for hysteresis.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seong** Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in develo** their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Statistical Analysis by Semiparametric Additive Regression and LSTM-FCN Based Hierarchical Classification for Computer Vision Quantification of Parkinsonian Bradykinesia
Authors:
Youngseo Cho,
In Hee Kwak,
Dohyeon Kim,
**hee Na,
Hanjoo Sung,
Jeongjae Lee,
Young Eun Kim,
Hyeo-il Ma
Abstract:
Bradykinesia, characterized by involuntary slowing or decrement of movement, is a fundamental symptom of Parkinson's Disease (PD) and is vital for its clinical diagnosis. Despite various methodologies explored to quantify bradykinesia, computer vision-based approaches have shown promising results. However, these methods often fall short in adequately addressing key bradykinesia characteristics in…
▽ More
Bradykinesia, characterized by involuntary slowing or decrement of movement, is a fundamental symptom of Parkinson's Disease (PD) and is vital for its clinical diagnosis. Despite various methodologies explored to quantify bradykinesia, computer vision-based approaches have shown promising results. However, these methods often fall short in adequately addressing key bradykinesia characteristics in repetitive limb movements: "occasional arrest" and "decrement in amplitude."
This research advances vision-based quantification of bradykinesia by introducing nuanced numerical analysis to capture decrement in amplitudes and employing a simple deep learning technique, LSTM-FCN, for precise classification of occasional arrests. Our approach structures the classification process hierarchically, tailoring it to the unique dynamics of bradykinesia in PD.
Statistical analysis of the extracted features, including those representing arrest and fatigue, has demonstrated their statistical significance in most cases. This finding underscores the importance of considering "occasional arrest" and "decrement in amplitude" in bradykinesia quantification of limb movement. Our enhanced diagnostic tool has been rigorously tested on an extensive dataset comprising 1396 motion videos from 310 PD patients, achieving an accuracy of 80.3%. The results confirm the robustness and reliability of our method.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Non-Abelian Fractional Quantum Anomalous Hall States and First Landau Level Physics in Second Moiré Band of Twisted Bilayer MoTe2
Authors:
Cheong-Eung Ahn,
Wonjun Lee,
Kunihiro Yananose,
Youngwook Kim,
Gil Young Cho
Abstract:
Utilizing the realistic continuum description of twisted bilayer MoTe2 and many-body exact diagonalization calculation, we establish that the second moiré band of twisted bilayer MoTe2, at a small twist angle of approximately 2°, serves as an optimal platform for achieving the long-sought non-Abelian fractional quantum anomalous Hall states without the need for external magnetic fields. Across a w…
▽ More
Utilizing the realistic continuum description of twisted bilayer MoTe2 and many-body exact diagonalization calculation, we establish that the second moiré band of twisted bilayer MoTe2, at a small twist angle of approximately 2°, serves as an optimal platform for achieving the long-sought non-Abelian fractional quantum anomalous Hall states without the need for external magnetic fields. Across a wide parameter range, our exact diagonalization calculations reveal that the half-filled second moiré band demonstrates the ground state degeneracy and spectral flows, which are consistent with the pfaffian state in the first Landau level. We further elucidate that the emergence of the non-Abelian state is deeply connected to the remarkable similarity between the second moiré band and the first Landau level. Essentially, the band not only exhibits characteristics akin to the first Landau level, $\frac{1}{2π}\int_\mathrm{BZ}\mathrm{d}^2\mathbf{k}\:\mathrm{tr}\:η(\mathbf{k}) \approx 3$ where $η_{ab}(\mathbf{k})$ is the Fubini-Study metric of the band, but also that its projected Coulomb interaction closely mirrors the Haldane pseudopotentials of the first Landau level. Motivated from this observation, we introduce a novel metric of "first Landau level"-ness of a band, which quantitatively measures the alignment of the projected Coulomb interaction with the Haldane pseudopotentials in Landau levels. This metric is then compared with the global phase diagram of the half-filled second moiré band, revealing its utility in predicting the parameter region of the non-Abelian state. In addition, we uncover that the first and third moiré bands closely resemble the lowest and second Landau levels, revealing a remarkable sequential equivalence between the moiré bands and Landau levels. We finally discuss the potential implications on experiments.
△ Less
Submitted 5 July, 2024; v1 submitted 28 March, 2024;
originally announced March 2024.
-
Expanding Density-Correlation Machine Learning Representations for Anisotropic Coarse-Grained Particles
Authors:
Arthur Y. Lin,
Kevin K. Huguenin-Dumittan,
Yong-Cheol Cho,
Jigyasa Nigam,
Rose K. Cersonsky
Abstract:
Physics-based, atom-centered machine learning (ML) representations have been instrumental to the effective integration of ML within the atomistic simulation community. Many of these representations build off the idea of atoms as having spherical, or isotropic, interactions. In many communities, there is often a need to represent groups of atoms, either to increase the computational efficiency of s…
▽ More
Physics-based, atom-centered machine learning (ML) representations have been instrumental to the effective integration of ML within the atomistic simulation community. Many of these representations build off the idea of atoms as having spherical, or isotropic, interactions. In many communities, there is often a need to represent groups of atoms, either to increase the computational efficiency of simulation via coarse-graining or to understand molecular influences on system behavior. In such cases, atom-centered representations will have limited utility, as groups of atoms may not be well-approximated as spheres. In this work, we extend the popular Smooth Overlap of Atomic Positions (SOAP) ML representation for systems consisting of non-spherical anisotropic particles or clusters of atoms. We show the power of this anisotropic extension of SOAP, which we deem \AniSOAP, in accurately characterizing liquid crystal systems and predicting the energetics of Gay-Berne ellipsoids and coarse-grained benzene crystals. With our study of these prototypical anisotropic systems, we derive fundamental insights into how molecular shape influences mesoscale behavior and explain how to reincorporate important atom-atom interactions typically not captured by coarse-grained models. Moving forward, we propose \AniSOAP as a flexible, unified framework for coarse-graining in complex, multiscale simulation.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Achieving Optical Refractive Index of 10-Plus by Colloidal Self-Assembly
Authors:
NaYeoun Kim,
Ji-Hyeok Huh,
YongDeok Cho,
Sung Hun Park,
Hyeon Ho Kim,
Kyung Hun Rho,
Jaewon Lee,
Seungwoo Lee
Abstract:
This study demonstrates the developments of self-assembled optical metasurfaces to overcome inherent limitations in polarization density (P) within natural materials, which hinder achieving high refractive indices (n) at optical frequencies. The Maxwellian macroscopic description establishes a link between P and n, revealing a static limit in natural materials, restricting n to approximately 4.0 a…
▽ More
This study demonstrates the developments of self-assembled optical metasurfaces to overcome inherent limitations in polarization density (P) within natural materials, which hinder achieving high refractive indices (n) at optical frequencies. The Maxwellian macroscopic description establishes a link between P and n, revealing a static limit in natural materials, restricting n to approximately 4.0 at optical frequencies. Optical metasurfaces, utilizing metallic colloids on a deep-subwavelength scale, offer a solution by unnaturally enhancing n through electric dipolar (ED) resonances. Self-assembly enables the creation of nanometer-scale metallic gaps between metallic nanoparticles (NPs), paving the way for achieving exceptionally high n at optical frequencies. This study focuses on assembling polyhedral gold (Au) NPs into a closely packed monolayer by rationally designing the polymeric ligand to balance attractive and repulsive forces, in that polymeric brush-mediated self-assembly of the close-packed Au NP monolayer is robustly achieved over a large-area. The resulting monolayer of Au nanospheres (NSs), nanooctahedras (NOs), and nanocubes (NCs) exhibits high macroscopic integrity and crystallinity, sufficiently enough for pushing n to record-high regimes. The study underlies the significance of capacitive coupling in achieving an unnaturally high n and explores fine-tuning Au NC size to optimize this coupling. The achieved n of 10.12 at optical frequencies stands as a benchmark, highlighting the potential of polyhedral Au NPs in advancing optical metasurfaces.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
ReFeree: Radar-based efficient global descriptor using a Feature and Free space for Place Recognition
Authors:
Byunghee Choi,
Hogyun Kim,
Younggun Cho
Abstract:
Radar is highlighted for robust sensing capabilities in adverse weather conditions (e.g. dense fog, heavy rain, or snowfall). In addition, Radar can cover wide areas and penetrate small particles. Despite these advantages, Radar-based place recognition remains in the early stages compared to other sensors due to its unique characteristics such as low resolution, and significant noise. In this pape…
▽ More
Radar is highlighted for robust sensing capabilities in adverse weather conditions (e.g. dense fog, heavy rain, or snowfall). In addition, Radar can cover wide areas and penetrate small particles. Despite these advantages, Radar-based place recognition remains in the early stages compared to other sensors due to its unique characteristics such as low resolution, and significant noise. In this paper, we propose a Radarbased place recognition utilizing a descriptor called ReFeree using a feature and free space. Unlike traditional methods, we overwhelmingly summarize the Radar image. Despite being lightweight, it contains semi-metric information and is also outstanding from the perspective of place recognition performance. For concrete validation, we test a single session from the MulRan dataset and a multi-session from the Oxford Offroad Radar, Oxford Radar RobotCar, and the Boreas dataset.
△ Less
Submitted 6 May, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
CORN: Contact-based Object Representation for Nonprehensile Manipulation of General Unseen Objects
Authors:
Yoonyoung Cho,
Junhyek Han,
Yoontae Cho,
Beomjoon Kim
Abstract:
Nonprehensile manipulation is essential for manipulating objects that are too thin, large, or otherwise ungraspable in the wild. To sidestep the difficulty of contact modeling in conventional modeling-based approaches, reinforcement learning (RL) has recently emerged as a promising alternative. However, previous RL approaches either lack the ability to generalize over diverse object shapes, or use…
▽ More
Nonprehensile manipulation is essential for manipulating objects that are too thin, large, or otherwise ungraspable in the wild. To sidestep the difficulty of contact modeling in conventional modeling-based approaches, reinforcement learning (RL) has recently emerged as a promising alternative. However, previous RL approaches either lack the ability to generalize over diverse object shapes, or use simple action primitives that limit the diversity of robot motions. Furthermore, using RL over diverse object geometry is challenging due to the high cost of training a policy that takes in high-dimensional sensory inputs. We propose a novel contact-based object representation and pretraining pipeline to tackle this. To enable massively parallel training, we leverage a lightweight patch-based transformer architecture for our encoder that processes point clouds, thus scaling our training across thousands of environments. Compared to learning from scratch, or other shape representation baselines, our representation facilitates both time- and data-efficient learning. We validate the efficacy of our overall system by zero-shot transferring the trained policy to novel real-world objects. Code and videos are available at https://sites.google.com/view/contact-non-prehensile.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Electroweak Monopole-Antimonopole Pair Production at LHC
Authors:
Petr Benes,
Filip Blaschke,
Y. M. Cho
Abstract:
One of the urgent issues in high energy physics is the experimental confirmation of the electroweak monopole predicted by the standard model, and currently MoEDAL at LHC is actively searching for the monopole. However, the present LHC cannot produce the monopole if the mass is bigger than 7 TeV, while the monopole mass is expected to be around $M_W/α\simeq 11~\text{TeV}$. In this paper we discuss…
▽ More
One of the urgent issues in high energy physics is the experimental confirmation of the electroweak monopole predicted by the standard model, and currently MoEDAL at LHC is actively searching for the monopole. However, the present LHC cannot produce the monopole if the mass is bigger than 7 TeV, while the monopole mass is expected to be around $M_W/α\simeq 11~\text{TeV}$. In this paper we discuss how LHC could circumbent this energy constraint and produce the monopole even when the mass is bigger than 7 TeV, based on the following ideas. First, in the topological production of the monopole the baby monopole mass at creation could be considerably smaller than the adolescent mass. Second, the binding energy of the monopole-antimonopole pair could effectively reduce the mass of the bound state. We discuss how these ideas can actually be realized at LHC to produce the monopole pairs. In particular, we argue that LHC could produce the baby electroweak monopoles whose mass could be around 5.3 TeV, smaller than the adolescent monopole mass around 11.0 TeV. Moreover, we show that LHC could produce the monopolium bound state with mass around 2.5 TeV, even when the total mass of the monopole-antimonopole pair is around 10.6 TeV. Our analysis could play an important role for MoEDAL experiment.
△ Less
Submitted 5 April, 2024; v1 submitted 15 March, 2024;
originally announced March 2024.
-
Robust Chemiresistive Behavior in Conductive Polymer/MOF Composites
Authors:
Heejung Roh,
Dong-Ha Kim,
Yeongsu Cho,
Young-Moo Jo,
Jesús A. del Alamo,
Heather J. Kulik,
Mircea Dincă,
Aristide Gumyusenge
Abstract:
Metal-organic frameworks (MOFs) are promising materials for gas sensing but are often limited to single-use detection. We demonstrate a hybridization strategy synergistically deploying conductive MOFs (cMOFs) and conductive polymers (cPs) as two complementary mixed ionic-electronic conductors in high-performing stand-alone chemiresistors. Our work presents significant improvement in i) sensor reco…
▽ More
Metal-organic frameworks (MOFs) are promising materials for gas sensing but are often limited to single-use detection. We demonstrate a hybridization strategy synergistically deploying conductive MOFs (cMOFs) and conductive polymers (cPs) as two complementary mixed ionic-electronic conductors in high-performing stand-alone chemiresistors. Our work presents significant improvement in i) sensor recovery kinetics, ii) cycling stability, and iii) dynamic range at room temperature. We demonstrate the effect of hybridization across well-studied cMOFs based on 2,3,6,7,10,11-hexahydroxytriphenylene (HHTP) and 2,3,6,7,10,11-hexaiminotripphenylene (HITP) ligands with varied metal nodes (Co, Cu, Ni). We conduct a comprehensive mechanistic study to relate energy band alignments at the heterojunctions between the MOFs and the polymer with sensing thermodynamics and binding kinetics. Our findings reveal that hole enrichment of the cMOF component upon hybridization leads to selective enhancement in desorption kinetics, enabling significantly improved sensor recovery at room temperature, and thus long-term response retention. This mechanism was further supported by density functional theory calculations on sorbate-analyte interactions. We also find that alloying cPs and cMOFs enables facile thin film co-processing and device integration, potentially unlocking the use of these hybrid conductors in diverse electronic applications.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Separable Physics-informed Neural Networks for Solving the BGK Model of the Boltzmann Equation
Authors:
Jaemin Oh,
Seung Yeon Cho,
Seok-Bae Yun,
Eunbyung Park,
Youngjoon Hong
Abstract:
In this study, we introduce a method based on Separable Physics-Informed Neural Networks (SPINNs) for effectively solving the BGK model of the Boltzmann equation. While the mesh-free nature of PINNs offers significant advantages in handling high-dimensional partial differential equations (PDEs), challenges arise when applying quadrature rules for accurate integral evaluation in the BGK operator, w…
▽ More
In this study, we introduce a method based on Separable Physics-Informed Neural Networks (SPINNs) for effectively solving the BGK model of the Boltzmann equation. While the mesh-free nature of PINNs offers significant advantages in handling high-dimensional partial differential equations (PDEs), challenges arise when applying quadrature rules for accurate integral evaluation in the BGK operator, which can compromise the mesh-free benefit and increase computational costs. To address this, we leverage the canonical polyadic decomposition structure of SPINNs and the linear nature of moment calculation, achieving a substantial reduction in computational expense for quadrature rule application. The multi-scale nature of the particle density function poses difficulties in precisely approximating macroscopic moments using neural networks. To improve SPINN training, we introduce the integration of Gaussian functions into SPINNs, coupled with a relative loss approach. This modification enables SPINNs to decay as rapidly as Maxwellian distributions, thereby enhancing the accuracy of macroscopic moment approximations. The relative loss design further ensures that both large and small-scale features are effectively captured by the SPINNs. The efficacy of our approach is demonstrated through a series of five numerical experiments, including the solution to a challenging 3D Riemann problem. These results highlight the potential of our novel method in efficiently and accurately addressing complex challenges in computational physics.
△ Less
Submitted 10 March, 2024;
originally announced March 2024.
-
DeepVM: Integrating Spot and On-Demand VMs for Cost-Efficient Deep Learning Clusters in the Cloud
Authors:
Yoochan Kim,
Kihyun Kim,
Yonghyeon Cho,
**woo Kim,
Awais Khan,
Ki-Dong Kang,
Baik-Song An,
Myung-Hoon Cha,
Hong-Yeon Kim,
Youngjae Kim
Abstract:
Distributed Deep Learning (DDL), as a paradigm, dictates the use of GPU-based clusters as the optimal infrastructure for training large-scale Deep Neural Networks (DNNs). However, the high cost of such resources makes them inaccessible to many users. Public cloud services, particularly Spot Virtual Machines (VMs), offer a cost-effective alternative, but their unpredictable availability poses a sig…
▽ More
Distributed Deep Learning (DDL), as a paradigm, dictates the use of GPU-based clusters as the optimal infrastructure for training large-scale Deep Neural Networks (DNNs). However, the high cost of such resources makes them inaccessible to many users. Public cloud services, particularly Spot Virtual Machines (VMs), offer a cost-effective alternative, but their unpredictable availability poses a significant challenge to the crucial checkpointing process in DDL. To address this, we introduce DeepVM, a novel solution that recommends cost-effective cluster configurations by intelligently balancing the use of Spot and On-Demand VMs. DeepVM leverages a four-stage process that analyzes instance performance using the FLOPP (FLoating-point Operations Per Price) metric, performs architecture-level analysis with linear programming, and identifies the optimal configuration for the user-specific needs. Extensive simulations and real-world deployments in the AWS environment demonstrate that DeepVM consistently outperforms other policies, reducing training costs and overall makespan. By enabling cost-effective checkpointing with Spot VMs, DeepVM opens up DDL to a wider range of users and facilitates a more efficient training of complex DNNs.
△ Less
Submitted 14 March, 2024; v1 submitted 9 March, 2024;
originally announced March 2024.
-
Precise Extraction of Deep Learning Models via Side-Channel Attacks on Edge/Endpoint Devices
Authors:
Younghan Lee,
Sohee Jun,
Yungi Cho,
Woorim Han,
Hyungon Moon,
Yunheung Paek
Abstract:
With growing popularity, deep learning (DL) models are becoming larger-scale, and only the companies with vast training datasets and immense computing power can manage their business serving such large models. Most of those DL models are proprietary to the companies who thus strive to keep their private models safe from the model extraction attack (MEA), whose aim is to steal the model by training…
▽ More
With growing popularity, deep learning (DL) models are becoming larger-scale, and only the companies with vast training datasets and immense computing power can manage their business serving such large models. Most of those DL models are proprietary to the companies who thus strive to keep their private models safe from the model extraction attack (MEA), whose aim is to steal the model by training surrogate models. Nowadays, companies are inclined to offload the models from central servers to edge/endpoint devices. As revealed in the latest studies, adversaries exploit this opportunity as new attack vectors to launch side-channel attack (SCA) on the device running victim model and obtain various pieces of the model information, such as the model architecture (MA) and image dimension (ID). Our work provides a comprehensive understanding of such a relationship for the first time and would benefit future MEA studies in both offensive and defensive sides in that they may learn which pieces of information exposed by SCA are more important than the others. Our analysis additionally reveals that by gras** the victim model information from SCA, MEA can get highly effective and successful even without any prior knowledge of the model. Finally, to evince the practicality of our analysis results, we empirically apply SCA, and subsequently, carry out MEA under realistic threat assumptions. The results show up to 5.8 times better performance than when the adversary has no model information about the victim model.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
FLGuard: Byzantine-Robust Federated Learning via Ensemble of Contrastive Models
Authors:
Younghan Lee,
Yungi Cho,
Woorim Han,
Ho Bae,
Yunheung Paek
Abstract:
Federated Learning (FL) thrives in training a global model with numerous clients by only sharing the parameters of their local models trained with their private training datasets. Therefore, without revealing the private dataset, the clients can obtain a deep learning (DL) model with high performance. However, recent research proposed poisoning attacks that cause a catastrophic loss in the accurac…
▽ More
Federated Learning (FL) thrives in training a global model with numerous clients by only sharing the parameters of their local models trained with their private training datasets. Therefore, without revealing the private dataset, the clients can obtain a deep learning (DL) model with high performance. However, recent research proposed poisoning attacks that cause a catastrophic loss in the accuracy of the global model when adversaries, posed as benign clients, are present in a group of clients. Therefore, recent studies suggested byzantine-robust FL methods that allow the server to train an accurate global model even with the adversaries present in the system. However, many existing methods require the knowledge of the number of malicious clients or the auxiliary (clean) dataset or the effectiveness reportedly decreased hugely when the private dataset was non-independently and identically distributed (non-IID). In this work, we propose FLGuard, a novel byzantine-robust FL method that detects malicious clients and discards malicious local updates by utilizing the contrastive learning technique, which showed a tremendous improvement as a self-supervised learning method. With contrastive models, we design FLGuard as an ensemble scheme to maximize the defensive capability. We evaluate FLGuard extensively under various poisoning attacks and compare the accuracy of the global model with existing byzantine-robust FL methods. FLGuard outperforms the state-of-the-art defense methods in most cases and shows drastic improvement, especially in non-IID settings. https://github.com/201younghanlee/FLGuard
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
MoEDAL search in the CMS beam pipe for magnetic monopoles produced via the Schwinger effect
Authors:
B. Acharya,
J. Alexandre,
P. Benes,
B. Bergmann,
S. Bertolucci,
A. Bevan,
R. Brancaccio,
H. Branzas,
P. Burian,
M. Campbell,
S. Cecchini,
Y. M. Cho,
M. de Montigny,
A. De Roeck,
J. R. Ellis,
M. Fairbairn,
D. Felea,
M. Frank,
O. Gould,
J. Hays,
A. M. Hirt,
D. L. -J. Ho,
P. Q. Hung,
J. Janecek,
M. Kalliokoski
, et al. (41 additional authors not shown)
Abstract:
We report on a search for magnetic monopoles (MMs) produced in ultraperipheral Pb--Pb collisions during Run-1 of the LHC. The beam pipe surrounding the interaction region of the CMS experiment was exposed to 174.29 $\mathrmμ$b$^{-1}$ of Pb--Pb collisions at 2.76 TeV center-of-mass energy per collision in December 2011. It was scanned by the MoEDAL experiment using a SQUID magnetometer to search fo…
▽ More
We report on a search for magnetic monopoles (MMs) produced in ultraperipheral Pb--Pb collisions during Run-1 of the LHC. The beam pipe surrounding the interaction region of the CMS experiment was exposed to 174.29 $\mathrmμ$b$^{-1}$ of Pb--Pb collisions at 2.76 TeV center-of-mass energy per collision in December 2011. It was scanned by the MoEDAL experiment using a SQUID magnetometer to search for trapped MMs. No MM signal was observed. The two distinctive features of this search are the use of a trap** volume very close to the collision point and ultra-high magnetic fields generated during the heavy-ion run that could produce MMs via the Schwinger effect. These two advantages allowed setting the first reliable, world-leading mass limits on MMs with high magnetic charge. In particular, the established limits are the strongest available in the range between 2 and 45 Dirac units, excluding MMs with masses of up to 80 GeV at 95% confidence level.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Studying Differential Mental Health Expressions in India
Authors:
Khushi Shelat,
Sunny Rai,
Devansh R Jain,
Kishen Sivabalan,
Young Min Cho,
Maitreyi Redkar,
Samindara Sawant,
Sharath Chandra Guntuku
Abstract:
Psychosocial stressors and the symptomatology of mental disorders vary across cultures. However, current understandings of mental health expressions on social media are predominantly derived from studies in WEIRD (Western, Educated, Industrialized, Rich, and Democratic) contexts. In this paper, we analyze mental health posts on Reddit made by individuals in India, to identify variations in online…
▽ More
Psychosocial stressors and the symptomatology of mental disorders vary across cultures. However, current understandings of mental health expressions on social media are predominantly derived from studies in WEIRD (Western, Educated, Industrialized, Rich, and Democratic) contexts. In this paper, we analyze mental health posts on Reddit made by individuals in India, to identify variations in online depression language specific to the Indian context compared to users from the Rest of the World (ROW). Unlike in Western samples, we observe that mental health discussions in India additionally express sadness, use negation, are present-focused, and are related to work and achievement. Illness is uniquely correlated to India, indicating the association between depression and physical health in Indian patients. Two clinical psychologists validated the findings from social media posts and found 95% of the top 20 topics associated with mental health discussions as prevalent in Indians. Significant linguistic variations in online mental health-related language in India compared to ROW, emphasize the importance of develo** precision-targeted interventions that are culturally appropriate.
△ Less
Submitted 16 June, 2024; v1 submitted 18 February, 2024;
originally announced February 2024.
-
Enumeration of multiplex juggling card sequences using generalized q-derivatives
Authors:
Yumin Cho,
Jaehyun Kim,
Jang Soo Kim,
Nakyung Lee
Abstract:
In 2019, Butler, Choi, Kim, and Seo introduced a new type of juggling card that represents multiplex juggling patterns in a natural bijective way. They conjectured a formula for the generating function for the number of multiplex juggling cards with capacity 2. In this paper we prove their conjecture. More generally, we find an explicit formula for the generating function with any capacity. We als…
▽ More
In 2019, Butler, Choi, Kim, and Seo introduced a new type of juggling card that represents multiplex juggling patterns in a natural bijective way. They conjectured a formula for the generating function for the number of multiplex juggling cards with capacity 2. In this paper we prove their conjecture. More generally, we find an explicit formula for the generating function with any capacity. We also find an expression for the generating function for multiplex juggling card sequences by introducing a generalization of the q-derivative operator. As a consequence, we show that this generating function is a rational function.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Combining Evidence Across Filtrations Using Adjusters
Authors:
Yo Joong Choe,
Aaditya Ramdas
Abstract:
In anytime-valid sequential inference, it is known that any admissible procedure must be based on e-processes, which are composite generalizations of test martingales that quantify the accumulated evidence against a composite null hypothesis at any arbitrary stop** time. This paper studies methods for combining e-processes constructed using different information sets (filtrations) for the same n…
▽ More
In anytime-valid sequential inference, it is known that any admissible procedure must be based on e-processes, which are composite generalizations of test martingales that quantify the accumulated evidence against a composite null hypothesis at any arbitrary stop** time. This paper studies methods for combining e-processes constructed using different information sets (filtrations) for the same null. Although e-processes constructed in the same filtration can be combined effortlessly (e.g., by averaging), e-processes constructed in different filtrations cannot, because their validity in a coarser filtration does not translate to validity in a finer filtration. This issue arises in exchangeability tests, independence tests, and tests for comparing forecasts with lags. We first establish that a class of functions called adjusters allows us to lift e-processes from a coarser filtration into any finer filtration. We then introduce a characterization theorem for adjusters, formalizing a sense in which using adjusters is necessary. There are two major implications. First, if we have a powerful e-process in a coarsened filtration, then we readily have a powerful e-process in the original filtration. Second, when we coarsen the filtration to construct an e-process, there is an asymptotically logarithmic cost of recovering anytime-validity in the original filtration.
△ Less
Submitted 28 May, 2024; v1 submitted 14 February, 2024;
originally announced February 2024.
-
Pretraining Vision-Language Model for Difference Visual Question Answering in Longitudinal Chest X-rays
Authors:
Yeongjae Cho,
Taehee Kim,
Heejun Shin,
Sungzoon Cho,
Dongmyung Shin
Abstract:
Difference visual question answering (diff-VQA) is a challenging task that requires answering complex questions based on differences between a pair of images. This task is particularly important in reading chest X-ray images because radiologists often compare multiple images of the same patient taken at different times to track disease progression and changes in its severity in their clinical prac…
▽ More
Difference visual question answering (diff-VQA) is a challenging task that requires answering complex questions based on differences between a pair of images. This task is particularly important in reading chest X-ray images because radiologists often compare multiple images of the same patient taken at different times to track disease progression and changes in its severity in their clinical practice. However, previous works focused on designing specific network architectures for the diff-VQA task, missing opportunities to enhance the model's performance using a pretrained vision-language model (VLM). Here, we introduce a novel VLM called PLURAL, which is pretrained on natural and longitudinal chest X-ray data for the diff-VQA task. The model is developed using a step-by-step approach, starting with being pretrained on natural images and texts, followed by being trained using longitudinal chest X-ray data. The longitudinal data consist of pairs of X-ray images, along with question-answer sets and radiologist's reports that describe the changes in lung abnormalities and diseases over time. Our experimental results show that the PLURAL model outperforms state-of-the-art methods not only in diff-VQA for longitudinal X-rays but also in conventional VQA for a single X-ray image. Through extensive experiments, we demonstrate the effectiveness of the proposed VLM architecture and pretraining method in improving the model's performance.
△ Less
Submitted 17 June, 2024; v1 submitted 14 February, 2024;
originally announced February 2024.
-
A Benchmark Dataset for Tornado Detection and Prediction using Full-Resolution Polarimetric Weather Radar Data
Authors:
Mark S. Veillette,
James M. Kurdzo,
Phillip M. Stepanian,
John Y. N. Cho,
Siddharth Samsi,
Joseph McDonald
Abstract:
Weather radar is the primary tool used by forecasters to detect and warn for tornadoes in near-real time. In order to assist forecasters in warning the public, several algorithms have been developed to automatically detect tornadic signatures in weather radar observations. Recently, Machine Learning (ML) algorithms, which learn directly from large amounts of labeled data, have been shown to be hig…
▽ More
Weather radar is the primary tool used by forecasters to detect and warn for tornadoes in near-real time. In order to assist forecasters in warning the public, several algorithms have been developed to automatically detect tornadic signatures in weather radar observations. Recently, Machine Learning (ML) algorithms, which learn directly from large amounts of labeled data, have been shown to be highly effective for this purpose. Since tornadoes are extremely rare events within the corpus of all available radar observations, the selection and design of training datasets for ML applications is critical for the performance, robustness, and ultimate acceptance of ML algorithms. This study introduces a new benchmark dataset, TorNet to support development of ML algorithms in tornado detection and prediction. TorNet contains full-resolution, polarimetric, Level-II WSR-88D data sampled from 10 years of reported storm events. A number of ML baselines for tornado detection are developed and compared, including a novel deep learning (DL) architecture capable of processing raw radar imagery without the need for manual feature extraction required for existing ML algorithms. Despite not benefiting from manual feature engineering or other preprocessing, the DL model shows increased detection performance compared to non-DL and operational baselines. The TorNet dataset, as well as source code and model weights of the DL baseline trained in this work, are made freely available.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Open-source data pipeline for street-view images: a case study on community mobility during COVID-19 pandemic
Authors:
Matthew Martell,
Nick Terry,
Ribhu Sengupta,
Chris Salazar,
Nicole A. Errett,
Scott B. Miles,
Joseph Wartman,
Youngjun Choe
Abstract:
Street View Images (SVI) are a common source of valuable data for researchers. Researchers have used SVI data for estimating pedestrian volumes, demographic surveillance, and to better understand built and natural environments in cityscapes. However, the most common source of publicly available SVI data is Google Street View. Google Street View images are collected infrequently, making temporal an…
▽ More
Street View Images (SVI) are a common source of valuable data for researchers. Researchers have used SVI data for estimating pedestrian volumes, demographic surveillance, and to better understand built and natural environments in cityscapes. However, the most common source of publicly available SVI data is Google Street View. Google Street View images are collected infrequently, making temporal analysis challenging, especially in low population density areas. Our main contribution is the development of an open-source data pipeline for processing 360-degree video recorded from a car-mounted camera. The video data is used to generate SVIs, which then can be used as an input for temporal analysis. We demonstrate the use of the pipeline by collecting a SVI dataset over a 38-month longitudinal survey of Seattle, WA, USA during the COVID-19 pandemic. The output of our pipeline is validated through statistical analyses of pedestrian traffic in the images. We confirm known results in the literature and provide new insights into outdoor pedestrian traffic patterns. This study demonstrates the feasibility and value of collecting and using SVI for research purposes beyond what is possible with currently available SVI data. Limitations and future improvements on the data pipeline and case study are also discussed.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
New Liouville-type theorem for the stationary tropical climate model
Authors:
Youseung Cho,
Hyun** In,
Minsuk Yang
Abstract:
We study the Liouville-type theorem for smooth solutions to the steady 3D tropical climate model. We prove the Liouville-type theorem if a smooth solution satisfies a certain growth condition in terms of $L^p$-norm on annuli, which improves the previous results, Theorem 1.1 (Math. Methods Appl. Sci. 44, 2021) by Ding and Wu, and Theorem 1.1 and Theorem 1.2 (Appl. Math. Lett. 138, 2023) by Yuan and…
▽ More
We study the Liouville-type theorem for smooth solutions to the steady 3D tropical climate model. We prove the Liouville-type theorem if a smooth solution satisfies a certain growth condition in terms of $L^p$-norm on annuli, which improves the previous results, Theorem 1.1 (Math. Methods Appl. Sci. 44, 2021) by Ding and Wu, and Theorem 1.1 and Theorem 1.2 (Appl. Math. Lett. 138, 2023) by Yuan and Wang.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Data-Driven Characterization of Latent Dynamics on Quantum Testbeds
Authors:
Sohail Reddy,
Stefanie Guenther,
Yu** Cho
Abstract:
This paper presents a data-driven approach to learn latent dynamics in superconducting quantum computing hardware. To this end, we augment the dynamical equation of quantum systems described by the Lindblad master equation with a parameterized source term that is trained from experimental data to capture unknown system dynamics, such as environmental interactions and system noise. We consider a st…
▽ More
This paper presents a data-driven approach to learn latent dynamics in superconducting quantum computing hardware. To this end, we augment the dynamical equation of quantum systems described by the Lindblad master equation with a parameterized source term that is trained from experimental data to capture unknown system dynamics, such as environmental interactions and system noise. We consider a structure preserving augmentation that learns and distinguishes unitary from dissipative latent dynamics parameterized by a basis of linear operators, as well as an augmentation given by a nonlinear feed-forward neural network. Numerical results are presented using data from two different quantum processing units (QPU) at Lawrence Livermore National Laboratory's Quantum Device and Integration Testbed. We demonstrate that our interpretable, structure preserving, and nonlinear models are able to improve the prediction accuracy of the Lindblad master equation and accurately model the latent dynamics of the QPUs.
△ Less
Submitted 1 February, 2024; v1 submitted 18 January, 2024;
originally announced January 2024.
-
Two Types of Gluons in QCD: Re-interpretation of ALEPH and CMS Gluon Jet Data
Authors:
Y. M. Cho
Abstract:
The Abelian decomposition of QCD tells that there are two types of gluons in QCD, the color neutral neurons and colored chromons, which behave differently. This implies that QCD has two types of gluon jets, the neuron jet and chromon jet. One quarter of the gluon jets is made of the neuron jets which have sharper jet radius and smaller particle multiplicity, while three quarters of them are made o…
▽ More
The Abelian decomposition of QCD tells that there are two types of gluons in QCD, the color neutral neurons and colored chromons, which behave differently. This implies that QCD has two types of gluon jets, the neuron jet and chromon jet. One quarter of the gluon jets is made of the neuron jets which have sharper jet radius and smaller particle multiplicity, while three quarters of them are made of chromon jets which have the broader jet radius and larger particle multiplicity. Moreover, the neuron jet has a distinct color flow which forms an ideal color dipole pattern, while the chromon jets have distorted dipole pattern. In this paper we provide circumferential evidences of the existence of two types of gluon jets from the existing ALEPH data on $e \bar e \rightarrow Z \rightarrow b \bar b g$ decay and the CMS data on Pb-Pb heavy ion collision.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.