Search | arXiv e-print repository

High-resolution open-vocabulary object 6D pose estimation

Authors: Jaime Corsetti, Davide Boscaini, Francesco Giuliari, Changjae Oh, Andrea Cavallaro, Fabio Poiesi

Abstract: The generalisation to unseen objects in the 6D pose estimation task is very challenging. While Vision-Language Models (VLMs) enable using natural language descriptions to support 6D pose estimation of unseen objects, these solutions underperform compared to model-based methods. In this work we present Horyon, an open-vocabulary VLM-based architecture that addresses relative pose estimation between… ▽ More The generalisation to unseen objects in the 6D pose estimation task is very challenging. While Vision-Language Models (VLMs) enable using natural language descriptions to support 6D pose estimation of unseen objects, these solutions underperform compared to model-based methods. In this work we present Horyon, an open-vocabulary VLM-based architecture that addresses relative pose estimation between two scenes of an unseen object, described by a textual prompt only. We use the textual prompt to identify the unseen object in the scenes and then obtain high-resolution multi-scale features. These features are used to extract cross-scene matches for registration. We evaluate our model on a benchmark with a large variety of unseen objects across four datasets, namely REAL275, Toyota-Light, Linemod, and YCB-Video. Our method achieves state-of-the-art performance on all datasets, outperforming by 12.6 in Average Recall the previous best-performing approach. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: Technical report. Extension of CVPR paper "Open-vocabulary object 6D pose estimation". Project page: https://jcorsetti.github.io/oryon

arXiv:2406.08086 [pdf, other]

Classical simulability of constant-depth linear-optical circuits with noise

Authors: Changhun Oh

Abstract: Noise is one of the main obstacles to realizing quantum devices that achieve a quantum computational advantage. A possible approach to minimize the noise effect is to employ shallow-depth quantum circuits since noise typically accumulates as circuit depth grows. In this work, we investigate the complexity of shallow-depth linear-optical circuits under the effects of photon loss and partial disting… ▽ More Noise is one of the main obstacles to realizing quantum devices that achieve a quantum computational advantage. A possible approach to minimize the noise effect is to employ shallow-depth quantum circuits since noise typically accumulates as circuit depth grows. In this work, we investigate the complexity of shallow-depth linear-optical circuits under the effects of photon loss and partial distinguishability. By establishing a correspondence between a linear-optical circuit and a bipartite graph, we show that the effects of photon loss and partial distinguishability are equivalent to removing the corresponding vertices. Using this correspondence and percolation theory, we prove that for constant-depth linear-optical circuits with single photons, there is a threshold of loss (noise) rate above which the linear-optical systems can be decomposed into smaller systems with high probability, which enables us to simulate the systems efficiently. Consequently, our result implies that even in shallow-depth circuits where noise is not accumulated enough, its effect may be sufficiently significant to make them efficiently simulable using classical algorithms due to its entanglement structure constituted by shallow-depth circuits. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 9 pages, 3 figures

arXiv:2406.05939 [pdf, other]

Revisiting the magnetic responses of bilayer graphene from the perspective of the quantum distance

Authors: Chang-geun Oh, Jun-Won Rhim, Bohm-Jung Yang

Abstract: We study the influence of the quantum geometry on the magnetic responses of quadratic band crossing semimetals. More explicitly, we examine the Landau levels, quantum Hall effect, and magnetic susceptibility of a general two-band Hamiltonian that has fixed isotropic quadratic band dispersion but with tunable quantum geometry, in which the interband coupling is fully characterized by the maximum qu… ▽ More We study the influence of the quantum geometry on the magnetic responses of quadratic band crossing semimetals. More explicitly, we examine the Landau levels, quantum Hall effect, and magnetic susceptibility of a general two-band Hamiltonian that has fixed isotropic quadratic band dispersion but with tunable quantum geometry, in which the interband coupling is fully characterized by the maximum quantum distance $d_\mathrm{max}$. By continuously tuning $d_\mathrm{max}$ in the range of $0\leq d_\mathrm{max}\leq 1$, we investigate how the magnetic properties of the free electron model with $d_\mathrm{max}=0$ evolve into those of the bilayer graphene with $d_\mathrm{max}=1$. We demonstrate that despite sharing the same energy dispersion $ε(p) =\pm\frac{p^2}{2m}$, the charge carriers in the free electron model and bilayer graphene exhibit entirely distinct Landau levels and quantum Hall responses due to the nontrivial quantum geometry of the wave functions. △ Less

Submitted 9 June, 2024; originally announced June 2024.

arXiv:2405.04356 [pdf, other]

Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation

Authors: Jihyun Kim, Changjae Oh, Hoseok Do, Soohyun Kim, Kwanghoon Sohn

Abstract: We present a new multi-modal face image generation method that converts a text prompt and a visual input, such as a semantic mask or scribble map, into a photo-realistic face image. To do this, we combine the strengths of Generative Adversarial networks (GANs) and diffusion models (DMs) by employing the multi-modal features in the DM into the latent space of the pre-trained GANs. We present a simp… ▽ More We present a new multi-modal face image generation method that converts a text prompt and a visual input, such as a semantic mask or scribble map, into a photo-realistic face image. To do this, we combine the strengths of Generative Adversarial networks (GANs) and diffusion models (DMs) by employing the multi-modal features in the DM into the latent space of the pre-trained GANs. We present a simple map** and a style modulation network to link two models and convert meaningful representations in feature maps and attention maps into latent codes. With GAN inversion, the estimated latent codes can be used to generate 2D or 3D-aware facial images. We further present a multi-step training strategy that reflects textual and structural representations into the generated image. Our proposed network produces realistic 2D, multi-view, and stylized face images, which align well with inputs. We validate our method by using pre-trained 2D and 3D GANs, and our results outperform existing methods. Our project page is available at https://github.com/1211sh/Diffusion-driven_GAN-Inversion/. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: Accepted by CVPR 2024

arXiv:2405.01786 [pdf, ps, other]

On computational complexity and average-case hardness of shallow-depth boson sampling

Authors: Byeongseon Go, Changhun Oh, Hyunseok Jeong

Abstract: Boson sampling, a computational task believed to be classically hard to simulate, is expected to hold promise for demonstrating quantum computational advantage using near-term quantum devices. However, noise in experimental implementations poses a significant challenge, potentially rendering boson sampling classically simulable and compromising its classical intractability. Numerous studies have p… ▽ More Boson sampling, a computational task believed to be classically hard to simulate, is expected to hold promise for demonstrating quantum computational advantage using near-term quantum devices. However, noise in experimental implementations poses a significant challenge, potentially rendering boson sampling classically simulable and compromising its classical intractability. Numerous studies have proposed classical algorithms under various noise models that can efficiently simulate boson sampling as noise rates increase with circuit depth. To address this issue particularly related to circuit depth, we explore the viability of achieving quantum computational advantage through boson sampling with shallow-depth linear optical circuits. Specifically, as the average-case hardness of estimating output probabilities of boson sampling is a crucial ingredient in demonstrating its classical intractability, we make progress on establishing the average-case hardness confined to logarithmic-depth regimes. We also obtain the average-case hardness for logarithmic-depth Fock-state boson sampling subject to lossy environments and for the logarithmic-depth Gaussian boson sampling. By providing complexity-theoretical backgrounds for the classical simulation hardness of logarithmic-depth boson sampling, we expect that our findings will mark a crucial step towards a more noise-tolerant demonstration of quantum advantage with shallow-depth boson sampling. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2405.01353 [pdf, other]

Sparse multi-view hand-object reconstruction for unseen environments

Authors: Yik Lung Pang, Changjae Oh, Andrea Cavallaro

Abstract: Recent works in hand-object reconstruction mainly focus on the single-view and dense multi-view settings. On the one hand, single-view methods can leverage learned shape priors to generalise to unseen objects but are prone to inaccuracies due to occlusions. On the other hand, dense multi-view methods are very accurate but cannot easily adapt to unseen objects without further data collection. In co… ▽ More Recent works in hand-object reconstruction mainly focus on the single-view and dense multi-view settings. On the one hand, single-view methods can leverage learned shape priors to generalise to unseen objects but are prone to inaccuracies due to occlusions. On the other hand, dense multi-view methods are very accurate but cannot easily adapt to unseen objects without further data collection. In contrast, sparse multi-view methods can take advantage of the additional views to tackle occlusion, while kee** the computational cost low compared to dense multi-view methods. In this paper, we consider the problem of hand-object reconstruction with unseen objects in the sparse multi-view setting. Given multiple RGB images of the hand and object captured at the same time, our model SVHO combines the predictions from each view into a unified reconstruction without optimisation across views. We train our model on a synthetic hand-object dataset and evaluate directly on a real world recorded hand-object dataset with unseen objects. We show that while reconstruction of unseen hands and objects from RGB is challenging, additional views can help improve the reconstruction quality. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: Camera-ready version. Paper accepted to CVPRW 2024. 8 pages, 7 figures, 1 table

arXiv:2404.17206 [pdf]

Rytov Approximation of Vectorial Waves by Modifying Scattering Matrixes: Precise Reconstruction of Dielectric Tensor Tomography

Authors: ChulMin Oh, Herve Hugonnet, Juheon Lee, YongKeun Park

Abstract: Analyzing 3D anisotropic materials presents significant challenges, especially when assessing 3D orientations, material distributions, and anisotropies through scattered light, due to the inherently vectorial nature of light-matter interactions. In this study, we formulate a scattering theory based on the Rytov approximation, commonly employed in scalar wave tomography, tailored to accommodate vec… ▽ More Analyzing 3D anisotropic materials presents significant challenges, especially when assessing 3D orientations, material distributions, and anisotropies through scattered light, due to the inherently vectorial nature of light-matter interactions. In this study, we formulate a scattering theory based on the Rytov approximation, commonly employed in scalar wave tomography, tailored to accommodate vector waves by modifying the scattering matrix. Using this formulation, we investigate the intricate 3D structure of liquid crystals with multiple topological defects exploiting dielectric tensor tomography. By leveraging dielectric tensor tomography, we successfully visualize these topological defects in three dimensions, a task that conventional 2D imaging techniques fail to achieve. △ Less

Submitted 26 April, 2024; originally announced April 2024.

arXiv:2404.11104 [pdf, other]

Object Remover Performance Evaluation Methods using Class-wise Object Removal Images

Authors: Changsuk Oh, Dongseok Shim, Taekbeom Lee, H. ** Kim

Abstract: Object removal refers to the process of erasing designated objects from an image while preserving the overall appearance, and it is one area where image inpainting is widely used in real-world applications. The performance of an object remover is quantitatively evaluated by measuring the quality of object removal results, similar to how the performance of an image inpainter is gauged. Current work… ▽ More Object removal refers to the process of erasing designated objects from an image while preserving the overall appearance, and it is one area where image inpainting is widely used in real-world applications. The performance of an object remover is quantitatively evaluated by measuring the quality of object removal results, similar to how the performance of an image inpainter is gauged. Current works reporting quantitative performance evaluations utilize original images as references. In this letter, to validate the current evaluation methods cannot properly evaluate the performance of an object remover, we create a dataset with object removal ground truth and compare the evaluations made by the current methods using original images to those utilizing object removal ground truth images. The disparities between two evaluation sets validate that the current methods are not suitable for measuring the performance of an object remover. Additionally, we propose new evaluation methods tailored to gauge the performance of an object remover. The proposed methods evaluate the performance through class-wise object removal results and utilize images without the target class objects as a comparison set. We confirm that the proposed methods can make judgments consistent with human evaluators in the COCO dataset, and that they can produce measurements aligning with those using object removal ground truth in the self-acquired dataset. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.04048 [pdf, other]

Revealing the Boundary between Quantum Mechanics and Classical Model by EPR-Steering Inequality

Authors: Ruo-Chen Wang, Zhuo-Chen Li, Xing-Yan Fan, Xiang-Ru Xie, Hong-Hao Wei, Choo Hiap Oh, **g-Ling Chen

Abstract: In quantum information, the Werner state is a benchmark to test the boundary between quantum mechanics and classical models. There have been three well-known critical values for the two-qubit Werner state, i.e., $V_{\rm c}^{\rm E}=1/3$ characterizing the boundary between entanglement and separable model, $V_{\rm c}^{\rm B}=1/K_G(3)$ characterizing the boundary between Bell's nonlocality and the lo… ▽ More In quantum information, the Werner state is a benchmark to test the boundary between quantum mechanics and classical models. There have been three well-known critical values for the two-qubit Werner state, i.e., $V_{\rm c}^{\rm E}=1/3$ characterizing the boundary between entanglement and separable model, $V_{\rm c}^{\rm B}=1/K_G(3)$ characterizing the boundary between Bell's nonlocality and the local-hidden-variable model, while $V_{\rm c}^{\rm S}=1/2$ characterizing the boundary between Einstein-Podolsky-Rosen (EPR) steering and the local-hidden-state model. So far, the problem of $V_{\rm c}^{\rm E}=1/3$ has been completely solved by an inequality involving in the positive-partial-transpose criterion, while how to reveal the other two critical values by the inequality approach are still open. In this work, we focus on EPR steering, which is a form of quantum nonlocality intermediate between entanglement and Bell's nonlocality. By proposing the optimal $N$-setting linear EPR-steering inequalities, we have successfully obtained the desired value $V_{\rm c}^{\rm S}=1/2$ for the two-qubit Werner state, thus resolving the long-standing problem. △ Less

Submitted 9 April, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

Comments: Main text: 6 pages, 2 figures; SM: 8 pages, 5 figures. Adding some references

arXiv:2403.18431 [pdf, ps, other]

$l^2$ decoupling theorem for surfaces in $\mathbb{R}^3$

Authors: Larry Guth, Dominique Maldague, Changkeun Oh

Abstract: We identify a new way to divide the $δ$-neighborhood of surfaces $\mathcal{M}\subset\mathbb{R}^3$ into a finitely-overlap** collection of rectangular boxes $S$. We obtain a sharp $(l^2,L^p)$ decoupling estimate using this decomposition, for the sharp range of exponents $2\leq p\leq 4$. Our decoupling inequality leads to new exponential sum estimates where the frequencies lie on surfaces which do… ▽ More We identify a new way to divide the $δ$-neighborhood of surfaces $\mathcal{M}\subset\mathbb{R}^3$ into a finitely-overlap** collection of rectangular boxes $S$. We obtain a sharp $(l^2,L^p)$ decoupling estimate using this decomposition, for the sharp range of exponents $2\leq p\leq 4$. Our decoupling inequality leads to new exponential sum estimates where the frequencies lie on surfaces which do not contain a line. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Comments: 34 pages

arXiv:2403.16158 [pdf, other]

Korean Bio-Medical Corpus (KBMC) for Medical Named Entity Recognition

Authors: Sungjoo Byun, Jiseung Hong, Sumin Park, Dongjun Jang, Jean Seo, Minseok Kim, Chaeyoung Oh, Hyopil Shin

Abstract: Named Entity Recognition (NER) plays a pivotal role in medical Natural Language Processing (NLP). Yet, there has not been an open-source medical NER dataset specifically for the Korean language. To address this, we utilized ChatGPT to assist in constructing the KBMC (Korean Bio-Medical Corpus), which we are now presenting to the public. With the KBMC dataset, we noticed an impressive 20% increase… ▽ More Named Entity Recognition (NER) plays a pivotal role in medical Natural Language Processing (NLP). Yet, there has not been an open-source medical NER dataset specifically for the Korean language. To address this, we utilized ChatGPT to assist in constructing the KBMC (Korean Bio-Medical Corpus), which we are now presenting to the public. With the KBMC dataset, we noticed an impressive 20% increase in medical NER performance compared to models trained on general Korean NER datasets. This research underscores the significant benefits and importance of using specialized tools and datasets, like ChatGPT, to enhance language processing in specialized fields such as healthcare. △ Less

Submitted 24 March, 2024; originally announced March 2024.

Journal ref: LREC-COLING 2024

arXiv:2403.14353 [pdf, other]

DaCapo: Accelerating Continuous Learning in Autonomous Systems for Video Analytics

Authors: Yoonsung Kim, Changhun Oh, **woo Hwang, Wonung Kim, Seongryong Oh, Yubin Lee, Hardik Sharma, Amir Yazdanbakhsh, Jongse Park

Abstract: Deep neural network (DNN) video analytics is crucial for autonomous systems such as self-driving vehicles, unmanned aerial vehicles (UAVs), and security robots. However, real-world deployment faces challenges due to their limited computational resources and battery power. To tackle these challenges, continuous learning exploits a lightweight "student" model at deployment (inference), leverages a l… ▽ More Deep neural network (DNN) video analytics is crucial for autonomous systems such as self-driving vehicles, unmanned aerial vehicles (UAVs), and security robots. However, real-world deployment faces challenges due to their limited computational resources and battery power. To tackle these challenges, continuous learning exploits a lightweight "student" model at deployment (inference), leverages a larger "teacher" model for labeling sampled data (labeling), and continuously retrains the student model to adapt to changing scenarios (retraining). This paper highlights the limitations in state-of-the-art continuous learning systems: (1) they focus on computations for retraining, while overlooking the compute needs for inference and labeling, (2) they rely on power-hungry GPUs, unsuitable for battery-operated autonomous systems, and (3) they are located on a remote centralized server, intended for multi-tenant scenarios, again unsuitable for autonomous systems due to privacy, network availability, and latency concerns. We propose a hardware-algorithm co-designed solution for continuous learning, DaCapo, that enables autonomous systems to perform concurrent executions of inference, labeling, and training in a performant and energy-efficient manner. DaCapo comprises (1) a spatially-partitionable and precision-flexible accelerator enabling parallel execution of kernels on sub-accelerators at their respective precisions, and (2) a spatiotemporal resource allocation algorithm that strategically navigates the resource-accuracy tradeoff space, facilitating optimal decisions for resource allocation to achieve maximal accuracy. Our evaluation shows that DaCapo achieves 6.5% and 5.5% higher accuracy than a state-of-the-art GPU-based continuous learning systems, Ekya and EOMU, respectively, while consuming 254x less power. △ Less

Submitted 28 April, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

arXiv:2402.18809 [pdf, other]

Entanglement-enabled advantage for learning a bosonic random displacement channel

Authors: Changhun Oh, Senrui Chen, Yat Wong, Sisi Zhou, Hsin-Yuan Huang, Jens A. H. Nielsen, Zheng-Hao Liu, Jonas S. Neergaard-Nielsen, Ulrik L. Andersen, Liang Jiang, John Preskill

Abstract: We show that quantum entanglement can provide an exponential advantage in learning properties of a bosonic continuous-variable (CV) system. The task we consider is estimating a probabilistic mixture of displacement operators acting on $n$ bosonic modes, called a random displacement channel. We prove that if the $n$ modes are not entangled with an ancillary quantum memory, then the channel must be… ▽ More We show that quantum entanglement can provide an exponential advantage in learning properties of a bosonic continuous-variable (CV) system. The task we consider is estimating a probabilistic mixture of displacement operators acting on $n$ bosonic modes, called a random displacement channel. We prove that if the $n$ modes are not entangled with an ancillary quantum memory, then the channel must be sampled a number of times exponential in $n$ in order to estimate its characteristic function to reasonable precision; this lower bound on sample complexity applies even if the channel inputs and measurements performed on channel outputs are chosen adaptively. On the other hand, we present a simple entanglement-assisted scheme that only requires a number of samples independent of $n$, given a sufficient amount of squeezing. This establishes an exponential separation in sample complexity. We then analyze the effect of photon loss and show that the entanglement-assisted scheme is still significantly more efficient than any lossless entanglement-free scheme under mild experimental conditions. Our work illuminates the role of entanglement in learning continuous-variable systems and points toward experimentally feasible demonstrations of provable entanglement-enabled advantage using CV quantum platforms. △ Less

Submitted 28 February, 2024; originally announced February 2024.

Comments: 7+26 pages, 3+5 figures

arXiv:2402.18548 [pdf, other]

Universal Spreading of Conditional Mutual Information in Noisy Random Circuits

Authors: Su-un Lee, Changhun Oh, Yat Wong, Senrui Chen, Liang Jiang

Abstract: We study the evolution of conditional mutual information in generic open quantum systems, focusing on one-dimensional random circuits with interspersed local noise. Unlike in noiseless circuits, where conditional mutual information spreads linearly while being bounded by the lightcone, we find that noisy random circuits with an error rate $p$ exhibit superlinear propagation of conditional mutual i… ▽ More We study the evolution of conditional mutual information in generic open quantum systems, focusing on one-dimensional random circuits with interspersed local noise. Unlike in noiseless circuits, where conditional mutual information spreads linearly while being bounded by the lightcone, we find that noisy random circuits with an error rate $p$ exhibit superlinear propagation of conditional mutual information, which diverges far beyond the lightcone at a critical circuit depth $t_c \propto p^{-1}$. We demonstrate that the underlying mechanism for such rapid spreading is the combined effect of local noise and a scrambling unitary, which selectively removes short-range correlations while preserving long-range correlations. To analytically capture the dynamics of conditional mutual information in noisy random circuits, we introduce a coarse-graining method, and we validate our theoretical results through numerical simulations. Furthermore, we identify a universal scaling law governing the spreading of conditional mutual information. △ Less

Submitted 4 May, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

arXiv:2402.15265 [pdf, other]

CloChat: Understanding How People Customize, Interact, and Experience Personas in Large Language Models

Authors: Juhye Ha, Hyeon Jeon, DaEun Han, **wook Seo, Changhoon Oh

Abstract: Large language models (LLMs) have facilitated significant strides in generating conversational agents, enabling seamless, contextually relevant dialogues across diverse topics. However, the existing LLM-driven conversational agents have fixed personalities and functionalities, limiting their adaptability to individual user needs. Creating personalized agent personas with distinct expertise or trai… ▽ More Large language models (LLMs) have facilitated significant strides in generating conversational agents, enabling seamless, contextually relevant dialogues across diverse topics. However, the existing LLM-driven conversational agents have fixed personalities and functionalities, limiting their adaptability to individual user needs. Creating personalized agent personas with distinct expertise or traits can address this issue. Nonetheless, we lack knowledge of how people customize and interact with agent personas. In this research, we investigated how users customize agent personas and their impact on interaction quality, diversity, and dynamics. To this end, we developed CloChat, an interface supporting easy and accurate customization of agent personas in LLMs. We conducted a study comparing how participants interact with CloChat and ChatGPT. The results indicate that participants formed emotional bonds with the customized agents, engaged in more dynamic dialogues, and showed interest in sustaining interactions. These findings contribute to design implications for future systems with conversational agents using LLMs. △ Less

Submitted 23 February, 2024; originally announced February 2024.

Comments: In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '24)

arXiv:2402.14279 [pdf, other]

Mitigating the Linguistic Gap with Phonemic Representations for Robust Multilingual Language Understanding

Authors: Haeji Jung, Changdae Oh, Jooeon Kang, Jimin Sohn, Kyungwoo Song, **kyu Kim, David R. Mortensen

Abstract: Approaches to improving multilingual language understanding often require multiple languages during the training phase, rely on complicated training techniques, and -- importantly -- struggle with significant performance gaps between high-resource and low-resource languages. We hypothesize that the performance gaps between languages are affected by linguistic gaps between those languages and provi… ▽ More Approaches to improving multilingual language understanding often require multiple languages during the training phase, rely on complicated training techniques, and -- importantly -- struggle with significant performance gaps between high-resource and low-resource languages. We hypothesize that the performance gaps between languages are affected by linguistic gaps between those languages and provide a novel solution for robust multilingual language modeling by employing phonemic representations (specifically, using phonemes as input tokens to LMs rather than subwords). We present quantitative evidence from three cross-lingual tasks that demonstrate the effectiveness of phonemic representation, which is further justified by a theoretical analysis of the cross-lingual performance gap. △ Less

Submitted 21 February, 2024; originally announced February 2024.

arXiv:2401.15996 [pdf, other]

doi 10.1145/3613904.3642767

AccessLens: Auto-detecting Inaccessibility of Everyday Objects

Authors: Nahyun Kwon, Qian Lu, Muhammad Hasham Qazi, Joanne Liu, Changhoon Oh, Shu Kong, Jeeeun Kim

Abstract: In our increasingly diverse society, everyday physical interfaces often present barriers, impacting individuals across various contexts. This oversight, from small cabinet knobs to identical wall switches that can pose different contextual challenges, highlights an imperative need for solutions. Leveraging low-cost 3D-printed augmentations such as knob magnifiers and tactile labels seems promising… ▽ More In our increasingly diverse society, everyday physical interfaces often present barriers, impacting individuals across various contexts. This oversight, from small cabinet knobs to identical wall switches that can pose different contextual challenges, highlights an imperative need for solutions. Leveraging low-cost 3D-printed augmentations such as knob magnifiers and tactile labels seems promising, yet the process of discovering unrecognized barriers remains challenging because disability is context-dependent. We introduce AccessLens, an end-to-end system designed to identify inaccessible interfaces in daily objects, and recommend 3D-printable augmentations for accessibility enhancement. Our approach involves training a detector using the novel AccessDB dataset designed to automatically recognize 21 distinct Inaccessibility Classes (e.g., bar-small and round-rotate) within 6 common object categories (e.g., handle and knob). AccessMeta serves as a robust way to build a comprehensive dictionary linking these accessibility classes to open-source 3D augmentation designs. Experiments demonstrate our detector's performance in detecting inaccessible objects. △ Less

Submitted 23 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

Comments: CHI2024

arXiv:2312.00690 [pdf, other]

Open-vocabulary object 6D pose estimation

Authors: Jaime Corsetti, Davide Boscaini, Changjae Oh, Andrea Cavallaro, Fabio Poiesi

Abstract: We introduce the new setting of open-vocabulary object 6D pose estimation, in which a textual prompt is used to specify the object of interest. In contrast to existing approaches, in our setting (i) the object of interest is specified solely through the textual prompt, (ii) no object model (e.g., CAD or video sequence) is required at inference, and (iii) the object is imaged from two RGBD viewpoin… ▽ More We introduce the new setting of open-vocabulary object 6D pose estimation, in which a textual prompt is used to specify the object of interest. In contrast to existing approaches, in our setting (i) the object of interest is specified solely through the textual prompt, (ii) no object model (e.g., CAD or video sequence) is required at inference, and (iii) the object is imaged from two RGBD viewpoints of different scenes. To operate in this setting, we introduce a novel approach that leverages a Vision-Language Model to segment the object of interest from the scenes and to estimate its relative 6D pose. The key of our approach is a carefully devised strategy to fuse object-level information provided by the prompt with local image features, resulting in a feature space that can generalize to novel concepts. We validate our approach on a new benchmark based on two popular datasets, REAL275 and Toyota-Light, which collectively encompass 34 object instances appearing in four thousand image pairs. The results demonstrate that our approach outperforms both a well-established hand-crafted method and a recent deep learning-based baseline in estimating the relative 6D pose of objects in different scenes. Code and dataset are available at https://jcorsetti.github.io/oryon. △ Less

Submitted 25 June, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

Comments: Camera ready version (CVPR 2024, poster highlight). New Oryon version: arXiv:2406.16384

arXiv:2311.13784 [pdf, other]

DaG LLM ver 1.0: Pioneering Instruction-Tuned Language Modeling for Korean NLP

Authors: Dongjun Jang, Sangah Lee, Sungjoo Byun, **woong Kim, Jean Seo, Minseok Kim, Soyeon Kim, Chaeyoung Oh, Jaeyoon Kim, Hyemi Jo, Hyopil Shin

Abstract: This paper presents the DaG LLM (David and Goliath Large Language Model), a language model specialized for Korean and fine-tuned through Instruction Tuning across 41 tasks within 13 distinct categories. This paper presents the DaG LLM (David and Goliath Large Language Model), a language model specialized for Korean and fine-tuned through Instruction Tuning across 41 tasks within 13 distinct categories. △ Less

Submitted 22 November, 2023; originally announced November 2023.

arXiv:2311.01723 [pdf, other]

Towards Calibrated Robust Fine-Tuning of Vision-Language Models

Authors: Changdae Oh, Hyesu Lim, Mijoo Kim, Dongyoon Han, Sangdoo Yun, Jaegul Choo, Alexander Hauptmann, Zhi-Qi Cheng, Kyungwoo Song

Abstract: Improving out-of-distribution (OOD) generalization through in-distribution (ID) adaptation is a primary goal of robust fine-tuning methods beyond the naive fine-tuning approach. However, despite decent OOD generalization performance from recent robust fine-tuning methods, OOD confidence calibration for reliable machine learning has not been fully addressed. This work proposes a robust fine-tuning… ▽ More Improving out-of-distribution (OOD) generalization through in-distribution (ID) adaptation is a primary goal of robust fine-tuning methods beyond the naive fine-tuning approach. However, despite decent OOD generalization performance from recent robust fine-tuning methods, OOD confidence calibration for reliable machine learning has not been fully addressed. This work proposes a robust fine-tuning method that improves both OOD accuracy and calibration error in Vision Language Models (VLMs). Firstly, we show that both types of errors have a shared upper bound consisting of two terms of ID data: 1) calibration error and 2) the smallest singular value of the input covariance matrix. Based on this insight, we design a novel framework that conducts fine-tuning with a constrained multimodal contrastive loss enforcing a larger smallest singular value, which is further aided by the self-distillation of a moving averaged model to achieve well-calibrated prediction. Starting from an empirical validation of our theoretical statements, we provide extensive experimental results on ImageNet distribution shift benchmarks that demonstrate the effectiveness of our method. △ Less

Submitted 27 May, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

Comments: Presented at the NeurIPS 2023 Workshop on Distribution Shifts (DistShift)

arXiv:2310.09614 [pdf, other]

Bridging the Divide: Unraveling the Knowledge Gap in Data Visualization Research and Practice

Authors: Nam Wook Kim, Grace Myers, **han Choi, Yoonsuh Cho, Changhoon Oh, Yea-Seul Kim

Abstract: Empirical research on perception and cognition has laid the foundation for visualization design, often yielding useful design guidelines for practitioners. However, it remains uncertain how well practitioners stay informed about such crucial visualization design knowledge. In this paper, we employed a mixed-method approach to explore the knowledge gap between visualization research and real-world… ▽ More Empirical research on perception and cognition has laid the foundation for visualization design, often yielding useful design guidelines for practitioners. However, it remains uncertain how well practitioners stay informed about such crucial visualization design knowledge. In this paper, we employed a mixed-method approach to explore the knowledge gap between visualization research and real-world design guidelines. We initially collected existing design guidelines from various sources and empirical studies from diverse publishing venues, analyzing their alignment and uncovering missing links and inconsistent knowledge. Subsequently, we conducted surveys and interviews with practitioners and researchers to gain further insights into their experiences and attitudes towards design guidelines and empirical studies, and their views on the knowledge gap between research and practice. Our findings highlight the similarities and differences in their perspectives and propose strategies to bridge the divide in visualization design knowledge. △ Less

Submitted 30 January, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

Comments: 15 pages, 5 figures

arXiv:2310.09015 [pdf, other]

doi 10.1038/s41586-023-06479-6

Precessing jet nozzle connecting to a spinning black hole in M87

Authors: Yuzhu Cui, Kazuhiro Hada, Tomohisa Kawashima, Motoki Kino, Weikang Lin, Yosuke Mizuno, Hyunwook Ro, Mareki Honma, Kunwoo Yi, **tao Yu, Jongho Park, Wu Jiang, Zhiqiang Shen, Evgeniya Kravchenko, Juan-Carlos Algaba, Xiaopeng Cheng, Ilje Cho, Gabriele Giovannini, Marcello Giroletti, Taehyun Jung, Ru-Sen Lu, Kotaro Niinuma, Junghwan Oh, Ken Ohsuga, Satoko Sawada-Satoh , et al. (54 additional authors not shown)

Abstract: The nearby radio galaxy M87 offers a unique opportunity to explore the connections between the central supermassive black hole and relativistic jets. Previous studies of the inner region of M87 revealed a wide opening angle for the jet originating near the black hole. The Event Horizon Telescope resolved the central radio source and found an asymmetric ring structure consistent with expectations f… ▽ More The nearby radio galaxy M87 offers a unique opportunity to explore the connections between the central supermassive black hole and relativistic jets. Previous studies of the inner region of M87 revealed a wide opening angle for the jet originating near the black hole. The Event Horizon Telescope resolved the central radio source and found an asymmetric ring structure consistent with expectations from General Relativity. With a baseline of 17 years of observations, there was a shift in the jet's transverse position, possibly arising from an eight to ten-year quasi-periodicity. However, the origin of this sideways shift remains unclear. Here we report an analysis of radio observations over 22 years that suggests a period of about 11 years in the position angle variation of the jet. We infer that we are seeing a spinning black hole that induces the Lense-Thirring precession of a misaligned accretion disk. Similar jet precession may commonly occur in other active galactic nuclei but has been challenging to detect owing to the small magnitude and long period of the variation. △ Less

Submitted 13 October, 2023; originally announced October 2023.

Comments: 41 pages, 7 figures, 7 tables

Journal ref: 2023, Nature, 621, 711-715

arXiv:2309.13461 [pdf, other]

doi 10.1103/PhysRevLett.132.180805

Tight bounds on Pauli channel learning without entanglement

Authors: Senrui Chen, Changhun Oh, Sisi Zhou, Hsin-Yuan Huang, Liang Jiang

Abstract: Quantum entanglement is a crucial resource for learning properties from nature, but a precise characterization of its advantage can be challenging. In this work, we consider learning algorithms without entanglement to be those that only utilize states, measurements, and operations that are separable between the main system of interest and an ancillary system. Interestingly, we show that these algo… ▽ More Quantum entanglement is a crucial resource for learning properties from nature, but a precise characterization of its advantage can be challenging. In this work, we consider learning algorithms without entanglement to be those that only utilize states, measurements, and operations that are separable between the main system of interest and an ancillary system. Interestingly, we show that these algorithms are equivalent to those that apply quantum circuits on the main system interleaved with mid-circuit measurements and classical feedforward. Within this setting, we prove a tight lower bound for Pauli channel learning without entanglement that closes the gap between the best-known upper and lower bound. In particular, we show that $Θ(2^n\varepsilon^{-2})$ rounds of measurements are required to estimate each eigenvalue of an $n$-qubit Pauli channel to $\varepsilon$ error with high probability when learning without entanglement. In contrast, a learning algorithm with entanglement only needs $Θ(\varepsilon^{-2})$ copies of the Pauli channel. The tight lower bound strengthens the foundation for an experimental demonstration of entanglement-enhanced advantages for Pauli noise characterization. △ Less

Submitted 17 April, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

Comments: 20 pages, 2 figure; v2: add Fig.2 comparing our lower bound with noisy-entanglement-assisted upper bound, close to accepted version

Journal ref: Phys. Rev. Lett. 132, 180805 (2024)

arXiv:2308.06427 [pdf, other]

Restriction estimates for quadratic manifolds of arbitrary codimensions

Authors: Shengwen Gan, Larry Guth, Changkeun Oh

Abstract: The restriction conjecture is one of the famous problems in harmonic analysis. There have been many methods developed in the study of the conjecture for the paraboloid. In this paper, we generalize the multilinear method of Bourgain and Guth for the paraboloid, and obtain restriction estimates for all quadratic manifolds of arbitrary codimensions. In particular, our theorem recovers the main theor… ▽ More The restriction conjecture is one of the famous problems in harmonic analysis. There have been many methods developed in the study of the conjecture for the paraboloid. In this paper, we generalize the multilinear method of Bourgain and Guth for the paraboloid, and obtain restriction estimates for all quadratic manifolds of arbitrary codimensions. In particular, our theorem recovers the main theorem of Bourgain and Guth for the paraboloid. A new ingredient is a covering lemma for varieties whose proof relies on Tarski's projection theorem in real algebraic geometry. We also provide algorithms to compute several algebraic quantities that naturally appear in the argument. These algorithms rely on a cylindrical decomposition in real algebraic geometry. △ Less

Submitted 11 August, 2023; originally announced August 2023.

Comments: 43 pages

arXiv:2307.06784 [pdf, other]

Robotic surface exploration with vision and tactile sensing for cracks detection and characterisation

Authors: Francesca Palermo, Bukeikhan Omarali, Changae Oh, Kaspar Althoefer, Ildar Farkhatdinov

Abstract: This paper presents a novel algorithm for crack localisation and detection based on visual and tactile analysis via fibre-optics. A finger-shaped sensor based on fibre-optics is employed for the data acquisition to collect data for the analysis and the experiments. To detect the possible locations of cracks a camera is used to scan an environment while running an object detection algorithm. Once t… ▽ More This paper presents a novel algorithm for crack localisation and detection based on visual and tactile analysis via fibre-optics. A finger-shaped sensor based on fibre-optics is employed for the data acquisition to collect data for the analysis and the experiments. To detect the possible locations of cracks a camera is used to scan an environment while running an object detection algorithm. Once the crack is detected, a fully-connected graph is created from a skeletonised version of the crack. A minimum spanning tree is then employed for calculating the shortest path to explore the crack which is then used to develop the motion planner for the robotic manipulator. The motion planner divides the crack into multiple nodes which are then explored individually. Then, the manipulator starts the exploration and performs the tactile data classification to confirm if there is indeed a crack in that location or just a false positive from the vision algorithm. If a crack is detected, also the length, width, orientation and number of branches are calculated. This is repeated until all the nodes of the crack are explored. In order to validate the complete algorithm, various experiments are performed: comparison of exploration of cracks through full scan and motion planning algorithm, implementation of frequency-based features for crack classification and geometry analysis using a combination of vision and tactile data. From the results of the experiments, it is shown that the proposed algorithm is able to detect cracks and improve the results obtained from vision to correctly classify cracks and their geometry with minimal cost thanks to the motion planning algorithm. △ Less

Submitted 13 July, 2023; originally announced July 2023.

Comments: 12 pages

arXiv:2307.06445 [pdf, ps, other]

Small cap decoupling for the paraboloid in $\mathbb{R}^n$

Authors: Larry Guth, Dominique Maldague, Changkeun Oh

Abstract: We extend the small cap decoupling program established by Demeter, Guth, and Want to paraboloids in $\mathbb{R}^n$ for some range of $p$. We extend the small cap decoupling program established by Demeter, Guth, and Want to paraboloids in $\mathbb{R}^n$ for some range of $p$. △ Less

Submitted 27 March, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

Comments: 17 pages, small corrections following referee report

arXiv:2306.10671 [pdf, ps, other]

doi 10.1103/PhysRevA.109.052613

Exploring Shallow-Depth Boson Sampling: Towards Scalable Quantum Supremacy

Authors: Byeongseon Go, Changhun Oh, Liang Jiang, Hyunseok Jeong

Abstract: Boson sampling is a sampling task proven to be hard to simulate efficiently using classical computers under plausible assumptions, which makes it an appealing candidate for quantum supremacy. However, due to a large noise rate for near-term quantum devices, it is still unclear whether those noisy devices maintain the quantum advantage for much larger quantum systems. Since the noise rate typically… ▽ More Boson sampling is a sampling task proven to be hard to simulate efficiently using classical computers under plausible assumptions, which makes it an appealing candidate for quantum supremacy. However, due to a large noise rate for near-term quantum devices, it is still unclear whether those noisy devices maintain the quantum advantage for much larger quantum systems. Since the noise rate typically grows with the circuit depth, an alternative is to find evidence of simulation hardness at the shallow-depth quantum circuit. To find the evidence, one way is to identify the minimum depth required for the average-case hardness of approximating output probabilities, which is considered a necessary condition for the state-of-the-art technique to prove the simulation hardness of boson sampling. In this work, we analyze the output probability distribution of shallow-depth boson sampling for Fock-states and Gaussian states, and examine the limitation of the average-case hardness argument at this shallow-depth regime for geometrically local architectures. We propose a shallow-depth linear optical circuit architecture that can overcome the problems associated with geometrically local architectures. Our numerical results suggest that this architecture demonstrates possibilities of average-case hardness properties in a shallow-depth regime, through its resemblance to the global Haar-random boson sampling circuit. This result implies that the corresponding architecture has the potential to be utilized for scalable quantum supremacy with its shallow-depth boson sampling. △ Less

Submitted 7 May, 2024; v1 submitted 18 June, 2023; originally announced June 2023.

Journal ref: Phys. Rev. A 109, 052613 (2024)

arXiv:2306.03709 [pdf, other]

doi 10.1038/s41567-024-02535-8

Classical algorithm for simulating experimental Gaussian boson sampling

Authors: Changhun Oh, Minzhao Liu, Yuri Alexeev, Bill Fefferman, Liang Jiang

Abstract: Gaussian boson sampling is a promising candidate for showing experimental quantum advantage. While there is evidence that noiseless Gaussian boson sampling is hard to efficiently simulate using a classical computer, the current Gaussian boson sampling experiments inevitably suffer from loss and other noise models. Despite a high photon loss rate and the presence of noise, they are currently claime… ▽ More Gaussian boson sampling is a promising candidate for showing experimental quantum advantage. While there is evidence that noiseless Gaussian boson sampling is hard to efficiently simulate using a classical computer, the current Gaussian boson sampling experiments inevitably suffer from loss and other noise models. Despite a high photon loss rate and the presence of noise, they are currently claimed to be hard to classically simulate with the best-known classical algorithm. In this work, we present a classical tensor-network algorithm that simulates Gaussian boson sampling and whose complexity can be significantly reduced when the photon loss rate is high. By generalizing the existing thermal-state approximation algorithm of lossy Gaussian boson sampling, the proposed algorithm allows us to achieve increased accuracy as the running time of the algorithm scales, as opposed to the algorithm that samples from the thermal state, which can give only a fixed accuracy. This generalization enables us to simulate the largest scale Gaussian boson sampling experiment so far using relatively modest computational resources, even though the output state of these experiments is not believed to be close to a thermal state. By demonstrating that our new classical algorithm outperforms the large-scale experiments on the benchmarks used as evidence for quantum advantage, we exhibit evidence that our classical sampler can simulate the ground-truth distribution better than the experiment can, which disputes the experimental quantum advantage claims. △ Less

Submitted 4 December, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

Comments: 23 pages, 13 figures

Journal ref: Nature Physics (2024)

arXiv:2305.07857 [pdf, other]

AURA : Automatic Mask Generator using Randomized Input Sampling for Object Removal

Authors: Changsuk Oh, Dongseok Shim, H. ** Kim

Abstract: The objective of the image inpainting task is to fill missing regions of an image in a visually plausible way. Recently, deep-learning-based image inpainting networks have generated outstanding results, and some utilize their models as object removers by masking unwanted objects in an image. However, while trying to better remove objects using their networks, the previous works pay less attention… ▽ More The objective of the image inpainting task is to fill missing regions of an image in a visually plausible way. Recently, deep-learning-based image inpainting networks have generated outstanding results, and some utilize their models as object removers by masking unwanted objects in an image. However, while trying to better remove objects using their networks, the previous works pay less attention to the importance of the input mask. In this paper, we focus on generating the input mask to better remove objects using the off-the-shelf image inpainting network. We propose an automatic mask generator inspired by the explainable AI (XAI) method, whose output can better remove objects than a semantic segmentation mask. The proposed method generates an importance map using randomly sampled input masks and quantitatively estimated scores of the completed images obtained from the random masks. The output mask is selected by a judge module among the candidate masks which are generated from the importance map. We design the judge module to quantitatively estimate the quality of the object removal results. In addition, we empirically find that the evaluation methods used in the previous works reporting object removal results are not appropriate for estimating the performance of an object remover. Therefore, we propose new evaluation metrics (FID$^*$ and U-IDS$^*$) to properly evaluate the quality of object removers. Experiments confirm that our method shows better performance in removing target class objects than the masks generated from the semantic segmentation maps, and the two proposed metrics make judgments consistent with humans. △ Less

Submitted 13 May, 2023; originally announced May 2023.

arXiv:2305.00448 [pdf, ps, other]

General construction scheme for geometrically nontrivial flat band models

Authors: Hyeongseop Kim, Chang-geun Oh, Jun-Won Rhim

Abstract: A singular flat band(SFB), a distinct class of the flat band, has been shown to exhibit various intriguing material properties characterized by a geometric quantity of the Bloch wave function called the quantum distance. We present a general construction scheme for a tight-binding model hosting an SFB, where the quantum distance profile can be controlled. We first introduce how to build a compact… ▽ More A singular flat band(SFB), a distinct class of the flat band, has been shown to exhibit various intriguing material properties characterized by a geometric quantity of the Bloch wave function called the quantum distance. We present a general construction scheme for a tight-binding model hosting an SFB, where the quantum distance profile can be controlled. We first introduce how to build a compact localized state(CLS), a characteristic eigenstate of the flat band, providing the flat band with a band-touching point, where a specific value of the maximum quantum distance is assigned. Then, we develop a scheme designing a tight-binding Hamiltonian hosting an SFB starting from the obtained CLS, satisfying the desired hop** range and symmetries by applying the construction scheme. While the scheme can be applied to any dimensions and lattice structures, we propose several simple SFB models on the square and kagome lattices. Finally, we establish a bulk-boundary correspondence between the maximum quantum distance and the boundary modes for the open boundary condition, which can be used to detect the quantum distance via the electronic structure of the boundary states. △ Less

Submitted 30 April, 2023; originally announced May 2023.

Comments: 10 pages, 6 figures

arXiv:2304.07432 [pdf, other]

doi 10.1103/PhysRevResearch.6.013058

Revisiting electromagnetic response of superconductors in mean-field approximation

Authors: Chang-geun Oh, Haruki Watanabe

Abstract: In the standard mean-field treatment of superconductors, the electron-electron interactions are assumed to be written in terms of local density operators. However, more general interactions, such as pair-hop** interactions, may exist or may be generated in a low-energy effective Hamiltonian. In this work, we study the effect of correlated hop** interactions toward the electromagnetic response… ▽ More In the standard mean-field treatment of superconductors, the electron-electron interactions are assumed to be written in terms of local density operators. However, more general interactions, such as pair-hop** interactions, may exist or may be generated in a low-energy effective Hamiltonian. In this work, we study the effect of correlated hop** interactions toward the electromagnetic response of superconductors. When only the Hamiltonian after the mean-field approximation is provided, one cannot unambiguously determine its electromagnetic response whenever such interactions are allowed. This work demonstrates that such interactions induce additional terms in the current operator, leading to modifications in the Meissner weight and optical conductivities that deviate from conventional expectations. These results underscore the need for caution when incorporating gauge fields into the BdG Hamiltonian. △ Less

Submitted 26 October, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

Comments: 6+7 pages, 3 figures; v2: discussions clarified, a new figure added

Journal ref: Phys. Rev. Research 6, 013058 (2024)

arXiv:2303.15838 [pdf, other]

doi 10.1103/PhysRevA.109.022410

Efficacy of virtual purification-based error mitigation on quantum metrology

Authors: Hyukgun Kwon, Changhun Oh, Youngrong Lim, Hyunseok Jeong, Liang Jiang

Abstract: Noise is the main source that hinders us from fully exploiting quantum advantages in various quantum informational tasks. However, characterizing and calibrating the effect of noise is not always feasible in practice. Especially for quantum parameter estimation, an estimator constructed without precise knowledge of noise entails an inevitable bias. Recently, virtual purification-based error mitiga… ▽ More Noise is the main source that hinders us from fully exploiting quantum advantages in various quantum informational tasks. However, characterizing and calibrating the effect of noise is not always feasible in practice. Especially for quantum parameter estimation, an estimator constructed without precise knowledge of noise entails an inevitable bias. Recently, virtual purification-based error mitigation (VPEM) has been proposed to apply for quantum metrology to reduce such a bias occurring from unknown noise. While it was demonstrated to work for particular cases, whether VPEM always reduces a bias for general estimation schemes is unclear yet. For more general applications of VPEM to quantum metrology, we study factors determining whether VPEM can reduce the bias. We find that the closeness between the dominant eigenvector of a noisy state and the ideal quantum probe (without noise) with respect to an observable determines the reducible amount of bias by VPEM. Next, we show that one should carefully choose the reference point of the target parameter, which gives a smaller bias than others because the bias depends on the reference point. Otherwise, even if the dominant eigenvector and the ideal quantum probe are close, the bias of the mitigated case could be larger than the non-mitigated one. Finally, we analyze the error mitigation for a phase estimation scheme under various noises. Based on our analysis, we predict whether VPEM can effectively reduce a bias and numerically verify our results. △ Less

Submitted 17 December, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

Comments: 14 pages, 10 figures

Journal ref: Phys. Rev. A 109, 022410 (2024)

arXiv:2303.14773 [pdf, other]

BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning

Authors: Changdae Oh, Hyeji Hwang, Hee-young Lee, YongTaek Lim, Geunyoung Jung, Jiyoung Jung, Hosik Choi, Kyungwoo Song

Abstract: With the surge of large-scale pre-trained models (PTMs), fine-tuning these models to numerous downstream tasks becomes a crucial problem. Consequently, parameter efficient transfer learning (PETL) of large models has grasped huge attention. While recent PETL methods showcase impressive performance, they rely on optimistic assumptions: 1) the entire parameter set of a PTM is available, and 2) a suf… ▽ More With the surge of large-scale pre-trained models (PTMs), fine-tuning these models to numerous downstream tasks becomes a crucial problem. Consequently, parameter efficient transfer learning (PETL) of large models has grasped huge attention. While recent PETL methods showcase impressive performance, they rely on optimistic assumptions: 1) the entire parameter set of a PTM is available, and 2) a sufficiently large memory capacity for the fine-tuning is equipped. However, in most real-world applications, PTMs are served as a black-box API or proprietary software without explicit parameter accessibility. Besides, it is hard to meet a large memory requirement for modern PTMs. In this work, we propose black-box visual prompting (BlackVIP), which efficiently adapts the PTMs without knowledge about model architectures and parameters. BlackVIP has two components; 1) Coordinator and 2) simultaneous perturbation stochastic approximation with gradient correction (SPSA-GC). The Coordinator designs input-dependent image-shaped visual prompts, which improves few-shot adaptation and robustness on distribution/location shift. SPSA-GC efficiently estimates the gradient of a target model to update Coordinator. Extensive experiments on 16 datasets demonstrate that BlackVIP enables robust adaptation to diverse domains without accessing PTMs' parameters, with minimal memory requirements. Code: \url{https://github.com/changdaeoh/BlackVIP} △ Less

Submitted 8 July, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

Comments: Accepted to CVPR 2023 (v2: citation error was fixed)

arXiv:2303.11409 [pdf, other]

Supercomputing tensor networks for U(1) symmetric quantum many-body systems

Authors: Minzhao Liu, Changhun Oh, Junyu Liu, Liang Jiang, Yuri Alexeev

Abstract: Simulation of many-body systems is extremely computationally intensive, and tensor network schemes have long been used to make these tasks more tractable via approximation. Recently, tensor network algorithms that can exploit the inherent symmetries of the underlying quantum systems have been proposed to further reduce computational complexity. One class of systems, namely those exhibiting a globa… ▽ More Simulation of many-body systems is extremely computationally intensive, and tensor network schemes have long been used to make these tasks more tractable via approximation. Recently, tensor network algorithms that can exploit the inherent symmetries of the underlying quantum systems have been proposed to further reduce computational complexity. One class of systems, namely those exhibiting a global U(1) symmetry, is especially interesting. We provide a state-of-the-art, graphical processing unit-accelerated, and highly parallel supercomputer implementation of the tensor network algorithm that takes advantage of U(1) symmetry, opening up the possibility of a wide range of quantum systems for future numerical investigations. △ Less

Submitted 20 March, 2023; originally announced March 2023.

Comments: 7 pages, 5 figures

arXiv:2303.01448 [pdf, other]

Co-Design of Topology, Scheduling, and Path Planning in Automated Warehouses

Authors: Christopher Leet, Chanwook Oh, Michele Lora, Sven Koenig, Pierluigi Nuzzo

Abstract: We address the warehouse servicing problem (WSP) in automated warehouses, which use teams of mobile agents to bring products from shelves to packing stations. Given a list of products, the WSP amounts to finding a plan for a team of agents which brings every product on the list to a station within a given timeframe. The WSP consists of four subproblems, concerning what tasks to perform (task formu… ▽ More We address the warehouse servicing problem (WSP) in automated warehouses, which use teams of mobile agents to bring products from shelves to packing stations. Given a list of products, the WSP amounts to finding a plan for a team of agents which brings every product on the list to a station within a given timeframe. The WSP consists of four subproblems, concerning what tasks to perform (task formulation), who will perform them (task allocation), and when (scheduling) and how (path planning) to perform them. These subproblems are NP-hard individually and become more challenging in combination. The difficulty of the WSP is compounded by the scale of automated warehouses, which frequently use teams of hundreds of agents. In this paper, we present a methodology that can solve the WSP at such scales. We introduce a novel, contract-based design framework which decomposes an automated warehouse into traffic system components. By assigning each of these components a contract describing the traffic flows it can support, we can synthesize a traffic flow satisfying a given WSP instance. Component-wise search-based path planning is then used to transform this traffic flow into a plan for discrete agents in a modular way. Evaluation shows that this methodology can solve WSP instances on real automated warehouses. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: To be published in Design, Automation and Test in Europe Conference (DATE) 2023

arXiv:2303.01014 [pdf, other]

doi 10.1051/0004-6361/202142988

Spectral analysis of a parsec-scale jet in M87: Observational constraint on the magnetic field strengths in the jet

Authors: Hyunwook Ro, Motoki Kino, Bong Won Sohn, Kazuhiro Hada, Jongho Park, Masanori Nakamura, Yuzhu Cui, Kunwoo Yi, Aeree Chung, Jeffrey Hodgson, Tomohisa Kawashima, Tao An, Sascha Trippe, Juan-Carlos Algaba, Jae-Young Kim, Satoko Sawada-Satoh, Kiyoaki Wajima, Zhiqiang Shen, Xiaopeng Cheng, Ilje Cho, Wu Jiang, Taehyun Jung, Jee-Won Lee, Kotaro Niinuma, Junghwan Oh , et al. (27 additional authors not shown)

Abstract: Because of its proximity and the large size of its black hole, M87 is one of the best targets for studying the launching mechanism of active galactic nucleus jets. Currently, magnetic fields are considered to be an essential factor in the launching and accelerating of the jet. However, current observational estimates of the magnetic field strength of the M87 jet are limited to the innermost part o… ▽ More Because of its proximity and the large size of its black hole, M87 is one of the best targets for studying the launching mechanism of active galactic nucleus jets. Currently, magnetic fields are considered to be an essential factor in the launching and accelerating of the jet. However, current observational estimates of the magnetic field strength of the M87 jet are limited to the innermost part of the jet or to HST-1. No attempt has yet been made to measure the magnetic field strength in between. We aim to infer the magnetic field strength of the M87 jet out to a distance of several thousand $r_s$ by tracking the distance-dependent changes in the synchrotron spectrum of the jet from high-resolution very long baseline interferometry observations. In order to obtain high-quality spectral index maps, quasi-simultaneous observations at 22 and 43 GHz were conducted using the KVN and VERA Array (KaVA) and the VLBA. We compared the spectral index distributions obtained from the observations with a model and placed limits on the magnetic field strengths as a function of distance. The overall spectral morphology is broadly consistent over the course of these observations. The observed synchrotron spectrum rapidly steepens from $α_{22-43 GHz}$ ~ -0.7 at ~ 2 mas to $α_{22-43 GHz}$ ~ -2.5 at ~ 6 mas. A spectral index model in which nonthermal electron injections inside the jet decrease with distance can adequately reproduce the observed trend. This suggests the magnetic field strength of the jet at a distance of 2 - 10 mas (~ 900 $r_s$ - ~ 4500 $r_s$ in the deprojected distance) has a range of $B=(0.3 - 1.0 G)(z/2 mas)^{-0.73}$. Extrapolating to the EHT scale yields consistent results, suggesting that the majority of the magnetic flux of the jet near the black hole is preserved out to ~ 4500 $r_s$ without significant dissipation. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: 16 pages, 9 figures. Accepted for publication in Astronomy & Astrophysics

Journal ref: A&A 673, A159 (2023)

arXiv:2302.00536 [pdf, ps, other]

doi 10.1103/PRXQuantum.5.020341

Quantum-inspired classical algorithm for graph problems by Gaussian boson sampling

Authors: Changhun Oh, Bill Fefferman, Liang Jiang, Nicolás Quesada

Abstract: We present a quantum-inspired classical algorithm that can be used for graph-theoretical problems, such as finding the densest $k$-subgraph and finding the maximum weight clique, which are proposed as applications of a Gaussian boson sampler. The main observation from Gaussian boson samplers is that a given graph's adjacency matrix to be encoded in a Gaussian boson sampler is nonnegative, which do… ▽ More We present a quantum-inspired classical algorithm that can be used for graph-theoretical problems, such as finding the densest $k$-subgraph and finding the maximum weight clique, which are proposed as applications of a Gaussian boson sampler. The main observation from Gaussian boson samplers is that a given graph's adjacency matrix to be encoded in a Gaussian boson sampler is nonnegative, which does not necessitate quantum interference. We first provide how to program a given graph problem into our efficient classical algorithm. We then numerically compare the performance of ideal and lossy Gaussian boson samplers, our quantum-inspired classical sampler, and the uniform sampler for finding the densest $k$-subgraph and finding the maximum weight clique and show that the advantage from Gaussian boson samplers is not significant in general. We finally discuss the potential advantage of a Gaussian boson sampler over the proposed sampler. △ Less

Submitted 23 April, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

Comments: 11 pages, 5 figures

Journal ref: PRX Quantum 5, 020341 (2024)

arXiv:2301.12814 [pdf, other]

doi 10.1103/PhysRevA.108.052604

Simulating lossy Gaussian boson sampling with matrix product operators

Authors: Minzhao Liu, Changhun Oh, Junyu Liu, Liang Jiang, Yuri Alexeev

Abstract: Gaussian boson sampling, a computational model that is widely believed to admit quantum supremacy, has already been experimentally demonstrated and is claimed to surpass the classical simulation capabilities of even the most powerful supercomputers today. However, whether the current approach limited by photon loss and noise in such experiments prescribes a scalable path to quantum advantage is an… ▽ More Gaussian boson sampling, a computational model that is widely believed to admit quantum supremacy, has already been experimentally demonstrated and is claimed to surpass the classical simulation capabilities of even the most powerful supercomputers today. However, whether the current approach limited by photon loss and noise in such experiments prescribes a scalable path to quantum advantage is an open question. To understand the effect of photon loss on the scalability of Gaussian boson sampling, we analytically derive the asymptotic operator entanglement entropy scaling, which relates to the simulation complexity. As a result, we observe that efficient tensor network simulations are likely possible under the $N_\text{out}\propto\sqrt{N}$ scaling of the number of surviving photons orange$N_\text{out}$ in the number of input photons $N$. We numerically verify this result using a tensor network algorithm with $U(1)$ symmetry, and overcome previous challenges due to the large local Hilbert space dimensions in Gaussian boson sampling with hardware acceleration. Additionally, we observe that increasing the photon number through larger squeezing does not increase the entanglement entropy significantly. Finally, we numerically find the bond dimension necessary for fixed accuracy simulations, providing more direct evidence for the complexity of tensor networks. △ Less

Submitted 13 October, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

Comments: 16 pages, 11 figures. To appear in PRA. This article supersedes arXiv:2303.11409

Journal ref: Phys. Rev. A 108, 052604, (2023)

arXiv:2301.11532 [pdf, ps, other]

On classical simulation algorithms for noisy Boson Sampling

Authors: Changhun Oh, Liang Jiang, Bill Fefferman

Abstract: We present a classical algorithm that approximately samples from the output distribution of certain noisy Boson Sampling experiments. This algorithm is inspired by a recent result of Aharonov, Gao, Landau, Liu and Vazirani and makes use of an observation originally due to Kalai and Kindler that the output probability of Boson Sampling experiments with a Gaussian noise model can be approximated by… ▽ More We present a classical algorithm that approximately samples from the output distribution of certain noisy Boson Sampling experiments. This algorithm is inspired by a recent result of Aharonov, Gao, Landau, Liu and Vazirani and makes use of an observation originally due to Kalai and Kindler that the output probability of Boson Sampling experiments with a Gaussian noise model can be approximated by sparse low-degree polynomials. This observation alone does not suffice for classical sampling, because its marginal probabilities might not be approximated by sparse low-degree polynomials, and furthermore, the approximated probabilities might be negative. We solve this problem by employing the first quantization representation to give an algorithm for computing the marginal probabilities of these experiments. We prove that when the overall noise rate is constant, the algorithm runs in time quasi-polynomial in the number of input photons $N$ and accuracy. When the overall noise rate scales as $1-x_1^γ$ for constant $x_1$ and $γ=Ω(\log N)$, the running time becomes polynomial. Furthermore, we study noisy Boson Sampling with practically relevant noise models such as partial distinguishability and photon loss. We show that the same technique does not immediately apply in these settings, leaving open the possibility of a scalable demonstration of noisy quantum advantage for these noise models in certain parameter regimes. △ Less

Submitted 26 January, 2023; originally announced January 2023.

Comments: 29 pages

arXiv:2212.07040 [pdf, other]

doi 10.3390/galaxies10060113

Overview of the Observing System and Initial Scientific Accomplishments of the East Asian VLBI Network (EAVN)

Authors: Kazunori Akiyama, Juan-Carlos Algaba, Tao An, Keiichi Asada, Kitiyanee Asanok, Do-Young Byun, Thanapol Chanapote, Wen Chen, Zhong Chen, Xiaopeng Cheng, James O. Chibueze, Ilje Cho, Se-Hyung Cho, Hyun-Soo Chung, Lang Cui, Yuzhu Cui, Akihiro Doi, Jian Dong, Kenta Fujisawa, Wei Gou, Wen Guo, Kazuhiro Hada, Yoshiaki Hagiwara, Tomoya Hirota, Jeffrey A. Hodgson , et al. (79 additional authors not shown)

Abstract: The East Asian VLBI Network (EAVN) is an international VLBI facility in East Asia and is operated under mutual collaboration between East Asian countries, as well as part of Southeast Asian and European countries. EAVN currently consists of 16 radio telescopes and three correlators located in China, Japan, and Korea, and is operated mainly at three frequency bands, 6.7, 22, and 43 GHz with the lon… ▽ More The East Asian VLBI Network (EAVN) is an international VLBI facility in East Asia and is operated under mutual collaboration between East Asian countries, as well as part of Southeast Asian and European countries. EAVN currently consists of 16 radio telescopes and three correlators located in China, Japan, and Korea, and is operated mainly at three frequency bands, 6.7, 22, and 43 GHz with the longest baseline length of 5078 km, resulting in the highest angular resolution of 0.28 milliarcseconds at 43 GHz. One of distinct capabilities of EAVN is multi-frequency simultaneous data reception at nine telescopes, which enable us to employ the frequency phase transfer technique to obtain better sensitivity at higher observing frequencies. EAVN started its open-use program in the second half of 2018, providing a total observing time of more than 1100 hours in a year. EAVN fills geographical gap in global VLBI array, resulting in enabling us to conduct contiguous high-resolution VLBI observations. EAVN has produced various scientific accomplishments especially in observations toward active galactic nuclei, evolved stars, and star-forming regions. These activities motivate us to initiate launch of the 'Global VLBI Alliance' to provide an opportunity of VLBI observation with the longest baselines on the earth. △ Less

Submitted 14 December, 2022; originally announced December 2022.

Comments: 27 pages, appeared in Galaxies special issue 'Challenges in Understanding Black Hole Powered Jets with VLBI' as an invited review

Journal ref: Galaxies 2022, 10(6), 113

arXiv:2211.12534 [pdf, ps, other]

doi 10.1093/pasj/psac102

EAVN Astrometry toward the Extreme Outer Galaxy: Kinematic distance with the proper motion of G034.84-00.95

Authors: Nobuyuki Sakai, Bo Zhang, Shuang**g Xu, Daisuke Sakai, Yoshiaki Tamura, Takaaki Jike, Taehyun Jung, Chungsik Oh, Jeong-Sook Kim, Noriyuki Kawaguchi, Hiroshi Imai, Wu Jiang, Lang Cui, Soon-Wook Kim, Pengfei Jiang, Tomoharu Kurayama, Jeong Ae Lee, Kazuya Hachisuka, Dong-Kyu Jung, Bo Xia, Guanghui Li, Mareki Honma, Kee-Tae Kim, Zhi-Qiang Shen, Na Wang

Abstract: We aim to reveal the structure and kinematics of the Outer-Scutum-Centaurus (OSC) arm located on the far side of the Milky Way through very long baseline interferometry (VLBI) astrometry using KaVA, which is composed of KVN (Korean VLBI Network) and VERA (VLBI Exploration of Radio Astrometry). We report the proper motion of a 22 GHz H$_{2}$O maser source, which is associated with the star-forming… ▽ More We aim to reveal the structure and kinematics of the Outer-Scutum-Centaurus (OSC) arm located on the far side of the Milky Way through very long baseline interferometry (VLBI) astrometry using KaVA, which is composed of KVN (Korean VLBI Network) and VERA (VLBI Exploration of Radio Astrometry). We report the proper motion of a 22 GHz H$_{2}$O maser source, which is associated with the star-forming region G034.84$-$00.95, to be ($μ_α \rm{cos}δ$, $μ_δ$) = ($-$1.61$\pm$0.18, $-$4.29$\pm$0.16) mas yr$^{-1}$ in equatorial coordinates (J2000). We estimate the 2D kinematic distance to the source to be 18.6$\pm$1.0 kpc, which is derived from the variance-weighted average of kinematic distances with LSR velocity and the Galactic-longitude component of the measured proper motion. Our result places the source in the OSC arm and implies that G034.84$-$00.95 is moving away from the Galactic plane with a vertical velocity of $-$38$\pm$16 km s$^{-1}$. Since the H I supershell GS033+06$-$49 is located at a kinematic distance roughly equal to that of G034.84$-$00.95, it is expected that gas circulation occurs between the outer Galactic disk around G034.84$-$00.95 with a Galactocentric distance of 12.8$^{+1.0}_{-0.9}$ kpc and halo. We evaluate possible origins of the fast vertical motion of G034.84$-$00.95, which are (1) supernova explosions and (2) cloud collisions with the Galactic disk. However, neither of the possibilities are matched with the results of VLBI astrometry as well as spatial distributions of H II regions and H I gas. △ Less

Submitted 22 November, 2022; originally announced November 2022.

Comments: Accepted for publication in PASJ. 14 figures; 8 tables

arXiv:2211.07184 [pdf, other]

doi 10.1038/s41534-023-00791-9

Approximating outcome probabilities of linear optical circuits

Authors: Youngrong Lim, Changhun Oh

Abstract: Quasiprobability representation is an important tool for analyzing a quantum system, such as a quantum state or a quantum circuit. In this work, we propose classical algorithms specialized for approximating outcome probabilities of a linear optical circuit using $s$-parameterized quasiprobability distributions. Notably, we can reduce the negativity bound of a circuit from exponential to at most… ▽ More Quasiprobability representation is an important tool for analyzing a quantum system, such as a quantum state or a quantum circuit. In this work, we propose classical algorithms specialized for approximating outcome probabilities of a linear optical circuit using $s$-parameterized quasiprobability distributions. Notably, we can reduce the negativity bound of a circuit from exponential to at most polynomial for specific cases by modulating the shapes of quasiprobability distributions thanks to the norm-preserving property of a linear optical transformation. Consequently, our scheme renders an efficient estimation of outcome probabilities with precision depending on the classicality of the circuit. Surprisingly, when the classicality is high enough, we reach a polynomial-time estimation algorithm within a multiplicative error. Our results provide quantum-inspired algorithms for approximating various matrix functions beating best-known results. Moreover, we give sufficient conditions for the classical simulability of Gaussian boson sampling using the approximating algorithm for any (marginal) outcome probability under the poly-sparse condition. Our study sheds light on the power of linear optics, providing plenty of quantum-inspired algorithms for problems in computational complexity. △ Less

Submitted 17 December, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

Comments: 27 pages, 2 figures, 2 tables

Journal ref: npj Quantum Information 9, 124 (2023)

arXiv:2210.15021 [pdf, ps, other]

doi 10.1103/PhysRevLett.131.010401

Spoofing cross entropy measure in boson sampling

Authors: Changhun Oh, Liang Jiang, Bill Fefferman

Abstract: Cross entropy (XE) measure is a widely used benchmarking to demonstrate quantum computational advantage from sampling problems, such as random circuit sampling using superconducting qubits and boson sampling (BS). We present a heuristic classical algorithm that attains a better XE than the current BS experiments in a verifiable regime and is likely to attain a better XE score than the near-future… ▽ More Cross entropy (XE) measure is a widely used benchmarking to demonstrate quantum computational advantage from sampling problems, such as random circuit sampling using superconducting qubits and boson sampling (BS). We present a heuristic classical algorithm that attains a better XE than the current BS experiments in a verifiable regime and is likely to attain a better XE score than the near-future BS experiments in a reasonable running time. The key idea behind the algorithm is that there exist distributions that correlate with the ideal BS probability distribution and that can be efficiently computed. The correlation and the computability of the distribution enable us to post-select heavy outcomes of the ideal probability distribution without computing the ideal probability, which essentially leads to a large XE. Our method scores a better XE than the recent Gaussian BS experiments when implemented at intermediate, verifiable system sizes. Much like current state-of-the-art experiments, we cannot verify that our spoofer works for quantum advantage size systems. However, we demonstrate that our approach works for much larger system sizes in fermion sampling, where we can efficiently compute output probabilities. Finally, we provide analytic evidence that the classical algorithm is likely to spoof noisy BS efficiently. △ Less

Submitted 10 July, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

Comments: 7+11 pages, 5+6 figures

Journal ref: Phys. Rev. Lett. 131, 010401 (2023)

arXiv:2210.14191 [pdf]

A Database of Ultrastable MOFs Reassembled from Stable Fragments with Machine Learning Models

Authors: Aditya Nandy, Shuwen Yue, Changhwan Oh, Chenru Duan, Gianmarco G. Terrones, Yongchul G. Chung, Heather J. Kulik

Abstract: High-throughput screening of large hypothetical databases of metal-organic frameworks (MOFs) can uncover new materials, but their stability in real-world applications is often unknown. We leverage community knowledge and machine learning (ML) models to identify MOFs that are thermally stable and stable upon activation. We separate these MOFs into their building blocks and recombine them to make a… ▽ More High-throughput screening of large hypothetical databases of metal-organic frameworks (MOFs) can uncover new materials, but their stability in real-world applications is often unknown. We leverage community knowledge and machine learning (ML) models to identify MOFs that are thermally stable and stable upon activation. We separate these MOFs into their building blocks and recombine them to make a new hypothetical MOF database of over 50,000 structures that samples orders of magnitude more connectivity nets and inorganic building blocks than prior databases. This database shows an order of magnitude enrichment of ultrastable MOF structures that are stable upon activation and more than one standard deviation more thermally stable than the average experimentally characterized MOF. For the nearly 10,000 ultrastable MOFs, we compute bulk elastic moduli to confirm these materials have good mechanical stability, and we report methane deliverable capacities. Our work identifies privileged metal nodes in ultrastable MOFs that optimize gas storage and mechanical stability simultaneously. △ Less

Submitted 25 October, 2022; originally announced October 2022.

arXiv:2209.06418 [pdf, other]

Graph Perceiver IO: A General Architecture for Graph Structured Data

Authors: Seyun Bae, Hoyoon Byun, Changdae Oh, Yoon-Sik Cho, Kyungwoo Song

Abstract: Multimodal machine learning has been widely studied for the development of general intelligence. Recently, the remarkable multimodal algorithms, the Perceiver and Perceiver IO, show competitive results for diverse dataset domains and tasks. However, recent works, Perceiver and Perceiver IO, have focused on heterogeneous modalities, including image, text, and speech, and there are few research work… ▽ More Multimodal machine learning has been widely studied for the development of general intelligence. Recently, the remarkable multimodal algorithms, the Perceiver and Perceiver IO, show competitive results for diverse dataset domains and tasks. However, recent works, Perceiver and Perceiver IO, have focused on heterogeneous modalities, including image, text, and speech, and there are few research works for graph structured datasets. A graph is one of the most generalized dataset structures, and we can represent the other dataset, including images, text, and speech, as graph structured data. A graph has an adjacency matrix different from other dataset domains such as text and image, and it is not trivial to handle the topological information, relational information, and canonical positional information. In this study, we provide a Graph Perceiver IO, the Perceiver IO for the graph structured dataset. We keep the main structure of the Graph Perceiver IO as the Perceiver IO because the Perceiver IO already handles the diverse dataset well, except for the graph structured dataset. The Graph Perceiver IO is a general method, and it can handle diverse datasets such as graph structured data as well as text and images. Comparing the graph neural networks, the Graph Perceiver IO requires a lower complexity, and it can incorporate the local and global information efficiently. We show that Graph Perceiver IO shows competitive results for diverse graph-related tasks, including node classification, graph classification, and link prediction. △ Less

Submitted 14 September, 2022; originally announced September 2022.

arXiv:2209.05784 [pdf]

doi 10.1364/OE.475610

Non-resonant lasing in a deep-hole scattering cavity

Authors: ChulMin Oh, Ho ** Ma, KyeoReh Lee, Do Kyung Kim, YongKeun Park

Abstract: Random lasers are promising in the spectral regime, wherein conventional lasers are unavailable, with advantages of low fabrication costs and applicability of diverse gain materials. However, their practical application is hindered by high threshold powers, low power efficiency, and difficulties in light collection. Here, we demonstrate a power-efficient easy-to-fabricate non-resonant laser using… ▽ More Random lasers are promising in the spectral regime, wherein conventional lasers are unavailable, with advantages of low fabrication costs and applicability of diverse gain materials. However, their practical application is hindered by high threshold powers, low power efficiency, and difficulties in light collection. Here, we demonstrate a power-efficient easy-to-fabricate non-resonant laser using a deep hole on a porous gain material. The laser action in this counterintuitive cavity was enabled by nonresonant feedback from strong diffuse reflections on the inner surface. Additionally, significant enhancements in slope efficiency, threshold power, and directionality were obtained from cavities fabricated on a porous Nd:YAG ceramic. △ Less

Submitted 13 September, 2022; originally announced September 2022.

Journal ref: Opt. Express 30, 47816-47825 (2022)

arXiv:2208.09835 [pdf, other]

A toy model for DLA arm growth in a wedge

Authors: Oren Louidor, Chanwoo Oh, Eviatar B. Procaccia

Abstract: In this paper, we consider a non-homogeneous discrete-time Markov chain which can be seen as a toy model for the growth of the arms of the DLA (Diffusion limited aggregation) process in a sub-linear wedge. It is conjectured that in a thin enough linear wedge there is only one infinite arm in the DLA cluster and we demonstrate this phenomenon in our model. The technique follows a bootstrap** argu… ▽ More In this paper, we consider a non-homogeneous discrete-time Markov chain which can be seen as a toy model for the growth of the arms of the DLA (Diffusion limited aggregation) process in a sub-linear wedge. It is conjectured that in a thin enough linear wedge there is only one infinite arm in the DLA cluster and we demonstrate this phenomenon in our model. The technique follows a bootstrap** argument, in which we iteratively prove ever faster growth rate. △ Less

Submitted 21 August, 2022; originally announced August 2022.

Comments: 8 pages, 1 figure

arXiv:2207.08398 [pdf, other]

Bayesian Optimization for Macro Placement

Authors: Changyong Oh, Roberto Bondesan, Dana Kianfar, Rehan Ahmed, Rishubh Khurana, Payal Agarwal, Romain Lepert, Mysore Sriram, Max Welling

Abstract: Macro placement is the problem of placing memory blocks on a chip canvas. It can be formulated as a combinatorial optimization problem over sequence pairs, a representation which describes the relative positions of macros. Solving this problem is particularly challenging since the objective function is expensive to evaluate. In this paper, we develop a novel approach to macro placement using Bayes… ▽ More Macro placement is the problem of placing memory blocks on a chip canvas. It can be formulated as a combinatorial optimization problem over sequence pairs, a representation which describes the relative positions of macros. Solving this problem is particularly challenging since the objective function is expensive to evaluate. In this paper, we develop a novel approach to macro placement using Bayesian optimization (BO) over sequence pairs. BO is a machine learning technique that uses a probabilistic surrogate model and an acquisition function that balances exploration and exploitation to efficiently optimize a black-box objective function. BO is more sample-efficient than reinforcement learning and therefore can be used with more realistic objectives. Additionally, the ability to learn from data and adapt the algorithm to the objective function makes BO an appealing alternative to other black-box optimization methods such as simulated annealing, which relies on problem-dependent heuristics and parameter-tuning. We benchmark our algorithm on the fixed-outline macro placement problem with the half-perimeter wire length objective and demonstrate competitive performance. △ Less

Submitted 18 July, 2022; originally announced July 2022.

Comments: ICML2022 Workshop on Adaptive Experimental Design and Active Learning in the Real World

arXiv:2206.08743 [pdf, other]

Learning Fair Representation via Distributional Contrastive Disentanglement

Authors: Changdae Oh, Heeji Won, Junhyuk So, Taero Kim, Yewon Kim, Hosik Choi, Kyungwoo Song

Abstract: Learning fair representation is crucial for achieving fairness or debiasing sensitive information. Most existing works rely on adversarial representation learning to inject some invariance into representation. However, adversarial learning methods are known to suffer from relatively unstable training, and this might harm the balance between fairness and predictiveness of representation. We propose… ▽ More Learning fair representation is crucial for achieving fairness or debiasing sensitive information. Most existing works rely on adversarial representation learning to inject some invariance into representation. However, adversarial learning methods are known to suffer from relatively unstable training, and this might harm the balance between fairness and predictiveness of representation. We propose a new approach, learning FAir Representation via distributional CONtrastive Variational AutoEncoder (FarconVAE), which induces the latent space to be disentangled into sensitive and nonsensitive parts. We first construct the pair of observations with different sensitive attributes but with the same labels. Then, FarconVAE enforces each non-sensitive latent to be closer, while sensitive latents to be far from each other and also far from the non-sensitive latent by contrasting their distributions. We provide a new type of contrastive loss motivated by Gaussian and Student-t kernels for distributional contrastive learning with theoretical analysis. Besides, we adopt a new swap-reconstruction loss to boost the disentanglement further. FarconVAE shows superior performance on fairness, pretrained model debiasing, and domain generalization tasks from various modalities, including tabular, image, and text. △ Less

Submitted 17 June, 2022; originally announced June 2022.

Comments: Accepted by KDD 2022 (Research Track)

arXiv:2205.09711 [pdf, other]

doi 10.22331/q-2023-03-06-939

Information transmission with continuous variable quantum erasure channels

Authors: Changchun Zhong, Changhun Oh, Liang Jiang

Abstract: Quantum capacity, as the key figure of merit for a given quantum channel, upper bounds the channel's ability in transmitting quantum information. Identifying different type of channels, evaluating the corresponding quantum capacity and finding the capacity-approaching coding scheme are the major tasks in quantum communication theory. Quantum channel in discrete variables has been discussed enormou… ▽ More Quantum capacity, as the key figure of merit for a given quantum channel, upper bounds the channel's ability in transmitting quantum information. Identifying different type of channels, evaluating the corresponding quantum capacity and finding the capacity-approaching coding scheme are the major tasks in quantum communication theory. Quantum channel in discrete variables has been discussed enormously involving various error models, while error model in the continuous variable channel has been less studied due to the infinite dimensional problem. In this paper, we investigate a general continuous variable quantum erasure channel. By defining an effective subspace of the continuous variable system, we find a continuous variable random coding model. We then derive the quantum capacity of the continuous variable erasure channel in the framework of decoupling theory. The discussion in this paper fills the gap of quantum erasure channel in continuous variable settings and sheds light on the understanding of other type of continuous variable quantum channels. △ Less

Submitted 2 March, 2023; v1 submitted 19 May, 2022; originally announced May 2022.

Journal ref: Quantum 7, 939 (2023)

Showing 1–50 of 341 results for author: Oh, C