-
VBIM-Net: Variational Born Iterative Network for Inverse Scattering Problems
Authors:
Ziqing Xing,
Zhaoyang Zhang,
Zirui Chen,
Yusong Wang,
Haoran Ma,
Zhun Wei,
Gang Bao
Abstract:
Recently, studies have shown the potential of integrating field-type iterative methods with deep learning (DL) techniques in solving inverse scattering problems (ISPs). In this article, we propose a novel Variational Born Iterative Network, namely, VBIM-Net, to solve the full-wave ISPs with significantly improved flexibility and inversion quality. The proposed VBIM-Net emulates the alternating upd…
▽ More
Recently, studies have shown the potential of integrating field-type iterative methods with deep learning (DL) techniques in solving inverse scattering problems (ISPs). In this article, we propose a novel Variational Born Iterative Network, namely, VBIM-Net, to solve the full-wave ISPs with significantly improved flexibility and inversion quality. The proposed VBIM-Net emulates the alternating updates of the total electric field and the contrast in the variational Born iterative method (VBIM) by multiple layers of subnetworks. We embed the calculation of the contrast variation into each of the subnetworks, converting the scattered field residual into an approximate contrast variation and then enhancing it by a U-Net, thus avoiding the requirement of matched measurement dimension and grid resolution as in existing approaches. The total field and contrast of each layer's output is supervised in the loss function of VBIM-Net, which guarantees the physical interpretability of variables of the subnetworks. In addition, we design a training scheme with extra noise to enhance the model's stability. Extensive numerical results on synthetic and experimental data both verify the inversion quality, generalization ability, and robustness of the proposed VBIM-Net. This work may provide some new inspiration for the design of efficient field-type DL schemes.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution
Authors:
Yulong Mao,
Kaiyu Huang,
Changhao Guan,
Ganglin Bao,
Fengran Mo,
**an Xu
Abstract:
Fine-tuning large-scale pre-trained models is inherently a resource-intensive task. While it can enhance the capabilities of the model, it also incurs substantial computational costs, posing challenges to the practical application of downstream tasks. Existing parameter-efficient fine-tuning (PEFT) methods such as Low-Rank Adaptation (LoRA) rely on a bypass framework that ignores the differential…
▽ More
Fine-tuning large-scale pre-trained models is inherently a resource-intensive task. While it can enhance the capabilities of the model, it also incurs substantial computational costs, posing challenges to the practical application of downstream tasks. Existing parameter-efficient fine-tuning (PEFT) methods such as Low-Rank Adaptation (LoRA) rely on a bypass framework that ignores the differential parameter budget requirements across weight matrices, which may lead to suboptimal fine-tuning outcomes. To address this issue, we introduce the Dynamic Low-Rank Adaptation (DoRA) method. DoRA decomposes high-rank LoRA layers into structured single-rank components, allowing for dynamic pruning of parameter budget based on their importance to specific tasks during training, which makes the most of the limited parameter budget. Experimental results demonstrate that DoRA can achieve competitive performance compared with LoRA and full model fine-tuning, and outperform various strong baselines with the same storage parameter budget. Our code is available at https://github.com/MIkumikumi0116/DoRA
△ Less
Submitted 26 June, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
Quantum Metrology with Higher-order Exceptional Points in Atom-cavity Magnonics
Authors:
Minwei Shi,
Guzhi Bao,
**xian Guo,
Wei** Zhang
Abstract:
Exceptional points (EPs), early arising from non-Hermitian physics, significantly amplify the system's response to minor perturbations, and act as a useful concept to enhance measurement in metrology. In particular, such a metrological enhancement grows dramatically with the EP's order. However, the Langevin noises intrinsically existing in the non-Hermitian systems diminish this enhancement. In t…
▽ More
Exceptional points (EPs), early arising from non-Hermitian physics, significantly amplify the system's response to minor perturbations, and act as a useful concept to enhance measurement in metrology. In particular, such a metrological enhancement grows dramatically with the EP's order. However, the Langevin noises intrinsically existing in the non-Hermitian systems diminish this enhancement. In this study, we propose a protocol for quantum metrology with the construction of higher-order EPs (HOEPs) in atom-cavity system through Hermitian magnon-photon interaction. The construction of HOEPs utilizes the atom-cavity non-Hermitian-like dynamical behavior but avoids the external Langevin noises via the Hermitian interaction. A general analysis is exhibited for the construction of arbitrary $n$-th order EP (EPn). As a demonstration of the superiority of these HOEPs in quantum metrology, we work out an EP3/4-based atomic sensor with sensitivity being orders of magnitude higher than that achievable in an EP2-based one. We further unveil the mechanism behind the sensitivity enhancement from HOEPs. The experimental establishment for this proposal is suggested with potential candidates. This EP-based atomic sensor, taking advantage of the atom-light interface, offers new insight into quantum metrology with HOEPs.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Wills Aligner: A Robust Multi-Subject Brain Representation Learner
Authors:
Guangyin Bao,
Zixuan Gong,
Qi Zhang,
Jialei Zhou,
Wei Fan,
Kun Yi,
Usman Naseem,
Liang Hu,
Duoqian Miao
Abstract:
Decoding visual information from human brain activity has seen remarkable advancements in recent research. However, due to the significant variability in cortical parcellation and cognition patterns across subjects, current approaches personalized deep models for each subject, constraining the practicality of this technology in real-world contexts. To tackle the challenges, we introduce Wills Alig…
▽ More
Decoding visual information from human brain activity has seen remarkable advancements in recent research. However, due to the significant variability in cortical parcellation and cognition patterns across subjects, current approaches personalized deep models for each subject, constraining the practicality of this technology in real-world contexts. To tackle the challenges, we introduce Wills Aligner, a robust multi-subject brain representation learner. Our Wills Aligner initially aligns different subjects' brains at the anatomical level. Subsequently, it incorporates a mixture of brain experts to learn individual cognition patterns. Additionally, it decouples the multi-subject learning task into a two-stage training, propelling the deep model and its plugin network to learn inter-subject commonality knowledge and various cognition patterns, respectively. Wills Aligner enables us to overcome anatomical differences and to efficiently leverage a single model for multi-subject brain representation learning. We meticulously evaluate the performance of our approach across coarse-grained and fine-grained visual decoding tasks. The experimental results demonstrate that our Wills Aligner achieves state-of-the-art performance.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction
Authors:
Zixuan Gong,
Qi Zhang,
Guangyin Bao,
Lei Zhu,
Ke Liu,
Liang Hu,
Duoqian Miao
Abstract:
Decoding natural visual scenes from brain activity has flourished, with extensive research in single-subject tasks and, however, less in cross-subject tasks. Reconstructing high-quality images in cross-subject tasks is a challenging problem due to profound individual differences between subjects and the scarcity of data annotation. In this work, we proposed MindTuner for cross-subject visual decod…
▽ More
Decoding natural visual scenes from brain activity has flourished, with extensive research in single-subject tasks and, however, less in cross-subject tasks. Reconstructing high-quality images in cross-subject tasks is a challenging problem due to profound individual differences between subjects and the scarcity of data annotation. In this work, we proposed MindTuner for cross-subject visual decoding, which achieves high-quality and rich-semantic reconstructions using only 1 hour of fMRI training data benefiting from the phenomena of visual fingerprint in the human visual system and a novel fMRI-to-text alignment paradigm. Firstly, we pre-train a multi-subject model among 7 subjects and fine-tune it with scarce data on new subjects, where LoRAs with Skip-LoRAs are utilized to learn the visual fingerprint. Then, we take the image modality as the intermediate pivot modality to achieve fMRI-to-text alignment, which achieves impressive fMRI-to-text retrieval performance and corrects fMRI-to-image reconstruction with fine-tuned semantics. The results of both qualitative and quantitative analyses demonstrate that MindTuner surpasses state-of-the-art cross-subject visual decoding models on the Natural Scenes Dataset (NSD), whether using training data of 1 hour or 40 hours.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
On the comparison between phenomenological and kinetic theories of gas mixtures with applications to flocking
Authors:
Gi-Chan Bae,
Seung-Yeal Ha,
Gyuyoung Hwang,
Tommaso Ruggeri
Abstract:
We study the compression between the phenomenological and kinetic models for a mixture of gases from the viewpoint of collective dynamics. In the case in which constituents are Eulerian gases, balance equations for mass, momentum, and energy are the same in the main differential part, but production terms due to the interchanges between constituents are different. They coincide only when the therm…
▽ More
We study the compression between the phenomenological and kinetic models for a mixture of gases from the viewpoint of collective dynamics. In the case in which constituents are Eulerian gases, balance equations for mass, momentum, and energy are the same in the main differential part, but production terms due to the interchanges between constituents are different. They coincide only when the thermal and mechanical diffusion are sufficiently small. In this paper, we first verify that both models satisfy the universal requirements of conservation laws of total mass, momentum, and energy, Galilean invariance and entropy principle. Following the work of Ha and Ruggeri (ARMA 2017), we consider spatially homogeneous models which correspond to the generalizations of the Cucker Smale model with the thermal effect. In these circumstances, we provide analytical results for the comparison between two resulting models and also present several numerical simulations to complement analytical results.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
U-ARE-ME: Uncertainty-Aware Rotation Estimation in Manhattan Environments
Authors:
Aalok Patwardhan,
Callum Rhodes,
Gwangbin Bae,
Andrew J. Davison
Abstract:
Camera rotation estimation from a single image is a challenging task, often requiring depth data and/or camera intrinsics, which are generally not available for in-the-wild videos. Although external sensors such as inertial measurement units (IMUs) can help, they often suffer from drift and are not applicable in non-inertial reference frames. We present U-ARE-ME, an algorithm that estimates camera…
▽ More
Camera rotation estimation from a single image is a challenging task, often requiring depth data and/or camera intrinsics, which are generally not available for in-the-wild videos. Although external sensors such as inertial measurement units (IMUs) can help, they often suffer from drift and are not applicable in non-inertial reference frames. We present U-ARE-ME, an algorithm that estimates camera rotation along with uncertainty from uncalibrated RGB images. Using a Manhattan World assumption, our method leverages the per-pixel geometric priors encoded in single-image surface normal predictions and performs optimisation over the SO(3) manifold. Given a sequence of images, we can use the per-frame rotation estimates and their uncertainty to perform multi-frame optimisation, achieving robustness and temporal consistency. Our experiments demonstrate that U-ARE-ME performs comparably to RGB-D methods and is more robust than sparse feature-based SLAM methods. We encourage the reader to view the accompanying video at https://callum-rhodes.github.io/U-ARE-ME for a visual overview of our method.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens
Authors:
Cunxiang Wang,
Ruoxi Ning,
Boqi Pan,
Tonghui Wu,
Qipeng Guo,
Cheng Deng,
Guangsheng Bao,
Xiangkun Hu,
Zheng Zhang,
Qian Wang,
Yue Zhang
Abstract:
The rapid advancement of Large Language Models (LLMs) has introduced a new frontier in natural language processing, particularly in understanding and processing long-context information. However, the evaluation of these models' long-context abilities remains a challenge due to the limitations of current benchmarks. To address this gap, we introduce NovelQA, a benchmark specifically designed to tes…
▽ More
The rapid advancement of Large Language Models (LLMs) has introduced a new frontier in natural language processing, particularly in understanding and processing long-context information. However, the evaluation of these models' long-context abilities remains a challenge due to the limitations of current benchmarks. To address this gap, we introduce NovelQA, a benchmark specifically designed to test the capabilities of LLMs with extended texts. Constructed from English novels, NovelQA offers a unique blend of complexity, length, and narrative coherence, making it an ideal tool for assessing deep textual understanding in LLMs. This paper presents the design and construction of NovelQA, highlighting its manual annotation, and diverse question types. Our evaluation of Long-context LLMs on NovelQA reveals significant insights into the models' performance, particularly emphasizing the challenges they face with multi-hop reasoning, detail-oriented questions, and extremely long input with an average length more than 200,000 tokens. The results underscore the necessity for further advancements in LLMs to improve their long-context comprehension.
△ Less
Submitted 17 June, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
Rethinking Inductive Biases for Surface Normal Estimation
Authors:
Gwangbin Bae,
Andrew J. Davison
Abstract:
Despite the growing demand for accurate surface normal estimation models, existing methods use general-purpose dense prediction models, adopting the same inductive biases as other tasks. In this paper, we discuss the inductive biases needed for surface normal estimation and propose to (1) utilize the per-pixel ray direction and (2) encode the relationship between neighboring surface normals by lea…
▽ More
Despite the growing demand for accurate surface normal estimation models, existing methods use general-purpose dense prediction models, adopting the same inductive biases as other tasks. In this paper, we discuss the inductive biases needed for surface normal estimation and propose to (1) utilize the per-pixel ray direction and (2) encode the relationship between neighboring surface normals by learning their relative rotation. The proposed method can generate crisp - yet, piecewise smooth - predictions for challenging in-the-wild images of arbitrary resolution and aspect ratio. Compared to a recent ViT-based state-of-the-art model, our method shows a stronger generalization ability, despite being trained on an orders of magnitude smaller dataset. The code is available at https://github.com/baegwangbin/DSINE.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
LLMs with Chain-of-Thought Are Non-Causal Reasoners
Authors:
Guangsheng Bao,
Hongbo Zhang,
Linyi Yang,
Cunxiang Wang,
Yue Zhang
Abstract:
This paper explores the role of the Chain of Thought (CoT) in Large Language Models (LLMs) reasoning. Despite its potential to improve task performance, our analysis reveals a surprising frequency of correct answers following incorrect CoTs and vice versa. We employ causal analysis to assess the cause-effect relationship between CoTs/instructions and answers in LLMs, uncovering the Structural Caus…
▽ More
This paper explores the role of the Chain of Thought (CoT) in Large Language Models (LLMs) reasoning. Despite its potential to improve task performance, our analysis reveals a surprising frequency of correct answers following incorrect CoTs and vice versa. We employ causal analysis to assess the cause-effect relationship between CoTs/instructions and answers in LLMs, uncovering the Structural Causal Model (SCM) that LLMs approximate. By comparing the implied SCM with that of human reasoning, we highlight discrepancies between LLM and human reasoning processes. We further examine the factors influencing the causal structure of the implied SCM, revealing that in-context learning, supervised fine-tuning, and reinforcement learning on human feedback significantly impact the causal relations. We release the code and results at https://github.com/StevenZHB/CoT_Causal_Analysis.
△ Less
Submitted 25 February, 2024;
originally announced February 2024.
-
PML-based boundary integral equation method for electromagnetic scattering problems in a layered-medium
Authors:
Gang Bao,
Wangtao Lu,
Tao Yin,
Lu Zhang
Abstract:
This paper proposes a new boundary integral equation (BIE) methodology based on the perfectly matched layer (PML) truncation technique for solving the electromagnetic scattering problems in a multi-layered medium. Instead of using the original PML stretched fields, artificial fields which are also equivalent to the solutions in the physical region are introduced. This significantly simplifies the…
▽ More
This paper proposes a new boundary integral equation (BIE) methodology based on the perfectly matched layer (PML) truncation technique for solving the electromagnetic scattering problems in a multi-layered medium. Instead of using the original PML stretched fields, artificial fields which are also equivalent to the solutions in the physical region are introduced. This significantly simplifies the study of the proposed methodology to derive the PML problem. Then some PML transformed layer potentials and the associated boundary integral operators (BIOs) are defined and the corresponding jump relations are shown. Under the assumption that the fields vanish on the PML boundary, the solution representations, as well as the related BIEs and regularization of the hyper-singular operators, in terms of the current density functions on the truncated interface, are derived. Numerical experiments are presented to demonstrate the efficiency and accuracy of the method.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
Hankel matrices acting on the Dirichlet space
Authors:
Guanlong Bao,
Kunyu Guo,
Fangmei Sun,
Zipeng Wang
Abstract:
The characterization of the boundedness of operators induced by Hankel matrices on analytic function spaces can be traced back to the work of Z. Nehari and H. Widom on the Hardy space, and has been extensively studied on many other analytic function spaces recently. However, this question remains open in the context of the Dirichlet space [20]. By Carleson measures, the Widom type condition and th…
▽ More
The characterization of the boundedness of operators induced by Hankel matrices on analytic function spaces can be traced back to the work of Z. Nehari and H. Widom on the Hardy space, and has been extensively studied on many other analytic function spaces recently. However, this question remains open in the context of the Dirichlet space [20]. By Carleson measures, the Widom type condition and the reproducing kernel thesis, this paper provides a comprehensive solution to this question. As a beneficial product, characterizations of the boundedness and compactness of operators induced by Cesàro type matrices on the Dirichlet space are given. In addition, we also show that a random Dirichlet function almost surely induces a compact Hankel type operator on the Dirichlet space.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Supervised Knowledge Makes Large Language Models Better In-context Learners
Authors:
Linyi Yang,
Shuibai Zhang,
Zhuohao Yu,
Guangsheng Bao,
Yidong Wang,
**dong Wang,
Ruochen Xu,
Wei Ye,
Xing Xie,
Weizhu Chen,
Yue Zhang
Abstract:
Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering. The recent progress in large-scale generative models has further expanded their use in real-world language applications. However, the critical challenge of improving the generalizability and factuality of LLMs in natural language understanding and question answering remains under-explored. While…
▽ More
Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering. The recent progress in large-scale generative models has further expanded their use in real-world language applications. However, the critical challenge of improving the generalizability and factuality of LLMs in natural language understanding and question answering remains under-explored. While previous in-context learning research has focused on enhancing models to adhere to users' specific instructions and quality expectations, and to avoid undesired outputs, little to no work has explored the use of task-Specific fine-tuned Language Models (SLMs) to improve LLMs' in-context learning during the inference stage. Our primary contribution is the establishment of a simple yet effective framework that enhances the reliability of LLMs as it: 1) generalizes out-of-distribution data, 2) elucidates how LLMs benefit from discriminative models, and 3) minimizes hallucinations in generative tasks. Using our proposed plug-in method, enhanced versions of Llama 2 and ChatGPT surpass their original versions regarding generalizability and factuality. We offer a comprehensive suite of resources, including 16 curated datasets, prompts, model checkpoints, and LLM outputs across 9 distinct tasks. The code and data are released at: https://github.com/YangLinyi/Supervised-Knowledge-Makes-Large-Language-Models-Better-In-context-Learners. Our empirical analysis sheds light on the advantages of incorporating discriminative models into LLMs and highlights the potential of our methodology in fostering more reliable LLMs.
△ Less
Submitted 11 April, 2024; v1 submitted 26 December, 2023;
originally announced December 2023.
-
Multimodal Federated Learning with Missing Modality via Prototype Mask and Contrast
Authors:
Guangyin Bao,
Qi Zhang,
Duoqian Miao,
Zixuan Gong,
Liang Hu,
Ke Liu,
Yang Liu,
Chongyang Shi
Abstract:
In real-world scenarios, multimodal federated learning often faces the practical challenge of intricate modality missing, which poses constraints on building federated frameworks and significantly degrades model inference accuracy. Existing solutions for addressing missing modalities generally involve develo** modality-specific encoders on clients and training modality fusion modules on servers.…
▽ More
In real-world scenarios, multimodal federated learning often faces the practical challenge of intricate modality missing, which poses constraints on building federated frameworks and significantly degrades model inference accuracy. Existing solutions for addressing missing modalities generally involve develo** modality-specific encoders on clients and training modality fusion modules on servers. However, these methods are primarily constrained to specific scenarios with either unimodal clients or complete multimodal clients, struggling to generalize effectively in the intricate modality missing scenarios. In this paper, we introduce a prototype library into the FedAvg-based Federated Learning framework, thereby empowering the framework with the capability to alleviate the global model performance degradation resulting from modality missing during both training and testing. The proposed method utilizes prototypes as masks representing missing modalities to formulate a task-calibrated training loss and a model-agnostic uni-modality inference strategy. In addition, a proximal term based on prototypes is constructed to enhance local training. Experimental results demonstrate the state-of-the-art performance of our approach. Compared to the baselines, our method improved inference accuracy by 3.7\% with 50\% modality missing during training and by 23.8\% during uni-modality inference. Code is available at https://github.com/BaoGuangYin/PmcmFL.
△ Less
Submitted 4 February, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
SuperPrimitive: Scene Reconstruction at a Primitive Level
Authors:
Kirill Mazur,
Gwangbin Bae,
Andrew J. Davison
Abstract:
Joint camera pose and dense geometry estimation from a set of images or a monocular video remains a challenging problem due to its computational complexity and inherent visual ambiguities. Most dense incremental reconstruction systems operate directly on image pixels and solve for their 3D positions using multi-view geometry cues. Such pixel-level approaches suffer from ambiguities or violations o…
▽ More
Joint camera pose and dense geometry estimation from a set of images or a monocular video remains a challenging problem due to its computational complexity and inherent visual ambiguities. Most dense incremental reconstruction systems operate directly on image pixels and solve for their 3D positions using multi-view geometry cues. Such pixel-level approaches suffer from ambiguities or violations of multi-view consistency (e.g. caused by textureless or specular surfaces).
We address this issue with a new image representation which we call a SuperPrimitive. SuperPrimitives are obtained by splitting images into semantically correlated local regions and enhancing them with estimated surface normal directions, both of which are predicted by state-of-the-art single image neural networks. This provides a local geometry estimate per SuperPrimitive, while their relative positions are adjusted based on multi-view observations.
We demonstrate the versatility of our new representation by addressing three 3D reconstruction tasks: depth completion, few-view structure from motion, and monocular dense visual odometry.
△ Less
Submitted 17 April, 2024; v1 submitted 10 December, 2023;
originally announced December 2023.
-
Lite-Mind: Towards Efficient and Robust Brain Representation Network
Authors:
Zixuan Gong,
Qi Zhang,
Guangyin Bao,
Lei Zhu,
Yu Zhang,
Ke Liu,
Liang Hu,
Duoqian Miao
Abstract:
The limited data availability and the low signal-to-noise ratio of fMRI signals lead to the challenging task of fMRI-to-image retrieval. State-of-the-art MindEye remarkably improves fMRI-to-image retrieval performance by leveraging a large model, i.e., a 996M MLP Backbone per subject, to align fMRI embeddings to the final hidden layer of CLIP's Vision Transformer (ViT). However, significant indivi…
▽ More
The limited data availability and the low signal-to-noise ratio of fMRI signals lead to the challenging task of fMRI-to-image retrieval. State-of-the-art MindEye remarkably improves fMRI-to-image retrieval performance by leveraging a large model, i.e., a 996M MLP Backbone per subject, to align fMRI embeddings to the final hidden layer of CLIP's Vision Transformer (ViT). However, significant individual variations exist among subjects, even under identical experimental setups, mandating the training of large subject-specific models. The substantial parameters pose significant challenges in deploying fMRI decoding on practical devices. To this end, we propose Lite-Mind, a lightweight, efficient, and robust brain representation learning paradigm based on Discrete Fourier Transform (DFT), which efficiently aligns fMRI voxels to fine-grained information of CLIP. We elaborately design a DFT backbone with Spectrum Compression and Frequency Projector modules to learn informative and robust voxel embeddings. Our experiments demonstrate that Lite-Mind achieves an impressive 94.6% fMRI-to-image retrieval accuracy on the NSD dataset for Subject 1, with 98.7% fewer parameters than MindEye. Lite-Mind is also proven to be able to be migrated to smaller fMRI datasets and establishes a new state-of-the-art for zero-shot classification on the GOD dataset.
△ Less
Submitted 19 April, 2024; v1 submitted 6 December, 2023;
originally announced December 2023.
-
Convergence of the PML method for the biharmonic wave scattering problem in periodic structures
Authors:
Gang Bao,
Peijun Li,
Xiaokai Yuan
Abstract:
This paper investigates the scattering of biharmonic waves by a one-dimensional periodic array of cavities embedded in an infinite elastic thin plate. The transparent boundary conditions are introduced to formulate the problem from an unbounded domain to a bounded one. The well-posedness of the associated variational problem is demonstrated utilizing the Fredholm alternative theorem. The perfectly…
▽ More
This paper investigates the scattering of biharmonic waves by a one-dimensional periodic array of cavities embedded in an infinite elastic thin plate. The transparent boundary conditions are introduced to formulate the problem from an unbounded domain to a bounded one. The well-posedness of the associated variational problem is demonstrated utilizing the Fredholm alternative theorem. The perfectly matched layer (PML) method is employed to reformulate the original scattering problem, transforming it from an unbounded domain to a bounded one. The transparent boundary conditions for the PML problem are deduced, and the well-posedness of its variational problem is established. Moreover, exponential convergence is achieved between the solution of the PML problem and that of the original scattering problem.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
FOUND: Foot Optimization with Uncertain Normals for Surface Deformation Using Synthetic Data
Authors:
Oliver Boyne,
Gwangbin Bae,
James Charles,
Roberto Cipolla
Abstract:
Surface reconstruction from multi-view images is a challenging task, with solutions often requiring a large number of sampled images with high overlap. We seek to develop a method for few-view reconstruction, for the case of the human foot. To solve this task, we must extract rich geometric cues from RGB images, before carefully fusing them into a final 3D object. Our FOUND approach tackles this,…
▽ More
Surface reconstruction from multi-view images is a challenging task, with solutions often requiring a large number of sampled images with high overlap. We seek to develop a method for few-view reconstruction, for the case of the human foot. To solve this task, we must extract rich geometric cues from RGB images, before carefully fusing them into a final 3D object. Our FOUND approach tackles this, with 4 main contributions: (i) SynFoot, a synthetic dataset of 50,000 photorealistic foot images, paired with ground truth surface normals and keypoints; (ii) an uncertainty-aware surface normal predictor trained on our synthetic dataset; (iii) an optimization scheme for fitting a generative foot model to a series of images; and (iv) a benchmark dataset of calibrated images and high resolution ground truth geometry. We show that our normal predictor outperforms all off-the-shelf equivalents significantly on real images, and our optimization scheme outperforms state-of-the-art photogrammetry pipelines, especially for a few-view setting. We release our synthetic dataset and baseline 3D scans to the research community.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
Authors:
Guangsheng Bao,
Yanbin Zhao,
Zhiyang Teng,
Linyi Yang,
Yue Zhang
Abstract:
Large language models (LLMs) have shown the ability to produce fluent and cogent content, presenting both productivity opportunities and societal risks. To build trustworthy AI systems, it is imperative to distinguish between machine-generated and human-authored content. The leading zero-shot detector, DetectGPT, showcases commendable performance but is marred by its intensive computational costs.…
▽ More
Large language models (LLMs) have shown the ability to produce fluent and cogent content, presenting both productivity opportunities and societal risks. To build trustworthy AI systems, it is imperative to distinguish between machine-generated and human-authored content. The leading zero-shot detector, DetectGPT, showcases commendable performance but is marred by its intensive computational costs. In this paper, we introduce the concept of conditional probability curvature to elucidate discrepancies in word choices between LLMs and humans within a given context. Utilizing this curvature as a foundational metric, we present **Fast-DetectGPT**, an optimized zero-shot detector, which substitutes DetectGPT's perturbation step with a more efficient sampling step. Our evaluations on various datasets, source models, and test conditions indicate that Fast-DetectGPT not only surpasses DetectGPT by a relative around 75% in both the white-box and black-box settings but also accelerates the detection process by a factor of 340, as detailed in Table 1. See \url{https://github.com/baoguangsheng/fast-detect-gpt} for code, data, and results.
△ Less
Submitted 22 February, 2024; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Numerical schemes for a multi-species quantum BGK model
Authors:
Gi-Chan Bae,
Marlies Pirner,
Sandra Warnecke
Abstract:
We consider a kinetic model of an N-species gas mixture modeled with quantum Bhatnagar-Gross-Krook (BGK) collision operators. The collision operators consist of a relaxation to a Maxwell distribution in the classical case, a Fermi distribution for fermions and a Bose-Einstein distribution for bosons. In this paper we present a numerical method for simulating this model, which uses an Implicit-Expl…
▽ More
We consider a kinetic model of an N-species gas mixture modeled with quantum Bhatnagar-Gross-Krook (BGK) collision operators. The collision operators consist of a relaxation to a Maxwell distribution in the classical case, a Fermi distribution for fermions and a Bose-Einstein distribution for bosons. In this paper we present a numerical method for simulating this model, which uses an Implicit-Explicit (IMEX) scheme to minimize a certain potential function. This is motivated by theoretical considerations coming from entropy minimization. We show that theoretical properties such as conservation of mass, total momentum and total energy as well as positivity of the distribution functions are preserved by the numerical method presented in this paper, and illustrate its usefulness and effectiveness with numerical examples
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Quantum Locking of Intrinsic Spin Squeezed State in Earth-field-range Magnetometry
Authors:
Peiyu Yang,
Guzhi Bao,
Jun Chen,
Wei Du,
**xian Guo,
Wei** Zhang
Abstract:
In the Earth-field range, the nonlinear Zeeman (NLZ) effect has been a bottleneck limiting the sensitivity and accuracy of atomic magnetometry from physical mechanism. To break this bottleneck, various techniques are introduced to suppress the NLZ effect. Here we revisit the spin dynamics in the Earth-field-range magnetometry and identify the existence of the intrinsic spin squeezed state (SSS) ge…
▽ More
In the Earth-field range, the nonlinear Zeeman (NLZ) effect has been a bottleneck limiting the sensitivity and accuracy of atomic magnetometry from physical mechanism. To break this bottleneck, various techniques are introduced to suppress the NLZ effect. Here we revisit the spin dynamics in the Earth-field-range magnetometry and identify the existence of the intrinsic spin squeezed state (SSS) generated from the geomagnetically induced NLZ effect with the oscillating squeezing degree and squeezing axis. Such oscillating features of the SSS prevent its direct observation and as well, accessibility to magnetic sensing. To exploit quantum advantage of the intrinsic SSS in the Earth-field-range magnetometry, it's essential to lock the oscillating SSS to a persistent one. Hence, we develop a quantum locking technique to achieve a persistent SSS, benefiting from which the sensitivity of the Earth-field-range magnetometer is quantum-enhanced. This work presents an innovative way turning the drawback of NLZ effect into the quantum advantage and opens a new access to quantum-enhanced magnetometry in the Earth-field range.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Peculiar orbital characteristics of Earth quasi-satellite 469219 Kamo`oalewa: implications for the Yarkovsky detection and orbital uncertainty propagation
Authors:
Shoucun Hu,
Bin Li,
Haoxuan Jiang,
Gang Bao,
Jianghui Ji
Abstract:
469219 Kamo`oalewa is selected as one of the primary targets of Tianwen-2 mission, which is currently believed to be the most stable quasi-satellite of Earth. Here we derive a weak detection of the Yarkovsky effect for Kamo`oalewa, giving $A_2 = -1.075\pm0.447\times 10^{-13} \rm{au/d}^2$, with the available ground-based optical observations from Minor Planet Center and a relatively conservative we…
▽ More
469219 Kamo`oalewa is selected as one of the primary targets of Tianwen-2 mission, which is currently believed to be the most stable quasi-satellite of Earth. Here we derive a weak detection of the Yarkovsky effect for Kamo`oalewa, giving $A_2 = -1.075\pm0.447\times 10^{-13} \rm{au/d}^2$, with the available ground-based optical observations from Minor Planet Center and a relatively conservative weighting scheme. Due to the quasi-satellite resonance with Earth, we show that the detection of Yarkovsky effect by orbital fitting with astrometric observations becomes difficult as its orbital drift shows a slow oscillatory growth resulting from the Yarkovsky effect. In addition, we extensively explore the characteristics of orbital uncertainty propagation and find that the positional uncertainty mainly arises from the geocentric radial direction in 2010-2020, and then concentrates in the heliocentric transverse direction in 2020-2030. Furthermore, the heliocentric transverse uncertainty is clearly monthly dependent, which can arrive at a minimum around January and a maximum around July as the orbit moves towards the leading and trailing edges, respectively, in 2025-2027. Finally, we investigate a long-term uncertainty propagation in the quasi-satellite regime, implying that the quasi-satellite resonance with Earth may play a crucial role in constraining the increase of uncertainty over time. Such interesting feature further implies that the orbital precision of Kamo`oalewa is relatively stable at its quasi-satellite phase, which may also be true for other quasi-satellites of Earth.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Image or Information? Examining the Nature and Impact of Visualization Perceptual Classification
Authors:
Anjana Arunkumar,
Lace Padilla,
Gi-Yeul Bae,
Chris Bryan
Abstract:
How do people internalize visualizations: as images or information? In this study, we investigate the nature of internalization for visualizations (i.e., how the mind encodes visualizations in memory) and how memory encoding affects its retrieval. This exploratory work examines the influence of various design elements on a user's perception of a chart. Specifically, which design elements lead to p…
▽ More
How do people internalize visualizations: as images or information? In this study, we investigate the nature of internalization for visualizations (i.e., how the mind encodes visualizations in memory) and how memory encoding affects its retrieval. This exploratory work examines the influence of various design elements on a user's perception of a chart. Specifically, which design elements lead to perceptions of visualization as an image or as information? Understanding how design elements contribute to viewers perceiving a visualization more as an image or information will help designers decide which elements to include to achieve their communication goals. For this study, we annotated 500 visualizations and analyzed the responses of 250 online participants, who rated the visualizations on a bilinear scale as image or information. We then conducted an in-person study (n = 101) using a free recall task to examine how the image/information ratings and design elements impact memory. The results revealed several interesting findings: Image-rated visualizations were perceived as more aesthetically appealing, enjoyable, and pleasing. Information-rated visualizations were perceived as less difficult to understand and more aesthetically likable and nice, though participants expressed higher positive sentiment when viewing image-rated visualizations and felt less guided to a conclusion. We also found different patterns among participants that were older. Importantly, we show that visualizations internalized as images are less effective in conveying trends and messages, though they elicit a more positive emotional judgment, while informative visualizations exhibit annotation focused recall and elicit a more positive design judgment. We discuss the implications of this dissociation between aesthetic pleasure and perceived ease of use in visualization design.
△ Less
Submitted 21 July, 2023; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Quantum-enhanced Electrometer based on Microwave-dressed Rydberg Atoms
Authors:
Shuhe Wu,
Dong Zhang,
Zhengchun Li,
Minwei Shi,
Peiyu Yang,
**xian Guo,
Wei Du,
Guzhi Bao,
Wei** Zhang
Abstract:
Rydberg atoms have been shown remarkable performance in sensing microwave field. The sensitivity of such an electrometer based on optical readout of atomic ensemble has been demonstrated to approach the photon-shot-noise limit. However, the sensitivity can not be promoted infinitely by increasing the power of probe light due to the increased collision rates and power broadening. Compared with clas…
▽ More
Rydberg atoms have been shown remarkable performance in sensing microwave field. The sensitivity of such an electrometer based on optical readout of atomic ensemble has been demonstrated to approach the photon-shot-noise limit. However, the sensitivity can not be promoted infinitely by increasing the power of probe light due to the increased collision rates and power broadening. Compared with classical light, the use of quantum light may lead to a better sensitivity with lower number of photons. In this paper, we exploit entanglement in a microwave-dressed Rydberg electrometer to suppress the fluctuation of noise. The results show a sensitivity enhancement beating the shot noise limit in both cold and hot atom schemes. Through optimizing the transmission of optical readout, our quantum advantage can be maintained with different absorptive index of atomic vapor, which makes it possible to apply quantum light source in the absorptive electrometer.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Non-Autoregressive Document-Level Machine Translation
Authors:
Guangsheng Bao,
Zhiyang Teng,
Hao Zhou,
Jianhao Yan,
Yue Zhang
Abstract:
Non-autoregressive translation (NAT) models achieve comparable performance and superior speed compared to auto-regressive translation (AT) models in the context of sentence-level machine translation (MT). However, their abilities are unexplored in document-level MT, hindering their usage in real scenarios. In this paper, we conduct a comprehensive examination of typical NAT models in the context o…
▽ More
Non-autoregressive translation (NAT) models achieve comparable performance and superior speed compared to auto-regressive translation (AT) models in the context of sentence-level machine translation (MT). However, their abilities are unexplored in document-level MT, hindering their usage in real scenarios. In this paper, we conduct a comprehensive examination of typical NAT models in the context of document-level MT and further propose a simple but effective design of sentence alignment between source and target. Experiments show that NAT models achieve high acceleration on documents, and sentence alignment significantly enhances their performance.
However, current NAT models still have a significant performance gap compared to their AT counterparts. Further investigation reveals that NAT models suffer more from the multi-modality and misalignment issues in the context of document-level MT, and current NAT models struggle with exploiting document context and handling discourse phenomena. We delve into these challenges and provide our code at \url{https://github.com/baoguangsheng/nat-on-doc}.
△ Less
Submitted 9 December, 2023; v1 submitted 22 May, 2023;
originally announced May 2023.
-
ParticleWNN: a Novel Neural Networks Framework for Solving Partial Differential Equations
Authors:
Yaohua Zang,
Gang Bao
Abstract:
Deep neural networks (DNNs) have been widely used to solve partial differential equations (PDEs) in recent years. In this work, a novel deep learning-based framework named Particle Weak-form based Neural Networks (ParticleWNN) is developed for solving PDEs in the weak form. In this framework, the trial space is defined as the space of DNNs, while the test space consists of functions compactly supp…
▽ More
Deep neural networks (DNNs) have been widely used to solve partial differential equations (PDEs) in recent years. In this work, a novel deep learning-based framework named Particle Weak-form based Neural Networks (ParticleWNN) is developed for solving PDEs in the weak form. In this framework, the trial space is defined as the space of DNNs, while the test space consists of functions compactly supported in extremely small regions, centered around particles. To facilitate the training of neural networks, an R-adaptive strategy is designed to adaptively modify the radius of regions during training. The ParticleWNN inherits the benefits of weak/variational formulation, requiring less regularity of the solution and a small number of quadrature points for computing integrals. Additionally, due to the special construction of the test functions, ParticleWNN enables parallel implementation and integral calculations only in extremely small regions. This framework is particularly desirable for solving problems with high-dimensional and complex domains. The efficiency and accuracy of ParticleWNN are demonstrated through several numerical examples, showcasing its superiority over state-of-the-art methods. The source code for the numerical examples presented in this paper is available at https://github.com/yaohua32/ParticleWNN.
△ Less
Submitted 12 November, 2023; v1 submitted 21 May, 2023;
originally announced May 2023.
-
Monogamy of entanglement for tripartite systems
Authors:
Xue-Na Zhu,
Gui Bao,
Zhi-Xiang **,
Shao-Ming Fei
Abstract:
We study the monogamy of arbitrary quantum entanglement measures $E$ for tripartite quantum systems. Both sufficient and necessary conditions for $E$ to be monogamous in terms of the $α$th power of $E$ are explicitly derived. It is shown that such monogamy of a entanglement measure $E$ only depends on the boundedness of the solution set of certain equations. Moreover, the monogamy conditions have…
▽ More
We study the monogamy of arbitrary quantum entanglement measures $E$ for tripartite quantum systems. Both sufficient and necessary conditions for $E$ to be monogamous in terms of the $α$th power of $E$ are explicitly derived. It is shown that such monogamy of a entanglement measure $E$ only depends on the boundedness of the solution set of certain equations. Moreover, the monogamy conditions have been also obtained with respect to certain subsets of quantum states for a given quantum correlation. Detailed examples are given to illustrate our results.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
Singularity swap** method for nearly singular integrals based on trapezoidal rule
Authors:
Gang Bao,
Wenmao Hua,
Jun Lai,
**rui Zhang
Abstract:
Accurate evaluation of nearly singular integrals plays an important role in many boundary integral equation based numerical methods. In this paper, we propose a variant of singularity swap** method to accurately evaluate the layer potentials for arbitrarily close targets. Our method is based on the global trapezoidal rule and trigonometric interpolation, resulting in an explicit quadrature formu…
▽ More
Accurate evaluation of nearly singular integrals plays an important role in many boundary integral equation based numerical methods. In this paper, we propose a variant of singularity swap** method to accurately evaluate the layer potentials for arbitrarily close targets. Our method is based on the global trapezoidal rule and trigonometric interpolation, resulting in an explicit quadrature formula. The method achieves spectral accuracy for nearly singular integrals on closed analytic curves. In order to extract the singularity from the complexified distance function, an efficient root finding method is proposed based on contour integration. Through the change of variables, we also extend the quadrature method to integrals on the piecewise analytic curves. Numerical examples for Laplace's and Helmholtz equations show that high order accuracy can be achieved for arbitrarily close field evaluation.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Target-Side Augmentation for Document-Level Machine Translation
Authors:
Guangsheng Bao,
Zhiyang Teng,
Yue Zhang
Abstract:
Document-level machine translation faces the challenge of data sparsity due to its long input length and a small amount of training data, increasing the risk of learning spurious patterns. To address this challenge, we propose a target-side augmentation method, introducing a data augmentation (DA) model to generate many potential translations for each source document. Learning on these wider range…
▽ More
Document-level machine translation faces the challenge of data sparsity due to its long input length and a small amount of training data, increasing the risk of learning spurious patterns. To address this challenge, we propose a target-side augmentation method, introducing a data augmentation (DA) model to generate many potential translations for each source document. Learning on these wider range translations, an MT model can learn a smoothed distribution, thereby reducing the risk of data sparsity. We demonstrate that the DA model, which estimates the posterior distribution, largely improves the MT performance, outperforming the previous best system by 2.30 s-BLEU on News and achieving new state-of-the-art on News and Europarl benchmarks. Our code is available at https://github.com/baoguangsheng/target-side-augmentation.
△ Less
Submitted 4 June, 2023; v1 submitted 8 May, 2023;
originally announced May 2023.
-
Token-Level Fitting Issues of Seq2seq Models
Authors:
Guangsheng Bao,
Zhiyang Teng,
Yue Zhang
Abstract:
Sequence-to-sequence (seq2seq) models have been widely used for natural language processing, computer vision, and other deep learning tasks. We find that seq2seq models trained with early-stop** suffer from issues at the token level. In particular, while some tokens in the vocabulary demonstrate overfitting, others underfit when training is stopped. Experiments show that the phenomena are pervas…
▽ More
Sequence-to-sequence (seq2seq) models have been widely used for natural language processing, computer vision, and other deep learning tasks. We find that seq2seq models trained with early-stop** suffer from issues at the token level. In particular, while some tokens in the vocabulary demonstrate overfitting, others underfit when training is stopped. Experiments show that the phenomena are pervasive in different models, even in fine-tuned large pretrained-models. We identify three major factors that influence token-level fitting, which include token frequency, parts-of-speech, and prediction discrepancy. Further, we find that external factors such as language, model size, domain, data scale, and pretraining can also influence the fitting of tokens.
△ Less
Submitted 22 June, 2023; v1 submitted 8 May, 2023;
originally announced May 2023.
-
A Family of Bipartite Separability Criteria Based on Bloch Representation of Density Matrices
Authors:
Xue-Na Zhu,
**g Wang,
Gui Bao,
Ming Li,
Shu-Qian Shen,
Shao-Ming Fei
Abstract:
We study the separability of bipartite quantum systems in arbitrary dimensions based on the Bloch representation of density matrices. We present two separability criteria for quantum states in terms of the matrices $T_{αβ}(ρ)$ and $W_{ab,αβ}(ρ)$ constructed from the correlation tensors in the Bloch representation. These separability criteria can be simplified and detect more entanglement than the…
▽ More
We study the separability of bipartite quantum systems in arbitrary dimensions based on the Bloch representation of density matrices. We present two separability criteria for quantum states in terms of the matrices $T_{αβ}(ρ)$ and $W_{ab,αβ}(ρ)$ constructed from the correlation tensors in the Bloch representation. These separability criteria can be simplified and detect more entanglement than the previous separability criteria. Detailed examples are given to illustrate the advantages of results.
△ Less
Submitted 9 May, 2023; v1 submitted 30 April, 2023;
originally announced May 2023.
-
GEMINI: Controlling the Sentence-level Writing Style for Abstractive Text Summarization
Authors:
Guangsheng Bao,
Zebin Ou,
Yue Zhang
Abstract:
Human experts write summaries using different techniques, including extracting a sentence from the document and rewriting it, or fusing various information from the document to abstract it. These techniques are flexible and thus difficult to be imitated by any single method. To address this issue, we propose an adaptive model, GEMINI, that integrates a rewriter and a generator to mimic the sentenc…
▽ More
Human experts write summaries using different techniques, including extracting a sentence from the document and rewriting it, or fusing various information from the document to abstract it. These techniques are flexible and thus difficult to be imitated by any single method. To address this issue, we propose an adaptive model, GEMINI, that integrates a rewriter and a generator to mimic the sentence rewriting and abstracting techniques, respectively. GEMINI adaptively chooses to rewrite a specific document sentence or generate a summary sentence from scratch. Experiments demonstrate that our adaptive approach outperforms the pure abstractive and rewriting baselines on three benchmark datasets, achieving the best results on WikiHow. Interestingly, empirical results show that the human summary styles of summary sentences are consistently predictable given their context. We release our code and model at \url{https://github.com/baoguangsheng/gemini}.
△ Less
Submitted 9 December, 2023; v1 submitted 7 April, 2023;
originally announced April 2023.
-
Programming hydrogel adhesion with engineered polymer network topology
Authors:
Zhen Yang,
Guangyu Bao,
Shuaibing Jiang,
Xingwei Yang,
Ran Huo,
Xiang Ni,
Luc Mongeau,
Rong Long,
Jianyu Li
Abstract:
Hydrogel adhesion that can be easily modulated in magnitude, space, and time is desirable in many emerging applications ranging from tissue engineering, and soft robotics, to wearable devices. In synthetic materials, these complex adhesion behaviors are often achieved individually with mechanisms and apparatus that are difficult to integrate. Here, we report a universal strategy to embody multifac…
▽ More
Hydrogel adhesion that can be easily modulated in magnitude, space, and time is desirable in many emerging applications ranging from tissue engineering, and soft robotics, to wearable devices. In synthetic materials, these complex adhesion behaviors are often achieved individually with mechanisms and apparatus that are difficult to integrate. Here, we report a universal strategy to embody multifaceted adhesion programmability in synthetic hydrogels. By designing the surface network topology of a hydrogel, supramolecular linkages that result in contrasting adhesion behaviors are formed on the hydrogel interface. The incorporation of different topological linkages leads to dynamically tunable adhesion with high-resolution spatial programmability without alteration of bulk mechanics and chemistry. Further, the association of linkages enables stable and tunable adhesion kinetics that can be tailored to suit different applications. We rationalize the physics of chain slippage, rupture, and diffusion that underpins emergent programmable behaviors. We then incorporate the strategy into the designs of various devices such as smart wound patches, fluidic channels, drug-eluting devices, and reconfigurable soft robotics. Our study presents a simple and robust platform in which adhesion controllability in multiple aspects can be easily integrated into a single design of a hydrogel network.
△ Less
Submitted 5 April, 2023; v1 submitted 28 March, 2023;
originally announced March 2023.
-
Duality for $α$-Möbius invariant Besov spaces
Authors:
Guanlong Bao,
Zengjian Lou,
Xiao**g Zhou
Abstract:
For $1\leq p\leq \infty$ and $α>0$, Besov spaces $B^p_α$ play a key role in the theory of $α$-Möbius invariant function spaces. In some sense, $B^1_α$ is the minimal $α$-Möbius invariant function space, $B^2_α$ is the unique $α$-Möbius invariant Hilbert space, and $B^\infty_α$ is the maximal $α$-Möbius invariant function space. In this paper, under the $α$-Möbius invariant pairing and by the space…
▽ More
For $1\leq p\leq \infty$ and $α>0$, Besov spaces $B^p_α$ play a key role in the theory of $α$-Möbius invariant function spaces. In some sense, $B^1_α$ is the minimal $α$-Möbius invariant function space, $B^2_α$ is the unique $α$-Möbius invariant Hilbert space, and $B^\infty_α$ is the maximal $α$-Möbius invariant function space. In this paper, under the $α$-Möbius invariant pairing and by the space $B^\infty_α$, we identify the predual and dual spaces of $B^1_α$. In particular, the corresponding identifications are isometric isomorphisms. The duality theorem via the $α$-Möbius invariant pairing for $B^p_α$ with $p>1$ is also given.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Large amplitude problem of BGK model: Relaxation to quadratic nonlinearity
Authors:
Gi-Chan Bae,
Gyounghun Ko,
Donghyun Lee,
Seok-Bae Yun
Abstract:
Bhatnagar-Gross-Krook (BGK) equation is a relaxation model of the Boltzmann equation which is widely used in place of the Boltzmann equation for the simulation of various kinetic flow problems. In this work, we study the asymptotic stability of the BGK model when the initial data is not necessarily close to the global equilibrium pointwisely. Due to the highly nonlinear structure of the relaxation…
▽ More
Bhatnagar-Gross-Krook (BGK) equation is a relaxation model of the Boltzmann equation which is widely used in place of the Boltzmann equation for the simulation of various kinetic flow problems. In this work, we study the asymptotic stability of the BGK model when the initial data is not necessarily close to the global equilibrium pointwisely. Due to the highly nonlinear structure of the relaxation operator, the argument developed to derive the bootstrap estimate for the Boltzmann equation leads to a weaker estimate in the case of the BGK model, which does not exclude the possible blow-up of the perturbation. To overcome this issue, we carry out a refined analysis of the macroscopic fields to guarantee that the system transits from a highly nonlinear regime into a quadratic nonlinear regime after a long but finite time, in which the highly nonlinear perturbative term relaxes to essentially quadratic nonlinearity.
△ Less
Submitted 25 January, 2023; v1 submitted 24 January, 2023;
originally announced January 2023.
-
Current State of Community-Driven Radiological AI Deployment in Medical Imaging
Authors:
Vikash Gupta,
Barbaros Selnur Erdal,
Carolina Ramirez,
Ralf Floca,
Laurence Jackson,
Brad Genereaux,
Sidney Bryson,
Christopher P Bridge,
Jens Kleesiek,
Felix Nensa,
Rickmer Braren,
Khaled Younis,
Tobias Penzkofer,
Andreas Michael Bucher,
Ming Melvin Qin,
Gigon Bae,
Hyeonhoon Lee,
M. Jorge Cardoso,
Sebastien Ourselin,
Eric Kerfoot,
Rahul Choudhury,
Richard D. White,
Tessa Cook,
David Bericat,
Matthew Lungren
, et al. (2 additional authors not shown)
Abstract:
Artificial Intelligence (AI) has become commonplace to solve routine everyday tasks. Because of the exponential growth in medical imaging data volume and complexity, the workload on radiologists is steadily increasing. We project that the gap between the number of imaging exams and the number of expert radiologist readers required to cover this increase will continue to expand, consequently introd…
▽ More
Artificial Intelligence (AI) has become commonplace to solve routine everyday tasks. Because of the exponential growth in medical imaging data volume and complexity, the workload on radiologists is steadily increasing. We project that the gap between the number of imaging exams and the number of expert radiologist readers required to cover this increase will continue to expand, consequently introducing a demand for AI-based tools that improve the efficiency with which radiologists can comfortably interpret these exams. AI has been shown to improve efficiency in medical-image generation, processing, and interpretation, and a variety of such AI models have been developed across research labs worldwide. However, very few of these, if any, find their way into routine clinical use, a discrepancy that reflects the divide between AI research and successful AI translation. To address the barrier to clinical deployment, we have formed MONAI Consortium, an open-source community which is building standards for AI deployment in healthcare institutions, and develo** tools and infrastructure to facilitate their implementation. This report represents several years of weekly discussions and hands-on problem solving experience by groups of industry experts and clinicians in the MONAI Consortium. We identify barriers between AI-model development in research labs and subsequent clinical deployment and propose solutions. Our report provides guidance on processes which take an imaging AI model from development to clinical implementation in a healthcare institution. We discuss various AI integration points in a clinical Radiology workflow. We also present a taxonomy of Radiology AI use-cases. Through this report, we intend to educate the stakeholders in healthcare and AI (AI researchers, radiologists, imaging informaticists, and regulators) about cross-disciplinary challenges and possible solutions.
△ Less
Submitted 8 May, 2023; v1 submitted 29 December, 2022;
originally announced December 2022.
-
Optimal Transportation for Electrical Impedance Tomography
Authors:
Gang Bao,
Yixuan Zhang
Abstract:
This work establishes a framework for solving inverse boundary problems with the geodesic based quadratic Wasserstein distance ($W_{2}$). A general form of the Fréchet gradient is systematically derived by optimal transportation (OT) theory. In addition, a fast algorithm based on the new formulation of OT on $\mathbb{S}^{1}$ is developed to solve the corresponding optimal transport problem. The co…
▽ More
This work establishes a framework for solving inverse boundary problems with the geodesic based quadratic Wasserstein distance ($W_{2}$). A general form of the Fréchet gradient is systematically derived by optimal transportation (OT) theory. In addition, a fast algorithm based on the new formulation of OT on $\mathbb{S}^{1}$ is developed to solve the corresponding optimal transport problem. The computational complexity of the algorithm is reduced to $O(N)$ from $O(N^{3})$ of the traditional method. Combining with the adjoint-state method, this framework provides a new computational approach for solving the challenging electrical impedance tomography (EIT) problem. Numerical examples are presented to illustrate the effectiveness of our method.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
Stability for the Helmholtz equation in deterministic and random periodic structures
Authors:
Gang Bao,
Yiwen Lin,
Xiang Xu
Abstract:
Stability results for the Helmholtz equations in both deterministic and random periodic structures are proved in this paper. Under the assumption of excluding resonances, by a variational method and Fourier analysis in the energy space, the stability estimate for the Helmholtz equation in a deterministic periodic structure is established. For the stochastic case, by introducing a variable transfor…
▽ More
Stability results for the Helmholtz equations in both deterministic and random periodic structures are proved in this paper. Under the assumption of excluding resonances, by a variational method and Fourier analysis in the energy space, the stability estimate for the Helmholtz equation in a deterministic periodic structure is established. For the stochastic case, by introducing a variable transform, the variational formulation of the scattering problem in a random domain is reduced to that in a definite domain with random medium. Combining the stability result for the deteministic case with regularity and stochastic regularity of the scattering surface, Pettis measurability theorem and Bochner's Theorem further yield the stability result for the scattering problem by random periodic structures. Both stability estimates are explicit with respect to the wavenumber.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
IronDepth: Iterative Refinement of Single-View Depth using Surface Normal and its Uncertainty
Authors:
Gwangbin Bae,
Ignas Budvytis,
Roberto Cipolla
Abstract:
Single image surface normal estimation and depth estimation are closely related problems as the former can be calculated from the latter. However, the surface normals computed from the output of depth estimation methods are significantly less accurate than the surface normals directly estimated by networks. To reduce such discrepancy, we introduce a novel framework that uses surface normal and its…
▽ More
Single image surface normal estimation and depth estimation are closely related problems as the former can be calculated from the latter. However, the surface normals computed from the output of depth estimation methods are significantly less accurate than the surface normals directly estimated by networks. To reduce such discrepancy, we introduce a novel framework that uses surface normal and its uncertainty to recurrently refine the predicted depth-map. The depth of each pixel can be propagated to a query pixel, using the predicted surface normal as guidance. We thus formulate depth refinement as a classification of choosing the neighboring pixel to propagate from. Then, by propagating to sub-pixel points, we upsample the refined, low-resolution output. The proposed method shows state-of-the-art performance on NYUv2 and iBims-1 - both in terms of depth and normal. Our refinement module can also be attached to the existing depth estimation methods to improve their accuracy. We also show that our framework, only trained for depth estimation, can also be used for depth completion. The code is available at https://github.com/baegwangbin/IronDepth.
△ Less
Submitted 7 October, 2022;
originally announced October 2022.
-
DigiFace-1M: 1 Million Digital Face Images for Face Recognition
Authors:
Gwangbin Bae,
Martin de La Gorce,
Tadas Baltrusaitis,
Charlie Hewitt,
Dong Chen,
Julien Valentin,
Roberto Cipolla,
**g**g Shen
Abstract:
State-of-the-art face recognition models show impressive accuracy, achieving over 99.8% on Labeled Faces in the Wild (LFW) dataset. Such models are trained on large-scale datasets that contain millions of real human face images collected from the internet. Web-crawled face images are severely biased (in terms of race, lighting, make-up, etc) and often contain label noise. More importantly, the fac…
▽ More
State-of-the-art face recognition models show impressive accuracy, achieving over 99.8% on Labeled Faces in the Wild (LFW) dataset. Such models are trained on large-scale datasets that contain millions of real human face images collected from the internet. Web-crawled face images are severely biased (in terms of race, lighting, make-up, etc) and often contain label noise. More importantly, the face images are collected without explicit consent, raising ethical concerns. To avoid such problems, we introduce a large-scale synthetic dataset for face recognition, obtained by rendering digital faces using a computer graphics pipeline. We first demonstrate that aggressive data augmentation can significantly reduce the synthetic-to-real domain gap. Having full control over the rendering pipeline, we also study how each attribute (e.g., variation in facial pose, accessories and textures) affects the accuracy. Compared to SynFace, a recent method trained on GAN-generated synthetic faces, we reduce the error rate on LFW by 52.5% (accuracy from 91.93% to 96.17%). By fine-tuning the network on a smaller number of real face images that could reasonably be obtained with consent, we achieve accuracy that is comparable to the methods trained on millions of real face images.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
SPARC: Sparse Render-and-Compare for CAD model alignment in a single RGB image
Authors:
Florian Langer,
Gwangbin Bae,
Ignas Budvytis,
Roberto Cipolla
Abstract:
Estimating 3D shapes and poses of static objects from a single image has important applications for robotics, augmented reality and digital content creation. Often this is done through direct mesh predictions which produces unrealistic, overly tessellated shapes or by formulating shape prediction as a retrieval task followed by CAD model alignment. Directly predicting CAD model poses from 2D image…
▽ More
Estimating 3D shapes and poses of static objects from a single image has important applications for robotics, augmented reality and digital content creation. Often this is done through direct mesh predictions which produces unrealistic, overly tessellated shapes or by formulating shape prediction as a retrieval task followed by CAD model alignment. Directly predicting CAD model poses from 2D image features is difficult and inaccurate. Some works, such as ROCA, regress normalised object coordinates and use those for computing poses. While this can produce more accurate pose estimates, predicting normalised object coordinates is susceptible to systematic failure. Leveraging efficient transformer architectures we demonstrate that a sparse, iterative, render-and-compare approach is more accurate and robust than relying on normalised object coordinates. For this we combine 2D image information including sparse depth and surface normal values which we estimate directly from the image with 3D CAD model information in early fusion. In particular, we reproject points sampled from the CAD model in an initial, random pose and compute their depth and surface normal values. This combined information is the input to a pose prediction network, SPARC-Net which we train to predict a 9 DoF CAD model pose update. The CAD model is reprojected again and the next pose update is predicted. Our alignment procedure converges after just 3 iterations, improving the state-of-the-art performance on the challenging real-world dataset ScanNet from 25.0% to 31.8% instance alignment accuracy. Code will be released at https://github.com/florianlanger/SPARC .
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
SLiDE: Self-supervised LiDAR De-snowing through Reconstruction Difficulty
Authors:
Gwangtak Bae,
Byungjun Kim,
Seongyong Ahn,
Jihong Min,
Inwook Shim
Abstract:
LiDAR is widely used to capture accurate 3D outdoor scene structures. However, LiDAR produces many undesirable noise points in snowy weather, which hamper analyzing meaningful 3D scene structures. Semantic segmentation with snow labels would be a straightforward solution for removing them, but it requires laborious point-wise annotation. To address this problem, we propose a novel self-supervised…
▽ More
LiDAR is widely used to capture accurate 3D outdoor scene structures. However, LiDAR produces many undesirable noise points in snowy weather, which hamper analyzing meaningful 3D scene structures. Semantic segmentation with snow labels would be a straightforward solution for removing them, but it requires laborious point-wise annotation. To address this problem, we propose a novel self-supervised learning framework for snow points removal in LiDAR point clouds. Our method exploits the structural characteristic of the noise points: low spatial correlation with their neighbors. Our method consists of two deep neural networks: Point Reconstruction Network (PR-Net) reconstructs each point from its neighbors; Reconstruction Difficulty Network (RD-Net) predicts point-wise difficulty of the reconstruction by PR-Net, which we call reconstruction difficulty. With simple post-processing, our method effectively detects snow points without any label. Our method achieves the state-of-the-art performance among label-free approaches and is comparable to the fully-supervised method. Moreover, we demonstrate that our method can be exploited as a pretext task to improve label-efficiency of supervised training of de-snowing.
△ Less
Submitted 8 August, 2022;
originally announced August 2022.
-
A Carleson type measure and a family of Möbius invariant function spaces
Authors:
Guanlong Bao,
Fangqin Ye
Abstract:
For $0<s<1$, let $\{z_n\}$ be a sequence in the open unit disk such that $\sum_n (1-|z_n|^2)^s δ_{z_n}$ is an $s$-Carleson measure. In this paper, we consider the connections between this $s$-Carleson measure and the theory of Möbius invariant $F(p, p-2, s)$ spaces by the Volterra type operator, the reciprocal of a Blaschke product, and second order complex differential equations having a prescrib…
▽ More
For $0<s<1$, let $\{z_n\}$ be a sequence in the open unit disk such that $\sum_n (1-|z_n|^2)^s δ_{z_n}$ is an $s$-Carleson measure. In this paper, we consider the connections between this $s$-Carleson measure and the theory of Möbius invariant $F(p, p-2, s)$ spaces by the Volterra type operator, the reciprocal of a Blaschke product, and second order complex differential equations having a prescribed zero sequence.
△ Less
Submitted 10 December, 2022; v1 submitted 7 August, 2022;
originally announced August 2022.
-
A General Contextualized Rewriting Framework for Text Summarization
Authors:
Guangsheng Bao,
Yue Zhang
Abstract:
The rewriting method for text summarization combines extractive and abstractive approaches, improving the conciseness and readability of extractive summaries using an abstractive model. Exiting rewriting systems take each extractive sentence as the only input, which is relatively focused but can lose necessary background knowledge and discourse context. In this paper, we investigate contextualized…
▽ More
The rewriting method for text summarization combines extractive and abstractive approaches, improving the conciseness and readability of extractive summaries using an abstractive model. Exiting rewriting systems take each extractive sentence as the only input, which is relatively focused but can lose necessary background knowledge and discourse context. In this paper, we investigate contextualized rewriting, which consumes the entire document and considers the summary context. We formalize contextualized rewriting as a seq2seq with group-tag alignments, introducing group-tag as a solution to model the alignments, identifying extractive sentences through content-based addressing. Results show that our approach significantly outperforms non-contextualized rewriting systems without requiring reinforcement learning, achieving strong improvements on ROUGE scores upon multiple extractors.
△ Less
Submitted 12 July, 2022;
originally announced July 2022.
-
Carleson measures and the range of a Cesàro-like operator acting on $H^\infty$
Authors:
Guanlong Bao,
Fangmei Sun,
Hasi Wulan
Abstract:
In this paper, by describing characterizations of Carleson type measures on $[0,1)$, we determine the range of a Cesàro-like operator acting on $H^\infty$. A special case of our result gives an answer to a question posed by P. Galanopoulos, D. Girela and N. Merchán recently.
In this paper, by describing characterizations of Carleson type measures on $[0,1)$, we determine the range of a Cesàro-like operator acting on $H^\infty$. A special case of our result gives an answer to a question posed by P. Galanopoulos, D. Girela and N. Merchán recently.
△ Less
Submitted 4 April, 2022;
originally announced April 2022.
-
Double integral estimates for Besov type spaces and their applications
Authors:
Guanlong Bao,
Juntao Du,
Hasi Wulan
Abstract:
For $0<p<\infty$, we give a complete description of nonnegative radial weight functions $ω$ on the open unit disk $\mathbb{D}$ such that $$
\int_{\mathbb{D}} |f'(z)|^p (1-|z|^2)^{p-2}ω(z)dA(z)<\infty
$$ if and only if $$ \int_{\mathbb{D}}\int_{\mathbb{D}}\frac{|f(z)-f(ζ)|^p}{|1-\overlineζz|^{4+τ+σ}}(1-|z|^2)^τ(1-|ζ|^2)^σω(ζ)dA(z)A(ζ)<\infty $$ for all analytic functions $f$ in $\mathbb{D}$, wh…
▽ More
For $0<p<\infty$, we give a complete description of nonnegative radial weight functions $ω$ on the open unit disk $\mathbb{D}$ such that $$
\int_{\mathbb{D}} |f'(z)|^p (1-|z|^2)^{p-2}ω(z)dA(z)<\infty
$$ if and only if $$ \int_{\mathbb{D}}\int_{\mathbb{D}}\frac{|f(z)-f(ζ)|^p}{|1-\overlineζz|^{4+τ+σ}}(1-|z|^2)^τ(1-|ζ|^2)^σω(ζ)dA(z)A(ζ)<\infty $$ for all analytic functions $f$ in $\mathbb{D}$, where $τ$ and $σ$ are some real numbers. As applications, we give some geometric descriptions of functions in Besove type spaces $B_p(ω)$ with doubling weights, and characterize the boundedness and compactness of Hankel type operators related to Besov type spaces with radial Békollé-Bonami weights. Some special cases of our results are new even for some standard weighted Besov spaces.
△ Less
Submitted 4 August, 2022; v1 submitted 22 March, 2022;
originally announced March 2022.
-
An Adaptive Finite Element DtN Method for Maxwell's Equations
Authors:
Gang Bao,
Mingming Zhang,
Xue Jiang,
Peijun Li,
Xiaokai Yuan
Abstract:
This paper is concerned with a numerical solution to the scattering of a time-harmonic electromagnetic wave by a bounded and impenetrable obstacle in three dimensions. The electromagnetic wave propagation is modeled by a boundary value problem of Maxwell's equations in the exterior domain of the obstacle. Based on the Dirichlet-to-Neumann (DtN) operator, which is defined by an infinite series, an…
▽ More
This paper is concerned with a numerical solution to the scattering of a time-harmonic electromagnetic wave by a bounded and impenetrable obstacle in three dimensions. The electromagnetic wave propagation is modeled by a boundary value problem of Maxwell's equations in the exterior domain of the obstacle. Based on the Dirichlet-to-Neumann (DtN) operator, which is defined by an infinite series, an exact transparent boundary condition is introduced and the scattering problem is reduced equivalently into a bounded domain. An a posteriori error estimate based adaptive finite element DtN method is developed to solve the discrete variational problem, where the DtN operator is truncated into a sum of finitely many terms. The a posteriori error estimate takes into account both the finite element approximation error and the truncation error of the DtN operator. The latter is shown to decay exponentially with respect to the truncation parameter. Numerical experiments are presented to illustrate the effectiveness of the proposed method.
△ Less
Submitted 18 February, 2022;
originally announced February 2022.
-
Multi-Scale Hybrid Vision Transformer for Learning Gastric Histology: AI-Based Decision Support System for Gastric Cancer Treatment
Authors:
Yu** Oh,
Go Eun Bae,
Kyung-Hee Kim,
Min-Kyung Yeo,
Jong Chul Ye
Abstract:
Gastric endoscopic screening is an effective way to decide appropriate gastric cancer (GC) treatment at an early stage, reducing GC-associated mortality rate. Although artificial intelligence (AI) has brought a great promise to assist pathologist to screen digitalized whole slide images, existing AI systems are limited in fine-grained cancer subclassifications and have little usability in planning…
▽ More
Gastric endoscopic screening is an effective way to decide appropriate gastric cancer (GC) treatment at an early stage, reducing GC-associated mortality rate. Although artificial intelligence (AI) has brought a great promise to assist pathologist to screen digitalized whole slide images, existing AI systems are limited in fine-grained cancer subclassifications and have little usability in planning cancer treatment. We propose a practical AI system that enables five subclassifications of GC pathology, which can be directly matched to general GC treatment guidance. The AI system is designed to efficiently differentiate multi-classes of GC through multi-scale self-attention mechanism using 2-stage hybrid Vision Transformer (ViT) networks, by mimicking the way how human pathologists understand histology. The AI system demonstrates reliable diagnostic performance by achieving class-average sensitivity of above 0.85 on a total of 1,212 slides from multicentric cohort. Furthermore, AI-assisted pathologists show significantly improved diagnostic sensitivity by 12% in addition to 18% reduced screening time compared to human pathologists. Our results demonstrate that AI-assisted gastric endoscopic screening has a great potential for providing presumptive pathologic opinion and appropriate cancer treatment of gastric cancer in practical clinical settings.
△ Less
Submitted 15 August, 2023; v1 submitted 17 February, 2022;
originally announced February 2022.
-
Sensing performance enhancement via asymmetric gain optimization in the atom-light hybrid interferometer
Authors:
Zhifei Yu,
Bo Fang,
Shuying Chen,
Pan Liu,
Guzhi Bao,
Chun-hua Yuan,
L. Q Chen
Abstract:
The SU (1,1)-type atom-light hybrid interferometer (SALHI) is a kind of interferometer that is sensitive to both the optical phase and atomic phase. However, the loss has been an unavoidable problem in practical applications and greatly limits the use of interferometers. Visibility is an important parameter to evaluate the sensing performance of interferometers. Here, we experimentally demonstrate…
▽ More
The SU (1,1)-type atom-light hybrid interferometer (SALHI) is a kind of interferometer that is sensitive to both the optical phase and atomic phase. However, the loss has been an unavoidable problem in practical applications and greatly limits the use of interferometers. Visibility is an important parameter to evaluate the sensing performance of interferometers. Here, we experimentally demonstrate the mitigating effect of the loss on visibility of the SALHI via asymmetric gain optimization, where the maximum threshold of loss to visibility close to $100\%$ is increased. Furthermore, we theoretically find that the optimal condition for the largest visibility is the same as that for the enhancement of signal-to-noise ratio (SNR) to the best value in the presence of losses using the intensity detection, indicating that the visibility can act as an experimental operational criterion for SNR improvement in practical applications. Improvement of the interference visibility means achievement of SNR enhancement. Our results provide a significant foundation for practical application of the SALHI in radar and ranging measurements.
△ Less
Submitted 11 January, 2022;
originally announced January 2022.
-
BGK model for multi-component gases near a global Maxwellian
Authors:
Gi-Chan Bae,
Christian Klingenberg,
Marlies Pirner,
Seok-Bae Yun
Abstract:
In this paper, we establish the existence of the unique global-in-time classical solutions to the multi-component BGK model suggested in \cite{mixmodel} when the initial data is a small perturbation of global equilibrium. For this, we carefully analyze the dissipative nature of the linearized multi-component relaxation operator, and observe that the partial dissipation from the intra-species and t…
▽ More
In this paper, we establish the existence of the unique global-in-time classical solutions to the multi-component BGK model suggested in \cite{mixmodel} when the initial data is a small perturbation of global equilibrium. For this, we carefully analyze the dissipative nature of the linearized multi-component relaxation operator, and observe that the partial dissipation from the intra-species and the inter-species linearized relaxation operators are combined in a complementary manner to give rise to the desired dissipation estimate of the model We also observe that the convergence rate of the distribution function increases as the momentum-energy interchange rate between the different components of the gas increases.
△ Less
Submitted 5 January, 2022;
originally announced January 2022.