Search | arXiv e-print repository

arXiv:2406.19130 [pdf, other]

Evidential Concept Embedding Models: Towards Reliable Concept Explanations for Skin Disease Diagnosis

Authors: Yibo Gao, Zheyao Gao, Xin Gao, Yuanye Liu, Bomin Wang, Xiahai Zhuang

Abstract: Due to the high stakes in medical decision-making, there is a compelling demand for interpretable deep learning methods in medical image analysis. Concept Bottleneck Models (CBM) have emerged as an active interpretable framework incorporating human-interpretable concepts into decision-making. However, their concept predictions may lack reliability when applied to clinical diagnosis, impeding conce… ▽ More Due to the high stakes in medical decision-making, there is a compelling demand for interpretable deep learning methods in medical image analysis. Concept Bottleneck Models (CBM) have emerged as an active interpretable framework incorporating human-interpretable concepts into decision-making. However, their concept predictions may lack reliability when applied to clinical diagnosis, impeding concept explanations' quality. To address this, we propose an evidential Concept Embedding Model (evi-CEM), which employs evidential learning to model the concept uncertainty. Additionally, we offer to leverage the concept uncertainty to rectify concept misalignments that arise when training CBMs using vision-language models without complete concept supervision. With the proposed methods, we can enhance concept explanations' reliability for both supervised and label-efficient settings. Furthermore, we introduce concept uncertainty for effective test-time intervention. Our evaluation demonstrates that evi-CEM achieves superior performance in terms of concept prediction, and the proposed concept rectification effectively mitigates concept misalignments for label-efficient training. Our code is available at https://github.com/obiyoag/evi-CEM. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: accepted by MICCAI 2024

arXiv:2406.19043 [pdf]

CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Ya**g Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, **g Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover high-quality, clinically interpretable images from undersampled measurements. However, the lack of publicly available cardiac MRI k-space dataset in terms of both quantity and diversity has severely hindered substantial technological progress, particularly for data-driven artificial intelligence. Here, we provide a standardized, diverse, and high-quality CMRxRecon2024 dataset to facilitate the technical development, fair evaluation, and clinical transfer of cardiac MRI reconstruction approaches, towards promoting the universal frameworks that enable fast and robust reconstructions across different cardiac MRI protocols in clinical practice. To the best of our knowledge, the CMRxRecon2024 dataset is the largest and most diverse publicly available cardiac k-space dataset. It is acquired from 330 healthy volunteers, covering commonly used modalities, anatomical views, and acquisition trajectories in clinical cardiac MRI workflows. Besides, an open platform with tutorials, benchmarks, and data processing tools is provided to facilitate data usage, advanced method development, and fair performance evaluation. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 19 pages, 3 figures, 2 tables

arXiv:2406.17575 [pdf, other]

Toward Universal Medical Image Registration via Sharpness-Aware Meta-Continual Learning

Authors: Bomin Wang, Xinzhe Luo, Xiahai Zhuang

Abstract: Current deep learning approaches in medical image registration usually face the challenges of distribution shift and data collection, hindering real-world deployment. In contrast, universal medical image registration aims to perform registration on a wide range of clinically relevant tasks simultaneously, thus having tremendous potential for clinical applications. In this paper, we present the fir… ▽ More Current deep learning approaches in medical image registration usually face the challenges of distribution shift and data collection, hindering real-world deployment. In contrast, universal medical image registration aims to perform registration on a wide range of clinically relevant tasks simultaneously, thus having tremendous potential for clinical applications. In this paper, we present the first attempt to achieve the goal of universal 3D medical image registration in sequential learning scenarios by proposing a continual learning method. Specifically, we utilize meta-learning with experience replay to mitigating the problem of catastrophic forgetting. To promote the generalizability of meta-continual learning, we further propose sharpness-aware meta-continual learning (SAMCL). We validate the effectiveness of our method on four datasets in a continual learning setup, including brain MR, abdomen CT, lung CT, and abdomen MR-CT image pairs. Results have shown the potential of SAMCL in realizing universal image registration, which performs better than or on par with vanilla sequential or centralized multi-task training strategies.The source code will be available from https://github.com/xzluo97/Continual-Reg. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: Accepted by MICCAI 2024

arXiv:2406.11045 [pdf, other]

Kolmogorov Arnold Informed neural network: A physics-informed deep learning framework for solving PDEs based on Kolmogorov Arnold Networks

Authors: Yizheng Wang, Jia Sun, **shuai Bai, Cosmin Anitescu, Mohammad Sadegh Eshaghi, Xiaoying Zhuang, Timon Rabczuk, Yinghua Liu

Abstract: AI for partial differential equations (PDEs) has garnered significant attention, particularly with the emergence of Physics-informed neural networks (PINNs). The recent advent of Kolmogorov-Arnold Network (KAN) indicates that there is potential to revisit and enhance the previously MLP-based PINNs. Compared to MLPs, KANs offer interpretability and require fewer parameters. PDEs can be described in… ▽ More AI for partial differential equations (PDEs) has garnered significant attention, particularly with the emergence of Physics-informed neural networks (PINNs). The recent advent of Kolmogorov-Arnold Network (KAN) indicates that there is potential to revisit and enhance the previously MLP-based PINNs. Compared to MLPs, KANs offer interpretability and require fewer parameters. PDEs can be described in various forms, such as strong form, energy form, and inverse form. While mathematically equivalent, these forms are not computationally equivalent, making the exploration of different PDE formulations significant in computational physics. Thus, we propose different PDE forms based on KAN instead of MLP, termed Kolmogorov-Arnold-Informed Neural Network (KINN). We systematically compare MLP and KAN in various numerical examples of PDEs, including multi-scale, singularity, stress concentration, nonlinear hyperelasticity, heterogeneous, and complex geometry problems. Our results demonstrate that KINN significantly outperforms MLP in terms of accuracy and convergence speed for numerous PDEs in computational solid mechanics, except for the complex geometry problem. This highlights KINN's potential for more efficient and accurate PDE solutions in AI for PDEs. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2406.09676 [pdf, other]

Optimizing Byte-level Representation for End-to-end ASR

Authors: Roger Hsiao, Liuhui Deng, Erik McDermott, Ruchir Travadi, Xiaodan Zhuang

Abstract: We propose a novel approach to optimizing a byte-level representation for end-to-end automatic speech recognition (ASR). Byte-level representation is often used by large scale multilingual ASR systems when the character set of the supported languages is large. The compactness and universality of byte-level representation allow the ASR models to use smaller output vocabularies and therefore, provid… ▽ More We propose a novel approach to optimizing a byte-level representation for end-to-end automatic speech recognition (ASR). Byte-level representation is often used by large scale multilingual ASR systems when the character set of the supported languages is large. The compactness and universality of byte-level representation allow the ASR models to use smaller output vocabularies and therefore, provide more flexibility. UTF-8 is a commonly used byte-level representation for multilingual ASR, but it is not designed to optimize machine learning tasks directly. By using auto-encoder and vector quantization, we show that we can optimize a byte-level representation for ASR and achieve better accuracy. Our proposed framework can incorporate information from different modalities, and provides an error correction mechanism. In an English/Mandarin dictation task, we show that a bilingual ASR model built with this approach can outperform UTF-8 representation by 5% relative in error rate. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: 5 pages, 1 figure

arXiv:2406.09098 [pdf, other]

SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models

Authors: Kehua Feng, Keyan Ding, Weijie Wang, Xiang Zhuang, Zeyuan Wang, Ming Qin, Yu Zhao, Jianhua Yao, Qiang Zhang, Huajun Chen

Abstract: The burgeoning utilization of Large Language Models (LLMs) in scientific research necessitates advanced benchmarks capable of evaluating their understanding and application of scientific knowledge comprehensively. To address this need, we introduce the SciKnowEval benchmark, a novel framework that systematically evaluates LLMs across five progressive levels of scientific knowledge: studying extens… ▽ More The burgeoning utilization of Large Language Models (LLMs) in scientific research necessitates advanced benchmarks capable of evaluating their understanding and application of scientific knowledge comprehensively. To address this need, we introduce the SciKnowEval benchmark, a novel framework that systematically evaluates LLMs across five progressive levels of scientific knowledge: studying extensively, inquiring earnestly, thinking profoundly, discerning clearly, and practicing assiduously. These levels aim to assess the breadth and depth of scientific knowledge in LLMs, including knowledge coverage, inquiry and exploration capabilities, reflection and reasoning abilities, ethic and safety considerations, as well as practice proficiency. Specifically, we take biology and chemistry as the two instances of SciKnowEval and construct a dataset encompassing 50K multi-level scientific problems and solutions. By leveraging this dataset, we benchmark 20 leading open-source and proprietary LLMs using zero-shot and few-shot prompting strategies. The results reveal that despite achieving state-of-the-art performance, the proprietary LLMs still have considerable room for improvement, particularly in addressing scientific computations and applications. We anticipate that SciKnowEval will establish a comprehensive standard for benchmarking LLMs in science research and discovery, and promote the development of LLMs that integrate scientific knowledge with strong safety awareness. The dataset and code are publicly available at https://github.com/hicai-zju/sciknoweval . △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: 48 pages, 2 figures

arXiv:2406.06063 [pdf, other]

Enabling Large-Scale and High-Precision Fluid Simulations on Near-Term Quantum Computers

Authors: Zhao-Yun Chen, Teng-Yang Ma, Chuang-Chao Ye, Liang Xu, Ming-Yang Tan, Xi-Ning Zhuang, Xiao-Fan Xu, Yun-Jie Wang, Tai-** Sun, Yong Chen, Lei Du, Liang-Liang Guo, Hai-Feng Zhang, Hao-Ran Tao, Tian-Le Wang, Xiao-Yan Yang, Ze-An Zhao, Peng Wang, Sheng Zhang, Chi Zhang, Ren-Ze Zhao, Zhi-Long Jia, Wei-Cheng Kong, Meng-Han Dou, Jun-Chao Wang , et al. (7 additional authors not shown)

Abstract: Quantum computational fluid dynamics (QCFD) offers a promising alternative to classical computational fluid dynamics (CFD) by leveraging quantum algorithms for higher efficiency. This paper introduces a comprehensive QCFD method, including an iterative method "Iterative-QLS" that suppresses error in quantum linear solver, and a subspace method to scale the solution to a larger size. We implement o… ▽ More Quantum computational fluid dynamics (QCFD) offers a promising alternative to classical computational fluid dynamics (CFD) by leveraging quantum algorithms for higher efficiency. This paper introduces a comprehensive QCFD method, including an iterative method "Iterative-QLS" that suppresses error in quantum linear solver, and a subspace method to scale the solution to a larger size. We implement our method on a superconducting quantum computer, demonstrating successful simulations of steady Poiseuille flow and unsteady acoustic wave propagation. The Poiseuille flow simulation achieved a relative error of less than $0.2\%$, and the unsteady acoustic wave simulation solved a 5043-dimensional matrix. We emphasize the utilization of the quantum-classical hybrid approach in applications of near-term quantum computers. By adapting to quantum hardware constraints and offering scalable solutions for large-scale CFD problems, our method paves the way for practical applications of near-term quantum computers in computational science. △ Less

Submitted 19 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

Comments: 31 pages, 10 figures

arXiv:2406.02430 [pdf, other]

Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

Authors: Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, Mingqing Gong, Peisong Huang, Qingqing Huang, Zhiying Huang, Yuanyuan Huo, Dongya Jia, Chumin Li, Feiya Li, Hui Li, Jiaxin Li, Xiaoyang Li, Xingxing Li, Lin Liu, Shouda Liu, Sichao Liu , et al. (21 additional authors not shown)

Abstract: We introduce Seed-TTS, a family of large-scale autoregressive text-to-speech (TTS) models capable of generating speech that is virtually indistinguishable from human speech. Seed-TTS serves as a foundation model for speech generation and excels in speech in-context learning, achieving performance in speaker similarity and naturalness that matches ground truth human speech in both objective and sub… ▽ More We introduce Seed-TTS, a family of large-scale autoregressive text-to-speech (TTS) models capable of generating speech that is virtually indistinguishable from human speech. Seed-TTS serves as a foundation model for speech generation and excels in speech in-context learning, achieving performance in speaker similarity and naturalness that matches ground truth human speech in both objective and subjective evaluations. With fine-tuning, we achieve even higher subjective scores across these metrics. Seed-TTS offers superior controllability over various speech attributes such as emotion and is capable of generating highly expressive and diverse speech for speakers in the wild. Furthermore, we propose a self-distillation method for speech factorization, as well as a reinforcement learning approach to enhance model robustness, speaker similarity, and controllability. We additionally present a non-autoregressive (NAR) variant of the Seed-TTS model, named $\text{Seed-TTS}_\text{DiT}$, which utilizes a fully diffusion-based architecture. Unlike previous NAR-based TTS systems, $\text{Seed-TTS}_\text{DiT}$ does not depend on pre-estimated phoneme durations and performs speech generation through end-to-end processing. We demonstrate that this variant achieves comparable performance to the language model-based variant and showcase its effectiveness in speech editing. We encourage readers to listen to demos at \url{https://bytedancespeech.github.io/seedtts_tech_report}. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2406.01335 [pdf, other]

Statistics-Informed Parameterized Quantum Circuit via Maximum Entropy Principle for Data Science and Finance

Authors: Xi-Ning Zhuang, Zhao-Yun Chen, Cheng Xue, Xiao-Fan Xu, Chao Wang, Huan-Yu Liu, Tai-** Sun, Yun-Jie Wang, Yu-Chun Wu, Guo-** Guo

Abstract: Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-i… ▽ More Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-informed parameterized quantum circuit (SI-PQC) for efficiently preparing and training of quantum computational statistical models, including arbitrary distributions and their weighted mixtures. The SI-PQC features a static structure with trainable parameters, enabling in-depth optimized circuit compilation, exponential reductions in resource and time consumption, and improved trainability and interpretability for learning quantum states and classical model parameters simultaneously. As an efficient subroutine for preparing and learning in various quantum algorithms, the SI-PQC addresses the input bottleneck and facilitates the injection of prior knowledge. △ Less

Submitted 18 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: 19 pages, 5 figures

arXiv:2405.20712 [pdf, other]

Simulation of open quantum systems on universal quantum computers

Authors: Huan-Yu Liu, Xiaoshui Lin, Zhao-Yun Chen, Cheng Xue, Tai-** Sun, Qing-Song Li, Xi-Ning Zhuang, Yun-Jie Wang, Yu-Chun Wu, Ming Gong, Guo-** Guo

Abstract: The rapid development of quantum computers has enabled demonstrations of quantum advantages on various tasks. However, real quantum systems are always dissipative due to their inevitable interaction with the environment, and the resulting non-unitary dynamics make quantum simulation challenging with only unitary quantum gates. In this work, we present an innovative and scalable method to simulate… ▽ More The rapid development of quantum computers has enabled demonstrations of quantum advantages on various tasks. However, real quantum systems are always dissipative due to their inevitable interaction with the environment, and the resulting non-unitary dynamics make quantum simulation challenging with only unitary quantum gates. In this work, we present an innovative and scalable method to simulate open quantum systems using quantum computers. We define an adjoint density matrix as a counterpart of the true density matrix, which reduces to a mixed-unitary quantum channel and thus can be effectively sampled using quantum computers. This method has several benefits, including no need for auxiliary qubits and noteworthy scalability. Moreover, accurate long-time simulation can also be achieved as the adjoint density matrix and the true dissipated one converge to the same state. Finally, we present deployments of this theory in the dissipative quantum $XY$ model for the evolution of correlation and entropy with short-time dynamics and the disordered Heisenberg model for many-body localization with long-time dynamics. This work promotes the study of real-world many-body dynamics with quantum computers, highlighting the potential to demonstrate practical quantum advantages. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Comments: 13 pages, 6 figures

arXiv:2405.19689 [pdf, other]

Uncertainty-aware sign language video retrieval with probability distribution modeling

Authors: Xuan Wu, Hongxiang Li, Yuanjiang Luo, Xuxin Cheng, Xianwei Zhuang, Meng Cao, Keren Fu

Abstract: Sign language video retrieval plays a key role in facilitating information access for the deaf community. Despite significant advances in video-text retrieval, the complexity and inherent uncertainty of sign language preclude the direct application of these techniques. Previous methods achieve the map** between sign language video and text through fine-grained modal alignment. However, due to th… ▽ More Sign language video retrieval plays a key role in facilitating information access for the deaf community. Despite significant advances in video-text retrieval, the complexity and inherent uncertainty of sign language preclude the direct application of these techniques. Previous methods achieve the map** between sign language video and text through fine-grained modal alignment. However, due to the scarcity of fine-grained annotation, the uncertainty inherent in sign language video is underestimated, limiting the further development of sign language retrieval tasks. To address this challenge, we propose a novel Uncertainty-aware Probability Distribution Retrieval (UPRet), that conceptualizes the map** process of sign language video and text in terms of probability distributions, explores their potential interrelationships, and enables flexible map**s. Experiments on three benchmarks demonstrate the effectiveness of our method, which achieves state-of-the-art results on How2Sign (59.1%), PHOENIX-2014T (72.0%), and CSL-Daily (78.4%). △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.10607 [pdf, ps, other]

On the existence and estimates of nested spherical designs

Authors: Ruigang Zheng, Xiaosheng Zhuang

Abstract: In this paper, we prove the existence of a spherical $t$-design formed by adding extra points to an arbitrarily given point set on the sphere and, subsequently, deduce the existence of nested spherical designs. Estimates on the number of required points are also given. For the case that the given point set is a spherical $t_1$-design such that $t_1 < t$ and the number of points is of optimal order… ▽ More In this paper, we prove the existence of a spherical $t$-design formed by adding extra points to an arbitrarily given point set on the sphere and, subsequently, deduce the existence of nested spherical designs. Estimates on the number of required points are also given. For the case that the given point set is a spherical $t_1$-design such that $t_1 < t$ and the number of points is of optimal order $t_1^d$, we show that the upper bound of the total number of extra points and given points for forming nested spherical $t$-design is of order $t^{2d+1}$. A brief discussion concerning the optimal order in nested spherical designs is also given. △ Less

Submitted 17 May, 2024; originally announced May 2024.

MSC Class: 41A10; 41A44; 41A55; 65D32

arXiv:2405.04828 [pdf, other]

ChuXin: 1.6B Technical Report

Authors: Xiaomin Zhuang, Yufan Jiang, Qiaozhi He, Zhihua Wu

Abstract: In this report, we present ChuXin, an entirely open-source language model with a size of 1.6 billion parameters. Unlike the majority of works that only open-sourced the model weights and architecture, we have made everything needed to train a model available, including the training data, the training process, and the evaluation code. Our goal is to empower and strengthen the open research communit… ▽ More In this report, we present ChuXin, an entirely open-source language model with a size of 1.6 billion parameters. Unlike the majority of works that only open-sourced the model weights and architecture, we have made everything needed to train a model available, including the training data, the training process, and the evaluation code. Our goal is to empower and strengthen the open research community, fostering transparency and enabling a new wave of innovation in the field of language modeling. Furthermore, we extend the context length to 1M tokens through lightweight continual pretraining and demonstrate strong needle-in-a-haystack retrieval performance. The weights for both models are available at Hugging Face to download and use. △ Less

Submitted 8 May, 2024; originally announced May 2024.

Comments: Technical Report

arXiv:2405.02918 [pdf, other]

MERIT: Multi-view Evidential learning for Reliable and Interpretable liver fibrosis sTaging

Authors: Yuanye Liu, Zheyao Gao, Nannan Shi, Fu** Wu, Yuxin Shi, Qingchao Chen, Xiahai Zhuang

Abstract: Accurate staging of liver fibrosis from magnetic resonance imaging (MRI) is crucial in clinical practice. While conventional methods often focus on a specific sub-region, multi-view learning captures more information by analyzing multiple patches simultaneously. However, previous multi-view approaches could not typically calculate uncertainty by nature, and they generally integrate features from d… ▽ More Accurate staging of liver fibrosis from magnetic resonance imaging (MRI) is crucial in clinical practice. While conventional methods often focus on a specific sub-region, multi-view learning captures more information by analyzing multiple patches simultaneously. However, previous multi-view approaches could not typically calculate uncertainty by nature, and they generally integrate features from different views in a black-box fashion, hence compromising reliability as well as interpretability of the resulting models. In this work, we propose a new multi-view method based on evidential learning, referred to as MERIT, which tackles the two challenges in a unified framework. MERIT enables uncertainty quantification of the predictions to enhance reliability, and employs a logic-based combination rule to improve interpretability. Specifically, MERIT models the prediction from each sub-view as an opinion with quantified uncertainty under the guidance of the subjective logic theory. Furthermore, a distribution-aware base rate is introduced to enhance performance, particularly in scenarios involving class distribution shifts. Finally, MERIT adopts a feature-specific combination rule to explicitly fuse multi-view predictions, thereby enhancing interpretability. Results have showcased the effectiveness of the proposed MERIT, highlighting the reliability and offering both ad-hoc and post-hoc interpretability. They also illustrate that MERIT can elucidate the significance of each view in the decision-making process for liver fibrosis staging. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: Submitted to Medical Image Analysis

MSC Class: 68U10 ACM Class: I.4.6

arXiv:2404.08979 [pdf, other]

BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection

Authors: Jian Zhang, Ruiteng Zhang, Xinyue Yan, Xiting Zhuang, Ruicheng Cao

Abstract: Degraded underwater images decrease the accuracy of underwater object detection. However, existing methods for underwater image enhancement mainly focus on improving the indicators in visual aspects, which may not benefit the tasks of underwater image detection, and may lead to serious degradation in performance. To alleviate this problem, we proposed a bidirectional-guided method for underwater o… ▽ More Degraded underwater images decrease the accuracy of underwater object detection. However, existing methods for underwater image enhancement mainly focus on improving the indicators in visual aspects, which may not benefit the tasks of underwater image detection, and may lead to serious degradation in performance. To alleviate this problem, we proposed a bidirectional-guided method for underwater object detection, referred to as BG-YOLO. In the proposed method, network is organized by constructing an enhancement branch and a detection branch in a parallel way. The enhancement branch consists of a cascade of an image enhancement subnet and an object detection subnet. And the detection branch only consists of a detection subnet. A feature guided module connects the shallow convolution layer of the two branches. When training the enhancement branch, the object detection subnet in the enhancement branch guides the image enhancement subnet to be optimized towards the direction that is most conducive to the detection task. The shallow feature map of the trained enhancement branch will be output to the feature guided module, constraining the optimization of detection branch through consistency loss and prompting detection branch to learn more detailed information of the objects. And hence the detection performance will be refined. During the detection tasks, only detection branch will be reserved so that no additional cost of computation will be introduced. Extensive experiments demonstrate that the proposed method shows significant improvement in performance of the detector in severely degraded underwater scenes while maintaining a remarkable detection speed. △ Less

Submitted 13 April, 2024; originally announced April 2024.

Comments: 15 pages, 8 figures, 4 tables

MSC Class: 68T07; 68T45 ACM Class: I.4.3; I.4.8; I.4.9; I.4.10; I.2.10

arXiv:2404.08317 [pdf, other]

Technical Design Report of the Spin Physics Detector at NICA

Authors: The SPD Collaboration, V. Abazov, V. Abramov, L. Afanasyev, R. Akhunzyanov, A. Akindinov, I. Alekseev, A. Aleshko, V. Alexakhin, G. Alexeev, L. Alimov, A. Allakhverdieva, A. Amoroso, V. Andreev, V. Andreev, E. Andronov, Yu. Anikin, S. Anischenko, A. Anisenkov, V. Anosov, E. Antokhin, A. Antonov, S. Antsupov, A. Anufriev, K. Asadova , et al. (392 additional authors not shown)

Abstract: The Spin Physics Detector collaboration proposes to install a universal detector in the second interaction point of the NICA collider under construction (JINR, Dubna) to study the spin structure of the proton and deuteron and other spin-related phenomena using a unique possibility to operate with polarized proton and deuteron beams at a collision energy up to 27 GeV and a luminosity up to… ▽ More The Spin Physics Detector collaboration proposes to install a universal detector in the second interaction point of the NICA collider under construction (JINR, Dubna) to study the spin structure of the proton and deuteron and other spin-related phenomena using a unique possibility to operate with polarized proton and deuteron beams at a collision energy up to 27 GeV and a luminosity up to $10^{32}$ cm$^{-2}$ s$^{-1}$. As the main goal, the experiment aims to provide access to the gluon TMD PDFs in the proton and deuteron, as well as the gluon transversity distribution and tensor PDFs in the deuteron, via the measurement of specific single and double spin asymmetries using different complementary probes such as charmonia, open charm, and prompt photon production processes. Other polarized and unpolarized physics is possible, especially at the first stage of NICA operation with reduced luminosity and collision energy of the proton and ion beams. This document is dedicated exclusively to technical issues of the SPD setup construction. △ Less

Submitted 28 May, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

arXiv:2404.07435 [pdf]

Encoding Urban Ecologies: Automated Building Archetype Generation through Self-Supervised Learning for Energy Modeling

Authors: Xinwei Zhuang, Zixun Huang, Wentao Zeng, Luisa Caldas

Abstract: As the global population and urbanization expand, the building sector has emerged as the predominant energy consumer and carbon emission contributor. The need for innovative Urban Building Energy Modeling grows, yet existing building archetypes often fail to capture the unique attributes of local buildings and the nuanced distinctions between different cities, jeopardizing the precision of energy… ▽ More As the global population and urbanization expand, the building sector has emerged as the predominant energy consumer and carbon emission contributor. The need for innovative Urban Building Energy Modeling grows, yet existing building archetypes often fail to capture the unique attributes of local buildings and the nuanced distinctions between different cities, jeopardizing the precision of energy modeling. This paper presents an alternative tool employing self-supervised learning to distill complex geometric data into representative, locale-specific archetypes. This study attempts to foster a new paradigm of interaction with built environments, incorporating local parameters to conduct bespoke energy simulations at the community level. The catered archetypes can augment the precision and applicability of energy consumption modeling at different scales across diverse building inventories. This tool provides a potential solution that encourages the exploration of emerging local ecologies. By integrating building envelope characteristics and cultural granularity into the building archetype generation process, we seek a future where architecture and urban design are intricately interwoven with the energy sector in sha** our built environments. △ Less

Submitted 10 April, 2024; originally announced April 2024.

arXiv:2404.01082 [pdf, other]

The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023

Authors: Jun Lyu, Chen Qin, Shuo Wang, Fanwen Wang, Yan Li, Zi Wang, Kunyuan Guo, Cheng Ouyang, Michael Tänzer, Meng Liu, Longyu Sun, Mengting Sun, Qin Li, Zhang Shi, Sha Hua, Hao Li, Zhensen Chen, Zhenlin Zhang, Bingyu Xin, Dimitris N. Metaxas, George Yiasemis, Jonas Teuwen, Li** Zhang, Weitian Chen, Yidong Zhao , et al. (25 additional authors not shown)

Abstract: Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation p… ▽ More Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation platform hinder the development of data-driven reconstruction algorithms. To address this issue, we organized the Cardiac MRI Reconstruction Challenge (CMRxRecon) in 2023, in collaboration with the 26th International Conference on MICCAI. CMRxRecon presented an extensive k-space dataset comprising cine and map** raw data, accompanied by detailed annotations of cardiac anatomical structures. With overwhelming participation, the challenge attracted more than 285 teams and over 600 participants. Among them, 22 teams successfully submitted Docker containers for the testing phase, with 7 teams submitted for both cine and map** tasks. All teams use deep learning based approaches, indicating that deep learning has predominately become a promising solution for the problem. The first-place winner of both tasks utilizes the E2E-VarNet architecture as backbones. In contrast, U-Net is still the most popular backbone for both multi-coil and single-coil reconstructions. This paper provides a comprehensive overview of the challenge design, presents a summary of the submitted results, reviews the employed methods, and offers an in-depth discussion that aims to inspire future advancements in cardiac MRI reconstruction models. The summary emphasizes the effective strategies observed in Cardiac MRI reconstruction, including backbone architecture, loss function, pre-processing techniques, physical modeling, and model complexity, thereby providing valuable insights for further developments in this field. △ Less

Submitted 16 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

Comments: 25 pages, 17 figures

arXiv:2403.19121 [pdf, other]

Code Comparison Tuning for Code Large Language Models

Authors: Yufan Jiang, Qiaozhi He, Xiaomin Zhuang, Zhihua Wu

Abstract: We present Code Comparison Tuning (CCT), a simple and effective tuning method for code large language models (Code LLMs) to better handle subtle code errors. Specifically, we integrate the concept of comparison into instruction tuning, both at the token and sequence levels, enabling the model to discern even the slightest deviations in code. To compare the original code with an erroneous version c… ▽ More We present Code Comparison Tuning (CCT), a simple and effective tuning method for code large language models (Code LLMs) to better handle subtle code errors. Specifically, we integrate the concept of comparison into instruction tuning, both at the token and sequence levels, enabling the model to discern even the slightest deviations in code. To compare the original code with an erroneous version containing manually added code errors, we use token-level preference loss for detailed token-level comparisons. Additionally, we combine code segments to create a new instruction tuning sample for sequence-level comparisons, enhancing the model's bug-fixing capability. Experimental results on the HumanEvalFix benchmark show that CCT surpasses instruction tuning in pass@1 scores by up to 4 points across diverse code LLMs, and extensive analysis demonstrates the effectiveness of our method. △ Less

Submitted 5 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

Comments: Preprint

arXiv:2403.16406 [pdf]

Development of a Chinese Human-Automation Trust Scale

Authors: Zixin Cui, Xiangling Zhuang, Seul Chan Lee, Jieun Lee, Xintong Li, Makoto Itoh

Abstract: The development of a reliable and valid assessment tool of human-automation trust is an important topic. This study aimed to develop a Chinese version of human-automation trust scale (C-HATS) with reasonable reliability and validity based on Lee and See (2004)'s trust model. After three phases of assessments including exploratory factor analysis, item analysis, and confirmatory factor analysis, di… ▽ More The development of a reliable and valid assessment tool of human-automation trust is an important topic. This study aimed to develop a Chinese version of human-automation trust scale (C-HATS) with reasonable reliability and validity based on Lee and See (2004)'s trust model. After three phases of assessments including exploratory factor analysis, item analysis, and confirmatory factor analysis, different dimensions and items were considered for initial and posttask human-automation trust. For post-task trust, the scale had three dimensions and 11 items and reflected Lee and See (2004)'s model, whereas different from Lee and See (2004)'s model, the final scale had 14 items but only two dimensions for initial trust. Nevertheless, for both initial and post-task trust, reasonable reliability and validity of the scale were verified with various consumer automation products. Although further verification is still necessary, the developed C-HATS could be used to effectively assess human-automation trust in the Chinese context. △ Less

Submitted 24 March, 2024; originally announced March 2024.

Comments: 26 pages with 3 figures

arXiv:2403.01582 [pdf, other]

Selection, Ensemble, and Adaptation: Advancing Multi-Source-Free Domain Adaptation via Architecture Zoo

Authors: Jiangbo Pei, Ruizhe Li, Aidong Men, Yang Liu, Xiahai Zhuang, Qingchao Chen

Abstract: Conventional Multi-Source Free Domain Adaptation (MSFDA) assumes that each source domain provides a single source model, and all source models adopt a uniform architecture. This paper introduces Zoo-MSFDA, a more general setting that allows each source domain to offer a zoo of multiple source models with different architectures. While it enriches the source knowledge, Zoo-MSFDA risks being dominat… ▽ More Conventional Multi-Source Free Domain Adaptation (MSFDA) assumes that each source domain provides a single source model, and all source models adopt a uniform architecture. This paper introduces Zoo-MSFDA, a more general setting that allows each source domain to offer a zoo of multiple source models with different architectures. While it enriches the source knowledge, Zoo-MSFDA risks being dominated by suboptimal/harmful models. To address this issue, we theoretically analyze the model selection problem in Zoo-MSFDA, and introduce two principles: transferability principle and diversity principle. Recognizing the challenge of measuring transferability, we subsequently propose a novel Source-Free Unsupervised Transferability Estimation (SUTE). It enables assessing and comparing transferability across multiple source models with different architectures under domain shift, without requiring target labels and source data. Based on above, we introduce a Selection, Ensemble, and Adaptation (SEA) framework to address Zoo-MSFDA, which consists of: 1) source models selection based on the proposed principles and SUTE; 2) ensemble construction based on SUTE-estimated transferability; 3) target-domain adaptation of the ensemble model. Evaluations demonstrate that our SEA framework, with the introduced Zoo-MSFDA setting, significantly improves adaptation performance (e.g., 13.5% on DomainNet). Additionally, our SUTE achieves state-of-the-art performance in transferability estimation. △ Less

Submitted 23 May, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

arXiv:2402.18940 [pdf, other]

End-to-End Quantum Vision Transformer: Towards Practical Quantum Speedup in Large-Scale Models

Authors: Cheng Xue, Zhao-Yun Chen, Xi-Ning Zhuang, Yun-Jie Wang, Tai-** Sun, Jun-Chao Wang, Huan-Yu Liu, Yu-Chun Wu, Zi-Lei Wang, Guo-** Guo

Abstract: The field of quantum deep learning presents significant opportunities for advancing computational capabilities, yet it faces a major obstacle in the form of the "information loss problem" due to the inherent limitations of the necessary quantum tomography in scaling quantum deep neural networks. This paper introduces an end-to-end Quantum Vision Transformer (QViT), which incorporates an innovative… ▽ More The field of quantum deep learning presents significant opportunities for advancing computational capabilities, yet it faces a major obstacle in the form of the "information loss problem" due to the inherent limitations of the necessary quantum tomography in scaling quantum deep neural networks. This paper introduces an end-to-end Quantum Vision Transformer (QViT), which incorporates an innovative quantum residual connection technique, to overcome these challenges and therefore optimize quantum computing processes in deep learning. Our thorough complexity analysis of the QViT reveals a theoretically exponential and empirically polynomial speedup, showcasing the model's efficiency and potential in quantum computing applications. We conducted extensive numerical tests on modern, large-scale transformers and datasets, establishing the QViT as a pioneering advancement in applying quantum deep neural networks in practical scenarios. Our work provides a comprehensive quantum deep learning paradigm, which not only demonstrates the versatility of current quantum linear algebra algorithms but also promises to enhance future research and development in quantum deep learning. △ Less

Submitted 1 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

Comments: 24pages, 10 figures

arXiv:2402.10959 [pdf]

Ultra-low Frequency Acoustic Luneburg Lens

Authors: Liuxian Zhao, Xuxu Zhuang, Hao Guo, Chuanxing Bi, Zhaoyong Sun

Abstract: In this paper, a novel structural Luneburg lens with local resonators is proposed. This lens allows for the realization of subwavelength focusing in low frequency range. The lens is achieved by graded refractive index from the lens centre to the outer surface. Numerical simulations are conducted to obtain data on wave propagation waveform, maximum displacement amplitude, and full width at half max… ▽ More In this paper, a novel structural Luneburg lens with local resonators is proposed. This lens allows for the realization of subwavelength focusing in low frequency range. The lens is achieved by graded refractive index from the lens centre to the outer surface. Numerical simulations are conducted to obtain data on wave propagation waveform, maximum displacement amplitude, and full width at half maximum of the lens's focal region. The results show that a broadband frequency range can be achieved for subwavelength focusing. This provides a straightforward and adaptable method for designing the structural Luneburg lens for numerous applications. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: 12 pages, 5 figures

MSC Class: 06Bxx ACM Class: B.1.1

arXiv:2402.06858 [pdf, other]

Evidence of genuine quantum effects in nonequilibrium entropy production

Authors: Qing-Feng Xue, Xu-Cai Zhuang, De-Yang Duan, Ying-Jie Zhang, Wei-Bin Yan, Yun-Jie Xia, Rosario Lo Franco, Zhong-Xiao Man

Abstract: Entropy production is a fundamental concept that plays a crucial role in the second law of thermodynamics and the measure of irreversibility. It imposes rigorous constraints on the kinds of transformations allowed in thermodynamic processes. Using an optical setup, here we experimentally demonstrate the division of entropy production of an open quantum system into a population-related component an… ▽ More Entropy production is a fundamental concept that plays a crucial role in the second law of thermodynamics and the measure of irreversibility. It imposes rigorous constraints on the kinds of transformations allowed in thermodynamic processes. Using an optical setup, here we experimentally demonstrate the division of entropy production of an open quantum system into a population-related component and a coherence-related component, validating previous theoretical predictions. The coherence-related component represents a genuine quantum contribution with no classical counterpart. By adjusting bath temperatures and initial coherences of the system, we first derive the total entropy production due to both populations and coherences, then remove all the coherences of the system to solely obtain the population-related contribution. The difference between these two results permits to isolate the coherence-related term. Based on this division, our experiment ultimately proves that irreversibility at the quantum level can be reduced through properly harnessing the two contributions to entropy production. △ Less

Submitted 15 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

Comments: 7 pages, 3 figures

arXiv:2402.04779 [pdf, other]

StableMask: Refining Causal Masking in Decoder-only Transformer

Authors: Qingyu Yin, Xuzheng He, Xiang Zhuang, Yu Zhao, Jianhua Yao, Xiaoyu Shen, Qiang Zhang

Abstract: The decoder-only Transformer architecture with causal masking and relative position encoding (RPE) has become the de facto choice in language modeling. Despite its exceptional performance across various tasks, we have identified two limitations: First, it requires all attention scores to be non-zero and sum up to 1, even if the current embedding has sufficient self-contained information. This comp… ▽ More The decoder-only Transformer architecture with causal masking and relative position encoding (RPE) has become the de facto choice in language modeling. Despite its exceptional performance across various tasks, we have identified two limitations: First, it requires all attention scores to be non-zero and sum up to 1, even if the current embedding has sufficient self-contained information. This compels the model to assign disproportional excessive attention to specific tokens. Second, RPE-based Transformers are not universal approximators due to their limited capacity at encoding absolute positional information, which limits their application in position-critical tasks. In this work, we propose StableMask: a parameter-free method to address both limitations by refining the causal mask. It introduces pseudo-attention values to balance attention distributions and encodes absolute positional information via a progressively decreasing mask ratio. StableMask's effectiveness is validated both theoretically and empirically, showing significant enhancements in language models with parameter sizes ranging from 71M to 1.4B across diverse datasets and encoding methods. We further show that it naturally supports (1) efficient extrapolation without special tricks such as StreamingLLM and (2) easy integration with existing attention optimization techniques. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: Preprint

arXiv:2402.04285 [pdf]

Elastic wave imaging with Maxwell's fish-eye lens

Authors: Liuxian Zhao, Chunlin Li, Xuxu Zhuang, Hao Guo, Yongquan Liu

Abstract: In this paper, a modified Maxwell's fish-eye lens is proposed in order to achieve super-resolution imaging. This lens possesses elevated refractive index profile compared with the traditional Maxwell's fish-eye lens. The refractive index profile is achieved with variable thickness configuration defined in a sheet plate structure, to realise desired changes in refractive indices. The wave propagati… ▽ More In this paper, a modified Maxwell's fish-eye lens is proposed in order to achieve super-resolution imaging. This lens possesses elevated refractive index profile compared with the traditional Maxwell's fish-eye lens. The refractive index profile is achieved with variable thickness configuration defined in a sheet plate structure, to realise desired changes in refractive indices. The wave propagation behaviours and the full width at half maximum (FWHM) are obtained from numerical simulations and experimental studies at the focal region of the lens. Super-resolution imaging is observed in a broadband frequency scope, with the FWHM around 0.2λ from 5 to 10 kHz. This work provides a straightforward and flexible approach to the engineering of the MFEL imaging characteristics and energy distributions for related applications. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: 18 pages, 10 figures

MSC Class: 74Bxx ACM Class: A.0

arXiv:2401.14656 [pdf, other]

Scientific Large Language Models: A Survey on Biological & Chemical Domains

Authors: Qiang Zhang, Keyang Ding, Tianwen Lyv, Xinda Wang, Qingyu Yin, Yiwen Zhang, **g Yu, Yuhao Wang, Xiaotong Li, Zhuoyi Xiang, Xiang Zhuang, Zeyuan Wang, Ming Qin, Mengyao Zhang, **lu Zhang, Jiyu Cui, Renjun Xu, Hongyang Chen, Xiaohui Fan, Huabin Xing, Huajun Chen

Abstract: Large Language Models (LLMs) have emerged as a transformative power in enhancing natural language comprehension, representing a significant stride toward artificial general intelligence. The application of LLMs extends beyond conventional linguistic boundaries, encompassing specialized linguistic systems developed within various scientific disciplines. This growing interest has led to the advent o… ▽ More Large Language Models (LLMs) have emerged as a transformative power in enhancing natural language comprehension, representing a significant stride toward artificial general intelligence. The application of LLMs extends beyond conventional linguistic boundaries, encompassing specialized linguistic systems developed within various scientific disciplines. This growing interest has led to the advent of scientific LLMs, a novel subclass specifically engineered for facilitating scientific discovery. As a burgeoning area in the community of AI for Science, scientific LLMs warrant comprehensive exploration. However, a systematic and up-to-date survey introducing them is currently lacking. In this paper, we endeavor to methodically delineate the concept of "scientific language", whilst providing a thorough review of the latest advancements in scientific LLMs. Given the expansive realm of scientific disciplines, our analysis adopts a focused lens, concentrating on the biological and chemical domains. This includes an in-depth examination of LLMs for textual knowledge, small molecules, macromolecular proteins, genomic sequences, and their combinations, analyzing them in terms of model architectures, capabilities, datasets, and evaluation. Finally, we critically examine the prevailing challenges and point out promising research directions along with the advances of LLMs. By offering a comprehensive overview of technical developments in this field, this survey aspires to be an invaluable resource for researchers navigating the intricate landscape of scientific LLMs. △ Less

Submitted 26 January, 2024; originally announced January 2024.

arXiv:2401.13225 [pdf, ps, other]

A New Look at the Scalar Meson $f_0(500)$ via $D^+\to π^+π^-\ell^+ν_\ell$ Decays

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai, X. Cai , et al. (615 additional authors not shown)

Abstract: Using $2.93~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773 GeV, we investigate the semileptonic decays $D^+\to π^+π^- \ell^+ν_\ell$ ($\ell=e$ and $μ$). The $D^+\to f_0(500)μ^+ν_μ$ decay is observed for the first time. By analyzing simultaneously the differential decay rates of $D^+\to f_0(500) μ^+ν_μ$ and… ▽ More Using $2.93~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773 GeV, we investigate the semileptonic decays $D^+\to π^+π^- \ell^+ν_\ell$ ($\ell=e$ and $μ$). The $D^+\to f_0(500)μ^+ν_μ$ decay is observed for the first time. By analyzing simultaneously the differential decay rates of $D^+\to f_0(500) μ^+ν_μ$ and $D^+\to f_0(500) e^+ν_e$ in different $\ell^+ν_\ell$ four-momentum transfer intervals, the product of the relevant hadronic form factor $f^{f_0}_{+}(0)$ and the magnitude of the $c\to d$ Cabibbo-Kobayashi-Maskawa matrix element $|V_{cd}|$ is determined to be $f_{+}^{f_0} (0)|V_{cd}|=0.0787\pm0.0060_{\rm stat}\pm0.0033_{\rm syst}$ for the first time. With the input of $|V_{cd}|$ from the global fit in the standard model, we determine $f_{+}^{f_0} (0)=0.350\pm0.027_{\rm stat}\pm0.015_{\rm syst}$. The absolute branching fractions of $D^+\to f_0(500)_{(π^+π^-)}μ^+ν_μ$ and $D^+\to ρ^0_{(π^+π^-)} μ^+ν_μ$ are determined as $(0.72\pm0.13_{\rm stat}\pm0.10_{\rm syst})\times10^{-3}$ and $(1.64\pm0.13_{\rm stat}\pm0.11_{\rm syst})\times 10^{-3}$. Combining these results with those of previous BESIII measurements on their semielectronic counterparts from the same data sample, we test lepton flavor universality by measuring the branching fraction ratios ${\mathcal B}_{D^+\to ρ^0 μ^+ν_μ}/{\mathcal B}_{D^+\to ρ^0 e^+ν_e}=0.88\pm0.10$ and ${\mathcal B}_{D^+\to f_0(500) μ^+ν_μ}/{\mathcal B}_{D^+\to f_0(500) e^+ν_e}=1.14\pm0.28$, which are compatible with the standard model expectation. △ Less

Submitted 4 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

Comments: Supplemental Materials added in this version

Report number: BAM-00660

arXiv:2401.02982 [pdf, other]

FinDABench: Benchmarking Financial Data Analysis Ability of Large Language Models

Authors: Shu Liu, Shangqing Zhao, Chenghao Jia, Xinlin Zhuang, Zhaoguang Long, Jie Zhou, Aimin Zhou, Man Lan, Qingquan Wu, Chong Yang

Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities across a wide range of tasks. However, their proficiency and reliability in the specialized domain of financial data analysis, particularly focusing on data-driven thinking, remain uncertain. To bridge this gap, we introduce \texttt{FinDABench}, a comprehensive benchmark designed to evaluate the financial data analysis capabili… ▽ More Large Language Models (LLMs) have demonstrated impressive capabilities across a wide range of tasks. However, their proficiency and reliability in the specialized domain of financial data analysis, particularly focusing on data-driven thinking, remain uncertain. To bridge this gap, we introduce \texttt{FinDABench}, a comprehensive benchmark designed to evaluate the financial data analysis capabilities of LLMs within this context. \texttt{FinDABench} assesses LLMs across three dimensions: 1) \textbf{Foundational Ability}, evaluating the models' ability to perform financial numerical calculation and corporate sentiment risk assessment; 2) \textbf{Reasoning Ability}, determining the models' ability to quickly comprehend textual information and analyze abnormal financial reports; and 3) \textbf{Technical Skill}, examining the models' use of technical knowledge to address real-world data analysis challenges involving analysis generation and charts visualization from multiple perspectives. We will release \texttt{FinDABench}, and the evaluation scripts at \url{https://github.com/cubenlp/BIBench}. \texttt{FinDABench} aims to provide a measure for in-depth analysis of LLM abilities and foster the advancement of LLMs in the field of financial data analysis. △ Less

Submitted 14 June, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

arXiv:2401.02141 [pdf, other]

Bayesian Intrinsic Groupwise Image Registration: Unsupervised Disentanglement of Anatomy and Geometry

Authors: Xinzhe Luo, Xin Wang, Linda Shapiro, Chun Yuan, Jianfeng Feng, Xiahai Zhuang

Abstract: This article presents a general Bayesian learning framework for multi-modal groupwise registration on medical images. The method builds on probabilistic modelling of the image generative process, where the underlying common anatomy and geometric variations of the observed images are explicitly disentangled as latent variables. Thus, groupwise registration is achieved through the solution to Bayesi… ▽ More This article presents a general Bayesian learning framework for multi-modal groupwise registration on medical images. The method builds on probabilistic modelling of the image generative process, where the underlying common anatomy and geometric variations of the observed images are explicitly disentangled as latent variables. Thus, groupwise registration is achieved through the solution to Bayesian inference. We propose a novel hierarchical variational auto-encoding architecture to realize the inference procedure of the latent variables, where the registration parameters can be calculated in a mathematically interpretable fashion. Remarkably, this new paradigm can learn groupwise registration in an unsupervised closed-loop self-reconstruction process, sparing the burden of designing complex intensity-based similarity measures. The computationally efficient disentangled architecture is also inherently scalable and flexible, allowing for groupwise registration on large-scale image groups with variable sizes. Furthermore, the inferred structural representations from disentanglement learning are capable of capturing the latent anatomy of the observations with visual semantics. Extensive experiments were conducted to validate the proposed framework, including four datasets from cardiac, brain and abdominal medical images. The results have demonstrated the superiority of our method over conventional similarity-based approaches in terms of accuracy, efficiency, scalability and interpretability. △ Less

Submitted 4 January, 2024; originally announced January 2024.

arXiv:2312.05556 [pdf, ps, other]

Second-order computational homogenization of flexoelectric composites

Authors: Xiaoying Zhuang, Bin Li, S. S. Nanthakumar, Thomas Bohlke

Abstract: Flexoelectricity shows promising applications for self-powered devices with its increased power density. This paper presents a second-order computational homogenization strategy for flexoelectric composite. The macro-micro scale transition, Hill-Mandel energy condition, periodic boundary conditions, and macroscopic constitutive tangents for the two-scale electromechanical coupling are investigated… ▽ More Flexoelectricity shows promising applications for self-powered devices with its increased power density. This paper presents a second-order computational homogenization strategy for flexoelectric composite. The macro-micro scale transition, Hill-Mandel energy condition, periodic boundary conditions, and macroscopic constitutive tangents for the two-scale electromechanical coupling are investigated and considered in the homogenization formulation. The macrostructure and microstructure are discretized using $C^1$ triangular finite elements. The second-order multiscale solution scheme is implemented using ABAQUS with user subroutines. Finally, we present numerical examples including parametric analysis of a square plate with holes and the design of piezoelectric materials made of non-piezoelectric materials to demonstrate the numerical implementation and the size-dependent effects of flexoelectricity. △ Less

Submitted 9 December, 2023; originally announced December 2023.

arXiv:2311.18333 [pdf, other]

Spherical Designs for Function Approximation and Beyond

Authors: Yuchen Xiao, Xiaosheng Zhuang

Abstract: In this paper, we compare two optimization algorithms using full Hessian and approximation Hessian to obtain numerical spherical designs through their variational characterization. Based on the obtained spherical design point sets, we investigate the approximation of smooth and non-smooth functions by spherical harmonics with spherical designs. Finally, we use spherical framelets for denoising Wen… ▽ More In this paper, we compare two optimization algorithms using full Hessian and approximation Hessian to obtain numerical spherical designs through their variational characterization. Based on the obtained spherical design point sets, we investigate the approximation of smooth and non-smooth functions by spherical harmonics with spherical designs. Finally, we use spherical framelets for denoising Wendland functions as an application, which shows the great potential of spherical designs in spherical data processing. △ Less

Submitted 30 November, 2023; originally announced November 2023.

Comments: 29 pages, 9 figures, 7 tables

MSC Class: 42C05; 58C35; 65K10; 65D15; 65D32

arXiv:2311.16879 [pdf]

Understanding the role of rock heterogeneity in controlling fault strength and stability

Authors: Shaobo Han, Xiaoying Zhuang, Quanzhou Yao, Qianlong Zhou, Xiaodong Hu

Abstract: The rock heterogeneity exists widely in fault zones; however, the intrinsic mechanism of how it affects the mechanical behavior of faults is poorly understood. To develop a quantitative understanding of the effect of the rock heterogeneity on the strength and stability of faults, here we investigate a pore-pressure model based on rate- and-state friction in the manner of two-degree-of-freedom spri… ▽ More The rock heterogeneity exists widely in fault zones; however, the intrinsic mechanism of how it affects the mechanical behavior of faults is poorly understood. To develop a quantitative understanding of the effect of the rock heterogeneity on the strength and stability of faults, here we investigate a pore-pressure model based on rate- and-state friction in the manner of two-degree-of-freedom spring-sliders and analyze the reasons of fault weakening and the conditions of frictional instability by carrying out nonlinear simulations and a linear stability analysis. We find that the strength of heterogeneous faults depends largely on the compaction difference (or differential compaction) between the two gouges (e.g. quartz and clay), and the stability is affected by the proportion of the two gouges patches. Our model implies that the rock heterogeneity is likely to weaken faults and reduce the stability of faults. △ Less

Submitted 28 November, 2023; originally announced November 2023.

arXiv:2311.06120 [pdf]

Converse Flexoelectricity of Low-Dimensional Bismuth Selenite (Bi2Se3) Revealed by Piezoresponse Force Microscopy (PFM)

Authors: Qiong Liu, S. S. Nanthakumar, Bin Li, Teresa Cheng, Florian Bittner, Chenxi Ma, Fei Ding, Lei Zheng, Bernhard Roth, Xiaoying Zhuang

Abstract: Many kinds of two-dimensional (2D) van der Waals (vdW) have been demonstrated to exhibit electromechanical coupling effects, which makes them promising candidates for next-generation devices, such as piezotronics and nanogenerators. Recently, flexoelectricity was found to account for the out-of-plane electromechanical coupling in many 2D transition metal dichalcogenides (TMDs) who only exhibit in-… ▽ More Many kinds of two-dimensional (2D) van der Waals (vdW) have been demonstrated to exhibit electromechanical coupling effects, which makes them promising candidates for next-generation devices, such as piezotronics and nanogenerators. Recently, flexoelectricity was found to account for the out-of-plane electromechanical coupling in many 2D transition metal dichalcogenides (TMDs) who only exhibit in-plane piezoelectricity. However, low dimensional vdW three-dimensional (3D) topological insulators (TIs) have been overlooked regarding their electromechanical properties. In this study, for the first time, we experimentally investigate the electromechanical coupling of low dimensional 3D TIs with a centrosymmetric crystal structure, where a binary compound, bismuth selenite (Bi2Se3), is taken as an example. The results of piezoresponse force microscope (PFM) tests on the Bi2Se3 nanoflakes show that the material exhibits both out-of-plane and in-plane electromechanical responses. The Bi2Se3 nanoflake with a thickness of 37 nm possesses an effective out-of-plane piezoelectric coefficient of ~0.65 pm V-1. With careful analyses, the electromechanical responses are verified to arise from the converse flexoelectricity. The measured effective out-of-plane piezoelectric coefficient is mainly contributed by flexoelectric coefficient, μ_39, which is estimated to be approximately 0.13 nC m-1. However, it is rather difficult to obtain the in-plane component of the flexoelectric tensor from the in-plane PFM measurements since the direction of the in-plane stress is always not normal to the AFM cantilever axis. The results provide useful guidance for understanding the flexoelectric effect of low dimensional vdW materials with centrosymmetric crystal structures. Moreover, the work can pave to way to explore the electromechanical devices based on the flexoelectricity of vdW TIs. △ Less

Submitted 10 November, 2023; originally announced November 2023.

Comments: 6 figures

arXiv:2311.05323 [pdf, other]

Spatial Attention-based Distribution Integration Network for Human Pose Estimation

Authors: Sihan Gao, **g Zhu, Xiaoxuan Zhuang, Zhaoyue Wang, Qi** Li

Abstract: In recent years, human pose estimation has made significant progress through the implementation of deep learning techniques. However, these techniques still face limitations when confronted with challenging scenarios, including occlusion, diverse appearances, variations in illumination, and overlap. To cope with such drawbacks, we present the Spatial Attention-based Distribution Integration Networ… ▽ More In recent years, human pose estimation has made significant progress through the implementation of deep learning techniques. However, these techniques still face limitations when confronted with challenging scenarios, including occlusion, diverse appearances, variations in illumination, and overlap. To cope with such drawbacks, we present the Spatial Attention-based Distribution Integration Network (SADI-NET) to improve the accuracy of localization in such situations. Our network consists of three efficient models: the receptive fortified module (RFM), spatial fusion module (SFM), and distribution learning module (DLM). Building upon the classic HourglassNet architecture, we replace the basic block with our proposed RFM. The RFM incorporates a dilated residual block and attention mechanism to expand receptive fields while enhancing sensitivity to spatial information. In addition, the SFM incorporates multi-scale characteristics by employing both global and local attention mechanisms. Furthermore, the DLM, inspired by residual log-likelihood estimation (RLE), reconfigures a predicted heatmap using a trainable distribution weight. For the purpose of determining the efficacy of our model, we conducted extensive experiments on the MPII and LSP benchmarks. Particularly, our model obtained a remarkable $92.10\%$ percent accuracy on the MPII test dataset, demonstrating significant improvements over existing models and establishing state-of-the-art performance. △ Less

Submitted 9 November, 2023; originally announced November 2023.

arXiv:2310.19485 [pdf]

doi 10.1016/j.flatc.2023.100575

Anomalous tensile strength and thermal expansion, and low thermal conductivity in wide band gap boron monoxide monolayer

Authors: Bohayra Mortazavi, Fazel Shojaei, Fei Ding, Xiaoying Zhuang

Abstract: Most recently the formation of boron monoxide (BO) in the two-dimensional (2D) form has been confirmed experimentally (J. Am. Chem. Soc. 2023, 145, 14660). Motivated by the aforementioned finding, herein we theoretically explore the key physical properties of the single-layer and suspended BO. Density functional theory (DFT) results reveal that BO monolayer yields a large indirect band gap of 3.78… ▽ More Most recently the formation of boron monoxide (BO) in the two-dimensional (2D) form has been confirmed experimentally (J. Am. Chem. Soc. 2023, 145, 14660). Motivated by the aforementioned finding, herein we theoretically explore the key physical properties of the single-layer and suspended BO. Density functional theory (DFT) results reveal that BO monolayer yields a large indirect band gap of 3.78 (2.18) eV on the basis of HSE06(PBE) functional. Ab-initio molecular dynamics results reveal the remarkable thermal stability of the BO monolayer at 1000 K. The thermal and mechanical properties at room temperature are furthermore investigated using a machine learning interatomic potential (MLIP). The developed MLIP-based model close to the ground state could very precisely reproduce the DFT predictions for the mechanical properties of the BO monolayer. The elastic modulus, tensile strength and lattice thermal conductivity of the BO monolayer at room temperature are predicted to be 107 GPa, 25 GPa and 5.6 W/mK, respectively. At the room temperature the BO monolayer is noticeably predicted to yield an ultrahigh negative thermal expansion coefficient, by almost 17 folds larger than that of the single-layer graphene. The presented results reveal the large indirect electronic band gap, decent thermal and dynamical stability, anomalously low elastic modulus to tensile strength ratio, ultrahigh negative thermal expansion coefficients and low lattice thermal conductivity of the BO monolayer. △ Less

Submitted 30 October, 2023; originally announced October 2023.

Journal ref: FlatChem 2023

arXiv:2310.17016 [pdf]

Boosting output performance of contact-separation mode triboelectric nanogenerators by adopting discontinuity and fringing effect: experiment and modelling studies

Authors: Teresa Cheng, Han Hu, Navid Valizadeh, Qiong Liu, Florian Bittner, Ling Yang, Timon Rabczuk, Xiaoning Jiang, Xiaoying Zhuang

Abstract: Triboelectric nanogenerators (TENGs) are promising self-powering supplies for a diverse range of intelligent sensing and monitoring devices, especially due to their capability of harvesting electric energy from low frequency and small-scale mechanical motions. Inspired by the fact that contact-separation mode TENGs with small contact areas harvest high electrical outputs due to fringing effect, th… ▽ More Triboelectric nanogenerators (TENGs) are promising self-powering supplies for a diverse range of intelligent sensing and monitoring devices, especially due to their capability of harvesting electric energy from low frequency and small-scale mechanical motions. Inspired by the fact that contact-separation mode TENGs with small contact areas harvest high electrical outputs due to fringing effect, this study employed discontinuity on the dielectric side of contact-separation mode TENGs to promote fringing electric fields for the enhancement of electrical outputs. The results reveal that the TENGs with more discontinuities show higher overall electric performance. Compared to pristine TENGs, the TENGs with cross discontinuities increased the surface charge by 50% and the power density by 114%. However, one should avoid generating discontinuities on tribonegative side of TENGs using metal blade within a positive-ion atmosphere due to the neutralization through electrically conductive metal blade. The computational simulation validated that the TENGs with discontinuities obtained higher electrical outputs, and further investigated the effect of discontinuity gap size and array distance on TENGs performance. This study has provided a promising method for the future design of TENGs using discontinuous structures. △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: 23 pages, 8 figures

arXiv:2310.14170 [pdf, other]

Learning Invariant Molecular Representation in Latent Discrete Space

Authors: Xiang Zhuang, Qiang Zhang, Keyan Ding, Yatao Bian, Xiao Wang, **gsong Lv, Hongyang Chen, Huajun Chen

Abstract: Molecular representation learning lays the foundation for drug discovery. However, existing methods suffer from poor out-of-distribution (OOD) generalization, particularly when data for training and testing originate from different environments. To address this issue, we propose a new framework for learning molecular representations that exhibit invariance and robustness against distribution shift… ▽ More Molecular representation learning lays the foundation for drug discovery. However, existing methods suffer from poor out-of-distribution (OOD) generalization, particularly when data for training and testing originate from different environments. To address this issue, we propose a new framework for learning molecular representations that exhibit invariance and robustness against distribution shifts. Specifically, we propose a strategy called ``first-encoding-then-separation'' to identify invariant molecule features in the latent space, which deviates from conventional practices. Prior to the separation step, we introduce a residual vector quantization module that mitigates the over-fitting to training data distributions while preserving the expressivity of encoders. Furthermore, we design a task-agnostic self-supervised learning objective to encourage precise invariance identification, which enables our method widely applicable to a variety of tasks, such as regression and multi-label classification. Extensive experiments on 18 real-world molecular datasets demonstrate that our model achieves stronger generalization against state-of-the-art baselines in the presence of various distribution shifts. Our code is available at https://github.com/HICAI-ZJU/iMoLD. △ Less

Submitted 22 October, 2023; originally announced October 2023.

arXiv:2310.03269 [pdf, other]

InstructProtein: Aligning Human and Protein Language via Knowledge Instruction

Authors: Zeyuan Wang, Qiang Zhang, Keyan Ding, Ming Qin, Xiang Zhuang, Xiaotong Li, Huajun Chen

Abstract: Large Language Models (LLMs) have revolutionized the field of natural language processing, but they fall short in comprehending biological sequences such as proteins. To address this challenge, we propose InstructProtein, an innovative LLM that possesses bidirectional generation capabilities in both human and protein languages: (i) taking a protein sequence as input to predict its textual function… ▽ More Large Language Models (LLMs) have revolutionized the field of natural language processing, but they fall short in comprehending biological sequences such as proteins. To address this challenge, we propose InstructProtein, an innovative LLM that possesses bidirectional generation capabilities in both human and protein languages: (i) taking a protein sequence as input to predict its textual function description and (ii) using natural language to prompt protein sequence generation. To achieve this, we first pre-train an LLM on both protein and natural language corpora, enabling it to comprehend individual languages. Then supervised instruction tuning is employed to facilitate the alignment of these two distinct languages. Herein, we introduce a knowledge graph-based instruction generation framework to construct a high-quality instruction dataset, addressing annotation imbalance and instruction deficits in existing protein-text corpus. In particular, the instructions inherit the structural relations between proteins and function annotations in knowledge graphs, which empowers our model to engage in the causal modeling of protein functions, akin to the chain-of-thought processes in natural languages. Extensive experiments on bidirectional protein-text generation tasks show that InstructProtein outperforms state-of-the-art LLMs by large margins. Moreover, InstructProtein serves as a pioneering step towards text-based protein function prediction and sequence design, effectively bridging the gap between protein and human language understanding. △ Less

Submitted 4 October, 2023; originally announced October 2023.

arXiv:2310.00180 [pdf, other]

MARL: Multi-scale Archetype Representation Learning for Urban Building Energy Modeling

Authors: Xinwei Zhuang, Zixun Huang, Wentao Zeng, Luisa Caldas

Abstract: Building archetypes, representative models of building stock, are crucial for precise energy simulations in Urban Building Energy Modeling. The current widely adopted building archetypes are developed on a nationwide scale, potentially neglecting the impact of local buildings' geometric specificities. We present Multi-scale Archetype Representation Learning (MARL), an approach that leverages repre… ▽ More Building archetypes, representative models of building stock, are crucial for precise energy simulations in Urban Building Energy Modeling. The current widely adopted building archetypes are developed on a nationwide scale, potentially neglecting the impact of local buildings' geometric specificities. We present Multi-scale Archetype Representation Learning (MARL), an approach that leverages representation learning to extract geometric features from a specific building stock. Built upon VQ-AE, MARL encodes building footprints and purifies geometric information into latent vectors constrained by multiple architectural downstream tasks. These tailored representations are proven valuable for further clustering and building energy modeling. The advantages of our algorithm are its adaptability with respect to the different building footprint sizes, the ability for automatic generation across multi-scale regions, and the preservation of geometric features across neighborhoods and local ecologies. In our study spanning five regions in LA County, we show MARL surpasses both conventional and VQ-AE extracted archetypes in performance. Results demonstrate that geometric feature embeddings significantly improve the accuracy and reliability of energy consumption estimates. Code, dataset and trained models are publicly available: https://github.com/ZixunHuang1997/MARL-BuildingEnergyEstimation △ Less

Submitted 29 September, 2023; originally announced October 2023.

Comments: *Equal Contribution

arXiv:2309.10836 [pdf, other]

CMRxRecon: An open cardiac MRI dataset for the competition of accelerated image reconstruction

Authors: Chengyan Wang, Jun Lyu, Shuo Wang, Chen Qin, Kunyuan Guo, Xinyu Zhang, Xiaotong Yu, Yan Li, Fanwen Wang, Jianhua **, Zhang Shi, Ziqiang Xu, Yapeng Tian, Sha Hua, Zhensen Chen, Meng Liu, Mengting Sun, Xutong Kuang, Kang Wang, Haoran Wang, Hao Li, Yinghua Chu, Guang Yang, Wenjia Bai, Xiahai Zhuang , et al. (3 additional authors not shown)

Abstract: Cardiac magnetic resonance imaging (CMR) has emerged as a valuable diagnostic tool for cardiac diseases. However, a limitation of CMR is its slow imaging speed, which causes patient discomfort and introduces artifacts in the images. There has been growing interest in deep learning-based CMR imaging algorithms that can reconstruct high-quality images from highly under-sampled k-space data. However,… ▽ More Cardiac magnetic resonance imaging (CMR) has emerged as a valuable diagnostic tool for cardiac diseases. However, a limitation of CMR is its slow imaging speed, which causes patient discomfort and introduces artifacts in the images. There has been growing interest in deep learning-based CMR imaging algorithms that can reconstruct high-quality images from highly under-sampled k-space data. However, the development of deep learning methods requires large training datasets, which have not been publicly available for CMR. To address this gap, we released a dataset that includes multi-contrast, multi-view, multi-slice and multi-coil CMR imaging data from 300 subjects. Imaging studies include cardiac cine and map** sequences. Manual segmentations of the myocardium and chambers of all the subjects are also provided within the dataset. Scripts of state-of-the-art reconstruction algorithms were also provided as a point of reference. Our aim is to facilitate the advancement of state-of-the-art CMR image reconstruction by introducing standardized evaluation criteria and making the dataset freely accessible to the research community. Researchers can access the dataset at https://www.synapse.org/#!Synapse:syn51471091/wiki/. △ Less

Submitted 19 September, 2023; originally announced September 2023.

Comments: 14 pages, 8 figures

arXiv:2309.08584 [pdf, other]

doi 10.1016/j.tafmec.2020.102523

Phase field method for quasi-static hydro-fracture in porous media under stress boundary condition considering the effect of initial stress field

Authors: Shuwei Zhou, Xiaoying Zhuang, Timon Rabczuk

Abstract: Phase field model (PFM) is an efficient fracture modeling method and has high potential for hydraulic fracturing (HF). However, the current PFMs in HF do not consider well the effect of in-situ stress field and the numerical examples of porous media with stress boundary conditions were rarely presented. The main reason is that if the remote stress is applied on the boundaries of the calculation do… ▽ More Phase field model (PFM) is an efficient fracture modeling method and has high potential for hydraulic fracturing (HF). However, the current PFMs in HF do not consider well the effect of in-situ stress field and the numerical examples of porous media with stress boundary conditions were rarely presented. The main reason is that if the remote stress is applied on the boundaries of the calculation domain, there will be relatively large deformation induced on these stress boundaries, which is not consistent with the engineering observations. To eliminate this limitation, this paper proposes a new phase field method to describe quasi-static hydraulic fracture propagation in porous media subjected to stress boundary conditions, and the new method is more in line with engineering practice. A new energy functional, which considers the effect of initial in-situ stress field, is established and then it is used to achieve the governing equations for the displacement and phase fields through the variational approach. Biot poroelasticity theory is used to couple the fluid pressure field and the displacement field while the phase field is used for determining the fluid properties from the intact domain to the fully broken domain. In addition, we present several 2D and 3D examples to show the effects of in-situ stress on hydraulic fracture propagation. The numerical examples indicate that under stress boundary condition our approach obtains correct displacement distribution and it is capable of capturing complex hydraulic fracture growth patterns. △ Less

Submitted 11 July, 2023; originally announced September 2023.

Journal ref: Theoretical and Applied Fracture Mechanics, 2020, 107: 102523

arXiv:2309.08579 [pdf, ps, other]

Polytopal composite finite elements for modeling concrete fracture based on nonlocal damage models

Authors: Hai D. Huynh, S. Natarajan, H. Nguyen-Xuan, Xiaoying Zhuang

Abstract: The paper presents an assumed strain formulation over polygonal meshes to accurately evaluate the strain fields in nonlocal damage models. An assume strained technique based on the Hu-Washizu variational principle is employed to generate a new strain approximation instead of direct derivation from the basis functions and the displacement fields. The underlying idea embedded in arbitrary finite pol… ▽ More The paper presents an assumed strain formulation over polygonal meshes to accurately evaluate the strain fields in nonlocal damage models. An assume strained technique based on the Hu-Washizu variational principle is employed to generate a new strain approximation instead of direct derivation from the basis functions and the displacement fields. The underlying idea embedded in arbitrary finite polygons is named as Polytopal composite finite elements (PCFEM). The PCFEM is accordingly applied within the framework of the nonlocal model of continuum damage mechanics to enhance the description of damage behaviours in which highly localized deformations must be captured accurately. This application is helpful to reduce the mesh-sensitivity and elaborate the process-zone of damage models. Several numerical examples are designed for various cases of fracture to discuss and validate the computational capability of the present method through comparison with published numerical results and experimental data from the literature. △ Less

Submitted 11 July, 2023; originally announced September 2023.

arXiv:2309.04000 [pdf, other]

doi 10.1007/s11440-020-00913-z

Phase field modeling of hydraulic fracture propagation in transversely isotropic poroelastic media

Authors: Shuwei Zhou, Xiaoying Zhuang

Abstract: This paper proposes a phase field model (PFM) for describing hydraulic fracture propagation in transversely isotopic media. The coupling between the fluid flow and displacement fields is established according to the classical Biot poroelasticity theory while the phase field model characterizes the fracture behavior. The proposed method uses a transversely isotropic constitutive relationship betwee… ▽ More This paper proposes a phase field model (PFM) for describing hydraulic fracture propagation in transversely isotopic media. The coupling between the fluid flow and displacement fields is established according to the classical Biot poroelasticity theory while the phase field model characterizes the fracture behavior. The proposed method uses a transversely isotropic constitutive relationship between stress and strain as well as anisotropy in fracture toughness and permeability. An additional pressure-related term and an anisotropic fracture toughness tensor are added in the energy functional, which is then used to obtain the governing equations of strong form via the variational approach. In addition, the phase field is used to construct indicator functions that transit the fluid property from the intact domain to the fully fractured one. Moreover, the proposed PFM is implemented using the finite element method where a staggered scheme is applied and the displacement and fluid pressure are monolithically solved in a staggered step. Afterwards, two examples are tested to initially verify the proposed PFM: a transversely isotropic single-edge-notched square plate subjected to tension and an isotropic porous medium subjected to internal fluid pressure. Finally, numerical examples of 2D and 3D transversely isotropic media with one or two interior notches subjected to internal fluid pressure are presented to further prove the capability of the proposed PFM in 2D and 3D problems. △ Less

Submitted 11 July, 2023; originally announced September 2023.

Journal ref: Acta Geotechnica, 2020, 15(9): 2599-2618

arXiv:2309.03996 [pdf, other]

doi 10.1016/j.engfracmech.2022.108234

Phase field modeling and computer implementation: A review

Authors: X. Zhuang, S. Zhou, G. D. Huynh, P. Areias, T. Rabczuk

Abstract: This paper presents an overview of the theories and computer implementation aspects of phase field models (PFM) of fracture. The advantage of PFM over discontinuous approaches to fracture is that PFM can elegantly simulate complicated fracture processes including fracture initiation, propagation, coalescence, and branching by using only a scalar field, the phase field. In addition, fracture is a n… ▽ More This paper presents an overview of the theories and computer implementation aspects of phase field models (PFM) of fracture. The advantage of PFM over discontinuous approaches to fracture is that PFM can elegantly simulate complicated fracture processes including fracture initiation, propagation, coalescence, and branching by using only a scalar field, the phase field. In addition, fracture is a natural outcome of the simulation and obtained through the solution of an additional differential equation related to the phase field. No extra fracture criteria are needed and an explicit representation of a crack surface as well as complex track crack procedures are avoided in PFM for fracture, which in turn dramatically facilitates the implementation. The PFM is thermodynamically consistent and can be easily extended to multi-physics problem by 'changing' the energy functional accordingly. Besides an overview of different PFMs, we also present comparative numerical benchmark examples to show the capability of PFMs. △ Less

Submitted 30 August, 2023; originally announced September 2023.

Journal ref: Engineering Fracture Mechanics, 2022, 262: 108234

arXiv:2309.03909 [pdf, other]

doi 10.1061/(ASCE)GM.1943-5622.0001930

Phase Field Characterization of Rock Fractures in Brazilian Splitting Test Specimens Containing Voids and Inclusions

Authors: Shuwei Zhou, Xiaoying Zhuang, Jiaming Zhou, Fang Liu

Abstract: The Brazilian splitting test is a widely used testing procedure for characterizing the tensile strength of natural rock or rock-like material due to the fact. However, the results of Brazilian tests on specimens with naturally existing voids and inclusions are strongly influenced by size effects and boundary conditions, while numerical modeling can assist in explaining and understanding the mechan… ▽ More The Brazilian splitting test is a widely used testing procedure for characterizing the tensile strength of natural rock or rock-like material due to the fact. However, the results of Brazilian tests on specimens with naturally existing voids and inclusions are strongly influenced by size effects and boundary conditions, while numerical modeling can assist in explaining and understanding the mechanisms. On the other hand, the potential of utilizing Brazilian test to characterize inhomogeneous deformation of rock samples with voids and inclusions of dissimilar materials still awaits to be explored. In the present study, fracture mechanisms in Brazilian discs with circular voids and filled inclusions are investigated by using the phase field model (PFM). The PFM is implemented within the framework of finite element method to study the influence of diameter, eccentricity, and quantity of the voids and inclusions on the fracture patterns and stress-strain curves. The phase field simulations can reproduce previous experimental phenomena and furthermore it deepens the understanding of the influence of inclusion and voids on the fracture pattern, overall strength and deformation behavior of inhomogeneous rock. The findings in the study highlight the potential of characterizing inhomogeneous rock through combining Brazilian tests and numerical modeling. △ Less

Submitted 11 July, 2023; originally announced September 2023.

Journal ref: International Journal of Geomechanics, 2021, 21(3): 04021006

arXiv:2309.03537 [pdf, other]

Data-Adaptive Graph Framelets with Generalized Vanishing Moments for Graph Signal Processing

Authors: Ruigang Zheng, Xiaosheng Zhuang

Abstract: In this paper, we propose a novel and general framework to construct tight framelet systems on graphs with localized supports based on hierarchical partitions. Our construction provides parametrized graph framelet systems with great generality based on partition trees, by which we are able to find the size of a low-dimensional subspace that best fits the low-rank structure of a family of signals.… ▽ More In this paper, we propose a novel and general framework to construct tight framelet systems on graphs with localized supports based on hierarchical partitions. Our construction provides parametrized graph framelet systems with great generality based on partition trees, by which we are able to find the size of a low-dimensional subspace that best fits the low-rank structure of a family of signals. The orthogonal decomposition of subspaces provides a key ingredient for the definition of "generalized vanishing moments" for graph framelets. In a data-adaptive setting, the graph framelet systems can be learned by solving an optimization problem on Stiefel manifolds with respect to our parameterization. Moreover, such graph framelet systems can be further improved by solving a subsequent optimization problem on Stiefel manifolds, aiming at providing the utmost sparsity for a given family of graph signals. Experimental results show that our learned graph framelet systems perform superiorly in non-linear approximation and denoising tasks. △ Less

Submitted 30 December, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

MSC Class: 43A99; 41A45; 94A11; 94A16

arXiv:2309.02438 [pdf, other]

doi 10.1016/j.cma.2019.03.001

Phase-field modeling of fluid-driven dynamic cracking in porous media

Authors: Shuwei Zhou, Xiaoying Zhuang, Timon Rabczuk

Abstract: A phase field model for fluid-driven dynamic crack propagation in poroelastic media is proposed. Therefore, classical Biot poroelasticity theory is applied in the porous medium while arbitrary crack growth is naturally captured by the phase field model. We also account for the transition of the fluid property from the intact medium to the fully broken one by employing indicator functions. We emplo… ▽ More A phase field model for fluid-driven dynamic crack propagation in poroelastic media is proposed. Therefore, classical Biot poroelasticity theory is applied in the porous medium while arbitrary crack growth is naturally captured by the phase field model. We also account for the transition of the fluid property from the intact medium to the fully broken one by employing indicator functions. We employ a staggered scheme and implement our approach into the software package COMSOL Multiphysics. Our approach is first verified through three classical benchmark problems which are compared to analytical solutions for dynamic consolidation and pressure distribution in a single crack and in a specimen with two sets of joints. Subsequently, we present several 2D and 3D examples of dynamic crack branching and their interaction with pre-existing natural fractures. All presented examples demonstrate the capability of the proposed approach of handling dynamic crack propagation, branching and coalescence of fluid-driven fracture. △ Less

Submitted 11 July, 2023; originally announced September 2023.

Journal ref: Computer Methods in Applied Mechanics and Engineering, 2019, 350: 169-198

arXiv:2308.11760 [pdf, ps, other]

The Jacobian of a Sixth-Root-of-Unity Matroid

Authors: Matthew Baker, Changxin Ding, Xu Zhuang

Abstract: The Jacobian group (also called the sandpile group, Picard group, or critical group) of a graph or, more generally, of a regular matroid has been well studied. Sixth-root-of-unity matroids, also called complex unimodular matroids, are generalizations of regular matroids. This paper provides a definition, and establishes some basic properties, of the Jacobian group of a sixth-root-of-unity matroid. The Jacobian group (also called the sandpile group, Picard group, or critical group) of a graph or, more generally, of a regular matroid has been well studied. Sixth-root-of-unity matroids, also called complex unimodular matroids, are generalizations of regular matroids. This paper provides a definition, and establishes some basic properties, of the Jacobian group of a sixth-root-of-unity matroid. △ Less

Submitted 4 September, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

arXiv:2308.05748 [pdf, other]

doi 10.1016/j.cma.2019.06.021

Phase field modeling of brittle compressive-shear fractures in rock-like materials: A new driving force and a hybrid formulation

Authors: Shuwei Zhou, Xiaoying Zhuang, Timon Rabczuk

Abstract: Compressive-shear fracture is commonly observed in rock-like materials. However, this fracture type cannot be captured by current phase field models (PFMs), which have been proven an effective tool for modeling fracture initiation, propagation, coalescence, and branching in solids. The existing PFMs also cannot describe the influence of cohesion and internal friction angle on load-displacement cur… ▽ More Compressive-shear fracture is commonly observed in rock-like materials. However, this fracture type cannot be captured by current phase field models (PFMs), which have been proven an effective tool for modeling fracture initiation, propagation, coalescence, and branching in solids. The existing PFMs also cannot describe the influence of cohesion and internal friction angle on load-displacement curve during compression tests. Therefore, to develop a new phase field model that can simulate well compressive-shear fractures in rock-like materials, we construct a new driving force in the evolution equation of phase field. Strain spectral decomposition is applied and only the compressive part of the strain is used in the new driving force with consideration of the influence of cohesion and internal friction angle. For ease of implementation, a hybrid formulation is established for the phase field modeling. Then, we test the brittle compressive-shear fractures in uniaxial compression tests on intact rock-like specimens as well as those with a single or two parallel inclined flaws. All numerical results are in good agreement with the experimental observation, validating the feasibility and practicability of the proposed PFM for simulating brittle compressive-shear fractures. △ Less

Submitted 11 July, 2023; originally announced August 2023.

Journal ref: Computer Methods in Applied Mechanics and Engineering, 2019, 355: 729-752

Showing 1–50 of 243 results for author: Zhuang, X