Search | arXiv e-print repository

Provable Preconditioned Plug-and-Play Approach for Compressed Sensing MRI Reconstruction

Authors: Tao Hong, Xiaojian Xu, Jason Hu, Jeffrey A. Fessler

Abstract: Model-based methods play a key role in the reconstruction of compressed sensing (CS) MRI. Finding an effective prior to describe the statistical distribution of the image family of interest is crucial for model-based methods. Plug-and-play (PnP) is a general framework that uses denoising algorithms as the prior or regularizer. Recent work showed that PnP methods with denoisers based on pretrained… ▽ More Model-based methods play a key role in the reconstruction of compressed sensing (CS) MRI. Finding an effective prior to describe the statistical distribution of the image family of interest is crucial for model-based methods. Plug-and-play (PnP) is a general framework that uses denoising algorithms as the prior or regularizer. Recent work showed that PnP methods with denoisers based on pretrained convolutional neural networks outperform other classical regularizers in CS MRI reconstruction. However, the numerical solvers for PnP can be slow for CS MRI reconstruction. This paper proposes a preconditioned PnP (P^2nP) method to accelerate the convergence speed. Moreover, we provide proofs of the fixed-point convergence of the P^2nP iterates. Numerical experiments on CS MRI reconstruction with non-Cartesian sampling trajectories illustrate the effectiveness and efficiency of the P^2nP approach. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 14 figures, 4 tables

arXiv:2405.03817 [pdf, other]

Search for joint multimessenger signals from potential Galactic PeVatrons with HAWC and IceCube

Authors: R. Alfaro, C. Alvarez, J. C. Arteaga-Velázquez, D. Avila Rojas, H. A. Ayala Solares, R. Babu, E. Belmont-Moreno, K. S. Caballero-Mora, T. Capistrán, A. Carramiñana, S. Casanova, U. Cotti, J. Cotzomi, S. Coutiño de León, E. De la Fuente, D. Depaoli, N. Di Lalla, R. Diaz Hernandez, J. C. Díaz-Vélez, K. Engel, T. Ergin, K. L. Fan, K. Fang, N. Fraija, S. Fraija , et al. (469 additional authors not shown)

Abstract: Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis… ▽ More Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis that combines gamma rays and neutrinos is required. In this study, we use the Multi-Mission Maximum Likelihood framework (3ML) with IceCube Maximum Likelihood Analysis software (i3mla) and HAWC Accelerated Likelihood (HAL) to search for a correlation between 22 known gamma-ray sources from the third HAWC gamma-ray catalog and 14 years of IceCube track-like data. No significant neutrino emission from the direction of the HAWC sources was found. We report the best-fit gamma-ray model and 90% CL neutrino flux limit from the 22 sources. From the neutrino flux limit, we conclude that the gamma-ray emission from five of the sources can not be produced purely from hadronic interactions. We report the limit for the fraction of gamma rays produced by hadronic interactions for these five sources. △ Less

Submitted 6 May, 2024; originally announced May 2024.

arXiv:2405.03814 [pdf, other]

Stochastic behavior of an n-node blockchain under cyber attacks from multiple hackers with random re-setting times

Authors: Xiufeng Xu, Liang Hong

Abstract: This paper investigates the stochastic behavior of an n-node blockchain which is continuously monitored and faces non-stop cyber attacks from multiple hackers. The blockchain will start being re-set once hacking is detected, forfeiting previous efforts of all hackers. It is assumed the re-setting process takes a random amount of time. Multiple independent hackers will keep attempting to hack into… ▽ More This paper investigates the stochastic behavior of an n-node blockchain which is continuously monitored and faces non-stop cyber attacks from multiple hackers. The blockchain will start being re-set once hacking is detected, forfeiting previous efforts of all hackers. It is assumed the re-setting process takes a random amount of time. Multiple independent hackers will keep attempting to hack into the blockchain until one of them succeeds. For arbitrary distributions of the hacking times, detecting times, and re-setting times, we derive the instantaneous functional probability, the limiting functional probability, and the mean functional time of the blockchain. Moreover, we establish that these quantities are increasing functions of the number of nodes, formalizing the intuition that the more nodes a blockchain has the more secure it is. △ Less

Submitted 6 May, 2024; originally announced May 2024.

arXiv:2405.03131 [pdf, other]

WDMoE: Wireless Distributed Large Language Models with Mixture of Experts

Authors: Nan Xue, Ya** Sun, Zhiyong Chen, Meixia Tao, Xiaodong Xu, Liang Qian, Shuguang Cui, ** Zhang

Abstract: Large Language Models (LLMs) have achieved significant success in various natural language processing tasks, but how wireless communications can support LLMs has not been extensively studied. In this paper, we propose a wireless distributed LLMs paradigm based on Mixture of Experts (MoE), named WDMoE, deploying LLMs collaboratively across edge servers of base station (BS) and mobile devices in the… ▽ More Large Language Models (LLMs) have achieved significant success in various natural language processing tasks, but how wireless communications can support LLMs has not been extensively studied. In this paper, we propose a wireless distributed LLMs paradigm based on Mixture of Experts (MoE), named WDMoE, deploying LLMs collaboratively across edge servers of base station (BS) and mobile devices in the wireless communications system. Specifically, we decompose the MoE layer in LLMs by deploying the gating network and the preceding neural network layer at BS, while distributing the expert networks across the devices. This arrangement leverages the parallel capabilities of expert networks on distributed devices. Moreover, to overcome the instability of wireless communications, we design an expert selection policy by taking into account both the performance of the model and the end-to-end latency, which includes both transmission delay and inference delay. Evaluations conducted across various LLMs and multiple datasets demonstrate that WDMoE not only outperforms existing models, such as Llama 2 with 70 billion parameters, but also significantly reduces end-to-end latency. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: submitted to IEEE conference

arXiv:2405.03125 [pdf, other]

MambaJSCC: Deep Joint Source-Channel Coding with Visual State Space Model

Authors: Tong Wu, Zhiyong Chen, Meixia Tao, Xiaodong Xu, Wenjun Zhang, ** Zhang

Abstract: Lightweight and efficient deep joint source-channel coding (JSCC) is a key technology for semantic communications. In this paper, we design a novel JSCC scheme named MambaJSCC, which utilizes a visual state space model with channel adaptation (VSSM-CA) block as its backbone for transmitting images over wireless channels. The VSSM-CA block utilizes VSSM to integrate two-dimensional images with the… ▽ More Lightweight and efficient deep joint source-channel coding (JSCC) is a key technology for semantic communications. In this paper, we design a novel JSCC scheme named MambaJSCC, which utilizes a visual state space model with channel adaptation (VSSM-CA) block as its backbone for transmitting images over wireless channels. The VSSM-CA block utilizes VSSM to integrate two-dimensional images with the state space, enabling feature extraction and encoding processes to operate with linear complexity. It also incorporates channel state information (CSI) via a newly proposed CSI embedding method. This method deploys a shared CSI encoding module within both the encoder and decoder to encode and inject the CSI into each VSSM-CA block, improving the adaptability of a single model to varying channel conditions. Experimental results show that MambaJSCC not only outperforms Swin Transformer based JSCC (SwinJSCC) but also significantly reduces parameter size, computational overhead, and inference delay (ID). For example, with employing an equal number of the VSSM-CA blocks and the Swin Transformer blocks, MambaJSCC achieves a 0.48 dB gain in peak-signal-to-noise ratio (PSNR) over SwinJSCC while requiring only 53.3% multiply-accumulate operations, 53.8% of the parameters, and 44.9% of ID. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: submitted to IEEE conference

arXiv:2405.03026 [pdf, other]

Enhanced Detection Classification via Clustering SVM for Various Robot Collaboration Task

Authors: Rui Liu, Xuanzhen Xu, Yuwei Shen, Armando Zhu, Chang Yu, Tianjian Chen, Ye Zhang

Abstract: We introduce an advanced, swift pattern recognition strategy for various multiple robotics during curve negotiation. This method, leveraging a sophisticated k-means clustering-enhanced Support Vector Machine algorithm, distinctly categorizes robotics into flying or mobile robots. Initially, the paradigm considers robot locations and features as quintessential parameters indicative of divergent rob… ▽ More We introduce an advanced, swift pattern recognition strategy for various multiple robotics during curve negotiation. This method, leveraging a sophisticated k-means clustering-enhanced Support Vector Machine algorithm, distinctly categorizes robotics into flying or mobile robots. Initially, the paradigm considers robot locations and features as quintessential parameters indicative of divergent robot patterns. Subsequently, employing the k-means clustering technique facilitates the efficient segregation and consolidation of robotic data, significantly optimizing the support vector delineation process and expediting the recognition phase. Following this preparatory phase, the SVM methodology is adeptly applied to construct a discriminative hyperplane, enabling precise classification and prognostication of the robot category. To substantiate the efficacy and superiority of the k-means framework over traditional SVM approaches, a rigorous cross-validation experiment was orchestrated, evidencing the former's enhanced performance in robot group classification. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: This paper has been received by CISCE 2024 Conference

arXiv:2405.02978 [pdf]

doi 10.3847/1538-4357/ad31a6

Four-hundred Very Metal-poor Stars Studied with LAMOST and Subaru. III. Dynamically Tagged Groups and Chemodynamical Properties

Authors: Ruizhi Zhang, Tadafumi Matsuno, Haining Li, Wako Aoki, Xiang-Xiang Xue, Takuma Suda, Gang Zhao, Yuqin Chen, Miho N. Ishigaki, Jianrong Shi, Qianfan Xing, **gkun Zhao

Abstract: Very metal-poor (VMP) stars record the signatures of early accreted galaxies, making them essential tools for unraveling the early stages of Galaxy formation. Understanding the origin of VMP stars requires comprehensive studies of their chemical compositions and kinematics, which are currently lacking. Hence, we conduct a chemodynamical analysis of 352 VMP stars selected from one of the largest un… ▽ More Very metal-poor (VMP) stars record the signatures of early accreted galaxies, making them essential tools for unraveling the early stages of Galaxy formation. Understanding the origin of VMP stars requires comprehensive studies of their chemical compositions and kinematics, which are currently lacking. Hence, we conduct a chemodynamical analysis of 352 VMP stars selected from one of the largest uniform high-resolution VMP star samples, jointly obtained from LAMOST and Subaru. We apply a friends-of-friends clustering algorithm to the master catalog of this high-resolution sample, which consists of 5778 VMP stars. It results in 131 dynamically tagged groups with 89 associated with known substructures in the Milky Way, including Gaia-Sausage-Enceladus (GSE), Thamnos, Helmi streams, Sequoia, Wukong, Pontus, and the very metal-poor disk (VMPD). Our findings are: (i) the VMPD shows lower Zn abundances than the rest, which indicates that it could be a relic of small stellar systems; (ii) Sequoia shows moderately high r-process abundances; (iii) Helmi streams show deficiencies in carbon and light neutron-capture elements; (iv) the fraction of carbon-enhanced metal-poor stars with no enhancement in heavy elements (CEMP-no stars) seems low in the VMPD and the Helmi streams; and (v) a subgroup in GSE exhibits a very high fraction of r-process enhanced stars, with four out of five showing [Eu/Fe]> +1.0. The abundance patterns of other elements in VMP substructures largely match the whole VMP sample. We also study large-scale correlations between abundance ratios and kinematics without classifying stars into substructures, but it does not yield significant correlations once the overall chemical evolution is considered for most elements. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: 30 pages, 24 figures, published in ApJ

arXiv:2405.02940 [pdf]

Spherulite-enhanced Macroscopic Polarization in Molecular Ferroelectric Films from Vacuum Deposition

Authors: Bibek Tiwari, Yuanyuan Ni, Jackson Savage, Ellen Daugherty, Bharat Giri, Xin Li, Xiaoshan Xu

Abstract: Proton-transfer type molecular ferroelectrics hold great application potential due to their large spontaneous polarizations, high Curie temperatures, and small switching fields. However, it is puzzling that preparation of quasi-2D films with macroscopic ferroelectric behaviors has only been reported in few molecular ferroelectrics. To resolve this puzzle, we studied the effect of microstructures o… ▽ More Proton-transfer type molecular ferroelectrics hold great application potential due to their large spontaneous polarizations, high Curie temperatures, and small switching fields. However, it is puzzling that preparation of quasi-2D films with macroscopic ferroelectric behaviors has only been reported in few molecular ferroelectrics. To resolve this puzzle, we studied the effect of microstructures on macroscopic ferroelectric properties of 5,6-Dichloro-2-methylbenzimidazole (DC-MBI) films grown using low-temperature deposition followed by restrained crystallization (LDRC) method. We revealed a competition between dense spherulites and porous microstructures containing randomly oriented nanograins in as-grown films. Post-growth annealing at moderate temperature promotes the formation of spherulites which leads to macroscopic ferroelectric polarization switching. These results highlight microstructure density as a critical factor for macroscopic ferroelectric properties, potentially resolving the puzzle for absence of macroscopic ferroelectric behavior in molecules ferroelectric films. We expect the approach for enhancing microstructure density offered in this work to greatly advance fabrication of quasi-2D molecular ferroelectrics films and to unlock their potential in device applications. △ Less

Submitted 5 May, 2024; originally announced May 2024.

arXiv:2405.02549 [pdf, other]

Defect-Assisted Domain Nucleation Drives Unique Exchange Bias Phenomena in $\bf{MnBi_2Te_4}$

Authors: Shiqi Yang, Xiaolong Xu, Yuchen Gao, Roger Guzman, **fan Gu, Huan Wang, Yuan Huang, Wu Zhou, Tianlong Xia, Yu Ye

Abstract: The study of the mechanism of exchange bias phenomena and the achievement of its efficient control are of great importance, as it promotes the revelation of unique exchange interactions and the development of exotic applications. However, it is challenging due to the elusive interface between magnetic phases. In this study, we report an unprecedented exchange bias phenomenon observed in ultrathin… ▽ More The study of the mechanism of exchange bias phenomena and the achievement of its efficient control are of great importance, as it promotes the revelation of unique exchange interactions and the development of exotic applications. However, it is challenging due to the elusive interface between magnetic phases. In this study, we report an unprecedented exchange bias phenomenon observed in ultrathin uncompensated antiferromagnetic MnBi$_2$Te$_4$. The magnitude and direction of the exchange field can be intentionally controlled by designing a magnetic field sweep protocol without a field cooling process. The combined experimental and theoretical simulation results indicate that the spin-flip process assisted by the ubiquitous defect-induced pinning domain sites with varying inner exchange interactions might give rise to the emergence and robustness of this peculiar exchange bias. The temperature and thickness dependence of the exchange bias phenomena are systematically investigated for further study and exploitation of its unique properties. This mechanism hold promise for highly tunable exchange bias in prevalent magnetic systems by engineering the properties of domain structures, and also offers promising avenues for the design of spintronic devices combing its topology based on MnBi$_2$Te$_4$. △ Less

Submitted 3 May, 2024; originally announced May 2024.

Comments: 8 pages, 4 figures

arXiv:2405.02335 [pdf, other]

sDAC -- Semantic Digital Analog Converter for Semantic Communications

Authors: Zhicheng Bao, Chen Dong, Xiaodong Xu

Abstract: In this paper, we propose a novel semantic digital analog converter (sDAC) for the compatibility of semantic communications and digital communications. Most of the current semantic communication systems are based on the analog modulations, ignoring their incorporation with digital communication systems, which are more common in practice. In fact, quantization methods in traditional communication s… ▽ More In this paper, we propose a novel semantic digital analog converter (sDAC) for the compatibility of semantic communications and digital communications. Most of the current semantic communication systems are based on the analog modulations, ignoring their incorporation with digital communication systems, which are more common in practice. In fact, quantization methods in traditional communication systems are not appropriate for use in the era of semantic communication as these methods do not consider the semantic information inside symbols. In this case, any bit flip caused by channel noise can lead to a great performance drop. To address this challenge, sDAC is proposed. It is a simple yet efficient and generative module used to realize digital and analog bi-directional conversion. On the transmitter side, continuous values from the encoder are converted to binary bits and then can be modulated by any existing methods. After transmitting through the noisy channel, these bits get demodulated by paired methods and converted back to continuous values for further semantic decoding. The whole progress does not depend on any specific semantic model, modulation methods, or channel conditions. In the experiment section, the performance of sDAC is tested across different semantic models, semantic tasks, modulation methods, channel conditions and quantization orders. Test results show that the proposed sDAC has great generative properties and channel robustness. △ Less

Submitted 26 April, 2024; originally announced May 2024.

arXiv:2405.01982 [pdf, ps, other]

Global regularity for solutions of magnetohydrodynamic equations with large initial data

Authors: Xiangsheng Xu

Abstract: We study the existence of a strong solution to the initial value problem for the magnetohydrodynamic equations in $\mathbb{R}^N, N\geq 3$. We obtain a global in-time strong solution without any smallness assumptions on the initial data. We study the existence of a strong solution to the initial value problem for the magnetohydrodynamic equations in $\mathbb{R}^N, N\geq 3$. We obtain a global in-time strong solution without any smallness assumptions on the initial data. △ Less

Submitted 9 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2404.16433

arXiv:2405.01847 [pdf, other]

A Model-based Multi-Agent Personalized Short-Video Recommender System

Authors: Peilun Zhou, Xiaoxiao Xu, Lantao Hu, Han Li, Peng Jiang

Abstract: Recommender selects and presents top-K items to the user at each online request, and a recommendation session consists of several sequential requests. Formulating a recommendation session as a Markov decision process and solving it by reinforcement learning (RL) framework has attracted increasing attention from both academic and industry communities. In this paper, we propose a RL-based industrial… ▽ More Recommender selects and presents top-K items to the user at each online request, and a recommendation session consists of several sequential requests. Formulating a recommendation session as a Markov decision process and solving it by reinforcement learning (RL) framework has attracted increasing attention from both academic and industry communities. In this paper, we propose a RL-based industrial short-video recommender ranking framework, which models and maximizes user watch-time in an environment of user multi-aspect preferences by a collaborative multi-agent formulization. Moreover, our proposed framework adopts a model-based learning approach to alleviate the sample selection bias which is a crucial but intractable problem in industrial recommender system. Extensive offline evaluations and live experiments confirm the effectiveness of our proposed method over alternatives. Our proposed approach has been deployed in our real large-scale short-video sharing platform, successfully serving over hundreds of millions users. △ Less

Submitted 3 May, 2024; originally announced May 2024.

arXiv:2405.01520 [pdf]

AI for Manufacturing and Healthcare: a chemistry and engineering perspective

Authors: Jihua Chen, Yue Yuan, Amir Koushyar Ziabari, Xuan Xu, Honghai Zhang, Panagiotis Christakopoulos, Peter V. Bonnesen, Ilia N. Ivanov, Panchapakesan Ganesh, Chen Wang, Karen Patino Jaimes, Guang Yang, Rajeev Kumar, Bobby G. Sumpter, Rigoberto Advincula

Abstract: Artificial Intelligence (AI) approaches are increasingly being applied to more and more domains of Science, Engineering, Chemistry, and Industries to not only improve efficiencies and enhance productivity, but also enable new capabilities. The new opportunities range from automated molecule design and screening, properties prediction, gaining insights of chemical reactions, to computer-aided desig… ▽ More Artificial Intelligence (AI) approaches are increasingly being applied to more and more domains of Science, Engineering, Chemistry, and Industries to not only improve efficiencies and enhance productivity, but also enable new capabilities. The new opportunities range from automated molecule design and screening, properties prediction, gaining insights of chemical reactions, to computer-aided design, predictive maintenance of systems, robotics, and autonomous vehicles. This review focuses on the new applications of AI in manufacturing and healthcare. For the Manufacturing Industries, we focus on AI and algorithms for (1) Battery, (2) Flow Chemistry, (3) Additive Manufacturing, (4) Sensors, and (5) Machine Vision. For Healthcare applications, we focus on: (1) Medical Vision (2) Diagnosis, (3) Protein Design, and (4) Drug Discovery. In the end, related topics are discussed, including physics integrated machine learning, model explainability, security, and governance during model deployment. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2405.01518 [pdf, other]

Driven Multiphoton Qubit-Resonator Interactions

Authors: Mohammad Ayyash, Xicheng Xu, Sahel Ashhab, M. Mariantoni

Abstract: We develop a general theory for multiphoton qubit-resonator interactions enhanced by a qubit drive. The interactions generate qubit-conditional operations in the resonator when the driving is near $n$-photon cross-resonance, namely, the qubit drive is $n$-times the resonator frequency. We pay special attention to the strong driving regime, where the interactions are conditioned on the qubit dresse… ▽ More We develop a general theory for multiphoton qubit-resonator interactions enhanced by a qubit drive. The interactions generate qubit-conditional operations in the resonator when the driving is near $n$-photon cross-resonance, namely, the qubit drive is $n$-times the resonator frequency. We pay special attention to the strong driving regime, where the interactions are conditioned on the qubit dressed states. We consider the specific case where $n=2$, which results in qubit-conditional squeezing (QCS). We propose to use the QCS protocol for amplifying resonator displacements and their superpositions. We find the QCS protocol to generate a superposition of orthogonally squeezed states following a properly chosen qubit measurement. We outline quantum information processing applications for these states, including encoding a qubit in a resonator and performing a quantum non-demolition measurement of the qubit inferred from the resonator's second statistical moment. Next, we employ a two-tone drive to engineer an effective $n$-photon Rabi Hamiltonian in any desired coupling regime. In other words, the effective coupling strengths can be tuned over a wide range, thus allowing for the realization of new regimes that have so far been inaccessible. Finally, we propose a multiphoton circuit QED implementation based on a transmon qubit coupled to a resonator via an asymmetric SQUID. We provide realistic parameter estimates for the two-photon operation regime that can host the aforementioned two-photon protocols. We use numerical simulations to show that even in the presence of spurious terms and decoherence, our analytical predictions are robust. △ Less

Submitted 16 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

Comments: Updated circuit diagram and derivation, 14 pages, 7 figures

arXiv:2405.00833 [pdf, other]

Modelling the nanopore sequencing process with Helicase HMMs

Authors: Xuechun Xu, Joakim Jaldén

Abstract: Recent advancements in nanopore sequencing technology, particularly the R10 nanopore from Oxford Nanopore Technology, have necessitated the development of improved data processing methods to utilize their potential for more than 9-mer resolution fully. The processing of the ion currents predominantly utilizes neural network-based methods known for their high basecalling accuracy but face developme… ▽ More Recent advancements in nanopore sequencing technology, particularly the R10 nanopore from Oxford Nanopore Technology, have necessitated the development of improved data processing methods to utilize their potential for more than 9-mer resolution fully. The processing of the ion currents predominantly utilizes neural network-based methods known for their high basecalling accuracy but face developmental bottlenecks at higher resolutions. In light of this, we introduce the Helicase Hidden Markov Model (HHMM), a novel framework designed to incorporate the dynamics of the helicase motor protein alongside the nucleotide sequence during nanopore sequencing. This model supports the analysis of millions of distinct states, enhancing our understanding of raw ion currents and their alignment with nucleotide sequences. Our findings demonstrate the utility of HHMM not only as a potent visualization tool but also as an effective base for develo** advanced basecalling algorithms. This approach offers a promising avenue for leveraging the full capabilities of emerging high-resolution nanopore sequencing technologies. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: 8 pages, 7 figures and 1 table. Journal manuscript

arXiv:2405.00452 [pdf, other]

Predictive Accuracy-Based Active Learning for Medical Image Segmentation

Authors: Jun Shi, Shulan Ruan, Ziqi Zhu, Minfan Zhao, Hong An, Xudong Xue, Bing Yan

Abstract: Active learning is considered a viable solution to alleviate the contradiction between the high dependency of deep learning-based segmentation methods on annotated data and the expensive pixel-level annotation cost of medical images. However, most existing methods suffer from unreliable uncertainty assessment and the struggle to balance diversity and informativeness, leading to poor performance in… ▽ More Active learning is considered a viable solution to alleviate the contradiction between the high dependency of deep learning-based segmentation methods on annotated data and the expensive pixel-level annotation cost of medical images. However, most existing methods suffer from unreliable uncertainty assessment and the struggle to balance diversity and informativeness, leading to poor performance in segmentation tasks. In response, we propose an efficient Predictive Accuracy-based Active Learning (PAAL) method for medical image segmentation, first introducing predictive accuracy to define uncertainty. Specifically, PAAL mainly consists of an Accuracy Predictor (AP) and a Weighted Polling Strategy (WPS). The former is an attached learnable module that can accurately predict the segmentation accuracy of unlabeled samples relative to the target model with the predicted posterior probability. The latter provides an efficient hybrid querying scheme by combining predicted accuracy and feature representation, aiming to ensure the uncertainty and diversity of the acquired samples. Extensive experiment results on multiple datasets demonstrate the superiority of PAAL. PAAL achieves comparable accuracy to fully annotated data while reducing annotation costs by approximately 50% to 80%, showcasing significant potential in clinical applications. The code is available at https://github.com/shijun18/PAAL-MedSeg. △ Less

Submitted 29 June, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

Comments: 9 pages, 4 figures

arXiv:2405.00339 [pdf]

Canonized then Minimized RMSD for Three-Dimensional Structures

Authors: Jie Li, Qian Chen, **gwei Weng, Jianming Wu, Xin Xu

Abstract: Existing molecular canonization algorithms typically operate on one-dimensional (1D) string representations or two-dimensional (2D) connectivity graphs of a molecule and are not able to differentiate equivalent atoms based on three-dimensional (3D) structures. The stereochemical tags on each atom are in fact determined according to established Cahn-Ingold-Prelog (CIP) rules for comparing grades, w… ▽ More Existing molecular canonization algorithms typically operate on one-dimensional (1D) string representations or two-dimensional (2D) connectivity graphs of a molecule and are not able to differentiate equivalent atoms based on three-dimensional (3D) structures. The stereochemical tags on each atom are in fact determined according to established Cahn-Ingold-Prelog (CIP) rules for comparing grades, which can help to further differentiate atoms with similar environment. Therefore, a stereochemical-rule-based canonization algorithm that is capable of assigning canonical indices using 3D structural information is of great value. On top of the Schneider-Sayle-Landrum (SSL) partition-based canonization algorithm, we propose an enhanced canonization algorithm to expand its applicability. The initial index assignment rules are redesigned, so that the obtained canonical indices are compatible with the most of the common CIP Sequence Rules, which greatly eases the stereochemical assignment. Furthermore, a branching tiebreaking step is added to secure an accurate evaluation of the structural difference through the minimized root-mean-square deviation (RMSD) between structures, with an option to include hydrogen atoms or not. Our algorithm is implemented with Python and can efficiently obtain minimized RMSD taking into account of the symmetry of molecular systems , contributing to the fields of drug design, molecular docking, and data analysis of molecular dynamics simulation. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2405.00286 [pdf]

Ultrafast Photocurrent Hysteresis in Photoferroelectric α-In2Se3

Authors: Zhen Lei, Jiawei Chang, Qiyi Zhao, Jian Zhou, Yuanyuan Huang, Qihua Xiong, Xinlong Xu

Abstract: The photon-electron interactions are generally volatile and the intricate multiphysics details of photoexcited carrier dynamics are not yet distinguished. How to nonvolatile control the physical state through all-optical means and clarify the intricate physical processes has been a long-term goal pursued in polar materials. Photoferroelectric α-In2Se3 holds the great potential for capturing multim… ▽ More The photon-electron interactions are generally volatile and the intricate multiphysics details of photoexcited carrier dynamics are not yet distinguished. How to nonvolatile control the physical state through all-optical means and clarify the intricate physical processes has been a long-term goal pursued in polar materials. Photoferroelectric α-In2Se3 holds the great potential for capturing multimodal nonvolatile states due to the spontaneous reversible in-plane and out-of-plane polarizations and its tunable light-matter interactions arising from the electronic degree of freedom. Here we uncover a nonvolatile zero-bias ultrafast photocurrent hysteresis response with an all-optical scheme, diagnosed by in-plane and out-of-plane terahertz waves emitted from the photoferroelectric α-In2Se3. The mechanism of such ultrafast photocurrent hysteresis emerges as a result of anomalous bulk linear and circular photovoltaic effect synchronously driven by local polarization rearrangement. Utilizing anisotropic ferroelectric kinetics-induced relative phase between the in-plane and out-of-plane directions, we further show flexibly selective chirality, tunable rotational angle, and optimizable ellipticity of terahertz wave polarizations. Our finding offers a promising avenue towards direct ultrafast nonvolatile processing of photocurrent signals through an all-optical scheme. △ Less

Submitted 30 April, 2024; originally announced May 2024.

arXiv:2405.00233 [pdf, other]

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Authors: Haohe Liu, Xuenan Xu, Yi Yuan, Mengyue Wu, Wenwu Wang, Mark D. Plumbley

Abstract: Large language models (LLMs) have significantly advanced audio processing through audio codecs that convert audio into discrete tokens, enabling the application of language modelling techniques to audio data. However, traditional codecs often operate at high bitrates or within narrow domains such as speech and lack the semantic clues required for efficient language modelling. Addressing these chal… ▽ More Large language models (LLMs) have significantly advanced audio processing through audio codecs that convert audio into discrete tokens, enabling the application of language modelling techniques to audio data. However, traditional codecs often operate at high bitrates or within narrow domains such as speech and lack the semantic clues required for efficient language modelling. Addressing these challenges, we introduce SemantiCodec, a novel codec designed to compress audio into fewer than a hundred tokens per second across diverse audio types, including speech, general audio, and music, without compromising quality. SemantiCodec features a dual-encoder architecture: a semantic encoder using a self-supervised AudioMAE, discretized using k-means clustering on extensive audio data, and an acoustic encoder to capture the remaining details. The semantic and acoustic encoder outputs are used to reconstruct audio via a diffusion-model-based decoder. SemantiCodec is presented in three variants with token rates of 25, 50, and 100 per second, supporting a range of ultra-low bit rates between 0.31 kbps and 1.43 kbps. Experimental results demonstrate that SemantiCodec significantly outperforms the state-of-the-art Descript codec on reconstruction quality. Our results also suggest that SemantiCodec contains significantly richer semantic information than all evaluated audio codecs, even at significantly lower bitrates. Our code and demos are available at https://haoheliu.github.io/SemantiCodec/. △ Less

Submitted 30 April, 2024; originally announced May 2024.

Comments: Demo and code: https://haoheliu.github.io/SemantiCodec/

arXiv:2404.19655 [pdf, other]

The Local Dark Matter Kinematic Substructure Based on LAMOST K Giants

Authors: Hai Zhu, Rui Guo, Juntai Shen, Jianglai Liu, Chao Liu, Xiang-Xiang Xue, Lan Zhang, Shude Mao

Abstract: Numerical simulations indicate that correlations exist between the velocity distributions of stars and dark matter (DM). We study the local DM velocity distribution based on these correlations. We select K giants from LAMOST DR8 cross-matched with Gaia DR3, which has robust measurements of three-dimensional velocity and metallicity, and separate them into the disk, halo substructure and main halo… ▽ More Numerical simulations indicate that correlations exist between the velocity distributions of stars and dark matter (DM). We study the local DM velocity distribution based on these correlations. We select K giants from LAMOST DR8 cross-matched with Gaia DR3, which has robust measurements of three-dimensional velocity and metallicity, and separate them into the disk, halo substructure and main halo components in the chemo-dynamical space utilizing the Gaussian Mixture Model. The substructure component is highly radially anisotropic, and possibly related to the Gaia-Enceladus-Sausage (GES) merger event, while the halo component is isotropic and accreted from the earliest mergers following the Maxwell-Boltzmann Distribution (Standard Halo Model, SHM). We find that the GES-like substructure contributes $\sim85\%$ of the local non-disk stars in the Solar neighbourhood, which is nearly invariant when applying different volume cuts or additional angular momentum constraints. Utilizing the metallicity-stellar-mass relation and the stellar-mass-halo-mass relation, we find that $\sim25_{-15}^{+24}\%$ of local DM is in the kinematic substructure. Combined with the stellar distributions of non-disk components, we compute the velocity distribution of local DM. The modified heliocentric velocity distribution of local DM shifts to a lower speed and has a sharper peak compared to the SHM, which yields updated detection limits for the DM direct detection experiments. Our work confirms that the local DM velocity distribution deviates from the SHM, and needs to be properly accounted in the DM detection experiments. △ Less

Submitted 30 April, 2024; originally announced April 2024.

arXiv:2404.19589 [pdf, other]

Acceptance Tests of more than 10 000 Photomultiplier Tubes for the multi-PMT Digital Optical Modules of the IceCube Upgrade

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

Abstract: More than 10,000 photomultiplier tubes (PMTs) with a diameter of 80 mm will be installed in multi-PMT Digital Optical Modules (mDOMs) of the IceCube Upgrade. These have been tested and pre-calibrated at two sites. A throughput of more than 1000 PMTs per week with both sites was achieved with a modular design of the testing facilities and highly automated testing procedures. The testing facilities… ▽ More More than 10,000 photomultiplier tubes (PMTs) with a diameter of 80 mm will be installed in multi-PMT Digital Optical Modules (mDOMs) of the IceCube Upgrade. These have been tested and pre-calibrated at two sites. A throughput of more than 1000 PMTs per week with both sites was achieved with a modular design of the testing facilities and highly automated testing procedures. The testing facilities can easily be adapted to other PMTs, such that they can, e.g., be re-used for testing the PMTs for IceCube-Gen2. Single photoelectron response, high voltage dependence, time resolution, prepulse, late pulse, afterpulse probabilities, and dark rates were measured for each PMT. We describe the design of the testing facilities, the testing procedures, and the results of the acceptance tests. △ Less

Submitted 20 June, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

Comments: 24 pages, 19 figures, 2 tables, submitted to JINST

arXiv:2404.18523 [pdf, other]

Dynamical Blockade Optimizing via Particle Swarm Optimization Algorithm

Authors: Guang-Yu Zhang, Zhi-Hao Liu, Xun-Wei Xu

Abstract: Photon blockade in weak nonlinear regime is an exciting and promising subject that has been extensively studied in the steady state. However, how to achieve dynamic blockade in a single bosonic mode with weak nonlinearity using only pulsed driving field remains unexplored. Here, we propose to optimize the parameters of the pulsed driving field to achieve dynamic blockade in a single bosonic mode w… ▽ More Photon blockade in weak nonlinear regime is an exciting and promising subject that has been extensively studied in the steady state. However, how to achieve dynamic blockade in a single bosonic mode with weak nonlinearity using only pulsed driving field remains unexplored. Here, we propose to optimize the parameters of the pulsed driving field to achieve dynamic blockade in a single bosonic mode with weak nonlinearity via the particle swarm optimization (PSO) algorithm. We demonstrate that both Gaussian and rectangular pulses can be used to generate dynamic photon blockade in a single bosonic mode with weak nonlinearity. Based on the Fourier series expansions of the pulsed driving field, we identify that there are many paths for two-photon excitation in the bosonic mode, even only driven by pulsed field, and the dynamic blockade in weak nonlinear regime is induced by the destructive interference between them. Our work not only highlights the effectiveness of PSO algorithm in optimizing dynamical blockade, but also opens a way to optimize the parameters for other quantum effects, such as quantum entanglement and quantum squeezing. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: 8 pages, 6 figures

arXiv:2404.17806 [pdf, other]

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

Authors: Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang

Abstract: Contrastive language-audio pretraining~(CLAP) has been developed to align the representations of audio and language, achieving remarkable performance in retrieval and classification tasks. However, current CLAP struggles to capture temporal information within audio and text features, presenting substantial limitations for tasks such as audio retrieval and generation. To address this gap, we introd… ▽ More Contrastive language-audio pretraining~(CLAP) has been developed to align the representations of audio and language, achieving remarkable performance in retrieval and classification tasks. However, current CLAP struggles to capture temporal information within audio and text features, presenting substantial limitations for tasks such as audio retrieval and generation. To address this gap, we introduce T-CLAP, a temporal-enhanced CLAP model. We use Large Language Models~(LLMs) and mixed-up strategies to generate temporal-contrastive captions for audio clips from extensive audio-text datasets. Subsequently, a new temporal-focused contrastive loss is designed to fine-tune the CLAP model by incorporating these synthetic data. We conduct comprehensive experiments and analysis in multiple downstream tasks. T-CLAP shows improved capability in capturing the temporal relationship of sound events and outperforms state-of-the-art models by a significant margin. △ Less

Submitted 27 April, 2024; originally announced April 2024.

Comments: Preprint submitted to IEEE MLSP 2024

arXiv:2404.16748 [pdf, other]

TELA: Text to Layer-wise 3D Clothed Human Generation

Authors: Junting Dong, Qi Fang, Zehuan Huang, Xudong Xu, **gbo Wang, Sida Peng, Bo Dai

Abstract: This paper addresses the task of 3D clothed human generation from textural descriptions. Previous works usually encode the human body and clothes as a holistic model and generate the whole model in a single-stage optimization, which makes them struggle for clothing editing and meanwhile lose fine-grained control over the whole generation process. To solve this, we propose a layer-wise clothed huma… ▽ More This paper addresses the task of 3D clothed human generation from textural descriptions. Previous works usually encode the human body and clothes as a holistic model and generate the whole model in a single-stage optimization, which makes them struggle for clothing editing and meanwhile lose fine-grained control over the whole generation process. To solve this, we propose a layer-wise clothed human representation combined with a progressive optimization strategy, which produces clothing-disentangled 3D human models while providing control capacity for the generation process. The basic idea is progressively generating a minimal-clothed human body and layer-wise clothes. During clothing generation, a novel stratified compositional rendering method is proposed to fuse multi-layer human models, and a new loss function is utilized to help decouple the clothing model from the human body. The proposed method achieves high-quality disentanglement, which thereby provides an effective way for 3D garment generation. Extensive experiments demonstrate that our approach achieves state-of-the-art 3D clothed human generation while also supporting cloth editing applications such as virtual try-on. Project page: http://jtdong.com/tela_layer/ △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.16687 [pdf, other]

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

Authors: Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng , et al. (89 additional authors not shown)

Abstract: This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Conte… ▽ More This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Content (AIGC). The challenge is divided into the image track and the video track. The image track uses the AIGIQA-20K, which contains 20,000 AI-Generated Images (AIGIs) generated by 15 popular generative models. The image track has a total of 318 registered participants. A total of 1,646 submissions are received in the development phase, and 221 submissions are received in the test phase. Finally, 16 participating teams submitted their models and fact sheets. The video track uses the T2VQA-DB, which contains 10,000 AI-Generated Videos (AIGVs) generated by 9 popular Text-to-Video (T2V) models. A total of 196 participants have registered in the video track. A total of 991 submissions are received in the development phase, and 185 submissions are received in the test phase. Finally, 12 participating teams submitted their models and fact sheets. Some methods have achieved better results than baseline methods, and the winning methods in both tracks have demonstrated superior prediction performance on AIGC. △ Less

Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.16433 [pdf, ps, other]

Global existence of a strong solution to the initial value problem for the Nernst-Planck-Navier-Stokes system in $\mathbb{R}^N$

Authors: Xiangsheng Xu

Abstract: We study the existence of a strong solution to the initial value problem for the Nernst-Planck-Navier-Stokes (NPNS) system in $\mathbb{R}^N, N\geq 3$. We obtain a global in-time strong solution without any smallness assumptions on the initial data. We study the existence of a strong solution to the initial value problem for the Nernst-Planck-Navier-Stokes (NPNS) system in $\mathbb{R}^N, N\geq 3$. We obtain a global in-time strong solution without any smallness assumptions on the initial data. △ Less

Submitted 4 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.16425 [pdf, other]

Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a, whose bright peak was also detected by the Swift Burst Alert Telescope and Konus-Wind through off-line analyses. At a redshift of $z=4.859$, EP240315a showed a much longer and more complicated light curve in the soft X-ray band than in gamma-rays. Benefiting from a large field-of-view ($\sim$3600 deg$^2$) and a high sensitivity, EP-WXT captured the earlier engine activation and extended late engine activity through a continuous detection. With a peak X-ray flux at the faint end of previously known high-$z$ GRBs, the detection of EP240315a demonstrates the great potential for EP to study the early universe via GRBs. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: 41 pages, 8 figures, 7 tables

arXiv:2404.16205 [pdf, other]

AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results

Authors: Marcos V. Conde, Saman Zadtootaghaj, Nabajeet Barman, Radu Timofte, Chenlong He, Qi Zheng, Ruoxi Zhu, Zhengzhong Tu, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Zicheng Zhang, Haoning Wu, Yingjie Zhou, Chunyi Li, Xiaohong Liu, Weisi Lin, Guangtao Zhai , et al. (11 additional authors not shown)

Abstract: This paper reviews the AIS 2024 Video Quality Assessment (VQA) Challenge, focused on User-Generated Content (UGC). The aim of this challenge is to gather deep learning-based methods capable of estimating the perceptual quality of UGC videos. The user-generated videos from the YouTube UGC Dataset include diverse content (sports, games, lyrics, anime, etc.), quality and resolutions. The proposed met… ▽ More This paper reviews the AIS 2024 Video Quality Assessment (VQA) Challenge, focused on User-Generated Content (UGC). The aim of this challenge is to gather deep learning-based methods capable of estimating the perceptual quality of UGC videos. The user-generated videos from the YouTube UGC Dataset include diverse content (sports, games, lyrics, anime, etc.), quality and resolutions. The proposed methods must process 30 FHD frames under 1 second. In the challenge, a total of 102 participants registered, and 15 submitted code and models. The performance of the top-5 submissions is reviewed and provided here as a survey of diverse deep models for efficient video quality assessment of user-generated content. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: CVPR 2024 Workshop -- AI for Streaming (AIS) Video Quality Assessment Challenge

arXiv:2404.16201 [pdf, other]

JWST NIRSpec High-resolution Spectroscopy of MACS0647-JD at z=10.167: Resolved [OII] Doublet and Electron Density in an Early Galaxy

Authors: Abdurro'uf, Rebecca L. Larson, Dan Coe, Tiger Yu-Yang Hsiao, Javier Álvarez-Márquez, Alejandro Crespo Gómez, Angela Adamo, Rachana Bhatawdekar, Arjan Bik, Larry D. Bradley, Christopher J. Conselice, Pratika Dayal, Jose M. Diego, Seiji Fujimoto, Lukas J. Furtak, Taylor A. Hutchison, Intae Jung, Meghana Killi, Vasily Kokorev, Matilde Mingozzi, Colin Norman, Tom Resseguier, Massimo Ricotti, Jane R. Rigby, Eros Vanzella , et al. (4 additional authors not shown)

Abstract: We present JWST/NIRSpec high-resolution spectroscopy G395H/F290LP of MACS0647-JD, a gravitationally lensed galaxy merger at $z=10.167$. The new spectroscopy, which is acquired for the two lensed images (JD1 and JD2), detects and resolves emission lines in the rest-frame ultraviolet (UV) and blue optical, including the resolved [OII]3726,3729 doublet, [NeIII]3870, [HeI]3890, H$δ$, H$γ$, and [OIII]4… ▽ More We present JWST/NIRSpec high-resolution spectroscopy G395H/F290LP of MACS0647-JD, a gravitationally lensed galaxy merger at $z=10.167$. The new spectroscopy, which is acquired for the two lensed images (JD1 and JD2), detects and resolves emission lines in the rest-frame ultraviolet (UV) and blue optical, including the resolved [OII]3726,3729 doublet, [NeIII]3870, [HeI]3890, H$δ$, H$γ$, and [OIII]4363. This is the first observation of the resolved [OII]3726,3729 doublet for a galaxy at $z>8$. We measure a line flux ratio [OII]3729/3726 $= 0.9 \pm 0.3$, which corresponds to an estimated electron density of $\log(n_{e} / \rm{cm}^{-3}) = 2.9 \pm 0.5$. This is significantly higher than the electron densities of local galaxies reported in the literature. We compile the measurements from the literature and further analyze the redshift evolution of $n_{e}$. We find that the redshift evolution follows the power-law form of $n_{e} = A\times (1+z)^{p}$ with $A=54^{+31}_{-23}$ cm$^{-3}$ and $p=1.2^{+0.4}_{-0.4}$. This power-law form may be explained by a combination of metallicity and morphological evolution of galaxies, which become, on average, more metal-poor and more compact with increasing redshift. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: 20 pages, 11 figures, 5 tables. Comments are welcome!

arXiv:2404.16200 [pdf, other]

JWST MIRI detections of H$α$ and [O III] and direct metallicity measurement of the $z=10.17$ lensed galaxy MACS0647$-$JD

Authors: Tiger Yu-Yang Hsiao, Javier Álvarez-Márquez, Dan Coe, Alejandro Crespo Gómez, Abdurro'uf, Pratika Dayal, Rebecca L. Larson, Arjan Bik, Carmen Blanco-Prieto, Luis Colina, Pablo Guillermo Pérez-González, Luca Costantin, Carlota Prieto-Jiménez, Angela Adamo, Larry D. Bradley, Christopher J. Conselice, Seiji Fujimoto, Lukas J. Furtak, Taylor A. Hutchison, Bethan L. James, Yolanda Jiménez-Teja, Intae Jung, Vasily Kokorev, Matilde Mingozzi, Colin Norman , et al. (8 additional authors not shown)

Abstract: JWST spectroscopy has revolutionized our understanding of galaxies in the early universe. Covering wavelengths up to $5.3\,{\rm μm}$, NIRSpec can detect rest-frame optical emission lines H$α$ out to $z = 7$ and [O III] to $z = 9.5$. Observing these lines in more distant galaxies requires longer wavelength spectroscopy with MIRI. Here we present MIRI MRS IFU observations of the lensed galaxy merger… ▽ More JWST spectroscopy has revolutionized our understanding of galaxies in the early universe. Covering wavelengths up to $5.3\,{\rm μm}$, NIRSpec can detect rest-frame optical emission lines H$α$ out to $z = 7$ and [O III] to $z = 9.5$. Observing these lines in more distant galaxies requires longer wavelength spectroscopy with MIRI. Here we present MIRI MRS IFU observations of the lensed galaxy merger MACS0647$-$JD at $z = 10.165$. With exposure times of 4.2 hours in each of two bands, we detect H$α$ at $9σ$, [O III]$\,\lambda5008$ at $11σ$, and [O III]$\,\lambda4960$ at $3σ$. Combined with previously reported NIRSpec spectroscopy that yields seven emission lines including the auroral line [O III]$\,\lambda4363$, we present the first direct metallicity measurement of a $z > 10$ galaxy: $12+{\rm log(O/H)}= 7.79\pm0.09$, or $0.13^{+0.02}_{-0.03}\,Z_{\odot}$. This is similar to galaxies at $z \sim 4 - 9$ with direct metallicity measurements, though higher than expected given the high specific star formation rate ${\rm log(sSFR / yr^{-1})} = -7.4 \pm 0.3$. We further constrain the ionization parameter ${\rm log}(U)$ = $-1.9 \pm 0.1$, ionizing photon production efficiency ${\rm log}(ξ_{\rm ion})$ = $25.3\pm0.1$, and star formation rate $5.0\pm0.6\,M_{\odot}/{\rm yr}$ within the past $10\,{\rm Myr}$. These observations demonstrate the combined power of JWST NIRSpec and MIRI for studying galaxies in the first $500$ million years. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: 15 pages, 8 figures, submitted to ApJ

arXiv:2404.16058 [pdf, ps, other]

Sign Changing Critical Points for Locally Lipschitz Functionals

Authors: Xian Xu, Baoxia Qin

Abstract: In this paper, some existence results for sign-changing critical points of locally Lipschitz functionals in real Banach space are obtained by the method combining the invariant sets of descending ow method with a quantitative deformation. First we assume the locally Lipschitz functionals to be outwardly directed on the the boundary of some closed convex sets of the real Banach space. By using the… ▽ More In this paper, some existence results for sign-changing critical points of locally Lipschitz functionals in real Banach space are obtained by the method combining the invariant sets of descending ow method with a quantitative deformation. First we assume the locally Lipschitz functionals to be outwardly directed on the the boundary of some closed convex sets of the real Banach space. By using the relation between the critical points on the Banach space and those of the closed convex sets, we construct a quantitative deformation lemma, and then we obtain some linking type of critical points theorems. These theoretical results can be applied to the study of the existence of sign-changing solutions for differential inclusion problems. In contrast with the related results in the literatures, the main results of this paper relax the requirement that the functional being of C1 continuous to locally Lipschitz. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: arXiv admin note: text overlap with arXiv:2404.13072

arXiv:2404.15276 [pdf, other]

SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation

Authors: Xiangyu Xu, Lijuan Liu, Shuicheng Yan

Abstract: Existing Transformers for monocular 3D human shape and pose estimation typically have a quadratic computation and memory complexity with respect to the feature length, which hinders the exploitation of fine-grained information in high-resolution features that is beneficial for accurate reconstruction. In this work, we propose an SMPL-based Transformer framework (SMPLer) to address this issue. SMPL… ▽ More Existing Transformers for monocular 3D human shape and pose estimation typically have a quadratic computation and memory complexity with respect to the feature length, which hinders the exploitation of fine-grained information in high-resolution features that is beneficial for accurate reconstruction. In this work, we propose an SMPL-based Transformer framework (SMPLer) to address this issue. SMPLer incorporates two key ingredients: a decoupled attention operation and an SMPL-based target representation, which allow effective utilization of high-resolution features in the Transformer. In addition, based on these two designs, we also introduce several novel modules including a multi-scale attention and a joint-aware attention to further boost the reconstruction performance. Extensive experiments demonstrate the effectiveness of SMPLer against existing 3D human shape and pose estimation methods both quantitatively and qualitatively. Notably, the proposed algorithm achieves an MPJPE of 45.2 mm on the Human3.6M dataset, improving upon Mesh Graphormer by more than 10% with fewer than one-third of the parameters. Code and pretrained models are available at https://github.com/xuxy09/SMPLer. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: Published at TPAMI 2024

Journal ref: https://www.computer.org/csdl/journal/tp/2024/05/10354384/1SP2qWh8Fq0

arXiv:2404.15155 [pdf, other]

Adaptive Collaboration Strategy for LLMs in Medical Decision Making

Authors: Yubin Kim, Chanwoo Park, Hyewon Jeong, Yik Siu Chan, Xuhai Xu, Daniel McDuff, Cynthia Breazeal, Hae Won Park

Abstract: Foundation models have become invaluable in advancing the medical field. Despite their promise, the strategic deployment of LLMs for effective utility in complex medical tasks remains an open question. Our novel framework, Medical Decision-making Agents (MDAgents) aims to address this gap by automatically assigning the effective collaboration structure for LLMs. Assigned solo or group collaboratio… ▽ More Foundation models have become invaluable in advancing the medical field. Despite their promise, the strategic deployment of LLMs for effective utility in complex medical tasks remains an open question. Our novel framework, Medical Decision-making Agents (MDAgents) aims to address this gap by automatically assigning the effective collaboration structure for LLMs. Assigned solo or group collaboration structure is tailored to the complexity of the medical task at hand, emulating real-world medical decision making processes. We evaluate our framework and baseline methods with state-of-the-art LLMs across a suite of challenging medical benchmarks: MedQA, MedMCQA, PubMedQA, DDXPlus, PMC-VQA, Path-VQA, and MedVidQA, achieving the best performance in 5 out of 7 benchmarks that require an understanding of multi-modal medical reasoning. Ablation studies reveal that MDAgents excels in adapting the number of collaborating agents to optimize efficiency and accuracy, showcasing its robustness in diverse scenarios. We also explore the dynamics of group consensus, offering insights into how collaborative agents could behave in complex clinical team dynamics. Our code can be found at https://github.com/mitmedialab/MDAgents. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.14757 [pdf, other]

Integrating Mamba and Transformer for Long-Short Range Time Series Forecasting

Authors: Xiongxiao Xu, Yueqing Liang, Baixiang Huang, Zhiling Lan, Kai Shu

Abstract: Time series forecasting is an important problem and plays a key role in a variety of applications including weather forecasting, stock market, and scientific simulations. Although transformers have proven to be effective in capturing dependency, its quadratic complexity of attention mechanism prevents its further adoption in long-range time series forecasting, thus limiting them attend to short-ra… ▽ More Time series forecasting is an important problem and plays a key role in a variety of applications including weather forecasting, stock market, and scientific simulations. Although transformers have proven to be effective in capturing dependency, its quadratic complexity of attention mechanism prevents its further adoption in long-range time series forecasting, thus limiting them attend to short-range range. Recent progress on state space models (SSMs) have shown impressive performance on modeling long range dependency due to their subquadratic complexity. Mamba, as a representative SSM, enjoys linear time complexity and has achieved strong scalability on tasks that requires scaling to long sequences, such as language, audio, and genomics. In this paper, we propose to leverage a hybrid framework Mambaformer that internally combines Mamba for long-range dependency, and Transformer for short range dependency, for long-short range forecasting. To the best of our knowledge, this is the first paper to combine Mamba and Transformer architecture in time series data. We investigate possible hybrid architectures to combine Mamba layer and attention layer for long-short range time series forecasting. The comparative study shows that the Mambaformer family can outperform Mamba and Transformer in long-short range time series forecasting problem. The code is available at https://github.com/XiongxiaoXu/Mambaformerin-Time-Series. △ Less

Submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.14665 [pdf, other]

Illuminating the Unseen: Investigating the Context-induced Harms in Behavioral Sensing

Authors: Han Zhang, Vedant Das Swain, Leijie Wang, Nan Gao, Yilun Sheng, Xuhai Xu, Flora D. Salim, Koustuv Saha, Anind K. Dey, Jennifer Mankoff

Abstract: Behavioral sensing technologies are rapidly evolving across a range of well-being applications. Despite its potential, concerns about the responsible use of such technology are escalating. In response, recent research within the sensing technology has started to address these issues. While promising, they primarily focus on broad demographic categories and overlook more nuanced, context-specific i… ▽ More Behavioral sensing technologies are rapidly evolving across a range of well-being applications. Despite its potential, concerns about the responsible use of such technology are escalating. In response, recent research within the sensing technology has started to address these issues. While promising, they primarily focus on broad demographic categories and overlook more nuanced, context-specific identities. These approaches lack grounding within domain-specific harms that arise from deploying sensing technology in diverse social, environmental, and technological settings. Additionally, existing frameworks for evaluating harms are designed for a generic ML life cycle, and fail to adapt to the dynamic and longitudinal considerations for behavioral sensing technology. To address these gaps, we introduce a framework specifically designed for evaluating behavioral sensing technologies. This framework emphasizes a comprehensive understanding of context, particularly the situated identities of users and the deployment settings of the sensing technology. It also highlights the necessity for iterative harm mitigation and continuous maintenance to adapt to the evolving nature of technology and its use. We demonstrate the feasibility and generalizability of our framework through post-hoc evaluations on two real-world behavioral sensing studies conducted in different international contexts, involving varied population demographics and machine learning tasks. Our evaluations provide empirical evidence of both situated identity-based harm and more domain-specific harms, and discuss the trade-offs introduced by implementing bias mitigation techniques. △ Less

Submitted 5 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

Comments: 26 pages, 8 tables, and 1 figure (excluding appendix)

MSC Class: 68U35 ACM Class: H.5.0; I.2.m

arXiv:2404.14568 [pdf, other]

UVMap-ID: A Controllable and Personalized UV Map Generative Model

Authors: Weijie Wang, Jichao Zhang, Chang Liu, Xia Li, Xingqian Xu, Humphrey Shi, Nicu Sebe, Bruno Lepri

Abstract: Recently, diffusion models have made significant strides in synthesizing realistic 2D human images based on provided text prompts. Building upon this, researchers have extended 2D text-to-image diffusion models into the 3D domain for generating human textures (UV Maps). However, some important problems about UV Map Generative models are still not solved, i.e., how to generate personalized texture… ▽ More Recently, diffusion models have made significant strides in synthesizing realistic 2D human images based on provided text prompts. Building upon this, researchers have extended 2D text-to-image diffusion models into the 3D domain for generating human textures (UV Maps). However, some important problems about UV Map Generative models are still not solved, i.e., how to generate personalized texture maps for any given face image, and how to define and evaluate the quality of these generated texture maps. To solve the above problems, we introduce a novel method, UVMap-ID, which is a controllable and personalized UV Map generative model. Unlike traditional large-scale training methods in 2D, we propose to fine-tune a pre-trained text-to-image diffusion model which is integrated with a face fusion module for achieving ID-driven customized generation. To support the finetuning strategy, we introduce a small-scale attribute-balanced training dataset, including high-quality textures with labeled text and Face ID. Additionally, we introduce some metrics to evaluate the multiple aspects of the textures. Finally, both quantitative and qualitative analyses demonstrate the effectiveness of our method in controllable and personalized UV Map generation. Code is publicly available via https://github.com/twowwj/UVMap-ID. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.14441 [pdf]

Optimizing Contrail Detection: A Deep Learning Approach with EfficientNet-b4 Encoding

Authors: Qunwei Lin, Qian Leng, Zhicheng Ding, Chao Yan, Xiaonan Xu

Abstract: In the pursuit of environmental sustainability, the aviation industry faces the challenge of minimizing its ecological footprint. Among the key solutions is contrail avoidance, targeting the linear ice-crystal clouds produced by aircraft exhaust. These contrails exacerbate global warming by trap** atmospheric heat, necessitating precise segmentation and comprehensive analysis of contrail images… ▽ More In the pursuit of environmental sustainability, the aviation industry faces the challenge of minimizing its ecological footprint. Among the key solutions is contrail avoidance, targeting the linear ice-crystal clouds produced by aircraft exhaust. These contrails exacerbate global warming by trap** atmospheric heat, necessitating precise segmentation and comprehensive analysis of contrail images to gauge their environmental impact. However, this segmentation task is complex due to the varying appearances of contrails under different atmospheric conditions and potential misalignment issues in predictive modeling. This paper presents an innovative deep-learning approach utilizing the efficient net-b4 encoder for feature extraction, seamlessly integrating misalignment correction, soft labeling, and pseudo-labeling techniques to enhance the accuracy and efficiency of contrail detection in satellite imagery. The proposed methodology aims to redefine contrail image analysis and contribute to the objectives of sustainable aviation by providing a robust framework for precise contrail detection and analysis in satellite imagery, thus aiding in the mitigation of aviation's environmental impact. △ Less

Submitted 19 April, 2024; originally announced April 2024.

arXiv:2404.14309 [pdf, other]

Towards Better Adversarial Purification via Adversarial Denoising Diffusion Training

Authors: Yiming Liu, Kezhao Liu, Yao Xiao, Ziyi Dong, Xiaogang Xu, Pengxu Wei, Liang Lin

Abstract: Recently, diffusion-based purification (DBP) has emerged as a promising approach for defending against adversarial attacks. However, previous studies have used questionable methods to evaluate the robustness of DBP models, their explanations of DBP robustness also lack experimental support. We re-examine DBP robustness using precise gradient, and discuss the impact of stochasticity on DBP robustne… ▽ More Recently, diffusion-based purification (DBP) has emerged as a promising approach for defending against adversarial attacks. However, previous studies have used questionable methods to evaluate the robustness of DBP models, their explanations of DBP robustness also lack experimental support. We re-examine DBP robustness using precise gradient, and discuss the impact of stochasticity on DBP robustness. To better explain DBP robustness, we assess DBP robustness under a novel attack setting, Deterministic White-box, and pinpoint stochasticity as the main factor in DBP robustness. Our results suggest that DBP models rely on stochasticity to evade the most effective attack direction, rather than directly countering adversarial perturbations. To improve the robustness of DBP models, we propose Adversarial Denoising Diffusion Training (ADDT). This technique uses Classifier-Guided Perturbation Optimization (CGPO) to generate adversarial perturbation through guidance from a pre-trained classifier, and uses Rank-Based Gaussian Map** (RBGM) to convert adversarial pertubation into a normal Gaussian distribution. Empirical results show that ADDT improves the robustness of DBP models. Further experiments confirm that ADDT equips DBP models with the ability to directly counter adversarial perturbations. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.14173 [pdf, other]

Noiseless linear amplification-based quantum Ziv-Zakai bound for phase estimation and its Heisenberg error limits in noisy scenarios

Authors: Wei Ye, Peng Xiao, Xiaofan Xu, Xiang Zhu, Yunbin Yan, Lu Wang, Jie Ren, Yuxuan Zhu, Ying Xia, Xuan Rao, Shoukang Chang

Abstract: In this work, we address the central problem about how to effectively find the available precision limit of unknown parameters. In the framework of the quantum Ziv-Zakai bound (QZZB), we employ noiseless linear amplification (NLA)techniques to an initial coherent state (CS) as the probe state, and focus on whether the phase estimation performance is improved significantly in noisy scenarios, invol… ▽ More In this work, we address the central problem about how to effectively find the available precision limit of unknown parameters. In the framework of the quantum Ziv-Zakai bound (QZZB), we employ noiseless linear amplification (NLA)techniques to an initial coherent state (CS) as the probe state, and focus on whether the phase estimation performance is improved significantly in noisy scenarios, involving the photon-loss and phase-diffusion cases. More importantly, we also obtain two kinds of Heisenberg error limits of the QZZB with the NLA-based CS in these noisy scenarios, making comparisons with both the Margolus-Levitin (ML) type bound and the Mandelstam-Tamm (MT) type bound. Our analytical results show that in cases of photon loss and phase diffusion, the phase estimation performance of the QZZB can be improved remarkably by increasing the NLA gain factor. Particularly, the improvement is more pronounced with severe photon losses. Furthermore in minimal photon losses, our Heisenberg error limit shows better compactness than the cases of the ML-type and MT-type bounds. Our findings will provide an useful guidance for accomplishing more complex quantum information processing tasks. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: 10 pages, 9 figures

arXiv:2404.13971 [pdf, other]

HamilToniQ: An Open-Source Benchmark Toolkit for Quantum Computers

Authors: Xiaotian Xu, Kuan-Cheng Chen, Robert Wille

Abstract: In this paper, we introduce HamilToniQ, an open-source, and application-oriented benchmarking toolkit for the comprehensive evaluation of Quantum Processing Units (QPUs). Designed to navigate the complexities of quantum computations, HamilToniQ incorporates a methodological framework assessing QPU types, topologies, and multi-QPU systems. The toolkit facilitates the evaluation of QPUs' performance… ▽ More In this paper, we introduce HamilToniQ, an open-source, and application-oriented benchmarking toolkit for the comprehensive evaluation of Quantum Processing Units (QPUs). Designed to navigate the complexities of quantum computations, HamilToniQ incorporates a methodological framework assessing QPU types, topologies, and multi-QPU systems. The toolkit facilitates the evaluation of QPUs' performance through multiple steps including quantum circuit compilation and quantum error mitigation (QEM), integrating strategies that are unique to each stage. HamilToniQ's standardized score, H-Score, quantifies the fidelity and reliability of QPUs, providing a multidimensional perspective of QPU performance. With a focus on the Quantum Approximate Optimization Algorithm (QAOA), the toolkit enables direct, comparable analysis of QPUs, enhancing transparency and equity in benchmarking. Demonstrated in this paper, HamilToniQ has been validated on various IBM QPUs, affirming its effectiveness and robustness. Overall, HamilToniQ significantly contributes to the advancement of the quantum computing field by offering precise and equitable benchmarking metrics. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: 11 pages, 13 figures

arXiv:2404.13840 [pdf, other]

Study of $e^+e^-\toωX(3872)$ and $γX(3872)$ from 4.66 to 4.95 GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

Abstract: Using data samples with an integrated luminosity of $4.5~\text{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies ranging from 4.66 to 4.95 GeV, we study the processes of $e^+e^-\toωX(3872)$ and $e^+e^-\toγX(3872)$. With the $e^+e^-\toωX(3872)$ process, the branching fraction ratio $R\equiv\frac{\mathcal{B}(X(3872)\toγJ/ψ)}{\mathcal{B}(X(3872)\toπ^+π^- J/ψ)}$ is measured to be… ▽ More Using data samples with an integrated luminosity of $4.5~\text{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies ranging from 4.66 to 4.95 GeV, we study the processes of $e^+e^-\toωX(3872)$ and $e^+e^-\toγX(3872)$. With the $e^+e^-\toωX(3872)$ process, the branching fraction ratio $R\equiv\frac{\mathcal{B}(X(3872)\toγJ/ψ)}{\mathcal{B}(X(3872)\toπ^+π^- J/ψ)}$ is measured to be $0.38\pm0.20_\text{stat.}\pm0.01_\text{syst.}$ ($R< 0.83$ at 90\% confidence level). In addition, we measure the ratio of the average cross section of $e^+e^-\toωX(3872)$ to $e^+e^-\toωχ_{c1}(ωχ_{c2})$ to be $σ_{ωX(3872)}/σ_{ωχ_{c1}}~(σ_{ωX(3872)}/σ_{ωχ_{c2}})=5.2\pm1.0_\text{stat.}\pm1.9_\text{syst.}~ (5.5\pm1.1_\text{stat.}\pm2.4_\text{syst.})$. Finally, we search for the process of $e^+e^-\toγX(3872)$, and no obvious signal is observed. The upper limit on the ratio of the average cross section of $e^+e^-\toγX(3872)$ to $e^+e^-\toωX(3872)$ is set as $σ_{γX(3872)}/σ_{ωX(3872)}<0.23$ at 90\% confidence level. △ Less

Submitted 21 April, 2024; originally announced April 2024.

Comments: 19 pages, 10 figures

arXiv:2404.13812 [pdf, other]

A Comparative Study on Enhancing Prediction in Social Network Advertisement through Data Augmentation

Authors: Qikai Yang, Panfeng Li, Xinhe Xu, Zhicheng Ding, Wen**g Zhou, Yi Nian

Abstract: In the ever-evolving landscape of social network advertising, the volume and accuracy of data play a critical role in the performance of predictive models. However, the development of robust predictive algorithms is often hampered by the limited size and potential bias present in real-world datasets. This study presents and explores a generative augmentation framework of social network advertising… ▽ More In the ever-evolving landscape of social network advertising, the volume and accuracy of data play a critical role in the performance of predictive models. However, the development of robust predictive algorithms is often hampered by the limited size and potential bias present in real-world datasets. This study presents and explores a generative augmentation framework of social network advertising data. Our framework explores three generative models for data augmentation - Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), and Gaussian Mixture Models (GMMs) - to enrich data availability and diversity in the context of social network advertising analytics effectiveness. By performing synthetic extensions of the feature space, we find that through data augmentation, the performance of various classifiers has been quantitatively improved. Furthermore, we compare the relative performance gains brought by each data augmentation technique, providing insights for practitioners to select appropriate techniques to enhance model performance. This paper contributes to the literature by showing that synthetic data augmentation alleviates the limitations imposed by small or imbalanced datasets in the field of social network advertising. At the same time, this article also provides a comparative perspective on the practicality of different data augmentation methods, thereby guiding practitioners to choose appropriate techniques to enhance model performance. △ Less

Submitted 28 April, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

Comments: Accepted by 2024 4th International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)

arXiv:2404.13409 [pdf, other]

"I Wish There Were an AI": Challenges and AI Potential in Cancer Patient-Provider Communication

Authors: Ziqi Yang, Xuhai Xu, Bingsheng Yao, Jiachen Li, Jennifer Bagdasarian, Guodong Gao, Dakuo Wang

Abstract: Patient-provider communication has been crucial to cancer patients' survival after their cancer treatments. However, the research community and patients themselves often overlook the communication challenges after cancer treatments as they are overshadowed by the severity of the patient's illness and the variety and rarity of the cancer disease itself. Meanwhile, the recent technical advances in A… ▽ More Patient-provider communication has been crucial to cancer patients' survival after their cancer treatments. However, the research community and patients themselves often overlook the communication challenges after cancer treatments as they are overshadowed by the severity of the patient's illness and the variety and rarity of the cancer disease itself. Meanwhile, the recent technical advances in AI, especially in Large Language Models (LLMs) with versatile natural language interpretation and generation ability, demonstrate great potential to support communication in complex real-world medical situations. By interviewing six healthcare providers and eight cancer patients, our goal is to explore the providers' and patients' communication barriers in the post-cancer treatment recovery period, their expectations for future communication technologies, and the potential of AI technologies in this context. Our findings reveal several challenges in current patient-provider communication, including the knowledge and timing gaps between cancer patients and providers, their collaboration obstacles, and resource limitations. Moreover, based on providers' and patients' needs and expectations, we summarize a set of design implications for intelligent communication systems, especially with the power of LLMs. Our work sheds light on the design of future AI-powered systems for patient-provider communication under high-stake and high-uncertainty situations. △ Less

Submitted 20 April, 2024; originally announced April 2024.

Comments: 18 pages, 2 figures, submission to CSCW'24

arXiv:2404.13261 [pdf, ps, other]

On the local solvability and stability of the partial inverse problems for the non-self-adjoint Sturm-Liouville operators with a discontinuity

Authors: Xiao-Chuan Xu, Chuan-Fu Yang, Natalia Pavlovna Bondarenko

Abstract: In this work, we study the inverse spectral problems for the Sturm-Liouville operators on [0,1] with complex coefficients and a discontinuity at $x=a\in(0,1)$. Assume that the potential on (a,1) and some parameters in the discontinuity and boundary conditions are given. We recover the potential on (0,a) and the other parameters from the eigenvalues. This is the so-called partial inverse problem. T… ▽ More In this work, we study the inverse spectral problems for the Sturm-Liouville operators on [0,1] with complex coefficients and a discontinuity at $x=a\in(0,1)$. Assume that the potential on (a,1) and some parameters in the discontinuity and boundary conditions are given. We recover the potential on (0,a) and the other parameters from the eigenvalues. This is the so-called partial inverse problem. The local solvability and stability of the partial inverse problems are obtained for $a\in(0,1)$, in which the error caused by the given partial potential is considered. As a by-product, we also obtain two new uniqueness theorems for the partial inverse problem. △ Less

Submitted 20 April, 2024; originally announced April 2024.

Comments: 26 pages

MSC Class: 34A55; 34B05; 34L40; 47E05

arXiv:2404.13153 [pdf, other]

Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring

Authors: Chengxu Liu, Xuan Wang, Xiangyu Xu, Ruhao Tian, Shuai Li, Xueming Qian, Ming-Hsuan Yang

Abstract: Eliminating image blur produced by various kinds of motion has been a challenging problem. Dominant approaches rely heavily on model capacity to remove blurring by reconstructing residual from blurry observation in feature space. These practices not only prevent the capture of spatially variable motion in the real world but also ignore the tailored handling of various motions in image space. In th… ▽ More Eliminating image blur produced by various kinds of motion has been a challenging problem. Dominant approaches rely heavily on model capacity to remove blurring by reconstructing residual from blurry observation in feature space. These practices not only prevent the capture of spatially variable motion in the real world but also ignore the tailored handling of various motions in image space. In this paper, we propose a novel real-world deblurring filtering model called the Motion-adaptive Separable Collaborative (MISC) Filter. In particular, we use a motion estimation network to capture motion information from neighborhoods, thereby adaptively estimating spatially-variant motion flow, mask, kernels, weights, and offsets to obtain the MISC Filter. The MISC Filter first aligns the motion-induced blurring patterns to the motion middle along the predicted flow direction, and then collaboratively filters the aligned image through the predicted kernels, weights, and offsets to generate the output. This design can handle more generalized and complex motion in a spatially differentiated manner. Furthermore, we analyze the relationships between the motion estimation network and the residual reconstruction network. Extensive experiments on four widely used benchmarks demonstrate that our method provides an effective solution for real-world motion blur removal and achieves state-of-the-art performance. Code is available at https://github.com/ChengxuLiu/MISCFilter △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: CVPR 2024

arXiv:2404.13072 [pdf, ps, other]

The Method of Invariant Sets of Descending Flow for Locally Lipschitz Functionals

Authors: Xian Xu, Baoxia Qin

Abstract: In this paper, we extend the method of invariant sets of descending flow that proposed by Sun **gxian for smooth functionals to the locally Lipschitz functionals. By this way, we obtain the existence results for the positive, negative and sign-changing critical points of the locally Lipschitz functionals, and apply these theoretical results to the study of differential inclusion problems with p-L… ▽ More In this paper, we extend the method of invariant sets of descending flow that proposed by Sun **gxian for smooth functionals to the locally Lipschitz functionals. By this way, we obtain the existence results for the positive, negative and sign-changing critical points of the locally Lipschitz functionals, and apply these theoretical results to the study of differential inclusion problems with p-Laplacian. In order to obtain the above results,we develop some new techniques: 1) We establish the method of how to extend the pseudo-gradient field to the whole space on the premise of preserving the useful information of the local pseudo-gradient field; 2) In the case of set-valued map**, a pseudo-gradient field is established to make both the cone and the negative cone being invariant sets of descending flow. To obtain our main results, a new class of (PS) condition is also proposed. △ Less

Submitted 15 April, 2024; originally announced April 2024.

arXiv:2404.12938 [pdf, other]

MAiDE-up: Multilingual Deception Detection of GPT-generated Hotel Reviews

Authors: Oana Ignat, Xiaomeng Xu, Rada Mihalcea

Abstract: Deceptive reviews are becoming increasingly common, especially given the increase in performance and the prevalence of LLMs. While work to date has addressed the development of models to differentiate between truthful and deceptive human reviews, much less is known about the distinction between real reviews and AI-authored fake reviews. Moreover, most of the research so far has focused primarily o… ▽ More Deceptive reviews are becoming increasingly common, especially given the increase in performance and the prevalence of LLMs. While work to date has addressed the development of models to differentiate between truthful and deceptive human reviews, much less is known about the distinction between real reviews and AI-authored fake reviews. Moreover, most of the research so far has focused primarily on English, with very little work dedicated to other languages. In this paper, we compile and make publicly available the MAiDE-up dataset, consisting of 10,000 real and 10,000 AI-generated fake hotel reviews, balanced across ten languages. Using this dataset, we conduct extensive linguistic analyses to (1) compare the AI fake hotel reviews to real hotel reviews, and (2) identify the factors that influence the deception detection model performance. We explore the effectiveness of several models for deception detection in hotel reviews across three main dimensions: sentiment, location, and language. We find that these dimensions influence how well we can detect AI-generated fake reviews. △ Less

Submitted 18 June, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

arXiv:2404.12833 [pdf, other]

How Far Can We Go with Practical Function-Level Program Repair?

Authors: Jiahong Xiang, Xiaoyang Xu, Fanchu Kong, Mingyuan Wu, Haotian Zhang, Yuqun Zhang

Abstract: Recently, multiple Automated Program Repair (APR) techniques based on Large Language Models (LLMs) have been proposed to enhance the repair performance. While these techniques mainly focus on the single-line or hunk-level repair, they face significant challenges in real-world application due to the limited repair task scope and costly statement-level fault localization. However, the more practical… ▽ More Recently, multiple Automated Program Repair (APR) techniques based on Large Language Models (LLMs) have been proposed to enhance the repair performance. While these techniques mainly focus on the single-line or hunk-level repair, they face significant challenges in real-world application due to the limited repair task scope and costly statement-level fault localization. However, the more practical function-level APR, which broadens the scope of APR task to fix entire buggy functions and requires only cost-efficient function-level fault localization, remains underexplored. In this paper, we conduct the first comprehensive study of LLM-based function-level APR including investigating the effect of the few-shot learning mechanism and the auxiliary repair-relevant information. Specifically, we adopt six widely-studied LLMs and construct a benchmark in both the Defects4J 1.2 and 2.0 datasets. Our study demonstrates that LLMs with zero-shot learning are already powerful function-level APR techniques, while applying the few-shot learning mechanism leads to disparate repair performance. Moreover, we find that directly applying the auxiliary repair-relevant information to LLMs significantly increases function-level repair performance. Inspired by our findings, we propose an LLM-based function-level APR technique, namely SRepair, which adopts a dual-LLM framework to leverage the power of the auxiliary repair-relevant information for advancing the repair performance. The evaluation results demonstrate that SRepair can correctly fix 300 single-function bugs in the Defects4J dataset, largely surpassing all previous APR techniques by at least 85%, without the need for the costly statement-level fault location information. Furthermore, SRepair successfully fixes 32 multi-function bugs in the Defects4J dataset, which is the first time achieved by any APR technique ever to our best knowledge. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: https://github.com/GhabiX/SRepair/

arXiv:2404.12817 [pdf, other]

Determination of the CKM angle $φ_{3}$ from a combination of Belle and Belle II results

Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, S. Al Said, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien , et al. (377 additional authors not shown)

Abstract: We report a determination of the CKM angle $φ_{3}$, also known as $γ$, from a combination of measurements using samples of up to 711~fb$^{-1}$ from the Belle experiment and up to 362~fb$^{-1}$ from the Belle II experiment. We combine results from analyses of $B^+\to DK^+, B^+\to Dπ^+$, and $B^+ \to D^{*}K^+$ decays, where $D$ is an admixture of $D^0$ and $\overline{D}{}^{0}$ mesons, in a likelihoo… ▽ More We report a determination of the CKM angle $φ_{3}$, also known as $γ$, from a combination of measurements using samples of up to 711~fb$^{-1}$ from the Belle experiment and up to 362~fb$^{-1}$ from the Belle II experiment. We combine results from analyses of $B^+\to DK^+, B^+\to Dπ^+$, and $B^+ \to D^{*}K^+$ decays, where $D$ is an admixture of $D^0$ and $\overline{D}{}^{0}$ mesons, in a likelihood fit to obtain $φ_{3} = (78.6^{+7.2}_{-7.3})^{\circ}$. We also briefly discuss the interpretation of this result. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 31 pages, 4 figures

Report number: Belle II Preprint 2023-015, KEK Preprint 2023-31

arXiv:2404.12426 [pdf, other]

Unequal-mass, highly-spinning binary black hole mergers in the stable mass transfer formation channel

Authors: Aleksandra Olejak, Jakub Klencki, Xiao-Tian Xu, Chen Wang, Krzysztof Belczynski, Jean-Pierre Lasota

Abstract: The growing database of gravitational-wave (GW) detections with the binary black holes (BHs) merging in the distant Universe contains subtle insights into their formation scenarios. One of the puzzling properties of detected GW sources is the possible (anti)correlation between mass ratio q of BH-BH binaries and their effective spin. We use rapid binary evolution models to demonstrate that the isol… ▽ More The growing database of gravitational-wave (GW) detections with the binary black holes (BHs) merging in the distant Universe contains subtle insights into their formation scenarios. One of the puzzling properties of detected GW sources is the possible (anti)correlation between mass ratio q of BH-BH binaries and their effective spin. We use rapid binary evolution models to demonstrate that the isolated binary evolution followed by efficient tidal spin-up of stripped helium core produces a similar pattern in Xeff vs q distributions of BH-BH mergers. In our models, the progenitors of unequal BH-BH systems in the stable mass transfer formation scenario are more likely to efficiently shrink their orbits during the second Roche-lobe overflow than the binaries that evolve into nearly equal-mass component systems. This makes it easier for unequal-mass progenitors to enter the tidal spin-up regime and later merge due to GW emission. Our results are, however, sensitive to some input assumptions, especially, the stability of mass transfer and the angular momentum loss during non-conservative mass transfer. We note that mass transfer prescriptions widely adopted in rapid codes favor the formation of BH-BH merger progenitors with unequal masses and moderate separations. We compare our results with detailed stellar model grids and find reasonable agreement after appropriate calibration of the physics models. We anticipate that future detections of unequal-mass BH-BH mergers could provide valuable constraints on the role of the stable mass transfer formation channel. A significant fraction of BH-BH detections with mass ratio q in range (0.4 - 0.7) would be consistent with the mass ratio reversal scenario during the first, relatively conservative mass transfer, and a non-enhanced angular momentum loss during the second, highly non-conservative mass transfer phase. △ Less

Submitted 11 June, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

Comments: Version accepted for publication in A&A

Showing 201–250 of 5,369 results for author: Xu, X