Search | arXiv e-print repository

doi 10.1109/TGRS.2024.3368760

A Multi-scale Generalized Shrinkage Threshold Network for Image Blind Deblurring in Remote Sensing

Authors: Yujie Feng, Yin Yang, Xiaohong Fan, Zhengpeng Zhang, Jian** Zhang

Abstract: Remote sensing images are essential for many applications of the earth's sciences, but their quality can usually be degraded due to limitations in sensor technology and complex imaging environments. To address this, various remote sensing image deblurring methods have been developed to restore sharp and high-quality images from degraded observational data. However, most traditional model-based deb… ▽ More Remote sensing images are essential for many applications of the earth's sciences, but their quality can usually be degraded due to limitations in sensor technology and complex imaging environments. To address this, various remote sensing image deblurring methods have been developed to restore sharp and high-quality images from degraded observational data. However, most traditional model-based deblurring methods usually require predefined {hand-crafted} prior assumptions, which are difficult to handle in complex applications. On the other hand, deep learning-based deblurring methods are often considered as black boxes, lacking transparency and interpretability. In this work, we propose a new blind deblurring learning framework that utilizes alternating iterations of shrinkage thresholds. This framework involves updating blurring kernels and images, with a theoretical foundation in network design. Additionally, we propose a learnable blur kernel proximal map** module to improve the accuracy of the blur kernel reconstruction. Furthermore, we propose a deep proximal map** module in the image domain, which combines a generalized shrinkage threshold with a multi-scale prior feature extraction block. This module also incorporates an attention mechanism to learn adaptively the importance of prior information, improving the flexibility and robustness of prior terms, and avoiding limitations similar to hand-crafted image prior terms. Consequently, we design a novel multi-scale generalized shrinkage threshold network (MGSTNet) that focuses specifically on learning deep geometric prior features to enhance image restoration. Experimental results on real and synthetic remote sensing image datasets demonstrate the superiority of our MGSTNet framework compared to existing deblurring methods. △ Less

Submitted 21 February, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

Comments: 16 pages,Accepted to IEEE Transactions on Geoscience and Remote Sensing,2024

MSC Class: 54H30; 68U10; 94A08

Journal ref: IEEE Transactions on Geoscience and Remote Sensing,2024

arXiv:2309.07210 [pdf, other]

TRINITY IV: Predictions for Supermassive Black Holes at $z \gtrsim 7$

Authors: Haowen Zhang, Peter Behroozi, Marta Volonteri, Joseph Silk, Xiaohui Fan, James Aird, **yi Yang, Feige Wang, Philip F. Hopkins

Abstract: We present predictions for the high-redshift halo-galaxy-supermassive black hole (SMBH) connection from the TRINITY model. Constrained by a comprehensive compilation of galaxy ($0\leq z \leq 10$) and SMBH datasets ($0\leq z \leq 6.5$), TRINITY finds: 1) The number of SMBHs with $M_\bullet > 10^9 M_\odot$ in the observable Universe increases by six orders of magnitude from $z\sim10$ to $z\sim2$, an… ▽ More We present predictions for the high-redshift halo-galaxy-supermassive black hole (SMBH) connection from the TRINITY model. Constrained by a comprehensive compilation of galaxy ($0\leq z \leq 10$) and SMBH datasets ($0\leq z \leq 6.5$), TRINITY finds: 1) The number of SMBHs with $M_\bullet > 10^9 M_\odot$ in the observable Universe increases by six orders of magnitude from $z\sim10$ to $z\sim2$, and by another factor of $\sim 3$ from $z\sim2$ to $z=0$; 2) The $M_\bullet > 10^9/10^{10} M_\odot$ SMBHs at $z\sim 6$ live in haloes with $\sim (2-3)/(3-5) \times 10^{12} M_\odot$; 3) the new JWST AGNs at $7\lesssim z \lesssim 11$ are broadly consistent with the median SMBH mass-galaxy mass relation for AGNs from TRINITY; 4) Seeds from runaway mergers in nuclear star clusters are viable progenitors for the SMBHs in GN-z11 ($z=10.6$) and CEERS_1019 ($z=8.7$); 5) $z=6-10$ quasar luminosity functions from wide area surveys by, e.g., Roman and Euclid, will reduce uncertainties in the $z=6-10$ SMBH mass-galaxy mass relation by up to $\sim 0.5$ dex. △ Less

Submitted 13 September, 2023; originally announced September 2023.

Comments: 15 pages, 12 figures, submitted to MNRAS. Questions and comments are welcome!

arXiv:2309.07136 [pdf, other]

Masked Transformer for Electrocardiogram Classification

Authors: Ya Zhou, Xiaolin Diao, Yanni Huo, Yang Liu, Xiaohan Fan, Wei Zhao

Abstract: Electrocardiogram (ECG) is one of the most important diagnostic tools in clinical applications. With the advent of advanced algorithms, various deep learning models have been adopted for ECG tasks. However, the potential of Transformer for ECG data has not been fully realized, despite their widespread success in computer vision and natural language processing. In this work, we present Masked Trans… ▽ More Electrocardiogram (ECG) is one of the most important diagnostic tools in clinical applications. With the advent of advanced algorithms, various deep learning models have been adopted for ECG tasks. However, the potential of Transformer for ECG data has not been fully realized, despite their widespread success in computer vision and natural language processing. In this work, we present Masked Transformer for ECG classification (MTECG), a simple yet effective method which significantly outperforms recent state-of-the-art algorithms in ECG classification. Our approach adapts the image-based masked autoencoders to self-supervised representation learning from ECG time series. We utilize a lightweight Transformer for the encoder and a 1-layer Transformer for the decoder. The ECG signal is split into a sequence of non-overlap** segments along the time dimension, and learnable positional embeddings are added to preserve the sequential information. We construct the Fuwai dataset comprising 220,251 ECG recordings with a broad range of diagnoses, annotated by medical experts, to explore the potential of Transformer. A strong pre-training and fine-tuning recipe is proposed from the empirical study. The experiments demonstrate that the proposed method increases the macro F1 scores by 3.4%-27.5% on the Fuwai dataset, 9.9%-32.0% on the PTB-XL dataset, and 9.4%-39.1% on a multicenter dataset, compared to the alternative methods. We hope that this study could direct future research on the application of Transformer to more ECG tasks. △ Less

Submitted 22 April, 2024; v1 submitted 31 August, 2023; originally announced September 2023.

Comments: more experimental results; more implementation details; different abstracts

arXiv:2309.06037 [pdf, other]

doi 10.1103/PhysRevD.107.123029

Fast resolving Galactic binaries in LISA data and its ability to study the Milky Way

Authors: Pin Gao, Xi-Long Fan, Zhou-Jian Cao, Xue-Hao Zhang

Abstract: Resolving individual gravitational waves from tens of millions of double white dwarf (DWD) binaries in the Milky Way is a challenge for future space-based gravitational wave detection programs. By using previous data to define the priors for the next search, we propose an accelerated approach of searching the DWD binaries and demonstrate its efficiency based on the GBSIEVER detection pipeline. Com… ▽ More Resolving individual gravitational waves from tens of millions of double white dwarf (DWD) binaries in the Milky Way is a challenge for future space-based gravitational wave detection programs. By using previous data to define the priors for the next search, we propose an accelerated approach of searching the DWD binaries and demonstrate its efficiency based on the GBSIEVER detection pipeline. Compared to the traditional GBSIEVER method, our method can obtain $\sim 50\%$ of sources with 2.5\% of the searching time for LDC1-4 data. In addition, we find that both methods have a similar ability to detect the Milky Way structure by their confirmed sources. The relative error of distance and chirp mass is about 20\% for DWD binaries whose gravitational wave frequency is higher than $4\times10^{-3}$ Hz, even if they are close to the Galactic center. Finally, we propose a signal-to-noise ratio (SNR) threshold for LISA to confirm the detection of DWD binaries. The threshold should be 16 when the gravitational wave frequency is lower than $4\times10^{-3}$ Hz and 9 when the frequency range is from $4\times10^{-3}$ Hz to $1.5\times10^{-2}$ Hz. △ Less

Submitted 12 September, 2023; originally announced September 2023.

Comments: 16 pages, 19 figures

Journal ref: Phys. Rev. D 107, 123029, 2023

arXiv:2309.06011 [pdf, other]

Waveform Reconstruction of Core-Collapse Supernovae Gravitational-Waves with Ensemble Empirical Mode Decomposition

Authors: Yong Yuan, Xi-Long Fan, Hou-Jun Lü, Yang-Yi Sun, Kai Lin

Abstract: The gravitational waves (GW) from core-collapse supernovae (CCSN) have been proposed as a probe to investigate physical properties inside of the supernova. However, how to search and extract the GW signals from core-collapse supernovae remains an open question due to its complicated time-frequency structure. In this paper, we apply the Ensemble Empirical Mode Decomposition (EEMD) method to decompo… ▽ More The gravitational waves (GW) from core-collapse supernovae (CCSN) have been proposed as a probe to investigate physical properties inside of the supernova. However, how to search and extract the GW signals from core-collapse supernovae remains an open question due to its complicated time-frequency structure. In this paper, we apply the Ensemble Empirical Mode Decomposition (EEMD) method to decompose and reconstruct simulated GW data generated by magnetorotational mechanism and neutrino-driven mechanism within the advanced LIGO, using the match score as the criterion for assessing the quality of the reconstruction. The results indicate that by decomposing the data, the sum of the first six intrinsic mode functions (IMFs) can be used as the reconstructed waveform. To determine the probability that our reconstructed waveform corresponds to a real GW waveform, we calculate the false alarm probability of reconstruction (FAPR). By setting the threshold of the match score to be 0.75, we obtain FAPR of GW sources at a distance of 5 kpc and 10 kpc to be $6\times10^{-3}$ and $1\times10^{-2}$ respectively. If we normalize the maximum amplitude of the GW signal to $5\times10^{-21}$, the FAPR at this threshold is $4\times10^{-3}$. Furthermore, in our study, the reconstruction distance is not equivalent to the detection distance. When the strain of GW reaches $7 \times 10^{-21}$, and the match score threshold is set at 0.75, we can reconstruct GW waveform up to approximately 36 kpc. △ Less

Submitted 22 February, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

Comments: 9 pages, 6 figures. Accepted by MNRAS

arXiv:2309.05267 [pdf, other]

Diving into Darkness: A Dual-Modulated Framework for High-Fidelity Super-Resolution in Ultra-Dark Environments

Authors: Jiaxin Gao, Ziyu Yue, Yaohua Liu, Sihan Xie, Xin Fan, Risheng Liu

Abstract: Super-resolution tasks oriented to images captured in ultra-dark environments is a practical yet challenging problem that has received little attention. Due to uneven illumination and low signal-to-noise ratio in dark environments, a multitude of problems such as lack of detail and color distortion may be magnified in the super-resolution process compared to normal-lighting environments. Consequen… ▽ More Super-resolution tasks oriented to images captured in ultra-dark environments is a practical yet challenging problem that has received little attention. Due to uneven illumination and low signal-to-noise ratio in dark environments, a multitude of problems such as lack of detail and color distortion may be magnified in the super-resolution process compared to normal-lighting environments. Consequently, conventional low-light enhancement or super-resolution methods, whether applied individually or in a cascaded manner for such problem, often encounter limitations in recovering luminance, color fidelity, and intricate details. To conquer these issues, this paper proposes a specialized dual-modulated learning framework that, for the first time, attempts to deeply dissect the nature of the low-light super-resolution task. Leveraging natural image color characteristics, we introduce a self-regularized luminance constraint as a prior for addressing uneven lighting. Expanding on this, we develop Illuminance-Semantic Dual Modulation (ISDM) components to enhance feature-level preservation of illumination and color details. Besides, instead of deploying naive up-sampling strategies, we design the Resolution-Sensitive Merging Up-sampler (RSMU) module that brings together different sampling modalities as substrates, effectively mitigating the presence of artifacts and halos. Comprehensive experiments showcases the applicability and generalizability of our approach to diverse and challenging ultra-low-light conditions, outperforming state-of-the-art methods with a notable improvement (i.e., $\uparrow$5\% in PSNR, and $\uparrow$43\% in LPIPS). Especially noteworthy is the 19-fold increase in the RMSE score, underscoring our method's exceptional generalization across different darkness levels. The code will be available online upon publication of the paper. △ Less

Submitted 11 September, 2023; originally announced September 2023.

Comments: 9 pages

arXiv:2309.05266 [pdf, ps, other]

Self-normalized Cramér type moderate deviations for martingales and applications

Authors: Xiequan Fan, Qi-Man Shao

Abstract: Cramér's moderate deviations give a quantitative estimate for the relative error of the normal approximation and provide theoretical justifications for many estimator used in statistics. In this paper, we establish self-normalized Cramér type moderate deviations for martingales under some mile conditions. The result extends an earlier work of Fan, Grama, Liu and Shao [Bernoulli, 2019]. Moreover, a… ▽ More Cramér's moderate deviations give a quantitative estimate for the relative error of the normal approximation and provide theoretical justifications for many estimator used in statistics. In this paper, we establish self-normalized Cramér type moderate deviations for martingales under some mile conditions. The result extends an earlier work of Fan, Grama, Liu and Shao [Bernoulli, 2019]. Moreover, applications of our result to Student's statistic, stationary martingale difference sequences and branching processes in a random environment are also discussed. In particular, we establish Cramér type moderate deviations for Student's $t$-statistic for branching processes in a random environment. △ Less

Submitted 11 September, 2023; originally announced September 2023.

Comments: 24 pages

MSC Class: Primary 60G42; 60F10; Secondary 60K37; 60J80

arXiv:2309.05226 [pdf, other]

Joint Beamforming and Compression Design for Per-Antenna Power Constrained Cooperative Cellular Networks

Authors: Xilai Fan, Ya-Feng Liu, Bo Jiang

Abstract: In the cooperative cellular network, relay-like base stations are connected to the central processor (CP) via rate-limited fronthaul links and the joint processing is performed at the CP, which thus can effectively mitigate the multiuser interference. In this paper, we consider the joint beamforming and compression problem with per-antenna power constraints in the cooperative cellular network. We… ▽ More In the cooperative cellular network, relay-like base stations are connected to the central processor (CP) via rate-limited fronthaul links and the joint processing is performed at the CP, which thus can effectively mitigate the multiuser interference. In this paper, we consider the joint beamforming and compression problem with per-antenna power constraints in the cooperative cellular network. We first establish the equivalence between the considered problem and its semidefinite relaxation (SDR). Then we further derive the partial Lagrangian dual of the SDR problem and show that the objective function of the obtained dual problem is differentiable. Based on the differentiability, we propose two efficient projected gradient ascent algorithms for solving the dual problem, which are projected exact gradient ascent (PEGA) and projected inexact gradient ascent (PIGA). While PEGA is guaranteed to find the global solution of the dual problem (and hence the global solution of the original problem), PIGA is more computationally efficient due to the lower complexity in inexactly computing the gradient. Global optimality and high efficiency of the proposed algorithms are demonstrated via numerical experiments. △ Less

Submitted 23 December, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

Comments: 5 pages, 2 figures, accepted for publication in IEEE ICASSP 2024

arXiv:2309.04171 [pdf, other]

PRISTA-Net: Deep Iterative Shrinkage Thresholding Network for Coded Diffraction Patterns Phase Retrieval

Authors: Aoxu Liu, Xiaohong Fan, Yin Yang, Jian** Zhang

Abstract: The problem of phase retrieval (PR) involves recovering an unknown image from limited amplitude measurement data and is a challenge nonlinear inverse problem in computational imaging and image processing. However, many of the PR methods are based on black-box network models that lack interpretability and plug-and-play (PnP) frameworks that are computationally complex and require careful parameter… ▽ More The problem of phase retrieval (PR) involves recovering an unknown image from limited amplitude measurement data and is a challenge nonlinear inverse problem in computational imaging and image processing. However, many of the PR methods are based on black-box network models that lack interpretability and plug-and-play (PnP) frameworks that are computationally complex and require careful parameter tuning. To address this, we have developed PRISTA-Net, a deep unfolding network (DUN) based on the first-order iterative shrinkage thresholding algorithm (ISTA). This network utilizes a learnable nonlinear transformation to address the proximal-point map** sub-problem associated with the sparse priors, and an attention mechanism to focus on phase information containing image edges, textures, and structures. Additionally, the fast Fourier transform (FFT) is used to learn global features to enhance local information, and the designed logarithmic-based loss function leads to significant improvements when the noise level is low. All parameters in the proposed PRISTA-Net framework, including the nonlinear transformation, threshold parameters, and step size, are learned end-to-end instead of being manually set. This method combines the interpretability of traditional methods with the fast inference ability of deep learning and is able to handle noise at each iteration during the unfolding stage, thus improving recovery quality. Experiments on Coded Diffraction Patterns (CDPs) measurements demonstrate that our approach outperforms the existing state-of-the-art methods in terms of qualitative and quantitative evaluations. Our source codes are available at \emph{https://github.com/liuaxou/PRISTA-Net}. △ Less

Submitted 8 September, 2023; originally announced September 2023.

Comments: 12 pages

arXiv:2309.04166 [pdf, ps, other]

Metastable Charge Distribution Between Degenerate Landau Levels

Authors: Wenlu Lin, Xing Fan, Lili Zhao, Yoon Jang Chung, Adbhut Gupta, Kirk W. Baldwin, Loren Pfeiffer, Hong Lu, Yang Liu

Abstract: We study two dimensional electron systems confined in wide quantum wells whose subband separation is comparable with the Zeeman energy. Two N = 0 Landau levels from different subbands and with opposite spins are pinned in energy when they cross each other and electrons can freely transfer between them. When the disorder is strong, we observe clear hysteresis in our data corresponding to instabilit… ▽ More We study two dimensional electron systems confined in wide quantum wells whose subband separation is comparable with the Zeeman energy. Two N = 0 Landau levels from different subbands and with opposite spins are pinned in energy when they cross each other and electrons can freely transfer between them. When the disorder is strong, we observe clear hysteresis in our data corresponding to instability of the electron distribution in the two crossing levels. When the intra-layer interaction dominates, multiple minima appear when a Landau level is 1/3 or 2/3 filled and fractional quantum hall effect can be stabilized. △ Less

Submitted 26 February, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

arXiv:2309.04051 [pdf]

Observation of Hybrid-Order Topological Pump in a Kekule-Textured Graphene Lattice

Authors: Tianzhi Xia, Yuzeng Li, Qicheng Zhang, Xiying Fan, Meng Xiao, Chunyin Qiu

Abstract: Thouless charge pum** protocol provides an effective route for realizing topological particle transport. To date, the first-order and higher-order topological pumps, exhibiting transitions of edge-bulk-edge and corner-bulk-corner states, respectively, are observed in a variety of experimental platforms. Here, we propose a concept of hybrid-order topological pump, which involves a transition of b… ▽ More Thouless charge pum** protocol provides an effective route for realizing topological particle transport. To date, the first-order and higher-order topological pumps, exhibiting transitions of edge-bulk-edge and corner-bulk-corner states, respectively, are observed in a variety of experimental platforms. Here, we propose a concept of hybrid-order topological pump, which involves a transition of bulk, edge, and corner states simultaneously. More specifically, we consider a Kekulé-textured graphene lattice that features a tunable phase parameter. The finite sample of zigzag boundaries, where the corner configuration is abnormal and inaccessible by repeating unit cells, hosts topological responses at both the edges and corners. The former is protected by a nonzero winding number, while the latter can be explained by a nontrivial vector Chern number. Using our skillful acoustic experiments, we verify those nontrivial boundary landmarks and visualize the consequent hybrid-order topological pump process directly. This work deepens our understanding to higher-order topological phases and broadens the scope of topological pumps. △ Less

Submitted 7 September, 2023; originally announced September 2023.

Comments: 5 figures

arXiv:2309.03609 [pdf]

doi 10.1002/aelm.202300799

Conduction modulation of solution-processed two-dimensional materials

Authors: Songwei Liu, Xiaoyue Fan, Yingyi Wen, Pengyu Liu, Yang Liu, **gfang Pei, Wenchen Yang, Lekai Song, Danmei Pan, Teng Ma, Yue Lin, Gang Wang, Guohua Hu

Abstract: Solution-processed two-dimensional (2D) materials hold promise for their scalable applications. However, the random, fragmented nature of the solution-processed nanoflakes and the poor percolative conduction through their discrete networks limit the performance of the enabled devices. To overcome the problem, we report conduction modulation of the solution-processed 2D materials via the Stark effe… ▽ More Solution-processed two-dimensional (2D) materials hold promise for their scalable applications. However, the random, fragmented nature of the solution-processed nanoflakes and the poor percolative conduction through their discrete networks limit the performance of the enabled devices. To overcome the problem, we report conduction modulation of the solution-processed 2D materials via the Stark effect. Using liquid-phase exfoliated molybdenum disulfide (MoS2) as an example, we demonstrate nonlinear conduction modulation with a switching ratio of >105 by the local fields from the interfacial ferroelectric P(VDF-TrFE). Through density-functional theory calculations and in situ Raman scattering and photoluminescence spectroscopic analysis, we understand the modulation arises from a charge redistribution in the solution-processed MoS2. Beyond MoS2, we show the modulation may be viable for the other solution-processed 2D materials and low-dimensional materials. The effective modulation can open their electronic device applications. △ Less

Submitted 7 September, 2023; originally announced September 2023.

arXiv:2309.03548 [pdf, other]

Trash to Treasure: Low-Light Object Detection via Decomposition-and-Aggregation

Authors: Xiaohan Cui, Long Ma, Tengyu Ma, **yuan Liu, Xin Fan, Risheng Liu

Abstract: Object detection in low-light scenarios has attracted much attention in the past few years. A mainstream and representative scheme introduces enhancers as the pre-processing for regular detectors. However, because of the disparity in task objectives between the enhancer and detector, this paradigm cannot shine at its best ability. In this work, we try to arouse the potential of enhancer + detector… ▽ More Object detection in low-light scenarios has attracted much attention in the past few years. A mainstream and representative scheme introduces enhancers as the pre-processing for regular detectors. However, because of the disparity in task objectives between the enhancer and detector, this paradigm cannot shine at its best ability. In this work, we try to arouse the potential of enhancer + detector. Different from existing works, we extend the illumination-based enhancers (our newly designed or existing) as a scene decomposition module, whose removed illumination is exploited as the auxiliary in the detector for extracting detection-friendly features. A semantic aggregation module is further established for integrating multi-scale scene-related semantic information in the context space. Actually, our built scheme successfully transforms the "trash" (i.e., the ignored illumination in the detector) into the "treasure" for the detector. Plenty of experiments are conducted to reveal our superiority against other state-of-the-art methods. The code will be public if it is accepted. △ Less

Submitted 7 September, 2023; originally announced September 2023.

arXiv:2309.02434 [pdf, other]

ReliTalk: Relightable Talking Portrait Generation from a Single Video

Authors: Haonan Qiu, Zhaoxi Chen, Yuming Jiang, Hang Zhou, Xiangyu Fan, Lei Yang, Wayne Wu, Ziwei Liu

Abstract: Recent years have witnessed great progress in creating vivid audio-driven portraits from monocular videos. However, how to seamlessly adapt the created video avatars to other scenarios with different backgrounds and lighting conditions remains unsolved. On the other hand, existing relighting studies mostly rely on dynamically lighted or multi-view data, which are too expensive for creating video p… ▽ More Recent years have witnessed great progress in creating vivid audio-driven portraits from monocular videos. However, how to seamlessly adapt the created video avatars to other scenarios with different backgrounds and lighting conditions remains unsolved. On the other hand, existing relighting studies mostly rely on dynamically lighted or multi-view data, which are too expensive for creating video portraits. To bridge this gap, we propose ReliTalk, a novel framework for relightable audio-driven talking portrait generation from monocular videos. Our key insight is to decompose the portrait's reflectance from implicitly learned audio-driven facial normals and images. Specifically, we involve 3D facial priors derived from audio features to predict delicate normal maps through implicit functions. These initially predicted normals then take a crucial part in reflectance decomposition by dynamically estimating the lighting condition of the given video. Moreover, the stereoscopic face representation is refined using the identity-consistent loss under simulated multiple lighting conditions, addressing the ill-posed problem caused by limited views available from a single monocular video. Extensive experiments validate the superiority of our proposed framework on both real and synthetic datasets. Our code is released in https://github.com/arthur-qiu/ReliTalk. △ Less

Submitted 5 September, 2023; originally announced September 2023.

arXiv:2309.01113 [pdf, other]

Hybrid-Supervised Dual-Search: Leveraging Automatic Learning for Loss-free Multi-Exposure Image Fusion

Authors: Guanyao Wu, Hongming Fu, **yuan Liu, Long Ma, Xin Fan, Risheng Liu

Abstract: Multi-exposure image fusion (MEF) has emerged as a prominent solution to address the limitations of digital imaging in representing varied exposure levels. Despite its advancements, the field grapples with challenges, notably the reliance on manual designs for network structures and loss functions, and the constraints of utilizing simulated reference images as ground truths. Consequently, current… ▽ More Multi-exposure image fusion (MEF) has emerged as a prominent solution to address the limitations of digital imaging in representing varied exposure levels. Despite its advancements, the field grapples with challenges, notably the reliance on manual designs for network structures and loss functions, and the constraints of utilizing simulated reference images as ground truths. Consequently, current methodologies often suffer from color distortions and exposure artifacts, further complicating the quest for authentic image representation. In addressing these challenges, this paper presents a Hybrid-Supervised Dual-Search approach for MEF, dubbed HSDS-MEF, which introduces a bi-level optimization search scheme for automatic design of both network structures and loss functions. More specifically, we harnesses a unique dual research mechanism rooted in a novel weighted structure refinement architecture search. Besides, a hybrid supervised contrast constraint seamlessly guides and integrates with searching process, facilitating a more adaptive and comprehensive search for optimal loss functions. We realize the state-of-the-art performance in comparison to various competitive schemes, yielding a 10.61% and 4.38% improvement in Visual Information Fidelity (VIF) for general and no-reference scenarios, respectively, while providing results with high contrast, rich details and colors. △ Less

Submitted 3 September, 2023; originally announced September 2023.

arXiv:2309.01106 [pdf, other]

AdvMono3D: Advanced Monocular 3D Object Detection with Depth-Aware Robust Adversarial Training

Authors: Xingyuan Li, **yuan Liu, Long Ma, Xin Fan, Risheng Liu

Abstract: Monocular 3D object detection plays a pivotal role in the field of autonomous driving and numerous deep learning-based methods have made significant breakthroughs in this area. Despite the advancements in detection accuracy and efficiency, these models tend to fail when faced with such attacks, rendering them ineffective. Therefore, bolstering the adversarial robustness of 3D detection models has… ▽ More Monocular 3D object detection plays a pivotal role in the field of autonomous driving and numerous deep learning-based methods have made significant breakthroughs in this area. Despite the advancements in detection accuracy and efficiency, these models tend to fail when faced with such attacks, rendering them ineffective. Therefore, bolstering the adversarial robustness of 3D detection models has become a crucial issue that demands immediate attention and innovative solutions. To mitigate this issue, we propose a depth-aware robust adversarial training method for monocular 3D object detection, dubbed DART3D. Specifically, we first design an adversarial attack that iteratively degrades the 2D and 3D perception capabilities of 3D object detection models(IDP), serves as the foundation for our subsequent defense mechanism. In response to this attack, we propose an uncertainty-based residual learning method for adversarial training. Our adversarial training approach capitalizes on the inherent uncertainty, enabling the model to significantly improve its robustness against adversarial attacks. We conducted extensive experiments on the KITTI 3D datasets, demonstrating that DART3D surpasses direct adversarial training (the most popular approach) under attacks in 3D object detection $AP_{R40}$ of car category for the Easy, Moderate, and Hard settings, with improvements of 4.415%, 4.112%, and 3.195%, respectively. △ Less

Submitted 3 September, 2023; originally announced September 2023.

arXiv:2309.01099 [pdf, other]

Enhancing Infrared Small Target Detection Robustness with Bi-Level Adversarial Framework

Authors: Zhu Liu, Zihang Chen, **yuan Liu, Long Ma, Xin Fan, Risheng Liu

Abstract: The detection of small infrared targets against blurred and cluttered backgrounds has remained an enduring challenge. In recent years, learning-based schemes have become the mainstream methodology to establish the map** directly. However, these methods are susceptible to the inherent complexities of changing backgrounds and real-world disturbances, leading to unreliable and compromised target es… ▽ More The detection of small infrared targets against blurred and cluttered backgrounds has remained an enduring challenge. In recent years, learning-based schemes have become the mainstream methodology to establish the map** directly. However, these methods are susceptible to the inherent complexities of changing backgrounds and real-world disturbances, leading to unreliable and compromised target estimations. In this work, we propose a bi-level adversarial framework to promote the robustness of detection in the presence of distinct corruptions. We first propose a bi-level optimization formulation to introduce dynamic adversarial learning. Specifically, it is composited by the learnable generation of corruptions to maximize the losses as the lower-level objective and the robustness promotion of detectors as the upper-level one. We also provide a hierarchical reinforced learning strategy to discover the most detrimental corruptions and balance the performance between robustness and accuracy. To better disentangle the corruptions from salient features, we also propose a spatial-frequency interaction network for target detection. Extensive experiments demonstrate our scheme remarkably improves 21.96% IOU across a wide array of corruptions and notably promotes 4.97% IOU on the general benchmark. The source codes are available at https://github.com/LiuZhu-CV/BALISTD. △ Less

Submitted 3 September, 2023; originally announced September 2023.

Comments: 9 pages, 6 figures

arXiv:2309.00872 [pdf, other]

doi 10.1145/3581783.3612436

Fearless Luminance Adaptation: A Macro-Micro-Hierarchical Transformer for Exposure Correction

Authors: Gehui Li, **yuan Liu, Long Ma, Zhiying Jiang, Xin Fan, Risheng Liu

Abstract: Photographs taken with less-than-ideal exposure settings often display poor visual quality. Since the correction procedures vary significantly, it is difficult for a single neural network to handle all exposure problems. Moreover, the inherent limitations of convolutions, hinder the models ability to restore faithful color or details on extremely over-/under- exposed regions. To overcome these lim… ▽ More Photographs taken with less-than-ideal exposure settings often display poor visual quality. Since the correction procedures vary significantly, it is difficult for a single neural network to handle all exposure problems. Moreover, the inherent limitations of convolutions, hinder the models ability to restore faithful color or details on extremely over-/under- exposed regions. To overcome these limitations, we propose a Macro-Micro-Hierarchical transformer, which consists of a macro attention to capture long-range dependencies, a micro attention to extract local features, and a hierarchical structure for coarse-to-fine correction. In specific, the complementary macro-micro attention designs enhance locality while allowing global interactions. The hierarchical structure enables the network to correct exposure errors of different scales layer by layer. Furthermore, we propose a contrast constraint and couple it seamlessly in the loss function, where the corrected image is pulled towards the positive sample and pushed away from the dynamically generated negative samples. Thus the remaining color distortion and loss of detail can be removed. We also extend our method as an image enhancer for low-light face recognition and low-light semantic segmentation. Experiments demonstrate that our approach obtains more attractive results than state-of-the-art methods quantitatively and qualitatively. △ Less

Submitted 17 December, 2023; v1 submitted 2 September, 2023; originally announced September 2023.

Comments: Accepted by ACM MM 2023

arXiv:2308.16101 [pdf]

Stripe charge order driven manipulation of Majorana bound states in 2M-WS2 topological superconductor

Authors: Xuemin Fan, Xiaoqi Sun, Penghao Zhu, Yuqiang Fang, Yongkang Ju, Yonghao Yuan, Fuqiang Huang, Taylor L. Hughes, Peizhe Tang, Qi-Kun Xue, Wei Li

Abstract: Majorana bound states (MBSs) are building blocks for topological quantum computing. They can be generated via the combination of electronic topology and superconductivity. To achieve logic operations via Majorana braiding, positional control of the MBS must be established. To this end, exotic co-existing phases or collective modes in an intrinsic topological superconductor can provide a tuning kno… ▽ More Majorana bound states (MBSs) are building blocks for topological quantum computing. They can be generated via the combination of electronic topology and superconductivity. To achieve logic operations via Majorana braiding, positional control of the MBS must be established. To this end, exotic co-existing phases or collective modes in an intrinsic topological superconductor can provide a tuning knob to manipulate the MBS. Here we report the observation of a striped surface charge order coexisting with superconductivity and its controllable tuning of the MBS in the topological superconductor 2M-WS2 using low-temperature scanning tunneling microscopy. By applying an out-of-plane magnetic field, we observe that MBS is absent in vortices in the region with strong stripe order. This is in contrast to adjacent underlaying layers without charge order where vortex-bound MBSs are observed. Via theoretical simulations, we show that the surface stripe order does not destroy the bulk topology, but it can effectively modify the spatial distribution of MBS, i.e., it pushes them downward away from the 2M-WS2 surface. Our findings demonstrate that the interplay of charge order and topological superconductivity can be used to manipulate the positions of the MBS, and to explore of new states of matter. △ Less

Submitted 30 August, 2023; originally announced August 2023.

Comments: 20 pages, 4 figures

arXiv:2308.15918 [pdf, other]

Physics-Informed DeepMRI: Bridging the Gap from Heat Diffusion to k-Space Interpolation

Authors: Zhuo-Xu Cui, Congcong Liu, Xiaohong Fan, Chentao Cao, **g Cheng, Qingyong Zhu, Yuanyuan Liu, Sen Jia, Yihang Zhou, Haifeng Wang, Yanjie Zhu, Jian** Zhang, Qiegen Liu, Dong Liang

Abstract: In the field of parallel imaging (PI), alongside image-domain regularization methods, substantial research has been dedicated to exploring $k$-space interpolation. However, the interpretability of these methods remains an unresolved issue. Furthermore, these approaches currently face acceleration limitations that are comparable to those experienced by image-domain methods. In order to enhance inte… ▽ More In the field of parallel imaging (PI), alongside image-domain regularization methods, substantial research has been dedicated to exploring $k$-space interpolation. However, the interpretability of these methods remains an unresolved issue. Furthermore, these approaches currently face acceleration limitations that are comparable to those experienced by image-domain methods. In order to enhance interpretability and overcome the acceleration limitations, this paper introduces an interpretable framework that unifies both $k$-space interpolation techniques and image-domain methods, grounded in the physical principles of heat diffusion equations. Building upon this foundational framework, a novel $k$-space interpolation method is proposed. Specifically, we model the process of high-frequency information attenuation in $k$-space as a heat diffusion equation, while the effort to reconstruct high-frequency information from low-frequency regions can be conceptualized as a reverse heat equation. However, solving the reverse heat equation poses a challenging inverse problem. To tackle this challenge, we modify the heat equation to align with the principles of magnetic resonance PI physics and employ the score-based generative method to precisely execute the modified reverse heat diffusion. Finally, experimental validation conducted on publicly available datasets demonstrates the superiority of the proposed approach over traditional $k$-space interpolation methods, deep learning-based $k$-space interpolation methods, and conventional diffusion models in terms of reconstruction accuracy, particularly in high-frequency regions. △ Less

Submitted 30 August, 2023; originally announced August 2023.

arXiv:2308.15796 [pdf, other]

doi 10.1016/j.physletb.2023.138359

A Neural Network Approach for Orienting Heavy-Ion Collision Events

Authors: Zu-Xing Yang, Xiao-Hua Fan, Zhi-Pan Li, Shunji Nishimura

Abstract: A convolutional neural network-based classifier is elaborated to retrace the initial orientation of deformed nucleus-nucleus collisions by integrating multiple typical experimental observables. The isospin-dependent Boltzmann-Uehling-Uhlenbeck transport model is employed to generate data for random orientations of ultra-central uranium-uranium collisions at… ▽ More A convolutional neural network-based classifier is elaborated to retrace the initial orientation of deformed nucleus-nucleus collisions by integrating multiple typical experimental observables. The isospin-dependent Boltzmann-Uehling-Uhlenbeck transport model is employed to generate data for random orientations of ultra-central uranium-uranium collisions at $E_\text{beam} = 1\, \text{GeV/nucleon}$. Statistically, the data-driven polarization scheme is essentially accomplished via the classifier, whose distinct categories filter out specific orientation-biased collision events. This will advance the deformed nucleus-based studies on nuclear symmetry energy, neutron skin, etc. △ Less

Submitted 28 November, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

arXiv:2308.15091 [pdf, other]

doi 10.1103/physrevc.108.034607

Impact of quadrupole deformation on intermediate-energy heavy-ion collisions

Authors: Xiao-Hua Fan, Zu-Xing Yang, Peng-Hui Chen, Shunji Nishimura, Zhi-Pan Li

Abstract: This study employs the isospin-dependent Boltzmann-Uehling-Uhlenbeck model to simulate intermediate-energy heavy-ion collisions between prolate nuclei $^{24}$Mg. The emphasis is on investigating the influence of centrality and orientation in several collision scenarios. The final-state particle multiplicities and anisotropic flows are primarily determined by the eccentricity and the area of the in… ▽ More This study employs the isospin-dependent Boltzmann-Uehling-Uhlenbeck model to simulate intermediate-energy heavy-ion collisions between prolate nuclei $^{24}$Mg. The emphasis is on investigating the influence of centrality and orientation in several collision scenarios. The final-state particle multiplicities and anisotropic flows are primarily determined by the eccentricity and the area of the initial overlap. This not only provides feedback on the collision systems, but also, to some extent, provides a means to explore the fine structure inside deformed nuclei. Additionally, non-polarized collisions have been further discussed. These results contribute to the understanding of the geometric effects in nuclear reactions, and aid in the exploration of other information on reaction systems, such as the equation of state and nuclear high-momentum tail. △ Less

Submitted 29 August, 2023; originally announced August 2023.

arXiv:2308.14296 [pdf, other]

RecMind: Large Language Model Powered Agent For Recommendation

Authors: Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Yingzhen Yang

Abstract: While the recommendation system (RS) has advanced significantly through deep learning, current RS approaches usually train and fine-tune models on task-specific datasets, limiting their generalizability to new recommendation tasks and their ability to leverage external knowledge due to model scale and data size constraints. Thus, we designed an LLM-powered autonomous recommender agent, RecMind, wh… ▽ More While the recommendation system (RS) has advanced significantly through deep learning, current RS approaches usually train and fine-tune models on task-specific datasets, limiting their generalizability to new recommendation tasks and their ability to leverage external knowledge due to model scale and data size constraints. Thus, we designed an LLM-powered autonomous recommender agent, RecMind, which is capable of leveraging external knowledge, utilizing tools with careful planning to provide zero-shot personalized recommendations. We propose a Self-Inspiring algorithm to improve the planning ability. At each intermediate step, the LLM self-inspires to consider all previously explored states to plan for the next step. This mechanism greatly improves the model's ability to comprehend and utilize historical information in planning for recommendation. We evaluate RecMind's performance in various recommendation scenarios. Our experiment shows that RecMind outperforms existing zero/few-shot LLM-based recommendation baseline methods in various tasks and achieves comparable performance to a fully trained recommendation model P5. △ Less

Submitted 20 March, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

Comments: Accepted by NAACL 2024 (Findings)

arXiv:2308.13630 [pdf, other]

Degrees of Freedom: Search Cost and Self-consistency

Authors: Lijun Wang, Hongyu Zhao, Xiaodan Fan

Abstract: Model degrees of freedom ($\df$) is a fundamental concept in statistics because it quantifies the flexibility of a fitting procedure and is indispensable in model selection. The $\df$ is often intuitively equated with the number of independent variables in the fitting procedure. But for adaptive regressions that perform variable selection (e.g., the best subset regressions), the model $\df$ is lar… ▽ More Model degrees of freedom ($\df$) is a fundamental concept in statistics because it quantifies the flexibility of a fitting procedure and is indispensable in model selection. The $\df$ is often intuitively equated with the number of independent variables in the fitting procedure. But for adaptive regressions that perform variable selection (e.g., the best subset regressions), the model $\df$ is larger than the number of selected variables. The excess part has been defined as the \emph{search degrees of freedom} ($\sdf$) to account for model selection. However, this definition is limited since it does not consider fitting procedures in augmented space, such as splines and regression trees; and it does not use the same fitting procedure for $\sdf$ and $\df$. For example, the lasso's $\sdf$ is defined through the \emph{relaxed} lasso's $\df$ instead of the lasso's $\df$. Here we propose a \emph{modified search degrees of freedom} ($\msdf$) to directly account for the cost of searching in the original or augmented space. Since many fitting procedures can be characterized by a linear operator, we define the search cost as the effort to determine such a linear operator. When we construct a linear operator for the lasso via the iterative ridge regression, $\msdf$ offers a new perspective for its search cost. For some complex procedures such as the multivariate adaptive regression splines (MARS), the search cost needs to be pre-determined to serve as a tuning parameter for the procedure itself, but it might be inaccurate. To investigate the inaccurate pre-determined search cost, we develop two concepts, \emph{nominal} $\df$ and \emph{actual} $\df$, and formulate a property named \emph{self-consistency} when there is no gap between the \emph{nominal} $\df$ and the \emph{actual} $\df$. △ Less

Submitted 25 August, 2023; originally announced August 2023.

arXiv:2308.12738 [pdf, other]

Learning Heavily-Degraded Prior for Underwater Object Detection

Authors: Chen** Fu, Xin Fan, Jiewen Xiao, Wanqi Yuan, Risheng Liu, Zhongxuan Luo

Abstract: Underwater object detection suffers from low detection performance because the distance and wavelength dependent imaging process yield evident image quality degradations such as haze-like effects, low visibility, and color distortions. Therefore, we commit to resolving the issue of underwater object detection with compounded environmental degradations. Typical approaches attempt to develop sophist… ▽ More Underwater object detection suffers from low detection performance because the distance and wavelength dependent imaging process yield evident image quality degradations such as haze-like effects, low visibility, and color distortions. Therefore, we commit to resolving the issue of underwater object detection with compounded environmental degradations. Typical approaches attempt to develop sophisticated deep architecture to generate high-quality images or features. However, these methods are only work for limited ranges because imaging factors are either unstable, too sensitive, or compounded. Unlike these approaches catering for high-quality images or features, this paper seeks transferable prior knowledge from detector-friendly images. The prior guides detectors removing degradations that interfere with detection. It is based on statistical observations that, the heavily degraded regions of detector-friendly (DFUI) and underwater images have evident feature distribution gaps while the lightly degraded regions of them overlap each other. Therefore, we propose a residual feature transference module (RFTM) to learn a map** between deep representations of the heavily degraded patches of DFUI- and underwater- images, and make the map** as a heavily degraded prior (HDP) for underwater detection. Since the statistical properties are independent to image content, HDP can be learned without the supervision of semantic labels and plugged into popular CNNbased feature extraction networks to improve their performance on underwater object detection. Without bells and whistles, evaluations on URPC2020 and UODD show that our methods outperform CNN-based detectors by a large margin. Our method with higher speeds and less parameters still performs better than transformer-based detectors. Our code and DFUI dataset can be found in https://github.com/xiaoDetection/Learning-Heavily-Degraed-Prior. △ Less

Submitted 24 August, 2023; originally announced August 2023.

arXiv:2308.12331 [pdf, other]

doi 10.3847/2041-8213/ad0158

JWST CEERS & JADES Active Galaxies at z = 4-7 Violate the Local $M_\bullet-M_\star$ Relation at $>3σ$: Implications for Low-Mass Black Holes and Seeding Models

Authors: Fabio Pacucci, Bao Nguyen, Stefano Carniani, Roberto Maiolino, Xiaohui Fan

Abstract: JWST is revolutionizing our understanding of the high-$z$ Universe by expanding the black hole horizon, looking farther and to smaller masses, and revealing the stellar light of their hosts. By examining JWST galaxies at $z=4-7$ that host H$α$-detected black holes, we investigate (i) the high-$z$ $M_\bullet-M_\star$ relation and (ii) the black hole mass distribution, especially in its low-mass ran… ▽ More JWST is revolutionizing our understanding of the high-$z$ Universe by expanding the black hole horizon, looking farther and to smaller masses, and revealing the stellar light of their hosts. By examining JWST galaxies at $z=4-7$ that host H$α$-detected black holes, we investigate (i) the high-$z$ $M_\bullet-M_\star$ relation and (ii) the black hole mass distribution, especially in its low-mass range ($M_\bullet \lesssim 10^{6.5} M_\odot$). With a detailed statistical analysis, our findings conclusively reveal a high-$z$ $M_\bullet-M_\star$ relation that deviates at $>3σ$ confidence level from the local relation. The high-$z$ relation is: $\log(M_\bullet/M_\odot) = -2.43^{+0.83}_{-0.83} + 1.06^{+0.09}_{-0.09} \log(M_\star/M_\odot)$. Black holes are overmassive by $\sim 10-100\times$ compared to their low-$z$ counterparts in galactic hosts of the same stellar mass. This fact is not due to a selection effect in surveys. Moreover, our analysis predicts the possibility of detecting in high-$z$ JWST surveys $5-15\times$ more black holes with $M_\bullet \lesssim 10^{6.5} M_\odot$, and $10-30\times$ more with $M_\bullet \lesssim 10^{8.5} M_\odot$, compared to local relation's predictions. The lighter black holes preferentially occupy galaxies with a stellar mass of $\sim 10^{7.5}-10^8 M_\odot$. We have yet to detect these sources because (i) they may be inactive (duty cycles $1\%-10\%$), (ii) the host overshines the AGN, or (iii) the AGN is obscured and not immediately recognizable by line diagnostics. A search of low-mass black holes in existing JWST surveys will further test the $M_\bullet-M_\star$ relation. Current JWST fields represent a treasure trove of black hole systems at $z = 4-7$; their detection will provide crucial insights into their early evolution and co-evolution with their galactic hosts. △ Less

Submitted 9 October, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

Comments: Accepted for publication in The Astrophysical Journal Letters. 14 pages, 5 figures

Journal ref: The Astrophysical Journal Letters, 2023, Volume 957, Number 1

arXiv:2308.12278 [pdf, other]

Predicting the Yields of $z$ > 6.5 Quasar Surveys in the Era of Roman and Rubin

Authors: Wei Leong Tee, Xiaohui Fan, Feige Wang, **yi Yang, Sangeeta Malhotra, James E. Rhoads

Abstract: Around 70 $z>6.5$ luminous quasars have been discovered, strongly biased toward the bright end, thus not providing a comprehensive view on quasar abundance beyond cosmic dawn. We present the predicted results of Roman/Rubin high-redshift quasar survey, yielding 3 times more, $2-4$ magnitudes deeper quasar samples, probing high-redshift quasars across broad range of luminosities, especially faint q… ▽ More Around 70 $z>6.5$ luminous quasars have been discovered, strongly biased toward the bright end, thus not providing a comprehensive view on quasar abundance beyond cosmic dawn. We present the predicted results of Roman/Rubin high-redshift quasar survey, yielding 3 times more, $2-4$ magnitudes deeper quasar samples, probing high-redshift quasars across broad range of luminosities, especially faint quasars at $L_\mathrm{bol}\sim 10^{10}\;L_{\odot}$ or $M_\mathrm{1450} \sim-22$ that are currently poorly explored. We include high-$z$ quasars, galactic dwarfs and low-$z$ compact galaxies with similar colors as quasar candidates. We create mock catalogs based on population models to evaluate selection completeness and efficiency. We utilize classical color dropout method in $z$ and $Y$ bands to select primary quasar candidates, followed up with Bayesian selection method to identify quasars. We show that overall selection completeness $> 80\%$ and efficiency $\sim 10\%$ at $6.5<z<9$, with 180 quasars at $z>6.5$, 20 at $z > 7.5$ and 2 at $z > 8.5$. The quasar yields depend sensitively on the assumed quasar luminosity shape and redshift evolution. Brown dwarf rejection through proper motion up to 50$\%$ can be made for stars brighter than 25 mag, low-$z$ galaxies dominate at fainter magnitude. Our results show that Roman/Rubin are able to discover a statistical sample of the earliest and faintest quasars in the Universe. The new valuable datasets worth follow up studies with James Webb Space Telescope and Extremely Large Telescopes, to determine quasar luminosity function faint end slope and constraint the supermassive black holes growth in the early Universe. △ Less

Submitted 23 August, 2023; originally announced August 2023.

Comments: Accepted for publication in ApJ

arXiv:2308.11165 [pdf, other]

Improving Misaligned Multi-modality Image Fusion with One-stage Progressive Dense Registration

Authors: Di Wang, **yuan Liu, Long Ma, Risheng Liu, Xin Fan

Abstract: Misalignments between multi-modality images pose challenges in image fusion, manifesting as structural distortions and edge ghosts. Existing efforts commonly resort to registering first and fusing later, typically employing two cascaded stages for registration,i.e., coarse registration and fine registration. Both stages directly estimate the respective target deformation fields. In this paper, we… ▽ More Misalignments between multi-modality images pose challenges in image fusion, manifesting as structural distortions and edge ghosts. Existing efforts commonly resort to registering first and fusing later, typically employing two cascaded stages for registration,i.e., coarse registration and fine registration. Both stages directly estimate the respective target deformation fields. In this paper, we argue that the separated two-stage registration is not compact, and the direct estimation of the target deformation fields is not accurate enough. To address these challenges, we propose a Cross-modality Multi-scale Progressive Dense Registration (C-MPDR) scheme, which accomplishes the coarse-to-fine registration exclusively using a one-stage optimization, thus improving the fusion performance of misaligned multi-modality images. Specifically, two pivotal components are involved, a dense Deformation Field Fusion (DFF) module and a Progressive Feature Fine (PFF) module. The DFF aggregates the predicted multi-scale deformation sub-fields at the current scale, while the PFF progressively refines the remaining misaligned features. Both work together to accurately estimate the final deformation fields. In addition, we develop a Transformer-Conv-based Fusion (TCF) subnetwork that considers local and long-range feature dependencies, allowing us to capture more informative features from the registered infrared and visible images for the generation of high-quality fused images. Extensive experimental analysis demonstrates the superiority of the proposed method in the fusion of misaligned cross-modality images. △ Less

Submitted 21 August, 2023; originally announced August 2023.

arXiv:2308.05880 [pdf, other]

Smart Data Map** for Connecting Power System Model and Geospatial Data

Authors: Xue Li, Kishan Prudhvi Guddanti, Samrat Acharya, Patrick Royer, Xiaoyuan Fan, Marcelo Elizondo

Abstract: Knowing the geospatial locations of power system model elements and linking load models with end users and their communities are the foundation for analyzing system resilience and vulnerability to natural hazards. However, power system models and geospatial data for power grid assets are often developed asynchronously without close coordination. Creating a direct map** between the two is a chall… ▽ More Knowing the geospatial locations of power system model elements and linking load models with end users and their communities are the foundation for analyzing system resilience and vulnerability to natural hazards. However, power system models and geospatial data for power grid assets are often developed asynchronously without close coordination. Creating a direct map** between the two is a challenging task, mainly due to heterogeneous data structures, target uses, historical legacies, and human errors. This work aims to build an automatic data map** workflow to connect the two, and to support energy grid resilience studies for Puerto Rico. The primary steps in this workflow include constructing graphs using geospatial data, and aligning them to the transmission networks defined in the power system data. The results have been evaluated against existing manual map** practices for part of the Puerto Rico Power Grid model to illustrate the performance of such auto-map** solutions. △ Less

Submitted 10 August, 2023; originally announced August 2023.

Comments: 5 pages, i figure, 1 table

arXiv:2308.04614 [pdf, other]

Probing Ultra-late Reionization: Direct Measurements of the Mean Free Path over $5<z<6$

Authors: Yongda Zhu, George D. Becker, Holly M. Christenson, Anson D'Aloisio, Sarah E. I. Bosman, Tom Bakx, Valentina D'Odorico, Manuela Bischetti, Christopher Cain, Frederick B. Davies, Rebecca L. Davies, Anna-Christina Eilers, Xiaohui Fan, Prakash Gaikwad, Martin G. Haehnelt, Laura C. Keating, Girish Kulkarni, Samuel Lai, Hai-Xia Ma, Andrei Mesinger, Yuxiang Qin, Sindhu Satyavolu, Tsutomu T. Takeuchi, Hideki Umehata, **yi Yang

Abstract: The mean free path of ionizing photons, $λ_{\rm mfp}$, is a critical parameter for modeling the intergalactic medium (IGM) both during and after reionization. We present direct measurements of $λ_{\rm mfp}$ from QSO spectra over the redshift range $5<z<6$, including the first measurements at $z\simeq5.3$ and 5.6. Our sample includes data from the XQR-30 VLT large program, as well as new Keck/ESI o… ▽ More The mean free path of ionizing photons, $λ_{\rm mfp}$, is a critical parameter for modeling the intergalactic medium (IGM) both during and after reionization. We present direct measurements of $λ_{\rm mfp}$ from QSO spectra over the redshift range $5<z<6$, including the first measurements at $z\simeq5.3$ and 5.6. Our sample includes data from the XQR-30 VLT large program, as well as new Keck/ESI observations of QSOs near $z \sim 5.5$, for which we also acquire new [C II] 158$μ$m redshifts with ALMA. By measuring the Lyman continuum transmission profile in stacked QSO spectra, we find $λ_{\rm mfp} = 9.33_{-1.80}^{+2.06}$, $5.40_{-1.40}^{+1.47}$, $3.31_{-1.34}^{+2.74}$, and $0.81_{-0.48}^{+0.73}$ pMpc at $z=5.08$, 5.31, 5.65, and 5.93, respectively. Our results demonstrate that $λ_{\rm mfp}$ increases steadily and rapidly with time over $5<z<6$. Notably, we find that $λ_{\rm mfp}$ deviates significantly from predictions based on a fully ionized and relaxed IGM as late as $z=5.3$. By comparing our results to model predictions and indirect $λ_{\rm mfp}$ constraints based on IGM Ly$α$ opacity, we find that the $λ_{\rm mfp}$ evolution is consistent with scenarios wherein the IGM is still undergoing reionization and/or retains large fluctuations in the ionizing UV background well below redshift six. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Comments: 17 pages, 6 figures, 2 tables; accepted for publication in ApJ

arXiv:2308.03979 [pdf, other]

PAIF: Perception-Aware Infrared-Visible Image Fusion for Attack-Tolerant Semantic Segmentation

Authors: Zhu Liu, **yuan Liu, Benzhuang Zhang, Long Ma, Xin Fan, Risheng Liu

Abstract: Infrared and visible image fusion is a powerful technique that combines complementary information from different modalities for downstream semantic perception tasks. Existing learning-based methods show remarkable performance, but are suffering from the inherent vulnerability of adversarial attacks, causing a significant decrease in accuracy. In this work, a perception-aware fusion framework is pr… ▽ More Infrared and visible image fusion is a powerful technique that combines complementary information from different modalities for downstream semantic perception tasks. Existing learning-based methods show remarkable performance, but are suffering from the inherent vulnerability of adversarial attacks, causing a significant decrease in accuracy. In this work, a perception-aware fusion framework is proposed to promote segmentation robustness in adversarial scenes. We first conduct systematic analyses about the components of image fusion, investigating the correlation with segmentation robustness under adversarial perturbations. Based on these analyses, we propose a harmonized architecture search with a decomposition-based structure to balance standard accuracy and robustness. We also propose an adaptive learning strategy to improve the parameter robustness of image fusion, which can learn effective feature extraction under diverse adversarial perturbations. Thus, the goals of image fusion (\textit{i.e.,} extracting complementary features from source modalities and defending attack) can be realized from the perspectives of architectural and learning strategies. Extensive experimental results demonstrate that our scheme substantially enhances the robustness, with gains of 15.3% mIOU of segmentation in the adversarial scene, compared with advanced competitors. The source codes are available at https://github.com/LiuZhu-CV/PAIF. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: Accepted by ACM MM'2023;The source codes are available at https://github.com/LiuZhu-CV/PAIF

arXiv:2308.03807 [pdf, other]

doi 10.1109/TCI.2023.3315853

Nest-DGIL: Nesterov-optimized Deep Geometric Incremental Learning for CS Image Reconstruction

Authors: Xiaohong Fan, Yin Yang, Ke Chen, Yujie Feng, Jian** Zhang

Abstract: Proximal gradient-based optimization is one of the most common strategies to solve inverse problem of images, and it is easy to implement. However, these techniques often generate heavy artifacts in image reconstruction. One of the most popular refinement methods is to fine-tune the regularization parameter to alleviate such artifacts, but it may not always be sufficient or applicable due to incre… ▽ More Proximal gradient-based optimization is one of the most common strategies to solve inverse problem of images, and it is easy to implement. However, these techniques often generate heavy artifacts in image reconstruction. One of the most popular refinement methods is to fine-tune the regularization parameter to alleviate such artifacts, but it may not always be sufficient or applicable due to increased computational costs. In this work, we propose a deep geometric incremental learning framework based on the second Nesterov proximal gradient optimization. The proposed end-to-end network not only has the powerful learning ability for high-/low-frequency image features, but also can theoretically guarantee that geometric texture details will be reconstructed from preliminary linear reconstruction. Furthermore, it can avoid the risk of intermediate reconstruction results falling outside the geometric decomposition domains and achieve fast convergence. Our reconstruction framework is decomposed into four modules including general linear reconstruction, cascade geometric incremental restoration, Nesterov acceleration, and post-processing. In the image restoration step, a cascade geometric incremental learning module is designed to compensate for missing texture information from different geometric spectral decomposition domains. Inspired by the overlap-tile strategy, we also develop a post-processing module to remove the block effect in patch-wise-based natural image reconstruction. All parameters in the proposed model are learnable, an adaptive initialization technique of physical parameters is also employed to make model flexibility and ensure converging smoothly. We compare the reconstruction performance of the proposed method with existing state-of-the-art methods to demonstrate its superiority. Our source codes are available at https://github.com/fanxiaohong/Nest-DGIL. △ Less

Submitted 11 October, 2023; v1 submitted 6 August, 2023; originally announced August 2023.

Comments: 15 pages,our source codes are available at https://github.com/fanxiaohong/Nest-DGIL

Journal ref: This work is published in IEEE Transactions on Computational Imaging, vol. 9, pp. 819-833, 2023

arXiv:2308.03381 [pdf, other]

Bilevel Generative Learning for Low-Light Vision

Authors: Yingchi Liu, Zhu Liu, Long Ma, **yuan Liu, Xin Fan, Zhongxuan Luo, Risheng Liu

Abstract: Recently, there has been a growing interest in constructing deep learning schemes for Low-Light Vision (LLV). Existing techniques primarily focus on designing task-specific and data-dependent vision models on the standard RGB domain, which inherently contain latent data associations. In this study, we propose a generic low-light vision solution by introducing a generative block to convert data fro… ▽ More Recently, there has been a growing interest in constructing deep learning schemes for Low-Light Vision (LLV). Existing techniques primarily focus on designing task-specific and data-dependent vision models on the standard RGB domain, which inherently contain latent data associations. In this study, we propose a generic low-light vision solution by introducing a generative block to convert data from the RAW to the RGB domain. This novel approach connects diverse vision problems by explicitly depicting data generation, which is the first in the field. To precisely characterize the latent correspondence between the generative procedure and the vision task, we establish a bilevel model with the parameters of the generative block defined as the upper level and the parameters of the vision task defined as the lower level. We further develop two types of learning strategies targeting different goals, namely low cost and high accuracy, to acquire a new bilevel generative learning paradigm. The generative blocks embrace a strong generalization ability in other low-light vision tasks through the bilevel optimization on enhancement tasks. Extensive experimental evaluations on three representative low-light vision tasks, namely enhancement, detection, and segmentation, fully demonstrate the superiority of our proposed approach. The code will be available at https://github.com/Yingchi1998/BGL. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: Accepted by ACM MM'2023, The code will be available at https://github.com/Yingchi1998/BGL

arXiv:2308.03032 [pdf, other]

doi 10.1088/1674-4527/ad0498

Understanding the predication mechanism of deep learning through error propagation among parameters in strong lensing case

Authors: Xilong Fan, Peizheng Wang, ** Li, Nan Yang

Abstract: The error propagation among estimated parameters reflects the correlation among the parameters. We study the capability of machine learning of "learning" the correlation of estimated parameters. We show that machine learning can recover the relation between the uncertainties of different parameters, especially, as predicted by the error propagation formula. Gravitational lensing can be used to pro… ▽ More The error propagation among estimated parameters reflects the correlation among the parameters. We study the capability of machine learning of "learning" the correlation of estimated parameters. We show that machine learning can recover the relation between the uncertainties of different parameters, especially, as predicted by the error propagation formula. Gravitational lensing can be used to probe both astrophysics and cosmology. As a practical application, we show that the machine learning is able to intelligently find the error propagation among the gravitational lens parameters (effective lens mass $M_{L}$ and Einstein radius $θ_{E}$) in accordance with the theoretical formula for the singular isothermal ellipse (SIE) lens model. The relation of errors of lens mass and Einstein radius, (e.g. the ratio of standard deviations $\mathcal{F}=σ_{\hat{ M_{L}}}/ σ_{\hat{θ_{E}}}$) predicted by the deep convolution neural network are consistent with the error propagation formula of SIE lens model. As a proof-of-principle test, a toy model of linear relation with Gaussian noise is presented. We found that the predictions obtained by machine learning indeed indicate the information about the law of error propagation and the distribution of noise. Error propagation plays a crucial role in identifying the physical relation among parameters, rather than a coincidence relation, therefore we anticipate our case study on the error propagation of machine learning predictions could extend to other physical systems on searching the correlation among parameters. △ Less

Submitted 9 January, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

Journal ref: Research in Astronomy and Astrophysics 23.12 (2023): 125022

arXiv:2308.02097 [pdf, other]

Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation

Authors: **yuan Liu, Zhu Liu, Guanyao Wu, Long Ma, Risheng Liu, Wei Zhong, Zhongxuan Luo, Xin Fan

Abstract: Multi-modality image fusion and segmentation play a vital role in autonomous driving and robotic operation. Early efforts focus on boosting the performance for only one task, \emph{e.g.,} fusion or segmentation, making it hard to reach~`Best of Both Worlds'. To overcome this issue, in this paper, we propose a \textbf{M}ulti-\textbf{i}nteractive \textbf{F}eature learning architecture for image fusi… ▽ More Multi-modality image fusion and segmentation play a vital role in autonomous driving and robotic operation. Early efforts focus on boosting the performance for only one task, \emph{e.g.,} fusion or segmentation, making it hard to reach~`Best of Both Worlds'. To overcome this issue, in this paper, we propose a \textbf{M}ulti-\textbf{i}nteractive \textbf{F}eature learning architecture for image fusion and \textbf{Seg}mentation, namely SegMiF, and exploit dual-task correlation to promote the performance of both tasks. The SegMiF is of a cascade structure, containing a fusion sub-network and a commonly used segmentation sub-network. By slickly bridging intermediate features between two components, the knowledge learned from the segmentation task can effectively assist the fusion task. Also, the benefited fusion network supports the segmentation one to perform more pretentiously. Besides, a hierarchical interactive attention block is established to ensure fine-grained map** of all the vital information between two tasks, so that the modality/semantic features can be fully mutual-interactive. In addition, a dynamic weight factor is introduced to automatically adjust the corresponding weights of each task, which can balance the interactive feature correspondence and break through the limitation of laborious tuning. Furthermore, we construct a smart multi-wave binocular imaging system and collect a full-time multi-modality benchmark with 15 annotated pixel-level categories for image fusion and segmentation. Extensive experiments on several public datasets and our benchmark demonstrate that the proposed method outputs visually appealing fused images and perform averagely $7.66\%$ higher segmentation mIoU in the real-world scene than the state-of-the-art approaches. The source code and benchmark are available at \url{https://github.com/**yuanLiu-CV/SegMiF}. △ Less

Submitted 3 August, 2023; originally announced August 2023.

Comments: Accepted by ICCV 2023. The source code and benchmark are available at https://github.com/**yuanLiu-CV/SegMiF

arXiv:2308.01474 [pdf, other]

doi 10.1145/3594556.3594626

Decentralized Translator of Trust: Supporting Heterogeneous TEE for Critical Infrastructure Protection

Authors: Rabimba Karanjai, Rowan Collier, Zhimin Gao, Lin Chen, Xinxin Fan, Taeweon Suh, Weidong Shi, Lei Xu

Abstract: Trusted execution environment (TEE) technology has found many applications in mitigating various security risks in an efficient manner, which is attractive for critical infrastructure protection. First, the natural of critical infrastructure requires it to be well protected from various cyber attacks. Second, performance is usually important for critical infrastructure and it cannot afford an expe… ▽ More Trusted execution environment (TEE) technology has found many applications in mitigating various security risks in an efficient manner, which is attractive for critical infrastructure protection. First, the natural of critical infrastructure requires it to be well protected from various cyber attacks. Second, performance is usually important for critical infrastructure and it cannot afford an expensive protection mechanism. While a large number of TEE-based critical infrastructure protection systems have been proposed to address various security challenges (e.g., secure sensing and reliable control), most existing works ignore one important feature, i.e., devices comprised the critical infrastructure may be equipped with multiple incompatible TEE technologies and belongs to different owners. This feature makes it hard for these devices to establish mutual trust and form a unified TEE environment. To address these challenges and fully unleash the potential of TEE technology for critical infrastructure protection, we propose DHTee, a decentralized coordination mechanism. DHTee uses blockchain technology to support key TEE functions in a heterogeneous TEE environment, especially the attestation service. A Device equipped with one TEE can interact securely with the blockchain to verify whether another potential collaborating device claiming to have a different TEE meets the security requirements. DHTee is also flexible and can support new TEE schemes without affecting devices using existing TEEs that have been supported by the system. △ Less

Submitted 2 August, 2023; originally announced August 2023.

Comments: Appeared in ACM BSCI'23

Journal ref: 12 September 2023

arXiv:2308.00953 [pdf]

doi 10.1021/acs.nanolett.3c02015

Gate-Tunable Critical Current of the Three-Dimensional Niobium Nano-Bridge Josephson Junction

Authors: Shujie Yu, Lei Chen, Yin** Pan, Yue Wang, Denghui Zhang, Guangting Wu, Xinxin Fan, Xiaoyu Liu, Ling Wu, Lu Zhang, Wei Peng, Jie Ren, Zhen Wang

Abstract: Recent studies have shown that the critical currents of several metallic superconducting nanowires and Dayem bridges can be locally tuned using a gate voltage {V_g}. Here, we report a gate-tunable Josephson junction structure constructed from a three-dimensional (3D) niobium nano-bridge junction (NBJ) with a voltage gate on top. Measurements up to 6 K showed that the critical current of this struc… ▽ More Recent studies have shown that the critical currents of several metallic superconducting nanowires and Dayem bridges can be locally tuned using a gate voltage {V_g}. Here, we report a gate-tunable Josephson junction structure constructed from a three-dimensional (3D) niobium nano-bridge junction (NBJ) with a voltage gate on top. Measurements up to 6 K showed that the critical current of this structure can be tuned to zero by increasing {V_g}. The critical gate voltage Vgc was reduced to 16 V and may possibly be reduced further by reducing the thickness of the insulation layer between the gate and the NBJ. Furthermore, the flux modulation generated by Josephson interference of two parallel 3D NBJs can also be tuned using {V_g} in a similar manner. Therefore, we believe that this gate-tunable Josephson junction structure is promising for superconducting circuit fabrication at high integration levels. △ Less

Submitted 2 August, 2023; originally announced August 2023.

Comments: 15 pages, 5 figures

arXiv:2308.00931 [pdf, other]

WaterFlow: Heuristic Normalizing Flow for Underwater Image Enhancement and Beyond

Authors: Zengxi Zhang, Zhiying Jiang, **yuan Liu, Xin Fan, Risheng Liu

Abstract: Underwater images suffer from light refraction and absorption, which impairs visibility and interferes the subsequent applications. Existing underwater image enhancement methods mainly focus on image quality improvement, ignoring the effect on practice. To balance the visual quality and application, we propose a heuristic normalizing flow for detection-driven underwater image enhancement, dubbed W… ▽ More Underwater images suffer from light refraction and absorption, which impairs visibility and interferes the subsequent applications. Existing underwater image enhancement methods mainly focus on image quality improvement, ignoring the effect on practice. To balance the visual quality and application, we propose a heuristic normalizing flow for detection-driven underwater image enhancement, dubbed WaterFlow. Specifically, we first develop an invertible map** to achieve the translation between the degraded image and its clear counterpart. Considering the differentiability and interpretability, we incorporate the heuristic prior into the data-driven map** procedure, where the ambient light and medium transmission coefficient benefit credible generation. Furthermore, we introduce a detection perception module to transmit the implicit semantic guidance into the enhancement procedure, where the enhanced images hold more detection-favorable features and are able to promote the detection performance. Extensive experiments prove the superiority of our WaterFlow, against state-of-the-art methods quantitatively and qualitatively. △ Less

Submitted 2 August, 2023; originally announced August 2023.

Comments: 10 pages, 13 figures

arXiv:2307.16741 [pdf, other]

Multi-Spectral Image Stitching via Spatial Graph Reasoning

Authors: Zhiying Jiang, Zengxi Zhang, **yuan Liu, Xin Fan, Risheng Liu

Abstract: Multi-spectral image stitching leverages the complementarity between infrared and visible images to generate a robust and reliable wide field-of-view (FOV) scene. The primary challenge of this task is to explore the relations between multi-spectral images for aligning and integrating multi-view scenes. Capitalizing on the strengths of Graph Convolutional Networks (GCNs) in modeling feature relatio… ▽ More Multi-spectral image stitching leverages the complementarity between infrared and visible images to generate a robust and reliable wide field-of-view (FOV) scene. The primary challenge of this task is to explore the relations between multi-spectral images for aligning and integrating multi-view scenes. Capitalizing on the strengths of Graph Convolutional Networks (GCNs) in modeling feature relationships, we propose a spatial graph reasoning based multi-spectral image stitching method that effectively distills the deformation and integration of multi-spectral images across different viewpoints. To accomplish this, we embed multi-scale complementary features from the same view position into a set of nodes. The correspondence across different views is learned through powerful dense feature embeddings, where both inter- and intra-correlations are developed to exploit cross-view matching and enhance inner feature disparity. By introducing long-range coherence along spatial and channel dimensions, the complementarity of pixel relations and channel interdependencies aids in the reconstruction of aligned multi-view features, generating informative and reliable wide FOV scenes. Moreover, we release a challenging dataset named ChaMS, comprising both real-world and synthetic sets with significant parallax, providing a new option for comprehensive evaluation. Extensive experiments demonstrate that our method surpasses the state-of-the-arts. △ Less

Submitted 31 July, 2023; originally announced July 2023.

Comments: 9 pages, 5 figures

arXiv:2307.15257 [pdf, other]

Learning with Constraint Learning: New Perspective, Solution Strategy and Various Applications

Authors: Risheng Liu, Jiaxin Gao, Xuan Liu, Xin Fan

Abstract: The complexity of learning problems, such as Generative Adversarial Network (GAN) and its variants, multi-task and meta-learning, hyper-parameter learning, and a variety of real-world vision applications, demands a deeper understanding of their underlying coupling mechanisms. Existing approaches often address these problems in isolation, lacking a unified perspective that can reveal commonalities… ▽ More The complexity of learning problems, such as Generative Adversarial Network (GAN) and its variants, multi-task and meta-learning, hyper-parameter learning, and a variety of real-world vision applications, demands a deeper understanding of their underlying coupling mechanisms. Existing approaches often address these problems in isolation, lacking a unified perspective that can reveal commonalities and enable effective solutions. Therefore, in this work, we proposed a new framework, named Learning with Constraint Learning (LwCL), that can holistically examine challenges and provide a unified methodology to tackle all the above-mentioned complex learning and vision problems. Specifically, LwCL is designed as a general hierarchical optimization model that captures the essence of these diverse learning and vision problems. Furthermore, we develop a gradient-response based fast solution strategy to overcome optimization challenges of the LwCL framework. Our proposed framework efficiently addresses a wide range of applications in learning and vision, encompassing three categories and nine different problem types. Extensive experiments on synthetic tasks and real-world applications verify the effectiveness of our approach. The LwCL framework offers a comprehensive solution for tackling complex machine learning and computer vision problems, bridging the gap between theory and practice. △ Less

Submitted 27 July, 2023; originally announced July 2023.

arXiv:2307.12213 [pdf, other]

LiveRetro: Visual Analytics for Strategic Retrospect in Livestream E-Commerce

Authors: Yuchen Wu, Yuansong Xu, Shenghan Gao, Xingbo Wang, Wenkai Song, Zhiheng Nie, Xiaomeng Fan, Quan Li

Abstract: Livestream e-commerce integrates live streaming and online shop**, allowing viewers to make purchases while watching. However, effective marketing strategies remain a challenge due to limited empirical research and subjective biases from the absence of quantitative data. Current tools fail to capture the interdependence between live performances and feedback. This study identified computational… ▽ More Livestream e-commerce integrates live streaming and online shop**, allowing viewers to make purchases while watching. However, effective marketing strategies remain a challenge due to limited empirical research and subjective biases from the absence of quantitative data. Current tools fail to capture the interdependence between live performances and feedback. This study identified computational features, formulated design requirements, and developed LiveRetro, an interactive visual analytics system. It enables comprehensive retrospective analysis of livestream e-commerce for streamers, viewers, and merchandise. LiveRetro employs enhanced visualization and time-series forecasting models to align performance features and feedback, identifying influences at channel, merchandise, feature, and segment levels. Through case studies and expert interviews, the system provides deep insights into the relationship between live performance and streaming statistics, enabling efficient strategic analysis from multiple perspectives. △ Less

Submitted 2 August, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

Comments: Accepted by IEEE VIS 2023

arXiv:2307.11429 [pdf, other]

Emergence of solid-like Debye scaling in the vibrational density of states of liquids under nanoconfinement

Authors: Yuanxi Yu, Sha **, Xue Fan, Mona Sarter, Dehong Yu, Matteo Baggioli, Liang Hong

Abstract: At frequencies higher than the inverse of the structural relaxation time $τ$, the dynamics of liquids display several solid-like properties, including propagating collective shear waves and emergent elasticity. However, in classical bulk liquids, where $τ$ is typically of the order of 1 ps or less, this solid-like behavior remains elusive in the low-frequency region of the vibrational density of s… ▽ More At frequencies higher than the inverse of the structural relaxation time $τ$, the dynamics of liquids display several solid-like properties, including propagating collective shear waves and emergent elasticity. However, in classical bulk liquids, where $τ$ is typically of the order of 1 ps or less, this solid-like behavior remains elusive in the low-frequency region of the vibrational density of states (VDOS). Here, we provide compelling evidence for the emergent solid-like nature of liquids at short distances through inelastic neutron scattering measurements of the low-frequency VDOS in liquid water and glycerol confined within graphene oxide membranes. In particular, upon increasing the strength of confinement, we observe a transition from a liquid-like VDOS (linear in the frequency $ω$) to a solid-like behavior (Debye law, $\simω^2$) in the range of $1$-$4$ meV. Molecular dynamics simulations confirm these findings and reveal additional solid-like features, including propagating collective shear waves and a reduction in the self-diffusion constant. Finally, we show that the onset of solid-like dynamics is pushed towards low frequency along with the slowing-down of the relaxation processes upon confinement, and that the scale at which solidity emerges is qualitatively compatible with k-gap theory and the concept of gapped momentum states. Our results provide convincing experimental evidence of the continuity between liquids and solids, as originally advocated by Frenkel and Maxwell, and a deeper understanding of the dynamics of liquids across a wide range of length scales. △ Less

Submitted 8 February, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

Comments: v2: revised version

arXiv:2307.08953 [pdf, other]

doi 10.1007/JHEP11(2023)141

Dynamics of null particles and shadow for general rotating black hole

Authors: Kun Meng, Xi-Long Fan, Song Li, Wen-Biao Han, Hongsheng Zhang

Abstract: The Johannsen black hole (BH) is a generic rotating BH admitting three constants of motions (energy, angular momentum, and Carter constant) and is characterized by four deviation parameters besides mass and spin, which could be a model-independent probe of the no-hair theorem. We systematically study the dynamics of null particles around Johannsen BH, revealing the effects of the deviation paramet… ▽ More The Johannsen black hole (BH) is a generic rotating BH admitting three constants of motions (energy, angular momentum, and Carter constant) and is characterized by four deviation parameters besides mass and spin, which could be a model-independent probe of the no-hair theorem. We systematically study the dynamics of null particles around Johannsen BH, revealing the effects of the deviation parameters on the BH shadow as well as the effects of spin. By using the shadow boundaries of M87* and SgrA*, for the first time, the deviation parameters of those BHs are constrained. The detailed results depend on the spin $a$ and inclination angle $ θ_0$. Assuming $a=0.2$ and $θ_0=15^{\circ}$, the deviation parameter $α_{13}$ are constained within $\sim $ [-3.5, 6] for M87* observation and [-3, 0.5] for SgrA* observation. We also show the images of a Johannsen BH surrounded by a Page-Thorne thin accretion disk observed by a remote observer with a ray-tracing method and discuss the effects of the deviation parameters on deforming the accretion disk image, which could be tested by observations with higher sensitivities in the future. △ Less

Submitted 28 November, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

Comments: 21 pages, 9 figures

Journal ref: J. High Energ. Phys. 2023, 141 (2023)

arXiv:2307.06554 [pdf, other]

TPU as Cryptographic Accelerator

Authors: Rabimba Karanjai, Sangwon Shin, Xinxin Fan, Lin Chen, Tianwei Zhang, Taeweon Suh, Weidong Shi, Lei Xu

Abstract: Polynomials defined on specific rings are heavily involved in various cryptographic schemes, and the corresponding operations are usually the computation bottleneck of the whole scheme. We propose to utilize TPU, an emerging hardware designed for AI applications, to speed up polynomial operations and convert TPU to a cryptographic accelerator. We also conduct preliminary evaluation and discuss… ▽ More Polynomials defined on specific rings are heavily involved in various cryptographic schemes, and the corresponding operations are usually the computation bottleneck of the whole scheme. We propose to utilize TPU, an emerging hardware designed for AI applications, to speed up polynomial operations and convert TPU to a cryptographic accelerator. We also conduct preliminary evaluation and discuss the limitations of current work and future plan. △ Less

Submitted 13 July, 2023; originally announced July 2023.

arXiv:2307.03536 [pdf, other]

Joint Perceptual Learning for Enhancement and Object Detection in Underwater Scenarios

Authors: Chen** Fu, Wanqi Yuan, Jiewen Xiao, Risheng Liu, Xin Fan

Abstract: Underwater degraded images greatly challenge existing algorithms to detect objects of interest. Recently, researchers attempt to adopt attention mechanisms or composite connections for improving the feature representation of detectors. However, this solution does \textit{not} eliminate the impact of degradation on image content such as color and texture, achieving minimal improvements. Another fea… ▽ More Underwater degraded images greatly challenge existing algorithms to detect objects of interest. Recently, researchers attempt to adopt attention mechanisms or composite connections for improving the feature representation of detectors. However, this solution does \textit{not} eliminate the impact of degradation on image content such as color and texture, achieving minimal improvements. Another feasible solution for underwater object detection is to develop sophisticated deep architectures in order to enhance image quality or features. Nevertheless, the visually appealing output of these enhancement modules do \textit{not} necessarily generate high accuracy for deep detectors. More recently, some multi-task learning methods jointly learn underwater detection and image enhancement, accessing promising improvements. Typically, these methods invoke huge architecture and expensive computations, rendering inefficient inference. Definitely, underwater object detection and image enhancement are two interrelated tasks. Leveraging information coming from the two tasks can benefit each task. Based on these factual opinions, we propose a bilevel optimization formulation for jointly learning underwater object detection and image enhancement, and then unroll to a dual perception network (DPNet) for the two tasks. DPNet with one shared module and two task subnets learns from the two different tasks, seeking a shared representation. The shared representation provides more structural details for image enhancement and rich content information for object detection. Finally, we derive a cooperative training strategy to optimize parameters for DPNet. Extensive experiments on real-world and synthetic underwater datasets demonstrate that our method outputs visually favoring images and higher detection accuracy. △ Less

Submitted 7 July, 2023; originally announced July 2023.

arXiv:2307.02701 [pdf]

Touch, press and stroke: a soft capacitive sensor skin

Authors: Mirza S. Sarwar, Ryusuke Ishizaki, Kieran Morton, Claire Preston, Tan Nguyen, Xu Fan, Bertille Dupont, Leanna Hogarth, Takahide Yoshiike, Shahriar Mirabbasi, John D. W. Madden

Abstract: Soft sensors that can discriminate shear and normal force could help provide machines the fine control desirable for safe and effective physical interactions with people. A capacitive sensor is made for this purpose, composed of patterned elastomer and containing both fixed and sliding pillars that allow the sensor to deform and buckle, much like skin itself. The sensor differentiates between simu… ▽ More Soft sensors that can discriminate shear and normal force could help provide machines the fine control desirable for safe and effective physical interactions with people. A capacitive sensor is made for this purpose, composed of patterned elastomer and containing both fixed and sliding pillars that allow the sensor to deform and buckle, much like skin itself. The sensor differentiates between simultaneously applied pressure and shear. In addition, finger proximity is detectable up to 15 mm, with a pressure and shear sensitivity of 1 kPa and a displacement resolution of 50 $μ$m. The operation is demonstrated on a simple gripper holding a cup. The combination of features and the straightforward fabrication method make this sensor a candidate for implementation as a sensing skin for humanoid robotics applications. △ Less

Submitted 5 July, 2023; originally announced July 2023.

Comments: 9 pages, 5 figures, submitted to Scientific Reports Nature

arXiv:2307.01748 [pdf, other]

Monotone Cubic B-Splines with a Neural-Network Generator

Authors: Lijun Wang, Xiaodan Fan, Huabai Li, Jun S. Liu

Abstract: We present a method for fitting monotone curves using cubic B-splines, which is equivalent to putting a monotonicity constraint on the coefficients. We explore different ways of enforcing this constraint and analyze their theoretical and empirical properties. We propose two algorithms for solving the spline fitting problem: one that uses standard optimization techniques and one that trains a Multi… ▽ More We present a method for fitting monotone curves using cubic B-splines, which is equivalent to putting a monotonicity constraint on the coefficients. We explore different ways of enforcing this constraint and analyze their theoretical and empirical properties. We propose two algorithms for solving the spline fitting problem: one that uses standard optimization techniques and one that trains a Multi-Layer Perceptrons (MLP) generator to approximate the solutions under various settings and perturbations. The generator approach can speed up the fitting process when we need to solve the problem repeatedly, such as when constructing confidence bands using bootstrap. We evaluate our method against several existing methods, some of which do not use the monotonicity constraint, on some monotone curves with varying noise levels. We demonstrate that our method outperforms the other methods, especially in high-noise scenarios. We also apply our method to analyze the polarization-hole phenomenon during star formation in astrophysics. The source code is accessible at \texttt{\url{https://github.com/szcf-weiya/MonotoneSplines.jl}}. △ Less

Submitted 17 November, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

arXiv:2307.01084 [pdf, ps, other]

Wasserstein-$1$ distance and nonuniform Berry-Esseen bound for a supercritical branching process in a random environment

Authors: Hao Wu, Xiequan Fan, Zhiqiang Gao, Yinna Ye

Abstract: Let $ (Z_{n})_{n\geq 0} $ be a supercritical branching process in an independent and identically distributed random environment. We establish an optimal convergence rate in the Wasserstein-$1$ distance for the process $ (Z_{n})_{n\geq 0} $, which completes a result of Grama et al. [Stochastic Process. Appl., 127(4), 1255-1281, 2017]. Moreover, an exponential nonuniform Berry-Esseen bound is also g… ▽ More Let $ (Z_{n})_{n\geq 0} $ be a supercritical branching process in an independent and identically distributed random environment. We establish an optimal convergence rate in the Wasserstein-$1$ distance for the process $ (Z_{n})_{n\geq 0} $, which completes a result of Grama et al. [Stochastic Process. Appl., 127(4), 1255-1281, 2017]. Moreover, an exponential nonuniform Berry-Esseen bound is also given. At last, some applications of the main results to the confidence interval estimation for the criticality parameter and the population size $Z_n$ are discussed. △ Less

Submitted 4 December, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

Comments: Corrected typos, updated publication information. 19 pages, published in "Journal of Mathematical Research with Applications" (ISSN: 2095-2651), 2023, 43(6): 737-753

MSC Class: 60J80; 60K37; 60F05; 62E20

arXiv:2306.16413 [pdf, other]

MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning

Authors: Paul Pu Liang, Yiwei Lyu, Xiang Fan, Arav Agarwal, Yun Cheng, Louis-Philippe Morency, Ruslan Salakhutdinov

Abstract: Learning multimodal representations involves integrating information from multiple heterogeneous sources of data. In order to accelerate progress towards understudied modalities and tasks while ensuring real-world robustness, we release MultiZoo, a public toolkit consisting of standardized implementations of > 20 core multimodal algorithms and MultiBench, a large-scale benchmark spanning 15 datase… ▽ More Learning multimodal representations involves integrating information from multiple heterogeneous sources of data. In order to accelerate progress towards understudied modalities and tasks while ensuring real-world robustness, we release MultiZoo, a public toolkit consisting of standardized implementations of > 20 core multimodal algorithms and MultiBench, a large-scale benchmark spanning 15 datasets, 10 modalities, 20 prediction tasks, and 6 research areas. Together, these provide an automated end-to-end machine learning pipeline that simplifies and standardizes data loading, experimental setup, and model evaluation. To enable holistic evaluation, we offer a comprehensive methodology to assess (1) generalization, (2) time and space complexity, and (3) modality robustness. MultiBench paves the way towards a better understanding of the capabilities and limitations of multimodal models, while ensuring ease of use, accessibility, and reproducibility. Our toolkits are publicly available, will be regularly updated, and welcome inputs from the community. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: JMLR Open Source Software 2023, Code available at https://github.com/pliang279/MultiBench

arXiv:2306.15980 [pdf, ps, other]

Sharp moderate and large deviations for sample quantiles

Authors: Xiequan Fan

Abstract: In this article, we discuss the sharp moderate and large deviations between the quantiles of population and the quantiles of samples. Cramér type moderate deviations and Bahadur-Rao type large deviations are established with some mild conditions. The results refine the moderate and large deviation principles of Xu and Miao [Filomat 2011; 25(2): 197-206]. In this article, we discuss the sharp moderate and large deviations between the quantiles of population and the quantiles of samples. Cramér type moderate deviations and Bahadur-Rao type large deviations are established with some mild conditions. The results refine the moderate and large deviation principles of Xu and Miao [Filomat 2011; 25(2): 197-206]. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: 10 pages

MSC Class: 60E15; 60F10; 62G30

Journal ref: Statistics and Probability Letters 2023

Showing 151–200 of 1,411 results for author: Fan, X