Search | arXiv e-print repository

Exploiting the dynamics of commodity futures curves

Authors: Robert J Bianchi, John Hua Fan, Joelle Miffre, Tingxi Zhang

Abstract: The Nelson-Siegel framework is employed to model the term structure of commodity futures prices. Exploiting the information embedded in the level, slope and curvature parameters, we develop novel investment strategies that assume short-term continuation of recent parallel, slope or butterfly movements of futures curves. Systematic strategies based on the change in the slope generate significant pr… ▽ More The Nelson-Siegel framework is employed to model the term structure of commodity futures prices. Exploiting the information embedded in the level, slope and curvature parameters, we develop novel investment strategies that assume short-term continuation of recent parallel, slope or butterfly movements of futures curves. Systematic strategies based on the change in the slope generate significant profits that are unrelated to previously documented risk factors and can survive reasonable transaction costs. Further analysis demonstrates that the profitability of the slope strategy increases with investor sentiment and is in part a compensation for the drawdowns incurred during economic slowdowns. The profitability can also be magnified through timing and persists under alternative specifications of the Nelson-Siegel model. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Journal ref: Journal of Banking and Finance, 2023, 154 (106965)

arXiv:2307.16803 [pdf, other]

DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation

Authors: Yue Zhang, Hehe Fan, Yi Yang, Mohan Kankanhalli

Abstract: In this technical report, we present our findings from the research conducted on the Human-Object Interaction 4D (HOI4D) dataset for egocentric action segmentation task. As a relatively novel research area, point cloud video methods might not be good at temporal modeling, especially for long point cloud videos (\eg, 150 frames). In contrast, traditional video understanding methods have been well d… ▽ More In this technical report, we present our findings from the research conducted on the Human-Object Interaction 4D (HOI4D) dataset for egocentric action segmentation task. As a relatively novel research area, point cloud video methods might not be good at temporal modeling, especially for long point cloud videos (\eg, 150 frames). In contrast, traditional video understanding methods have been well developed. Their effectiveness on temporal modeling has been widely verified on many large scale video datasets. Therefore, we convert point cloud videos into depth videos and employ traditional video modeling methods to improve 4D action segmentation. By ensembling depth and point cloud video methods, the accuracy is significantly improved. The proposed method, named Mixture of Depth and Point cloud video experts (DPMix), achieved the first place in the 4D Action Segmentation Track of the HOI4D Challenge 2023. △ Less

Submitted 31 July, 2023; originally announced July 2023.

arXiv:2307.16146 [pdf, other]

Theory of expansion and compression of polymeric materials

Authors: P. M. Biesheuvel, H. Fan, M. Elimelech

Abstract: We extend classical Flory-Rehner theory for the expansion and compression of porous materials such as cross-linked polymer networks. The theory includes volume exclusion, affinity with the solvent, and finite stretching of the polymer chains. We also modify this equilibrium theory -- that applies to equal expansion of a material in all directions -- to the situation that a material can only expand… ▽ More We extend classical Flory-Rehner theory for the expansion and compression of porous materials such as cross-linked polymer networks. The theory includes volume exclusion, affinity with the solvent, and finite stretching of the polymer chains. We also modify this equilibrium theory -- that applies to equal expansion of a material in all directions -- to the situation that a material can only expand in a single direction, as is the case when a thin layer is tightly bound to a support structure. We extend this equilibrium model to the case that a pressure is applied across such a thin layer of the polymer material, for instance a membrane, and liquid flows across this layer. The theory describes how in the direction of liquid flow the membrane is increasingly compacted (becomes less porous), and the more so at higher applied pressures. We provide results of example calculations for a thick membrane with significant changes in compaction across its thickness, and a thin membrane for which compaction due to flow is minor. In the last section we model the dynamics of the change of size of a porous material in time after a step change in the solvent-polymer attraction parameter. △ Less

Submitted 27 September, 2023; v1 submitted 30 July, 2023; originally announced July 2023.

arXiv:2307.15894 [pdf, ps, other]

doi 10.1103/PhysRevLett.132.081904

Determination of the $Σ^{+}$ Timelike Electromagnetic Form Factors

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

Abstract: Based on data samples collected with the BESIII detector at the BEPCII collider, the process $e^{+}e^{-} \to Σ^{+}\barΣ^{-}$ is studied at center-of-mass energies $\sqrt{s}$ = 2.3960, 2.6454, and 2.9000 GeV. Using a fully differential angular description of the final state particles, both the relative magnitude and phase information of the $Σ^{+}$ electromagnetic form factors in the timelike regio… ▽ More Based on data samples collected with the BESIII detector at the BEPCII collider, the process $e^{+}e^{-} \to Σ^{+}\barΣ^{-}$ is studied at center-of-mass energies $\sqrt{s}$ = 2.3960, 2.6454, and 2.9000 GeV. Using a fully differential angular description of the final state particles, both the relative magnitude and phase information of the $Σ^{+}$ electromagnetic form factors in the timelike region are extracted. The relative phase between the electric and magnetic form factors is determined to be $\sinΔΦ$ = -0.67~$\pm$~0.29~(stat)~$\pm$~0.18~(syst) at $\sqrt{s}$ = 2.3960 GeV, $ΔΦ$ = 55$^{\circ}$~$\pm$~19$^{\circ}$~(stat) $\pm$~14$^{\circ}$~(syst) at $\sqrt{s}$ = 2.6454 GeV, and 78$^{\circ}$~$\pm$~22$^{\circ}$~(stat) $\pm$~9$^{\circ}$~(syst) at $\sqrt{s}$ = 2.9000 GeV. For the first time, the phase of the hyperon electromagnetic form factors is explored in a wide range of four-momentum transfer. The evolution of the phase along with four-momentum transfer is an important input for understanding its asymptotic behavior and the dynamics of baryons. △ Less

Submitted 5 March, 2024; v1 submitted 29 July, 2023; originally announced July 2023.

Journal ref: Phys. Rev. Lett. 132, 081904 (2024)

arXiv:2307.15353 [pdf, other]

Supervised Homography Learning with Realistic Dataset Generation

Authors: Hai Jiang, Haipeng Li, Songchen Han, Haoqiang Fan, Bing Zeng, Shuaicheng Liu

Abstract: In this paper, we propose an iterative framework, which consists of two phases: a generation phase and a training phase, to generate realistic training data and yield a supervised homography network. In the generation phase, given an unlabeled image pair, we utilize the pre-estimated dominant plane masks and homography of the pair, along with another sampled homography that serves as ground truth… ▽ More In this paper, we propose an iterative framework, which consists of two phases: a generation phase and a training phase, to generate realistic training data and yield a supervised homography network. In the generation phase, given an unlabeled image pair, we utilize the pre-estimated dominant plane masks and homography of the pair, along with another sampled homography that serves as ground truth to generate a new labeled training pair with realistic motion. In the training phase, the generated data is used to train the supervised homography network, in which the training data is refined via a content consistency module and a quality assessment module. Once an iteration is finished, the trained network is used in the next data generation phase to update the pre-estimated homography. Through such an iterative strategy, the quality of the dataset and the performance of the network can be gradually and simultaneously improved. Experimental results show that our method achieves state-of-the-art performance and existing supervised methods can be also improved based on the generated dataset. Code and dataset are available at https://github.com/JianghaiSCU/RealSH. △ Less

Submitted 15 August, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

Comments: Accepted by ICCV 2023

arXiv:2307.13250 [pdf, other]

Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering

Authors: Yi Cheng, Hehe Fan, Dongyun Lin, Ying Sun, Mohan Kankanhalli, Joo-Hwee Lim

Abstract: The main challenge in video question answering (VideoQA) is to capture and understand the complex spatial and temporal relations between objects based on given questions. Existing graph-based methods for VideoQA usually ignore keywords in questions and employ a simple graph to aggregate features without considering relative relations between objects, which may lead to inferior performance. In this… ▽ More The main challenge in video question answering (VideoQA) is to capture and understand the complex spatial and temporal relations between objects based on given questions. Existing graph-based methods for VideoQA usually ignore keywords in questions and employ a simple graph to aggregate features without considering relative relations between objects, which may lead to inferior performance. In this paper, we propose a Keyword-aware Relative Spatio-Temporal (KRST) graph network for VideoQA. First, to make question features aware of keywords, we employ an attention mechanism to assign high weights to keywords during question encoding. The keyword-aware question features are then used to guide video graph construction. Second, because relations are relative, we integrate the relative relation modeling to better capture the spatio-temporal dynamics among object nodes. Moreover, we disentangle the spatio-temporal reasoning into an object-level spatial graph and a frame-level temporal graph, which reduces the impact of spatial and temporal relation reasoning on each other. Extensive experiments on the TGIF-QA, MSVD-QA and MSRVTT-QA datasets demonstrate the superiority of our KRST over multiple state-of-the-art methods. △ Less

Submitted 25 July, 2023; originally announced July 2023.

Comments: under review

arXiv:2307.12736 [pdf, other]

Measurement of $e^{+}e^{-}\toφη'$ cross sections at center-of-mass energies from 3.508 to 4.951 GeV and search for the decay $ψ(3770)\toφη'$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (600 additional authors not shown)

Abstract: The cross sections of the $e^{+}e^{-}\toφη'$ process at center-of-mass energies from 3.508 to 4.951 GeV are measured with high precision using 26.1 fb$^{-1}$ data collected with the BESIII detector operating at the BEPCII storage ring. The cross sections are of the order of a few picobarn, and decrease as the center-of-mass energy increases as $s^{-n/2}$ with $n=4.35\pm 0.14$. This result is in ag… ▽ More The cross sections of the $e^{+}e^{-}\toφη'$ process at center-of-mass energies from 3.508 to 4.951 GeV are measured with high precision using 26.1 fb$^{-1}$ data collected with the BESIII detector operating at the BEPCII storage ring. The cross sections are of the order of a few picobarn, and decrease as the center-of-mass energy increases as $s^{-n/2}$ with $n=4.35\pm 0.14$. This result is in agreement with the Nambu-Jona-Lasinio model prediction of $n=3.5\pm 0.9$. In addition, the charmless decay $ψ(3770)\toφη'$ is searched for by fitting the measured cross sections, yet no significant signal is observed. The upper limit of ${\cal B}(ψ(3770)\toφη')$ at the 90\% confidence level is determined to be $2.3\times 10^{-5}$. △ Less

Submitted 11 September, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

arXiv:2307.10948 [pdf, ps, other]

doi 10.1103/PhysRevLett.132.191902

First Observation of a Three-Resonance Structure in $e^+e^-\rightarrow$Nonopen Charm Hadrons

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

Abstract: We report the measurement of the inclusive cross sections for $e^+e^-$$\rightarrow$nOCH (where nOCH denotes non-open charm hadrons) with improved precision at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe three resonances: $\mathcal R(3760)$, $\mathcal R(3780)$, and $\mathcal R(3810)$ with significances of $8.1σ$, $13.7σ$, and $8.8σ$, respectively. The $\mathcal R(3810)$ state… ▽ More We report the measurement of the inclusive cross sections for $e^+e^-$$\rightarrow$nOCH (where nOCH denotes non-open charm hadrons) with improved precision at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe three resonances: $\mathcal R(3760)$, $\mathcal R(3780)$, and $\mathcal R(3810)$ with significances of $8.1σ$, $13.7σ$, and $8.8σ$, respectively. The $\mathcal R(3810)$ state is observed for the first time, while the $\mathcal R(3760)$ and $\mathcal R(3780)$ states are observed for the first time in the nOCH cross sections. Two sets of resonance parameters describe the energy-dependent line shape of the cross sections well. In set I [set II], the $\mathcal R(3810)$ state has mass $(3805.7 \pm 1.1 \pm 2.7)$ [$(3805.7 \pm 1.1 \pm 2.7)$] MeV/$c^2$, total width $(11.6 \pm 2.9 \pm 1.9)$ [$(11.5 \pm 2.8 \pm 1.9)$] MeV, and an electronic width multiplied by the nOCH decay branching fraction of $(10.9\pm 3.8\pm 2.5)$ [$(11.0\pm 3.4\pm 2.5)$] eV. In addition, we measure the branching fractions ${\mathcal B}[{\mathcal R}(3760)$$\rightarrow$nOCH$]=(25.2 \pm 16.1 \pm 30.4)\% [(6.4 \pm 4.8 \pm 7.7)\%]$ and ${\mathcal B}[\mathcal R(3780)$$\rightarrow$nOCH$]=(12.3 \pm 6.6 \pm 8.3)\% [(10.4 \pm 4.8 \pm 7.0)\%]$ for the first time. The $\mathcal R(3760)$ state can be interpreted as an open-charm (OC) molecular state, but containing a simple four-quark state component. The $\mathcal R(3810)$ state can be interpreted as a hadrocharmonium state. △ Less

Submitted 11 May, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

Journal ref: Physical Review Letters 132, 191902 (2024)

arXiv:2307.10046 [pdf, other]

Divert More Attention to Vision-Language Object Tracking

Authors: Mingzhe Guo, Zhipeng Zhang, Li** **g, Haibin Ling, Heng Fan

Abstract: Multimodal vision-language (VL) learning has noticeably pushed the tendency toward generic intelligence owing to emerging large foundation models. However, tracking, as a fundamental vision problem, surprisingly enjoys less bonus from recent flourishing VL learning. We argue that the reasons are two-fold: the lack of large-scale vision-language annotated videos and ineffective vision-language inte… ▽ More Multimodal vision-language (VL) learning has noticeably pushed the tendency toward generic intelligence owing to emerging large foundation models. However, tracking, as a fundamental vision problem, surprisingly enjoys less bonus from recent flourishing VL learning. We argue that the reasons are two-fold: the lack of large-scale vision-language annotated videos and ineffective vision-language interaction learning of current works. These nuisances motivate us to design more effective vision-language representation for tracking, meanwhile constructing a large database with language annotation for model learning. Particularly, in this paper, we first propose a general attribute annotation strategy to decorate videos in six popular tracking benchmarks, which contributes a large-scale vision-language tracking database with more than 23,000 videos. We then introduce a novel framework to improve tracking by learning a unified-adaptive VL representation, where the cores are the proposed asymmetric architecture search and modality mixer (ModaMixer). To further improve VL representation, we introduce a contrastive loss to align different modalities. To thoroughly evidence the effectiveness of our method, we integrate the proposed framework on three tracking methods with different designs, i.e., the CNN-based SiamCAR, the Transformer-based OSTrack, and the hybrid structure TransT. The experiments demonstrate that our framework can significantly improve all baselines on six benchmarks. Besides empirical results, we theoretically analyze our approach to show its rationality. By revealing the potential of VL representation, we expect the community to divert more attention to VL tracking and hope to open more possibilities for future tracking with diversified multimodal messages. △ Less

Submitted 19 July, 2023; originally announced July 2023.

Comments: 16 pages, 9 figures

arXiv:2307.09866 [pdf, other]

Detecting Vulnerable Nodes in Urban Infrastructure Interdependent Network

Authors: **zhu Mao, Liu Cao, Chen Gao, Huandong Wang, Hangyu Fan, Depeng **, Yong Li

Abstract: Understanding and characterizing the vulnerability of urban infrastructures, which refers to the engineering facilities essential for the regular running of cities and that exist naturally in the form of networks, is of great value to us. Potential applications include protecting fragile facilities and designing robust topologies, etc. Due to the strong correlation between different topological ch… ▽ More Understanding and characterizing the vulnerability of urban infrastructures, which refers to the engineering facilities essential for the regular running of cities and that exist naturally in the form of networks, is of great value to us. Potential applications include protecting fragile facilities and designing robust topologies, etc. Due to the strong correlation between different topological characteristics and infrastructure vulnerability and their complicated evolution mechanisms, some heuristic and machine-assisted analysis fall short in addressing such a scenario. In this paper, we model the interdependent network as a heterogeneous graph and propose a system based on graph neural network with reinforcement learning, which can be trained on real-world data, to characterize the vulnerability of the city system accurately. The presented system leverages deep learning techniques to understand and analyze the heterogeneous graph, which enables us to capture the risk of cascade failure and discover vulnerable infrastructures of cities. Extensive experiments with various requests demonstrate not only the expressive power of our system but also transferring ability and necessity of the specific components. △ Less

Submitted 1 August, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

arXiv:2307.09729 [pdf, other]

NTIRE 2023 Quality Assessment of Video Enhancement Challenge

Authors: Xiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Wei Wu, Shuming Hu, Sibin Deng, Pengxiang Xiao, Ying Chen, Kai Li, Kai Zhao, Kun Yuan, Ming Sun, Heng Cong, Hao Wang, Lingzhi Fu , et al. (47 additional authors not shown)

Abstract: This paper reports on the NTIRE 2023 Quality Assessment of Video Enhancement Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2023. This challenge is to address a major challenge in the field of video processing, namely, video quality assessment (VQA) for enhanced videos. The challenge uses the VQA Dataset for Perceptual… ▽ More This paper reports on the NTIRE 2023 Quality Assessment of Video Enhancement Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2023. This challenge is to address a major challenge in the field of video processing, namely, video quality assessment (VQA) for enhanced videos. The challenge uses the VQA Dataset for Perceptual Video Enhancement (VDPVE), which has a total of 1211 enhanced videos, including 600 videos with color, brightness, and contrast enhancements, 310 videos with deblurring, and 301 deshaked videos. The challenge has a total of 167 registered participants. 61 participating teams submitted their prediction results during the development phase, with a total of 3168 submissions. A total of 176 submissions were submitted by 37 participating teams during the final testing phase. Finally, 19 participating teams submitted their models and fact sheets, and detailed the methods they used. Some methods have achieved better results than baseline methods, and the winning methods have demonstrated superior prediction performance. △ Less

Submitted 18 July, 2023; originally announced July 2023.

arXiv:2307.09266 [pdf, other]

doi 10.1007/JHEP11(2023)137

Measurement of the branching fractions of the singly Cabibbo-suppressed decays $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, J. Bloms, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

Abstract: Based on 4.5 $\mbox{fb$^{-1}$}$ $e^{+}e^{-}$ collision data collected with BESIII detector at seven energy points between 4.600 and 4.699 GeV, the branching fractions for $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ were measured by means of single-tag method. The branching fractions of $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ are determined to be… ▽ More Based on 4.5 $\mbox{fb$^{-1}$}$ $e^{+}e^{-}$ collision data collected with BESIII detector at seven energy points between 4.600 and 4.699 GeV, the branching fractions for $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ were measured by means of single-tag method. The branching fractions of $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ are determined to be $(1.57\pm0.11_{\rm {stat}}\pm0.04_{\rm{syst}})\times10^{-3}$ and $(1.11\pm0.20_{\rm{stat}}\pm0.07_{\rm{syst}})\times10^{-3}$, with a statistical significance of greater than 10 $σ$ and 5.7 $σ$, respectively. These results are consistent with the previous measurements by BESIII, LHCb and Belle, and the result of $Λ_{c}^{+}\to pη$ is the most precise to date. △ Less

Submitted 17 October, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

Comments: 24 pages, 4 figures

Journal ref: J. High Energ. Phys. 2023, 137 (2023)

arXiv:2307.08629 [pdf, other]

Deficiency-Aware Masked Transformer for Video Inpainting

Authors: Yongsheng Yu, Heng Fan, Libo Zhang

Abstract: Recent video inpainting methods have made remarkable progress by utilizing explicit guidance, such as optical flow, to propagate cross-frame pixels. However, there are cases where cross-frame recurrence of the masked video is not available, resulting in a deficiency. In such situation, instead of borrowing pixels from other frames, the focus of the model shifts towards addressing the inverse probl… ▽ More Recent video inpainting methods have made remarkable progress by utilizing explicit guidance, such as optical flow, to propagate cross-frame pixels. However, there are cases where cross-frame recurrence of the masked video is not available, resulting in a deficiency. In such situation, instead of borrowing pixels from other frames, the focus of the model shifts towards addressing the inverse problem. In this paper, we introduce a dual-modality-compatible inpainting framework called Deficiency-aware Masked Transformer (DMT), which offers three key advantages. Firstly, we pretrain a image inpainting model DMT_img serve as a prior for distilling the video model DMT_vid, thereby benefiting the hallucination of deficiency cases. Secondly, the self-attention module selectively incorporates spatiotemporal tokens to accelerate inference and remove noise signals. Thirdly, a simple yet effective Receptive Field Contextualizer is integrated into DMT, further improving performance. Extensive experiments conducted on YouTube-VOS and DAVIS datasets demonstrate that DMT_vid significantly outperforms previous solutions. The code and video demonstrations can be found at github.com/yeates/DMT. △ Less

Submitted 17 July, 2023; originally announced July 2023.

arXiv:2307.08231 [pdf, other]

doi 10.1093/mnras/stad3296

Observing white dwarf tidal strip** with TianQin gravitational wave observatory

Authors: Chang-Qing Ye, **-Hong Chen, Jian-dong Zhang, Hui-Min Fan, Yi-Ming Hu

Abstract: Recently discovered regular X-ray bursts known as quasi-periodic eruptions have a proposed model that suggests a tidal strip** white dwarf inspiralling into the galaxy's central black hole on an eccentric orbit. According to this model, the interaction of the strip** white dwarf with the central black hole would emit gravitational wave signals as well, their detection can help explore the form… ▽ More Recently discovered regular X-ray bursts known as quasi-periodic eruptions have a proposed model that suggests a tidal strip** white dwarf inspiralling into the galaxy's central black hole on an eccentric orbit. According to this model, the interaction of the strip** white dwarf with the central black hole would emit gravitational wave signals as well, their detection can help explore the formation mechanism of quasi-periodic eruptions and facilitate multi-messenger observations. In this paper, we aim to perform a preliminary study of the gravitation wave observation of TianQin on this strip** white dwarf model. We investigated the horizon distance of TianQin on this type of gravitation wave signal and found it can be set to 200Mpc. We also find that those strip** white dwarf model sources with central black hole mass within $10^4\sim10^{5.5}M_\odot$ are more likely to be detected by TianQin. We assessed the parameter estimation precision of TianQin on those strip** white dwarf model sources. Our result shows that, even in the worst case, TianQin can determine the central black hole mass, the white dwarf mass, the central black hole spin, and the orbital initial eccentricity with a precision of $10^{-2}$. In the optimistic case, TianQin can determine the central black hole mass and the white dwarf mass with a precision of $10^{-7}$, determine the central black hole spin with a precision of $10^{-5}$, and determine the orbital initial eccentricity with a precision of $10^{-8}$. Moreover, TianQin can determine the luminosity distance with a precision of $10^{-1}$ and determine the sky localization with a precision of $10^{-2}\sim10$ $\rm deg^2$. △ Less

Submitted 29 October, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

Comments: 9 pages, 4 figures

Journal ref: MNRAS,stad3296,2023

arXiv:2307.07316 [pdf, other]

doi 10.1103/PhysRevLett.131.191901

Measurement of the Energy-Dependent Electromagnetic Form Factors of a Charmed Baryon

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (598 additional authors not shown)

Abstract: We study the process $e^{+}e^{-}\toΛ_{c}^{+}\barΛ_c^{-}$ at twelve center-of-mass energies from $4.6119$ to $4.9509~\mathrm{GeV}$ using data samples collected by the BESIII detector at the BEPCII collider. The Born cross sections and effective form factors ($|G_{\mathrm{eff}}|$) are determined with unprecedented precision after combining the single and double-tag methods based on the decay process… ▽ More We study the process $e^{+}e^{-}\toΛ_{c}^{+}\barΛ_c^{-}$ at twelve center-of-mass energies from $4.6119$ to $4.9509~\mathrm{GeV}$ using data samples collected by the BESIII detector at the BEPCII collider. The Born cross sections and effective form factors ($|G_{\mathrm{eff}}|$) are determined with unprecedented precision after combining the single and double-tag methods based on the decay process $Λ_{c}^{+}\to pK^{-}π^{+}$. Flat cross sections around $4.63~\mathrm{GeV}$ are obtained and no indication of the resonant structure $Y(4630)$, as reported by Belle, is found. In addition, no oscillatory behavior is discerned in the $|G_{\mathrm{eff}}|$ energy-dependence of $Λ_{c}^{+}$, in contrast to what is seen for the proton and neutron cases. Analyzing the cross section together with the polar-angle distribution of the $Λ_{c}^{+}$ baryon at each energy point, the moduli of electric and magnetic form factors ($|G_{E}|$ and $|G_{M}|$) are extracted and separated. For the first time, the energy-dependence of the form factor ratio $|G_{E}/G_{M}|$ is observed, which can be well described by an oscillatory function. △ Less

Submitted 14 July, 2023; originally announced July 2023.

Comments: 10 pages, 3 figures

Journal ref: Phys. Rev. Lett. 131, 191901 (2023)

arXiv:2307.06569 [pdf, other]

A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023

Authors: Yi Cheng, Ziwei Xu, Fen Fang, Dongyun Lin, Hehe Fan, Yongkang Wong, Ying Sun, Mohan Kankanhalli

Abstract: In this technical report, we present our findings from a study conducted on the EPIC-KITCHENS-100 Unsupervised Domain Adaptation task for Action Recognition. Our research focuses on the innovative application of a differentiable logic loss in the training to leverage the co-occurrence relations between verb and noun, as well as the pre-trained Large Language Models (LLMs) to generate the logic rul… ▽ More In this technical report, we present our findings from a study conducted on the EPIC-KITCHENS-100 Unsupervised Domain Adaptation task for Action Recognition. Our research focuses on the innovative application of a differentiable logic loss in the training to leverage the co-occurrence relations between verb and noun, as well as the pre-trained Large Language Models (LLMs) to generate the logic rules for the adaptation to unseen action labels. Specifically, the model's predictions are treated as the truth assignment of a co-occurrence logic formula to compute the logic loss, which measures the consistency between the predictions and the logic constraints. By using the verb-noun co-occurrence matrix generated from the dataset, we observe a moderate improvement in model performance compared to our baseline framework. To further enhance the model's adaptability to novel action labels, we experiment with rules generated using GPT-3.5, which leads to a slight decrease in performance. These findings shed light on the potential and challenges of incorporating differentiable logic and LLMs for knowledge extraction in unsupervised domain adaptation for action recognition. Our final submission (entitled `NS-LLM') achieved the first place in terms of top-1 action recognition accuracy. △ Less

Submitted 13 July, 2023; originally announced July 2023.

Comments: Technical report submitted to CVPR 2023 EPIC-Kitchens challenges

arXiv:2307.04455 [pdf, other]

SAM-IQA: Can Segment Anything Boost Image Quality Assessment?

Authors: Xinpeng Li, Ting Jiang, Haoqiang Fan, Shuaicheng Liu

Abstract: Image Quality Assessment (IQA) is a challenging task that requires training on massive datasets to achieve accurate predictions. However, due to the lack of IQA data, deep learning-based IQA methods typically rely on pre-trained networks trained on massive datasets as feature extractors to enhance their generalization ability, such as the ResNet network trained on ImageNet. In this paper, we utili… ▽ More Image Quality Assessment (IQA) is a challenging task that requires training on massive datasets to achieve accurate predictions. However, due to the lack of IQA data, deep learning-based IQA methods typically rely on pre-trained networks trained on massive datasets as feature extractors to enhance their generalization ability, such as the ResNet network trained on ImageNet. In this paper, we utilize the encoder of Segment Anything, a recently proposed segmentation model trained on a massive dataset, for high-level semantic feature extraction. Most IQA methods are limited to extracting spatial-domain features, while frequency-domain features have been shown to better represent noise and blur. Therefore, we leverage both spatial-domain and frequency-domain features by applying Fourier and standard convolutions on the extracted features, respectively. Extensive experiments are conducted to demonstrate the effectiveness of all the proposed components, and results show that our approach outperforms the state-of-the-art (SOTA) in four representative datasets, both qualitatively and quantitatively. Our experiments confirm the powerful feature extraction capabilities of Segment Anything and highlight the value of combining spatial-domain and frequency-domain features in IQA tasks. Code: https://github.com/Hedlen/SAM-IQA △ Less

Submitted 10 July, 2023; originally announced July 2023.

arXiv:2307.02574 [pdf, other]

Semi-supervised Learning from Street-View Images and OpenStreetMap for Automatic Building Height Estimation

Authors: Hao Li, Zhendong Yuan, Gabriel Dax, Gefei Kong, Hongchao Fan, Alexander Zipf, Martin Werner

Abstract: Accurate building height estimation is key to the automatic derivation of 3D city models from emerging big geospatial data, including Volunteered Geographical Information (VGI). However, an automatic solution for large-scale building height estimation based on low-cost VGI data is currently missing. The fast development of VGI data platforms, especially OpenStreetMap (OSM) and crowdsourced street-… ▽ More Accurate building height estimation is key to the automatic derivation of 3D city models from emerging big geospatial data, including Volunteered Geographical Information (VGI). However, an automatic solution for large-scale building height estimation based on low-cost VGI data is currently missing. The fast development of VGI data platforms, especially OpenStreetMap (OSM) and crowdsourced street-view images (SVI), offers a stimulating opportunity to fill this research gap. In this work, we propose a semi-supervised learning (SSL) method of automatically estimating building height from Mapillary SVI and OSM data to generate low-cost and open-source 3D city modeling in LoD1. The proposed method consists of three parts: first, we propose an SSL schema with the option of setting a different ratio of "pseudo label" during the supervised regression; second, we extract multi-level morphometric features from OSM data (i.e., buildings and streets) for the purposed of inferring building height; last, we design a building floor estimation workflow with a pre-trained facade object detection network to generate "pseudo label" from SVI and assign it to the corresponding OSM building footprint. In a case study, we validate the proposed SSL method in the city of Heidelberg, Germany and evaluate the model performance against the reference data of building heights. Based on three different regression models, namely Random Forest (RF), Support Vector Machine (SVM), and Convolutional Neural Network (CNN), the SSL method leads to a clear performance boosting in estimating building heights with a Mean Absolute Error (MAE) around 2.1 meters, which is competitive to state-of-the-art approaches. The preliminary result is promising and motivates our future work in scaling up the proposed method based on low-cost VGI data, with possibilities in even regions and areas with diverse data quality and availability. △ Less

Submitted 5 July, 2023; originally announced July 2023.

Comments: Accepted for GIScience 2023

arXiv:2307.00252 [pdf, other]

An ML approach to resolution of singularities

Authors: Gergely Bérczi, Honglu Fan, Mingcong Zeng

Abstract: The solution set of a system of polynomial equations typically contains ill-behaved, singular points. Resolution is a fundamental process in geometry in which we replace singular points with smooth points, while kee** the rest of the solution set unchanged. Resolutions are not unique: the usual way to describe them involves repeatedly performing a fundamental operation known as "blowing-up", and… ▽ More The solution set of a system of polynomial equations typically contains ill-behaved, singular points. Resolution is a fundamental process in geometry in which we replace singular points with smooth points, while kee** the rest of the solution set unchanged. Resolutions are not unique: the usual way to describe them involves repeatedly performing a fundamental operation known as "blowing-up", and the complexity of the resolution highly depends on certain choices. The process can be translated into various versions of a 2-player game, the so-called Hironaka game, and a winning strategy for the first player provides a solution to the resolution problem. In this paper we introduce a new approach to the Hironaka game that uses reinforcement learning agents to find optimal resolutions of singularities. In certain domains, the trained model outperforms state-of-the-art selection heuristics in total number of polynomial additions performed, which provides a proof-of-concept that recent developments in machine learning have the potential to improve performance of algorithms in symbolic computation. △ Less

Submitted 22 August, 2023; v1 submitted 1 July, 2023; originally announced July 2023.

Comments: To appear in Proceedings of the 40th International Conference on Machine Learning TAG Workshop (ICML-TAG 2023)

arXiv:2306.17806 [pdf, other]

Stay on topic with Classifier-Free Guidance

Authors: Guillaume Sanchez, Honglu Fan, Alexander Spangher, Elad Levi, Pawan Sasanka Ammanamanchi, Stella Biderman

Abstract: Classifier-Free Guidance (CFG) has recently emerged in text-to-image generation as a lightweight technique to encourage prompt-adherence in generations. In this work, we demonstrate that CFG can be used broadly as an inference-time technique in pure language modeling. We show that CFG (1) improves the performance of Pythia, GPT-2 and LLaMA-family models across an array of tasks: Q\&A, reasoning, c… ▽ More Classifier-Free Guidance (CFG) has recently emerged in text-to-image generation as a lightweight technique to encourage prompt-adherence in generations. In this work, we demonstrate that CFG can be used broadly as an inference-time technique in pure language modeling. We show that CFG (1) improves the performance of Pythia, GPT-2 and LLaMA-family models across an array of tasks: Q\&A, reasoning, code generation, and machine translation, achieving SOTA on LAMBADA with LLaMA-7B over PaLM-540B; (2) brings improvements equivalent to a model with twice the parameter-count; (3) can stack alongside other inference-time methods like Chain-of-Thought and Self-Consistency, yielding further improvements in difficult tasks; (4) can be used to increase the faithfulness and coherence of assistants in challenging form-driven and content-driven prompts: in a human evaluation we show a 75\% preference for GPT4All using CFG over baseline. △ Less

Submitted 30 June, 2023; originally announced June 2023.

arXiv:2306.16194 [pdf, other]

doi 10.1103/PhysRevResearch.5.043285

Variational generation of spin squeezing on one-dimensional quantum devices with nearest-neighbor interactions

Authors: Zheng-Hang Sun, Yong-Yi Wang, Yu-Ran Zhang, Franco Nori, Heng Fan

Abstract: Efficient preparation of spin-squeezed states is important for quantum-enhanced metrology. Current protocols for generating strong spin squeezing rely on either high dimensionality or long-range interactions. A key challenge is how to generate considerable spin squeezing in one-dimensional systems with only nearest-neighbor interactions. Here, we develop variational spin-squeezing algorithms to so… ▽ More Efficient preparation of spin-squeezed states is important for quantum-enhanced metrology. Current protocols for generating strong spin squeezing rely on either high dimensionality or long-range interactions. A key challenge is how to generate considerable spin squeezing in one-dimensional systems with only nearest-neighbor interactions. Here, we develop variational spin-squeezing algorithms to solve this problem. We consider both digital and analog quantum circuits for these variational algorithms. After the closed optimization loop of the variational spin-squeezing algorithms, the generated squeezing can be comparable to the strongest squeezing created from two-axis twisting. By analyzing the experimental imperfections, the variational spin-squeezing algorithms proposed in this work are feasible in recent developed noisy intermediate-scale quantum computers. △ Less

Submitted 26 December, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

Journal ref: Phys. Rev. Research 5, 043285 (2023)

arXiv:2306.10417 [pdf, ps, other]

Amending the Lonely Runner Spectrum Conjecture

Authors: Ho Tin Fan, Alec Sun

Abstract: Let $\|x\|$ be the absolute distance from $x$ to the nearest integer. For a set of distinct positive integral speeds $v_1, \ldots, v_n$, we define its maximum loneliness to be $$\text{ML}(v_1,\ldots,v_n) = \max_{t \in \mathbb{R}}\min_{1 \leq i \leq n} \|tv_i\|$$ The Loneliness Spectrum Conjecture, recently proposed by Kravitz, asserts that… ▽ More Let $\|x\|$ be the absolute distance from $x$ to the nearest integer. For a set of distinct positive integral speeds $v_1, \ldots, v_n$, we define its maximum loneliness to be $$\text{ML}(v_1,\ldots,v_n) = \max_{t \in \mathbb{R}}\min_{1 \leq i \leq n} \|tv_i\|$$ The Loneliness Spectrum Conjecture, recently proposed by Kravitz, asserts that $$\exists s \in \mathbb{N}, \text{ML}(v_1,\ldots,v_n) = \frac{s} {sn + 1} \text{ or } \text{ML}(v_1,\ldots,v_n) \ge \frac{1}{n}$$ We disprove the Loneliness Spectrum Conjecture for $n = 4$ and propose an alternative conjecture. We confirm the amended conjecture for $n = 4$ if any pair of speeds share a common factor of at least $3$ and also prove some related results. △ Less

Submitted 17 June, 2023; originally announced June 2023.

Comments: 21 pages

arXiv:2306.05312 [pdf, other]

Tunable Coupling Architectures with Capacitively Connecting Pads for Large-Scale Superconducting Multi-Qubit Processors

Authors: Gui-Han Liang, Xiao-Hui Song, Cheng-Lin Deng, Xu-Yang Gu, Yu Yan, Zheng-Yang Mei, Si-Lu Zhao, Yi-Zhou Bu, Yong-Xi Xiao, Yi-Han Yu, Ming-Chuan Wang, Tong Liu, Yun-Hao Shi, He Zhang, Xiang Li, Li Li, **g-Zhe Wang, Ye Tian, Shi-** Zhao, Kai Xu, Heng Fan, Zhong-Cheng Xiang, Dong-Ning Zheng

Abstract: We have proposed and experimentally verified a tunable inter-qubit coupling scheme for large-scale integration of superconducting qubits. The key feature of the scheme is the insertion of connecting pads between qubit and tunable coupling element. In such a way, the distance between two qubits can be increased considerably to a few millimeters, leaving enough space for arranging control lines, rea… ▽ More We have proposed and experimentally verified a tunable inter-qubit coupling scheme for large-scale integration of superconducting qubits. The key feature of the scheme is the insertion of connecting pads between qubit and tunable coupling element. In such a way, the distance between two qubits can be increased considerably to a few millimeters, leaving enough space for arranging control lines, readout resonators and other necessary structures. The increased inter-qubit distance provides more wiring space for flip-chip process and reduces crosstalk between qubits and from control lines to qubits. We use the term Tunable Coupler with Capacitively Connecting Pad (TCCP) to name the tunable coupling part that consists of a transmon coupler and capacitively connecting pads. With the different placement of connecting pads, different TCCP architectures can be realized. We have designed and fabricated a few multi-qubit devices in which TCCP is used for coupling. The measured results show that the performance of the qubits coupled by the TCCP, such as $T_1$ and $T_2$, was similar to that of the traditional transmon qubits without TCCP. Meanwhile, our TCCP also exhibited a wide tunable range of the effective coupling strength and a low residual ZZ interaction between the qubits by properly tuning the parameters on the design. Finally, we successfully implemented an adiabatic CZ gate with TCCP. Furthermore, by introducing TCCP, we also discuss the realization of the flip-chip process and tunable coupling qubits between different chips. △ Less

Submitted 8 June, 2023; originally announced June 2023.

Comments: Main text: 7 pages, 6 figures

arXiv:2306.00989 [pdf, other]

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Authors: Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer

Abstract: Modern hierarchical vision transformers have added several vision-specific components in the pursuit of supervised classification performance. While these components lead to effective accuracies and attractive FLOP counts, the added complexity actually makes these transformers slower than their vanilla ViT counterparts. In this paper, we argue that this additional bulk is unnecessary. By pretraini… ▽ More Modern hierarchical vision transformers have added several vision-specific components in the pursuit of supervised classification performance. While these components lead to effective accuracies and attractive FLOP counts, the added complexity actually makes these transformers slower than their vanilla ViT counterparts. In this paper, we argue that this additional bulk is unnecessary. By pretraining with a strong visual pretext task (MAE), we can strip out all the bells-and-whistles from a state-of-the-art multi-stage vision transformer without losing accuracy. In the process, we create Hiera, an extremely simple hierarchical vision transformer that is more accurate than previous models while being significantly faster both at inference and during training. We evaluate Hiera on a variety of tasks for image and video recognition. Our code and models are available at https://github.com/facebookresearch/hiera. △ Less

Submitted 1 June, 2023; originally announced June 2023.

Comments: ICML 2023 Oral version. Code+Models: https://github.com/facebookresearch/hiera

arXiv:2306.00306 [pdf, other]

Low-Light Image Enhancement with Wavelet-based Diffusion Models

Authors: Hai Jiang, Ao Luo, Songchen Han, Haoqiang Fan, Shuaicheng Liu

Abstract: Diffusion models have achieved promising results in image restoration tasks, yet suffer from time-consuming, excessive computational resource consumption, and unstable restoration. To address these issues, we propose a robust and efficient Diffusion-based Low-Light image enhancement approach, dubbed DiffLL. Specifically, we present a wavelet-based conditional diffusion model (WCDM) that leverages… ▽ More Diffusion models have achieved promising results in image restoration tasks, yet suffer from time-consuming, excessive computational resource consumption, and unstable restoration. To address these issues, we propose a robust and efficient Diffusion-based Low-Light image enhancement approach, dubbed DiffLL. Specifically, we present a wavelet-based conditional diffusion model (WCDM) that leverages the generative power of diffusion models to produce results with satisfactory perceptual fidelity. Additionally, it also takes advantage of the strengths of wavelet transformation to greatly accelerate inference and reduce computational resource usage without sacrificing information. To avoid chaotic content and diversity, we perform both forward diffusion and denoising in the training phase of WCDM, enabling the model to achieve stable denoising and reduce randomness during inference. Moreover, we further design a high-frequency restoration module (HFRM) that utilizes the vertical and horizontal details of the image to complement the diagonal information for better fine-grained restoration. Extensive experiments on publicly available real-world benchmarks demonstrate that our method outperforms the existing state-of-the-art methods both quantitatively and visually, and it achieves remarkable improvements in efficiency compared to previous diffusion-based methods. In addition, we empirically show that the application for low-light face detection also reveals the latent practical values of our method. Code is available at https://github.com/JianghaiSCU/Diffusion-Low-Light. △ Less

Submitted 25 September, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

Comments: Accepted by Siggraph Aisa 2023 (ACM Transactions on Graphics)

arXiv:2305.17495 [pdf, ps, other]

Quantum collapse and exponential growth of out-of-time-ordered correlator in anisotropic quantum Rabi model

Authors: Shangyun Wang, Songbai Chen, Jiliang **g, Jieci Wang, Heng Fan

Abstract: Quantum chaos is an intriguing topic and has attracting a great deal of interests in quantum mechanics and black hole physics. Recently, the exponential growth of out-of-time-ordered correlator (OTOC) has been proposed to diagnose quantum chaos and verify the correspondence principle. Here, we demonstrate that the exponential growth of the OTOC at early times for the initial states centered both i… ▽ More Quantum chaos is an intriguing topic and has attracting a great deal of interests in quantum mechanics and black hole physics. Recently, the exponential growth of out-of-time-ordered correlator (OTOC) has been proposed to diagnose quantum chaos and verify the correspondence principle. Here, we demonstrate that the exponential growth of the OTOC at early times for the initial states centered both in the chaotic and stable regions of the anisotropic quantum Rabi model. We attribute the exponential growth of the OTOC to quantum collapse which provides a novel mechanism of yielding exponential growth of the OTOC in quantum systems. Moreover, the quantum collapse effect is more obvious for the initial states centered in the chaotic one. Our results show that compared with the OTOC, the linear entanglement entropy and Loschmidt echo seem to be more effective to diagnose the signals of quantum chaos in the anisotropic quantum Rabi model. △ Less

Submitted 31 May, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

Comments: 6 pages, 8 figures

arXiv:2305.17030 [pdf, other]

doi 10.3847/1538-4365/acfd29

The First LHAASO Catalog of Gamma-Ray Sources

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022.… ▽ More We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022. This catalog represents the main result from the most sensitive large coverage gamma-ray survey of the sky above 1 TeV, covering declination from $-$20$^{\circ}$ to 80$^{\circ}$. In total, the catalog contains 90 sources with an extended size smaller than $2^\circ$ and a significance of detection at $> 5σ$. Based on our source association criteria, 32 new TeV sources are proposed in this study. Among the 90 sources, 43 sources are detected with ultra-high energy ($E > 100$ TeV) emission at $> 4σ$ significance level. We provide the position, extension, and spectral characteristics of all the sources in this catalog. △ Less

Submitted 27 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

Comments: 40 pages, 13 figures, 4 tables

Journal ref: The Astrophysical Journal Supplement Series, 271 (2024) 25

arXiv:2305.15778 [pdf, other]

Automatic Root Cause Analysis via Large Language Models for Cloud Incidents

Authors: Yinfang Chen, Huaibing Xie, Minghua Ma, Yu Kang, Xin Gao, Liu Shi, Yunjie Cao, Xuedong Gao, Hao Fan, Ming Wen, Jun Zeng, Supriyo Ghosh, Xuchao Zhang, Chaoyun Zhang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Tianyin Xu

Abstract: Ensuring the reliability and availability of cloud services necessitates efficient root cause analysis (RCA) for cloud incidents. Traditional RCA methods, which rely on manual investigations of data sources such as logs and traces, are often laborious, error-prone, and challenging for on-call engineers. In this paper, we introduce RCACopilot, an innovative on-call system empowered by the large lan… ▽ More Ensuring the reliability and availability of cloud services necessitates efficient root cause analysis (RCA) for cloud incidents. Traditional RCA methods, which rely on manual investigations of data sources such as logs and traces, are often laborious, error-prone, and challenging for on-call engineers. In this paper, we introduce RCACopilot, an innovative on-call system empowered by the large language model for automating RCA of cloud incidents. RCACopilot matches incoming incidents to corresponding incident handlers based on their alert types, aggregates the critical runtime diagnostic information, predicts the incident's root cause category, and provides an explanatory narrative. We evaluate RCACopilot using a real-world dataset consisting of a year's worth of incidents from Microsoft. Our evaluation demonstrates that RCACopilot achieves RCA accuracy up to 0.766. Furthermore, the diagnostic information collection component of RCACopilot has been successfully in use at Microsoft for over four years. △ Less

Submitted 13 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

arXiv:2305.14631 [pdf, other]

Determination of spin and parity of $D^{*}_{(s)}$ mesons

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, J. Bloms, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (598 additional authors not shown)

Abstract: The spin and parity of the charmed mesons $D_{s}^{*+}$, $D^{*0}$ and $D^{*+}$ are determined for the first time to be $J^P=1^{-}$ with significances greater than 10$σ$ over other hypotheses of $2^{+}$ and $3^{-}$, using an $e^+e^-$ collision data sample with an integrated luminosity of 3.19 fb$^{-1}$ collected by the BESIII detector at a center-of-mass energy of 4.178 GeV. Different spin-parity hy… ▽ More The spin and parity of the charmed mesons $D_{s}^{*+}$, $D^{*0}$ and $D^{*+}$ are determined for the first time to be $J^P=1^{-}$ with significances greater than 10$σ$ over other hypotheses of $2^{+}$ and $3^{-}$, using an $e^+e^-$ collision data sample with an integrated luminosity of 3.19 fb$^{-1}$ collected by the BESIII detector at a center-of-mass energy of 4.178 GeV. Different spin-parity hypotheses for $D_{s}^{*+}$, $D^{*0}$, and $D^{*+}$ mesons are tested via a helicity amplitude analysis of the processes $e^+e^-\to D^{*+}_{s}D^{-}_{s}$, $D^{*0}D^{0}$ and $D^{*+}D^{-}$, with $D^{*+}_{s}\to D^{+}_{s} γ$, $D^{*0}\to D^{0}π^{0}$, and $D^{*+}\to D^{+}π^{0}$. The results confirm the quark model predictions. △ Less

Submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.14374 [pdf, other]

Inferring Attracting Basins of Power System with Machine Learning

Authors: Yao Du, Qing Li, Huawei Fan, Meng Zhan, **ghua Xiao, Xingang Wang

Abstract: Power systems dominated by renewable energy encounter frequently large, random disturbances, and a critical challenge faced in power-system management is how to anticipate accurately whether the perturbed systems will return to the functional state after the transient or collapse. Whereas model-based studies show that the key to addressing the challenge lies in the attracting basins of the functio… ▽ More Power systems dominated by renewable energy encounter frequently large, random disturbances, and a critical challenge faced in power-system management is how to anticipate accurately whether the perturbed systems will return to the functional state after the transient or collapse. Whereas model-based studies show that the key to addressing the challenge lies in the attracting basins of the functional and dysfunctional states in the phase space, the finding of the attracting basins for realistic power systems remains a challenge, as accurate models describing the system dynamics are generally unavailable. Here we propose a new machine learning technique, namely balanced reservoir computing, to infer the attracting basins of a typical power system based on measured data. Specifically, trained by the time series of a handful of perturbation events, we demonstrate that the trained machine can predict accurately whether the system will return to the functional state in response to a large, random perturbation, thereby reconstructing the attracting basin of the functional state. The working mechanism of the new machine is analyzed, and it is revealed that the success of the new machine is attributed to the good balance between the echo and fading properties of the reservoir network; the effect of noisy signals on the prediction performance is also investigated, and a stochastic-resonance-like phenomenon is observed. Finally, we demonstrate that the new technique can be also utilized to infer the attracting basins of coexisting attractors in typical chaotic systems. △ Less

Submitted 20 May, 2023; originally announced May 2023.

Comments: 13 pages, 7 figures

arXiv:2305.14022 [pdf, other]

Realistic Noise Synthesis with Diffusion Models

Authors: Qi Wu, Mingyan Han, Ting Jiang, Haoqiang Fan, Bing Zeng, Shuaicheng Liu

Abstract: Deep image denoising models often rely on large amount of training data for the high quality performance. However, it is challenging to obtain sufficient amount of data under real-world scenarios for the supervised training. As such, synthesizing realistic noise becomes an important solution. However, existing techniques have limitations in modeling complex noise distributions, resulting in residu… ▽ More Deep image denoising models often rely on large amount of training data for the high quality performance. However, it is challenging to obtain sufficient amount of data under real-world scenarios for the supervised training. As such, synthesizing realistic noise becomes an important solution. However, existing techniques have limitations in modeling complex noise distributions, resulting in residual noise and edge artifacts in denoising methods relying on synthetic data. To overcome these challenges, we propose a novel method that synthesizes realistic noise using diffusion models, namely Realistic Noise Synthesize Diffusor (RNSD). In particular, the proposed time-aware controlling module can simulate various environmental conditions under given camera settings. RNSD can incorporate guided multiscale content, such that more realistic noise with spatial correlations can be generated at multiple frequencies. In addition, we construct an inversion mechanism to predict the unknown camera setting, which enables the extension of RNSD to datasets without setting information. Extensive experiments demonstrate that our RNSD method significantly outperforms the existing methods not only in the synthesized noise under multiple realism metrics, but also in the single image denoising performances. △ Less

Submitted 3 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.13622 [pdf, other]

Continual Learning with Strong Experience Replay

Authors: Tao Zhuo, Zhiyong Cheng, Zan Gao, Hehe Fan, Mohan Kankanhalli

Abstract: Continual Learning (CL) aims at incrementally learning new tasks without forgetting the knowledge acquired from old ones. Experience Replay (ER) is a simple and effective rehearsal-based strategy, which optimizes the model with current training data and a subset of old samples stored in a memory buffer. To further reduce forgetting, recent approaches extend ER with various techniques, such as mode… ▽ More Continual Learning (CL) aims at incrementally learning new tasks without forgetting the knowledge acquired from old ones. Experience Replay (ER) is a simple and effective rehearsal-based strategy, which optimizes the model with current training data and a subset of old samples stored in a memory buffer. To further reduce forgetting, recent approaches extend ER with various techniques, such as model regularization and memory sampling. However, the prediction consistency between the new model and the old one on current training data has been seldom explored, resulting in less knowledge preserved when few previous samples are available. To address this issue, we propose a CL method with Strong Experience Replay (SER), which additionally utilizes future experiences mimicked on the current training data, besides distilling past experience from the memory buffer. In our method, the updated model will produce approximate outputs as its original ones, which can effectively preserve the acquired knowledge. Experimental results on multiple image classification datasets show that our SER method surpasses the state-of-the-art methods by a noticeable margin. △ Less

Submitted 3 December, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

arXiv:2305.11818 [pdf, other]

MaGIC: Multi-modality Guided Image Completion

Authors: Yongsheng Yu, Hao Wang, Tiejian Luo, Heng Fan, Libo Zhang

Abstract: Vanilla image completion approaches exhibit sensitivity to large missing regions, attributed to the limited availability of reference information for plausible generation. To mitigate this, existing methods incorporate the extra cue as a guidance for image completion. Despite improvements, these approaches are often restricted to employing a single modality (e.g., segmentation or sketch maps), whi… ▽ More Vanilla image completion approaches exhibit sensitivity to large missing regions, attributed to the limited availability of reference information for plausible generation. To mitigate this, existing methods incorporate the extra cue as a guidance for image completion. Despite improvements, these approaches are often restricted to employing a single modality (e.g., segmentation or sketch maps), which lacks scalability in leveraging multi-modality for more plausible completion. In this paper, we propose a novel, simple yet effective method for Multi-modal Guided Image Completion, dubbed MaGIC, which not only supports a wide range of single modality as the guidance (e.g., text, canny edge, sketch, segmentation, depth, and pose), but also adapts to arbitrarily customized combination of these modalities (i.e., arbitrary multi-modality) for image completion. For building MaGIC, we first introduce a modality-specific conditional U-Net (MCU-Net) that injects single-modal signal into a U-Net denoiser for single-modal guided image completion. Then, we devise a consistent modality blending (CMB) method to leverage modality signals encoded in multiple learned MCU-Nets through gradient guidance in latent space. Our CMB is training-free, thereby avoids the cumbersome joint re-training of different modalities, which is the secret of MaGIC to achieve exceptional flexibility in accommodating new modalities for completion. Experiments show the superiority of MaGIC over state-of-the-art methods and its generalization to various completion tasks. Our project with code and models is available at yeates.github.io/MaGIC-Page/. △ Less

Submitted 21 November, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

Comments: 23 pages, 15 figures

arXiv:2305.08334 [pdf, other]

Logarithmic light cone, slow entanglement growth and scrambling, and quantum memory

Authors: Yu Zeng, Alioscia Hamma, Yu-Ran Zhang, Qiang Liu, Rengang Li, Heng Fan, Wu-Ming Liu

Abstract: Effective light cones may emerge in non-relativistic local quantum systems from the Lieb-Robinson bounds, resulting in exponentially decaying commutator norms of two space-time separated operators in the Heisenberg picture. Here, we derive a mechanism for the emergence and consequences of a logarithmic light cone (LLC). As a possible way, the LLC can emerge from a phenomenological model of many-bo… ▽ More Effective light cones may emerge in non-relativistic local quantum systems from the Lieb-Robinson bounds, resulting in exponentially decaying commutator norms of two space-time separated operators in the Heisenberg picture. Here, we derive a mechanism for the emergence and consequences of a logarithmic light cone (LLC). As a possible way, the LLC can emerge from a phenomenological model of many-body-localization. We show that the information scrambling is logarithmically slow in the regime of the LLC. We prove that the bipartite entanglement entropy grows logarithmically with time for arbitrary finite space dimensions and arbitrary initial pure states. As an application in quantum information processing, the LLC supports long-lived quantum memory after unitary time evolution: a quantum code with macroscopic code distance and exponentially long lifetime. △ Less

Submitted 25 July, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

Comments: 7+3 pages, 2 figures.In version 2, typos were fixed; the proof of theorem 1 was slightly modified

arXiv:2305.05875 [pdf, other]

Quantization Aware Attack: Enhancing Transferable Adversarial Attacks by Model Quantization

Authors: Yulong Yang, Chenhao Lin, Qian Li, Zhengyu Zhao, Haoran Fan, Dawei Zhou, Nannan Wang, Tongliang Liu, Chao Shen

Abstract: Quantized neural networks (QNNs) have received increasing attention in resource-constrained scenarios due to their exceptional generalizability. However, their robustness against realistic black-box adversarial attacks has not been extensively studied. In this scenario, adversarial transferability is pursued across QNNs with different quantization bitwidths, which particularly involve unknown arch… ▽ More Quantized neural networks (QNNs) have received increasing attention in resource-constrained scenarios due to their exceptional generalizability. However, their robustness against realistic black-box adversarial attacks has not been extensively studied. In this scenario, adversarial transferability is pursued across QNNs with different quantization bitwidths, which particularly involve unknown architectures and defense methods. Previous studies claim that transferability is difficult to achieve across QNNs with different bitwidths on the condition that they share the same architecture. However, we discover that under different architectures, transferability can be largely improved by using a QNN quantized with an extremely low bitwidth as the substitute model. We further improve the attack transferability by proposing \textit{quantization aware attack} (QAA), which fine-tunes a QNN substitute model with a multiple-bitwidth training objective. In particular, we demonstrate that QAA addresses the two issues that are commonly known to hinder transferability: 1) quantization shifts and 2) gradient misalignments. Extensive experimental results validate the high transferability of the QAA to diverse target models. For instance, when adopting the ResNet-34 substitute model on ImageNet, QAA outperforms the current best attack in attacking standardly trained DNNs, adversarially trained DNNs, and QNNs with varied bitwidths by 4.3\% $\sim$ 20.9\%, 8.7\% $\sim$ 15.5\%, and 2.6\% $\sim$ 31.1\% (absolute), respectively. In addition, QAA is efficient since it only takes one epoch for fine-tuning. In the end, we empirically explain the effectiveness of QAA from the view of the loss landscape. Our code is available at https://github.com/yyl-github-1896/QAA/ △ Less

Submitted 16 February, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

Comments: Accepted by IEEE Transactions on Information Forensics and Security in 2024

arXiv:2305.05372 [pdf, other]

doi 10.1103/PhysRevLett.131.151001

Measurement of ultra-high-energy diffuse gamma-ray emission of the Galactic plane from 10 TeV to 1 PeV with LHAASO-KM2A

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: The diffuse Galactic $γ$-ray emission, mainly produced via interactions between cosmic rays and the interstellar medium and/or radiation field, is a very important probe of the distribution, propagation, and interaction of cosmic rays in the Milky Way. In this work we report the measurements of diffuse $γ$-rays from the Galactic plane between 10 TeV and 1 PeV energies, with the square kilometer ar… ▽ More The diffuse Galactic $γ$-ray emission, mainly produced via interactions between cosmic rays and the interstellar medium and/or radiation field, is a very important probe of the distribution, propagation, and interaction of cosmic rays in the Milky Way. In this work we report the measurements of diffuse $γ$-rays from the Galactic plane between 10 TeV and 1 PeV energies, with the square kilometer array of the Large High Altitude Air Shower Observatory (LHAASO). Diffuse emissions from the inner ($15^{\circ}<l<125^{\circ}$, $|b|<5^{\circ}$) and outer ($125^{\circ}<l<235^{\circ}$, $|b|<5^{\circ}$) Galactic plane are detected with $29.1σ$ and $12.7σ$ significance, respectively. The outer Galactic plane diffuse emission is detected for the first time in the very- to ultra-high-energy domain ($E>10$~TeV). The energy spectrum in the inner Galaxy regions can be described by a power-law function with an index of $-2.99\pm0.04$, which is different from the curved spectrum as expected from hadronic interactions between locally measured cosmic rays and the line-of-sight integrated gas content. Furthermore, the measured flux is higher by a factor of $\sim3$ than the prediction. A similar spectrum with an index of $-2.99\pm0.07$ is found in the outer Galaxy region, and the absolute flux for $10\lesssim E\lesssim60$ TeV is again higher than the prediction for hadronic cosmic ray interactions. The latitude distributions of the diffuse emission are consistent with the gas distribution, while the longitude distributions show clear deviation from the gas distribution. The LHAASO measurements imply that either additional emission sources exist or cosmic ray intensities have spatial variations. △ Less

Submitted 19 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

Comments: 12 pages, 8 figures, 5 tables; accepted for publication in Physical Review Letters; source mask file provided as ancillary file

Journal ref: Phys. Rev. Lett. 131, 151001 (2023)

arXiv:2305.03976 [pdf, ps, other]

doi 10.1088/1674-4527/acd589

The chromatic Point Spread Function of weak lensing measurement in Chinese Space Station survey Telescope

Authors: Q. Y. Liu, X. Z. Er, Z. H. Fan, D. Z. Liu, G. L. Li, C. L. Wei, Z. Ban, X. B. Li, D. Yue

Abstract: The weak gravitational lensing is a powerful tool in modern cosmology. To accurately measure the weak lensing signal, one has to control the systematic bias to a small level. One of the most difficult problems is how to correct the smearing effect of the Point Spread Function (PSF) on the shape of the galaxies. The chromaticity of PSF for a broad-band observation can lead to new subtle effects. Si… ▽ More The weak gravitational lensing is a powerful tool in modern cosmology. To accurately measure the weak lensing signal, one has to control the systematic bias to a small level. One of the most difficult problems is how to correct the smearing effect of the Point Spread Function (PSF) on the shape of the galaxies. The chromaticity of PSF for a broad-band observation can lead to new subtle effects. Since the PSF is wavelength dependent and the spectrum energy distributions between stars and galaxies are different, the effective PSF measured from the star images will be different from that smears the galaxies. Such a bias is called colour bias. We estimate it in the optical bands of the Chinese Space Station Survey Telescope from simulated PSFs, and show the dependence on the colour and redshift of the galaxies. Moreover, due to the spatial variation of spectra over the galaxy image, there exists another higher-order bias, colour gradient bias. Our results show that both colour bias and colour gradient bias are generally below $0.1$ percent in CSST. Only for small-size galaxies, one needs to be careful about the colour gradient bias in the weak lensing analysis using CSST data. △ Less

Submitted 6 May, 2023; originally announced May 2023.

arXiv:2305.03546 [pdf, other]

Breast Cancer Immunohistochemical Image Generation: a Benchmark Dataset and Challenge Review

Authors: Chuang Zhu, Shengjie Liu, Zekuan Yu, Feng Xu, Arpit Aggarwal, Germán Corredor, Anant Madabhushi, Qixun Qu, Hongwei Fan, Fangda Li, Yueheng Li, Xianchao Guan, Yongbing Zhang, Vivek Kumar Singh, Farhan Akram, Md. Mostafa Kamal Sarker, Zhongyue Shi, Mulan **

Abstract: For invasive breast cancer, immunohistochemical (IHC) techniques are often used to detect the expression level of human epidermal growth factor receptor-2 (HER2) in breast tissue to formulate a precise treatment plan. From the perspective of saving manpower, material and time costs, directly generating IHC-stained images from Hematoxylin and Eosin (H&E) stained images is a valuable research direct… ▽ More For invasive breast cancer, immunohistochemical (IHC) techniques are often used to detect the expression level of human epidermal growth factor receptor-2 (HER2) in breast tissue to formulate a precise treatment plan. From the perspective of saving manpower, material and time costs, directly generating IHC-stained images from Hematoxylin and Eosin (H&E) stained images is a valuable research direction. Therefore, we held the breast cancer immunohistochemical image generation challenge, aiming to explore novel ideas of deep learning technology in pathological image generation and promote research in this field. The challenge provided registered H&E and IHC-stained image pairs, and participants were required to use these images to train a model that can directly generate IHC-stained images from corresponding H&E-stained images. We selected and reviewed the five highest-ranking methods based on their PSNR and SSIM metrics, while also providing overviews of the corresponding pipelines and implementations. In this paper, we further analyze the current limitations in the field of breast cancer immunohistochemical image generation and forecast the future development of this field. We hope that the released dataset and the challenge will inspire more scholars to jointly study higher-quality IHC-stained image generation. △ Less

Submitted 22 September, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

Comments: 12 pages, 12 figures, 2tables

arXiv:2304.12159 [pdf, ps, other]

First Experimental Study of the Purely Leptonic Decay $D_s^{*+}\to e^+ν_e$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, J. Bloms, A. Bortone, I. Boyko, R. A. Briere , et al. (604 additional authors not shown)

Abstract: Using $7.33~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data taken with the BESIII detector at the BEPCII collider, we report the first experimental study of the purely leptonic decay $D_s^{*+}\to e^+ν_e$. A signal for the decay $D_s^{*+}\to e^+ν_e$ is observed with a statistical significance of $2.9σ$. The branching fraction of ${D_s^{*+}\to e^+ν_e}$ is measured to be… ▽ More Using $7.33~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data taken with the BESIII detector at the BEPCII collider, we report the first experimental study of the purely leptonic decay $D_s^{*+}\to e^+ν_e$. A signal for the decay $D_s^{*+}\to e^+ν_e$ is observed with a statistical significance of $2.9σ$. The branching fraction of ${D_s^{*+}\to e^+ν_e}$ is measured to be $(2.1{^{+1.2}_{-0.9}}_{\rm stat.}\pm0.2_{\rm syst.})\times 10^{-5}$, corresponding to an upper limit of $4.0\times10^{-5}$ at the 90\% confidence level. Taking the total width of the $D_s^{*+}$~(($0.070\pm0.028$) keV) predicted by lattice quantum chromodynamics as input, the decay constant of the $D^{*+}_s$ is determined to be $f_{D_s^{*+}}=(213.6{^{+61.0}_{-45.8}}_{\rm stat.}\pm43.9_{\rm syst.})$ MeV, corresponding to an upper limit of 353.8 MeV at the 90\% confidence level. △ Less

Submitted 24 April, 2023; originally announced April 2023.

arXiv:2304.11335 [pdf, other]

Two Birds, One Stone: A Unified Framework for Joint Learning of Image and Video Style Transfers

Authors: Bohai Gu, Heng Fan, Libo Zhang

Abstract: Current arbitrary style transfer models are limited to either image or video domains. In order to achieve satisfying image and video style transfers, two different models are inevitably required with separate training processes on image and video domains, respectively. In this paper, we show that this can be precluded by introducing UniST, a Unified Style Transfer framework for both images and vid… ▽ More Current arbitrary style transfer models are limited to either image or video domains. In order to achieve satisfying image and video style transfers, two different models are inevitably required with separate training processes on image and video domains, respectively. In this paper, we show that this can be precluded by introducing UniST, a Unified Style Transfer framework for both images and videos. At the core of UniST is a domain interaction transformer (DIT), which first explores context information within the specific domain and then interacts contextualized domain information for joint learning. In particular, DIT enables exploration of temporal information from videos for the image style transfer task and meanwhile allows rich appearance texture from images for video style transfer, thus leading to mutual benefits. Considering heavy computation of traditional multi-head self-attention, we present a simple yet effective axial multi-head self-attention (AMSA) for DIT, which improves computational efficiency while maintains style transfer performance. To verify the effectiveness of UniST, we conduct extensive experiments on both image and video style transfer tasks and show that UniST performs favorably against state-of-the-art approaches on both tasks. Code is available at https://github.com/NevSNev/UniST. △ Less

Submitted 1 September, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

Comments: Conference on International Conference on Computer Vision.(ICCV 2023)

arXiv:2304.09405 [pdf, other]

doi 10.1007/JHEP09(2023)125

Measurement of branching fractions of $Λ_{c}^{+}$ decays to $Σ^{+} K^{+} K^{-}$, $Σ^{+}φ$ and $Σ^{+} K^{+} π^{-}(π^{0})$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, J. Bloms, A. Bortone, I. Boyko, R. A. Briere , et al. (601 additional authors not shown)

Abstract: Based on 4.5 fb$^{-1}$ data taken at seven center-of-mass energies ranging from 4.600 to 4.699 GeV with the BESIII detector at the BEPCII collider, we measure the branching fractions of $Λ_{c}^{+}\rightarrowΣ^{+}+hadrons$ relative to $Λ_{c}^{+}\rightarrow Σ^+ π^+ π^-$. Combining with the world average branching fraction of $Λ_{c}^{+}\rightarrow Σ^+ π^+ π^-$, their branching fractions are measured… ▽ More Based on 4.5 fb$^{-1}$ data taken at seven center-of-mass energies ranging from 4.600 to 4.699 GeV with the BESIII detector at the BEPCII collider, we measure the branching fractions of $Λ_{c}^{+}\rightarrowΣ^{+}+hadrons$ relative to $Λ_{c}^{+}\rightarrow Σ^+ π^+ π^-$. Combining with the world average branching fraction of $Λ_{c}^{+}\rightarrow Σ^+ π^+ π^-$, their branching fractions are measured to be $(0.377\pm0.042\pm0.018\pm0.021)\%$ for $Λ_{c}^{+}\rightarrowΣ^{+} K^{+} K^{-}$, $(0.200\pm0.023\pm0.010\pm0.011)\%$ for $Λ_{c}^{+}\rightarrowΣ^{+} K^{+} π^{-}$, $(0.414\pm0.080\pm0.029\pm0.023)\%$ for $Λ_{c}^{+}\rightarrowΣ^{+}φ$ and $(0.197\pm0.036\pm0.008\pm0.011)\%$ for $Λ_{c}^{+}\rightarrowΣ^{+}K^{+} K^{-}$(non-$φ$). In all the above results, the first uncertainties are statistical, the second are systematic and the third are from external input of the branching fraction of $Λ_{c}^{+}\rightarrow Σ^+ π^+ π^-$. Since no signal for $Λ_{c}^{+}\rightarrowΣ^{+} K^{+} π^{-}π^{0}$ is observed, the upper limit of its branching fraction is determined to be 0.11\% at the 90$\%$ confidence level. △ Less

Submitted 30 August, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

Journal ref: JHEP09(2023)125

arXiv:2304.07783 [pdf, other]

doi 10.1103/PhysRevD.108.032004

Cross section measurements of $e^+e^- \to ΦK^+ K^-$ and $e^+ e^- \to ΦK_S^0 K_S^0$ at center-of-mass energies between 3.7730 GeV and 4.7008 GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, J. Bloms, A. Bortone, I. Boyko, R. A. Briere , et al. (600 additional authors not shown)

Abstract: Based on 22.7 fb$^{-1}$ of $e^+e^-$ annihilation data collected at 33 different center-of-mass energies between 3.7730 GeV and 4.7008 GeV with the BESIII detector at the BEPCII collider, Born cross sections of the two processes $e^+e^-\to φK^+ K^-$ and $e^+ e^- \to φK_{S}^{0} K_{S}^{0}$ are measured for the first time. No indication of resonant production through an intermediate vector state $V$ i… ▽ More Based on 22.7 fb$^{-1}$ of $e^+e^-$ annihilation data collected at 33 different center-of-mass energies between 3.7730 GeV and 4.7008 GeV with the BESIII detector at the BEPCII collider, Born cross sections of the two processes $e^+e^-\to φK^+ K^-$ and $e^+ e^- \to φK_{S}^{0} K_{S}^{0}$ are measured for the first time. No indication of resonant production through an intermediate vector state $V$ is observed, and the upper limits on the product of the electronic width $Γ_{e^+e^-}$ and the branching fraction $Br(V\rightarrow φK \bar{K})$ of the processes $e^+e^- \to V \to φK^+ K^-$ and $e^+e^- \to V \to φK_S^0K_S^0$ at the $90\%$ confidence level are obtained for a large parameter space in resonance masses and widths. For the current world average mass and width of the $ψ(4230)$ of $m=4.2187$ GeV$/c^2$ and $Γ=44$ MeV, we set upper limits on the $φK^+ K^-$ and $φK_S^0K_S^0$ final states of 1.75 eV and 0.47 eV at the $90\%$ confidence level, respectively. △ Less

Submitted 16 April, 2023; originally announced April 2023.

arXiv:2304.07018 [pdf, other]

DIPNet: Efficiency Distillation and Iterative Pruning for Image Super-Resolution

Authors: Lei Yu, Xinpeng Li, Youwei Li, Ting Jiang, Qi Wu, Haoqiang Fan, Shuaicheng Liu

Abstract: Efficient deep learning-based approaches have achieved remarkable performance in single image super-resolution. However, recent studies on efficient super-resolution have mainly focused on reducing the number of parameters and floating-point operations through various network designs. Although these methods can decrease the number of parameters and floating-point operations, they may not necessari… ▽ More Efficient deep learning-based approaches have achieved remarkable performance in single image super-resolution. However, recent studies on efficient super-resolution have mainly focused on reducing the number of parameters and floating-point operations through various network designs. Although these methods can decrease the number of parameters and floating-point operations, they may not necessarily reduce actual running time. To address this issue, we propose a novel multi-stage lightweight network boosting method, which can enable lightweight networks to achieve outstanding performance. Specifically, we leverage enhanced high-resolution output as additional supervision to improve the learning ability of lightweight student networks. Upon convergence of the student network, we further simplify our network structure to a more lightweight level using reparameterization techniques and iterative network pruning. Meanwhile, we adopt an effective lightweight network training strategy that combines multi-anchor distillation and progressive learning, enabling the lightweight network to achieve outstanding performance. Ultimately, our proposed method achieves the fastest inference time among all participants in the NTIRE 2023 efficient super-resolution challenge while maintaining competitive super-resolution performance. Additionally, extensive experiments are conducted to demonstrate the effectiveness of the proposed components. The results show that our approach achieves comparable performance in representative dataset DIV2K, both qualitatively and quantitatively, with faster inference and fewer number of network parameters. △ Less

Submitted 14 April, 2023; originally announced April 2023.

arXiv:2304.06649 [pdf]

doi 10.1016/j.compag.2023.107808

Prediction method of cigarette draw resistance based on correlation analysis

Authors: Linsheng Chen, Zhonghua Yu, Bo Zhang, Qiang Zhu, Hu Fan, Yucan Qiu

Abstract: The cigarette draw resistance monitoring method is incomplete and single, and the lacks correlation analysis and preventive modeling, resulting in substandard cigarettes in the market. To address this problem without increasing the hardware cost, in this paper, multi-indicator correlation analysis is used to predict cigarette draw resistance. First, the monitoring process of draw resistance is ana… ▽ More The cigarette draw resistance monitoring method is incomplete and single, and the lacks correlation analysis and preventive modeling, resulting in substandard cigarettes in the market. To address this problem without increasing the hardware cost, in this paper, multi-indicator correlation analysis is used to predict cigarette draw resistance. First, the monitoring process of draw resistance is analyzed based on the existing quality control framework, and optimization ideas are proposed. In addition, for the three production units, the cut tobacco supply (VE), the tobacco rolling (SE), and the cigarette-forming (MAX), direct and potential factors associated with draw resistance are explored, based on the linear and non-linear correlation analysis. Then, the correlates of draw resistance are used as inputs for the machine learning model, and the predicted values of draw resistance are used as outputs. Finally, this research also innovatively verifies the practical application value of draw resistance prediction: the distribution characteristics of substandard cigarettes are analyzed based on the prediction results, the time interval of substandard cigarettes being produced is determined, the probability model of substandard cigarettes being sampled is derived, and the reliability of the prediction result is further verified by the example. The results show that the prediction model based on correlation analysis has good performance in three months of actual production. △ Less

Submitted 12 April, 2023; originally announced April 2023.

Comments: Preprint, submitted to Computers and Electronics in Agriculture. For any suggestions or improvements, please contact me directly by e-mail

arXiv:2304.05791 [pdf, ps, other]

doi 10.1103/PhysRevA.108.012423

Sequential sharing of two-qudit entanglement based on the entropic uncertainty relation

Authors: Ming-Liang Hu, Heng Fan

Abstract: Entanglement and uncertainty relation are two focuses of quantum theory. We relate entanglement sharing to the entropic uncertainty relation in a $(d\times d)$-dimensional system via weak measurements with different pointers. We consider both the scenarios of one-sided sequential measurements in which the entangled pair is distributed to multiple Alices and one Bob and two-sided sequential measure… ▽ More Entanglement and uncertainty relation are two focuses of quantum theory. We relate entanglement sharing to the entropic uncertainty relation in a $(d\times d)$-dimensional system via weak measurements with different pointers. We consider both the scenarios of one-sided sequential measurements in which the entangled pair is distributed to multiple Alices and one Bob and two-sided sequential measurements in which the entangled pair is distributed to multiple Alices and Bobs. It is found that the maximum number of observers sharing the entanglement strongly depends on the measurement scenarios, the pointer states of the apparatus, and the local dimension $d$ of each subsystem, while the required minimum measurement precision to achieve entanglement sharing decreases to its asymptotic value with the increase of $d$. The maximum number of observers remain unaltered even when the state is not maximally entangled but has strong-enough entanglement. △ Less

Submitted 24 July, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

Comments: 11 pages, 6 figures; Final version published in Phys. Rev. A

Journal ref: Phys. Rev. A 108, 012423 (2023)

arXiv:2304.03283 [pdf, other]

Diffusion Models as Masked Autoencoders

Authors: Chen Wei, Karttikeya Mangalam, Po-Yao Huang, Yanghao Li, Haoqi Fan, Hu Xu, Huiyu Wang, Cihang Xie, Alan Yuille, Christoph Feichtenhofer

Abstract: There has been a longstanding belief that generation can facilitate a true understanding of visual data. In line with this, we revisit generatively pre-training visual representations in light of recent interest in denoising diffusion models. While directly pre-training with diffusion models does not produce strong representations, we condition diffusion models on masked input and formulate diffus… ▽ More There has been a longstanding belief that generation can facilitate a true understanding of visual data. In line with this, we revisit generatively pre-training visual representations in light of recent interest in denoising diffusion models. While directly pre-training with diffusion models does not produce strong representations, we condition diffusion models on masked input and formulate diffusion models as masked autoencoders (DiffMAE). Our approach is capable of (i) serving as a strong initialization for downstream recognition tasks, (ii) conducting high-quality image inpainting, and (iii) being effortlessly extended to video where it produces state-of-the-art classification accuracy. We further perform a comprehensive study on the pros and cons of design choices and build connections between diffusion models and masked autoencoders. △ Less

Submitted 6 April, 2023; originally announced April 2023.

Comments: Tech report. Project page: https://weichen582.github.io/diffmae.html

arXiv:2303.16383 [pdf, ps, other]

The jet apparent motion and central engine study of Fermi blazars

Authors: H. B. Xiao, J. T. Zhu, J. H. Fan, Z. Y. Pei, Z. J. Luo, S. H. Zhang

Abstract: The study of blazar jet has been performed for several decades via the VLBI technique, while its generation and propagation stay unclear. In the present work, we compiled a sample of 407 VLBI detected \textit{Fermi} blazars (VFBs) and studied the correlations between apparent velocity (${\rm log}\,β_{\rm app}$) and jet/accretion disk properties. We found a positive correlation between $γ$-ray lumi… ▽ More The study of blazar jet has been performed for several decades via the VLBI technique, while its generation and propagation stay unclear. In the present work, we compiled a sample of 407 VLBI detected \textit{Fermi} blazars (VFBs) and studied the correlations between apparent velocity (${\rm log}\,β_{\rm app}$) and jet/accretion disk properties. We found a positive correlation between $γ$-ray luminosity (${\rm log}\,L_{\rm γ}$) and ${\rm log}\,β_{\rm app}$, the correlation suggests that the apparent motion of jet knot is related to the jet power. △ Less

Submitted 28 March, 2023; originally announced March 2023.

Comments: 11 pages, 7 figures, 3 tables

Journal ref: Published on MNRAS in 2022

arXiv:2303.15781 [pdf, other]

doi 10.1103/PhysRevD.107.116010

Momentum spirals in multiphoton pair production revisited

Authors: Li-Na Hu, Orkash Amat, Li Wang, Adiljan Sawut, Hong-Hao Fan, B. S. Xie

Abstract: Spirals in multiphoton pair production are revisited by two counter-rotating fields with time delay for different cycles in pulse. Novel findings include that for subcycle fields, the remarkable spiral structure in the momentum spectrum can be still caused by a large time delay compared to the previous study for supercycle case where it is easier to be generated by a small time delay. And also the… ▽ More Spirals in multiphoton pair production are revisited by two counter-rotating fields with time delay for different cycles in pulse. Novel findings include that for subcycle fields, the remarkable spiral structure in the momentum spectrum can be still caused by a large time delay compared to the previous study for supercycle case where it is easier to be generated by a small time delay. And also there exist a range of critical polarization values for the spirals appearance corresponding to the different cycle number. The relative phase difference between two fields causes not only severe symmetry breaking of the momentum spectra pattern and spiral, but also a significant change for the shape and the number of spiral arm. Upon the number density, it is found a more sensitive to the cycle number, in particularly, it is enhanced by more than one order of magnitude for small cycle pulse, while it is increased about few times when the time delay is small. These results provide an abundant theoretical testbed for the possible experimental observation on the multiphoton pair production in future. Meanwhile, it is applicable to regard the particles momentum signatures as a new probing to the laser field information with it from the vacuum. △ Less

Submitted 16 June, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

Comments: 31 pages, 12 figures, 2 tables

Journal ref: Physical Review D 107, 116010 (2023)

arXiv:2303.13496 [pdf, other]

The effectiveness of MAE pre-pretraining for billion-scale pretraining

Authors: Mannat Singh, Quentin Duval, Kalyan Vasudev Alwala, Haoqi Fan, Vaibhav Aggarwal, Aaron Adcock, Armand Joulin, Piotr Dollár, Christoph Feichtenhofer, Ross Girshick, Rohit Girdhar, Ishan Misra

Abstract: This paper revisits the standard pretrain-then-finetune paradigm used in computer vision for visual recognition tasks. Typically, state-of-the-art foundation models are pretrained using large scale (weakly) supervised datasets with billions of images. We introduce an additional pre-pretraining stage that is simple and uses the self-supervised MAE technique to initialize the model. While MAE has on… ▽ More This paper revisits the standard pretrain-then-finetune paradigm used in computer vision for visual recognition tasks. Typically, state-of-the-art foundation models are pretrained using large scale (weakly) supervised datasets with billions of images. We introduce an additional pre-pretraining stage that is simple and uses the self-supervised MAE technique to initialize the model. While MAE has only been shown to scale with the size of models, we find that it scales with the size of the training dataset as well. Thus, our MAE-based pre-pretraining scales with both model and data size making it applicable for training foundation models. Pre-pretraining consistently improves both the model convergence and the downstream transfer performance across a range of model scales (millions to billions of parameters), and dataset sizes (millions to billions of images). We measure the effectiveness of pre-pretraining on 10 different visual recognition tasks spanning image classification, video recognition, object detection, low-shot classification and zero-shot recognition. Our largest model achieves new state-of-the-art results on iNaturalist-18 (91.7%), ImageNet-ReaL (91.1%), 1-shot ImageNet-1k (63.6%), and zero-shot transfer on Food-101 (96.2%). Our study reveals that model initialization plays a significant role, even for web-scale pretraining with billions of images, and our models are available publicly. △ Less

Submitted 24 January, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

Comments: ICCV 2023. Models available at https://github.com/facebookresearch/maws/

arXiv:2303.11243 [pdf, other]

Augment and Criticize: Exploring Informative Samples for Semi-Supervised Monocular 3D Object Detection

Authors: Zhenyu Li, Zhipeng Zhang, Heng Fan, Yuan He, Ke Wang, Xianming Liu, Junjun Jiang

Abstract: In this paper, we improve the challenging monocular 3D object detection problem with a general semi-supervised framework. Specifically, having observed that the bottleneck of this task lies in lacking reliable and informative samples to train the detector, we introduce a novel, simple, yet effective `Augment and Criticize' framework that explores abundant informative samples from unlabeled data fo… ▽ More In this paper, we improve the challenging monocular 3D object detection problem with a general semi-supervised framework. Specifically, having observed that the bottleneck of this task lies in lacking reliable and informative samples to train the detector, we introduce a novel, simple, yet effective `Augment and Criticize' framework that explores abundant informative samples from unlabeled data for learning more robust detection models. In the `Augment' stage, we present the Augmentation-based Prediction aGgregation (APG), which aggregates detections from various automatically learned augmented views to improve the robustness of pseudo label generation. Since not all pseudo labels from APG are beneficially informative, the subsequent `Criticize' phase is presented. In particular, we introduce the Critical Retraining Strategy (CRS) that, unlike simply filtering pseudo labels using a fixed threshold (e.g., classification score) as in 2D semi-supervised tasks, leverages a learnable network to evaluate the contribution of unlabeled images at different training timestamps. This way, the noisy samples prohibitive to model evolution could be effectively suppressed. To validate our framework, we apply it to MonoDLE and MonoFlex. The two new detectors, dubbed 3DSeMo_DLE and 3DSeMo_FLEX, achieve state-of-the-art results with remarkable improvements for over 3.5% AP_3D/BEV (Easy) on KITTI, showing its effectiveness and generality. Code and models will be released. △ Less

Submitted 20 March, 2023; originally announced March 2023.

Showing 201–250 of 990 results for author: Fan, H