Skip to main content

Showing 151–200 of 577 results for author: Wan, X

.
  1. arXiv:2302.03622  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Novel three-dimensional Fermi surface and electron-correlation-induced charge density wave in FeGe

    Authors: Lin Wu, Yating Hu, Di Wang, Xiangang Wan

    Abstract: As the first magnetic kagome material to exhibit the charge density wave (CDW) order, FeGe has attracted much attention in recent studies. Similar to AV$_{3}$Sb$_{5}$ (A = K, Cs, Rb), FeGe exhibits the CDW pattern with an in-plane 2$\times $2 structure and the existence of van Hove singularities (vHSs) near the Fermi level. However, sharply different from AV$_{3}$Sb$_{5}$ which has phonon instabil… ▽ More

    Submitted 13 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Journal ref: Chinese Physics Letters 40, 117103 (2023)

  2. arXiv:2301.13538  [pdf, other

    cs.CV

    AMD: Adaptive Masked Distillation for Object Detection

    Authors: Guang Yang, Yin Tang, Jun Li, Jianhua Xu, Xili Wan

    Abstract: As a general model compression paradigm, feature-based knowledge distillation allows the student model to learn expressive features from the teacher counterpart. In this paper, we mainly focus on designing an effective feature-distillation framework and propose a spatial-channel adaptive masked distillation (AMD) network for object detection. More specifically, in order to accurately reconstruct i… ▽ More

    Submitted 10 February, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

  3. arXiv:2301.12132  [pdf, other

    cs.CL cs.AI cs.LG

    AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning

    Authors: Han Zhou, Xingchen Wan, Ivan Vulić, Anna Korhonen

    Abstract: Large pretrained language models are widely used in downstream NLP tasks via task-specific fine-tuning, but such procedures can be costly. Recently, Parameter-Efficient Fine-Tuning (PEFT) methods have achieved strong task performance while updating much fewer parameters than full model fine-tuning (FFT). However, it is non-trivial to make informed design choices on the PEFT configurations, such as… ▽ More

    Submitted 29 January, 2024; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: Accepted to TACL; pre-MIT Press publication version

  4. arXiv:2301.06277  [pdf, ps, other

    cs.SD cs.AI cs.LG eess.AS

    Improving Target Speaker Extraction with Sparse LDA-transformed Speaker Embeddings

    Authors: Kai Liu, Xucheng Wan, Ziqing Du, Huan Zhou

    Abstract: As a practical alternative of speech separation, target speaker extraction (TSE) aims to extract the speech from the desired speaker using additional speaker cue extracted from the speaker. Its main challenge lies in how to properly extract and leverage the speaker cue to benefit the extracted speech quality. The cue extraction method adopted in majority existing TSE studies is to directly utilize… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: ACCEPTED by NCMMSC 2022

  5. arXiv:2301.04904  [pdf, other

    eess.IV cs.CV

    Lesion-aware Dynamic Kernel for Polyp Segmentation

    Authors: Ruifei Zhang, Peiwen Lai, Xiang Wan, De-Jun Fan, Feng Gao, Xiao-Jian Wu, Guanbin Li

    Abstract: Automatic and accurate polyp segmentation plays an essential role in early colorectal cancer diagnosis. However, it has always been a challenging task due to 1) the diverse shape, size, brightness and other appearance characteristics of polyps, 2) the tiny contrast between concealed polyps and their surrounding regions. To address these problems, we propose a lesion-aware dynamic network (LDNet) f… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: Accepted by MICCAI2022

  6. Positivity of Schur forms for strongly decomposably positive vector bundles

    Authors: Xueyuan Wan

    Abstract: In this paper, we define two types of strongly decomposable positivity, which serve as generalizations of (dual) Nakano positivity and are stronger than the decomposable positivity introduced by S. Finski. We provide the criteria for strongly decomposable positivity of type I and type II and prove that the Schur forms of a strongly decomposable positive vector bundle of type I are weakly positive,… ▽ More

    Submitted 7 December, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: 31 pages, 1 figure, final version, to appear in Forum of Mathematics, Sigma

  7. arXiv:2212.13963  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    First-principles study of spin orbit coupling contribution to anisotropic magnetic interaction

    Authors: Di Wang, Xiangyan Bo, Feng Tang, Xiangang Wan

    Abstract: Anisotropic magnetic exchange interactions lead to a surprisingly rich variety of the magnetic properties. Considering the spin orbit coupling (SOC) as perturbation, we extract the general expression of a bilinear spin Hamiltonian, including isotropic exchange interaction, antisymmetric Dzyaloshinskii-Moriya (DM) interaction and symmetric $Γ$ term. Though it is commonly believed that the magnitude… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Journal ref: Phys. Rev. B 108, 085140 (2023)

  8. Which Pixel to Annotate: a Label-Efficient Nuclei Segmentation Framework

    Authors: Wei Lou, Haofeng Li, Guanbin Li, Xiaoguang Han, Xiang Wan

    Abstract: Recently deep neural networks, which require a large amount of annotated samples, have been widely applied in nuclei instance segmentation of H\&E stained pathology images. However, it is inefficient and unnecessary to label all pixels for a dataset of nuclei images which usually contain similar and redundant patterns. Although unsupervised and semi-supervised learning methods have been studied fo… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: IEEE TMI 2022, Released code: https://github.com/lhaof/NuSeg

    ACM Class: I.4.6

  9. arXiv:2212.10171  [pdf, other

    cs.CL

    Document-level Relation Extraction with Relation Correlations

    Authors: Ridong Han, Tao Peng, Benyou Wang, Lu Liu, Xiang Wan

    Abstract: Document-level relation extraction faces two overlooked challenges: long-tail problem and multi-label problem. Previous work focuses mainly on obtaining better contextual representations for entity pairs, hardly address the above challenges. In this paper, we analyze the co-occurrence correlation of relations, and introduce it into DocRE task for the first time. We argue that the correlations can… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: 13 pages

  10. arXiv:2212.07019  [pdf

    econ.GN

    Data-Driven Prediction and Evaluation on Future Impact of Energy Transition Policies in Smart Regions

    Authors: Chunmeng Yang, Siqi Bu, Yi Fan, Wayne Xinwei Wan, Ruoheng Wang, Aoife Foley

    Abstract: To meet widely recognised carbon neutrality targets, over the last decade metropolitan regions around the world have implemented policies to promote the generation and use of sustainable energy. Nevertheless, there is an availability gap in formulating and evaluating these policies in a timely manner, since sustainable energy capacity and generation are dynamically determined by various factors al… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  11. arXiv:2212.03609  [pdf, ps, other

    hep-th hep-ph

    Fractional Path Integrals and its degeneration to Dimensional Regularization

    Authors: Zheng-Wei Cheng, You-Kai Wang, Xia Wan

    Abstract: In this work we study particles propagate in a fractional path and use fractional derivatives to extend the dynamic dimension of Quantum Field Theory. we construct the Lagrangian of fractional scalar, vector and spinor fields to obtain their propagators by path integral. Then we compute the typical tree level and one loop diagrams which correspond to QED cases. The calculations show the dimension… ▽ More

    Submitted 14 September, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: 17pages

  12. arXiv:2212.02040  [pdf

    cond-mat.mtrl-sci

    Discovery of a metallic oxide with ultralow thermal conductivity

    Authors: Jianhong Dai, Zhehong Liu, Jialin Ji, Xuejuan Dong, Jihai Yu, Xubin Ye, Weipeng Wang, RiCheng Yu, Zhiwei Hu, Huaizhou Zhao, Xiangang Wan, Wenqing Zhang, Youwen Long

    Abstract: A compound with metallic electrical conductivity usually has a considerable total thermal conductivity because both electrons and photons contribute to thermal transport. Here, we show an exceptional example of iridium oxide, Bi3Ir3O11, that concurrently displays metallic electrical conductivity and ultralow thermal conductivity approaching 0.61 W m-1 K-1 at 300 K. The compound crystallizes into a… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: 20 pages, 4 figures

  13. arXiv:2211.15545  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci cond-mat.supr-con

    Magnetic interactions and possible structural distortion in kagome FeGe from first-principles study and symmetry analysis

    Authors: Han**g Zhou, Songsong Yan, Dongze Fan, Di Wang, Xiangang Wan

    Abstract: Based on density functional theory and symmetry analysis, we present a comprehensive investigation of electronic structure, magnetic properties and possible structural distortion of magnetic kagome metal FeGe. We estimate the magnetic parameters including Heisenberg and Dzyaloshinskii-Moriya (DM) interactions, and find that the ferromagnetic nearest-neighbor $J_{1}$ dominates over the others, whil… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Journal ref: Phys. Rev. B 108, 035138 (2023)

  14. arXiv:2211.11618  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el

    Topological exact flat bands in two dimensional materials under periodic strain

    Authors: Xiaohan Wan, Siddhartha Sarkar, Shi-Zeng Lin, Kai Sun

    Abstract: We study flat bands and their topology in 2D materials with quadratic band crossing points (QBCPs) under periodic strain. In contrast to Dirac points in graphene, where strain acts as a vector potential, strain for QBCPs serves as a director potential with angular momentum $\ell=2$. We prove that when the strengths of the strain fields hit certain ``magic" values, exact flat bands with $C=\pm 1$ e… ▽ More

    Submitted 26 May, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Journal ref: Phys. Rev. Lett. 130, 216401 (2023)

  15. arXiv:2211.10992  [pdf, other

    cs.CV cs.CL

    How to Describe Images in a More Funny Way? Towards a Modular Approach to Cross-Modal Sarcasm Generation

    Authors: Jie Ruan, Yue Wu, Xiaojun Wan, Yuesheng Zhu

    Abstract: Sarcasm generation has been investigated in previous studies by considering it as a text-to-text generation problem, i.e., generating a sarcastic sentence for an input sentence. In this paper, we study a new problem of cross-modal sarcasm generation (CMSG), i.e., generating a sarcastic description for a given image. CMSG is challenging as models need to satisfy the characteristics of sarcasm, as w… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  16. arXiv:2211.08909  [pdf

    cond-mat.mtrl-sci

    Continuous Electrical Manipulation of Magnetic Anisotropy and Spin Flop** in van der Waals Ferromagnetic Devices

    Authors: Ming Tang, Junwei Huang, Feng Qin, Kun Zhai, Toshiya Ideue, Zeya Li, Fanhao Meng, Anmin Nie, Linglu Wu, Xiangyu Bi, Caorong Zhang, Ling Zhou, Peng Chen, Caiyu Qiu, Peizhe Tang, Haijun Zhang, Xiangang Wan, Lin Wang, Zhongyuan Liu, Yongjun Tian, Yoshihiro Iwasa, Hongtao Yuan

    Abstract: Controlling the magnetic anisotropy of ferromagnetic materials plays a key role in magnetic switching devices and spintronic applications. Examples of spin-orbit torque devices with different magnetic anisotropy geometries (in-plane or out-of-plane directions) have been demonstrated with novel magnetization switching mechanisms for extended device functionalities. Normally, the intrinsic magnetic… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 4 figures

  17. Toward expanding the scope of radiology report summarization to multiple anatomies and modalities

    Authors: Zhihong Chen, Maya Varma, Xiang Wan, Curtis Langlotz, Jean-Benoit Delbrouck

    Abstract: Radiology report summarization (RRS) is a growing area of research. Given the Findings section of a radiology report, the goal is to generate a summary (called an Impression section) that highlights the key observations and conclusions of the radiology study. However, RRS currently faces essential limitations.First, many prior studies conduct experiments on private datasets, preventing reproductio… ▽ More

    Submitted 21 July, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Journal ref: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2023

  18. arXiv:2211.07843  [pdf, other

    cs.CL

    Error-Robust Retrieval for Chinese Spelling Check

    Authors: Xunjian Yin, Xinyu Hu, ** Jiang, Xiaojun Wan

    Abstract: Chinese Spelling Check (CSC) aims to detect and correct error tokens in Chinese contexts, which has a wide range of applications. However, it is confronted with the challenges of insufficient annotated data and the issue that previous methods may actually not fully leverage the existing datasets. In this paper, we introduce our plug-and-play retrieval method with error-robust information for Chine… ▽ More

    Submitted 25 February, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: 11 pages, 3 figures

  19. arXiv:2211.01543  [pdf, other

    hep-ph

    Custodial Symmetry Violation in Scalar Extensions of the Standard Model

    Authors: Huayang Song, Xia Wan, Jiang-Hao Yu

    Abstract: The new measurement of the W boson mass from the CDF collaboration shows a significant tension with the Standard Model prediction, which evidences violation of custodial symmetry in the scalar sector. We study the scalar extensions of the Standard Model, which can be categorized into two classes, scalar sector with custodial symmetry (Georgi-Machacek model and its generalizations) and scalar secto… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: 24 pages, 4 figures

  20. arXiv:2210.16098  [pdf, other

    q-bio.BM cs.LG

    Predicting Protein-Ligand Binding Affinity with Equivariant Line Graph Network

    Authors: Yiqiang Yi, Xu Wan, Kangfei Zhao, Le Ou-Yang, Peilin Zhao

    Abstract: Binding affinity prediction of three-dimensional (3D) protein ligand complexes is critical for drug repositioning and virtual drug screening. Existing approaches transform a 3D protein-ligand complex to a two-dimensional (2D) graph, and then use graph neural networks (GNNs) to predict its binding affinity. However, the node and edge features of the 2D graph are extracted based on invariant local c… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  21. arXiv:2210.14402  [pdf, other

    cs.LG math.NA

    Adaptive deep density approximation for fractional Fokker-Planck equations

    Authors: Li Zeng, Xiaoliang Wan, Tao Zhou

    Abstract: In this work, we propose adaptive deep learning approaches based on normalizing flows for solving fractional Fokker-Planck equations (FPEs). The solution of a FPE is a probability density function (PDF). Traditional mesh-based methods are ineffective because of the unbounded computation domain, a large number of dimensions and the nonlocal fractional operator. To this end, we represent the solutio… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: 25 pages, 22 figures

  22. Gate-tunable Lifshitz transition of Fermi arcs and its nonlocal transport signatures

    Authors: Yue Zheng, Wei Chen, Xiangang Wan, D. Y. Xing

    Abstract: One hallmark of the Weyl semimetal is the emergence of Fermi arcs (FAs) in the surface Brillouin zone that connect the projected Weyl nodes of opposite chirality. The unclosed FAs can give rise to various exotic effects that have attracted tremendous research interest. The configurations of the FAs are usually thought to be determined fully by the band topology of the bulk states, which seems impo… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 9 pages, 5 figures

    Journal ref: Chinese Phys. Lett. 40 097301 (2023 )

  23. Competing ferromagnetic superconducting states in europium-based iron pnictides

    Authors: Huai-Xiang Huang, Yu-Qian Cao, Xin Wan

    Abstract: In europium-based iron pnictides superconducting Fe-planes can be influenced by a Zeeman field originated from the neighboring Eu-planes. The field tends to induce spin-density waves with a ferromagnetic average which coexists with the superconducting order by forming complementary patterns of the superconducting and magnetic order parameters in a Fulde-Ferrell-Larkin-Ovchinnikov phase and a two-d… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 7pages,11figures

    Journal ref: PRB 106,144503 (2022)

  24. arXiv:2210.10199  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Bayesian Optimization over Discrete and Mixed Spaces via Probabilistic Reparameterization

    Authors: Samuel Daulton, Xingchen Wan, David Eriksson, Maximilian Balandat, Michael A. Osborne, Eytan Bakshy

    Abstract: Optimizing expensive-to-evaluate black-box functions of discrete (and potentially continuous) design parameters is a ubiquitous problem in scientific and engineering applications. Bayesian optimization (BO) is a popular, sample-efficient method that leverages a probabilistic surrogate model and an acquisition function (AF) to select promising designs to evaluate. However, maximizing the AF over mi… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: To appear in Advances in Neural Information Processing Systems 35, 2022. Code available at: https://github.com/facebookresearch/bo_pr

  25. arXiv:2210.08859  [pdf, other

    cs.CL cs.AI

    Social Biases in Automatic Evaluation Metrics for NLG

    Authors: Mingqi Gao, Xiaojun Wan

    Abstract: Many studies have revealed that word embeddings, language models, and models for specific downstream tasks in NLP are prone to social biases, especially gender bias. Recently these techniques have been gradually applied to automatic evaluation metrics for text generation. In the paper, we propose an evaluation method based on Word Embeddings Association Test (WEAT) and Sentence Embeddings Associat… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  26. arXiv:2210.08303  [pdf, other

    cs.CV cs.AI cs.CL

    Improving Radiology Summarization with Radiograph and Anatomy Prompts

    Authors: **peng Hu, Zhihong Chen, Yang Liu, Xiang Wan, Tsung-Hui Chang

    Abstract: The impression is crucial for the referring physicians to grasp key information since it is concluded from the findings and reasoning of radiologists. To alleviate the workload of radiologists and reduce repetitive human labor in impression writing, many researchers have focused on automatic impression generation. However, recent works on this task mainly summarize the corresponding findings and p… ▽ More

    Submitted 27 December, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: 11 pages, ACL2023 Findings

  27. arXiv:2210.02954  [pdf, other

    cond-mat.mtrl-sci cond-mat.supr-con

    All hourglass bosonic excitations in the 1651 magnetic space groups and 528 magnetic layer groups

    Authors: Dongze Fan, Xiangang Wan, Feng Tang

    Abstract: The band connectivity as imposed by the compatibility relations between the irreducible representations of little groups can give rise to the exotic hourglass-like shape composed of four branches of bands and five band crossings (BCs). Such an hourglass band connectivity could enforce the emergence of nontrivial excitations like Weyl fermion, Dirac fermion or even beyond them. On the other hand, t… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Supplementary Material can be found in the ancillary file

  28. arXiv:2209.13761  [pdf, other

    eess.IV cs.CV cs.MM

    Image Compressed Sensing with Multi-scale Dilated Convolutional Neural Network

    Authors: Zhifeng Wang, Zhenghui Wang, Chunyan Zeng, Yan Yu, Xiangkui Wan

    Abstract: Deep Learning (DL) based Compressed Sensing (CS) has been applied for better performance of image reconstruction than traditional CS methods. However, most existing DL methods utilize the block-by-block measurement and each measurement block is restored separately, which introduces harmful blocking effects for reconstruction. Furthermore, the neuronal receptive fields of those methods are designed… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: 28 pages, 8 figures, MsDCNN for CS

  29. arXiv:2209.11906  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    Joint Speech Activity and Overlap Detection with Multi-Exit Architecture

    Authors: Ziqing Du, Kai Liu, Xucheng Wan, Huan Zhou

    Abstract: Overlapped speech detection (OSD) is critical for speech applications in scenario of multi-party conversion. Despite numerous research efforts and progresses, comparing with speech activity detection (VAD), OSD remains an open challenge and its overall performance is far from satisfactory. The majority of prior research typically formulates the OSD problem as a standard classification problem, to… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

  30. arXiv:2209.11905  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    Speech Enhancement with Perceptually-motivated Optimization and Dual Transformations

    Authors: Xucheng Wan, Kai Liu, Ziqing Du, Huan Zhou

    Abstract: To address the monaural speech enhancement problem, numerous research studies have been conducted to enhance speech via operations either in time-domain on the inner-domain learned from the speech mixture or in time--frequency domain on the fixed full-band short time Fourier transform (STFT) spectrograms. Very recently, a few studies on sub-band based speech enhancement have been proposed. By enha… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

  31. arXiv:2209.09694  [pdf

    cond-mat.mtrl-sci

    Modulating Thermal Conductivity via Targeted Phonon Excitation

    Authors: Xiao Wan, Dongkai Pan, **g-Tao Lü, Sebastian Volz, Lifa Zhang, Qing Hao, Yangjun Qin, Zhicheng Zong, Nuo Yang

    Abstract: Thermal conductivity is a critical material property in numerous applications, such as those related to thermoelectric devices and heat dissipation. Effectively modulating thermal conductivity has become a great concern in the field of heat conduction. In this study, a quantum strategy is proposed to modulate thermal conductivity by exciting targeted phonons. The results show that the thermal cond… ▽ More

    Submitted 5 April, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

  32. View-Disentangled Transformer for Brain Lesion Detection

    Authors: Haofeng Li, Junjia Huang, Guanbin Li, Zhou Liu, Yihong Zhong, Yingying Chen, Yunfei Wang, Xiang Wan

    Abstract: Deep neural networks (DNNs) have been widely adopted in brain lesion detection and segmentation. However, locating small lesions in 2D MRI slices is challenging, and requires to balance between the granularity of 3D context aggregation and the computational complexity. In this paper, we propose a novel view-disentangled transformer to enhance the extraction of MRI features for more accurate tumour… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: International Symposium on Biomedical Imaging (ISBI) 2022, code: https://github.com/lhaof/ISBI-VDFormer

  33. arXiv:2209.08887  [pdf, other

    cs.CV

    Attentive Symmetric Autoencoder for Brain MRI Segmentation

    Authors: Junjia Huang, Haofeng Li, Guanbin Li, Xiang Wan

    Abstract: Self-supervised learning methods based on image patch reconstruction have witnessed great success in training auto-encoders, whose pre-trained weights can be transferred to fine-tune other downstream tasks of image understanding. However, existing methods seldom study the various importance of reconstructed patches and the symmetry of anatomical structures, when they are applied to 3D medical imag… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: MICCAI 2022, code:https://github.com/lhaof/ASA

  34. arXiv:2209.07759  [pdf, other

    cs.CL

    An Empirical Study of Automatic Post-Editing

    Authors: Xu Zhang, Xiaojun Wan

    Abstract: Automatic post-editing (APE) aims to reduce manual post-editing efforts by automatically correcting errors in machine-translated output. Due to the limited amount of human-annotated training data, data scarcity is one of the main challenges faced by all APE systems. To alleviate the lack of genuine training data, most of the current APE systems employ data augmentation methods to generate large-sc… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: 14 pages, 4 figures

  35. arXiv:2209.07280  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Observation of robust zero-energy state and enhanced superconducting gap in a tri-layer heterostructure of MnTe/Bi2Te3/Fe(Te, Se)

    Authors: Shuyue Ding, Chen Chen, Zhipeng Cao, Di Wang, Yongqiang Pan, Ran Tao, Dongming Zhao, Yining Hu, Tianxing Jiang, Yajun Yan, Zhixiang Shi, Xiangang Wan, Donglai Feng, Tong Zhang

    Abstract: The interface between magnetic material and superconductors has long been predicted to host unconventional superconductivity, such as spin-triplet pairing and topological nontrivial pairing state, particularly when spin-orbital coupling (SOC) is incorporated. To identify these novel pairing states, fabricating homogenous heterostructures which contain such various properties are preferred, but oft… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 33 pages, supplementary materials included

    Journal ref: Sci. Adv. 8, eabq4578 (2022)

  36. arXiv:2209.07118  [pdf, other

    cs.CL cs.CV

    Align, Reason and Learn: Enhancing Medical Vision-and-Language Pre-training with Knowledge

    Authors: Zhihong Chen, Guanbin Li, Xiang Wan

    Abstract: Medical vision-and-language pre-training (Med-VLP) has received considerable attention owing to its applicability to extracting generic vision-and-language representations from medical images and texts. Most existing methods mainly contain three elements: uni-modal encoders (i.e., a vision encoder and a language encoder), a multi-modal fusion module, and pretext tasks, with few studies considering… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Natural Language Processing. 10 pages, 3 figures

  37. arXiv:2209.07098  [pdf, other

    cs.CV cs.CL

    Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training

    Authors: Zhihong Chen, Yuhao Du, **peng Hu, Yang Liu, Guanbin Li, Xiang Wan, Tsung-Hui Chang

    Abstract: Medical vision-and-language pre-training provides a feasible solution to extract effective vision-and-language representations from medical images and texts. However, few studies have been dedicated to this field to facilitate medical vision-and-language understanding. In this paper, we propose a self-supervised learning paradigm with multi-modal masked autoencoders (M$^3$AE), which learn cross-mo… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Natural Language Processing. 11 pages, 3 figures

  38. arXiv:2209.06209  [pdf, other

    cs.CV cs.IR cs.MM

    Look Before You Leap: Improving Text-based Person Retrieval by Learning A Consistent Cross-modal Common Manifold

    Authors: Zijie Wang, Aichun Zhu, **gyi Xue, Xili Wan, Chao Liu, Tian Wang, Yifeng Li

    Abstract: The core problem of text-based person retrieval is how to bridge the heterogeneous gap between multi-modal data. Many previous approaches contrive to learning a latent common manifold map** paradigm following a \textbf{cross-modal distribution consensus prediction (CDCP)} manner. When map** features from distribution of one certain modality into the common manifold, feature distribution of the… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Accepted on ACM MM '22. arXiv admin note: text overlap with arXiv:2209.05773

  39. arXiv:2209.05773  [pdf, other

    cs.CV cs.IR cs.MM

    CAIBC: Capturing All-round Information Beyond Color for Text-based Person Retrieval

    Authors: Zijie Wang, Aichun Zhu, **gyi Xue, Xili Wan, Chao Liu, Tian Wang, Yifeng Li

    Abstract: Given a natural language description, text-based person retrieval aims to identify images of a target person from a large-scale person image database. Existing methods generally face a \textbf{color over-reliance problem}, which means that the models rely heavily on color information when matching cross-modal data. Indeed, color information is an important decision-making accordance for retrieval,… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Accepted on ACM MM '22

  40. arXiv:2209.01370  [pdf, other

    cs.CL

    CrossDial: An Entertaining Dialogue Dataset of Chinese Crosstalk

    Authors: Baizhou Huang, Shikang Du, Xiaojun Wan

    Abstract: Crosstalk is a traditional Chinese theatrical performance art. It is commonly performed by two performers in the form of a dialogue. With the typical features of dialogues, crosstalks are also designed to be hilarious for the purpose of amusing the audience. In this study, we introduce CrossDial, the first open-source dataset containing most classic Chinese crosstalks crawled from the Web. Moreove… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

  41. arXiv:2208.12753  [pdf, other

    cs.SD cs.AI eess.AS

    Spatio-Temporal Representation Learning Enhanced Source Cell-phone Recognition from Speech Recordings

    Authors: Chunyan Zeng, Shixiong Feng, Zhifeng Wang, Xiangkui Wan, Yunfan Chen, Nan Zhao

    Abstract: The existing source cell-phone recognition method lacks the long-term feature characterization of the source device, resulting in inaccurate representation of the source cell-phone related features which leads to insufficient recognition accuracy. In this paper, we propose a source cell-phone recognition method based on spatio-temporal representation learning, which includes two main parts: extrac… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: 29 pages, 4 figures

  42. arXiv:2208.11920  [pdf

    cs.SD eess.AS

    Digital Audio Tampering Detection Based on ENF Spatio-temporal Features Representation Learning

    Authors: Chunyan Zeng, Shuai Kong, Zhifeng Wang, Xiangkui Wan, Yunfan Chen

    Abstract: Most digital audio tampering detection methods based on electrical network frequency (ENF) only utilize the static spatial information of ENF, ignoring the variation of ENF in time series, which limit the ability of ENF feature representation and reduce the accuracy of tampering detection. This paper proposes a new method for digital audio tampering detection based on ENF spatio-temporal features… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: 19 pages, 6 figures

  43. arXiv:2208.06964  [pdf, ps, other

    math.DG math.GT math.MG

    Curvature of the total space of a Griffiths negative vector bundle and quasi-Fuchsian space

    Authors: Inkang Kim, Xueyuan Wan, Genkai Zhang

    Abstract: For a holomorphic vector bundle $E$ over a Hermitian manifold $M$ there are two important notions of curvature positivity, the Griffiths positivity and Nakano positivity. We study the consequence of these positivities and the relevant estimates. If $E$ is Griffiths negative over Kähler manifold, then there is a Kähler metric on its total space $E$, and we calculate the curvature and prove the non-… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

    Comments: 27 pages, extension of the previous paper arXiv:1902.04523 New Kähler metric on quasifuchsian space and its curvature properties

  44. arXiv:2208.01433  [pdf

    econ.GN

    Review of Energy Transition Policies in Singapore, London, and California

    Authors: Chunmeng Yang, Siqi Bu, Yi Fan, Wayne Xinwei Wan, Ruoheng Wang, Aoife Foley

    Abstract: The paper contains the online supplementary materials for "Data-Driven Prediction and Evaluation on Future Impact of Energy Transition Policies in Smart Regions". We review the renewable energy development and policies in the three metropolitan cities/regions over recent decades. Depending on the geographic variations in the types and quantities of renewable energy resources and the levels of poli… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  45. Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration

    Authors: Haotian Bai, Ruimao Zhang, Jiong Wang, Xiang Wan

    Abstract: Weakly Supervised Object Localization (WSOL), which aims to localize objects by only using image-level labels, has attracted much attention because of its low annotation cost in real applications. Recent studies leverage the advantage of self-attention in visual Transformer for long-range dependency to re-active semantic regions, aiming to avoid partial activation in traditional class activation m… ▽ More

    Submitted 10 March, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV2022

  46. arXiv:2207.09405  [pdf, other

    cs.LG cs.AI

    Bayesian Generational Population-Based Training

    Authors: Xingchen Wan, Cong Lu, Jack Parker-Holder, Philip J. Ball, Vu Nguyen, Binxin Ru, Michael A. Osborne

    Abstract: Reinforcement learning (RL) offers the potential for training generally capable agents that can interact autonomously in the real world. However, one key limitation is the brittleness of RL algorithms to core hyperparameters and network architecture choice. Furthermore, non-stationarities such as evolving training data and increased agent complexity mean that different hyperparameters and architec… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: AutoML Conference 2022. 10 pages, 4 figure, 3 tables (28 pages, 10 figures, 7 tables including references and appendices)

  47. arXiv:2207.06141  [pdf, other

    math.DG math-ph

    The mass of an asymptotically hyperbolic end and distance estimates

    Authors: Xiaoxiang Chai, Xueyuan Wan

    Abstract: Let $(M,g)$ be a complete connected $n$-dimensional Riemannian spin manifold without boundary such that the scalar curvature satisfies $R_g\geq -n(n-1)$ and $\mathcal{E}\subset M$ be an asymptotically hyperbolic end, we prove that the mass functional of the end $\mathcal{E}$ is timelike future-directed or zero. Moreover, it vanishes if and only if $(M,g)$ is isometric to the hyperbolic space. We a… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: 24 pages, 3 figures

  48. arXiv:2206.13778  [pdf, other

    cs.CL

    CC-Riddle: A Question Answering Dataset of Chinese Character Riddles

    Authors: Fan Xu, Yunxiang Zhang, Xiaojun Wan

    Abstract: The Chinese character riddle is a unique form of cultural entertainment specific to the Chinese language. It typically comprises two parts: the riddle description and the solution. The solution to the riddle is a single character, while the riddle description primarily describes the glyph of the solution, occasionally supplemented with its explanation and pronunciation. Solving Chinese character r… ▽ More

    Submitted 24 September, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

    ACM Class: I.2.7

  49. arXiv:2206.08023  [pdf, other

    eess.IV cs.CV cs.LG

    AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation

    Authors: Yuanfeng Ji, Haotian Bai, Jie Yang, Chongjian Ge, Ye Zhu, Ruimao Zhang, Zhen Li, Lingyan Zhang, Wanling Ma, Xiang Wan, ** Luo

    Abstract: Despite the considerable progress in automatic abdominal multi-organ segmentation from CT/MRI scans in recent years, a comprehensive evaluation of the models' capabilities is hampered by the lack of a large-scale benchmark from diverse clinical scenarios. Constraint by the high cost of collecting and labeling 3D medical data, most of the deep learning models to date are driven by datasets with a l… ▽ More

    Submitted 1 September, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

  50. arXiv:2206.02624  [pdf, other

    math.DG

    Band width estimates of CMC initial data sets

    Authors: Xiaoxiang Chai, Xueyuan Wan

    Abstract: We generalize a band width estimate of Gromov to CMC initial data sets. We give three independent proofs: via the stability of a hypersurface with prescribed null expansion, via a perturbation of the spacetime harmonic function and via the Dirac operator.

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: 20 pages, 1 figure