Skip to main content

Showing 1–27 of 27 results for author: Feng, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.06714  [pdf, other

    cs.CL cs.SD eess.AS

    Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness

    Authors: Xincan Feng, Akifumi Yoshimoto

    Abstract: Recent advancements in Natural Language Processing (NLP) have seen Large-scale Language Models (LLMs) excel at producing high-quality text for various purposes. Notably, in Text-To-Speech (TTS) systems, the integration of BERT for semantic token generation has underscored the importance of semantic content in producing coherent speech outputs. Despite this, the specific utility of LLMs in enhancin… ▽ More

    Submitted 17 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: 9 pages, 2 figures, 4 tables; accepted at LREC-COLING 2024

  2. arXiv:2402.14099  [pdf, other

    eess.IV cs.CV physics.med-ph

    EXACT-Net:EHR-guided lung tumor auto-segmentation for non-small cell lung cancer radiotherapy

    Authors: Hamed Hooshangnejad, Xue Feng, Gaofeng Huang, Rui Zhang, Quan Chen, Kai Ding

    Abstract: Lung cancer is a devastating disease with the highest mortality rate among cancer types. Over 60% of non-small cell lung cancer (NSCLC) patients, which accounts for 87% of diagnoses, require radiation therapy. Rapid treatment initiation significantly increases the patient's survival rate and reduces the mortality rate. Accurate tumor segmentation is a critical step in the diagnosis and treatment o… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  3. arXiv:2308.13993  [pdf

    eess.SY

    Comprehensive performance comparison among different types of features in data-driven battery state of health estimation

    Authors: Xinhong Feng, Yongzhi Zhang, Rui Xiong, Chun Wang

    Abstract: Battery state of health (SOH), which informs the maximal available capacity of the battery, is a key indicator of battery aging failure. Accurately estimating battery SOH is a vital function of the battery management system that remains to be addressed. In this study, a physics-informed Gaussian process regression (GPR) model is developed for battery SOH estimation, with the performance being syst… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  4. arXiv:2302.10193  [pdf

    physics.med-ph eess.IV

    Non-line-of-sight photoacoustic imaging

    Authors: Yuting Shen, Xiaohua Feng, Fei Gao

    Abstract: Photoacoustic imaging is a promising imaging technique for human brain due to its high sensitivity and functional imaging ability. However, the skull would cause strong attenuation and distortion to the photoacoustic signals, which makes non-invasive transcranial imaging difficult. In this work, the temporal bone is selected as an imaging window to minimize the influence of the skull. Moreover, no… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

  5. arXiv:2301.06321  [pdf

    eess.IV physics.optics

    Deep-learning-based on-chip rapid spectral imaging with high spatial resolution

    Authors: Jiawei Yang, Kaiyu Cui, Yidong Huang, Wei Zhang, Xue Feng, Fang Liu

    Abstract: Spectral imaging extends the concept of traditional color cameras to capture images across multiple spectral channels and has broad application prospects. Conventional spectral cameras based on scanning methods suffer from low acquisition speed and large volume. On-chip computational spectral imaging based on metasurface filters provides a promising scheme for portable applications, but endures lo… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

  6. Efficient Visual Computing with Camera RAW Snapshots

    Authors: Zhihao Li, Ming Lu, Xu Zhang, Xin Feng, M. Salman Asif, Zhan Ma

    Abstract: Conventional cameras capture image irradiance on a sensor and convert it to RGB images using an image signal processor (ISP). The images can then be used for photography or visual computing tasks in a variety of applications, such as public safety surveillance and autonomous driving. One can argue that since RAW images contain all the captured information, the conversion of RAW to RGB using an ISP… ▽ More

    Submitted 25 January, 2024; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Accepted by T-PAMI 2024. Homepage: https://njuvision.github.io/rho-vision

  7. arXiv:2207.12941  [pdf, other

    cs.CV eess.IV

    Learning Generalizable Latent Representations for Novel Degradations in Super Resolution

    Authors: Fengjun Li, Xin Feng, Fanglin Chen, Guangming Lu, Wenjie Pei

    Abstract: Typical methods for blind image super-resolution (SR) focus on dealing with unknown degradations by directly estimating them or learning the degradation representations in a latent space. A potential limitation of these methods is that they assume the unknown degradations can be simulated by the integration of various handcrafted degradations (e.g., bicubic downsampling), which is not necessarily… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  8. arXiv:2206.02705  [pdf

    eess.SP cs.AI

    Human Behavior Recognition Method Based on CEEMD-ES Radar Selection

    Authors: Zhaolin Zhang, Mingqi Song, Wugang Meng, Yuhan Liu, Fengcong Li, Xiang Feng, Yinan Zhao

    Abstract: In recent years, the millimeter-wave radar to identify human behavior has been widely used in medical,security, and other fields. When multiple radars are performing detection tasks, the validity of the features contained in each radar is difficult to guarantee. In addition, processing multiple radar data also requires a lot of time and computational cost. The Complementary Ensemble Empirical Mode… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: 4 pages, 5 figures

  9. arXiv:2205.09933  [pdf, other

    cs.CV eess.IV

    Hyperspectral Unmixing Based on Nonnegative Matrix Factorization: A Comprehensive Review

    Authors: Xin-Ru Feng, Heng-Chao Li, Rui Wang, Qian Du, ** Jia, Antonio Plaza

    Abstract: Hyperspectral unmixing has been an important technique that estimates a set of endmembers and their corresponding abundances from a hyperspectral image (HSI). Nonnegative matrix factorization (NMF) plays an increasingly significant role in solving this problem. In this article, we present a comprehensive survey of the NMF-based methods proposed for hyperspectral unmixing. Taking the NMF model as a… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  10. arXiv:2203.04767  [pdf, other

    eess.AS cs.SD

    A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling

    Authors: Yike Zhang, Xiaobing Feng, Yi Liu, Songjun Cao, Long Ma

    Abstract: Automatic speech recognition (ASR) systems used on smart phones or vehicles are usually required to process speech queries from very different domains. In such situations, a vanilla ASR system usually fails to perform well on every domain. This paper proposes a multi-domain ASR framework for Tencent Map, a navigation app used on smart phones and in-vehicle infotainment systems. The proposed framew… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 7 pages, 1 figure

  11. arXiv:2112.10074  [pdf, other

    eess.IV cs.CV cs.LG

    QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking Results

    Authors: Raghav Mehta, Angelos Filos, Ujjwal Baid, Chiharu Sako, Richard McKinley, Michael Rebsamen, Katrin Datwyler, Raphael Meier, Piotr Radojewski, Gowtham Krishnan Murugesan, Sahil Nalawade, Chandan Ganesh, Ben Wagner, Fang F. Yu, Baowei Fei, Ananth J. Madhuranthakam, Joseph A. Maldjian, Laura Daza, Catalina Gomez, Pablo Arbelaez, Chengliang Dai, Shuo Wang, Hadrien Reynaud, Yuan-han Mo, Elsa Angelini , et al. (67 additional authors not shown)

    Abstract: Deep learning (DL) models have provided state-of-the-art performance in various medical imaging benchmarking challenges, including the Brain Tumor Segmentation (BraTS) challenges. However, the task of focal pathology multi-compartment segmentation (e.g., tumor and lesion sub-regions) is particularly challenging, and potential errors hinder translating DL models into clinical workflows. Quantifying… ▽ More

    Submitted 23 August, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA): https://www.melba-journal.org/papers/2022:026.html

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)

  12. arXiv:2111.10633  [pdf, other

    cs.CV eess.IV

    Sparse Tensor-based Multiscale Representation for Point Cloud Geometry Compression

    Authors: Jianqiang Wang, Dandan Ding, Zhu Li, Xiaoxing Feng, Chuntong Cao, Zhan Ma

    Abstract: This study develops a unified Point Cloud Geometry (PCG) compression method through the processing of multiscale sparse tensor-based voxelized PCG. We call this compression method SparsePCGC. The proposed SparsePCGC is a low complexity solution because it only performs the convolutions on sparsely-distributed Most-Probable Positively-Occupied Voxels (MP-POV). The multiscale representation also all… ▽ More

    Submitted 21 October, 2022; v1 submitted 20 November, 2021; originally announced November 2021.

    Comments: 17 pages, 15 figures

  13. arXiv:2108.04016  [pdf, other

    eess.IV cs.CV

    Deep Learning methods for automatic evaluation of delayed enhancement-MRI. The results of the EMIDEC challenge

    Authors: Alain Lalande, Zhihao Chen, Thibaut Pommier, Thomas Decourselle, Abdul Qayyum, Michel Salomon, Dominique Ginhac, Youssef Skandarani, Arnaud Boucher, Khawla Brahim, Marleen de Bruijne, Robin Camarasa, Teresa M. Correia, Xue Feng, Kibrom B. Girum, Anja Hennemuth, Markus Huellebrand, Raabid Hussain, Matthias Ivantsits, Jun Ma, Craig Meyer, Rishabh Sharma, Jixi Shi, Nikolaos V. Tsekos, Marta Varela , et al. (8 additional authors not shown)

    Abstract: A key factor for assessing the state of the heart after myocardial infarction (MI) is to measure whether the myocardium segment is viable after reperfusion or revascularization therapy. Delayed enhancement-MRI or DE-MRI, which is performed several minutes after injection of the contrast agent, provides high contrast between viable and nonviable myocardium and is therefore a method of choice to eva… ▽ More

    Submitted 10 August, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: Submitted to Medical Image Analysis

  14. arXiv:2107.03165  [pdf, other

    eess.AS cs.SD

    Improving Speech Recognition Accuracy of Local POI Using Geographical Models

    Authors: Songjun Cao, Yike Zhang, Xiaobing Feng, Long Ma

    Abstract: Nowadays voice search for points of interest (POI) is becoming increasingly popular. However, speech recognition for local POI has remained to be a challenge due to multi-dialect and massive POI. This paper improves speech recognition accuracy for local POI from two aspects. Firstly, a geographic acoustic model (Geo-AM) is proposed. The Geo-AM deals with multi-dialect problem using dialect-specifi… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Accepted by SLT 2021

  15. arXiv:2104.02474  [pdf

    physics.optics eess.IV

    All-Optical Image Identification with Programmable Matrix Transformation

    Authors: Shikang Li, Baohua Ni, Xue Feng, Kaiyu Cui, Fang Liu, Wei Zhang, Yidong Huang

    Abstract: An optical neural network is proposed and demonstrated with programmable matrix transformation and nonlinear activation function of photodetection (square-law detection). Based on discrete phase-coherent spatial modes, the dimensionality of programmable optical matrix operations is 30~37, which is implemented by spatial light modulators. With this architecture, all-optical classification tasks of… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

    Journal ref: optics express 2021

  16. arXiv:2010.01291  [pdf, other

    cs.CV eess.IV

    Unsupervised Shadow Removal Using Target Consistency Generative Adversarial Network

    Authors: Chao Tan, Xin Feng

    Abstract: Unsupervised shadow removal aims to learn a non-linear function to map the original image from shadow domain to non-shadow domain in the absence of paired shadow and non-shadow data. In this paper, we develop a simple yet efficient target-consistency generative adversarial network (TC-GAN) for the shadow removal task in the unsupervised manner. Compared with the bidirectional map** in cycle-cons… ▽ More

    Submitted 30 May, 2021; v1 submitted 3 October, 2020; originally announced October 2020.

  17. arXiv:2009.00454  [pdf

    cs.NI eess.SP

    A Digital Twin for Reconfigurable Intelligent Surface Assisted Wireless Communication

    Authors: Baoling Sheen, ** Yang, Xianglong Feng, Md Moin Uddin Chowdhury

    Abstract: Reconfigurable Intelligent Surface (RIS) has emerged as one of the key technologies for 6G in recent years, which comprise a large number of low-cost passive elements that can smartly interact with the im**ing electromagnetic waves for performance enhancement. However, optimally configuring massive number of RIS elements remains a challenge. In this paper, we present a novel digital-twin framewo… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

  18. arXiv:2008.09352  [pdf, other

    eess.IV cs.CV

    Deep Learning Methods for Lung Cancer Segmentation in Whole-slide Histopathology Images -- the ACDC@LungHP Challenge 2019

    Authors: Zhang Li, Jiehua Zhang, Tao Tan, Xichao Teng, Xiaoliang Sun, Yang Li, Lihong Liu, Yang Xiao, Byungjae Lee, Yilong Li, Qianni Zhang, Shujiao Sun, Yushan Zheng, Junyu Yan, Ni Li, Yiyu Hong, Junsu Ko, Hyun Jung, Yanling Liu, Yu-cheng Chen, Ching-wei Wang, Vladimir Yurovskiy, Pavel Maevskikh, Vahid Khanagha, Yi Jiang , et al. (8 additional authors not shown)

    Abstract: Accurate segmentation of lung cancer in pathology slides is a critical step in improving patient care. We proposed the ACDC@LungHP (Automatic Cancer Detection and Classification in Whole-slide Lung Histopathology) challenge for evaluating different computer-aided diagnosis (CADs) methods on the automatic diagnosis of lung cancer. The ACDC@LungHP 2019 focused on segmentation (pixel-wise detection)… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

  19. arXiv:2007.02096  [pdf

    eess.IV cs.CV cs.LG

    Multi-Site Infant Brain Segmentation Algorithms: The iSeg-2019 Challenge

    Authors: Yue Sun, Kun Gao, Zhengwang Wu, Zhihao Lei, Ying Wei, Jun Ma, ** Yang, Xue Feng, Li Zhao, Trung Le Phan, Jitae Shin, Tao Zhong, Yu Zhang, Lequan Yu, Caizi Li, Ramesh Basnet, M. Omair Ahmad, M. N. S. Swamy, Wenao Ma, Qi Dou, Toan Duc Bui, Camilo Bermudez Noguera, Bennett Landman, Ian H. Gotlib, Kathryn L. Humphreys , et al. (8 additional authors not shown)

    Abstract: To better understand early brain growth patterns in health and disorder, it is critical to accurately segment infant brain magnetic resonance (MR) images into white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF). Deep learning-based methods have achieved state-of-the-art performance; however, one of major limitations is that the learning-based methods may suffer from the multi-site i… ▽ More

    Submitted 11 July, 2020; v1 submitted 4 July, 2020; originally announced July 2020.

    Journal ref: IEEE Transactions on Medical Imaging, 40(5), 1363-1376, 2021

  20. arXiv:2005.02689  [pdf

    physics.optics eess.IV physics.app-ph

    Dynamic brain spectrum acquired by a real-time ultra-spectral imaging chip with reconfigurable metasurfaces

    Authors: Jian Xiong, Xusheng Cai, Kaiyu Cui, Yidong Huang, Jiawei Yang, Hongbo Zhu, Wenzheng Li, Bo Hong, Shijie Rao, Zekun Zheng, Sheng Xu, Yuhan He, Fang Liu, Xue Feng, Wei Zhang

    Abstract: Spectral imaging paves way for various fields and particular in biomedical research. However, spectral imaging mainly depending on spatial or temporal scanning, cannot achieve high temporal, spatial and spectral resolution simultaneously. In this study, we demonstrated a silicon real-time ultra-spectral imaging chip based on reconfigurable metasurfaces, comprising of 155,216 (356$\times$436) image… ▽ More

    Submitted 28 March, 2023; v1 submitted 6 May, 2020; originally announced May 2020.

    Journal ref: Optica 9, 461-468 (2022)

  21. arXiv:2001.05551  [pdf, other

    q-bio.QM cs.CV eess.IV

    Substituting Gadolinium in Brain MRI Using DeepContrast

    Authors: Haoran Sun, Xueqing Liu, Xinyang Feng, Chen Liu, Nanyan Zhu, Sabrina J. Gjerswold-Selleck, Hong-Jian Wei, Pavan S. Upadhyayula, Angeliki Mela, Cheng-Chia Wu, Peter D. Canoll, Andrew F. Laine, J. Thomas Vaughan, Scott A. Small, Jia Guo

    Abstract: Cerebral blood volume (CBV) is a hemodynamic correlate of oxygen metabolism and reflects brain activity and function. High-resolution CBV maps can be generated using the steady-state gadolinium-enhanced MRI technique. Such a technique requires an intravenous injection of exogenous gadolinium based contrast agent (GBCA) and recent studies suggest that the GBCA can accumulate in the brain after freq… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

    Journal ref: The IEEE International Symposium on Biomedical Imaging (ISBI) 2020

  22. arXiv:1912.08011  [pdf

    cs.CL cs.LG eess.AS

    Application of Word2vec in Phoneme Recognition

    Authors: Xin Feng, Lei Wang

    Abstract: In this paper, we present how to hybridize a Word2vec model and an attention-based end-to-end speech recognition model. We build a phoneme recognition system based on Listen, Attend and Spell model. And the phoneme recognition model uses a word2vec model to initialize the embedding matrix for the improvement of the performance, which can increase the distance among the phoneme vectors. At the same… ▽ More

    Submitted 19 December, 2019; v1 submitted 17 December, 2019; originally announced December 2019.

  23. arXiv:1911.09982  [pdf

    eess.IV cs.CV cs.LG

    HybridNetSeg: A Compact Hybrid Network for Retinal Vessel Segmentation

    Authors: Ling Luo, Dingyu Xue, Xinglong Feng

    Abstract: A large number of retinal vessel analysis methods based on image segmentation have emerged in recent years. However, existing methods depend on cumbersome backbones, such as VGG16 and ResNet-50, benefiting from their powerful feature extraction capabilities but suffering from high computational costs. In this paper, we propose a novel neural network (HybridNetSeg) dedicated to solving this drawbac… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: 16 pages, 3 figures

  24. arXiv:1910.08375  [pdf, other

    eess.IV cs.CV

    Detecting intracranial aneurysm rupture from 3D surfaces using a novel GraphNet approach

    Authors: Z. Ma, L. Song, X. Feng, G. Yang, W. Zhu, J. Liu, Y. Zhang, X. Yang, Y. Yin

    Abstract: Intracranial aneurysm (IA) is a life-threatening blood spot in human's brain if it ruptures and causes cerebral hemorrhage. It is challenging to detect whether an IA has ruptured from medical images. In this paper, we propose a novel graph based neural network named GraphNet to detect IA rupture from 3D surface data. GraphNet is based on graph convolution network (GCN) and is designed for graph-le… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: Submitted to ISBI 2020

  25. arXiv:1910.04919  [pdf

    cs.CV cs.LG eess.IV

    From Species to Cultivar: Soybean Cultivar Recognition using Multiscale Sliding Chord Matching of Leaf Images

    Authors: Bin Wang, Yongsheng Gao, Xiaohan Yu, Xiaohui Yuan, Shengwu Xiong, Xianzhong Feng

    Abstract: Leaf image recognition techniques have been actively researched for plant species identification. However it remains unclear whether leaf patterns can provide sufficient information for cultivar recognition. This paper reports the first attempt on soybean cultivar recognition from plant leaves which is not only a challenging research problem but also important for soybean cultivar evaluation, sele… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: 33 pages, 8 figures

  26. arXiv:1907.00943  [pdf, other

    cs.CV eess.IV q-bio.QM

    Estimating brain age based on a healthy population with deep learning and structural MRI

    Authors: Xinyang Feng, Zachary C. Lipton, Jie Yang, Scott A. Small, Frank A. Provenzano

    Abstract: Numerous studies have established that estimated brain age, as derived from statistical models trained on healthy populations, constitutes a valuable biomarker that is predictive of cognitive decline and various neurological diseases. In this work, we curate a large-scale heterogeneous dataset (N = 10,158, age range 18 - 97) of structural brain MRIs in a healthy population from multiple publicly-a… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: 32 pages, 9 figures, 6 tables

  27. Subspace Stabilization Analysis for Non-Markovian Open Quantum Systems

    Authors: Shikun Zhang, Kun Liu, Daoyi Dong, Xiaoxue Feng, Feng Pan

    Abstract: Studied in this article is non-Markovian open quantum systems parametrized by Hamiltonian H, coupling operator L, and memory kernel function γ, which is a proper candidate for describing the dynamics of various solid-state quantum information processing devices. We look into the subspace stabilization problem of the system from the perspective of dynamical systems and control. The problem translat… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

    Comments: 7 pages, 1 figure

    Journal ref: Phys. Rev. A 101, 042327 (2020)