Skip to main content

Showing 1–50 of 54 results for author: Zeng, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.13150  [pdf

    eess.IV cs.CV

    MCAD: Multi-modal Conditioned Adversarial Diffusion Model for High-Quality PET Image Reconstruction

    Authors: Jiaqi Cui, Xinyi Zeng, Pinxian Zeng, Bo Liu, Xi Wu, Jiliu Zhou, Yan Wang

    Abstract: Radiation hazards associated with standard-dose positron emission tomography (SPET) images remain a concern, whereas the quality of low-dose PET (LPET) images fails to meet clinical requirements. Therefore, there is great interest in reconstructing SPET images from LPET images. However, prior studies focus solely on image data, neglecting vital complementary information from other modalities, e.g.… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Early accepted by MICCAI2024

  2. arXiv:2406.07880  [pdf, other

    cs.CV eess.IV

    A Comprehensive Survey on Machine Learning Driven Material Defect Detection: Challenges, Solutions, and Future Prospects

    Authors: Jun Bai, Di Wu, Tristan Shelley, Peter Schubel, David Twine, John Russell, Xuesen Zeng, Ji Zhang

    Abstract: Material defects (MD) represent a primary challenge affecting product performance and giving rise to safety issues in related products. The rapid and accurate identification and localization of MD constitute crucial research endeavours in addressing contemporary challenges associated with MD. Although conventional non-destructive testing methods such as ultrasonic and X-ray approaches have mitigat… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2406.00492  [pdf, other

    eess.IV cs.CV cs.LG

    SAM-VMNet: Deep Neural Networks For Coronary Angiography Vessel Segmentation

    Authors: Xueying Zeng, Baixiang Huang, Yu Luo, Guangyu Wei, Songyan He, Yushuang Shao

    Abstract: Coronary artery disease (CAD) is one of the most prevalent diseases in the cardiovascular field and one of the major contributors to death worldwide. Computed Tomography Angiography (CTA) images are regarded as the authoritative standard for the diagnosis of coronary artery disease, and by performing vessel segmentation and stenosis detection on CTA images, physicians are able to diagnose coronary… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  4. arXiv:2405.02809  [pdf, other

    eess.SY

    Does Optimal Control Always Benefit from Better Prediction? An Analysis Framework for Predictive Optimal Control

    Authors: Xiangrui Zeng, Cheng Yin, Zhou** Yin

    Abstract: The ``prediction + optimal control'' scheme has shown good performance in many applications of automotive, traffic, robot, and building control. In practice, the prediction results are simply considered correct in the optimal control design process. However, in reality, these predictions may never be perfect. Under a conventional stochastic optimal control formulation, it is difficult to answer qu… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  5. arXiv:2404.14862  [pdf, other

    eess.SP

    Deep Learning Based Multi-Node ISAC 4D Environmental Reconstruction with Uplink- Downlink Cooperation

    Authors: Bohao Lu, Zhiqing Wei, Huici Wu, Xinrui Zeng, Lin Wang, Xi Lu, Dongyang Mei, Zhiyong Feng

    Abstract: Utilizing widely distributed communication nodes to achieve environmental reconstruction is one of the significant scenarios for Integrated Sensing and Communication (ISAC) and a crucial technology for 6G. To achieve this crucial functionality, we propose a deep learning based multi-node ISAC 4D environment reconstruction method with Uplink-Downlink (UL-DL) cooperation, which employs virtual apert… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 13 pages,21 figures,4 tables

  6. arXiv:2404.01723  [pdf, other

    eess.IV cs.CV

    Contextual Embedding Learning to Enhance 2D Networks for Volumetric Image Segmentation

    Authors: Zhuoyuan Wang, Dong Sun, Xiangyun Zeng, Ruodai Wu, Yi Wang

    Abstract: The segmentation of organs in volumetric medical images plays an important role in computer-aided diagnosis and treatment/surgery planning. Conventional 2D convolutional neural networks (CNNs) can hardly exploit the spatial correlation of volumetric data. Current 3D CNNs have the advantage to extract more powerful volumetric representations but they usually suffer from occupying excessive memory a… ▽ More

    Submitted 17 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 15 pages, 9 figures

  7. arXiv:2401.17681  [pdf, ps, other

    cs.IT eess.SP

    Joint Transceiver Optimization for MmWave/THz MU-MIMO ISAC Systems

    Authors: Peilan Wang, Jun Fang, Xianlong Zeng, Zhi Chen, Hongbin Li

    Abstract: In this paper, we consider the problem of joint transceiver design for millimeter wave (mmWave)/Terahertz (THz) multi-user MIMO integrated sensing and communication (ISAC) systems. Such a problem is formulated into a nonconvex optimization problem, with the objective of maximizing a weighted sum of communication users' rates and the passive radar's signal-to-clutter-and-noise-ratio (SCNR). By expl… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  8. arXiv:2401.03623  [pdf

    eess.IV

    A Video Coding Method Based on Neural Network for CLIC2024

    Authors: Zhengang Li, **gchi Zhang, Yonghua Wang, Xing Zeng, Zhen Zhang, Yunlin Long, Menghu Jia, Ning Wang

    Abstract: This paper presents a video coding scheme that combines traditional optimization methods with deep learning methods based on the Enhanced Compression Model (ECM). In this paper, the traditional optimization methods adaptively adjust the quantization parameter (QP). The key frame QP offset is set according to the video content characteristics, and the coding tree unit (CTU) level QP of all frames i… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  9. arXiv:2312.05279  [pdf

    eess.IV cs.CV

    Quantitative perfusion maps using a novelty spatiotemporal convolutional neural network

    Authors: Anbo Cao, Pin-Yu Le, Zhonghui Qie, Haseeb Hassan, Yingwei Guo, Asim Zaman, Jiaxi Lu, Xueqiang Zeng, Huihui Yang, Xiaoqiang Miao, Taiyu Han, Guangtao Huang, Yan Kang, Yu Luo, Jia Guo

    Abstract: Dynamic susceptibility contrast magnetic resonance imaging (DSC-MRI) is widely used to evaluate acute ischemic stroke to distinguish salvageable tissue and infarct core. For this purpose, traditional methods employ deconvolution techniques, like singular value decomposition, which are known to be vulnerable to noise, potentially distorting the derived perfusion parameters. However, deep learning t… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  10. arXiv:2311.11151  [pdf, ps, other

    eess.SY cs.LG stat.ML

    On the Hardness of Learning to Stabilize Linear Systems

    Authors: Xiong Zeng, Zexiang Liu, Zhe Du, Necmiye Ozay, Mario Sznaier

    Abstract: Inspired by the work of Tsiamis et al. \cite{tsiamis2022learning}, in this paper we study the statistical hardness of learning to stabilize linear time-invariant systems. Hardness is measured by the number of samples required to achieve a learning task with a given probability. The work in \cite{tsiamis2022learning} shows that there exist system classes that are hard to learn to stabilize with the… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: 7 pages, 2 figures, accepted by CDC 2023

  11. arXiv:2311.09770  [pdf, other

    cs.SD eess.AS

    DINO-VITS: Data-Efficient Zero-Shot TTS with Self-Supervised Speaker Verification Loss for Noise Robustness

    Authors: Vikentii Pankov, Valeria Pronina, Alexander Kuzmin, Maksim Borisov, Nikita Usoltsev, Xingshan Zeng, Alexander Golubkov, Nikolai Ermolenko, Aleksandra Shirshova, Yulia Matveeva

    Abstract: We address zero-shot TTS systems' noise-robustness problem by proposing a dual-objective training for the speaker encoder using self-supervised DINO loss. This approach enhances the speaker encoder with the speech synthesis objective, capturing a wider range of speech characteristics beneficial for voice cloning. At the same time, the DINO objective improves speaker representation learning, ensuri… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to Interspeech2024

  12. arXiv:2310.05374  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis

    Authors: Jianqiao Lu, Wenyong Huang, Nianzu Zheng, Xingshan Zeng, Yu Ting Yeung, Xiao Chen

    Abstract: Training a high performance end-to-end speech (E2E) processing model requires an enormous amount of labeled speech data, especially in the era of data-centric artificial intelligence. However, labeled speech data are usually scarcer and more expensive for collection, compared to textual data. We propose Latent Synthesis (LaSyn), an efficient textual data utilization framework for E2E speech proces… ▽ More

    Submitted 24 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: 15 pages, 8 figures, 8 tables, Accepted to EMNLP 2023 Findings

  13. arXiv:2309.11850  [pdf, ps, other

    cs.IT eess.SP

    Joint Beamforming for RIS Aided Full-Duplex Integrated Sensing and Uplink Communication

    Authors: Yuan Guo, Yang Liu, Qingqing Wu, Xin Zeng, Qingjiang Shi

    Abstract: This paper studies integrated sensing and communication (ISAC) technology in a full-duplex (FD) uplink communication system. As opposed to the half-duplex system, where sensing is conducted in a first-emit-then-listen manner, FD ISAC system emits and listens simultaneously and hence conducts uninterrupted target sensing. Besides, impressed by the recently emerging reconfigurable intelligent surfac… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2309.02648

  14. arXiv:2308.05365  [pdf

    eess.IV cs.CV

    TriDo-Former: A Triple-Domain Transformer for Direct PET Reconstruction from Low-Dose Sinograms

    Authors: Jiaqi Cui, Pinxian Zeng, Xinyi Zeng, Peng Wang, Xi Wu, Jiliu Zhou, Yan Wang, Dinggang Shen

    Abstract: To obtain high-quality positron emission tomography (PET) images while minimizing radiation exposure, various methods have been proposed for reconstructing standard-dose PET (SPET) images from low-dose PET (LPET) sinograms directly. However, current methods often neglect boundaries during sinogram-to-image reconstruction, resulting in high-frequency distortion in the frequency domain and diminishe… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  15. arXiv:2307.01665  [pdf

    eess.SP

    Multicarrier Modulation-Based Digital Radio-over-Fibre System Achieving Unequal Bit Protection with Over 10 dB SNR Gain

    Authors: Yicheng Xu, Yixiao Zhu, Xiaobo Zeng, Mengfan Fu, Hexun Jiang, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: We propose a multicarrier modulation-based digital radio-over-fibre system achieving unequal bit protection by bit and power allocation for subcarriers. A theoretical SNR gain of 16.1 dB is obtained in the AWGN channel and the simulation results show a 13.5 dB gain in the bandwidth-limited case.

    Submitted 4 July, 2023; originally announced July 2023.

  16. arXiv:2305.12111  [pdf, other

    eess.AS cs.SD

    Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection

    Authors: Xiao-Min Zeng, Yan Song, Zhu Zhuo, Yu Zhou, Yu-Hong Li, Hui Xue, Li-Rong Dai, Ian McLoughlin

    Abstract: In this paper, we propose a joint generative and contrastive representation learning method (GeCo) for anomalous sound detection (ASD). GeCo exploits a Predictive AutoEncoder (PAE) equipped with self-attention as a generative model to perform frame-level prediction. The output of the PAE together with original normal samples, are used for supervised contrastive representative learning in a multi-t… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: Accepted by ICASSP2023

  17. arXiv:2212.08911  [pdf, other

    cs.CL cs.SD eess.AS

    AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation

    Authors: Xingshan Zeng, Liangyou Li, Qun Liu

    Abstract: To alleviate the data scarcity problem in End-to-end speech translation (ST), pre-training on data for speech recognition and machine translation is considered as an important technique. However, the modality gap between speech and text prevents the ST model from efficiently inheriting knowledge from the pre-trained models. In this work, we propose AdaTranS for end-to-end ST. It adapts the speech… ▽ More

    Submitted 17 December, 2022; originally announced December 2022.

  18. Towards Better Dermoscopic Image Feature Representation Learning for Melanoma Classification

    Authors: ChengHui Yu, MingKang Tang, ShengGe Yang, MingQing Wang, Zhe Xu, JiangPeng Yan, HanMo Chen, Yu Yang, Xiao-Jun Zeng, Xiu Li

    Abstract: Deep learning-based melanoma classification with dermoscopic images has recently shown great potential in automatic early-stage melanoma diagnosis. However, limited by the significant data imbalance and obvious extraneous artifacts, i.e., the hair and ruler markings, discriminative feature extraction from dermoscopic images is very challenging. In this study, we seek to resolve these problems resp… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: ICONIP 2021 conference

  19. arXiv:2206.13882  [pdf, other

    cs.IT eess.SP

    CSI Sensing from Heterogeneous User Feedbacks: A Constrained Phase Retrieval Approach

    Authors: Lei Li, Xing Zeng, Ya-Feng Liu, Yanqing Xu, Tsung-Hui Chang

    Abstract: This paper investigates the downlink channel state information (CSI) sensing in 5G heterogeneous networks composed of user equipments (UEs) with different feedback capabilities. We aim to enhance the CSI accuracy of UEs only affording the low-resolution Type-I codebook. While existing works have demonstrated that the task can be accomplished by solving a phase retrieval (PR) formulation based on t… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  20. arXiv:2204.04956  [pdf, other

    eess.IV cs.CV

    Segmentation Network with Compound Loss Function for Hydatidiform Mole Hydrops Lesion Recognition

    Authors: Chengze Zhu, **ge Hu, Xianxu Zeng, Xingtong Wang, Zehua Ji, Li Shi

    Abstract: Pathological morphology diagnosis is the standard diagnosis method of hydatidiform mole. As a disease with malignant potential, the hydatidiform mole section of hydrops lesions is an important basis for diagnosis. Due to incomplete lesion development, early hydatidiform mole is difficult to distinguish, resulting in a low accuracy of clinical diagnosis. As a remarkable machine learning technology,… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

  21. arXiv:2204.04949  [pdf

    eess.IV cs.CV

    A Semantic Segmentation Network Based Real-Time Computer-Aided Diagnosis System for Hydatidiform Mole Hydrops Lesion Recognition in Microscopic View

    Authors: Chengze Zhu, **ge Hu, Xianxu Zeng, Xingtong Wang, Zehua Ji, Li Shi

    Abstract: As a disease with malignant potential, hydatidiform mole (HM) is one of the most common gestational trophoblastic diseases. For pathologists, the HM section of hydrops lesions is an important basis for diagnosis. In pathology departments, the diverse microscopic manifestations of HM lesions and the limited view under the microscope mean that physicians with extensive diagnostic experience are requ… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

  22. SHREC 2021: Classification in cryo-electron tomograms

    Authors: Ilja Gubins, Marten L. Chaillet, Gijs van der Schot, M. Cristina Trueba, Remco C. Veltkamp, Friedrich Förster, Xiao Wang, Daisuke Kihara, Emmanuel Moebel, Nguyen P. Nguyen, Tommi White, Filiz Bunyak, Giorgos Papoulias, Stavros Gerolymatos, Evangelia I. Zacharaki, Konstantinos Moustakas, Xiangrui Zeng, Sinuo Liu, Min Xu, Yaoyu Wang, Cheng Chen, Xuefeng Cui, Fa Zhang

    Abstract: Cryo-electron tomography (cryo-ET) is an imaging technique that allows three-dimensional visualization of macro-molecular assemblies under near-native conditions. Cryo-ET comes with a number of challenges, mainly low signal-to-noise and inability to obtain images from all angles. Computational methods are key to analyze cryo-electron tomograms. To promote innovation in computational methods, we… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Workshop version of the paper can be found here: https://diglib.eg.org/handle/10.2312/3dor20211307

  23. arXiv:2201.01492  [pdf, other

    eess.IV cs.CV

    FAVER: Blind Quality Prediction of Variable Frame Rate Videos

    Authors: Qi Zheng, Zhengzhong Tu, Pavan C. Madhusudana, Xiaoyang Zeng, Alan C. Bovik, Yibo Fan

    Abstract: Video quality assessment (VQA) remains an important and challenging problem that affects many applications at the widest scales. Recent advances in mobile devices and cloud computing techniques have made it possible to capture, process, and share high resolution, high frame rate (HFR) videos across the Internet nearly instantaneously. Being able to monitor and control the quality of these streamed… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: 12 pages, 8 figures

  24. arXiv:2112.14420  [pdf, other

    cs.CV eess.IV

    Invertible Image Dataset Protection

    Authors: Kejiang Chen, Xianhan Zeng, Qichao Ying, Sheng Li, Zhenxing Qian, Xinpeng Zhang

    Abstract: Deep learning has achieved enormous success in various industrial applications. Companies do not want their valuable data to be stolen by malicious employees to train pirated models. Nor do they wish the data analyzed by the competitors after using them online. We propose a novel solution for dataset protection in this scenario by robustly and reversibly transform the images into adversarial image… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.

    Comments: Submitted to ICME 2022. Authors are from University of Science and Technology of China, Fudan University, China. A potential extended version of this work is under way

  25. arXiv:2112.10683  [pdf, other

    cs.CV eess.IV

    SelFSR: Self-Conditioned Face Super-Resolution in the Wild via Flow Field Degradation Network

    Authors: Xianfang Zeng, Jiangning Zhang, Liang Liu, Guangzhong Tian, Yong Liu

    Abstract: In spite of the success on benchmark datasets, most advanced face super-resolution models perform poorly in real scenarios since the remarkable domain gap between the real images and the synthesized training pairs. To tackle this problem, we propose a novel domain-adaptive degradation network for face super-resolution in the wild. This degradation network predicts a flow field along with an interm… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

  26. arXiv:2112.06149   

    eess.IV cs.CV

    Two New Stenosis Detection Methods of Coronary Angiograms

    Authors: Yaofang Liu, Xinyue Zhang, Wenlong Wan, Shaoyu Liu, Yingdi Liu, Hu Liu, Xueying Zeng, Qing Zhang

    Abstract: Coronary angiography is the "gold standard" for diagnosing coronary artery disease (CAD). At present, the methods for detecting and evaluating coronary artery stenosis cannot satisfy the clinical needs, e.g., there is no prior study of detecting stenoses in prespecified vessel segments, which is necessary in clinical practice. Two vascular stenosis detection methods are proposed to assist the diag… ▽ More

    Submitted 14 December, 2021; v1 submitted 11 December, 2021; originally announced December 2021.

    Comments: We submitted the paper due to an operational error. This paper is a modified version of the original paper Two New Stenoses Detection Methods of Coronary Angiograms (arXiv:2108.01516). And we will update the revised paper to the original paper later

  27. arXiv:2111.00485  [pdf, other

    cs.CV eess.IV

    Learned Image Compression with Separate Hyperprior Decoders

    Authors: Zhao Zan, Chao Liu, Heming Sun, Xiaoyang Zeng, Yibo Fan

    Abstract: Learned image compression techniques have achieved considerable development in recent years. In this paper, we find that the performance bottleneck lies in the use of a single hyperprior decoder, in which case the ternary Gaussian model collapses to a binary one. To solve this, we propose to use three hyperprior decoders to separate the decoding process of the mixed parameters in discrete Gaussian… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Comments: This paper has been accepted by IEEE Open Journal of Circuits and Systems

  28. arXiv:2109.00617  [pdf, other

    eess.SY cs.LG

    LinEasyBO: Scalable Bayesian Optimization Approach for Analog Circuit Synthesis via One-Dimensional Subspaces

    Authors: Shuhan Zhang, Fan Yang, Changhao Yan, Dian Zhou, Xuan Zeng

    Abstract: A large body of literature has proved that the Bayesian optimization framework is especially efficient and effective in analog circuit synthesis. However, most of the previous research works only focus on designing informative surrogate models or efficient acquisition functions. Even if searching for the global optimum over the acquisition function surface is itself a difficult task, it has been l… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: 6 pages, 4 figures

  29. arXiv:2108.08551  [pdf, other

    eess.IV cs.CV cs.MM

    Learned Video Compression with Residual Prediction and Loop Filter

    Authors: Chao Liu, Heming Sun, Jiro Katto, Xiaoyang Zeng, Yibo Fan

    Abstract: In this paper, we propose a learned video codec with a residual prediction network (RP-Net) and a feature-aided loop filter (LF-Net). For the RP-Net, we exploit the residual of previous multiple frames to further eliminate the redundancy of the current frame residual. For the LF-Net, the features from residual decoding network and the motion compensation network are used to aid the reconstruction… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

  30. CarveMix: A Simple Data Augmentation Method for Brain Lesion Segmentation

    Authors: Xinru Zhang, Chenghao Liu, Ni Ou, Xiangzhu Zeng, Xiaoliang Xiong, Yizhou Yu, Zhiwen Liu, Chuyang Ye

    Abstract: Brain lesion segmentation provides a valuable tool for clinical diagnosis, and convolutional neural networks (CNNs) have achieved unprecedented success in the task. Data augmentation is a widely used strategy that improves the training of CNNs, and the design of the augmentation method for brain lesion segmentation is still an open problem. In this work, we propose a simple data augmentation appro… ▽ More

    Submitted 16 August, 2021; v1 submitted 15 August, 2021; originally announced August 2021.

    Comments: accepted by MICCAI 2021

    Journal ref: MICCAI (2021) Part of the Lecture Notes in Computer Science book series(LNCS, volume 12901) pp 196-205

  31. arXiv:2108.01516  [pdf, other

    eess.IV cs.CV

    Two New Stenosis Detection Methods of Coronary Angiograms

    Authors: Yaofang Liu, Xinyue Zhang, Wenlong Wan, Shaoyu Liu, Yingdi Liu, Hu Liu, Xueying Zeng, Qing Zhang

    Abstract: Coronary angiography is the "gold standard" for diagnosing coronary artery disease (CAD). At present, the methods for detecting and evaluating coronary artery stenosis cannot satisfy the clinical needs, e.g., there is no prior study of detecting stenoses in prespecified vessel segments, which is necessary in clinical practice. Two vascular stenosis detection methods are proposed to assist the diag… ▽ More

    Submitted 14 December, 2021; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: Correspondence should be addressed to Qing Zhang

  32. An Efficient Batch Constrained Bayesian Optimization Approach for Analog Circuit Synthesis via Multi-objective Acquisition Ensemble

    Authors: Shuhan Zhang, Fan Yang, Changhao Yan, Dian Zhou, Xuan Zeng

    Abstract: Bayesian optimization is a promising methodology for analog circuit synthesis. However, the sequential nature of the Bayesian optimization framework significantly limits its ability to fully utilize real-world computational resources. In this paper, we propose an efficient parallelizable Bayesian optimization algorithm via Multi-objective ACquisition function Ensemble (MACE) to further accelerate… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: 14 pages, 5 figures

  33. An Efficient Asynchronous Batch Bayesian Optimization Approach for Analog Circuit Synthesis

    Authors: Shuhan Zhang, Fan Yang, Dian Zhou, Xuan Zeng

    Abstract: In this paper, we propose EasyBO, an Efficient ASYnchronous Batch Bayesian Optimization approach for analog circuit synthesis. In this proposed approach, instead of waiting for the slowest simulations in the batch to finish, we accelerate the optimization procedure by asynchronously issuing the next query points whenever there is an idle worker. We introduce a new acquisition function that can bet… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: 6 pages, 6 figures

  34. arXiv:2106.05905  [pdf, other

    eess.SY cs.AI math.OC

    Multiple Dynamic Pricing for Demand Response with Adaptive Clustering-based Customer Segmentation in Smart Grids

    Authors: Fanlin Meng, Qian Ma, Zixu Liu, Xiao-Jun Zeng

    Abstract: In this paper, we propose a realistic multiple dynamic pricing approach to demand response in the retail market. First, an adaptive clustering-based customer segmentation framework is proposed to categorize customers into different groups to enable the effective identification of usage patterns. Second, customized demand models with important market constraints which capture the price-demand relat… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

  35. arXiv:2106.04833  [pdf, other

    cs.CL cs.SD eess.AS

    RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer

    Authors: Xingshan Zeng, Liangyou Li, Qun Liu

    Abstract: End-to-end simultaneous speech translation (SST), which directly translates speech in one language into text in another language in real-time, is useful in many scenarios but has not been fully investigated. In this work, we propose RealTranS, an end-to-end model for SST. To bridge the modality gap between speech and text, RealTranS gradually downsamples the input speech with interleaved convoluti… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted by ACL2021 Findings

  36. arXiv:2106.00197  [pdf, other

    cs.CL cs.SD eess.AS

    Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021

    Authors: Xingshan Zeng, Liangyou Li, Qun Liu

    Abstract: This paper describes the system submitted to the IWSLT 2021 Multilingual Speech Translation (MultiST) task from Huawei Noah's Ark Lab. We use a unified transformer architecture for our MultiST model, so that the data from different modalities (i.e., speech and text) and different tasks (i.e., Speech Recognition, Machine Translation, and Speech Translation) can be exploited to enhance the model's a… ▽ More

    Submitted 21 June, 2021; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: IWSLT 2021

  37. Predictive Optimal Control with Data-Based Disturbance Scenario Tree Approximation

    Authors: Ran **g, Xiangrui Zeng

    Abstract: Efficiently computing the optimal control policy concerning a complicated future with stochastic disturbance has always been a challenge. The predicted stochastic future disturbance can be represented by a scenario tree, but solving the optimal control problem with a scenario tree is usually computationally demanding. In this paper, we propose a data-based clustering approximation method for the s… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: accepted by American Control Conference (ACC) 2021

    Journal ref: 2021 American Control Conference (ACC), 2021, pp. 992-997

  38. Feedback-Based Dynamic Feature Selection for Constrained Continuous Data Acquisition

    Authors: Alp Sahin, Xiangrui Zeng

    Abstract: Relevant and high-quality data are critical to successful development of machine learning applications. For machine learning applications on dynamic systems equipped with a large number of sensors, such as connected vehicles and robots, how to find relevant and high-quality data features in an efficient way is a challenging problem. In this work, we address the problem of feature selection in cons… ▽ More

    Submitted 22 February, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: to be published in ACC 2021

    Journal ref: 2021 American Control Conference (ACC), 2021, pp. 3507-3512

  39. arXiv:2011.02880  [pdf, other

    eess.IV cs.CV

    Covariance Self-Attention Dual Path UNet for Rectal Tumor Segmentation

    Authors: Haijun Gao, Bochuan Zheng, Dazhi Pan, Xiangyin Zeng

    Abstract: Deep learning algorithms are preferable for rectal tumor segmentation. However, it is still a challenge task to accurately segment and identify the locations and sizes of rectal tumors by using deep learning methods. To increase the capability of extracting enough feature information for rectal tumor segmentation, we propose a Covariance Self-Attention Dual Path UNet (CSA-DPUNet). The proposed net… ▽ More

    Submitted 5 January, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

  40. arXiv:2010.13059  [pdf, other

    eess.IV cs.LG cs.MM

    A QP-adaptive Mechanism for CNN-based Filter in Video Coding

    Authors: Chao Liu, Heming Sun, Jiro Katto, Xiaoyang Zeng, Yibo Fan

    Abstract: Convolutional neural network (CNN)-based filters have achieved great success in video coding. However, in most previous works, individual models are needed for each quantization parameter (QP) band. This paper presents a generic method to help an arbitrary CNN-filter handle different quantization noise. We model the quantization noise problem and implement a feasible solution on CNN, which introdu… ▽ More

    Submitted 25 October, 2020; originally announced October 2020.

  41. arXiv:2009.09826  [pdf, other

    eess.SY cs.AI cs.LG

    Learning Safe Neural Network Controllers with Barrier Certificates

    Authors: Hengjun Zhao, Xia Zeng, Taolue Chen, Zhiming Liu, Jim Woodcock

    Abstract: We provide a novel approach to synthesize controllers for nonlinear continuous dynamical systems with control against safety properties. The controllers are based on neural networks (NNs). To certify the safety property we utilize barrier functions, which are represented by NNs as well. We train the controller-NN and barrier-NN simultaneously, achieving a verification-in-the-loop synthesis. We pro… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

  42. arXiv:2009.02733  [pdf, other

    eess.IV cs.MM

    A Convolutional Neural Network-Based Low Complexity Filter

    Authors: Chao Liu, Heming Sun, Jiro Katto, Xiaoyang Zeng, Yibo Fan

    Abstract: Convolutional Neural Network (CNN)-based filters have achieved significant performance in video artifacts reduction. However, the high complexity of existing methods makes it difficult to be applied in real usage. In this paper, a CNN-based low complexity filter is proposed. We utilize depth separable convolution (DSC) merged with the batch normalization (BN) as the backbone of our proposed CNN-ba… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

  43. arXiv:2004.02396  [pdf, other

    cs.LG eess.SP stat.ML

    A Learning Framework for n-bit Quantized Neural Networks toward FPGAs

    Authors: Jun Chen, Liang Liu, Yong Liu, Xianfang Zeng

    Abstract: The quantized neural network (QNN) is an efficient approach for network compression and can be widely used in the implementation of FPGAs. This paper proposes a novel learning framework for n-bit QNNs, whose weights are constrained to the power of two. To solve the gradient vanishing problem, we propose a reconstructed gradient function for QNNs in back-propagation algorithm that can directly get… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: This paper has been accepted for publication in the IEEE Transactions on Neural Networks and Learning Systems

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems 2020

  44. arXiv:2003.06529  [pdf

    eess.IV cs.CV cs.LG

    Boundary Guidance Hierarchical Network for Real-Time Tongue Segmentation

    Authors: Xinyi Zeng, Qian Zhang, Jia Chen, Guixu Zhang, Aimin Zhou, Yiqin Wang

    Abstract: Automated tongue image segmentation in tongue images is a challenging task for two reasons: 1) there are many pathological details on the tongue surface, which affect the extraction of the boundary; 2) the shapes of the tongues captured from various persons (with different diseases) are quite different. To deal with the challenge, a novel end-to-end Boundary Guidance Hierarchical Network (BGHNet)… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: 10 pages, 8 figures

  45. arXiv:2003.01768  [pdf, ps, other

    cs.CV eess.IV

    A Robust Imbalanced SAR Image Change Detection Approach Based on Deep Difference Image and PCANet

    Authors: Xinzheng Zhang, Hang Su, Ce Zhang, Peter M. Atkinson, Xiaoheng Tan, ** Zeng, Xin Jian

    Abstract: In this research, a novel robust change detection approach is presented for imbalanced multi-temporal synthetic aperture radar (SAR) image based on deep learning. Our main contribution is to develop a novel method for generating difference image and a parallel fuzzy c-means (FCM) clustering method. The main steps of our proposed approach are as follows: 1) Inspired by convolution and pooling in de… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: 5 pages, 4 figures

  46. arXiv:1912.00402  [pdf, other

    cs.LG eess.SY stat.ML

    Bayesian Optimization Approach for Analog Circuit Synthesis Using Neural Network

    Authors: Shuhan Zhang, Wenlong Lyu, Fan Yang, Changhao Yan, Dian Zhou, Xuan Zeng

    Abstract: Bayesian optimization with Gaussian process as surrogate model has been successfully applied to analog circuit synthesis. In the traditional Gaussian process regression model, the kernel functions are defined explicitly. The computational complexity of training is O(N 3 ), and the computation complexity of prediction is O(N 2 ), where N is the number of training data. Gaussian process model can al… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Journal ref: 2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

  47. An Efficient Multi-fidelity Bayesian Optimization Approach for Analog Circuit Synthesis

    Authors: Shuhan Zhang, Wenlong Lyu, Fan Yang, Changhao Yan, Dian Zhou, Xuan Zeng, Xiangdong Hu

    Abstract: This paper presents an efficient multi-fidelity Bayesian optimization approach for analog circuit synthesis. The proposed method can significantly reduce the overall computational cost by fusing the simple but potentially inaccurate low-fidelity model and a few accurate but expensive high-fidelity data. Gaussian Process (GP) models are employed to model the low- and high-fidelity black-box functio… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Journal ref: The 56th Annual Design Automation Conference 2019

  48. arXiv:1911.09857  [pdf, ps, other

    eess.IV cs.LG cs.MM

    Dual Learning-based Video Coding with Inception Dense Blocks

    Authors: Chao Liu, Heming Sun, Junan Chen, Zhengxue Cheng, Masaru Takeuchi, Jiro Katto, Xiaoyang Zeng, Yibo Fan

    Abstract: In this paper, a dual learning-based method in intra coding is introduced for PCS Grand Challenge. This method is mainly composed of two parts: intra prediction and reconstruction filtering. They use different network structures, the neural network-based intra prediction uses the full-connected network to predict the block while the neural network-based reconstruction filtering utilizes the convol… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

  49. arXiv:1911.03044  [pdf, other

    q-bio.QM cs.LG eess.IV

    AITom: Open-source AI platform for cryo-electron tomography data analysis

    Authors: Xiangrui Zeng, Min Xu

    Abstract: Cryo-electron tomography (cryo-ET) is an emerging technology for the 3D visualization of structural organizations and interactions of subcellular components at near-native state and sub-molecular resolution. Tomograms captured by cryo-ET contain heterogeneous structures representing the complex and dynamic subcellular environment. Since the structures are not purified or fluorescently labeled, the… ▽ More

    Submitted 30 October, 2020; v1 submitted 7 November, 2019; originally announced November 2019.

    Comments: 2 figures

  50. arXiv:1908.09993  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    Deep Learning-Based Strategy for Macromolecules Classification with Imbalanced Data from Cellular Electron Cryotomography

    Authors: Ziqian Luo, Xiangrui Zeng, Zhipeng Bao, Min Xu

    Abstract: Deep learning model trained by imbalanced data may not work satisfactorily since it could be determined by major classes and thus may ignore the classes with small amount of data. In this paper, we apply deep learning based imbalanced data classification for the first time to cellular macromolecular complexes captured by Cryo-electron tomography (Cryo-ET). We adopt a range of strategies to cope wi… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

    Comments: 13 pages. arXiv admin note: text overlap with arXiv:1710.09412, arXiv:1710.05381, arXiv:1708.02002 by other authors

    Journal ref: 2019 International Joint Conference on Neural Networks (IJCNN)