Skip to main content

Showing 1–27 of 27 results for author: Du, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.12943  [pdf

    eess.IV

    A square cross-section FOV rotational CL (SC-CL) and its analytical reconstruction method

    Authors: Xiang Zou, Wuliang Shi, Muge Du, Yuxiang Xing

    Abstract: Rotational computed laminography (CL) has broad application potential in three-dimensional imaging of plate-like objects, as it only needs x-ray to pass through the tested object in the thickness direction during the imaging process. In this study, a square cross-section FOV rotational CL (SC-CL) was proposed. Then, the FDK-type analytical reconstruction algorithm applicable to the SC-CL was deriv… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2401.08469  [pdf, other

    eess.IV cs.CV cs.LG

    Explanations of Classifiers Enhance Medical Image Segmentation via End-to-end Pre-training

    Authors: Jiamin Chen, Xuhong Li, Yanwu Xu, Mengnan Du, Haoyi Xiong

    Abstract: Medical image segmentation aims to identify and locate abnormal structures in medical images, such as chest radiographs, using deep neural networks. These networks require a large number of annotated images with fine-grained masks for the regions of interest, making pre-training strategies based on classification datasets essential for sample efficiency. Based on a large-scale medical image classi… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  3. arXiv:2401.01755  [pdf, other

    cs.SD cs.AI eess.AS

    Incremental FastPitch: Chunk-based High Quality Text to Speech

    Authors: Muyang Du, Chuan Liu, Junjie Lai

    Abstract: Parallel text-to-speech models have been widely applied for real-time speech synthesis, and they offer more controllability and a much faster synthesis process compared with conventional auto-regressive models. Although parallel models have benefits in many aspects, they become naturally unfit for incremental synthesis due to their fully parallel architecture such as transformer. In this work, we… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 5 pages, 4 figures, 1 table

  4. arXiv:2312.00308  [pdf, other

    cs.CV eess.IV stat.AP

    A knowledge-based data-driven (KBDD) framework for all-day identification of cloud types using satellite remote sensing

    Authors: Longfeng Nie, Yuntian Chen, Mengge Du, Changqi Sun, Dongxiao Zhang

    Abstract: Cloud types, as a type of meteorological data, are of particular significance for evaluating changes in rainfall, heatwaves, water resources, floods and droughts, food security and vegetation cover, as well as land use. In order to effectively utilize high-resolution geostationary observations, a knowledge-based data-driven (KBDD) framework for all-day identification of cloud types based on spectr… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  5. arXiv:2311.10349  [pdf, other

    eess.IV cs.CV cs.LG

    Pseudo Label-Guided Data Fusion and Output Consistency for Semi-Supervised Medical Image Segmentation

    Authors: Tao Wang, Yuanbin Chen, Xinlin Zhang, Yuanbo Zhou, Junlin Lan, Bizhe Bai, Tao Tan, Min Du, Qinquan Gao, Tong Tong

    Abstract: Supervised learning algorithms based on Convolutional Neural Networks have become the benchmark for medical image segmentation tasks, but their effectiveness heavily relies on a large amount of labeled data. However, annotating medical image datasets is a laborious and time-consuming process. Inspired by semi-supervised algorithms that use both labeled and unlabeled data for training, we propose t… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  6. arXiv:2308.08181  [pdf, ps, other

    cs.SD cs.CL eess.AS

    ChinaTelecom System Description to VoxCeleb Speaker Recognition Challenge 2023

    Authors: Mengjie Du, Xiang Fang, Jie Li

    Abstract: This technical report describes ChinaTelecom system for Track 1 (closed) of the VoxCeleb2023 Speaker Recognition Challenge (VoxSRC 2023). Our system consists of several ResNet variants trained only on VoxCeleb2, which were fused for better performance later. Score calibration was also applied for each variant and the fused system. The final submission achieved minDCF of 0.1066 and EER of 1.980%.

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: System description of VoxSRC 2023

  7. arXiv:2306.16918  [pdf, other

    eess.IV cs.CV

    PCDAL: A Perturbation Consistency-Driven Active Learning Approach for Medical Image Segmentation and Classification

    Authors: Tao Wang, Xinlin Zhang, Yuanbo Zhou, Junlin Lan, Tao Tan, Min Du, Qinquan Gao, Tong Tong

    Abstract: In recent years, deep learning has become a breakthrough technique in assisting medical image diagnosis. Supervised learning using convolutional neural networks (CNN) provides state-of-the-art performance and has served as a benchmark for various medical image segmentation and classification. However, supervised learning deeply relies on large-scale annotated data, which is expensive, time-consumi… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  8. arXiv:2302.12864  [pdf, other

    eess.SY

    A Data-Driven Polynomial Chaos Expansion-Based Method for Microgrid Ram** Support Capability Assessment and Enhancement

    Authors: Mohan Du, Xiaozhe Wang

    Abstract: Microgrids (MGs) are regarded as effective solutions to provide ram** support to the main grid during heavy-load periods. Nevertheless, the uncertain renewable energy sources (RES) and electric vehicles (EVs) integrated into an MG may affect the ram** support capability (RSC) of an MG. To address the challenge, this paper develops a data-driven sparse polynomial chaos expansion (DDSPCE)-based… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: This paper is accepted and will appear in 2023 IEEE Power & Energy Society General Meeting (GM). 5 pages, 4 figures

  9. arXiv:2301.06595  [pdf, other

    physics.comp-ph eess.IV physics.optics

    PtyLab.m/py/jl: a cross-platform, open-source inverse modeling toolbox for conventional and Fourier ptychography

    Authors: Lars Loetgering, Mengqi Du, Dirk Boonzajer Flaes, Tomas Aidukas, Felix Wechsler, Daniel S. Penagos Molina, Max Rose, Antonios Pelekanidis, Wilhelm Eschen, Jürgen Hess, Thomas Wilhein, Rainer Heintzmann, Jan Rothhardt, Stefan Witte

    Abstract: Conventional (CP) and Fourier (FP) ptychography have emerged as versatile quantitative phase imaging techniques. While the main application cases for each technique are different, namely lens-less short wavelength imaging for CP and lens-based visible light imaging for FP, both methods share a common algorithmic ground. CP and FP have in part independently evolved to include experimentally robust… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

  10. arXiv:2211.13939  [pdf, other

    cs.SD cs.LG eess.AS

    Efficient Incremental Text-to-Speech on GPUs

    Authors: Muyang Du, Chuan Liu, Jiaxing Qi, Junjie Lai

    Abstract: Incremental text-to-speech, also known as streaming TTS, has been increasingly applied to online speech applications that require ultra-low response latency to provide an optimal user experience. However, most of the existing speech synthesis pipelines deployed on GPU are still non-incremental, which uncovers limitations in high-concurrency scenarios, especially when the pipeline is built with end… ▽ More

    Submitted 5 December, 2022; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: 5 pages, 4 figures

  11. arXiv:2205.14850  [pdf, other

    cs.RO cs.LG cs.SD eess.AS

    Play it by Ear: Learning Skills amidst Occlusion through Audio-Visual Imitation Learning

    Authors: Maximilian Du, Olivia Y. Lee, Suraj Nair, Chelsea Finn

    Abstract: Humans are capable of completing a range of challenging manipulation tasks that require reasoning jointly over modalities such as vision, touch, and sound. Moreover, many such tasks are partially-observed; for example, taking a notebook out of a backpack will lead to visual occlusion and require reasoning over the history of audio or tactile information. While robust tactile sensing can be costly… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Journal ref: Robotics Science and Systems (RSS) 2022

  12. arXiv:2112.06226  [pdf, other

    eess.IV cs.CV

    Attention based Broadly Self-guided Network for Low light Image Enhancement

    Authors: Zilong Chen, Yaling Liang, Minghui Du

    Abstract: During the past years,deep convolutional neural networks have achieved impressive success in low-light Image Enhancement.Existing deep learning methods mostly enhance the ability of feature extraction by stacking network structures and deepening the depth of the network.which causes more runtime cost on single image.In order to reduce inference time while fully extracting local features and global… ▽ More

    Submitted 15 December, 2021; v1 submitted 12 December, 2021; originally announced December 2021.

    Comments: 10 Pages,8 Figures,4 Tables

  13. arXiv:2111.12983  [pdf, other

    cs.CV eess.IV

    Investigation of domain gap problem in several deep-learning-based CT metal artefact reduction methods

    Authors: Muge Du, Kaichao Liang, Yinong Liu, Yuxiang Xing

    Abstract: Metal artefacts in CT images may disrupt image quality and interfere with diagnosis. Recently many deep-learning-based CT metal artefact reduction (MAR) methods have been proposed. Current deep MAR methods may be troubled with domain gap problem, where methods trained on simulated data cannot perform well on practical data. In this work, we experimentally investigate two image-domain supervised me… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  14. arXiv:2108.11558  [pdf, ps, other

    eess.SY

    Targeted False Data Injection Attacks Against AC State Estimation Without Network Parameters

    Authors: Mingqiu Du, Georgia Pierrou, Xiaozhe Wang, Marthe Kassouf

    Abstract: State estimation is a data processing algorithm for converting redundant meter measurements and other information into an estimate of the state of a power system. Relying heavily on meter measurements, state estimation has proven to be vulnerable to cyber attacks. In this paper, a novel targeted false data injection attack (FDIA) model against AC state estimation is proposed. Leveraging on the int… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

  15. arXiv:2107.00279  [pdf, other

    cs.CL cs.SD eess.AS

    The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021

    Authors: Dan Liu, Mengge Du, Xiaoxi Li, Yuchen Hu, Lirong Dai

    Abstract: This paper describes USTC-NELSLIP's submissions to the IWSLT2021 Simultaneous Speech Translation task. We proposed a novel simultaneous translation model, Cross Attention Augmented Transducer (CAAT), which extends conventional RNN-T to sequence-to-sequence tasks without monotonic constraints, e.g., simultaneous translation. Experiments on speech-to-text (S2T) and text-to-text (T2T) simultaneous tr… ▽ More

    Submitted 9 July, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

  16. arXiv:2103.05114  [pdf, other

    eess.IV cs.CV cs.LG

    Learning Invariant Representations across Domains and Tasks

    Authors: **dong Wang, Wenjie Feng, Chang Liu, Chaohui Yu, Mingxuan Du, Renjun Xu, Tao Qin, Tie-Yan Liu

    Abstract: Being expensive and time-consuming to collect massive COVID-19 image samples to train deep classification models, transfer learning is a promising approach by transferring knowledge from the abundant typical pneumonia datasets for COVID-19 image classification. However, negative transfer may deteriorate the performance due to the feature distribution divergence between two datasets and task semant… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: Technical report, 12 pages

  17. arXiv:2102.11896  [pdf, ps, other

    eess.SY

    Targeted False Data Injection Attack against DC State Estimation without Line Parameters

    Authors: Mingqiu Du, Georgia Pierrou, Xiaozhe Wang

    Abstract: A novel false data injection attack (FDIA) model against DC state estimation is proposed, which requires no network parameters and exploits only limited phasor measurement unit (PMU) data. The proposed FDIA model can target specific states and launch large deviation attacks using estimated line parameters. Sufficient conditions for the proposed method are also presented. Different attack vectors a… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

  18. Using a modified double deep image prior for crosstalk mitigation in multislice ptychography

    Authors: Ming Du, Xiao**g Huang, Chris Jacobsen

    Abstract: Multislice ptychography is a high-resolution microscopy technique used to image multiple separate axial planes using a single illumination direction. However, multislice ptychography reconstructions are often degraded by crosstalk, where some features on one plane erroneously contribute to the reconstructed image of another plane. Here, we demonstrate the use of a modified "double deep image prior… ▽ More

    Submitted 29 January, 2021; originally announced February 2021.

    Comments: 10 pages, 5 figures

  19. arXiv:2012.12686  [pdf, other

    eess.IV math.NA physics.comp-ph

    Adorym: A multi-platform generic x-ray image reconstruction framework based on automatic differentiation

    Authors: Ming Du, Saugat Kandel, Jun**g Deng, Xiao**g Huang, Arnaud Demortiere, Tuan Tu Nguyen, Remi Tucoulou, Vincent De Andrade, Qiaoling **, Chris Jacobsen

    Abstract: We describe and demonstrate an optimization-based x-ray image reconstruction framework called Adorym. Our framework provides a generic forward model, allowing one code framework to be used for a wide range of imaging methods ranging from near-field holography to and fly-scan ptychographic tomography. By using automatic differentiation for optimization, Adorym has the flexibility to refine experime… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

    MSC Class: 78-04

  20. arXiv:1912.03449  [pdf, other

    eess.SP cs.LG

    Fully Dense Neural Network for the Automatic Modulation Recognition

    Authors: Miao Du, Qin Yu, Shaomin Fei, Chen Wang, Xiaofeng Gong, Ruisen Luo

    Abstract: Nowadays, we mainly use various convolution neural network (CNN) structures to extract features from radio data or spectrogram in AMR. Based on expert experience and spectrograms, they not only increase the difficulty of preprocessing, but also consume a lot of memory. In order to directly use in-phase and quadrature (IQ) data obtained by the receiver and enhance the efficiency of network extracti… ▽ More

    Submitted 7 December, 2019; originally announced December 2019.

  21. Stain Style Transfer using Transitive Adversarial Networks

    Authors: Shao** Cai, Yuyang Xue3 Qinquan Gao, Min Du, Gang Chen, Hejun Zhang, Tong Tong

    Abstract: Digitized pathological diagnosis has been in increasing demand recently. It is well known that color information is critical to the automatic and visual analysis of pathological slides. However, the color variations due to various factors not only have negative impact on pathologist's diagnosis, but also will reduce the robustness of the algorithms. The factors that cause the color differences are… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: MICCAI 2019 MLMIR Workshop, Oral Paper

  22. Learning Enhanced Resolution-wise features for Human Pose Estimation

    Authors: Kun Zhang, Peng He, ** Yao, Ge Chen, Rui Wu, Min Du, Huimin Li, Li Fu, Tianyao Zheng

    Abstract: Recently, multi-resolution networks (such as Hourglass, CPN, HRNet, etc.) have achieved significant performance on pose estimation by combining feature maps of various resolutions. In this paper, we propose a Resolution-wise Attention Module (RAM) and Gradual Pyramid Refinement (GPR), to learn enhanced resolution-wise feature maps for precise pose estimation. Specifically, RAM learns a group of we… ▽ More

    Submitted 13 December, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: Published on ICIP 2020

  23. arXiv:1908.06770  [pdf, other

    eess.IV physics.app-ph physics.med-ph

    Near, far, wherever you are: simulations on the dose efficiency of holographic and ptychographic coherent imaging

    Authors: Ming Du, Doga Gursoy, Chris Jacobsen

    Abstract: Different studies in x-ray microscopy have arrived at conflicting conclusions about the dose efficiency of imaging modes involving the recording of intensity distributions in the near (Fresnel regime) or far (Fraunhofer regime) field downstream of a specimen. We present here a numerical study on the dose efficiency of near-field holography (NFH), near-field ptychography (NFP), and far-field ptycho… ▽ More

    Submitted 11 March, 2020; v1 submitted 16 August, 2019; originally announced August 2019.

    Journal ref: Journal of Applied Crystallography. 53, 748-759 (2020)

  24. arXiv:1907.04536  [pdf

    cs.LG cs.SD eess.AS stat.ML

    Multi-layer Attention Mechanism for Speech Keyword Recognition

    Authors: Ruisen Luo, Tianran Sun, Chen Wang, Miao Du, Zuodong Tang, Kai Zhou, Xiaofeng Gong, Xiaomei Yang

    Abstract: As an important part of speech recognition technology, automatic speech keyword recognition has been intensively studied in recent years. Such technology becomes especially pivotal under situations with limited infrastructures and computational resources, such as voice command recognition in vehicles and robot interaction. At present, the mainstream methods in automatic speech keyword recognition… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

  25. arXiv:1907.02244  [pdf, other

    cs.CV eess.IV

    Searching for Apparel Products from Images in the Wild

    Authors: Son Tran, Ming Du, Sampath Chanda, R. Manmatha, Cj Taylor

    Abstract: In this age of social media, people often look at what others are wearing. In particular, Instagram and Twitter influencers often provide images of themselves wearing different outfits and their followers are often inspired to buy similar clothes.We propose a system to automatically find the closest visually similar clothes in the online Catalog (street-to-shop searching). The problem is challengi… ▽ More

    Submitted 7 April, 2022; v1 submitted 4 July, 2019; originally announced July 2019.

    Comments: KDD2019, AI for Fashion Workshop

  26. arXiv:1905.10433  [pdf, other

    eess.IV physics.app-ph physics.optics

    Three dimensions, two microscopes, one code: automatic differentiation for x-ray nanotomography beyond the depth of focus limit

    Authors: Ming Du, Youssef S. G. Nashed, Saugat Kandel, Doga Gursoy, Chris Jacobsen

    Abstract: Conventional tomographic reconstruction algorithms assume that one has obtained pure projection images, involving no within-specimen diffraction effects nor multiple scattering. Advances in x-ray nanotomography are leading towards the violation of these assumptions, by combining the high penetration power of x-rays which enables thick specimens to be imaged, with improved spatial resolution which… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Journal ref: Science Advances. 6, eaay3700 (2020)

  27. X-ray tomography of extended objects: a comparison of data acquisition approaches

    Authors: Ming Du, Rafael Vescovi, Kamel Fezzaa, Chris Jacobsen, Doga Gursoy

    Abstract: The penetration power of x-rays allows one to image large objects. For example, centimeter-sized specimens can be imaged with micron-level resolution using synchrotron sources. In this case, however, the limited beam diameter and detector size preclude the acquisition of the full sample in a single take, necessitating strategies for combining data from multiple regions. Object stitching involves t… ▽ More

    Submitted 11 July, 2018; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: Under review

    Journal ref: Journal of the Optical Society of America A. 35, 1871 (2018)