Skip to main content

Showing 1–25 of 25 results for author: Guan, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.03118  [pdf, other

    cs.SD eess.AS

    Determined Multichannel Blind Source Separation with Clustered Source Model

    Authors: Jianyu Wang, Shanzheng Guan

    Abstract: The independent low-rank matrix analysis (ILRMA) method stands out as a prominent technique for multichannel blind audio source separation. It leverages nonnegative matrix factorization (NMF) and nonnegative canonical polyadic decomposition (NCPD) to model source parameters. While it effectively captures the low-rank structure of sources, the NMF model overlooks inter-channel dependencies. On the… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  2. arXiv:2401.01763  [pdf, other

    cs.SD eess.AS

    Multichannel blind speech source separation with a disjoint constraint source model

    Authors: Jianyu Wang, Shanzheng Guan

    Abstract: Multichannel convolutive blind speech source separation refers to the problem of separating different speech sources from the observed multichannel mixtures without much a priori information about the mixing system. Multichannel nonnegative matrix factorization (MNMF) has been proven to be one of the most powerful separation frameworks and the representative algorithms such as MNMF and the indepen… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  3. arXiv:2401.01762  [pdf, other

    cs.SD eess.AS

    Independent low-rank matrix analysis based on the Sinkhorn divergence source model for blind source separation

    Authors: Jianyu Wang, Shanzheng Guan, **gdong Chen, Jacob Benesty

    Abstract: The so-called independent low-rank matrix analysis (ILRMA) has demonstrated a great potential for dealing with the problem of determined blind source separation (BSS) for audio and speech signals. This method assumes that the spectra from different frequency bands are independent and the spectral coefficients in any frequency band are Gaussian distributed. The Itakura-Saito divergence is then empl… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  4. arXiv:2305.08408  [pdf, other

    cs.CV eess.IV

    SB-VQA: A Stack-Based Video Quality Assessment Framework for Video Enhancement

    Authors: Ding-Jiun Huang, Yu-Ting Kao, Tieh-Hung Chuang, Ya-Chun Tsai, **g-Kai Lou, Shuen-Huei Guan

    Abstract: In recent years, several video quality assessment (VQA) methods have been developed, achieving high performance. However, these methods were not specifically trained for enhanced videos, which limits their ability to predict video quality accurately based on human subjective perception. To address this issue, we propose a stack-based framework for VQA that outperforms existing state-of-the-art met… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: CVPR NTIRE 2023

  5. CaraNet: Context Axial Reverse Attention Network for Segmentation of Small Medical Objects

    Authors: Ange Lou, Shuyue Guan, Murray Loew

    Abstract: Segmenting medical images accurately and reliably is important for disease diagnosis and treatment. It is a challenging task because of the wide variety of objects' sizes, shapes, and scanning modalities. Recently, many convolutional neural networks (CNN) have been designed for segmentation tasks and achieved great success. Few studies, however, have fully considered the sizes of objects, and thus… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: arXiv admin note: text overlap with arXiv:2108.07368

    Journal ref: Journal of Medical Imaging 10(1), 014005 (18 February 2023)

  6. arXiv:2212.14828  [pdf, other

    eess.IV cs.CV

    Informing selection of performance metrics for medical image segmentation evaluation using configurable synthetic errors

    Authors: Shuyue Guan, Ravi K. Samala, Weijie Chen

    Abstract: Machine learning-based segmentation in medical imaging is widely used in clinical applications from diagnostics to radiotherapy treatment planning. Segmented medical images with ground truth are useful for investigating the properties of different segmentation performance metrics to inform metric selection. Regular geometrical shapes are often used to synthesize segmentation errors and illustrate… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

    Comments: 8 pages, 8 figures. Accepted by IEEE AIPR 2022 (Oral)

    Report number: 25

  7. arXiv:2202.08065  [pdf, other

    eess.SY math.OC

    Graph Neural Network and Koopman Models for Learning Networked Dynamics: A Comparative Study on Power Grid Transients Prediction

    Authors: Sai Pushpak Nandanoori, Sheng Guan, Soumya Kundu, Seemita Pal, Khushbu Agarwal, Yinghui Wu, Sutanay Choudhury

    Abstract: Continuous monitoring of the spatio-temporal dynamic behavior of critical infrastructure networks, such as the power systems, is a challenging but important task. In particular, accurate and timely prediction of the (electro-mechanical) transient dynamic trajectories of the power grid is necessary for early detection of any instability and prevention of catastrophic failures. Existing approaches f… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: 17 pages, this paper is currently under review in a journal

  8. arXiv:2201.02771  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    A Sneak Attack on Segmentation of Medical Images Using Deep Neural Network Classifiers

    Authors: Shuyue Guan, Murray Loew

    Abstract: Instead of using current deep-learning segmentation models (like the UNet and variants), we approach the segmentation problem using trained Convolutional Neural Network (CNN) classifiers, which automatically extract important features from images for classification. Those extracted features can be visualized and formed into heatmaps using Gradient-weighted Class Activation Map** (Grad-CAM). This… ▽ More

    Submitted 27 January, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

    Comments: 8 pages, 10 figures. Accepted by IEEE AIPR 2021 (Oral)

    Report number: 13

  9. arXiv:2108.09374  [pdf

    eess.IV cs.CV

    Fourier Neural Operator Networks: A Fast and General Solver for the Photoacoustic Wave Equation

    Authors: Steven Guan, Ko-Tsung Hsu, Parag V. Chitnis

    Abstract: Simulation tools for photoacoustic wave propagation have played a key role in advancing photoacoustic imaging by providing quantitative and qualitative insights into parameters affecting image quality. Classical methods for numerically solving the photoacoustic wave equation relies on a fine discretization of space and can become computationally expensive for large computational grids. In this wor… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

  10. arXiv:2108.07368  [pdf

    eess.IV cs.CV

    CaraNet: Context Axial Reverse Attention Network for Segmentation of Small Medical Objects

    Authors: Ange Lou, Shuyue Guan, Hanseok Ko, Murray Loew

    Abstract: Segmenting medical images accurately and reliably is important for disease diagnosis and treatment. It is a challenging task because of the wide variety of objects' sizes, shapes, and scanning modalities. Recently, many convolutional neural networks (CNN) have been designed for segmentation tasks and achieved great success. Few studies, however, have fully considered the sizes of objects, and thus… ▽ More

    Submitted 13 January, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Accepted by SPIE Medical Imaging: Image Processing (oral presentation)

  11. arXiv:2107.03067  [pdf

    cs.LG eess.SY

    Distributed adaptive algorithm based on the asymmetric cost of error functions

    Authors: Sihai Guan, Qing Cheng, Yong Zhao

    Abstract: In this paper, a family of novel diffusion adaptive estimation algorithm is proposed from the asymmetric cost function perspective by combining diffusion strategy and the linear-linear cost (LLC), quadratic-quadratic cost (QQC), and linear-exponential cost (LEC), at all distributed network nodes, and named diffusion LLCLMS (DLLCLMS), diffusion QQCLMS (DQQCLMS), and diffusion LECLMS (DLECLMS), resp… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

  12. arXiv:2107.00178  [pdf, other

    cs.SD eess.AS

    Attention-based multi-channel speaker verification with ad-hoc microphone arrays

    Authors: Chengdong Liang, Junqi Chen, Shanzheng Guan, Xiao-Lei Zhang

    Abstract: Recently, ad-hoc microphone array has been widely studied. Unlike traditional microphone array settings, the spatial arrangement and number of microphones of ad-hoc microphone arrays are not known in advance, which hinders the adaptation of traditional speaker verification technologies to ad-hoc microphone arrays. To overcome this weakness, in this paper, we propose attention-based multi-channel s… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

    Comments: Submitted to APSIPA ASC 2021

  13. arXiv:2105.04075  [pdf

    cs.CV eess.IV

    CFPNet-M: A Light-Weight Encoder-Decoder Based Network for Multimodal Biomedical Image Real-Time Segmentation

    Authors: Ange Lou, Shuyue Guan, Murray Loew

    Abstract: Currently, developments of deep learning techniques are providing instrumental to identify, classify, and quantify patterns in medical images. Segmentation is one of the important applications in medical image analysis. In this regard, U-Net is the predominant approach to medical image segmentation tasks. However, we found that those U-Net based models have limitations in several aspects, for exam… ▽ More

    Submitted 30 May, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

  14. arXiv:2104.03130  [pdf

    eess.IV cs.CV cs.LG

    Dense Dilated UNet: Deep Learning for 3D Photoacoustic Tomography Image Reconstruction

    Authors: Steven Guan, Ko-Tsung Hsu, Matthias Eyassu, Parag V. Chitnis

    Abstract: In photoacoustic tomography (PAT), the acoustic pressure waves produced by optical excitation are measured by an array of detectors and used to reconstruct an image. Sparse spatial sampling and limited-view detection are two common challenges faced in PAT. Reconstructing from incomplete data using standard methods results in severe streaking artifacts and blurring. We propose a modified convolutio… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

  15. arXiv:2103.15118  [pdf, other

    eess.AS

    Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays

    Authors: Shanzheng Guan, Shupei Liu, Junqi Chen, Wenbo Zhu, Shengqiang Li, Xu Tan, Ziye Yang, Menglong Xu, Yijiang Chen, Jianyu Wang, Xiao-Lei Zhang

    Abstract: Recently, there is a research trend on ad-hoc microphone arrays. However, most research was conducted on simulated data. Although some data sets were collected with a small number of distributed devices, they were not synchronized which hinders the fundamental theoretical research to ad-hoc microphone arrays. To address this issue, this paper presents a synchronized speech corpus, named Libri-adho… ▽ More

    Submitted 6 April, 2021; v1 submitted 28 March, 2021; originally announced March 2021.

  16. arXiv:2101.06398  [pdf, other

    cs.SD eess.AS

    Minimum-volume Multichannel Nonnegative matrix factorization for blind source separation

    Authors: Jianyu Wang, Shanzheng Guan, Shupei Liu, Xiao-Lei Zhang

    Abstract: Multichannel blind audio source separation aims to recover the latent sources from their multichannel mixtures without supervised information. One state-of-the-art blind audio source separation method, named independent low-rank matrix analysis (ILRMA), unifies independent vector analysis (IVA) and nonnegative matrix factorization (NMF). However, the spectra matrix produced from NMF may not find a… ▽ More

    Submitted 29 March, 2021; v1 submitted 16 January, 2021; originally announced January 2021.

  17. arXiv:2012.00403  [pdf, other

    cs.SD cs.CL eess.AS

    Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation

    Authors: Ziye Yang, Shanzheng Guan, Xiao-Lei Zhang

    Abstract: Recently, the research on ad-hoc microphone arrays with deep learning has drawn much attention, especially in speech enhancement and separation. Because an ad-hoc microphone array may cover such a large area that multiple speakers may locate far apart and talk independently, target-dependent speech separation, which aims to extract a target speaker from a mixed speech, is important for extracting… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  18. arXiv:2011.00376  [pdf

    eess.IV cs.CV

    Segmentation of Infrared Breast Images Using MultiResUnet Neural Network

    Authors: Ange Lou, Shuyue Guan, Nada Kamona, Murray Loew

    Abstract: Breast cancer is the second leading cause of death for women in the U.S. Early detection of breast cancer is key to higher survival rates of breast cancer patients. We are investigating infrared (IR) thermography as a noninvasive adjunct to mammography for breast cancer screening. IR imaging is radiation-free, pain-free, and non-contact. Automatic segmentation of the breast area from the acquired… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

    Comments: 6 pages. Accepted by IEEE AIPR 2019 (Oral)

  19. arXiv:2006.00414  [pdf

    eess.IV cs.CV

    DC-UNet: Rethinking the U-Net Architecture with Dual Channel Efficient CNN for Medical Images Segmentation

    Authors: Ange Lou, Shuyue Guan, Murray Loew

    Abstract: Recently, deep learning has become much more popular in computer vision area. The Convolution Neural Network (CNN) has brought a breakthrough in images segmentation areas, especially, for medical images. In this regard, U-Net is the predominant approach to medical image segmentation task. The U-Net not only performs well in segmenting multimodal medical images generally, but also in some tough cas… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

  20. arXiv:2002.12345  [pdf, other

    cs.CV cs.LG eess.IV

    A Novel Measure to Evaluate Generative Adversarial Networks Based on Direct Analysis of Generated Images

    Authors: Shuyue Guan, Murray Loew

    Abstract: The Generative Adversarial Network (GAN) is a state-of-the-art technique in the field of deep learning. A number of recent papers address the theory and applications of GANs in various fields of image processing. Fewer studies, however, have directly evaluated GAN outputs. Those that have been conducted focused on using classification performance, e.g., Inception Score (IS) and statistical metrics… ▽ More

    Submitted 7 April, 2021; v1 submitted 27 February, 2020; originally announced February 2020.

    Comments: 16 pages, 11 figures. Accepted by the Neural Computing and Applications journal

    Report number: NCAA-D-20-03011

    Journal ref: Neural Comput & Applic 33, 13921-13936 (2021)

  21. arXiv:1911.04357  [pdf

    eess.IV cs.CV cs.LG physics.med-ph

    Limited View and Sparse Photoacoustic Tomography for Neuroimaging with Deep Learning

    Authors: Steven Guan, Amir A. Khan, Siddhartha Sikdar, Parag V. Chitnis

    Abstract: Photoacoustic tomography (PAT) is a nonionizing imaging modality capable of acquiring high contrast and resolution images of optical absorption at depths greater than traditional optical imaging techniques. Practical considerations with instrumentation and geometry limit the number of available acoustic sensors and their view of the imaging target, which result in significant image reconstruction… ▽ More

    Submitted 27 June, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Journal ref: Sci Rep 10, 8510 (2020)

  22. arXiv:1908.09730  [pdf

    eess.SP

    Diffusion probabilistic LMS algorithm

    Authors: Sihai Guan, Chun Meng, Bharat Biswal

    Abstract: In this paper, a novel diffusion estimation algorithm is proposed from a probabilistic perspective by combining diffusion strategy and the probabilistic least-mean-squares (PLMS) at all agents. The proposed method diffusion probabilistic LMS (DPLMS) is more robust to input signal and impulsive interference than the DSE-LMS, DRVSSLMS and DLLAD algorithms. Instead of minimizing the estimate error, t… ▽ More

    Submitted 21 August, 2019; originally announced August 2019.

    Comments: 13 pages, 8 figures

  23. arXiv:1908.08165  [pdf

    eess.SP cs.IT

    Optimal step-size of least mean absolute fourth algorithm in low SNR

    Authors: Sihai Guan, Chun Meng, Bharat Biswal

    Abstract: There is a need to improve the capability of the adaptive filtering algorithm against Gaussian or multiple types of non-Gaussian noises, time-varying system, and systems with low SNR. In this paper, we propose an optimized least mean absolute fourth (OPLMF) algorithm, especially for a time-varying unknown system with low signal-noise-rate (SNR). The optimal step-size of OPLMF is obtained by minimi… ▽ More

    Submitted 21 August, 2019; originally announced August 2019.

  24. arXiv:1805.01307  [pdf

    eess.SY

    Convex Combination of Overlap-Save Frequency-Domain Adaptive Filters

    Authors: Sihai Guan, Zhi Li

    Abstract: In order to decrease the steady-state error and reduce the computational complexity and increase the ability to identify a large unknown system, a convex combination of overlap-save frequency-domain adaptive filters (COSFDAF) algorithm is proposed. From the articles available, most papers discuss convex combinations of adaptive-filter algorithms focusing on the time domain. Those algorithms show b… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

  25. arXiv:1805.01305  [pdf

    eess.SY

    Noise constrained least mean absolute third algorithm

    Authors: Sihai Guan, Zhi Li

    Abstract: The learning speed of an adaptive algorithm can be improved by properly constraining the cost function of the adaptive algorithm. Besides, the stabilization of the NCLMF algorithm is more complicated, whose stability depends solely on the input power of the adaptive filter and the NCLMF algorithm with unbounded repressors is not mean square stability even for a small value of the step-size. So, in… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.