Skip to main content

Showing 1–8 of 8 results for author: Gonzalez, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18247  [pdf, other

    eess.IV cs.CV cs.LG

    Generative artificial intelligence in ophthalmology: multimodal retinal images for the diagnosis of Alzheimer's disease with convolutional neural networks

    Authors: I. R. Slootweg, M. Thach, K. R. Curro-Tafili, F. D. Verbraak, F. H. Bouwman, Y. A. L. Pijnenburg, J. F. Boer, J. H. P. de Kwisthout, L. Bagheriye, P. J. González

    Abstract: Background/Aim. This study aims to predict Amyloid Positron Emission Tomography (AmyloidPET) status with multimodal retinal imaging and convolutional neural networks (CNNs) and to improve the performance through pretraining with synthetic data. Methods. Fundus autofluorescence, optical coherence tomography (OCT), and OCT angiography images from 328 eyes of 59 AmyloidPET positive subjects and 108 A… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.06160  [pdf, other

    eess.AS

    The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems

    Authors: Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May

    Abstract: The performance of deep neural network-based speech enhancement systems typically increases with the training dataset size. However, studies that investigated the effect of training dataset size on speech enhancement performance did not consider recent approaches, such as diffusion-based generative models. Diffusion models are typically trained with massive datasets for image generation tasks, but… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  3. arXiv:2312.04370  [pdf, other

    eess.AS cs.LG cs.SD

    Investigating the Design Space of Diffusion Models for Speech Enhancement

    Authors: Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May

    Abstract: Diffusion models are a new class of generative models that have shown outstanding performance in image generation literature. As a consequence, studies have attempted to apply diffusion models to other tasks, such as speech enhancement. A popular approach in adapting diffusion models to speech enhancement consists in modelling a progressive transformation between the clean and noisy speech signals… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  4. arXiv:2312.02683  [pdf, other

    eess.AS cs.LG cs.SD

    Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler

    Authors: Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May

    Abstract: Diffusion models are a new class of generative models that have recently been applied to speech enhancement successfully. Previous works have demonstrated their superior performance in mismatched conditions compared to state-of-the art discriminative models. However, this was investigated with a single database for training and another one for testing, which makes the results highly dependent on t… ▽ More

    Submitted 16 January, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024

  5. arXiv:2311.11742  [pdf, other

    eess.IV cs.CV

    Fuzzy Information Seeded Region Growing for Automated Lesions After Stroke Segmentation in MR Brain Images

    Authors: Mario Pascual González

    Abstract: In the realm of medical imaging, precise segmentation of stroke lesions from brain MRI images stands as a critical challenge with significant implications for patient diagnosis and treatment. Addressing this, our study introduces an innovative approach using a Fuzzy Information Seeded Region Growing (FISRG) algorithm. Designed to effectively delineate the complex and irregular boundaries of stroke… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 10 pages, 14 figures. Associated code and data available at: https://github.com/Mawio02/FISRG-for-Automated-Lesion-After-Stroke-Segmentation-in-MRI

    MSC Class: 92C55

  6. arXiv:2309.06183  [pdf, other

    eess.AS cs.LG cs.SD

    Assessing the Generalization Gap of Learning-Based Speech Enhancement Systems in Noisy and Reverberant Environments

    Authors: Philippe Gonzalez, Tommy Sonne Alstrøm, Tobias May

    Abstract: The acoustic variability of noisy and reverberant speech mixtures is influenced by multiple factors, such as the spectro-temporal characteristics of the target speaker and the interfering noise, the signal-to-noise ratio (SNR) and the room characteristics. This large variability poses a major challenge for learning-based speech enhancement systems, since a mismatch between the training and testing… ▽ More

    Submitted 8 November, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE/ACM TASLP

  7. On Batching Variable Size Inputs for Training End-to-End Speech Enhancement Systems

    Authors: Philippe Gonzalez, Tommy Sonne Alstrøm, Tobias May

    Abstract: The performance of neural network-based speech enhancement systems is primarily influenced by the model architecture, whereas training times and computational resource utilization are primarily affected by training parameters such as the batch size. Since noisy and reverberant speech mixtures can have different duration, a batching strategy is required to handle variable size inputs during trainin… ▽ More

    Submitted 31 March, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Accepted to ICASSP 2023

  8. arXiv:1906.05791  [pdf

    eess.SY physics.app-ph

    Modeling and Control of Combustion Phasing in Dual-Fuel Compression Ignition Engines

    Authors: Wenbo Sui, Jorge Pulpeiro González, Carrie M. Hall

    Abstract: Dual fuel engines can achieve high efficiencies and low emissions but also can encounter high cylinder-to-cylinder variations on multi-cylinder engines. In order to avoid these variations, they require a more complex method for combustion phasing control such as model-based control. Since the combustion process in these engines is complex, typical models of the system are complex as well and there… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Journal ref: J. Eng. Gas Turbines Power 141(5), 051005 (Nov 28, 2018)