Search | arXiv e-print repository

doi 10.1038/s41597-024-03182-7

OCTDL: Optical Coherence Tomography Dataset for Image-Based Deep Learning Methods

Authors: Mikhail Kulyabin, Aleksei Zhdanov, Anastasia Nikiforova, Andrey Stepichev, Anna Kuznetsova, Mikhail Ronkin, Vasilii Borisov, Alexander Bogachev, Sergey Korotkich, Paul A Constable, Andreas Maier

Abstract: Optical coherence tomography (OCT) is a non-invasive imaging technique with extensive clinical applications in ophthalmology. OCT enables the visualization of the retinal layers, playing a vital role in the early detection and monitoring of retinal diseases. OCT uses the principle of light wave interference to create detailed images of the retinal microstructures, making it a valuable tool for dia… ▽ More Optical coherence tomography (OCT) is a non-invasive imaging technique with extensive clinical applications in ophthalmology. OCT enables the visualization of the retinal layers, playing a vital role in the early detection and monitoring of retinal diseases. OCT uses the principle of light wave interference to create detailed images of the retinal microstructures, making it a valuable tool for diagnosing ocular conditions. This work presents an open-access OCT dataset (OCTDL) comprising over 2000 OCT images labeled according to disease group and retinal pathology. The dataset consists of OCT records of patients with Age-related Macular Degeneration (AMD), Diabetic Macular Edema (DME), Epiretinal Membrane (ERM), Retinal Artery Occlusion (RAO), Retinal Vein Occlusion (RVO), and Vitreomacular Interface Disease (VID). The images were acquired with an Optovue Avanti RTVue XR using raster scanning protocols with dynamic scan length and image resolution. Each retinal b-scan was acquired by centering on the fovea and interpreted and cataloged by an experienced retinal specialist. In this work, we applied Deep Learning classification techniques to this new open-access dataset. △ Less

Submitted 31 March, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

arXiv:2211.07493 [pdf, ps, other]

The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement

Authors: Anastasia Kuznetsova, Aswin Sivaraman, Minje Kim

Abstract: With the advances in deep learning, speech enhancement systems benefited from large neural network architectures and achieved state-of-the-art quality. However, speaker-agnostic methods are not always desirable, both in terms of quality and their complexity, when they are to be used in a resource-constrained environment. One promising way is personalized speech enhancement (PSE), which is a smalle… ▽ More With the advances in deep learning, speech enhancement systems benefited from large neural network architectures and achieved state-of-the-art quality. However, speaker-agnostic methods are not always desirable, both in terms of quality and their complexity, when they are to be used in a resource-constrained environment. One promising way is personalized speech enhancement (PSE), which is a smaller and easier speech enhancement problem for small models to solve, because it focuses on a particular test-time user. To achieve the personalization goal, while dealing with the typical lack of personal data, we investigate the effect of data augmentation based on neural speech synthesis (NSS). In the proposed method, we show that the quality of the NSS system's synthetic data matters, and if they are good enough the augmented dataset can be used to improve the PSE system that outperforms the speaker-agnostic baseline. The proposed PSE systems show significant complexity reduction while preserving the enhancement quality. △ Less

Submitted 14 November, 2022; originally announced November 2022.

arXiv:2202.08883 [pdf, other]

Curriculum optimization for low-resource speech recognition

Authors: Anastasia Kuznetsova, Anurag Kumar, Jennifer Drexler Fox, Francis Tyers

Abstract: Modern end-to-end speech recognition models show astonishing results in transcribing audio signals into written text. However, conventional data feeding pipelines may be sub-optimal for low-resource speech recognition, which still remains a challenging task. We propose an automated curriculum learning approach to optimize the sequence of training examples based on both the progress of the model wh… ▽ More Modern end-to-end speech recognition models show astonishing results in transcribing audio signals into written text. However, conventional data feeding pipelines may be sub-optimal for low-resource speech recognition, which still remains a challenging task. We propose an automated curriculum learning approach to optimize the sequence of training examples based on both the progress of the model while training and prior knowledge about the difficulty of the training examples. We introduce a new difficulty measure called compression ratio that can be used as a scoring function for raw audio in various noise conditions. The proposed method improves speech recognition Word Error Rate performance by up to 33% relative over the baseline system △ Less

Submitted 17 February, 2022; originally announced February 2022.

arXiv:2102.03662 [pdf, other]

A bandit approach to curriculum generation for automatic speech recognition

Authors: Anastasia Kuznetsova, Anurag Kumar, Francis M. Tyers

Abstract: The Automated Speech Recognition (ASR) task has been a challenging domain especially for low data scenarios with few audio examples. This is the main problem in training ASR systems on the data from low-resource or marginalized languages. In this paper we present an approach to mitigate the lack of training data by employing Automated Curriculum Learning in combination with an adversarial bandit a… ▽ More The Automated Speech Recognition (ASR) task has been a challenging domain especially for low data scenarios with few audio examples. This is the main problem in training ASR systems on the data from low-resource or marginalized languages. In this paper we present an approach to mitigate the lack of training data by employing Automated Curriculum Learning in combination with an adversarial bandit approach inspired by Reinforcement learning. The goal of the approach is to optimize the training sequence of mini-batches ranked by the level of difficulty and compare the ASR performance metrics against the random training sequence and discrete curriculum. We test our approach on a truly low-resource language and show that the bandit framework has a good improvement over the baseline transfer-learning model. △ Less

Submitted 6 February, 2021; originally announced February 2021.

arXiv:1810.04470 [pdf, other]

Analysis Of Congestion Control In Data Channels With Frequent Frame Loss

Authors: Yuri Monakhov, Anna Kuznetsova

Abstract: Development of optimal control procedures for congested networks is a key factor in maintaining efficient network utilization. The absence of congestion control mechanism or its failure can lead to the lack of availability for certain network segments, and in severe cases -- for the entire network. The paper presents an analytical model describing the operation of the TCP Reno congestion control a… ▽ More Development of optimal control procedures for congested networks is a key factor in maintaining efficient network utilization. The absence of congestion control mechanism or its failure can lead to the lack of availability for certain network segments, and in severe cases -- for the entire network. The paper presents an analytical model describing the operation of the TCP Reno congestion control algorithm in terms of differential calculus and queuing systems. The purpose of this research is to explore the possibilities and ways of increasing the virtual channel capacity utilization efficiency in a lossy environment. △ Less

Submitted 10 October, 2018; originally announced October 2018.

Comments: 5 pages, 4 figures, 2nd European Conference on Electrical Engineering & Computer Science EECS 2018: Bern, Switzerland, December 20-22, 2018

MSC Class: 94C99; 68M12; 68M20; 90B25 ACM Class: C.2.2; C.2.3; C.4

arXiv:1810.02609 [pdf, other]

Comment on "Analysis of a Charge-Pump PLL: A New Model" by M. van Paemel

Authors: N. V. Kuznetsov, M. V. Yuldashev, R. V. Yuldashev, M. V. Blagov, E. V. Kudryashova, O. A. Kuznetsova, T. N. Mokaev

Abstract: In this short communication we comment on the non-linear mathematical model of CP-PLL introduced by V.Paemel. We reveal and obviate shortcomings in the model. In this short communication we comment on the non-linear mathematical model of CP-PLL introduced by V.Paemel. We reveal and obviate shortcomings in the model. △ Less

Submitted 23 February, 2019; v1 submitted 5 October, 2018; originally announced October 2018.

Comments: arXiv admin note: substantial text overlap with arXiv:1901.01468

arXiv:1711.10302 [pdf, other]

Hidden attractors in aircraft control systems with saturated inputs

Authors: B. R. Andrievsky, E. V. Kudryashova, N. V. Kuznetsov, O. A. Kuznetsova, G. A. Leonov

Abstract: In the paper, the control problem with limitations on the magnitude and rate of the control action in aircraft control systems, is studied. Existence of hidden oscillations in the case of actuator position and rate limitations is demonstrated by the examples of piloted aircraft pilot involved oscillations (PIO) phenomenon and the airfoil flutter suppression system. In the paper, the control problem with limitations on the magnitude and rate of the control action in aircraft control systems, is studied. Existence of hidden oscillations in the case of actuator position and rate limitations is demonstrated by the examples of piloted aircraft pilot involved oscillations (PIO) phenomenon and the airfoil flutter suppression system. △ Less

Submitted 23 November, 2017; originally announced November 2017.

Showing 1–7 of 7 results for author: Kuznetsova, A