-
OCTDL: Optical Coherence Tomography Dataset for Image-Based Deep Learning Methods
Authors:
Mikhail Kulyabin,
Aleksei Zhdanov,
Anastasia Nikiforova,
Andrey Stepichev,
Anna Kuznetsova,
Mikhail Ronkin,
Vasilii Borisov,
Alexander Bogachev,
Sergey Korotkich,
Paul A Constable,
Andreas Maier
Abstract:
Optical coherence tomography (OCT) is a non-invasive imaging technique with extensive clinical applications in ophthalmology. OCT enables the visualization of the retinal layers, playing a vital role in the early detection and monitoring of retinal diseases. OCT uses the principle of light wave interference to create detailed images of the retinal microstructures, making it a valuable tool for dia…
▽ More
Optical coherence tomography (OCT) is a non-invasive imaging technique with extensive clinical applications in ophthalmology. OCT enables the visualization of the retinal layers, playing a vital role in the early detection and monitoring of retinal diseases. OCT uses the principle of light wave interference to create detailed images of the retinal microstructures, making it a valuable tool for diagnosing ocular conditions. This work presents an open-access OCT dataset (OCTDL) comprising over 2000 OCT images labeled according to disease group and retinal pathology. The dataset consists of OCT records of patients with Age-related Macular Degeneration (AMD), Diabetic Macular Edema (DME), Epiretinal Membrane (ERM), Retinal Artery Occlusion (RAO), Retinal Vein Occlusion (RVO), and Vitreomacular Interface Disease (VID). The images were acquired with an Optovue Avanti RTVue XR using raster scanning protocols with dynamic scan length and image resolution. Each retinal b-scan was acquired by centering on the fovea and interpreted and cataloged by an experienced retinal specialist. In this work, we applied Deep Learning classification techniques to this new open-access dataset.
△ Less
Submitted 31 March, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement
Authors:
Anastasia Kuznetsova,
Aswin Sivaraman,
Minje Kim
Abstract:
With the advances in deep learning, speech enhancement systems benefited from large neural network architectures and achieved state-of-the-art quality. However, speaker-agnostic methods are not always desirable, both in terms of quality and their complexity, when they are to be used in a resource-constrained environment. One promising way is personalized speech enhancement (PSE), which is a smalle…
▽ More
With the advances in deep learning, speech enhancement systems benefited from large neural network architectures and achieved state-of-the-art quality. However, speaker-agnostic methods are not always desirable, both in terms of quality and their complexity, when they are to be used in a resource-constrained environment. One promising way is personalized speech enhancement (PSE), which is a smaller and easier speech enhancement problem for small models to solve, because it focuses on a particular test-time user. To achieve the personalization goal, while dealing with the typical lack of personal data, we investigate the effect of data augmentation based on neural speech synthesis (NSS). In the proposed method, we show that the quality of the NSS system's synthetic data matters, and if they are good enough the augmented dataset can be used to improve the PSE system that outperforms the speaker-agnostic baseline. The proposed PSE systems show significant complexity reduction while preserving the enhancement quality.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
Curriculum optimization for low-resource speech recognition
Authors:
Anastasia Kuznetsova,
Anurag Kumar,
Jennifer Drexler Fox,
Francis Tyers
Abstract:
Modern end-to-end speech recognition models show astonishing results in transcribing audio signals into written text. However, conventional data feeding pipelines may be sub-optimal for low-resource speech recognition, which still remains a challenging task. We propose an automated curriculum learning approach to optimize the sequence of training examples based on both the progress of the model wh…
▽ More
Modern end-to-end speech recognition models show astonishing results in transcribing audio signals into written text. However, conventional data feeding pipelines may be sub-optimal for low-resource speech recognition, which still remains a challenging task. We propose an automated curriculum learning approach to optimize the sequence of training examples based on both the progress of the model while training and prior knowledge about the difficulty of the training examples. We introduce a new difficulty measure called compression ratio that can be used as a scoring function for raw audio in various noise conditions. The proposed method improves speech recognition Word Error Rate performance by up to 33% relative over the baseline system
△ Less
Submitted 17 February, 2022;
originally announced February 2022.
-
A bandit approach to curriculum generation for automatic speech recognition
Authors:
Anastasia Kuznetsova,
Anurag Kumar,
Francis M. Tyers
Abstract:
The Automated Speech Recognition (ASR) task has been a challenging domain especially for low data scenarios with few audio examples. This is the main problem in training ASR systems on the data from low-resource or marginalized languages. In this paper we present an approach to mitigate the lack of training data by employing Automated Curriculum Learning in combination with an adversarial bandit a…
▽ More
The Automated Speech Recognition (ASR) task has been a challenging domain especially for low data scenarios with few audio examples. This is the main problem in training ASR systems on the data from low-resource or marginalized languages. In this paper we present an approach to mitigate the lack of training data by employing Automated Curriculum Learning in combination with an adversarial bandit approach inspired by Reinforcement learning. The goal of the approach is to optimize the training sequence of mini-batches ranked by the level of difficulty and compare the ASR performance metrics against the random training sequence and discrete curriculum. We test our approach on a truly low-resource language and show that the bandit framework has a good improvement over the baseline transfer-learning model.
△ Less
Submitted 6 February, 2021;
originally announced February 2021.
-
Analysis Of Congestion Control In Data Channels With Frequent Frame Loss
Authors:
Yuri Monakhov,
Anna Kuznetsova
Abstract:
Development of optimal control procedures for congested networks is a key factor in maintaining efficient network utilization. The absence of congestion control mechanism or its failure can lead to the lack of availability for certain network segments, and in severe cases -- for the entire network. The paper presents an analytical model describing the operation of the TCP Reno congestion control a…
▽ More
Development of optimal control procedures for congested networks is a key factor in maintaining efficient network utilization. The absence of congestion control mechanism or its failure can lead to the lack of availability for certain network segments, and in severe cases -- for the entire network. The paper presents an analytical model describing the operation of the TCP Reno congestion control algorithm in terms of differential calculus and queuing systems. The purpose of this research is to explore the possibilities and ways of increasing the virtual channel capacity utilization efficiency in a lossy environment.
△ Less
Submitted 10 October, 2018;
originally announced October 2018.
-
Comment on "Analysis of a Charge-Pump PLL: A New Model" by M. van Paemel
Authors:
N. V. Kuznetsov,
M. V. Yuldashev,
R. V. Yuldashev,
M. V. Blagov,
E. V. Kudryashova,
O. A. Kuznetsova,
T. N. Mokaev
Abstract:
In this short communication we comment on the non-linear mathematical model of CP-PLL introduced by V.Paemel. We reveal and obviate shortcomings in the model.
In this short communication we comment on the non-linear mathematical model of CP-PLL introduced by V.Paemel. We reveal and obviate shortcomings in the model.
△ Less
Submitted 23 February, 2019; v1 submitted 5 October, 2018;
originally announced October 2018.
-
Hidden attractors in aircraft control systems with saturated inputs
Authors:
B. R. Andrievsky,
E. V. Kudryashova,
N. V. Kuznetsov,
O. A. Kuznetsova,
G. A. Leonov
Abstract:
In the paper, the control problem with limitations on the magnitude and rate of the control action in aircraft control systems, is studied. Existence of hidden oscillations in the case of actuator position and rate limitations is demonstrated by the examples of piloted aircraft pilot involved oscillations (PIO) phenomenon and the airfoil flutter suppression system.
In the paper, the control problem with limitations on the magnitude and rate of the control action in aircraft control systems, is studied. Existence of hidden oscillations in the case of actuator position and rate limitations is demonstrated by the examples of piloted aircraft pilot involved oscillations (PIO) phenomenon and the airfoil flutter suppression system.
△ Less
Submitted 23 November, 2017;
originally announced November 2017.