-
XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models
Authors:
Shashi Kumar,
Srikanth Madikeri,
Juan Zuluaga-Gomez,
Esaú Villatoro-Tello,
Iuliia Nigmatulina,
Petr Motlicek,
Manjunath K E,
Aravind Ganapathiraju
Abstract:
Self-supervised pretrained models exhibit competitive performance in automatic speech recognition on finetuning, even with limited in-domain supervised data for training. However, popular pretrained models are not suitable for streaming ASR because they are trained with full attention context. In this paper, we introduce XLSR-Transducer, where the XLSR-53 model is used as encoder in transducer set…
▽ More
Self-supervised pretrained models exhibit competitive performance in automatic speech recognition on finetuning, even with limited in-domain supervised data for training. However, popular pretrained models are not suitable for streaming ASR because they are trained with full attention context. In this paper, we introduce XLSR-Transducer, where the XLSR-53 model is used as encoder in transducer setup. Our experiments on the AMI dataset reveal that the XLSR-Transducer achieves 4% absolute WER improvement over Whisper large-v2 and 8% over a Zipformer transducer model trained from scratch.To enable streaming capabilities, we investigate different attention masking patterns in the self-attention computation of transformer layers within the XLSR-53 model. We validate XLSR-Transducer on AMI and 5 languages from CommonVoice under low-resource scenarios. Finally, with the introduction of attention sinks, we reduce the left context by half while achieving a relative 12% improvement in WER.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Photochemistry and Haze Formation
Authors:
Mandt K. E.,
Luspay-Kuti A.,
Cheng A.,
Jessup K. -L.,
Gao P
Abstract:
One of the many exciting revelations of the New Horizons flyby of Pluto was the observation of global haze layers at altitudes as high as 200 km in the visible wavelengths. This haze is produced in the upper atmosphere through photochemical processes, similar to the processes in Titan's atmosphere. As the haze particles grow in size and descend to the lower atmosphere, they coagulate and interact…
▽ More
One of the many exciting revelations of the New Horizons flyby of Pluto was the observation of global haze layers at altitudes as high as 200 km in the visible wavelengths. This haze is produced in the upper atmosphere through photochemical processes, similar to the processes in Titan's atmosphere. As the haze particles grow in size and descend to the lower atmosphere, they coagulate and interact with the gases in the atmosphere through condensation and sticking processes that serve as temporary and permanent loss processes. New Horizons observations confirm studies of Titan haze analogs suggesting that photochemically produced haze particles harden as they grow in size. We outline in this chapter what is known about the photochemical processes that lead to haze production and outline feedback processes resulting from the presence of haze in the atmosphere, connect this to the evolution of Pluto's atmosphere, and discuss open questions that need to be addressed in future work.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
A Proactive Flow Admission and Re-Routing Scheme for Load Balancing and Mitigation of Congestion Propagation in SDN Data Plane
Authors:
Sminesh C. N.,
Grace Mary Kanaga E.,
Ranjitha K
Abstract:
The centralized architecture in software-defined network (SDN) provides a global view of the underlying network, paving the way for enormous research in the area of SDN traffic engineering (SDN TE). This research focuses on the load balancing aspects of SDN TE, given that the existing reactive methods for data-plane load balancing eventually result in packet loss and proactive schemes for data pla…
▽ More
The centralized architecture in software-defined network (SDN) provides a global view of the underlying network, paving the way for enormous research in the area of SDN traffic engineering (SDN TE). This research focuses on the load balancing aspects of SDN TE, given that the existing reactive methods for data-plane load balancing eventually result in packet loss and proactive schemes for data plane load balancing do not address congestion propagation. In the proposed work, the SDN controller periodically monitors flow level statistics and utilization on each link in the network and over-utilized links that cause network congestion and packet loss are identified as bottleneck links. For load balancing the identified largest flow and further traffic through these bottleneck links are rerouted through the lightly-loaded alternate path. The proposed scheme models a Bayesian Network using the observed port utilization and residual bandwidth to decide whether the newly computed alternate path can handle the new flow load before flow admission which in turn reduces congestion propagation. The simulation results show that when the network traffic increases the proposed method efficiently re-routes the flows and balance the network load which substantially improves the network efficiency and the quality of service (QoS) parameters.
△ Less
Submitted 6 December, 2018;
originally announced December 2018.