Search | arXiv e-print repository

arXiv:2012.03860 [pdf, other]

doi 10.1121/10.0005314

Modeling the effects of dynamic range compression on signals in noise

Authors: Ryan M. Corey, Andrew C. Singer

Abstract: Hearing aids use dynamic range compression (DRC), a form of automatic gain control, to make quiet sounds louder and loud sounds quieter. Compression can improve listening comfort, but it can also cause distortion in noisy environments. It has been widely reported that DRC performs poorly in noise, but there has been little mathematical analysis of these distortion effects. This work introduces a m… ▽ More Hearing aids use dynamic range compression (DRC), a form of automatic gain control, to make quiet sounds louder and loud sounds quieter. Compression can improve listening comfort, but it can also cause distortion in noisy environments. It has been widely reported that DRC performs poorly in noise, but there has been little mathematical analysis of these distortion effects. This work introduces a mathematical model to study the behavior of DRC in noise. Using statistical assumptions about the signal envelopes, we define an effective compression function that models the compression applied to one signal in the presence of another. This framework is used to prove results about DRC that have been previously observed experimentally: that when DRC is applied to a mixture of signals, uncorrelated signal envelopes become negatively correlated; that the effective compression applied to each sound in a mixture is weaker than it would have been for the signal alone; and that compression can reduce the long-term signal-to-noise ratio in certain conditions. These theoretical results are supported by software experiments using recorded speech signals. △ Less

Submitted 7 December, 2020; originally announced December 2020.

arXiv:2008.04521 [pdf, other]

doi 10.1121/10.0002279

Acoustic effects of medical, cloth, and transparent face masks on speech signals

Authors: Ryan M. Corey, Uriah Jones, Andrew C. Singer

Abstract: Face masks muffle speech and make communication more difficult, especially for people with hearing loss. This study examines the acoustic attenuation caused by different face masks, including medical, cloth, and transparent masks, using a head-shaped loudspeaker and a live human talker. The results suggest that all masks attenuate frequencies above 1 kHz, that attenuation is greatest in front of t… ▽ More Face masks muffle speech and make communication more difficult, especially for people with hearing loss. This study examines the acoustic attenuation caused by different face masks, including medical, cloth, and transparent masks, using a head-shaped loudspeaker and a live human talker. The results suggest that all masks attenuate frequencies above 1 kHz, that attenuation is greatest in front of the talker, and that there is substantial variation between mask types, especially cloth masks with different materials and weaves. Transparent masks have poor acoustic performance compared to both medical and cloth masks. Most masks have little effect on lapel microphones, suggesting that existing sound reinforcement and assistive listening systems may be effective for verbal communication with masks. △ Less

Submitted 11 August, 2020; originally announced August 2020.

Journal ref: The Journal of the Acoustical Society of America, 148(4), pp. 2371-2375, Oct. 2020

arXiv:2004.11956 [pdf, other]

Binaural Audio Source Remixing with Microphone Array Listening Devices

Authors: Ryan M. Corey, Andrew C. Singer

Abstract: Augmented listening devices, such as hearing aids and augmented reality headsets, enhance human perception by changing the sounds that we hear. Microphone arrays can improve the performance of listening systems in noisy environments, but most array-based listening systems are designed to isolate a single sound source from a mixture. This work considers a source-remixing filter that alters the rela… ▽ More Augmented listening devices, such as hearing aids and augmented reality headsets, enhance human perception by changing the sounds that we hear. Microphone arrays can improve the performance of listening systems in noisy environments, but most array-based listening systems are designed to isolate a single sound source from a mixture. This work considers a source-remixing filter that alters the relative level of each source independently. Remixing rather than separating sounds can help to improve perceptual transparency: it causes less distortion to the signal spectrum and especially to the interaural cues that humans use to localize sounds in space. △ Less

Submitted 24 April, 2020; originally announced April 2020.

Comments: To appear at ICASSP 2020

arXiv:1912.05043 [pdf, other]

Motion-Tolerant Beamforming with Deformable Microphone Arrays

Authors: Ryan M. Corey, Andrew C. Singer

Abstract: Microphone arrays are usually assumed to have rigid geometries: the microphones may move with respect to the sound field but remain fixed relative to each other. However, many useful arrays, such as those in wearable devices, have sensors that can move relative to each other. We compare two approaches to beamforming with deformable microphone arrays: first, by explicitly tracking the geometry of t… ▽ More Microphone arrays are usually assumed to have rigid geometries: the microphones may move with respect to the sound field but remain fixed relative to each other. However, many useful arrays, such as those in wearable devices, have sensors that can move relative to each other. We compare two approaches to beamforming with deformable microphone arrays: first, by explicitly tracking the geometry of the array as it changes over time, and second, by designing a time-invariant beamformer based on the second-order statistics of the moving array. The time-invariant approach is shown to be appropriate when the motion of the array is small relative to the acoustic wavelengths of interest. The performance of the proposed beamforming system is demonstrated using a wearable microphone array on a moving human listener in a cocktail-party scenario. △ Less

Submitted 10 December, 2019; originally announced December 2019.

Comments: Presented at WASPAA 2019

arXiv:1912.05038 [pdf, other]

Cooperative Audio Source Separation and Enhancement Using Distributed Microphone Arrays and Wearable Devices

Authors: Ryan M. Corey, Matthew D. Skarha, Andrew C. Singer

Abstract: Augmented listening devices such as hearing aids often perform poorly in noisy and reverberant environments with many competing sound sources. Large distributed microphone arrays can improve performance, but data from remote microphones often cannot be used for delay-constrained real-time processing. We present a cooperative audio source separation and enhancement system that leverages wearable li… ▽ More Augmented listening devices such as hearing aids often perform poorly in noisy and reverberant environments with many competing sound sources. Large distributed microphone arrays can improve performance, but data from remote microphones often cannot be used for delay-constrained real-time processing. We present a cooperative audio source separation and enhancement system that leverages wearable listening devices and other microphone arrays spread around a room. The full distributed array is used to separate sound sources and estimate their statistics. Each listening device uses these statistics to design real-time binaural audio enhancement filters using its own local microphones. The system is demonstrated experimentally using 10 speech sources and 160 microphones in a large, reverberant room. △ Less

Submitted 10 December, 2019; originally announced December 2019.

Comments: To appear at CAMSAP 2019

arXiv:1903.02094 [pdf, other]

doi 10.1109/ICASSP.2019.8682733

Acoustic Impulse Responses for Wearable Audio Devices

Authors: Ryan M. Corey, Naoki Tsuda, Andrew C. Singer

Abstract: We present an open-access dataset of over 8000 acoustic impulse from 160 microphones spread across the body and affixed to wearable accessories. The data can be used to evaluate audio capture and array processing systems using wearable devices such as hearing aids, headphones, eyeglasses, jewelry, and clothing. We analyze the acoustic transfer functions of different parts of the body, measure the… ▽ More We present an open-access dataset of over 8000 acoustic impulse from 160 microphones spread across the body and affixed to wearable accessories. The data can be used to evaluate audio capture and array processing systems using wearable devices such as hearing aids, headphones, eyeglasses, jewelry, and clothing. We analyze the acoustic transfer functions of different parts of the body, measure the effects of clothing worn over microphones, compare measurements from a live human subject to those from a mannequin, and simulate the noise-reduction performance of several beamformers. The results suggest that arrays of microphones spread across the body are more effective than those confined to a single device. △ Less

Submitted 5 March, 2019; originally announced March 2019.

Comments: To appear at ICASSP 2019

Journal ref: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

arXiv:1808.00096 [pdf, other]

doi 10.1109/IWAENC.2018.8521260

Speech Separation Using Partially Asynchronous Microphone Arrays Without Resampling

Authors: Ryan M. Corey, Andrew C. Singer

Abstract: We consider the problem of separating speech sources captured by multiple spatially separated devices, each of which has multiple microphones and samples its signals at a slightly different rate. Most asynchronous array processing methods rely on sample rate offset estimation and resampling, but these offsets can be difficult to estimate if the sources or microphones are moving. We propose a sourc… ▽ More We consider the problem of separating speech sources captured by multiple spatially separated devices, each of which has multiple microphones and samples its signals at a slightly different rate. Most asynchronous array processing methods rely on sample rate offset estimation and resampling, but these offsets can be difficult to estimate if the sources or microphones are moving. We propose a source separation method that does not require offset estimation or signal resampling. Instead, we divide the distributed array into several synchronous subarrays. All arrays are used jointly to estimate the time-varying signal statistics, and those statistics are used to design separate time-varying spatial filters in each array. We demonstrate the method for speech mixtures recorded on both stationary and moving microphone arrays. △ Less

Submitted 31 July, 2018; originally announced August 2018.

Comments: To appear at the International Workshop on Acoustic Signal Enhancement (IWAENC 2018)

Journal ref: 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC)

arXiv:1808.00082 [pdf, other]

doi 10.1109/IWAENC.2018.8521263

Delay-Performance Tradeoffs in Causal Microphone Array Processing

Authors: Ryan M. Corey, Naoki Tsuda, Andrew C. Singer

Abstract: In real-time listening enhancement applications, such as hearing aid signal processing, sounds must be processed with no more than a few milliseconds of delay to sound natural to the listener. Listening devices can achieve better performance with lower delay by using microphone arrays to filter acoustic signals in both space and time. Here, we analyze the tradeoff between delay and squared-error p… ▽ More In real-time listening enhancement applications, such as hearing aid signal processing, sounds must be processed with no more than a few milliseconds of delay to sound natural to the listener. Listening devices can achieve better performance with lower delay by using microphone arrays to filter acoustic signals in both space and time. Here, we analyze the tradeoff between delay and squared-error performance of causal multichannel Wiener filters for microphone array noise reduction. We compute exact expressions for the delay-error curves in two special cases and present experimental results from real-world microphone array recordings. We find that delay-performance characteristics are determined by both the spatial and temporal correlation structures of the signals. △ Less

Submitted 31 July, 2018; originally announced August 2018.

Comments: To appear at the International Workshop on Acoustic Signal Enhancement (IWAENC 2018)

Journal ref: 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC)

Showing 1–8 of 8 results for author: Corey, R M