Search | arXiv e-print repository

Unsupervised Opinion Aggregation -- A Statistical Perspective

Authors: Noyan C. Sevuktekin, Andrew C. Singer

Abstract: Complex decision-making systems rarely have direct access to the current state of the world and they instead rely on opinions to form an understanding of what the ground truth could be. Even in problems where experts provide opinions without any intention to manipulate the decision maker, it is challenging to decide which expert's opinion is more reliable -- a challenge that is further amplified w… ▽ More Complex decision-making systems rarely have direct access to the current state of the world and they instead rely on opinions to form an understanding of what the ground truth could be. Even in problems where experts provide opinions without any intention to manipulate the decision maker, it is challenging to decide which expert's opinion is more reliable -- a challenge that is further amplified when decision-maker has limited, delayed, or no access to the ground truth after the fact. This paper explores a statistical approach to infer the competence of each expert based on their opinions without any need for the ground truth. Echoing the logic behind what is commonly referred to as \textit{the wisdom of crowds}, we propose measuring the competence of each expert by their likeliness to agree with their peers. We further show that the more reliable an expert is the more likely it is that they agree with their peers. We leverage this fact to propose a completely unsupervised version of the naïve Bayes classifier and show that the proposed technique is asymptotically optimal for a large class of problems. In addition to aggregating a large block of opinions, we further apply our technique for online opinion aggregation and for decision-making based on a limited the number of opinions. △ Less

Submitted 20 August, 2023; originally announced August 2023.

Comments: This research was conducted during Noyan Sevuktekin's time at University of Illinois at Urbana-Champaign and the results were first presented in Chapter 3 of his dissertation, entitled "Learning From Opinions". Permalink: https://hdl.handle.net/2142/110814

arXiv:2305.19127 [pdf, other]

doi 10.1109/UComms56954.2022.9905690

Online Segmented Recursive Least-Squares for Multipath Doppler Tracking

Authors: Jae Won Choi, Girish Chowdhary, Andrew C. Singer, Hari Vishnu, Amir Weiss, Gregory W. Wornell, Grant Deane

Abstract: Underwater communication signals typically suffer from distortion due to motion-induced Doppler. Especially in shallow water environments, recovering the signal is challenging due to the time-varying Doppler effects distorting each path differently. However, conventional Doppler estimation algorithms typically model uniform Doppler across all paths and often fail to provide robust Doppler tracking… ▽ More Underwater communication signals typically suffer from distortion due to motion-induced Doppler. Especially in shallow water environments, recovering the signal is challenging due to the time-varying Doppler effects distorting each path differently. However, conventional Doppler estimation algorithms typically model uniform Doppler across all paths and often fail to provide robust Doppler tracking in multipath environments. In this paper, we propose a dynamic programming-inspired method, called online segmented recursive least-squares (OSRLS) to sequentially estimate the time-varying non-uniform Doppler across different multipath arrivals. By approximating the non-linear time distortion as a piece-wise-linear Markov model, we formulate the problem in a dynamic programming framework known as segmented least-squares (SLS). In order to circumvent an ill-conditioned formulation, perturbations are added to the Doppler model during the linearization process. The successful operation of the algorithm is demonstrated in a simulation on a synthetic channel with time-varying non-uniform Doppler. △ Less

Submitted 30 May, 2023; originally announced May 2023.

arXiv:2305.17920 [pdf, other]

doi 10.1109/ICASSP49357.2023.10094981

Towards Robust Data-Driven Underwater Acoustic Localization: A Deep CNN Solution with Performance Guarantees for Model Mismatch

Authors: Amir Weiss, Andrew C. Singer, Gregory W. Wornell

Abstract: Key challenges in develo** underwater acoustic localization methods are related to the combined effects of high reverberation in intricate environments. To address such challenges, recent studies have shown that with a properly designed architecture, neural networks can lead to unprecedented localization capabilities and enhanced accuracy. However, the robustness of such methods to environmental… ▽ More Key challenges in develo** underwater acoustic localization methods are related to the combined effects of high reverberation in intricate environments. To address such challenges, recent studies have shown that with a properly designed architecture, neural networks can lead to unprecedented localization capabilities and enhanced accuracy. However, the robustness of such methods to environmental mismatch is typically hard to characterize, and is usually assessed only empirically. In this work, we consider the recently proposed data-driven method [19] based on a deep convolutional neural network, and demonstrate that it can learn to localize in complex and mismatched environments. To explain this robustness, we provide an upper bound on the localization mean squared error (MSE) in the ``true" environment, in terms of the MSE in a ``presumed" environment and an additional penalty term related to the environmental discrepancy. Our theoretical results are corroborated via simulation results in a rich, highly reverberant, and mismatch channel. △ Less

Submitted 29 May, 2023; originally announced May 2023.

arXiv:2210.09489 [pdf]

Through Tissue Ultra-high-definition Video Transmission Using an Ultrasound Communication Channel

Authors: Zhengchang Kou, Andrew C. Singer, Michael L. Oelze

Abstract: Wireless capsule endoscopy (WCE) has been widely adopted as complementary to traditional wired gastroendoscopy, especially for small bowel diseases which are beyond the latter's reach. However, both the video resolution and frame rates are limited in current WCE solutions due to the limited wireless data rate. The reasons behind this are that the electromagnetic (EM), radio frequency (RF) based co… ▽ More Wireless capsule endoscopy (WCE) has been widely adopted as complementary to traditional wired gastroendoscopy, especially for small bowel diseases which are beyond the latter's reach. However, both the video resolution and frame rates are limited in current WCE solutions due to the limited wireless data rate. The reasons behind this are that the electromagnetic (EM), radio frequency (RF) based communication scheme used by WCE has strict limits on useable bandwidth and power, and the high attenuation in the human body compared to air. Ultrasound communication could be a potential alternative solution as it has access to much higher bandwidths and transmitted power with much lower attenuation. In this paper, we propose an ultrasound communication scheme specially designed for high data rate through tissue data transmission and validate this communication scheme by successfully transmitting ultra-high-definition (UHD) video (3840*2160 pixels at 60 FPS) through 5 cm of pork belly. Over 8.3 Mbps error free payload data rate was achieved with the proposed communication scheme and our custom-built field programmable gate array (FPGA) based test platform. △ Less

Submitted 17 October, 2022; originally announced October 2022.

arXiv:2110.14767 [pdf, other]

doi 10.1109/TSP.2022.3173731

A Semi-Blind Method for Localization of Underwater Acoustic Sources

Authors: Amir Weiss, Toros Arikan, Hari Vishnu, Grant B. Deane, Andrew C. Singer, Gregory W. Wornell

Abstract: Underwater acoustic localization has traditionally been challenging due to the presence of unknown environmental structure and dynamic conditions. The problem is richer still when such structure includes occlusion, which causes the loss of line-of-sight (LOS) between the acoustic source and the receivers, on which many of the existing localization algorithms rely. We develop a semi-blind passive l… ▽ More Underwater acoustic localization has traditionally been challenging due to the presence of unknown environmental structure and dynamic conditions. The problem is richer still when such structure includes occlusion, which causes the loss of line-of-sight (LOS) between the acoustic source and the receivers, on which many of the existing localization algorithms rely. We develop a semi-blind passive localization method capable of accurately estimating the source's position even in the possible absence of LOS between the source and all receivers. Based on typically-available prior knowledge of the water surface and bottom, we derive a closed-form expression for the optimal estimator under a multi-ray propagation model, which is suitable for shallow-water environments and high-frequency signals. By exploiting a computationally efficient form of this estimator, our methodology makes comparatively high-resolution localization feasible. We also derive the Cramér-Rao bound for this model, which can be used to guide the placement of collections of receivers so as to optimize localization accuracy. The method improves a balance of accuracy and robustness to environmental model mismatch, relative to existing localization methods that are useful in similar settings. The method is validated with simulations and water tank experiments. △ Less

Submitted 2 February, 2023; v1 submitted 27 October, 2021; originally announced October 2021.

arXiv:2106.13655 [pdf, other]

Video-Streaming Biomedical Implants using Ultrasonic Waves for Communication

Authors: Gizem Tabak, Jae Won Choi, Rita J. Miller, Michael L. Oelze, Andrew C. Singer

Abstract: The use of wireless implanted medical devices (IMDs) is growing because they facilitate continuous monitoring of patients during normal activities, simplify medical procedures required for data retrieval and reduce the likelihood of infection associated with trailing wires. However, most of the state-of-the-art IMDs are passive and offline devices. One of the key obstacles to an active and online… ▽ More The use of wireless implanted medical devices (IMDs) is growing because they facilitate continuous monitoring of patients during normal activities, simplify medical procedures required for data retrieval and reduce the likelihood of infection associated with trailing wires. However, most of the state-of-the-art IMDs are passive and offline devices. One of the key obstacles to an active and online IMD is the infeasibility of real-time, high-quality video broadcast from the IMD. Such broadcast would help develop innovative devices such as a video-streaming capsule endoscopy (CE) pill with therapeutic intervention capabilities. State-of-the-art IMDs employ radio-frequency electromagnetic waves for information transmission. However, high attenuation of RF-EM waves in tissues and federal restrictions on the transmit power and operable bandwidth lead to fundamental performance constraints for IMDs employing RF links, and prevent achieving high data rates that could accomodate video broadcast. In this work, ultrasonic waves were used for video transmission and broadcast through biological tissues. The proposed proof-of-concept system was tested on a porcine intestine ex vivo and a rabbit in vivo. It was demonstrated that using a millimeter-sized, implanted biocompatible transducer operating at 1.1-1.2 MHz, it was possible to transmit endoscopic video with high resolution (1280 pixels by 720 pixels) through porcine intestine wrapped with bacon, and to broadcast standard definition (640 pixels by 480 pixels) video near real-time through rabbit abdomen in vivo. A media repository that includes experimental demonstrations and media files accompanies this paper. The accompanying media repository can be found at this link: https://bit.ly/3wuc7tk. △ Less

Submitted 27 June, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

Comments: arXiv admin note: text overlap with arXiv:1909.13172

arXiv:2104.01078 [pdf, other]

Blind Exploration and Exploitation of Stochastic Experts

Authors: Noyan C. Sevuktekin, Andrew C. Singer

Abstract: We present blind exploration and exploitation (BEE) algorithms for identifying the most reliable stochastic expert based on formulations that employ posterior sampling, upper-confidence bounds, empirical Kullback-Leibler divergence, and minmax methods for the stochastic multi-armed bandit problem. Joint sampling and consultation of experts whose opinions depend on the hidden and random state of th… ▽ More We present blind exploration and exploitation (BEE) algorithms for identifying the most reliable stochastic expert based on formulations that employ posterior sampling, upper-confidence bounds, empirical Kullback-Leibler divergence, and minmax methods for the stochastic multi-armed bandit problem. Joint sampling and consultation of experts whose opinions depend on the hidden and random state of the world becomes challenging in the unsupervised, or blind, framework as feedback from the true state is not available. We propose an empirically realizable measure of expert competence that can be inferred instantaneously using only the opinions of other experts. This measure preserves the ordering of true competences and thus enables joint sampling and consultation of stochastic experts based on their opinions on dynamically changing tasks. Statistics derived from the proposed measure is instantaneously available allowing both blind exploration-exploitation and unsupervised opinion aggregation. We discuss how the lack of supervision affects the asymptotic regret of BEE architectures that rely on UCB1, KL-UCB, MOSS, IMED, and Thompson sampling. We demonstrate the performance of different BEE algorithms empirically and compare them to their standard, or supervised, counterparts. △ Less

Submitted 2 April, 2021; originally announced April 2021.

arXiv:2103.11261 [pdf, other]

High Data Rate Near-Ultrasonic Communication with Consumer Devices

Authors: Gizem Tabak, Xintian Eddie Lin, Andrew C. Singer

Abstract: Automating device pairing and credential exchange in consumer devices reduce the time users spend with mundane tasks and improve the user experience. Acoustic communication is gaining traction as a practical alternative to Bluetooth or Wi-Fi because it can enable quick and localized information transfer between consumer devices with built-in hardware. However, achieving high data rates (>1 kbps) i… ▽ More Automating device pairing and credential exchange in consumer devices reduce the time users spend with mundane tasks and improve the user experience. Acoustic communication is gaining traction as a practical alternative to Bluetooth or Wi-Fi because it can enable quick and localized information transfer between consumer devices with built-in hardware. However, achieving high data rates (>1 kbps) in such systems has been a challenge because the systems and methods chosen for communication were not tailored to the application. In this work, a high data rate, near-ultrasonic communication (NUSC) system is proposed to transfer personal identification numbers (PINs) to establish a connection between consumer laptops using built-in microphones and speakers. The similarities between indoor near-ultrasonic and underwater acoustic communication (UWAC) channels are identified, and appropriate UWAC techniques are tailored to the NUSC system. The proposed system uses the near-ultrasonic band at 18-20 kHz, and employs coherent modulation and phase-coherent adaptive equalization. The capability of the proposed system is explored in simulated and field experiments that span different device orientations and distances. The experiments demonstrate data rates of 4 kbps over distances of up to 5 meters, which is an order of magnitude higher than the data rates reported with similar systems in the literature. △ Less

Submitted 27 May, 2021; v1 submitted 20 March, 2021; originally announced March 2021.

Comments: Accepted for presentation at EUSIPCO '21

arXiv:2012.03860 [pdf, other]

doi 10.1121/10.0005314

Modeling the effects of dynamic range compression on signals in noise

Authors: Ryan M. Corey, Andrew C. Singer

Abstract: Hearing aids use dynamic range compression (DRC), a form of automatic gain control, to make quiet sounds louder and loud sounds quieter. Compression can improve listening comfort, but it can also cause distortion in noisy environments. It has been widely reported that DRC performs poorly in noise, but there has been little mathematical analysis of these distortion effects. This work introduces a m… ▽ More Hearing aids use dynamic range compression (DRC), a form of automatic gain control, to make quiet sounds louder and loud sounds quieter. Compression can improve listening comfort, but it can also cause distortion in noisy environments. It has been widely reported that DRC performs poorly in noise, but there has been little mathematical analysis of these distortion effects. This work introduces a mathematical model to study the behavior of DRC in noise. Using statistical assumptions about the signal envelopes, we define an effective compression function that models the compression applied to one signal in the presence of another. This framework is used to prove results about DRC that have been previously observed experimentally: that when DRC is applied to a mixture of signals, uncorrelated signal envelopes become negatively correlated; that the effective compression applied to each sound in a mixture is weaker than it would have been for the signal alone; and that compression can reduce the long-term signal-to-noise ratio in certain conditions. These theoretical results are supported by software experiments using recorded speech signals. △ Less

Submitted 7 December, 2020; originally announced December 2020.

arXiv:2009.13683 [pdf]

doi 10.1109/tbme.2021.3070477

Real-time video streaming in vivo using ultrasound as the communication channel

Authors: Zhengchang Kou, Rita J. Miller, Andrew C. Singer, Michael L. Oelze

Abstract: The emergence of capsule endoscopy has provided a means of capturing video of the small intestines without having to resort to an invasive procedure involving intubation. However, real-time video streaming to a receiver outside the body remains challenging for capsule endoscopy. Traditional electromagnetic-based solutions are limited in their data rates and available power. Recently, ultrasound wa… ▽ More The emergence of capsule endoscopy has provided a means of capturing video of the small intestines without having to resort to an invasive procedure involving intubation. However, real-time video streaming to a receiver outside the body remains challenging for capsule endoscopy. Traditional electromagnetic-based solutions are limited in their data rates and available power. Recently, ultrasound was investigated as a communication channel for through-tissue data transmission. To achieve real-time video streaming through tissue, data rates of ultrasound need to exceed 1 Mbps. In a previous study, we demonstrated ultrasound communications with data rates greater than 30 Mbps with two focused ultrasound transducers using a large footprint laboratory system through slabs of lossy tissues [1]. While the form factor of the transmitter is also crucial for capsule endoscopy, it is obvious that a large, focused transducer cannot fit within the size of a capsule. Several other challenges for achieving high-speed ultrasonic communication through tissue include strong reflections leading to multipath effects and attenuation. In this work, we demonstrate ultrasonic video communications using a mm-scale microcrystal transmitter with video streaming supplied by a camera connected to a Field Programmable Gate Array (FPGA). The signals were transmitted through a tissue-mimicking phantom and through the abdomen of a rabbit in vivo. The ultrasound signal was recorded by an array probe connected to a Verasonics Vantage system and decoded back to video. To improve the received signal quality, we combined the signal from multiple channels of the array probe. Orthogonal frequency division multiplexing (OFDM) modulation was used to reduce the receiver complexity under a strong multipath environment. △ Less

Submitted 28 September, 2020; originally announced September 2020.

arXiv:2008.04521 [pdf, other]

doi 10.1121/10.0002279

Acoustic effects of medical, cloth, and transparent face masks on speech signals

Authors: Ryan M. Corey, Uriah Jones, Andrew C. Singer

Abstract: Face masks muffle speech and make communication more difficult, especially for people with hearing loss. This study examines the acoustic attenuation caused by different face masks, including medical, cloth, and transparent masks, using a head-shaped loudspeaker and a live human talker. The results suggest that all masks attenuate frequencies above 1 kHz, that attenuation is greatest in front of t… ▽ More Face masks muffle speech and make communication more difficult, especially for people with hearing loss. This study examines the acoustic attenuation caused by different face masks, including medical, cloth, and transparent masks, using a head-shaped loudspeaker and a live human talker. The results suggest that all masks attenuate frequencies above 1 kHz, that attenuation is greatest in front of the talker, and that there is substantial variation between mask types, especially cloth masks with different materials and weaves. Transparent masks have poor acoustic performance compared to both medical and cloth masks. Most masks have little effect on lapel microphones, suggesting that existing sound reinforcement and assistive listening systems may be effective for verbal communication with masks. △ Less

Submitted 11 August, 2020; originally announced August 2020.

Journal ref: The Journal of the Acoustical Society of America, 148(4), pp. 2371-2375, Oct. 2020

arXiv:2006.03664 [pdf, other]

doi 10.1109/TBCAS.2020.3020702

Low-Complexity System and Algorithm for an Emergency Ventilator Sensor and Alarm

Authors: Ryan M. Corey, Evan M. Widloski, David Null, Brian Ricconi, Mark Johnson, Karen White, Jennifer R. Amos, Alex Pagano, Michael Oelze, Rachel Switzky, Matthew B. Wheeler, Eliot Bethke, Clifford Shipley, Andrew C. Singer

Abstract: In response to the shortage of ventilators caused by the COVID-19 pandemic, many organizations have designed low-cost emergency ventilators. Many of these devices are pressure-cycled pneumatic ventilators, which are easy to produce but often do not include the sensing or alarm features found on commercial ventilators. This work reports a low-cost, easy-to-produce electronic sensor and alarm system… ▽ More In response to the shortage of ventilators caused by the COVID-19 pandemic, many organizations have designed low-cost emergency ventilators. Many of these devices are pressure-cycled pneumatic ventilators, which are easy to produce but often do not include the sensing or alarm features found on commercial ventilators. This work reports a low-cost, easy-to-produce electronic sensor and alarm system for pressure-cycled ventilators that estimates clinically useful metrics such as pressure and respiratory rate and sounds an alarm when the ventilator malfunctions. A low-complexity signal processing algorithm uses a pair of nonlinear recursive envelope trackers to monitor the signal from an electronic pressure sensor connected to the patient airway. The algorithm, inspired by those used in hearing aids, requires little memory and performs only a few calculations on each sample so that it can run on nearly any microcontroller. △ Less

Submitted 5 June, 2020; originally announced June 2020.

Comments: Open-source hardware and software: https://rapidalarm.github.io/

Journal ref: IEEE Transactions on Biomedical Circuits and Systems 14(5), Oct. 2020

arXiv:2004.11956 [pdf, other]

Binaural Audio Source Remixing with Microphone Array Listening Devices

Authors: Ryan M. Corey, Andrew C. Singer

Abstract: Augmented listening devices, such as hearing aids and augmented reality headsets, enhance human perception by changing the sounds that we hear. Microphone arrays can improve the performance of listening systems in noisy environments, but most array-based listening systems are designed to isolate a single sound source from a mixture. This work considers a source-remixing filter that alters the rela… ▽ More Augmented listening devices, such as hearing aids and augmented reality headsets, enhance human perception by changing the sounds that we hear. Microphone arrays can improve the performance of listening systems in noisy environments, but most array-based listening systems are designed to isolate a single sound source from a mixture. This work considers a source-remixing filter that alters the relative level of each source independently. Remixing rather than separating sounds can help to improve perceptual transparency: it causes less distortion to the signal spectrum and especially to the interaural cues that humans use to localize sounds in space. △ Less

Submitted 24 April, 2020; originally announced April 2020.

Comments: To appear at ICASSP 2020

arXiv:1912.05043 [pdf, other]

Motion-Tolerant Beamforming with Deformable Microphone Arrays

Authors: Ryan M. Corey, Andrew C. Singer

Abstract: Microphone arrays are usually assumed to have rigid geometries: the microphones may move with respect to the sound field but remain fixed relative to each other. However, many useful arrays, such as those in wearable devices, have sensors that can move relative to each other. We compare two approaches to beamforming with deformable microphone arrays: first, by explicitly tracking the geometry of t… ▽ More Microphone arrays are usually assumed to have rigid geometries: the microphones may move with respect to the sound field but remain fixed relative to each other. However, many useful arrays, such as those in wearable devices, have sensors that can move relative to each other. We compare two approaches to beamforming with deformable microphone arrays: first, by explicitly tracking the geometry of the array as it changes over time, and second, by designing a time-invariant beamformer based on the second-order statistics of the moving array. The time-invariant approach is shown to be appropriate when the motion of the array is small relative to the acoustic wavelengths of interest. The performance of the proposed beamforming system is demonstrated using a wearable microphone array on a moving human listener in a cocktail-party scenario. △ Less

Submitted 10 December, 2019; originally announced December 2019.

Comments: Presented at WASPAA 2019

arXiv:1912.05038 [pdf, other]

Cooperative Audio Source Separation and Enhancement Using Distributed Microphone Arrays and Wearable Devices

Authors: Ryan M. Corey, Matthew D. Skarha, Andrew C. Singer

Abstract: Augmented listening devices such as hearing aids often perform poorly in noisy and reverberant environments with many competing sound sources. Large distributed microphone arrays can improve performance, but data from remote microphones often cannot be used for delay-constrained real-time processing. We present a cooperative audio source separation and enhancement system that leverages wearable li… ▽ More Augmented listening devices such as hearing aids often perform poorly in noisy and reverberant environments with many competing sound sources. Large distributed microphone arrays can improve performance, but data from remote microphones often cannot be used for delay-constrained real-time processing. We present a cooperative audio source separation and enhancement system that leverages wearable listening devices and other microphone arrays spread around a room. The full distributed array is used to separate sound sources and estimate their statistics. Each listening device uses these statistics to design real-time binaural audio enhancement filters using its own local microphones. The system is demonstrated experimentally using 10 speech sources and 160 microphones in a large, reverberant room. △ Less

Submitted 10 December, 2019; originally announced December 2019.

Comments: To appear at CAMSAP 2019

arXiv:1905.09940 [pdf, other]

On the Reusability of Post-Experimental Field Data for Underwater Acoustic Communications R&D

Authors: Sijung Yang, Grant Deane, James C. Preisig, Noyan C. Sevüktekin, Jae W. Choi, Andrew C. Singer

Abstract: Field data is often expensive to collect, time-consuming to prepare to collect, and even more time-consuming to process after the experiment has concluded. However, it is often the practice that such data are used for little after the funded research activity that was concomitant with the experiment is completed. Immutability of the original experimental configuration either results in re-gatherin… ▽ More Field data is often expensive to collect, time-consuming to prepare to collect, and even more time-consuming to process after the experiment has concluded. However, it is often the practice that such data are used for little after the funded research activity that was concomitant with the experiment is completed. Immutability of the original experimental configuration either results in re-gathering of expensive field-data, or in absence of such data, model-dependent analysis that partially captures the real-world dynamics. For underwater acoustic research and development, the standard communication pipeline might be modified to enable greater re-usability of experimental field data. This paper first characterizes the necessary modifications to the standard communication pipeline to prepare signals for transmission and subsequent recording such that research trades for different modulation and coding schemes may be undertaken post-experiment, without the need for re-transmission of additional waveforms. Then, using the modified mathematical framework, sufficient conditions for reliable post-experimental replay of the environment are recognized. Finally, techniques are discussed to collect sufficient environmental statistics such that subsequent research can be accomplished long after the experiment has been completed, and that results from a given experiment may be reasonably compared with those of another. Examples are provided using both synthetic and experimental data collected from at-sea field tests. △ Less

Submitted 23 May, 2019; originally announced May 2019.

Comments: The manuscript is 39 pages long, including 17 figures and 2 tables. The manuscript was submitted into IEEE Journal of Oceanic Engineering in Jan 2019 and under review

arXiv:1903.02094 [pdf, other]

doi 10.1109/ICASSP.2019.8682733

Acoustic Impulse Responses for Wearable Audio Devices

Authors: Ryan M. Corey, Naoki Tsuda, Andrew C. Singer

Abstract: We present an open-access dataset of over 8000 acoustic impulse from 160 microphones spread across the body and affixed to wearable accessories. The data can be used to evaluate audio capture and array processing systems using wearable devices such as hearing aids, headphones, eyeglasses, jewelry, and clothing. We analyze the acoustic transfer functions of different parts of the body, measure the… ▽ More We present an open-access dataset of over 8000 acoustic impulse from 160 microphones spread across the body and affixed to wearable accessories. The data can be used to evaluate audio capture and array processing systems using wearable devices such as hearing aids, headphones, eyeglasses, jewelry, and clothing. We analyze the acoustic transfer functions of different parts of the body, measure the effects of clothing worn over microphones, compare measurements from a live human subject to those from a mannequin, and simulate the noise-reduction performance of several beamformers. The results suggest that arrays of microphones spread across the body are more effective than those confined to a single device. △ Less

Submitted 5 March, 2019; originally announced March 2019.

Comments: To appear at ICASSP 2019

Journal ref: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

arXiv:1808.00096 [pdf, other]

doi 10.1109/IWAENC.2018.8521260

Speech Separation Using Partially Asynchronous Microphone Arrays Without Resampling

Authors: Ryan M. Corey, Andrew C. Singer

Abstract: We consider the problem of separating speech sources captured by multiple spatially separated devices, each of which has multiple microphones and samples its signals at a slightly different rate. Most asynchronous array processing methods rely on sample rate offset estimation and resampling, but these offsets can be difficult to estimate if the sources or microphones are moving. We propose a sourc… ▽ More We consider the problem of separating speech sources captured by multiple spatially separated devices, each of which has multiple microphones and samples its signals at a slightly different rate. Most asynchronous array processing methods rely on sample rate offset estimation and resampling, but these offsets can be difficult to estimate if the sources or microphones are moving. We propose a source separation method that does not require offset estimation or signal resampling. Instead, we divide the distributed array into several synchronous subarrays. All arrays are used jointly to estimate the time-varying signal statistics, and those statistics are used to design separate time-varying spatial filters in each array. We demonstrate the method for speech mixtures recorded on both stationary and moving microphone arrays. △ Less

Submitted 31 July, 2018; originally announced August 2018.

Comments: To appear at the International Workshop on Acoustic Signal Enhancement (IWAENC 2018)

Journal ref: 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC)

arXiv:1808.00082 [pdf, other]

doi 10.1109/IWAENC.2018.8521263

Delay-Performance Tradeoffs in Causal Microphone Array Processing

Authors: Ryan M. Corey, Naoki Tsuda, Andrew C. Singer

Abstract: In real-time listening enhancement applications, such as hearing aid signal processing, sounds must be processed with no more than a few milliseconds of delay to sound natural to the listener. Listening devices can achieve better performance with lower delay by using microphone arrays to filter acoustic signals in both space and time. Here, we analyze the tradeoff between delay and squared-error p… ▽ More In real-time listening enhancement applications, such as hearing aid signal processing, sounds must be processed with no more than a few milliseconds of delay to sound natural to the listener. Listening devices can achieve better performance with lower delay by using microphone arrays to filter acoustic signals in both space and time. Here, we analyze the tradeoff between delay and squared-error performance of causal multichannel Wiener filters for microphone array noise reduction. We compute exact expressions for the delay-error curves in two special cases and present experimental results from real-world microphone array recordings. We find that delay-performance characteristics are determined by both the spatial and temporal correlation structures of the signals. △ Less

Submitted 31 July, 2018; originally announced August 2018.

Comments: To appear at the International Workshop on Acoustic Signal Enhancement (IWAENC 2018)

Journal ref: 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC)

arXiv:1806.08968 [pdf, ps, other]

doi 10.1109/JSTSP.2018.2863189

A Modulo-Based Architecture for Analog-to-Digital Conversion

Authors: Or Ordentlich, Gizem Tabak, Pavan Kumar Hanumolu, Andrew C. Singer, Gregory W. Wornell

Abstract: Systems that capture and process analog signals must first acquire them through an analog-to-digital converter. While subsequent digital processing can remove statistical correlations present in the acquired data, the dynamic range of the converter is typically scaled to match that of the input analog signal. The present paper develops an approach for analog-to-digital conversion that aims at mini… ▽ More Systems that capture and process analog signals must first acquire them through an analog-to-digital converter. While subsequent digital processing can remove statistical correlations present in the acquired data, the dynamic range of the converter is typically scaled to match that of the input analog signal. The present paper develops an approach for analog-to-digital conversion that aims at minimizing the number of bits per sample at the output of the converter. This is attained by reducing the dynamic range of the analog signal by performing a modulo operation on its amplitude, and then quantizing the result. While the converter itself is universal and agnostic of the statistics of the signal, the decoder operation on the output of the quantizer can exploit the statistical structure in order to unwrap the modulo folding. The performance of this method is shown to approach information theoretical limits, as captured by the rate-distortion function, in various settings. An architecture for modulo analog-to-digital conversion via ring oscillators is suggested, and its merits are numerically demonstrated. △ Less

Submitted 23 June, 2018; originally announced June 2018.

arXiv:1705.07779 [pdf, other]

Cost-Performance Tradeoffs in Fusing Unreliable Computational Units

Authors: Mehmet A. Donmez, Maxim Raginsky, Andrew C. Singer, Lav R. Varshney

Abstract: We investigate fusing several unreliable computational units that perform the same task. We model an unreliable computational outcome as an additive perturbation to its error-free result in terms of its fidelity and cost. We analyze performance of repetition-based strategies that distribute cost across several unreliable units and fuse their outcomes. When the cost is a convex function of fidelity… ▽ More We investigate fusing several unreliable computational units that perform the same task. We model an unreliable computational outcome as an additive perturbation to its error-free result in terms of its fidelity and cost. We analyze performance of repetition-based strategies that distribute cost across several unreliable units and fuse their outcomes. When the cost is a convex function of fidelity, the optimal repetition-based strategy in terms of incurred cost while achieving a target mean-square error (MSE) performance may fuse several computational units. For concave and linear costs, a single more reliable unit incurs lower cost compared to fusion of several lower cost and less reliable units while achieving the same MSE performance. We show how our results give insight into problems from theoretical neuroscience, circuits, and crowdsourcing. △ Less

Submitted 22 May, 2017; originally announced May 2017.

arXiv:1705.07070 [pdf, other]

EE-Grad: Exploration and Exploitation for Cost-Efficient Mini-Batch SGD

Authors: Mehmet A. Donmez, Maxim Raginsky, Andrew C. Singer

Abstract: We present a generic framework for trading off fidelity and cost in computing stochastic gradients when the costs of acquiring stochastic gradients of different quality are not known a priori. We consider a mini-batch oracle that distributes a limited query budget over a number of stochastic gradients and aggregates them to estimate the true gradient. Since the optimal mini-batch size depends on t… ▽ More We present a generic framework for trading off fidelity and cost in computing stochastic gradients when the costs of acquiring stochastic gradients of different quality are not known a priori. We consider a mini-batch oracle that distributes a limited query budget over a number of stochastic gradients and aggregates them to estimate the true gradient. Since the optimal mini-batch size depends on the unknown cost-fidelity function, we propose an algorithm, {\it EE-Grad}, that sequentially explores the performance of mini-batch oracles and exploits the accumulated knowledge to estimate the one achieving the best performance in terms of cost-efficiency. We provide performance guarantees for EE-Grad with respect to the optimal mini-batch oracle, and illustrate these results in the case of strongly convex objectives. We also provide a simple numerical example that corroborates our theoretical findings. △ Less

Submitted 19 May, 2017; originally announced May 2017.

arXiv:1605.07905 [pdf, ps, other]

Timing Channel: Achievable Rate in the Finite Block-Length Regime

Authors: Thomas J. Riedl, Todd P. Coleman, Andrew C. Singer

Abstract: The exponential server timing channel is known to be the simplest, and in some sense canonical, queuing timing channel. The capacity of this infinite-memory channel is known. Here, we discuss practical finite-length restrictions on the codewords and attempt to understand the maximal rate that can be achieved for a target error probability. By using Markov chain analysis, we prove a lower bound on… ▽ More The exponential server timing channel is known to be the simplest, and in some sense canonical, queuing timing channel. The capacity of this infinite-memory channel is known. Here, we discuss practical finite-length restrictions on the codewords and attempt to understand the maximal rate that can be achieved for a target error probability. By using Markov chain analysis, we prove a lower bound on the maximal channel coding rate achievable at blocklength $n$ and error probability $ε$. The bound is approximated by $C- n^{-1/2} σQ^{-1}(ε)$ where $Q$ denotes the Q-function and $σ^2$ is the asymptotic variance of the underlying Markov chain. A closed form expression for $σ^2$ is given. △ Less

Submitted 25 May, 2016; originally announced May 2016.

Comments: Full technical report on the work originally presented at the Information Theory Workshop (ITW) in 2011

arXiv:1203.4206 [pdf, ps, other]

doi 10.1109/LCOMM.2014.2316172

Low Complexity Turbo-Equalization: A Clustering Approach

Authors: Kyeongyeon Kim, Jun Won Choi, Suleyman S. Kozat, Andrew C. Singer

Abstract: We introduce a low complexity approach to iterative equalization and decoding, or "turbo equalization", that uses clustered models to better match the nonlinear relationship that exists between likelihood information from a channel decoder and the symbol estimates that arise in soft-input channel equalization. The introduced clustered turbo equalizer uses piecewise linear models to capture the non… ▽ More We introduce a low complexity approach to iterative equalization and decoding, or "turbo equalization", that uses clustered models to better match the nonlinear relationship that exists between likelihood information from a channel decoder and the symbol estimates that arise in soft-input channel equalization. The introduced clustered turbo equalizer uses piecewise linear models to capture the nonlinear dependency of the linear minimum mean square error (MMSE) symbol estimate on the symbol likelihoods produced by the channel decoder and maintains a computational complexity that is only linear in the channel memory. By partitioning the space of likelihood information from the decoder, based on either hard or soft clustering, and using locally-linear adaptive equalizers within each clustered region, the performance gap between the linear MMSE equalizer and low-complexity, LMS-based linear turbo equalizers can be dramatically narrowed. △ Less

Submitted 19 March, 2012; originally announced March 2012.

Comments: Submitted to the IEEE Signal Processing Letters

arXiv:1203.4168 [pdf, ps, other]

Linear MMSE-Optimal Turbo Equalization Using Context Trees

Authors: Nargiz Kalantarova, Kyeongyeon Kim, Suleyman S. Kozat, Andrew C. Singer

Abstract: Formulations of the turbo equalization approach to iterative equalization and decoding vary greatly when channel knowledge is either partially or completely unknown. Maximum aposteriori probability (MAP) and minimum mean square error (MMSE) approaches leverage channel knowledge to make explicit use of soft information (priors over the transmitted data bits) in a manner that is distinctly nonlinear… ▽ More Formulations of the turbo equalization approach to iterative equalization and decoding vary greatly when channel knowledge is either partially or completely unknown. Maximum aposteriori probability (MAP) and minimum mean square error (MMSE) approaches leverage channel knowledge to make explicit use of soft information (priors over the transmitted data bits) in a manner that is distinctly nonlinear, appearing either in a trellis formulation (MAP) or inside an inverted matrix (MMSE). To date, nearly all adaptive turbo equalization methods either estimate the channel or use a direct adaptation equalizer in which estimates of the transmitted data are formed from an expressly linear function of the received data and soft information, with this latter formulation being most common. We study a class of direct adaptation turbo equalizers that are both adaptive and nonlinear functions of the soft information from the decoder. We introduce piecewise linear models based on context trees that can adaptively approximate the nonlinear dependence of the equalizer on the soft information such that it can choose both the partition regions as well as the locally linear equalizer coefficients in each region independently, with computational complexity that remains of the order of a traditional direct adaptive linear equalizer. This approach is guaranteed to asymptotically achieve the performance of the best piecewise linear equalizer and we quantify the MSE performance of the resulting algorithm and the convergence of its MSE to that of the linear minimum MSE estimator as the depth of the context tree and the data length increase. △ Less

Submitted 19 March, 2012; originally announced March 2012.

Comments: Submitted to the IEEE Transactions on Signal Processing

arXiv:1105.1482 [pdf, ps, other]

Efficient Soft-Input Soft-Output Tree Detection Via an Improved Path Metric

Authors: J. W. Choi, B. Shim, A. C. Singer

Abstract: Tree detection techniques are often used to reduce the complexity of a posteriori probability (APP) detection in high dimensional multi-antenna wireless communication systems. In this paper, we introduce an efficient soft-input soft-output tree detection algorithm that employs a new type of look-ahead path metric in the computation of its branch pruning (or sorting). While conventional path metric… ▽ More Tree detection techniques are often used to reduce the complexity of a posteriori probability (APP) detection in high dimensional multi-antenna wireless communication systems. In this paper, we introduce an efficient soft-input soft-output tree detection algorithm that employs a new type of look-ahead path metric in the computation of its branch pruning (or sorting). While conventional path metrics depend only on symbols on a visited path, the new path metric accounts for unvisited parts of the tree in advance through an unconstrained linear estimator and adds a bias term that reflects the contribution of as-yet undecided symbols. By applying the linear estimate-based look-ahead path metric to an M-algorithm that selects the best M paths for each level of the tree we develop a new soft-input soft-output tree detector, called an improved soft-input soft-output M-algorithm (ISS-MA). Based on an analysis of the probability of correct path loss, we show that the improved path metric offers substantial performance gain over the conventional path metric. We also demonstrate through simulations that the ISS-MA provides a better performance-complexity trade-off than existing soft-input soft-output detection algorithms. △ Less

Submitted 7 May, 2011; originally announced May 2011.

Comments: 16 pages, 7 figures

Showing 1–26 of 26 results for author: Singer, A C