Skip to main content

Showing 1–34 of 34 results for author: Erdogan, H

.
  1. arXiv:2401.08864  [pdf, other

    eess.AS cs.LG cs.SD

    Binaural Angular Separation Network

    Authors: Yang Yang, George Sung, Shao-Fu Shih, Hakan Erdogan, Chehung Lee, Matthias Grundmann

    Abstract: We propose a neural network model that can separate target speech sources from interfering sources at different angular regions using two microphones. The model is trained with simulated room impulse responses (RIRs) using omni-directional microphones without needing to collect real RIRs. By relying on specific angular regions and multiple room simulations, the model utilizes consistent time diffe… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted to ICASSP 2024

  2. arXiv:2312.04229  [pdf

    eess.SP eess.SY

    Accelerated Real-Life (ARL) Testing and Characterization of Automotive LiDAR Sensors to facilitate the Development and Validation of Enhanced Sensor Models

    Authors: Marcel Kettelgerdes, Tjorven Hillmann, Thomas Hirmer, Hüseyin Erdogan, Bernhard Wunderle, Gordon Elger

    Abstract: In the realm of automated driving simulation and sensor modeling, the need for highly accurate sensor models is paramount for ensuring the reliability and safety of advanced driving assistance systems (ADAS). Hence, numerous works focus on the development of high-fidelity models of ADAS sensors, such as camera, Radar as well as modern LiDAR systems to simulate the sensor behavior in different driv… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 9th Symposium Driving Simulation 2023, Brunswick, Germany

  3. arXiv:2308.10415  [pdf, other

    cs.SD cs.LG eess.AS

    TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition

    Authors: Hakan Erdogan, Scott Wisdom, Xuankai Chang, Zalán Borsos, Marco Tagliasacchi, Neil Zeghidour, John R. Hershey

    Abstract: We present TokenSplit, a speech separation model that acts on discrete token sequences. The model is trained on multiple tasks simultaneously: separate and transcribe each speech source, and generate speech from text. The model operates on transcripts and audio token sequences and achieves multiple tasks through masking of inputs. The model is a sequence-to-sequence encoder-decoder model that uses… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: INTERSPEECH 2023, project webpage with audio demos at https://google-research.github.io/sound-separation/papers/tokensplit

  4. arXiv:2303.07486  [pdf, other

    eess.AS cs.LG cs.SD

    Guided Speech Enhancement Network

    Authors: Yang Yang, Shao-Fu Shih, Hakan Erdogan, Jamie Menjay Lin, Chehung Lee, Yunpeng Li, George Sung, Matthias Grundmann

    Abstract: High quality speech capture has been widely studied for both voice communication and human computer interface reasons. To improve the capture performance, we can often find multi-microphone speech enhancement techniques deployed on various devices. Multi-microphone speech enhancement problem is often decomposed into two decoupled steps: a beamformer that provides spatial filtering and a single-cha… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: Accepted to ICASSP 2023

  5. arXiv:2203.15652  [pdf, other

    eess.AS cs.SD

    CycleGAN-Based Unpaired Speech Dereverberation

    Authors: Hannah Muckenhirn, Aleksandr Safin, Hakan Erdogan, Felix de Chaumont Quitry, Marco Tagliasacchi, Scott Wisdom, John R. Hershey

    Abstract: Typically, neural network-based speech dereverberation models are trained on paired data, composed of a dry utterance and its corresponding reverberant utterance. The main limitation of this approach is that such models can only be trained on large amounts of data and a variety of room impulse responses when the data is synthetically reverberated, since acquiring real paired data is costly. In thi… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Submitted to Interspeech 2022

  6. arXiv:2110.10739  [pdf, other

    cs.SD eess.AS

    Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training

    Authors: Aswin Sivaraman, Scott Wisdom, Hakan Erdogan, John R. Hershey

    Abstract: The recently-proposed mixture invariant training (MixIT) is an unsupervised method for training single-channel sound separation models in the sense that it does not require ground-truth isolated reference sources. In this paper, we investigate using MixIT to adapt a separation model on real far-field overlap** reverberant and noisy speech data from the AMI Corpus. The models are tested on real A… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

  7. arXiv:2106.15813  [pdf, other

    eess.AS cs.SD

    DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement

    Authors: Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani

    Abstract: Single-channel speech enhancement (SE) is an important task in speech processing. A widely used framework combines an analysis/synthesis filterbank with a mask prediction network, such as the Conv-TasNet architecture. In such systems, the denoising performance and computational efficiency are mainly affected by the structure of the mask prediction network. In this study, we aim to improve the sequ… ▽ More

    Submitted 5 August, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: 5 pages, 2 figure. accepted for WASPAA 2021

  8. arXiv:2106.00847  [pdf, other

    eess.AS cs.SD

    Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation

    Authors: Scott Wisdom, Aren Jansen, Ron J. Weiss, Hakan Erdogan, John R. Hershey

    Abstract: Supervised neural network training has led to significant progress on single-channel sound separation. This approach relies on ground truth isolated sources, which precludes scaling to widely available mixture data and limits progress on open-domain tasks. The recent mixture invariant training (MixIT) method enables training on in-the-wild data; however, it suffers from two outstanding problems. F… ▽ More

    Submitted 16 October, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: 5 pages, 1 figure. WASPAA 2021

  9. arXiv:2105.02096  [pdf, other

    cs.SD cs.LG eess.AS

    End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings

    Authors: Soumi Maiti, Hakan Erdogan, Kevin Wilson, Scott Wisdom, Shinji Watanabe, John R. Hershey

    Abstract: We present an end-to-end deep network model that performs meeting diarization from single-channel audio recordings. End-to-end diarization models have the advantage of handling speaker overlap and enabling straightforward handling of discriminative training, unlike traditional clustering-based diarization methods. The proposed system is designed to handle meetings with unknown numbers of speakers,… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 5 pages, 2 figures, ICASSP 2021

    Journal ref: ICASSP 2021, SPE-54.1

  10. arXiv:2012.09727  [pdf, other

    eess.AS cs.SD eess.SP

    Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording

    Authors: Cong Han, Yi Luo, Chenda Li, Tianyan Zhou, Keisuke Kinoshita, Shinji Watanabe, Marc Delcroix, Hakan Erdogan, John R. Hershey, Nima Mesgarani, Zhuo Chen

    Abstract: Leveraging additional speaker information to facilitate speech separation has received increasing attention in recent years. Recent research includes extracting target speech by using the target speaker's voice snippet and jointly separating all participating speakers by using a pool of additional speaker signals, which is known as speech separation using speaker inventory (SSUSI). However, all th… ▽ More

    Submitted 18 December, 2020; v1 submitted 17 December, 2020; originally announced December 2020.

  11. arXiv:2011.02014  [pdf, other

    eess.AS cs.SD

    Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis

    Authors: Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Maokui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, **yu Li, Scott Wisdom, John R. Hershey

    Abstract: Multi-speaker speech recognition of unsegmented recordings has diverse applications such as meeting transcription and automatic subtitle generation. With technical advances in systems dealing with speech separation, speaker diarization, and automatic speech recognition (ASR) in the last decade, it has become possible to build pipelines that achieve reasonable error rates on this task. In this pape… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: Accepted to IEEE SLT 2021

  12. arXiv:2011.00803  [pdf, other

    cs.SD eess.AS

    What's All the FUSS About Free Universal Sound Separation Data?

    Authors: Scott Wisdom, Hakan Erdogan, Daniel Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John Hershey

    Abstract: We introduce the Free Universal Sound Separation (FUSS) dataset, a new corpus for experiments in separating mixtures of an unknown number of sounds from an open domain of sound types. The dataset consists of 23 hours of single-source audio data drawn from 357 classes, which are used to create mixtures of one to four sources. To simulate reverberation, an acoustic room simulator is used to generate… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  13. arXiv:2011.00801  [pdf, other

    cs.SD eess.AS

    Sound Event Detection and Separation: a Benchmark on Desed Synthetic Soundscapes

    Authors: Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon

    Abstract: We propose a benchmark of state-of-the-art sound event detection systems (SED). We designed synthetic evaluation sets to focus on specific sound event detection challenges. We analyze the performance of the submissions to DCASE 2021 task 4 depending on time related modifications (time position of an event and length of clips) and we study the impact of non-target sound events and reverberation. We… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  14. arXiv:2007.03932  [pdf, other

    cs.SD eess.AS eess.SP

    Improving Sound Event Detection In Domestic Environments Using Sound Separation

    Authors: Nicolas Turpault, Scott Wisdom, Hakan Erdogan, John Hershey, Romain Serizel, Eduardo Fonseca, Prem Seetharaman, Justin Salamon

    Abstract: Performing sound event detection on real-world recordings often implies dealing with overlap** target sound events and non-target sounds, also referred to as interference or noise. Until now these problems were mainly tackled at the classifier level. We propose to use sound separation as a pre-processing for sound event detection. In this paper we start from a sound separation model trained on t… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  15. arXiv:2006.12701  [pdf, other

    eess.AS cs.LG cs.SD

    Unsupervised Sound Separation Using Mixture Invariant Training

    Authors: Scott Wisdom, Efthymios Tzinis, Hakan Erdogan, Ron J. Weiss, Kevin Wilson, John R. Hershey

    Abstract: In recent years, rapid progress has been made on the problem of single-channel sound separation using supervised training of deep neural networks. In such supervised approaches, a model is trained to predict the component sources from synthetic mixtures created by adding up isolated ground-truth sources. Reliance on this synthetic training data is problematic because good performance depends upon… ▽ More

    Submitted 23 October, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Accepted for spotlight presentation at NeurIPS 2020

  16. arXiv:1911.07953  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement

    Authors: Zhong-Qiu Wang, Hakan Erdogan, Scott Wisdom, Kevin Wilson, Desh Raj, Shinji Watanabe, Zhuo Chen, John R. Hershey

    Abstract: This work introduces sequential neural beamforming, which alternates between neural network based spectral separation and beamforming based spatial separation. Our neural networks for separation use an advanced convolutional architecture trained with a novel stabilized signal-to-noise ratio loss function. For beamforming, we explore multiple ways of computing time-varying covariance matrices, incl… ▽ More

    Submitted 3 November, 2020; v1 submitted 18 November, 2019; originally announced November 2019.

    Comments: 7 pages, 7 figures, IEEE SLT 2021 (slt2020.org)

  17. arXiv:1905.03330  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Universal Sound Separation

    Authors: Ilya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin Wilson, Jonathan Le Roux, John R. Hershey

    Abstract: Recent deep learning approaches have achieved impressive performance on speech enhancement and separation tasks. However, these approaches have not been investigated for separating mixtures of arbitrary sounds of different types, a task we refer to as universal sound separation, and it is unknown how performance on speech tasks carries over to non-speech tasks. To study this question, we develop a… ▽ More

    Submitted 2 August, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

    Comments: 5 pages, accepted to WASPAA 2019

  18. arXiv:1904.06478  [pdf, other

    eess.AS cs.CL cs.SD

    Low-Latency Speaker-Independent Continuous Speech Separation

    Authors: Takuya Yoshioka, Zhuo Chen, Changliang Liu, Xiong Xiao, Hakan Erdogan, Dimitrios Dimitriadis

    Abstract: Speaker independent continuous speech separation (SI-CSS) is a task of converting a continuous audio stream, which may contain overlap** voices of unknown speakers, into a fixed number of continuous signals each of which contains no overlap** speech segment. A separated, or cleaned, version of each utterance is generated from one of SI-CSS's output channels nondeterministically without being s… ▽ More

    Submitted 13 April, 2019; originally announced April 2019.

  19. arXiv:1811.02508  [pdf, other

    cs.SD eess.AS

    SDR - half-baked or well done?

    Authors: Jonathan Le Roux, Scott Wisdom, Hakan Erdogan, John R. Hershey

    Abstract: In speech enhancement and source separation, signal-to-noise ratio is a ubiquitous objective measure of denoising/separation quality. A decade ago, the BSS_eval toolkit was developed to give researchers worldwide a way to evaluate the quality of their algorithms in a simple, fair, and hopefully insightful way: it attempted to account for channel variations, and to not only evaluate the total disto… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

  20. Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks

    Authors: Takuya Yoshioka, Hakan Erdogan, Zhuo Chen, Xiong Xiao, Fil Alleva

    Abstract: The goal of this work is to develop a meeting transcription system that can recognize speech even when utterances of different speakers are overlapped. While speech overlaps have been regarded as a major obstacle in accurately transcribing meetings, a traditional beamformer with a single output has been exclusively used because previously proposed speech separation techniques have critical constra… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

    Journal ref: Proc. Interspeech 2018, 3038-3042

  21. arXiv:1711.08016  [pdf, other

    eess.AS cs.CL cs.SD

    Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition

    Authors: Zhong Meng, Shinji Watanabe, John R. Hershey, Hakan Erdogan

    Abstract: Far-field speech recognition in noisy and reverberant conditions remains a challenging problem despite recent deep learning breakthroughs. This problem is commonly addressed by acquiring a speech signal from multiple microphones and performing beamforming over them. In this paper, we propose to use a recurrent neural network with long short-term memory (LSTM) architecture to adaptively estimate re… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

    Journal ref: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, 2017, pp. 271-275

  22. PLDA-Based Diarization of Telephone Conversations

    Authors: Ahmet E. Bulut, Hakan Demir, Yusuf Ziya Isik, Hakan Erdogan

    Abstract: This paper investigates the application of the probabilistic linear discriminant analysis (PLDA) to speaker diarization of telephone conversations. We introduce using a variational Bayes (VB) approach for inference under a PLDA model for modeling segmental i-vectors in speaker diarization. Deterministic annealing (DA) algorithm is imposed in order to avoid local optimal solutions in VB iterations.… ▽ More

    Submitted 29 September, 2017; originally announced October 2017.

    Journal ref: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings 1 (2015) 4809-4813

  23. arXiv:1507.02826  [pdf, ps, other

    cs.IT

    Comments On "Multipath Matching Pursuit" by Kwon, Wang and Shim

    Authors: Nazim Burak Karahanoglu, Hakan Erdogan

    Abstract: Straightforward combination of tree search with matching pursuits, which was suggested in 2001 by Cotter and Rao, and then later developed by some other authors, has been revisited recently as multipath matching pursuit (MMP). In this comment, we would like to point out some major issues regarding this publication. First, the idea behind MMP is not novel, and the related literature has not been pr… ▽ More

    Submitted 10 July, 2015; originally announced July 2015.

  24. THRIVE: Threshold Homomorphic encryption based secure and privacy preserving bIometric VErification system

    Authors: Cagatay Karabat, Mehmet Sabir Kiraz, Hakan Erdogan, Erkay Savas

    Abstract: In this paper, we propose a new biometric verification and template protection system which we call the THRIVE system. The system includes novel enrollment and authentication protocols based on threshold homomorphic cryptosystem where the private key is shared between a user and the verifier. In the THRIVE system, only encrypted binary biometric templates are stored in the database and verificatio… ▽ More

    Submitted 29 September, 2014; originally announced September 2014.

  25. arXiv:1311.2746  [pdf, other

    cs.NE cs.LG

    Deep neural networks for single channel source separation

    Authors: Emad M. Grais, Mehmet Umut Sen, Hakan Erdogan

    Abstract: In this paper, a novel approach for single channel source separation (SCSS) using a deep neural network (DNN) architecture is introduced. Unlike previous studies in which DNN and other classifiers were used for classifying time-frequency bins to obtain hard masks for each source, we use the DNN to classify estimated source spectra to check for their validity during separation. In the training stag… ▽ More

    Submitted 12 November, 2013; originally announced November 2013.

    Comments: 5 pages, 2 figures, 2 tables, submitted to ICASSP2014

  26. Improving A*OMP: Theoretical and Empirical Analyses With a Novel Dynamic Cost Model

    Authors: Nazim Burak Karahanoglu, Hakan Erdogan

    Abstract: Best-first search has been recently utilized for compressed sensing (CS) by the A* orthogonal matching pursuit (A*OMP) algorithm. In this work, we concentrate on theoretical and empirical analyses of A*OMP. We present a restricted isometry property (RIP) based general condition for exact recovery of sparse signals via A*OMP. In addition, we develop online guarantees which promise improved recovery… ▽ More

    Submitted 10 July, 2015; v1 submitted 6 July, 2013; originally announced July 2013.

    Journal ref: Signal Processing 118 (2016) 62-74

  27. arXiv:1302.7283  [pdf, other

    cs.LG math.NA

    Source Separation using Regularized NMF with MMSE Estimates under GMM Priors with Online Learning for The Uncertainties

    Authors: Emad M. Grais, Hakan Erdogan

    Abstract: We propose a new method to enforce priors on the solution of the nonnegative matrix factorization (NMF). The proposed algorithm can be used for denoising or single-channel source separation (SCSS) applications. The NMF solution is guided to follow the Minimum Mean Square Error (MMSE) estimates under Gaussian mixture prior models (GMM) for the source signal. In SCSS applications, the spectra of the… ▽ More

    Submitted 28 February, 2013; originally announced February 2013.

  28. arXiv:1210.5991  [pdf, ps, other

    cs.IT

    Online Recovery Guarantees and Analytical Results for OMP

    Authors: Nazim Burak Karahanoglu, Hakan Erdogan

    Abstract: Orthogonal Matching Pursuit (OMP) is a simple, yet empirically competitive algorithm for sparse recovery. Recent developments have shown that OMP guarantees exact recovery of K-sparse signals with K or more than K iterations if the observation matrix satisfies the restricted isometry property (RIP) with some conditions. We develop RIP-based online guarantees for recovery of a K-sparse signal with… ▽ More

    Submitted 29 March, 2013; v1 submitted 22 October, 2012; originally announced October 2012.

  29. Compressed Sensing Signal Recovery via Forward-Backward Pursuit

    Authors: Nazim Burak Karahanoglu, Hakan Erdogan

    Abstract: Recovery of sparse signals from compressed measurements constitutes an l0 norm minimization problem, which is unpractical to solve. A number of sparse recovery approaches have appeared in the literature, including l1 minimization techniques, greedy pursuit algorithms, Bayesian methods and nonconvex optimization techniques among others. This manuscript introduces a novel two stage greedy approach,… ▽ More

    Submitted 6 July, 2013; v1 submitted 20 October, 2012; originally announced October 2012.

    Comments: accepted for publication in Digital Signal Processing

    Journal ref: Digital Signal Processing 23 (2013), pp. 1539-1548

  30. arXiv:1108.3260  [pdf, other

    cs.AI cs.LO cs.PL

    Finding Similar/Diverse Solutions in Answer Set Programming

    Authors: Thomas Eiter, Esra Erdem, Halit Erdogan, Michael Fink

    Abstract: For some computational problems (e.g., product configuration, planning, diagnosis, query answering, phylogeny reconstruction) computing a set of similar/diverse solutions may be desirable for better decision-making. With this motivation, we studied several decision/optimization versions of this problem in the context of Answer Set Programming (ASP), analyzed their computational complexity, and int… ▽ More

    Submitted 16 August, 2011; originally announced August 2011.

    Comments: 57 pages, 17 figures, 4 tables. To appear in Theory and Practice of Logic Programming (TPLP)

    Journal ref: Theory and Practice of Logic Programming, 13(3), 303-359, 2013

  31. arXiv:1107.5850   

    cs.CV

    Confidence-Based Dynamic Classifier Combination For Mean-Shift Tracking

    Authors: Ibrahim Saygin Topkaya, Hakan Erdogan

    Abstract: We introduce a novel tracking technique which uses dynamic confidence-based fusion of two different information sources for robust and efficient tracking of visual objects. Mean-shift tracking is a popular and well known method used in object tracking problems. Originally, the algorithm uses a similarity measure which is optimized by shifting a search area to the center of a generated weight image… ▽ More

    Submitted 22 July, 2014; v1 submitted 28 July, 2011; originally announced July 2011.

    Comments: This paper has been withdrawn by the author due to an implementation issue

    ACM Class: I.4.8

  32. arXiv:1106.1684  [pdf, other

    cs.LG

    Max-Margin Stacking and Sparse Regularization for Linear Classifier Combination and Selection

    Authors: Mehmet Umut Sen, Hakan Erdogan

    Abstract: The main principle of stacked generalization (or Stacking) is using a second-level generalizer to combine the outputs of base classifiers in an ensemble. In this paper, we investigate different combination types under the stacking framework; namely weighted sum (WS), class-dependent weighted sum (CWS) and linear stacked generalization (LSG). For learning the weights, we propose using regularized e… ▽ More

    Submitted 8 June, 2011; originally announced June 2011.

    Comments: 8 pages, 3 figures, 6 tables, journal

  33. arXiv:1012.1899  [pdf, other

    cs.AI

    Querying Biomedical Ontologies in Natural Language using Answer Set

    Authors: Halit Erdogan, Umut Oztok, Yelda Erdem, Esra Erdem

    Abstract: In this work, we develop an intelligent user interface that allows users to enter biomedical queries in a natural language, and that presents the answers (possibly with explanations if requested) in a natural language. We develop a rule layer over biomedical ontologies and databases, and use automated reasoners to answer queries considering relevant parts of the rule layer.

    Submitted 8 December, 2010; originally announced December 2010.

    Comments: in Adrian Paschke, Albert Burger, Andrea Splendiani, M. Scott Marshall, Paolo Romano: Proceedings of the 3rd International Workshop on Semantic Web Applications and Tools for the Life Sciences, Berlin,Germany, December 8-10, 2010

    Report number: SWAT4LS 2010 ACM Class: J.3

  34. A* Orthogonal Matching Pursuit: Best-First Search for Compressed Sensing Signal Recovery

    Authors: Nazim Burak Karahanoglu, Hakan Erdogan

    Abstract: Compressed sensing is a develo** field aiming at reconstruction of sparse signals acquired in reduced dimensions, which make the recovery process under-determined. The required solution is the one with minimum $\ell_0$ norm due to sparsity, however it is not practical to solve the $\ell_0$ minimization problem. Commonly used techniques include $\ell_1$ minimization, such as Basis Pursuit (BP) an… ▽ More

    Submitted 14 March, 2012; v1 submitted 2 September, 2010; originally announced September 2010.

    Comments: accepted for publication in Digital Signal Processing

    Journal ref: Digital Signal Processing, Volume 22, Issue 4, 2012, Pages 555-568