Skip to main content

Showing 1–6 of 6 results for author: Kavalerov, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.04420  [pdf, other

    eess.AS cs.AI cs.LG cs.SD eess.SP

    Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation Systems

    Authors: Rahil Parikh, Ilya Kavalerov, Carol Espy-Wilson, Shihab Shamma

    Abstract: Recent advancements in deep learning have led to drastic improvements in speech segregation models. Despite their success and growing applicability, few efforts have been made to analyze the underlying principles that these networks learn to perform segregation. Here we analyze the role of harmonicity on two state-of-the-art Deep Neural Networks (DNN)-based models- Conv-TasNet and DPT-Net. We eval… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: 5 pages, IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2022

  2. arXiv:2103.01303  [pdf, other

    cs.CV

    Exploring the high dimensional geometry of HSI features

    Authors: Wojciech Czaja, Ilya Kavalerov, Weilin Li

    Abstract: We explore feature space geometries induced by the 3-D Fourier scattering transform and deep neural network with extended attribute profiles on four standard hyperspectral images. We examine the distances and angles of class means, the variability of classes, and their low-dimensional structures. These statistics are compared to that of raw features, and our results provide insight into the vastly… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: 5 pages, 4 figures, to appear in WHISPERS 2021

  3. arXiv:2102.00313  [pdf, other

    cs.SD cs.LG eess.AS

    Cortical Features for Defense Against Adversarial Audio Attacks

    Authors: Ilya Kavalerov, Ruijie Zheng, Wojciech Czaja, Rama Chellappa

    Abstract: We propose using a computational model of the auditory cortex as a defense against adversarial attacks on audio. We apply several white-box iterative optimization-based adversarial attacks to an implementation of Amazon Alexa's HW network, and a modified version of this network with an integrated cortical representation, and show that the cortical features help defend against universal adversarial… ▽ More

    Submitted 17 November, 2021; v1 submitted 30 January, 2021; originally announced February 2021.

    Comments: Co-author legal name changed

  4. arXiv:1912.04216  [pdf, other

    cs.LG cs.CV stat.ML

    cGANs with Multi-Hinge Loss

    Authors: Ilya Kavalerov, Wojciech Czaja, Rama Chellappa

    Abstract: We propose a new algorithm to incorporate class conditional information into the critic of GANs via a multi-class generalization of the commonly used Hinge loss that is compatible with both supervised and semi-supervised settings. We study the compromise between training a state of the art generator and an accurate classifier simultaneously, and propose a way to use our algorithm to measure the de… ▽ More

    Submitted 21 November, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: Accepted to Winter Conference on Applications of Computer Vision (WACV) 2021

  5. arXiv:1906.06804  [pdf, other

    cs.CV

    Three-Dimensional Fourier Scattering Transform and Classification of Hyperspectral Images

    Authors: Ilya Kavalerov, Weilin Li, Wojciech Czaja, Rama Chellappa

    Abstract: Recent developments in machine learning and signal processing have resulted in many new techniques that are able to effectively capture the intrinsic yet complex properties of hyperspectral imagery. Tasks ranging from anomaly detection to classification can now be solved by taking advantage of very efficient algorithms which have their roots in representation theory and in computational approximat… ▽ More

    Submitted 21 November, 2020; v1 submitted 16 June, 2019; originally announced June 2019.

    Comments: Accepted to IEEE Transactions On Geoscience And Remote Sensing

  6. arXiv:1905.03330  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Universal Sound Separation

    Authors: Ilya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin Wilson, Jonathan Le Roux, John R. Hershey

    Abstract: Recent deep learning approaches have achieved impressive performance on speech enhancement and separation tasks. However, these approaches have not been investigated for separating mixtures of arbitrary sounds of different types, a task we refer to as universal sound separation, and it is unknown how performance on speech tasks carries over to non-speech tasks. To study this question, we develop a… ▽ More

    Submitted 2 August, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

    Comments: 5 pages, accepted to WASPAA 2019