Skip to main content

Showing 1–9 of 9 results for author: Heydari, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.08592  [pdf

    eess.IV cs.CV cs.LG

    Convolutional Neural Networks Towards Facial Skin Lesions Detection

    Authors: Reza Sarshar, Mohammad Heydari, Elham Akhondzadeh Noughabi

    Abstract: Facial analysis has emerged as a prominent area of research with diverse applications, including cosmetic surgery programs, the beauty industry, photography, and entertainment. Manipulating patient images often necessitates professional image processing software. This study contributes by providing a model that facilitates the detection of blemishes and skin lesions on facial images through a conv… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 6 pages, 11 figures

  2. arXiv:2309.07525  [pdf, other

    cs.SD cs.AI eess.AS

    SingFake: Singing Voice Deepfake Detection

    Authors: Yongyi Zang, You Zhang, Mojtaba Heydari, Zhiyao Duan

    Abstract: The rise of singing voice synthesis presents critical challenges to artists and industry stakeholders over unauthorized voice usage. Unlike synthesized speech, synthesized singing voices are typically released in songs containing strong background music that may hide synthesis artifacts. Additionally, singing voices present different acoustic and linguistic characteristics from speech utterances.… ▽ More

    Submitted 21 January, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted at ICASSP 2024

  3. arXiv:2306.02372  [pdf, other

    eess.AS

    SingNet: A Real-time Singing Voice Beat and Downbeat Tracking System

    Authors: Mojtaba Heydari, Ju-Chiang Wang, Zhiyao Duan

    Abstract: Singing voice beat and downbeat tracking posses several applications in automatic music production, analysis and manipulation. Among them, some require real-time processing, such as live performance processing and auto-accompaniment for singing inputs. This task is challenging owing to the non-trivial rhythmic and harmonic patterns in singing signals. For real-time processing, it introduces furthe… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted for 2023 International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2023)

  4. arXiv:2208.14578  [pdf, other

    eess.AS

    Singing Beat Tracking With Self-supervised Front-end and Linear Transformers

    Authors: Mojtaba Heydari, Zhiyao Duan

    Abstract: Tracking beats of singing voices without the presence of musical accompaniment can find many applications in music production, automatic song arrangement, and social media interaction. Its main challenge is the lack of strong rhythmic and harmonic patterns that are important for music rhythmic analysis in general. Even for human listeners, this can be a challenging task. As a result, existing musi… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 23rd International Society for Music Information Retrieval Conference (ISMIR 2022)

  5. arXiv:2111.00704  [pdf, other

    cs.SD cs.IR eess.AS eess.SP

    A Novel 1D State Space for Efficient Music Rhythmic Analysis

    Authors: Mojtaba Heydari, Matthew McCallum, Andreas Ehmann, Zhiyao Duan

    Abstract: Inferring music time structures has a broad range of applications in music production, processing and analysis. Scholars have proposed various methods to analyze different aspects of time structures, such as beat, downbeat, tempo and meter. Many state-of-the-art (SOFA) methods, however, are computationally expensive. This makes them inapplicable in real-world industrial settings where the scale of… ▽ More

    Submitted 20 February, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: International Conference on Acoustics, Speech and Signal Processing (ICASSP), May. 2022

  6. arXiv:2108.10382  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Learning Sparse Analytic Filters for Piano Transcription

    Authors: Frank Cwitkowitz, Mojtaba Heydari, Zhiyao Duan

    Abstract: In recent years, filterbank learning has become an increasingly popular strategy for various audio-related machine learning tasks. This is partly due to its ability to discover task-specific audio characteristics which can be leveraged in downstream processing. It is also a natural extension of the nearly ubiquitous deep learning methods employed to tackle a diverse array of audio applications. In… ▽ More

    Submitted 10 November, 2022; v1 submitted 23 August, 2021; originally announced August 2021.

    Comments: Sound and Music Computing Conference (SMC) 2022

  7. arXiv:2108.03576  [pdf, other

    eess.AS cs.AI cs.IR cs.LG cs.SD eess.SP

    BeatNet: CRNN and Particle Filtering for Online Joint Beat Downbeat and Meter Tracking

    Authors: Mojtaba Heydari, Frank Cwitkowitz, Zhiyao Duan

    Abstract: The online estimation of rhythmic information, such as beat positions, downbeat positions, and meter, is critical for many real-time music applications. Musical rhythm comprises complex hierarchical relationships across time, rendering its analysis intrinsically challenging and at times subjective. Furthermore, systems which attempt to estimate rhythmic information in real-time must be causal and… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

    Comments: 22nd International Society for Music Information Retrieval (ISMIR) Conference Paper, Fall 2021. 8 Pages (Total), 3 Figures, 2 Tables, 1 Algorithm

  8. arXiv:2011.02619  [pdf

    eess.AS cs.SD

    Don't look back: an online beat tracking method using RNN and enhanced particle filtering

    Authors: Mojtaba Heydari, Zhiyao Duan

    Abstract: Online beat tracking (OBT) has always been a challenging task. Due to the inaccessibility of future data and the need to make inference in real-time. We propose Do not Look back! (DLB), a novel approach optimized for efficiency when performing OBT. DLB feeds the activations of a unidirectional RNN into an enhanced Monte-Carlo localization model to infer beat positions. Most preexisting OBT methods… ▽ More

    Submitted 1 March, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

    Comments: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2021). (ACCEPTED)

  9. arXiv:2009.04411  [pdf

    eess.SP eess.SY

    Design and Fabrication of Novel Digital Transcranial Electrical Stimulator for Medical and Psychiatry Applications

    Authors: HoseinAli Jafari, Mohammad Bagher Heydari, Niloofar Jafari, Hamid Mirhosseini

    Abstract: In this article, we design a novel Transcranial Electrical Stimulator for medical applications, which is very cheap and can produce the desired signals very accurately. Our fabricated stimulator generates all current signals related to Transcranial Electrical Stimulation (TES) methods, i.e. Transcranial Direct Current Stimulation (tDCS), Transcranial Pulsed Current Stimulation (tPCS), Cranial Elec… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.