Skip to main content

Showing 1–13 of 13 results for author: Avramidis, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08644  [pdf, other

    eess.SP cs.AI cs.SD eess.AS

    Toward Fully-End-to-End Listened Speech Decoding from EEG Signals

    Authors: Jihwan Lee, Aditya Kommineni, Tiantian Feng, Kleanthis Avramidis, Xuan Shi, Sudarsana Kadiri, Shrikanth Narayanan

    Abstract: Speech decoding from EEG signals is a challenging task, where brain activity is modeled to estimate salient characteristics of acoustic stimuli. We propose FESDE, a novel framework for Fully-End-to-end Speech Decoding from EEG signals. Our approach aims to directly reconstruct listened speech waveforms given EEG signals, where no intermediate acoustic feature processing step is required. The propo… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: accepted to Interspeech2024

  2. arXiv:2403.03222  [pdf, other

    cs.LG cs.AI eess.SP

    Knowledge-guided EEG Representation Learning

    Authors: Aditya Kommineni, Kleanthis Avramidis, Richard Leahy, Shrikanth Narayanan

    Abstract: Self-supervised learning has produced impressive results in multimedia domains of audio, vision and speech. This paradigm is equally, if not more, relevant for the domain of biosignals, owing to the scarcity of labelled data in such scenarios. The ability to leverage large-scale unlabelled data to learn robust representations could help improve the performance of numerous inference tasks on biosig… ▽ More

    Submitted 14 February, 2024; originally announced March 2024.

    Comments: 6 Pages, 5 figures, Submitted to EMBC 2024

  3. arXiv:2402.09655  [pdf, other

    eess.SP eess.IV

    Evaluating Atypical Gaze Patterns through Vision Models: The Case of Cortical Visual Impairment

    Authors: Kleanthis Avramidis, Melinda Y. Chang, Rahul Sharma, Mark S. Borchert, Shrikanth Narayanan

    Abstract: A wide range of neurological and cognitive disorders exhibit distinct behavioral markers aside from their clinical manifestations. Cortical Visual Impairment (CVI) is a prime example of such conditions, resulting from damage to visual pathways in the brain, and adversely impacting low- and high-level visual function. The characteristics impacted by CVI are primarily described qualitatively, challe… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures, submitted to IEEE EMBC 2024

  4. arXiv:2309.15292  [pdf, other

    cs.LG eess.SP

    Scaling Representation Learning from Ubiquitous ECG with State-Space Models

    Authors: Kleanthis Avramidis, Dominika Kunc, Bartosz Perz, Kranti Adsul, Tiantian Feng, Przemysław Kazienko, Stanisław Saganowski, Shrikanth Narayanan

    Abstract: Ubiquitous sensing from wearable devices in the wild holds promise for enhancing human well-being, from diagnosing clinical conditions and measuring stress to building adaptive health promoting scaffolds. But the large volumes of data therein across heterogeneous contexts pose challenges for conventional supervised learning approaches. Representation Learning from biological signals is an emerging… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: Pre-print, currently under review

  5. arXiv:2308.12610  [pdf, other

    cs.MM cs.SD eess.AS

    Emotion-Aligned Contrastive Learning Between Images and Music

    Authors: Shanti Stewart, Kleanthis Avramidis, Tiantian Feng, Shrikanth Narayanan

    Abstract: Traditional music search engines rely on retrieval methods that match natural language queries with music metadata. There have been increasing efforts to expand retrieval methods to consider the audio characteristics of music itself, using queries of various modalities including text, video, and speech. While most approaches aim to match general music semantics to the input queries, only a few foc… ▽ More

    Submitted 20 September, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 4 pages + 1 reference page, 1 figure, 3 tables. Under review for publication

  6. arXiv:2304.08614  [pdf, ps, other

    eess.SP cs.LG

    Signal Processing Grand Challenge 2023 -- e-Prevention: Sleep Behavior as an Indicator of Relapses in Psychotic Patients

    Authors: Kleanthis Avramidis, Kranti Adsul, Digbalay Bose, Shrikanth Narayanan

    Abstract: This paper presents the approach and results of USC SAIL's submission to the Signal Processing Grand Challenge 2023 - e-Prevention (Task 2), on detecting relapses in psychotic patients. Relapse prediction has proven to be challenging, primarily due to the heterogeneity of symptoms and responses to treatment between individuals. We address these challenges by investigating the use of sleep behavior… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 2 pages, 1 table, ICASSP 2023, Grand Challenges Track

  7. arXiv:2210.15828  [pdf, other

    cs.SD cs.MM eess.AS

    On the Role of Visual Context in Enriching Music Representations

    Authors: Kleanthis Avramidis, Shanti Stewart, Shrikanth Narayanan

    Abstract: Human perception and experience of music is highly context-dependent. Contextual variability contributes to differences in how we interpret and interact with music, challenging the design of robust models for information retrieval. Incorporating multimodal context from diverse sources provides a promising approach toward modeling this variability. Music presented in media such as movies and music… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 5 pages, 4 figures, 1 table

  8. arXiv:2210.15826  [pdf, other

    eess.SP cs.HC

    Multimodal Estimation of Change Points of Physiological Arousal in Drivers

    Authors: Kleanthis Avramidis, Tiantian Feng, Digbalay Bose, Shrikanth Narayanan

    Abstract: Detecting unsafe driving states, such as stress, drowsiness, and fatigue, is an important component of ensuring driving safety and an essential prerequisite for automatic intervention systems in vehicles. These concerning conditions are primarily connected to the driver's low or high arousal levels. In this study, we describe a framework for processing multimodal physiological time-series from wea… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 tables, 4 figures

  9. arXiv:2207.04565  [pdf, other

    eess.IV cs.LG

    Automating Detection of Papilledema in Pediatric Fundus Images with Explainable Machine Learning

    Authors: Kleanthis Avramidis, Mohammad Rostami, Melinda Chang, Shrikanth Narayanan

    Abstract: Papilledema is an ophthalmic neurologic disorder in which increased intracranial pressure leads to swelling of the optic nerves. Undiagnosed papilledema in children may lead to blindness and may be a sign of life-threatening conditions, such as brain tumors. Robust and accurate clinical diagnosis of this syndrome can be facilitated by automated analysis of fundus images using deep learning, especi… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: 5 pages, 4 figures, 2 tables, 2022 IEEE International Conference on Image Processing (ICIP)

  10. arXiv:2202.09750  [pdf, other

    cs.SD cs.IR cs.LG eess.AS

    Enhancing Affective Representations of Music-Induced EEG through Multimodal Supervision and latent Domain Adaptation

    Authors: Kleanthis Avramidis, Christos Garoufis, Athanasia Zlatintsi, Petros Maragos

    Abstract: The study of Music Cognition and neural responses to music has been invaluable in understanding human emotions. Brain signals, though, manifest a highly complex structure that makes processing and retrieving meaningful features challenging, particularly of abstract constructs like affect. Moreover, the performance of learning models is undermined by the limited amount of available neuronal data an… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

    Comments: 5 pages, 3 figures, IEEE ICASSP 2022

  11. arXiv:2102.06930  [pdf, other

    cs.SD cs.LG eess.AS

    Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms

    Authors: Kleanthis Avramidis, Agelos Kratimenos, Christos Garoufis, Athanasia Zlatintsi, Petros Maragos

    Abstract: Sound Event Detection and Audio Classification tasks are traditionally addressed through time-frequency representations of audio signals such as spectrograms. However, the emergence of deep neural networks as efficient feature extractors has enabled the direct use of audio signals for classification purposes. In this paper, we attempt to recognize musical instruments in polyphonic audio by only fe… ▽ More

    Submitted 13 February, 2021; originally announced February 2021.

    Comments: 5 pages, 4 figures, 6 tables, to be published in the Proc. of the 46th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021) @ Toronto, Ontario, Canada

  12. Multiscale Fractal Analysis on EEG Signals for Music-Induced Emotion Recognition

    Authors: Kleanthis Avramidis, Athanasia Zlatintsi, Christos Garoufis, Petros Maragos

    Abstract: Emotion Recognition from EEG signals has long been researched as it can assist numerous medical and rehabilitative applications. However, their complex and noisy structure has proven to be a serious barrier for traditional modeling methods. In this paper, we employ multifractal analysis to examine the behavior of EEG signals in terms of presence of fluctuations and the degree of fragmentation alon… ▽ More

    Submitted 12 December, 2021; v1 submitted 30 October, 2020; originally announced October 2020.

    Comments: 5 pages, 3 figures, 3 tables, European Signal Processing Conference (EUSIPCO) 2021, Dublin, Ireland

  13. arXiv:1911.12505  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music

    Authors: Agelos Kratimenos, Kleanthis Avramidis, Christos Garoufis, Athanasia Zlatintsi, Petros Maragos

    Abstract: Instrument classification is one of the fields in Music Information Retrieval (MIR) that has attracted a lot of research interest. However, the majority of that is dealing with monophonic music, while efforts on polyphonic material mainly focus on predominant instrument recognition. In this paper, we propose an approach for instrument classification in polyphonic music from purely monophonic data,… ▽ More

    Submitted 2 March, 2020; v1 submitted 27 November, 2019; originally announced November 2019.