Skip to main content

Showing 1–29 of 29 results for author: Salah, A a

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19006  [pdf, other

    cs.CV

    VideoMambaPro: A Leap Forward for Mamba in Video Understanding

    Authors: Hui Lu, Albert Ali Salah, Ronald Poppe

    Abstract: Video understanding requires the extraction of rich spatio-temporal representations, which transformer models achieve through self-attention. Unfortunately, self-attention poses a computational burden. In NLP, Mamba has surfaced as an efficient alternative for transformers. However, Mamba's successes do not trivially extend to computer vision tasks, including those in video analysis. In this paper… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2405.12642  [pdf

    cs.SI cs.CY

    Combining Twitter and Mobile Phone Data to Observe Border-Rush: The Turkish-European Border Opening

    Authors: Carlos Arcila Calderón, Bilgeçağ Aydoğdu, Tuba Bircan, Bünyamin Gündüz, Onur Önes, Albert Ali Salah, Alina Sîrbu

    Abstract: Following Turkey's 2020 decision to revoke border controls, many individuals journeyed towards the Greek, Bulgarian, and Turkish borders. However, the lack of verifiable statistics on irregular migration and discrepancies between media reports and actual migration patterns require further exploration. The objective of this study is to bridge this knowledge gap by harnessing novel data sources, spe… ▽ More

    Submitted 22 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2403.16128  [pdf, other

    cs.CV

    Enhancing Video Transformers for Action Understanding with VLM-aided Training

    Authors: Hui Lu, Hu Jian, Ronald Poppe, Albert Ali Salah

    Abstract: Owing to their ability to extract relevant spatio-temporal video embeddings, Vision Transformers (ViTs) are currently the best performing models in video action understanding. However, their generalization over domains or datasets is somewhat limited. In contrast, Visual Language Models (VLMs) have demonstrated exceptional generalization performance, but are currently unable to process videos. Con… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  4. arXiv:2403.11818  [pdf, other

    cs.CV

    TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions

    Authors: Hui Lu, Albert Ali Salah, Ronald Poppe

    Abstract: A key challenge in continuous sign language recognition (CSLR) is to efficiently capture long-range spatial interactions over time from the video input. To address this challenge, we propose TCNet, a hybrid network that effectively models spatio-temporal information from Trajectories and Correlated regions. TCNet's trajectory module transforms frames into aligned trajectories composed of continuou… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  5. arXiv:2401.17134  [pdf, other

    cs.HC

    Wrist movement classification for adaptive mobile phone based rehabilitation of children with motor skill impairments

    Authors: Kayleigh Schoorl, Tamara Pinos Cisneros, Albert Ali Salah, Ben Schouten

    Abstract: Rehabilitation exercises performed by children with cerebral palsy are tedious and repetitive. To make them more engaging, we propose to use an exergame approach, where an adaptive application can help the child remain stimulated and interested during exercises. In this paper, we describe how the mobile phone sensors can be used to classify wrist movements of the user during the rehabilitation exe… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 25 pages, 11 figures

  6. arXiv:2312.06285  [pdf, other

    cs.CV cs.AI

    Compensation Sampling for Improved Convergence in Diffusion Models

    Authors: Hui Lu, Albert ali Salah, Ronald Poppe

    Abstract: Diffusion models achieve remarkable quality in image generation, but at a cost. Iterative denoising requires many time steps to produce high fidelity images. We argue that the denoising process is crucially limited by an accumulation of the reconstruction error due to an initial inaccurate reconstruction of the target data. This leads to lower quality outputs, and slower convergence. To address th… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  7. arXiv:2308.15321  [pdf, other

    cs.LG cs.AI cs.CV

    Elucidating the Exposure Bias in Diffusion Models

    Authors: Mang Ning, Mingxiao Li, Jianlin Su, Albert Ali Salah, Itir Onal Ertugrul

    Abstract: Diffusion models have demonstrated impressive generative capabilities, but their \textit{exposure bias} problem, described as the input mismatch between training and sampling, lacks in-depth exploration. In this paper, we systematically investigate the exposure bias problem in diffusion models by first analytically modelling the sampling distribution, based on which we then attribute the predictio… ▽ More

    Submitted 10 April, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: ICLR 2024

  8. arXiv:2211.03705  [pdf, other

    cs.CV eess.IV

    A Survey on Computer Vision based Human Analysis in the COVID-19 Era

    Authors: Fevziye Irem Eyiokur, Alperen Kantarcı, Mustafa Ekrem Erakın, Naser Damer, Ferda Ofli, Muhammad Imran, Janez Križaj, Albert Ali Salah, Alexander Waibel, Vitomir Štruc, Hazım Kemal Ekenel

    Abstract: The emergence of COVID-19 has had a global and profound impact, not only on society as a whole, but also on the lives of individuals. Various prevention measures were introduced around the world to limit the transmission of the disease, including face masks, mandates for social distancing and regular disinfection in public spaces, and the use of screening applications. These developments also trig… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: Submitted to Image and Vision Computing, 44 pages, 7 figures

  9. arXiv:2210.15769  [pdf, other

    cs.CV

    Fully-attentive and interpretable: vision and video vision transformers for pain detection

    Authors: Giacomo Fiorentini, Itir Onal Ertugrul, Albert Ali Salah

    Abstract: Pain is a serious and costly issue globally, but to be treated, it must first be detected. Vision transformers are a top-performing architecture in computer vision, with little research on their use for pain detection. In this paper, we propose the first fully-attentive automated pain detection pipeline that achieves state-of-the-art performance on binary pain detection from facial expressions. Th… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 9 pages (12 with references), 10 figures, VTTA2022

  10. arXiv:2210.08860  [pdf, other

    cs.CV

    Automatic Analysis of Human Body Representations in Western Art

    Authors: Shu Zhao, Almıla Akdağ Salah, Albert Ali Salah

    Abstract: The way the human body is depicted in classical and modern paintings is relevant for art historical analyses. Each artist has certain themes and concerns, resulting in different poses being used more heavily than others. In this paper, we propose a computer vision pipeline to analyse human pose and representations in paintings, which can be used for specific artists or periods. Specifically, we co… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  11. arXiv:2209.13296  [pdf, other

    cs.CV

    Video-based estimation of pain indicators in dogs

    Authors: Hongyi Zhu, Yasemin Salgırlı, Pınar Can, Durmuş Atılgan, Albert Ali Salah

    Abstract: Dog owners are typically capable of recognizing behavioral cues that reveal subjective states of their dogs, such as pain. But automatic recognition of the pain state is very challenging. This paper proposes a novel video-based, two-stream deep neural network approach for this problem. We extract and preprocess body keypoints, and compute features from both keypoints and the RGB representation ove… ▽ More

    Submitted 26 November, 2022; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: 20 pages, 7 figures

  12. arXiv:2207.01487  [pdf

    cs.CY cs.AI cs.HC cs.SD eess.AS

    State of the Art of Audio- and Video-Based Solutions for AAL

    Authors: Slavisa Aleksic, Michael Atanasov, Jean Calleja Agius, Kenneth Camilleri, Anto Cartolovni, Pau Climent-Peerez, Sara Colantonio, Stefania Cristina, Vladimir Despotovic, Hazim Kemal Ekenel, Ekrem Erakin, Francisco Florez-Revuelta, Danila Germanese, Nicole Grech, Steinunn Gróa Sigurðardóttir, Murat Emirzeoglu, Ivo Iliev, Mladjan Jovanovic, Martin Kampel, William Kearns, Andrzej Klimczuk, Lambros Lambrinos, Jennifer Lumetzberger, Wiktor Mucha, Sophie Noiret , et al. (14 additional authors not shown)

    Abstract: The report illustrates the state of the art of the most successful AAL applications and functions based on audio and video data, namely (i) lifelogging and self-monitoring, (ii) remote monitoring of vital signs, (iii) emotional state recognition, (iv) food intake monitoring, activity and behaviour recognition, (v) activity and personal assistance, (vi) gesture recognition, (vii) fall detection and… ▽ More

    Submitted 5 July, 2022; v1 submitted 26 June, 2022; originally announced July 2022.

    ACM Class: I.2

  13. arXiv:2206.08405  [pdf, ps, other

    cs.CV

    Going Deeper than Tracking: a Survey of Computer-Vision Based Recognition of Animal Pain and Affective States

    Authors: Sofia Broomé, Marcelo Feighelstein, Anna Zamansky, Gabriel Carreira Lencioni, Pia Haubro Andersen, Francisca Pessanha, Marwa Mahmoud, Hedvig Kjellström, Albert Ali Salah

    Abstract: Advances in animal motion tracking and pose recognition have been a game changer in the study of animal behavior. Recently, an increasing number of works go 'deeper' than tracking, and address automated recognition of animals' internal states such as emotions and pain with the aim of improving animal welfare, making this a timely moment for a systematization of the field. This paper provides a com… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  14. Federated learning for violence incident prediction in a simulated cross-institutional psychiatric setting

    Authors: Thomas Borger, Pablo Mosteiro, Heysem Kaya, Emil Rijcken, Albert Ali Salah, Floortje Scheepers, Marco Spruit

    Abstract: Inpatient violence is a common and severe problem within psychiatry. Knowing who might become violent can influence staffing levels and mitigate severity. Predictive machine learning models can assess each patient's likelihood of becoming violent based on clinical notes. Yet, while machine learning models benefit from having more data, data availability is limited as hospitals typically do not sha… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Journal ref: Expert Systems with Applications Volume 199, 1 August 2022, 116720

  15. arXiv:2202.06766  [pdf, other

    eess.AS cs.LG cs.SD

    Speech Analysis for Automatic Mania Assessment in Bipolar Disorder

    Authors: Pınar Baki, Heysem Kaya, Elvan Çiftçi, Hüseyin Güleç, Albert Ali Salah

    Abstract: Bipolar disorder is a mental disorder that causes periods of manic and depressive episodes. In this work, we classify recordings from Bipolar Disorder corpus that contain 7 different tasks, into hypomania, mania, and remission classes using only speech features. We perform our experiments on splitted tasks from the interviews. Best results achieved on the model trained with 6th and 7th tasks toget… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: Conference, 5 pages, in Turkish language

  16. Benchmarking Quality-Dependent and Cost-Sensitive Score-Level Multimodal Biometric Fusion Algorithms

    Authors: Norman Poh, Thirimachos Bourlai, Josef Kittler, Lorene Allano, Fernando Alonso-Fernandez, Onkar Ambekar, John Baker, Bernadette Dorizzi, Omolara Fatukasi, Julian Fierrez, Harald Ganster, Javier Ortega-Garcia, Donald Maurer, Albert Ali Salah, Tobias Scheidat, Claus Vielhauer

    Abstract: Automatically verifying the identity of a person by means of biometrics is an important application in day-to-day activities such as accessing banking services and security control in airports. To increase the system reliability, several biometric devices are often used. Such a combined system is known as a multimodal biometric system. This paper reports a benchmarking study carried out within the… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Published at IEEE Transactions on Information Forensics and Security journal

  17. arXiv:2009.03432  [pdf, other

    cs.CL cs.HC cs.LG

    Is Everything Fine, Grandma? Acoustic and Linguistic Modeling for Robust Elderly Speech Emotion Recognition

    Authors: Gizem Soğancıoğlu, Oxana Verkholyak, Heysem Kaya, Dmitrii Fedotov, Tobias Cadèe, Albert Ali Salah, Alexey Karpov

    Abstract: Acoustic and linguistic analysis for elderly emotion recognition is an under-studied and challenging research direction, but essential for the creation of digital assistants for the elderly, as well as unobtrusive telemonitoring of elderly in their residences for mental healthcare purposes. This paper presents our contribution to the INTERSPEECH 2020 Computational Paralinguistics Challenge (ComPar… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: 5 pages, 1 figure, Interspeech 2020

  18. arXiv:2003.12347  [pdf

    cs.CY

    Mobile phone data and COVID-19: Missing an opportunity?

    Authors: Nuria Oliver, Emmanuel Letouzé, Harald Sterly, Sébastien Delataille, Marco De Nadai, Bruno Lepri, Renaud Lambiotte, Richard Benjamins, Ciro Cattuto, Vittoria Colizza, Nicolas de Cordes, Samuel P. Fraiberger, Till Koebe, Sune Lehmann, Juan Murillo, Alex Pentland, Phuong N Pham, Frédéric Pivetta, Albert Ali Salah, Jari Saramäki, Samuel V. Scarpino, Michele Tizzoni, Stefaan Verhulst, Patrick Vinck

    Abstract: This paper describes how mobile phone data can guide government and public health authorities in determining the best course of action to control the COVID-19 pandemic and in assessing the effectiveness of control measures such as physical distancing. It identifies key gaps and reasons why this kind of data is only scarcely used, although their value in similar epidemics has proven in a number of… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

  19. arXiv:1807.00523  [pdf, other

    cs.CY

    Data for Refugees: The D4R Challenge on Mobility of Syrian Refugees in Turkey

    Authors: Albert Ali Salah, Alex Pentland, Bruno Lepri, Emmanuel Letouze, Patrick Vinck, Yves-Alexandre de Montjoye, Xiaowen Dong, Ozge Dagdelen

    Abstract: The Data for Refugees (D4R) Challenge is a non-profit challenge initiated to improve the conditions of the Syrian refugees in Turkey by providing a special database to scientific community for enabling research on urgent problems concerning refugees, including health, education, unemployment, safety, and social integration. The collected database is based on anonymised mobile Call Detail Record (C… ▽ More

    Submitted 14 October, 2018; v1 submitted 2 July, 2018; originally announced July 2018.

    Comments: See http://d4r.turktelekom.com.tr/ for more information on the D4R Challenge

  20. arXiv:1802.00745  [pdf, other

    cs.CV

    Explaining First Impressions: Modeling, Recognizing, and Explaining Apparent Personality from Videos

    Authors: Hugo Jair Escalante, Heysem Kaya, Albert Ali Salah, Sergio Escalera, Yagmur Gucluturk, Umut Guclu, Xavier Baro, Isabelle Guyon, Julio Jacques Junior, Meysam Madadi, Stephane Ayache, Evelyne Viegas, Furkan Gurpinar, Achmadnoer Sukma Wicaksana, Cynthia C. S. Liem, Marcel A. J. van Gerven, Rob van Lier

    Abstract: Explainability and interpretability are two critical aspects of decision support systems. Within computer vision, they are critical in certain tasks related to human behavior analysis such as in health care applications. Despite their importance, it is only recently that researchers are starting to explore these aspects. This paper provides an introduction to explainability and interpretability in… ▽ More

    Submitted 28 September, 2019; v1 submitted 2 February, 2018; originally announced February 2018.

    Comments: Preprint submitted to TAC

  21. arXiv:1507.02801  [pdf, other

    stat.ML cs.IT cs.LG

    Adaptive Mixtures of Factor Analyzers

    Authors: Heysem Kaya, Albert Ali Salah

    Abstract: A mixture of factor analyzers is a semi-parametric density estimator that generalizes the well-known mixtures of Gaussians model by allowing each Gaussian in the mixture to be represented in a different lower-dimensional manifold. This paper presents a robust and parsimonious model selection algorithm for training a mixture of factor analyzers, carrying out simultaneous clustering and locally line… ▽ More

    Submitted 22 October, 2015; v1 submitted 10 July, 2015; originally announced July 2015.

    Comments: Pre-print has 30 pages including the appendix and references. A MATLAB tool of the proposed method is available (see the conclusions section)

    ACM Class: G.3; I.5.4

  22. arXiv:1306.3783  [pdf

    cs.DL physics.soc-ph

    UDC in Action

    Authors: Richard Smiraglia, Andrea Scharnhorst, Almila Akdag Salah, Cheng Gao

    Abstract: The UDC (Universal Decimal Classification) is not only a classification language with a long history; it also presents a complex cognitive system worthy of the attention of complexity theory. The elements of the UDC: classes, auxiliaries, and operations are combined into symbolic strings, which in essence represent a complex networks of concepts. This network forms a backbone of ordering of knowle… ▽ More

    Submitted 17 June, 2013; originally announced June 2013.

    Comments: Accepted for the UDCC seminar 2013

    ACM Class: H.3.7

  23. arXiv:1304.5753  [pdf

    cs.DL

    Map** EINS -- An exercise in map** the Network of Excellence in Internet Science

    Authors: Almila Akdag Salah, Sally Wyatt, Samir Passi, Andrea Scharnhorst

    Abstract: This paper demonstrates the application of bibliometric map** techniques in the area of funded research networks. We discuss how science maps can be used to facilitate communication inside newly formed communities, but also to account for their activities to funding agencies. We present the map** of EINS as case -- an FP7 funded Network of Excellence. Finally, we discuss how these techniques c… ▽ More

    Submitted 16 July, 2013; v1 submitted 21 April, 2013; originally announced April 2013.

    Journal ref: Conference Proceedings of the First International Conference on Internet Science, April 9-11, 2013 Brussels. Pages 75-78

  24. arXiv:1204.3769  [pdf

    cs.DL physics.soc-ph

    The evolution of classification systems: Ontogeny of the UDC

    Authors: Almila Akdag Salah, Cheng Gao, Krzysztof Suchecki, Andrea Scharnhorst, Richard P. Smiraglia

    Abstract: To classify is to put things in meaningful groups, but the criteria for doing so can be problematic. Study of evolution of classification includes ontogenetic analysis of change in classification over time. We present an empirical analysis of the UDC over the entire period of its development. We demonstrate stability in main classes, with major change driven by 20th century scientific developments… ▽ More

    Submitted 17 April, 2012; originally announced April 2012.

    Comments: ISKO conference 2012

  25. arXiv:1203.0788  [pdf, ps, other

    physics.soc-ph cs.DL cs.SI

    Evolution of Wikipedia's Category Structure

    Authors: Krzysztof Suchecki, Alkim Almila Akdag Salah, Cheng Gao, Andrea Scharnhorst

    Abstract: Wikipedia, as a social phenomenon of collaborative knowledge creating, has been studied extensively from various points of views. The category system of Wikipedia, introduced in 2004, has attracted relatively little attention. In this study, we focus on the documentation of knowledge, and the transformation of this documentation with time. We take Wikipedia as a proxy for knowledge in general and… ▽ More

    Submitted 4 March, 2012; originally announced March 2012.

    Comments: Preprint of an article submitted for consideration in Advances in Complex Systems (2012) http://www.worldscinet.com/acs/, 19 pages, 7 figures

  26. arXiv:1105.5912  [pdf

    cs.DL cs.IR physics.soc-ph

    Need to categorize: A comparative look at the categories of the Universal Decimal Classification system (UDC) and Wikipedia

    Authors: Almila Akdag Salah, Cheng Gao, Krzysztof Suchecki, Andrea Scharnhorst

    Abstract: This study analyzes the differences between the category structure of the Universal Decimal Classification (UDC) system (which is one of the widely used library classification systems in Europe) and Wikipedia. In particular, we compare the emerging structure of category-links to the structure of classes in the UDC. With this comparison we would like to scrutinize the question of how do knowledge m… ▽ More

    Submitted 30 May, 2011; originally announced May 2011.

    Comments: Paper for High Throughput Humanities - a satellite meeting at the European Conference on Complex Systems 2010; Sept. 15, 2010 Lisbon University Institute ISCTE, Lisbon, Portugal

  27. arXiv:1102.1934  [pdf

    cs.DL physics.soc-ph

    The structure of the Arts & Humanities Citation Index: A map** on the basis of aggregated citations among 1,157 journals

    Authors: Loet Leydesdorff, Björn Hammarfelt, Alkim Almila Akdag Salah

    Abstract: Using the Arts & Humanities Citation Index (A&HCI) 2008, we apply map** techniques previously developed for map** journal structures in the Science and Social Science Citation Indices. Citation relations among the 110,718 records were aggregated at the level of 1,157 journals specific to the A&HCI, and the journal structures are questioned on whether a cognitive structure can be reconstructed… ▽ More

    Submitted 20 July, 2011; v1 submitted 9 February, 2011; originally announced February 2011.

  28. arXiv:1009.1466  [pdf

    cs.DL

    The Development of the Journal Environment of Leonardo

    Authors: Alkim Almila Akdag Salah, Loet Leydesdorff

    Abstract: We present animations based on the aggregated journal-journal citations of Leonardo during the period 1974-2008. Leonardo is mainly cited by journals outside the arts domain for cultural reasons, for example, in neuropsychology and physics. Articles in Leonardo itself cite a large number of journals, but with a focus on the arts. Animations at this level of aggregation enable us to show the histor… ▽ More

    Submitted 8 September, 2010; originally announced September 2010.

  29. arXiv:0912.3098  [pdf

    cs.DL physics.soc-ph

    Maps on the basis of the Arts & Humanities Citation Index: The journals Leonardo and Art Journal versus "Digital Humanities" as a topic

    Authors: Loet Leydesdorff, Alkim Almila Akdag Salah

    Abstract: The possibilities of using the Arts & Humanities Citation Index (A&HCI) for journal map** have not been sufficiently recognized because of the absence of a Journal Citations Report (JCR) for this database. A quasi-JCR for the A&HCI (2008) was constructed from the data contained in the Web-of-Science and is used for the evaluation of two journals as examples: Leonardo and Art Journal. The maps… ▽ More

    Submitted 16 December, 2009; originally announced December 2009.