Skip to main content

Showing 1–50 of 81 results for author: Smeaton, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01322  [pdf, other

    cs.CL cs.AI

    A Review of Multi-Modal Large Language and Vision Models

    Authors: Kilian Carolan, Laura Fennelly, Alan F. Smeaton

    Abstract: Large Language Models (LLMs) have recently emerged as a focal point of research and application, driven by their unprecedented ability to understand and generate text with human-like quality. Even more recently, LLMs have been extended into multi-modal large language models (MM-LLMs) which extends their capabilities to deal with image, video and audio information, in addition to text. This opens u… ▽ More

    Submitted 28 March, 2024; originally announced April 2024.

    Comments: 33 pages, 1 figure

  2. arXiv:2401.15448  [pdf, other

    cs.CV

    A Systematic Review of Available Datasets in Additive Manufacturing

    Authors: Xiao Liu, Alessandra Mileo, Alan F. Smeaton

    Abstract: In-situ monitoring incorporating data from visual and other sensor technologies, allows the collection of extensive datasets during the Additive Manufacturing (AM) process. These datasets have potential for determining the quality of the manufactured output and the detection of defects through the use of Machine Learning during the manufacturing process. Open and annotated datasets derived from AM… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 24 pages

  3. arXiv:2401.05767  [pdf, other

    cs.IR cs.HC

    Lifelogging As An Extreme Form of Personal Information Management -- What Lessons To Learn

    Authors: Ly-Duyen Tran, Cathal Gurrin, Alan F. Smeaton

    Abstract: Personal data includes the digital footprints that we leave behind as part of our everyday activities, both online and offline in the real world. It includes data we collect ourselves, such as from wearables, as well as the data collected by others about our online behaviour and activities. Sometimes we are able to use the personal data we ourselves collect, in order to examine some parts of our l… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Journal ref: IEEE Data Engineering Bulletin 47 (4), 18-29, 2023

  4. arXiv:2311.16514  [pdf, other

    cs.CV cs.AI cs.LG

    Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach

    Authors: Ayush K. Rai, Tarun Krishna, Feiyan Hu, Alexandru Drimbarean, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: Video Anomaly Detection (VAD) is an open-set recognition task, which is usually formulated as a one-class classification (OCC) problem, where training data is comprised of videos with normal instances while test data contains both normal and anomalous instances. Recent works have investigated the creation of pseudo-anomalies (PAs) using only the normal data and making strong assumptions about real… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted in CVPRW 2024 - VAND Workshop

  5. arXiv:2311.06221  [pdf, other

    cs.CL

    A Comparison of Lexicon-Based and ML-Based Sentiment Analysis: Are There Outlier Words?

    Authors: Siddhant Jaydeep Mahajani, Shashank Srivastava, Alan F. Smeaton

    Abstract: Lexicon-based approaches to sentiment analysis of text are based on each word or lexical entry having a pre-defined weight indicating its sentiment polarity. These are usually manually assigned but the accuracy of these when compared against machine leaning based approaches to computing sentiment, are not known. It may be that there are lexical entries whose sentiment values cause a lexicon-based… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 4 pages, to appear in Proceedings of the 31st Irish Conference on Artificial Intelligence and Cognitive Science. December 7th-8th, 2023

  6. arXiv:2309.16704  [pdf, other

    q-bio.NC cs.CV eess.SP

    Memories in the Making: Predicting Video Memorability with Encoding Phase EEG

    Authors: Lorin Sweeney, Graham Healy, Alan F. Smeaton

    Abstract: In a world of ephemeral moments, our brain diligently sieves through a cascade of experiences, like a skilled gold prospector searching for precious nuggets amidst the river's relentless flow. This study delves into the elusive "moment of memorability" -- a fleeting, yet vital instant where experiences are prioritised for consolidation in our memory. By transforming subjects' encoding phase electr… ▽ More

    Submitted 16 August, 2023; originally announced September 2023.

    Comments: Content-Based Multimedia Indexing, CBMI, September 20-22, Orleans, France, 2023

  7. arXiv:2309.11891  [pdf, other

    eess.IV cs.CV

    Heart Rate Detection Using an Event Camera

    Authors: Aniket Jagtap, RamaKrishna Venkatesh Saripalli, Joe Lemley, Waseem Shariff, Alan F. Smeaton

    Abstract: Event cameras, also known as neuromorphic cameras, are an emerging technology that offer advantages over traditional shutter and frame-based cameras, including high temporal resolution, low power consumption, and selective data acquisition. In this study, we propose to harnesses the capabilities of event-based cameras to capture subtle changes in the surface of the skin caused by the pulsatile flo… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Dataset available at https://doi.org/10.6084/m9.figshare.24039501.v1

  8. Using Saliency and Crop** to Improve Video Memorability

    Authors: Vaibhav Mudgal, Qingyang Wang, Lorin Sweeney, Alan F. Smeaton

    Abstract: Video memorability is a measure of how likely a particular video is to be remembered by a viewer when that viewer has no emotional connection with the video content. It is an important characteristic as videos that are more memorable are more likely to be shared, viewed, and discussed. This paper presents results of a series of experiments where we improved the memorability of a video by selective… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 12 pages

  9. arXiv:2309.08009  [pdf, other

    cs.CV cs.MM

    Measuring the Quality of Text-to-Video Model Outputs: Metrics and Dataset

    Authors: Iya Chivileva, Philip Lynch, Tomas E. Ward, Alan F. Smeaton

    Abstract: Evaluating the quality of videos generated from text-to-video (T2V) models is important if they are to produce plausible outputs that convince a viewer of their authenticity. We examine some of the metrics used in this area and highlight their limitations. The paper presents a dataset of more than 1,000 generated videos from 5 very recent T2V models on which some of those commonly used quality met… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 13 pages

  10. Handwriting Analysis on the Diaries of Rosamond Jacob

    Authors: Sharmistha S. Sawant, Saloni D. Thakare, Derek Greene, Gerardine Meaney, Alan F. Smeaton

    Abstract: Handwriting is an art form that most people learn at an early age. Each person's writing style is unique with small changes as we grow older and as our mood changes. Here we analyse handwritten text in a culturally significant personal diary. We compare changes in handwriting and relate this to the sentiment of the written material and to the topic of diary entries. We identify handwritten text fr… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: International Conference on Content-based Multimedia Indexing, September 20--22, 2023, Orleans, France

  11. Domain Generalisation with Bidirectional Encoder Representations from Vision Transformers

    Authors: Hamza Riaz, Alan F. Smeaton

    Abstract: Domain generalisation involves pooling knowledge from source domain(s) into a single model that can generalise to unseen target domain(s). Recent research in domain generalisation has faced challenges when using deep learning models as they interact with data distributions which differ from those they are trained on. Here we perform domain generalisation on out-of-distribution (OOD) vision benchma… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: 4 pages, accepted at the Irish Machine Vision and Image Processing Conference (IMVIP), Galway, August 2023

  12. Defect Classification in Additive Manufacturing Using CNN-Based Vision Processing

    Authors: Xiao Liu, Alessandra Mileo, Alan F. Smeaton

    Abstract: The development of computer vision and in-situ monitoring using visual sensors allows the collection of large datasets from the additive manufacturing (AM) process. Such datasets could be used with machine learning techniques to improve the quality of AM. This paper examines two scenarios: first, using convolutional neural networks (CNNs) to accurately classify defects in an image dataset from AM… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: 4 pages, accepted at the Irish Machine Vision and Image Processing Conference (IMVIP), Galway, August 2023

  13. Automatically detecting activities of daily living from in-home sensors as indicators of routine behaviour in an older population

    Authors: Claire M. Timon, Pamela Hussey, Hyowon Lee, Catriona Murphy, Harsh Vardan Rai, and Alan F. Smeaton

    Abstract: Objective: The NEX project has developed an integrated Internet of Things (IoT) system coupled with data analytics to offer unobtrusive health and wellness monitoring supporting older adults living independently at home. Monitoring {currently} involves visualising a set of automatically detected activities of daily living (ADLs) for each participant. The detection of ADLs is achieved {} to allow t… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 11 pages, 7 Figures, 2 tables

    Journal ref: DIGITAL HEALTH. 2023;9

  14. Calculating the matrix profile from noisy data

    Authors: Colin Hehir, Alan F. Smeaton

    Abstract: The matrix profile (MP) is a data structure computed from a time series which encodes the data required to locate motifs and discords, corresponding to recurring patterns and outliers respectively. When the time series contains noisy data then the conventional approach is to pre-filter it in order to remove noise but this cannot apply in unsupervised settings where patterns and outliers are not an… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 16 pages

    Journal ref: PLoS ONE 18(6): e0286763

  15. arXiv:2305.05780  [pdf, other

    cs.SD cs.LG eess.AS

    Enhancing Gappy Speech Audio Signals with Generative Adversarial Networks

    Authors: Deniss Strods, Alan F. Smeaton

    Abstract: Gaps, dropouts and short clips of corrupted audio are a common problem and particularly annoying when they occur in speech. This paper uses machine learning to regenerate gaps of up to 320ms in an audio speech signal. Audio regeneration is translated into image regeneration by transforming audio into a Mel-spectrogram and using image in-painting to regenerate the gaps. The full Mel-spectrogram is… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 7 pages, 4 figures, 4 tables. 34th Irish Signals and Systems Conferences, 13-14 June 2023

  16. Automatic Detection of Signalling Behaviour from Assistance Dogs as they Forecast the Onset of Epileptic Seizures in Humans

    Authors: Hitesh Raju, Ankit Sharma, Aoife Smeaton, Alan F. Smeaton

    Abstract: Epilepsy or the occurrence of epileptic seizures, is one of the world's most well-known neurological disorders affecting millions of people. Seizures mostly occur due to non-coordinated electrical discharges in the human brain and may cause damage, including collapse and loss of consciousness. If the onset of a seizure can be forecast then the subject can be placed into a safe environment or posit… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: 8 pages, 5 tables, 6 figures

    Journal ref: The 38th ACM/SIGAPP Symposium on Applied Computing (SAC '23), March 27-April 2, 2023, Tallinn, Estonia

  17. arXiv:2302.09293  [pdf, other

    cs.CY physics.data-an

    Periodicity Intensity Reveals Insights into Time Series Data: Three Use Cases

    Authors: Alan F. Smeaton, Feiyan Hu

    Abstract: Periodic phenomena are oscillating signals found in many naturally-occurring time series. A periodogram can be used to measure the intensities of oscillations at different frequencies over an entire time series but sometimes we are interested in measuring how periodicity intensity at a specific frequency varies throughout the time series. This can be done by calculating periodicity intensity withi… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 14 pages, 6 figures, a

    Journal ref: Algorithms 2023, 16, 119

  18. arXiv:2301.09164  [pdf, other

    cs.LG cs.CV

    Unifying Synergies between Self-supervised Learning and Dynamic Computation

    Authors: Tarun Krishna, Ayush K Rai, Alexandru Drimbarean, Eric Arazo, Paul Albert, Alan F Smeaton, Kevin McGuinness, Noel E O'Connor

    Abstract: Computationally expensive training strategies make self-supervised learning (SSL) impractical for resource constrained industrial settings. Techniques like knowledge distillation (KD), dynamic computation (DC), and pruning are often used to obtain a lightweightmodel, which usually involves multiple epochs of fine-tuning (or distilling steps) of a large pre-trained model, making it more computation… ▽ More

    Submitted 9 September, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: Accepted in BMVC 2023

  19. Vision Based Machine Learning Algorithms for Out-of-Distribution Generalisation

    Authors: Hamza Riaz, Alan F. Smeaton

    Abstract: There are many computer vision applications including object segmentation, classification, object detection, and reconstruction for which machine learning (ML) shows state-of-the-art performance. Nowadays, we can build ML tools for such applications with real-world accuracy. However, each tool works well within the domain in which it has been trained and developed. Often, when we train a model on… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: Computing Conference, 22-23 June 2023, London, United Kingdom. 15 pages, 5 Figures, 3 Tables

  20. arXiv:2212.10273  [pdf, other

    cs.LG cs.AI math.NA

    Managing Large Dataset Gaps in Urban Air Quality Prediction: DCU-Insight-AQ at MediaEval 2022

    Authors: Dinh Viet Cuong, Phuc H. Le-Khac, Adam Stapleton, Elke Eichlemann, Mark Roantree, Alan F. Smeaton

    Abstract: Calculating an Air Quality Index (AQI) typically uses data streams from air quality sensors deployed at fixed locations and the calculation is a real time process. If one or a number of sensors are broken or offline, then the real time AQI value cannot be computed. Estimating AQI values for some point in the future is a predictive process and uses historical AQI values to train and build models. I… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 5 pages, 1 Figure, 1 Table

  21. arXiv:2212.09308  [pdf, other

    cs.CV cs.AI

    Diffusing Surrogate Dreams of Video Scenes to Predict Video Memorability

    Authors: Lorin Sweeney, Graham Healy, Alan F. Smeaton

    Abstract: As part of the MediaEval 2022 Predicting Video Memorability task we explore the relationship between visual memorability, the visual representation that characterises it, and the underlying concept portrayed by that visual representation. We achieve state-of-the-art memorability prediction performance with a model trained and tested exclusively on surrogate dream images, elevating concepts to the… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 5 pages, 3 figures, 1 table, MediaEval-22: Multimedia Evaluation Workshop, 13-15 January 2023, Bergen, Norway and Online

  22. arXiv:2212.06516  [pdf, other

    cs.CV cs.AI cs.MM

    Overview of The MediaEval 2022 Predicting Video Memorability Task

    Authors: Lorin Sweeney, Mihai Gabriel Constantin, Claire-Hélène Demarty, Camilo Fosco, Alba G. Seco de Herrera, Sebastian Halder, Graham Healy, Bogdan Ionescu, Ana Matran-Fernandez, Alan F. Smeaton, Mushfika Sultana

    Abstract: This paper describes the 5th edition of the Predicting Video Memorability Task as part of MediaEval2022. This year we have reorganised and simplified the task in order to lubricate a greater depth of inquiry. Similar to last year, two datasets are provided in order to facilitate generalisation, however, this year we have replaced the TRECVid2019 Video-to-Text dataset with the VideoMem dataset in o… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: 6 pages. In: MediaEval Multimedia Benchmark Workshop Working Notes, 2022

  23. arXiv:2212.06153  [pdf, other

    cs.LG cs.AI cs.CV

    An adaptive human-in-the-loop approach to emission detection of Additive Manufacturing processes and active learning with computer vision

    Authors: Xiao Liu, Alan F. Smeaton, Alessandra Mileo

    Abstract: Recent developments in in-situ monitoring and process control in Additive Manufacturing (AM), also known as 3D-printing, allows the collection of large amounts of emission data during the build process of the parts being manufactured. This data can be used as input into 3D and 2D representations of the 3D-printed parts. However the analysis and use, as well as the characterization of this data sti… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: 7 pages, 9 figures, 1 table. Presented at The 6th IEEE Workshop on Human-in-the-Loop Methods and Future of Work in BigData (IEEE HMData 2022) December 2022

  24. arXiv:2212.03955  [pdf, other

    cs.CV cs.AI

    Experiences from the MediaEval Predicting Media Memorability Task

    Authors: Alba García Deco de Herrera, Mihai Gabriel Constantin, Chaire-Hélène Demarty, Camilo Fosco, Sebastian Halder, Graham Healy, Bogdan Ionescu, Ana Matran-Fernandez, Alan F. Smeaton, Mushfika Sultana, Lorin Sweeney

    Abstract: The Predicting Media Memorability task in the MediaEval evaluation campaign has been running annually since 2018 and several different tasks and data sets have been used in this time. This has allowed us to compare the performance of many memorability prediction techniques on the same data and in a reproducible way and to refine and improve on those techniques. The resources created to compute med… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 7 pages, 2 figures, 1 table. Presented at the NeurIPS 2022 Workshop on Memory in Artificial and Real Intelligence (MemARI), 2 December 2022, New Orleans, USA

  25. arXiv:2210.05574  [pdf, other

    cs.CV cs.AI cs.LG

    Motion Aware Self-Supervision for Generic Event Boundary Detection

    Authors: Ayush K. Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The task of Generic Event Boundary Detection (GEBD) aims to detect moments in videos that are naturally perceived by humans as generic and taxonomy-free event boundaries. Modeling the dynamically evolving temporal and spatial changes in a video makes GEBD a difficult problem to solve. Existing approaches involve very complex and sophisticated pipelines in terms of architectural design choices, hen… ▽ More

    Submitted 12 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023

  26. arXiv:2208.03479  [pdf, other

    cs.CV cs.AI cs.MM

    Analysing the Memorability of a Procedural Crime-Drama TV Series, CSI

    Authors: Sean Cummins, Lorin Sweeney, Alan F. Smeaton

    Abstract: We investigate the memorability of a 5-season span of a popular crime-drama TV series, CSI, through the application of a vision transformer fine-tuned on the task of predicting video memorability. By investigating the popular genre of crime-drama TV through the use of a detailed annotated corpus combined with video memorability scores, we show how to extrapolate meaning from the memorability score… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: 7 pages, accepted to CBMI 2022

  27. Playback-centric visualisations of video usage using weighted interactions to guide where to watch in an educational context

    Authors: Hyowon Lee, Mingming Liu, Michael Scriney, Alan F. Smeaton

    Abstract: The increase in use of online educational tools has led to a large amount of educational video materials made available for students. Finding the right video content is usually supported by the overarching learning management system and its interface that organises video items by course, categories and weeks, and makes them searchable. However, once a video is found, students are left without furt… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Journal ref: Front. Educ. 7:733646 (2022)

  28. arXiv:2207.12065  [pdf, other

    cs.CV

    Dynamic Channel Selection in Self-Supervised Learning

    Authors: Tarun Krishna, Ayush K. Rai, Yasser A. D. Djilali, Alan F. Smeaton, Kevin McGuinness, Noel E. O'Connor

    Abstract: Whilst computer vision models built using self-supervised approaches are now commonplace, some important questions remain. Do self-supervised models learn highly redundant channel features? What if a self-supervised network could dynamically select the important channels and get rid of the unnecessary ones? Currently, convnets pre-trained with self-supervision have obtained comparable performance… ▽ More

    Submitted 16 December, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted in Irish Machine Vision and Image Processing Conference 2022

  29. Analysis of Individual Conversational Volatility in Tandem Telecollaboration for Second Language Learning

    Authors: Alan F. Smeaton, Aparajita Dey-Plissonneau, Hyowon Lee, Mingming Liu, Michael Scriney

    Abstract: Second language learning can be enabled by tandem collaboration where students are grouped into video conference calls while learning the native language of other student(s) on the calls. This places students in an online environment where the more outgoing can actively contribute and engage in dialogue while those more shy and unsure of their second language skills can sit back and coast through… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 21st European Conference on e-Learning, October 2022, Brighton, UK

  30. An Analysis of Conversational Volatility During Telecollaboration Sessions for Second Language Learning

    Authors: Aparajita Dey-Plissonneau, Hyowon Lee, Mingming Liu, Vyoma Patel, Michael Scriney, Alan F. Smeaton

    Abstract: Tandem telecollaboration is a pedagogy used in second language learning where mixed groups of students meet online in videoconferencing sessions to practice their conversational skills in their target language. We have built and deployed a system called L2 Learning to support post-session review and self-reflection on students participation in such meetings. We automatically compute a metric calle… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: 8 pages

  31. arXiv:2201.00620  [pdf, other

    q-bio.NC cs.HC cs.LG eess.SP

    Overview of the EEG Pilot Subtask at MediaEval 2021: Predicting Media Memorability

    Authors: Lorin Sweeney, Ana Matran-Fernandez, Sebastian Halder, Alba G. Seco de Herrera, Alan Smeaton, Graham Healy

    Abstract: The aim of the Memorability-EEG pilot subtask at MediaEval'2021 is to promote interest in the use of neural signals -- either alone or in combination with other data sources -- in the context of predicting video memorability by highlighting the utility of EEG data. The dataset created consists of pre-extracted features from EEG recordings of subjects while watching a subset of videos from Predicti… ▽ More

    Submitted 15 December, 2021; originally announced January 2022.

    Comments: 3 pages

  32. arXiv:2112.07969  [pdf, ps, other

    cs.CV cs.AI

    Predicting Media Memorability: Comparing Visual, Textual and Auditory Features

    Authors: Lorin Sweeney, Graham Healy, Alan F. Smeaton

    Abstract: This paper describes our approach to the Predicting Media Memorability task in MediaEval 2021, which aims to address the question of media memorability by setting the task of automatically predicting video memorability. This year we tackle the task from a comparative standpoint, looking to gain deeper insights into each of three explored modalities, and using our results from last year's submissio… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 3 pages

  33. arXiv:2112.05982  [pdf, ps, other

    cs.CV cs.AI cs.MM

    Overview of The MediaEval 2021 Predicting Media Memorability Task

    Authors: Rukiye Savran Kiziltepe, Mihai Gabriel Constantin, Claire-Helene Demarty, Graham Healy, Camilo Fosco, Alba Garcia Seco de Herrera, Sebastian Halder, Bogdan Ionescu, Ana Matran-Fernandez, Alan F. Smeaton, Lorin Sweeney

    Abstract: This paper describes the MediaEval 2021 Predicting Media Memorability}task, which is in its 4th edition this year, as the prediction of short-term and long-term video memorability remains a challenging task. In 2021, two datasets of videos are used: first, a subset of the TRECVid 2019 Video-to-Text dataset; second, the Memento10K dataset in order to provide opportunities to explore cross-dataset g… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

    Comments: 3 pages, to appear in Proceedings of MediaEval 2021, December 13-15 2021, Online

  34. An Annotated Video Dataset for Computing Video Memorability

    Authors: Rukiye Savran Kiziltepe, Lorin Sweeney, Mihai Gabriel Constantin, Faiyaz Doctor, Alba Garcia Seco de Herrera, Claire-Helene Demarty, Graham Healy, Bogdan Ionescu, Alan F. Smeaton

    Abstract: Using a collection of publicly available links to short form video clips of an average of 6 seconds duration each, 1,275 users manually annotated each video multiple times to indicate both long-term and short-term memorability of the videos. The annotations were gathered as part of an online memory game and measured a participant's ability to recall having seen the video previously when shown a co… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

    Comments: 11 pages

    Journal ref: Data in Brief, Volume 39, 107671, (2021), ISSN 2352-3409

  35. Using a GAN to Generate Adversarial Examples to Facial Image Recognition

    Authors: Andrew Merrigan, Alan F. Smeaton

    Abstract: Images posted online present a privacy concern in that they may be used as reference examples for a facial recognition system. Such abuse of images is in violation of privacy rights but is difficult to counter. It is well established that adversarial example images can be created for recognition systems which are based on deep neural networks. These adversarial examples can be used to disrupt the… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Comments: 8 pages, to appear at the Media Watermarking, Security, and Forensics Conference at Electronic Imaging, January, 2022

  36. arXiv:2111.14557  [pdf, other

    cs.CV

    Image Segmentation to Identify Safe Landing Zones for Unmanned Aerial Vehicles

    Authors: Joe Kinahan, Alan F. Smeaton

    Abstract: There is a marked increase in delivery services in urban areas, and with Jeff Bezos claiming that 86% of the orders that Amazon ships weigh less than 5 lbs, the time is ripe for investigation into economical methods of automating the final stage of the delivery process. With the advent of semi-autonomous drone delivery services, such as Irish startup `Manna', and Malta's `Skymax', the final step o… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 12 pages, to appear in Proceedings of the 29th Irish Conference on Artificial Intelligence and Cognitive Science AICS'2021, December 2021

    Journal ref: CEUR Workshop Proceedings Volume 3105, pp.235-247 urn:nbn:de:0074-3105-7 2021

  37. arXiv:2111.09243  [pdf, other

    cs.HC

    An Investigation into Keystroke Dynamics and Heart Rate Variability as Indicators of Stress

    Authors: Srijith Unni, Sushma Suryanarayana Gowda, Alan F. Smeaton

    Abstract: Lifelogging has become a prominent research topic in recent years. Wearable sensors like Fitbits and smart watches are now increasingly popular for recording ones activities. Some researchers are also exploring keystroke dynamics for lifelogging. Keystroke dynamics refers to the process of measuring and assessing a persons ty** rhythm on digital devices. A digital footprint is created when a use… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 12 pages. To appear at MMM 2022, 28th International Conference on Multimedia Modeling, 5-8 April 2022, Phu Quoc, Vietnam

  38. Facilitating reflection in teletandem through automatically generated conversation metrics and playback video

    Authors: Aparajita Dey-Plissonneau, Hyowon Lee, Michael Scriney, Alan F. Smeaton, Vincent Pradier, Hamza Riaz

    Abstract: This pilot study focuses on a tool called L2L that allows second language (L2) learners to visualise and analyse their Zoom interactions with native speakers. L2L uses the Zoom transcript to automatically generate conversation metrics and its playback feature with timestamps allows students to replay any chosen portion of the conversation for post-session reflection and self-review. This explorato… ▽ More

    Submitted 18 November, 2021; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: 5 pages

    Journal ref: CALL and professionalisation: short papers from EUROCALL 2021

  39. Computer Vision for Supporting Image Search

    Authors: Alan F. Smeaton

    Abstract: Computer vision and multimedia information processing have made extreme progress within the last decade and many tasks can be done with a level of accuracy as if done by humans, or better. This is because we leverage the benefits of huge amounts of data available for training, we have enormous computer processing available and we have seen the evolution of machine learning as a suite of techniques… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 10 pages

    Journal ref: Advances in Visual Informatics. H. Badioze Zaman et al (Eds). IVIC 2021, LNCS 13051, pp1-10, 2021

  40. Visual Selective Attention System to Intervene User Attention in Sharing COVID-19 Misinformation

    Authors: Zaid Amin, Nazlena Mohamad Ali, Alan F. Smeaton

    Abstract: Information sharing on social media must be accompanied by attentive behavior so that in a distorted digital environment, users are not rushed and distracted in deciding to share information. The spread of misinformation, especially those related to the COVID-19, can divide and create negative effects of falsehood in society. Individuals can also cause feelings of fear, health anxiety, and confusi… ▽ More

    Submitted 9 November, 2021; v1 submitted 26 October, 2021; originally announced October 2021.

    Report number: Volume 12 Issue 10, 2021

    Journal ref: International Journal of Advanced Computer Science and Applications(IJACSA) Volume 12 Issue 10, 2021

  41. arXiv:2106.13512  [pdf, other

    cs.HC

    The L2L System for Second Language Learning Using Visualised Zoom Calls Among Students

    Authors: Aparajita Dey-Plissonneau, Hyowon Lee, Vincent Pradier, Michael Scriney, Alan F. Smeaton

    Abstract: An important part of second language learning is conversation which is best practised with speakers whose native language is the language being learned. We facilitate this by pairing students from different countries learning each others' native language. Mixed groups of students have Zoom calls, half in one language and half in the other, in order to practice and improve their conversation skills… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: 16th European Conference on Technology-Enhanced Learning (EC-TEL), Bozen-Bolzano, Italy (online), September 2021

  42. arXiv:2106.13504  [pdf, other

    cs.MM cs.HC

    Usage-based Summaries of Learning Videos

    Authors: Hyowon Lee, Mingming Liu, Michael Scriney, Alan F. Smeaton

    Abstract: Much of the delivery of University education is now by synchronous or asynchronous video. For students, one of the challenges is managing the sheer volume of such video material as video presentations of taught material are difficult to abbreviate and summarise because they do not have highlights which stand out. Apart from video bookmarks there are no tools available to determine which parts of v… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: 16th European Conference on Technology-Enhanced Learning (EC-TEL), Bozen-Bolzano, Italy (online), September 2021

  43. arXiv:2106.10090  [pdf, other

    cs.CV cs.AI

    Discerning Generic Event Boundaries in Long-Form Wild Videos

    Authors: Ayush K Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F Smeaton, Noel E O'Connor

    Abstract: Detecting generic, taxonomy-free event boundaries invideos represents a major stride forward towards holisticvideo understanding. In this paper we present a technique forgeneric event boundary detection based on a two stream in-flated 3D convolutions architecture, which can learn spatio-temporal features from videos. Our work is inspired from theGeneric Event Boundary Detection Challenge (part of… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: Technical Report for Generic Event Boundary Challenge - LOVEU Challenge (CVPR 2021)

  44. arXiv:2106.08936  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Improved CNN-based Learning of Interpolation Filters for Low-Complexity Inter Prediction in Video Coding

    Authors: Luka Murn, Saverio Blasi, Alan F. Smeaton, Marta Mrak

    Abstract: The versatility of recent machine learning approaches makes them ideal for improvement of next generation video compression solutions. Unfortunately, these approaches typically bring significant increases in computational complexity and are difficult to interpret into explainable models, affecting their potential for implementation within practical video coding applications. This paper introduces… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: IEEE Open Journal of Signal Processing Special Issue on Applied AI and Machine Learning for Video Coding and Streaming, June 2021

  45. arXiv:2105.03311  [pdf, other

    cs.CL

    Translation Quality Assessment: A Brief Survey on Manual and Automatic Methods

    Authors: Lifeng Han, Gareth J. F. Jones, Alan F. Smeaton

    Abstract: To facilitate effective translation modeling and translation studies, one of the crucial questions to address is how to assess translation quality. From the perspectives of accuracy, reliability, repeatability and cost, translation quality assessment (TQA) itself is a rich and challenging task. In this work, we present a high-level and concise survey of TQA methods, including both manual judgement… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: Accepted to 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021): Workshop on Modelling Translation: Translatology in the Digital Age (MoTra21). arXiv admin note: substantial text overlap with arXiv:1605.04515

  46. arXiv:2105.01705  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Attention-based Stylisation for Exemplar Image Colourisation

    Authors: Marc Gorriz Blanch, Issa Khalifeh, Alan Smeaton, Noel O'Connor, Marta Mrak

    Abstract: Exemplar-based colourisation aims to add plausible colours to a grayscale image using the guidance of a colour reference image. Most of the existing methods tackle the task as a style transfer problem, using a convolutional neural network (CNN) to obtain deep representations of the content of both inputs. Stylised outputs are then obtained by computing similarities between both feature representat… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

  47. arXiv:2104.13473  [pdf, other

    cs.CV cs.AI

    TRECVID 2020: A comprehensive campaign for evaluating video retrieval tasks across multiple application domains

    Authors: George Awad, Asad A. Butt, Keith Curtis, Jonathan Fiscus, Afzal Godil, Yooyoung Lee, Andrew Delgado, Jesse Zhang, Eliot Godard, Baptiste Chocot, Lukas Diduch, Jeffrey Liu, Alan F. Smeaton, Yvette Graham, Gareth J. F. Jones, Wessel Kraaij, Georges Quenot

    Abstract: The TREC Video Retrieval Evaluation (TRECVID) is a TREC-style video analysis and retrieval evaluation with the goal of promoting progress in research and development of content-based exploitation and retrieval of information from digital video via open, metrics-based evaluation. Over the last twenty years this effort has yielded a better understanding of how systems can effectively accomplish such… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: TRECVID 2020 Workshop Overview Paper. arXiv admin note: substantial text overlap with arXiv:2009.09984

  48. The Influence of Audio on Video Memorability with an Audio Gestalt Regulated Video Memorability System

    Authors: Lorin Sweeney, Graham Healy, Alan F. Smeaton

    Abstract: Memories are the tethering threads that tie us to the world, and memorability is the measure of their tensile strength. The threads of memory are spun from fibres of many modalities, obscuring the contribution of a single fibre to a thread's overall tensile strength. Unfurling these fibres is the key to understanding the nature of their interaction, and how we can ultimately create more meaningful… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

    Comments: 6 pages, 3 figures, 4 tables, paper accepted in CBMI 2021 for publication and oral presentation

  49. arXiv:2104.04497  [pdf, other

    cs.CL cs.LG

    Chinese Character Decomposition for Neural MT with Multi-Word Expressions

    Authors: Lifeng Han, Gareth J. F. Jones, Alan F. Smeaton, Paolo Bolzoni

    Abstract: Chinese character decomposition has been used as a feature to enhance Machine Translation (MT) models, combining radicals into character and word level models. Recent work has investigated ideograph or stroke level embedding. However, questions remain about different decomposition levels of Chinese character representations, radical and strokes, best suited for MT. To investigate the impact of Chi… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted to publish in NoDaLiDa2021

  50. arXiv:2102.04993  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding

    Authors: Marc Górriz, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Neural networks can be successfully used to improve several modules of advanced video coding schemes. In particular, compression of colour components was shown to greatly benefit from usage of machine learning models, thanks to the design of appropriate attention-based architectures that allow the prediction to exploit specific samples in the reference region. However, such architectures tend to b… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, 2020