Skip to main content

Showing 1–50 of 64 results for author: O'Connor, N E

.
  1. arXiv:2404.06941  [pdf, other

    eess.IV cs.CV

    Accelerating Cardiac MRI Reconstruction with CMRatt: An Attention-Driven Approach

    Authors: Anam Hashmi, Julia Dietlmeier, Kathleen M. Curran, Noel E. O'Connor

    Abstract: Cine cardiac magnetic resonance (CMR) imaging is recognised as the benchmark modality for the comprehensive assessment of cardiac function. Nevertheless, the acquisition process of cine CMR is considered as an impediment due to its prolonged scanning time. One commonly used strategy to expedite the acquisition process is through k-space undersampling, though it comes with a drawback of introducing… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: This paper has been submitted for the 32nd European Signal Processing Conference EUSIPCO 2024 in Lyon

  2. arXiv:2404.06362  [pdf, other

    cs.CV cs.AI

    Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation

    Authors: Sidra Aleem, Fangyijie Wang, Mayug Maniparambil, Eric Arazo, Julia Dietlmeier, Guenole Silvestre, Kathleen Curran, Noel E. O'Connor, Suzanne Little

    Abstract: The Segment Anything Model (SAM) and CLIP are remarkable vision foundation models (VFMs). SAM, a prompt driven segmentation model, excels in segmentation tasks across diverse domains, while CLIP is renowned for its zero shot recognition capabilities. However, their unified potential has not yet been explored in medical image segmentation. To adapt SAM to medical imaging, existing methods primarily… ▽ More

    Submitted 30 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  3. arXiv:2401.05224  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Do Vision and Language Encoders Represent the World Similarly?

    Authors: Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Mohamed El Amine Seddik, Karttikeya Mangalam, Noel E. O'Connor

    Abstract: Aligned text-image encoders such as CLIP have become the de facto model for vision-language tasks. Furthermore, modality-specific encoders achieve impressive performances in their respective domains. This raises a central question: does an alignment exist between uni-modal vision and language encoders since they fundamentally represent the same physical world? Analyzing the latent spaces structure… ▽ More

    Submitted 22 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: Accepted CVPR 2024

  4. arXiv:2311.16514  [pdf, other

    cs.CV cs.AI cs.LG

    Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach

    Authors: Ayush K. Rai, Tarun Krishna, Feiyan Hu, Alexandru Drimbarean, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: Video Anomaly Detection (VAD) is an open-set recognition task, which is usually formulated as a one-class classification (OCC) problem, where training data is comprised of videos with normal instances while test data contains both normal and anomalous instances. Recent works have investigated the creation of pseudo-anomalies (PAs) using only the normal data and making strong assumptions about real… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted in CVPRW 2024 - VAND Workshop

  5. arXiv:2307.15401  [pdf, other

    physics.soc-ph

    Breathing Green: Maximising Health and Environmental Benefits for Active Transportation Users Leveraging Large Scale Air Quality Data

    Authors: Sen Yan, Shaoshu Zhu, Jaime B. Fernandez, Eric Arazo Sánchez, Yingqi Gu, Noel E. O'Connor, David O'Connor, Mingming Liu

    Abstract: Pollution in urban areas can have significant adverse effects on the health and well-being of citizens, with traffic-related air pollution being a major concern in many cities. Pollutants emitted by vehicles, such as nitrogen oxides, carbon monoxide, and particulate matter, can cause respiratory and cardiovascular problems, particularly for vulnerable road users like pedestrians and cyclists. Furt… ▽ More

    Submitted 4 August, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: The manuscript has been accepted by the IEEE ITSC 2023

  6. arXiv:2307.12033  [pdf, other

    cs.CV

    Self-Supervised and Semi-Supervised Polyp Segmentation using Synthetic Data

    Authors: Enric Moreu, Eric Arazo, Kevin McGuinness, Noel E. O'Connor

    Abstract: Early detection of colorectal polyps is of utmost importance for their treatment and for colorectal cancer prevention. Computer vision techniques have the potential to aid professionals in the diagnosis stage, where colonoscopies are manually carried out to examine the entirety of the patient's colon. The main challenge in medical imaging is the lack of data, and a further challenge specific to po… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  7. arXiv:2307.11661  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts

    Authors: Mayug Maniparambil, Chris Vorster, Derek Molloy, Noel Murphy, Kevin McGuinness, Noel E. O'Connor

    Abstract: Contrastive pretrained large Vision-Language Models (VLMs) like CLIP have revolutionized visual representation learning by providing good performance on downstream datasets. VLMs are 0-shot adapted to a downstream dataset by designing prompts that are relevant to the dataset. Such prompt engineering makes use of domain expertise and a validation dataset. Meanwhile, recent developments in generativ… ▽ More

    Submitted 8 August, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: Paper accepted at ICCV-W 2023. V2 contains additional comparisons with concurrent works

  8. Joint one-sided synthetic unpaired image translation and segmentation for colorectal cancer prevention

    Authors: Enric Moreu, Eric Arazo, Kevin McGuinness, Noel E. O'Connor

    Abstract: Deep learning has shown excellent performance in analysing medical images. However, datasets are difficult to obtain due privacy issues, standardization problems, and lack of annotations. We address these problems by producing realistic synthetic images using a combination of 3D technologies and generative adversarial networks. We propose CUT-seg, a joint training where a segmentation model and a… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.08680

  9. Fashion CUT: Unsupervised domain adaptation for visual pattern classification in clothes using synthetic data and pseudo-labels

    Authors: Enric Moreu, Alex Martinelli, Martina Naughton, Philip Kelly, Noel E. O'Connor

    Abstract: Accurate product information is critical for e-commerce stores to allow customers to browse, filter, and search for products. Product data quality is affected by missing or incorrect information resulting in poor customer experience. While machine learning can be used to correct inaccurate or missing information, achieving high performance on fashion image classification tasks requires large amoun… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  10. arXiv:2303.03152  [pdf, other

    physics.soc-ph

    U-Park: A User-Centric Smart Parking Recommendation System for Electric Shared Micromobility Services

    Authors: Sen Yan, Rakesh D. Murthy, Noel E. O'Connor, Mingming Liu

    Abstract: At present, electric shared micromobility services (ESMS) have become an important part of the mobility as a service (MaaS) paradigm for sustainable transportation systems. However, current ESMS suffer from critical design issues such as a lack of integration, transparency and user-centric approaches, resulting in high operational costs and poor service quality. A key operational challenge in ESMS… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: The manuscript has been submitted to the IEEE Transactions on Intelligent Transportation Systems Journal. This manuscript includes 16 pages with 7 figures and 13 tables

  11. arXiv:2301.13019  [pdf, other

    cs.RO cs.LG

    Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies

    Authors: Qiang Wang, Robert McCarthy, David Cordova Bulens, Francisco Roldan Sanchez, Kevin McGuinness, Noel E. O'Connor, Stephen J. Redmond

    Abstract: This paper presents our solution for the Real Robot Challenge (RRC) III, a competition featured in the NeurIPS 2022 Competition Track, aimed at addressing dexterous robotic manipulation tasks through learning from pre-collected offline data. Participants were provided with two types of datasets for each task: expert and mixed datasets with varying skill levels. While the simplest offline policy le… ▽ More

    Submitted 21 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

  12. arXiv:2301.11734  [pdf, other

    cs.LG cs.RO

    Improving Behavioural Cloning with Positive Unlabeled Learning

    Authors: Qiang Wang, Robert McCarthy, David Cordova Bulens, Kevin McGuinness, Noel E. O'Connor, Nico Gürtler, Felix Widmaier, Francisco Roldan Sanchez, Stephen J. Redmond

    Abstract: Learning control policies offline from pre-recorded datasets is a promising avenue for solving challenging real-world problems. However, available datasets are typically of mixed quality, with a limited number of the trajectories that we would consider as positive examples; i.e., high-quality demonstrations. Therefore, we propose a novel iterative learning algorithm for identifying expert trajecto… ▽ More

    Submitted 21 September, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  13. arXiv:2301.09164  [pdf, other

    cs.LG cs.CV

    Unifying Synergies between Self-supervised Learning and Dynamic Computation

    Authors: Tarun Krishna, Ayush K Rai, Alexandru Drimbarean, Eric Arazo, Paul Albert, Alan F Smeaton, Kevin McGuinness, Noel E O'Connor

    Abstract: Computationally expensive training strategies make self-supervised learning (SSL) impractical for resource constrained industrial settings. Techniques like knowledge distillation (KD), dynamic computation (DC), and pruning are often used to obtain a lightweightmodel, which usually involves multiple epochs of fine-tuning (or distilling steps) of a large pre-trained model, making it more computation… ▽ More

    Submitted 9 September, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: Accepted in BMVC 2023

  14. arXiv:2210.05574  [pdf, other

    cs.CV cs.AI cs.LG

    Motion Aware Self-Supervision for Generic Event Boundary Detection

    Authors: Ayush K. Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The task of Generic Event Boundary Detection (GEBD) aims to detect moments in videos that are naturally perceived by humans as generic and taxonomy-free event boundaries. Modeling the dynamically evolving temporal and spatial changes in a video makes GEBD a difficult problem to solve. Existing approaches involve very complex and sophisticated pipelines in terms of architectural design choices, hen… ▽ More

    Submitted 12 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023

  15. arXiv:2210.04578  [pdf, other

    cs.CV cs.LG

    Is your noise correction noisy? PLS: Robustness to label noise with two stage detection

    Authors: Paul Albert, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness

    Abstract: Designing robust algorithms capable of training accurate neural networks on uncurated datasets from the web has been the subject of much research as it reduces the need for time consuming human labor. The focus of many previous research contributions has been on the detection of different types of label noise; however, this paper proposes to improve the correction accuracy of noisy samples once th… ▽ More

    Submitted 15 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 9 pages 4 figures. Accepted at WACV 2023

  16. arXiv:2209.09714  [pdf, other

    eess.IV cs.CV

    Cardiac Segmentation using Transfer Learning under Respiratory Motion Artifacts

    Authors: Carles Garcia-Cabrera, Eric Arazo, Kathleen M. Curran, Noel E. O'Connor, Kevin McGuinness

    Abstract: Methods that are resilient to artifacts in the cardiac magnetic resonance imaging (MRI) while performing ventricle segmentation, are crucial for ensuring quality in structural and functional analysis of those tissues. While there has been significant efforts on improving the quality of the algorithms, few works have tackled the harm that the artifacts generate in the predictions. In this work, we… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: accepted for the STACOM2022 workshop @ MICCAI2022

  17. arXiv:2207.12065  [pdf, other

    cs.CV

    Dynamic Channel Selection in Self-Supervised Learning

    Authors: Tarun Krishna, Ayush K. Rai, Yasser A. D. Djilali, Alan F. Smeaton, Kevin McGuinness, Noel E. O'Connor

    Abstract: Whilst computer vision models built using self-supervised approaches are now commonplace, some important questions remain. Do self-supervised models learn highly redundant channel features? What if a self-supervised network could dynamically select the important channels and get rid of the unnecessary ones? Currently, convnets pre-trained with self-supervision have obtained comparable performance… ▽ More

    Submitted 16 December, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted in Irish Machine Vision and Image Processing Conference 2022

  18. arXiv:2207.01573  [pdf, other

    cs.CV

    Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets

    Authors: Paul Albert, Eric Arazo, Noel E. O'Connor, Kevin McGuinness

    Abstract: Using search engines for web image retrieval is a tempting alternative to manual curation when creating an image dataset, but their main drawback remains the proportion of incorrect (noisy) samples retrieved. These noisy samples have been evidenced by previous works to be a mixture of in-distribution (ID) samples, assigned to the incorrect category but presenting similar visual semantics to other… ▽ More

    Submitted 18 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  19. arXiv:2204.09343  [pdf

    cs.CV

    Utilizing unsupervised learning to improve sward content prediction and herbage mass estimation

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Brian Mac Namee, Deirdre Hennessy, Aisling H. O'Connor, Noel E. O'Connor, Kevin McGuinness

    Abstract: Sward species composition estimation is a tedious one. Herbage must be collected in the field, manually separated into components, dried and weighed to estimate species composition. Deep learning approaches using neural networks have been used in previous work to propose faster and more cost efficient alternatives to this process by estimating the biomass information from a picture of an area of p… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 3 pages. Accepted at the 29th EGF General Meeting 2022

  20. arXiv:2204.08271  [pdf, other

    cs.CV

    Unsupervised domain adaptation and super resolution on drone images for autonomous dry herbage biomass estimation

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Jaime Fernandez, Brian Mac Namee, Deirdre Hennessey, Noel E. O'Connor, Kevin McGuinness

    Abstract: Herbage mass yield and composition estimation is an important tool for dairy farmers to ensure an adequate supply of high quality herbage for grazing and subsequently milk production. By accurately estimating herbage mass and composition, targeted nitrogen fertiliser application strategies can be deployed to improve localised regions in a herbage field, effectively reducing the negative impacts of… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 11 pages, 5 figures. Accepted at the Agriculture-Vision CVPR 2022 Workshop

  21. Parking Behaviour Analysis of Shared E-Bike Users Based on a Real-World Dataset -- A Case Study in Dublin, Ireland

    Authors: Sen Yan, Mingming Liu, Noel E. O'Connor

    Abstract: In recent years, an increasing number of shared E-bikes have been rolling out rapidly in our cities. It therefore becomes important to understand new behaviour patterns of the cyclists in using these E-bikes as a foundation for the novel design of shared micromobility services as part of the realisation for next generation intelligent transportation systems. In this paper, we deeply investigate th… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: The manuscript has been accepted by the IEEE VTC 2022-Spring Conference

  22. arXiv:2202.08680  [pdf, other

    eess.IV cs.CV

    Synthetic data for unsupervised polyp segmentation

    Authors: Enric Moreu, Kevin McGuinness, Noel E. O'Connor

    Abstract: Deep learning has shown excellent performance in analysing medical images. However, datasets are difficult to obtain due privacy issues, standardization problems, and lack of annotations. We address these problems by producing realistic synthetic images using a combination of 3D technologies and generative adversarial networks. We use zero annotations from medical professionals in our pipeline. Ou… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  23. arXiv:2202.08670  [pdf, other

    cs.CV cs.AI

    Domain Randomization for Object Counting

    Authors: Enric Moreu, Kevin McGuinness, Diego Ortego, Noel E. O'Connor

    Abstract: Recently, the use of synthetic datasets based on game engines has been shown to improve the performance of several tasks in computer vision. However, these datasets are typically only appropriate for the specific domains depicted in computer games, such as urban scenes involving vehicles and people. In this paper, we present an approach to generate synthetic datasets for object counting for any do… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  24. arXiv:2201.10243  [pdf, other

    cs.CV cs.LG

    BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment

    Authors: Luis Lebron, Yvette Graham, Kevin McGuinness, Konstantinos Kouramas, Noel E. O'Connor

    Abstract: Evaluating video captioning systems is a challenging task as there are multiple factors to consider; for instance: the fluency of the caption, multiple actions happening in a single scene, and the human bias of what is considered important. Most metrics try to measure how similar the system generated captions are to a single or a set of human-annotated captions. This paper presents a new method ba… ▽ More

    Submitted 16 May, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: In press in Language Resources and Evaluation Conference(LREC) 2022

  25. arXiv:2111.09056  [pdf, other

    cs.CV cs.CY cs.MM

    Improving Person Re-Identification with Temporal Constraints

    Authors: Julia Dietlmeier, Feiyan Hu, Frances Ryan, Noel E. O'Connor, Kevin McGuinness

    Abstract: In this paper we introduce an image-based person re-identification dataset collected across five non-overlap** camera views in the large and busy airport in Dublin, Ireland. Unlike all publicly available image-based datasets, our dataset contains timestamp information in addition to frame number, and camera and person IDs. Also our dataset has been fully anonymized to comply with modern data pri… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 10 pages, RWS @ WACV2022

  26. arXiv:2110.14283  [pdf, other

    cs.CV

    How Important is Importance Sampling for Deep Budgeted Training?

    Authors: Eric Arazo, Diego Ortego, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Long iterative training processes for Deep Neural Networks (DNNs) are commonly required to achieve state-of-the-art performance in many computer vision tasks. Importance sampling approaches might play a key role in budgeted training regimes, i.e. when limiting the number of training iterations. These approaches aim at dynamically estimating the importance of each sample to focus on the most releva… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: British Machine Vision Conference (BMVC) 2021, oral presentation

  27. arXiv:2106.10090  [pdf, other

    cs.CV cs.AI

    Discerning Generic Event Boundaries in Long-Form Wild Videos

    Authors: Ayush K Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F Smeaton, Noel E O'Connor

    Abstract: Detecting generic, taxonomy-free event boundaries invideos represents a major stride forward towards holisticvideo understanding. In this paper we present a technique forgeneric event boundary detection based on a two stream in-flated 3D convolutions architecture, which can learn spatio-temporal features from videos. Our work is inspired from theGeneric Event Boundary Detection Challenge (part of… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: Technical Report for Generic Event Boundary Challenge - LOVEU Challenge (CVPR 2021)

  28. arXiv:2105.09460  [pdf, other

    cs.NI eess.SY

    Optimal Distributed Bandwidth Allocation in NB-IoT Networks

    Authors: Hongde Wu, Zhengyong Chen, Noel E. O'Connor, Mingming Liu

    Abstract: In this paper, we investigate a key problem of Narrowband-Internet of Things (NB-IoT) in the context of 5G with Mobile Edge Computing (MEC). We address the challenge that IoT devices may have different priorities when demanding bandwidth for data transmission in specific applications and services. Due to the scarcity of bandwidth in an MEC enabled IoT network, our objective is to optimize bandwidt… ▽ More

    Submitted 5 March, 2021; originally announced May 2021.

    Comments: The paper has been accepted by the 6th ACM/IEEE Conference on Internet of Things Design and Implementation

  29. arXiv:2104.10644  [pdf, other

    cs.LG eess.SY

    A Comparative Study of Using Spatial-Temporal Graph Convolutional Networks for Predicting Availability in Bike Sharing Schemes

    Authors: Zhengyong Chen, Hongde Wu, Noel E. O'Connor, Mingming Liu

    Abstract: Accurately forecasting transportation demand is crucial for efficient urban traffic guidance, control and management. One solution to enhance the level of prediction accuracy is to leverage graph convolutional networks (GCN), a neural network based modelling approach with the ability to process data contained in graph based structures. As a powerful extension of GCN, a spatial-temporal graph convo… ▽ More

    Submitted 6 July, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: This manuscript has been accepted at the IEEE ITSC 2021

  30. arXiv:2104.07614  [pdf, other

    eess.SY

    An ADMM-based Optimal Transmission Frequency Management System for IoT Edge Intelligence

    Authors: Hongde Wu, Noel E. O'Connor, Jennifer Bruton, Mingming Liu

    Abstract: In this paper, we investigate a key problem of Internet of Things (IoT) applications in practice. Our research objective is to optimize the transmission frequencies for a group of IoT edge devices under practical constraints. Our key assumption is that different IoT devices may have different priority levels when transmitting data in a resource-constrained environment and that those priority level… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: The paper has been accepted at the 7th IEEE World Forum on Internet of Things (IEEE WF-IoT)

  31. arXiv:2103.00548  [pdf, other

    eess.SY

    An Intelligent Multi-Speed Advisory System using Improved Whale Optimisation Algorithm

    Authors: Beiran Chen, Mingming Liu, Yi Zhang, Zhengyong Chen, Yingqi Gu, Noel E. O'Connor

    Abstract: An intelligent speed advisory system can be used to recommend speed for vehicles travelling in a given road network in cities. In this paper, we extend our previous work where a distributed speed advisory system has been devised to recommend an optimal consensus speed for a fleet of Internal Combustion Engine Vehicles (ICEVs) in a highway scenario. In particular, we propose a novel optimisation fr… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: This paper has been accepted by IEEE VTC2021-Spring for presentation

  32. arXiv:2102.04993  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding

    Authors: Marc Górriz, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Neural networks can be successfully used to improve several modules of advanced video coding schemes. In particular, compression of colour components was shown to greatly benefit from usage of machine learning models, thanks to the design of appropriate attention-based architectures that allow the prediction to exploit specific samples in the reference region. However, such architectures tend to b… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, 2020

  33. arXiv:2101.06451  [pdf, other

    eess.SY

    MPC-CSAS: Multi-Party Computation for Real-time Privacy-preserving Speed Advisory Systems

    Authors: Mingming Liu, Long Cheng, Yingqi Gu, Ying Wang, Qingzhi Liu, Noel E. O'Connor

    Abstract: As a part of Advanced Driver Assistance Systems (ADASs), Consensus-based Speed Advisory Systems (CSAS) have been proposed to recommend a common speed to a group of vehicles for specific application purposes, such as emission control and energy management. With Vehicle-to-Vehicle (V2V), Vehicle-to-Infrastructure (V2I) technologies and advanced control theories in place, state-of-the-art CSAS can be… ▽ More

    Submitted 16 January, 2021; originally announced January 2021.

    Comments: This manuscript has been accepted by the IEEE Transactions on Intelligent Transportation Systems

  34. arXiv:2012.15641  [pdf, other

    cs.MM cs.AI cs.CV

    Investigating Memorability of Dynamic Media

    Authors: Phuc H. Le-Khac, Ayush K. Rai, Graham Healy, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The Predicting Media Memorability task in MediaEval'20 has some challenging aspects compared to previous years. In this paper we identify the high-dynamic content in videos and dataset of limited size as the core challenges for the task, we propose directions to overcome some of these challenges and we present our initial result in these directions.

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 3 pages, 1 figure. 1 table

    Journal ref: MediaEval Multimedia Benchmark Workshop Working Notes, 14-15 December 2020

  35. arXiv:2012.04462  [pdf, other

    cs.CV

    Multi-Objective Interpolation Training for Robustness to Label Noise

    Authors: Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Deep neural networks trained with standard cross-entropy loss memorize noisy labels, which degrades their performance. Most research to mitigate this memorization proposes new robust classification loss functions. Conversely, we propose a Multi-Objective Interpolation Training (MOIT) approach that jointly exploits contrastive learning and classification to mutually help each other and boost perfor… ▽ More

    Submitted 18 March, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted to CVPR 2021. 10 pages, 1 figure, and 9 tables

  36. arXiv:2011.07616  [pdf, other

    cs.SD cs.LG eess.AS

    Unsupervised Contrastive Learning of Sound Event Representations

    Authors: Eduardo Fonseca, Diego Ortego, Kevin McGuinness, Noel E. O'Connor, Xavier Serra

    Abstract: Self-supervised representation learning can mitigate the limitations in recognition tasks with few manually labeled data but abundant unlabeled data---a common scenario in sound event research. In this work, we explore unsupervised contrastive learning as a way to learn sound event representations. To this end, we propose to use the pretext task of contrasting differently augmented views of sound… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Comments: A 4-page version is submitted to ICASSP 2021

  37. arXiv:2010.06307  [pdf, other

    cs.CV cs.AI cs.LG

    How important are faces for person re-identification?

    Authors: Julia Dietlmeier, Joseph Antony, Kevin McGuinness, Noel E. O'Connor

    Abstract: This paper investigates the dependence of existing state-of-the-art person re-identification models on the presence and visibility of human faces. We apply a face detection and blurring algorithm to create anonymized versions of several popular person re-identification datasets including Market1501, DukeMTMC-reID, CUHK03, Viper, and Airport. Using a cross-section of existing state-of-the-art model… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: 25th International Conference on Pattern Recognition (ICPR2020), Milan, Italy, 10-15 January 2021

  38. arXiv:2008.00106  [pdf, other

    cs.CV

    Utilising Visual Attention Cues for Vehicle Detection and Tracking

    Authors: Feiyan Hu, Venkatesh G M, Noel E. O'Connor, Alan F. Smeaton, Suzanne Little

    Abstract: Advanced Driver-Assistance Systems (ADAS) have been attracting attention from many researchers. Vision-based sensors are the closest way to emulate human driver visual behavior while driving. In this paper, we explore possible ways to use visual attention (saliency) for object detection and tracking. We investigate: 1) How a visual attention map such as a \emph{subjectness} attention or saliency m… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: Accepted in ICPR2020

  39. arXiv:2007.11866  [pdf, other

    cs.CV

    Reliable Label Bootstrap** for Semi-Supervised Learning

    Authors: Paul Albert, Diego Ortego, Eric Arazo, Noel E. O'Connor, Kevin McGuinness

    Abstract: Reducing the amount of labels required to train convolutional neural networks without performance degradation is key to effectively reduce human annotation efforts. We propose Reliable Label Bootstrap** (ReLaB), an unsupervised preprossessing algorithm which improves the performance of semi-supervised algorithms in extremely low supervision settings. Given a dataset with few labeled samples, we… ▽ More

    Submitted 25 February, 2021; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: 10 pages, 3 figures

  40. arXiv:2006.15349  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Chroma Intra Prediction with attention-based CNN architectures

    Authors: Marc Górriz, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Neural networks can be used in video coding to improve chroma intra-prediction. In particular, usage of fully-connected networks has enabled better cross-component prediction with respect to traditional linear models. Nonetheless, state-of-the-art architectures tend to disregard the location of individual reference samples in the prediction process. This paper proposes a new neural network archite… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

    Comments: 27th IEEE International Conference on Image Processing, 25-28 Oct 2020, Abu Dhabi, United Arab Emirates

  41. arXiv:2006.06392  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Interpreting CNN for Low Complexity Learned Sub-pixel Motion Compensation in Video Coding

    Authors: Luka Murn, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Deep learning has shown great potential in image and video compression tasks. However, it brings bit savings at the cost of significant increases in coding complexity, which limits its potential for implementation within practical applications. In this paper, a novel neural network-based tool is presented which improves the interpolation of reference samples needed for fractional precision motion… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 27th IEEE International Conference on Image Processing, 25-28 Oct 2020, Abu Dhabi, United Arab Emirates

    Journal ref: 2020 IEEE International Conference on Image Processing (ICIP), 2020, pp. 798-802

  42. arXiv:2005.00430  [pdf, other

    cs.CV

    Investigating Class-level Difficulty Factors in Multi-label Classification Problems

    Authors: Mark Marsden, Kevin McGuinness, Joseph Antony, Haolin Wei, Milan Redzic, Jian Tang, Zhilan Hu, Alan Smeaton, Noel E O'Connor

    Abstract: This work investigates the use of class-level difficulty factors in multi-label classification problems for the first time. Four class-level difficulty factors are proposed: frequency, visual variation, semantic abstraction, and class co-occurrence. Once computed for a given multi-label classification dataset, these difficulty factors are shown to have several potential applications including the… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: Published in ICME 2020

  43. arXiv:1912.08741  [pdf, other

    cs.CV

    Towards Robust Learning with Different Label Noise Distributions

    Authors: Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Noisy labels are an unavoidable consequence of labeling processes and detecting them is an important step towards preventing performance degradations in Convolutional Neural Networks. Discarding noisy labels avoids a harmful memorization, while the associated image content can still be exploited in a semi-supervised learning (SSL) setup. Clean samples are usually identified using the small loss tr… ▽ More

    Submitted 27 July, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

  44. arXiv:1908.09873  [pdf, other

    eess.IV cs.CV cs.LG

    End-to-End Conditional GAN-based Architectures for Image Colourisation

    Authors: Marc Górriz, Marta Mrak, Alan F. Smeaton, Noel E. O'Connor

    Abstract: In this work recent advances in conditional adversarial networks are investigated to develop an end-to-end architecture based on Convolutional Neural Networks (CNNs) to directly map realistic colours to an input greyscale image. Observing that existing colourisation methods sometimes exhibit a lack of colourfulness, this paper proposes a method to improve colourisation results. In particular, the… ▽ More

    Submitted 5 September, 2019; v1 submitted 26 August, 2019; originally announced August 2019.

    Comments: IEEE 21st International Workshop on Multimedia Signal Processing, 27-29 Sept 2019, Kuala Lumpur, Malaysia

  45. arXiv:1908.08873  [pdf, other

    eess.IV cs.CV cs.LG

    Predicting knee osteoarthritis severity: comparative modeling based on patient's data and plain X-ray images

    Authors: Jaynal Abedin, Joseph Antony, Kevin McGuinness, Kieran Moran, Noel E O'Connor, Dietrich Rebholz-Schuhmann, John Newell

    Abstract: Knee osteoarthritis (KOA) is a disease that impairs knee function and causes pain. A radiologist reviews knee X-ray images and grades the severity level of the impairments according to the Kellgren and Lawrence grading scheme; a five-point ordinal scale (0--4). In this study, we used Elastic Net (EN) and Random Forests (RF) to build predictive models using patient assessment data (i.e. signs and s… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: Published in Nature Scientific Reports, 2019

    Journal ref: Scientific reports 9, no. 1 (2019): 5761

  46. arXiv:1908.08856  [pdf, other

    eess.IV cs.CV cs.LG

    Assessing Knee OA Severity with CNN attention-based end-to-end architectures

    Authors: Marc Górriz, Joseph Antony, Kevin McGuinness, Xavier Giró-i-Nieto, Noel E. O'Connor

    Abstract: This work proposes a novel end-to-end convolutional neural network (CNN) architecture to automatically quantify the severity of knee osteoarthritis (OA) using X-Ray images, which incorporates trainable attention modules acting as unsupervised fine-grained detectors of the region of interest (ROI). The proposed attention modules can be applied at different levels and scales across any CNN pipeline… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: Proceedings of the 2nd International Conference on Medical Imaging with Deep Learning

    Journal ref: Proceedings of The 2nd International Conference on Medical Imaging with Deep Learning, PMLR 102:197-214, 2019

  47. arXiv:1908.02983  [pdf, other

    cs.CV

    Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning

    Authors: Eric Arazo, Diego Ortego, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Semi-supervised learning, i.e. jointly learning from labeled and unlabeled samples, is an active research topic due to its key role on relaxing human supervision. In the context of image classification, recent advances to learn from unlabeled samples are mainly focused on consistency regularization methods that encourage invariant predictions for different perturbations of unlabeled samples. We, c… ▽ More

    Submitted 29 June, 2020; v1 submitted 8 August, 2019; originally announced August 2019.

  48. arXiv:1907.01869  [pdf, other

    cs.CV cs.LG

    Simple vs complex temporal recurrences for video saliency prediction

    Authors: Panagiotis Linardos, Eva Mohedano, Juan Jose Nieto, Noel E. O'Connor, Xavier Giro-i-Nieto, Kevin McGuinness

    Abstract: This paper investigates modifying an existing neural network architecture for static saliency prediction using two types of recurrences that integrate information from the temporal domain. The first modification is the addition of a ConvLSTM within the architecture, while the second is a conceptually simple exponential moving average of an internal convolutional state. We use weights pre-trained o… ▽ More

    Submitted 16 July, 2019; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Accepted at BMVC 2019

  49. arXiv:1904.11256  [pdf, other

    cs.CV

    On guiding video object segmentation

    Authors: Diego Ortego, Kevin McGuinness, Juan C. SanMiguel, Eric Arazo, José M. Martínez, Noel E. O'Connor

    Abstract: This paper presents a novel approach for segmenting moving objects in unconstrained environments using guided convolutional neural networks. This guiding process relies on foreground masks from independent algorithms (i.e. state-of-the-art algorithms) to implement an attention mechanism that incorporates the spatial location of foreground and background to compute their separated representations.… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.

  50. arXiv:1904.11238  [pdf, other

    cs.CV

    Unsupervised Label Noise Modeling and Loss Correction

    Authors: Eric Arazo, Diego Ortego, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Despite being robust to small amounts of label noise, convolutional neural networks trained with stochastic gradient methods have been shown to easily fit random labels. When there are a mixture of correct and mislabelled targets, networks tend to fit the former before the latter. This suggests using a suitable two-component mixture model as an unsupervised generative model of sample loss values d… ▽ More

    Submitted 5 June, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

    Comments: Accepted to ICML 2019