Skip to main content

Showing 1–50 of 58 results for author: O'Connor, N E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06941  [pdf, other

    eess.IV cs.CV

    Accelerating Cardiac MRI Reconstruction with CMRatt: An Attention-Driven Approach

    Authors: Anam Hashmi, Julia Dietlmeier, Kathleen M. Curran, Noel E. O'Connor

    Abstract: Cine cardiac magnetic resonance (CMR) imaging is recognised as the benchmark modality for the comprehensive assessment of cardiac function. Nevertheless, the acquisition process of cine CMR is considered as an impediment due to its prolonged scanning time. One commonly used strategy to expedite the acquisition process is through k-space undersampling, though it comes with a drawback of introducing… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: This paper has been submitted for the 32nd European Signal Processing Conference EUSIPCO 2024 in Lyon

  2. arXiv:2404.06362  [pdf, other

    cs.CV cs.AI

    Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation

    Authors: Sidra Aleem, Fangyijie Wang, Mayug Maniparambil, Eric Arazo, Julia Dietlmeier, Guenole Silvestre, Kathleen Curran, Noel E. O'Connor, Suzanne Little

    Abstract: The Segment Anything Model (SAM) and CLIP are remarkable vision foundation models (VFMs). SAM, a prompt driven segmentation model, excels in segmentation tasks across diverse domains, while CLIP is renowned for its zero shot recognition capabilities. However, their unified potential has not yet been explored in medical image segmentation. To adapt SAM to medical imaging, existing methods primarily… ▽ More

    Submitted 30 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  3. arXiv:2401.05224  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Do Vision and Language Encoders Represent the World Similarly?

    Authors: Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Mohamed El Amine Seddik, Karttikeya Mangalam, Noel E. O'Connor

    Abstract: Aligned text-image encoders such as CLIP have become the de facto model for vision-language tasks. Furthermore, modality-specific encoders achieve impressive performances in their respective domains. This raises a central question: does an alignment exist between uni-modal vision and language encoders since they fundamentally represent the same physical world? Analyzing the latent spaces structure… ▽ More

    Submitted 22 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: Accepted CVPR 2024

  4. arXiv:2311.16514  [pdf, other

    cs.CV cs.AI cs.LG

    Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach

    Authors: Ayush K. Rai, Tarun Krishna, Feiyan Hu, Alexandru Drimbarean, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: Video Anomaly Detection (VAD) is an open-set recognition task, which is usually formulated as a one-class classification (OCC) problem, where training data is comprised of videos with normal instances while test data contains both normal and anomalous instances. Recent works have investigated the creation of pseudo-anomalies (PAs) using only the normal data and making strong assumptions about real… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted in CVPRW 2024 - VAND Workshop

  5. arXiv:2307.12033  [pdf, other

    cs.CV

    Self-Supervised and Semi-Supervised Polyp Segmentation using Synthetic Data

    Authors: Enric Moreu, Eric Arazo, Kevin McGuinness, Noel E. O'Connor

    Abstract: Early detection of colorectal polyps is of utmost importance for their treatment and for colorectal cancer prevention. Computer vision techniques have the potential to aid professionals in the diagnosis stage, where colonoscopies are manually carried out to examine the entirety of the patient's colon. The main challenge in medical imaging is the lack of data, and a further challenge specific to po… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  6. arXiv:2307.11661  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts

    Authors: Mayug Maniparambil, Chris Vorster, Derek Molloy, Noel Murphy, Kevin McGuinness, Noel E. O'Connor

    Abstract: Contrastive pretrained large Vision-Language Models (VLMs) like CLIP have revolutionized visual representation learning by providing good performance on downstream datasets. VLMs are 0-shot adapted to a downstream dataset by designing prompts that are relevant to the dataset. Such prompt engineering makes use of domain expertise and a validation dataset. Meanwhile, recent developments in generativ… ▽ More

    Submitted 8 August, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: Paper accepted at ICCV-W 2023. V2 contains additional comparisons with concurrent works

  7. Joint one-sided synthetic unpaired image translation and segmentation for colorectal cancer prevention

    Authors: Enric Moreu, Eric Arazo, Kevin McGuinness, Noel E. O'Connor

    Abstract: Deep learning has shown excellent performance in analysing medical images. However, datasets are difficult to obtain due privacy issues, standardization problems, and lack of annotations. We address these problems by producing realistic synthetic images using a combination of 3D technologies and generative adversarial networks. We propose CUT-seg, a joint training where a segmentation model and a… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.08680

  8. Fashion CUT: Unsupervised domain adaptation for visual pattern classification in clothes using synthetic data and pseudo-labels

    Authors: Enric Moreu, Alex Martinelli, Martina Naughton, Philip Kelly, Noel E. O'Connor

    Abstract: Accurate product information is critical for e-commerce stores to allow customers to browse, filter, and search for products. Product data quality is affected by missing or incorrect information resulting in poor customer experience. While machine learning can be used to correct inaccurate or missing information, achieving high performance on fashion image classification tasks requires large amoun… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  9. arXiv:2301.13019  [pdf, other

    cs.RO cs.LG

    Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies

    Authors: Qiang Wang, Robert McCarthy, David Cordova Bulens, Francisco Roldan Sanchez, Kevin McGuinness, Noel E. O'Connor, Stephen J. Redmond

    Abstract: This paper presents our solution for the Real Robot Challenge (RRC) III, a competition featured in the NeurIPS 2022 Competition Track, aimed at addressing dexterous robotic manipulation tasks through learning from pre-collected offline data. Participants were provided with two types of datasets for each task: expert and mixed datasets with varying skill levels. While the simplest offline policy le… ▽ More

    Submitted 21 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

  10. arXiv:2301.11734  [pdf, other

    cs.LG cs.RO

    Improving Behavioural Cloning with Positive Unlabeled Learning

    Authors: Qiang Wang, Robert McCarthy, David Cordova Bulens, Kevin McGuinness, Noel E. O'Connor, Nico Gürtler, Felix Widmaier, Francisco Roldan Sanchez, Stephen J. Redmond

    Abstract: Learning control policies offline from pre-recorded datasets is a promising avenue for solving challenging real-world problems. However, available datasets are typically of mixed quality, with a limited number of the trajectories that we would consider as positive examples; i.e., high-quality demonstrations. Therefore, we propose a novel iterative learning algorithm for identifying expert trajecto… ▽ More

    Submitted 21 September, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  11. arXiv:2301.09164  [pdf, other

    cs.LG cs.CV

    Unifying Synergies between Self-supervised Learning and Dynamic Computation

    Authors: Tarun Krishna, Ayush K Rai, Alexandru Drimbarean, Eric Arazo, Paul Albert, Alan F Smeaton, Kevin McGuinness, Noel E O'Connor

    Abstract: Computationally expensive training strategies make self-supervised learning (SSL) impractical for resource constrained industrial settings. Techniques like knowledge distillation (KD), dynamic computation (DC), and pruning are often used to obtain a lightweightmodel, which usually involves multiple epochs of fine-tuning (or distilling steps) of a large pre-trained model, making it more computation… ▽ More

    Submitted 9 September, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: Accepted in BMVC 2023

  12. arXiv:2210.05574  [pdf, other

    cs.CV cs.AI cs.LG

    Motion Aware Self-Supervision for Generic Event Boundary Detection

    Authors: Ayush K. Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The task of Generic Event Boundary Detection (GEBD) aims to detect moments in videos that are naturally perceived by humans as generic and taxonomy-free event boundaries. Modeling the dynamically evolving temporal and spatial changes in a video makes GEBD a difficult problem to solve. Existing approaches involve very complex and sophisticated pipelines in terms of architectural design choices, hen… ▽ More

    Submitted 12 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023

  13. arXiv:2210.04578  [pdf, other

    cs.CV cs.LG

    Is your noise correction noisy? PLS: Robustness to label noise with two stage detection

    Authors: Paul Albert, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness

    Abstract: Designing robust algorithms capable of training accurate neural networks on uncurated datasets from the web has been the subject of much research as it reduces the need for time consuming human labor. The focus of many previous research contributions has been on the detection of different types of label noise; however, this paper proposes to improve the correction accuracy of noisy samples once th… ▽ More

    Submitted 15 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 9 pages 4 figures. Accepted at WACV 2023

  14. arXiv:2209.09714  [pdf, other

    eess.IV cs.CV

    Cardiac Segmentation using Transfer Learning under Respiratory Motion Artifacts

    Authors: Carles Garcia-Cabrera, Eric Arazo, Kathleen M. Curran, Noel E. O'Connor, Kevin McGuinness

    Abstract: Methods that are resilient to artifacts in the cardiac magnetic resonance imaging (MRI) while performing ventricle segmentation, are crucial for ensuring quality in structural and functional analysis of those tissues. While there has been significant efforts on improving the quality of the algorithms, few works have tackled the harm that the artifacts generate in the predictions. In this work, we… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: accepted for the STACOM2022 workshop @ MICCAI2022

  15. arXiv:2207.12065  [pdf, other

    cs.CV

    Dynamic Channel Selection in Self-Supervised Learning

    Authors: Tarun Krishna, Ayush K. Rai, Yasser A. D. Djilali, Alan F. Smeaton, Kevin McGuinness, Noel E. O'Connor

    Abstract: Whilst computer vision models built using self-supervised approaches are now commonplace, some important questions remain. Do self-supervised models learn highly redundant channel features? What if a self-supervised network could dynamically select the important channels and get rid of the unnecessary ones? Currently, convnets pre-trained with self-supervision have obtained comparable performance… ▽ More

    Submitted 16 December, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted in Irish Machine Vision and Image Processing Conference 2022

  16. arXiv:2207.01573  [pdf, other

    cs.CV

    Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets

    Authors: Paul Albert, Eric Arazo, Noel E. O'Connor, Kevin McGuinness

    Abstract: Using search engines for web image retrieval is a tempting alternative to manual curation when creating an image dataset, but their main drawback remains the proportion of incorrect (noisy) samples retrieved. These noisy samples have been evidenced by previous works to be a mixture of in-distribution (ID) samples, assigned to the incorrect category but presenting similar visual semantics to other… ▽ More

    Submitted 18 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  17. arXiv:2204.09343  [pdf

    cs.CV

    Utilizing unsupervised learning to improve sward content prediction and herbage mass estimation

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Brian Mac Namee, Deirdre Hennessy, Aisling H. O'Connor, Noel E. O'Connor, Kevin McGuinness

    Abstract: Sward species composition estimation is a tedious one. Herbage must be collected in the field, manually separated into components, dried and weighed to estimate species composition. Deep learning approaches using neural networks have been used in previous work to propose faster and more cost efficient alternatives to this process by estimating the biomass information from a picture of an area of p… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 3 pages. Accepted at the 29th EGF General Meeting 2022

  18. arXiv:2204.08271  [pdf, other

    cs.CV

    Unsupervised domain adaptation and super resolution on drone images for autonomous dry herbage biomass estimation

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Jaime Fernandez, Brian Mac Namee, Deirdre Hennessey, Noel E. O'Connor, Kevin McGuinness

    Abstract: Herbage mass yield and composition estimation is an important tool for dairy farmers to ensure an adequate supply of high quality herbage for grazing and subsequently milk production. By accurately estimating herbage mass and composition, targeted nitrogen fertiliser application strategies can be deployed to improve localised regions in a herbage field, effectively reducing the negative impacts of… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 11 pages, 5 figures. Accepted at the Agriculture-Vision CVPR 2022 Workshop

  19. arXiv:2202.08680  [pdf, other

    eess.IV cs.CV

    Synthetic data for unsupervised polyp segmentation

    Authors: Enric Moreu, Kevin McGuinness, Noel E. O'Connor

    Abstract: Deep learning has shown excellent performance in analysing medical images. However, datasets are difficult to obtain due privacy issues, standardization problems, and lack of annotations. We address these problems by producing realistic synthetic images using a combination of 3D technologies and generative adversarial networks. We use zero annotations from medical professionals in our pipeline. Ou… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  20. arXiv:2202.08670  [pdf, other

    cs.CV cs.AI

    Domain Randomization for Object Counting

    Authors: Enric Moreu, Kevin McGuinness, Diego Ortego, Noel E. O'Connor

    Abstract: Recently, the use of synthetic datasets based on game engines has been shown to improve the performance of several tasks in computer vision. However, these datasets are typically only appropriate for the specific domains depicted in computer games, such as urban scenes involving vehicles and people. In this paper, we present an approach to generate synthetic datasets for object counting for any do… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  21. arXiv:2201.10243  [pdf, other

    cs.CV cs.LG

    BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment

    Authors: Luis Lebron, Yvette Graham, Kevin McGuinness, Konstantinos Kouramas, Noel E. O'Connor

    Abstract: Evaluating video captioning systems is a challenging task as there are multiple factors to consider; for instance: the fluency of the caption, multiple actions happening in a single scene, and the human bias of what is considered important. Most metrics try to measure how similar the system generated captions are to a single or a set of human-annotated captions. This paper presents a new method ba… ▽ More

    Submitted 16 May, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: In press in Language Resources and Evaluation Conference(LREC) 2022

  22. arXiv:2111.09056  [pdf, other

    cs.CV cs.CY cs.MM

    Improving Person Re-Identification with Temporal Constraints

    Authors: Julia Dietlmeier, Feiyan Hu, Frances Ryan, Noel E. O'Connor, Kevin McGuinness

    Abstract: In this paper we introduce an image-based person re-identification dataset collected across five non-overlap** camera views in the large and busy airport in Dublin, Ireland. Unlike all publicly available image-based datasets, our dataset contains timestamp information in addition to frame number, and camera and person IDs. Also our dataset has been fully anonymized to comply with modern data pri… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 10 pages, RWS @ WACV2022

  23. arXiv:2110.14283  [pdf, other

    cs.CV

    How Important is Importance Sampling for Deep Budgeted Training?

    Authors: Eric Arazo, Diego Ortego, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Long iterative training processes for Deep Neural Networks (DNNs) are commonly required to achieve state-of-the-art performance in many computer vision tasks. Importance sampling approaches might play a key role in budgeted training regimes, i.e. when limiting the number of training iterations. These approaches aim at dynamically estimating the importance of each sample to focus on the most releva… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: British Machine Vision Conference (BMVC) 2021, oral presentation

  24. arXiv:2106.10090  [pdf, other

    cs.CV cs.AI

    Discerning Generic Event Boundaries in Long-Form Wild Videos

    Authors: Ayush K Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F Smeaton, Noel E O'Connor

    Abstract: Detecting generic, taxonomy-free event boundaries invideos represents a major stride forward towards holisticvideo understanding. In this paper we present a technique forgeneric event boundary detection based on a two stream in-flated 3D convolutions architecture, which can learn spatio-temporal features from videos. Our work is inspired from theGeneric Event Boundary Detection Challenge (part of… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: Technical Report for Generic Event Boundary Challenge - LOVEU Challenge (CVPR 2021)

  25. arXiv:2105.09460  [pdf, other

    cs.NI eess.SY

    Optimal Distributed Bandwidth Allocation in NB-IoT Networks

    Authors: Hongde Wu, Zhengyong Chen, Noel E. O'Connor, Mingming Liu

    Abstract: In this paper, we investigate a key problem of Narrowband-Internet of Things (NB-IoT) in the context of 5G with Mobile Edge Computing (MEC). We address the challenge that IoT devices may have different priorities when demanding bandwidth for data transmission in specific applications and services. Due to the scarcity of bandwidth in an MEC enabled IoT network, our objective is to optimize bandwidt… ▽ More

    Submitted 5 March, 2021; originally announced May 2021.

    Comments: The paper has been accepted by the 6th ACM/IEEE Conference on Internet of Things Design and Implementation

  26. arXiv:2104.10644  [pdf, other

    cs.LG eess.SY

    A Comparative Study of Using Spatial-Temporal Graph Convolutional Networks for Predicting Availability in Bike Sharing Schemes

    Authors: Zhengyong Chen, Hongde Wu, Noel E. O'Connor, Mingming Liu

    Abstract: Accurately forecasting transportation demand is crucial for efficient urban traffic guidance, control and management. One solution to enhance the level of prediction accuracy is to leverage graph convolutional networks (GCN), a neural network based modelling approach with the ability to process data contained in graph based structures. As a powerful extension of GCN, a spatial-temporal graph convo… ▽ More

    Submitted 6 July, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: This manuscript has been accepted at the IEEE ITSC 2021

  27. arXiv:2102.04993  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding

    Authors: Marc Górriz, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Neural networks can be successfully used to improve several modules of advanced video coding schemes. In particular, compression of colour components was shown to greatly benefit from usage of machine learning models, thanks to the design of appropriate attention-based architectures that allow the prediction to exploit specific samples in the reference region. However, such architectures tend to b… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, 2020

  28. arXiv:2012.15641  [pdf, other

    cs.MM cs.AI cs.CV

    Investigating Memorability of Dynamic Media

    Authors: Phuc H. Le-Khac, Ayush K. Rai, Graham Healy, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The Predicting Media Memorability task in MediaEval'20 has some challenging aspects compared to previous years. In this paper we identify the high-dynamic content in videos and dataset of limited size as the core challenges for the task, we propose directions to overcome some of these challenges and we present our initial result in these directions.

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 3 pages, 1 figure. 1 table

    Journal ref: MediaEval Multimedia Benchmark Workshop Working Notes, 14-15 December 2020

  29. arXiv:2012.04462  [pdf, other

    cs.CV

    Multi-Objective Interpolation Training for Robustness to Label Noise

    Authors: Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Deep neural networks trained with standard cross-entropy loss memorize noisy labels, which degrades their performance. Most research to mitigate this memorization proposes new robust classification loss functions. Conversely, we propose a Multi-Objective Interpolation Training (MOIT) approach that jointly exploits contrastive learning and classification to mutually help each other and boost perfor… ▽ More

    Submitted 18 March, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted to CVPR 2021. 10 pages, 1 figure, and 9 tables

  30. arXiv:2011.07616  [pdf, other

    cs.SD cs.LG eess.AS

    Unsupervised Contrastive Learning of Sound Event Representations

    Authors: Eduardo Fonseca, Diego Ortego, Kevin McGuinness, Noel E. O'Connor, Xavier Serra

    Abstract: Self-supervised representation learning can mitigate the limitations in recognition tasks with few manually labeled data but abundant unlabeled data---a common scenario in sound event research. In this work, we explore unsupervised contrastive learning as a way to learn sound event representations. To this end, we propose to use the pretext task of contrasting differently augmented views of sound… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Comments: A 4-page version is submitted to ICASSP 2021

  31. arXiv:2010.06307  [pdf, other

    cs.CV cs.AI cs.LG

    How important are faces for person re-identification?

    Authors: Julia Dietlmeier, Joseph Antony, Kevin McGuinness, Noel E. O'Connor

    Abstract: This paper investigates the dependence of existing state-of-the-art person re-identification models on the presence and visibility of human faces. We apply a face detection and blurring algorithm to create anonymized versions of several popular person re-identification datasets including Market1501, DukeMTMC-reID, CUHK03, Viper, and Airport. Using a cross-section of existing state-of-the-art model… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: 25th International Conference on Pattern Recognition (ICPR2020), Milan, Italy, 10-15 January 2021

  32. arXiv:2008.00106  [pdf, other

    cs.CV

    Utilising Visual Attention Cues for Vehicle Detection and Tracking

    Authors: Feiyan Hu, Venkatesh G M, Noel E. O'Connor, Alan F. Smeaton, Suzanne Little

    Abstract: Advanced Driver-Assistance Systems (ADAS) have been attracting attention from many researchers. Vision-based sensors are the closest way to emulate human driver visual behavior while driving. In this paper, we explore possible ways to use visual attention (saliency) for object detection and tracking. We investigate: 1) How a visual attention map such as a \emph{subjectness} attention or saliency m… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: Accepted in ICPR2020

  33. arXiv:2007.11866  [pdf, other

    cs.CV

    Reliable Label Bootstrap** for Semi-Supervised Learning

    Authors: Paul Albert, Diego Ortego, Eric Arazo, Noel E. O'Connor, Kevin McGuinness

    Abstract: Reducing the amount of labels required to train convolutional neural networks without performance degradation is key to effectively reduce human annotation efforts. We propose Reliable Label Bootstrap** (ReLaB), an unsupervised preprossessing algorithm which improves the performance of semi-supervised algorithms in extremely low supervision settings. Given a dataset with few labeled samples, we… ▽ More

    Submitted 25 February, 2021; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: 10 pages, 3 figures

  34. arXiv:2006.15349  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Chroma Intra Prediction with attention-based CNN architectures

    Authors: Marc Górriz, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Neural networks can be used in video coding to improve chroma intra-prediction. In particular, usage of fully-connected networks has enabled better cross-component prediction with respect to traditional linear models. Nonetheless, state-of-the-art architectures tend to disregard the location of individual reference samples in the prediction process. This paper proposes a new neural network archite… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

    Comments: 27th IEEE International Conference on Image Processing, 25-28 Oct 2020, Abu Dhabi, United Arab Emirates

  35. arXiv:2006.06392  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Interpreting CNN for Low Complexity Learned Sub-pixel Motion Compensation in Video Coding

    Authors: Luka Murn, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Deep learning has shown great potential in image and video compression tasks. However, it brings bit savings at the cost of significant increases in coding complexity, which limits its potential for implementation within practical applications. In this paper, a novel neural network-based tool is presented which improves the interpolation of reference samples needed for fractional precision motion… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 27th IEEE International Conference on Image Processing, 25-28 Oct 2020, Abu Dhabi, United Arab Emirates

    Journal ref: 2020 IEEE International Conference on Image Processing (ICIP), 2020, pp. 798-802

  36. arXiv:2005.00430  [pdf, other

    cs.CV

    Investigating Class-level Difficulty Factors in Multi-label Classification Problems

    Authors: Mark Marsden, Kevin McGuinness, Joseph Antony, Haolin Wei, Milan Redzic, Jian Tang, Zhilan Hu, Alan Smeaton, Noel E O'Connor

    Abstract: This work investigates the use of class-level difficulty factors in multi-label classification problems for the first time. Four class-level difficulty factors are proposed: frequency, visual variation, semantic abstraction, and class co-occurrence. Once computed for a given multi-label classification dataset, these difficulty factors are shown to have several potential applications including the… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: Published in ICME 2020

  37. arXiv:1912.08741  [pdf, other

    cs.CV

    Towards Robust Learning with Different Label Noise Distributions

    Authors: Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Noisy labels are an unavoidable consequence of labeling processes and detecting them is an important step towards preventing performance degradations in Convolutional Neural Networks. Discarding noisy labels avoids a harmful memorization, while the associated image content can still be exploited in a semi-supervised learning (SSL) setup. Clean samples are usually identified using the small loss tr… ▽ More

    Submitted 27 July, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

  38. arXiv:1908.09873  [pdf, other

    eess.IV cs.CV cs.LG

    End-to-End Conditional GAN-based Architectures for Image Colourisation

    Authors: Marc Górriz, Marta Mrak, Alan F. Smeaton, Noel E. O'Connor

    Abstract: In this work recent advances in conditional adversarial networks are investigated to develop an end-to-end architecture based on Convolutional Neural Networks (CNNs) to directly map realistic colours to an input greyscale image. Observing that existing colourisation methods sometimes exhibit a lack of colourfulness, this paper proposes a method to improve colourisation results. In particular, the… ▽ More

    Submitted 5 September, 2019; v1 submitted 26 August, 2019; originally announced August 2019.

    Comments: IEEE 21st International Workshop on Multimedia Signal Processing, 27-29 Sept 2019, Kuala Lumpur, Malaysia

  39. arXiv:1908.08873  [pdf, other

    eess.IV cs.CV cs.LG

    Predicting knee osteoarthritis severity: comparative modeling based on patient's data and plain X-ray images

    Authors: Jaynal Abedin, Joseph Antony, Kevin McGuinness, Kieran Moran, Noel E O'Connor, Dietrich Rebholz-Schuhmann, John Newell

    Abstract: Knee osteoarthritis (KOA) is a disease that impairs knee function and causes pain. A radiologist reviews knee X-ray images and grades the severity level of the impairments according to the Kellgren and Lawrence grading scheme; a five-point ordinal scale (0--4). In this study, we used Elastic Net (EN) and Random Forests (RF) to build predictive models using patient assessment data (i.e. signs and s… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: Published in Nature Scientific Reports, 2019

    Journal ref: Scientific reports 9, no. 1 (2019): 5761

  40. arXiv:1908.08856  [pdf, other

    eess.IV cs.CV cs.LG

    Assessing Knee OA Severity with CNN attention-based end-to-end architectures

    Authors: Marc Górriz, Joseph Antony, Kevin McGuinness, Xavier Giró-i-Nieto, Noel E. O'Connor

    Abstract: This work proposes a novel end-to-end convolutional neural network (CNN) architecture to automatically quantify the severity of knee osteoarthritis (OA) using X-Ray images, which incorporates trainable attention modules acting as unsupervised fine-grained detectors of the region of interest (ROI). The proposed attention modules can be applied at different levels and scales across any CNN pipeline… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: Proceedings of the 2nd International Conference on Medical Imaging with Deep Learning

    Journal ref: Proceedings of The 2nd International Conference on Medical Imaging with Deep Learning, PMLR 102:197-214, 2019

  41. arXiv:1908.02983  [pdf, other

    cs.CV

    Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning

    Authors: Eric Arazo, Diego Ortego, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Semi-supervised learning, i.e. jointly learning from labeled and unlabeled samples, is an active research topic due to its key role on relaxing human supervision. In the context of image classification, recent advances to learn from unlabeled samples are mainly focused on consistency regularization methods that encourage invariant predictions for different perturbations of unlabeled samples. We, c… ▽ More

    Submitted 29 June, 2020; v1 submitted 8 August, 2019; originally announced August 2019.

  42. arXiv:1907.01869  [pdf, other

    cs.CV cs.LG

    Simple vs complex temporal recurrences for video saliency prediction

    Authors: Panagiotis Linardos, Eva Mohedano, Juan Jose Nieto, Noel E. O'Connor, Xavier Giro-i-Nieto, Kevin McGuinness

    Abstract: This paper investigates modifying an existing neural network architecture for static saliency prediction using two types of recurrences that integrate information from the temporal domain. The first modification is the addition of a ConvLSTM within the architecture, while the second is a conceptually simple exponential moving average of an internal convolutional state. We use weights pre-trained o… ▽ More

    Submitted 16 July, 2019; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Accepted at BMVC 2019

  43. arXiv:1904.11256  [pdf, other

    cs.CV

    On guiding video object segmentation

    Authors: Diego Ortego, Kevin McGuinness, Juan C. SanMiguel, Eric Arazo, José M. Martínez, Noel E. O'Connor

    Abstract: This paper presents a novel approach for segmenting moving objects in unconstrained environments using guided convolutional neural networks. This guiding process relies on foreground masks from independent algorithms (i.e. state-of-the-art algorithms) to implement an attention mechanism that incorporates the spatial location of foreground and background to compute their separated representations.… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.

  44. arXiv:1904.11238  [pdf, other

    cs.CV

    Unsupervised Label Noise Modeling and Loss Correction

    Authors: Eric Arazo, Diego Ortego, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Despite being robust to small amounts of label noise, convolutional neural networks trained with stochastic gradient methods have been shown to easily fit random labels. When there are a mixture of correct and mislabelled targets, networks tend to fit the former before the latter. This suggests using a suitable two-component mixture model as an unsupervised generative model of sample loss values d… ▽ More

    Submitted 5 June, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

    Comments: Accepted to ICML 2019

  45. arXiv:1809.00567  [pdf, other

    cs.CV cs.AI

    PathGAN: Visual Scanpath Prediction with Generative Adversarial Networks

    Authors: Marc Assens, Xavier Giro-i-Nieto, Kevin McGuinness, Noel E. O'Connor

    Abstract: We introduce PathGAN, a deep neural network for visual scanpath prediction trained on adversarial examples. A visual scanpath is defined as the sequence of fixation points over an image defined by a human observer with its gaze. PathGAN is composed of two parts, the generator and the discriminator. Both parts extract features from images using off-the-shelf networks, and train recurrent layers to… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: ECCV 2018 Workshop on Egocentric Perception, Interaction and Computing (EPIC). This work obtained the 2nd award in Prediction of Head-gaze Scan-paths for Images, and the 2nd award in Prediction of Eye-gaze Scan-paths for Images at the IEEE ICME 2018 Salient360! Challenge

  46. arXiv:1711.10795  [pdf, other

    cs.CV cs.AI cs.IR

    Saliency Weighted Convolutional Features for Instance Search

    Authors: Eva Mohedano, Kevin McGuinness, Xavier Giro-i-Nieto, Noel E. O'Connor

    Abstract: This work explores attention models to weight the contribution of local convolutional representations for the instance search task. We present a retrieval framework based on bags of local convolutional features (BLCF) that benefits from saliency weighting to build an efficient image representation. The use of human visual attention models (saliency) allows significant improvements in retrieval per… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

  47. arXiv:1711.05586  [pdf, other

    cs.CV

    People, Penguins and Petri Dishes: Adapting Object Counting Models To New Visual Domains And Object Types Without Forgetting

    Authors: Mark Marsden, Kevin McGuinness, Suzanne Little, Ciara E. Keogh, Noel E. O'Connor

    Abstract: In this paper we propose a technique to adapt a convolutional neural network (CNN) based object counter to additional visual domains and object types while still preserving the original counting function. Domain-specific normalisation and scaling operators are trained to allow the model to adjust to the statistical distributions of the various visual domains. The developed adaptation technique is… ▽ More

    Submitted 15 November, 2017; originally announced November 2017.

    Comments: 10 pages

  48. arXiv:1707.03123  [pdf, other

    cs.CV cs.MM

    SaltiNet: Scan-path Prediction on 360 Degree Images using Saliency Volumes

    Authors: Marc Assens, Kevin McGuinness, Xavier Giro-i-Nieto, Noel E. O'Connor

    Abstract: We introduce SaltiNet, a deep neural network for scanpath prediction trained on 360-degree images. The model is based on a temporal-aware novel representation of saliency information named the saliency volume. The first part of the network consists of a model trained to generate saliency volumes, whose parameters are fit by back-propagation computed from a binary cross entropy (BCE) loss over down… ▽ More

    Submitted 17 August, 2017; v1 submitted 11 July, 2017; originally announced July 2017.

    Comments: Winner of the Best Scan-path Award at the Salient360!: Visual attention modeling for 360 degrees Images Grand Challenge of ICME 2017. Presented at the ICCV 2017 Workshop on Egocentric Perception, Interaction and Computing (EPIC)

  49. arXiv:1705.10698  [pdf, other

    cs.CV

    ResnetCrowd: A Residual Deep Learning Architecture for Crowd Counting, Violent Behaviour Detection and Crowd Density Level Classification

    Authors: Mark Marsden, Kevin McGuinness, Suzanne Little, Noel E. O'Connor

    Abstract: In this paper we propose ResnetCrowd, a deep residual architecture for simultaneous crowd counting, violent behaviour detection and crowd density level classification. To train and evaluate the proposed multi-objective technique, a new 100 image dataset referred to as Multi Task Crowd is constructed. This new dataset is the first computer vision dataset fully annotated for crowd counting, violent… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

    Comments: 7 Pages, AVSS 2017

  50. arXiv:1703.09856  [pdf, other

    cs.CV

    Automatic Detection of Knee Joints and Quantification of Knee Osteoarthritis Severity using Convolutional Neural Networks

    Authors: Joseph Antony, Kevin McGuinness, Kieran Moran, Noel E O'Connor

    Abstract: This paper introduces a new approach to automatically quantify the severity of knee OA using X-ray images. Automatically quantifying knee OA severity involves two steps: first, automatically localizing the knee joints; next, classifying the localized knee joint images. We introduce a new approach to automatically detect the knee joints using a fully convolutional neural network (FCN). We train con… ▽ More

    Submitted 28 March, 2017; originally announced March 2017.