Skip to main content

Showing 1–50 of 76 results for author: McGuinness, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.09550  [pdf, other

    cs.LG cs.RO

    Dataset Clustering for Improved Offline Policy Learning

    Authors: Qiang Wang, Yixin Deng, Francisco Roldan Sanchez, Keru Wang, Kevin McGuinness, Noel O'Connor, Stephen J. Redmond

    Abstract: Offline policy learning aims to discover decision-making policies from previously-collected datasets without additional online interactions with the environment. As the training dataset is fixed, its quality becomes a crucial determining factor in the performance of the learned policy. This paper studies a dataset characteristic that we refer to as multi-behavior, indicating that the dataset is co… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  2. arXiv:2311.16514  [pdf, other

    cs.CV cs.AI cs.LG

    Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach

    Authors: Ayush K. Rai, Tarun Krishna, Feiyan Hu, Alexandru Drimbarean, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: Video Anomaly Detection (VAD) is an open-set recognition task, which is usually formulated as a one-class classification (OCC) problem, where training data is comprised of videos with normal instances while test data contains both normal and anomalous instances. Recent works have investigated the creation of pseudo-anomalies (PAs) using only the normal data and making strong assumptions about real… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted in CVPRW 2024 - VAND Workshop

  3. arXiv:2310.01827  [pdf, other

    cs.RO cs.AI

    Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency

    Authors: Francisco Roldan Sanchez, Qiang Wang, David Cordova Bulens, Kevin McGuinness, Stephen Redmond, Noel O'Connor

    Abstract: Hindsight Experience Replay (HER) is a technique used in reinforcement learning (RL) that has proven to be very efficient for training off-policy RL-based agents to solve goal-based robotic manipulation tasks using sparse rewards. Even though HER improves the sample efficiency of RL-based agents by learning from mistakes made in past experiences, it does not provide any guidance while exploring th… ▽ More

    Submitted 19 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: 6 pages, 2 figures, 1 algorithm, 1 table. Version accepted to ICARA 2024

  4. arXiv:2307.12033  [pdf, other

    cs.CV

    Self-Supervised and Semi-Supervised Polyp Segmentation using Synthetic Data

    Authors: Enric Moreu, Eric Arazo, Kevin McGuinness, Noel E. O'Connor

    Abstract: Early detection of colorectal polyps is of utmost importance for their treatment and for colorectal cancer prevention. Computer vision techniques have the potential to aid professionals in the diagnosis stage, where colonoscopies are manually carried out to examine the entirety of the patient's colon. The main challenge in medical imaging is the lack of data, and a further challenge specific to po… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  5. arXiv:2307.11661  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts

    Authors: Mayug Maniparambil, Chris Vorster, Derek Molloy, Noel Murphy, Kevin McGuinness, Noel E. O'Connor

    Abstract: Contrastive pretrained large Vision-Language Models (VLMs) like CLIP have revolutionized visual representation learning by providing good performance on downstream datasets. VLMs are 0-shot adapted to a downstream dataset by designing prompts that are relevant to the dataset. Such prompt engineering makes use of domain expertise and a validation dataset. Meanwhile, recent developments in generativ… ▽ More

    Submitted 8 August, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: Paper accepted at ICCV-W 2023. V2 contains additional comparisons with concurrent works

  6. Joint one-sided synthetic unpaired image translation and segmentation for colorectal cancer prevention

    Authors: Enric Moreu, Eric Arazo, Kevin McGuinness, Noel E. O'Connor

    Abstract: Deep learning has shown excellent performance in analysing medical images. However, datasets are difficult to obtain due privacy issues, standardization problems, and lack of annotations. We address these problems by producing realistic synthetic images using a combination of 3D technologies and generative adversarial networks. We propose CUT-seg, a joint training where a segmentation model and a… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.08680

  7. arXiv:2305.10115  [pdf, other

    eess.IV cs.CV cs.LG

    An Ensemble Deep Learning Approach for COVID-19 Severity Prediction Using Chest CT Scans

    Authors: Sidra Aleem, Mayug Maniparambil, Suzanne Little, Noel O'Connor, Kevin McGuinness

    Abstract: Chest X-rays have been widely used for COVID-19 screening; however, 3D computed tomography (CT) is a more effective modality. We present our findings on COVID-19 severity prediction from chest CT scans using the STOIC dataset. We developed an ensemble deep learning based model that incorporates multiple neural networks to improve predictions. To address data imbalance, we used slicing functions an… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  8. arXiv:2302.01052  [pdf, other

    cs.LG

    Site-specific Deep Learning Path Loss Models based on the Method of Moments

    Authors: Conor Brennan, Kevin McGuinness

    Abstract: This paper describes deep learning models based on convolutional neural networks applied to the problem of predicting EM wave propagation over rural terrain. A surface integral equation formulation, solved with the method of moments and accelerated using the Fast Far Field approximation, is used to generate synthetic training data which comprises path loss computed over randomly generated 1D terra… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: EuCAP 2023

  9. arXiv:2301.13019  [pdf, other

    cs.RO cs.LG

    Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies

    Authors: Qiang Wang, Robert McCarthy, David Cordova Bulens, Francisco Roldan Sanchez, Kevin McGuinness, Noel E. O'Connor, Stephen J. Redmond

    Abstract: This paper presents our solution for the Real Robot Challenge (RRC) III, a competition featured in the NeurIPS 2022 Competition Track, aimed at addressing dexterous robotic manipulation tasks through learning from pre-collected offline data. Participants were provided with two types of datasets for each task: expert and mixed datasets with varying skill levels. While the simplest offline policy le… ▽ More

    Submitted 21 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

  10. arXiv:2301.11734  [pdf, other

    cs.LG cs.RO

    Improving Behavioural Cloning with Positive Unlabeled Learning

    Authors: Qiang Wang, Robert McCarthy, David Cordova Bulens, Kevin McGuinness, Noel E. O'Connor, Nico Gürtler, Felix Widmaier, Francisco Roldan Sanchez, Stephen J. Redmond

    Abstract: Learning control policies offline from pre-recorded datasets is a promising avenue for solving challenging real-world problems. However, available datasets are typically of mixed quality, with a limited number of the trajectories that we would consider as positive examples; i.e., high-quality demonstrations. Therefore, we propose a novel iterative learning algorithm for identifying expert trajecto… ▽ More

    Submitted 21 September, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  11. arXiv:2301.09164  [pdf, other

    cs.LG cs.CV

    Unifying Synergies between Self-supervised Learning and Dynamic Computation

    Authors: Tarun Krishna, Ayush K Rai, Alexandru Drimbarean, Eric Arazo, Paul Albert, Alan F Smeaton, Kevin McGuinness, Noel E O'Connor

    Abstract: Computationally expensive training strategies make self-supervised learning (SSL) impractical for resource constrained industrial settings. Techniques like knowledge distillation (KD), dynamic computation (DC), and pruning are often used to obtain a lightweightmodel, which usually involves multiple epochs of fine-tuning (or distilling steps) of a large pre-trained model, making it more computation… ▽ More

    Submitted 9 September, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: Accepted in BMVC 2023

  12. arXiv:2301.04619  [pdf, other

    cs.CV

    TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps Distillation

    Authors: Feiyan Hu, Simone Palazzo, Federica Proietto Salanitri, Giovanni Bellitto, Morteza Moradi, Concetto Spampinato, Kevin McGuinness

    Abstract: Video saliency prediction has recently attracted attention of the research community, as it is an upstream task for several practical applications. However, current solutions are particularly computationally demanding, especially due to the wide usage of spatio-temporal 3D convolutions. We observe that, while different model architectures achieve similar performance on benchmarks, visual variation… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: WACV2023

  13. arXiv:2210.05574  [pdf, other

    cs.CV cs.AI cs.LG

    Motion Aware Self-Supervision for Generic Event Boundary Detection

    Authors: Ayush K. Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The task of Generic Event Boundary Detection (GEBD) aims to detect moments in videos that are naturally perceived by humans as generic and taxonomy-free event boundaries. Modeling the dynamically evolving temporal and spatial changes in a video makes GEBD a difficult problem to solve. Existing approaches involve very complex and sophisticated pipelines in terms of architectural design choices, hen… ▽ More

    Submitted 12 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023

  14. arXiv:2210.04578  [pdf, other

    cs.CV cs.LG

    Is your noise correction noisy? PLS: Robustness to label noise with two stage detection

    Authors: Paul Albert, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness

    Abstract: Designing robust algorithms capable of training accurate neural networks on uncurated datasets from the web has been the subject of much research as it reduces the need for time consuming human labor. The focus of many previous research contributions has been on the detection of different types of label noise; however, this paper proposes to improve the correction accuracy of noisy samples once th… ▽ More

    Submitted 15 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 9 pages 4 figures. Accepted at WACV 2023

  15. arXiv:2210.02476  [pdf, other

    cs.CV cs.AI cs.LG

    BaseTransformers: Attention over base data-points for One Shot Learning

    Authors: Mayug Maniparambil, Kevin McGuinness, Noel O'Connor

    Abstract: Few shot classification aims to learn to recognize novel categories using only limited samples per category. Most current few shot methods use a base dataset rich in labeled examples to train an encoder that is used for obtaining representations of support instances for novel classes. Since the test instances are from a distribution different to the base distribution, their feature representations… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Paper accepted at British Machine Vision Conference 2022

  16. arXiv:2210.00824  [pdf, other

    eess.IV cs.CV cs.LG

    Random Data Augmentation based Enhancement: A Generalized Enhancement Approach for Medical Datasets

    Authors: Sidra Aleem, Teerath Kumar, Suzanne Little, Malika Bendechache, Rob Brennan, Kevin McGuinness

    Abstract: Over the years, the paradigm of medical image analysis has shifted from manual expertise to automated systems, often using deep learning (DL) systems. The performance of deep learning algorithms is highly dependent on data quality. Particularly for the medical domain, it is an important aspect as medical data is very sensitive to quality and poor quality can lead to misdiagnosis. To improve the di… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: Our paper is accepted at 24th Irish Machine Vision and Image Processing (IMVIP) Conference, Belfast. Paper got BCS NI Best Poster Presentation Award and copy of proceeding is at https://imvipconference.github.io/IMVIP2022_Proceedings.pdf

  17. arXiv:2210.00795  [pdf, other

    cs.RO

    Hierarchical reinforcement learning for in-hand robotic manipulation using Davenport chained rotations

    Authors: Francisco Roldan Sanchez, Qiang Wang, David Cordova Bulens, Kevin McGuinness, Stephen Redmond, Noel O'Connor

    Abstract: End-to-end reinforcement learning techniques are among the most successful methods for robotic manipulation tasks. However, the training time required to find a good policy capable of solving complex tasks is prohibitively large. Therefore, depending on the computing resources available, it might not be feasible to use such techniques. The use of domain knowledge to decompose manipulation tasks in… ▽ More

    Submitted 15 November, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 figures, 3 tables, submitted to ICARA 2023

  18. arXiv:2210.00507  [pdf, other

    cs.CV cs.LG

    Fast and Robust Video-Based Exercise Classification via Body Pose Tracking and Scalable Multivariate Time Series Classifiers

    Authors: Ashish Singh, Antonio Bevilacqua, Thach Le Nguyen, Feiyan Hu, Kevin McGuinness, Martin OReilly, Darragh Whelan, Brian Caulfield, Georgiana Ifrim

    Abstract: Technological advancements have spurred the usage of machine learning based applications in sports science. Physiotherapists, sports coaches and athletes actively look to incorporate the latest technologies in order to further improve performance and avoid injuries. While wearable sensors are very popular, their use is hindered by constraints on battery power and sensor calibration, especially for… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  19. arXiv:2209.09714  [pdf, other

    eess.IV cs.CV

    Cardiac Segmentation using Transfer Learning under Respiratory Motion Artifacts

    Authors: Carles Garcia-Cabrera, Eric Arazo, Kathleen M. Curran, Noel E. O'Connor, Kevin McGuinness

    Abstract: Methods that are resilient to artifacts in the cardiac magnetic resonance imaging (MRI) while performing ventricle segmentation, are crucial for ensuring quality in structural and functional analysis of those tissues. While there has been significant efforts on improving the quality of the algorithms, few works have tackled the harm that the artifacts generate in the predictions. In this work, we… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: accepted for the STACOM2022 workshop @ MICCAI2022

  20. arXiv:2209.08903  [pdf, other

    cs.RO

    Towards advanced robotic manipulation

    Authors: Francisco Roldan Sanchez, Stephen Redmond, Kevin McGuinness, Noel O'Connor

    Abstract: Robotic manipulation and control has increased in importance in recent years. However, state of the art techniques still have limitations when required to operate in real world applications. This paper explores Hindsight Experience Replay both in simulated and real environments, highlighting its weaknesses and proposing reinforcement-learning based alternatives based on reward and goal sha**. Ad… ▽ More

    Submitted 26 September, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: 4 pages, 1 figure, Submitted to PhD Workshop submission at IRC 2022. Updated figure 1

  21. arXiv:2207.12065  [pdf, other

    cs.CV

    Dynamic Channel Selection in Self-Supervised Learning

    Authors: Tarun Krishna, Ayush K. Rai, Yasser A. D. Djilali, Alan F. Smeaton, Kevin McGuinness, Noel E. O'Connor

    Abstract: Whilst computer vision models built using self-supervised approaches are now commonplace, some important questions remain. Do self-supervised models learn highly redundant channel features? What if a self-supervised network could dynamically select the important channels and get rid of the unnecessary ones? Currently, convnets pre-trained with self-supervision have obtained comparable performance… ▽ More

    Submitted 16 December, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted in Irish Machine Vision and Image Processing Conference 2022

  22. arXiv:2207.01573  [pdf, other

    cs.CV

    Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets

    Authors: Paul Albert, Eric Arazo, Noel E. O'Connor, Kevin McGuinness

    Abstract: Using search engines for web image retrieval is a tempting alternative to manual curation when creating an image dataset, but their main drawback remains the proportion of incorrect (noisy) samples retrieved. These noisy samples have been evidenced by previous works to be a mixture of in-distribution (ID) samples, assigned to the incorrect category but presenting similar visual semantics to other… ▽ More

    Submitted 18 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  23. arXiv:2206.04449  [pdf, other

    cs.CV

    Segmentation Enhanced Lameness Detection in Dairy Cows from RGB and Depth Video

    Authors: Eric Arazo, Robin Aly, Kevin McGuinness

    Abstract: Cow lameness is a severe condition that affects the life cycle and life quality of dairy cows and results in considerable economic losses. Early lameness detection helps farmers address illnesses early and avoid negative effects caused by the degeneration of cows' condition. We collected a dataset of short clips of cows passing through a hallway exiting a milking station and annotated the degree o… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted at the CV4Animals workshop in CVPR 2022

  24. Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks

    Authors: Qiang Wang, Francisco Roldan Sanchez, Robert McCarthy, David Cordova Bulens, Kevin McGuinness, Noel O'Connor, Manuel Wüthrich, Felix Widmaier, Stefan Bauer, Stephen J. Redmond

    Abstract: This paper describes a deep reinforcement learning (DRL) approach that won Phase 1 of the Real Robot Challenge (RRC) 2021, and then extends this method to a more difficult manipulation task. The RRC consisted of using a TriFinger robot to manipulate a cube along a specified positional trajectory, but with no requirement for the cube to have any specific orientation. We used a relatively simple rew… ▽ More

    Submitted 27 January, 2023; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: This paper has been summited to Expert Systems: the Journal of Knowledge Engineering for reviewing. arXiv admin note: text overlap with arXiv:2109.15233

  25. arXiv:2204.09343  [pdf

    cs.CV

    Utilizing unsupervised learning to improve sward content prediction and herbage mass estimation

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Brian Mac Namee, Deirdre Hennessy, Aisling H. O'Connor, Noel E. O'Connor, Kevin McGuinness

    Abstract: Sward species composition estimation is a tedious one. Herbage must be collected in the field, manually separated into components, dried and weighed to estimate species composition. Deep learning approaches using neural networks have been used in previous work to propose faster and more cost efficient alternatives to this process by estimating the biomass information from a picture of an area of p… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 3 pages. Accepted at the 29th EGF General Meeting 2022

  26. arXiv:2204.08271  [pdf, other

    cs.CV

    Unsupervised domain adaptation and super resolution on drone images for autonomous dry herbage biomass estimation

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Jaime Fernandez, Brian Mac Namee, Deirdre Hennessey, Noel E. O'Connor, Kevin McGuinness

    Abstract: Herbage mass yield and composition estimation is an important tool for dairy farmers to ensure an adequate supply of high quality herbage for grazing and subsequently milk production. By accurately estimating herbage mass and composition, targeted nitrogen fertiliser application strategies can be deployed to improve localised regions in a herbage field, effectively reducing the negative impacts of… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 11 pages, 5 figures. Accepted at the Agriculture-Vision CVPR 2022 Workshop

  27. arXiv:2202.08680  [pdf, other

    eess.IV cs.CV

    Synthetic data for unsupervised polyp segmentation

    Authors: Enric Moreu, Kevin McGuinness, Noel E. O'Connor

    Abstract: Deep learning has shown excellent performance in analysing medical images. However, datasets are difficult to obtain due privacy issues, standardization problems, and lack of annotations. We address these problems by producing realistic synthetic images using a combination of 3D technologies and generative adversarial networks. We use zero annotations from medical professionals in our pipeline. Ou… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  28. arXiv:2202.08670  [pdf, other

    cs.CV cs.AI

    Domain Randomization for Object Counting

    Authors: Enric Moreu, Kevin McGuinness, Diego Ortego, Noel E. O'Connor

    Abstract: Recently, the use of synthetic datasets based on game engines has been shown to improve the performance of several tasks in computer vision. However, these datasets are typically only appropriate for the specific domains depicted in computer games, such as urban scenes involving vehicles and people. In this paper, we present an approach to generate synthetic datasets for object counting for any do… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  29. arXiv:2201.10243  [pdf, other

    cs.CV cs.LG

    BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment

    Authors: Luis Lebron, Yvette Graham, Kevin McGuinness, Konstantinos Kouramas, Noel E. O'Connor

    Abstract: Evaluating video captioning systems is a challenging task as there are multiple factors to consider; for instance: the fluency of the caption, multiple actions happening in a single scene, and the human bias of what is considered important. Most metrics try to measure how similar the system generated captions are to a single or a set of human-annotated captions. This paper presents a new method ba… ▽ More

    Submitted 16 May, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: In press in Language Resources and Evaluation Conference(LREC) 2022

  30. arXiv:2111.09056  [pdf, other

    cs.CV cs.CY cs.MM

    Improving Person Re-Identification with Temporal Constraints

    Authors: Julia Dietlmeier, Feiyan Hu, Frances Ryan, Noel E. O'Connor, Kevin McGuinness

    Abstract: In this paper we introduce an image-based person re-identification dataset collected across five non-overlap** camera views in the large and busy airport in Dublin, Ireland. Unlike all publicly available image-based datasets, our dataset contains timestamp information in addition to frame number, and camera and person IDs. Also our dataset has been fully anonymized to comply with modern data pri… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 10 pages, RWS @ WACV2022

  31. arXiv:2110.14283  [pdf, other

    cs.CV

    How Important is Importance Sampling for Deep Budgeted Training?

    Authors: Eric Arazo, Diego Ortego, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Long iterative training processes for Deep Neural Networks (DNNs) are commonly required to achieve state-of-the-art performance in many computer vision tasks. Importance sampling approaches might play a key role in budgeted training regimes, i.e. when limiting the number of training iterations. These approaches aim at dynamically estimating the importance of each sample to focus on the most releva… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: British Machine Vision Conference (BMVC) 2021, oral presentation

  32. arXiv:2110.13719  [pdf, other

    cs.CV

    Semi-supervised dry herbage mass estimation using automatic data and synthetic images

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Brian Mac Namee, Deirdre Hennessy, Aisling O'Connor, Noel O'Connor, Kevin McGuinness

    Abstract: Monitoring species-specific dry herbage biomass is an important aspect of pasture-based milk production systems. Being aware of the herbage biomass in the field enables farmers to manage surpluses and deficits in herbage supply, as well as using targeted nitrogen fertilization when necessary. Deep learning for computer vision is a powerful tool in this context as it can accurately estimate the dry… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Published at CVPPA 2021, ICCVW 2021

  33. arXiv:2110.13699  [pdf, other

    cs.CV

    Addressing out-of-distribution label noise in webly-labelled data

    Authors: Paul Albert, Diego Ortego, Eric Arazo, Noel O'Connor, Kevin McGuinness

    Abstract: A recurring focus of the deep learning community is towards reducing the labeling effort. Data gathering and annotation using a search engine is a simple alternative to generating a fully human-annotated and human-gathered dataset. Although web crawling is very time efficient, some of the retrieved images are unavoidably noisy, i.e. incorrectly labeled. Designing robust algorithms for training on… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at WACV 2022

  34. arXiv:2109.15233  [pdf, other

    cs.RO cs.LG

    Solving the Real Robot Challenge using Deep Reinforcement Learning

    Authors: Robert McCarthy, Francisco Roldan Sanchez, Qiang Wang, David Cordova Bulens, Kevin McGuinness, Noel O'Connor, Stephen J. Redmond

    Abstract: This paper details our winning submission to Phase 1 of the 2021 Real Robot Challenge; a challenge in which a three-fingered robot must carry a cube along specified goal trajectories. To solve Phase 1, we use a pure reinforcement learning approach which requires minimal expert knowledge of the robotic system, or of robotic gras** in general. A sparse, goal-based reward is employed in conjunction… ▽ More

    Submitted 27 June, 2022; v1 submitted 30 September, 2021; originally announced September 2021.

    Comments: Published in AICS 2021 (http://ceur-ws.org/Vol-3105/paper41.pdf). Paper updated to clarify procedure used to train the policy

  35. arXiv:2109.10957  [pdf, other

    cs.RO stat.AP

    Real Robot Challenge: A Robotics Competition in the Cloud

    Authors: Stefan Bauer, Felix Widmaier, Manuel Wüthrich, Annika Buchholz, Sebastian Stark, Anirudh Goyal, Thomas Steinbrenner, Joel Akpo, Shruti Joshi, Vincent Berenz, Vaibhav Agrawal, Niklas Funk, Julen Urain De Jesus, Jan Peters, Joe Watson, Claire Chen, Krishnan Srinivasan, Junwu Zhang, Jeffrey Zhang, Matthew R. Walter, Rishabh Madan, Charles Schaff, Takahiro Maeda, Takuma Yoneda, Denis Yarats , et al. (17 additional authors not shown)

    Abstract: Dexterous manipulation remains an open problem in robotics. To coordinate efforts of the research community towards tackling this problem, we propose a shared benchmark. We designed and built robotic platforms that are hosted at MPI for Intelligent Systems and can be accessed remotely. Each platform consists of three robotic fingers that are capable of dexterous object manipulation. Users are able… ▽ More

    Submitted 10 June, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

  36. arXiv:2106.10090  [pdf, other

    cs.CV cs.AI

    Discerning Generic Event Boundaries in Long-Form Wild Videos

    Authors: Ayush K Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F Smeaton, Noel E O'Connor

    Abstract: Detecting generic, taxonomy-free event boundaries invideos represents a major stride forward towards holisticvideo understanding. In this paper we present a technique forgeneric event boundary detection based on a two stream in-flated 3D convolutions architecture, which can learn spatio-temporal features from videos. Our work is inspired from theGeneric Event Boundary Detection Challenge (part of… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: Technical Report for Generic Event Boundary Challenge - LOVEU Challenge (CVPR 2021)

  37. arXiv:2106.09814  [pdf, other

    cs.MM cs.SD eess.AS

    PixInWav: Residual Steganography for Hiding Pixels in Audio

    Authors: Margarita Geleta, Cristina Punti, Kevin McGuinness, Jordi Pons, Cristian Canton, Xavier Giro-i-Nieto

    Abstract: Steganography comprises the mechanics of hiding data in a host media that may be publicly available. While previous works focused on unimodal setups (e.g., hiding images in images, or hiding audio in audio), PixInWav targets the multimodal case of hiding images in audio. To this end, we propose a novel residual architecture operating on top of short-time discrete cosine transform (STDCT) audio spe… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Extended abstract presented in CVPR 2021 Women in Computer Vision Workshop

  38. Evaluating Contrastive Models for Instance-based Image Retrieval

    Authors: Tarun Krishna, Kevin McGuinness, Noel O'Connor

    Abstract: In this work, we evaluate contrastive models for the task of image retrieval. We hypothesise that models that are learned to encode semantic similarity among instances via discriminative learning should perform well on the task of image retrieval, where relevancy is defined in terms of instances of the same object. Through our extensive evaluation, we find that representations from models trained… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: Accepted In Proceedings of the 2021 International Conference on Multimedia Retrieval (ICMR 21)

  39. arXiv:2101.03198  [pdf, other

    cs.CV cs.LG

    Extracting Pasture Phenotype and Biomass Percentages using Weakly Supervised Multi-target Deep Learning on a Small Dataset

    Authors: Badri Narayanan, Mohamed Saadeldin, Paul Albert, Kevin McGuinness, Brian Mac Namee

    Abstract: The dairy industry uses clover and grass as fodder for cows. Accurate estimation of grass and clover biomass yield enables smart decisions in optimizing fertilization and seeding density, resulting in increased productivity and positive environmental impact. Grass and clover are usually planted together, since clover is a nitrogen-fixing plant that brings nutrients to the soil. Adjusting the right… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

    Journal ref: Irish Machine Vision and Image Processing Conference (2020) 21-28

  40. arXiv:2012.10283  [pdf, other

    cs.CV cs.LG

    Temporal Bilinear Encoding Network of Audio-Visual Features at Low Sampling Rates

    Authors: Feiyan Hu, Eva Mohedano, Noel O'Connor, Kevin McGuinness

    Abstract: Current deep learning based video classification architectures are typically trained end-to-end on large volumes of data and require extensive computational resources. This paper aims to exploit audio-visual information in video classification with a 1 frame per second sampling rate. We propose Temporal Bilinear Encoding Networks (TBEN) for encoding both audio and visual long range temporal inform… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 8 pages

  41. arXiv:2012.04462  [pdf, other

    cs.CV

    Multi-Objective Interpolation Training for Robustness to Label Noise

    Authors: Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Deep neural networks trained with standard cross-entropy loss memorize noisy labels, which degrades their performance. Most research to mitigate this memorization proposes new robust classification loss functions. Conversely, we propose a Multi-Objective Interpolation Training (MOIT) approach that jointly exploits contrastive learning and classification to mutually help each other and boost perfor… ▽ More

    Submitted 18 March, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted to CVPR 2021. 10 pages, 1 figure, and 9 tables

  42. arXiv:2011.10600  [pdf, other

    cs.CV cs.LG

    ATSal: An Attention Based Architecture for Saliency Prediction in 360 Videos

    Authors: Yasser Dahou, Marouane Tliba, Kevin McGuinness, Noel O'Connor

    Abstract: The spherical domain representation of 360 video/image presents many challenges related to the storage, processing, transmission and rendering of omnidirectional videos (ODV). Models of human visual attention can be used so that only a single viewport is rendered at a time, which is important when develo** systems that allow users to explore ODV with head mounted displays (HMD). Accordingly, res… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  43. arXiv:2011.07616  [pdf, other

    cs.SD cs.LG eess.AS

    Unsupervised Contrastive Learning of Sound Event Representations

    Authors: Eduardo Fonseca, Diego Ortego, Kevin McGuinness, Noel E. O'Connor, Xavier Serra

    Abstract: Self-supervised representation learning can mitigate the limitations in recognition tasks with few manually labeled data but abundant unlabeled data---a common scenario in sound event research. In this work, we explore unsupervised contrastive learning as a way to learn sound event representations. To this end, we propose to use the pretext task of contrasting differently augmented views of sound… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Comments: A 4-page version is submitted to ICASSP 2021

  44. arXiv:2010.06307  [pdf, other

    cs.CV cs.AI cs.LG

    How important are faces for person re-identification?

    Authors: Julia Dietlmeier, Joseph Antony, Kevin McGuinness, Noel E. O'Connor

    Abstract: This paper investigates the dependence of existing state-of-the-art person re-identification models on the presence and visibility of human faces. We apply a face detection and blurring algorithm to create anonymized versions of several popular person re-identification datasets including Market1501, DukeMTMC-reID, CUHK03, Viper, and Airport. Using a cross-section of existing state-of-the-art model… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: 25th International Conference on Pattern Recognition (ICPR2020), Milan, Italy, 10-15 January 2021

  45. arXiv:2010.01947  [pdf, other

    eess.IV cs.CV cs.LG

    A Comparative Study of Existing and New Deep Learning Methods for Detecting Knee Injuries using the MRNet Dataset

    Authors: David Azcona, Kevin McGuinness, Alan F. Smeaton

    Abstract: This work presents a comparative study of existing and new techniques to detect knee injuries by leveraging Stanford's MRNet Dataset. All approaches are based on deep learning and we explore the comparative performances of transfer learning and a deep residual network trained from scratch. We also exploit some characteristics of Magnetic Resonance Imaging (MRI) data by, for example, using a fixed… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  46. arXiv:2008.11151  [pdf, other

    cs.CV cs.LG

    FastSal: a Computationally Efficient Network for Visual Saliency Prediction

    Authors: Feiyan Hu, Kevin McGuinness

    Abstract: This paper focuses on the problem of visual saliency prediction, predicting regions of an image that tend to attract human visual attention, under a constrained computational budget. We modify and test various recent efficient convolutional neural network architectures like EfficientNet and MobileNetV2 and compare them with existing state-of-the-art saliency models such as SalGAN and DeepGaze II b… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

  47. arXiv:2007.11866  [pdf, other

    cs.CV

    Reliable Label Bootstrap** for Semi-Supervised Learning

    Authors: Paul Albert, Diego Ortego, Eric Arazo, Noel E. O'Connor, Kevin McGuinness

    Abstract: Reducing the amount of labels required to train convolutional neural networks without performance degradation is key to effectively reduce human annotation efforts. We propose Reliable Label Bootstrap** (ReLaB), an unsupervised preprossessing algorithm which improves the performance of semi-supervised algorithms in extremely low supervision settings. Given a dataset with few labeled samples, we… ▽ More

    Submitted 25 February, 2021; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: 10 pages, 3 figures

  48. arXiv:2005.00430  [pdf, other

    cs.CV

    Investigating Class-level Difficulty Factors in Multi-label Classification Problems

    Authors: Mark Marsden, Kevin McGuinness, Joseph Antony, Haolin Wei, Milan Redzic, Jian Tang, Zhilan Hu, Alan Smeaton, Noel E O'Connor

    Abstract: This work investigates the use of class-level difficulty factors in multi-label classification problems for the first time. Four class-level difficulty factors are proposed: frequency, visual variation, semantic abstraction, and class co-occurrence. Once computed for a given multi-label classification dataset, these difficulty factors are shown to have several potential applications including the… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: Published in ICME 2020

  49. arXiv:1912.08741  [pdf, other

    cs.CV

    Towards Robust Learning with Different Label Noise Distributions

    Authors: Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Noisy labels are an unavoidable consequence of labeling processes and detecting them is an important step towards preventing performance degradations in Convolutional Neural Networks. Discarding noisy labels avoids a harmful memorization, while the associated image content can still be exploited in a semi-supervised learning (SSL) setup. Clean samples are usually identified using the small loss tr… ▽ More

    Submitted 27 July, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

  50. arXiv:1910.11603  [pdf, other

    cs.LG cs.CR

    MediaEval 2019: Concealed FGSM Perturbations for Privacy Preservation

    Authors: Panagiotis Linardos, Suzanne Little, Kevin McGuinness

    Abstract: This work tackles the Pixel Privacy task put forth by MediaEval 2019. Our goal is to manipulate images in a way that conceals them from automatic scene classifiers while preserving the original image quality. We use the fast gradient sign method, which normally has a corrupting influence on image appeal, and devise two methods to minimize the damage. The first approach uses a map of pixel location… ▽ More

    Submitted 25 October, 2019; originally announced October 2019.

    Comments: MediaEval 2019 - Pixel Privacy