Search | arXiv e-print repository

A Complete System for Automated 3D Semantic-Geometric Map** of Corrosion in Industrial Environments

Authors: Rui Pimentel de Figueiredo, Stefan Nordborg Eriksen, Ignacio Rodriguez, Simon Bøgh

Abstract: Corrosion, a naturally occurring process leading to the deterioration of metallic materials, demands diligent detection for quality control and the preservation of metal-based objects, especially within industrial contexts. Traditional techniques for corrosion identification, including ultrasonic testing, radio-graphic testing, and magnetic flux leakage, necessitate the deployment of expensive and… ▽ More Corrosion, a naturally occurring process leading to the deterioration of metallic materials, demands diligent detection for quality control and the preservation of metal-based objects, especially within industrial contexts. Traditional techniques for corrosion identification, including ultrasonic testing, radio-graphic testing, and magnetic flux leakage, necessitate the deployment of expensive and bulky equipment on-site for effective data acquisition. An unexplored alternative involves employing lightweight, conventional camera systems, and state-of-the-art computer vision methods for its identification. In this work, we propose a complete system for semi-automated corrosion identification and map** in industrial environments. We leverage recent advances in LiDAR-based methods for localization and map**, with vision-based semantic segmentation deep learning techniques, in order to build semantic-geometric maps of industrial environments. Unlike previous corrosion identification systems available in the literature, our designed multi-modal system is low-cost, portable, semi-autonomous and allows collecting large datasets by untrained personnel. A set of experiments in an indoor laboratory environment, demonstrate quantitatively the high accuracy of the employed LiDAR based 3D map** and localization system, with less then $0.05m$ and 0.02m average absolute and relative pose errors. Also, our data-driven semantic segmentation model, achieves around 70\% precision when trained with our pixel-wise manually annotated dataset. △ Less

Submitted 21 April, 2024; originally announced April 2024.

arXiv:2209.13284 [pdf, other]

Frame Interpolation for Dynamic Scenes with Implicit Flow Encoding

Authors: Pedro Figueirêdo, Avinash Paliwal, Nima Khademi Kalantari

Abstract: In this paper, we propose an algorithm to interpolate between a pair of images of a dynamic scene. While in the past years significant progress in frame interpolation has been made, current approaches are not able to handle images with brightness and illumination changes, which are common even when the images are captured shortly apart. We propose to address this problem by taking advantage of the… ▽ More In this paper, we propose an algorithm to interpolate between a pair of images of a dynamic scene. While in the past years significant progress in frame interpolation has been made, current approaches are not able to handle images with brightness and illumination changes, which are common even when the images are captured shortly apart. We propose to address this problem by taking advantage of the existing optical flow methods that are highly robust to the variations in the illumination. Specifically, using the bidirectional flows estimated using an existing pre-trained flow network, we predict the flows from an intermediate frame to the two input images. To do this, we propose to encode the bidirectional flows into a coordinate-based network, powered by a hypernetwork, to obtain a continuous representation of the flow across time. Once we obtain the estimated flows, we use them within an existing blending network to obtain the final intermediate frame. Through extensive experiments, we demonstrate that our approach is able to produce significantly better results than state-of-the-art frame interpolation algorithms. △ Less

Submitted 16 November, 2022; v1 submitted 27 September, 2022; originally announced September 2022.

Comments: Accepted to WACV 2023. Project website: https://people.engr.tamu.edu/nimak/Papers/WACV2023_Interp . Code: https://github.com/pedrovfigueiredo/frameintIFE . YouTube: https://youtu.be/Re_c-CBlSfI

arXiv:2109.01474 [pdf, other]

Real-Time Volumetric-Semantic Exploration and Map**: An Uncertainty-Aware Approach

Authors: Rui Pimentel de Figueiredo, Jonas le Fevre Sejersen, Jakob Grimm Hansen, Martim Brandão, Erdal Kayacan

Abstract: In this work we propose a holistic framework for autonomous aerial inspection tasks, using semantically-aware, yet, computationally efficient planning and map** algorithms. The system leverages state-of-the-art receding horizon exploration techniques for next-best-view (NBV) planning with geometric and semantic segmentation information provided by state-of-the-art deep convolutional neural netwo… ▽ More In this work we propose a holistic framework for autonomous aerial inspection tasks, using semantically-aware, yet, computationally efficient planning and map** algorithms. The system leverages state-of-the-art receding horizon exploration techniques for next-best-view (NBV) planning with geometric and semantic segmentation information provided by state-of-the-art deep convolutional neural networks (DCNNs), with the goal of enriching environment representations. The contributions of this article are threefold, first we propose an efficient sensor observation model, and a reward function that encodes the expected information gains from the observations taken from specific view points. Second, we extend the reward function to incorporate not only geometric but also semantic probabilistic information, provided by a DCNN for semantic segmentation that operates in real-time. The incorporation of semantic information in the environment representation allows biasing exploration towards specific objects, while ignoring task-irrelevant ones during planning. Finally, we employ our approaches in an autonomous drone shipyard inspection task. A set of simulations in realistic scenarios demonstrate the efficacy and efficiency of the proposed framework when compared with the state-of-the-art. △ Less

Submitted 3 September, 2021; originally announced September 2021.

Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021

arXiv:2108.03862 [pdf, other]

Safe Vessel Navigation Visually Aided by Autonomous Unmanned Aerial Vehicles in Congested Harbors and Waterways

Authors: Jonas le Fevre Sejersen, Rui Pimentel de Figueiredo, Erdal Kayacan

Abstract: In the maritime sector, safe vessel navigation is of great importance, particularly in congested harbors and waterways. The focus of this work is to estimate the distance between an object of interest and potential obstacles using a companion UAV. The proposed approach fuses GPS data with long-range aerial images. First, we employ semantic segmentation DNN for discriminating the vessel of interest… ▽ More In the maritime sector, safe vessel navigation is of great importance, particularly in congested harbors and waterways. The focus of this work is to estimate the distance between an object of interest and potential obstacles using a companion UAV. The proposed approach fuses GPS data with long-range aerial images. First, we employ semantic segmentation DNN for discriminating the vessel of interest, water, and potential solid objects using raw image data. The network is trained with both real and images generated and automatically labeled from a realistic AirSim simulation environment. Then, the distances between the extracted vessel and non-water obstacle blobs are computed using a novel GSD estimation algorithm. To the best of our knowledge, this work is the first attempt to detect and estimate distances to unknown objects from long-range visual data captured with conventional RGB cameras and auxiliary absolute positioning systems (e.g. GPS). The simulation results illustrate the accuracy and efficacy of the proposed method for visually aided navigation of vessels assisted by UAV. △ Less

Submitted 9 August, 2021; originally announced August 2021.

Comments: Accepted by "International Conference on Automation Science and Engineering" - Case 2021, waiting for publishment

arXiv:2105.12691 [pdf, other]

On the Advantages of Multiple Stereo Vision Camera Designs for Autonomous Drone Navigation

Authors: Rui Pimentel de Figueiredo, Jakob Grimm Hansen, Jonas Le Fevre, Martim Brandão, Erdal Kayacan

Abstract: In this work we showcase the design and assessment of the performance of a multi-camera UAV, when coupled with state-of-the-art planning and map** algorithms for autonomous navigation. The system leverages state-of-the-art receding horizon exploration techniques for Next-Best-View (NBV) planning with 3D and semantic information, provided by a reconfigurable multi stereo camera system. We employ… ▽ More In this work we showcase the design and assessment of the performance of a multi-camera UAV, when coupled with state-of-the-art planning and map** algorithms for autonomous navigation. The system leverages state-of-the-art receding horizon exploration techniques for Next-Best-View (NBV) planning with 3D and semantic information, provided by a reconfigurable multi stereo camera system. We employ our approaches in an autonomous drone-based inspection task and evaluate them in an autonomous exploration and map** scenario. We discuss the advantages and limitations of using multi stereo camera flying systems, and the trade-off between number of cameras and map** performance. △ Less

Submitted 26 May, 2021; originally announced May 2021.

Journal ref: in ICRA workshop on Resilient and Long-Term Autonomy for Aerial Robotic Systems, 2021

arXiv:2012.12794 [pdf, other]

doi 10.1109/MELECON53508.2022.9842925

NeuXus: A Biosignal Processing and Classification Pipeline for Real-Time Brain-Computer Interaction

Authors: Athanasios Vourvopoulos, Simon Legeay, Patricia Figueiredo

Abstract: In the last few years,Brain-Computer Interfaces (BCIs) have progressed as an emerging research area in the fields of human-computer interaction and interactive systems.This is primarily due to the introduction of low-cost electroencephalographic (EEG) systems that render BCI technology accessible for non-medical research but also due to the advancements of signal processing and machine learning me… ▽ More In the last few years,Brain-Computer Interfaces (BCIs) have progressed as an emerging research area in the fields of human-computer interaction and interactive systems.This is primarily due to the introduction of low-cost electroencephalographic (EEG) systems that render BCI technology accessible for non-medical research but also due to the advancements of signal processing and machine learning methods.Consequently,BCIs could provide a wide new range of possibilities in the way users interact with a computer system (e.g., neuroadaptive interfaces).However,major challenges must still be addressed for BCI systems to mature into an established communication medium for effective human-computer interaction. One of the major challenges involves the easy integration of real-time processing pipelines with portable EEG systems for an out-of-the-lab use. To date, despite the amount of options current open-source tools provide, most toolboxes focus mainly in extending the processing and classification methods but lack on the ability to provide an easy-to-design yet extensible architecture for ubiquitous use.Here, we present NeuXus, a modular toolbox in Python for real-time biosignal processing and pipeline design.NeuXus is open-source and platform independent,providing high-level implementation of processing pipelines for easy BCI design and deployment. △ Less

Submitted 23 December, 2020; originally announced December 2020.

Comments: 12 pages, 2 figures, 1 table

MSC Class: 00A99 ACM Class: H.5.0; H.5.2; J.7; J.2; J.3

arXiv:2007.11070 [pdf, other]

doi 10.4204/EPTCS.321.3

How to Increase Interest in Studying Functional Programming via Interdisciplinary Application

Authors: Pedro Figueirêdo, Yuri Kim, Nghia Le Minh, Evan Sitt, Xue Ying, Viktória Zsók

Abstract: Functional programming represents a modern tool for applying and implementing software. The state of the art in functional programming reports an increasing number of methodologies in this paradigm. However, extensive interdisciplinary applications are missing. Our goal is to increase student interest in pursuing further studies in functional programming with the use of an application: the ray tra… ▽ More Functional programming represents a modern tool for applying and implementing software. The state of the art in functional programming reports an increasing number of methodologies in this paradigm. However, extensive interdisciplinary applications are missing. Our goal is to increase student interest in pursuing further studies in functional programming with the use of an application: the ray tracer. We conducted a teaching experience, with positive results and student feedback, described here in this paper. △ Less

Submitted 24 August, 2020; v1 submitted 21 July, 2020; originally announced July 2020.

Comments: In Proceedings TFPIE 2019 and 2020, arXiv:2008.08923

Journal ref: EPTCS 321, 2020, pp. 37-54

arXiv:2003.09995 [pdf, ps, other]

Gravitational Wave Detection and Information Extraction via Neural Networks

Authors: Gerson R. Santos, Marcela P. Figueiredo, Antonio de Pádua Santos, Pavlos Protopapas, Tiago A. E. Ferreira

Abstract: Laser Interferometer Gravitational-Wave Observatory (LIGO) was the first laboratory to measure the gravitational waves. It was needed an exceptional experimental design to measure distance changes much less than a radius of a proton. In the same way, the data analyses to confirm and extract information is a tremendously hard task. Here, it is shown a computational procedure base on artificial neur… ▽ More Laser Interferometer Gravitational-Wave Observatory (LIGO) was the first laboratory to measure the gravitational waves. It was needed an exceptional experimental design to measure distance changes much less than a radius of a proton. In the same way, the data analyses to confirm and extract information is a tremendously hard task. Here, it is shown a computational procedure base on artificial neural networks to detect a gravitation wave event and extract the knowledge of its ring-down time from the LIGO data. With this proposal, it is possible to make a probabilistic thermometer for gravitational wave detection and obtain physical information about the astronomical body system that created the phenomenon. Here, the ring-down time is determined with a direct data measure, without the need to use numerical relativity techniques and high computational power. △ Less

Submitted 22 March, 2020; originally announced March 2020.

arXiv:1508.03170 [pdf, other]

Generation of Multimedia Artifacts: An Extractive Summarization-based Approach

Authors: Paulo Figueiredo, Marta Aparício, David Martins de Matos, Ricardo Ribeiro

Abstract: We explore methods for content selection and address the issue of coherence in the context of the generation of multimedia artifacts. We use audio and video to present two case studies: generation of film tributes, and lecture-driven science talks. For content selection, we use centrality-based and diversity-based summarization, along with topic analysis. To establish coherence, we use the emotion… ▽ More We explore methods for content selection and address the issue of coherence in the context of the generation of multimedia artifacts. We use audio and video to present two case studies: generation of film tributes, and lecture-driven science talks. For content selection, we use centrality-based and diversity-based summarization, along with topic analysis. To establish coherence, we use the emotional content of music, for film tributes, and ensure topic similarity between lectures and documentaries, for science talks. Composition techniques for the production of multimedia artifacts are addressed as a means of organizing content, in order to improve coherence. We discuss our results considering the above aspects. △ Less

Submitted 13 August, 2015; originally announced August 2015.

Comments: 7 pages, 2 figures

ACM Class: I.2.7

arXiv:1506.01273 [pdf, other]

doi 10.1016/j.patrec.2015.12.016

Summarization of Films and Documentaries Based on Subtitles and Scripts

Authors: Marta Aparício, Paulo Figueiredo, Francisco Raposo, David Martins de Matos, Ricardo Ribeiro, Luís Marujo

Abstract: We assess the performance of generic text summarization algorithms applied to films and documentaries, using the well-known behavior of summarization of news articles as reference. We use three datasets: (i) news articles, (ii) film scripts and subtitles, and (iii) documentary subtitles. Standard ROUGE metrics are used for comparing generated summaries against news abstracts, plot summaries, and s… ▽ More We assess the performance of generic text summarization algorithms applied to films and documentaries, using the well-known behavior of summarization of news articles as reference. We use three datasets: (i) news articles, (ii) film scripts and subtitles, and (iii) documentary subtitles. Standard ROUGE metrics are used for comparing generated summaries against news abstracts, plot summaries, and synopses. We show that the best performing algorithms are LSA, for news articles and documentaries, and LexRank and Support Sets, for films. Despite the different nature of films and documentaries, their relative behavior is in accordance with that obtained for news articles. △ Less

Submitted 9 March, 2016; v1 submitted 3 June, 2015; originally announced June 2015.

Comments: 7 pages, 9 tables, 4 figures, submitted to Pattern Recognition Letters (Elsevier)

ACM Class: I.2.7

Journal ref: Pattern Recognition Letters, Volume 73, 1 April 2016, Pages 7-12

arXiv:1504.06206 [pdf, other]

An Elastic Image Registration Approach for Wireless Capsule Endoscope Localization

Authors: Isabel N. Figueiredo, Carlos Leal, Luís Pinto, Pedro N. Figueiredo, Richard Tsai

Abstract: Wireless Capsule Endoscope (WCE) is an innovative imaging device that permits physicians to examine all the areas of the Gastrointestinal (GI) tract. It is especially important for the small intestine, where traditional invasive endoscopies cannot reach. Although WCE represents an extremely important advance in medical imaging, a major drawback that remains unsolved is the WCE precise location in… ▽ More Wireless Capsule Endoscope (WCE) is an innovative imaging device that permits physicians to examine all the areas of the Gastrointestinal (GI) tract. It is especially important for the small intestine, where traditional invasive endoscopies cannot reach. Although WCE represents an extremely important advance in medical imaging, a major drawback that remains unsolved is the WCE precise location in the human body during its operating time. This is mainly due to the complex physiological environment and the inherent capsule effects during its movement. When an abnormality is detected, in the WCE images, medical doctors do not know precisely where this abnormality is located relative to the intestine and therefore they can not proceed efficiently with the appropriate therapy. The primary objective of the present paper is to give a contribution to WCE localization, using image-based methods. The main focus of this work is on the description of a multiscale elastic image registration approach, its experimental application on WCE videos, and comparison with a multiscale affine registration. The proposed approach includes registrations that capture both rigid-like and non-rigid deformations, due respectively to the rigid-like WCE movement and the elastic deformation of the small intestine originated by the GI peristaltic movement. Under this approach a qualitative information about the WCE speed can be obtained, as well as the WCE location and orientation via projective geometry. The results of the experimental tests with real WCE video frames show the good performance of the proposed approach, when elastic deformations of the small intestine are involved in successive frames, and its superiority with respect to a multiscale affine image registration, which accounts for rigid-like deformations only and discards elastic deformations. △ Less

Submitted 23 April, 2015; originally announced April 2015.

arXiv:1503.02192 [pdf, other]

Uplink Performance Evaluation of Massive MU-MIMO Systems

Authors: Felipe A. P. de Figueiredo, Joao Paulo Miranda, Fabricio L. Figueiredo, Fabbryccio A. C. M. Cardoso

Abstract: The present paper deals with an OFDM-based uplink within a multi-user MIMO (MU-MIMO) system where a massive MIMO approach is employed. In this context, the linear detectors Minimum Mean-Squared Error (MMSE), Zero Forcing (ZF) and Maximum Ratio Combining (MRC) are considered and assessed. This papers includes Bit Error Rate (BER) results for uncoded QPSK/OFDM transmissions through a flat Rayleigh f… ▽ More The present paper deals with an OFDM-based uplink within a multi-user MIMO (MU-MIMO) system where a massive MIMO approach is employed. In this context, the linear detectors Minimum Mean-Squared Error (MMSE), Zero Forcing (ZF) and Maximum Ratio Combining (MRC) are considered and assessed. This papers includes Bit Error Rate (BER) results for uncoded QPSK/OFDM transmissions through a flat Rayleigh fading channel under the assumption of perfect power control and channel estimation. BER results are obtained through Monte Carlo simulations. Performance results are discussed in detail and we confirm the achievable "massive MIMO" effects, even for a reduced complexity detection technique, when the number of receive antennas at BS is much larger than the number of transmit antennas. △ Less

Submitted 7 March, 2015; originally announced March 2015.

arXiv:1411.1108 [pdf, other]

High-level Reasoning and Low-level Learning for Gras**: A Probabilistic Logic Pipeline

Authors: Laura Antanas, Plinio Moreno, Marion Neumann, Rui Pimentel de Figueiredo, Kristian Kersting, José Santos-Victor, Luc De Raedt

Abstract: While grasps must satisfy the gras** stability criteria, good grasps depend on the specific manipulation scenario: the object, its properties and functionalities, as well as the task and grasp constraints. In this paper, we consider such information for robot gras** by leveraging manifolds and symbolic object parts. Specifically, we introduce a new probabilistic logic module to first semantica… ▽ More While grasps must satisfy the gras** stability criteria, good grasps depend on the specific manipulation scenario: the object, its properties and functionalities, as well as the task and grasp constraints. In this paper, we consider such information for robot gras** by leveraging manifolds and symbolic object parts. Specifically, we introduce a new probabilistic logic module to first semantically reason about pre-grasp configurations with respect to the intended tasks. Further, a map** is learned from part-related visual features to good gras** points. The probabilistic logic module makes use of object-task affordances and object/task ontologies to encode rules that generalize over similar object parts and object/task categories. The use of probabilistic logic for task-dependent gras** contrasts with current approaches that usually learn direct map**s from visual perceptions to task-dependent gras** points. We show the benefits of the full probabilistic logic pipeline experimentally and on a real robot. △ Less

Submitted 4 November, 2014; originally announced November 2014.

arXiv:1305.1912 [pdf, other]

doi 10.1109/TMI.2014.2314959

Automated polyp detection in colon capsule endoscopy

Authors: Alexander V. Mamonov, Isabel N. Figueiredo, Pedro N. Figueiredo, Yen-Hsi Richard Tsai

Abstract: Colorectal polyps are important precursors to colon cancer, a major health problem. Colon capsule endoscopy (CCE) is a safe and minimally invasive examination procedure, in which the images of the intestine are obtained via digital cameras on board of a small capsule ingested by a patient. The video sequence is then analyzed for the presence of polyps. We propose an algorithm that relieves the lab… ▽ More Colorectal polyps are important precursors to colon cancer, a major health problem. Colon capsule endoscopy (CCE) is a safe and minimally invasive examination procedure, in which the images of the intestine are obtained via digital cameras on board of a small capsule ingested by a patient. The video sequence is then analyzed for the presence of polyps. We propose an algorithm that relieves the labor of a human operator analyzing the frames in the video sequence. The algorithm acts as a binary classifier, which labels the frame as either containing polyps or not, based on the geometrical analysis and the texture content of the frame. The geometrical analysis is based on a segmentation of an image with the help of a mid-pass filter. The features extracted by the segmentation procedure are classified according to an assumption that the polyps are characterized as protrusions that are mostly round in shape. Thus, we use a best fit ball radius as a decision parameter of a binary classifier. We present a statistical study of the performance of our approach on a data set containing over 18,900 frames from the endoscopic video sequences of five adult patients. The algorithm demonstrates a solid performance, achieving 47% sensitivity per frame and over 81% sensitivity per polyp at a specificity level of 90%. On average, with a video sequence length of 3747 frames, only 367 false positive frames need to be inspected by a human operator. △ Less

Submitted 27 March, 2014; v1 submitted 8 May, 2013; originally announced May 2013.

Comments: 16 pages, 9 figures, 4 tables

ACM Class: I.4.8

Journal ref: IEEE Transactions on Medical Imaging 33(7):1488-1502, 2014

Showing 1–14 of 14 results for author: Figueirêdo, P