-
Unraveling many-body effects in ZnO: Combined study using momentum-resolved electron energy-loss spectroscopy and first-principles calculations
Authors:
Dario A. Leon,
Cana Elgvin,
Phuong Dan Nguyen,
Øystein Prytz,
Fredrik S. Hage,
Kristian Berland
Abstract:
We present a detailed study of the dielectric response of ZnO using a combination of low-loss momentum-resolved electron energy-loss spectroscopy (EELS) and first-principles calculations at several levels of theory, from the independent particle and the random phase approximation with different variants of density functional theory (DFT), including hybrid and DFT$+U$ schemes; to the Bethe-Salpeter…
▽ More
We present a detailed study of the dielectric response of ZnO using a combination of low-loss momentum-resolved electron energy-loss spectroscopy (EELS) and first-principles calculations at several levels of theory, from the independent particle and the random phase approximation with different variants of density functional theory (DFT), including hybrid and DFT$+U$ schemes; to the Bethe-Salpeter equation (BSE). We use a method based on the $f$-sum rule to obtain the momentum-resolved experimental loss function and absorption spectra from EELS measurements. We characterize the main features in the direct and inverse dielectric functions of ZnO and their dispersion, associating them to single-particle features in the electronic band structure, while highlighting the important role of many-body effects such as plasmons and excitons. We discuss different signatures of the high anisotropy in the response function of ZnO, including the symmetry of the excitonic wave-functions.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
Authors:
Phuc D. A. Nguyen,
Tuan Duc Ngo,
Evangelos Kalogerakis,
Chuang Gan,
Anh Tran,
Cuong Pham,
Khoi Nguyen
Abstract:
We introduce Open3DIS, a novel solution designed to tackle the problem of Open-Vocabulary Instance Segmentation within 3D scenes. Objects within 3D environments exhibit diverse shapes, scales, and colors, making precise instance-level identification a challenging task. Recent advancements in Open-Vocabulary scene understanding have made significant strides in this area by employing class-agnostic…
▽ More
We introduce Open3DIS, a novel solution designed to tackle the problem of Open-Vocabulary Instance Segmentation within 3D scenes. Objects within 3D environments exhibit diverse shapes, scales, and colors, making precise instance-level identification a challenging task. Recent advancements in Open-Vocabulary scene understanding have made significant strides in this area by employing class-agnostic 3D instance proposal networks for object localization and learning queryable features for each 3D mask. While these methods produce high-quality instance proposals, they struggle with identifying small-scale and geometrically ambiguous objects. The key idea of our method is a new module that aggregates 2D instance masks across frames and maps them to geometrically coherent point cloud regions as high-quality object proposals addressing the above limitations. These are then combined with 3D class-agnostic instance proposals to include a wide range of objects in the real world. To validate our approach, we conducted experiments on three prominent datasets, including ScanNet200, S3DIS, and Replica, demonstrating significant performance gains in segmenting objects with diverse categories over the state-of-the-art approaches.
△ Less
Submitted 5 April, 2024; v1 submitted 17 December, 2023;
originally announced December 2023.
-
Acousto-optic reconstruction of exterior sound field based on concentric circle sampling with circular harmonic expansion
Authors:
Phuc Duc Nguyen,
Kenji Ishikawa,
Noboru Harada,
Takehiro Moriya
Abstract:
Acousto-optic sensing provides an alternative approach to traditional microphone arrays by shedding light on the interaction of light with an acoustic field. Sound field reconstruction is a fascinating and advanced technique used in acousto-optics sensing. Current challenges in sound-field reconstruction methods pertain to scenarios in which the sound source is located within the reconstruction ar…
▽ More
Acousto-optic sensing provides an alternative approach to traditional microphone arrays by shedding light on the interaction of light with an acoustic field. Sound field reconstruction is a fascinating and advanced technique used in acousto-optics sensing. Current challenges in sound-field reconstruction methods pertain to scenarios in which the sound source is located within the reconstruction area, known as the exterior problem. Existing reconstruction algorithms, primarily designed for interior scenarios, often exhibit suboptimal performance when applied to exterior cases. This paper introduces a novel technique for exterior sound-field reconstruction. The proposed method leverages concentric circle sampling and a two-dimensional exterior sound-field reconstruction approach based on circular harmonic extensions. To evaluate the efficacy of this approach, both numerical simulations and practical experiments are conducted. The results highlight the superior accuracy of the proposed method when compared to conventional reconstruction methods, all while utilizing a minimal amount of measured projection data.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Advancing Wound Filling Extraction on 3D Faces: Auto-Segmentation and Wound Face Regeneration Approach
Authors:
Duong Q. Nguyen,
Thinh D. Le,
Phuong D. Nguyen,
Nga T. K. Le,
H. Nguyen-Xuan
Abstract:
Facial wound segmentation plays a crucial role in preoperative planning and optimizing patient outcomes in various medical applications. In this paper, we propose an efficient approach for automating 3D facial wound segmentation using a two-stream graph convolutional network. Our method leverages the Cir3D-FaIR dataset and addresses the challenge of data imbalance through extensive experimentation…
▽ More
Facial wound segmentation plays a crucial role in preoperative planning and optimizing patient outcomes in various medical applications. In this paper, we propose an efficient approach for automating 3D facial wound segmentation using a two-stream graph convolutional network. Our method leverages the Cir3D-FaIR dataset and addresses the challenge of data imbalance through extensive experimentation with different loss functions. To achieve accurate segmentation, we conducted thorough experiments and selected a high-performing model from the trained models. The selected model demonstrates exceptional segmentation performance for complex 3D facial wounds. Furthermore, based on the segmentation model, we propose an improved approach for extracting 3D facial wound fillers and compare it to the results of the previous study. Our method achieved a remarkable accuracy of 0.9999986\% on the test suite, surpassing the performance of the previous method. From this result, we use 3D printing technology to illustrate the shape of the wound filling. The outcomes of this study have significant implications for physicians involved in preoperative planning and intervention design. By automating facial wound segmentation and improving the accuracy of wound-filling extraction, our approach can assist in carefully assessing and optimizing interventions, leading to enhanced patient outcomes. Additionally, it contributes to advancing facial reconstruction techniques by utilizing machine learning and 3D bioprinting for printing skin tissue implants. Our source code is available at \url{https://github.com/SIMOGroup/WoundFilling3D}.
△ Less
Submitted 12 July, 2023; v1 submitted 4 July, 2023;
originally announced July 2023.
-
Bearing-Based Network Localization Under Randomized Gossip Protocol
Authors:
Nhat-Minh Le-Phan,
Minh Hoang Trinh,
Phuoc Doan Nguyen
Abstract:
In this paper, we consider a randomized gossip algorithm for the bearing-based network localization problem. Let each sensor node be able to obtain the bearing vectors and communicate its position estimates with several neighboring agents. Each update involves two agents, and the update sequence follows a stochastic process. Under the assumption that the network is infinitesimally bearing rigid an…
▽ More
In this paper, we consider a randomized gossip algorithm for the bearing-based network localization problem. Let each sensor node be able to obtain the bearing vectors and communicate its position estimates with several neighboring agents. Each update involves two agents, and the update sequence follows a stochastic process. Under the assumption that the network is infinitesimally bearing rigid and contains at least two beacon nodes, we show that when the updating step-size is properly selected, the proposed algorithm can successfully estimate the actual sensor nodes' positions with probability one. The randomized update provides a simple, distributed, and cost-effective method for localizing the network. The theoretical result is supported with a simulation of a 1089-node sensor network.
△ Less
Submitted 17 January, 2024; v1 submitted 27 April, 2023;
originally announced April 2023.
-
Application of Self-Supervised Learning to MICA Model for Reconstructing Imperfect 3D Facial Structures
Authors:
Phuong D. Nguyen,
Thinh D. Le,
Duong Q. Nguyen,
Binh Nguyen,
H. Nguyen-Xuan
Abstract:
In this study, we emphasize the integration of a pre-trained MICA model with an imperfect face dataset, employing a self-supervised learning approach. We present an innovative method for regenerating flawed facial structures, yielding 3D printable outputs that effectively support physicians in their patient treatment process. Our results highlight the model's capacity for concealing scars and achi…
▽ More
In this study, we emphasize the integration of a pre-trained MICA model with an imperfect face dataset, employing a self-supervised learning approach. We present an innovative method for regenerating flawed facial structures, yielding 3D printable outputs that effectively support physicians in their patient treatment process. Our results highlight the model's capacity for concealing scars and achieving comprehensive facial reconstructions without discernible scarring. By capitalizing on pre-trained models and necessitating only a few hours of supplementary training, our methodology adeptly devises an optimal model for reconstructing damaged and imperfect facial features. Harnessing contemporary 3D printing technology, we institute a standardized protocol for fabricating realistic, camouflaging mask models for patients in a laboratory environment.
△ Less
Submitted 8 April, 2023;
originally announced April 2023.
-
Randomized Matrix Weighted Consensus
Authors:
Nhat-Minh Le-Phan,
Minh Hoang Trinh,
Phuoc Doan Nguyen
Abstract:
In this paper, randomized gossip-type matrix-weighted consensus algorithms are proposed for both leaderless and leader-follower topologies. First, we introduce the notion of expected matrix-weighted network, which captures the multi-dimensional interactions between any two agents in a probabilistic sense. Under some mild assumptions on the distribution of the expected matrix weights and the upper…
▽ More
In this paper, randomized gossip-type matrix-weighted consensus algorithms are proposed for both leaderless and leader-follower topologies. First, we introduce the notion of expected matrix-weighted network, which captures the multi-dimensional interactions between any two agents in a probabilistic sense. Under some mild assumptions on the distribution of the expected matrix weights and the upper bound of the updating step size, the proposed asynchronous pairwise update algorithms drive the network to achieve a consensus in expectation. An upper bound of the $ε$-convergence time of the algorithm is then derived. Furthermore, the proposed algorithms are applied to the bearing-based network localization and formation control problems. The theoretical results are supported by several numerical examples.
△ Less
Submitted 6 February, 2024; v1 submitted 26 March, 2023;
originally announced March 2023.
-
3D Facial Imperfection Regeneration: Deep learning approach and 3D printing prototypes
Authors:
Phuong D. Nguyen,
Thinh D. Le,
Duong Q. Nguyen,
Thanh Q. Nguyen,
Li-Wei Chou,
H. Nguyen-Xuan
Abstract:
This study explores the potential of a fully convolutional mesh autoencoder model for regenerating 3D nature faces with the presence of imperfect areas. We utilize deep learning approaches in graph processing and analysis to investigate the capabilities model in recreating a filling part for facial scars. Our approach in dataset creation is able to generate a facial scar rationally in a virtual sp…
▽ More
This study explores the potential of a fully convolutional mesh autoencoder model for regenerating 3D nature faces with the presence of imperfect areas. We utilize deep learning approaches in graph processing and analysis to investigate the capabilities model in recreating a filling part for facial scars. Our approach in dataset creation is able to generate a facial scar rationally in a virtual space that corresponds to the unique circumstances. Especially, we propose a new method which is named 3D Facial Imperfection Regeneration(3D-FaIR) for reproducing a complete face reconstruction based on the remaining features of the patient face. To further enhance the applicable capacity of the present research, we develop an improved outlier technique to separate the wounds of patients and provide appropriate wound cover models. Also, a Cir3D-FaIR dataset of imperfect faces and open codes was released at https://github.com/SIMOGroup/3DFaIR. Our findings demonstrate the potential of the proposed approach to help patients recover more quickly and safely through convenient techniques. We hope that this research can contribute to the development of new products and innovative solutions for facial scar regeneration.
△ Less
Submitted 25 March, 2023;
originally announced March 2023.
-
Intelligent problem-solving as integrated hierarchical reinforcement learning
Authors:
Manfred Eppe,
Christian Gumbsch,
Matthias Kerzel,
Phuong D. H. Nguyen,
Martin V. Butz,
Stefan Wermter
Abstract:
According to cognitive psychology and related disciplines, the development of complex problem-solving behaviour in biological agents depends on hierarchical cognitive mechanisms. Hierarchical reinforcement learning is a promising computational approach that may eventually yield comparable problem-solving behaviour in artificial agents and robots. However, to date the problem-solving abilities of m…
▽ More
According to cognitive psychology and related disciplines, the development of complex problem-solving behaviour in biological agents depends on hierarchical cognitive mechanisms. Hierarchical reinforcement learning is a promising computational approach that may eventually yield comparable problem-solving behaviour in artificial agents and robots. However, to date the problem-solving abilities of many human and non-human animals are clearly superior to those of artificial systems. Here, we propose steps to integrate biologically inspired hierarchical mechanisms to enable advanced problem-solving skills in artificial agents. Therefore, we first review the literature in cognitive psychology to highlight the importance of compositional abstraction and predictive processing. Then we relate the gained insights with contemporary hierarchical reinforcement learning methods. Interestingly, our results suggest that all identified cognitive mechanisms have been implemented individually in isolated computational architectures, raising the question of why there exists no single unifying architecture that integrates them. As our final contribution, we address this question by providing an integrative perspective on the computational challenges to develop such a unifying architecture. We expect our results to guide the development of more sophisticated cognitively inspired hierarchical machine learning architectures.
△ Less
Submitted 18 August, 2022;
originally announced August 2022.
-
Long-term quantification and characterisation of wind farm noise amplitude modulation
Authors:
Phuc D. Nguyen,
Kristy L. Hansen,
Peter Catcheside,
Colin Hansen,
Branko Zajamsek
Abstract:
The large-scale expansion of wind farms has prompted community debate regarding adverse impacts of wind farm noise (WFN). One of the most annoying and potentially sleep disturbing components of WFN is amplitude modulation (AM). Here we quantified and characterised AM over one year using acoustical and meteorological data measured at three locations near three wind farms. We found that the diurnal…
▽ More
The large-scale expansion of wind farms has prompted community debate regarding adverse impacts of wind farm noise (WFN). One of the most annoying and potentially sleep disturbing components of WFN is amplitude modulation (AM). Here we quantified and characterised AM over one year using acoustical and meteorological data measured at three locations near three wind farms. We found that the diurnal variation of outdoor AM prevalence was substantial, the nighttime prevalence was approximately 2 to 5 times higher than the daytime prevalence. On average, indoor AM occurred during the nighttime from 1.1 to 1.7 times less often than outdoor AM, but the indoor AM depth was higher than that measured outdoors. We observed an association between AM prevalence and sunset and sunrise. AM occurred more often at downwind and crosswind conditions. These findings provide important insights into long term WFN characteristics that will help to inform future WFN assessment guidelines.
△ Less
Submitted 29 May, 2022;
originally announced June 2022.
-
Multi-input model uncertainty analysis for long-range wind farm noise predictions
Authors:
Phuc D. Nguyen,
Kristy L. Hansen,
Branko Zajamsek,
Peter Catcheside,
Colin H. Hansen
Abstract:
One of the major sources of uncertainty in predictions of wind farm noise (WFN) reflect parametric and model structure uncertainty. The model structure uncertainty is a systematic uncertainty, which relates to uncertainty about the appropriate mathematical structure of the models. Here we quantified the model structure uncertainty in predicting WFN arising from multi-input models, including nine g…
▽ More
One of the major sources of uncertainty in predictions of wind farm noise (WFN) reflect parametric and model structure uncertainty. The model structure uncertainty is a systematic uncertainty, which relates to uncertainty about the appropriate mathematical structure of the models. Here we quantified the model structure uncertainty in predicting WFN arising from multi-input models, including nine ground impedance and four wind speed profile models. We used a numerical ray tracing sound propagation model for predicting the noise level at different receivers. We found that variations between different ground impedance models and wind speed profile models were significant sources of uncertainty, and that these sources contributed to predicted noise level differences in excess of 10 dBA at distances greater than 3.5 km. We also found that differences between atmospheric vertical wind speed profile models were the main source of uncertainty in predicting WFN at long-range distances. When predicting WFN, it is important to acknowledge variability associated with different models as this contributes to the uncertainty of the predicted values.
△ Less
Submitted 26 May, 2022;
originally announced May 2022.
-
Region-level Active Detector Learning
Authors:
Michael Laielli,
Giscard Biamby,
Dian Chen,
Ritwik Gupta,
Adam Loeffler,
Phat Dat Nguyen,
Ross Luo,
Trevor Darrell,
Sayna Ebrahimi
Abstract:
Active learning for object detection is conventionally achieved by applying techniques developed for classification in a way that aggregates individual detections into image-level selection criteria. This is typically coupled with the costly assumption that every image selected for labelling must be exhaustively annotated. This yields incremental improvements on well-curated vision datasets and st…
▽ More
Active learning for object detection is conventionally achieved by applying techniques developed for classification in a way that aggregates individual detections into image-level selection criteria. This is typically coupled with the costly assumption that every image selected for labelling must be exhaustively annotated. This yields incremental improvements on well-curated vision datasets and struggles in the presence of data imbalance and visual clutter that occurs in real-world imagery. Alternatives to the image-level approach are surprisingly under-explored in the literature. In this work, we introduce a new strategy that subsumes previous Image-level and Object-level approaches into a generalized, Region-level approach that promotes spatial-diversity by avoiding nearby redundant queries from the same image and minimizes context-switching for the labeler. We show that this approach significantly decreases labeling effort and improves rare object search on realistic data with inherent class-imbalance and cluttered scenes.
△ Less
Submitted 17 January, 2022; v1 submitted 20 August, 2021;
originally announced August 2021.
-
Hierarchical principles of embodied reinforcement learning: A review
Authors:
Manfred Eppe,
Christian Gumbsch,
Matthias Kerzel,
Phuong D. H. Nguyen,
Martin V. Butz,
Stefan Wermter
Abstract:
Cognitive Psychology and related disciplines have identified several critical mechanisms that enable intelligent biological agents to learn to solve complex problems. There exists pressing evidence that the cognitive mechanisms that enable problem-solving skills in these species build on hierarchical mental representations. Among the most promising computational approaches to provide comparable le…
▽ More
Cognitive Psychology and related disciplines have identified several critical mechanisms that enable intelligent biological agents to learn to solve complex problems. There exists pressing evidence that the cognitive mechanisms that enable problem-solving skills in these species build on hierarchical mental representations. Among the most promising computational approaches to provide comparable learning-based problem-solving abilities for artificial agents and robots is hierarchical reinforcement learning. However, so far the existing computational approaches have not been able to equip artificial agents with problem-solving abilities that are comparable to intelligent animals, including human and non-human primates, crows, or octopuses. Here, we first survey the literature in Cognitive Psychology, and related disciplines, and find that many important mental mechanisms involve compositional abstraction, curiosity, and forward models. We then relate these insights with contemporary hierarchical reinforcement learning methods, and identify the key machine intelligence approaches that realise these mechanisms. As our main result, we show that all important cognitive mechanisms have been implemented independently in isolated computational architectures, and there is simply a lack of approaches that integrate them appropriately. We expect our results to guide the development of more sophisticated cognitively inspired hierarchical methods, so that future artificial agents achieve a problem-solving performance on the level of intelligent animals.
△ Less
Submitted 18 August, 2022; v1 submitted 18 December, 2020;
originally announced December 2020.
-
Sensorimotor representation learning for an "active self" in robots: A model survey
Authors:
Phuong D. H. Nguyen,
Yasmin Kim Georgie,
Ezgi Kayhan,
Manfred Eppe,
Verena Vanessa Hafner,
Stefan Wermter
Abstract:
Safe human-robot interactions require robots to be able to learn how to behave appropriately in \sout{humans' world} \rev{spaces populated by people} and thus to cope with the challenges posed by our dynamic and unstructured environment, rather than being provided a rigid set of rules for operations. In humans, these capabilities are thought to be related to our ability to perceive our body in spa…
▽ More
Safe human-robot interactions require robots to be able to learn how to behave appropriately in \sout{humans' world} \rev{spaces populated by people} and thus to cope with the challenges posed by our dynamic and unstructured environment, rather than being provided a rigid set of rules for operations. In humans, these capabilities are thought to be related to our ability to perceive our body in space, sensing the location of our limbs during movement, being aware of other objects and agents, and controlling our body parts to interact with them intentionally. Toward the next generation of robots with bio-inspired capacities, in this paper, we first review the developmental processes of underlying mechanisms of these abilities: The sensory representations of body schema, peripersonal space, and the active self in humans. Second, we provide a survey of robotics models of these sensory representations and robotics models of the self; and we compare these models with the human counterparts. Finally, we analyse what is missing from these robotics models and propose a theoretical computational framework, which aims to allow the emergence of the sense of self in artificial agents by develo** sensory representations through self-exploration.
△ Less
Submitted 12 January, 2021; v1 submitted 25 November, 2020;
originally announced November 2020.
-
Robotic self-representation improves manipulation skills and transfer learning
Authors:
Phuong D. H. Nguyen,
Manfred Eppe,
Stefan Wermter
Abstract:
Cognitive science suggests that the self-representation is critical for learning and problem-solving. However, there is a lack of computational methods that relate this claim to cognitively plausible robots and reinforcement learning. In this paper, we bridge this gap by develo** a model that learns bidirectional action-effect associations to encode the representations of body schema and the per…
▽ More
Cognitive science suggests that the self-representation is critical for learning and problem-solving. However, there is a lack of computational methods that relate this claim to cognitively plausible robots and reinforcement learning. In this paper, we bridge this gap by develo** a model that learns bidirectional action-effect associations to encode the representations of body schema and the peripersonal space from multisensory information, which is named multimodal BidAL. Through three different robotic experiments, we demonstrate that this approach significantly stabilizes the learning-based problem-solving under noisy conditions and that it improves transfer learning of robotic manipulation skills.
△ Less
Submitted 13 November, 2020;
originally announced November 2020.
-
Reinforcement Learning with Time-dependent Goals for Robotic Musicians
Authors:
Thilo Fryen,
Manfred Eppe,
Phuong D. H. Nguyen,
Timo Gerkmann,
Stefan Wermter
Abstract:
Reinforcement learning is a promising method to accomplish robotic control tasks. The task of playing musical instruments is, however, largely unexplored because it involves the challenge of achieving sequential goals - melodies - that have a temporal dimension. In this paper, we address robotic musicianship by introducing a temporal extension to goal-conditioned reinforcement learning: Time-depen…
▽ More
Reinforcement learning is a promising method to accomplish robotic control tasks. The task of playing musical instruments is, however, largely unexplored because it involves the challenge of achieving sequential goals - melodies - that have a temporal dimension. In this paper, we address robotic musicianship by introducing a temporal extension to goal-conditioned reinforcement learning: Time-dependent goals. We demonstrate that these can be used to train a robotic musician to play the theremin instrument. We train the robotic agent in simulation and transfer the acquired policy to a real-world robotic thereminist. Supplemental video: https://youtu.be/jvC9mPzdQN4
△ Less
Submitted 11 November, 2020;
originally announced November 2020.
-
Interpretation of smartphone-captured radiographs utilizing a deep learning-based approach
Authors:
Hieu X. Le,
Phuong D. Nguyen,
Thang H. Nguyen,
Khanh N. Q. Le,
Thanh T. Nguyen
Abstract:
Recently, computer-aided diagnostic systems (CADs) that could automatically interpret medical images effectively have been the emerging subject of recent academic attention. For radiographs, several deep learning-based systems or models have been developed to study the multi-label diseases recognition tasks. However, none of them have been trained to work on smartphone-captured chest radiographs.…
▽ More
Recently, computer-aided diagnostic systems (CADs) that could automatically interpret medical images effectively have been the emerging subject of recent academic attention. For radiographs, several deep learning-based systems or models have been developed to study the multi-label diseases recognition tasks. However, none of them have been trained to work on smartphone-captured chest radiographs. In this study, we proposed a system that comprises a sequence of deep learning-based neural networks trained on the newly released CheXphoto dataset to tackle this issue. The proposed approach achieved promising results of 0.684 in AUC and 0.699 in average F1 score. To the best of our knowledge, this is the first published study that showed to be capable of processing smartphone-captured radiographs.
△ Less
Submitted 13 September, 2020;
originally announced September 2020.
-
A novel approach to remove foreign objects from chest X-ray images
Authors:
Hieu X. Le,
Phuong D. Nguyen,
Thang H. Nguyen,
Khanh N. Q. Le,
Thanh T. Nguyen
Abstract:
We initially proposed a deep learning approach for foreign objects inpainting in smartphone-camera captured chest radiographs utilizing the cheXphoto dataset. Foreign objects which can significantly affect the quality of a computer-aided diagnostic prediction are captured under various settings. In this paper, we used multi-method to tackle both removal and inpainting chest radiographs. Firstly, a…
▽ More
We initially proposed a deep learning approach for foreign objects inpainting in smartphone-camera captured chest radiographs utilizing the cheXphoto dataset. Foreign objects which can significantly affect the quality of a computer-aided diagnostic prediction are captured under various settings. In this paper, we used multi-method to tackle both removal and inpainting chest radiographs. Firstly, an object detection model is trained to separate the foreign objects from the given image. Subsequently, the binary mask of each object is extracted utilizing a segmentation model. Each pair of the binary mask and the extracted object are then used for inpainting purposes. Finally, the in-painted regions are now merged back to the original image, resulting in a clean and non-foreign-object-existing output. To conclude, we achieved state-of-the-art accuracy. The experimental results showed a new approach to the possible applications of this method for chest X-ray images detection.
△ Less
Submitted 15 August, 2020;
originally announced August 2020.
-
Curious Hierarchical Actor-Critic Reinforcement Learning
Authors:
Frank Röder,
Manfred Eppe,
Phuong D. H. Nguyen,
Stefan Wermter
Abstract:
Hierarchical abstraction and curiosity-driven exploration are two common paradigms in current reinforcement learning approaches to break down difficult problems into a sequence of simpler ones and to overcome reward sparsity. However, there is a lack of approaches that combine these paradigms, and it is currently unknown whether curiosity also helps to perform the hierarchical abstraction. As a no…
▽ More
Hierarchical abstraction and curiosity-driven exploration are two common paradigms in current reinforcement learning approaches to break down difficult problems into a sequence of simpler ones and to overcome reward sparsity. However, there is a lack of approaches that combine these paradigms, and it is currently unknown whether curiosity also helps to perform the hierarchical abstraction. As a novelty and scientific contribution, we tackle this issue and develop a method that combines hierarchical reinforcement learning with curiosity. Herein, we extend a contemporary hierarchical actor-critic approach with a forward model to develop a hierarchical notion of curiosity. We demonstrate in several continuous-space environments that curiosity can more than double the learning performance and success rates for most of the investigated benchmarking problems. We also provide our source code and a supplementary video.
△ Less
Submitted 17 August, 2020; v1 submitted 7 May, 2020;
originally announced May 2020.
-
From semantics to execution: Integrating action planning with reinforcement learning for robotic causal problem-solving
Authors:
Manfred Eppe,
Phuong D. H. Nguyen,
Stefan Wermter
Abstract:
Reinforcement learning is an appropriate and successful method to robustly perform low-level robot control under noisy conditions. Symbolic action planning is useful to resolve causal dependencies and to break a causally complex problem down into a sequence of simpler high-level actions. A problem with the integration of both approaches is that action planning is based on discrete high-level actio…
▽ More
Reinforcement learning is an appropriate and successful method to robustly perform low-level robot control under noisy conditions. Symbolic action planning is useful to resolve causal dependencies and to break a causally complex problem down into a sequence of simpler high-level actions. A problem with the integration of both approaches is that action planning is based on discrete high-level action- and state spaces, whereas reinforcement learning is usually driven by a continuous reward function. However, recent advances in reinforcement learning, specifically, universal value function approximators and hindsight experience replay, have focused on goal-independent methods based on sparse rewards. In this article, we build on these novel methods to facilitate the integration of action planning with reinforcement learning by exploiting the reward-sparsity as a bridge between the high-level and low-level state- and control spaces. As a result, we demonstrate that the integrated neuro-symbolic method is able to solve object manipulation problems that involve tool use and non-trivial causal dependencies under noisy conditions, exploiting both data and knowledge.
△ Less
Submitted 8 December, 2019; v1 submitted 23 May, 2019;
originally announced May 2019.