Search | arXiv e-print repository

Closed-Loop Binary Media-Based Modulation

Authors: Majid Nasiri Khormuji, Branislav M. Popovic

Abstract: Presenting analytical results for Binary Media-Based Modulation (B-MBM) over fading channels for single-antenna receivers. Illustrating that open-loop B-MBM, in the absence of feedback, only achieves a diversity order of one. However, with feedback and optimal weight selection in closed-loop configurations, a diversity order of two becomes achievable. Notably, the closed-loop B-MBM, with analytica… ▽ More Presenting analytical results for Binary Media-Based Modulation (B-MBM) over fading channels for single-antenna receivers. Illustrating that open-loop B-MBM, in the absence of feedback, only achieves a diversity order of one. However, with feedback and optimal weight selection in closed-loop configurations, a diversity order of two becomes achievable. Notably, the closed-loop B-MBM, with analytically computed optimal weights, performs equivalent to Alamouti-coded BPSK transmission, demonstrating feasibility even with just one radio frequency chain when feedback is available. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 5 pages, 4 figures, 1 table

Journal ref: EuCNC and 6G Summit 2024

arXiv:2406.11580 [pdf, other]

Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation

Authors: Tom Kocmi, Vilém Zouhar, Eleftherios Avramidis, Roman Grundkiewicz, Marzena Karpinska, Maja Popović, Mrinmaya Sachan, Mariya Shmatova

Abstract: High-quality Machine Translation (MT) evaluation relies heavily on human judgments. Comprehensive error classification methods, such as Multidimensional Quality Metrics (MQM), are expensive as they are time-consuming and can only be done by experts, whose availability may be limited especially for low-resource languages. On the other hand, just assigning overall scores, like Direct Assessment (DA)… ▽ More High-quality Machine Translation (MT) evaluation relies heavily on human judgments. Comprehensive error classification methods, such as Multidimensional Quality Metrics (MQM), are expensive as they are time-consuming and can only be done by experts, whose availability may be limited especially for low-resource languages. On the other hand, just assigning overall scores, like Direct Assessment (DA), is simpler and faster and can be done by translators of any level, but are less reliable. In this paper, we introduce Error Span Annotation (ESA), a human evaluation protocol which combines the continuous rating of DA with the high-level error severity span marking of MQM. We validate ESA by comparing it to MQM and DA for 12 MT systems and one human reference translation (English to German) from WMT23. The results show that ESA offers faster and cheaper annotations than MQM at the same quality level, without the requirement of expensive MQM experts. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2405.15015 [pdf, ps, other]

Fast Transaction Scheduling in Blockchain Sharding

Authors: Ramesh Adhikari, Costas Busch, Miroslav Popovic

Abstract: Sharding is a promising technique for addressing the scalability issues of blockchain. It divides the $n$ participating nodes into $s$ disjoint groups called shards, where each shard processes transactions in parallel. We investigate scheduling algorithms for the blockchain sharding systems, where each transaction resides in a shard of the communication graph and attempts to access accounts at pos… ▽ More Sharding is a promising technique for addressing the scalability issues of blockchain. It divides the $n$ participating nodes into $s$ disjoint groups called shards, where each shard processes transactions in parallel. We investigate scheduling algorithms for the blockchain sharding systems, where each transaction resides in a shard of the communication graph and attempts to access accounts at possibly remote shards. We examine batch scheduling problems on the shard graph $G_s$, where given a set of transactions, we aim to find efficient schedules to execute them as fast as possible. First, we present a centralized scheduler where one of the shards has global knowledge of transactions to be processed. For general graphs, where the transaction and its accessing objects are arbitrarily far from each other with a maximum distance $d$, the centralized scheduler provides $O(kd)$ approximation to the optimal schedule, where $k$ is the maximum number of shards each transaction accesses. Consequently, for a Clique graph where shards are at a unit distance from each other, we obtain $O(k)$ approximation to the optimal schedule. We also get $O(k \log s)$ approximation for Hypercube, Butterfly, and $g$-dimensional Grid, where $g=O(\log s)$. Next, we provide a centralized scheduler with a bucketing approach that offers improved bounds for special cases. Finally, we provide a distributed scheduler where shards do not require global transaction information. We achieve this by using a hierarchical clustering of the shards and using the centralized scheduler in each cluster. We show that the distributed scheduler has a competitive ratio of $O(\mathcal{A_\mathcal{CS}} \log ^2 s)$, where $\mathcal{A_\mathcal{CS}}$ is the approximation ratio of the centralized scheduler. To our knowledge, we are the first to give provably fast transaction scheduling algorithms for blockchain sharding systems. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 12 pages

arXiv:2405.09423 [pdf]

MicroPython Testbed for Federated Learning Algorithms

Authors: Miroslav Popovic, Marko Popovic, Ivan Kastelan, Miodrag Djukic, Ilija Basicevic

Abstract: Recently, Python Testbed for Federated Learning Algorithms emerged as a low code and generative large language models amenable framework for develo** decentralized and distributed applications, primarily targeting edge systems, by nonprofessional programmers with the help of emerging artificial intelligence tools. This light framework is written in pure Python to be easy to install and to fit in… ▽ More Recently, Python Testbed for Federated Learning Algorithms emerged as a low code and generative large language models amenable framework for develo** decentralized and distributed applications, primarily targeting edge systems, by nonprofessional programmers with the help of emerging artificial intelligence tools. This light framework is written in pure Python to be easy to install and to fit into a small IoT memory. It supports formally verified generic centralized and decentralized federated learning algorithms, as well as the peer-to-peer data exchange used in time division multiplexing communication, and its current main limitation is that all the application instances can run only on a single PC. This paper presents the MicroPyton Testbed for Federated Learning Algorithms, the new framework that overcomes its predecessor's limitation such that individual application instances may run on different network nodes like PCs and IoTs, primarily in edge systems. The new framework carries on the pure Python ideal, is based on asynchronous I/O abstractions, and runs on MicroPython, and therefore is a great match for IoTs and devices in edge systems. The new framework was experimentally validated on a wireless network comprising PCs and Raspberry Pi Pico W boards, by using application examples originally developed for the predecessor framework. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: 20 pages, 6 figures, 12 tables, the extended paper preprint

arXiv:2404.06940 [pdf, other]

Robotic Learning for Adaptive Informative Path Planning

Authors: Marija Popovic, Joshua Ott, Julius Rückin, Mykel J. Kochenderfer

Abstract: Adaptive informative path planning (AIPP) is important to many robotics applications, enabling mobile robots to efficiently collect useful data about initially unknown environments. In addition, learning-based methods are increasingly used in robotics to enhance adaptability, versatility, and robustness across diverse and complex tasks. Our survey explores research on applying robotic learning to… ▽ More Adaptive informative path planning (AIPP) is important to many robotics applications, enabling mobile robots to efficiently collect useful data about initially unknown environments. In addition, learning-based methods are increasingly used in robotics to enhance adaptability, versatility, and robustness across diverse and complex tasks. Our survey explores research on applying robotic learning to AIPP, bridging the gap between these two research fields. We begin by providing a unified mathematical framework for general AIPP problems. Next, we establish two complementary taxonomies of current work from the perspectives of (i) learning algorithms and (ii) robotic applications. We explore synergies, recent trends, and highlight the benefits of learning-based methods in AIPP frameworks. Finally, we discuss key challenges and promising future directions to enable more generally applicable and robust robotic data-gathering systems through learning. We provide a comprehensive catalogue of papers reviewed in our survey, including publicly available repositories, to facilitate future studies in the field. △ Less

Submitted 15 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

Comments: 22 pages, 1 figure

arXiv:2403.16803 [pdf, other]

Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning

Authors: Sicong Pan, Liren **, Xuying Huang, Cyrill Stachniss, Marija Popović, Maren Bennewitz

Abstract: Object reconstruction is relevant for many autonomous robotic tasks that require interaction with the environment. A key challenge in such scenarios is planning view configurations to collect informative measurements for reconstructing an initially unknown object. One-shot view planning enables efficient data collection by predicting view configurations and planning the globally shortest path conn… ▽ More Object reconstruction is relevant for many autonomous robotic tasks that require interaction with the environment. A key challenge in such scenarios is planning view configurations to collect informative measurements for reconstructing an initially unknown object. One-shot view planning enables efficient data collection by predicting view configurations and planning the globally shortest path connecting all views at once. However, geometric priors about the object are required to conduct one-shot view planning. In this work, we propose a novel one-shot view planning approach that utilizes the powerful 3D generation capabilities of diffusion models as priors. By incorporating such geometric priors into our pipeline, we achieve effective one-shot view planning starting with only a single RGB image of the object to be reconstructed. Our planning experiments in simulation and real-world setups indicate that our approach balances well between object reconstruction quality and movement cost. △ Less

Submitted 25 March, 2024; originally announced March 2024.

Comments: Sicong Pan and Liren ** have equal contribution. Submitted to IROS 2024

arXiv:2403.11233 [pdf, other]

STAIR: Semantic-Targeted Active Implicit Reconstruction

Authors: Liren **, Haofei Kuang, Yue Pan, Cyrill Stachniss, Marija Popović

Abstract: Many autonomous robotic applications require object-level understanding when deployed. Actively reconstructing objects of interest, i.e. objects with specific semantic meanings, is therefore relevant for a robot to perform downstream tasks in an initially unknown environment. In this work, we propose a novel framework for semantic-targeted active reconstruction using posed RGB-D measurements and 2… ▽ More Many autonomous robotic applications require object-level understanding when deployed. Actively reconstructing objects of interest, i.e. objects with specific semantic meanings, is therefore relevant for a robot to perform downstream tasks in an initially unknown environment. In this work, we propose a novel framework for semantic-targeted active reconstruction using posed RGB-D measurements and 2D semantic labels as input. The key components of our framework are a semantic implicit neural representation and a compatible planning utility function based on semantic rendering and uncertainty estimation, enabling adaptive view planning to target objects of interest. Our planning approach achieves better reconstruction performance in terms of mesh and novel view rendering quality compared to implicit reconstruction baselines that do not consider semantics for view planning. Our framework further outperforms a state-of-the-art semantic-targeted active reconstruction pipeline based on explicit maps, justifying our choice of utilising implicit neural representations to tackle semantic-targeted active reconstruction problems. △ Less

Submitted 17 March, 2024; originally announced March 2024.

arXiv:2402.04894 [pdf, other]

Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning

Authors: Apoorva Vashisth, Julius Rückin, Federico Magistri, Cyrill Stachniss, Marija Popović

Abstract: Autonomous robots are often employed for data collection due to their efficiency and low labour costs. A key task in robotic data acquisition is planning paths through an initially unknown environment to collect observations given platform-specific resource constraints, such as limited battery life. Adaptive online path planning in 3D environments is challenging due to the large set of valid actio… ▽ More Autonomous robots are often employed for data collection due to their efficiency and low labour costs. A key task in robotic data acquisition is planning paths through an initially unknown environment to collect observations given platform-specific resource constraints, such as limited battery life. Adaptive online path planning in 3D environments is challenging due to the large set of valid actions and the presence of unknown occlusions. To address these issues, we propose a novel deep reinforcement learning approach for adaptively replanning robot paths to map targets of interest in unknown 3D environments. A key aspect of our approach is a dynamically constructed graph that restricts planning actions local to the robot, allowing us to quickly react to newly discovered obstacles and targets of interest. For replanning, we propose a new reward function that balances between exploring the unknown environment and exploiting online-collected data about the targets of interest. Our experiments show that our method enables more efficient target detection compared to state-of-the-art learning and non-learning baselines. We also show the applicability of our approach for orchard monitoring using an unmanned aerial vehicle in a photorealistic simulator. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 8 pages, 6 figures

arXiv:2312.04412 [pdf]

doi 10.1109/TELFOR59449.2023.10372714

Develo** Elementary Federated Learning Algorithms Leveraging the ChatGPT

Authors: Miroslav Popovic, Marko Popovic, Ivan Kastelan, Miodrag Djukic, Ilija Basicevic

Abstract: The Python Testbed for Federated Learning Algorithms is a simple Python FL framework easy to use by ML&AI developers who do not need to be professional programmers, and this paper shows that it is also amenable to emerging AI tools. In this paper, we successfully developed three elementary FL algorithms using the following three steps process: (i) specify context, (ii) ask ChatGPT to complete serv… ▽ More The Python Testbed for Federated Learning Algorithms is a simple Python FL framework easy to use by ML&AI developers who do not need to be professional programmers, and this paper shows that it is also amenable to emerging AI tools. In this paper, we successfully developed three elementary FL algorithms using the following three steps process: (i) specify context, (ii) ask ChatGPT to complete server and clients' callback functions, and (iii) verify the generated code. △ Less

Submitted 8 January, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

Comments: 4 pages, 6 tables, submitted to TELFOR 2023, Published by IEEE Xplore

arXiv:2312.04402 [pdf, other]

Semi-Supervised Active Learning for Semantic Segmentation in Unknown Environments Using Informative Path Planning

Authors: Julius Rückin, Federico Magistri, Cyrill Stachniss, Marija Popović

Abstract: Semantic segmentation enables robots to perceive and reason about their environments beyond geometry. Most of such systems build upon deep learning approaches. As autonomous robots are commonly deployed in initially unknown environments, pre-training on static datasets cannot always capture the variety of domains and limits the robot's perception performance during missions. Recently, self-supervi… ▽ More Semantic segmentation enables robots to perceive and reason about their environments beyond geometry. Most of such systems build upon deep learning approaches. As autonomous robots are commonly deployed in initially unknown environments, pre-training on static datasets cannot always capture the variety of domains and limits the robot's perception performance during missions. Recently, self-supervised and fully supervised active learning methods emerged to improve a robot's vision. These approaches rely on large in-domain pre-training datasets or require substantial human labelling effort. We propose a planning method for semi-supervised active learning of semantic segmentation that substantially reduces human labelling requirements compared to fully supervised approaches. We leverage an adaptive map-based planner guided towards the frontiers of unexplored space with high model uncertainty collecting training data for human labelling. A key aspect of our approach is to combine the sparse high-quality human labels with pseudo labels automatically extracted from highly certain environment map areas. Experimental results show that our method reaches segmentation performance close to fully supervised approaches with drastically reduced human labelling effort while outperforming self-supervised approaches. △ Less

Submitted 26 January, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

Comments: 8 pages, 9 figures

arXiv:2311.16312 [pdf]

doi 10.1109/icdmw58026.2022.00041

Domain-Specific Deep Learning Feature Extractor for Diabetic Foot Ulcer Detection

Authors: Reza Basiri, Milos R. Popovic, Shehroz S. Khan

Abstract: Diabetic Foot Ulcer (DFU) is a condition requiring constant monitoring and evaluations for treatment. DFU patient population is on the rise and will soon outpace the available health resources. Autonomous monitoring and evaluation of DFU wounds is a much-needed area in health care. In this paper, we evaluate and identify the most accurate feature extractor that is the core basis for develo** a d… ▽ More Diabetic Foot Ulcer (DFU) is a condition requiring constant monitoring and evaluations for treatment. DFU patient population is on the rise and will soon outpace the available health resources. Autonomous monitoring and evaluation of DFU wounds is a much-needed area in health care. In this paper, we evaluate and identify the most accurate feature extractor that is the core basis for develo** a deep-learning wound detection network. For the evaluation, we used mAP and F1-score on the publicly available DFU2020 dataset. A combination of UNet and EfficientNetb3 feature extractor resulted in the best evaluation among the 14 networks compared. UNet and Efficientnetb3 can be used as the classifier in the development of a comprehensive DFU domain-specific autonomous wound detection pipeline. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: 5 pages, 2 figures, 3 tables, 2022 IEEE International Conference on Data Mining Workshops

Journal ref: 2022 IEEE International Conference on Data Mining Workshops. pp. 1-5

arXiv:2311.05015 [pdf, ps, other]

doi 10.1109/TWC.2023.3328494

Beamforming Performances of Holographic Surfaces

Authors: Peng Wang, Majid Nasiri Khormuji, Branislav M. Popovic

Abstract: In this paper, we investigate the beamforming performances of holographic surfaces implemented as lossless antenna arrays with less than half-wavelength spacing. We first develop a method to quantify the mutual coupling effect among the antennas in an array. The developed coupling model is general and applicable to arrays with arbitrary distribution of any type of antennas with arbitrary structure… ▽ More In this paper, we investigate the beamforming performances of holographic surfaces implemented as lossless antenna arrays with less than half-wavelength spacing. We first develop a method to quantify the mutual coupling effect among the antennas in an array. The developed coupling model is general and applicable to arrays with arbitrary distribution of any type of antennas with arbitrary structure, physical size and radiation power pattern. In particular, it reduces to a neat analytical expression for arbitrarily deployed isotropic antenna arrays. We then discuss the beamforming design for holographic surfaces, and in particular provide analytical beamforming characterizations for arrays with two arbitrarily spaced isotropic antennas. Numerical results indicate that, by accounting for the mutual coupling effect between antennas, the array densification by packing more antennas in a given surface aperture can significantly enhance both the beamforming gain and spatial resolution of the system. The beamforming gain enhancement and beamwidth reduction can be several dBs higher than, and more than half of, those achieved by the conventional half-wavelength spaced antenna arrays in the same surface aperture. The gains of densification become saturated when the antenna spacing is below a critical value, and the saturated gain reduces as the surface aperture increases. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Comments: 16 pages, 26 figures

Journal ref: IEEE Transactions on Wireless Communications, 2023

arXiv:2310.20140 [pdf]

Synthesizing Diabetic Foot Ulcer Images with Diffusion Model

Authors: Reza Basiri, Karim Manji, Francois Harton, Alisha Poonja, Milos R. Popovic, Shehroz S. Khan

Abstract: Diabetic Foot Ulcer (DFU) is a serious skin wound requiring specialized care. However, real DFU datasets are limited, hindering clinical training and research activities. In recent years, generative adversarial networks and diffusion models have emerged as powerful tools for generating synthetic images with remarkable realism and diversity in many applications. This paper explores the potential of… ▽ More Diabetic Foot Ulcer (DFU) is a serious skin wound requiring specialized care. However, real DFU datasets are limited, hindering clinical training and research activities. In recent years, generative adversarial networks and diffusion models have emerged as powerful tools for generating synthetic images with remarkable realism and diversity in many applications. This paper explores the potential of diffusion models for synthesizing DFU images and evaluates their authenticity through expert clinician assessments. Additionally, evaluation metrics such as Frechet Inception Distance (FID) and Kernel Inception Distance (KID) are examined to assess the quality of the synthetic DFU images. A dataset of 2,000 DFU images is used for training the diffusion model, and the synthetic images are generated by applying diffusion processes. The results indicate that the diffusion model successfully synthesizes visually indistinguishable DFU images. 70% of the time, clinicians marked synthetic DFU images as real DFUs. However, clinicians demonstrate higher unanimous confidence in rating real images than synthetic ones. The study also reveals that FID and KID metrics do not significantly align with clinicians' assessments, suggesting alternative evaluation approaches are needed. The findings highlight the potential of diffusion models for generating synthetic DFU images and their impact on medical training programs and research in wound detection and classification. △ Less

Submitted 30 October, 2023; originally announced October 2023.

Comments: 8 pages, 3 figures, 6th Workshop on AI for Aging, Rehabilitation and Intelligent Assisted Living at European Conference on Machine Learning, Italy, 2023

arXiv:2310.05102 [pdf]

doi 10.1007/978-3-031-49252-5_4

A Federated Learning Algorithms Development Paradigm

Authors: Miroslav Popovic, Marko Popovic, Ivan Kastelan, Miodrag Djukic, Ilija Basicevic

Abstract: At present many distributed and decentralized frameworks for federated learning algorithms are already available. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. A solution to that challenge named Python Testbed for Federated Learning Algorithms (PTB-FLA) appeared recently. This solution is written in pure Python, it supports… ▽ More At present many distributed and decentralized frameworks for federated learning algorithms are already available. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. A solution to that challenge named Python Testbed for Federated Learning Algorithms (PTB-FLA) appeared recently. This solution is written in pure Python, it supports both centralized and decentralized algorithms, and its usage was validated and illustrated by three simple algorithm examples. In this paper, we present the federated learning algorithms development paradigm based on PTB-FLA. The paradigm comprises the four phases named by the code they produce: (1) the sequential code, (2) the federated sequential code, (3) the federated sequential code with callbacks, and (4) the PTB-FLA code. The development paradigm is validated and illustrated in the case study on logistic regression, where both centralized and decentralized algorithms are developed. △ Less

Submitted 3 December, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

Comments: 19 pages, 3 figures, 5 algorithms, submitted to ECBS 2023

Journal ref: Springer, LNCS 14390, 2024

arXiv:2310.00685 [pdf, other]

Active Implicit Reconstruction Using One-Shot View Planning

Authors: Hao Hu, Sicong Pan, Liren **, Marija Popović, Maren Bennewitz

Abstract: Active object reconstruction using autonomous robots is gaining great interest. A primary goal in this task is to maximize the information of the object to be reconstructed, given limited on-board resources. Previous view planning methods exhibit inefficiency since they rely on an iterative paradigm based on explicit representations, consisting of (1) planning a path to the next-best view only; an… ▽ More Active object reconstruction using autonomous robots is gaining great interest. A primary goal in this task is to maximize the information of the object to be reconstructed, given limited on-board resources. Previous view planning methods exhibit inefficiency since they rely on an iterative paradigm based on explicit representations, consisting of (1) planning a path to the next-best view only; and (2) requiring a considerable number of less-gain views in terms of surface coverage. To address these limitations, we propose to integrate implicit representations into the One-Shot View Planning (OSVP). The key idea behind our approach is to use implicit representations to obtain the small missing surface areas instead of observing them with extra views. Therefore, we design a deep neural network, named OSVP, to directly predict a set of views given a dense point cloud refined from an initial sparse observation. To train our OSVP network, we generate supervision labels using dense point clouds refined by implicit representations and set covering optimization problems. Simulated experiments show that our method achieves sufficient reconstruction quality, outperforming several baselines under limited view and movement budgets. We further demonstrate the applicability of our approach in a real-world object reconstruction scenario. △ Less

Submitted 13 February, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

Comments: Hao Hu and Sicong Pan have equal contribution. Publication to appear in IEEE International Conference on Robotics and Automation (ICRA), 2024

arXiv:2310.00684 [pdf, other]

How Many Views Are Needed to Reconstruct an Unknown Object Using NeRF?

Authors: Sicong Pan, Liren **, Hao Hu, Marija Popović, Maren Bennewitz

Abstract: Neural Radiance Fields (NeRFs) are gaining significant interest for online active object reconstruction due to their exceptional memory efficiency and requirement for only posed RGB inputs. Previous NeRF-based view planning methods exhibit computational inefficiency since they rely on an iterative paradigm, consisting of (1) retraining the NeRF when new images arrive; and (2) planning a path to th… ▽ More Neural Radiance Fields (NeRFs) are gaining significant interest for online active object reconstruction due to their exceptional memory efficiency and requirement for only posed RGB inputs. Previous NeRF-based view planning methods exhibit computational inefficiency since they rely on an iterative paradigm, consisting of (1) retraining the NeRF when new images arrive; and (2) planning a path to the next best view only. To address these limitations, we propose a non-iterative pipeline based on the Prediction of the Required number of Views (PRV). The key idea behind our approach is that the required number of views to reconstruct an object depends on its complexity. Therefore, we design a deep neural network, named PRVNet, to predict the required number of views, allowing us to tailor the data acquisition based on the object complexity and plan a globally shortest path. To train our PRVNet, we generate supervision labels using the ShapeNet dataset. Simulated experiments show that our PRV-based view planning method outperforms baselines, achieving good reconstruction quality while significantly reducing movement cost and planning time. We further justify the generalization ability of our approach in a real-world experiment. △ Less

Submitted 13 February, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

Comments: Sicong Pan and Liren ** have equal contribution. Publication to appear in IEEE International Conference on Robotics and Automation (ICRA), 2024

arXiv:2309.10620 [pdf, other]

Perceptual Factors for Environmental Modeling in Robotic Active Perception

Authors: David Morilla-Cabello, Jonas Westheider, Marija Popovic, Eduardo Montijano

Abstract: Accurately assessing the potential value of new sensor observations is a critical aspect of planning for active perception. This task is particularly challenging when reasoning about high-level scene understanding using measurements from vision-based neural networks. Due to appearance-based reasoning, the measurements are susceptible to several environmental effects such as the presence of occlude… ▽ More Accurately assessing the potential value of new sensor observations is a critical aspect of planning for active perception. This task is particularly challenging when reasoning about high-level scene understanding using measurements from vision-based neural networks. Due to appearance-based reasoning, the measurements are susceptible to several environmental effects such as the presence of occluders, variations in lighting conditions, and redundancy of information due to similarity in appearance between nearby viewpoints. To address this, we propose a new active perception framework incorporating an arbitrary number of perceptual effects in planning and fusion. Our method models the correlation with the environment by a set of general functions termed perceptual factors to construct a perceptual map, which quantifies the aggregated influence of the environment on candidate viewpoints. This information is seamlessly incorporated into the planning and fusion processes by adjusting the uncertainty associated with measurements to weigh their contributions. We evaluate our perceptual maps in a simulated environment that reproduces environmental conditions common in robotics applications. Our results show that, by accounting for environmental effects within our perceptual maps, we improve in the state estimation by correctly selecting the viewpoints and considering the measurement noise correctly when affected by environmental factors. We furthermore deploy our approach on a ground robot to showcase its applicability for real-world active perception missions. △ Less

Submitted 10 October, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

Comments: 7 pages, 9 figures, under review for IEEE ICRA 2023

arXiv:2306.15945 [pdf, ps, other]

Permutation Polynomial Interleaved Zadoff-Chu Sequences

Authors: Fredrik Berggren, Branislav M. Popovic

Abstract: Constant amplitude zero autocorrelation (CAZAC) sequences have modulus one and ideal periodic autocorrelation function. Such sequences are used in cellular radio communications systems, e.g., for reference signals, synchronization signals and random access preambles. We propose a new family CAZAC sequences, which is constructed by interleaving a Zadoff-Chu sequence by a quadratic permutation polyn… ▽ More Constant amplitude zero autocorrelation (CAZAC) sequences have modulus one and ideal periodic autocorrelation function. Such sequences are used in cellular radio communications systems, e.g., for reference signals, synchronization signals and random access preambles. We propose a new family CAZAC sequences, which is constructed by interleaving a Zadoff-Chu sequence by a quadratic permutation polynomial (QPP), or by a permutation polynomial whose inverse is a QPP. It is demonstrated that a set of orthogonal interleaved Zadoff-Chu sequences can be constructed by proper choice of QPPs. △ Less

Submitted 26 April, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: Submitted to IEEE Transactions on Information Theory

arXiv:2306.14529 [pdf, other]

Correct orchestration of Federated Learning generic algorithms: formalisation and verification in CSP

Authors: Ivan Prokić, Silvia Ghilezan, Simona Kašterović, Miroslav Popovic, Marko Popovic, Ivan Kaštelan

Abstract: Federated learning (FL) is a machine learning setting where clients keep the training data decentralised and collaboratively train a model either under the coordination of a central server (centralised FL) or in a peer-to-peer network (decentralised FL). Correct orchestration is one of the main challenges. In this paper, we formally verify the correctness of two generic FL algorithms, a centralise… ▽ More Federated learning (FL) is a machine learning setting where clients keep the training data decentralised and collaboratively train a model either under the coordination of a central server (centralised FL) or in a peer-to-peer network (decentralised FL). Correct orchestration is one of the main challenges. In this paper, we formally verify the correctness of two generic FL algorithms, a centralised and a decentralised one, using the CSP process calculus and the PAT model checker. The CSP models consist of CSP processes corresponding to generic FL algorithm instances. PAT automatically proves the correctness of the two generic FL algorithms by proving their deadlock freeness (safety property) and successful termination (liveness property). The CSP models are constructed bottom-up by hand as a faithful representation of the real Python code and is automatically checked top-down by PAT. △ Less

Submitted 26 June, 2023; originally announced June 2023.

Comments: arXiv admin note: text overlap with arXiv:2305.20027

arXiv:2305.20027 [pdf]

doi 10.1109/ZINC58345.2023.10173859

A Simple Python Testbed for Federated Learning Algorithms

Authors: Miroslav Popovic, Marko Popovic, Ivan Kastelan, Miodrag Djukic, Silvia Ghilezan

Abstract: Nowadays many researchers are develo** various distributed and decentralized frameworks for federated learning algorithms. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. In this paper, we present our solution to that challenge called Python Testbed for Federated Learning Algorithms. The solution is written in pure Python, a… ▽ More Nowadays many researchers are develo** various distributed and decentralized frameworks for federated learning algorithms. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. In this paper, we present our solution to that challenge called Python Testbed for Federated Learning Algorithms. The solution is written in pure Python, and it supports both centralized and decentralized algorithms. The usage of the presented solution is both validated and illustrated by three simple algorithm examples. △ Less

Submitted 18 July, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

Comments: 6 pages, 7 figures, 3 algorithms, as accepted at ZINC 2023, Published by IEEE Xplore

ACM Class: D.2.11

arXiv:2305.08380 [pdf]

doi 10.1145/3459960.3459962

PSTM Transaction Scheduler Verification Based on CSP and Testing

Authors: Miroslav Popovic, Marko Popovic, Branislav Kordic, Huibiao Zhu

Abstract: Many online transaction scheduler architectures and algorithms for various software transactional memories have been designed in order to maintain good system performance even for high concurrency workloads. Most of these algorithms were directly implemented in a target programming language, and experimentally evaluated, without theoretical proofs of correctness and analysis of their performance.… ▽ More Many online transaction scheduler architectures and algorithms for various software transactional memories have been designed in order to maintain good system performance even for high concurrency workloads. Most of these algorithms were directly implemented in a target programming language, and experimentally evaluated, without theoretical proofs of correctness and analysis of their performance. Only a small number of these algorithms were modeled using formal methods, such as process algebra CSP, in order to verify that they satisfy properties such as deadlock-freeness and starvation-freeness. However, as this paper shows, using solely formal methods has its disadvantages, too. In this paper, we first analyze the previous CSP model of PSTM transaction scheduler by comparing the model checker PAT results with the manually derived expected results, for the given test workloads. Next, according to the results of this analysis, we correct and extend the CSP model. Finally, based on PAT results for the new CSP model, we analyze the performance of PSTM online transaction scheduling algorithms from the perspective of makespan, number of aborts, and throughput. Based on our findings, we may conclude that for the complete formal verification of trustworthy software, both formal verification and it's testing must be jointly used. △ Less

Submitted 15 May, 2023; originally announced May 2023.

Comments: 18 pages, 5 figures, 5 tables, 4 algorithms

ACM Class: D.2.4

Journal ref: In Proceedings of 7th Conference on the Engineering of Computer Based Systems (ECBS 2021). ACM, New York, NY, USA, 9 pages. 2021

arXiv:2304.14922 [pdf, other]

Supervised and Unsupervised Deep Learning Approaches for EEG Seizure Prediction

Authors: Zakary Georgis-Yap, Milos R. Popovic, Shehroz S. Khan

Abstract: Epilepsy affects more than 50 million people worldwide, making it one of the world's most prevalent neurological diseases. The main symptom of epilepsy is seizures, which occur abruptly and can cause serious injury or death. The ability to predict the occurrence of an epileptic seizure could alleviate many risks and stresses people with epilepsy face. We formulate the problem of detecting preictal… ▽ More Epilepsy affects more than 50 million people worldwide, making it one of the world's most prevalent neurological diseases. The main symptom of epilepsy is seizures, which occur abruptly and can cause serious injury or death. The ability to predict the occurrence of an epileptic seizure could alleviate many risks and stresses people with epilepsy face. We formulate the problem of detecting preictal (or pre-seizure) with reference to normal EEG as a precursor to incoming seizure. To this end, we developed several supervised deep learning approaches to identify preictal EEG from normal EEG. We further develop novel unsupervised deep learning approaches to train the models on only normal EEG, and detecting pre-seizure EEG as an anomalous event. These deep learning models were trained and evaluated on two large EEG seizure datasets in a person-specific manner. We found that both supervised and unsupervised approaches are feasible; however, their performance varies depending on the patient, approach and architecture. This new line of research has the potential to develop therapeutic interventions and save human lives. △ Less

Submitted 3 February, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

Comments: 16 figures, 9 tables

Journal ref: Journal of Health Informatics Research, 2024

arXiv:2303.03048 [pdf, other]

Graph-based View Motion Planning for Fruit Detection

Authors: Tobias Zaenker, Julius Rückin, Rohit Menon, Marija Popović, Maren Bennewitz

Abstract: Crop monitoring is crucial for maximizing agricultural productivity and efficiency. However, monitoring large and complex structures such as sweet pepper plants presents significant challenges, especially due to frequent occlusions of the fruits. Traditional next-best view planning can lead to unstructured and inefficient coverage of the crops. To address this, we propose a novel view motion plann… ▽ More Crop monitoring is crucial for maximizing agricultural productivity and efficiency. However, monitoring large and complex structures such as sweet pepper plants presents significant challenges, especially due to frequent occlusions of the fruits. Traditional next-best view planning can lead to unstructured and inefficient coverage of the crops. To address this, we propose a novel view motion planner that builds a graph network of viable view poses and trajectories between nearby poses, thereby considering robot motion constraints. The planner searches the graphs for view sequences with the highest accumulated information gain, allowing for efficient pepper plant monitoring while minimizing occlusions. The generated view poses aim at both sufficiently covering already detected and discovering new fruits. The graph and the corresponding best view pose sequence are computed with a limited horizon and are adaptively updated in fixed time intervals as the system gathers new information. We demonstrate the effectiveness of our approach through simulated and real-world experiments using a robotic arm equipped with an RGB-D camera and mounted on a trolley. As the experimental results show, our planner produces view pose sequences to systematically cover the crops and leads to increased fruit coverage when given a limited time in comparison to a state-of-the-art single next-best view planner. △ Less

Submitted 15 August, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

Comments: 7 pages, 10 figures, accepted at IROS 2023

arXiv:2303.01284 [pdf, other]

NeU-NBV: Next Best View Planning Using Uncertainty Estimation in Image-Based Neural Rendering

Authors: Liren **, Xieyuanli Chen, Julius Rückin, Marija Popović

Abstract: Autonomous robotic tasks require actively perceiving the environment to achieve application-specific goals. In this paper, we address the problem of positioning an RGB camera to collect the most informative images to represent an unknown scene, given a limited measurement budget. We propose a novel mapless planning framework to iteratively plan the next best camera view based on collected image me… ▽ More Autonomous robotic tasks require actively perceiving the environment to achieve application-specific goals. In this paper, we address the problem of positioning an RGB camera to collect the most informative images to represent an unknown scene, given a limited measurement budget. We propose a novel mapless planning framework to iteratively plan the next best camera view based on collected image measurements. A key aspect of our approach is a new technique for uncertainty estimation in image-based neural rendering, which guides measurement acquisition at the most uncertain view among view candidates, thus maximising the information value during data collection. By incrementally adding new measurements into our image collection, our approach efficiently explores an unknown scene in a mapless manner. We show that our uncertainty estimation is generalisable and valuable for view planning in unknown scenes. Our planning experiments using synthetic and real-world data verify that our uncertainty-guided approach finds informative images leading to more accurate scene representations when compared against baselines. △ Less

Submitted 23 July, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

Comments: Accepted to IEEE/RSJ International Conference on Robotics and Intelligent Systems (IROS) 2023

arXiv:2303.01150 [pdf, other]

Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning

Authors: Jonas Westheider, Julius Rückin, Marija Popović

Abstract: Efficient aerial data collection is important in many remote sensing applications. In large-scale monitoring scenarios, deploying a team of unmanned aerial vehicles (UAVs) offers improved spatial coverage and robustness against individual failures. However, a key challenge is cooperative path planning for the UAVs to efficiently achieve a joint mission goal. We propose a novel multi-agent informat… ▽ More Efficient aerial data collection is important in many remote sensing applications. In large-scale monitoring scenarios, deploying a team of unmanned aerial vehicles (UAVs) offers improved spatial coverage and robustness against individual failures. However, a key challenge is cooperative path planning for the UAVs to efficiently achieve a joint mission goal. We propose a novel multi-agent informative path planning approach based on deep reinforcement learning for adaptive terrain monitoring scenarios using UAV teams. We introduce new network feature representations to effectively learn path planning in a 3D workspace. By leveraging a counterfactual baseline, our approach explicitly addresses credit assignment to learn cooperative behaviour. Our experimental evaluation shows improved planning performance, i.e. maps regions of interest more quickly, with respect to non-counterfactual variants. Results on synthetic and real-world data show that our approach has superior performance compared to state-of-the-art non-learning-based methods, while being transferable to varying team sizes and communication constraints. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: 8 pages, 8 figures, Submission to IEEE/RSJ International Conference on Robotics and Intelligent Systems

arXiv:2302.03347 [pdf, other]

An Informative Path Planning Framework for Active Learning in UAV-based Semantic Map**

Authors: Julius Rückin, Federico Magistri, Cyrill Stachniss, Marija Popović

Abstract: Unmanned aerial vehicles (UAVs) are frequently used for aerial map** and general monitoring tasks. Recent progress in deep learning enabled automated semantic segmentation of imagery to facilitate the interpretation of large-scale complex environments. Commonly used supervised deep learning for segmentation relies on large amounts of pixel-wise labelled data, which is tedious and costly to annot… ▽ More Unmanned aerial vehicles (UAVs) are frequently used for aerial map** and general monitoring tasks. Recent progress in deep learning enabled automated semantic segmentation of imagery to facilitate the interpretation of large-scale complex environments. Commonly used supervised deep learning for segmentation relies on large amounts of pixel-wise labelled data, which is tedious and costly to annotate. The domain-specific visual appearance of aerial environments often prevents the usage of models pre-trained on publicly available datasets. To address this, we propose a novel general planning framework for UAVs to autonomously acquire informative training images for model re-training. We leverage multiple acquisition functions and fuse them into probabilistic terrain maps. Our framework combines the mapped acquisition function information into the UAV's planning objectives. In this way, the UAV adaptively acquires informative aerial images to be manually labelled for model re-training. Experimental results on real-world data and in a photorealistic simulation show that our framework maximises model performance and drastically reduces labelling efforts. Our map-based planners outperform state-of-the-art local planning. △ Less

Submitted 6 September, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

Comments: 18 pages, 24 figures

arXiv:2207.12520 [pdf, other]

3D Lidar Reconstruction with Probabilistic Depth Completion for Robotic Navigation

Authors: Yifu Tao, Marija Popović, Yiduo Wang, Sundara Tejaswi Digumarti, Nived Chebrolu, Maurice Fallon

Abstract: Safe motion planning in robotics requires planning into space which has been verified to be free of obstacles. However, obtaining such environment representations using lidars is challenging by virtue of the sparsity of their depth measurements. We present a learning-aided 3D lidar reconstruction framework that upsamples sparse lidar depth measurements with the aid of overlap** camera images so… ▽ More Safe motion planning in robotics requires planning into space which has been verified to be free of obstacles. However, obtaining such environment representations using lidars is challenging by virtue of the sparsity of their depth measurements. We present a learning-aided 3D lidar reconstruction framework that upsamples sparse lidar depth measurements with the aid of overlap** camera images so as to generate denser reconstructions with more definitively free space than can be achieved with the raw lidar measurements alone. We use a neural network with an encoder-decoder structure to predict dense depth images along with depth uncertainty estimates which are fused using a volumetric map** system. We conduct experiments on real-world outdoor datasets captured using a handheld sensing device and a legged robot. Using input data from a 16-beam lidar map** a building network, our experiments showed that the amount of estimated free space was increased by more than 40% with our approach. We also show that our approach trained on a synthetic dataset generalises well to real-world outdoor scenes without additional fine-tuning. Finally, we demonstrate how motion planning tasks can benefit from these denser reconstructions. △ Less

Submitted 25 July, 2022; originally announced July 2022.

Comments: Accepted at IROS 2022; Video at https://www.youtube.com/watch?v=db3HetMq5h4

arXiv:2206.04528 [pdf, ps, other]

Joint radar and communications with multicarrier chirp-based waveform

Authors: Fredrik Berggren, Branislav M. Popovic

Abstract: We consider a multicarrier chirp-based waveform for joint radar and communication (JRC) systems and derive its time discrete periodic ambiguity function (AF). An advantage of the waveform is that it includes a set of waveform parameters (e.g., chirp rate) which together with the transmit sequence, can be selected to flexibly shape the AF to be thumbtack-like, or to be ridge-like, either along the… ▽ More We consider a multicarrier chirp-based waveform for joint radar and communication (JRC) systems and derive its time discrete periodic ambiguity function (AF). An advantage of the waveform is that it includes a set of waveform parameters (e.g., chirp rate) which together with the transmit sequence, can be selected to flexibly shape the AF to be thumbtack-like, or to be ridge-like, either along the delay axis or the Doppler axis. These shapes are applicable for different use cases, e.g., target detection or timeand frequency synchronization. The results show that better signal detection performance than OFDM and DFT-s-OFDM can be achieved on channels with large Doppler frequency. Furthermore, it is shown how transmit sequences can be selected in order to achieve 0 dB peak-to-average-power-ratio (PAPR) of the waveform. △ Less

Submitted 2 September, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

Comments: 16 pages, 11 figures, submitted to IEEE Open Journal of the Communications Society

arXiv:2205.10947 [pdf, other]

Deep Direct Discriminative Decoders for High-dimensional Time-series Data Analysis

Authors: Mohammad R. Rezaei, Milos R. Popovic, Milad Lankarany, Ali Yousefi

Abstract: The state-space models (SSMs) are widely utilized in the analysis of time-series data. SSMs rely on an explicit definition of the state and observation processes. Characterizing these processes is not always easy and becomes a modeling challenge when the dimension of observed data grows or the observed data distribution deviates from the normal distribution. Here, we propose a new formulation of S… ▽ More The state-space models (SSMs) are widely utilized in the analysis of time-series data. SSMs rely on an explicit definition of the state and observation processes. Characterizing these processes is not always easy and becomes a modeling challenge when the dimension of observed data grows or the observed data distribution deviates from the normal distribution. Here, we propose a new formulation of SSM for high-dimensional observation processes. We call this solution the deep direct discriminative decoder (D4). The D4 brings deep neural networks' expressiveness and scalability to the SSM formulation letting us build a novel solution that efficiently estimates the underlying state processes through high-dimensional observation signal. We demonstrate the D4 solutions in simulated and real data such as Lorenz attractors, Langevin dynamics, random walk dynamics, and rat hippocampus spiking neural data and show that the D4 performs better than traditional SSMs and RNNs. The D4 can be applied to a broader class of time-series data where the connection between high-dimensional observation and the underlying latent process is hard to characterize. △ Less

Submitted 3 July, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

arXiv:2204.05961 [pdf, other]

Quantified Reproducibility Assessment of NLP Results

Authors: Anya Belz, Maja Popović, Simon Mille

Abstract: This paper describes and tests a method for carrying out quantified reproducibility assessment (QRA) that is based on concepts and definitions from metrology. QRA produces a single score estimating the degree of reproducibility of a given system and evaluation measure, on the basis of the scores from, and differences between, different reproductions. We test QRA on 18 system and evaluation measure… ▽ More This paper describes and tests a method for carrying out quantified reproducibility assessment (QRA) that is based on concepts and definitions from metrology. QRA produces a single score estimating the degree of reproducibility of a given system and evaluation measure, on the basis of the scores from, and differences between, different reproductions. We test QRA on 18 system and evaluation measure combinations (involving diverse NLP tasks and types of evaluation), for each of which we have the original results and one to seven reproduction results. The proposed QRA method produces degree-of-reproducibility scores that are comparable across multiple reproductions not only of the same, but of different original studies. We find that the proposed method facilitates insights into causes of variation between reproductions, and allows conclusions to be drawn about what changes to system and/or evaluation design might lead to improved reproducibility. △ Less

Submitted 12 April, 2022; originally announced April 2022.

Comments: To be published in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL'22)

arXiv:2203.01652 [pdf, other]

Informative Path Planning for Active Learning in Aerial Semantic Map**

Authors: Julius Rückin, Liren **, Federico Magistri, Cyrill Stachniss, Marija Popović

Abstract: Semantic segmentation of aerial imagery is an important tool for map** and earth observation. However, supervised deep learning models for segmentation rely on large amounts of high-quality labelled data, which is labour-intensive and time-consuming to generate. To address this, we propose a new approach for using unmanned aerial vehicles (UAVs) to autonomously collect useful data for model trai… ▽ More Semantic segmentation of aerial imagery is an important tool for map** and earth observation. However, supervised deep learning models for segmentation rely on large amounts of high-quality labelled data, which is labour-intensive and time-consuming to generate. To address this, we propose a new approach for using unmanned aerial vehicles (UAVs) to autonomously collect useful data for model training. We exploit a Bayesian approach to estimate model uncertainty in semantic segmentation. During a mission, the semantic predictions and model uncertainty are used as input for terrain map**. A key aspect of our pipeline is to link the mapped model uncertainty to a robotic planning objective based on active learning. This enables us to adaptively guide a UAV to gather the most informative terrain images to be labelled by a human for model training. Our experimental evaluation on real-world data shows the benefit of using our informative planning approach in comparison to static coverage paths in terms of maximising model performance and reducing labelling efforts. △ Less

Submitted 2 September, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

Comments: 8 pages, 9 figures, Submission to IEEE/RSJ International Conference on Robotics and Intelligent Systems

arXiv:2203.01642 [pdf, other]

Adaptive Path Planning for UAVs for Multi-Resolution Semantic Segmentation

Authors: Felix Stache, Jonas Westheider, Federico Magistri, Cyrill Stachniss, Marija Popović

Abstract: Efficient data collection methods play a major role in hel** us better understand the Earth and its ecosystems. In many applications, the usage of unmanned aerial vehicles (UAVs) for monitoring and remote sensing is rapidly gaining momentum due to their high mobility, low cost, and flexible deployment. A key challenge is planning missions to maximize the value of acquired data in large environme… ▽ More Efficient data collection methods play a major role in hel** us better understand the Earth and its ecosystems. In many applications, the usage of unmanned aerial vehicles (UAVs) for monitoring and remote sensing is rapidly gaining momentum due to their high mobility, low cost, and flexible deployment. A key challenge is planning missions to maximize the value of acquired data in large environments given flight time limitations. This is, for example, relevant for monitoring agricultural fields. This paper addresses the problem of adaptive path planning for accurate semantic segmentation of using UAVs. We propose an online planning algorithm which adapts the UAV paths to obtain high-resolution semantic segmentations necessary in areas with fine details as they are detected in incoming images. This enables us to perform close inspections at low altitudes only where required, without wasting energy on exhaustive map** at maximum image resolution. A key feature of our approach is a new accuracy model for deep learning-based architectures that captures the relationship between UAV altitude and semantic segmentation accuracy. We evaluate our approach on different domains using real-world data, proving the efficacy and generability of our solution. △ Less

Submitted 3 March, 2022; originally announced March 2022.

Comments: 10 pages, 7 figures, Submission to Robotics and Autonomous Systems journal. arXiv admin note: substantial text overlap with arXiv:2108.01884

arXiv:2109.14257 [pdf, other]

Adaptive-Resolution Field Map** Using Gaussian Process Fusion with Integral Kernels

Authors: Liren **, Julius Rückin, Stefan H. Kiss, Teresa Vidal-Calleja, Marija Popović

Abstract: Unmanned aerial vehicles are rapidly gaining popularity in a variety of environmental monitoring tasks. A key requirement for their autonomous operation is the ability to perform efficient environmental map** online, given limited onboard resources constraining operation time, travel distance, and computational capacity. To address this, we present an online adaptive-resolution approach for mapp… ▽ More Unmanned aerial vehicles are rapidly gaining popularity in a variety of environmental monitoring tasks. A key requirement for their autonomous operation is the ability to perform efficient environmental map** online, given limited onboard resources constraining operation time, travel distance, and computational capacity. To address this, we present an online adaptive-resolution approach for map** terrain based on Gaussian Process fusion. A key aspect of our approach is an integral kernel encoding spatial correlation over the areas of grid cells, which enables modifying map resolution while maintaining correlations in a theoretically sound fashion. This way, we can retain details in areas of interest at higher map resolutions while compressing information in uninteresting areas at coarser resolutions to achieve a compact map representation of the environment. We evaluate the performance of our approach on both synthetic and real-world data. Results show that our method is more efficient in terms of map** time and memory consumption without compromising on map quality. Finally, we integrate our map** strategy into an adaptive path planning framework to show that it facilitates information gathering efficiency in online settings. △ Less

Submitted 3 March, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

arXiv:2109.13570 [pdf, other]

Adaptive Informative Path Planning Using Deep Reinforcement Learning for UAV-based Active Sensing

Authors: Julius Rückin, Liren **, Marija Popović

Abstract: Aerial robots are increasingly being utilized for environmental monitoring and exploration. However, a key challenge is efficiently planning paths to maximize the information value of acquired data as an initially unknown environment is explored. To address this, we propose a new approach for informative path planning based on deep reinforcement learning (RL). Combining recent advances in RL and r… ▽ More Aerial robots are increasingly being utilized for environmental monitoring and exploration. However, a key challenge is efficiently planning paths to maximize the information value of acquired data as an initially unknown environment is explored. To address this, we propose a new approach for informative path planning based on deep reinforcement learning (RL). Combining recent advances in RL and robotic applications, our method combines tree search with an offline-learned neural network predicting informative sensing actions. We introduce several components making our approach applicable for robotic tasks with high-dimensional state and large action spaces. By deploying the trained network during a mission, our method enables sample-efficient online replanning on platforms with limited computational resources. Simulations show that our approach performs on par with existing methods while reducing runtime by 8-10x. We validate its performance using real-world surface temperature data. △ Less

Submitted 3 March, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

Comments: Accepted in IEEE International Conference on Robotics and Automation 2022

arXiv:2108.01884 [pdf, other]

Adaptive Path Planning for UAV-based Multi-Resolution Semantic Segmentation

Authors: Felix Stache, Jonas Westheider, Federico Magistri, Marija Popović, Cyrill Stachniss

Abstract: In this paper, we address the problem of adaptive path planning for accurate semantic segmentation of terrain using unmanned aerial vehicles (UAVs). The usage of UAVs for terrain monitoring and remote sensing is rapidly gaining momentum due to their high mobility, low cost, and flexible deployment. However, a key challenge is planning missions to maximize the value of acquired data in large enviro… ▽ More In this paper, we address the problem of adaptive path planning for accurate semantic segmentation of terrain using unmanned aerial vehicles (UAVs). The usage of UAVs for terrain monitoring and remote sensing is rapidly gaining momentum due to their high mobility, low cost, and flexible deployment. However, a key challenge is planning missions to maximize the value of acquired data in large environments given flight time limitations. To address this, we propose an online planning algorithm which adapts the UAV paths to obtain high-resolution semantic segmentations necessary in areas on the terrain with fine details as they are detected in incoming images. This enables us to perform close inspections at low altitudes only where required, without wasting energy on exhaustive map** at maximum resolution. A key feature of our approach is a new accuracy model for deep learning-based architectures that captures the relationship between UAV altitude and semantic segmentation accuracy. We evaluate our approach on the application of crop/weed segmentation in precision agriculture using real-world field data. △ Less

Submitted 4 August, 2021; originally announced August 2021.

Comments: 6 pages, submission to European Conference on Mobile Robots (ECMR) 2021

arXiv:2107.05987 [pdf, other]

Generating Gender Augmented Data for NLP

Authors: Nishtha Jain, Maja Popovic, Declan Groves, Eva Vanmassenhove

Abstract: Gender bias is a frequent occurrence in NLP-based applications, especially pronounced in gender-inflected languages. Bias can appear through associations of certain adjectives and animate nouns with the natural gender of referents, but also due to unbalanced grammatical gender frequencies of inflected words. This type of bias becomes more evident in generating conversational utterances where gende… ▽ More Gender bias is a frequent occurrence in NLP-based applications, especially pronounced in gender-inflected languages. Bias can appear through associations of certain adjectives and animate nouns with the natural gender of referents, but also due to unbalanced grammatical gender frequencies of inflected words. This type of bias becomes more evident in generating conversational utterances where gender is not specified within the sentence, because most current NLP applications still work on a sentence-level context. As a step towards more inclusive NLP, this paper proposes an automatic and generalisable rewriting approach for short conversational sentences. The rewriting method can be applied to sentences that, without extra-sentential context, have multiple equivalent alternatives in terms of gender. The method can be applied both for creating gender balanced outputs as well as for creating gender balanced training data. The proposed approach is based on a neural machine translation (NMT) system trained to 'translate' from one gender alternative to another. Both the automatic and manual analysis of the approach show promising results for automatic generation of gender alternatives for conversational sentences in Spanish. △ Less

Submitted 13 July, 2021; originally announced July 2021.

Comments: 10 pages, 4 tables

arXiv:2106.07415 [pdf, other]

Accumulative Iterative Codes Based on Feedback

Authors: Alberto G. Perotti, Branislav M. Popovic, Anahid R. Safavi

Abstract: The Accumulative Iterative Code (AIC) proposed in this work is a new error correcting code for channels with feedback. AIC sends the information message to the receiver in a number of transmissions, where the initial transmission contains the uncoded message and each subsequent transmission informs the receiver about the locations of the errors that corrupted the previous transmission. Error locat… ▽ More The Accumulative Iterative Code (AIC) proposed in this work is a new error correcting code for channels with feedback. AIC sends the information message to the receiver in a number of transmissions, where the initial transmission contains the uncoded message and each subsequent transmission informs the receiver about the locations of the errors that corrupted the previous transmission. Error locations are determined based on the forward channel output, which is made available to the transmitter through the feedback channel. AIC achieves arbitrarily low error rates, thereby being suitablefor applications demanding extremely high reliability. In the same time, AIC achieves spectral efficiencies very close to the channel capacity in a wide range of signal-to-noise ratios even for transmission of short information messages. △ Less

Submitted 14 June, 2021; originally announced June 2021.

Comments: 30 pages, 8 figures, 4 tables; submitted to IEEE Transactions on Communications

arXiv:2105.01365 [pdf, other]

Deep Extended Feedback Codes

Authors: Anahid Robert Safavi, Alberto G. Perotti, Branislav M. Popovic, Mahdi Boloursaz Mashhadi, Deniz Gunduz

Abstract: A new deep-neural-network (DNN) based error correction encoder architecture for channels with feedback, called Deep Extended Feedback (DEF), is presented in this paper. The encoder in the DEF architecture transmits an information message followed by a sequence of parity symbols which are generated based on the message as well as the observations of the past forward channel outputs sent to the tran… ▽ More A new deep-neural-network (DNN) based error correction encoder architecture for channels with feedback, called Deep Extended Feedback (DEF), is presented in this paper. The encoder in the DEF architecture transmits an information message followed by a sequence of parity symbols which are generated based on the message as well as the observations of the past forward channel outputs sent to the transmitter through a feedback channel. DEF codes generalize Deepcode [1] in several ways: parity symbols are generated based on forward-channel output observations over longer time intervals in order to provide better error correction capability; and high-order modulation formats are deployed in the encoder so as to achieve increased spectral efficiency. Performance evaluations show that DEF codes have better performance compared to other DNN-based codes for channels with feedback. △ Less

Submitted 4 May, 2021; originally announced May 2021.

Comments: 7 pages, 5 figures, 5 tables

arXiv:2012.03023 [pdf, other]

doi 10.1109/LRA.2021.3070308

Volumetric Occupancy Map** With Probabilistic Depth Completion for Robotic Navigation

Authors: Marija Popovic, Florian Thomas, Sotiris Papatheodorou, Nils Funk, Teresa Vidal-Calleja, Stefan Leutenegger

Abstract: In robotic applications, a key requirement for safe and efficient motion planning is the ability to map obstacle-free space in unknown, cluttered 3D environments. However, commodity-grade RGB-D cameras commonly used for sensing fail to register valid depth values on shiny, glossy, bright, or distant surfaces, leading to missing data in the map. To address this issue, we propose a framework leverag… ▽ More In robotic applications, a key requirement for safe and efficient motion planning is the ability to map obstacle-free space in unknown, cluttered 3D environments. However, commodity-grade RGB-D cameras commonly used for sensing fail to register valid depth values on shiny, glossy, bright, or distant surfaces, leading to missing data in the map. To address this issue, we propose a framework leveraging probabilistic depth completion as an additional input for spatial map**. We introduce a deep learning architecture providing uncertainty estimates for the depth completion of RGB-D images. Our pipeline exploits the inferred missing depth values and depth uncertainty to complement raw depth images and improve the speed and quality of free space map**. Evaluations on synthetic data show that our approach maps significantly more correct free space with relatively low error when compared against using raw data alone in different indoor environments; thereby producing more complete maps that can be directly used for robotic navigation tasks. The performance of our framework is validated using real-world data. △ Less

Submitted 22 March, 2021; v1 submitted 5 December, 2020; originally announced December 2020.

Comments: 8 pages, 10 figures, submission to IEEE Robotics and Automation Letters (revised)

arXiv:2010.14476 [pdf, other]

doi 10.1371/journal.pone.0249769

Artificial intelligence based writer identification generates new evidence for the unknown scribes of the Dead Sea Scrolls exemplified by the Great Isaiah Scroll (1QIsaa)

Authors: Mladen Popović, Maruf A. Dhali, Lambert Schomaker

Abstract: The Dead Sea Scrolls are tangible evidence of the Bible's ancient scribal culture. Palaeography - the study of ancient handwriting - can provide access to this scribal culture. However, one of the problems of traditional palaeography is to determine writer identity when the writing style is near uniform. This is exemplified by the Great Isaiah Scroll (1QIsaa). To this end, we used pattern recognit… ▽ More The Dead Sea Scrolls are tangible evidence of the Bible's ancient scribal culture. Palaeography - the study of ancient handwriting - can provide access to this scribal culture. However, one of the problems of traditional palaeography is to determine writer identity when the writing style is near uniform. This is exemplified by the Great Isaiah Scroll (1QIsaa). To this end, we used pattern recognition and artificial intelligence techniques to innovate the palaeography of the scrolls regarding writer identification and to pioneer the microlevel of individual scribes to open access to the Bible's ancient scribal culture. Although many scholars believe that 1QIsaa was written by one scribe, we report new evidence for a breaking point in the series of columns in this scroll. Without prior assumption of writer identity, based on point clouds of the reduced-dimensionality feature-space, we found that columns from the first and second halves of the manuscript ended up in two distinct zones of such scatter plots, notably for a range of digital palaeography tools, each addressing very different featural aspects of the script samples. In a secondary, independent, analysis, now assuming writer difference and using yet another independent feature method and several different types of statistical testing, a switching point was found in the column series. A clear phase transition is apparent around column 27. Given the statistically significant differences between the two halves, a tertiary, post-hoc analysis was performed. Demonstrating that two main scribes were responsible for the Great Isaiah Scroll, this study sheds new light on the Bible's ancient scribal culture by providing new, tangible evidence that ancient biblical texts were not copied by a single scribe only but that multiple scribes could closely collaborate on one particular manuscript. △ Less

Submitted 27 October, 2020; originally announced October 2020.

Comments: 23 pages, 15 pages of supplementary materials, submitted to PLOS ONE on 19 October 2019

Report number: PLoS ONE 16(4): e0249769

Journal ref: PLoS ONE 2021

arXiv:2010.09232 [pdf, other]

doi 10.1109/ICRA48506.2021.9561736

Elastic and Efficient LiDAR Reconstruction for Large-Scale Exploration Tasks

Authors: Yiduo Wang, Nils Funk, Milad Ramezani, Sotiris Papatheodorou, Marija Popovic, Marco Camurri, Stefan Leutenegger, Maurice Fallon

Abstract: We present an efficient, elastic 3D LiDAR reconstruction framework which can reconstruct up to maximum LiDAR ranges (60 m) at multiple frames per second, thus enabling robot exploration in large-scale environments. Our approach only requires a CPU. We focus on three main challenges of large-scale reconstruction: integration of long-range LiDAR scans at high frequency, the capacity to deform the re… ▽ More We present an efficient, elastic 3D LiDAR reconstruction framework which can reconstruct up to maximum LiDAR ranges (60 m) at multiple frames per second, thus enabling robot exploration in large-scale environments. Our approach only requires a CPU. We focus on three main challenges of large-scale reconstruction: integration of long-range LiDAR scans at high frequency, the capacity to deform the reconstruction after loop closures are detected, and scalability for long-duration exploration. Our system extends upon a state-of-the-art efficient RGB-D volumetric reconstruction technique, called supereight, to support LiDAR scans and a newly developed submap** technique to allow for dynamic correction of the 3D reconstruction. We then introduce a novel pose graph clustering and submap fusion feature to make the proposed system more scalable for large environments. We evaluate the performance using two public datasets including outdoor exploration with a handheld device and a drone, and with a mobile robot exploring an underground room network. Experimental results demonstrate that our system can reconstruct at 3 Hz with 60 m sensor range and ~5 cm resolution, while state-of-the-art approaches can only reconstruct to 25 cm resolution or 20 m range at the same frequency. △ Less

Submitted 9 April, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

Comments: 7 pages, 7 figures

arXiv:2010.07929 [pdf, other]

doi 10.1109/LRA.2021.3061989

Multi-Resolution 3D Map** with Explicit Free Space Representation for Fast and Accurate Mobile Robot Motion Planning

Authors: Nils Funk, Juan Tarrio, Sotiris Papatheodorou, Marija Popovic, Pablo F. Alcantarilla, Stefan Leutenegger

Abstract: With the aim of bridging the gap between high quality reconstruction and mobile robot motion planning, we propose an efficient system that leverages the concept of adaptive-resolution volumetric map**, which naturally integrates with the hierarchical decomposition of space in an octree data structure. Instead of a Truncated Signed Distance Function (TSDF), we adopt map** of occupancy probabili… ▽ More With the aim of bridging the gap between high quality reconstruction and mobile robot motion planning, we propose an efficient system that leverages the concept of adaptive-resolution volumetric map**, which naturally integrates with the hierarchical decomposition of space in an octree data structure. Instead of a Truncated Signed Distance Function (TSDF), we adopt map** of occupancy probabilities in log-odds representation, which allows to represent both surfaces, as well as the entire free, i.e. observed space, as opposed to unobserved space. We introduce a method for choosing resolution -- on the fly -- in real-time by means of a multi-scale max-min pooling of the input depth image. The notion of explicit free space map** paired with the spatial hierarchy in the data structure, as well as map resolution, allows for collision queries, as needed for robot motion planning, at unprecedented speed. We quantitatively evaluate map** accuracy, memory, runtime performance, and planning performance showing improvements over the state of the art, particularly in cases requiring high resolution maps. △ Less

Submitted 30 January, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

Comments: 8 pages, 9 figures, 5 tables

arXiv:2008.05994 [pdf]

doi 10.1371/journal.pone.0253612

A community-powered search of machine learning strategy space to find NMR property prediction models

Authors: Lars A. Bratholm, Will Gerrard, Brandon Anderson, Shaojie Bai, Sunghwan Choi, Lam Dang, Pavel Hanchar, Addison Howard, Guillaume Huard, Sanghoon Kim, Zico Kolter, Risi Kondor, Mordechai Kornbluth, Youhan Lee, Youngsoo Lee, Jonathan P. Mailoa, Thanh Tu Nguyen, Milos Popovic, Goran Rakocevic, Walter Reade, Wonho Song, Luka Stojanovic, Erik H. Thiede, Nebojsa Tijanic, Andres Torrubia , et al. (4 additional authors not shown)

Abstract: The rise of machine learning (ML) has created an explosion in the potential strategies for using data to make scientific predictions. For physical scientists wishing to apply ML strategies to a particular domain, it can be difficult to assess in advance what strategy to adopt within a vast space of possibilities. Here we outline the results of an online community-powered effort to swarm search the… ▽ More The rise of machine learning (ML) has created an explosion in the potential strategies for using data to make scientific predictions. For physical scientists wishing to apply ML strategies to a particular domain, it can be difficult to assess in advance what strategy to adopt within a vast space of possibilities. Here we outline the results of an online community-powered effort to swarm search the space of ML strategies and develop algorithms for predicting atomic-pairwise nuclear magnetic resonance (NMR) properties in molecules. Using an open-source dataset, we worked with Kaggle to design and host a 3-month competition which received 47,800 ML model predictions from 2,700 teams in 84 countries. Within 3 weeks, the Kaggle community produced models with comparable accuracy to our best previously published "in-house" efforts. A meta-ensemble model constructed as a linear combination of the top predictions has a prediction accuracy which exceeds that of any individual model, 7-19x better than our previous state-of-the-art. The results highlight the potential of transformer architectures for predicting quantum mechanical (QM) molecular properties. △ Less

Submitted 13 August, 2020; originally announced August 2020.

arXiv:2007.08341 [pdf]

Zero Correlation Zone Sequences With Flexible Block-Repetitive Spectral Constraints

Authors: Branislav M. Popovic, Peng Wang, Fredrik Berggren, Renaud-Alexandre Pitaval

Abstract: A general construction of a set of time-domain sequences with sparse periodic correlation functions, having multiple segments of consecutive zero-values, i.e. multiple zero correlation zones (ZCZs), is presented. All such sequences have a common and block-repetitive structure of the positions of zeros in their Discrete Fourier Transform (DFT) sequences, where the exact positions of zeros in a DFT… ▽ More A general construction of a set of time-domain sequences with sparse periodic correlation functions, having multiple segments of consecutive zero-values, i.e. multiple zero correlation zones (ZCZs), is presented. All such sequences have a common and block-repetitive structure of the positions of zeros in their Discrete Fourier Transform (DFT) sequences, where the exact positions of zeros in a DFT sequence do not impact the positions and sizes of ZCZs. This property offers completely new degree of flexibility in designing signals with good correlation properties under various spectral constraints. The non-zero values of the DFT sequences are determined by the corresponding frequency-domain modulation sequences, constructed as the element-by-element product of two component sequences: a "long" one, which is common to the set of time-domain sequences, and which controls the peak-to-average power ratio (PAPR) properties of the time-domain sequences; and a "short" one, periodically extended to match the length of the "long" component sequence, which controls the non-zero crosscorrelation values of all time-domain sequences. It is shown that 0 dB PAPR of time-domain sequences can be obtained if the "long" frequency-domain component sequence is selected to be a modulatable constant amplitude zero autocorrelation (MCAZAC) sequence. A generalized and simplified unified construction of MCAZAC sequences is presented. △ Less

Submitted 16 July, 2020; originally announced July 2020.

Comments: 7 pages (double-column). This work has been submitted to IEEE Transactions on Information Theory. This version provides in addition a new generalization of the unified construction of modulatable CAZAC sequences in Sect. V, reflected also in Sect. VI

arXiv:2006.02116 [pdf, other]

Aerial Manipulation Using Hybrid Force and Position NMPC Applied to Aerial Writing

Authors: Dimos Tzoumanikas, Felix Graule, Qingyue Yan, Dhruv Shah, Marija Popovic, Stefan Leutenegger

Abstract: Aerial manipulation aims at combining the manoeuvrability of aerial vehicles with the manipulation capabilities of robotic arms. This, however, comes at the cost of the additional control complexity due to the coupling of the dynamics of the two systems. In this paper we present a NMPC specifically designed for MAVs equipped with a robotic arm. We formulate a hybrid control model for the combined… ▽ More Aerial manipulation aims at combining the manoeuvrability of aerial vehicles with the manipulation capabilities of robotic arms. This, however, comes at the cost of the additional control complexity due to the coupling of the dynamics of the two systems. In this paper we present a NMPC specifically designed for MAVs equipped with a robotic arm. We formulate a hybrid control model for the combined MAV-arm system which incorporates interaction forces acting on the end effector. We explain the practical implementation of our algorithm and show extensive experimental results of our custom built system performing multiple aerial-writing tasks on a whiteboard, revealing accuracy in the order of millimetres. △ Less

Submitted 3 June, 2020; originally announced June 2020.

Comments: Accepted for publication in Robotics: Science and Systems (RSS) 2020. Video: https://youtu.be/iE--MO0YF0o

arXiv:2005.00283 [pdf, other]

Facilitating Access to Multilingual COVID-19 Information via Neural Machine Translation

Authors: Andy Way, Rejwanul Haque, Guodong Xie, Federico Gaspari, Maja Popovic, Alberto Poncelas

Abstract: Every day, more people are becoming infected and dying from exposure to COVID-19. Some countries in Europe like Spain, France, the UK and Italy have suffered particularly badly from the virus. Others such as Germany appear to have coped extremely well. Both health professionals and the general public are keen to receive up-to-date information on the effects of the virus, as well as treatments that… ▽ More Every day, more people are becoming infected and dying from exposure to COVID-19. Some countries in Europe like Spain, France, the UK and Italy have suffered particularly badly from the virus. Others such as Germany appear to have coped extremely well. Both health professionals and the general public are keen to receive up-to-date information on the effects of the virus, as well as treatments that have proven to be effective. In cases where language is a barrier to access of pertinent information, machine translation (MT) may help people assimilate information published in different languages. Our MT systems trained on COVID-19 data are freely available for anyone to use to help translate information published in German, French, Italian, Spanish into English, as well as the reverse direction. △ Less

Submitted 1 May, 2020; originally announced May 2020.

arXiv:1911.03098 [pdf, other]

doi 10.1109/MRA.2020.3012492

Building an Aerial-Ground Robotics System for Precision Farming: An Adaptable Solution

Authors: Alberto Pretto, Stéphanie Aravecchia, Wolfram Burgard, Nived Chebrolu, Christian Dornhege, Tillmann Falck, Freya Fleckenstein, Alessandra Fontenla, Marco Imperoli, Raghav Khanna, Frank Liebisch, Philipp Lottes, Andres Milioto, Daniele Nardi, Sandro Nardi, Johannes Pfeifer, Marija Popović, Ciro Potena, Cédric Pradalier, Elisa Rothacker-Feder, Inkyu Sa, Alexander Schaefer, Roland Siegwart, Cyrill Stachniss, Achim Walter , et al. (3 additional authors not shown)

Abstract: The application of autonomous robots in agriculture is gaining increasing popularity thanks to the high impact it may have on food security, sustainability, resource use efficiency, reduction of chemical treatments, and the optimization of human effort and yield. With this vision, the Flourish research project aimed to develop an adaptable robotic solution for precision farming that combines the a… ▽ More The application of autonomous robots in agriculture is gaining increasing popularity thanks to the high impact it may have on food security, sustainability, resource use efficiency, reduction of chemical treatments, and the optimization of human effort and yield. With this vision, the Flourish research project aimed to develop an adaptable robotic solution for precision farming that combines the aerial survey capabilities of small autonomous unmanned aerial vehicles (UAVs) with targeted intervention performed by multi-purpose unmanned ground vehicles (UGVs). This paper presents an overview of the scientific and technological advances and outcomes obtained in the project. We introduce multi-spectral perception algorithms and aerial and ground-based systems developed for monitoring crop density, weed pressure, crop nitrogen nutrition status, and to accurately classify and locate weeds. We then introduce the navigation and map** systems tailored to our robots in the agricultural environment, as well as the modules for collaborative map**. We finally present the ground intervention hardware, software solutions, and interfaces we implemented and tested in different field conditions and with different crops. We describe a real use case in which a UAV collaborates with a UGV to monitor the field and to perform selective spraying without human intervention. △ Less

Submitted 7 June, 2022; v1 submitted 8 November, 2019; originally announced November 2019.

Comments: Published in IEEE Robotics & Automation Magazine, vol. 28, no. 3, pp. 29-49, Sept. 2021

Journal ref: IEEE Robotics & Automation Magazine, vol. 28, no. 3, pp. 29-49, Sept. 2021

arXiv:1909.03750 [pdf, other]

Combining SMT and NMT Back-Translated Data for Efficient NMT

Authors: Alberto Poncelas, Maja Popovic, Dimitar Shterionov, Gideon Maillette de Buy Wenniger, Andy Way

Abstract: Neural Machine Translation (NMT) models achieve their best performance when large sets of parallel data are used for training. Consequently, techniques for augmenting the training set have become popular recently. One of these methods is back-translation (Sennrich et al., 2016), which consists on generating synthetic sentences by translating a set of monolingual, target-language sentences using a… ▽ More Neural Machine Translation (NMT) models achieve their best performance when large sets of parallel data are used for training. Consequently, techniques for augmenting the training set have become popular recently. One of these methods is back-translation (Sennrich et al., 2016), which consists on generating synthetic sentences by translating a set of monolingual, target-language sentences using a Machine Translation (MT) model. Generally, NMT models are used for back-translation. In this work, we analyze the performance of models when the training data is extended with synthetic data using different MT approaches. In particular we investigate back-translated data generated not only by NMT but also by Statistical Machine Translation (SMT) models and combinations of both. The results reveal that the models achieve the best performances when the training set is augmented with back-translated data created by merging different MT approaches. △ Less

Submitted 9 September, 2019; originally announced September 2019.

Journal ref: Proceedings of Recent Advances in Natural Language Processing (RANLP 2019). pages 922--931

arXiv:1908.11157 [pdf, other]

Active Learning for UAV-based Semantic Map**

Authors: Hermann Blum, Silvan Rohrbach, Marija Popovic, Luca Bartolomei, Roland Siegwart

Abstract: Unmanned aerial vehicles combined with computer vision systems, such as convolutional neural networks, offer a flexible and affordable solution for terrain monitoring, map**, and detection tasks. However, a key challenge remains the collection and annotation of training data for the given sensors, application, and mission. We introduce an informative path planning system that incorporates novelt… ▽ More Unmanned aerial vehicles combined with computer vision systems, such as convolutional neural networks, offer a flexible and affordable solution for terrain monitoring, map**, and detection tasks. However, a key challenge remains the collection and annotation of training data for the given sensors, application, and mission. We introduce an informative path planning system that incorporates novelty estimation into its objective function, based on research for uncertainty estimation in deep learning. The system is designed for data collection to reduce both the number of flights and of annotated images. We evaluate the approach on real world terrain map** data and show significantly smaller collected training dataset compared to standard lawnmower data collection techniques. △ Less

Submitted 16 December, 2019; v1 submitted 29 August, 2019; originally announced August 2019.

Comments: presented at RSS'19 "2nd Workshop on Informative Path Planning and Adaptive Sampling"

arXiv:1902.10182 [pdf, other]

Obstacle-aware Adaptive Informative Path Planning for UAV-based Target Search

Authors: Ajith Anil Meera, Marija Popovic, Alexander Millane, Roland Siegwart

Abstract: Target search with unmanned aerial vehicles (UAVs) is relevant problem to many scenarios, e.g., search and rescue (SaR). However, a key challenge is planning paths for maximal search efficiency given flight time constraints. To address this, we propose the Obstacle-aware Adaptive Informative Path Planning (OA-IPP) algorithm for target search in cluttered environments using UAVs. Our approach lever… ▽ More Target search with unmanned aerial vehicles (UAVs) is relevant problem to many scenarios, e.g., search and rescue (SaR). However, a key challenge is planning paths for maximal search efficiency given flight time constraints. To address this, we propose the Obstacle-aware Adaptive Informative Path Planning (OA-IPP) algorithm for target search in cluttered environments using UAVs. Our approach leverages a layered planning strategy using a Gaussian Process (GP)-based model of target occupancy to generate informative paths in continuous 3D space. Within this framework, we introduce an adaptive replanning scheme which allows us to trade off between information gain, field coverage, sensor performance, and collision avoidance for efficient target detection. Extensive simulations show that our OA-IPP method performs better than state-of-the-art planners, and we demonstrate its application in a realistic urban SaR scenario. △ Less

Submitted 26 February, 2019; originally announced February 2019.

Comments: Paper accepted for International Conference on Robotics and Automation (ICRA-2019) to be held at Montreal, Canada

Showing 1–50 of 68 results for author: Popovic, M