-
Closed-Loop Binary Media-Based Modulation
Authors:
Majid Nasiri Khormuji,
Branislav M. Popovic
Abstract:
Presenting analytical results for Binary Media-Based Modulation (B-MBM) over fading channels for single-antenna receivers. Illustrating that open-loop B-MBM, in the absence of feedback, only achieves a diversity order of one. However, with feedback and optimal weight selection in closed-loop configurations, a diversity order of two becomes achievable. Notably, the closed-loop B-MBM, with analytica…
▽ More
Presenting analytical results for Binary Media-Based Modulation (B-MBM) over fading channels for single-antenna receivers. Illustrating that open-loop B-MBM, in the absence of feedback, only achieves a diversity order of one. However, with feedback and optimal weight selection in closed-loop configurations, a diversity order of two becomes achievable. Notably, the closed-loop B-MBM, with analytically computed optimal weights, performs equivalent to Alamouti-coded BPSK transmission, demonstrating feasibility even with just one radio frequency chain when feedback is available.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation
Authors:
Tom Kocmi,
Vilém Zouhar,
Eleftherios Avramidis,
Roman Grundkiewicz,
Marzena Karpinska,
Maja Popović,
Mrinmaya Sachan,
Mariya Shmatova
Abstract:
High-quality Machine Translation (MT) evaluation relies heavily on human judgments. Comprehensive error classification methods, such as Multidimensional Quality Metrics (MQM), are expensive as they are time-consuming and can only be done by experts, whose availability may be limited especially for low-resource languages. On the other hand, just assigning overall scores, like Direct Assessment (DA)…
▽ More
High-quality Machine Translation (MT) evaluation relies heavily on human judgments. Comprehensive error classification methods, such as Multidimensional Quality Metrics (MQM), are expensive as they are time-consuming and can only be done by experts, whose availability may be limited especially for low-resource languages. On the other hand, just assigning overall scores, like Direct Assessment (DA), is simpler and faster and can be done by translators of any level, but are less reliable. In this paper, we introduce Error Span Annotation (ESA), a human evaluation protocol which combines the continuous rating of DA with the high-level error severity span marking of MQM. We validate ESA by comparing it to MQM and DA for 12 MT systems and one human reference translation (English to German) from WMT23. The results show that ESA offers faster and cheaper annotations than MQM at the same quality level, without the requirement of expensive MQM experts.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Cell divisions imprint long lasting elastic strain fields in epithelial tissues
Authors:
Ali Tahaei,
Romina Pisticello-Gómez,
S Suganthan,
Greta Cwikla,
Jana F. Fuhrmann,
Natalie A. Dye,
Marko Popović
Abstract:
A hallmark of biological tissues, viewed as complex cellular materials, is the active generation of mechanical stresses by cellular processes, such as cell divisions. Each cellular event generates a force dipole that deforms the surrounding tissue. Therefore, a quantitative description of these force dipoles, and their consequences on tissue mechanics, is one of the central problems in understandi…
▽ More
A hallmark of biological tissues, viewed as complex cellular materials, is the active generation of mechanical stresses by cellular processes, such as cell divisions. Each cellular event generates a force dipole that deforms the surrounding tissue. Therefore, a quantitative description of these force dipoles, and their consequences on tissue mechanics, is one of the central problems in understanding the overall tissue mechanics. In this work we analyze previously published experimental data on fruit fly \textit{D. melanogaster} wing epithelia to quantitatively describe the deformation fields induced by a cell-scale force dipole. We find that the measured deformation field can be explained by a simple model of fly epithelium as a linearly elastic sheet. This fact allows us to use measurements of the strain field around cellular events, such as cell divisions, to infer the magnitude and dynamics of the mechanical forces they generate. In particular, we find that cell divisions exert a transient isotropic force dipole field, corresponding to the temporary localisation of the cell nucleus to the tissue surface during the division, and traceless-symmetric force dipole field that remains detectable from the tissue strain field for up to about $3.5$ hours after the division. This is the timescale on which elastic strains are erased by other mechanical processes and therefore it corresponds to the tissue fluidization timescale. In summary, we have developed a method to infer force dipoles induced by cell divisions, by observing the strain field in the surrounding tissues. Using this method we quantitatively characterize mechanical forces generated during a cell division, and their effects on the tissue mechanics.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
ZTF SN Ia DR2: Study of Type Ia Supernova lightcurve fits
Authors:
M. Rigault,
M. Smith,
N. Regnault,
D. W. Kenworthy,
K. Maguire,
A. Goobar,
G. Dimitriadis,
M. Amenouche,
M. Aubert,
C. Barjou-Delayre,
C. E. Bellm,
U. Burgaz,
B. Carreres,
Y. Copin,
M. Deckers,
T. de Jaeger,
S. Dhawan,
F. Feinstein,
D. Fouchez,
L. Galbany,
M. Ginolin,
J. M. Graham,
Y. -L. Kim,
M. Kowalski,
D. Kuhn
, et al. (12 additional authors not shown)
Abstract:
Type Ia supernova (SN Ia) cosmology relies on the estimation of lightcurve parameters to derive precision distances that leads to the estimation of cosmological parameters. The empirical SALT2 lightcurve modeling that relies on only two parameters, a stretch x1, and a color c, has been used by the community for almost two decades. In this paper we study the ability of the SALT2 model to fit the ne…
▽ More
Type Ia supernova (SN Ia) cosmology relies on the estimation of lightcurve parameters to derive precision distances that leads to the estimation of cosmological parameters. The empirical SALT2 lightcurve modeling that relies on only two parameters, a stretch x1, and a color c, has been used by the community for almost two decades. In this paper we study the ability of the SALT2 model to fit the nearly 3000 cosmology-grade SN Ia lightcurves from the second release of the Zwicky Transient Facility (ZTF) cosmology science working group. While the ZTF data was not used to train SALT2, the algorithm is modeling the ZTF SN Ia optical lightcurves remarkably well, except for lightcurve points prior to -10 d from maximum, where the training critically lacks statistics. We find that the lightcurve fitting is robust against the considered choice of phase-range, but we show the [-10; +40] d range to be optimal in terms of statistics and accuracy. We do not detect any significant features in the lightcurve fit residuals that could be connected to the host environment. Potential systematic population differences related to the SN Ia host properties might thus not be accountable for by the addition of extra lightcurve parameters. However, a small but significant inconsistency between residuals of blue- and red-SN Ia strongly suggests the existence of a phase-dependent color term, with potential implications for the use of SNe Ia in precision cosmology. We thus encourage modellers to explore this avenue and we emphasize the importance that SN Ia cosmology must include a SALT2 retraining to accurately model the lightcurves and avoid biasing the derivation of cosmological parameters.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Fast Transaction Scheduling in Blockchain Sharding
Authors:
Ramesh Adhikari,
Costas Busch,
Miroslav Popovic
Abstract:
Sharding is a promising technique for addressing the scalability issues of blockchain. It divides the $n$ participating nodes into $s$ disjoint groups called shards, where each shard processes transactions in parallel. We investigate scheduling algorithms for the blockchain sharding systems, where each transaction resides in a shard of the communication graph and attempts to access accounts at pos…
▽ More
Sharding is a promising technique for addressing the scalability issues of blockchain. It divides the $n$ participating nodes into $s$ disjoint groups called shards, where each shard processes transactions in parallel. We investigate scheduling algorithms for the blockchain sharding systems, where each transaction resides in a shard of the communication graph and attempts to access accounts at possibly remote shards. We examine batch scheduling problems on the shard graph $G_s$, where given a set of transactions, we aim to find efficient schedules to execute them as fast as possible. First, we present a centralized scheduler where one of the shards has global knowledge of transactions to be processed. For general graphs, where the transaction and its accessing objects are arbitrarily far from each other with a maximum distance $d$, the centralized scheduler provides $O(kd)$ approximation to the optimal schedule, where $k$ is the maximum number of shards each transaction accesses. Consequently, for a Clique graph where shards are at a unit distance from each other, we obtain $O(k)$ approximation to the optimal schedule. We also get $O(k \log s)$ approximation for Hypercube, Butterfly, and $g$-dimensional Grid, where $g=O(\log s)$. Next, we provide a centralized scheduler with a bucketing approach that offers improved bounds for special cases. Finally, we provide a distributed scheduler where shards do not require global transaction information. We achieve this by using a hierarchical clustering of the shards and using the centralized scheduler in each cluster. We show that the distributed scheduler has a competitive ratio of $O(\mathcal{A_\mathcal{CS}} \log ^2 s)$, where $\mathcal{A_\mathcal{CS}}$ is the approximation ratio of the centralized scheduler. To our knowledge, we are the first to give provably fast transaction scheduling algorithms for blockchain sharding systems.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
MicroPython Testbed for Federated Learning Algorithms
Authors:
Miroslav Popovic,
Marko Popovic,
Ivan Kastelan,
Miodrag Djukic,
Ilija Basicevic
Abstract:
Recently, Python Testbed for Federated Learning Algorithms emerged as a low code and generative large language models amenable framework for develo** decentralized and distributed applications, primarily targeting edge systems, by nonprofessional programmers with the help of emerging artificial intelligence tools. This light framework is written in pure Python to be easy to install and to fit in…
▽ More
Recently, Python Testbed for Federated Learning Algorithms emerged as a low code and generative large language models amenable framework for develo** decentralized and distributed applications, primarily targeting edge systems, by nonprofessional programmers with the help of emerging artificial intelligence tools. This light framework is written in pure Python to be easy to install and to fit into a small IoT memory. It supports formally verified generic centralized and decentralized federated learning algorithms, as well as the peer-to-peer data exchange used in time division multiplexing communication, and its current main limitation is that all the application instances can run only on a single PC. This paper presents the MicroPyton Testbed for Federated Learning Algorithms, the new framework that overcomes its predecessor's limitation such that individual application instances may run on different network nodes like PCs and IoTs, primarily in edge systems. The new framework carries on the pure Python ideal, is based on asynchronous I/O abstractions, and runs on MicroPython, and therefore is a great match for IoTs and devices in edge systems. The new framework was experimentally validated on a wireless network comprising PCs and Raspberry Pi Pico W boards, by using application examples originally developed for the predecessor framework.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Robotic Learning for Adaptive Informative Path Planning
Authors:
Marija Popovic,
Joshua Ott,
Julius Rückin,
Mykel J. Kochenderfer
Abstract:
Adaptive informative path planning (AIPP) is important to many robotics applications, enabling mobile robots to efficiently collect useful data about initially unknown environments. In addition, learning-based methods are increasingly used in robotics to enhance adaptability, versatility, and robustness across diverse and complex tasks. Our survey explores research on applying robotic learning to…
▽ More
Adaptive informative path planning (AIPP) is important to many robotics applications, enabling mobile robots to efficiently collect useful data about initially unknown environments. In addition, learning-based methods are increasingly used in robotics to enhance adaptability, versatility, and robustness across diverse and complex tasks. Our survey explores research on applying robotic learning to AIPP, bridging the gap between these two research fields. We begin by providing a unified mathematical framework for general AIPP problems. Next, we establish two complementary taxonomies of current work from the perspectives of (i) learning algorithms and (ii) robotic applications. We explore synergies, recent trends, and highlight the benefits of learning-based methods in AIPP frameworks. Finally, we discuss key challenges and promising future directions to enable more generally applicable and robust robotic data-gathering systems through learning. We provide a comprehensive catalogue of papers reviewed in our survey, including publicly available repositories, to facilitate future studies in the field.
△ Less
Submitted 15 April, 2024; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning
Authors:
Sicong Pan,
Liren **,
Xuying Huang,
Cyrill Stachniss,
Marija Popović,
Maren Bennewitz
Abstract:
Object reconstruction is relevant for many autonomous robotic tasks that require interaction with the environment. A key challenge in such scenarios is planning view configurations to collect informative measurements for reconstructing an initially unknown object. One-shot view planning enables efficient data collection by predicting view configurations and planning the globally shortest path conn…
▽ More
Object reconstruction is relevant for many autonomous robotic tasks that require interaction with the environment. A key challenge in such scenarios is planning view configurations to collect informative measurements for reconstructing an initially unknown object. One-shot view planning enables efficient data collection by predicting view configurations and planning the globally shortest path connecting all views at once. However, geometric priors about the object are required to conduct one-shot view planning. In this work, we propose a novel one-shot view planning approach that utilizes the powerful 3D generation capabilities of diffusion models as priors. By incorporating such geometric priors into our pipeline, we achieve effective one-shot view planning starting with only a single RGB image of the object to be reconstructed. Our planning experiments in simulation and real-world setups indicate that our approach balances well between object reconstruction quality and movement cost.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
STAIR: Semantic-Targeted Active Implicit Reconstruction
Authors:
Liren **,
Haofei Kuang,
Yue Pan,
Cyrill Stachniss,
Marija Popović
Abstract:
Many autonomous robotic applications require object-level understanding when deployed. Actively reconstructing objects of interest, i.e. objects with specific semantic meanings, is therefore relevant for a robot to perform downstream tasks in an initially unknown environment. In this work, we propose a novel framework for semantic-targeted active reconstruction using posed RGB-D measurements and 2…
▽ More
Many autonomous robotic applications require object-level understanding when deployed. Actively reconstructing objects of interest, i.e. objects with specific semantic meanings, is therefore relevant for a robot to perform downstream tasks in an initially unknown environment. In this work, we propose a novel framework for semantic-targeted active reconstruction using posed RGB-D measurements and 2D semantic labels as input. The key components of our framework are a semantic implicit neural representation and a compatible planning utility function based on semantic rendering and uncertainty estimation, enabling adaptive view planning to target objects of interest. Our planning approach achieves better reconstruction performance in terms of mesh and novel view rendering quality compared to implicit reconstruction baselines that do not consider semantics for view planning. Our framework further outperforms a state-of-the-art semantic-targeted active reconstruction pipeline based on explicit maps, justifying our choice of utilising implicit neural representations to tackle semantic-targeted active reconstruction problems.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning
Authors:
Apoorva Vashisth,
Julius Rückin,
Federico Magistri,
Cyrill Stachniss,
Marija Popović
Abstract:
Autonomous robots are often employed for data collection due to their efficiency and low labour costs. A key task in robotic data acquisition is planning paths through an initially unknown environment to collect observations given platform-specific resource constraints, such as limited battery life. Adaptive online path planning in 3D environments is challenging due to the large set of valid actio…
▽ More
Autonomous robots are often employed for data collection due to their efficiency and low labour costs. A key task in robotic data acquisition is planning paths through an initially unknown environment to collect observations given platform-specific resource constraints, such as limited battery life. Adaptive online path planning in 3D environments is challenging due to the large set of valid actions and the presence of unknown occlusions. To address these issues, we propose a novel deep reinforcement learning approach for adaptively replanning robot paths to map targets of interest in unknown 3D environments. A key aspect of our approach is a dynamically constructed graph that restricts planning actions local to the robot, allowing us to react to newly discovered static obstacles and targets of interest. For replanning, we propose a new reward function that balances between exploring the unknown environment and exploiting online-discovered targets of interest. Our experiments show that our method enables more efficient target discovery compared to state-of-the-art learning and non-learning baselines. We also showcase our approach for orchard monitoring using an unmanned aerial vehicle in a photorealistic simulator. We open-source our code and model at: https://github.com/dmar-bonn/ipp-rl-3d.
△ Less
Submitted 5 July, 2024; v1 submitted 7 February, 2024;
originally announced February 2024.
-
Ductile-to-brittle transition and yielding in soft amorphous materials: perspectives and open questions
Authors:
Thibaut Divoux,
Elisabeth Agoritsas,
Stefano Aime,
Catherine Barentin,
Jean-Louis Barrat,
Roberto Benzi,
Ludovic Berthier,
Dapeng Bi,
Giulio Biroli,
Daniel Bonn,
Philippe Bourrianne,
Mehdi Bouzid,
Emanuela Del Gado,
Hélène Delanoë-Ayari,
Kasra Farain,
Suzanne Fielding,
Matthias Fuchs,
Jasper van der Gucht,
Silke Henkes,
Maziyar Jalaal,
Yogesh M. Joshi,
Anaël Lemaître,
Robert L. Leheny,
Sébastien Manneville,
Kirsten Martens
, et al. (15 additional authors not shown)
Abstract:
Soft amorphous materials are viscoelastic solids ubiquitously found around us, from clays and cementitious pastes to emulsions and physical gels encountered in food or biomedical engineering. Under an external deformation, these materials undergo a noteworthy transition from a solid to a liquid state that reshapes the material microstructure. This yielding transition was the main theme of a worksh…
▽ More
Soft amorphous materials are viscoelastic solids ubiquitously found around us, from clays and cementitious pastes to emulsions and physical gels encountered in food or biomedical engineering. Under an external deformation, these materials undergo a noteworthy transition from a solid to a liquid state that reshapes the material microstructure. This yielding transition was the main theme of a workshop held from January 9 to 13, 2023 at the Lorentz Center in Leiden. The manuscript presented here offers a critical perspective on the subject, synthesizing insights from the various brainstorming sessions and informal discussions that unfolded during this week of vibrant exchange of ideas. The result of these exchanges takes the form of a series of open questions that represent outstanding experimental, numerical, and theoretical challenges to be tackled in the near future.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Develo** Elementary Federated Learning Algorithms Leveraging the ChatGPT
Authors:
Miroslav Popovic,
Marko Popovic,
Ivan Kastelan,
Miodrag Djukic,
Ilija Basicevic
Abstract:
The Python Testbed for Federated Learning Algorithms is a simple Python FL framework easy to use by ML&AI developers who do not need to be professional programmers, and this paper shows that it is also amenable to emerging AI tools. In this paper, we successfully developed three elementary FL algorithms using the following three steps process: (i) specify context, (ii) ask ChatGPT to complete serv…
▽ More
The Python Testbed for Federated Learning Algorithms is a simple Python FL framework easy to use by ML&AI developers who do not need to be professional programmers, and this paper shows that it is also amenable to emerging AI tools. In this paper, we successfully developed three elementary FL algorithms using the following three steps process: (i) specify context, (ii) ask ChatGPT to complete server and clients' callback functions, and (iii) verify the generated code.
△ Less
Submitted 8 January, 2024; v1 submitted 7 December, 2023;
originally announced December 2023.
-
Semi-Supervised Active Learning for Semantic Segmentation in Unknown Environments Using Informative Path Planning
Authors:
Julius Rückin,
Federico Magistri,
Cyrill Stachniss,
Marija Popović
Abstract:
Semantic segmentation enables robots to perceive and reason about their environments beyond geometry. Most of such systems build upon deep learning approaches. As autonomous robots are commonly deployed in initially unknown environments, pre-training on static datasets cannot always capture the variety of domains and limits the robot's perception performance during missions. Recently, self-supervi…
▽ More
Semantic segmentation enables robots to perceive and reason about their environments beyond geometry. Most of such systems build upon deep learning approaches. As autonomous robots are commonly deployed in initially unknown environments, pre-training on static datasets cannot always capture the variety of domains and limits the robot's perception performance during missions. Recently, self-supervised and fully supervised active learning methods emerged to improve a robot's vision. These approaches rely on large in-domain pre-training datasets or require substantial human labelling effort. We propose a planning method for semi-supervised active learning of semantic segmentation that substantially reduces human labelling requirements compared to fully supervised approaches. We leverage an adaptive map-based planner guided towards the frontiers of unexplored space with high model uncertainty collecting training data for human labelling. A key aspect of our approach is to combine the sparse high-quality human labels with pseudo labels automatically extracted from highly certain environment map areas. Experimental results show that our method reaches segmentation performance close to fully supervised approaches with drastically reduced human labelling effort while outperforming self-supervised approaches.
△ Less
Submitted 26 January, 2024; v1 submitted 7 December, 2023;
originally announced December 2023.
-
Domain-Specific Deep Learning Feature Extractor for Diabetic Foot Ulcer Detection
Authors:
Reza Basiri,
Milos R. Popovic,
Shehroz S. Khan
Abstract:
Diabetic Foot Ulcer (DFU) is a condition requiring constant monitoring and evaluations for treatment. DFU patient population is on the rise and will soon outpace the available health resources. Autonomous monitoring and evaluation of DFU wounds is a much-needed area in health care. In this paper, we evaluate and identify the most accurate feature extractor that is the core basis for develo** a d…
▽ More
Diabetic Foot Ulcer (DFU) is a condition requiring constant monitoring and evaluations for treatment. DFU patient population is on the rise and will soon outpace the available health resources. Autonomous monitoring and evaluation of DFU wounds is a much-needed area in health care. In this paper, we evaluate and identify the most accurate feature extractor that is the core basis for develo** a deep-learning wound detection network. For the evaluation, we used mAP and F1-score on the publicly available DFU2020 dataset. A combination of UNet and EfficientNetb3 feature extractor resulted in the best evaluation among the 14 networks compared. UNet and Efficientnetb3 can be used as the classifier in the development of a comprehensive DFU domain-specific autonomous wound detection pipeline.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Beamforming Performances of Holographic Surfaces
Authors:
Peng Wang,
Majid Nasiri Khormuji,
Branislav M. Popovic
Abstract:
In this paper, we investigate the beamforming performances of holographic surfaces implemented as lossless antenna arrays with less than half-wavelength spacing. We first develop a method to quantify the mutual coupling effect among the antennas in an array. The developed coupling model is general and applicable to arrays with arbitrary distribution of any type of antennas with arbitrary structure…
▽ More
In this paper, we investigate the beamforming performances of holographic surfaces implemented as lossless antenna arrays with less than half-wavelength spacing. We first develop a method to quantify the mutual coupling effect among the antennas in an array. The developed coupling model is general and applicable to arrays with arbitrary distribution of any type of antennas with arbitrary structure, physical size and radiation power pattern. In particular, it reduces to a neat analytical expression for arbitrarily deployed isotropic antenna arrays. We then discuss the beamforming design for holographic surfaces, and in particular provide analytical beamforming characterizations for arrays with two arbitrarily spaced isotropic antennas. Numerical results indicate that, by accounting for the mutual coupling effect between antennas, the array densification by packing more antennas in a given surface aperture can significantly enhance both the beamforming gain and spatial resolution of the system. The beamforming gain enhancement and beamwidth reduction can be several dBs higher than, and more than half of, those achieved by the conventional half-wavelength spaced antenna arrays in the same surface aperture. The gains of densification become saturated when the antenna spacing is below a critical value, and the saturated gain reduces as the surface aperture increases.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Synthesizing Diabetic Foot Ulcer Images with Diffusion Model
Authors:
Reza Basiri,
Karim Manji,
Francois Harton,
Alisha Poonja,
Milos R. Popovic,
Shehroz S. Khan
Abstract:
Diabetic Foot Ulcer (DFU) is a serious skin wound requiring specialized care. However, real DFU datasets are limited, hindering clinical training and research activities. In recent years, generative adversarial networks and diffusion models have emerged as powerful tools for generating synthetic images with remarkable realism and diversity in many applications. This paper explores the potential of…
▽ More
Diabetic Foot Ulcer (DFU) is a serious skin wound requiring specialized care. However, real DFU datasets are limited, hindering clinical training and research activities. In recent years, generative adversarial networks and diffusion models have emerged as powerful tools for generating synthetic images with remarkable realism and diversity in many applications. This paper explores the potential of diffusion models for synthesizing DFU images and evaluates their authenticity through expert clinician assessments. Additionally, evaluation metrics such as Frechet Inception Distance (FID) and Kernel Inception Distance (KID) are examined to assess the quality of the synthetic DFU images. A dataset of 2,000 DFU images is used for training the diffusion model, and the synthetic images are generated by applying diffusion processes. The results indicate that the diffusion model successfully synthesizes visually indistinguishable DFU images. 70% of the time, clinicians marked synthetic DFU images as real DFUs. However, clinicians demonstrate higher unanimous confidence in rating real images than synthetic ones. The study also reveals that FID and KID metrics do not significantly align with clinicians' assessments, suggesting alternative evaluation approaches are needed. The findings highlight the potential of diffusion models for generating synthetic DFU images and their impact on medical training programs and research in wound detection and classification.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Transverse Emittance Reduction in Muon Beams by Ionization Cooling
Authors:
The MICE Collaboration,
M. Bogomilov,
R. Tsenov,
G. Vankova-Kirilova,
Y. P. Song,
J. Y. Tang,
Z. H. Li,
R. Bertoni,
M. Bonesini,
F. Chignoli,
R. Mazza,
A. de Bari,
D. Orestano,
L. Tortora,
Y. Kuno,
H. Sakamoto,
A. Sato,
S. Ishimoto,
M. Chung,
C. K. Sung,
F. Filthaut,
M. Fedorov,
D. Jokovic,
D. Maletic,
M. Savic
, et al. (112 additional authors not shown)
Abstract:
Accelerated muon beams have been considered for next-generation studies of high-energy lepton-antilepton collisions and neutrino oscillations. However, high-brightness muon beams have not yet been produced. The main challenge for muon acceleration and storage stems from the large phase-space volume occupied by the beam, derived from the muon production mechanism through the decay of pions from pro…
▽ More
Accelerated muon beams have been considered for next-generation studies of high-energy lepton-antilepton collisions and neutrino oscillations. However, high-brightness muon beams have not yet been produced. The main challenge for muon acceleration and storage stems from the large phase-space volume occupied by the beam, derived from the muon production mechanism through the decay of pions from proton collisions. Ionization cooling is the technique proposed to decrease the muon beam phase-space volume. Here we demonstrate a clear signal of ionization cooling through the observation of transverse emittance reduction in beams that traverse lithium hydride or liquid hydrogen absorbers in the Muon Ionization Cooling Experiment (MICE). The measurement is well reproduced by the simulation of the experiment and the theoretical model. The results shown here represent a substantial advance towards the realization of muon-based facilities that could operate at the energy and intensity frontiers.
△ Less
Submitted 13 October, 2023; v1 submitted 9 October, 2023;
originally announced October 2023.
-
A Federated Learning Algorithms Development Paradigm
Authors:
Miroslav Popovic,
Marko Popovic,
Ivan Kastelan,
Miodrag Djukic,
Ilija Basicevic
Abstract:
At present many distributed and decentralized frameworks for federated learning algorithms are already available. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. A solution to that challenge named Python Testbed for Federated Learning Algorithms (PTB-FLA) appeared recently. This solution is written in pure Python, it supports…
▽ More
At present many distributed and decentralized frameworks for federated learning algorithms are already available. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. A solution to that challenge named Python Testbed for Federated Learning Algorithms (PTB-FLA) appeared recently. This solution is written in pure Python, it supports both centralized and decentralized algorithms, and its usage was validated and illustrated by three simple algorithm examples. In this paper, we present the federated learning algorithms development paradigm based on PTB-FLA. The paradigm comprises the four phases named by the code they produce: (1) the sequential code, (2) the federated sequential code, (3) the federated sequential code with callbacks, and (4) the PTB-FLA code. The development paradigm is validated and illustrated in the case study on logistic regression, where both centralized and decentralized algorithms are developed.
△ Less
Submitted 3 December, 2023; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Active Implicit Reconstruction Using One-Shot View Planning
Authors:
Hao Hu,
Sicong Pan,
Liren **,
Marija Popović,
Maren Bennewitz
Abstract:
Active object reconstruction using autonomous robots is gaining great interest. A primary goal in this task is to maximize the information of the object to be reconstructed, given limited on-board resources. Previous view planning methods exhibit inefficiency since they rely on an iterative paradigm based on explicit representations, consisting of (1) planning a path to the next-best view only; an…
▽ More
Active object reconstruction using autonomous robots is gaining great interest. A primary goal in this task is to maximize the information of the object to be reconstructed, given limited on-board resources. Previous view planning methods exhibit inefficiency since they rely on an iterative paradigm based on explicit representations, consisting of (1) planning a path to the next-best view only; and (2) requiring a considerable number of less-gain views in terms of surface coverage. To address these limitations, we propose to integrate implicit representations into the One-Shot View Planning (OSVP). The key idea behind our approach is to use implicit representations to obtain the small missing surface areas instead of observing them with extra views. Therefore, we design a deep neural network, named OSVP, to directly predict a set of views given a dense point cloud refined from an initial sparse observation. To train our OSVP network, we generate supervision labels using dense point clouds refined by implicit representations and set covering optimization problems. Simulated experiments show that our method achieves sufficient reconstruction quality, outperforming several baselines under limited view and movement budgets. We further demonstrate the applicability of our approach in a real-world object reconstruction scenario.
△ Less
Submitted 13 February, 2024; v1 submitted 1 October, 2023;
originally announced October 2023.
-
How Many Views Are Needed to Reconstruct an Unknown Object Using NeRF?
Authors:
Sicong Pan,
Liren **,
Hao Hu,
Marija Popović,
Maren Bennewitz
Abstract:
Neural Radiance Fields (NeRFs) are gaining significant interest for online active object reconstruction due to their exceptional memory efficiency and requirement for only posed RGB inputs. Previous NeRF-based view planning methods exhibit computational inefficiency since they rely on an iterative paradigm, consisting of (1) retraining the NeRF when new images arrive; and (2) planning a path to th…
▽ More
Neural Radiance Fields (NeRFs) are gaining significant interest for online active object reconstruction due to their exceptional memory efficiency and requirement for only posed RGB inputs. Previous NeRF-based view planning methods exhibit computational inefficiency since they rely on an iterative paradigm, consisting of (1) retraining the NeRF when new images arrive; and (2) planning a path to the next best view only. To address these limitations, we propose a non-iterative pipeline based on the Prediction of the Required number of Views (PRV). The key idea behind our approach is that the required number of views to reconstruct an object depends on its complexity. Therefore, we design a deep neural network, named PRVNet, to predict the required number of views, allowing us to tailor the data acquisition based on the object complexity and plan a globally shortest path. To train our PRVNet, we generate supervision labels using the ShapeNet dataset. Simulated experiments show that our PRV-based view planning method outperforms baselines, achieving good reconstruction quality while significantly reducing movement cost and planning time. We further justify the generalization ability of our approach in a real-world experiment.
△ Less
Submitted 13 February, 2024; v1 submitted 1 October, 2023;
originally announced October 2023.
-
Fine-Resolution Silicon Photonic Wavelength-Selective Switch Using Hybrid Multimode Racetrack Resonators
Authors:
Lucas M. Cohen,
Saleha Fatema,
Vivek V. Wankhade,
Navin B. Lingaraju,
Bohan Zhang,
Deniz Onural,
Milos Popovic,
Andrew M. Weiner
Abstract:
In this work, we describe a procedure for synthesizing racetrack resonators with large quality factors and apply it to realize a multi-channel wavelength-selective switch (WSS) on a silicon photonic chip. We first determine the contribution of each component primitive to propagation loss in a racetrack resonator and use this data to develop a model for the frequency response of arbitrary order, co…
▽ More
In this work, we describe a procedure for synthesizing racetrack resonators with large quality factors and apply it to realize a multi-channel wavelength-selective switch (WSS) on a silicon photonic chip. We first determine the contribution of each component primitive to propagation loss in a racetrack resonator and use this data to develop a model for the frequency response of arbitrary order, coupled-racetrack channel drop** filters. We design second-order racetrack filters based on this model and cascade multiple such filters to form a 1x7 WSS. We find good agreement between our model and device performance with second-order racetrack that have ~1 dB of drop-port loss, ~2 GHz FWHM linewidth, and low optical crosstalk due to the quick filter roll-off of ~ 5.3 dB/GHz. Using a control algorithm, we show three-channel operation of our WSS with a channel spacing of only 10 GHz. Owing to the high quality factor and quick roll-off of our filter design, adjacent channel crosstalk is measured to be <-25 dB for channels spaced on a 10 GHz grid. As a further demonstration, we use five of seven WSS channels to perform a demultiplexing operation on both an 8 GHz and a 10 GHz grid. These results suggest that a low-loss WSS with fine channel resolution can be realized in a scalable manner using the silicon photonics platform.
△ Less
Submitted 29 September, 2023;
originally announced September 2023.
-
Perceptual Factors for Environmental Modeling in Robotic Active Perception
Authors:
David Morilla-Cabello,
Jonas Westheider,
Marija Popovic,
Eduardo Montijano
Abstract:
Accurately assessing the potential value of new sensor observations is a critical aspect of planning for active perception. This task is particularly challenging when reasoning about high-level scene understanding using measurements from vision-based neural networks. Due to appearance-based reasoning, the measurements are susceptible to several environmental effects such as the presence of occlude…
▽ More
Accurately assessing the potential value of new sensor observations is a critical aspect of planning for active perception. This task is particularly challenging when reasoning about high-level scene understanding using measurements from vision-based neural networks. Due to appearance-based reasoning, the measurements are susceptible to several environmental effects such as the presence of occluders, variations in lighting conditions, and redundancy of information due to similarity in appearance between nearby viewpoints. To address this, we propose a new active perception framework incorporating an arbitrary number of perceptual effects in planning and fusion. Our method models the correlation with the environment by a set of general functions termed perceptual factors to construct a perceptual map, which quantifies the aggregated influence of the environment on candidate viewpoints. This information is seamlessly incorporated into the planning and fusion processes by adjusting the uncertainty associated with measurements to weigh their contributions. We evaluate our perceptual maps in a simulated environment that reproduces environmental conditions common in robotics applications. Our results show that, by accounting for environmental effects within our perceptual maps, we improve in the state estimation by correctly selecting the viewpoints and considering the measurement noise correctly when affected by environmental factors. We furthermore deploy our approach on a ground robot to showcase its applicability for real-world active perception missions.
△ Less
Submitted 10 October, 2023; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Permutation Polynomial Interleaved Zadoff-Chu Sequences
Authors:
Fredrik Berggren,
Branislav M. Popovic
Abstract:
Constant amplitude zero autocorrelation (CAZAC) sequences have modulus one and ideal periodic autocorrelation function. Such sequences are used in cellular radio communications systems, e.g., for reference signals, synchronization signals and random access preambles. We propose a new family CAZAC sequences, which is constructed by interleaving a Zadoff-Chu sequence by a quadratic permutation polyn…
▽ More
Constant amplitude zero autocorrelation (CAZAC) sequences have modulus one and ideal periodic autocorrelation function. Such sequences are used in cellular radio communications systems, e.g., for reference signals, synchronization signals and random access preambles. We propose a new family CAZAC sequences, which is constructed by interleaving a Zadoff-Chu sequence by a quadratic permutation polynomial (QPP), or by a permutation polynomial whose inverse is a QPP. It is demonstrated that a set of orthogonal interleaved Zadoff-Chu sequences can be constructed by proper choice of QPPs.
△ Less
Submitted 26 April, 2024; v1 submitted 28 June, 2023;
originally announced June 2023.
-
Correct orchestration of Federated Learning generic algorithms: formalisation and verification in CSP
Authors:
Ivan Prokić,
Silvia Ghilezan,
Simona Kašterović,
Miroslav Popovic,
Marko Popovic,
Ivan Kaštelan
Abstract:
Federated learning (FL) is a machine learning setting where clients keep the training data decentralised and collaboratively train a model either under the coordination of a central server (centralised FL) or in a peer-to-peer network (decentralised FL). Correct orchestration is one of the main challenges. In this paper, we formally verify the correctness of two generic FL algorithms, a centralise…
▽ More
Federated learning (FL) is a machine learning setting where clients keep the training data decentralised and collaboratively train a model either under the coordination of a central server (centralised FL) or in a peer-to-peer network (decentralised FL). Correct orchestration is one of the main challenges. In this paper, we formally verify the correctness of two generic FL algorithms, a centralised and a decentralised one, using the CSP process calculus and the PAT model checker. The CSP models consist of CSP processes corresponding to generic FL algorithm instances. PAT automatically proves the correctness of the two generic FL algorithms by proving their deadlock freeness (safety property) and successful termination (liveness property). The CSP models are constructed bottom-up by hand as a faithful representation of the real Python code and is automatically checked top-down by PAT.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
A Simple Python Testbed for Federated Learning Algorithms
Authors:
Miroslav Popovic,
Marko Popovic,
Ivan Kastelan,
Miodrag Djukic,
Silvia Ghilezan
Abstract:
Nowadays many researchers are develo** various distributed and decentralized frameworks for federated learning algorithms. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. In this paper, we present our solution to that challenge called Python Testbed for Federated Learning Algorithms. The solution is written in pure Python, a…
▽ More
Nowadays many researchers are develo** various distributed and decentralized frameworks for federated learning algorithms. However, development of such a framework targeting smart Internet of Things in edge systems is still an open challenge. In this paper, we present our solution to that challenge called Python Testbed for Federated Learning Algorithms. The solution is written in pure Python, and it supports both centralized and decentralized algorithms. The usage of the presented solution is both validated and illustrated by three simple algorithm examples.
△ Less
Submitted 18 July, 2023; v1 submitted 31 May, 2023;
originally announced May 2023.
-
PSTM Transaction Scheduler Verification Based on CSP and Testing
Authors:
Miroslav Popovic,
Marko Popovic,
Branislav Kordic,
Huibiao Zhu
Abstract:
Many online transaction scheduler architectures and algorithms for various software transactional memories have been designed in order to maintain good system performance even for high concurrency workloads. Most of these algorithms were directly implemented in a target programming language, and experimentally evaluated, without theoretical proofs of correctness and analysis of their performance.…
▽ More
Many online transaction scheduler architectures and algorithms for various software transactional memories have been designed in order to maintain good system performance even for high concurrency workloads. Most of these algorithms were directly implemented in a target programming language, and experimentally evaluated, without theoretical proofs of correctness and analysis of their performance. Only a small number of these algorithms were modeled using formal methods, such as process algebra CSP, in order to verify that they satisfy properties such as deadlock-freeness and starvation-freeness. However, as this paper shows, using solely formal methods has its disadvantages, too. In this paper, we first analyze the previous CSP model of PSTM transaction scheduler by comparing the model checker PAT results with the manually derived expected results, for the given test workloads. Next, according to the results of this analysis, we correct and extend the CSP model. Finally, based on PAT results for the new CSP model, we analyze the performance of PSTM online transaction scheduling algorithms from the perspective of makespan, number of aborts, and throughput. Based on our findings, we may conclude that for the complete formal verification of trustworthy software, both formal verification and it's testing must be jointly used.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Scaling Description of Dynamical Heterogeneity and Avalanches of Relaxation in Glass-Forming Liquids
Authors:
Ali Tahaei,
Giulio Biroli,
Misaki Ozawa,
Marko Popović,
Matthieu Wyart
Abstract:
We provide a theoretical description of dynamical heterogeneities in glass-forming liquids, based on the premise that relaxation occurs via local rearrangements coupled by elasticity. In our framework, the growth of the dynamical correlation length $ξ$ and of the correlation volume $χ_4$ are controlled by a zero-temperature fixed point. We connect this critical behavior to the properties of the di…
▽ More
We provide a theoretical description of dynamical heterogeneities in glass-forming liquids, based on the premise that relaxation occurs via local rearrangements coupled by elasticity. In our framework, the growth of the dynamical correlation length $ξ$ and of the correlation volume $χ_4$ are controlled by a zero-temperature fixed point. We connect this critical behavior to the properties of the distribution of local energy barriers at zero temperature. Our description makes a direct connection between dynamical heterogeneities and avalanche-type relaxation associated to dynamic facilitation, allowing us to relate the size distribution of heterogeneities to their time evolution. Within an avalanche, a local region relaxes multiple times, the more the larger is the avalanche. This property, related to the nature of the zero-temperature fixed point, directly leads to decoupling of particle diffusion and relaxation time (the so-called Stokes-Einstein violation). Our most salient predictions are tested and confirmed by numerical simulations of scalar and tensorial thermal elasto-plastic models.
△ Less
Submitted 3 August, 2023; v1 submitted 29 April, 2023;
originally announced May 2023.
-
Supervised and Unsupervised Deep Learning Approaches for EEG Seizure Prediction
Authors:
Zakary Georgis-Yap,
Milos R. Popovic,
Shehroz S. Khan
Abstract:
Epilepsy affects more than 50 million people worldwide, making it one of the world's most prevalent neurological diseases. The main symptom of epilepsy is seizures, which occur abruptly and can cause serious injury or death. The ability to predict the occurrence of an epileptic seizure could alleviate many risks and stresses people with epilepsy face. We formulate the problem of detecting preictal…
▽ More
Epilepsy affects more than 50 million people worldwide, making it one of the world's most prevalent neurological diseases. The main symptom of epilepsy is seizures, which occur abruptly and can cause serious injury or death. The ability to predict the occurrence of an epileptic seizure could alleviate many risks and stresses people with epilepsy face. We formulate the problem of detecting preictal (or pre-seizure) with reference to normal EEG as a precursor to incoming seizure. To this end, we developed several supervised deep learning approaches to identify preictal EEG from normal EEG. We further develop novel unsupervised deep learning approaches to train the models on only normal EEG, and detecting pre-seizure EEG as an anomalous event. These deep learning models were trained and evaluated on two large EEG seizure datasets in a person-specific manner. We found that both supervised and unsupervised approaches are feasible; however, their performance varies depending on the patient, approach and architecture. This new line of research has the potential to develop therapeutic interventions and save human lives.
△ Less
Submitted 3 February, 2024; v1 submitted 24 April, 2023;
originally announced April 2023.
-
Theory of rheology and aging of protein condensates
Authors:
Ryota Takaki,
Louise Jawerth,
Marko Popović,
Frank Jülicher
Abstract:
Biological condensates are assemblies of proteins and nucleic acids that form membraneless compartments in cells and play essential roles in cellular functions. In many cases they exhibit the physical properties of liquid droplets that coexist in a surrounding fluid. Recently, quantitative studies on the material properties of biological condensates have become available, revealing complex materia…
▽ More
Biological condensates are assemblies of proteins and nucleic acids that form membraneless compartments in cells and play essential roles in cellular functions. In many cases they exhibit the physical properties of liquid droplets that coexist in a surrounding fluid. Recently, quantitative studies on the material properties of biological condensates have become available, revealing complex material properties. In vitro experiments have shown that protein condensates exhibit time dependent material properties, similar to aging in glasses. To understand this phenomenon from a theoretical perspective, we develop a rheological model based on the physical picture of protein diffusion and stochastic binding inside condensates. The complex nature of protein interactions is captured by a distribution of binding energies, incorporated in a trap model originally developed to study glass transitions. Our model can describe diffusion of constituent particles, as well as the material response to time-dependent forces, and it recapitulates the age dependent relaxation time of Maxwell glass observed experimentally both in active and passive rheology. We derive a generalized fluctuation-response relations of our model in which the relaxation function does not obey time translation invariance. Our study sheds light on the complex material properties of biological condensates and provides a theoretical framework for understanding their aging behavior.
△ Less
Submitted 30 June, 2023; v1 submitted 31 March, 2023;
originally announced March 2023.
-
Graph-based View Motion Planning for Fruit Detection
Authors:
Tobias Zaenker,
Julius Rückin,
Rohit Menon,
Marija Popović,
Maren Bennewitz
Abstract:
Crop monitoring is crucial for maximizing agricultural productivity and efficiency. However, monitoring large and complex structures such as sweet pepper plants presents significant challenges, especially due to frequent occlusions of the fruits. Traditional next-best view planning can lead to unstructured and inefficient coverage of the crops. To address this, we propose a novel view motion plann…
▽ More
Crop monitoring is crucial for maximizing agricultural productivity and efficiency. However, monitoring large and complex structures such as sweet pepper plants presents significant challenges, especially due to frequent occlusions of the fruits. Traditional next-best view planning can lead to unstructured and inefficient coverage of the crops. To address this, we propose a novel view motion planner that builds a graph network of viable view poses and trajectories between nearby poses, thereby considering robot motion constraints. The planner searches the graphs for view sequences with the highest accumulated information gain, allowing for efficient pepper plant monitoring while minimizing occlusions. The generated view poses aim at both sufficiently covering already detected and discovering new fruits. The graph and the corresponding best view pose sequence are computed with a limited horizon and are adaptively updated in fixed time intervals as the system gathers new information. We demonstrate the effectiveness of our approach through simulated and real-world experiments using a robotic arm equipped with an RGB-D camera and mounted on a trolley. As the experimental results show, our planner produces view pose sequences to systematically cover the crops and leads to increased fruit coverage when given a limited time in comparison to a state-of-the-art single next-best view planner.
△ Less
Submitted 15 August, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
NeU-NBV: Next Best View Planning Using Uncertainty Estimation in Image-Based Neural Rendering
Authors:
Liren **,
Xieyuanli Chen,
Julius Rückin,
Marija Popović
Abstract:
Autonomous robotic tasks require actively perceiving the environment to achieve application-specific goals. In this paper, we address the problem of positioning an RGB camera to collect the most informative images to represent an unknown scene, given a limited measurement budget. We propose a novel mapless planning framework to iteratively plan the next best camera view based on collected image me…
▽ More
Autonomous robotic tasks require actively perceiving the environment to achieve application-specific goals. In this paper, we address the problem of positioning an RGB camera to collect the most informative images to represent an unknown scene, given a limited measurement budget. We propose a novel mapless planning framework to iteratively plan the next best camera view based on collected image measurements. A key aspect of our approach is a new technique for uncertainty estimation in image-based neural rendering, which guides measurement acquisition at the most uncertain view among view candidates, thus maximising the information value during data collection. By incrementally adding new measurements into our image collection, our approach efficiently explores an unknown scene in a mapless manner. We show that our uncertainty estimation is generalisable and valuable for view planning in unknown scenes. Our planning experiments using synthetic and real-world data verify that our uncertainty-guided approach finds informative images leading to more accurate scene representations when compared against baselines.
△ Less
Submitted 23 July, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning
Authors:
Jonas Westheider,
Julius Rückin,
Marija Popović
Abstract:
Efficient aerial data collection is important in many remote sensing applications. In large-scale monitoring scenarios, deploying a team of unmanned aerial vehicles (UAVs) offers improved spatial coverage and robustness against individual failures. However, a key challenge is cooperative path planning for the UAVs to efficiently achieve a joint mission goal. We propose a novel multi-agent informat…
▽ More
Efficient aerial data collection is important in many remote sensing applications. In large-scale monitoring scenarios, deploying a team of unmanned aerial vehicles (UAVs) offers improved spatial coverage and robustness against individual failures. However, a key challenge is cooperative path planning for the UAVs to efficiently achieve a joint mission goal. We propose a novel multi-agent informative path planning approach based on deep reinforcement learning for adaptive terrain monitoring scenarios using UAV teams. We introduce new network feature representations to effectively learn path planning in a 3D workspace. By leveraging a counterfactual baseline, our approach explicitly addresses credit assignment to learn cooperative behaviour. Our experimental evaluation shows improved planning performance, i.e. maps regions of interest more quickly, with respect to non-counterfactual variants. Results on synthetic and real-world data show that our approach has superior performance compared to state-of-the-art non-learning-based methods, while being transferable to varying team sizes and communication constraints.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
An Informative Path Planning Framework for Active Learning in UAV-based Semantic Map**
Authors:
Julius Rückin,
Federico Magistri,
Cyrill Stachniss,
Marija Popović
Abstract:
Unmanned aerial vehicles (UAVs) are frequently used for aerial map** and general monitoring tasks. Recent progress in deep learning enabled automated semantic segmentation of imagery to facilitate the interpretation of large-scale complex environments. Commonly used supervised deep learning for segmentation relies on large amounts of pixel-wise labelled data, which is tedious and costly to annot…
▽ More
Unmanned aerial vehicles (UAVs) are frequently used for aerial map** and general monitoring tasks. Recent progress in deep learning enabled automated semantic segmentation of imagery to facilitate the interpretation of large-scale complex environments. Commonly used supervised deep learning for segmentation relies on large amounts of pixel-wise labelled data, which is tedious and costly to annotate. The domain-specific visual appearance of aerial environments often prevents the usage of models pre-trained on publicly available datasets. To address this, we propose a novel general planning framework for UAVs to autonomously acquire informative training images for model re-training. We leverage multiple acquisition functions and fuse them into probabilistic terrain maps. Our framework combines the mapped acquisition function information into the UAV's planning objectives. In this way, the UAV adaptively acquires informative aerial images to be manually labelled for model re-training. Experimental results on real-world data and in a photorealistic simulation show that our framework maximises model performance and drastically reduces labelling efforts. Our map-based planners outperform state-of-the-art local planning.
△ Less
Submitted 6 September, 2023; v1 submitted 7 February, 2023;
originally announced February 2023.
-
Electrohydraulic activity of biological cells
Authors:
Marko Popović,
Jacques Prost,
Frank Jülicher
Abstract:
Fluid pum** and the generation of electric current by living tissues are required during morphogenetic processes and for maintainance of homeostasis. How these flows emerge from active and passive ion transport in cells has been well established. However, the interplay between flow and current generation is not well understood. Here, we study the electro-hydraulic coupling that arises from cell…
▽ More
Fluid pum** and the generation of electric current by living tissues are required during morphogenetic processes and for maintainance of homeostasis. How these flows emerge from active and passive ion transport in cells has been well established. However, the interplay between flow and current generation is not well understood. Here, we study the electro-hydraulic coupling that arises from cell ion pum**. We develop a one-dimensional continuum model of fluid and ion transport across active cell membranes. Solving the Nernst-Planck and Poisson equations in the limit of weak charge imbalance allows us to derive approximate analytical solutions of the model. These approximations, consistent with the numerical results in physiologically relevant regime of parameters, allow us to describe electro-hydraulic activity of cells and tissues in terms of experimentally accessible parameters.
△ Less
Submitted 13 November, 2022;
originally announced November 2022.
-
Photoacoustic characterization of TiO2 thin-films deposited on Silicon substrate using neural networks
Authors:
Katarina Lj Djordjevic,
Dragana K Markushev,
Marica N Popovic,
Mioljub V Nesic,
Slobodanka P Galovic,
Dragan V Lukic,
Dragan D Markushev
Abstract:
In this paper, the possibility of determining the thermal, elastic and geometric characteristics of a thin TiO2 film deposited on a silicon substrate, thickness 30 mikrons, in the frequency range of 20 to 20 kHz with neural networks was analyzed. For this purpose, the substrate parameters remained the known and constant in the two-layer model and nano layer thin-film parameters were changed: thick…
▽ More
In this paper, the possibility of determining the thermal, elastic and geometric characteristics of a thin TiO2 film deposited on a silicon substrate, thickness 30 mikrons, in the frequency range of 20 to 20 kHz with neural networks was analyzed. For this purpose, the substrate parameters remained the known and constant in the two-layer model and nano layer thin-film parameters were changed: thickness, expansion and thermal diffusivity. Prediction of these three parameters was analyzed separately with three neural networks and all of these together by fourth neural network. It was shown that neural network, which analyzed all three parameters at the same time, achieved the highest accuracy, so the use of networks that provide predictions for only one parameter is less reliable.
△ Less
Submitted 3 November, 2022;
originally announced November 2022.
-
Random traction yielding transition in epithelial tissues
Authors:
Aboutaleb Amiri,
Charlie Duclut,
Frank Jülicher,
Marko Popović
Abstract:
We investigate how randomly oriented cell traction forces lead to fluidisation in a vertex model of epithelial tissues. We find that the fluidisation occurs at a critical value of the traction force magnitude $F_c$. We show that this transition exhibits critical behaviour, similar to the yielding transition of sheared amorphous solids. However, we find that it belongs to a different universality c…
▽ More
We investigate how randomly oriented cell traction forces lead to fluidisation in a vertex model of epithelial tissues. We find that the fluidisation occurs at a critical value of the traction force magnitude $F_c$. We show that this transition exhibits critical behaviour, similar to the yielding transition of sheared amorphous solids. However, we find that it belongs to a different universality class, even though it satisfies the same scaling relations between critical exponents established in the yielding transition of sheared amorphous solids. Our work provides a fluidisation mechanism through active force generation that could be relevant in biological tissues.
△ Less
Submitted 3 November, 2022;
originally announced November 2022.
-
Multiple Coulomb Scattering of muons in Lithium Hydride
Authors:
M. Bogomilov,
R. Tsenov,
G. Vankova-Kirilova,
Y. P. Song,
J. Y. Tang,
Z. H. Li,
R. Bertoni,
M. Bonesini,
F. Chignoli,
R. Mazza,
V. Palladino,
A. de Bari,
D. Orestano,
L. Tortora,
Y. Kuno,
H. Sakamoto,
A. Sato,
S. Ishimoto,
M. Chung,
C. K. Sung,
F. Filthaut,
M. Fedorov,
D. Jokovic,
D. Maletic,
M. Savic
, et al. (112 additional authors not shown)
Abstract:
Multiple Coulomb Scattering (MCS) is a well known phenomenon occurring when charged particles traverse materials. Measurements of muons traversing low $Z$ materials made in the MuScat experiment showed that theoretical models and simulation codes, such as GEANT4 (v7.0), over-estimated the scattering. The Muon Ionization Cooling Experiment (MICE) measured the cooling of a muon beam traversing a liq…
▽ More
Multiple Coulomb Scattering (MCS) is a well known phenomenon occurring when charged particles traverse materials. Measurements of muons traversing low $Z$ materials made in the MuScat experiment showed that theoretical models and simulation codes, such as GEANT4 (v7.0), over-estimated the scattering. The Muon Ionization Cooling Experiment (MICE) measured the cooling of a muon beam traversing a liquid hydrogen or lithium hydride (LiH) energy absorber as part of a programme to develop muon accelerator facilities, such as a Neutrino Factory or a Muon Collider. The energy loss and MCS that occur in the absorber material are competing effects that alter the performance of the cooling channel. Therefore measurements of MCS are required in order to validate the simulations used to predict the cooling performance in future accelerator facilities. We report measurements made in the MICE apparatus of MCS using a LiH absorber and muons within the momentum range 160 to 245 MeV/c. The measured RMS scattering width is about 9% smaller than that predicted by the approximate formula proposed by the Particle Data Group. Data at 172, 200 and 240 MeV/c are compared to the GEANT4 (v9.6) default scattering model. These measurements show agreement with this more recent GEANT4 (v9.6) version over the range of incident muon momenta.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
Regression discontinuity design in perinatal epidemiology and birth cohort research
Authors:
Maja Popovic,
Daniela Zugna,
Lorenzo Richiardi
Abstract:
Regression discontinuity design (RDD) is a quasi-experimental approach to study the causal effects of an intervention/treatment on later health outcomes. It exploits a continuously measured assignment variable with a clearly defined cut-off above or below which the population is at least partially assigned to the intervention/treatment. We describe the RDD and outline the applications of RDD in th…
▽ More
Regression discontinuity design (RDD) is a quasi-experimental approach to study the causal effects of an intervention/treatment on later health outcomes. It exploits a continuously measured assignment variable with a clearly defined cut-off above or below which the population is at least partially assigned to the intervention/treatment. We describe the RDD and outline the applications of RDD in the context of perinatal epidemiology and birth cohort research.
There is an increasing number of studies using RDD in perinatal and pediatric epidemiology. Most of these studies were conducted in the context of education, social and welfare policies, healthcare organization, insurance, and preventive programs. Additional thematic fields include clinically relevant research questions, shock events, social and environmental factors, and changes in guidelines. Maternal and perinatal characteristics, such as age, birth weight and gestational age are frequently used assignment variables to study the effects of the type and intensity of neonatal care, health insurance, and supplemental newborn benefits. Different socioeconomic measures have been used to study the effects of social, welfare and cash transfer programs, while age or date of birth served as assignment variables to study the effects of vaccination programs, pregnancy-specific guidelines, maternity and paternity leave policies and introduction of newborn-based welfare programs.
RDD has advantages, including relatively weak and testable assumptions, strong internal validity, intuitive interpretation, and transparent and simple graphical representation. However, its use in birth cohort research is hampered by the rarity of settings outside of policy and program evaluations, low statistical power, limited external validity (geographic- and time-specific settings) and potential contamination by other exposures/interventions.
△ Less
Submitted 23 August, 2022;
originally announced August 2022.
-
3D Lidar Reconstruction with Probabilistic Depth Completion for Robotic Navigation
Authors:
Yifu Tao,
Marija Popović,
Yiduo Wang,
Sundara Tejaswi Digumarti,
Nived Chebrolu,
Maurice Fallon
Abstract:
Safe motion planning in robotics requires planning into space which has been verified to be free of obstacles. However, obtaining such environment representations using lidars is challenging by virtue of the sparsity of their depth measurements. We present a learning-aided 3D lidar reconstruction framework that upsamples sparse lidar depth measurements with the aid of overlap** camera images so…
▽ More
Safe motion planning in robotics requires planning into space which has been verified to be free of obstacles. However, obtaining such environment representations using lidars is challenging by virtue of the sparsity of their depth measurements. We present a learning-aided 3D lidar reconstruction framework that upsamples sparse lidar depth measurements with the aid of overlap** camera images so as to generate denser reconstructions with more definitively free space than can be achieved with the raw lidar measurements alone. We use a neural network with an encoder-decoder structure to predict dense depth images along with depth uncertainty estimates which are fused using a volumetric map** system. We conduct experiments on real-world outdoor datasets captured using a handheld sensing device and a legged robot. Using input data from a 16-beam lidar map** a building network, our experiments showed that the amount of estimated free space was increased by more than 40% with our approach. We also show that our approach trained on a synthetic dataset generalises well to real-world outdoor scenes without additional fine-tuning. Finally, we demonstrate how motion planning tasks can benefit from these denser reconstructions.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
Joint radar and communications with multicarrier chirp-based waveform
Authors:
Fredrik Berggren,
Branislav M. Popovic
Abstract:
We consider a multicarrier chirp-based waveform for joint radar and communication (JRC) systems and derive its time discrete periodic ambiguity function (AF). An advantage of the waveform is that it includes a set of waveform parameters (e.g., chirp rate) which together with the transmit sequence, can be selected to flexibly shape the AF to be thumbtack-like, or to be ridge-like, either along the…
▽ More
We consider a multicarrier chirp-based waveform for joint radar and communication (JRC) systems and derive its time discrete periodic ambiguity function (AF). An advantage of the waveform is that it includes a set of waveform parameters (e.g., chirp rate) which together with the transmit sequence, can be selected to flexibly shape the AF to be thumbtack-like, or to be ridge-like, either along the delay axis or the Doppler axis. These shapes are applicable for different use cases, e.g., target detection or timeand frequency synchronization. The results show that better signal detection performance than OFDM and DFT-s-OFDM can be achieved on channels with large Doppler frequency. Furthermore, it is shown how transmit sequences can be selected in order to achieve 0 dB peak-to-average-power-ratio (PAPR) of the waveform.
△ Less
Submitted 2 September, 2022; v1 submitted 9 June, 2022;
originally announced June 2022.
-
Deep Direct Discriminative Decoders for High-dimensional Time-series Data Analysis
Authors:
Mohammad R. Rezaei,
Milos R. Popovic,
Milad Lankarany,
Ali Yousefi
Abstract:
The state-space models (SSMs) are widely utilized in the analysis of time-series data. SSMs rely on an explicit definition of the state and observation processes. Characterizing these processes is not always easy and becomes a modeling challenge when the dimension of observed data grows or the observed data distribution deviates from the normal distribution. Here, we propose a new formulation of S…
▽ More
The state-space models (SSMs) are widely utilized in the analysis of time-series data. SSMs rely on an explicit definition of the state and observation processes. Characterizing these processes is not always easy and becomes a modeling challenge when the dimension of observed data grows or the observed data distribution deviates from the normal distribution. Here, we propose a new formulation of SSM for high-dimensional observation processes. We call this solution the deep direct discriminative decoder (D4). The D4 brings deep neural networks' expressiveness and scalability to the SSM formulation letting us build a novel solution that efficiently estimates the underlying state processes through high-dimensional observation signal. We demonstrate the D4 solutions in simulated and real data such as Lorenz attractors, Langevin dynamics, random walk dynamics, and rat hippocampus spiking neural data and show that the D4 performs better than traditional SSMs and RNNs. The D4 can be applied to a broader class of time-series data where the connection between high-dimensional observation and the underlying latent process is hard to characterize.
△ Less
Submitted 3 July, 2023; v1 submitted 22 May, 2022;
originally announced May 2022.
-
Quantified Reproducibility Assessment of NLP Results
Authors:
Anya Belz,
Maja Popović,
Simon Mille
Abstract:
This paper describes and tests a method for carrying out quantified reproducibility assessment (QRA) that is based on concepts and definitions from metrology. QRA produces a single score estimating the degree of reproducibility of a given system and evaluation measure, on the basis of the scores from, and differences between, different reproductions. We test QRA on 18 system and evaluation measure…
▽ More
This paper describes and tests a method for carrying out quantified reproducibility assessment (QRA) that is based on concepts and definitions from metrology. QRA produces a single score estimating the degree of reproducibility of a given system and evaluation measure, on the basis of the scores from, and differences between, different reproductions. We test QRA on 18 system and evaluation measure combinations (involving diverse NLP tasks and types of evaluation), for each of which we have the original results and one to seven reproduction results. The proposed QRA method produces degree-of-reproducibility scores that are comparable across multiple reproductions not only of the same, but of different original studies. We find that the proposed method facilitates insights into causes of variation between reproductions, and allows conclusions to be drawn about what changes to system and/or evaluation design might lead to improved reproducibility.
△ Less
Submitted 12 April, 2022;
originally announced April 2022.
-
Informative Path Planning for Active Learning in Aerial Semantic Map**
Authors:
Julius Rückin,
Liren **,
Federico Magistri,
Cyrill Stachniss,
Marija Popović
Abstract:
Semantic segmentation of aerial imagery is an important tool for map** and earth observation. However, supervised deep learning models for segmentation rely on large amounts of high-quality labelled data, which is labour-intensive and time-consuming to generate. To address this, we propose a new approach for using unmanned aerial vehicles (UAVs) to autonomously collect useful data for model trai…
▽ More
Semantic segmentation of aerial imagery is an important tool for map** and earth observation. However, supervised deep learning models for segmentation rely on large amounts of high-quality labelled data, which is labour-intensive and time-consuming to generate. To address this, we propose a new approach for using unmanned aerial vehicles (UAVs) to autonomously collect useful data for model training. We exploit a Bayesian approach to estimate model uncertainty in semantic segmentation. During a mission, the semantic predictions and model uncertainty are used as input for terrain map**. A key aspect of our pipeline is to link the mapped model uncertainty to a robotic planning objective based on active learning. This enables us to adaptively guide a UAV to gather the most informative terrain images to be labelled by a human for model training. Our experimental evaluation on real-world data shows the benefit of using our informative planning approach in comparison to static coverage paths in terms of maximising model performance and reducing labelling efforts.
△ Less
Submitted 2 September, 2022; v1 submitted 3 March, 2022;
originally announced March 2022.
-
Adaptive Path Planning for UAVs for Multi-Resolution Semantic Segmentation
Authors:
Felix Stache,
Jonas Westheider,
Federico Magistri,
Cyrill Stachniss,
Marija Popović
Abstract:
Efficient data collection methods play a major role in hel** us better understand the Earth and its ecosystems. In many applications, the usage of unmanned aerial vehicles (UAVs) for monitoring and remote sensing is rapidly gaining momentum due to their high mobility, low cost, and flexible deployment. A key challenge is planning missions to maximize the value of acquired data in large environme…
▽ More
Efficient data collection methods play a major role in hel** us better understand the Earth and its ecosystems. In many applications, the usage of unmanned aerial vehicles (UAVs) for monitoring and remote sensing is rapidly gaining momentum due to their high mobility, low cost, and flexible deployment. A key challenge is planning missions to maximize the value of acquired data in large environments given flight time limitations. This is, for example, relevant for monitoring agricultural fields. This paper addresses the problem of adaptive path planning for accurate semantic segmentation of using UAVs. We propose an online planning algorithm which adapts the UAV paths to obtain high-resolution semantic segmentations necessary in areas with fine details as they are detected in incoming images. This enables us to perform close inspections at low altitudes only where required, without wasting energy on exhaustive map** at maximum image resolution. A key feature of our approach is a new accuracy model for deep learning-based architectures that captures the relationship between UAV altitude and semantic segmentation accuracy. We evaluate our approach on different domains using real-world data, proving the efficacy and generability of our solution.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
Scaling description of creep flow in amorphous solids
Authors:
Marko Popović,
Tom W. J. de Geus,
Wencheng Ji,
Alberto Rosso,
Matthieu Wyart
Abstract:
Amorphous solids such as coffee foam, toothpaste or mayonnaise display a transient creep flow when a stress $Σ$ is suddenly imposed. The associated strain rate is commonly found to decay in time as $\dotγ \sim t^{-ν}$, followed either by arrest or by a sudden fluidisation. Various empirical laws have been suggested for the creep exponent $ν$ and fluidisation time $τ_f$ in experimental and numerica…
▽ More
Amorphous solids such as coffee foam, toothpaste or mayonnaise display a transient creep flow when a stress $Σ$ is suddenly imposed. The associated strain rate is commonly found to decay in time as $\dotγ \sim t^{-ν}$, followed either by arrest or by a sudden fluidisation. Various empirical laws have been suggested for the creep exponent $ν$ and fluidisation time $τ_f$ in experimental and numerical studies. Here, we postulate that plastic flow is governed by the difference between $Σ$ and the transient yield stress $Σ_t(γ)$ that characterises the stability of configurations visited by the system at strain $γ$. Assuming the analyticity of $Σ_t(γ)$ allows us to predict $ν$ and asymptotic behaviours of $τ_f$ in terms of properties of stationary flows. We test successfully our predictions using elastoplastic models and published experimental results.
△ Less
Submitted 6 October, 2022; v1 submitted 7 November, 2021;
originally announced November 2021.
-
Adaptive-Resolution Field Map** Using Gaussian Process Fusion with Integral Kernels
Authors:
Liren **,
Julius Rückin,
Stefan H. Kiss,
Teresa Vidal-Calleja,
Marija Popović
Abstract:
Unmanned aerial vehicles are rapidly gaining popularity in a variety of environmental monitoring tasks. A key requirement for their autonomous operation is the ability to perform efficient environmental map** online, given limited onboard resources constraining operation time, travel distance, and computational capacity. To address this, we present an online adaptive-resolution approach for mapp…
▽ More
Unmanned aerial vehicles are rapidly gaining popularity in a variety of environmental monitoring tasks. A key requirement for their autonomous operation is the ability to perform efficient environmental map** online, given limited onboard resources constraining operation time, travel distance, and computational capacity. To address this, we present an online adaptive-resolution approach for map** terrain based on Gaussian Process fusion. A key aspect of our approach is an integral kernel encoding spatial correlation over the areas of grid cells, which enables modifying map resolution while maintaining correlations in a theoretically sound fashion. This way, we can retain details in areas of interest at higher map resolutions while compressing information in uninteresting areas at coarser resolutions to achieve a compact map representation of the environment. We evaluate the performance of our approach on both synthetic and real-world data. Results show that our method is more efficient in terms of map** time and memory consumption without compromising on map quality. Finally, we integrate our map** strategy into an adaptive path planning framework to show that it facilitates information gathering efficiency in online settings.
△ Less
Submitted 3 March, 2022; v1 submitted 29 September, 2021;
originally announced September 2021.
-
Adaptive Informative Path Planning Using Deep Reinforcement Learning for UAV-based Active Sensing
Authors:
Julius Rückin,
Liren **,
Marija Popović
Abstract:
Aerial robots are increasingly being utilized for environmental monitoring and exploration. However, a key challenge is efficiently planning paths to maximize the information value of acquired data as an initially unknown environment is explored. To address this, we propose a new approach for informative path planning based on deep reinforcement learning (RL). Combining recent advances in RL and r…
▽ More
Aerial robots are increasingly being utilized for environmental monitoring and exploration. However, a key challenge is efficiently planning paths to maximize the information value of acquired data as an initially unknown environment is explored. To address this, we propose a new approach for informative path planning based on deep reinforcement learning (RL). Combining recent advances in RL and robotic applications, our method combines tree search with an offline-learned neural network predicting informative sensing actions. We introduce several components making our approach applicable for robotic tasks with high-dimensional state and large action spaces. By deploying the trained network during a mission, our method enables sample-efficient online replanning on platforms with limited computational resources. Simulations show that our approach performs on par with existing methods while reducing runtime by 8-10x. We validate its performance using real-world surface temperature data.
△ Less
Submitted 3 March, 2022; v1 submitted 28 September, 2021;
originally announced September 2021.
-
Adaptive Path Planning for UAV-based Multi-Resolution Semantic Segmentation
Authors:
Felix Stache,
Jonas Westheider,
Federico Magistri,
Marija Popović,
Cyrill Stachniss
Abstract:
In this paper, we address the problem of adaptive path planning for accurate semantic segmentation of terrain using unmanned aerial vehicles (UAVs). The usage of UAVs for terrain monitoring and remote sensing is rapidly gaining momentum due to their high mobility, low cost, and flexible deployment. However, a key challenge is planning missions to maximize the value of acquired data in large enviro…
▽ More
In this paper, we address the problem of adaptive path planning for accurate semantic segmentation of terrain using unmanned aerial vehicles (UAVs). The usage of UAVs for terrain monitoring and remote sensing is rapidly gaining momentum due to their high mobility, low cost, and flexible deployment. However, a key challenge is planning missions to maximize the value of acquired data in large environments given flight time limitations. To address this, we propose an online planning algorithm which adapts the UAV paths to obtain high-resolution semantic segmentations necessary in areas on the terrain with fine details as they are detected in incoming images. This enables us to perform close inspections at low altitudes only where required, without wasting energy on exhaustive map** at maximum resolution. A key feature of our approach is a new accuracy model for deep learning-based architectures that captures the relationship between UAV altitude and semantic segmentation accuracy. We evaluate our approach on the application of crop/weed segmentation in precision agriculture using real-world field data.
△ Less
Submitted 4 August, 2021;
originally announced August 2021.
-
Thermodynamics of decoherence
Authors:
Maria Popovic,
Mark T. Mitchison,
John Goold
Abstract:
We investigate the nonequilibrium thermodynamics of pure decoherence. In a pure decoherence process, the system Hamiltonian is a constant of motion and there is no direct energy exchange between the system and its surroundings. Nevertheless, the environment's energy is not generally conserved and in this work we show that this leads to nontrivial heat dissipation as a result of decoherence alone.…
▽ More
We investigate the nonequilibrium thermodynamics of pure decoherence. In a pure decoherence process, the system Hamiltonian is a constant of motion and there is no direct energy exchange between the system and its surroundings. Nevertheless, the environment's energy is not generally conserved and in this work we show that this leads to nontrivial heat dissipation as a result of decoherence alone. This heat has some very distinctive properties: it obeys an integral fluctuation relation and can be interpreted in terms of the entropy production associated with populations in the energy eigenbasis of the initial state. We show that the heat distribution for a pure decoherence process is different from the distribution of work done by the initial system-bath interaction quench. Instead, it corresponds to a mixture of work distributions of cyclical processes, each conditioned on a state of the open system. Inspired by recent experiments on impurities in ultra-cold gases, we demonstrate our general results by studying the heat generated by the decoherence of a qubit immersed within a degenerate Fermi gas in the lowest band of a species-selective optical lattice.
△ Less
Submitted 19 April, 2023; v1 submitted 29 July, 2021;
originally announced July 2021.
-
Generating Gender Augmented Data for NLP
Authors:
Nishtha Jain,
Maja Popovic,
Declan Groves,
Eva Vanmassenhove
Abstract:
Gender bias is a frequent occurrence in NLP-based applications, especially pronounced in gender-inflected languages. Bias can appear through associations of certain adjectives and animate nouns with the natural gender of referents, but also due to unbalanced grammatical gender frequencies of inflected words. This type of bias becomes more evident in generating conversational utterances where gende…
▽ More
Gender bias is a frequent occurrence in NLP-based applications, especially pronounced in gender-inflected languages. Bias can appear through associations of certain adjectives and animate nouns with the natural gender of referents, but also due to unbalanced grammatical gender frequencies of inflected words. This type of bias becomes more evident in generating conversational utterances where gender is not specified within the sentence, because most current NLP applications still work on a sentence-level context. As a step towards more inclusive NLP, this paper proposes an automatic and generalisable rewriting approach for short conversational sentences. The rewriting method can be applied to sentences that, without extra-sentential context, have multiple equivalent alternatives in terms of gender. The method can be applied both for creating gender balanced outputs as well as for creating gender balanced training data. The proposed approach is based on a neural machine translation (NMT) system trained to 'translate' from one gender alternative to another. Both the automatic and manual analysis of the approach show promising results for automatic generation of gender alternatives for conversational sentences in Spanish.
△ Less
Submitted 13 July, 2021;
originally announced July 2021.