-
Random forests for detecting weak signals and extracting physical information: a case study of magnetic navigation
Authors:
Mohammadamin Moradi,
Zheng-Meng Zhai,
Aaron Nielsen,
Ying-Cheng Lai
Abstract:
It was recently demonstrated that two machine-learning architectures, reservoir computing and time-delayed feed-forward neural networks, can be exploited for detecting the Earth's anomaly magnetic field immersed in overwhelming complex signals for magnetic navigation in a GPS-denied environment. The accuracy of the detected anomaly field corresponds to a positioning accuracy in the range of 10 to…
▽ More
It was recently demonstrated that two machine-learning architectures, reservoir computing and time-delayed feed-forward neural networks, can be exploited for detecting the Earth's anomaly magnetic field immersed in overwhelming complex signals for magnetic navigation in a GPS-denied environment. The accuracy of the detected anomaly field corresponds to a positioning accuracy in the range of 10 to 40 meters. To increase the accuracy and reduce the uncertainty of weak signal detection as well as to directly obtain the position information, we exploit the machine-learning model of random forests that combines the output of multiple decision trees to give optimal values of the physical quantities of interest. In particular, from time-series data gathered from the cockpit of a flying airplane during various maneuvering stages, where strong background complex signals are caused by other elements of the Earth's magnetic field and the fields produced by the electronic systems in the cockpit, we demonstrate that the random-forest algorithm performs remarkably well in detecting the weak anomaly field and in filtering the position of the aircraft. With the aid of the conventional inertial navigation system, the positioning error can be reduced to less than 10 meters. We also find that, contrary to the conventional wisdom, the classic Tolles-Lawson model for calibrating and removing the magnetic field generated by the body of the aircraft is not necessary and may even be detrimental for the success of the random-forest method.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Deep Learning Based Simulators for the Phosphorus Removal Process Control in Wastewater Treatment via Deep Reinforcement Learning Algorithms
Authors:
Esmaeel Mohammadi,
Mikkel Stokholm-Bjerregaard,
Aviaja Anna Hansen,
Per Halkjær Nielsen,
Daniel Ortiz-Arroyo,
Petar Durdevic
Abstract:
Phosphorus removal is vital in wastewater treatment to reduce reliance on limited resources. Deep reinforcement learning (DRL) is a machine learning technique that can optimize complex and nonlinear systems, including the processes in wastewater treatment plants, by learning control policies through trial and error. However, applying DRL to chemical and biological processes is challenging due to t…
▽ More
Phosphorus removal is vital in wastewater treatment to reduce reliance on limited resources. Deep reinforcement learning (DRL) is a machine learning technique that can optimize complex and nonlinear systems, including the processes in wastewater treatment plants, by learning control policies through trial and error. However, applying DRL to chemical and biological processes is challenging due to the need for accurate simulators. This study trained six models to identify the phosphorus removal process and used them to create a simulator for the DRL environment. Although the models achieved high accuracy (>97%), uncertainty and incorrect prediction behavior limited their performance as simulators over longer horizons. Compounding errors in the models' predictions were identified as one of the causes of this problem. This approach for improving process control involves creating simulation environments for DRL algorithms, using data from supervisory control and data acquisition (SCADA) systems with a sufficient historical horizon without complex system modeling or parameter estimation.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Diffeomorphic Multi-Resolution Deep Learning Registration for Applications in Breast MRI
Authors:
Matthew G. French,
Gonzalo D. Maso Talou,
Thiranja P. Babarenda Gamage,
Martyn P. Nash,
Poul M. Nielsen,
Anthony J. Doyle,
Juan Eugenio Iglesias,
Yaël Balbastre,
Sean I. Young
Abstract:
In breast surgical planning, accurate registration of MR images across patient positions has the potential to improve the localisation of tumours during breast cancer treatment. While learning-based registration methods have recently become the state-of-the-art approach for most medical image registration tasks, these methods have yet to make inroads into breast image registration due to certain d…
▽ More
In breast surgical planning, accurate registration of MR images across patient positions has the potential to improve the localisation of tumours during breast cancer treatment. While learning-based registration methods have recently become the state-of-the-art approach for most medical image registration tasks, these methods have yet to make inroads into breast image registration due to certain difficulties-the lack of rich texture information in breast MR images and the need for the deformations to be diffeomophic. In this work, we propose learning strategies for breast MR image registration that are amenable to diffeomorphic constraints, together with early experimental results from in-silico and in-vivo experiments. One key contribution of this work is a registration network which produces superior registration outcomes for breast images in addition to providing diffeomorphic guarantees.
△ Less
Submitted 4 October, 2023; v1 submitted 24 September, 2023;
originally announced September 2023.
-
A Semi-Automated Solution Approach Recommender for a Given Use Case: a Case Study for AI/ML in Oncology via Scopus and OpenAI
Authors:
Deniz Kenan Kılıç,
Alex Elkjær Vasegaard,
Aurélien Desoeuvres,
Peter Nielsen
Abstract:
Nowadays, literature review is a necessary task when trying to solve a given problem. However, an exhaustive literature review is very time-consuming in today's vast literature landscape. It can take weeks, even if looking only for abstracts or surveys. Moreover, choosing a method among others, and targeting searches within relevant problem and solution domains, are not easy tasks. These are espec…
▽ More
Nowadays, literature review is a necessary task when trying to solve a given problem. However, an exhaustive literature review is very time-consuming in today's vast literature landscape. It can take weeks, even if looking only for abstracts or surveys. Moreover, choosing a method among others, and targeting searches within relevant problem and solution domains, are not easy tasks. These are especially true for young researchers or engineers starting to work in their field. Even if surveys that provide methods used to solve a specific problem already exist, an automatic way to do it for any use case is missing, especially for those who don't know the existing literature. Our proposed tool, SARBOLD-LLM, allows discovering and choosing among methods related to a given problem, providing additional information about their uses in the literature to derive decision-making insights, in only a few hours. The SARBOLD-LLM comprises three modules: (1: Scopus search) paper selection using a keyword selection scheme to query Scopus API; (2: Scoring and method extraction) relevancy and popularity scores calculation and solution method extraction in papers utilizing OpenAI API (GPT 3.5); (3: Analyzes) sensitivity analysis and post-analyzes which reveals trends, relevant papers and methods. Comparing the SARBOLD-LLM to manual ground truth using precision, recall, and F1-score metrics, the performance results of AI in the oncology case study are 0.68, 0.9, and 0.77, respectively. SARBOLD-LLM demonstrates successful outcomes across various domains, showcasing its robustness and effectiveness. The SARBOLD-LLM addresses engineers more than researchers, as it proposes methods and trends without adding pros and cons. It is a useful tool to select which methods to investigate first and comes as a complement to surveys. This can limit the global search and accumulation of knowledge for the end user. However...
△ Less
Submitted 15 May, 2024; v1 submitted 10 July, 2023;
originally announced July 2023.
-
Smart Meter Data Anomaly Detection using Variational Recurrent Autoencoders with Attention
Authors:
Wen**g Dai,
Xiufeng Liu,
Alfred Heller,
Per Sieverts Nielsen
Abstract:
In the digitization of energy systems, sensors and smart meters are increasingly being used to monitor production, operation and demand. Detection of anomalies based on smart meter data is crucial to identify potential risks and unusual events at an early stage, which can serve as a reference for timely initiation of appropriate actions and improving management. However, smart meter data from ener…
▽ More
In the digitization of energy systems, sensors and smart meters are increasingly being used to monitor production, operation and demand. Detection of anomalies based on smart meter data is crucial to identify potential risks and unusual events at an early stage, which can serve as a reference for timely initiation of appropriate actions and improving management. However, smart meter data from energy systems often lack labels and contain noise and various patterns without distinctively cyclical. Meanwhile, the vague definition of anomalies in different energy scenarios and highly complex temporal correlations pose a great challenge for anomaly detection. Many traditional unsupervised anomaly detection algorithms such as cluster-based or distance-based models are not robust to noise and not fully exploit the temporal dependency in a time series as well as other dependencies amongst multiple variables (sensors). This paper proposes an unsupervised anomaly detection method based on a Variational Recurrent Autoencoder with attention mechanism. with "dirty" data from smart meters, our method pre-detects missing values and global anomalies to shrink their contribution while training. This paper makes a quantitative comparison with the VAE-based baseline approach and four other unsupervised learning methods, demonstrating its effectiveness and superiority. This paper further validates the proposed method by a real case study of detecting the anomalies of water supply temperature from an industrial heating plant.
△ Less
Submitted 8 June, 2022;
originally announced June 2022.
-
On the Fitness Landscapes of Interdependency Models in the Travelling Thief Problem
Authors:
Mohamed El Yafrani,
Marcella Scoczynski,
Myriam Delgado,
Ricardo Lüders,
Peter Nielsen,
Markus Wagner
Abstract:
Since its inception in 2013, the Travelling Thief Problem (TTP) has been widely studied as an example of problems with multiple interconnected sub-problems. The dependency in this model arises when tying the travelling time of the "thief" to the weight of the knapsack. However, other forms of dependency as well as combinations of dependencies should be considered for investigation, as they are oft…
▽ More
Since its inception in 2013, the Travelling Thief Problem (TTP) has been widely studied as an example of problems with multiple interconnected sub-problems. The dependency in this model arises when tying the travelling time of the "thief" to the weight of the knapsack. However, other forms of dependency as well as combinations of dependencies should be considered for investigation, as they are often found in complex real-world problems. Our goal is to study the impact of different forms of dependency in the TTP using a simple local search algorithm. To achieve this, we use Local Optima Networks, a technique for analysing the fitness landscape.
△ Less
Submitted 28 February, 2022;
originally announced March 2022.
-
Digital Global Public Goods
Authors:
Johan Ivar Sæbø,
Brian Nicholson,
Petter Nielsen,
Sundeep Sahay
Abstract:
The purpose of this paper is to define and conceptualize digital global public goods (DGPGs) and illustrate the importance of contextual relevance in ICT4D projects. Recent studies have examined the importance of digital artefacts with public goods traits, emphasizing the significant potential for socio-economic development. However, we know little about the theoretical and practical dimensions of…
▽ More
The purpose of this paper is to define and conceptualize digital global public goods (DGPGs) and illustrate the importance of contextual relevance in ICT4D projects. Recent studies have examined the importance of digital artefacts with public goods traits, emphasizing the significant potential for socio-economic development. However, we know little about the theoretical and practical dimensions of how we can align the public goods traits of such artefacts to create relevance in the context they are implemented. To address this gap we review the literature firstly to develop a definition and conceptual basis of DGPGs and then to illustrate the importance of relevance: how to align DGPGs with context to meet local needs. The illustration draws from a case study of the District Health Information systems (DHIS2). The paper advances both the theoretical and practical understanding of DPGs in development processes.
△ Less
Submitted 22 August, 2021;
originally announced August 2021.
-
Resilient ICT4D: Building and Sustaining our Community in Pandemic Times
Authors:
Silvia Masiero,
Petter Nielsen
Abstract:
The impacts of the COVID-19 pandemic, disproportionally affecting vulnerable people and deepening pre-existing inequalities (Dreze, 2020; Qureshi, 2021), have interested the very same "development" processes that the IFIP Working Group 9.4 on the Implications of Information and Digital Technologies for Development has dealt with over time. A global development paradigm (Oldekop et al., 2020) has e…
▽ More
The impacts of the COVID-19 pandemic, disproportionally affecting vulnerable people and deepening pre-existing inequalities (Dreze, 2020; Qureshi, 2021), have interested the very same "development" processes that the IFIP Working Group 9.4 on the Implications of Information and Digital Technologies for Development has dealt with over time. A global development paradigm (Oldekop et al., 2020) has emerged in response to the global nature of the crisis, infusing new meaning in the spirit of "making a better world" with ICTs (Walsham, 2012) that always have characterised ICT4D research. Such a new meaning contextualises our research in the landscape of the first pandemic of the datafied society (Milan & Trere, 2020), coming to terms with the silencing of narratives from the margins within the pandemic (Milan et al., 2021) - in Qureshi's (2021) words, a "pandemics within the pandemic" producing new socio-economic inequities in a state of global emergency.
△ Less
Submitted 22 August, 2021;
originally announced August 2021.
-
SEGSys: A map** system for segmentation analysis in energy
Authors:
Xiufeng Liu,
Rongling Li,
Yi Wang,
Per Sieverts Nielsen
Abstract:
Customer segmentation analysis can give valuable insights into the energy efficiency of residential buildings. This paper presents a map** system, SEGSys that enables segmentation analysis at the individual and the neighborhood levels. SEGSys supports the online and offline classification of customers based on their daily consumption patterns and consumption intensity. It also supports the segme…
▽ More
Customer segmentation analysis can give valuable insights into the energy efficiency of residential buildings. This paper presents a map** system, SEGSys that enables segmentation analysis at the individual and the neighborhood levels. SEGSys supports the online and offline classification of customers based on their daily consumption patterns and consumption intensity. It also supports the segmentation analysis according to the social characteristics of customers of individual households or neighborhoods, as well as spatial geometries. SEGSys uses a three-layer architecture to model the segmentation system, including the data layer, the service layer, and the presentation layer. The data layer models data into a star schema within a data warehouse, the service layer provides data service through a RESTful interface, and the presentation layer interacts with users through a visual map. This paper showcases the system on the segmentation analysis using an electricity consumption data set and validates the effectiveness of the system.
△ Less
Submitted 11 December, 2020;
originally announced December 2020.
-
Signal Enhancement for Magnetic Navigation Challenge Problem
Authors:
Albert R. Gnadt,
Joseph Belarge,
Aaron Canciani,
Glenn Carl,
Lauren Conger,
Joseph Curro,
Alan Edelman,
Peter Morales,
Aaron P. Nielsen,
Michael F. O'Keeffe,
Christopher V. Rackauckas,
Jonathan Taylor,
Allan B. Wollaber
Abstract:
Harnessing the magnetic field of the Earth for navigation has shown promise as a viable alternative to other navigation systems. A magnetic navigation system collects its own magnetic field data using a magnetometer and uses magnetic anomaly maps to determine the current location. The greatest challenge with magnetic navigation arises when the magnetic field measurements from the magnetometer enco…
▽ More
Harnessing the magnetic field of the Earth for navigation has shown promise as a viable alternative to other navigation systems. A magnetic navigation system collects its own magnetic field data using a magnetometer and uses magnetic anomaly maps to determine the current location. The greatest challenge with magnetic navigation arises when the magnetic field measurements from the magnetometer encompass the magnetic field from not just the Earth, but also from the vehicle on which it is mounted. It is difficult to separate the Earth magnetic anomaly field, which is crucial for navigation, from the total magnetic field reading from the sensor. The purpose of this challenge problem is to decouple the Earth and aircraft magnetic signals in order to derive a clean signal from which to perform magnetic navigation. Baseline testing on the dataset has shown that the Earth magnetic field can be extracted from the total magnetic field using machine learning (ML). The challenge is to remove the aircraft magnetic field from the total magnetic field using a trained model. This challenge offers an opportunity to construct an effective model for removing the aircraft magnetic field from the dataset by using a scientific machine learning (SciML) approach comprised of an ML algorithm integrated with the physics of magnetic navigation.
△ Less
Submitted 6 January, 2023; v1 submitted 23 July, 2020;
originally announced July 2020.
-
MATE: A Model-based Algorithm Tuning Engine
Authors:
Mohamed El Yafrani,
Marcella Scoczynski Ribeiro Martins,
Inkyung Sung,
Markus Wagner,
Carola Doerr,
Peter Nielsen
Abstract:
In this paper, we introduce a Model-based Algorithm Turning Engine, namely MATE, where the parameters of an algorithm are represented as expressions of the features of a target optimisation problem. In contrast to most static (feature-independent) algorithm tuning engines such as irace and SPOT, our approach aims to derive the best parameter configuration of a given algorithm for a specific proble…
▽ More
In this paper, we introduce a Model-based Algorithm Turning Engine, namely MATE, where the parameters of an algorithm are represented as expressions of the features of a target optimisation problem. In contrast to most static (feature-independent) algorithm tuning engines such as irace and SPOT, our approach aims to derive the best parameter configuration of a given algorithm for a specific problem, exploiting the relationships between the algorithm parameters and the features of the problem. We formulate the problem of finding the relationships between the parameters and the problem features as a symbolic regression problem and we use genetic programming to extract these expressions. For the evaluation, we apply our approach to configuration of the (1+1) EA and RLS algorithms for the OneMax, LeadingOnes, BinValue and Jump optimisation problems, where the theoretically optimal algorithm parameters to the problems are available as functions of the features of the problems. Our study shows that the found relationships typically comply with known theoretical results, thus demonstrating a new opportunity to consider model-based parameter tuning as an effective alternative to the static algorithm tuning engines.
△ Less
Submitted 15 February, 2021; v1 submitted 27 April, 2020;
originally announced April 2020.
-
Batch 2: Definition of novel Weather & Climate Dwarfs
Authors:
Andreas Müller,
Mike Gillard,
Kristian Pagh Nielsen,
Zbigniew Piotrowski
Abstract:
This document is one of the deliverable reports created for the ESCAPE project. ESCAPE stands for Energy-efficient Scalable Algorithms for Weather Prediction at Exascale. The project develops world-class, extreme-scale computing capabilities for European operational numerical weather prediction and future climate models. This is done by identifying weather & climate dwarfs which are key patterns i…
▽ More
This document is one of the deliverable reports created for the ESCAPE project. ESCAPE stands for Energy-efficient Scalable Algorithms for Weather Prediction at Exascale. The project develops world-class, extreme-scale computing capabilities for European operational numerical weather prediction and future climate models. This is done by identifying weather & climate dwarfs which are key patterns in terms of computation and communication (in the spirit of the Berkeley dwarfs). These dwarfs are then optimised for different hardware architectures (single and multi-node) and alternative algorithms are explored. Performance portability is addressed through the use of domain specific languages.
This deliverable contains the description of the characteristics of a second set of so-called numerical weather & climate prediction dwarfs that form key functional components of prediction models in terms of the science that they encapsulate and in terms of computational cost they impose on the forecast production. The ESCAPE work flow between work packages centres on these dwarfs and hence their selection, their performance assessment, code adaptation and optimisation is crucial for the success of the project. These new dwarfs have been chosen with the purpose of extending the range of computational characteristic represented by the dwarfs previously selected in batch 1 (see Deliverable D1.1). The dwarfs have been made, their documentation has been compiled and the software has been made available on the software exchange platform.
The dwarfs in this deliverable include a multigrid elliptic solver, a novel advection scheme for unstructured meshes, an advection scheme for structured meshes and a radiation scheme. This deliverable includes their scientific description and the guidance for installation, execution and testing.
△ Less
Submitted 16 August, 2019;
originally announced August 2019.
-
Instance Scale, Numerical Properties and Design of Metaheuristics: A Study for the Facility Location Problem
Authors:
David Chalupa,
Peter Nielsen
Abstract:
Metaheuristics are known to be strong in solving large-scale instances of computationally hard problems. However, their efficiency still needs exploration in the context of instance structure, scale and numerical properties for many of these problems. In this paper, we present an in-depth computational study of two local search metaheuristics for the classical uncapacitated facility location probl…
▽ More
Metaheuristics are known to be strong in solving large-scale instances of computationally hard problems. However, their efficiency still needs exploration in the context of instance structure, scale and numerical properties for many of these problems. In this paper, we present an in-depth computational study of two local search metaheuristics for the classical uncapacitated facility location problem. We investigate four problem instance models, studied for the same problem size, for which the two metaheuristics exhibit intriguing and contrasting behaviours. The metaheuristics explored include a local search (LS) algorithm that chooses the best moves in the current neighbourhood, while a randomised local search (RLS) algorithm chooses the first move that does not lead to a worsening. The experimental results indicate that the right choice between these two algorithms depends heavily on the distribution of coefficients within the problem instance. This is also put further into context by finding optimal or near-optimal solutions using a mixed-integer linear programming problem solver. Since the facility location problem is a relatively simple example of a choice-and-assignment problem, similar phenomena are likely to be discovered in a number of other, possibly more complex computational problems in science and engineering.
△ Less
Submitted 10 January, 2018;
originally announced January 2018.
-
A Hybrid ICT-Solution for Smart Meter Data Analytics
Authors:
Xiufeng Liu,
Per Sieverts Nielsen
Abstract:
Smart meters are increasingly used worldwide. Smart meters are the advanced meters capable of measuring energy consumption at a fine-grained time interval, e.g., every 15 minutes. Smart meter data are typically bundled with social economic data in analytics, such as meter geographic locations, weather conditions and user information, which makes the data sets very sizable and the analytics complex…
▽ More
Smart meters are increasingly used worldwide. Smart meters are the advanced meters capable of measuring energy consumption at a fine-grained time interval, e.g., every 15 minutes. Smart meter data are typically bundled with social economic data in analytics, such as meter geographic locations, weather conditions and user information, which makes the data sets very sizable and the analytics complex. Data mining and emerging cloud computing technologies make collecting, processing, and analyzing the so-called big data possible. This paper proposes an innovative ICT-solution to streamline smart meter data analytics. The proposed solution offers an information integration pipeline for ingesting data from smart meters, a scalable platform for processing and mining big data sets, and a web portal for visualizing analytics results. The implemented system has a hybrid architecture of using Spark or Hive for big data processing, and using the machine learning toolkit, MADlib, for doing in-database data analytics in PostgreSQL database. This paper evaluates the key technologies of the proposed ICT-solution, and the results show the effectiveness and efficiency of using the system for both batch and online analytics.
△ Less
Submitted 18 June, 2016;
originally announced June 2016.
-
Regression-based Online Anomaly Detection for Smart Grid Data
Authors:
Xiufeng Liu,
Per Sieverts Nielsen
Abstract:
With the widely used smart meters in the energy sector, anomaly detection becomes a crucial mean to study the unusual consumption behaviors of customers, and to discover unexpected events of using energy promptly. Detecting consumption anomalies is, essentially, a real-time big data analytics problem, which does data mining on a large amount of parallel data streams from smart meters. In this pape…
▽ More
With the widely used smart meters in the energy sector, anomaly detection becomes a crucial mean to study the unusual consumption behaviors of customers, and to discover unexpected events of using energy promptly. Detecting consumption anomalies is, essentially, a real-time big data analytics problem, which does data mining on a large amount of parallel data streams from smart meters. In this paper, we propose a supervised learning and statistical-based anomaly detection method, and implement a Lambda system using the in-memory distributed computing framework, Spark and its extension Spark Streaming. The system supports not only iterative detection model refreshment from scalable data sets, but also real-time detection on scalable live data streams. This paper empirically evaluates the system and the detection algorithm, and the results show the effectiveness and the scalability of the proposed lambda detection system.
△ Less
Submitted 18 June, 2016;
originally announced June 2016.