-
New Gravitational Wave Discoveries Enabled by Machine Learning
Authors:
Alexandra E. Koloniari,
Evdokia C. Koursoumpa,
Paraskevi Nousi,
Paraskevas Lampropoulos,
Nikolaos Passalis,
Anastasios Tefas,
Nikolaos Stergioulas
Abstract:
The detection of gravitational waves has revolutionized our understanding of the universe, offering unprecedented insights into its dynamics. A major goal of gravitational wave data analysis is to speed up the detection and parameter estimation process using machine learning techniques, in light of an anticipated surge in detected events that would render traditional methods impractical. Here, we…
▽ More
The detection of gravitational waves has revolutionized our understanding of the universe, offering unprecedented insights into its dynamics. A major goal of gravitational wave data analysis is to speed up the detection and parameter estimation process using machine learning techniques, in light of an anticipated surge in detected events that would render traditional methods impractical. Here, we present the first detections of new gravitational-wave candidate events in data from a network of interferometric detectors enabled by machine learning. We discuss several new enhancements of our ResNet-based deep learning code, AresGW, that increased its sensitivity, including a new hierarchical classification of triggers, based on different noise and frequency filters. The enhancements resulted in a significant reduction in the false alarm rate, allowing AresGW to surpass traditional pipelines in the number of detected events in its effective training range (single source masses between 7 and 50 solar masses and source chirp masses between 10 and 40 solar masses), when the new detections are included. We calculate the astrophysical significance of events detected with AresGW using a logarithmic ranking statistic and injections into O3 data. Furthermore, we present spectrograms, parameter estimation, and reconstruction in the time domain for our new candidate events and discuss the distribution of their properties. In addition, the AresGW code exhibited very good performance when tested across various two-detector setups and on observational data from the O1 and O2 observing periods. Our findings underscore the remarkable potential of AresGW as a fast and sensitive detection algorithm for gravitational-wave astronomy, paving the way for a larger number of future discoveries.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Leveraging Deep Learning and Online Source Sentiment for Financial Portfolio Management
Authors:
Paraskevi Nousi,
Loukia Avramelou,
Georgios Rodinos,
Maria Tzelepi,
Theodoros Manousis,
Konstantinos Tsampazis,
Kyriakos Stefanidis,
Dimitris Spanos,
Manos Kirtas,
Pavlos Tosidis,
Avraam Tsantekidis,
Nikolaos Passalis,
Anastasios Tefas
Abstract:
Financial portfolio management describes the task of distributing funds and conducting trading operations on a set of financial assets, such as stocks, index funds, foreign exchange or cryptocurrencies, aiming to maximize the profit while minimizing the loss incurred by said operations. Deep Learning (DL) methods have been consistently excelling at various tasks and automated financial trading is…
▽ More
Financial portfolio management describes the task of distributing funds and conducting trading operations on a set of financial assets, such as stocks, index funds, foreign exchange or cryptocurrencies, aiming to maximize the profit while minimizing the loss incurred by said operations. Deep Learning (DL) methods have been consistently excelling at various tasks and automated financial trading is one of the most complex one of those. This paper aims to provide insight into various DL methods for financial trading, under both the supervised and reinforcement learning schemes. At the same time, taking into consideration sentiment information regarding the traded assets, we discuss and demonstrate their usefulness through corresponding research studies. Finally, we discuss commonly found problems in training such financial agents and equip the reader with the necessary knowledge to avoid these problems and apply the discussed methods in practice.
△ Less
Submitted 24 October, 2023; v1 submitted 23 July, 2023;
originally announced September 2023.
-
Deep Learning for Energy Time-Series Analysis and Forecasting
Authors:
Maria Tzelepi,
Charalampos Symeonidis,
Paraskevi Nousi,
Efstratios Kakaletsis,
Theodoros Manousis,
Pavlos Tosidis,
Nikos Nikolaidis,
Anastasios Tefas
Abstract:
Energy time-series analysis describes the process of analyzing past energy observations and possibly external factors so as to predict the future. Different tasks are involved in the general field of energy time-series analysis and forecasting, with electric load demand forecasting, personalized energy consumption forecasting, as well as renewable energy generation forecasting being among the most…
▽ More
Energy time-series analysis describes the process of analyzing past energy observations and possibly external factors so as to predict the future. Different tasks are involved in the general field of energy time-series analysis and forecasting, with electric load demand forecasting, personalized energy consumption forecasting, as well as renewable energy generation forecasting being among the most common ones. Following the exceptional performance of Deep Learning (DL) in a broad area of vision tasks, DL models have successfully been utilized in time-series forecasting tasks. This paper aims to provide insight into various DL methods geared towards improving the performance in energy time-series forecasting tasks, with special emphasis in Greek Energy Market, and equip the reader with the necessary knowledge to apply these methods in practice.
△ Less
Submitted 29 June, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Variational Voxel Pseudo Image Tracking
Authors:
Illia Oleksiienko,
Paraskevi Nousi,
Nikolaos Passalis,
Anastasios Tefas,
Alexandros Iosifidis
Abstract:
Uncertainty estimation is an important task for critical problems, such as robotics and autonomous driving, because it allows creating statistically better perception models and signaling the model's certainty in its predictions to the decision method or a human supervisor. In this paper, we propose a Variational Neural Network-based version of a Voxel Pseudo Image Tracking (VPIT) method for 3D Si…
▽ More
Uncertainty estimation is an important task for critical problems, such as robotics and autonomous driving, because it allows creating statistically better perception models and signaling the model's certainty in its predictions to the decision method or a human supervisor. In this paper, we propose a Variational Neural Network-based version of a Voxel Pseudo Image Tracking (VPIT) method for 3D Single Object Tracking. The Variational Feature Generation Network of the proposed Variational VPIT computes features for target and search regions and the corresponding uncertainties, which are later combined using an uncertainty-aware cross-correlation module in one of two ways: by computing similarity between the corresponding uncertainties and adding it to the regular cross-correlation values, or by penalizing the uncertain feature channels to increase influence of the certain features. In experiments, we show that both methods improve tracking performance, while penalization of uncertain features provides the best uncertainty quality.
△ Less
Submitted 12 February, 2023;
originally announced February 2023.
-
Deep Residual Networks for Gravitational Wave Detection
Authors:
Paraskevi Nousi,
Alexandra E. Koloniari,
Nikolaos Passalis,
Panagiotis Iosif,
Nikolaos Stergioulas,
Anastasios Tefas
Abstract:
Traditionally, gravitational waves are detected with techniques such as matched filtering or unmodeled searches based on wavelets. However, in the case of generic black hole binaries with non-aligned spins, if one wants to explore the whole parameter space, matched filtering can become impractical, which sets severe restrictions on the sensitivity and computational efficiency of gravitational-wave…
▽ More
Traditionally, gravitational waves are detected with techniques such as matched filtering or unmodeled searches based on wavelets. However, in the case of generic black hole binaries with non-aligned spins, if one wants to explore the whole parameter space, matched filtering can become impractical, which sets severe restrictions on the sensitivity and computational efficiency of gravitational-wave searches. Here, we use a novel combination of machine-learning algorithms and arrive at sensitive distances that surpass traditional techniques in a specific setting. Moreover, the computational cost is only a small fraction of the computational cost of matched filtering. The main ingredients are a 54-layer deep residual network (ResNet), a Deep Adaptive Input Normalization (DAIN), a dynamic dataset augmentation, and curriculum learning, based on an empirical relation for the signal-to-noise ratio. We compare the algorithm's sensitivity with two traditional algorithms on a dataset consisting of a large number of injected waveforms of non-aligned binary black hole mergers in real LIGO O3a noise samples. Our machine-learning algorithm can be used in upcoming rapid online searches of gravitational-wave events in a sizeable portion of the astrophysically interesting parameter space. We make our code, AResGW, and detailed results publicly available at https://github.com/vivinousi/gw-detection-deep-learning .
△ Less
Submitted 29 June, 2023; v1 submitted 2 November, 2022;
originally announced November 2022.
-
A Novel Dataset for Evaluating and Alleviating Domain Shift for Human Detection in Agricultural Fields
Authors:
Paraskevi Nousi,
Emmanouil Mpampis,
Nikolaos Passalis,
Ole Green,
Anastasios Tefas
Abstract:
In this paper we evaluate the impact of domain shift on human detection models trained on well known object detection datasets when deployed on data outside the distribution of the training set, as well as propose methods to alleviate such phenomena based on the available annotations from the target domain. Specifically, we introduce the OpenDR Humans in Field dataset, collected in the context of…
▽ More
In this paper we evaluate the impact of domain shift on human detection models trained on well known object detection datasets when deployed on data outside the distribution of the training set, as well as propose methods to alleviate such phenomena based on the available annotations from the target domain. Specifically, we introduce the OpenDR Humans in Field dataset, collected in the context of agricultural robotics applications, using the Robotti platform, allowing for quantitatively measuring the impact of domain shift in such applications. Furthermore, we examine the importance of manual annotation by evaluating three distinct scenarios concerning the training data: a) only negative samples, i.e., no depicted humans, b) only positive samples, i.e., only images which contain humans, and c) both negative and positive samples. Our results indicate that good performance can be achieved even when using only negative samples, if additional consideration is given to the training process. We also find that positive samples increase performance especially in terms of better localization. The dataset is publicly available for download at https://github.com/opendr-eu/datasets.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
MLGWSC-1: The first Machine Learning Gravitational-Wave Search Mock Data Challenge
Authors:
Marlin B. Schäfer,
Ondřej Zelenka,
Alexander H. Nitz,
He Wang,
Shichao Wu,
Zong-Kuan Guo,
Zhoujian Cao,
Zhixiang Ren,
Paraskevi Nousi,
Nikolaos Stergioulas,
Panagiotis Iosif,
Alexandra E. Koloniari,
Anastasios Tefas,
Nikolaos Passalis,
Francesco Salemi,
Gabriele Vedovato,
Sergey Klimenko,
Tanmaya Mishra,
Bernd Brügmann,
Elena Cuoco,
E. A. Huerta,
Chris Messenger,
Frank Ohme
Abstract:
We present the results of the first Machine Learning Gravitational-Wave Search Mock Data Challenge (MLGWSC-1). For this challenge, participating groups had to identify gravitational-wave signals from binary black hole mergers of increasing complexity and duration embedded in progressively more realistic noise. The final of the 4 provided datasets contained real noise from the O3a observing run and…
▽ More
We present the results of the first Machine Learning Gravitational-Wave Search Mock Data Challenge (MLGWSC-1). For this challenge, participating groups had to identify gravitational-wave signals from binary black hole mergers of increasing complexity and duration embedded in progressively more realistic noise. The final of the 4 provided datasets contained real noise from the O3a observing run and signals up to a duration of 20 seconds with the inclusion of precession effects and higher order modes. We present the average sensitivity distance and runtime for the 6 entered algorithms derived from 1 month of test data unknown to the participants prior to submission. Of these, 4 are machine learning algorithms. We find that the best machine learning based algorithms are able to achieve up to 95% of the sensitive distance of matched-filtering based production analyses for simulated Gaussian noise at a false-alarm rate (FAR) of one per month. In contrast, for real noise, the leading machine learning search achieved 70%. For higher FARs the differences in sensitive distance shrink to the point where select machine learning submissions outperform traditional search algorithms at FARs $\geq 200$ per month on some datasets. Our results show that current machine learning search algorithms may already be sensitive enough in limited parameter regions to be useful for some production settings. To improve the state-of-the-art, machine learning algorithms need to reduce the false-alarm rates at which they are capable of detecting signals and extend their validity to regions of parameter space where modeled searches are computationally expensive to run. Based on our findings we compile a list of research areas that we believe are the most important to elevate machine learning searches to an invaluable tool in gravitational-wave signal detection.
△ Less
Submitted 22 September, 2022;
originally announced September 2022.
-
VPIT: Real-time Embedded Single Object 3D Tracking Using Voxel Pseudo Images
Authors:
Illia Oleksiienko,
Paraskevi Nousi,
Nikolaos Passalis,
Anastasios Tefas,
Alexandros Iosifidis
Abstract:
In this paper, we propose a novel voxel-based 3D single object tracking (3D SOT) method called Voxel Pseudo Image Tracking (VPIT). VPIT is the first method that uses voxel pseudo images for 3D SOT. The input point cloud is structured by pillar-based voxelization, and the resulting pseudo image is used as an input to a 2D-like Siamese SOT method. The pseudo image is created in the Bird's-eye View (…
▽ More
In this paper, we propose a novel voxel-based 3D single object tracking (3D SOT) method called Voxel Pseudo Image Tracking (VPIT). VPIT is the first method that uses voxel pseudo images for 3D SOT. The input point cloud is structured by pillar-based voxelization, and the resulting pseudo image is used as an input to a 2D-like Siamese SOT method. The pseudo image is created in the Bird's-eye View (BEV) coordinates, and therefore the objects in it have constant size. Thus, only the object rotation can change in the new coordinate system and not the object scale. For this reason, we replace multi-scale search with a multi-rotation search, where differently rotated search regions are compared against a single target representation to predict both position and rotation of the object. Experiments on KITTI Tracking dataset show that VPIT is the fastest 3D SOT method and maintains competitive Success and Precision values. Application of a SOT method in a real-world scenario meets with limitations such as lower computational capabilities of embedded devices and a latency-unforgiving environment, where the method is forced to skip certain data frames if the inference speed is not high enough. We implement a real-time evaluation protocol and show that other methods lose most of their performance on embedded devices, while VPIT maintains its ability to track the object.
△ Less
Submitted 6 June, 2022;
originally announced June 2022.
-
Deep Residual Error and Bag-of-Tricks Learning for Gravitational Wave Surrogate Modeling
Authors:
Styliani-Christina Fragkouli,
Paraskevi Nousi,
Nikolaos Passalis,
Panagiotis Iosif,
Nikolaos Stergioulas,
Anastasios Tefas
Abstract:
Deep learning methods have been employed in gravitational-wave astronomy to accelerate the construction of surrogate waveforms for the inspiral of spin-aligned black hole binaries, among other applications. We face the challenge of modeling the residual error of an artificial neural network that models the coefficients of the surrogate waveform expansion (especially those of the phase of the wavef…
▽ More
Deep learning methods have been employed in gravitational-wave astronomy to accelerate the construction of surrogate waveforms for the inspiral of spin-aligned black hole binaries, among other applications. We face the challenge of modeling the residual error of an artificial neural network that models the coefficients of the surrogate waveform expansion (especially those of the phase of the waveform) which we demonstrate has sufficient structure to be learnable by a second network. Adding this second network, we were able to reduce the maximum mismatch for waveforms in a validation set by 13.4 times. We also explored several other ideas for improving the accuracy of the surrogate model, such as the exploitation of similarities between waveforms, the augmentation of the training set, the dissection of the input space, using dedicated networks per output coefficient and output augmentation. In several cases, small improvements can be observed, but the most significant improvement still comes from the addition of a second network that models the residual error. Since the residual error for more general surrogate waveform models (when e.g., eccentricity is included) may also have a specific structure, one can expect our method to be applicable to cases where the gain in accuracy could lead to significant gains in computational time.
△ Less
Submitted 23 August, 2023; v1 submitted 16 March, 2022;
originally announced March 2022.
-
OpenDR: An Open Toolkit for Enabling High Performance, Low Footprint Deep Learning for Robotics
Authors:
N. Passalis,
S. Pedrazzi,
R. Babuska,
W. Burgard,
D. Dias,
F. Ferro,
M. Gabbouj,
O. Green,
A. Iosifidis,
E. Kayacan,
J. Kober,
O. Michel,
N. Nikolaidis,
P. Nousi,
R. Pieters,
M. Tzelepi,
A. Valada,
A. Tefas
Abstract:
Existing Deep Learning (DL) frameworks typically do not provide ready-to-use solutions for robotics, where very specific learning, reasoning, and embodiment problems exist. Their relatively steep learning curve and the different methodologies employed by DL compared to traditional approaches, along with the high complexity of DL models, which often leads to the need of employing specialized hardwa…
▽ More
Existing Deep Learning (DL) frameworks typically do not provide ready-to-use solutions for robotics, where very specific learning, reasoning, and embodiment problems exist. Their relatively steep learning curve and the different methodologies employed by DL compared to traditional approaches, along with the high complexity of DL models, which often leads to the need of employing specialized hardware accelerators, further increase the effort and cost needed to employ DL models in robotics. Also, most of the existing DL methods follow a static inference paradigm, as inherited by the traditional computer vision pipelines, ignoring active perception, which can be employed to actively interact with the environment in order to increase perception accuracy. In this paper, we present the Open Deep Learning Toolkit for Robotics (OpenDR). OpenDR aims at develo** an open, non-proprietary, efficient, and modular toolkit that can be easily used by robotics companies and research institutions to efficiently develop and deploy AI and cognition technologies to robotics applications, providing a solid step towards addressing the aforementioned challenges. We also detail the design choices, along with an abstract interface that was created to overcome these challenges. This interface can describe various robotic tasks, spanning beyond traditional DL cognition and inference, as known by existing frameworks, incorporating openness, homogeneity and robotics-oriented perception e.g., through active perception, as its core design principles.
△ Less
Submitted 1 March, 2022;
originally announced March 2022.
-
Autoencoder-driven Spiral Representation Learning for Gravitational Wave Surrogate Modelling
Authors:
Paraskevi Nousi,
Styliani-Christina Fragkouli,
Nikolaos Passalis,
Panagiotis Iosif,
Theocharis Apostolatos,
George Pappas,
Nikolaos Stergioulas,
Anastasios Tefas
Abstract:
Recently, artificial neural networks have been gaining momentum in the field of gravitational wave astronomy, for example in surrogate modelling of computationally expensive waveform models for binary black hole inspiral and merger. Surrogate modelling yields fast and accurate approximations of gravitational waves and neural networks have been used in the final step of interpolating the coefficien…
▽ More
Recently, artificial neural networks have been gaining momentum in the field of gravitational wave astronomy, for example in surrogate modelling of computationally expensive waveform models for binary black hole inspiral and merger. Surrogate modelling yields fast and accurate approximations of gravitational waves and neural networks have been used in the final step of interpolating the coefficients of the surrogate model for arbitrary waveforms outside the training sample. We investigate the existence of underlying structures in the empirical interpolation coefficients using autoencoders. We demonstrate that when the coefficient space is compressed to only two dimensions, a spiral structure appears, wherein the spiral angle is linearly related to the mass ratio. Based on this finding, we design a spiral module with learnable parameters, that is used as the first layer in a neural network, which learns to map the input space to the coefficients. The spiral module is evaluated on multiple neural network architectures and consistently achieves better speed-accuracy trade-off than baseline models. A thorough experimental study is conducted and the final result is a surrogate model which can evaluate millions of input parameters in a single forward pass in under 1ms on a desktop GPU, while the mismatch between the corresponding generated waveforms and the ground-truth waveforms is better than the compared baseline methods. We anticipate the existence of analogous underlying structures and corresponding computational gains also in the case of spinning black hole binaries.
△ Less
Submitted 9 July, 2021;
originally announced July 2021.
-
Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human Digitization
Authors:
C. Symeonidis,
P. Nousi,
P. Tosidis,
K. Tsampazis,
N. Passalis,
A. Tefas,
N. Nikolaidis
Abstract:
The performance of supervised deep learning algorithms depends significantly on the scale, quality and diversity of the data used for their training. Collecting and manually annotating large amount of data can be both time-consuming and costly tasks to perform. In the case of tasks related to visual human-centric perception, the collection and distribution of such data may also face restrictions d…
▽ More
The performance of supervised deep learning algorithms depends significantly on the scale, quality and diversity of the data used for their training. Collecting and manually annotating large amount of data can be both time-consuming and costly tasks to perform. In the case of tasks related to visual human-centric perception, the collection and distribution of such data may also face restrictions due to legislation regarding privacy. In addition, the design and testing of complex systems, e.g., robots, which often employ deep learning-based perception models, may face severe difficulties as even state-of-the-art methods trained on real and large-scale datasets cannot always perform adequately due to not having been adapted to the visual differences between the virtual and the real world data. As an attempt to tackle and mitigate the effect of these issues, we present a method that automatically generates realistic synthetic data with annotations for a) person detection, b) face recognition, and c) human pose estimation. The proposed method takes as input real background images and populates them with human figures in various poses. Instead of using hand-made 3D human models, we propose the use of models generated through deep learning methods, further reducing the dataset creation costs, while maintaining a high level of realism. In addition, we provide open-source and easy to use tools that implement the proposed pipeline, allowing for generating highly-realistic synthetic datasets for a variety of tasks. A benchmarking and evaluation in the corresponding tasks shows that synthetic data can be effectively used as a supplement to real data.
△ Less
Submitted 30 June, 2021; v1 submitted 28 June, 2021;
originally announced June 2021.
-
Machine Learning for Forecasting Mid Price Movement using Limit Order Book Data
Authors:
Paraskevi Nousi,
Avraam Tsantekidis,
Nikolaos Passalis,
Adamantios Ntakaris,
Juho Kanniainen,
Anastasios Tefas,
Moncef Gabbouj,
Alexandros Iosifidis
Abstract:
Forecasting the movements of stock prices is one the most challenging problems in financial markets analysis. In this paper, we use Machine Learning (ML) algorithms for the prediction of future price movements using limit order book data. Two different sets of features are combined and evaluated: handcrafted features based on the raw order book data and features extracted by ML algorithms, resulti…
▽ More
Forecasting the movements of stock prices is one the most challenging problems in financial markets analysis. In this paper, we use Machine Learning (ML) algorithms for the prediction of future price movements using limit order book data. Two different sets of features are combined and evaluated: handcrafted features based on the raw order book data and features extracted by ML algorithms, resulting in feature vectors with highly variant dimensionalities. Three classifiers are evaluated using combinations of these sets of features on two different evaluation setups and three prediction scenarios. Even though the large scale and high frequency nature of the limit order book poses several challenges, the scope of the conducted experiments and the significance of the experimental results indicate that Machine Learning highly befits this task carving the path towards future research in this field.
△ Less
Submitted 8 April, 2019; v1 submitted 19 September, 2018;
originally announced September 2018.