Search | arXiv e-print repository

A Hybrid Microscopic Model for Multimodal Traffic with Empirical Observations from Aerial Footage

Authors: Georg Anagnostopoulos, Nikolas Geroliminis

Abstract: Microscopic traffic flow models can be distinguished in lane-based or lane-free depending on the degree of lane-discipline. This distinction holds true only if motorcycles are neglected in lane-based traffic. In cities, as opposed to highways, this is an oversimplification and it would be more accurate to speak of hybrid situations, where lane discipline can be made mode-dependent. Empirical evide… ▽ More Microscopic traffic flow models can be distinguished in lane-based or lane-free depending on the degree of lane-discipline. This distinction holds true only if motorcycles are neglected in lane-based traffic. In cities, as opposed to highways, this is an oversimplification and it would be more accurate to speak of hybrid situations, where lane discipline can be made mode-dependent. Empirical evidence shows that cars follow the lanes as defined by the infrastructure, while motorcycles do not necessarily adhere to predefined norms and may participate in self-organized formation of virtual lanes. This phenomenon is the result of complex interactions between different traffic participants competing for limited space. In order to better understand the dynamics of modal interaction microscopically, we first analyze empirical data from detailed trajectories obtained by the pNEUMA experiment and observe patterns of mixed traffic. Then, we propose a hybrid model for multimodal vehicular traffic. The hybrid model is inspired by the pedestrian flow literature, featuring collision-free and anticipatory properties, and we demonstrate that it is able to reproduce empirical observations from aerial footage. △ Less

Submitted 25 October, 2022; originally announced October 2022.

arXiv:2111.10402 [pdf, other]

doi 10.1007/978-3-030-80387-2_20

Influence Dynamics Among Narratives: A Case Study of the Venezuelan Presidential Crisis

Authors: Akshay Aravamudan, Xi Zhang, Jihye Song, Stephen M. Fiore, Georgios C. Anagnostopoulos

Abstract: It is widely understood that diffusion of and simultaneous interactions between narratives -- defined here as persistent point-of-view messaging -- significantly contributes to the sha** of political discourse and public opinion. In this work, we propose a methodology based on Multi-Variate Hawkes Processes and our newly-introduced Process Influence Measures for quantifying and assessing how suc… ▽ More It is widely understood that diffusion of and simultaneous interactions between narratives -- defined here as persistent point-of-view messaging -- significantly contributes to the sha** of political discourse and public opinion. In this work, we propose a methodology based on Multi-Variate Hawkes Processes and our newly-introduced Process Influence Measures for quantifying and assessing how such narratives influence (Granger-cause) each other. Such an approach may aid social scientists enhance their understanding of socio-geopolitical phenomena as they manifest themselves and evolve in the realm of social media. In order to show its merits, we apply our methodology on Twitter narratives during the 2019 Venezuelan presidential crisis. Our analysis indicates a nuanced, evolving influence structure between 8 distinct narratives, part of which could be explained by landmark historical events. △ Less

Submitted 5 November, 2021; originally announced November 2021.

Comments: This document has 11 pages and 4 figures

Journal ref: Social, Cultural, and Behavioral Modeling (2021) 204--213

arXiv:2102.02706 [pdf, other]

doi 10.1109/IPIN51156.2021.9662590

ProxyFAUG: Proximity-based Fingerprint Augmentation

Authors: Grigorios G. Anagnostopoulos, Alexandros Kalousis

Abstract: The proliferation of data-demanding machine learning methods has brought to light the necessity for methodologies which can enlarge the size of training datasets, with simple, rule-based methods. In-line with this concept, the fingerprint augmentation scheme proposed in this work aims to augment fingerprint datasets which are used to train positioning models. The proposed method utilizes fingerpri… ▽ More The proliferation of data-demanding machine learning methods has brought to light the necessity for methodologies which can enlarge the size of training datasets, with simple, rule-based methods. In-line with this concept, the fingerprint augmentation scheme proposed in this work aims to augment fingerprint datasets which are used to train positioning models. The proposed method utilizes fingerprints which are recorded in spacial proximity, in order to perform fingerprint augmentation, creating new fingerprints which combine the features of the original ones. The proposed method of composing the new, augmented fingerprints is inspired by the crossover and mutation operators of genetic algorithms. The ProxyFAUG method aims to improve the achievable positioning accuracy of fingerprint datasets, by introducing a rule-based, stochastic, proximity-based method of fingerprint augmentation. The performance of ProxyFAUG is evaluated in an outdoor Sigfox setting using a public dataset. The best performing published positioning method on this dataset is improved by 40% in terms of median error and 6% in terms of mean error, with the use of the augmented dataset. The analysis of the results indicate a systematic and significant performance improvement at the lower error quartiles, as indicated by the impressive improvement of the median error. △ Less

Submitted 12 January, 2022; v1 submitted 4 February, 2021; originally announced February 2021.

arXiv:2011.10478 [pdf, other]

Analysing the Data-Driven Approach of Dynamically Estimating Positioning Accuracy

Authors: Grigorios G. Anagnostopoulos, Alexandros Kalousis

Abstract: The primary expectation from positioning systems is for them to provide the users with reliable estimates of their position. An additional piece of information that can greatly help the users utilize position estimates is the level of uncertainty that a positioning system assigns to the position estimate it produced. The concept of dynamically estimating the accuracy of position estimates of finge… ▽ More The primary expectation from positioning systems is for them to provide the users with reliable estimates of their position. An additional piece of information that can greatly help the users utilize position estimates is the level of uncertainty that a positioning system assigns to the position estimate it produced. The concept of dynamically estimating the accuracy of position estimates of fingerprinting positioning systems has been sporadically discussed over the last decade in the literature of the field, where mainly handcrafted rules based on domain knowledge have been proposed. The emergence of IoT devices and the proliferation of data from Low Power Wide Area Networks (LPWANs) have facilitated the conceptualization of data-driven methods of determining the estimated certainty over position estimates. In this work, we analyze the data-driven approach of determining the Dynamic Accuracy Estimation (DAE), considering it in the broader context of a positioning system. More specifically, with the use of a public LoRaWAN dataset, the current work analyses: the repartition of the available training set between the tasks of determining the location estimates and the DAE, the concept of selecting a subset of the most reliable estimates, and the impact that the spatial distribution of the data has to the accuracy of the DAE. The work provides a wide overview of the data-driven approach of DAE determination in the context of the overall design of a positioning system. △ Less

Submitted 24 February, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

Comments: Author's accepted manuscript version. Accepted for publication in IEEE ICC 2021, IoT and Sensor Networks Symposium

arXiv:2006.02781 [pdf, other]

doi 10.1371/journal.pone.0244206

StationRank: Aggregate dynamics of the Swiss railway

Authors: Georg Anagnostopoulos, Vahid Moosavi

Abstract: Increasing availability and quality of actual, as opposed to scheduled, open transport data offers new possibilities for capturing the spatiotemporal dynamics of the railway and other networks of social infrastructure. One way to describe such complex phenomena is in terms of stochastic processes. At its core, a stochastic model is domain-agnostic and algorithms discussed here have been successful… ▽ More Increasing availability and quality of actual, as opposed to scheduled, open transport data offers new possibilities for capturing the spatiotemporal dynamics of the railway and other networks of social infrastructure. One way to describe such complex phenomena is in terms of stochastic processes. At its core, a stochastic model is domain-agnostic and algorithms discussed here have been successfully used in other applications, including Google's PageRank citation ranking. Our key assumption is that train routes constitute meaningful sequences analogous to sentences of literary text. A corpus of routes is thus susceptible to the same analytic tool-set as a corpus of sentences. With our experiment in Switzerland, we introduce a method for building Markov Chains from aggregated daily streams of railway traffic data. The stationary distributions under normal and perturbed conditions are used to define systemic risk measures with non-evident,valuable information about railway infrastructure. △ Less

Submitted 17 October, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

arXiv:2004.01954 [pdf]

doi 10.1088/1361-6528/aae683

Enhancing the Adhesion of Graphene to Polymer Substrates by Controlled Defect Formation

Authors: George Anagnostopoulos, Labrini Sygellou, George Paterakis, Ioannis Polyzos, Christos A. Aggelopoulos, Costas Galiotis

Abstract: The mechanical integrity of composite materials depends primarily on the interface strength and the defect density of the reinforcement which is the provider of enhanced strength and stiffness. In the case of graphene/ polymer nanocomposites which are characterized by an extremely large interface region, any defects in the inclusion (such as folds, cracks, holes etc.) will have a detrimental effec… ▽ More The mechanical integrity of composite materials depends primarily on the interface strength and the defect density of the reinforcement which is the provider of enhanced strength and stiffness. In the case of graphene/ polymer nanocomposites which are characterized by an extremely large interface region, any defects in the inclusion (such as folds, cracks, holes etc.) will have a detrimental effect to the internal strain distribution and the resulting mechanical performance. This conventional wisdom, however, can be challenged if the defect size is reduced beyond the critical size for crack formation to the level of atomic vacancies. In that case, there should be no practical effect on crack propagation and depending on the nature of the vacancies the interface strength may be in fact increase. In this work we employed argon ion (Ar+) bombardment and subsequent exposure to hydrogen (H2) to induce (as revealed by X-ray & Ultraviolet photoelectron spectroscopy (XPS/UPS) and Raman spectroscopy) passivated atomic single vacancies to CVD graphene. The modified graphene was subsequently transferred to PMMA bars and the morphology, wettability and the interface adhesion of the CVD graphene/PMMA system were investigated with Atomic Force Microscopy technique and Raman analysis. The results obtained showed clearly an overall improved mechanical behavior of graphene/polymer interface, since an increase as well a more uniform shift distribution with strain is observed. This paves the way for interface engineering in graphene/polymer systems which, in pristine condition, suffer from premature graphene slippage and subsequent failure. △ Less

Submitted 4 April, 2020; originally announced April 2020.

Journal ref: Nanotechnology 30 015704 (2019)

arXiv:2003.11611 [pdf, other]

doi 10.1007/978-3-030-77517-9_11

Deep Agent: Studying the Dynamics of Information Spread and Evolution in Social Networks

Authors: Ivan Garibay, Toktam A. Oghaz, Niloofar Yousefi, Ece C. Mutlu, Madeline Schiappa, Steven Scheinert, Georgios C. Anagnostopoulos, Christina Bouwens, Stephen M. Fiore, Alexander Mantzaris, John T. Murphy, William Rand, Anastasia Salter, Mel Stanfill, Gita Sukthankar, Nisha Baral, Gabriel Fair, Chathika Gunaratne, Neda B. Hajiakhoond, Jasser Jasser, Chathura Jayalath, Olivia Newton, Samaneh Saadat, Chathurani Senevirathna, Rachel Winter , et al. (1 additional authors not shown)

Abstract: This paper explains the design of a social network analysis framework, developed under DARPA's SocialSim program, with novel architecture that models human emotional, cognitive and social factors. Our framework is both theory and data-driven, and utilizes domain expertise. Our simulation effort helps in understanding how information flows and evolves in social media platforms. We focused on modeli… ▽ More This paper explains the design of a social network analysis framework, developed under DARPA's SocialSim program, with novel architecture that models human emotional, cognitive and social factors. Our framework is both theory and data-driven, and utilizes domain expertise. Our simulation effort helps in understanding how information flows and evolves in social media platforms. We focused on modeling three information domains: cryptocurrencies, cyber threats, and software vulnerabilities for the three interrelated social environments: GitHub, Reddit, and Twitter. We participated in the SocialSim DARPA Challenge in December 2018, in which our models were subjected to extensive performance evaluation for accuracy, generalizability, explainability, and experimental power. This paper reports the main concepts and models, utilized in our social media modeling effort in develo** a multi-resolution simulation at the user, community, population, and content levels. △ Less

Submitted 29 May, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

Comments: 16 pages

arXiv:1908.06851 [pdf, other]

doi 10.1109/IPIN.2019.8911792

A Reproducible Analysis of RSSI Fingerprinting for Outdoor Localization Using Sigfox: Preprocessing and Hyperparameter Tuning

Authors: Grigorios G. Anagnostopoulos, Alexandros Kalousis

Abstract: Fingerprinting techniques, which are a common method for indoor localization, have been recently applied with success into outdoor settings. Particularly, the communication signals of Low Power Wide Area Networks (LPWAN) such as Sigfox, have been used for localization. In this rather recent field of study, not many publicly available datasets, which would facilitate the consistent comparison of di… ▽ More Fingerprinting techniques, which are a common method for indoor localization, have been recently applied with success into outdoor settings. Particularly, the communication signals of Low Power Wide Area Networks (LPWAN) such as Sigfox, have been used for localization. In this rather recent field of study, not many publicly available datasets, which would facilitate the consistent comparison of different positioning systems, exist so far. In the current study, a published dataset of RSSI measurements on a Sigfox network deployed in Antwerp, Belgium is used to analyse the appropriate selection of preprocessing steps and to tune the hyperparameters of a kNN fingerprinting method. Initially, the tuning of hyperparameter k for a variety of distance metrics, and the selection of efficient data transformation schemes, proposed by relevant works, is presented. In addition, accuracy improvements are achieved in this study, by a detailed examination of the appropriate adjustment of the parameters of the data transformation schemes tested, and of the handling of out of range values. With the appropriate tuning of these factors, the achieved mean localization error was 298 meters, and the median error was 109 meters. To facilitate the reproducibility of tests and comparability of results, the code and train/validation/test split used in this study are available. △ Less

Submitted 14 August, 2019; originally announced August 2019.

Comments: Preprint of a paper to be presented in IPIN2019

arXiv:1908.05085 [pdf, other]

doi 10.1109/WPNC47567.2019.8970177

A Reproducible Comparison of RSSI Fingerprinting Localization Methods Using LoRaWAN

Authors: Grigorios G. Anagnostopoulos, Alexandros Kalousis

Abstract: The use of fingerprinting localization techniques in outdoor IoT settings has started to gain popularity over the recent years. Communication signals of Low Power Wide Area Networks (LPWAN), such as LoRaWAN, are used to estimate the location of low power mobile devices. In this study, a publicly available dataset of LoRaWAN RSSI measurements is utilized to compare different machine learning method… ▽ More The use of fingerprinting localization techniques in outdoor IoT settings has started to gain popularity over the recent years. Communication signals of Low Power Wide Area Networks (LPWAN), such as LoRaWAN, are used to estimate the location of low power mobile devices. In this study, a publicly available dataset of LoRaWAN RSSI measurements is utilized to compare different machine learning methods and their accuracy in producing location estimates. The tested methods are: the k Nearest Neighbours method, the Extra Trees method and a neural network approach using a Multilayer Perceptron. To facilitate the reproducibility of tests and the comparability of results, the code and the train/validation/test split of the dataset used in this study have become available. The neural network approach was the method with the highest accuracy, achieving a mean error of 358 meters and a median error of 204 meters. △ Less

Submitted 14 August, 2019; originally announced August 2019.

arXiv:1902.08639 [pdf, ps, other]

Learning Hash Function through Codewords

Authors: Yinjie Huang, Michael Georgiopoulos, Georgios C. Anagnostopoulos

Abstract: In this paper, we propose a novel hash learning approach that has the following main distinguishing features, when compared to past frameworks. First, the codewords are utilized in the Hamming space as ancillary techniques to accomplish its hash learning task. These codewords, which are inferred from the data, attempt to capture grou** aspects of the data's hash codes. Furthermore, the proposed… ▽ More In this paper, we propose a novel hash learning approach that has the following main distinguishing features, when compared to past frameworks. First, the codewords are utilized in the Hamming space as ancillary techniques to accomplish its hash learning task. These codewords, which are inferred from the data, attempt to capture grou** aspects of the data's hash codes. Furthermore, the proposed framework is capable of addressing supervised, unsupervised and, even, semi-supervised hash learning scenarios. Additionally, the framework adopts a regularization term over the codewords, which automatically chooses the codewords for the problem. To efficiently solve the problem, one Block Coordinate Descent algorithm is showcased in the paper. We also show that one step of the algorithms can be casted into several Support Vector Machine problems which enables our algorithms to utilize efficient software package. For the regularization term, a closed form solution of the proximal operator is provided in the paper. A series of comparative experiments focused on content-based image retrieval highlights its performance advantages. △ Less

Submitted 22 February, 2019; originally announced February 2019.

Comments: arXiv admin note: substantial text overlap with arXiv:1508.03285

arXiv:1902.08313 [pdf, other]

Reduced-Rank Local Distance Metric Learning for k-NN Classification

Authors: YInjie Huang, Cong Li, Michael Georgiopoulos, Georgios C. Anagnostopoulos

Abstract: We propose a new method for local distance metric learning based on sample similarity as side information. These local metrics, which utilize conical combinations of metric weight matrices, are learned from the pooled spatial characteristics of the data, as well as the similarity profiles between the pairs of samples, whose distances are measured. The main objective of our framework is to yield me… ▽ More We propose a new method for local distance metric learning based on sample similarity as side information. These local metrics, which utilize conical combinations of metric weight matrices, are learned from the pooled spatial characteristics of the data, as well as the similarity profiles between the pairs of samples, whose distances are measured. The main objective of our framework is to yield metrics, such that the resulting distances between similar samples are small and distances between dissimilar samples are above a certain threshold. For learning and inference purposes, we describe a transductive, as well as an inductive algorithm; the former approach naturally befits our framework, while the latter one is provided in the interest of faster learning. Experimental results on a collection of classification problems imply that the new methods may exhibit notable performance advantages over alternative metric learning approaches that have recently appeared in the literature. △ Less

Submitted 21 February, 2019; originally announced February 2019.

arXiv:1804.10242 [pdf]

doi 10.1016/j.eml.2016.03.016

Compression behavior of simply-supported and fully embedded monolayer graphene: theory and experiment

Authors: Emmanuel N. Koukaras, Charalampos Androulidakis, George Anagnostopoulos, Konstantinos Papagelis, Costas Galiotis

Abstract: Single layer graphene simply-supported on a polymer substrate was subjected to axial compression and its behavior upon loading was monitored with laser Raman spectroscopy (LRS). The graphene was found to fail by wrinkling (buckling) at a critical strain of -0.30% and at a compressive stress of ~1.6 GPa, as revealed by the conversion of the spectroscopic data to actual stress-strain curves. This co… ▽ More Single layer graphene simply-supported on a polymer substrate was subjected to axial compression and its behavior upon loading was monitored with laser Raman spectroscopy (LRS). The graphene was found to fail by wrinkling (buckling) at a critical strain of -0.30% and at a compressive stress of ~1.6 GPa, as revealed by the conversion of the spectroscopic data to actual stress-strain curves. This contrasts with the value of -0.60% and stress of ~3.8 GPa required for failure initiation in the fully embedded case. To elucidate the failure mechanisms in the two cases examined, molecular dynamics simulations employing the AIREBO potential were performed. We assess the impact of surface roughness, graphene-polymer interaction, and of thermal (phonon) ripples on the onset of wrinkle formation. Overall good agreement was found between theory and experiment. As argued herein, the understanding and control of out-of-plane phenomena upon mechanical loading of graphene are important prerequisites for the design and function of new graphene-based devices. △ Less

Submitted 26 April, 2018; originally announced April 2018.

Comments: 20 pages, 9 figures

Journal ref: Extreme Mechanics Letters 8, 191-200, 2016

arXiv:1801.05586 [pdf]

doi 10.1002/pssb.201600233

Stress and charge transfer in uniaxially strained CVD graphene

Authors: Milan Bousa, George Anagnostopoulos, Elena del Corro, Karolina Drogowska, Jan Pekarek, Ladislav Kavan, Martin Kalbac, John Parthenios, Konstantinos Papagelis, Costas Galiotis, Otakar Frank

Abstract: Mechanical properties of graphene prepared by chemical vapor deposition (CVD) are not easily comparable to the properties of nearly perfect graphene prepared by mechanical cleavage. In this work, we attempt to investigate the mechanical performance of CVD graphene (simply supported or embedded in polymer matrix), transferred by two different techniques, under uniaxial loading with simultaneous in-… ▽ More Mechanical properties of graphene prepared by chemical vapor deposition (CVD) are not easily comparable to the properties of nearly perfect graphene prepared by mechanical cleavage. In this work, we attempt to investigate the mechanical performance of CVD graphene (simply supported or embedded in polymer matrix), transferred by two different techniques, under uniaxial loading with simultaneous in-situ monitoring by Raman microspectroscopy. The level of charge transfer do** and strain is assessed using the vector analysis modified for uniaxial strain. The strain distribution across the samples varies significantly, owing to the growth and transfer process, which induces wrinkles and faults in the CVD graphene. In simply supported specimens, the stress transfer efficiency is generally very low and the changes in Raman spectra are dominated by variations in the charge transfer originating from the realignment of the domains on the substrate upon the application of strain. In contrast, samples covered with an additional polymer layer exhibit an improved stress transfer efficiency, and the alterations of charge do** levels are negligible. In fully embedded specimens, the variations in stress transfer efficiencies are caused by the size of the effective graphene domains defined by cracks, folds and or/wrinkles. △ Less

Submitted 17 January, 2018; originally announced January 2018.

Journal ref: Phys. Status Solidi B 253, No. 12, 2355-2361 2016

arXiv:1707.03426 [pdf, ps, other]

Multi-Task Learning Using Neighborhood Kernels

Authors: Niloofar Yousefi, Cong Li, Mansooreh Mollaghasemi, Georgios Anagnostopoulos, Michael Georgiopoulos

Abstract: This paper introduces a new and effective algorithm for learning kernels in a Multi-Task Learning (MTL) setting. Although, we consider a MTL scenario here, our approach can be easily applied to standard single task learning, as well. As shown by our empirical results, our algorithm consistently outperforms the traditional kernel learning algorithms such as uniform combination solution, convex comb… ▽ More This paper introduces a new and effective algorithm for learning kernels in a Multi-Task Learning (MTL) setting. Although, we consider a MTL scenario here, our approach can be easily applied to standard single task learning, as well. As shown by our empirical results, our algorithm consistently outperforms the traditional kernel learning algorithms such as uniform combination solution, convex combinations of base kernels as well as some kernel alignment-based models, which have been proven to give promising results in the past. We present a Rademacher complexity bound based on which a new Multi-Task Multiple Kernel Learning (MT-MKL) model is derived. In particular, we propose a Support Vector Machine-regularized model in which, for each task, an optimal kernel is learned based on a neighborhood-defining kernel that is not restricted to be positive semi-definite. Comparative experimental results are showcased that underline the merits of our neighborhood-defining framework in both classification and regression problems. △ Less

Submitted 11 July, 2017; originally announced July 2017.

arXiv:1606.05281 [pdf]

doi 10.1021/am508482n

Stress Transfer Mechanisms at the Submicron Level for Graphene/Polymer Systems

Authors: George Anagnostopoulos, Charalampos Androulidakis, Emmanuel N. Koukaras, Georgia Tsoukleri, Ioannis Polyzos, John Parthenios, Konstantinos Papagelis, Costas Galiotis

Abstract: The stress transfer mechanism from a polymer substrate to a nano-inclusion, such as a graphene flake, is of extreme interest for the production of effective nanocomposites. Previous work conducted mainly at the micron scale has shown that the intrinsic mechanism of stress transfer is shear at the interface. However, since the interfacial shear takes its maximum value at the very edge of the nano-i… ▽ More The stress transfer mechanism from a polymer substrate to a nano-inclusion, such as a graphene flake, is of extreme interest for the production of effective nanocomposites. Previous work conducted mainly at the micron scale has shown that the intrinsic mechanism of stress transfer is shear at the interface. However, since the interfacial shear takes its maximum value at the very edge of the nano-inclusion it is of extreme interest to assess the effect of edge integrity upon axial stress transfer at the submicron scale. Here, we conduct a detailed Raman line map** near the edges of a monolayer graphene flake which is simply supported onto an epoxy based photoresist (SU8)/poly(methyl methacrylate) (PMMA) matrix at steps as small as 100 nm. We show for the first time that, the distribution of axial strain (stress) along the flake deviates somewhat from the classical shear-lag prediction for a region of about 2 um from the edge. This behavior is mainly attributed to the presence of residual stresses, unintentional do** and/or edge effects (deviation from the equilibrium values of bond lengths and angles, as well as different edge chiralities). By considering a simple balance of shear-to-normal stresses at the interface we are able to directly convert the strain (stress) gradient to values of interfacial shear stress for all the applied tensile levels without assuming classical shear-lag behavior. For large flakes a maximum value of interfacial shear stress (ISS) of 0.4 MPa is obtained prior to flake slip**. △ Less

Submitted 16 June, 2016; originally announced June 2016.

Journal ref: ACS Applied Materials and Interfaces 7 (2015) 4216-4223

arXiv:1602.05916 [pdf, ps, other]

Local Rademacher Complexity-based Learning Guarantees for Multi-Task Learning

Authors: Niloofar Yousefi, Yunwen Lei, Marius Kloft, Mansooreh Mollaghasemi, Georgios Anagnostopoulos

Abstract: We show a Talagrand-type concentration inequality for Multi-Task Learning (MTL), using which we establish sharp excess risk bounds for MTL in terms of distribution- and data-dependent versions of the Local Rademacher Complexity (LRC). We also give a new bound on the LRC for norm regularized as well as strongly convex hypothesis classes, which applies not only to MTL but also to the standard i.i.d.… ▽ More We show a Talagrand-type concentration inequality for Multi-Task Learning (MTL), using which we establish sharp excess risk bounds for MTL in terms of distribution- and data-dependent versions of the Local Rademacher Complexity (LRC). We also give a new bound on the LRC for norm regularized as well as strongly convex hypothesis classes, which applies not only to MTL but also to the standard i.i.d. setting. Combining both results, one can now easily derive fast-rate bounds on the excess risk for many prominent MTL methods, including---as we demonstrate---Schatten-norm, group-norm, and graph-regularized MTL. The derived bounds reflect a relationship akeen to a conservation law of asymptotic convergence rates. This very relationship allows for trading off slower rates w.r.t. the number of tasks for faster rates with respect to the number of available samples per task, when compared to the rates obtained via a traditional, global Rademacher analysis. △ Less

Submitted 9 February, 2017; v1 submitted 18 February, 2016; originally announced February 2016.

Comments: In this version, some arguments and results (of the previous version) have been corrected, or modified

arXiv:1508.03329 [pdf, ps, other]

Multi-Task Learning with Group-Specific Feature Space Sharing

Authors: Niloofar Yousefi, Michael Georgiopoulos, Georgios C. Anagnostopoulos

Abstract: When faced with learning a set of inter-related tasks from a limited amount of usable data, learning each task independently may lead to poor generalization performance. Multi-Task Learning (MTL) exploits the latent relations between tasks and overcomes data scarcity limitations by co-learning all these tasks simultaneously to offer improved performance. We propose a novel Multi-Task Multiple Kern… ▽ More When faced with learning a set of inter-related tasks from a limited amount of usable data, learning each task independently may lead to poor generalization performance. Multi-Task Learning (MTL) exploits the latent relations between tasks and overcomes data scarcity limitations by co-learning all these tasks simultaneously to offer improved performance. We propose a novel Multi-Task Multiple Kernel Learning framework based on Support Vector Machines for binary classification tasks. By considering pair-wise task affinity in terms of similarity between a pair's respective feature spaces, the new framework, compared to other similar MTL approaches, offers a high degree of flexibility in determining how similar feature spaces should be, as well as which pairs of tasks should share a common feature space in order to benefit overall performance. The associated optimization problem is solved via a block coordinate descent, which employs a consensus-form Alternating Direction Method of Multipliers algorithm to optimize the Multiple Kernel Learning weights and, hence, to determine task affinities. Empirical evaluation on seven data sets exhibits a statistically significant improvement of our framework's results compared to the ones of several other Clustered Multi-Task Learning methods. △ Less

Submitted 13 August, 2015; originally announced August 2015.

arXiv:1508.03285 [pdf, ps, other]

Hash Function Learning via Codewords

Authors: Yinjie Huang, Michael Georgiopoulos, Georgios C. Anagnostopoulos

Abstract: In this paper we introduce a novel hash learning framework that has two main distinguishing features, when compared to past approaches. First, it utilizes codewords in the Hamming space as ancillary means to accomplish its hash learning task. These codewords, which are inferred from the data, attempt to capture similarity aspects of the data's hash codes. Secondly and more importantly, the same fr… ▽ More In this paper we introduce a novel hash learning framework that has two main distinguishing features, when compared to past approaches. First, it utilizes codewords in the Hamming space as ancillary means to accomplish its hash learning task. These codewords, which are inferred from the data, attempt to capture similarity aspects of the data's hash codes. Secondly and more importantly, the same framework is capable of addressing supervised, unsupervised and, even, semi-supervised hash learning tasks in a natural manner. A series of comparative experiments focused on content-based image retrieval highlights its performance advantages. △ Less

Submitted 18 August, 2015; v1 submitted 13 August, 2015; originally announced August 2015.

arXiv:1503.06414 [pdf]

doi 10.1021/nn507202c

The deformation of wrinkled graphene

Authors: Zheling Li, Ian A. Kinloch, Robert J. Young, Kostya S. Novoselov, George Anagnostopoulos, John Parthenios, Costas Galiotis, Konstantinos Papagelis, Ching-Yu Lu, Liam Britnell

Abstract: The deformation of monolayer graphene, produced by chemical vapor deposition (CVD), on a polyester film substrate has been investigated through the use of Raman spectroscopy. It has been found that the microstructure of the CVD graphene consists of a hexagonal array of islands of flat monolayer graphene separated by wrinkled material. During deformation, it was found that the rate of shift of the… ▽ More The deformation of monolayer graphene, produced by chemical vapor deposition (CVD), on a polyester film substrate has been investigated through the use of Raman spectroscopy. It has been found that the microstructure of the CVD graphene consists of a hexagonal array of islands of flat monolayer graphene separated by wrinkled material. During deformation, it was found that the rate of shift of the Raman 2D band wavenumber per unit strain was less than 25% of that of flat flakes of mechanically-exfoliated graphene, whereas the rate of band broadening per unit strain was about 75% of that of the exfoliated material. This unusual deformation behavior has been modeled in terms of mechanically-isolated graphene islands separated by the graphene wrinkles, with the strain distribution in each graphene island determined using shear lag analysis. The effect of the size and position of the Raman laser beam spot has also been incorporated in the model. The predictions fit well with the behavior observed experimentally for the Raman band shifts and broadening of the wrinkled CVD graphene. The effect of wrinkles upon the efficiency of graphene to reinforce nanocomposites is also discussed. △ Less

Submitted 22 March, 2015; originally announced March 2015.

Comments: ACS Nano (2015)

Journal ref: ACS Nano, 9(4), 3917-3925 (2015)

arXiv:1408.4714 [pdf, ps, other]

Conic Multi-Task Classification

Authors: Cong Li, Michael Georgiopoulos, Georgios C. Anagnostopoulos

Abstract: Traditionally, Multi-task Learning (MTL) models optimize the average of task-related objective functions, which is an intuitive approach and which we will be referring to as Average MTL. However, a more general framework, referred to as Conic MTL, can be formulated by considering conic combinations of the objective functions instead; in this framework, Average MTL arises as a special case, when al… ▽ More Traditionally, Multi-task Learning (MTL) models optimize the average of task-related objective functions, which is an intuitive approach and which we will be referring to as Average MTL. However, a more general framework, referred to as Conic MTL, can be formulated by considering conic combinations of the objective functions instead; in this framework, Average MTL arises as a special case, when all combination coefficients equal 1. Although the advantage of Conic MTL over Average MTL has been shown experimentally in previous works, no theoretical justification has been provided to date. In this paper, we derive a generalization bound for the Conic MTL method, and demonstrate that the tightest bound is not necessarily achieved, when all combination coefficients equal 1; hence, Average MTL may not always be the optimal choice, and it is important to consider Conic MTL. As a byproduct of the generalization bound, it also theoretically explains the good experimental results of previous relevant works. Finally, we propose a new Conic MTL model, whose conic combination coefficients minimize the generalization bound, instead of choosing them heuristically as has been done in previous methods. The rationale and advantage of our model is demonstrated and verified via a series of experiments by comparing with several other methods. △ Less

Submitted 20 August, 2014; originally announced August 2014.

Comments: Accepted by European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD)-2014

arXiv:1404.3190 [pdf, ps, other]

doi 10.1109/TNNLS.2014.2309939

Pareto-Path Multi-Task Multiple Kernel Learning

Authors: Cong Li, Michael Georgiopoulos, Georgios C. Anagnostopoulos

Abstract: A traditional and intuitively appealing Multi-Task Multiple Kernel Learning (MT-MKL) method is to optimize the sum (thus, the average) of objective functions with (partially) shared kernel function, which allows information sharing amongst tasks. We point out that the obtained solution corresponds to a single point on the Pareto Front (PF) of a Multi-Objective Optimization (MOO) problem, which con… ▽ More A traditional and intuitively appealing Multi-Task Multiple Kernel Learning (MT-MKL) method is to optimize the sum (thus, the average) of objective functions with (partially) shared kernel function, which allows information sharing amongst tasks. We point out that the obtained solution corresponds to a single point on the Pareto Front (PF) of a Multi-Objective Optimization (MOO) problem, which considers the concurrent optimization of all task objectives involved in the Multi-Task Learning (MTL) problem. Motivated by this last observation and arguing that the former approach is heuristic, we propose a novel Support Vector Machine (SVM) MT-MKL framework, that considers an implicitly-defined set of conic combinations of task objectives. We show that solving our framework produces solutions along a path on the aforementioned PF and that it subsumes the optimization of the average of objective functions as a special case. Using algorithms we derived, we demonstrate through a series of experimental results that the framework is capable of achieving better classification performance, when compared to other similar MTL approaches. △ Less

Submitted 11 April, 2014; originally announced April 2014.

Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems

arXiv:1401.5136 [pdf, other]

A Unifying Framework for Typical Multi-Task Multiple Kernel Learning Problems

Authors: Cong Li, Michael Georgiopoulos, Georgios C. Anagnostopoulos

Abstract: Over the past few years, Multi-Kernel Learning (MKL) has received significant attention among data-driven feature selection techniques in the context of kernel-based learning. MKL formulations have been devised and solved for a broad spectrum of machine learning problems, including Multi-Task Learning (MTL). Solving different MKL formulations usually involves designing algorithms that are tailored… ▽ More Over the past few years, Multi-Kernel Learning (MKL) has received significant attention among data-driven feature selection techniques in the context of kernel-based learning. MKL formulations have been devised and solved for a broad spectrum of machine learning problems, including Multi-Task Learning (MTL). Solving different MKL formulations usually involves designing algorithms that are tailored to the problem at hand, which is, typically, a non-trivial accomplishment. In this paper we present a general Multi-Task Multi-Kernel Learning (Multi-Task MKL) framework that subsumes well-known Multi-Task MKL formulations, as well as several important MKL approaches on single-task problems. We then derive a simple algorithm that can solve the unifying framework. To demonstrate the flexibility of the proposed framework, we formulate a new learning problem, namely Partially-Shared Common Space (PSCS) Multi-Task MKL, and demonstrate its merits through experimentation. △ Less

Submitted 20 January, 2014; originally announced January 2014.

Comments: 17 pages, 1 figure. Accepted by IEEE Transactions on Neural Networks and Learning Systems; currently published as Early Access Article

arXiv:1312.2606 [pdf, ps, other]

Multi-Task Classification Hypothesis Space with Improved Generalization Bounds

Authors: Cong Li, Michael Georgiopoulos, Georgios C. Anagnostopoulos

Abstract: This paper presents a RKHS, in general, of vector-valued functions intended to be used as hypothesis space for multi-task classification. It extends similar hypothesis spaces that have previously considered in the literature. Assuming this space, an improved Empirical Rademacher Complexity-based generalization bound is derived. The analysis is itself extended to an MKL setting. The connection betw… ▽ More This paper presents a RKHS, in general, of vector-valued functions intended to be used as hypothesis space for multi-task classification. It extends similar hypothesis spaces that have previously considered in the literature. Assuming this space, an improved Empirical Rademacher Complexity-based generalization bound is derived. The analysis is itself extended to an MKL setting. The connection between the proposed hypothesis space and a Group-Lasso type regularizer is discussed. Finally, experimental results, with some SVM-based Multi-Task Learning problems, underline the quality of the derived bounds and validate the paper's analysis. △ Less

Submitted 9 December, 2013; originally announced December 2013.

Comments: 18 pages, 4 figures, submitted to IEEE Transactions on Neural Networks and Learning Systems

arXiv:1312.2578 [pdf, ps, other]

doi 10.1109/IJCNN.2013.6706862

Kernel-based Distance Metric Learning in the Output Space

Authors: Cong Li, Michael Georgiopoulos, Georgios C. Anagnostopoulos

Abstract: In this paper we present two related, kernel-based Distance Metric Learning (DML) methods. Their respective models non-linearly map data from their original space to an output space, and subsequent distance measurements are performed in the output space via a Mahalanobis metric. The dimensionality of the output space can be directly controlled to facilitate the learning of a low-rank metric. Both… ▽ More In this paper we present two related, kernel-based Distance Metric Learning (DML) methods. Their respective models non-linearly map data from their original space to an output space, and subsequent distance measurements are performed in the output space via a Mahalanobis metric. The dimensionality of the output space can be directly controlled to facilitate the learning of a low-rank metric. Both methods allow for simultaneous inference of the associated metric and the map** to the output space, which can be used to visualize the data, when the output space is 2- or 3-dimensional. Experimental results for a collection of classification tasks illustrate the advantages of the proposed methods over other traditional and kernel-based DML approaches. △ Less

Submitted 28 April, 2014; v1 submitted 9 December, 2013; originally announced December 2013.

Comments: 11 pages, 7 figures, appeared in the Proceedings of 2013 International Joint Conference on Neural Networks (IJCNN)

arXiv:1302.6680 [pdf]

Enhanced ULF electromagnetic activity detected by DEMETER above seismogenic regions

Authors: M. Athanasiou, G. Machairidis, C. David, G. Anagnostopoulos

Abstract: In this paper we present results of a comparison between ultra low frequency (ULF) electromagnetic (EM) radiation, recorded by an electric field instrument (ICE) onboard the satellite DEMETER in the topside ionosphere, and the seismicity of regions with high and lower seiismic activity. In particular we evaluated the energy variations of the ULF Ez-electric field component during a period of four… ▽ More In this paper we present results of a comparison between ultra low frequency (ULF) electromagnetic (EM) radiation, recorded by an electric field instrument (ICE) onboard the satellite DEMETER in the topside ionosphere, and the seismicity of regions with high and lower seiismic activity. In particular we evaluated the energy variations of the ULF Ez-electric field component during a period of four years (2006-2009), in order to examine check the possible relation of ULF EM radiation with seismogenic regions located in central America, Indonesia, Eastern Mediterranean Basin and Greece. As a tool of evaluating the ULF Ez energy variations we used Singular Spectrum Analysis (SSA) techniques. The results of our analysis clearly show a significant increase of the ULF EM energy emmited from regions of highest seismic activity at the tectonic plates boundaries. We interpret these results as suggesting that the highest ULF EM energy detected in the topside ionosphere is originated from seismic processes within Earth's crust. We understand the results of the present study as confirming previous evidence that the ULF EM anomalous wave activity in the ionosphere can be considered a useful earthquake precursory signal △ Less

Submitted 27 February, 2013; originally announced February 2013.

Comments: 17 pages, PDF, Submitted for publication in Natural Hazards and Earth System Sciences

arXiv:1203.6446 [pdf]

doi 10.1016/j.pss.2012.05.010

The large-scale energetic ion layer in the high latitude Jovian magnetosphere as revealed by Ulysses/HI-SCALE cross-field intensity-gradient measurements

Authors: G. C. Anagnostopoulos, I. Karanikola, P. K. Marhavilas, E. T. Sarris

Abstract: Ulysses investigated the high latitude Jovian magnetosphere for a second time after Pioneer 11 mission and gave us the opportunity to search the structure and the dynamics of this giant magnetosphere above the magnetodisc. Kivelson(1976) and Kennel & Coroniti(1979) reported that Pioneer 11 observed energetic particle intensities at high latitudes at the same level with those measured in the plasma… ▽ More Ulysses investigated the high latitude Jovian magnetosphere for a second time after Pioneer 11 mission and gave us the opportunity to search the structure and the dynamics of this giant magnetosphere above the magnetodisc. Kivelson(1976) and Kennel & Coroniti(1979) reported that Pioneer 11 observed energetic particle intensities at high latitudes at the same level with those measured in the plasma sheet and inferred that they were not consistent with the magnetodisc model. Ulysses observations supported the idea about a large-scale layer of energetic ions and electrons in the outer high latitude Jovian magnetosphere (Cowley et al.1996; Anagnostopoulos et al. 2001). This study perform a number of further tests for the existence of the large scale layer of energetic ions in the outer high latitude Jovian magnetosphere by studying appropriate cross-B field anisotropies in order to monitor the ion northward/southward intensity gradients. In particular, we examined Ulysses/HI-SCALE observations of energetic ions with large gyro-radius (0.5-1.6MeV protons and >2.5MeV heavy(Z>5) ions) in order to compare instant intensity changes with remote sensing intensity gradients. Our analysis confirms the existence of an energetic particle layer in the north hemisphere, during the inbound trajectory of Ulysses traveling at moderate latitudes, and in the south high-latitude duskside magnetosphere, during the outbound segment of the spacecraft trajectory. Our Ulysses/HI-SCALE data analysis also provides evidence for the detection of an energetic proton magnetopause boundary layer during the outbound trajectory of the spacecraft. During Ulysses flyby of Jupiter the almost permanent appearance of alternative northward and southward intensity gradients suggests that the high latitude layer appeared to be a third major area of energetic particles, which coexisted with the radiation belts and the magnetodisc. △ Less

Submitted 29 March, 2012; originally announced March 2012.

Comments: 37 pages, 11 figures, 1 table

arXiv:1012.3588 [pdf]

Radiation belt electron precipitation in the upper ionosphere at middle latitudes before strong earthquakes

Authors: G. Anagnostopoulos, V. Rigas, E. Vassiliadis

Abstract: In this article we present examples of a wider study of space-time correlation of electron precipitation event of the Van Allen belts with the position and time of occurrence of strong (M>6.5) earthquakes. The study is based on the analysis of observations of electron bursts (EBs) with energies 70 - 2350 keV at middle geographic latitudes, which were detected by DEMETER satellite (at an altitude o… ▽ More In this article we present examples of a wider study of space-time correlation of electron precipitation event of the Van Allen belts with the position and time of occurrence of strong (M>6.5) earthquakes. The study is based on the analysis of observations of electron bursts (EBs) with energies 70 - 2350 keV at middle geographic latitudes, which were detected by DEMETER satellite (at an altitude of ~700 km). The EBs show a relative peak-to-background increase usually < 100, they have a time duration ~0.5 - 3 min, energy spectrum with peaks moving in higher energies as the satellite moves towards the equator, and highest energy limit <~500 keV. The EBs are observed in the presence of VLF waves. The flux-time profile of the EBs varies in East Asia and Mediterranean Sea at the similar geographic latitudes, due to the differentiation of the magnitude of the earth's magnetic field. The most important result of our study is the characteristic temporal variation of electron precipitation variation which begins with incremental rate several days / a few weeks before major earthquakes, then presents a maximum, and finally become weaker, with a minimun or a pause of the electron precipitation some hours before the onset of the earthquake. △ Less

Submitted 16 December, 2010; originally announced December 2010.

Comments: 23 pages, 17 figures

arXiv:1012.3585 [pdf]

Solar wind triggering of geomagnetic disturbances and strong (M>6.8) earthquakes during the November - December 2004 period

Authors: G. Anagnostopoulos, A. Papandreou, P. Antoniou

Abstract: This paper brings space weather prediction close to earthquake (EQ) prediction research. The results of this paper support conclusions of previously presented statistical studies that solar activity influences the seismic activity, this influence is mediated through rapid geomagnetic disturbances and the geomagnetic disturbances are related with increases of solar wind speed. Our study concern an… ▽ More This paper brings space weather prediction close to earthquake (EQ) prediction research. The results of this paper support conclusions of previously presented statistical studies that solar activity influences the seismic activity, this influence is mediated through rapid geomagnetic disturbances and the geomagnetic disturbances are related with increases of solar wind speed. Our study concern an example of 40 days with direct response of a series of 7 strong-to-giant (M=6.8-9.3) EQs (including the Andaman-Sumatra EQ) to solar wind speed increases and subsequent geomagnetic fast disturbances. Our analysis for 10 M>6 EQs from November 23 to December 28, 2004 suggests a mean time response delay of EQs to fast geomagnetic disturbances of ~1.5 days. The two giant EQs during this period occurred after the two fastest geomagnetic variations, as revealed by the ratio of the daily Kp index variation over a day ΔKp/Δt (12 and 15, respectively). It suggests that the fast disturbance of the magnetosphere, as a result of the solar wind speed increase, is a key parameter in a related space weather-earthquake prediction research. The Solar-magnetosphere-lithosphere coupling and their possible special characteristics during the period examined needs further investigation, since it could provide significant information on the underlying physical relation processes of strong earthquakes. △ Less

Submitted 16 December, 2010; originally announced December 2010.

Comments: 26 pages, 8 figures

arXiv:1012.1533 [pdf]

doi 10.5194/nhess-11-1091-2011

Enhanced ULF radiation observed by DEMETER two months around the strong 2010 Haiti earthquake

Authors: M. Athanasiou, G. Anagnostopoulos, A. Iliopoulos, G. Pavlos, K. David

Abstract: In this paper we study the energy of ULF electromagnetic waves that have been recorded by the satellite DEMETER, during its passing over Haiti before and after a destructive earthquake. This earthquake occurred on 12/1/2010, at geographic Latitude 18.46o and Longitude 287.47o, with Magnitude 7.0 R. Specifically, we are focusing on the variations of energy of Ez-electric field component concerning… ▽ More In this paper we study the energy of ULF electromagnetic waves that have been recorded by the satellite DEMETER, during its passing over Haiti before and after a destructive earthquake. This earthquake occurred on 12/1/2010, at geographic Latitude 18.46o and Longitude 287.47o, with Magnitude 7.0 R. Specifically, we are focusing on the variations of energy of Ez-electric field component concerning a time period of 100 days before and 50 days after the strong earthquake. In order to study these variations, we developed a novel method that can be divided in two stages: first we filter the signal kee** only the very low frequencies and afterwards we eliminate its trend using techniques of Singular Spectrum Analysis, combined with a third-degree polynomial filter. As it is shown, a significant increase in energy is observed for the time interval of 30 days before the strong earthquake. This result clearly indicates that the change in the energy of ULF electromagnetic waves could be related to strong precursory earthquake phenomena. Moreover, changes in energy were also observed 25 days after the strong earthquake associated with strong aftershock activity. Finally, we present results concerning the comparison in changes in Energy during night and day passes of the satellite over Haiti, which showed differences in the mean energy values, but similar results as far as the rate of energy change is concerned. △ Less

Submitted 7 December, 2010; originally announced December 2010.

Comments: 16 pages, 7 figures, submitted to NHESS

Showing 1–29 of 29 results for author: Anagnostopoulos, G