-
Mobile Traffic Classification through Physical Channel Fingerprinting: a Deep Learning Approach
Authors:
Hoang Duy Trinh,
Angel Fernandez Gambin,
Lorenza Giupponi,
Michele Rossi,
Paolo Dini
Abstract:
The automatic classification of applications and services is an invaluable feature for new generation mobile networks. Here, we propose and validate algorithms to perform this task, at runtime, from the raw physical channel of an operative mobile network, without having to decode and/or decrypt the transmitted flows. Towards this, we decode Downlink Control Information (DCI) messages carried withi…
▽ More
The automatic classification of applications and services is an invaluable feature for new generation mobile networks. Here, we propose and validate algorithms to perform this task, at runtime, from the raw physical channel of an operative mobile network, without having to decode and/or decrypt the transmitted flows. Towards this, we decode Downlink Control Information (DCI) messages carried within the LTE Physical Downlink Control CHannel (PDCCH). DCI messages are sent by the radio cell in clear text and, in this paper, are utilized to classify the applications and services executed at the connected mobile terminals. Two datasets are collected through a large measurement campaign: one labeled, used to train the classification algorithms, and one unlabeled, collected from four radio cells in the metropolitan area of Barcelona, in Spain. Among other approaches, our Convolutional Neural Network (CNN) classifier provides the highest classification accuracy of 99%. The CNN classifier is then augmented with the capability of rejecting sessions whose patterns do not conform to those learned during the training phase, and is subsequently utilized to attain a fine grained decomposition of the traffic for the four monitored radio cells, in an online and unsupervised fashion.
△ Less
Submitted 7 February, 2020; v1 submitted 25 October, 2019;
originally announced October 2019.
-
Adaptive Resource Management for a Virtualized Computing Platform within Edge Computing
Authors:
Thembelihle Dlamini,
Ángel Fernandez Gambın
Abstract:
In virtualized computing platforms, energy consumption is related to the computing-plus-communication processes. However, most of the proposed energy consumption models and energy saving solutions found in literature consider only the active Virtual Machines (VMs), thus the overall operational energy expenditure is usually related to solely the computation process. To address this shortcoming, in…
▽ More
In virtualized computing platforms, energy consumption is related to the computing-plus-communication processes. However, most of the proposed energy consumption models and energy saving solutions found in literature consider only the active Virtual Machines (VMs), thus the overall operational energy expenditure is usually related to solely the computation process. To address this shortcoming, in this paper we consider a computing-plus-communication energy model, within the Multi-access Edge Computing (MEC) paradigm, and then put forward a combination of a traffic engineering- and MEC Location Service-based online server management algorithm with Energy Harvesting (EH) capabilities, called Automated Resource Controller for Energy-aware Server (ARCES), for autoscaling and reconfiguring the computing-plus-communication resources. The main goal is to minimize the overall energy consumption, under hard per-task delay constraints (i.e., Quality of Service (QoS)). ARCES jointly performs (i) a short-term server demand and harvested solar energy forecasting, (ii) VM soft-scaling, workload and processing rate allocation and lastly, (iii) switching on/off of transmission drivers (i.e., fast tunable lasers) coupled with the location-aware traffic scheduling. Our numerical results reveal that ARCES achieves on average energy savings of 69%, and an energy consumption ranging from 31%-45%and from 21%-25% at different values of per-VM reconfiguration cost, with respect to the case where no energy management is applied.
△ Less
Submitted 12 June, 2019;
originally announced June 2019.
-
Jaccard/Tanimoto similarity test and estimation methods
Authors:
Neo Christopher Chung,
Błażej Miasojedow,
Michał Startek,
Anna Gambin
Abstract:
Binary data are used in a broad area of biological sciences. Using binary presence-absence data, we can evaluate species co-occurrences that help elucidate relationships among organisms and environments. To summarize similarity between occurrences of species, we routinely use the Jaccard/Tanimoto coefficient, which is the ratio of their intersection to their union. It is natural, then, to identify…
▽ More
Binary data are used in a broad area of biological sciences. Using binary presence-absence data, we can evaluate species co-occurrences that help elucidate relationships among organisms and environments. To summarize similarity between occurrences of species, we routinely use the Jaccard/Tanimoto coefficient, which is the ratio of their intersection to their union. It is natural, then, to identify statistically significant Jaccard/Tanimoto coefficients, which suggest non-random co-occurrences of species. However, statistical hypothesis testing using this similarity coefficient has been seldom used or studied.
We introduce a hypothesis test for similarity for biological presence-absence data, using the Jaccard/Tanimoto coefficient. Several key improvements are presented including unbiased estimation of expectation and centered Jaccard/Tanimoto coefficients, that account for occurrence probabilities. We derived the exact and asymptotic solutions and developed the bootstrap and measurement concentration algorithms to compute statistical significance of binary similarity. Comprehensive simulation studies demonstrate that our proposed methods produce accurate p-values and false discovery rates. The proposed estimation methods are orders of magnitude faster than the exact solution. The proposed methods are implemented in an open source R package called jaccard (https://cran.r-project.org/package=jaccard).
We introduce a suite of statistical methods for the Jaccard/Tanimoto similarity coefficient, that enable straightforward incorporation of probabilistic measures in analysis for species co-occurrences. Due to their generality, the proposed methods and implementations are applicable to a wide range of binary data arising from genomics, biochemistry, and other areas of science.
△ Less
Submitted 27 March, 2019;
originally announced March 2019.
-
Online Resource Management in Energy Harvesting BS Sites through Prediction and Soft-Scaling of Computing Resources
Authors:
Thembelihle Dlamini,
Angel Fernandez Gambin,
Daniele Munaretto,
Michele Rossi
Abstract:
Multi-Access Edge Computing (MEC) is a paradigm for handling delay sensitive services that require ultra-low latency at the access network. With it, computing and communications are performed within one Base Station (BS) site, where the computation resources are in the form of Virtual Machines (VMs) (computer emulators) in the MEC server. MEC and Energy Harvesting (EH) BSs, i.e., BSs equipped with…
▽ More
Multi-Access Edge Computing (MEC) is a paradigm for handling delay sensitive services that require ultra-low latency at the access network. With it, computing and communications are performed within one Base Station (BS) site, where the computation resources are in the form of Virtual Machines (VMs) (computer emulators) in the MEC server. MEC and Energy Harvesting (EH) BSs, i.e., BSs equipped with EH equipments, are foreseen as a key towards next-generation mobile networks. In fact, EH systems are expected to decrease the energy drained from the electricity grid and facilitate the deployment of BSs in remote places, extending network coverage and making energy self-sufficiency possible in remote/rural sites. In this paper, we propose an online optimization algorithm called ENergy Aware and Adaptive Management (ENAAM), for managing remote BS sites through foresighted control policies exploiting (short-term) traffic load and harvested energy forecasts. Our numerical results reveal that ENAAM achieves energy savings with respect to the case where no energy management is applied, ranging from 56% to 66% through the scaling of computing resources, and keeps the server utilization factor between 30% and 96% over time (with an average of 75%). Notable benefits are also found against heuristic energy management techniques.
△ Less
Submitted 14 February, 2019;
originally announced February 2019.
-
Online Supervisory Control and Resource Management for Energy Harvesting BS Sites Empowered with Computation Capabilities
Authors:
Thembelihle Dlamini,
Angel Fernandez Gambin,
Daniele Munaretto,
Michele Rossi
Abstract:
The convergence of communication and computing has lead to the emergence of Multi-access Edge Computing (MEC), where computing resources (supported by Virtual Machines (VMs)) are distributed at the edge of the Mobile Network (MN), i.e., in Base Stations (BSs), with the aim of ensuring reliable and ultra-low latency services. Moreover, BSs equipped with Energy Harvesting (EH) systems can decrease t…
▽ More
The convergence of communication and computing has lead to the emergence of Multi-access Edge Computing (MEC), where computing resources (supported by Virtual Machines (VMs)) are distributed at the edge of the Mobile Network (MN), i.e., in Base Stations (BSs), with the aim of ensuring reliable and ultra-low latency services. Moreover, BSs equipped with Energy Harvesting (EH) systems can decrease the amount of energy drained from the power grid resulting in energetically self-sufficient MNs. The combination of these paradigms is considered here. Specifically, we propose an online optimization algorithm, called ENergy Aware and Adaptive Management (ENAAM), based on foresighted control policies exploiting (short-term) traffic load and harvested energy forecasts, where BSs and VMs are dynamically switched on/off towards energy savings and QoS provisioning. Our numerical results reveal that ENAAM achieves energy savings with respect to the case where no energy management is applied, ranging from 57% and 69%. Moreover, the extension of ENAAM within a cluster of BSs provides a further gain ranging from 9% to 16% in energy savings with respect to the optimization performed in isolation for each BS.
△ Less
Submitted 14 February, 2019;
originally announced February 2019.
-
Energy Sustainable Mobile Networks via Energy Routing, Learning and Foresighted Optimization
Authors:
Angel Fernandez Gambin,
Maria Scalabrin,
Michele Rossi
Abstract:
The design of self-sustainable base station (BS) deployments is addressed in this paper: BSs have energy harvesting and storage capabilities, they can use ambient energy to serve the local traffic or store it for later use. A dedicated power packet grid allows energy transfer across BSs, compensating for imbalance in the harvested energy or in the traffic load. Some BSs are offgrid, i.e., they can…
▽ More
The design of self-sustainable base station (BS) deployments is addressed in this paper: BSs have energy harvesting and storage capabilities, they can use ambient energy to serve the local traffic or store it for later use. A dedicated power packet grid allows energy transfer across BSs, compensating for imbalance in the harvested energy or in the traffic load. Some BSs are offgrid, i.e., they can only use the locally harvested energy and that transferred from other BSs, whereas others are ongrid, i.e., they can also purchase energy from the power grid. Within this setup, an optimization problem is formulated where: energy harvested and traffic processes are estimated at the BSs through Gaussian Processes (GPs), and a Model Predictive Control (MPC) framework is devised for the computation of energy allocation and transfer schedules. Numerical results, obtained using real energy harvesting and traffic profiles, show substantial improvements in terms of energy self-sustainability of the system, outage probability (zero in most cases), and in the amount of energy purchased from the power grid, which is of more than halved with respect to the case where the optimization does not consider GP forecasting and MPC.
△ Less
Submitted 16 March, 2018;
originally announced March 2018.
-
Energy sustainable paradigms and methods for future mobile networks: A survey
Authors:
Nicola Piovesan,
Angel Fernandez Gambin,
Marco Miozzo,
Michele Rossi,
Paolo Dini
Abstract:
In this survey, we discuss the role of energy in the design of future mobile networks and, in particular, we advocate and elaborate on the use of energy harvesting (EH) hardware as a means to decrease the environmental footprint of 5G technology. To take full advantage of the harvested (renewable) energy, while still meeting the quality of service required by dense 5G deployments, suitable managem…
▽ More
In this survey, we discuss the role of energy in the design of future mobile networks and, in particular, we advocate and elaborate on the use of energy harvesting (EH) hardware as a means to decrease the environmental footprint of 5G technology. To take full advantage of the harvested (renewable) energy, while still meeting the quality of service required by dense 5G deployments, suitable management techniques are here reviewed, highlighting the open issues that are still to be solved to provide eco-friendly and cost-effective mobile architectures. Several solutions have recently been proposed to tackle capacity, coverage and efficiency problems, including: C-RAN, Software Defined Networking (SDN) and fog computing, among others. However, these are not explicitly tailored to increase the energy efficiency of networks featuring renewable energy sources, and have the following limitations: (i) their energy savings are in many cases still insufficient and (ii) they do not consider network elements possessing energy harvesting capabilities. In this paper, we systematically review existing energy sustainable paradigms and methods to address points (i) and (ii), discussing how these can be exploited to obtain highly efficient, energy self-sufficient and high capacity networks. Several open issues have emerged from our review, ranging from the need for accurate energy, transmission and consumption models, to the lack of accurate data traffic profiles, to the use of power transfer, energy cooperation and energy trading techniques. These challenges are here discussed along with some research directions to follow for achieving sustainable 5G systems.
△ Less
Submitted 23 January, 2018;
originally announced January 2018.
-
Assigning peaks and modeling ETD in top-down mass spectrometry
Authors:
Mateusz Krzysztof Łącki,
Frederik Lermyte,
Błażej Miasojedow,
Mikołaj Olszański,
Michał Startek,
Frank Sobott,
Dirk Valkenborg,
Anna Gambin
Abstract:
Among many techniques of modern mass spectrometry, the top down methods are becoming continuously more popular in the overall strive to describe the proteome. These techniques are based on fragmentation of ions inside mass spectrometers instead of being proteolytically digested. In some of these techniques, the fragmentation is induced by electron transfer. It can trigger several concurring reacti…
▽ More
Among many techniques of modern mass spectrometry, the top down methods are becoming continuously more popular in the overall strive to describe the proteome. These techniques are based on fragmentation of ions inside mass spectrometers instead of being proteolytically digested. In some of these techniques, the fragmentation is induced by electron transfer. It can trigger several concurring reactions: electron transfer dissociation, electron transfer without dissociation, and proton transfer reaction. The evaluation of the extent of these reactions is important for the proper understanding of the functioning of the instrument and, what is even more important, to know if it can be used to reveal important structural information. We present a workflow for assigning peaks and interpreting the results of electron transfer driven reactions. We also present software written in Python and available under GNU v3 license.
△ Less
Submitted 25 August, 2017; v1 submitted 1 August, 2017;
originally announced August 2017.
-
Modelling the proliferation of transposable elements in populations under environmental stress
Authors:
K. Gogolewski,
M. Startek,
A. Gambin,
A. Le Rouzic
Abstract:
In this article, we investigate the evolution of sexual diploid populations which are hosts for active TE families. Our purpose is to explore the relationship between the environmental change, that influences such population and activity of those TEs that are present in genomes.
In this article, we investigate the evolution of sexual diploid populations which are hosts for active TE families. Our purpose is to explore the relationship between the environmental change, that influences such population and activity of those TEs that are present in genomes.
△ Less
Submitted 15 November, 2016;
originally announced November 2016.
-
Computational model of sphingolipids metabolism: a case study of Alzheimer's disease
Authors:
Agata Charzyńska,
Weronika Wronowska,
Karol Nienałtowski,
Anna Gambin
Abstract:
Background: Sphingolipids - as suggested by the prefix in their name - are mysterious molecules, which play surprisingly various roles in opposable cellular processes, like autophagy, apoptosis, proliferation and differentiation. Recently they have been also recognized as important messengers in cellular signalling pathways. More importantly, sphingolipid metabolism disorders were observed in vari…
▽ More
Background: Sphingolipids - as suggested by the prefix in their name - are mysterious molecules, which play surprisingly various roles in opposable cellular processes, like autophagy, apoptosis, proliferation and differentiation. Recently they have been also recognized as important messengers in cellular signalling pathways. More importantly, sphingolipid metabolism disorders were observed in various pathological conditions such as cancer and neurodegeneration. Results: Existing formal models of sphingolipids metabolism concentrates mostly on de novo ceramide synthesis or restrict their focus to biochemical transformations of a particular subspecies. We propose first comprehensive computational model of sphingolipid metabolism in human tissue. In contrast to previous approaches we explicitly model compartmentalization what allows emphasizing the differences among individual organelles. Conclusions: Presented here model was validated by means of recently proposed model analysis technics allowing for detection of most sensitive and experimentally non-identifiable parameters and determination of main sources of model variance. Moreover, we demonstrate the utility of the model for the study of molecular processes underlying Alzheimer's disease.
△ Less
Submitted 10 November, 2014;
originally announced November 2014.
-
Law of Localised Fine Structure with application in mass spectrometry
Authors:
Mateusz Krzysztof Łącki,
Anna Gambin
Abstract:
This paper presents a brand new methodology to deal with isotopic fine structure calculations. By using the Poisson approximation in an entirely novel way, we introduce mathematical elegance into the discussion on the trade-off between resolution and tractability. Our considerations unify the concepts of fine-structure, equatransneutronic configurations, and aggregate isotopic structure in a natur…
▽ More
This paper presents a brand new methodology to deal with isotopic fine structure calculations. By using the Poisson approximation in an entirely novel way, we introduce mathematical elegance into the discussion on the trade-off between resolution and tractability. Our considerations unify the concepts of fine-structure, equatransneutronic configurations, and aggregate isotopic structure in a natural and simple way. We show how to boost the theoretical resolution in a seemingly costless way by several orders of magnitude with respect to the already very efficient algorithms operating on isotopic aggregates. We also develop an effective new way to obtain the important peaks in the most disaggregated isotopic structure localised in a precise region in the mass domain.
△ Less
Submitted 24 October, 2014;
originally announced October 2014.
-
StochDecomp - Matlab package for noise decomposition in stochastic biochemical systems
Authors:
Tomasz Jetka,
Agata Charzynska,
Anna Gambin,
Michael P. H. Stumpf,
Michal Komorowski
Abstract:
Stochasticity is an indispensable aspect of biochemical processes at the cellular level. Studies on how the noise enters and propagates in biochemical systems provided us with nontrivial insights into the origins of stochasticity, in total however they constitute a patchwork of different theoretical analyses. Here we present a flexible and generally applicable noise decomposition tool, that allows…
▽ More
Stochasticity is an indispensable aspect of biochemical processes at the cellular level. Studies on how the noise enters and propagates in biochemical systems provided us with nontrivial insights into the origins of stochasticity, in total however they constitute a patchwork of different theoretical analyses. Here we present a flexible and generally applicable noise decomposition tool, that allows us to calculate contributions of individual reactions to the total variability of a system's output. With the package it is therefore possible to quantify how the noise enters and propagates in biochemical systems. We also demonstrate and exemplify using the JAK-STAT signalling pathway that it is possible to infer noise contributions resulting from individual reactions directly from experimental data. This is the first computational tool that allows to decompose noise into contributions resulting from individual reactions.
△ Less
Submitted 14 August, 2013;
originally announced August 2013.
-
Modelling the efficacy of hyperthermia treatment
Authors:
Mikołaj Rybiński,
Zuzanna Szymańska,
Sławomir Lasota,
Anna Gambin
Abstract:
Multimodal oncological strategies which combine chemotherapy or radiotherapy with hyperthermia have a potential of improving the efficacy of the non-surgical methods of cancer treatment. Hyperthermia engages the heat-shock response mechanism (HSR), main component of which are heat-shock proteins (HSP). Cancer cells have already partially activated HSR, thereby, hyperthermia may be more toxic to th…
▽ More
Multimodal oncological strategies which combine chemotherapy or radiotherapy with hyperthermia have a potential of improving the efficacy of the non-surgical methods of cancer treatment. Hyperthermia engages the heat-shock response mechanism (HSR), main component of which are heat-shock proteins (HSP). Cancer cells have already partially activated HSR, thereby, hyperthermia may be more toxic to them relative to normal cells. On the other hand, HSR triggers thermotolerance, i.e. hyperthermia treated cells show an impairment in their susceptibility to a subsequent heat-induced stress. This poses questions about efficacy and optimal strategy of the anti-cancer therapy combined with hyperthermia treatment.
To address these questions, we adapt our previous HSR model and propose its stochastic extension. We formalise the notion of a HSP-induced thermotolerance. Next, we estimate the intensity and the duration of the thermotolerance. Finally, we quantify the effect of a multimodal therapy based on hyperthermia and a cytotoxic effect of bortezomib, a clinically approved proteasome inhibitor. Consequently, we propose an optimal strategy for combining hyperthermia and proteasome inhibition modalities.
In summary, by a proof of concept mathematical analysis of HSR we are able to support the common belief that the combination of cancer treatment strategies increases therapy efficacy. thermotolerance.
△ Less
Submitted 6 March, 2013; v1 submitted 18 September, 2012;
originally announced September 2012.
-
On subset seeds for protein alignment
Authors:
Mikhail A. Roytberg,
Anna Gambin,
Laurent Noé,
Slawomir Lasota,
Eugenia Furletova,
Ewa Szczurek,
Gregory Kucherov
Abstract:
We apply the concept of subset seeds proposed in [1] to similarity search in protein sequences. The main question studied is the design of efficient seed alphabets to construct seeds with optimal sensitivity/selectivity trade-offs. We propose several different design methods and use them to construct several alphabets. We then perform a comparative analysis of seeds built over those alphabets an…
▽ More
We apply the concept of subset seeds proposed in [1] to similarity search in protein sequences. The main question studied is the design of efficient seed alphabets to construct seeds with optimal sensitivity/selectivity trade-offs. We propose several different design methods and use them to construct several alphabets. We then perform a comparative analysis of seeds built over those alphabets and compare them with the standard BLASTP seeding method [2], [3], as well as with the family of vector seeds proposed in [4]. While the formalism of subset seeds is less expressive (but less costly to implement) than the cumulative principle used in BLASTP and vector seeds, our seeds show a similar or even better performance than BLASTP on Bernoulli models of proteins compatible with the common BLOSUM62 matrix. Finally, we perform a large-scale benchmarking of our seeds against several main databases of protein alignments. Here again, the results show a comparable or better performance of our seeds vs. BLASTP.
△ Less
Submitted 21 January, 2009;
originally announced January 2009.
-
Efficient seeding techniques for protein similarity search
Authors:
Mihkail Roytberg,
Anna Gambin,
Laurent Noé,
Slawomir Lasota,
Eugenia Furletova,
Ewa Szczurek,
Gregory Kucherov
Abstract:
We apply the concept of subset seeds proposed in [1] to similarity search in protein sequences. The main question studied is the design of efficient seed alphabets to construct seeds with optimal sensitivity/selectivity trade-offs. We propose several different design methods and use them to construct several alphabets.We then perform an analysis of seeds built over those alphabet and compare the…
▽ More
We apply the concept of subset seeds proposed in [1] to similarity search in protein sequences. The main question studied is the design of efficient seed alphabets to construct seeds with optimal sensitivity/selectivity trade-offs. We propose several different design methods and use them to construct several alphabets.We then perform an analysis of seeds built over those alphabet and compare them with the standard Blastp seeding method [2,3], as well as with the family of vector seeds proposed in [4]. While the formalism of subset seed is less expressive (but less costly to implement) than the accumulative principle used in Blastp and vector seeds, our seeds show a similar or even better performance than Blastp on Bernoulli models of proteins compatible with the common BLOSUM62 matrix.
△ Less
Submitted 30 October, 2008;
originally announced October 2008.