-
Study of a Hybrid Photovoltaic-Wind Smart Microgrid using Data Science Approach
Authors:
Josimar Edinson Chire Saire,
José Armando Gastelo Roque,
Franco Canziani
Abstract:
In this paper, a smart microgrid implemented in Paracas, Ica, Peru, composed of 6kWp PV + 6kW Wind and that provides electricity to a rural community of 40 families, was studied using a data science approach. Real data of solar irradiance, wind speed, energy demand, and voltage of the battery bank from 2 periods of operation were studied to find patterns, seasonality, and existing correlations bet…
▽ More
In this paper, a smart microgrid implemented in Paracas, Ica, Peru, composed of 6kWp PV + 6kW Wind and that provides electricity to a rural community of 40 families, was studied using a data science approach. Real data of solar irradiance, wind speed, energy demand, and voltage of the battery bank from 2 periods of operation were studied to find patterns, seasonality, and existing correlations between the analyzed data. Among the main results are the periodicity of renewable resources and demand, the weekly behavior of electricity demand and how it has progressively increased from an average of 0.7kW in 2019 to 1.2kW in 2021, and how power outages are repeated at certain hours in the morning when resources are low or there is a failure in the battery bank. These analyzed data will be used to improve sizing techniques and provide recommendations for energy management to optimize the performance of smart microgrids.
△ Less
Submitted 13 May, 2021;
originally announced May 2021.
-
Analysis of Users Reaction around Impeachment in Peru using Twitter
Authors:
Josimar Edinson Chire Saire,
Esteban Wilfredo Vilca Zuñiga
Abstract:
Covid-19 pandemic generated many problems and show other hidden issues in countries in South America. Every government analyzed his own context and decided which health policies would be used. Peru is a country in the middle of South America region, the first reported case was on March 6. Besides, a lockdown was established in ground borders, sea and air. Peruvian government analyzed the context a…
▽ More
Covid-19 pandemic generated many problems and show other hidden issues in countries in South America. Every government analyzed his own context and decided which health policies would be used. Peru is a country in the middle of South America region, the first reported case was on March 6. Besides, a lockdown was established in ground borders, sea and air. Peruvian government analyzed the context and proposed many policies around health, economy, employment, transport. But, these action were not enough for the existence of previous lack of infrastructure in hospitals, as result of past governments. By the other hand, a variety of politic parties in the Parliament and their search for own interests, was evidenced during this pandemic period. Considering previous condition of lack of success in health, economic policies, the discussion about possible impeachment started. Therefore, this work has the main aim of finding evidence about what users were talking about and what was the impact on Peruvian population using Twitter.
△ Less
Submitted 12 October, 2020; v1 submitted 9 October, 2020;
originally announced October 2020.
-
Parameter Experimental Analysis of the Reservoirs Observers using Echo State Network Approach
Authors:
Diana C. Roca Arroyo,
Josimar E. Chire Saire
Abstract:
Dynamical systems has a variety of applications for the new information generated during the time. Many phenomenons like physical, chemical or social are not static, then an analysis over the time is necessary. In this work, an experimental analysis of parameters of the model Echo State Network is performed and the influence of the kind of Complex Network is explored to understand the influence on…
▽ More
Dynamical systems has a variety of applications for the new information generated during the time. Many phenomenons like physical, chemical or social are not static, then an analysis over the time is necessary. In this work, an experimental analysis of parameters of the model Echo State Network is performed and the influence of the kind of Complex Network is explored to understand the influence on the performance. The experiments are performed using the Rossler attractor.
△ Less
Submitted 28 September, 2020;
originally announced September 2020.
-
Text Mining over Curriculum Vitae of Peruvian Professionals using Official Scientific Site DINA
Authors:
Josimar Edinson Chire Saire,
Honorio Apaza Alanoca
Abstract:
During the last decade, Peruvian government started to invest and promote Science and Technology through Concytec(National Council of Science and Technology). Many programs are oriented to support research projects, expenses for paper presentation, organization of conferences/ events and more. Concytec created a National Directory of Researchers(DINA) where professionals can create and add curricu…
▽ More
During the last decade, Peruvian government started to invest and promote Science and Technology through Concytec(National Council of Science and Technology). Many programs are oriented to support research projects, expenses for paper presentation, organization of conferences/ events and more. Concytec created a National Directory of Researchers(DINA) where professionals can create and add curriculum vitae, Concytec can provide official title of Researcher following some criterion for the evaluation. The actual paper aims to conduct an exploratory analysis over the curriculum vitae of Peruvian Professionals using Data Mining Approach to understand Peruvian context.
△ Less
Submitted 7 September, 2020;
originally announced September 2020.
-
Data Mining Approach to Analyze Covid19 Dataset of Brazilian Patients
Authors:
Josimar E. Chire Saire
Abstract:
The pandemic originated by coronavirus(covid-19), name coined by World Health Organization during the first month in 2020. Actually, almost all the countries presented covid19 positive cases and governments are choosing different health policies to stop the infection and many research groups are working on patients data to understand the virus, at the same time scientists are looking for a vacuum…
▽ More
The pandemic originated by coronavirus(covid-19), name coined by World Health Organization during the first month in 2020. Actually, almost all the countries presented covid19 positive cases and governments are choosing different health policies to stop the infection and many research groups are working on patients data to understand the virus, at the same time scientists are looking for a vacuum to enhance imnulogy system to tack covid19 virus. One of top countries with more infections is Brazil, until August 11 had a total of 3,112,393 cases. Research Foundation of Sao Paulo State(Fapesp) released a dataset, it was an innovative in collaboration with hospitals(Einstein, Sirio-Libanes), laboratory(Fleury) and Sao Paulo University to foster reseach on this trend topic. The present paper presents an exploratory analysis of the datasets, using a Data Mining Approach, and some inconsistencies are found, i.e. NaN values, null references values for analytes, outliers on results of analytes, encoding issues. The results were cleaned datasets for future studies, but at least a 20\% of data were discarded because of non numerical, null values and numbers out of reference range.
△ Less
Submitted 25 August, 2020;
originally announced August 2020.
-
Curriculum Vitae Recommendation Based on Text Mining
Authors:
Honorio Apaza Alanoca,
Americo A. Rubin de Celis Vidal,
Josimar Edinson Chire Saire
Abstract:
During the last years, the development in diverse areas related to computer science and internet, allowed to generate new alternatives for decision making in the selection of personnel for state and private companies. In order to optimize this selection process, the recommendation systems are the most suitable for working with explicit information related to the likes and dislikes of employers or…
▽ More
During the last years, the development in diverse areas related to computer science and internet, allowed to generate new alternatives for decision making in the selection of personnel for state and private companies. In order to optimize this selection process, the recommendation systems are the most suitable for working with explicit information related to the likes and dislikes of employers or end users, since this information allows to generate lists of recommendations based on collaboration or similarity of content. Therefore, this research takes as a basis these characteristics contained in the database of curricula and job offers, which correspond to the Peruvian ambit, which highlights the experience, knowledge and skills of each candidate, which are described in textual terms or words. This research focuses on the problem: how we can take advantage from the growth of unstructured information about job offers and curriculum vitae on different websites for CV recommendation. So, we use the techniques from Text Mining and Natural Language Processing. Then, as a relevant technique for the present study, we emphasize the technique frequency of the Term - Inverse Frequency of the documents (TF-IDF), which allows identifying the most relevant CVs in relation to a job offer of website through the average values (TF-IDF). So, the weighted value can be used as a qualification value of the relevant curriculum vitae for the recommendation.
△ Less
Submitted 21 July, 2020;
originally announced July 2020.
-
Machine Learning Pipeline for Pulsar Star Dataset
Authors:
Alexander Ylnner Choquenaira Florez,
Braulio Valentin Sanchez Vinces,
Diana Carolina Roca Arroyo,
Josimar Edinson Chire Saire,
Patrıcia Batista Franco
Abstract:
This work brings together some of the most common machine learning (ML) algorithms, and the objective is to make a comparison at the level of obtained results from a set of unbalanced data. This dataset is composed of almost 17 thousand observations made to astronomical objects to identify pulsars (HTRU2). The methodological proposal based on evaluating the accuracy of these different models on th…
▽ More
This work brings together some of the most common machine learning (ML) algorithms, and the objective is to make a comparison at the level of obtained results from a set of unbalanced data. This dataset is composed of almost 17 thousand observations made to astronomical objects to identify pulsars (HTRU2). The methodological proposal based on evaluating the accuracy of these different models on the same database treated with two different strategies for unbalanced data. The results show that in spite of the noise and unbalance of classes present in this type of data, it is possible to apply them on standard ML algorithms and obtain promising accuracy ratios.
△ Less
Submitted 3 May, 2020;
originally announced May 2020.
-
What is the people posting about symptoms related to Coronavirus in Bogota, Colombia?
Authors:
Josimar E. Chire Saire,
Roberto C. Navarro
Abstract:
During the last months, there is an increasing alarm about a new mutation of coronavirus, covid-19 coined by World Health Organization(WHO) with an impact in many areas: economy, health, politics and others. This situation was declared a pandemic by WHO, because of the fast expansion over many countries. At the same time, people is using Social Networks to express what they think, feel or experime…
▽ More
During the last months, there is an increasing alarm about a new mutation of coronavirus, covid-19 coined by World Health Organization(WHO) with an impact in many areas: economy, health, politics and others. This situation was declared a pandemic by WHO, because of the fast expansion over many countries. At the same time, people is using Social Networks to express what they think, feel or experiment, so this people are Social Sensors and helps to analyze what is happening in their city. The objective of this paper is analyze the publications of Colombian people living in Bogota with a radius of 50 km using Text Mining techniques from symptomatology approach. The results support the understanding of the spread in Colombia related to symptoms of covid19.
△ Less
Submitted 24 March, 2020;
originally announced March 2020.