-
A theory of best choice selection through objective arguments grounded in Linear Response Theory concepts
Authors:
Marcel Ausloos,
Giulia Rotundo,
Roy Cerqueti
Abstract:
In this paper, we propose how to use objective arguments grounded in statistical mechanics concepts in order to obtain a single number, obtained after aggregation, which would allow to rank "agents", "opinions", ..., all defined in a very broad sense. We aim toward any process which should a priori demand or lead to some consensus in order to attain the presumably best choice among many possibilit…
▽ More
In this paper, we propose how to use objective arguments grounded in statistical mechanics concepts in order to obtain a single number, obtained after aggregation, which would allow to rank "agents", "opinions", ..., all defined in a very broad sense. We aim toward any process which should a priori demand or lead to some consensus in order to attain the presumably best choice among many possibilities. In order to precise the framework, we discuss previous attempts, recalling trivial "means of scores", - weighted or not, Condorcet paradox, TOPSIS, etc. We demonstrate through geometrical arguments on a toy example, with 4 criteria, that the pre-selected order of criteria in previous attempts makes a difference on the final result. However, it might be unjustified. Thus, we base our "best choice theory" on the linear response theory in statistical mechanics: we indicate that one should be calculating correlations functions between all possible choice evaluations, thereby avoiding an arbitrarily ordered set of criteria. We justify the point through an example with 6 possible criteria. Applications in many fields are suggested. Beside, two toy models serving as practical examples and illustrative arguments are given in an Appendix.
△ Less
Submitted 30 March, 2024;
originally announced May 2024.
-
Hierarchy Selection: New team ranking indicators for cyclist multi-stage races
Authors:
Marcel Ausloos
Abstract:
In this paper, I report some investigation discussing team selection, whence hierarchy, through ranking indicators, for example when measuring professional cyclist team's sportive value, in particular in multistage races. A logical, it seems, constraint is introduced on the riders: they must finish the race. Several new indicators are defined, justified, and compared. These indicators are mainly b…
▽ More
In this paper, I report some investigation discussing team selection, whence hierarchy, through ranking indicators, for example when measuring professional cyclist team's sportive value, in particular in multistage races. A logical, it seems, constraint is introduced on the riders: they must finish the race. Several new indicators are defined, justified, and compared. These indicators are mainly based on the arriving place of (the best 3) riders instead of their time needed for finishing the stage or the race, - as presently classically used. A case study, serving as an illustration containing the necessary ingredients for a wider discussion, is the 2023 Vuelta de San Juan, but without loss of generality.
It is shown that the new indicators offer some new viewpoint for distinguishing the ranking through the cumulative sums of the places of riders rather than their finishing times. On the other hand, the indicators indicate a different team hierarchy if only the finishing riders are considered. Some consideration on the distance between ranking indicators is presented.
Moreover, it is argued that these new ranking indicators should hopefully promote more competitive races, not only till the end of the race, but also until the end of each stage. Generalizations and other applications within operational research topics, like in academia, are suggested.
△ Less
Submitted 22 February, 2024;
originally announced April 2024.
-
Unleashing the Power of AI. A Systematic Review of Cutting-Edge Techniques in AI-Enhanced Scientometrics, Webometrics, and Bibliometrics
Authors:
Hamid Reza Saeidnia,
Elaheh Hosseini,
Shadi Abdoli,
Marcel Ausloos
Abstract:
Purpose: The study aims to analyze the synergy of Artificial Intelligence (AI), with scientometrics, webometrics, and bibliometrics to unlock and to emphasize the potential of the applications and benefits of AI algorithms in these fields.
Design/methodology/approach: By conducting a systematic literature review, our aim is to explore the potential of AI in revolutionizing the methods used to me…
▽ More
Purpose: The study aims to analyze the synergy of Artificial Intelligence (AI), with scientometrics, webometrics, and bibliometrics to unlock and to emphasize the potential of the applications and benefits of AI algorithms in these fields.
Design/methodology/approach: By conducting a systematic literature review, our aim is to explore the potential of AI in revolutionizing the methods used to measure and analyze scholarly communication, identify emerging research trends, and evaluate the impact of scientific publications. To achieve this, we implemented a comprehensive search strategy across reputable databases such as ProQuest, IEEE Explore, EBSCO, Web of Science, and Scopus. Our search encompassed articles published from January 1, 2000, to September 2022, resulting in a thorough review of 61 relevant articles.
Findings: (i) Regarding scientometrics, the application of AI yields various distinct advantages, such as conducting analyses of publications, citations, research impact prediction, collaboration, research trend analysis, and knowledge map**, in a more objective and reliable framework. (ii) In terms of webometrics, AI algorithms are able to enhance web crawling and data collection, web link analysis, web content analysis, social media analysis, web impact analysis, and recommender systems. (iii) Moreover, automation of data collection, analysis of citations, disambiguation of authors, analysis of co-authorship networks, assessment of research impact, text mining, and recommender systems are considered as the potential of AI integration in the field of bibliometrics.
Originality/value: This study covers the particularly new benefits and potential of AI-enhanced scientometrics, webometrics, and bibliometrics to highlight the significant prospects of the synergy of this integration through AI.
△ Less
Submitted 22 February, 2024;
originally announced March 2024.
-
Identification of the most important external features of highly cited scholarly papers through 3 (i.e., Ridge, Lasso, and Boruta) feature selection data mining methods
Authors:
Sepideh Fahimifar,
Khadijeh Mousavi,
Fatemeh Mozaffari,
Marcel Ausloos
Abstract:
Highly cited papers are influenced by external factors that are not directly related to the document's intrinsic quality. In this study, 50 characteristics for measuring the performance of 68 highly cited papers, from the Journal of the American Medical Informatics Association indexed in Web of Sciences (WoS), from 2009 to 2019 were investigated. In the first step, a Pearson correlation analysis i…
▽ More
Highly cited papers are influenced by external factors that are not directly related to the document's intrinsic quality. In this study, 50 characteristics for measuring the performance of 68 highly cited papers, from the Journal of the American Medical Informatics Association indexed in Web of Sciences (WoS), from 2009 to 2019 were investigated. In the first step, a Pearson correlation analysis is performed to eliminate variables with zero or weak correlation with the target (dependent) variable ([number of citations in WOS]). Consequently, 32 variables are selected for the next step. By applying the Ridge technique, 13 features show a positive effect on the number of citations. Using three different algorithms, i.e., Ridge, Lasso, and Boruta, 6 factors appear to be the most relevant ones. The [Number of citations by international researchers], [Journal self-citations in citing documents], and [Authors' self-citations in citing documents], are recognized as the most important features by all three methods here used. The [First author's scientific age], [Open-access paper], and [Number of first author's citations in WOS] are identified as the important features of highly cited papers by only two methods, Ridge and Lasso. Notice that we use specific machine learning algorithms as feature selection methods (Ridge, Lasso, and Boruta) to identify the most important features of highly cited papers, tools that had not previously been used for this purpose. In conclusion, we re-emphasize the performance resulting from such algorithms. Moreover, we do not advise authors to seek to increase the citations of their articles by manipulating the identified performance features. Indeed, ethical rules regarding these characteristics must be strictly obeyed.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
God ($\equiv Elohim$), the first small world network
Authors:
Marcel Ausloos
Abstract:
In this paper, the approach of network map** of words in literary texts is extended to ''textual factors'': the network nodes are defined as ''concepts''; the links are ''community connexions''. Thereafter, the text network properties are investigated along modern statistical physics approaches of networks, thereby relating network topology and algebraic properties, to literary texts contents. A…
▽ More
In this paper, the approach of network map** of words in literary texts is extended to ''textual factors'': the network nodes are defined as ''concepts''; the links are ''community connexions''. Thereafter, the text network properties are investigated along modern statistical physics approaches of networks, thereby relating network topology and algebraic properties, to literary texts contents. As a practical illustration, the first chapter of the Genesis in the Bible is mapped into a 10 node network, as in the Kabbalah approach, mentioning God ($\equiv Elohim$). The characteristics of the network are studied starting from its adjacency matrix, and the corresponding Laplacian matrix. Triplets of nodes are particularly examined in order to emphasize the ''textual (community) connexions'' of each agent "emanation", through the so called clustering coefficients and the overlap index, whence measuring the ''semantic flow'' between the different nodes. It is concluded that this graph is a small-world network, weakly dis-assortative, because its average local clustering coefficient is significantly higher than a random graph constructed on the same vertex set.
△ Less
Submitted 20 June, 2022;
originally announced August 2022.
-
Are We Standing on Unreliable Shoulders? The Effect of Retracted Papers Citations on Previous and Subsequent Published Papers: A Study of the Web of Science Database
Authors:
Sepideh Fahimifar,
Ali Ghorbi,
Marcel Ausloos
Abstract:
The present research attempts to identify the impact of retracted papers on previous or subsequent papers. We consider the 5693 retracted papers from 1975 to 2020 indexed in the Web of Science database based on bibliometric methods. We use HistCite, Excel, and SPSS software as technical means. The findings suggest a significant difference between the average number of retracted and unretracted pap…
▽ More
The present research attempts to identify the impact of retracted papers on previous or subsequent papers. We consider the 5693 retracted papers from 1975 to 2020 indexed in the Web of Science database based on bibliometric methods. We use HistCite, Excel, and SPSS software as technical means. The findings suggest a significant difference between the average number of retracted and unretracted papers when cited in retracted papers. Furthermore, there is a significant difference between the average number of unretracted and retracted papers citing retracted papers. The reasons for the retraction of an article may not be the previous retracted papers, yet unretracted papers may be retracted later because of referring to (many) retracted papers. It is deduced that proprietors of citation databases should carefully focus on these papers by checking references to each new paper citing previously retracted papers.
△ Less
Submitted 22 January, 2022;
originally announced January 2022.
-
Retracted papers by Iranian authors: Causes, journals, time lags, affiliations, collaborations
Authors:
Ali Ghorbi,
Mohsen Fazeli-Varzaneh,
Erfan Ghaderi-Azad,
Marcel Ausloos,
Marcin Kozak
Abstract:
This study aims to analyze 343 retraction notices indexed in the Scopus database, published in 2001-2019, related to scientific articles (co-)written by at least one author affiliated with an Iranian institution. In order to determine reasons for retractions, we merged this database with the database from Retraction Watch. The data were analyzed using Excel 2016 and IBM-SPSS version 24.0, and visu…
▽ More
This study aims to analyze 343 retraction notices indexed in the Scopus database, published in 2001-2019, related to scientific articles (co-)written by at least one author affiliated with an Iranian institution. In order to determine reasons for retractions, we merged this database with the database from Retraction Watch. The data were analyzed using Excel 2016 and IBM-SPSS version 24.0, and visualized using VOSviewer software. Most of the retractions were due to fake peer review (95 retractions) and plagiarism (90). The average time between a publication and its retraction was 591 days. The maximum time-lag (about 3,000 days) occurred for papers retracted due to duplicate publications; the minimum time-lag (fewer than 100 days) was for papers retracted due to ''unspecified cause'' (most of these were conference papers). As many as 48 (14%) of the retracted papers were published in two medical journals: Tumor Biology (25 papers) and Diagnostic Pathology (23 papers). From the institutional point of view, Islamic Azad University was the inglorious leader, contributing to over one-half (53.1%) of retracted papers. Among the 343 retraction notices, 64 papers pertained to international collaborations with researchers from mainly Asian and European countries; Malaysia having the most retractions (22 papers). Since most retractions were due to fake peer review and plagiarism, the peer review system appears to be a weak point of the submission/publication process; if improved, the number of retractions would likely drop because of increased editorial control.
△ Less
Submitted 29 August, 2021;
originally announced August 2021.
-
Words ranking and Hirsch index for identifying the core of the hapaxes in political texts
Authors:
Valerio Ficcadenti,
Roy Cerqueti,
Marcel Ausloos,
Gurjeet Dhesi
Abstract:
This paper deals with a quantitative analysis of the content of official political speeches. We study a set of about one thousand talks pronounced by the US Presidents, ranging from Washington to Trump. In particular, we search for the relevance of the rare words, i.e. those said only once in each speech -- the so-called hapaxes. We implement a rank-size procedure of Zipf-Mandelbrot type for discu…
▽ More
This paper deals with a quantitative analysis of the content of official political speeches. We study a set of about one thousand talks pronounced by the US Presidents, ranging from Washington to Trump. In particular, we search for the relevance of the rare words, i.e. those said only once in each speech -- the so-called hapaxes. We implement a rank-size procedure of Zipf-Mandelbrot type for discussing the hapaxes' frequencies regularity over the overall set of speeches. Starting from the obtained rank-size law, we define and detect the core of the hapaxes set by means of a procedure based on an Hirsch index variant. We discuss the resulting list of words in the light of the overall US Presidents' speeches. We further show that this core of hapaxes itself can be well fitted through a Zipf-Mandelbrot law and that contains elements producing deviations at the low ranks between scatter plots and fitted curve -- the so-called king and vice-roy effect. Some socio-political insights are derived from the obtained findings about the US Presidents messages.
△ Less
Submitted 13 June, 2020;
originally announced June 2020.
-
Seasonal Entropy, Diversity and Inequality Measures of Submitted and Accepted Papers Distributions In Peer-Reviewed Journals
Authors:
Marcel Ausloos,
Olgica Nedic,
Aleksandar Dekanski
Abstract:
This paper presents a novel method for finding features in the analysis of variable distributions stemming from time series. We apply the methodology to the case of submitted and accepted papers in peer-reviewed journals. We provide a comparative study of editorial decisions for papers submitted to two peer-reviewed journals: the Journal of the Serbian Chemical Society (JSCS) and this MDPI Entropy…
▽ More
This paper presents a novel method for finding features in the analysis of variable distributions stemming from time series. We apply the methodology to the case of submitted and accepted papers in peer-reviewed journals. We provide a comparative study of editorial decisions for papers submitted to two peer-reviewed journals: the Journal of the Serbian Chemical Society (JSCS) and this MDPI Entropy journal. We cover three recent years for which the fate of submitted papers, about 600 papers to JSCS and 2500 to Entropy, is completely determined. Instead of comparing the number distributions of these papers as a function of time with respect to a uniform distribution, we analyze the relevant probabilities, from which we derive the information entropy. It is argued that such probabilities are indeed more relevant for authors than the actual number of submissions. We tie this entropy analysis to the so called diversity of the variable distributions. Furthermore, we emphasize the correspondence between the entropy and the diversity with inequality measures, like the Herfindahl-Hirschman index and the Theil index, itself being in the class of entropy measures; the Gini coefficient which also measures the diversity in ranking is calculated for further discussion. In this sample, the seasonal aspects of the peer review process are outlined. It is found that the use of such indices, non linear transformations of the data distributions, allow to distinguish features and evolutions of peer review process as a function of time as well as comparing non-uniformity of distributions. Furthermore, t- and z- statistical tests are applied in order to measure the significance (p-level) of the findings, i.e. whether papers are more likely to be accepted if they are submitted during a few specific months or "season"; the predictability strength depends on the journal.
△ Less
Submitted 13 October, 2019;
originally announced October 2019.
-
Correlations between submission and acceptance of papers in peer review journals
Authors:
Marcel Ausloos,
Olgica Nedic,
Aleksandar Dekanski
Abstract:
This paper provides a comparative study about seasonal influence on editorial decisions for papers submitted to two peer review journals. We distinguish a specialized one, the Journal of the Serbian Chemical Society (JSCS) and an interdisciplinary one, Entropy. Dates of electronic submission for about 600 papers to JSCS and 2500 to Entropy have been recorded over 3 recent years. Time series of eit…
▽ More
This paper provides a comparative study about seasonal influence on editorial decisions for papers submitted to two peer review journals. We distinguish a specialized one, the Journal of the Serbian Chemical Society (JSCS) and an interdisciplinary one, Entropy. Dates of electronic submission for about 600 papers to JSCS and 2500 to Entropy have been recorded over 3 recent years. Time series of either accepted or rejected papers are subsequently analyzed. We take either editors or authors view points into account, thereby considering magnitudes and probabilities. In this sample, it is found that there are distinguishable peaks and dips in the time series, demonstrating preferred months for the submission of papers. It is also found that papers are more likely accepted if they are submitted during a few specific months, - these depending on the journal. The probability of having a rejected paper also appears to be seasonally biased. In view of clarifying reports with contradictory findings, we discuss previously proposed conjectures for such effects, like holiday effects and the desk rejection by editors. We conclude that, in this sample, the type of journal, specialized or multidisciplinary, seems to be the drastic criterion for distinguishing the outcomes rates.
△ Less
Submitted 13 October, 2019;
originally announced October 2019.
-
Efficiency in managing peer-review of scientific manuscripts -- editors' perspective
Authors:
Olgica Nedic,
Ivana Drvenica,
Marcel Ausloos,
Aleksandar Dekanski
Abstract:
The purpose of this paper is to introduce a model for measuring the efficiency in managing peer-review of scientific manuscripts by editors. The approach employed is based on the assumption that the editorial aim is to manage publication with high efficiency, employing the least amount of editorial resources. Efficiency is defined in this research as a measure based on 7 variables. An on-line surv…
▽ More
The purpose of this paper is to introduce a model for measuring the efficiency in managing peer-review of scientific manuscripts by editors. The approach employed is based on the assumption that the editorial aim is to manage publication with high efficiency, employing the least amount of editorial resources. Efficiency is defined in this research as a measure based on 7 variables. An on-line survey was constructed and editors of journals originating from Serbia regularly publishing articles in the field of chemistry were invited to participate. An evaluation of the model is given based on responses from 24 journals and 50 editors. With this investigation we aimed to contribute to our understanding of the peer-review process and, possibly, offer a tool to improve the "efficiency" in journal editing. The proposed protocol may be adapted by other journals in order to assess the managing potential of editors.
△ Less
Submitted 12 October, 2019;
originally announced October 2019.
-
A joint text mining-rank size investigation of the rhetoric structures of the US Presidents' speeches
Authors:
Valerio Ficcadenti,
Roy Cerqueti,
Marcel Ausloos
Abstract:
This work presents a text mining context and its use for a deep analysis of the messages delivered by the politicians. Specifically, we deal with an expert systems-based exploration of the rhetoric dynamics of a large collection of US Presidents' speeches, ranging from Washington to Trump. In particular, speeches are viewed as complex expert systems whose structures can be effectively analyzed thr…
▽ More
This work presents a text mining context and its use for a deep analysis of the messages delivered by the politicians. Specifically, we deal with an expert systems-based exploration of the rhetoric dynamics of a large collection of US Presidents' speeches, ranging from Washington to Trump. In particular, speeches are viewed as complex expert systems whose structures can be effectively analyzed through rank-size laws. The methodological contribution of the paper is twofold. First, we develop a text mining-based procedure for the construction of the dataset by using a web scra** routine on the Miller Center website -- the repository collecting the speeches. Second, we explore the implicit structure of the discourse data by implementing a rank-size procedure over the individual speeches, being the words of each speech ranked in terms of their frequencies. The scientific significance of the proposed combination of text-mining and rank-size approaches can be found in its flexibility and generality, which let it be reproducible to a wide set of expert systems and text mining contexts. The usefulness of the proposed method and the speech subsequent analysis is demonstrated by the findings themselves. Indeed, in terms of impact, it is worth noting that interesting conclusions of social, political and linguistic nature on how 45 United States Presidents, from April 30, 1789 till February 28, 2017 delivered political messages can be carried out. Indeed, the proposed analysis shows some remarkable regularities, not only inside a given speech, but also among different speeches. Moreover, under a purely methodological perspective, the presented contribution suggests possible ways of generating a linguistic decision-making algorithm.
△ Less
Submitted 9 May, 2019;
originally announced May 2019.
-
Optimization of the post-crisis recovery plans in scale-free networks
Authors:
Mohammad Bahrami,
Narges Chinichian,
Ali Hosseiny,
Gholamreza Jafari,
Marcel Ausloos
Abstract:
General Motors or a local business, which one is better to be stimulated in post-crisis recessions, where government stimulation is meant to overcome recessions? Due to the budget constraints, it is quite relevant to ask how one can increase the chance of economic recovery. One of the key elements to answer this question is to understand metastable features of the economic networks. Ising model ha…
▽ More
General Motors or a local business, which one is better to be stimulated in post-crisis recessions, where government stimulation is meant to overcome recessions? Due to the budget constraints, it is quite relevant to ask how one can increase the chance of economic recovery. One of the key elements to answer this question is to understand metastable features of the economic networks. Ising model has been suggested for studying such features in the literature. In the homogenous networks one needs at least a minimum activation, forcing an Ising network to switch its local equilibria, where such minimum is independent of the nodes characteristics. In the scale free networks however, when one aims to push the network to switch its vacuum, she faces the question of which nodes are better to be stimulated to minimize the cost. In the paper it has been shown that stimulation of the high degree nodes costs less in general. Despite regular networks, in the scale free networks, the stimulation cost depends on the networks features such as assortativity. Though we have utilized the Ising model to tackle a problem in economics, our analysis shed lights on many other problems concerning stimulations of socio-economic systems.
△ Less
Submitted 23 October, 2019; v1 submitted 23 April, 2019;
originally announced April 2019.
-
Artificial intelligence in peer review: How can evolutionary computation support journal editors?
Authors:
Maciej J. Mrowinski,
Piotr Fronczak,
Agata Fronczak,
Marcel Ausloos,
Olgica Nedic
Abstract:
With the volume of manuscripts submitted for publication growing every year, the deficiencies of peer review (e.g. long review times) are becoming more apparent. Editorial strategies, sets of guidelines designed to speed up the process and reduce editors workloads, are treated as trade secrets by publishing houses and are not shared publicly. To improve the effectiveness of their strategies, edito…
▽ More
With the volume of manuscripts submitted for publication growing every year, the deficiencies of peer review (e.g. long review times) are becoming more apparent. Editorial strategies, sets of guidelines designed to speed up the process and reduce editors workloads, are treated as trade secrets by publishing houses and are not shared publicly. To improve the effectiveness of their strategies, editors in small publishing groups are faced with undertaking an iterative trial-and-error approach. We show that Cartesian Genetic Programming, a nature-inspired evolutionary algorithm, can dramatically improve editorial strategies. The artificially evolved strategy reduced the duration of the peer review process by 30%, without increasing the pool of reviewers (in comparison to a typical human-developed strategy). Evolutionary computation has typically been used in technological processes or biological ecosystems. Our results demonstrate that genetic programs can improve real-world social systems that are usually much harder to understand and control than physical systems.
△ Less
Submitted 2 December, 2017;
originally announced December 2017.
-
Fractional Dynamics of Network Growth Constrained by aging Node Interactions
Authors:
Hadiseh Safdari,
Milad Zare Kamali,
Amirhossein Shirazi,
Moein Khalighi,
Gholamreza Jafari,
Marcel Ausloos
Abstract:
In many social complex systems, in which agents are linked by non-linear interactions, the history of events strongly influences the whole network dynamics. However, a class of "commonly accepted beliefs" seems rarely studied. In this paper, we examine how the growth process of a (social) network is influenced by past circumstances. In order to tackle this cause, we simply modify the well known pr…
▽ More
In many social complex systems, in which agents are linked by non-linear interactions, the history of events strongly influences the whole network dynamics. However, a class of "commonly accepted beliefs" seems rarely studied. In this paper, we examine how the growth process of a (social) network is influenced by past circumstances. In order to tackle this cause, we simply modify the well known preferential attachment mechanism by imposing a time dependent kernel function in the network evolution equation. This approach leads to a fractional order Barabasi-Albert (BA) differential equation, generalizing the BA model. Our results show that, with passing time, an aging process is observed for the network dynamics. The aging process leads to a decay for the node degree values, thereby creating an opposing process to the preferential attachment mechanism. On one hand, based on the preferential attachment mechanism, nodes with a high degree are more likely to absorb links; but, on the other hand, a node's age has a reduced chance for new connections. This competitive scenario allows an increased chance for younger members to become a hub. Simulations of such a network growth with aging constraint confirm the results found from solving the fractional BA equation. We also report, as an exemplary application, an investigation of the collaboration network between Hollywood movie actors. It is undubiously shown that a decay in the dynamics of their collaboration rate is found, - even including a sex difference. Such findings suggest a widely universal application of the so generalized BA model.
△ Less
Submitted 9 September, 2017;
originally announced September 2017.
-
Glassy states of aging social networks
Authors:
F. Hassanibesheli,
L. Hedayatifar,
H. Safdari,
M. Ausloos,
G. R. Jafari
Abstract:
Individuals often develop reluctance to change their social relations, called "secondary homebody", even though their interactions with their environment evolve with time. Some memory effect is loosely present deforcing changes. In other words, in presence of memory, relations do not change easily. In order to investigate some history or memory effect on social networks, we introduce a temporal ke…
▽ More
Individuals often develop reluctance to change their social relations, called "secondary homebody", even though their interactions with their environment evolve with time. Some memory effect is loosely present deforcing changes. In other words, in presence of memory, relations do not change easily. In order to investigate some history or memory effect on social networks, we introduce a temporal kernel function into the Heider conventional balance theory, allowing for the "quality" of past relations to contribute to the evolution of the system. This memory effect is shown to lead to the emergence of aged networks, thereby perfectly describing and the more so measuring the aging process of links ("social relations"). It is shown that such a memory does not change the dynamical attractors of the system, but does prolong the time necessary to reach the "balanced states". The general trend goes toward obtaining either global ("paradise" or "bipolar") or local ("jammed") balanced states, but is profoundly affected by aged relations. The resistance of elder links against changes decelerates the evolution of the system and traps it into so named glassy states. In contrast to balance
△ Less
Submitted 9 September, 2017;
originally announced September 2017.
-
Quantitative and Qualitative Analysis of Editor Behavior through Potentially Coercive Citations
Authors:
Claudiu Herteliu,
Marcel Ausloos,
Bogdan Vasile Ileanu,
Giulia Rotundo,
Tudorel Andrei
Abstract:
How much is the h-index of an editor of a well ranked journal improved due to citations which occur after his or her appointment? Scientific recognition within academia is widely measured nowadays by the number of citations or h-index. Our dataset is based on a sample of four editors from a well ranked journal (impact factor - IF - greater than 2). The target group consists of two editors who seem…
▽ More
How much is the h-index of an editor of a well ranked journal improved due to citations which occur after his or her appointment? Scientific recognition within academia is widely measured nowadays by the number of citations or h-index. Our dataset is based on a sample of four editors from a well ranked journal (impact factor - IF - greater than 2). The target group consists of two editors who seem to benefit by their position through an increased citation number (and subsequently h-index) within journal. The total amount of citations for the target group is bigger than 600. The control group is formed by another set of two editors from the same journal whose relations between their positions and their citation records remain neutral. The total amount of citations for the control group is more than 1200. The timespan for which pattern of citations has been studied is 1975-2015. Previous coercive citations for a journal benefit (increase its IF) has been signaled. To the best of our knowledge, this is a pioneering work on coercive citations for personal (or editors) benefit. Editorial teams should be aware about this type of potentially unethical behavior and act accordingly.
△ Less
Submitted 7 June, 2017; v1 submitted 2 May, 2017;
originally announced May 2017.
-
Benford's law: a 'slee** beauty' slee** in the dirty pages of logarithmic tables
Authors:
Tariq Ahmad Mir,
Marcel Ausloos
Abstract:
Benford's law is an empirical observation, first reported by Simon Newcomb in 1881 and then independently by Frank Benford in 1938: the first significant digits of numbers in large data are often distributed according to a logarithmically decreasing function. Being contrary to intuition, the law was forgotten as a mere curious observation. However, in the last two decades, relevant literature has…
▽ More
Benford's law is an empirical observation, first reported by Simon Newcomb in 1881 and then independently by Frank Benford in 1938: the first significant digits of numbers in large data are often distributed according to a logarithmically decreasing function. Being contrary to intuition, the law was forgotten as a mere curious observation. However, in the last two decades, relevant literature has grown exponentially, - an evolution typical of "Slee** Beauties" (SBs) publications that go unnoticed (sleep) for a long time and then suddenly become center of attention (are awakened). Thus, in the present study, we show that Newcomb (1881) and Benford (1938) papers are clearly SBs. The former was in deep sleep for 110 years whereas the latter was in deep sleep for a comparatively lesser period of 31 years up to 1968, and in a state of less deep sleep for another 27 years up to 1995. Both SBs were awakened in the year 1995 by Hill (1995a). In so doing, we show that the waking prince (Hill, 1995a) is more often quoted than the SB whom he kissed, - in this Benford's law case, wondering whether this is a general effect, - to be usefully studied.
△ Less
Submitted 2 February, 2017;
originally announced February 2017.
-
Day of the week effect in paper submission/acceptance/rejection to/in/by peer review journals. II. An ARCH econometric-like modeling
Authors:
Marcel Ausloos,
Olgica Nedic,
Aleksandar Dekanski,
Maciej J. Mrowinski,
Piotr Fronczak,
Agata Fronczak
Abstract:
This paper aims at providing a statistical model for the preferred behavior of authors submitting a paper to a scientific journal. The electronic submission of (about 600) papers to the Journal of the Serbian Chemical Society has been recorded for every day from Jan. 01, 2013 till Dec. 31, 2014, together with the acceptance or rejection paper fate. Seasonal effects and editor roles (through desk r…
▽ More
This paper aims at providing a statistical model for the preferred behavior of authors submitting a paper to a scientific journal. The electronic submission of (about 600) papers to the Journal of the Serbian Chemical Society has been recorded for every day from Jan. 01, 2013 till Dec. 31, 2014, together with the acceptance or rejection paper fate. Seasonal effects and editor roles (through desk rejection and subfield editors) are examined. An ARCH-like econometric model is derived stressing the main determinants of the favorite day-of-week process.
△ Less
Submitted 14 November, 2016;
originally announced November 2016.
-
Day of the week effect in paper submission/acceptance/rejection to/in/by peer review journals
Authors:
Marcel Ausloos,
Olgica Nedic,
Aleksandar Dekanski
Abstract:
This paper aims at providing an introduction to the behavior of authors submitting a paper to a scientific journal. Dates of electronic submission of papers to the Journal of the Serbian Chemical Society have been recorded from the 1st January 2013 till the 31st December 2014, thus over 2 years.
There is no Monday or Friday effect like in financial markets, but rather a Tuesday-Wednesday effect…
▽ More
This paper aims at providing an introduction to the behavior of authors submitting a paper to a scientific journal. Dates of electronic submission of papers to the Journal of the Serbian Chemical Society have been recorded from the 1st January 2013 till the 31st December 2014, thus over 2 years.
There is no Monday or Friday effect like in financial markets, but rather a Tuesday-Wednesday effect occurs: papers are more often submitted on Wednesday; however, the relative number of going to be accepted papers is larger if these are submitted on Tuesday. On the other hand, weekend days (Saturday and Sunday) are not the best days to finalize and submit manuscripts. An interpretation based on the type of submitted work ("experimental chemistry") and on the influence of (senior) coauthors is presented. A thermodynamic connection is proposed within an entropy context. A (new) entropic distance is defined in order to measure the "opaqueness" = disorder) of the submission process.
△ Less
Submitted 6 April, 2016;
originally announced April 2016.
-
Inferring cultural regions from correlation networks of given baby names
Authors:
Mateusz Pomorski,
Malgorzata J. Krawczyk,
Krzysztof Kulakowski,
Jaroslaw Kwapien,
Marcel Ausloos
Abstract:
We report investigations on the statistical characteristics of the baby names given between 1910 and 2010 in the United States of America. For each year, the 100 most frequent names in the USA are sorted out. For these names, the correlations between the names profiles are calculated for all pairs of states (minus Hawaii and Alaska). The correlations are used to form a weighted network which is fo…
▽ More
We report investigations on the statistical characteristics of the baby names given between 1910 and 2010 in the United States of America. For each year, the 100 most frequent names in the USA are sorted out. For these names, the correlations between the names profiles are calculated for all pairs of states (minus Hawaii and Alaska). The correlations are used to form a weighted network which is found to vary mildly in time. In fact, the structure of communities in the network remains quite stable till about 1980. The goal is that the calculated structure approximately reproduces the usually accepted geopolitical regions: the North East, the South, and the "Midwest + West" as the third one. Furthermore, the dataset reveals that the name distribution satisfies the Zipf law, separately for each state and each year, i.e. the name frequency $f\propto r^{-α}$, where r is the name rank. Between 1920 and 1980, the exponent alpha is the largest one for the set of states classified as 'the South', but the smallest one for the set of states classified as "Midwest + West". Our interpretation is that the pool of selected names was quite narrow in the Southern states. The data is compared with some related statistics of names in Belgium, a country also with different regions, but having quite a different scale than the USA. There, the Zipf exponent is low for young people and for the Brussels citizens.
△ Less
Submitted 8 December, 2015; v1 submitted 7 December, 2015;
originally announced December 2015.
-
Quantifying the quality of peer reviewers through Zipf's law
Authors:
Marcel Ausloos,
Olgica Nedic,
Agata Fronczak,
Piotr Fronczak
Abstract:
This paper introduces a statistical and other analysis of peer reviewers in order to approach their "quality" through some quantification measure, thereby leading to some quality metrics. Peer reviewer reports for the Journal of the Serbian Chemical Society are examined. The text of each report has first to be adapted to word counting software in order to avoid jargon inducing confusion when searc…
▽ More
This paper introduces a statistical and other analysis of peer reviewers in order to approach their "quality" through some quantification measure, thereby leading to some quality metrics. Peer reviewer reports for the Journal of the Serbian Chemical Society are examined. The text of each report has first to be adapted to word counting software in order to avoid jargon inducing confusion when searching for the word frequency: e.g. C must be distinguished, depending if it means Carbon or Celsius, etc. Thus, every report has to be carefully "rewritten". Thereafter, the quantity, variety and distribution of words are examined in each report and compared to the whole set. Two separate months, according when reports came in, are distinguished to observe any possible hidden spurious effects. Coherence is found. An empirical distribution is searched for through a Zipf-Pareto rank-size law. It is observed that peer review reports are very far from usual texts in this respect. Deviations from the usual (first) Zipf's law are discussed. A theoretical suggestion for the "best (or worst) report" and by extension "good (or bad) reviewer", within this context, is provided from an entropy argument, through the concept of "distance to average" behavior. Another entropy-based measure also allows to measure the journal reviews (whence reviewers) for further comparison with other journals through their own reviewer reports.
△ Less
Submitted 23 August, 2015;
originally announced August 2015.
-
Review times in peer review: quantitative analysis of editorial workflows
Authors:
Maciej J. Mrowinski,
Agata Fronczak,
Piotr Fronczak,
Olgica Nedic,
Marcel Ausloos
Abstract:
We examine selected aspects of peer review and suggest possible improvements. To this end, we analyse a dataset containing information about 300 papers submitted to the Biochemistry and Biotechnology section of the Journal of the Serbian Chemical Society. After separating the peer review process into stages that each review has to go through, we use a weighted directed graph to describe it in a pr…
▽ More
We examine selected aspects of peer review and suggest possible improvements. To this end, we analyse a dataset containing information about 300 papers submitted to the Biochemistry and Biotechnology section of the Journal of the Serbian Chemical Society. After separating the peer review process into stages that each review has to go through, we use a weighted directed graph to describe it in a probabilistic manner and test the impact of some modifications of the editorial policy on the efficiency of the whole process.
△ Less
Submitted 5 August, 2015;
originally announced August 2015.
-
Test of two hypotheses explaining the size of populations in a system of cities
Authors:
Nikolay K. Vitanov,
Marcel Ausloos
Abstract:
Two classical hypotheses are examined about the population growth in a system of cities: Hypothesis 1 pertains to Gibrat's and Zipf's theory which states that the city growth-decay process is size independent; Hypothesis 2 pertains to the so called Yule process which states that the growth of populations in cities happens when (i) the distribution of the city population initial size obeys a log-no…
▽ More
Two classical hypotheses are examined about the population growth in a system of cities: Hypothesis 1 pertains to Gibrat's and Zipf's theory which states that the city growth-decay process is size independent; Hypothesis 2 pertains to the so called Yule process which states that the growth of populations in cities happens when (i) the distribution of the city population initial size obeys a log-normal function, (ii) the growth of the settlements follows a stochastic process. The basis for the test is some official data on Bulgarian cities at various times. This system was chosen because (i) Bulgaria is a country for which one does not expect biased theoretical conditions; (ii) the city populations were determined rather precisely. The present results show that: (i) the population size growth of the Bulgarian cities is size dependent, whence Hypothesis 1 is not confirmed for Bulgaria; (ii) the population size growth of Bulgarian cities can be described by a double Pareto log-normal distribution, whence Hypothesis 2 is valid for the Bulgarian city system. It is expected that this fine study brings some information and light on other, usually considered to be more pertinent, city systems in various countries.
△ Less
Submitted 29 June, 2015;
originally announced June 2015.
-
Slow-down or speed-up of inter- and intra-cluster diffusion of controversial knowledge in stubborn communities based on a small world network
Authors:
Marcel Ausloos
Abstract:
Diffusion of knowledge is expected to be huge when agents are open minded. The report concerns a more difficult diffusion case when communities are made of stubborn agents. Communities having markedly different opinions are for example the Neocreationist and Intelligent Design Proponents (IDP), on one hand, and the Darwinian Evolution Defenders (DED), on the other hand. The case of knowledge diffu…
▽ More
Diffusion of knowledge is expected to be huge when agents are open minded. The report concerns a more difficult diffusion case when communities are made of stubborn agents. Communities having markedly different opinions are for example the Neocreationist and Intelligent Design Proponents (IDP), on one hand, and the Darwinian Evolution Defenders (DED), on the other hand. The case of knowledge diffusion within such communities is studied here on a network based on an adjacency matrix built from time ordered selected quotations of agents, whence for inter- and intra-communities. The network is intrinsically directed and not necessarily reciprocal. Thus, the adjacency matrices have complex eigenvalues, the eigenvectors present complex components. A quantification of the slow-down or speed-up effects of information diffusion in such temporal networks, with non-Markovian contact sequences, can be made by comparing the real time dependent (directed) network to its counterpart, the time aggregated (undirected) network, - which has real eigenvalues. In order to do so, small world networks which both contain an $odd$ number of nodes are studied and compared to similar networks with an $even$ number of nodes.
It is found that (i) the diffusion of knowledge is more difficult on the largest networks, (ii) the network size influences the slowing-down or speeding-up diffusion process. Interestingly, it is observed that (iii) the diffusion of knowledge is slower in IDP and faster in DED communities. It is suggested that the finding can be "rationalized", if some "scientific quality" and "publication habit" is attributed to the agents, as common sense would guess. This finding offers some opening discussion toward tying scientific knowledge to belief.
△ Less
Submitted 28 June, 2015;
originally announced June 2015.
-
Coherent measures of the impact of co-authors in peer review journals and in proceedings publications
Authors:
Marcel Ausloos
Abstract:
This paper focuses on the coauthor effect in different types of publications, usually not equally respected in measuring research impact. {\it A priori} unexpected relationships are found between the total coauthor core value, $m_a$, of a leading investigator (LI), and the related values for their publications in either peer review journals ($j$) or in proceedings ($p$). A surprisingly linear rela…
▽ More
This paper focuses on the coauthor effect in different types of publications, usually not equally respected in measuring research impact. {\it A priori} unexpected relationships are found between the total coauthor core value, $m_a$, of a leading investigator (LI), and the related values for their publications in either peer review journals ($j$) or in proceedings ($p$). A surprisingly linear relationship is found: $ m_a^{(j)} + 0.4\;m_a^{(p)} = m_a^{(jp)} $. Furthermore, another relationship is found concerning the measure of the total number of citations, $A_a$, i.e. the surface of the citation size-rank histogram up to $m_a$. Another linear relationship exists : $A_a^{(j)} + 1.36\; A_a^{(p)} = A_a^{(jp)} $. These empirical findings coefficients (0.4 and 1.36) are supported by considerations based on an empirical power law found between the number of joint publications of an author and the rank of a coauthor. Moreover, a simple power law relationship is found between $m_a$ and the number ($r_M$) of coauthors of a LI: $m_a\simeq r_M^μ$; the power law exponent $μ$ depends on the type ($j$ or $p$) of publications. These simple relations, at this time limited to publications in physics, imply that coauthors are a "more positive measure" of a principal investigator role, in both types of scientific outputs, than the Hirsch index could indicate. Therefore, to scorn upon co-authors in publications, in particular in proceedings, is incorrect. On the contrary, the findings suggest an immediate test of coherence of scientific authorship in scientific policy processes.
△ Less
Submitted 17 June, 2015;
originally announced June 2015.
-
Assessing the true role of coauthors in the h-index measure of an author scientific impact
Authors:
Marcel Ausloos
Abstract:
A method based on the classical principal component analysis leads to demonstrate that the role of co-authors should give a h-index measure to a group leader higher than usually accepted. The method rather easily gives what is usually searched for, i.e. an estimate of the role (or "weight") of co-authors, as the additional value to an author papers' popularity. The construction of the co-authorshi…
▽ More
A method based on the classical principal component analysis leads to demonstrate that the role of co-authors should give a h-index measure to a group leader higher than usually accepted. The method rather easily gives what is usually searched for, i.e. an estimate of the role (or "weight") of co-authors, as the additional value to an author papers' popularity. The construction of the co-authorship popularity H-matrix is exemplified and the role of eigenvalues and the main eigenvector component are discussed. An example illustrates the points and serves as the basis for suggesting a generally practical application of the concept.
△ Less
Submitted 10 January, 2015;
originally announced January 2015.
-
Spatial interactions in agent-based modeling
Authors:
Marcel Ausloos,
Herbert Dawid,
Ugo Merlone
Abstract:
Agent Based Modeling (ABM) has become a widespread approach to model complex interactions. In this chapter after briefly summarizing some features of ABM the different approaches in modeling spatial interactions are discussed.
It is stressed that agents can interact either indirectly through a shared environment and/or directly with each other. In such an approach, higher-order variables such as…
▽ More
Agent Based Modeling (ABM) has become a widespread approach to model complex interactions. In this chapter after briefly summarizing some features of ABM the different approaches in modeling spatial interactions are discussed.
It is stressed that agents can interact either indirectly through a shared environment and/or directly with each other. In such an approach, higher-order variables such as commodity prices, population dynamics or even institutions, are not exogenously specified but instead are seen as the results of interactions. It is highlighted in the chapter that the understanding of patterns emerging from such spatial interaction between agents is a key problem as much as their description through analytical or simulation means.
The chapter reviews different approaches for modeling agents' behavior, taking into account either explicit spatial (lattice based) structures or networks. Some emphasis is placed on recent ABM as applied to the description of the dynamics of the geographical distribution of economic activities, - out of equilibrium. The Eurace@Unibi Model, an agent-based macroeconomic model with spatial structure, is used to illustrate the potential of such an approach for spatial policy analysis.
△ Less
Submitted 4 May, 2014;
originally announced May 2014.
-
Ranking structures and Rank-Rank Correlations of Countries. The FIFA and UEFA cases
Authors:
Marcel Ausloos,
Rudi Cloots,
Adam Gadomski,
Nikolay K. Vitanov
Abstract:
Ranking of agents competing with each other in complex systems may lead to paradoxes according to the pre-chosen different measures. A discussion is presented on such rank-rank, similar or not, correlations based on the case of European countries ranked by UEFA and FIFA from different soccer competitions. The first question to be answered is whether an empirical and simple law is obtained for such…
▽ More
Ranking of agents competing with each other in complex systems may lead to paradoxes according to the pre-chosen different measures. A discussion is presented on such rank-rank, similar or not, correlations based on the case of European countries ranked by UEFA and FIFA from different soccer competitions. The first question to be answered is whether an empirical and simple law is obtained for such (self-) organizations of complex sociological systems with such different measuring schemes. It is found that the power law form is not the best description contrary to many modern expectations. The stretched exponential is much more adequate. Moreover, it is found that the measuring rules lead to some inner structures, in both cases.
△ Less
Submitted 22 March, 2014;
originally announced March 2014.
-
Binary Scientific Star Coauthors Core Size
Authors:
Marcel Ausloos
Abstract:
It is examined whether the relationship $ J \propto A/r^α$, and the subsequent coauthor core notion (Ausloos 2013), between the number ($J$) of joint publications (JP) by a "main scientist" (LI) with her/his coauthors (CAs) can be extended to a team-like system. This is done by considering that each coauthor can be so strongly tied to the LI that they are forming {\it binary scientific star} (BSS)…
▽ More
It is examined whether the relationship $ J \propto A/r^α$, and the subsequent coauthor core notion (Ausloos 2013), between the number ($J$) of joint publications (JP) by a "main scientist" (LI) with her/his coauthors (CAs) can be extended to a team-like system. This is done by considering that each coauthor can be so strongly tied to the LI that they are forming {\it binary scientific star} (BSS) systems with respect to their other collaborators. Moreover, publications in peer review journals and in "proceedings", both often thought to be of "different quality", are separetely distinguished. The role of a time interval for measuring $J$ and $α$ is also examined. New indirect measures are also introduced.
For making the point, two LI cases with numerous CAs are studied. It is found that only a few BSS need to be usefully examined. The exponent $α$ turns out to be "second scientist" weakly dependent, but still "size" and "publication type" dependent, according to the number of CAs or JP. The CA core value is found to be (CA or JP) size and publication type dependent, but remains in an understandable range. Somewhat unexpectedly, no special qualitative difference on the binary scientific star CA core value is found between publications in peer review journals and in proceedings.
In conclusion, some remark is made on partner cooperation in BSS teams. It is suggested that such measures can serve as criteria for distinguishing the role of scientists in a team.
△ Less
Submitted 15 January, 2014;
originally announced January 2014.
-
A scientometrics law about co-authors and their ranking. The co-author core
Authors:
Marcel Ausloos
Abstract:
Rather than "measuring" a scientist impact through the number of citations which his/her published work can have generated, isn't it more appropriate to consider his/her value through his/her scientific network performance illustrated by his/her co-author role, thus focussing on his/her joint publications, - and their impact through citations? Whence, on one hand, this paper very briefly examines…
▽ More
Rather than "measuring" a scientist impact through the number of citations which his/her published work can have generated, isn't it more appropriate to consider his/her value through his/her scientific network performance illustrated by his/her co-author role, thus focussing on his/her joint publications, - and their impact through citations? Whence, on one hand, this paper very briefly examines bibliometric laws, like the $h$-index and subsequent debate about co-authorship effects, but on the other hand, proposes a measure of collaborative work through a new index. Based on data about the publication output of a specific research group, a new bibliometric law is found.
Let a co-author $C$ have written $J$ (joint) publications with one or several colleagues. Rank all the co-authors of that individual according to their number of joint publications, giving a rank $r$ to each co-author, starting with $r=1$ for the most prolific.
It is empirically found that a very simple relationship holds between the number of joint publications $J$ by coauthors and their rank of importance, i.e. $J \propto 1/r$. Thereafter, in the same spirit as for the Hirsch core, one can define a "co-author core", and introduce indices operating on an author. It is emphasized that the new index has a quite different (philosophical) perspective that the $h$-index. In the present case, one focusses on "relevant" persons rather than on "relevant" publications.
Although the numerical discussion is based on one case, there is little doubt that the law can be verified in many other situations. Therefore, variants and generalizations could be later produced in order to quantify co-author roles, in a temporary or long lasting stable team(s), and lead to criteria about funding, career measurements or even induce career strategies.
△ Less
Submitted 14 January, 2013; v1 submitted 6 July, 2012;
originally announced July 2012.
-
Information Society: Modeling A Complex System With Scarce Data
Authors:
Noemi L. Olivera,
Araceli N. Proto,
Marcel Ausloos
Abstract:
Considering electronic implications in the Information Society (IS) as a complex system, complexity science tools are used to describe the processes that are seen to be taking place. The sometimes troublesome relationship between the information and communication new technologies and e-society gives rise to different problems, some of them being unexpected. Probably, the Digital Divide (DD) and th…
▽ More
Considering electronic implications in the Information Society (IS) as a complex system, complexity science tools are used to describe the processes that are seen to be taking place. The sometimes troublesome relationship between the information and communication new technologies and e-society gives rise to different problems, some of them being unexpected. Probably, the Digital Divide (DD) and the Internet Governance (IG) are among the most conflictive ones of internationally based e-Affairs. Admitting that solutions should be found for these problems, certain international policies are required. In this context, data gathering and subsequent analysis, as well as the construction of adequate physical models are extremely important in order to imagine different future scenarios and suggest some subsequent control. In the main text, mathematical modelization helps for visualizing how policies could e.g. influence the individual and collective behavior in an empirical social agent system. In order to show how this purpose could be achieved, two approaches, (i) the Ising model and (ii) a generalized Lotka-Volterra model are used for DD and IG considerations respectively. It can be concluded that the social modelization of the e-Information Society as a complex system provides insights about how DD can be reduced and how the a large number of weak members of the IS could influence the outcomes of the IG.
△ Less
Submitted 7 January, 2012;
originally announced January 2012.
-
Knowledge epidemics and population dynamics models for describing idea diffusion
Authors:
Nikolay K. Vitanov,
Marcel R. Ausloos
Abstract:
The diffusion of ideas is often closely connected to the creation and diffusion of knowledge and to the technological evolution of society. Because of this, knowledge creation, exchange and its subsequent transformation into innovations for improved welfare and economic growth is briefly described from a historical point of view. Next, three approaches are discussed for modeling the diffusion of i…
▽ More
The diffusion of ideas is often closely connected to the creation and diffusion of knowledge and to the technological evolution of society. Because of this, knowledge creation, exchange and its subsequent transformation into innovations for improved welfare and economic growth is briefly described from a historical point of view. Next, three approaches are discussed for modeling the diffusion of ideas in the areas of science and technology, through (i) deterministic, (ii) stochastic, and (iii) statistical approaches. These are illustrated through their corresponding population dynamics and epidemic models relative to the spreading of ideas, knowledge and innovations. The deterministic dynamical models are considered to be appropriate for analyzing the evolution of large and small societal, scientific and technological systems when the influence of fluctuations is insignificant. Stochastic models are appropriate when the system of interest is small but when the fluctuations become significant for its evolution. Finally statistical approaches and models based on the laws and distributions of Lotka, Bradford, Yule, Zipf-Mandelbrot, and others, provide much useful information for the analysis of the evolution of systems in which development is closely connected to the process of idea diffusion.
△ Less
Submitted 3 January, 2012;
originally announced January 2012.
-
On religion and language evolutions seen through mathematical and agent based models
Authors:
M. Ausloos
Abstract:
(shortened version) Religions and languages are social variables, like age, sex, wealth or political opinions, to be studied like any other organizational parameter. In fact, religiosity is one of the most important sociological aspects of populations. Languages are also a characteristics of the human kind. New religions, new languages appear though others disappear. All religions and languages ev…
▽ More
(shortened version) Religions and languages are social variables, like age, sex, wealth or political opinions, to be studied like any other organizational parameter. In fact, religiosity is one of the most important sociological aspects of populations. Languages are also a characteristics of the human kind. New religions, new languages appear though others disappear. All religions and languages evolve when they adapt to the society developments. On the other hand, the number of adherents of a given religion, the number of persons speaking a language is not fixed. Several questions can be raised. E.g. from a macroscopic point of view : How many religions/languages exist at a given time? What is their distribution? What is their life time? How do they evolve?. From a microscopic view point: can one invent agent based models to describe macroscopic aspects? Does it exist simple evolution equations? It is intuitively accepted, but also found through from statistical analysis of the frequency distribution that an attachment process is the primary cause of the distribution evolution : usually the initial religion/language is that of the mother. Later on, changes can occur either due to heterogeneous agent interaction processes or due to external field constraints, - or both. Such cases can be illustrated with historical facts and data. It is stressed that characteristic time scales are different, and recalled that external fields are very relevant in the case of religions, rending the study more interesting within a mechanistic approach
△ Less
Submitted 28 March, 2011;
originally announced March 2011.
-
Verhulst-Lotka-Volterra (VLV) model of ideological struggles
Authors:
Marcel R. Ausloos,
Nikolay K. Vitanov,
Zlatinka I. Dimitrova
Abstract:
Let the population of e.g. a country where some opinion struggle occurs be varying in time, according to Verhulst equation. Consider next some competition between opinions such as the dynamics be described by Lotka and Volterra equations. Two kinds of influences can be used, in such a model, for describing the dynamics of an agent opinion conversion: this can occur (i) either by means of mass comm…
▽ More
Let the population of e.g. a country where some opinion struggle occurs be varying in time, according to Verhulst equation. Consider next some competition between opinions such as the dynamics be described by Lotka and Volterra equations. Two kinds of influences can be used, in such a model, for describing the dynamics of an agent opinion conversion: this can occur (i) either by means of mass communication tools, under some external field influence, or (ii) by means of direct interactions between agents. It results, among other features, that change(s) in environmental conditions can prevent the extinction of populations of followers of some ideology due to different kinds of resurrection effects. The tension arising in the country population is proposed to be measured by an appropriately defined scale index.
△ Less
Submitted 28 March, 2011;
originally announced March 2011.
-
Punctuation effects in English and Esperanto texts
Authors:
M. Ausloos
Abstract:
A statistical physics study of punctuation effects on sentence lengths is presented for written texts: {\it Alice in wonderland} and {\it Through a looking glass}. The translation of the first text into esperanto is also considered as a test for the role of punctuation in defining a style, and for contrasting natural and artificial, but written, languages. Several log-log plots of the sentence len…
▽ More
A statistical physics study of punctuation effects on sentence lengths is presented for written texts: {\it Alice in wonderland} and {\it Through a looking glass}. The translation of the first text into esperanto is also considered as a test for the role of punctuation in defining a style, and for contrasting natural and artificial, but written, languages. Several log-log plots of the sentence length-rank relationship are presented for the major punctuation marks. Different power laws are observed with characteristic exponents. The exponent can take a value much less than unity ($ca.$ 0.50 or 0.30) depending on how a sentence is defined. The texts are also mapped into time series based on the word frequencies. The quantitative differences between the original and translated texts are very minutes, at the exponent level. It is argued that sentences seem to be more reliable than word distributions in discussing an author style.
△ Less
Submitted 27 April, 2010;
originally announced April 2010.
-
Statistical-mechanics approach to a reinforcement learning model with memory
Authors:
Adam Lipowski,
Krzysztof Gontarek,
Marcel Ausloos
Abstract:
We introduce a two-player model of reinforcement learning with memory. Past actions of an iterated game are stored in a memory and used to determine player's next action. To examine the behaviour of the model some approximate methods are used and confronted against numerical simulations and exact master equation. When the length of memory of players increases to infinity the model undergoes an a…
▽ More
We introduce a two-player model of reinforcement learning with memory. Past actions of an iterated game are stored in a memory and used to determine player's next action. To examine the behaviour of the model some approximate methods are used and confronted against numerical simulations and exact master equation. When the length of memory of players increases to infinity the model undergoes an absorbing-state phase transition. Performance of examined strategies is checked in the prisoner' dilemma game. It turns out that it is advantageous to have a large memory in symmetric games, but it is better to have a short memory in asymmetric ones.
△ Less
Submitted 30 August, 2008; v1 submitted 4 April, 2008;
originally announced April 2008.
-
Equilibrium (Zipf) and Dynamic (Grasseberg-Procaccia) method based analyses of human texts. A comparison of natural (english) and artificial (esperanto) languages
Authors:
M. Ausloos
Abstract:
A comparison of two english texts from Lewis Carroll, one (Alice in wonderland), also translated into esperanto, the other (Through a looking glass) are discussed in order to observe whether natural and artificial languages significantly differ from each other. One dimensional time series like signals are constructed using only word frequencies (FTS) or word lengths (LTS). The data is studied th…
▽ More
A comparison of two english texts from Lewis Carroll, one (Alice in wonderland), also translated into esperanto, the other (Through a looking glass) are discussed in order to observe whether natural and artificial languages significantly differ from each other. One dimensional time series like signals are constructed using only word frequencies (FTS) or word lengths (LTS). The data is studied through (i) a Zipf method for sorting out correlations in the FTS and (ii) a Grassberger-Procaccia (GP) technique based method for finding correlations in LTS. Features are compared : different power laws are observed with characteristic exponents for the ranking properties, and the {\it phase space attractor dimensionality}. The Zipf exponent can take values much less than unity ($ca.$ 0.50 or 0.30) depending on how a sentence is defined. This non-universality is conjectured to be a measure of the author $style$. Moreover the attractor dimension $r$ is a simple function of the so called phase space dimension $n$, i.e., $r = n^λ$, with $λ= 0.79$. Such an exponent should also conjecture to be a measure of the author $creativity$. However, even though there are quantitative differences between the original english text and its esperanto translation, the qualitative differences are very minutes, indicating in this case a translation relatively well respecting, along our analysis lines, the content of the author writing.
△ Less
Submitted 28 February, 2008;
originally announced February 2008.
-
A Comparison of natural (english) and artificial (esperanto) languages. A Multifractal method based analysis
Authors:
J. Gillet,
M. Ausloos
Abstract:
We present a comparison of two english texts, written by Lewis Carroll, one (Alice in wonderland) and the other (Through a looking glass), the former translated into esperanto, in order to observe whether natural and artificial languages significantly differ from each other. We construct one dimensional time series like signals using either word lengths or word frequencies. We use the multifract…
▽ More
We present a comparison of two english texts, written by Lewis Carroll, one (Alice in wonderland) and the other (Through a looking glass), the former translated into esperanto, in order to observe whether natural and artificial languages significantly differ from each other. We construct one dimensional time series like signals using either word lengths or word frequencies. We use the multifractal ideas for sorting out correlations in the writings. In order to check the robustness of the methods we also write the corresponding shuffled texts. We compare characteristic functions and e.g. observe marked differences in the (far from parabolic) f(alpha) curves, differences which we attribute to Tsallis non extensive statistical features in the ''frequency time series'' and ''length time series''. The esperanto text has more extreme vallues. A very rough approximation consists in modeling the texts as a random Cantor set if resulting from a binomial cascade of long and short words (or words and blanks). This leads to parameters characterizing the text style, and most likely in fine the author writings.
△ Less
Submitted 16 January, 2008;
originally announced January 2008.
-
Word statistics in Blogs and RSS feeds: Towards empirical universal evidence
Authors:
R. Lambiotte,
M. Ausloos,
M. Thelwall
Abstract:
We focus on the statistics of word occurrences and of the waiting times between such occurrences in Blogs. Due to the heterogeneity of words' frequencies, the empirical analysis is performed by studying classes of "frequently-equivalent" words, i.e. by grou** words depending on their frequencies. Two limiting cases are considered: the dilute limit, i.e. for those words that are used less than…
▽ More
We focus on the statistics of word occurrences and of the waiting times between such occurrences in Blogs. Due to the heterogeneity of words' frequencies, the empirical analysis is performed by studying classes of "frequently-equivalent" words, i.e. by grou** words depending on their frequencies. Two limiting cases are considered: the dilute limit, i.e. for those words that are used less than once a day, and the dense limit for frequent words. In both cases, extreme events occur more frequently than expected from the Poisson hypothesis. These deviations from Poisson statistics reveal non-trivial time correlations between events that are associated with bursts of activities. The distribution of waiting times is shown to behave like a stretched exponential and to have the same shape for different sets of words sharing a common frequency, thereby revealing universal features.
△ Less
Submitted 15 July, 2007;
originally announced July 2007.
-
Collaborative tagging as a tripartite network
Authors:
R. Lambiotte,
M. Ausloos
Abstract:
We describe online collaborative communities by tripartite networks, the nodes being persons, items and tags. We introduce projection methods in order to uncover the structures of the networks, i.e. communities of users, genre families...
To do so, we focus on the correlations between the nodes, depending on their profiles, and use percolation techniques that consist in removing less correlated…
▽ More
We describe online collaborative communities by tripartite networks, the nodes being persons, items and tags. We introduce projection methods in order to uncover the structures of the networks, i.e. communities of users, genre families...
To do so, we focus on the correlations between the nodes, depending on their profiles, and use percolation techniques that consist in removing less correlated links and observing the sha** of disconnected islands. The structuring of the network is visualised by using a tree representation. The notion of diversity in the system is also discussed.
△ Less
Submitted 29 December, 2005; v1 submitted 23 December, 2005;
originally announced December 2005.
-
On the genre-fication of Music: a percolation approach (long version)
Authors:
R. Lambiotte,
M. Ausloos
Abstract:
In this paper, we analyze web-downloaded data on people sharing their music library. By attributing to each music group usual music genres (Rock, Pop...), and analysing correlations between music groups of different genres with percolation-idea based methods, we probe the reality of these subdivisions and construct a music genre cartography, with a tree representation. We also show the diversity…
▽ More
In this paper, we analyze web-downloaded data on people sharing their music library. By attributing to each music group usual music genres (Rock, Pop...), and analysing correlations between music groups of different genres with percolation-idea based methods, we probe the reality of these subdivisions and construct a music genre cartography, with a tree representation. We also show the diversity of music genres with Shannon entropy arguments, and discuss an alternative objective way to classify music, that is based on the complex structure of the groups audience. Finally, a link is drawn with the theory of hidden variables in complex networks.
△ Less
Submitted 15 October, 2005; v1 submitted 15 September, 2005;
originally announced September 2005.
-
Simple Model for the Dynamics of Correlations in the Evolution of Economic Entities Under Varying Economic Conditions
Authors:
Marcel Ausloos,
Paulette Clippe,
Andrzej Pekalski
Abstract:
From some observations on economic behaviors, in particular changing economic conditions with time and space, we develop a very simple model for the evolution of economic entities within a geographical type of framework. We raise a few questions and attempt to investigate whether some of them can be tackled by our model. Several cases of interest are reported. It is found that the model even in…
▽ More
From some observations on economic behaviors, in particular changing economic conditions with time and space, we develop a very simple model for the evolution of economic entities within a geographical type of framework. We raise a few questions and attempt to investigate whether some of them can be tackled by our model. Several cases of interest are reported. It is found that the model even in its simple forms can lead to a large variety of situations, including: delocalization and cycles, but also pre-chaotic behavior.
△ Less
Submitted 18 October, 2002;
originally announced October 2002.