-
A theory of best choice selection through objective arguments grounded in Linear Response Theory concepts
Authors:
Marcel Ausloos,
Giulia Rotundo,
Roy Cerqueti
Abstract:
In this paper, we propose how to use objective arguments grounded in statistical mechanics concepts in order to obtain a single number, obtained after aggregation, which would allow to rank "agents", "opinions", ..., all defined in a very broad sense. We aim toward any process which should a priori demand or lead to some consensus in order to attain the presumably best choice among many possibilit…
▽ More
In this paper, we propose how to use objective arguments grounded in statistical mechanics concepts in order to obtain a single number, obtained after aggregation, which would allow to rank "agents", "opinions", ..., all defined in a very broad sense. We aim toward any process which should a priori demand or lead to some consensus in order to attain the presumably best choice among many possibilities. In order to precise the framework, we discuss previous attempts, recalling trivial "means of scores", - weighted or not, Condorcet paradox, TOPSIS, etc. We demonstrate through geometrical arguments on a toy example, with 4 criteria, that the pre-selected order of criteria in previous attempts makes a difference on the final result. However, it might be unjustified. Thus, we base our "best choice theory" on the linear response theory in statistical mechanics: we indicate that one should be calculating correlations functions between all possible choice evaluations, thereby avoiding an arbitrarily ordered set of criteria. We justify the point through an example with 6 possible criteria. Applications in many fields are suggested. Beside, two toy models serving as practical examples and illustrative arguments are given in an Appendix.
△ Less
Submitted 30 March, 2024;
originally announced May 2024.
-
Hierarchy Selection: New team ranking indicators for cyclist multi-stage races
Authors:
Marcel Ausloos
Abstract:
In this paper, I report some investigation discussing team selection, whence hierarchy, through ranking indicators, for example when measuring professional cyclist team's sportive value, in particular in multistage races. A logical, it seems, constraint is introduced on the riders: they must finish the race. Several new indicators are defined, justified, and compared. These indicators are mainly b…
▽ More
In this paper, I report some investigation discussing team selection, whence hierarchy, through ranking indicators, for example when measuring professional cyclist team's sportive value, in particular in multistage races. A logical, it seems, constraint is introduced on the riders: they must finish the race. Several new indicators are defined, justified, and compared. These indicators are mainly based on the arriving place of (the best 3) riders instead of their time needed for finishing the stage or the race, - as presently classically used. A case study, serving as an illustration containing the necessary ingredients for a wider discussion, is the 2023 Vuelta de San Juan, but without loss of generality.
It is shown that the new indicators offer some new viewpoint for distinguishing the ranking through the cumulative sums of the places of riders rather than their finishing times. On the other hand, the indicators indicate a different team hierarchy if only the finishing riders are considered. Some consideration on the distance between ranking indicators is presented.
Moreover, it is argued that these new ranking indicators should hopefully promote more competitive races, not only till the end of the race, but also until the end of each stage. Generalizations and other applications within operational research topics, like in academia, are suggested.
△ Less
Submitted 22 February, 2024;
originally announced April 2024.
-
Unleashing the Power of AI. A Systematic Review of Cutting-Edge Techniques in AI-Enhanced Scientometrics, Webometrics, and Bibliometrics
Authors:
Hamid Reza Saeidnia,
Elaheh Hosseini,
Shadi Abdoli,
Marcel Ausloos
Abstract:
Purpose: The study aims to analyze the synergy of Artificial Intelligence (AI), with scientometrics, webometrics, and bibliometrics to unlock and to emphasize the potential of the applications and benefits of AI algorithms in these fields.
Design/methodology/approach: By conducting a systematic literature review, our aim is to explore the potential of AI in revolutionizing the methods used to me…
▽ More
Purpose: The study aims to analyze the synergy of Artificial Intelligence (AI), with scientometrics, webometrics, and bibliometrics to unlock and to emphasize the potential of the applications and benefits of AI algorithms in these fields.
Design/methodology/approach: By conducting a systematic literature review, our aim is to explore the potential of AI in revolutionizing the methods used to measure and analyze scholarly communication, identify emerging research trends, and evaluate the impact of scientific publications. To achieve this, we implemented a comprehensive search strategy across reputable databases such as ProQuest, IEEE Explore, EBSCO, Web of Science, and Scopus. Our search encompassed articles published from January 1, 2000, to September 2022, resulting in a thorough review of 61 relevant articles.
Findings: (i) Regarding scientometrics, the application of AI yields various distinct advantages, such as conducting analyses of publications, citations, research impact prediction, collaboration, research trend analysis, and knowledge map**, in a more objective and reliable framework. (ii) In terms of webometrics, AI algorithms are able to enhance web crawling and data collection, web link analysis, web content analysis, social media analysis, web impact analysis, and recommender systems. (iii) Moreover, automation of data collection, analysis of citations, disambiguation of authors, analysis of co-authorship networks, assessment of research impact, text mining, and recommender systems are considered as the potential of AI integration in the field of bibliometrics.
Originality/value: This study covers the particularly new benefits and potential of AI-enhanced scientometrics, webometrics, and bibliometrics to highlight the significant prospects of the synergy of this integration through AI.
△ Less
Submitted 22 February, 2024;
originally announced March 2024.
-
Identification of the most important external features of highly cited scholarly papers through 3 (i.e., Ridge, Lasso, and Boruta) feature selection data mining methods
Authors:
Sepideh Fahimifar,
Khadijeh Mousavi,
Fatemeh Mozaffari,
Marcel Ausloos
Abstract:
Highly cited papers are influenced by external factors that are not directly related to the document's intrinsic quality. In this study, 50 characteristics for measuring the performance of 68 highly cited papers, from the Journal of the American Medical Informatics Association indexed in Web of Sciences (WoS), from 2009 to 2019 were investigated. In the first step, a Pearson correlation analysis i…
▽ More
Highly cited papers are influenced by external factors that are not directly related to the document's intrinsic quality. In this study, 50 characteristics for measuring the performance of 68 highly cited papers, from the Journal of the American Medical Informatics Association indexed in Web of Sciences (WoS), from 2009 to 2019 were investigated. In the first step, a Pearson correlation analysis is performed to eliminate variables with zero or weak correlation with the target (dependent) variable ([number of citations in WOS]). Consequently, 32 variables are selected for the next step. By applying the Ridge technique, 13 features show a positive effect on the number of citations. Using three different algorithms, i.e., Ridge, Lasso, and Boruta, 6 factors appear to be the most relevant ones. The [Number of citations by international researchers], [Journal self-citations in citing documents], and [Authors' self-citations in citing documents], are recognized as the most important features by all three methods here used. The [First author's scientific age], [Open-access paper], and [Number of first author's citations in WOS] are identified as the important features of highly cited papers by only two methods, Ridge and Lasso. Notice that we use specific machine learning algorithms as feature selection methods (Ridge, Lasso, and Boruta) to identify the most important features of highly cited papers, tools that had not previously been used for this purpose. In conclusion, we re-emphasize the performance resulting from such algorithms. Moreover, we do not advise authors to seek to increase the citations of their articles by manipulating the identified performance features. Indeed, ethical rules regarding these characteristics must be strictly obeyed.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
Shannon Entropy and Herfindahl-Hirschman Index as Team's Performance and Competitive Balance Indicators in Cyclist Multi-Stage Races
Authors:
Marcel Ausloos
Abstract:
It seems that one cannot find many papers relating entropy to sport competitions. Thus, in this paper, I use (i) the Shannon intrinsic entropy ($S$) as an indicator of "teams sporting value" (or "competition performance") and (ii) the Herfindahl-Hirschman index (HHi) index as a "teams competitive balance" indicator, in the case of (professional) cyclist multi-stage races. The 2022 Tour de France a…
▽ More
It seems that one cannot find many papers relating entropy to sport competitions. Thus, in this paper, I use (i) the Shannon intrinsic entropy ($S$) as an indicator of "teams sporting value" (or "competition performance") and (ii) the Herfindahl-Hirschman index (HHi) index as a "teams competitive balance" indicator, in the case of (professional) cyclist multi-stage races. The 2022 Tour de France and 2023 Tour of Oman are used for numerical illustrations and discussion. The numerical values are obtained from classical and and new ranking indices which measure the teams "final time", on one hand, and "final place", on the other hand, based on the "best three" riders in each stage, but also the corresponding times and places throughout the race, for these finishing riders.
The analysis data demonstrates that the constraint, "only the finishing riders count", makes much sense for obtaining a more objective measure of "team value" and team performance", at the end of a multi-stage race. A graphical analysis allows to distinguish various team levels, with in each a Feller-Pareto distribution, thereby pointing to self-organized processes. In so doing, one hopefully better relates objective scientific measures to sport team competitions, and, besides, even proposes some paths to elaborate on forecasting through standard probability concepts.
△ Less
Submitted 18 June, 2023;
originally announced June 2023.
-
Portfolio Volatility Estimation Relative to Stock Market Cross-Sectional Intrinsic Entropy
Authors:
Claudiu Vinte,
Marcel Ausloos
Abstract:
Selecting stock portfolios and assessing their relative volatility risk compared to the market as a whole, market indices, or other portfolios is of great importance to professional fund managers and individual investors alike. Our research uses the cross-sectional intrinsic entropy (CSIE) model to estimate the cross-sectional volatility of the stock groups that can be considered together as portf…
▽ More
Selecting stock portfolios and assessing their relative volatility risk compared to the market as a whole, market indices, or other portfolios is of great importance to professional fund managers and individual investors alike. Our research uses the cross-sectional intrinsic entropy (CSIE) model to estimate the cross-sectional volatility of the stock groups that can be considered together as portfolio constituents. In our study, we benchmark portfolio volatility risks against the volatility of the entire market provided by the CSIE and the volatility of market indices computed using longitudinal data. This article introduces CSIE-based betas to characterise the relative volatility risk of the portfolio against market indices and the market as a whole. We empirically prove that, through CSIE-based betas, multiple sets of symbols that outperform the market indices in terms of rate of return while maintaining the same level of risk or even lower than the one exhibited by the market index can be discovered, for any given time interval. These sets of symbols can be used as constituent stock portfolios and, in connection with the perspective provided by the CSIE volatility estimates, to hierarchically assess their relative volatility risk within the broader context of the overall volatility of the stock market.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Markov Chain Monte Carlo for generating ranked textual data
Authors:
Roy Cerqueti,
Valerio Ficcadenti,
Gurjeet Dhesi,
Marcel Ausloos
Abstract:
This paper faces a central theme in applied statistics and information science, which is the assessment of the stochastic structure of rank-size laws in text analysis. We consider the words in a corpus by ranking them on the basis of their frequencies in descending order. The starting point is that the ranked data generated in linguistic contexts can be viewed as the realisations of a discrete sta…
▽ More
This paper faces a central theme in applied statistics and information science, which is the assessment of the stochastic structure of rank-size laws in text analysis. We consider the words in a corpus by ranking them on the basis of their frequencies in descending order. The starting point is that the ranked data generated in linguistic contexts can be viewed as the realisations of a discrete states Markov chain, whose stationary distribution behaves according to a discretisation of the best fitted rank-size law. The employed methodological toolkit is Markov Chain Monte Carlo, specifically referring to the Metropolis-Hastings algorithm. The theoretical framework is applied to the rank-size analysis of the hapax legomena occurring in the speeches of the US Presidents. We offer a large number of statistical tests leading to the consistency of our methodological proposal. To pursue our scopes, we also offer arguments supporting that hapaxes are rare (``extreme") events resulting from memory-less-like processes. Moreover, we show that the considered sample has the stochastic structure of a Markov chain of order one. Importantly, we discuss the versatility of the method, which is considered suitable for deducing similar outcomes for other applied science contexts.
△ Less
Submitted 13 October, 2022;
originally announced October 2022.
-
God ($\equiv Elohim$), the first small world network
Authors:
Marcel Ausloos
Abstract:
In this paper, the approach of network map** of words in literary texts is extended to ''textual factors'': the network nodes are defined as ''concepts''; the links are ''community connexions''. Thereafter, the text network properties are investigated along modern statistical physics approaches of networks, thereby relating network topology and algebraic properties, to literary texts contents. A…
▽ More
In this paper, the approach of network map** of words in literary texts is extended to ''textual factors'': the network nodes are defined as ''concepts''; the links are ''community connexions''. Thereafter, the text network properties are investigated along modern statistical physics approaches of networks, thereby relating network topology and algebraic properties, to literary texts contents. As a practical illustration, the first chapter of the Genesis in the Bible is mapped into a 10 node network, as in the Kabbalah approach, mentioning God ($\equiv Elohim$). The characteristics of the network are studied starting from its adjacency matrix, and the corresponding Laplacian matrix. Triplets of nodes are particularly examined in order to emphasize the ''textual (community) connexions'' of each agent "emanation", through the so called clustering coefficients and the overlap index, whence measuring the ''semantic flow'' between the different nodes. It is concluded that this graph is a small-world network, weakly dis-assortative, because its average local clustering coefficient is significantly higher than a random graph constructed on the same vertex set.
△ Less
Submitted 20 June, 2022;
originally announced August 2022.
-
An Intrinsic Entropy Model for Exchange-Traded Securities
Authors:
Claudiu Vinte,
Ion Smeureanu,
Titus-Felix Furtuna,
Marcel Ausloos
Abstract:
This article introduces an intrinsic entropy model that can be used as an indicator to gauge investor interest in a given exchange-traded security, along with the state of the general market corroborated by individual security trade data. Although the syntagma of intrinsic entropy might sound somehow pleonastic, since entropy itself characterizes the fundamentals of a system, we would like to make…
▽ More
This article introduces an intrinsic entropy model that can be used as an indicator to gauge investor interest in a given exchange-traded security, along with the state of the general market corroborated by individual security trade data. Although the syntagma of intrinsic entropy might sound somehow pleonastic, since entropy itself characterizes the fundamentals of a system, we would like to make a clear distinction between entropy models based on the values that a random variable may take and the model that we propose, which employs actual stock exchange trading data. The model we propose for intrinsic entropy does not include any exogenous factor that could influence the level of entropy. The intrinsic entropy signals whether the market is inclined to buy the security or rather to sell it. We further explore the usage of the intrinsic entropy model for algorithmic trading, in order to demonstrate the value of our model in assisting investors in the selection of the intraday stock portfolio, along with timely generated signals to support the buy / sell decision making process. The test results provide empirical evidence that the proposed intrinsic entropy model can be used as an indicator to evaluate the direction and intensity of intraday trading activity of an exchange-traded security. The data used for the test consisted of historical intraday transactions executed on The Bucharest Stock Exchange (BVB).
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
A Volatility Estimator of Stock Market Indices Based on the Intrinsic Entropy Model
Authors:
Claudiu Vinte,
Marcel Ausloos,
Titus Felix Furtuna
Abstract:
Gras** the historical volatility of stock market indices and accurately estimating are two of the major focuses of those involved in the financial securities industry and derivative instruments pricing. This paper presents the results of employing the intrinsic entropy model as a substitute for estimating the volatility of stock market indices. Diverging from the widely used volatility models th…
▽ More
Gras** the historical volatility of stock market indices and accurately estimating are two of the major focuses of those involved in the financial securities industry and derivative instruments pricing. This paper presents the results of employing the intrinsic entropy model as a substitute for estimating the volatility of stock market indices. Diverging from the widely used volatility models that take into account only the elements related to the traded prices, namely the open, high, low, and close prices of a trading day (OHLC), the intrinsic entropy model takes into account the traded volumes during the considered time frame as well. We adjust the intraday intrinsic entropy model that we introduced earlier for exchange-traded securities in order to connect daily OHLC prices with the ratio of the corresponding daily volume to the overall volume traded in the considered period. The intrinsic entropy model conceptualizes this ratio as entropic probability or market credence assigned to the corresponding price level. The intrinsic entropy is computed using historical daily data for traded market indices (S&P 500, Dow 30, NYSE Composite, NASDAQ Composite, Nikkei 225, and Hang Seng Index). We compare the results produced by the intrinsic entropy model with the volatility estimates obtained for the same data sets using widely employed industry volatility estimators. The intrinsic entropy model proves to consistently deliver reliable estimates for various time frames while showing peculiarly high values for the coefficient of variation, with the estimates falling in a significantly lower interval range compared with those provided by the other advanced volatility estimators.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
The Cross-Sectional Intrinsic Entropy. A Comprehensive Stock Market Volatility Estimator
Authors:
Claudiu Vinte,
Marcel Ausloos
Abstract:
To take into account the temporal dimension of uncertainty in stock markets, this paper introduces a cross-sectional estimation of stock market volatility based on the intrinsic entropy model. The proposed cross-sectional intrinsic entropy (CSIE) is defined and computed as a daily volatility estimate for the entire market, grounded on the daily traded prices: open, high, low, and close prices (OHL…
▽ More
To take into account the temporal dimension of uncertainty in stock markets, this paper introduces a cross-sectional estimation of stock market volatility based on the intrinsic entropy model. The proposed cross-sectional intrinsic entropy (CSIE) is defined and computed as a daily volatility estimate for the entire market, grounded on the daily traded prices: open, high, low, and close prices (OHLC), along with the daily traded volume for all symbols listed on The New York Stock Exchange (NYSE) and The National Association of Securities Dealers Automated Quotations (NASDAQ). We perform a comparative analysis between the time series obtained from the CSIE and the historical volatility as provided by the estimators: close-to-close, Parkinson, Garman-Klass, Rogers-Satchell, Yang-Zhang, and intrinsic entropy (IE), defined and computed from historical OHLC daily prices of the Standard & Poor's 500 index (S&P500), Dow Jones Industrial Average (DJIA), and the NASDAQ Composite index, respectively, for various time intervals. Our study uses approximately 6000 day reference points, starting on 1 Jan. 2001, until 23 Jan. 2022, for both the NYSE and the NASDAQ. We found that the CSIE market volatility estimator is consistently at least 10 times more sensitive to market changes, compared to the volatility estimate captured through the market indices. Furthermore, beta values confirm a consistently lower volatility risk for market indices overall, between 50% and 90% lower, compared to the volatility risk of the entire market in various time intervals and rolling windows.
△ Less
Submitted 29 April, 2022;
originally announced May 2022.
-
Are We Standing on Unreliable Shoulders? The Effect of Retracted Papers Citations on Previous and Subsequent Published Papers: A Study of the Web of Science Database
Authors:
Sepideh Fahimifar,
Ali Ghorbi,
Marcel Ausloos
Abstract:
The present research attempts to identify the impact of retracted papers on previous or subsequent papers. We consider the 5693 retracted papers from 1975 to 2020 indexed in the Web of Science database based on bibliometric methods. We use HistCite, Excel, and SPSS software as technical means. The findings suggest a significant difference between the average number of retracted and unretracted pap…
▽ More
The present research attempts to identify the impact of retracted papers on previous or subsequent papers. We consider the 5693 retracted papers from 1975 to 2020 indexed in the Web of Science database based on bibliometric methods. We use HistCite, Excel, and SPSS software as technical means. The findings suggest a significant difference between the average number of retracted and unretracted papers when cited in retracted papers. Furthermore, there is a significant difference between the average number of unretracted and retracted papers citing retracted papers. The reasons for the retraction of an article may not be the previous retracted papers, yet unretracted papers may be retracted later because of referring to (many) retracted papers. It is deduced that proprietors of citation databases should carefully focus on these papers by checking references to each new paper citing previously retracted papers.
△ Less
Submitted 22 January, 2022;
originally announced January 2022.
-
Economic Freedom: The Top, the Bottom, and the Reality. I. 1997-2007
Authors:
Marcel Ausloos,
Philippe Bronlet
Abstract:
We recall the historically admitted prerequisites of Economic Freedom (EF). We have examined 908 data points for the Economic Freedom of the World (EFW) index and 1884 points for the Index of Economic Freedom (IEF); the studied periods are 2000-2006 and 1997-2007, respectively, thereby following the Berlin wall collapse, and including Sept. 11, 2001. After discussing EFW index and IEF, in order to…
▽ More
We recall the historically admitted prerequisites of Economic Freedom (EF). We have examined 908 data points for the Economic Freedom of the World (EFW) index and 1884 points for the Index of Economic Freedom (IEF); the studied periods are 2000-2006 and 1997-2007, respectively, thereby following the Berlin wall collapse, and including Sept. 11, 2001. After discussing EFW index and IEF, in order to compare the indices, one needs to study their overlap in time and space. That leaves 138 countries to be examined over a period extending from 2000 to 2006, thus 2 sets of 862 data points. The data analysis pertains to the rank-size law technique. It is examined whether the distributions obey an exponential or a power law. A correlation with the country Gross Domestic Product (GDP), an admittedly major determinant of EF, follows, distinguishing regional aspects, i.e. defining 6 continents. Semi-log plots show that the EFW-rank relationship is exponential for countries of high rank ($\ge 20$); overall the log-log plots point to a behaviour close to a power law. In contrast, for the IEF, the overall ranking has an exponential behaviour; but the log-log plots point to the existence of a transitional point between two different power laws, i.e., near rank 10. Moreover, log-log plots of the EFW index relationship to country GDP is characterised by a power law, with a rather stable exponent ($γ\simeq 0.674$) as a function of time. In contrast, log-log plots of the IEF relationship with the country's gross domestic product point to a downward evolutive power law as a function of time. Markedly the two studied indices provide different aspects of EF.
△ Less
Submitted 22 January, 2022;
originally announced January 2022.
-
An Intergenerational Issue: The Equity Issues due to Public-Private Partnerships. The Critical Aspect of the Social Discount Rate Choice for Future Generations
Authors:
Abeer Al Yaqoobi,
Marcel Ausloos
Abstract:
This paper investigates the impact of Social Discount Rate (SDR) choice on intergenerational equity issues caused by Public-Private Partnerships (PPPs) projects. Indeed, more PPPs mean more debt being accumulated for future generations leading to a fiscal deficit crisis. The paper draws on how the SDR level taken today distributes societies on the Social Welfare Function (SWF). This is done by ans…
▽ More
This paper investigates the impact of Social Discount Rate (SDR) choice on intergenerational equity issues caused by Public-Private Partnerships (PPPs) projects. Indeed, more PPPs mean more debt being accumulated for future generations leading to a fiscal deficit crisis. The paper draws on how the SDR level taken today distributes societies on the Social Welfare Function (SWF). This is done by answering two sub-questions: (i) What is the risk of PPPs debts being off-balance sheet? (ii) How do public policies, based on the envisaged SDR, position society within different ethical perspectives? The answers are obtained from a discussion of the different SDRs (applied in the UK for examples) according to the merits of the pertinent ethical theories, namely libertarian, egalitarian, utilitarian and Rawlsian. We find that public policymakers can manipulate the SDR to make PPPs looking like a better option than the traditional financing form. However, this antagonises the Value for Money principle. We also point out that public policy is not harmonised with ethical theories. We find that at present (in the UK), the SDR is somewhere between weighted utilitarian and Rawlsian societies in the trade-off curve. Alas, our study finds no evidence that the (UK) government is using a sophisticated system to keep pace with the accumulated off-balance sheet debts. Thus, the exact prediction of the final state is hardly made because of the uncertainty factor. We conclude that our study hopefully provides a good analytical framework for policymakers in order to draw on the merits of ethical theories before initiating public policies like PPPs.
△ Less
Submitted 22 January, 2022;
originally announced January 2022.
-
Stock index futures trading impact on spot price volatility. The CSI 300 studied with a TGARCH model
Authors:
Marcel Ausloos,
Yining Zhang,
Gurjeet Dhesi
Abstract:
A TGARCH modeling is argued to be the optimal basis for investigating the impact of index futures trading on spot price variability. We discuss the CSI-300 index (China-Shanghai-Shenzhen-300-Stock Index) as a test case. The results prove that the introduction of CSI-300 index futures (CSI-300-IF) trading significantly reduces the volatility in the corresponding spot market. It is also found that t…
▽ More
A TGARCH modeling is argued to be the optimal basis for investigating the impact of index futures trading on spot price variability. We discuss the CSI-300 index (China-Shanghai-Shenzhen-300-Stock Index) as a test case. The results prove that the introduction of CSI-300 index futures (CSI-300-IF) trading significantly reduces the volatility in the corresponding spot market. It is also found that there is a stationary equilibrium relationship between the CSI-300 spot and CCSI-300-IF markets. A bidirectional Granger causality is also detected. ''Finally'', it is deduced that spot prices are predicted with greater accuracy over a 3 or 4 lag day time span.
△ Less
Submitted 29 August, 2021;
originally announced September 2021.
-
Tsallis entropy for cross-shareholding network configurations
Authors:
Roy Cerqueti,
Giulia Rotundo,
Marcel Ausloos
Abstract:
In this work, we develop the Tsallis entropy approach for examining the cross-shareholding network of companies traded on the Italian stock market. In such a network, the nodes represent the companies, and the links represent the ownership. Within this context, we introduce the out-degree of the nodes -- which represents the diversification -- and the in-degree of them -- capturing the integration…
▽ More
In this work, we develop the Tsallis entropy approach for examining the cross-shareholding network of companies traded on the Italian stock market. In such a network, the nodes represent the companies, and the links represent the ownership. Within this context, we introduce the out-degree of the nodes -- which represents the diversification -- and the in-degree of them -- capturing the integration. Diversification and integration allow a clear description of the industrial structure formed by the considered companies. The stochastic dependence of diversification and integration is modelled through copulas. We argue that copulas are well suited for modelling the joint distribution. The analysis of the stochastic dependence between integration and diversification by means of the Tsallis entropy gives a crucial information on the reaction of the market structure to the external shocks, - on the basis of some relevant cases of dependence between the considered variables. In this respect, the considered entropy framework provides insights on the relationship between in-degree and out-degree dependence structure and market polarisation or fairness. Moreover, the interpretation of the results in the light of the Tsallis entropy parameter gives relevant suggestions for policymakers who aim at sha** the industrial context for having high polarisation or fair joint distribution of diversification and integration. Furthermore, a discussion of possible parametrisations of the in-degree and out-degree marginal distribution, -- by means of power laws or exponential functions, -- is also carried out. An empirical experiment on a large dataset of Italian companies validates the theoretical framework.
△ Less
Submitted 29 August, 2021;
originally announced September 2021.
-
Retracted papers by Iranian authors: Causes, journals, time lags, affiliations, collaborations
Authors:
Ali Ghorbi,
Mohsen Fazeli-Varzaneh,
Erfan Ghaderi-Azad,
Marcel Ausloos,
Marcin Kozak
Abstract:
This study aims to analyze 343 retraction notices indexed in the Scopus database, published in 2001-2019, related to scientific articles (co-)written by at least one author affiliated with an Iranian institution. In order to determine reasons for retractions, we merged this database with the database from Retraction Watch. The data were analyzed using Excel 2016 and IBM-SPSS version 24.0, and visu…
▽ More
This study aims to analyze 343 retraction notices indexed in the Scopus database, published in 2001-2019, related to scientific articles (co-)written by at least one author affiliated with an Iranian institution. In order to determine reasons for retractions, we merged this database with the database from Retraction Watch. The data were analyzed using Excel 2016 and IBM-SPSS version 24.0, and visualized using VOSviewer software. Most of the retractions were due to fake peer review (95 retractions) and plagiarism (90). The average time between a publication and its retraction was 591 days. The maximum time-lag (about 3,000 days) occurred for papers retracted due to duplicate publications; the minimum time-lag (fewer than 100 days) was for papers retracted due to ''unspecified cause'' (most of these were conference papers). As many as 48 (14%) of the retracted papers were published in two medical journals: Tumor Biology (25 papers) and Diagnostic Pathology (23 papers). From the institutional point of view, Islamic Azad University was the inglorious leader, contributing to over one-half (53.1%) of retracted papers. Among the 343 retraction notices, 64 papers pertained to international collaborations with researchers from mainly Asian and European countries; Malaysia having the most retractions (22 papers). Since most retractions were due to fake peer review and plagiarism, the peer review system appears to be a weak point of the submission/publication process; if improved, the number of retractions would likely drop because of increased editorial control.
△ Less
Submitted 29 August, 2021;
originally announced August 2021.
-
Benford's laws tests on S&P500 daily closing values and the corresponding daily log-returns both point to huge non-conformity
Authors:
Marcel Ausloos,
Valerio Ficcadenti,
Gurjeet Dhesi,
Muhammad Shakeel
Abstract:
The so-called Benford's laws are of frequent use in order to observe anomalies and regularities in data sets, in particular, in election results and financial statements. Yet, basic financial market indices have not been much studied, if studied at all, within such a perspective. This paper presents features in the distributions of S\&P500 daily closing values and the corresponding daily log retur…
▽ More
The so-called Benford's laws are of frequent use in order to observe anomalies and regularities in data sets, in particular, in election results and financial statements. Yet, basic financial market indices have not been much studied, if studied at all, within such a perspective. This paper presents features in the distributions of S\&P500 daily closing values and the corresponding daily log returns over a long time interval, [03/01/1950 - 22/08/2014], amounting to 16265 data points. We address the frequencies of the first, second, and first two significant digits counts and explore the conformance to Benford's laws of these distributions at five different (equal size) levels of disaggregation. The log returns are studied for either positive or negative cases. The results for the S&P500 daily closing values are showing a huge lack of non-conformity, whatever the different levels of disaggregation. Some "first digits" and "first two digits" values are even missing. The causes of this non-conformity are discussed, pointing to the danger in taking Benford's laws for granted in huge databases, whence drawing "definite conclusions". The agreements with Benford's laws are much better for the log returns. Such a disparity in agreements finds an explanation in the data set itself: the inherent trend in the index. To further validate this, daily returns have been simulated calibrating the simulations with the observed data averages and tested against Benford's laws. One finds that not only the trend but also the standard deviation of the distributions are relevant parameters in concluding about conformity with Benford's laws.
△ Less
Submitted 16 April, 2021;
originally announced April 2021.
-
Challenging Practical Features of Bitcoin by the Main Altcoins
Authors:
Andrew Spurr,
Marcel Ausloos
Abstract:
We study the fundamental differences that separate: Litecoin; Bitcoin Gold; Bitcoin Cash; Ethereum; and Zcash from Bitcoin, and draw analysis to how these features are appreciated by the market, to ultimately make an inference as to how future successful cryptocurrencies may behave. We use Google Trend data, as well as price, volume and market capitalization data sourced from coinmarketcap.com to…
▽ More
We study the fundamental differences that separate: Litecoin; Bitcoin Gold; Bitcoin Cash; Ethereum; and Zcash from Bitcoin, and draw analysis to how these features are appreciated by the market, to ultimately make an inference as to how future successful cryptocurrencies may behave. We use Google Trend data, as well as price, volume and market capitalization data sourced from coinmarketcap.com to support this analysis. We find that Litecoin's shorter block times offer benefits in commerce, but drawbacks in the mining process through orphaned blocks. Zcash holds a niche use for anonymous transactions, benefitting areas of the world lacking in economic freedom. Bitcoin Cash suffers from centralization in the mining process, while the greater decentralization of Bitcoin Gold has generally left it to stagnate. Ether's greater functionality offers the greatest threat to Bitcoin's dominance in the market. A coin that incorporates several of these features can be technically better than Bitcoin, but the first-to-marketadvantage of Bitcoin should keep its dominant position in the market.
△ Less
Submitted 19 December, 2020;
originally announced January 2021.
-
Simple approaches on how to discover promising strategies for efficient enterprise performance, at time of crisis in the case of SMEs : Voronoi clustering and outlier effects perspective
Authors:
Marcel Ausloos,
Francesca Bartolacci,
Nicola G. Castellano,
Roy Cerqueti
Abstract:
This paper analyzes the connection between innovation activities of companies -- implemented before a financial crisis -- and their performance -- measured after such a time of crisis. Pertinent data about companies listed in the STAR Market Segment of the Italian Stock Exchange is analyzed. Innovation is measured through the level of investments in total tangible and intangible fixed assets in 20…
▽ More
This paper analyzes the connection between innovation activities of companies -- implemented before a financial crisis -- and their performance -- measured after such a time of crisis. Pertinent data about companies listed in the STAR Market Segment of the Italian Stock Exchange is analyzed. Innovation is measured through the level of investments in total tangible and intangible fixed assets in 2006-2007, while performance is captured through growth -- expressed by variations of sales or of total assets, -- profitability -- through ROI or ROS evolution, - and productivity -- through asset turnover or sales/employee in the period 2008-2010. The variables of interest are analyzed and compared through statistical techniques and by adopting a cluster analysis. In particular, a Voronoi tessellation is implemented in a varying centroids framework. In accord with a large part of the literature, we find that the behavior of the performance of the companies is not univocal when they innovate. The statistical outliers are the best cases in order to suggest efficient strategies. In brief, it is found that a positive rate of investments is preferable.
△ Less
Submitted 19 December, 2020;
originally announced December 2020.
-
If Global or Local Investor Sentiments are Prone to Develo** an Impact on Stock Returns, is there an Industry Effect?
Authors:
**g Shi,
Marcel Ausloos,
Tingting Zhu
Abstract:
This paper investigates the heterogeneous impacts of either Global or Local Investor Sentiments on stock returns. We study 10 industry sectors through the lens of 6 (so called) emerging countries: China, Brazil, India, Mexico, Indonesia and Turkey, over the 2000 to 2014 period. Using a panel data framework, our study sheds light on a significant effect of Local Investor Sentiments on expected retu…
▽ More
This paper investigates the heterogeneous impacts of either Global or Local Investor Sentiments on stock returns. We study 10 industry sectors through the lens of 6 (so called) emerging countries: China, Brazil, India, Mexico, Indonesia and Turkey, over the 2000 to 2014 period. Using a panel data framework, our study sheds light on a significant effect of Local Investor Sentiments on expected returns for basic materials, consumer goods, industrial, and financial industries. Moreover, our results suggest that from Global Investor Sentiments alone, one cannot predict expected stock returns in these markets.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
Valuation Models Applied to Value-Based Management. Application to the Case of UK Companies with Problems
Authors:
Marcel Ausloos
Abstract:
Many still rightly wonder whether accounting numbers affect business value. Basic questions are why? and how? I aim at promoting an objective choice on how optimizing the most suitable valuation methods under a value-based management framework through some performance measurement systems. First, I present a comprehensive review of valuation methods. Three valuations methods, (i) Free Cash Flow Val…
▽ More
Many still rightly wonder whether accounting numbers affect business value. Basic questions are why? and how? I aim at promoting an objective choice on how optimizing the most suitable valuation methods under a value-based management framework through some performance measurement systems. First, I present a comprehensive review of valuation methods. Three valuations methods, (i) Free Cash Flow Valuation Model (FCFVM), (ii) Residual Earning Valuation Model (REVM) and (iii) Abnormal Earning Growth Model (AEGM), are presented. I point out to advantages and limitations. As applications, the proofs of the findings are illustrated on three study cases: Marks & Spencer's business pattern (size and growth prospect), which had a recently advertised valuation problem, and two comparable companies, Tesco and Sainsbury's, all three chosen for multiple-based valuation. For the purpose, two value drivers are chosen, EnV/EBIT (entity value/earnings before interests and taxes) and the corresponding EnV/Sales. Thus, the question whether accounting numbers through models based on mathematical economics truly affect business value has an answer: Maybe, yes.
△ Less
Submitted 19 December, 2020;
originally announced December 2020.
-
Insider trading in the run-up to merger announcements. Before and after the UK's Financial Services Act 2012
Authors:
Rebecaa Pham,
Marcel Ausloos
Abstract:
After the 2007/2008 financial crisis, the UK government decided that a change in regulation was required to amend the poor control of financial markets. The Financial Services Act 2012 was developed as a result in order to give more control and authority to the regulators of financial markets. Thus, the Financial Conduct Authority (FCA) succeeded the Financial Services Authority (FSA). An area req…
▽ More
After the 2007/2008 financial crisis, the UK government decided that a change in regulation was required to amend the poor control of financial markets. The Financial Services Act 2012 was developed as a result in order to give more control and authority to the regulators of financial markets. Thus, the Financial Conduct Authority (FCA) succeeded the Financial Services Authority (FSA). An area requiring an improvement in regulation was insider trading. Our study examines the effectiveness of the FCA in its duty of regulating insider trading through utilising the event study methodology to assess abnormal returns in the run-up to the first announcement of mergers. Samples of abnormal returns are examined on periods, under regulation either by the FSA or by the FCA. Practically, stock price data on the London Stock Exchange from 2008-2012 and 2015-2019 is investigated. The results from this study determine that abnormal returns are reduced after the implementation of the Financial Services Act 2012; prices are also found to be noisier in the period before the 2012 Act. Insignificant abnormal returns are found in the run-up to the first announcement of mergers in the 2015-2019 period. This concludes that the FCA is efficient in regulating insider trading.
△ Less
Submitted 19 December, 2020;
originally announced December 2020.
-
Hagiotoponyms in France: Saint popularity, like a herding phase transition
Authors:
Marcel Ausloos
Abstract:
A spectacular order-order-like transition is presented in the distribution of hagiotoponyms in France. Data analysis and displays distinguish male and female cases. The respective hapax values point to a very large variety of saints with a specific devotion. The most popular ones are St. Martin and the apostles. The less popular ones are not so well known. These features are explained in terms of…
▽ More
A spectacular order-order-like transition is presented in the distribution of hagiotoponyms in France. Data analysis and displays distinguish male and female cases. The respective hapax values point to a very large variety of saints with a specific devotion. The most popular ones are St. Martin and the apostles. The less popular ones are not so well known. These features are explained in terms of herding in agent behaviors: people have either preferred popular saints with supposedly good links to God, whence a herding behavior, or (non-herding) agents have preferred to name their local human settlement through a reference to some holy person(s) with more local specificities -- yet with moral or religious leadership, and conjectured to have good contact with God, whence at least locally defined as a saint.
△ Less
Submitted 19 December, 2020;
originally announced December 2020.
-
Corporate Governance and Firms Financial Performance in the United Kingdom
Authors:
Martin Kyere,
Marcel Ausloos
Abstract:
The objective of this study is to examine empirically the impact of good corporate governance on financial performance of United Kingdom non-financial listed firms. Agency theory and stewardship theory serve as the bases of a conceptual model. Five corporate governance mechanisms are examined on two financial performance indicators, return on assets (ROA) and Tobin's Q, employing cross-sectional r…
▽ More
The objective of this study is to examine empirically the impact of good corporate governance on financial performance of United Kingdom non-financial listed firms. Agency theory and stewardship theory serve as the bases of a conceptual model. Five corporate governance mechanisms are examined on two financial performance indicators, return on assets (ROA) and Tobin's Q, employing cross-sectional regression methodology. The conclusion drawn from empirical test so performed on 252 firms listed on London Stock Exchange for the year 2014 indicates a positive or a negative relationship, but also sometimes no effect, of corporate governance mechanisms impact on financial performance. The implications are discussed. Thereby, so distinguishing effects due to causes, we present a proof that, when the right corporate governance mechanisms are chosen, the finances of a firm can be improved. The results of this research should have some implication on academia and policy makers thoughts.
△ Less
Submitted 22 July, 2020;
originally announced August 2020.
-
Words ranking and Hirsch index for identifying the core of the hapaxes in political texts
Authors:
Valerio Ficcadenti,
Roy Cerqueti,
Marcel Ausloos,
Gurjeet Dhesi
Abstract:
This paper deals with a quantitative analysis of the content of official political speeches. We study a set of about one thousand talks pronounced by the US Presidents, ranging from Washington to Trump. In particular, we search for the relevance of the rare words, i.e. those said only once in each speech -- the so-called hapaxes. We implement a rank-size procedure of Zipf-Mandelbrot type for discu…
▽ More
This paper deals with a quantitative analysis of the content of official political speeches. We study a set of about one thousand talks pronounced by the US Presidents, ranging from Washington to Trump. In particular, we search for the relevance of the rare words, i.e. those said only once in each speech -- the so-called hapaxes. We implement a rank-size procedure of Zipf-Mandelbrot type for discussing the hapaxes' frequencies regularity over the overall set of speeches. Starting from the obtained rank-size law, we define and detect the core of the hapaxes set by means of a procedure based on an Hirsch index variant. We discuss the resulting list of words in the light of the overall US Presidents' speeches. We further show that this core of hapaxes itself can be well fitted through a Zipf-Mandelbrot law and that contains elements producing deviations at the low ranks between scatter plots and fitted curve -- the so-called king and vice-roy effect. Some socio-political insights are derived from the obtained findings about the US Presidents messages.
△ Less
Submitted 13 June, 2020;
originally announced June 2020.
-
Coupled criticality analysis of inflation and unemployment
Authors:
Z. Koohi Lai,
A. Namaki,
A. Hosseiny,
G. R. Jafari,
M. Ausloos
Abstract:
In this paper, we are interested to focus on the critical periods in the economy which are characterized by large fluctuations in macroeconomic indicators.
To capture unusual and large fluctuations of inflation and unemployment, we concentrate on the non-Gaussianity of their distributions.
To this aim, by using the coupled multifractal approach, we analyze US data for a period of 70 years from…
▽ More
In this paper, we are interested to focus on the critical periods in the economy which are characterized by large fluctuations in macroeconomic indicators.
To capture unusual and large fluctuations of inflation and unemployment, we concentrate on the non-Gaussianity of their distributions.
To this aim, by using the coupled multifractal approach, we analyze US data for a period of 70 years from 1948 until 2018 and measure the non-Gausianity of the distributions. Then, we investigate how the non-Gaussianity of the variables affects the coupling structure of them. By applying the multifractal method, one can see that the non-Gaussianity depends on the scales. While the non-Gaussianity of unemployment is noticeable only for periods smaller than 1 year and for longer periods tends to Gaussian behavior, the non-Gaussianities of inflation persist for all time scales. Also, it is observed that the coupling structure of these variables tends to a Gaussian behavior after $2$ years.
△ Less
Submitted 27 March, 2020;
originally announced March 2020.
-
Rank-size law, financial inequality indices and gain concentrations by cyclist teams. The case of a multiple stage bicycle race, like Tour de France
Authors:
Marcel Ausloos
Abstract:
This note examines financial distributions to competing teams at the end of the most famous multiple stage professional (male) bicyclist race, TOUR DE FRANCE. A rank-size law (RSL) is calculated for the team financial gains. The RSL is found to be hyperbolic with a surprisingly simple decay exponent (about equal to -1). Yet, the financial gain distributions unexpectedly do not obey Pareto principl…
▽ More
This note examines financial distributions to competing teams at the end of the most famous multiple stage professional (male) bicyclist race, TOUR DE FRANCE. A rank-size law (RSL) is calculated for the team financial gains. The RSL is found to be hyperbolic with a surprisingly simple decay exponent (about equal to -1). Yet, the financial gain distributions unexpectedly do not obey Pareto principle of factor sparsity. Next, several (8) inequality indices are considered : the Entropy, the Hirschman-Herfindahl, Theil, Pietra-Hoover, Gini, Rosenbluth indices, the Coefficient of Variation and the Concentration Index are calculated for outlining diversity measures. The connection between such indices and their concentration aspects meanings are presented as support of the RSL findings. The results emphasize that the sum of skills and team strategies are effectively contributing to the financial gains distributions. From theoretical and practical points of view, the findings suggest that one should investigate other "long multiple stage races" and rewarding rules. Indeed, money prize rules coupling to stage difficulty might influence and maybe enhance (or deteriorate) purely sportive aspects in group competitions. Due to the delay in the peer review process, the 2019 results can be examined. They are discussed in an Appendix; the value of the exponent (-1.2) is pointed out to mainly originating from the so called "king effect"; the tail of the RSL rather looks like an exponential.
△ Less
Submitted 24 October, 2019;
originally announced October 2019.
-
Fundamental Analysis in China: An Empirical Study of the Relationship between Financial Ratios and Stock Prices
Authors:
Lijuan Ma,
Marcel Ausloos,
Christophe Schinckus,
H. L. Felicia Chong
Abstract:
The informational context is regularly questioned in a transitional economic regime like the one implemented in China or Vietnam. This article investigates this issue and the predictive power of fundamental analysis in such context and more precisely in a Chinese context with an analysis of 3 different industries (media, power, and steel). Through 3 different kinds of correlation, we examine 25 fi…
▽ More
The informational context is regularly questioned in a transitional economic regime like the one implemented in China or Vietnam. This article investigates this issue and the predictive power of fundamental analysis in such context and more precisely in a Chinese context with an analysis of 3 different industries (media, power, and steel). Through 3 different kinds of correlation, we examine 25 financial determinants for 60 Chinese listed companies between 2011 and 2015. Our results show that fundamental analysis can effectively be used as an investment tool in transitional economic context. Contrasting with the EMH for which the accounting information is instantaneously integrated into the financial information (stock prices), our study suggests that these two levels of information are not synchronized in China opening therefore a door for a fundamental analysis based prediction. Furthermore, our results also indicate that accounting information illustrates quite well the economic reality since financial reports in each industry can disclose a part of stock value information in line with the economic situation of the industry under consideration.
△ Less
Submitted 12 October, 2019;
originally announced October 2019.
-
Seasonal Entropy, Diversity and Inequality Measures of Submitted and Accepted Papers Distributions In Peer-Reviewed Journals
Authors:
Marcel Ausloos,
Olgica Nedic,
Aleksandar Dekanski
Abstract:
This paper presents a novel method for finding features in the analysis of variable distributions stemming from time series. We apply the methodology to the case of submitted and accepted papers in peer-reviewed journals. We provide a comparative study of editorial decisions for papers submitted to two peer-reviewed journals: the Journal of the Serbian Chemical Society (JSCS) and this MDPI Entropy…
▽ More
This paper presents a novel method for finding features in the analysis of variable distributions stemming from time series. We apply the methodology to the case of submitted and accepted papers in peer-reviewed journals. We provide a comparative study of editorial decisions for papers submitted to two peer-reviewed journals: the Journal of the Serbian Chemical Society (JSCS) and this MDPI Entropy journal. We cover three recent years for which the fate of submitted papers, about 600 papers to JSCS and 2500 to Entropy, is completely determined. Instead of comparing the number distributions of these papers as a function of time with respect to a uniform distribution, we analyze the relevant probabilities, from which we derive the information entropy. It is argued that such probabilities are indeed more relevant for authors than the actual number of submissions. We tie this entropy analysis to the so called diversity of the variable distributions. Furthermore, we emphasize the correspondence between the entropy and the diversity with inequality measures, like the Herfindahl-Hirschman index and the Theil index, itself being in the class of entropy measures; the Gini coefficient which also measures the diversity in ranking is calculated for further discussion. In this sample, the seasonal aspects of the peer review process are outlined. It is found that the use of such indices, non linear transformations of the data distributions, allow to distinguish features and evolutions of peer review process as a function of time as well as comparing non-uniformity of distributions. Furthermore, t- and z- statistical tests are applied in order to measure the significance (p-level) of the findings, i.e. whether papers are more likely to be accepted if they are submitted during a few specific months or "season"; the predictability strength depends on the journal.
△ Less
Submitted 13 October, 2019;
originally announced October 2019.
-
Correlations between submission and acceptance of papers in peer review journals
Authors:
Marcel Ausloos,
Olgica Nedic,
Aleksandar Dekanski
Abstract:
This paper provides a comparative study about seasonal influence on editorial decisions for papers submitted to two peer review journals. We distinguish a specialized one, the Journal of the Serbian Chemical Society (JSCS) and an interdisciplinary one, Entropy. Dates of electronic submission for about 600 papers to JSCS and 2500 to Entropy have been recorded over 3 recent years. Time series of eit…
▽ More
This paper provides a comparative study about seasonal influence on editorial decisions for papers submitted to two peer review journals. We distinguish a specialized one, the Journal of the Serbian Chemical Society (JSCS) and an interdisciplinary one, Entropy. Dates of electronic submission for about 600 papers to JSCS and 2500 to Entropy have been recorded over 3 recent years. Time series of either accepted or rejected papers are subsequently analyzed. We take either editors or authors view points into account, thereby considering magnitudes and probabilities. In this sample, it is found that there are distinguishable peaks and dips in the time series, demonstrating preferred months for the submission of papers. It is also found that papers are more likely accepted if they are submitted during a few specific months, - these depending on the journal. The probability of having a rejected paper also appears to be seasonally biased. In view of clarifying reports with contradictory findings, we discuss previously proposed conjectures for such effects, like holiday effects and the desk rejection by editors. We conclude that, in this sample, the type of journal, specialized or multidisciplinary, seems to be the drastic criterion for distinguishing the outcomes rates.
△ Less
Submitted 13 October, 2019;
originally announced October 2019.
-
Efficiency in managing peer-review of scientific manuscripts -- editors' perspective
Authors:
Olgica Nedic,
Ivana Drvenica,
Marcel Ausloos,
Aleksandar Dekanski
Abstract:
The purpose of this paper is to introduce a model for measuring the efficiency in managing peer-review of scientific manuscripts by editors. The approach employed is based on the assumption that the editorial aim is to manage publication with high efficiency, employing the least amount of editorial resources. Efficiency is defined in this research as a measure based on 7 variables. An on-line surv…
▽ More
The purpose of this paper is to introduce a model for measuring the efficiency in managing peer-review of scientific manuscripts by editors. The approach employed is based on the assumption that the editorial aim is to manage publication with high efficiency, employing the least amount of editorial resources. Efficiency is defined in this research as a measure based on 7 variables. An on-line survey was constructed and editors of journals originating from Serbia regularly publishing articles in the field of chemistry were invited to participate. An evaluation of the model is given based on responses from 24 journals and 50 editors. With this investigation we aimed to contribute to our understanding of the peer-review process and, possibly, offer a tool to improve the "efficiency" in journal editing. The proposed protocol may be adapted by other journals in order to assess the managing potential of editors.
△ Less
Submitted 12 October, 2019;
originally announced October 2019.
-
A joint text mining-rank size investigation of the rhetoric structures of the US Presidents' speeches
Authors:
Valerio Ficcadenti,
Roy Cerqueti,
Marcel Ausloos
Abstract:
This work presents a text mining context and its use for a deep analysis of the messages delivered by the politicians. Specifically, we deal with an expert systems-based exploration of the rhetoric dynamics of a large collection of US Presidents' speeches, ranging from Washington to Trump. In particular, speeches are viewed as complex expert systems whose structures can be effectively analyzed thr…
▽ More
This work presents a text mining context and its use for a deep analysis of the messages delivered by the politicians. Specifically, we deal with an expert systems-based exploration of the rhetoric dynamics of a large collection of US Presidents' speeches, ranging from Washington to Trump. In particular, speeches are viewed as complex expert systems whose structures can be effectively analyzed through rank-size laws. The methodological contribution of the paper is twofold. First, we develop a text mining-based procedure for the construction of the dataset by using a web scra** routine on the Miller Center website -- the repository collecting the speeches. Second, we explore the implicit structure of the discourse data by implementing a rank-size procedure over the individual speeches, being the words of each speech ranked in terms of their frequencies. The scientific significance of the proposed combination of text-mining and rank-size approaches can be found in its flexibility and generality, which let it be reproducible to a wide set of expert systems and text mining contexts. The usefulness of the proposed method and the speech subsequent analysis is demonstrated by the findings themselves. Indeed, in terms of impact, it is worth noting that interesting conclusions of social, political and linguistic nature on how 45 United States Presidents, from April 30, 1789 till February 28, 2017 delivered political messages can be carried out. Indeed, the proposed analysis shows some remarkable regularities, not only inside a given speech, but also among different speeches. Moreover, under a purely methodological perspective, the presented contribution suggests possible ways of generating a linguistic decision-making algorithm.
△ Less
Submitted 9 May, 2019;
originally announced May 2019.
-
Evidence for Gross Domestic Product growth time delay dependence over Foreign Direct Investment. A time-lag dependent correlation study
Authors:
Marcel Ausloos,
Ali Eskandary,
Parmjit Kaur,
Gurjeet Dhesi
Abstract:
This paper considers an often forgotten relationship, the time delay between a cause and its effect in economies and finance. We treat the case of Foreign Direct Investment (FDI) and economic growth, - measured through a country Gross Domestic Product (GDP). The pertinent data refers to 43 countries, over 1970-2015, - for a total of 4278 observations. When countries are grouped
according to the…
▽ More
This paper considers an often forgotten relationship, the time delay between a cause and its effect in economies and finance. We treat the case of Foreign Direct Investment (FDI) and economic growth, - measured through a country Gross Domestic Product (GDP). The pertinent data refers to 43 countries, over 1970-2015, - for a total of 4278 observations. When countries are grouped
according to the Inequality-Adjusted Human Development Index (IHDI), it is found that a time lag dependence effect exists in FDI-GDP correlations.
This is established through a time-dependent Pearson 's product-moment correlation coefficient matrix.
Moreover, such a Pearson correlation coefficient is observed to evolve from positive
to negative values depending on the IHDI, from low to high. It is "politically and policy
"relevant" that
the correlation is statistically significant providing the time lag is less than 3 years. A "rank-size" law is demonstrated.
It is recommended to reconsider such a time lag effect when discussing previous analyses whence conclusions on international business, and thereafter on forecasting.
△ Less
Submitted 5 May, 2019;
originally announced May 2019.
-
Optimization of the post-crisis recovery plans in scale-free networks
Authors:
Mohammad Bahrami,
Narges Chinichian,
Ali Hosseiny,
Gholamreza Jafari,
Marcel Ausloos
Abstract:
General Motors or a local business, which one is better to be stimulated in post-crisis recessions, where government stimulation is meant to overcome recessions? Due to the budget constraints, it is quite relevant to ask how one can increase the chance of economic recovery. One of the key elements to answer this question is to understand metastable features of the economic networks. Ising model ha…
▽ More
General Motors or a local business, which one is better to be stimulated in post-crisis recessions, where government stimulation is meant to overcome recessions? Due to the budget constraints, it is quite relevant to ask how one can increase the chance of economic recovery. One of the key elements to answer this question is to understand metastable features of the economic networks. Ising model has been suggested for studying such features in the literature. In the homogenous networks one needs at least a minimum activation, forcing an Ising network to switch its local equilibria, where such minimum is independent of the nodes characteristics. In the scale free networks however, when one aims to push the network to switch its vacuum, she faces the question of which nodes are better to be stimulated to minimize the cost. In the paper it has been shown that stimulation of the high degree nodes costs less in general. Despite regular networks, in the scale free networks, the stimulation cost depends on the networks features such as assortativity. Though we have utilized the Ising model to tackle a problem in economics, our analysis shed lights on many other problems concerning stimulations of socio-economic systems.
△ Less
Submitted 23 October, 2019; v1 submitted 23 April, 2019;
originally announced April 2019.
-
Exploring how innovation strategies at time of crisis influence performance: a cluster analysis perspective
Authors:
Marcel Ausloos,
Francesca Bartolacci,
Nicola G. Castellano,
Roy Cerqueti
Abstract:
This paper analyzes the connection between innovation activities of companies -- implemented before crisis -- and their performance -- measured at time of crisis. The companies listed in the STAR Market Segment of the Italian Stock Exchange are analyzed. Innovation is measured through the level of investments in total tangible and intangible fixed assets in 2006-2007, while performance is captured…
▽ More
This paper analyzes the connection between innovation activities of companies -- implemented before crisis -- and their performance -- measured at time of crisis. The companies listed in the STAR Market Segment of the Italian Stock Exchange are analyzed. Innovation is measured through the level of investments in total tangible and intangible fixed assets in 2006-2007, while performance is captured through growth -- expressed by variations of sales, total assets and employees -- profitability -- through ROI or ROS -- and productivity -- through asset turnover or sales per employee in the period 2008-2010. The variables of interest are analyzed and compared through statistical techniques and by adopting cluster analysis. In particular, a Voronoi tessellation is also implemented in a varying centroids framework. In accord with a large part of the literature, we find that the behavior of the performance of the companies is not univocal when they innovate.
△ Less
Submitted 17 August, 2018;
originally announced August 2018.
-
A tribute to Marian Smoluchowski's legacy on soft grains assembly and hydrogel formation
Authors:
Adam Gadomski,
Natalia Kruszewska,
Piotr Bełdowski,
Bogdan Lent,
Marcel Ausloos
Abstract:
The paper compares the statistical description of physical-metallurgical processes and ceramic-polycrystalline evolutions, termed the normal grain growth (NGG), as adopted to soft- and chemically-reactive grains, with a Smoluchowski's population-constant kernel cluster-cluster aggregation (CCA) model, concerning irreversible chemical reaction kinetics. The former aiming at comprehending, in a semi…
▽ More
The paper compares the statistical description of physical-metallurgical processes and ceramic-polycrystalline evolutions, termed the normal grain growth (NGG), as adopted to soft- and chemically-reactive grains, with a Smoluchowski's population-constant kernel cluster-cluster aggregation (CCA) model, concerning irreversible chemical reaction kinetics. The former aiming at comprehending, in a semi-quantitative way, the volume-conservative (pressure-drifted) grain-growth process which we propose to adopt for hydrogel systems at quite low temperature (near a gel point). It has been noticed, that by identifying the mean cluster size $<k>$ from the Smoluchowski CCA description with the mean cluster radius' size $R_D$, from the NGG approach of proximate grains, one is able to embark on equivalence of both frameworks, but only under certain conditions. For great enough, close-packed clusters, the equivalence can be obtained by rearranging the time domain with rescaled time variable, where the scaling function originates from the dispersive (long-tail, or fractal) kinetics, with a single exponent equal to $d+1$ (in $d$-dimensional (Euclidean) space). This can be of interest for experimenters, working in the field of thermoresponsive gels formation, where crystalline structural predispositions overwhelm.
△ Less
Submitted 27 June, 2018;
originally announced July 2018.
-
SME investment best strategies. Outliers for assessing how to optimize performance
Authors:
Marcel Ausloos,
Roy Cerqueti,
Francesca Bartolacci,
Nicola G. Castellano
Abstract:
Any research on strategies for reaching business excellence aims at revealing the appropriate course of actions any executive should consider. Thus, discussions take place on how effective a performance measurement system can be estimated, or/and validated. Can one find an adequate measure (i) on the performance result due to whatever level of investment, and (ii) on the timing of such investments…
▽ More
Any research on strategies for reaching business excellence aims at revealing the appropriate course of actions any executive should consider. Thus, discussions take place on how effective a performance measurement system can be estimated, or/and validated. Can one find an adequate measure (i) on the performance result due to whatever level of investment, and (ii) on the timing of such investments? We argue that extreme value statistics provide the answer. We demonstrate that the level and timing of investments allow to be forecasting small and medium size enterprises (SME) performance, - at financial crisis times. The "investment level" is taken as the yearly total tangible asset (TTA). The financial/economic performance indicators defining growth are the sales or total assets variations; profitability is defined from returns on investments or returns on sales. Companies on the Italian Stock Exchange STAR Market serve as example. It is found from the distributions extreme values that outlier companies (with positive performance) are those with the lowest but growing TTA. In contrast, the SME with low TTA, but which did not increase its TTA, before the crisis, became a negative outlier. The outcome of these statistical findings should suggest strategies to SME board members.
△ Less
Submitted 13 June, 2018;
originally announced July 2018.
-
Investigating the configurations in cross-shareholding: a joint copula-entropy approach
Authors:
Roy Cerqueti,
Giulia Rotundo,
Marcel Ausloos
Abstract:
--- the companies populating a Stock market, along with their connections, can be effectively modeled through a directed network, where the nodes represent the companies, and the links indicate the ownership. This paper deals with this theme and discusses the concentration of a market. A cross-shareholding matrix is considered, along with two key factors: the node out-degree distribution which rep…
▽ More
--- the companies populating a Stock market, along with their connections, can be effectively modeled through a directed network, where the nodes represent the companies, and the links indicate the ownership. This paper deals with this theme and discusses the concentration of a market. A cross-shareholding matrix is considered, along with two key factors: the node out-degree distribution which represents the diversification of investments in terms of the number of involved companies, and the node in-degree distribution which reports the integration of a company due to the sales of its own shares to other companies. While diversification is widely explored in the literature, integration is most present in literature on contagions. This paper captures such quantities of interest in the two frameworks and studies the stochastic dependence of diversification and integration through a copula approach. We adopt entropies as measures for assessing the concentration in the market. The main question is to assess the dependence structure leading to a better description of the data or to market polarization (minimal entropy) or market fairness (maximal entropy). In so doing, we derive information on the way in which the in- and out-degrees should be connected in order to shape the market. The question is of interest to regulators bodies, as witnessed by specific alert threshold published on the US mergers guidelines for limiting the possibility of acquisitions and the prevalence of a single company on the market. Indeed, all countries and the EU have also rules or guidelines in order to limit concentrations, in a country or across borders, respectively. The calibration of copulas and model parameters on the basis of real data serves as an illustrative application of the theoretical proposal.
△ Less
Submitted 14 June, 2018;
originally announced July 2018.
-
Intriguing yet simple skewness - kurtosis relation in economic and demographic data distributions; pointing to preferential attachment processes
Authors:
Marcel Ausloos,
Roy Cerqueti
Abstract:
In this paper, we propose that relations between high order moments of data distributions, for example between the skewness (S) and kurtosis (K), allow to point to theoretical models with understandable structural parameters. The illustrative data concerns two cases: (i) the distribution of income taxes and (ii) that of inhabitants, after aggregation over each city in each province of Italy in 201…
▽ More
In this paper, we propose that relations between high order moments of data distributions, for example between the skewness (S) and kurtosis (K), allow to point to theoretical models with understandable structural parameters. The illustrative data concerns two cases: (i) the distribution of income taxes and (ii) that of inhabitants, after aggregation over each city in each province of Italy in 2011. Moreover, from the rank-size relationship, for either S or K, in both cases, it is shown that one obtains the parameters of the underlying (hypothetical) modeling distribution: in the present cases, the 2-parameter Beta function, - itself related to the Yule-Simon distribution function, whence suggesting a growth model based on the preferential attachment process.
△ Less
Submitted 18 July, 2018;
originally announced July 2018.
-
Data on the annual aggregated income taxes of the Italian municipalities over the quinquennium 2007-2011
Authors:
Marcel Ausloos,
Roy Cerqueti,
Tariq A. Mir
Abstract:
This dataset contains the annual aggregated income taxes of all the Italian municipalities over the years 2007-2011. Data are clustered over the Italian regions and provinces. The source of the data is the Italian Ministry of Economics and Finance. The administrative variations in Italy over the quinquennium have been taken into account. Data are useful to understand the economic structure of Ital…
▽ More
This dataset contains the annual aggregated income taxes of all the Italian municipalities over the years 2007-2011. Data are clustered over the Italian regions and provinces. The source of the data is the Italian Ministry of Economics and Finance. The administrative variations in Italy over the quinquennium have been taken into account. Data are useful to understand the economic structure of Italy at the microscopic level of municipalities. They can serve also for making comparisons between economical aspects and other features of the Italian cities.
△ Less
Submitted 16 June, 2018;
originally announced June 2018.
-
Dynamical phase diagrams of a love capacity constrained prey-predator model
Authors:
P. Toranj Simin,
G. R. Jafari,
M. Ausloos,
C. F. Caiafa,
F. Caram,
A. Sonubi,
A. Arcagni,
S. Stefani
Abstract:
One interesting question in love relationships is: finally, what and when is the end of this love relationship? Using a prey-predator Verhulst-Lotka-Volterra (VLV) model we imply cooperation and competition tendency between people in order to describe a "love dilemma game". We select the most simple but immediately most complex case for studying the set of nonlinear differential equations, i.e. th…
▽ More
One interesting question in love relationships is: finally, what and when is the end of this love relationship? Using a prey-predator Verhulst-Lotka-Volterra (VLV) model we imply cooperation and competition tendency between people in order to describe a "love dilemma game". We select the most simple but immediately most complex case for studying the set of nonlinear differential equations, i.e. that implying three persons, being at the same time prey and predator. We describe four different scenarios in such a love game containing either a one-way love or a love triangle. Our results show that it is hard to love more than one person simultaneously. Moreover, to love several people simultaneously is an unstable state. We find some condition in which persons tend to have a friendly relationship and love someone in spite of their antagonistic interaction. We demonstrate the dynamics by displaying flow diagrams.
△ Less
Submitted 14 June, 2018;
originally announced June 2018.
-
Intriguing behavior when testing the impact of quotation marks usage in Google search results
Authors:
Bogdan Vasile Ileanu,
Marcel Ausloos,
Claudiu Herteliu,
Marian Pompiliu Cristescu
Abstract:
Internet research on search engine quality and validity of results demand much concern. Thus, the focus in our study has been to measure the impact of quotation marks usage on the internet search outputs in terms of google search outcomes distributions, through Benford Law. The current paper is focused on applying a Benford Law analysis on two related types of internet searches distinguished by th…
▽ More
Internet research on search engine quality and validity of results demand much concern. Thus, the focus in our study has been to measure the impact of quotation marks usage on the internet search outputs in terms of google search outcomes distributions, through Benford Law. The current paper is focused on applying a Benford Law analysis on two related types of internet searches distinguished by the usage or absence of quotation marks. Both search results values are assumed as variables. We found that the first digit of outcomes does not follow the Benford Law first digit of numbers in the case of searching text without quotation marks. Unexpectedly, the Benford Law is obeyed when quotation marks are used, even if the variability of search outcomes is considerably reduced. By studying outputs demonstrating influences of (apparently at first) "details", in using a search engine, the authors are able to further warn the users concerning the validity of such outputs.
△ Less
Submitted 21 May, 2018;
originally announced May 2018.
-
Artificial intelligence in peer review: How can evolutionary computation support journal editors?
Authors:
Maciej J. Mrowinski,
Piotr Fronczak,
Agata Fronczak,
Marcel Ausloos,
Olgica Nedic
Abstract:
With the volume of manuscripts submitted for publication growing every year, the deficiencies of peer review (e.g. long review times) are becoming more apparent. Editorial strategies, sets of guidelines designed to speed up the process and reduce editors workloads, are treated as trade secrets by publishing houses and are not shared publicly. To improve the effectiveness of their strategies, edito…
▽ More
With the volume of manuscripts submitted for publication growing every year, the deficiencies of peer review (e.g. long review times) are becoming more apparent. Editorial strategies, sets of guidelines designed to speed up the process and reduce editors workloads, are treated as trade secrets by publishing houses and are not shared publicly. To improve the effectiveness of their strategies, editors in small publishing groups are faced with undertaking an iterative trial-and-error approach. We show that Cartesian Genetic Programming, a nature-inspired evolutionary algorithm, can dramatically improve editorial strategies. The artificially evolved strategy reduced the duration of the peer review process by 30%, without increasing the pool of reviewers (in comparison to a typical human-developed strategy). Evolutionary computation has typically been used in technological processes or biological ecosystems. Our results demonstrate that genetic programs can improve real-world social systems that are usually much harder to understand and control than physical systems.
△ Less
Submitted 2 December, 2017;
originally announced December 2017.
-
An Inverse Problem Study: Credit Risk Ratings as a Determinant of Corporate Governance and Capital Structure in Emerging Markets: Evidence from Chinese Listed Companies
Authors:
ManYing Kang,
Marcel Ausloos
Abstract:
Credit risk rating is shown to be a relevant determinant in order to estimate good corporate governance and to self-optimize capital structure. The conclusion is argued from a study on a selected (and justified) sample of (182) companies listed on the Shanghai Stock Exchange and the Shenzhen Stock Exchange and which use the same Shanghai Brilliance Credit Rating & Investors Service Company assessm…
▽ More
Credit risk rating is shown to be a relevant determinant in order to estimate good corporate governance and to self-optimize capital structure. The conclusion is argued from a study on a selected (and justified) sample of (182) companies listed on the Shanghai Stock Exchange and the Shenzhen Stock Exchange and which use the same Shanghai Brilliance Credit Rating & Investors Service Company assessment criteria, for their credit ratings, from 2010 to 2015. Practically, 3 debt ratios are examined in terms of 11 characteristic variables. Moreover, any relationship between credit rating and corporate governance can be thought to be an interesting finding. The relationship between credit rating and leverage is not as evident as that found by other researchers from different countries; it is significantly positively related to the outside director, firm size, tangible assets and firm age, and CEO and chairman office plurality. However, leverage is found to be negatively correlated with board size, profitability, growth opportunity, and non-debt tax shield. Credit rating is positively associated with leverage, but in a less significant way. CEO-Board chairship duality is insignificantly related to leverage. The non-debt tax shield is significantly correlated with leverage. The correlation coefficient between CEO duality and auditor is positive but weakly significant, but seems not consistent with expectations. Finally, profitability cause could be regarded as an interesting finding. Indeed, there is an inverse correlation between profitability and total debt (Notice that the result supports the pecking order theory). In conclusion, it appears that credit rating has less effect on the so listed large Chinese companies than in other countries. Nevertheless, the perspective of assessing credit risk rating by relevant agencies is indubitably a recommended time dependent leverage determinant.
△ Less
Submitted 2 December, 2017;
originally announced December 2017.
-
Benford's law first significant digit and distribution distances for testing the reliability of financial reports in develo** countries
Authors:
**g Shi,
Marcel Ausloos,
Tingting Zhu
Abstract:
We discuss a common suspicion about reported financial data, in 10 industrial sectors of the 6 so called "main develo** countries" over the time interval [2000-2014]. These data are examined through Benford's law first significant digit and through distribution distances tests. It is shown that several visually anomalous data have to be a priori removed. Thereafter, the distributions much better…
▽ More
We discuss a common suspicion about reported financial data, in 10 industrial sectors of the 6 so called "main develo** countries" over the time interval [2000-2014]. These data are examined through Benford's law first significant digit and through distribution distances tests. It is shown that several visually anomalous data have to be a priori removed. Thereafter, the distributions much better follow the first digit significant law, indicating the usefulness of a Benford's law test from the research starting line. The same holds true for distance tests. A few outliers are pointed out.
△ Less
Submitted 30 November, 2017;
originally announced December 2017.
-
Hint of a Universal Law for the Financial Gains of Competitive Sport Teams. The case of Tour de France cycle race
Authors:
Marcel Ausloos
Abstract:
This short note is intended as a "Letter to the Editor" Perspective in order that it serves as a contribution, in view of reaching the physics community caring about rare events and scaling laws and unexpected findings, on a domain of wide interest: sport and money. It is apparent from the data reported and discussed below that the scarcity of such data does not allow to recommend a complex elabor…
▽ More
This short note is intended as a "Letter to the Editor" Perspective in order that it serves as a contribution, in view of reaching the physics community caring about rare events and scaling laws and unexpected findings, on a domain of wide interest: sport and money. It is apparent from the data reported and discussed below that the scarcity of such data does not allow to recommend a complex elaboration of an agent based model, - at this time. In some sense, this also means that much data on sport activities is not necessarily given in terms of physics prone materials, but it could be, and would then attract much attention. Nevertheless the findings tie the data to well known scaling laws and physics processes. It is found that a simple scaling law describes the gains of teams in recent bicycle races, like the Tour de France. An analogous case, ranking teams in Formula 1 races, is shown in an Appendix
△ Less
Submitted 30 November, 2017;
originally announced December 2017.
-
Fractional Dynamics of Network Growth Constrained by aging Node Interactions
Authors:
Hadiseh Safdari,
Milad Zare Kamali,
Amirhossein Shirazi,
Moein Khalighi,
Gholamreza Jafari,
Marcel Ausloos
Abstract:
In many social complex systems, in which agents are linked by non-linear interactions, the history of events strongly influences the whole network dynamics. However, a class of "commonly accepted beliefs" seems rarely studied. In this paper, we examine how the growth process of a (social) network is influenced by past circumstances. In order to tackle this cause, we simply modify the well known pr…
▽ More
In many social complex systems, in which agents are linked by non-linear interactions, the history of events strongly influences the whole network dynamics. However, a class of "commonly accepted beliefs" seems rarely studied. In this paper, we examine how the growth process of a (social) network is influenced by past circumstances. In order to tackle this cause, we simply modify the well known preferential attachment mechanism by imposing a time dependent kernel function in the network evolution equation. This approach leads to a fractional order Barabasi-Albert (BA) differential equation, generalizing the BA model. Our results show that, with passing time, an aging process is observed for the network dynamics. The aging process leads to a decay for the node degree values, thereby creating an opposing process to the preferential attachment mechanism. On one hand, based on the preferential attachment mechanism, nodes with a high degree are more likely to absorb links; but, on the other hand, a node's age has a reduced chance for new connections. This competitive scenario allows an increased chance for younger members to become a hub. Simulations of such a network growth with aging constraint confirm the results found from solving the fractional BA equation. We also report, as an exemplary application, an investigation of the collaboration network between Hollywood movie actors. It is undubiously shown that a decay in the dynamics of their collaboration rate is found, - even including a sex difference. Such findings suggest a widely universal application of the so generalized BA model.
△ Less
Submitted 9 September, 2017;
originally announced September 2017.
-
Glassy states of aging social networks
Authors:
F. Hassanibesheli,
L. Hedayatifar,
H. Safdari,
M. Ausloos,
G. R. Jafari
Abstract:
Individuals often develop reluctance to change their social relations, called "secondary homebody", even though their interactions with their environment evolve with time. Some memory effect is loosely present deforcing changes. In other words, in presence of memory, relations do not change easily. In order to investigate some history or memory effect on social networks, we introduce a temporal ke…
▽ More
Individuals often develop reluctance to change their social relations, called "secondary homebody", even though their interactions with their environment evolve with time. Some memory effect is loosely present deforcing changes. In other words, in presence of memory, relations do not change easily. In order to investigate some history or memory effect on social networks, we introduce a temporal kernel function into the Heider conventional balance theory, allowing for the "quality" of past relations to contribute to the evolution of the system. This memory effect is shown to lead to the emergence of aged networks, thereby perfectly describing and the more so measuring the aging process of links ("social relations"). It is shown that such a memory does not change the dynamical attractors of the system, but does prolong the time necessary to reach the "balanced states". The general trend goes toward obtaining either global ("paradise" or "bipolar") or local ("jammed") balanced states, but is profoundly affected by aged relations. The resistance of elder links against changes decelerates the evolution of the system and traps it into so named glassy states. In contrast to balance
△ Less
Submitted 9 September, 2017;
originally announced September 2017.
-
Data science for assessing possible tax income manipulation: The case of Italy
Authors:
Marcel Ausloos,
Roy Cerqueti,
Tariq A. Mir
Abstract:
This paper explores a real-world fundamental theme under a data science perspective. It specifically discusses whether fraud or manipulation can be observed in and from municipality income tax size distributions, through their aggregation from citizen fiscal reports. The study case pertains to official data obtained from the Italian Ministry of Economics and Finance over the period 2007-2011. All…
▽ More
This paper explores a real-world fundamental theme under a data science perspective. It specifically discusses whether fraud or manipulation can be observed in and from municipality income tax size distributions, through their aggregation from citizen fiscal reports. The study case pertains to official data obtained from the Italian Ministry of Economics and Finance over the period 2007-2011. All Italian (20) regions are considered. The considered data science approach concretizes in the adoption of the Benford first digit law as quantitative tool. Marked disparities are found, - for several regions, leading to unexpected "conclusions". The most eye browsing regions are not the expected ones according to classical imagination about Italy financial shadow matters.
△ Less
Submitted 7 September, 2017;
originally announced September 2017.