-
Innovation and Word Usage Patterns in Machine Learning
Authors:
Vítor Bandeira Borges,
Daniel Oliveira Cajueiro
Abstract:
In this study, we delve into the dynamic landscape of machine learning research evolution. Initially, through the utilization of Latent Dirichlet Allocation, we discern pivotal themes and fundamental concepts that have emerged within the realm of machine learning. Subsequently, we undertake a comprehensive analysis to track the evolutionary trajectories of these identified themes. To quantify the…
▽ More
In this study, we delve into the dynamic landscape of machine learning research evolution. Initially, through the utilization of Latent Dirichlet Allocation, we discern pivotal themes and fundamental concepts that have emerged within the realm of machine learning. Subsequently, we undertake a comprehensive analysis to track the evolutionary trajectories of these identified themes. To quantify the novelty and divergence of research contributions, we employ the Kullback-Leibler Divergence metric. This statistical measure serves as a proxy for ``surprise'', indicating the extent of differentiation between the content of academic papers and the subsequent developments in research. By amalgamating these insights, we gain the ability to ascertain the pivotal roles played by prominent researchers and the significance of specific academic venues (periodicals and conferences) within the machine learning domain.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Questions of science: chatting with ChatGPT about complex systems
Authors:
Nuno Crokidakis,
Marcio Argollo de Menezes,
Daniel O. Cajueiro
Abstract:
We present an overview of the complex systems field using ChatGPT as a representation of the community's understanding. ChatGPT has learned language patterns and styles from a large dataset of internet texts, allowing it to provide answers that reflect common opinions, ideas, and language patterns found in the community. Our exploration covers both teaching and learning, and research topics. We re…
▽ More
We present an overview of the complex systems field using ChatGPT as a representation of the community's understanding. ChatGPT has learned language patterns and styles from a large dataset of internet texts, allowing it to provide answers that reflect common opinions, ideas, and language patterns found in the community. Our exploration covers both teaching and learning, and research topics. We recognize the value of ChatGPT as a source for the community's ideas.
△ Less
Submitted 29 March, 2023;
originally announced March 2023.
-
A comprehensive review of automatic text summarization techniques: method, data, evaluation and coding
Authors:
Daniel O. Cajueiro,
Arthur G. Nery,
Igor Tavares,
Maísa K. De Melo,
Silvia A. dos Reis,
Li Weigang,
Victor R. R. Celestino
Abstract:
We provide a literature review about Automatic Text Summarization (ATS) systems. We consider a citation-based approach. We start with some popular and well-known papers that we have in hand about each topic we want to cover and we have tracked the "backward citations" (papers that are cited by the set of papers we knew beforehand) and the "forward citations" (newer papers that cite the set of pape…
▽ More
We provide a literature review about Automatic Text Summarization (ATS) systems. We consider a citation-based approach. We start with some popular and well-known papers that we have in hand about each topic we want to cover and we have tracked the "backward citations" (papers that are cited by the set of papers we knew beforehand) and the "forward citations" (newer papers that cite the set of papers we knew beforehand). In order to organize the different methods, we present the diverse approaches to ATS guided by the mechanisms they use to generate a summary. Besides presenting the methods, we also present an extensive review of the datasets available for summarization tasks and the methods used to evaluate the quality of the summaries. Finally, we present an empirical exploration of these methods using the CNN Corpus dataset that provides golden summaries for extractive and abstractive methods.
△ Less
Submitted 3 October, 2023; v1 submitted 4 January, 2023;
originally announced January 2023.
-
Controlling self-organized criticality in complex networks
Authors:
Daniel O. Cajueiro,
Roberto F. S. Andrade
Abstract:
A control scheme to reduce the size of avalanches of the Bak-Tang-Wiesenfeld model on complex networks is proposed. Three network types are considered: those proposed by Erdős-Renyi, Goh-Kahng-Kim, and a real network representing the main connections of the electrical power grid of the western United States. The control scheme is based on the idea of triggering avalanches in the highest degree nod…
▽ More
A control scheme to reduce the size of avalanches of the Bak-Tang-Wiesenfeld model on complex networks is proposed. Three network types are considered: those proposed by Erdős-Renyi, Goh-Kahng-Kim, and a real network representing the main connections of the electrical power grid of the western United States. The control scheme is based on the idea of triggering avalanches in the highest degree nodes that are near to become critical. We show that this strategy works in the sense that the dissipation of mass occurs most locally avoiding larger avalanches. We also compare this strategy with a random strategy where the nodes are chosen randomly. Although the random control has some ability to reduce the probability of large avalanches, its performance is much worse than the one based on the choice of the highest degree nodes. Finally, we argue that the ability of the proposed control scheme is related to its ability to reduce the concentration of mass on the network.
△ Less
Submitted 28 May, 2013;
originally announced May 2013.