-
Uncertainty components in profile likelihood fits
Authors:
Andrés Pinto,
Zhibo Wu,
Fabrice Balli,
Nicolas Berger,
Maarten Boonekamp,
Émilien Chapon,
Tatsuo Kawamoto,
Bogdan Malaescu
Abstract:
When a measurement of a physical quantity is reported, the total uncertainty is usually decomposed into statistical and systematic uncertainties. This decomposition is not only useful to understand the contributions to the total uncertainty, but also required to propagate these contributions in subsequent analyses, such as combinations or interpretation fits including results from other measuremen…
▽ More
When a measurement of a physical quantity is reported, the total uncertainty is usually decomposed into statistical and systematic uncertainties. This decomposition is not only useful to understand the contributions to the total uncertainty, but also required to propagate these contributions in subsequent analyses, such as combinations or interpretation fits including results from other measurements or experiments. In profile-likelihood fits, widely applied in high-energy physics analyses, contributions of systematic uncertainties are routinely quantified using "impacts", which are not adequate for such applications. We discuss the difference between impacts and actual uncertainty components, and establish methods to determine the latter in a wide range of statistical models.
△ Less
Submitted 14 March, 2024; v1 submitted 8 July, 2023;
originally announced July 2023.
-
Entropy of microcanonical finite-graph ensembles
Authors:
Tatsuro Kawamoto
Abstract:
The entropy of random graph ensembles has gained widespread attention in the field of graph theory and network science. We consider microcanonical ensembles of simple graphs with prescribed degree sequences. We demonstrate that the mean-field approximations of the generating function using the Chebyshev-Hermite polynomials provide estimates for the entropy of finite-graph ensembles. Our estimate r…
▽ More
The entropy of random graph ensembles has gained widespread attention in the field of graph theory and network science. We consider microcanonical ensembles of simple graphs with prescribed degree sequences. We demonstrate that the mean-field approximations of the generating function using the Chebyshev-Hermite polynomials provide estimates for the entropy of finite-graph ensembles. Our estimate reproduces the Bender-Canfield formula in the limit of large graphs.
△ Less
Submitted 25 August, 2023; v1 submitted 18 May, 2023;
originally announced May 2023.
-
Finding community structure using the ordered random graph model
Authors:
Masaki Ochi,
Tatsuro Kawamoto
Abstract:
Visualization of the adjacency matrix enables us to capture macroscopic features of a network when the matrix elements are aligned properly. Community structure, a network consisting of several densely connected components, is a particularly important feature, and the structure can be identified through the adjacency matrix when it is close to a block-diagonal form. However, classical ordering alg…
▽ More
Visualization of the adjacency matrix enables us to capture macroscopic features of a network when the matrix elements are aligned properly. Community structure, a network consisting of several densely connected components, is a particularly important feature, and the structure can be identified through the adjacency matrix when it is close to a block-diagonal form. However, classical ordering algorithms for matrices fail to align matrix elements such that the community structure is visible. In this study, we propose an ordering algorithm based on the maximum-likelihood estimate of the ordered random graph model. We show that the proposed method allows us to more clearly identify community structures than the existing ordering algorithms.
△ Less
Submitted 10 July, 2023; v1 submitted 17 October, 2022;
originally announced October 2022.
-
Consistency between ordering and clustering methods for graphs
Authors:
Tatsuro Kawamoto,
Masaki Ochi,
Teruyoshi Kobayashi
Abstract:
A relational dataset is often analyzed by optimally assigning a label to each element through clustering or ordering. While similar characterizations of a dataset would be achieved by both clustering and ordering methods, the former has been studied much more actively than the latter, particularly for the data represented as graphs. This study fills this gap by investigating methodological relatio…
▽ More
A relational dataset is often analyzed by optimally assigning a label to each element through clustering or ordering. While similar characterizations of a dataset would be achieved by both clustering and ordering methods, the former has been studied much more actively than the latter, particularly for the data represented as graphs. This study fills this gap by investigating methodological relationships between several clustering and ordering methods, focusing on spectral techniques. Furthermore, we evaluate the resulting performance of the clustering and ordering methods. To this end, we propose a measure called the label continuity error, which generically quantifies the degree of consistency between a sequence and partition for a set of elements. Based on synthetic and real-world datasets, we evaluate the extents to which an ordering method identifies a module structure and a clustering method identifies a banded structure.
△ Less
Submitted 7 April, 2023; v1 submitted 27 August, 2022;
originally announced August 2022.
-
Single-trajectory map equation
Authors:
Tatsuro Kawamoto
Abstract:
Community detection, the process of identifying module structures in complex systems represented on networks, is an effective tool in various fields of science. The map equation, which is an information-theoretic framework based on the random walk on a network, is a particularly popular community detection method. Despite its outstanding performance in many applications, the inner workings of the…
▽ More
Community detection, the process of identifying module structures in complex systems represented on networks, is an effective tool in various fields of science. The map equation, which is an information-theoretic framework based on the random walk on a network, is a particularly popular community detection method. Despite its outstanding performance in many applications, the inner workings of the map equation have not been thoroughly studied. Herein, we revisit the original formulation of the map equation and address the existence of its ``raw form,'' which we refer to as the single-trajectory map equation. This raw form sheds light on many details behind the principle of the map equation that are hidden in the steady-state limit of the random walk. Most importantly, the single-trajectory map equation provides a more balanced community structure, naturally reducing the tendency of the overfitting phenomenon in the map equation.
△ Less
Submitted 23 April, 2023; v1 submitted 8 March, 2022;
originally announced March 2022.
-
Contribution of directedness in graph spectra
Authors:
Masaki Ochi,
Tatsuro Kawamoto
Abstract:
In graph analyses, directed edges are often approximated to undirected ones so that the adjacency matrices may be symmetric. However, such simplification has not been thoroughly verified. In this study, we investigate how directedness affects the graph spectra by introducing random directization, which is an opposite operation of neglecting edge directions. We analytically reveal that uniformly ra…
▽ More
In graph analyses, directed edges are often approximated to undirected ones so that the adjacency matrices may be symmetric. However, such simplification has not been thoroughly verified. In this study, we investigate how directedness affects the graph spectra by introducing random directization, which is an opposite operation of neglecting edge directions. We analytically reveal that uniformly random directization typically conserves the relative spectral structure of the adjacency matrix in the perturbative regime. The result of random directization implies that the spectrum of the adjacency matrix can be conserved after the directedness is ignored.
△ Less
Submitted 18 August, 2022; v1 submitted 8 February, 2022;
originally announced February 2022.
-
Transverse Oscillating Bubble Enhanced Laser-driven Betatron X-ray Radiation Generation
Authors:
Rafal Rakowski,
** Zhang,
Kyle Jensen,
Brendan Kettle,
Tim Kawamoto,
Sudeep Banerjee,
Colton Fruhling,
Grigory Golovin,
Daniel Haden,
Matthew S. Robinson,
Donald Umstadter,
B. A. Shadwick,
Matthias Fuchs
Abstract:
Ultrafast high-brightness X-ray pulses have proven invaluable for a broad range of research. Such pulses are typically generated via synchrotron emission from relativistic electron bunches using large-scale facilities. Recently, significantly more compact X-ray sources based on laser-wakefield accelerated (LWFA) electron beams have been demonstrated. In particular, laser-driven sources, where the…
▽ More
Ultrafast high-brightness X-ray pulses have proven invaluable for a broad range of research. Such pulses are typically generated via synchrotron emission from relativistic electron bunches using large-scale facilities. Recently, significantly more compact X-ray sources based on laser-wakefield accelerated (LWFA) electron beams have been demonstrated. In particular, laser-driven sources, where the radiation is generated by transverse oscillations of electrons within the plasma accelerator structure (so-called betatron oscillations) can generate highly-brilliant ultrashort X-ray pulses using a comparably simple setup. Here, we experimentally demonstrate a method to markedly enhance and control the parameters of LWFA-driven betatron X-ray emission. With our novel Transverse Oscillating Bubble Enhanced Betatron Radiation (TOBER) scheme, we show a significant increase in the number of generated photons by specifically manipulating the amplitude of the betatron oscillations. We realize this through an orchestrated evolution of the temporal laser pulse shape and the accelerating plasma structure. This leads to controlled off-axis injection of electrons that perform large-amplitude collective transverse betatron oscillations, resulting in increased radiation emission. Our concept holds the promise for a method to optimize the X-ray parameters for specific applications, such as time-resolved investigations with spatial and temporal atomic resolution or advanced high-resolution imaging modalities, and the generation of X-ray beams with even higher peak and average brightness.
△ Less
Submitted 2 February, 2022;
originally announced February 2022.
-
Sequential locality of graphs and its hypothesis testing
Authors:
Tatsuro Kawamoto,
Teruyoshi Kobayashi
Abstract:
The adjacency matrix is the most fundamental and intuitive object in graph analysis that is useful not only mathematically but also for visualizing the structures of graphs. Because the appearance of an adjacency matrix is critically affected by the ordering of rows and columns, or vertex ordering, statistical assessment of graphs together with their vertex sequences is important in identifying th…
▽ More
The adjacency matrix is the most fundamental and intuitive object in graph analysis that is useful not only mathematically but also for visualizing the structures of graphs. Because the appearance of an adjacency matrix is critically affected by the ordering of rows and columns, or vertex ordering, statistical assessment of graphs together with their vertex sequences is important in identifying the characteristic structures of graphs. In this paper, we propose a hypothesis testing framework that assesses how locally vertices are connected to each other along a specified vertex sequence, which provides a statistical foundation for an optimization problem called envelope reduction or minimum linear arrangement. The proposed tests are particularly suitable for moderately small data and formulated based on a combinatorial approach and a block model with intrinsic vertex ordering.
△ Less
Submitted 6 April, 2023; v1 submitted 22 November, 2021;
originally announced November 2021.
-
The large inner Micromegas modules for the Atlas Muon Spectrometer Upgrade: construction, quality control and characterization
Authors:
J. Allard,
M. Anfreville,
N. Andari,
D. Attié,
S. Aune,
H. Bachacou,
F. Balli,
F. Bauer,
J. Bennet,
T. Benoit,
J. Beltramelli,
H. Bervas,
T. Bey,
S. Bouaziz,
M. Boyer,
T. Challey,
T. Chevalérias,
X. Copollani,
J. Costa,
G. Cara,
G. Decock,
F. Deliot,
D. Denysiuk,
D. Desforge,
G. Disset
, et al. (49 additional authors not shown)
Abstract:
The steadily increasing luminosity of the LHC requires an upgrade with high-rate and high-resolution detector technology for the inner end cap of the ATLAS muon spectrometer: the New Small Wheels (NSW). In order to achieve the goal of precision tracking at a hit rate of about 15 kHz/cm$^2$ at the inner radius of the NSW, large area Micromegas quadruplets with 100\,\microns spatial resolution per p…
▽ More
The steadily increasing luminosity of the LHC requires an upgrade with high-rate and high-resolution detector technology for the inner end cap of the ATLAS muon spectrometer: the New Small Wheels (NSW). In order to achieve the goal of precision tracking at a hit rate of about 15 kHz/cm$^2$ at the inner radius of the NSW, large area Micromegas quadruplets with 100\,\microns spatial resolution per plane have been produced. % IRFU, from the CEA research center of Saclay, is responsible for the production and validation of LM1 Micromegas modules. The construction, production, qualification and validation of the largest Micromegas detectors ever built are reported here. Performance results under cosmic muon characterisation will also be discussed.
△ Less
Submitted 28 May, 2021;
originally announced May 2021.
-
Graph-based open-ended survey on concerns related to COVID-19
Authors:
Tatsuro Kawamoto,
Takaaki Aoki,
Michiko Ueda
Abstract:
The COVID-19 pandemic is an unprecedented public health crisis with broad social and economic consequences. We conducted four surveys between April and August 2020 using the graph-based open-ended survey (GOS) framework, and investigated the most pressing concerns and issues for the general public in Japan. The GOS framework is a hybrid of the two traditional survey frameworks that allows responde…
▽ More
The COVID-19 pandemic is an unprecedented public health crisis with broad social and economic consequences. We conducted four surveys between April and August 2020 using the graph-based open-ended survey (GOS) framework, and investigated the most pressing concerns and issues for the general public in Japan. The GOS framework is a hybrid of the two traditional survey frameworks that allows respondents to post their opinions in a free-format style, which can subsequently serve as one of the choice items for other respondents, just as in a multiple-choice survey. As a result, this framework generates an opinion graph that relates opinions and respondents. We can also construct annotated opinion graphs to achieve a higher resolution. By clustering the annotated opinion graphs, we revealed the characteristic evolution of the response patterns as well as the interconnectedness and multi-faceted nature of opinions. Substantively, our notable finding is that "social pressure," not "infection risk," was one of the major concerns of our respondents. Social pressure refers to criticism and discrimination that they anticipate receiving from others should they contract COVID-19. It is possible that the collectivist nature of Japanese culture coupled with the government's policy of relying on personal responsibility to combat COVID-19 explains some of the above findings, as the latter has led to the emergence of vigilantes. The presence of mutual surveillance can contribute to growing skepticism toward others as well as fear of ostracism, which may have negative consequences at both the societal and individual levels.
△ Less
Submitted 22 December, 2021; v1 submitted 8 December, 2020;
originally announced December 2020.
-
Identifying macroscopic features in foreign visitor travel pathways
Authors:
Tatsuro Kawamoto,
Ryutaro Hashimoto
Abstract:
Human travel patterns are commonly studied as networks in which the points of departure and destination are encoded as nodes and the travel frequency between two points is recorded as a weighted edge. However, because travelers often visit multiple destinations, which constitute pathways, an analysis incorporating pathway statistics is expected to be more informative over an approach based solely…
▽ More
Human travel patterns are commonly studied as networks in which the points of departure and destination are encoded as nodes and the travel frequency between two points is recorded as a weighted edge. However, because travelers often visit multiple destinations, which constitute pathways, an analysis incorporating pathway statistics is expected to be more informative over an approach based solely on pairwise frequencies. Hence, in this study, we apply a higher-order network representation framework to identify characteristic travel patterns from foreign visitor pathways in Japan. We expect that the results herein are mainly useful for marketing research in the tourism industry.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
Spectral clustering of annotated graphs using a factor graph representation
Authors:
Tatsuro Kawamoto
Abstract:
Graph-structured data commonly have node annotations. A popular approach for inference and learning involving annotated graphs is to incorporate annotations into a statistical model or algorithm. By contrast, we consider a more direct method named scotch-ta**, in which the structural information in a graph and its node annotations are encoded as a factor graph. Specifically, we establish the mat…
▽ More
Graph-structured data commonly have node annotations. A popular approach for inference and learning involving annotated graphs is to incorporate annotations into a statistical model or algorithm. By contrast, we consider a more direct method named scotch-ta**, in which the structural information in a graph and its node annotations are encoded as a factor graph. Specifically, we establish the mathematical basis of this method in the spectral framework.
△ Less
Submitted 6 October, 2020;
originally announced October 2020.
-
Fragility of spectral clustering for networks with an overlap** structure
Authors:
Chihiro Noguchi,
Tatsuro Kawamoto
Abstract:
Communities commonly overlap in real-world networks. This is a motivation to develop overlap** community detection methods, because methods for non-overlap** communities may not perform well. However, deterioration mechanism of the detection methods used for non-overlap** communities have rarely been investigated theoretically. Here, we analyze an accuracy of spectral clustering, which does…
▽ More
Communities commonly overlap in real-world networks. This is a motivation to develop overlap** community detection methods, because methods for non-overlap** communities may not perform well. However, deterioration mechanism of the detection methods used for non-overlap** communities have rarely been investigated theoretically. Here, we analyze an accuracy of spectral clustering, which does not consider overlap** structures, by using the replica method from statistical physics. Our analysis on an overlap** stochastic block model reveals how the structural information is lost from the leading eigenvector because of the overlap** structure.
△ Less
Submitted 25 October, 2020; v1 submitted 5 March, 2020;
originally announced March 2020.
-
Democratic summary of public opinions in free-response surveys
Authors:
Tatsuro Kawamoto,
Takaaki Aoki
Abstract:
Social surveys have been widely used as a method of obtaining public opinion. Sometimes it is more ideal to collect opinions by presenting questions in free-response formats than in multiple-choice formats. Despite their advantages, free-response questions are rarely used in practice because they usually require manual analysis. Therefore, classification of free-format texts can present a formidab…
▽ More
Social surveys have been widely used as a method of obtaining public opinion. Sometimes it is more ideal to collect opinions by presenting questions in free-response formats than in multiple-choice formats. Despite their advantages, free-response questions are rarely used in practice because they usually require manual analysis. Therefore, classification of free-format texts can present a formidable task in large-scale surveys and can be influenced by the interpretations of analysts. In this study, we propose a network-based survey framework in which responses are automatically classified in a statistically principled manner. This can be achieved because in addition to the texts, similarities among responses are also assessed by each respondent. We demonstrate our approach using a poll on the 2016 US presidential election and a survey taken by graduates of a particular university. The proposed approach helps analysts interpret the underlying semantics of responses in large-scale surveys.
△ Less
Submitted 8 July, 2020; v1 submitted 9 July, 2019;
originally announced July 2019.
-
Evaluating network partitions through visualization
Authors:
Chihiro Noguchi,
Tatsuro Kawamoto
Abstract:
Network clustering requires making many decisions manually, such as the number of groups and a statistical model to be used. Even after filtering using an information criterion or regularizing with a nonparametric framework, we are commonly left with multiple candidates with reasonable partitions. In the end, the user has to decide which inferred groups should be regarded as informative. Here we p…
▽ More
Network clustering requires making many decisions manually, such as the number of groups and a statistical model to be used. Even after filtering using an information criterion or regularizing with a nonparametric framework, we are commonly left with multiple candidates with reasonable partitions. In the end, the user has to decide which inferred groups should be regarded as informative. Here we propose a visualization method that efficiently represents network partitioning based on statistical inference algorithms. Our non-statistical assessment procedure based on visualization helps users extract informative groups when they cannot uniquely determine significant groups on the basis of statistical assessments. The proposed visualization is also effective for use as a benchmark test of different clustering algorithms.
△ Less
Submitted 4 June, 2019; v1 submitted 3 June, 2019;
originally announced June 2019.
-
Mean-field theory of graph neural networks in graph partitioning
Authors:
Tatsuro Kawamoto,
Masashi Tsubaki,
Tomoyuki Obuchi
Abstract:
A theoretical performance analysis of the graph neural network (GNN) is presented. For classification tasks, the neural network approach has the advantage in terms of flexibility that it can be employed in a data-driven manner, whereas Bayesian inference requires the assumption of a specific model. A fundamental question is then whether GNN has a high accuracy in addition to this flexibility. More…
▽ More
A theoretical performance analysis of the graph neural network (GNN) is presented. For classification tasks, the neural network approach has the advantage in terms of flexibility that it can be employed in a data-driven manner, whereas Bayesian inference requires the assumption of a specific model. A fundamental question is then whether GNN has a high accuracy in addition to this flexibility. Moreover, whether the achieved performance is predominately a result of the backpropagation or the architecture itself is a matter of considerable interest. To gain a better insight into these questions, a mean-field theory of a minimal GNN architecture is developed for the graph partitioning problem. This demonstrates a good agreement with numerical experiments.
△ Less
Submitted 28 October, 2018;
originally announced October 2018.
-
Counting the number of metastable states in the modularity landscape: Algorithmic detectability limit of greedy algorithms in community detection
Authors:
Tatsuro Kawamoto,
Yoshiyuki Kabashima
Abstract:
Modularity maximization using greedy algorithms continues to be a popular approach toward community detection in graphs, even after various better forming algorithms have been proposed. Apart from its clear mechanism and ease of implementation, this approach is persistently popular because, presumably, its risk of algorithmic failure is not well understood. This Rapid Communication provides insigh…
▽ More
Modularity maximization using greedy algorithms continues to be a popular approach toward community detection in graphs, even after various better forming algorithms have been proposed. Apart from its clear mechanism and ease of implementation, this approach is persistently popular because, presumably, its risk of algorithmic failure is not well understood. This Rapid Communication provides insight into this issue by estimating the algorithmic performance limit of modularity maximization. This is achieved by counting the number of metastable states under a local update rule. Our results offer a quantitative insight into the level of sparsity at which a greedy algorithm typically fails.
△ Less
Submitted 17 January, 2019; v1 submitted 23 August, 2018;
originally announced August 2018.
-
Algorithmic detectability threshold of the stochastic block model
Authors:
Tatsuro Kawamoto
Abstract:
The assumption that the values of model parameters are known or correctly learned, i.e., the Nishimori condition, is one of the requirements for the detectability analysis of the stochastic block model in statistical inference. In practice, however, there is no example demonstrating that we can know the model parameters beforehand, and there is no guarantee that the model parameters can be learned…
▽ More
The assumption that the values of model parameters are known or correctly learned, i.e., the Nishimori condition, is one of the requirements for the detectability analysis of the stochastic block model in statistical inference. In practice, however, there is no example demonstrating that we can know the model parameters beforehand, and there is no guarantee that the model parameters can be learned accurately. In this study, we consider the expectation--maximization (EM) algorithm with belief propagation (BP) and derive its algorithmic detectability threshold. Our analysis is not restricted to the community structure, but includes general modular structures. Because the algorithm cannot always learn the planted model parameters correctly, the algorithmic detectability threshold is qualitatively different from the one with the Nishimori condition.
△ Less
Submitted 7 March, 2018; v1 submitted 24 October, 2017;
originally announced October 2017.
-
Algorithmic infeasibility of community detection in higher-order networks
Authors:
Tatsuro Kawamoto
Abstract:
In principle, higher-order networks that have multiple edge types are more informative than their lower-order counterparts. In practice, however, excessively rich information may be algorithmically infeasible to extract. It requires an algorithm that assumes a high-dimensional model and such an algorithm may perform poorly or be extremely sensitive to the initial estimate of the model parameters.…
▽ More
In principle, higher-order networks that have multiple edge types are more informative than their lower-order counterparts. In practice, however, excessively rich information may be algorithmically infeasible to extract. It requires an algorithm that assumes a high-dimensional model and such an algorithm may perform poorly or be extremely sensitive to the initial estimate of the model parameters. Herein, we address this problem of community detection through a detectability analysis. We focus on the expectation-maximization (EM) algorithm with belief propagation (BP), and analytically derive its algorithmic detectability threshold, i.e., the limit of the modular structure strength below which the algorithm can no longer detect any modular structures. The results indicate the existence of a phase in which the community detection of a lower-order network outperforms its higher-order counterpart.
△ Less
Submitted 24 October, 2017;
originally announced October 2017.
-
Detectability thresholds of general modular graphs
Authors:
Tatsuro Kawamoto,
Yoshiyuki Kabashima
Abstract:
We investigate the detectability thresholds of various modular structures in the stochastic block model. Our analysis reveals how the detectability threshold is related to the details of the modular pattern, including the hierarchy of the clusters. We show that certain planted structures are impossible to infer regardless of their fuzziness.
We investigate the detectability thresholds of various modular structures in the stochastic block model. Our analysis reveals how the detectability threshold is related to the details of the modular pattern, including the hierarchy of the clusters. We show that certain planted structures are impossible to infer regardless of their fuzziness.
△ Less
Submitted 9 January, 2017; v1 submitted 31 August, 2016;
originally announced August 2016.
-
Comparative analysis on the selection of number of clusters in community detection
Authors:
Tatsuro Kawamoto,
Yoshiyuki Kabashima
Abstract:
We conduct a comparative analysis on various estimates of the number of clusters in community detection. An exhaustive comparison requires testing of all possible combinations of frameworks, algorithms, and assessment criteria. In this paper we focus on the framework based on a stochastic block model, and investigate the performance of greedy algorithms, statistical inference, and spectral methods…
▽ More
We conduct a comparative analysis on various estimates of the number of clusters in community detection. An exhaustive comparison requires testing of all possible combinations of frameworks, algorithms, and assessment criteria. In this paper we focus on the framework based on a stochastic block model, and investigate the performance of greedy algorithms, statistical inference, and spectral methods. For the assessment criteria, we consider modularity, map equation, Bethe free energy, prediction errors, and isolated eigenvalues. From the analysis, the tendency of overfit and underfit that the assessment criteria and algorithms have, becomes apparent. In addition, we propose that the alluvial diagram is a suitable tool to visualize statistical inference results and can be useful to determine the number of clusters.
△ Less
Submitted 7 March, 2018; v1 submitted 24 June, 2016;
originally announced June 2016.
-
Cross-validation estimate of the number of clusters in a network
Authors:
Tatsuro Kawamoto,
Yoshiyuki Kabashima
Abstract:
Network science investigates methodologies that summarise relational data to obtain better interpretability. Identifying modular structures is a fundamental task, and assessment of the coarse-grain level is its crucial step. Here, we propose principled, scalable, and widely applicable assessment criteria to determine the number of clusters in modular networks based on the leave-one-out cross-valid…
▽ More
Network science investigates methodologies that summarise relational data to obtain better interpretability. Identifying modular structures is a fundamental task, and assessment of the coarse-grain level is its crucial step. Here, we propose principled, scalable, and widely applicable assessment criteria to determine the number of clusters in modular networks based on the leave-one-out cross-validation estimate of the edge prediction error.
△ Less
Submitted 12 June, 2017; v1 submitted 25 May, 2016;
originally announced May 2016.
-
Detectability of the spectral method for sparse graph partitioning
Authors:
Tatsuro Kawamoto,
Yoshiyuki Kabashima
Abstract:
We show that modularity maximization with the resolution parameter offers a unifying framework of graph partitioning. In this framework, we demonstrate that the spectral method exhibits universal detectability, irrespective of the value of the resolution parameter, as long as the graph is partitioned. Furthermore, we show that when the resolution parameter is sufficiently small, a first-order phas…
▽ More
We show that modularity maximization with the resolution parameter offers a unifying framework of graph partitioning. In this framework, we demonstrate that the spectral method exhibits universal detectability, irrespective of the value of the resolution parameter, as long as the graph is partitioned. Furthermore, we show that when the resolution parameter is sufficiently small, a first-order phase transition occurs, resulting in the graph being unpartitioned.
△ Less
Submitted 18 December, 2015; v1 submitted 22 September, 2015;
originally announced September 2015.
-
Localized eigenvectors of the non-backtracking matrix
Authors:
Tatsuro Kawamoto
Abstract:
In the case of graph partitioning, the emergence of localized eigenvectors can cause the standard spectral method to fail. To overcome this problem, the spectral method using a non-backtracking matrix was proposed. Based on numerical experiments on several examples of real networks, it is clear that the non-backtracking matrix does not exhibit localization of eigenvectors. However, we show that lo…
▽ More
In the case of graph partitioning, the emergence of localized eigenvectors can cause the standard spectral method to fail. To overcome this problem, the spectral method using a non-backtracking matrix was proposed. Based on numerical experiments on several examples of real networks, it is clear that the non-backtracking matrix does not exhibit localization of eigenvectors. However, we show that localized eigenvectors of the non-backtracking matrix can exist outside the spectral band, which may lead to deterioration in the performance of graph partitioning.
△ Less
Submitted 8 February, 2016; v1 submitted 28 May, 2015;
originally announced May 2015.
-
Persistence of activity on Twitter triggered by a natural disaster: A data analysis
Authors:
Tatsuro Kawamoto
Abstract:
In this note, we list the results of a simple analysis of a Twitter dataset: the complete dataset of Japanese tweets in the 1-week period after the Great East Japan earthquake, which occurred on March 11, 2011. Our data analysis shows how people reacted to the earthquake on Twitter and how some users went inactive in the long-term.
In this note, we list the results of a simple analysis of a Twitter dataset: the complete dataset of Japanese tweets in the 1-week period after the Great East Japan earthquake, which occurred on March 11, 2011. Our data analysis shows how people reacted to the earthquake on Twitter and how some users went inactive in the long-term.
△ Less
Submitted 11 March, 2015;
originally announced March 2015.
-
Limitations in the spectral method for graph partitioning: detectability threshold and localization of eigenvectors
Authors:
Tatsuro Kawamoto,
Yoshiyuki Kabashima
Abstract:
Investigating the performance of different methods is a fundamental problem in graph partitioning. In this paper, we estimate the so-called detectability threshold for the spectral method with both unnormalized and normalized Laplacians in sparse graphs. The detectability threshold is the critical point at which the result of the spectral method is completely uncorrelated to the planted partition.…
▽ More
Investigating the performance of different methods is a fundamental problem in graph partitioning. In this paper, we estimate the so-called detectability threshold for the spectral method with both unnormalized and normalized Laplacians in sparse graphs. The detectability threshold is the critical point at which the result of the spectral method is completely uncorrelated to the planted partition. We also analyze whether the localization of eigenvectors affects the partitioning performance in the detectable region. We use the replica method, which is often used in the field of spin-glass theory, and focus on the case of bisection. We show that the gap between the estimated threshold for the spectral method and the threshold obtained from Bayesian inference is considerable in sparse graphs, even without eigenvector localization. This gap closes in a dense limit.
△ Less
Submitted 9 June, 2015; v1 submitted 24 February, 2015;
originally announced February 2015.
-
Estimating the resolution limit of the map equation in community detection
Authors:
Tatsuro Kawamoto,
Martin Rosvall
Abstract:
A community detection algorithm is considered to have a resolution limit if the scale of the smallest modules that can be resolved depends on the size of the analyzed subnetwork. The resolution limit is known to prevent some community detection algorithms from accurately identifying the modular structure of a network. In fact, any global objective function for measuring the quality of a two-level…
▽ More
A community detection algorithm is considered to have a resolution limit if the scale of the smallest modules that can be resolved depends on the size of the analyzed subnetwork. The resolution limit is known to prevent some community detection algorithms from accurately identifying the modular structure of a network. In fact, any global objective function for measuring the quality of a two-level assignment of nodes into modules must have some sort of resolution limit or an external resolution parameter. However, it is yet unknown how the resolution limit affects the so-called map equation, which is known to be an efficient objective function for community detection. We derive an analytical estimate and conclude that the resolution limit of the map equation is set by the total number of links between modules instead of the total number of links in the full network as for modularity. This mechanism makes the resolution limit much less restrictive for the map equation than for modularity, and in practice orders of magnitudes smaller. Furthermore, we argue that the effect of the resolution limit often results from shoehorning multi-level modular structures into two-level descriptions. As we show, the hierarchical map equation effectively eliminates the resolution limit for networks with nested multi-level modular structures.
△ Less
Submitted 14 January, 2015; v1 submitted 18 February, 2014;
originally announced February 2014.
-
Viral spreading of daily information in online social networks
Authors:
Tatsuro Kawamoto,
Naomichi Hatano
Abstract:
We explain a possible mechanism of an information spreading on a network which spreads extremely far from a seed node, namely the viral spreading. On the basis of a model of the information spreading in an online social network, in which the dynamics is expressed as a random multiplicative process of the spreading rates, we will show that the correlation between the spreading rates enhances the ch…
▽ More
We explain a possible mechanism of an information spreading on a network which spreads extremely far from a seed node, namely the viral spreading. On the basis of a model of the information spreading in an online social network, in which the dynamics is expressed as a random multiplicative process of the spreading rates, we will show that the correlation between the spreading rates enhances the chance of the viral spreading, shifting the tip** point at which the spreading goes viral.
△ Less
Submitted 16 March, 2014; v1 submitted 12 November, 2012;
originally announced November 2012.
-
A stochastic model of the tweet diffusion on the Twitter network
Authors:
Tatsuro Kawamoto
Abstract:
We introduce a stochastic model which describes diffusions of tweets on the Twitter network. By dividing the followers into generations, we describe the dynamics of the tweet diffusion as a random multiplicative process. We confirm our model by directly observing the statistics of the multiplicative factors in the Twitter data.
We introduce a stochastic model which describes diffusions of tweets on the Twitter network. By dividing the followers into generations, we describe the dynamics of the tweet diffusion as a random multiplicative process. We confirm our model by directly observing the statistics of the multiplicative factors in the Twitter data.
△ Less
Submitted 25 September, 2012;
originally announced September 2012.
-
Compensation of the Crossing Angle with Crab Cavities at KEKB
Authors:
T. Abe,
K. Akai,
M. Akemoto,
A. Akiyama,
M. Arinaga,
K. Ebihara,
K. Egawa,
A. Enomoto,
J. Flanagan,
S. Fukuda,
H. Fukuma,
Y. Funakoshi,
K. Furukawa,
T. Furuya,
K. Hara,
T. Higo,
S. Hiramatsu,
H. Hisamatsu,
H. Honma,
T. Honma,
K. Hosoyama,
T. Ieiri,
N. Iida,
H. Ikeda,
M. Ikeda
, et al. (90 additional authors not shown)
Abstract:
Crab cavities have been installed in the KEKB B--Factory rings to compensate the crossing angle at the collision point and thus increase luminosity. The beam operation with crab crossing has been done since February 2007. This is the first experience with such cavities in colliders or storage rings. The crab cavities have been working without serious issues. While higher specific luminosity than…
▽ More
Crab cavities have been installed in the KEKB B--Factory rings to compensate the crossing angle at the collision point and thus increase luminosity. The beam operation with crab crossing has been done since February 2007. This is the first experience with such cavities in colliders or storage rings. The crab cavities have been working without serious issues. While higher specific luminosity than the geometrical gain has been achieved, further study is necessary and under way to reach the prediction of simulation.
△ Less
Submitted 21 June, 2007;
originally announced June 2007.