Search | arXiv e-print repository

Uncertainty components in profile likelihood fits

Authors: Andrés Pinto, Zhibo Wu, Fabrice Balli, Nicolas Berger, Maarten Boonekamp, Émilien Chapon, Tatsuo Kawamoto, Bogdan Malaescu

Abstract: When a measurement of a physical quantity is reported, the total uncertainty is usually decomposed into statistical and systematic uncertainties. This decomposition is not only useful to understand the contributions to the total uncertainty, but also required to propagate these contributions in subsequent analyses, such as combinations or interpretation fits including results from other measuremen… ▽ More When a measurement of a physical quantity is reported, the total uncertainty is usually decomposed into statistical and systematic uncertainties. This decomposition is not only useful to understand the contributions to the total uncertainty, but also required to propagate these contributions in subsequent analyses, such as combinations or interpretation fits including results from other measurements or experiments. In profile-likelihood fits, widely applied in high-energy physics analyses, contributions of systematic uncertainties are routinely quantified using "impacts", which are not adequate for such applications. We discuss the difference between impacts and actual uncertainty components, and establish methods to determine the latter in a wide range of statistical models. △ Less

Submitted 14 March, 2024; v1 submitted 8 July, 2023; originally announced July 2023.

Comments: 20 pages

arXiv:2305.10996 [pdf, other]

doi 10.1088/2632-072X/acf01c

Entropy of microcanonical finite-graph ensembles

Authors: Tatsuro Kawamoto

Abstract: The entropy of random graph ensembles has gained widespread attention in the field of graph theory and network science. We consider microcanonical ensembles of simple graphs with prescribed degree sequences. We demonstrate that the mean-field approximations of the generating function using the Chebyshev-Hermite polynomials provide estimates for the entropy of finite-graph ensembles. Our estimate r… ▽ More The entropy of random graph ensembles has gained widespread attention in the field of graph theory and network science. We consider microcanonical ensembles of simple graphs with prescribed degree sequences. We demonstrate that the mean-field approximations of the generating function using the Chebyshev-Hermite polynomials provide estimates for the entropy of finite-graph ensembles. Our estimate reproduces the Bender-Canfield formula in the limit of large graphs. △ Less

Submitted 25 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

Comments: 7 pages, 3 figures

Journal ref: J. Phys. Complex. 4 035005 (2023)

arXiv:2210.08989 [pdf, other]

doi 10.1103/PhysRevE.108.014303

Finding community structure using the ordered random graph model

Authors: Masaki Ochi, Tatsuro Kawamoto

Abstract: Visualization of the adjacency matrix enables us to capture macroscopic features of a network when the matrix elements are aligned properly. Community structure, a network consisting of several densely connected components, is a particularly important feature, and the structure can be identified through the adjacency matrix when it is close to a block-diagonal form. However, classical ordering alg… ▽ More Visualization of the adjacency matrix enables us to capture macroscopic features of a network when the matrix elements are aligned properly. Community structure, a network consisting of several densely connected components, is a particularly important feature, and the structure can be identified through the adjacency matrix when it is close to a block-diagonal form. However, classical ordering algorithms for matrices fail to align matrix elements such that the community structure is visible. In this study, we propose an ordering algorithm based on the maximum-likelihood estimate of the ordered random graph model. We show that the proposed method allows us to more clearly identify community structures than the existing ordering algorithms. △ Less

Submitted 10 July, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

Comments: 14 pages, 12 figures

Journal ref: Phys. Rev. E 108, 014303 (2023)

arXiv:2208.12933 [pdf, other]

doi 10.1103/PhysRevResearch.5.023006

Consistency between ordering and clustering methods for graphs

Authors: Tatsuro Kawamoto, Masaki Ochi, Teruyoshi Kobayashi

Abstract: A relational dataset is often analyzed by optimally assigning a label to each element through clustering or ordering. While similar characterizations of a dataset would be achieved by both clustering and ordering methods, the former has been studied much more actively than the latter, particularly for the data represented as graphs. This study fills this gap by investigating methodological relatio… ▽ More A relational dataset is often analyzed by optimally assigning a label to each element through clustering or ordering. While similar characterizations of a dataset would be achieved by both clustering and ordering methods, the former has been studied much more actively than the latter, particularly for the data represented as graphs. This study fills this gap by investigating methodological relationships between several clustering and ordering methods, focusing on spectral techniques. Furthermore, we evaluate the resulting performance of the clustering and ordering methods. To this end, we propose a measure called the label continuity error, which generically quantifies the degree of consistency between a sequence and partition for a set of elements. Based on synthetic and real-world datasets, we evaluate the extents to which an ordering method identifies a module structure and a clustering method identifies a banded structure. △ Less

Submitted 7 April, 2023; v1 submitted 27 August, 2022; originally announced August 2022.

Comments: 30 pages, 26 figures

Journal ref: Phys. Rev. Research 5, 023006 (2023)

arXiv:2203.04044 [pdf, other]

doi 10.1038/s41598-023-33880-y

Single-trajectory map equation

Authors: Tatsuro Kawamoto

Abstract: Community detection, the process of identifying module structures in complex systems represented on networks, is an effective tool in various fields of science. The map equation, which is an information-theoretic framework based on the random walk on a network, is a particularly popular community detection method. Despite its outstanding performance in many applications, the inner workings of the… ▽ More Community detection, the process of identifying module structures in complex systems represented on networks, is an effective tool in various fields of science. The map equation, which is an information-theoretic framework based on the random walk on a network, is a particularly popular community detection method. Despite its outstanding performance in many applications, the inner workings of the map equation have not been thoroughly studied. Herein, we revisit the original formulation of the map equation and address the existence of its ``raw form,'' which we refer to as the single-trajectory map equation. This raw form sheds light on many details behind the principle of the map equation that are hidden in the steady-state limit of the random walk. Most importantly, the single-trajectory map equation provides a more balanced community structure, naturally reducing the tendency of the overfitting phenomenon in the map equation. △ Less

Submitted 23 April, 2023; v1 submitted 8 March, 2022; originally announced March 2022.

Comments: 20 pages, 14 figures

Journal ref: Sci. Rep. 13, 6597 (2023)

arXiv:2202.03653 [pdf, other]

doi 10.1103/PhysRevResearch.4.033129

Contribution of directedness in graph spectra

Authors: Masaki Ochi, Tatsuro Kawamoto

Abstract: In graph analyses, directed edges are often approximated to undirected ones so that the adjacency matrices may be symmetric. However, such simplification has not been thoroughly verified. In this study, we investigate how directedness affects the graph spectra by introducing random directization, which is an opposite operation of neglecting edge directions. We analytically reveal that uniformly ra… ▽ More In graph analyses, directed edges are often approximated to undirected ones so that the adjacency matrices may be symmetric. However, such simplification has not been thoroughly verified. In this study, we investigate how directedness affects the graph spectra by introducing random directization, which is an opposite operation of neglecting edge directions. We analytically reveal that uniformly random directization typically conserves the relative spectral structure of the adjacency matrix in the perturbative regime. The result of random directization implies that the spectrum of the adjacency matrix can be conserved after the directedness is ignored. △ Less

Submitted 18 August, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

Comments: 15 pages, 12 figures

Journal ref: Phys. Rev. Research 4, 033129 (2022)

arXiv:2202.01321 [pdf, other]

Transverse Oscillating Bubble Enhanced Laser-driven Betatron X-ray Radiation Generation

Authors: Rafal Rakowski, ** Zhang, Kyle Jensen, Brendan Kettle, Tim Kawamoto, Sudeep Banerjee, Colton Fruhling, Grigory Golovin, Daniel Haden, Matthew S. Robinson, Donald Umstadter, B. A. Shadwick, Matthias Fuchs

Abstract: Ultrafast high-brightness X-ray pulses have proven invaluable for a broad range of research. Such pulses are typically generated via synchrotron emission from relativistic electron bunches using large-scale facilities. Recently, significantly more compact X-ray sources based on laser-wakefield accelerated (LWFA) electron beams have been demonstrated. In particular, laser-driven sources, where the… ▽ More Ultrafast high-brightness X-ray pulses have proven invaluable for a broad range of research. Such pulses are typically generated via synchrotron emission from relativistic electron bunches using large-scale facilities. Recently, significantly more compact X-ray sources based on laser-wakefield accelerated (LWFA) electron beams have been demonstrated. In particular, laser-driven sources, where the radiation is generated by transverse oscillations of electrons within the plasma accelerator structure (so-called betatron oscillations) can generate highly-brilliant ultrashort X-ray pulses using a comparably simple setup. Here, we experimentally demonstrate a method to markedly enhance and control the parameters of LWFA-driven betatron X-ray emission. With our novel Transverse Oscillating Bubble Enhanced Betatron Radiation (TOBER) scheme, we show a significant increase in the number of generated photons by specifically manipulating the amplitude of the betatron oscillations. We realize this through an orchestrated evolution of the temporal laser pulse shape and the accelerating plasma structure. This leads to controlled off-axis injection of electrons that perform large-amplitude collective transverse betatron oscillations, resulting in increased radiation emission. Our concept holds the promise for a method to optimize the X-ray parameters for specific applications, such as time-resolved investigations with spatial and temporal atomic resolution or advanced high-resolution imaging modalities, and the generation of X-ray beams with even higher peak and average brightness. △ Less

Submitted 2 February, 2022; originally announced February 2022.

arXiv:2111.11267 [pdf, other]

doi 10.1103/PhysRevResearch.5.023007

Sequential locality of graphs and its hypothesis testing

Authors: Tatsuro Kawamoto, Teruyoshi Kobayashi

Abstract: The adjacency matrix is the most fundamental and intuitive object in graph analysis that is useful not only mathematically but also for visualizing the structures of graphs. Because the appearance of an adjacency matrix is critically affected by the ordering of rows and columns, or vertex ordering, statistical assessment of graphs together with their vertex sequences is important in identifying th… ▽ More The adjacency matrix is the most fundamental and intuitive object in graph analysis that is useful not only mathematically but also for visualizing the structures of graphs. Because the appearance of an adjacency matrix is critically affected by the ordering of rows and columns, or vertex ordering, statistical assessment of graphs together with their vertex sequences is important in identifying the characteristic structures of graphs. In this paper, we propose a hypothesis testing framework that assesses how locally vertices are connected to each other along a specified vertex sequence, which provides a statistical foundation for an optimization problem called envelope reduction or minimum linear arrangement. The proposed tests are particularly suitable for moderately small data and formulated based on a combinatorial approach and a block model with intrinsic vertex ordering. △ Less

Submitted 6 April, 2023; v1 submitted 22 November, 2021; originally announced November 2021.

Comments: 23 pages, 11 figures

Journal ref: Phys. Rev. Research 5, 023007 (2023)

arXiv:2105.13709 [pdf, other]

doi 10.1016/j.nima.2021.166143

The large inner Micromegas modules for the Atlas Muon Spectrometer Upgrade: construction, quality control and characterization

Authors: J. Allard, M. Anfreville, N. Andari, D. Attié, S. Aune, H. Bachacou, F. Balli, F. Bauer, J. Bennet, T. Benoit, J. Beltramelli, H. Bervas, T. Bey, S. Bouaziz, M. Boyer, T. Challey, T. Chevalérias, X. Copollani, J. Costa, G. Cara, G. Decock, F. Deliot, D. Denysiuk, D. Desforge, G. Disset , et al. (49 additional authors not shown)

Abstract: The steadily increasing luminosity of the LHC requires an upgrade with high-rate and high-resolution detector technology for the inner end cap of the ATLAS muon spectrometer: the New Small Wheels (NSW). In order to achieve the goal of precision tracking at a hit rate of about 15 kHz/cm$^2$ at the inner radius of the NSW, large area Micromegas quadruplets with 100\,\microns spatial resolution per p… ▽ More The steadily increasing luminosity of the LHC requires an upgrade with high-rate and high-resolution detector technology for the inner end cap of the ATLAS muon spectrometer: the New Small Wheels (NSW). In order to achieve the goal of precision tracking at a hit rate of about 15 kHz/cm$^2$ at the inner radius of the NSW, large area Micromegas quadruplets with 100\,\microns spatial resolution per plane have been produced. % IRFU, from the CEA research center of Saclay, is responsible for the production and validation of LM1 Micromegas modules. The construction, production, qualification and validation of the largest Micromegas detectors ever built are reported here. Performance results under cosmic muon characterisation will also be discussed. △ Less

Submitted 28 May, 2021; originally announced May 2021.

Comments: To be submitted to NIMA

arXiv:2012.04510 [pdf, other]

doi 10.1371/journal.pone.0256212

Graph-based open-ended survey on concerns related to COVID-19

Authors: Tatsuro Kawamoto, Takaaki Aoki, Michiko Ueda

Abstract: The COVID-19 pandemic is an unprecedented public health crisis with broad social and economic consequences. We conducted four surveys between April and August 2020 using the graph-based open-ended survey (GOS) framework, and investigated the most pressing concerns and issues for the general public in Japan. The GOS framework is a hybrid of the two traditional survey frameworks that allows responde… ▽ More The COVID-19 pandemic is an unprecedented public health crisis with broad social and economic consequences. We conducted four surveys between April and August 2020 using the graph-based open-ended survey (GOS) framework, and investigated the most pressing concerns and issues for the general public in Japan. The GOS framework is a hybrid of the two traditional survey frameworks that allows respondents to post their opinions in a free-format style, which can subsequently serve as one of the choice items for other respondents, just as in a multiple-choice survey. As a result, this framework generates an opinion graph that relates opinions and respondents. We can also construct annotated opinion graphs to achieve a higher resolution. By clustering the annotated opinion graphs, we revealed the characteristic evolution of the response patterns as well as the interconnectedness and multi-faceted nature of opinions. Substantively, our notable finding is that "social pressure," not "infection risk," was one of the major concerns of our respondents. Social pressure refers to criticism and discrimination that they anticipate receiving from others should they contract COVID-19. It is possible that the collectivist nature of Japanese culture coupled with the government's policy of relying on personal responsibility to combat COVID-19 explains some of the above findings, as the latter has led to the emergence of vigilantes. The presence of mutual surveillance can contribute to growing skepticism toward others as well as fear of ostracism, which may have negative consequences at both the societal and individual levels. △ Less

Submitted 22 December, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

Comments: 12 pages, 7 figures, 1 table

Journal ref: PLOS ONE 16(8): e0256212 (2021)

arXiv:2011.04190 [pdf, other]

doi 10.1007/s42973-020-00058-4

Identifying macroscopic features in foreign visitor travel pathways

Authors: Tatsuro Kawamoto, Ryutaro Hashimoto

Abstract: Human travel patterns are commonly studied as networks in which the points of departure and destination are encoded as nodes and the travel frequency between two points is recorded as a weighted edge. However, because travelers often visit multiple destinations, which constitute pathways, an analysis incorporating pathway statistics is expected to be more informative over an approach based solely… ▽ More Human travel patterns are commonly studied as networks in which the points of departure and destination are encoded as nodes and the travel frequency between two points is recorded as a weighted edge. However, because travelers often visit multiple destinations, which constitute pathways, an analysis incorporating pathway statistics is expected to be more informative over an approach based solely on pairwise frequencies. Hence, in this study, we apply a higher-order network representation framework to identify characteristic travel patterns from foreign visitor pathways in Japan. We expect that the results herein are mainly useful for marketing research in the tourism industry. △ Less

Submitted 9 November, 2020; originally announced November 2020.

Comments: 16 pages, 10 figures

Journal ref: The Japanese Economic Review (2020)

arXiv:2010.02791 [pdf, other]

Spectral clustering of annotated graphs using a factor graph representation

Authors: Tatsuro Kawamoto

Abstract: Graph-structured data commonly have node annotations. A popular approach for inference and learning involving annotated graphs is to incorporate annotations into a statistical model or algorithm. By contrast, we consider a more direct method named scotch-ta**, in which the structural information in a graph and its node annotations are encoded as a factor graph. Specifically, we establish the mat… ▽ More Graph-structured data commonly have node annotations. A popular approach for inference and learning involving annotated graphs is to incorporate annotations into a statistical model or algorithm. By contrast, we consider a more direct method named scotch-ta**, in which the structural information in a graph and its node annotations are encoded as a factor graph. Specifically, we establish the mathematical basis of this method in the spectral framework. △ Less

Submitted 6 October, 2020; originally announced October 2020.

Comments: 24 pages, 8 figures

arXiv:2003.02463 [pdf, other]

doi 10.1103/PhysRevResearch.2.043101

Fragility of spectral clustering for networks with an overlap** structure

Authors: Chihiro Noguchi, Tatsuro Kawamoto

Abstract: Communities commonly overlap in real-world networks. This is a motivation to develop overlap** community detection methods, because methods for non-overlap** communities may not perform well. However, deterioration mechanism of the detection methods used for non-overlap** communities have rarely been investigated theoretically. Here, we analyze an accuracy of spectral clustering, which does… ▽ More Communities commonly overlap in real-world networks. This is a motivation to develop overlap** community detection methods, because methods for non-overlap** communities may not perform well. However, deterioration mechanism of the detection methods used for non-overlap** communities have rarely been investigated theoretically. Here, we analyze an accuracy of spectral clustering, which does not consider overlap** structures, by using the replica method from statistical physics. Our analysis on an overlap** stochastic block model reveals how the structural information is lost from the leading eigenvector because of the overlap** structure. △ Less

Submitted 25 October, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

Comments: 23 pages, 16 figures

Journal ref: Phys. Rev. Research 2, 043101 (2020)

arXiv:1907.04359 [pdf, other]

doi 10.1038/s42256-019-0071-y

Democratic summary of public opinions in free-response surveys

Authors: Tatsuro Kawamoto, Takaaki Aoki

Abstract: Social surveys have been widely used as a method of obtaining public opinion. Sometimes it is more ideal to collect opinions by presenting questions in free-response formats than in multiple-choice formats. Despite their advantages, free-response questions are rarely used in practice because they usually require manual analysis. Therefore, classification of free-format texts can present a formidab… ▽ More Social surveys have been widely used as a method of obtaining public opinion. Sometimes it is more ideal to collect opinions by presenting questions in free-response formats than in multiple-choice formats. Despite their advantages, free-response questions are rarely used in practice because they usually require manual analysis. Therefore, classification of free-format texts can present a formidable task in large-scale surveys and can be influenced by the interpretations of analysts. In this study, we propose a network-based survey framework in which responses are automatically classified in a statistically principled manner. This can be achieved because in addition to the texts, similarities among responses are also assessed by each respondent. We demonstrate our approach using a poll on the 2016 US presidential election and a survey taken by graduates of a particular university. The proposed approach helps analysts interpret the underlying semantics of responses in large-scale surveys. △ Less

Submitted 8 July, 2020; v1 submitted 9 July, 2019; originally announced July 2019.

Comments: 7 + 17 pages, 3 + 9 figures, 3 tables, the accepted version

Journal ref: Nature Machine Intelligence, 1, 322-327 (2019)

arXiv:1906.00699 [pdf, other]

Evaluating network partitions through visualization

Authors: Chihiro Noguchi, Tatsuro Kawamoto

Abstract: Network clustering requires making many decisions manually, such as the number of groups and a statistical model to be used. Even after filtering using an information criterion or regularizing with a nonparametric framework, we are commonly left with multiple candidates with reasonable partitions. In the end, the user has to decide which inferred groups should be regarded as informative. Here we p… ▽ More Network clustering requires making many decisions manually, such as the number of groups and a statistical model to be used. Even after filtering using an information criterion or regularizing with a nonparametric framework, we are commonly left with multiple candidates with reasonable partitions. In the end, the user has to decide which inferred groups should be regarded as informative. Here we propose a visualization method that efficiently represents network partitioning based on statistical inference algorithms. Our non-statistical assessment procedure based on visualization helps users extract informative groups when they cannot uniquely determine significant groups on the basis of statistical assessments. The proposed visualization is also effective for use as a benchmark test of different clustering algorithms. △ Less

Submitted 4 June, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

Comments: 7 pages, 4 figures

arXiv:1810.11908 [pdf, other]

doi 10.1088/1742-5468/ab3456

Mean-field theory of graph neural networks in graph partitioning

Authors: Tatsuro Kawamoto, Masashi Tsubaki, Tomoyuki Obuchi

Abstract: A theoretical performance analysis of the graph neural network (GNN) is presented. For classification tasks, the neural network approach has the advantage in terms of flexibility that it can be employed in a data-driven manner, whereas Bayesian inference requires the assumption of a specific model. A fundamental question is then whether GNN has a high accuracy in addition to this flexibility. More… ▽ More A theoretical performance analysis of the graph neural network (GNN) is presented. For classification tasks, the neural network approach has the advantage in terms of flexibility that it can be employed in a data-driven manner, whereas Bayesian inference requires the assumption of a specific model. A fundamental question is then whether GNN has a high accuracy in addition to this flexibility. Moreover, whether the achieved performance is predominately a result of the backpropagation or the architecture itself is a matter of considerable interest. To gain a better insight into these questions, a mean-field theory of a minimal GNN architecture is developed for the graph partitioning problem. This demonstrates a good agreement with numerical experiments. △ Less

Submitted 28 October, 2018; originally announced October 2018.

Comments: 16 pages, 6 figures, Thirty-second Conference on Neural Information Processing Systems (NIPS2018)

arXiv:1808.07690 [pdf, other]

doi 10.1103/PhysRevE.99.010301

Counting the number of metastable states in the modularity landscape: Algorithmic detectability limit of greedy algorithms in community detection

Authors: Tatsuro Kawamoto, Yoshiyuki Kabashima

Abstract: Modularity maximization using greedy algorithms continues to be a popular approach toward community detection in graphs, even after various better forming algorithms have been proposed. Apart from its clear mechanism and ease of implementation, this approach is persistently popular because, presumably, its risk of algorithmic failure is not well understood. This Rapid Communication provides insigh… ▽ More Modularity maximization using greedy algorithms continues to be a popular approach toward community detection in graphs, even after various better forming algorithms have been proposed. Apart from its clear mechanism and ease of implementation, this approach is persistently popular because, presumably, its risk of algorithmic failure is not well understood. This Rapid Communication provides insight into this issue by estimating the algorithmic performance limit of modularity maximization. This is achieved by counting the number of metastable states under a local update rule. Our results offer a quantitative insight into the level of sparsity at which a greedy algorithm typically fails. △ Less

Submitted 17 January, 2019; v1 submitted 23 August, 2018; originally announced August 2018.

Comments: 6+6 pages, 5 figures

Journal ref: Phys. Rev. E 99, 010301 (2019)

arXiv:1710.08841 [pdf, other]

doi 10.1103/PhysRevE.97.032301

Algorithmic detectability threshold of the stochastic block model

Authors: Tatsuro Kawamoto

Abstract: The assumption that the values of model parameters are known or correctly learned, i.e., the Nishimori condition, is one of the requirements for the detectability analysis of the stochastic block model in statistical inference. In practice, however, there is no example demonstrating that we can know the model parameters beforehand, and there is no guarantee that the model parameters can be learned… ▽ More The assumption that the values of model parameters are known or correctly learned, i.e., the Nishimori condition, is one of the requirements for the detectability analysis of the stochastic block model in statistical inference. In practice, however, there is no example demonstrating that we can know the model parameters beforehand, and there is no guarantee that the model parameters can be learned accurately. In this study, we consider the expectation--maximization (EM) algorithm with belief propagation (BP) and derive its algorithmic detectability threshold. Our analysis is not restricted to the community structure, but includes general modular structures. Because the algorithm cannot always learn the planted model parameters correctly, the algorithmic detectability threshold is qualitatively different from the one with the Nishimori condition. △ Less

Submitted 7 March, 2018; v1 submitted 24 October, 2017; originally announced October 2017.

Comments: 15 pages, 8 figures

Journal ref: Phys. Rev. E 97, 032301 (2018)

arXiv:1710.08816 [pdf, other]

Algorithmic infeasibility of community detection in higher-order networks

Authors: Tatsuro Kawamoto

Abstract: In principle, higher-order networks that have multiple edge types are more informative than their lower-order counterparts. In practice, however, excessively rich information may be algorithmically infeasible to extract. It requires an algorithm that assumes a high-dimensional model and such an algorithm may perform poorly or be extremely sensitive to the initial estimate of the model parameters.… ▽ More In principle, higher-order networks that have multiple edge types are more informative than their lower-order counterparts. In practice, however, excessively rich information may be algorithmically infeasible to extract. It requires an algorithm that assumes a high-dimensional model and such an algorithm may perform poorly or be extremely sensitive to the initial estimate of the model parameters. Herein, we address this problem of community detection through a detectability analysis. We focus on the expectation-maximization (EM) algorithm with belief propagation (BP), and analytically derive its algorithmic detectability threshold, i.e., the limit of the modular structure strength below which the algorithm can no longer detect any modular structures. The results indicate the existence of a phase in which the community detection of a lower-order network outperforms its higher-order counterpart. △ Less

Submitted 24 October, 2017; originally announced October 2017.

Comments: 5 pages, 3 figures

arXiv:1608.08908 [pdf, ps, other]

doi 10.1103/PhysRevE.95.012304

Detectability thresholds of general modular graphs

Authors: Tatsuro Kawamoto, Yoshiyuki Kabashima

Abstract: We investigate the detectability thresholds of various modular structures in the stochastic block model. Our analysis reveals how the detectability threshold is related to the details of the modular pattern, including the hierarchy of the clusters. We show that certain planted structures are impossible to infer regardless of their fuzziness. We investigate the detectability thresholds of various modular structures in the stochastic block model. Our analysis reveals how the detectability threshold is related to the details of the modular pattern, including the hierarchy of the clusters. We show that certain planted structures are impossible to infer regardless of their fuzziness. △ Less

Submitted 9 January, 2017; v1 submitted 31 August, 2016; originally announced August 2016.

Comments: 5 pages, 3 figures

Journal ref: Phys. Rev. E 95, 012304 (2017)

arXiv:1606.07668 [pdf, other]

doi 10.1103/PhysRevE.97.022315

Comparative analysis on the selection of number of clusters in community detection

Authors: Tatsuro Kawamoto, Yoshiyuki Kabashima

Abstract: We conduct a comparative analysis on various estimates of the number of clusters in community detection. An exhaustive comparison requires testing of all possible combinations of frameworks, algorithms, and assessment criteria. In this paper we focus on the framework based on a stochastic block model, and investigate the performance of greedy algorithms, statistical inference, and spectral methods… ▽ More We conduct a comparative analysis on various estimates of the number of clusters in community detection. An exhaustive comparison requires testing of all possible combinations of frameworks, algorithms, and assessment criteria. In this paper we focus on the framework based on a stochastic block model, and investigate the performance of greedy algorithms, statistical inference, and spectral methods. For the assessment criteria, we consider modularity, map equation, Bethe free energy, prediction errors, and isolated eigenvalues. From the analysis, the tendency of overfit and underfit that the assessment criteria and algorithms have, becomes apparent. In addition, we propose that the alluvial diagram is a suitable tool to visualize statistical inference results and can be useful to determine the number of clusters. △ Less

Submitted 7 March, 2018; v1 submitted 24 June, 2016; originally announced June 2016.

Comments: 21 pages, 14 figures, 2 tables

Journal ref: Phys. Rev. E 97, 022315 (2018)

arXiv:1605.07915 [pdf, ps, other]

doi 10.1038/s41598-017-03623-x

Cross-validation estimate of the number of clusters in a network

Authors: Tatsuro Kawamoto, Yoshiyuki Kabashima

Abstract: Network science investigates methodologies that summarise relational data to obtain better interpretability. Identifying modular structures is a fundamental task, and assessment of the coarse-grain level is its crucial step. Here, we propose principled, scalable, and widely applicable assessment criteria to determine the number of clusters in modular networks based on the leave-one-out cross-valid… ▽ More Network science investigates methodologies that summarise relational data to obtain better interpretability. Identifying modular structures is a fundamental task, and assessment of the coarse-grain level is its crucial step. Here, we propose principled, scalable, and widely applicable assessment criteria to determine the number of clusters in modular networks based on the leave-one-out cross-validation estimate of the edge prediction error. △ Less

Submitted 12 June, 2017; v1 submitted 25 May, 2016; originally announced May 2016.

Comments: 19 pages, 9 figures

Journal ref: Scientific Reports, 7, 3327 (2017)

arXiv:1509.06484 [pdf, ps, other]

doi 10.1209/0295-5075/112/40007

Detectability of the spectral method for sparse graph partitioning

Authors: Tatsuro Kawamoto, Yoshiyuki Kabashima

Abstract: We show that modularity maximization with the resolution parameter offers a unifying framework of graph partitioning. In this framework, we demonstrate that the spectral method exhibits universal detectability, irrespective of the value of the resolution parameter, as long as the graph is partitioned. Furthermore, we show that when the resolution parameter is sufficiently small, a first-order phas… ▽ More We show that modularity maximization with the resolution parameter offers a unifying framework of graph partitioning. In this framework, we demonstrate that the spectral method exhibits universal detectability, irrespective of the value of the resolution parameter, as long as the graph is partitioned. Furthermore, we show that when the resolution parameter is sufficiently small, a first-order phase transition occurs, resulting in the graph being unpartitioned. △ Less

Submitted 18 December, 2015; v1 submitted 22 September, 2015; originally announced September 2015.

Comments: 6 pages, 2 figures

Journal ref: Europhys. Lett. 112, 40007 (2015)

arXiv:1505.07543 [pdf, ps, other]

Localized eigenvectors of the non-backtracking matrix

Authors: Tatsuro Kawamoto

Abstract: In the case of graph partitioning, the emergence of localized eigenvectors can cause the standard spectral method to fail. To overcome this problem, the spectral method using a non-backtracking matrix was proposed. Based on numerical experiments on several examples of real networks, it is clear that the non-backtracking matrix does not exhibit localization of eigenvectors. However, we show that lo… ▽ More In the case of graph partitioning, the emergence of localized eigenvectors can cause the standard spectral method to fail. To overcome this problem, the spectral method using a non-backtracking matrix was proposed. Based on numerical experiments on several examples of real networks, it is clear that the non-backtracking matrix does not exhibit localization of eigenvectors. However, we show that localized eigenvectors of the non-backtracking matrix can exist outside the spectral band, which may lead to deterioration in the performance of graph partitioning. △ Less

Submitted 8 February, 2016; v1 submitted 28 May, 2015; originally announced May 2015.

Comments: 11 pages, 5 figures, to be published from JSTAT

Journal ref: J. Stat. Mech. 023404 (2016)

arXiv:1503.03199 [pdf, other]

Persistence of activity on Twitter triggered by a natural disaster: A data analysis

Authors: Tatsuro Kawamoto

Abstract: In this note, we list the results of a simple analysis of a Twitter dataset: the complete dataset of Japanese tweets in the 1-week period after the Great East Japan earthquake, which occurred on March 11, 2011. Our data analysis shows how people reacted to the earthquake on Twitter and how some users went inactive in the long-term. In this note, we list the results of a simple analysis of a Twitter dataset: the complete dataset of Japanese tweets in the 1-week period after the Great East Japan earthquake, which occurred on March 11, 2011. Our data analysis shows how people reacted to the earthquake on Twitter and how some users went inactive in the long-term. △ Less

Submitted 11 March, 2015; originally announced March 2015.

Comments: 2 pages, 3 figures

arXiv:1502.06775 [pdf, ps, other]

doi 10.1103/PhysRevE.91.062803

Limitations in the spectral method for graph partitioning: detectability threshold and localization of eigenvectors

Authors: Tatsuro Kawamoto, Yoshiyuki Kabashima

Abstract: Investigating the performance of different methods is a fundamental problem in graph partitioning. In this paper, we estimate the so-called detectability threshold for the spectral method with both unnormalized and normalized Laplacians in sparse graphs. The detectability threshold is the critical point at which the result of the spectral method is completely uncorrelated to the planted partition.… ▽ More Investigating the performance of different methods is a fundamental problem in graph partitioning. In this paper, we estimate the so-called detectability threshold for the spectral method with both unnormalized and normalized Laplacians in sparse graphs. The detectability threshold is the critical point at which the result of the spectral method is completely uncorrelated to the planted partition. We also analyze whether the localization of eigenvectors affects the partitioning performance in the detectable region. We use the replica method, which is often used in the field of spin-glass theory, and focus on the case of bisection. We show that the gap between the estimated threshold for the spectral method and the threshold obtained from Bayesian inference is considerable in sparse graphs, even without eigenvector localization. This gap closes in a dense limit. △ Less

Submitted 9 June, 2015; v1 submitted 24 February, 2015; originally announced February 2015.

Comments: 26 pages, 13 figures

Journal ref: Phys. Rev. E 91, 062803 (2015)

arXiv:1402.4385 [pdf, ps, other]

doi 10.1103/PhysRevE.91.012809

Estimating the resolution limit of the map equation in community detection

Authors: Tatsuro Kawamoto, Martin Rosvall

Abstract: A community detection algorithm is considered to have a resolution limit if the scale of the smallest modules that can be resolved depends on the size of the analyzed subnetwork. The resolution limit is known to prevent some community detection algorithms from accurately identifying the modular structure of a network. In fact, any global objective function for measuring the quality of a two-level… ▽ More A community detection algorithm is considered to have a resolution limit if the scale of the smallest modules that can be resolved depends on the size of the analyzed subnetwork. The resolution limit is known to prevent some community detection algorithms from accurately identifying the modular structure of a network. In fact, any global objective function for measuring the quality of a two-level assignment of nodes into modules must have some sort of resolution limit or an external resolution parameter. However, it is yet unknown how the resolution limit affects the so-called map equation, which is known to be an efficient objective function for community detection. We derive an analytical estimate and conclude that the resolution limit of the map equation is set by the total number of links between modules instead of the total number of links in the full network as for modularity. This mechanism makes the resolution limit much less restrictive for the map equation than for modularity, and in practice orders of magnitudes smaller. Furthermore, we argue that the effect of the resolution limit often results from shoehorning multi-level modular structures into two-level descriptions. As we show, the hierarchical map equation effectively eliminates the resolution limit for networks with nested multi-level modular structures. △ Less

Submitted 14 January, 2015; v1 submitted 18 February, 2014; originally announced February 2014.

Comments: 12 pages, 7 figures

Journal ref: Phys. Rev. E 91, 012809 (2015)

arXiv:1211.2555 [pdf, ps, other]

Viral spreading of daily information in online social networks

Authors: Tatsuro Kawamoto, Naomichi Hatano

Abstract: We explain a possible mechanism of an information spreading on a network which spreads extremely far from a seed node, namely the viral spreading. On the basis of a model of the information spreading in an online social network, in which the dynamics is expressed as a random multiplicative process of the spreading rates, we will show that the correlation between the spreading rates enhances the ch… ▽ More We explain a possible mechanism of an information spreading on a network which spreads extremely far from a seed node, namely the viral spreading. On the basis of a model of the information spreading in an online social network, in which the dynamics is expressed as a random multiplicative process of the spreading rates, we will show that the correlation between the spreading rates enhances the chance of the viral spreading, shifting the tip** point at which the spreading goes viral. △ Less

Submitted 16 March, 2014; v1 submitted 12 November, 2012; originally announced November 2012.

Comments: 15 pages, 3 figures, accepted for publication in Physica A: Statistical Mechanics and its Applications

arXiv:1209.5599 [pdf, ps, other]

doi 10.1016/j.physa.2013.03.048

A stochastic model of the tweet diffusion on the Twitter network

Authors: Tatsuro Kawamoto

Abstract: We introduce a stochastic model which describes diffusions of tweets on the Twitter network. By dividing the followers into generations, we describe the dynamics of the tweet diffusion as a random multiplicative process. We confirm our model by directly observing the statistics of the multiplicative factors in the Twitter data. We introduce a stochastic model which describes diffusions of tweets on the Twitter network. By dividing the followers into generations, we describe the dynamics of the tweet diffusion as a random multiplicative process. We confirm our model by directly observing the statistics of the multiplicative factors in the Twitter data. △ Less

Submitted 25 September, 2012; originally announced September 2012.

Comments: 12 pages, 11 figures

arXiv:0706.3248 [pdf, other]

Compensation of the Crossing Angle with Crab Cavities at KEKB

Authors: T. Abe, K. Akai, M. Akemoto, A. Akiyama, M. Arinaga, K. Ebihara, K. Egawa, A. Enomoto, J. Flanagan, S. Fukuda, H. Fukuma, Y. Funakoshi, K. Furukawa, T. Furuya, K. Hara, T. Higo, S. Hiramatsu, H. Hisamatsu, H. Honma, T. Honma, K. Hosoyama, T. Ieiri, N. Iida, H. Ikeda, M. Ikeda , et al. (90 additional authors not shown)

Abstract: Crab cavities have been installed in the KEKB B--Factory rings to compensate the crossing angle at the collision point and thus increase luminosity. The beam operation with crab crossing has been done since February 2007. This is the first experience with such cavities in colliders or storage rings. The crab cavities have been working without serious issues. While higher specific luminosity than… ▽ More Crab cavities have been installed in the KEKB B--Factory rings to compensate the crossing angle at the collision point and thus increase luminosity. The beam operation with crab crossing has been done since February 2007. This is the first experience with such cavities in colliders or storage rings. The crab cavities have been working without serious issues. While higher specific luminosity than the geometrical gain has been achieved, further study is necessary and under way to reach the prediction of simulation. △ Less

Submitted 21 June, 2007; originally announced June 2007.

Comments: Submitted to Particle Accelerator Conference 2007, MOZAKI01, Albuquerque

Journal ref: Conf.Proc.C070625:27,2007

Showing 1–30 of 30 results for author: Kawamoto, T