Search | arXiv e-print repository

Algebraic Geometrical Analysis of Metropolis Algorithm When Parameters Are Non-identifiable

Authors: Kenji Nagata, Yoh-ichi Mototake

Abstract: The Metropolis algorithm is one of the Markov chain Monte Carlo (MCMC) methods that realize sampling from the target probability distribution. In this paper, we are concerned with the sampling from the distribution in non-identifiable cases that involve models with Fisher information matrices that may fail to be invertible. The theoretical adjustment of the step size, which is the variance of the… ▽ More The Metropolis algorithm is one of the Markov chain Monte Carlo (MCMC) methods that realize sampling from the target probability distribution. In this paper, we are concerned with the sampling from the distribution in non-identifiable cases that involve models with Fisher information matrices that may fail to be invertible. The theoretical adjustment of the step size, which is the variance of the candidate distribution, is difficult for non-identifiable cases. In this study, to establish such a principle, the average acceptance rate, which is used as a guideline to optimize the step size in the MCMC method, was analytically derived in non-identifiable cases. The optimization principle for the step size was developed from the viewpoint of the average acceptance rate. In addition, we performed numerical experiments on some specific target distributions to verify the effectiveness of our theoretical results. △ Less

Submitted 1 June, 2024; originally announced June 2024.

Comments: 14 pages, 3 figures

arXiv:2306.16593 [pdf, other]

Autoregressive with Slack Time Series Model for Forecasting a Partially-Observed Dynamical Time Series

Authors: Akifumi Okuno, Yuya Morishita, Yoh-ichi Mototake

Abstract: This study delves into the domain of dynamical systems, specifically the forecasting of dynamical time series defined through an evolution function. Traditional approaches in this area predict the future behavior of dynamical systems by inferring the evolution function. However, these methods may confront obstacles due to the presence of missing variables, which are usually attributed to challenge… ▽ More This study delves into the domain of dynamical systems, specifically the forecasting of dynamical time series defined through an evolution function. Traditional approaches in this area predict the future behavior of dynamical systems by inferring the evolution function. However, these methods may confront obstacles due to the presence of missing variables, which are usually attributed to challenges in measurement and a partial understanding of the system of interest. To overcome this obstacle, we introduce the autoregressive with slack time series (ARS) model, that simultaneously estimates the evolution function and imputes missing variables as a slack time series. Assuming time-invariance and linearity in the (underlying) entire dynamical time series, our experiments demonstrate the ARS model's capability to forecast future time series. From a theoretical perspective, we prove that a 2-dimensional time-invariant and linear system can be reconstructed by utilizing observations from a single, partially observed dimension of the system. △ Less

Submitted 9 February, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: 15 pages, 6 figures, accepted to IEEE Access

arXiv:2306.03575 [pdf, other]

Quantifying physical insights cooperatively with exhaustive search for Bayesian spectroscopy of X-ray photoelectron spectra

Authors: Hiroyuki Kumazoe, Kazunori Iwamitsu, Masaki Imamura, Kazutoshi Takahashi, Yoh-ichi Mototake, Masato Okada, Ichiro Akai

Abstract: We analyzed the X-ray photoemission spectra (XPS) of carbon 1s states in graphene and oxygen-intercalated graphene grown on SiC(0001) using Bayesian spectroscopy. To realize highly accurate spectral decomposition of the XPS spectra, we proposed a framework for discovering physical constraints from the absence of prior quantified physical knowledge, in which we designed the prior probabilities base… ▽ More We analyzed the X-ray photoemission spectra (XPS) of carbon 1s states in graphene and oxygen-intercalated graphene grown on SiC(0001) using Bayesian spectroscopy. To realize highly accurate spectral decomposition of the XPS spectra, we proposed a framework for discovering physical constraints from the absence of prior quantified physical knowledge, in which we designed the prior probabilities based on the found constraints and the physically required conditions. This suppresses the exchange of peak components during replica exchange Monte Carlo iterations and makes possible to decompose XPS in the case where a reliable structure model or a presumable number of components is not known. As a result, we have successfully decomposed XPS of one monolayer (1ML), two monolayers (2ML), and quasi-freestanding 2ML (qfs-2ML) graphene samples deposited on SiC substrates with the meV order precision of the binding energy, in which the posterior probability distributions of the binding energies were obtained distinguishably between the different components of buffer layer even though they are observed as hump and shoulder structures because of their overlap**. △ Less

Submitted 6 June, 2023; originally announced June 2023.

arXiv:2304.06522 [pdf, other]

Signal identification without signal formulation

Authors: Yoh-ichi Mototake, Y-h. Taguchi

Abstract: When there are signals and noises, physicists try to identify signals by modeling them, whereas statisticians oppositely try to model noise to identify signals. In this study, we applied the statisticians' concept of signal detection of physics data with small-size samples and high dimensions without modeling the signals. Most of the data in nature, whether noises or signals, are assumed to be gen… ▽ More When there are signals and noises, physicists try to identify signals by modeling them, whereas statisticians oppositely try to model noise to identify signals. In this study, we applied the statisticians' concept of signal detection of physics data with small-size samples and high dimensions without modeling the signals. Most of the data in nature, whether noises or signals, are assumed to be generated by dynamical systems; thus, there is essentially no distinction between these generating processes. We propose that the correlation length of a dynamical system and the number of samples are crucial for the practical definition of noise variables among the signal variables generated by such a system. Since variables with short-term correlations reach normal distributions faster as the number of samples decreases, they are regarded to be ``noise-like'' variables, whereas variables with opposite properties are ``signal-like'' variables. Normality tests are not effective for data of small-size samples with high dimensions. Therefore, we modeled noises on the basis of the property of a noise variable, that is, the uniformity of the histogram of the probability that a variable is a noise. We devised a method of detecting signal variables from the structural change of the histogram according to the decrease in the number of samples. We applied our method to the data generated by globally coupled map, which can produce time series data with different correlation lengths, and also applied to gene expression data, which are typical static data of small-size samples with high dimensions, and we successfully detected signal variables from them. Moreover, we verified the assumption that the gene expression data also potentially have a dynamical system as their generation model, and found that the assumption is compatible with the results of signal extraction. △ Less

Submitted 13 April, 2023; originally announced April 2023.

Comments: 22 pages, 16 figures

arXiv:2204.13912 [pdf, ps, other]

Quantitative Prediction of Fracture Toughness $(K_{{\rm I}c})$ of Polymer by Fractography Using Deep Neural Networks

Authors: Yoh-ichi Mototake, Kaita Ito, Masahiko Demura

Abstract: Fracture surfaces provide various types of information about fracture. The fracture toughness $K_{{\rm I}c}$, which represents the resistance to fracture, can be estimated using the three-dimensional (3D) information of a fracture surface, i.e., its roughness. However, this is time-consuming and expensive to obtain the 3D information of a fracture surface; thus, it is desirable to estimate… ▽ More Fracture surfaces provide various types of information about fracture. The fracture toughness $K_{{\rm I}c}$, which represents the resistance to fracture, can be estimated using the three-dimensional (3D) information of a fracture surface, i.e., its roughness. However, this is time-consuming and expensive to obtain the 3D information of a fracture surface; thus, it is desirable to estimate $K_{{\rm I}c}$ from a two-dimensional (2D) image, which can be easily obtained. In recent years, methods of estimating a 3D structure from its 2D image using deep learning have been rapidly developed. In this study, we propose a framework for fractography that directly estimates $K_{{\rm I}c}$ from a 2D fracture surface image using deep neural networks (DNNs). Typically, image recognition using a DNN requires a tremendous amount of image data, which is difficult to acquire for fractography owing to the high experimental cost. To compensate for the limited data, in this study, we used the transfer learning (TL) method, and constructed high-performance prediction models even with a small dataset by transferring machine learning models trained using other large datasets. We found that the regression model obtained using our proposed framework can predict $K_{{\rm I}c}$ in the range of approximately 1-5 [MPa$\sqrt{m}$] with a standard deviation of the estimation error of approximately $\pm$0.37 [MPa$\sqrt{m}$]. The present results demonstrate that the DNN trained with TL opens a new route for quantitative fractography by which parameters of fracture process can be estimated from a fracture surface even with a small dataset. The proposed framework also enables the building of regression models in a few hours. Therefore, our framework enables us to screen a large number of image datasets available in the field of materials science and find candidates that are worth expensive machine learning analysis. △ Less

Submitted 29 April, 2022; originally announced April 2022.

Comments: 13 pages, 4 figures

arXiv:2204.12194 [pdf, other]

Procedure to Reveal the Mechanism of Pattern Formation Process by Topological Data Analysis

Authors: Yoh-ichi Mototake, Masaichiro Mizumaki, Kazue Kudo, Kenji Fukumizu

Abstract: Topological data analysis (TDA) is a versatile tool that can be used to extract scientific knowledge from complex pattern formation processes. However, the physics correspondence between the features obtained from TDA and pattern dynamics does not agree one-to-one, and the physical interpretation of the TDA features needs to be set appropriately according to the phenomenon to be analyzed. In this… ▽ More Topological data analysis (TDA) is a versatile tool that can be used to extract scientific knowledge from complex pattern formation processes. However, the physics correspondence between the features obtained from TDA and pattern dynamics does not agree one-to-one, and the physical interpretation of the TDA features needs to be set appropriately according to the phenomenon to be analyzed. In this study, we propose an analytical procedure to physically interpret pattern dynamics through TDA and machine learning techniques. The proposed procedure was applied to the process of magnetic domain pattern formation to quantify non-trivial domain pattern classifications and reveal the nature of the underlying dynamics. On the basis of these findings, we also propose a candidate reduction model to understand the nature of magnetic domain formation. △ Less

Submitted 8 July, 2024; v1 submitted 26 April, 2022; originally announced April 2022.

Comments: 54 pages, 19 figures

arXiv:2008.01933 [pdf, other]

Robust phase estimation of Gaussian states in the presence of outlier quantum states

Authors: Yukito Mototake, Jun Suzuki

Abstract: In this paper, we investigate the problem of estimating the phase of a coherent state in the presence of unavoidable noisy quantum states. These unwarranted quantum states are represented by outlier quantum states in this study. We first present a statistical framework of robust statistics in a quantum system to handle outlier quantum states. We then apply the method of M-estimators to suppress un… ▽ More In this paper, we investigate the problem of estimating the phase of a coherent state in the presence of unavoidable noisy quantum states. These unwarranted quantum states are represented by outlier quantum states in this study. We first present a statistical framework of robust statistics in a quantum system to handle outlier quantum states. We then apply the method of M-estimators to suppress untrusted measurement outcomes due to outlier quantum states. Our proposal has the advantage over the classical methods in being systematic, easy to implement, and robust against occurrence of noisy states. △ Less

Submitted 5 August, 2020; originally announced August 2020.

Comments: 15 pages, 12 figures. Accepted version

arXiv:2001.00111 [pdf, other]

doi 10.1103/PhysRevE.103.033303

Interpretable Conservation Law Estimation by Deriving the Symmetries of Dynamics from Trained Deep Neural Networks

Authors: Yoh-ichi Mototake

Abstract: Understanding complex systems with their reduced model is one of the central roles in scientific activities. Although physics has greatly been developed with the physical insights of physicists, it is sometimes challenging to build a reduced model of such complex systems on the basis of insights alone. We propose a novel framework that can infer the hidden conservation laws of a complex system fro… ▽ More Understanding complex systems with their reduced model is one of the central roles in scientific activities. Although physics has greatly been developed with the physical insights of physicists, it is sometimes challenging to build a reduced model of such complex systems on the basis of insights alone. We propose a novel framework that can infer the hidden conservation laws of a complex system from deep neural networks (DNNs) that have been trained with physical data of the system. The purpose of the proposed framework is not to analyze physical data with deep learning, but to extract interpretable physical information from trained DNNs. With Noether's theorem and by an efficient sampling method, the proposed framework infers conservation laws by extracting symmetries of dynamics from trained DNNs. The proposed framework is developed by deriving the relationship between a manifold structure of time-series dataset and the necessary conditions for Noether's theorem. The feasibility of the proposed framework has been verified in some primitive cases for which the conservation law is well known. We also apply the proposed framework to conservation law estimation for a more practical case that is a large-scale collective motion system in the metastable state, and we obtain a result consistent with that of a previous study. △ Less

Submitted 18 April, 2020; v1 submitted 31 December, 2019; originally announced January 2020.

Comments: 38 pages, 8 figures

Journal ref: Phys. Rev. E 103, 033303 (2021)

arXiv:1906.04868 [pdf, other]

Semi-flat minima and saddle points by embedding neural networks to overparameterization

Authors: Kenji Fukumizu, Shoichiro Yamaguchi, Yoh-ichi Mototake, Mirai Tanaka

Abstract: We theoretically study the landscape of the training error for neural networks in overparameterized cases. We consider three basic methods for embedding a network into a wider one with more hidden units, and discuss whether a minimum point of the narrower network gives a minimum or saddle point of the wider one. Our results show that the networks with smooth and ReLU activation have different part… ▽ More We theoretically study the landscape of the training error for neural networks in overparameterized cases. We consider three basic methods for embedding a network into a wider one with more hidden units, and discuss whether a minimum point of the narrower network gives a minimum or saddle point of the wider one. Our results show that the networks with smooth and ReLU activation have different partially flat landscapes around the embedded point. We also relate these results to a difference of their generalization abilities in overparameterized realization. △ Less

Submitted 14 June, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

Comments: 38 pages, 4 figures

arXiv:1812.05501 [pdf, other]

doi 10.7566/JPSJ.88.044003

Bayesian Spectral Deconvolution Based on Poisson Distribution: Bayesian Measurement and Virtual Measurement Analytics (VMA)

Authors: Kenji Nagata, Yoh-ichi Mototake, Rei Muraoka, Takehiko Sasaki, Masato Okada

Abstract: In this paper, we propose a new method of Bayesian measurement for spectral deconvolution, which regresses spectral data into the sum of unimodal basis function such as Gaussian or Lorentzian functions. Bayesian measurement is a framework for considering not only the target physical model but also the measurement model as a probabilistic model, and enables us to estimate the parameter of a physica… ▽ More In this paper, we propose a new method of Bayesian measurement for spectral deconvolution, which regresses spectral data into the sum of unimodal basis function such as Gaussian or Lorentzian functions. Bayesian measurement is a framework for considering not only the target physical model but also the measurement model as a probabilistic model, and enables us to estimate the parameter of a physical model with its confidence interval through a Bayesian posterior distribution given a measurement data set. The measurement with Poisson noise is one of the most effective system to apply our proposed method. Since the measurement time is strongly related to the signal-to-noise ratio for the Poisson noise model, Bayesian measurement with Poisson noise model enables us to clarify the relationship between the measurement time and the limit of estimation. In this study, we establish the probabilistic model with Poisson noise for spectral deconvolution. Bayesian measurement enables us to perform virtual and computer simulation for a certain measurement through the established probabilistic model. This property is called "Virtual Measurement Analytics(VMA)" in this paper. We also show that the relationship between the measurement time and the limit of estimation can be extracted by using the proposed method in a simulation of synthetic data and real data for XPS measurement of MoS$_2$. △ Less

Submitted 11 December, 2018; originally announced December 2018.

Comments: 8 pages, 8 figures

arXiv:1812.01205 [pdf, other]

doi 10.7566/JPSJ.88.034004

Bayesian Hamiltonian Selection in X-ray Photoelectron Spectroscopy

Authors: Yoh-ichi Mototake, Masaichiro Mizumaki, Ichiro Akai, Masato Okada

Abstract: Core-level X-ray photoelectron spectroscopy (XPS) is a useful measurement technique for investigating the electronic states of a strongly correlated electron system. Usually, to extract physical information of a target object from a core-level XPS spectrum, we need to set an effective Hamiltonian by physical consideration so as to express complicated electron-to-electron interactions in the transi… ▽ More Core-level X-ray photoelectron spectroscopy (XPS) is a useful measurement technique for investigating the electronic states of a strongly correlated electron system. Usually, to extract physical information of a target object from a core-level XPS spectrum, we need to set an effective Hamiltonian by physical consideration so as to express complicated electron-to-electron interactions in the transition of core-level XPS, and manually tune the physical parameters of the effective Hamiltonian so as to represent the XPS spectrum. Then, we can extract physical information from the tuned parameters. In this paper, we propose an automated method for analyzing core-level XPS spectra based on the Bayesian model selection framework, which selects the effective Hamiltonian and estimates its parameters automatically. The Bayesian model selection, which often has a large computational cost, was carried out by the exchange Monte Carlo sampling method. By applying our proposed method to the 3$d$ core-level XPS spectra of Ce and La compounds, we confirmed that our proposed method selected an effective Hamiltonian and estimated its parameters appropriately; these results were consistent with conventional knowledge obtained from physical studies. Moreover, using our proposed method, we can also evaluate the uncertainty of its estimation values and clarify why the effective Hamiltonian was selected. Such information is difficult to obtain by the conventional analysis method. △ Less

Submitted 3 December, 2018; originally announced December 2018.

Comments: 15page, 10 figures

arXiv:1812.00718 [pdf, other]

Finding Continuity and Discontinuity in Fish Schools via Integrated Information Theory

Authors: Takayuki Niizato, Kotaro Sakamoto, Yoh-ichi Mototake, Takenori Tomaru, Tomotaro Hoshika, Toshiki Fukushima

Abstract: Collective behaviour is known to be the result of diverse dynamics and is sometimes likened to a living system. Although many studies have revealed the dynamics of various collective behaviours, their main focus was on the information process inside the collective, not on the whole system itself. For example, the qualitative difference between two elements and three elements as a system has rarely… ▽ More Collective behaviour is known to be the result of diverse dynamics and is sometimes likened to a living system. Although many studies have revealed the dynamics of various collective behaviours, their main focus was on the information process inside the collective, not on the whole system itself. For example, the qualitative difference between two elements and three elements as a system has rarely been investigated. Tononi et al. have proposed Integrated Information Theory (IIT) to measure the degree of consciousness $Φ$. IIT postulates that the amount of information loss caused by certain partitions is equivalent to the degree of information integration in the system. This measure is not only useful for estimating the degree of consciousness but can also be applied to more general network systems. Here we applied IIT (in particular, IIT 3.0 using PyPhi) to analyse real fish schools ({\it Plecoglossus altivelis}). Our hypothesis in this study is a very simple one: a living system evolves to raise its $Φ$ value. If we accept this hypothesis, IIT reveals the existence of continuous and discontinuous properties as group size varies. For example, leadership in the fish school emerged for a school size of four or above; but not below three. Furthermore, this transition was not observed by measuring mutual information or in a simple Boids model. This result suggests that integrated information $Φ$ can reveal some inherent properties which cannot be observed using other measures. We also discuss how the fish recognition of the figure-ground relation, that is, what determines the relevant ON and OFF states, may reveal various optimal paths for obtaining the functional evolution of collective behaviour. △ Less

Submitted 3 December, 2018; originally announced December 2018.

Showing 1–12 of 12 results for author: Mototake, Y