Search | arXiv e-print repository

doi 10.1038/nature24471

A gravitational-wave standard siren measurement of the Hubble constant

Authors: B. P. Abbott, R. Abbott, T. D. Abbott, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, V. B. Adya, C. Affeldt, M. Afrough, B. Agarwal, M. Agathos, K. Agatsuma, N. Aggarwal, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, B. Allen, G. Allen, A. Allocca, P. A. Altin, A. Amato , et al. (1289 additional authors not shown)

Abstract: The detection of GW170817 in both gravitational waves and electromagnetic waves heralds the age of gravitational-wave multi-messenger astronomy. On 17 August 2017 the Advanced LIGO and Virgo detectors observed GW170817, a strong signal from the merger of a binary neutron-star system. Less than 2 seconds after the merger, a gamma-ray burst (GRB 170817A) was detected within a region of the sky consi… ▽ More The detection of GW170817 in both gravitational waves and electromagnetic waves heralds the age of gravitational-wave multi-messenger astronomy. On 17 August 2017 the Advanced LIGO and Virgo detectors observed GW170817, a strong signal from the merger of a binary neutron-star system. Less than 2 seconds after the merger, a gamma-ray burst (GRB 170817A) was detected within a region of the sky consistent with the LIGO-Virgo-derived location of the gravitational-wave source. This sky region was subsequently observed by optical astronomy facilities, resulting in the identification of an optical transient signal within $\sim 10$ arcsec of the galaxy NGC 4993. These multi-messenger observations allow us to use GW170817 as a standard siren, the gravitational-wave analog of an astronomical standard candle, to measure the Hubble constant. This quantity, which represents the local expansion rate of the Universe, sets the overall scale of the Universe and is of fundamental importance to cosmology. Our measurement combines the distance to the source inferred purely from the gravitational-wave signal with the recession velocity inferred from measurements of the redshift using electromagnetic data. This approach does not require any form of cosmic "distance ladder;" the gravitational wave analysis can be used to estimate the luminosity distance out to cosmological scales directly, without the use of intermediate astronomical distance measurements. We determine the Hubble constant to be $70.0^{+12.0}_{-8.0} \, \mathrm{km} \, \mathrm{s}^{-1} \, \mathrm{Mpc}^{-1}$ (maximum a posteriori and 68% credible interval). This is consistent with existing measurements, while being completely independent of them. Additional standard-siren measurements from future gravitational-wave sources will provide precision constraints of this important cosmological parameter. △ Less

Submitted 16 October, 2017; originally announced October 2017.

Comments: 26 pages, 5 figures, Nature in press. For more information see https://dcc.ligo.org/LIGO-P1700296/public

Report number: LIGO P1700296

arXiv:1710.04942 [pdf, ps, other]

Local rigidity of certain actions of nilpotent-by-cyclic groups on the sphere

Authors: Mao Okada

Abstract: Let G = SU(n,1), n >1 be the orientation-preserving isometry group of the complex hyperbolic space with an Iwasawa decomposition G = KAN. We prove local rigidity of a family of certain actions of a subgroup of AN on the imaginary boundary of the complex hyperbolic space. Let G = SU(n,1), n >1 be the orientation-preserving isometry group of the complex hyperbolic space with an Iwasawa decomposition G = KAN. We prove local rigidity of a family of certain actions of a subgroup of AN on the imaginary boundary of the complex hyperbolic space. △ Less

Submitted 13 October, 2017; originally announced October 2017.

Comments: 32 pages, no figures

arXiv:1710.02327 [pdf, other]

doi 10.1103/PhysRevD.96.122006

First narrow-band search for continuous gravitational waves from known pulsars in advanced detector data

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, B. P. Abbott, R. Abbott, T. D. Abbott, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, V. B. Adya, C. Affeldt, M. Afrough, B. Agarwal, M. Agathos, K. Agatsuma, N. Aggarwal, O. D. Aguiar, L. Aiello, A. Ain, B. Allen, G. Allen, A. Allocca, P. A. Altin , et al. (1074 additional authors not shown)

Abstract: Spinning neutron stars asymmetric with respect to their rotation axis are potential sources of continuous gravitational waves for ground-based interferometric detectors. In the case of known pulsars a fully coherent search, based on matched filtering, which uses the position and rotational parameters obtained from electromagnetic observations, can be carried out. Matched filtering maximizes the si… ▽ More Spinning neutron stars asymmetric with respect to their rotation axis are potential sources of continuous gravitational waves for ground-based interferometric detectors. In the case of known pulsars a fully coherent search, based on matched filtering, which uses the position and rotational parameters obtained from electromagnetic observations, can be carried out. Matched filtering maximizes the signal-to-noise (SNR) ratio, but a large sensitivity loss is expected in case of even a very small mismatch between the assumed and the true signal parameters. For this reason, {\it narrow-band} analyses methods have been developed, allowing a fully coherent search for gravitational waves from known pulsars over a fraction of a hertz and several spin-down values. In this paper we describe a narrow-band search of eleven pulsars using data from Advanced LIGO's first observing run. Although we have found several initial outliers, further studies show no significant evidence for the presence of a gravitational wave signal. Finally, we have placed upper limits on the signal strain amplitude lower than the spin-down limit for 5 of the 11 targets over the bands searched: in the case of J1813-1749 the spin-down limit has been beaten for the first time. For an additional 3 targets, the median upper limit across the search bands is below the spin-down limit. This is the most sensitive narrow-band search for continuous gravitational waves carried out so far. △ Less

Submitted 5 December, 2017; v1 submitted 6 October, 2017; originally announced October 2017.

Comments: 9 Figures, 7 tables, submitted to PRD

Journal ref: Phys. Rev. D 96, 122006 (2017)

arXiv:1709.09660 [pdf]

doi 10.1103/PhysRevLett.119.141101

GW170814: A Three-Detector Observation of Gravitational Waves from a Binary Black Hole Coalescence

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, B. P. Abbott, R. Abbott, T. D. Abbott, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, V. B. Adya, C. Affeldt, M. Afrough, B. Agarwal, M. Agathos, K. Agatsuma, N. Aggarwal, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, B. Allen, G. Allen, A. Allocca , et al. (1085 additional authors not shown)

Abstract: On August 14, 2017 at 10:30:43 UTC, the Advanced Virgo detector and the two Advanced LIGO detectors coherently observed a transient gravitational-wave signal produced by the coalescence of two stellar mass black holes, with a false-alarm-rate of $\lesssim$ 1 in 27000 years. The signal was observed with a three-detector network matched-filter signal-to-noise ratio of 18. The inferred masses of the… ▽ More On August 14, 2017 at 10:30:43 UTC, the Advanced Virgo detector and the two Advanced LIGO detectors coherently observed a transient gravitational-wave signal produced by the coalescence of two stellar mass black holes, with a false-alarm-rate of $\lesssim$ 1 in 27000 years. The signal was observed with a three-detector network matched-filter signal-to-noise ratio of 18. The inferred masses of the initial black holes are $30.5_{-3.0}^{+5.7}$ Msun and $25.3_{-4.2}^{+2.8}$ Msun (at the 90% credible level). The luminosity distance of the source is $540_{-210}^{+130}~\mathrm{Mpc}$, corresponding to a redshift of $z=0.11_{-0.04}^{+0.03}$. A network of three detectors improves the sky localization of the source, reducing the area of the 90% credible region from 1160 deg$^2$ using only the two LIGO detectors to 60 deg$^2$ using all three detectors. For the first time, we can test the nature of gravitational wave polarizations from the antenna response of the LIGO-Virgo network, thus enabling a new class of phenomenological tests of gravity. △ Less

Submitted 13 October, 2017; v1 submitted 27 September, 2017; originally announced September 2017.

Journal ref: Phys. Rev. Lett. 119, 141101 (2017)

arXiv:1709.08156 [pdf, ps, other]

Randomness-induced quantum spin liquid on honeycomb lattice

Authors: H. Yamaguchi, M. Okada, Y. Kono, S. Kittaka, T. Sakakibara, T. Okabe, Y. Iwasaki, Y. Hosokoshi

Abstract: We present a quantu spin liquid state in a spin-1/2 honeycomb lattice with randomness in the exchange interaction. That is, we successfully introduce randomness into the organic radial-based complex and realize a random-singlet (RS) state. All magnetic and thermodynamic experimental results indicate the liquid-like behaviors, which are consistent with those expected in the RS state. These results… ▽ More We present a quantu spin liquid state in a spin-1/2 honeycomb lattice with randomness in the exchange interaction. That is, we successfully introduce randomness into the organic radial-based complex and realize a random-singlet (RS) state. All magnetic and thermodynamic experimental results indicate the liquid-like behaviors, which are consistent with those expected in the RS state. These results demonstrate that the randomness or inhomogeneity in the actual systems stabilize the RS state and yield liquid-like behavior. △ Less

Submitted 24 September, 2017; originally announced September 2017.

arXiv:1707.02050 [pdf, ps, other]

doi 10.7566/JPSJ.87.044802

Exhaustive search for sparse variable selection in linear regression

Authors: Yasuhiko Igarashi, Hikaru Takenaka, Yoshinori Nakanishi-Ohno, Makoto Uemura, Shiro Ikeda, Masato Okada

Abstract: We propose a K-sparse exhaustive search (ES-K) method and a K-sparse approximate exhaustive search method (AES-K) for selecting variables in linear regression. With these methods, K-sparse combinations of variables are tested exhaustively assuming that the optimal combination of explanatory variables is K-sparse. By collecting the results of exhaustively computing ES-K, various approximate methods… ▽ More We propose a K-sparse exhaustive search (ES-K) method and a K-sparse approximate exhaustive search method (AES-K) for selecting variables in linear regression. With these methods, K-sparse combinations of variables are tested exhaustively assuming that the optimal combination of explanatory variables is K-sparse. By collecting the results of exhaustively computing ES-K, various approximate methods for selecting sparse variables can be summarized as density of states. With this density of states, we can compare different methods for selecting sparse variables such as relaxation and sampling. For large problems where the combinatorial explosion of explanatory variables is crucial, the AES-K method enables density of states to be effectively reconstructed by using the replica-exchange Monte Carlo method and the multiple histogram method. Applying the ES-K and AES-K methods to type Ia supernova data, we confirmed the conventional understanding in astronomy when an appropriate K is given beforehand. However, we found the difficulty to determine K from the data. Using virtual measurement and analysis, we argue that this is caused by data shortage. △ Less

Submitted 7 July, 2017; originally announced July 2017.

Comments: 19pages, 3 figures

MSC Class: 62-07; 62Jxx

arXiv:1706.09597 [pdf, other]

Path Integral Networks: End-to-End Differentiable Optimal Control

Authors: Masashi Okada, Luca Rigazio, Takenobu Aoshima

Abstract: In this paper, we introduce Path Integral Networks (PI-Net), a recurrent network representation of the Path Integral optimal control algorithm. The network includes both system dynamics and cost models, used for optimal control based planning. PI-Net is fully differentiable, learning both dynamics and cost models end-to-end by back-propagation and stochastic gradient descent. Because of this, PI-N… ▽ More In this paper, we introduce Path Integral Networks (PI-Net), a recurrent network representation of the Path Integral optimal control algorithm. The network includes both system dynamics and cost models, used for optimal control based planning. PI-Net is fully differentiable, learning both dynamics and cost models end-to-end by back-propagation and stochastic gradient descent. Because of this, PI-Net can learn to plan. PI-Net has several advantages: it can generalize to unseen states thanks to planning, it can be applied to continuous control tasks, and it allows for a wide variety learning schemes, including imitation and reinforcement learning. Preliminary experiment results show that PI-Net, trained by imitation learning, can mimic control demonstrations for two simulated problems; a linear system and a pendulum swing-up problem. We also show that PI-Net is able to learn dynamics and cost models latent in the demonstrations. △ Less

Submitted 29 June, 2017; originally announced June 2017.

arXiv:1706.06953 [pdf, ps, other]

doi 10.7566/JPSJ.86.024002

Statistical Mechanics of Node-perturbation Learning with Noisy Baseline

Authors: Kazuyuki Hara, Kentaro Katahira, Masato Okada

Abstract: Node-perturbation learning is a type of statistical gradient descent algorithm that can be applied to problems where the objective function is not explicitly formulated, including reinforcement learning. It estimates the gradient of an objective function by using the change in the object function in response to the perturbation. The value of the objective function for an unperturbed output is call… ▽ More Node-perturbation learning is a type of statistical gradient descent algorithm that can be applied to problems where the objective function is not explicitly formulated, including reinforcement learning. It estimates the gradient of an objective function by using the change in the object function in response to the perturbation. The value of the objective function for an unperturbed output is called a baseline. Cho et al. proposed node-perturbation learning with a noisy baseline. In this paper, we report on building the statistical mechanics of Cho's model and on deriving coupled differential equations of order parameters that depict learning dynamics. We also show how to derive the generalization error by solving the differential equations of order parameters. On the basis of the results, we show that Cho's results are also apply in general cases and show some general performances of Cho's model. △ Less

Submitted 20 June, 2017; originally announced June 2017.

Comments: 16 pages, 7 figures, submitted to JPSJ

Journal ref: Journal of the Physical Society of Japan 86, 024002 (2017)

arXiv:1609.00438 [pdf, ps, other]

doi 10.1103/PhysRevB.94.094421

Quasi-two-dimensional Bose-Einstein condensation of spin triplets in dimerized quantum magnet Ba$_2$CuSi$_2$O$_6$Cl$_2$

Authors: Makiko Okada, Hidekazu Tanaka, Nobuyuki Kurita, Kohei Johmoto, Hidehiro Uekusa, Atsushi Miyake, Masashi Tokunaga, Satoshi Nishimoto, Masaaki Nakamura, Marcelo Jaime, Guillaume Radtke, Andrés Saúl

Abstract: We synthesized single crystals of composition Ba$_2$CuSi$_2$O$_6$Cl$_2$ and investigated its quantum magnetic properties. The crystal structure is closely related to that of the quasi-two-dimensional (2D) dimerized magnet BaCuSi$_2$O$_6$ also known as Han purple. Ba$_2$CuSi$_2$O$_6$Cl$_2$ has a singlet ground state with an excitation gap of $Δ/k_{\rm B}\,{=}\,20.8$ K. The magnetization curves for… ▽ More We synthesized single crystals of composition Ba$_2$CuSi$_2$O$_6$Cl$_2$ and investigated its quantum magnetic properties. The crystal structure is closely related to that of the quasi-two-dimensional (2D) dimerized magnet BaCuSi$_2$O$_6$ also known as Han purple. Ba$_2$CuSi$_2$O$_6$Cl$_2$ has a singlet ground state with an excitation gap of $Δ/k_{\rm B}\,{=}\,20.8$ K. The magnetization curves for two different field directions almost perfectly coincide when normalized by the $g$-factor except for a small jump anomaly for a magnetic field perpendicular to the $c$ axis. The magnetization curve with a nonlinear slope above the critical field is in excellent agreement with exact-diagonalization calculations based on a 2D coupled spin-dimer model. Individual exchange constants are also evaluated using density functional theory (DFT). The DFT results demonstrate a 2D exchange network and weak frustration between interdimer exchange interactions, supported by weak spin-lattice coupling implied from our magnetostriction data. The magnetic-field-induced spin ordering in Ba$_2$CuSi$_2$O$_6$Cl$_2$ is described as the quasi-2D Bose-Einstein condensation of triplets. △ Less

Submitted 1 September, 2016; originally announced September 2016.

Comments: 9 pages, 7 figures, to appear in Phys. Rev. B

Journal ref: Phys. Rev. B 94, 094421 (2016)

arXiv:1608.06124 [pdf, other]

doi 10.1103/PhysRevB.95.165112

Ultrafast Melting of Spin Density Wave Order in BaFe$_{2}$As$_{2}$ Observed by Time- and Angle-Resolved Photoemission Spectroscopy with Extreme-Ultraviolet Higher Harmonic Generation

Authors: H. Suzuki, K. Okazaki, T. Yamamoto, T. Someya, M. Okada, K. Koshiishi, M. Fujisawa, T. Kanai, N. Ishii, M. Nakajima, H. Eisaki, K. Ono, H. Kumigashira, J. Itatani, A. Fujimori, S. Shin

Abstract: Transient single-particle spectral function of BaFe$_{2}$As$_{2}$, a parent compound of iron-based superconductors, has been studied by time- and angle-resolved photoemission spectroscopy with an extreme-ultraviolet laser generated by higher harmonics from Ar gas, which enables us to investigate the dynamics in the entire Brillouin zone. We observed electronic modifications from the spin-density-w… ▽ More Transient single-particle spectral function of BaFe$_{2}$As$_{2}$, a parent compound of iron-based superconductors, has been studied by time- and angle-resolved photoemission spectroscopy with an extreme-ultraviolet laser generated by higher harmonics from Ar gas, which enables us to investigate the dynamics in the entire Brillouin zone. We observed electronic modifications from the spin-density-wave (SDW) ordered state within $\sim$ 1 ps after the arrival of a 1.5 eV pump pulse. We observed optically excited electrons at the zone center above $E_{F}$ at 0.12 ps, and their rapid decay. After the fast decay of the optically excited electrons, a thermalized state appears and survives for a relatively long time. From the comparison with the density-functional theory band structure for the paramagnetic and SDW states, we interpret the experimental observations as the melting of the SDW. Exponential decay constants for the thermalized state to recover back to the SDW ground state are $\sim$ 0.60 ps both around the zone center and the zone corner. △ Less

Submitted 22 August, 2016; originally announced August 2016.

Journal ref: Phys. Rev. B 95, 165112 (2017)

arXiv:1607.07590 [pdf, ps, other]

doi 10.7566/JPSJ.86.024001

Simultaneous Estimation of Noise Variance and Number of Peaks in Bayesian Spectral Deconvolution

Authors: Satoru Tokuda, Kenji Nagata, Masato Okada

Abstract: The heuristic identification of peaks from noisy complex spectra often leads to misunderstanding of the physical and chemical properties of matter. In this paper, we propose a framework based on Bayesian inference, which enables us to separate multipeak spectra into single peaks statistically and consists of two steps. The first step is estimating both the noise variance and the number of peaks as… ▽ More The heuristic identification of peaks from noisy complex spectra often leads to misunderstanding of the physical and chemical properties of matter. In this paper, we propose a framework based on Bayesian inference, which enables us to separate multipeak spectra into single peaks statistically and consists of two steps. The first step is estimating both the noise variance and the number of peaks as hyperparameters based on Bayes free energy, which generally is not analytically tractable. The second step is fitting the parameters of each peak function to the given spectrum by calculating the posterior density, which has a problem of local minima and saddles since multipeak models are nonlinear and hierarchical. Our framework enables the escape from local minima or saddles by using the exchange Monte Carlo method and calculates Bayes free energy via the multiple histogram method. We discuss a simulation demonstrating how efficient our framework is and show that estimating both the noise variance and the number of peaks prevents overfitting, overpenalizing, and misunderstanding the precision of parameter estimation. △ Less

Submitted 15 December, 2016; v1 submitted 26 July, 2016; originally announced July 2016.

arXiv:1607.07189 [pdf, ps, other]

doi 10.7566/JPSJ.85.093702

Compressed sensing in scanning tunneling microscopy/spectroscopy for observation of quasi-particle interference

Authors: Yoshinori Nakanishi-Ohno, Masahiro Haze, Yasuo Yoshida, Koji Hukushima, Yukio Hasegawa, Masato Okada

Abstract: We applied a method of compressed sensing to the observation of quasi-particle interference (QPI) by scanning tunneling microscopy/spectroscopy to improve efficiency and save measurement time. To solve an ill-posed problem owing to the scarcity of data, the compressed sensing utilizes the sparseness of QPI patterns in momentum space. We examined the performance of a sparsity-inducing algorithm cal… ▽ More We applied a method of compressed sensing to the observation of quasi-particle interference (QPI) by scanning tunneling microscopy/spectroscopy to improve efficiency and save measurement time. To solve an ill-posed problem owing to the scarcity of data, the compressed sensing utilizes the sparseness of QPI patterns in momentum space. We examined the performance of a sparsity-inducing algorithm called least absolute shrinkage and selection operator (LASSO), and demonstrated that LASSO enables us to recover a double-circle QPI pattern of the Ag(111) surface from a dataset whose size is less than that necessary for the conventional Fourier transformation method. In addition, the smallest number of data required for the recovery is discussed on the basis of cross validation. △ Less

Submitted 25 July, 2016; originally announced July 2016.

arXiv:1510.02189 [pdf, ps, other]

doi 10.1088/1742-5468/2016/06/063302

Sparse approximation based on a random overcomplete basis

Authors: Yoshinori Nakanishi-Ohno, Tomoyuki Obuchi, Masato Okada, Yoshiyuki Kabashima

Abstract: We discuss a strategy of sparse approximation that is based on the use of an overcomplete basis, and evaluate its performance when a random matrix is used as this basis. A small combination of basis vectors is chosen from a given overcomplete basis, according to a given compression rate, such that they compactly represent the target data with as small a distortion as possible. As a selection metho… ▽ More We discuss a strategy of sparse approximation that is based on the use of an overcomplete basis, and evaluate its performance when a random matrix is used as this basis. A small combination of basis vectors is chosen from a given overcomplete basis, according to a given compression rate, such that they compactly represent the target data with as small a distortion as possible. As a selection method, we study the $\ell_0$- and $\ell_1$-based methods, which employ the exhaustive search and $\ell_1$-norm regularization techniques, respectively. The performance is assessed in terms of the trade-off relation between the representation distortion and the compression rate. First, we evaluate the performance analytically in the case that the methods are carried out ideally, using methods of statistical mechanics. Our result clarifies the fact that the $\ell_0$-based method greatly outperforms the $\ell_1$-based one. Second, we examine the practical performances of two well-known algorithms, orthogonal matching pursuit and approximate message passing, when they are used to execute the $\ell_0$- and $\ell_1$-based methods, respectively. Our examination shows that orthogonal matching pursuit achieves a much better performance than the exact execution of the $\ell_1$-based method, as well as approximate message passing. However, regarding the $\ell_0$-based method, there is still room to design more effective greedy algorithms than orthogonal matching pursuit. Finally, we evaluate the performances of the algorithms when they are applied to image data compression. △ Less

Submitted 2 March, 2016; v1 submitted 7 October, 2015; originally announced October 2015.

Comments: 35 pages, 11 figures

arXiv:1509.04091 [pdf, ps, other]

Circular Symmetrization, Subordination and Arclength problems on Convex Functions

Authors: Mari Okada, Saminathan Ponnusamy, Allu Vasudevarao, Hiroshi Yanagihara

Abstract: We study the class ${\mathcal C}(Ω)$ of univalent analytic functions $f$ in the unit disk $\mathbb{D} = \{z \in \mathbb{C} :\,|z|<1 \}$ of the form $f(z)=z+\sum_{n=2}^{\infty}a_n z^n$ satisfying \[ 1+\frac{zf"(z)}{f'(z)} \in Ω, \quad z\in \mathbb{D}, \] where $Ω$ will be a proper subdomain of ${\mathbb C}$ which is starlike with respect to $1 (\in Ω)$. Let $φ_Ω$ be the unique conformal map** of… ▽ More We study the class ${\mathcal C}(Ω)$ of univalent analytic functions $f$ in the unit disk $\mathbb{D} = \{z \in \mathbb{C} :\,|z|<1 \}$ of the form $f(z)=z+\sum_{n=2}^{\infty}a_n z^n$ satisfying \[ 1+\frac{zf"(z)}{f'(z)} \in Ω, \quad z\in \mathbb{D}, \] where $Ω$ will be a proper subdomain of ${\mathbb C}$ which is starlike with respect to $1 (\in Ω)$. Let $φ_Ω$ be the unique conformal map** of ${\mathbb D}$ onto $Ω$ with $φ_Ω(0)=1$ and $φ_Ω'(0) > 0$ and $ k_Ω(z) = \int_0^z \exp \left(\int_0^t ζ^{-1} (φ_Ω(ζ) -1) \, d ζ\right) \, dt$. Let $L_r(f)$ denote the arclength of the image of the circle $\{z \in \mathbb{C} : \, |z|=r\}$, $r\in (0,1)$. The first result in this paper is an inequality $L_r(f) \leq L_r(k_Ω)$ for $f \in \mathcal{C} (Ω)$, which solves the general extremal problem $\max_{f \in {\mathcal C}(Ω)} L_r(f)$, and contains many other well-known results of the previous authors as special cases. Other results of this article cover another set of related problems about integral means in the general setting of the class ${\mathcal C}(Ω)$. △ Less

Submitted 14 September, 2015; originally announced September 2015.

Comments: This is appear in Mathematische Nachrichten

MSC Class: 30C45

arXiv:1506.06364 [pdf, other]

doi 10.1088/0952-4746/36/1/49

Measurement and comparison of individual external doses of high-school students living in Japan, France, Poland and Belarus -- the "D-shuttle" project --

Authors: N. Adachi, V. Adamovitch, Y. Adjovi, K. Aida, H. Akamatsu, S. Akiyama, A. Akli, A. Ando, T. Andrault, H. Antonietti, S. Anzai, G. Arkoun, C. Avenoso, D. Ayrault, M. Banasiewicz, M. Banaśkiewicz, L. Bernandini, E. Bernard, E. Berthet, M. Blanchard, D. Boreyko, K. Boros, S. Charron, P. Cornette, K. Czerkas , et al. (208 additional authors not shown)

Abstract: Twelve high schools in Japan (of which six are in Fukushima Prefecture), four in France, eight in Poland and two in Belarus cooperated in the measurement and comparison of individual external doses in 2014. In total 216 high-school students and teachers participated in the study. Each participant wore an electronic personal dosimeter "D-shuttle" for two weeks, and kept a journal of his/her whereab… ▽ More Twelve high schools in Japan (of which six are in Fukushima Prefecture), four in France, eight in Poland and two in Belarus cooperated in the measurement and comparison of individual external doses in 2014. In total 216 high-school students and teachers participated in the study. Each participant wore an electronic personal dosimeter "D-shuttle" for two weeks, and kept a journal of his/her whereabouts and activities. The distributions of annual external doses estimated for each region overlap with each other, demonstrating that the personal external individual doses in locations where residence is currently allowed in Fukushima Prefecture and in Belarus are well within the range of estimated annual doses due to the background radiation level of other regions/countries. △ Less

Submitted 18 November, 2015; v1 submitted 21 June, 2015; originally announced June 2015.

arXiv:1412.7850 [pdf, ps, other]

doi 10.7566/JPSJ.84.033001

Correspondence between phase oscillator network and classical XY model with the same infinite-range interaction in statics

Authors: T. Uezu, T. Kimoto, S. Kiyokawa, M. Okada

Abstract: We study the phase oscillator networks with distributed natural frequencies and classical XY models both of which have a class of infinite-range interactions in common. We find that the integral kernel of the self-consistent equations (SCEs) for oscillator networks correspond to that of the saddle point equations (SPEs) for XY models, and that the quenched randomness (distributed natural frequenci… ▽ More We study the phase oscillator networks with distributed natural frequencies and classical XY models both of which have a class of infinite-range interactions in common. We find that the integral kernel of the self-consistent equations (SCEs) for oscillator networks correspond to that of the saddle point equations (SPEs) for XY models, and that the quenched randomness (distributed natural frequencies) corresponds to thermal noise. We find a sufficient condition that the probability density of natural frequency distributions is one-humped in order that the kernel in the oscillator network is strictly decreasing as that in the XY model. Furthermore, taking the uniform and Mexican-hat type interactions, we prove the one to one correspondence between the solutions of the SCEs and SPEs. As an application of the correspondence, we study the associative memory type interaction. In the XY model with this interaction, there exists a peculiar one-parameter family of solutions. For the oscillator network, we find a non-trivial solution, i.e., a limit cycle oscillation. △ Less

Submitted 25 December, 2014; originally announced December 2014.

Comments: 13 pages, 2 figures

arXiv:1405.2165 [pdf, ps, other]

doi 10.7566/JPSJ.83.124004

Oscillations in Spurious States of the Associative Memory Model with Synaptic Depression

Authors: Shin Murata, Yosuke Otsubo, Kenji Nagata, Masato Okada

Abstract: The associative memory model is a typical neural network model, which can store discretely distributed fixed-point attractors as memory patterns. When the network stores the memory patterns extensively, however, the model has other attractors besides the memory patterns. These attractors are called spurious memories. Both spurious states and memory states are equilibrium, so there is little differ… ▽ More The associative memory model is a typical neural network model, which can store discretely distributed fixed-point attractors as memory patterns. When the network stores the memory patterns extensively, however, the model has other attractors besides the memory patterns. These attractors are called spurious memories. Both spurious states and memory states are equilibrium, so there is little difference between their dynamics. Recent physiological experiments have shown that short-term dynamic synapse called synaptic depression decreases its transmission efficacy to postsynaptic neurons according to the activities of presynaptic neurons. Previous studies have shown that synaptic depression induces oscillation in the network and decreases the storage capacity at finite temperature. How synaptic depression affects spurious states, however, is still unclear. We investigate the effect of synaptic depression on spurious states through Monte Carlo simulation. The results demonstrate that synaptic depression does not affect the memory states but mainly destabilizes the spurious states and induces the periodic oscillations. △ Less

Submitted 9 May, 2014; originally announced May 2014.

Comments: 17 pages, 9 figures

arXiv:1404.2033 [pdf, ps, other]

doi 10.7566/JPSJ.83.103701

Almost Perfect Frustration in the Dimer Magnet Ba$_2$CoSi$_2$O$_6$Cl$_2$

Authors: Hidekazu Tanaka, Nobuyuki Kurita, Makiko Okada, Eiji Kunihiro, Yutaka Shirata, Kotaro Fujii, Hidehiro Uekusa, Akira Matsuo, Koichi Kindo, Hiroyuki Nojiri

Abstract: We determined the crystal structure of Ba$_2$CoSi$_2$O$_6$Cl$_2$, which was synthesized in this work, and investigated its quantum magnetic properties using single crystals. This compound should be described as a two-dimensionally coupled spin-1/2 XY-like spin dimer system. Ba$_2$CoSi$_2$O$_6$Cl$_2$ exhibits a stepwise magnetization process with a plateau at half of the saturation magnetization, i… ▽ More We determined the crystal structure of Ba$_2$CoSi$_2$O$_6$Cl$_2$, which was synthesized in this work, and investigated its quantum magnetic properties using single crystals. This compound should be described as a two-dimensionally coupled spin-1/2 XY-like spin dimer system. Ba$_2$CoSi$_2$O$_6$Cl$_2$ exhibits a stepwise magnetization process with a plateau at half of the saturation magnetization, irrespective of the field direction, although all the Co$^{2+}$ sites are equivalent. This indicates that spin triplets are localized owing to the almost perfect frustration of interdimer exchange interactions. Thus, the spin states for the zero and 1/2 magnetization-plateau states are almost exactly given by the simple product of singlet dimers and the alternate product of singlet and triplet dimers, respectively. △ Less

Submitted 2 September, 2014; v1 submitted 8 April, 2014; originally announced April 2014.

Comments: 8 pages, 8 figures, published in J. Phys. Soc. Jpn. http://dx.doi.org/10.7566/JPSJ.83.103701

Journal ref: J. Phys. Soc. Jpn., Vol.83, No.10, 2014, Article ID: 103701

arXiv:1310.1202 [pdf, ps, other]

doi 10.7566/JPSCP.3.015045

Analysis of Magnetic Field-Angle Dependent Electronic Raman Scattering to Probe the Superconducting Gap

Authors: Masaru Okada, Nobuhiko Hayashi

Abstract: We study the field-angle resolved electronic Raman scattering in 2-dimensional d-wave superconducting vortex states theoretically by quasi-classical approximation, the so-called Doppler-shift method. An analytic expression is obtained for the field angle dependence of the Raman scattering amplitude at zero temperature. After numerical integration, we obtain the scattering intensity for various fie… ▽ More We study the field-angle resolved electronic Raman scattering in 2-dimensional d-wave superconducting vortex states theoretically by quasi-classical approximation, the so-called Doppler-shift method. An analytic expression is obtained for the field angle dependence of the Raman scattering amplitude at zero temperature. After numerical integration, we obtain the scattering intensity for various field angles by changing the Raman shift energy. Field-angle resolved electronic Raman scattering turns out to be an effective method for probing unconventional superconducting gap structures. It shows a novel phenomenon: reversal of extrema as a function of frequency without changing temperature or field magnitude. △ Less

Submitted 21 January, 2014; v1 submitted 4 October, 2013; originally announced October 2013.

Comments: 6 pages, 5 figures. SCES2013 Proceedings (to be published in JPS Conf. Proc.)

Journal ref: JPS Conf. Proc. 3, 015045 (2014)

arXiv:1304.0670 [pdf]

doi 10.1007/s41114-020-00026-9

Prospects for Observing and Localizing Gravitational-Wave Transients with Advanced LIGO, Advanced Virgo and KAGRA

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, B. P. Abbott, R. Abbott, T. D. Abbott, S. Abraham, F. Acernese, K. Ackley, C. Adams, V. B. Adya, C. Affeldt, M. Agathos, K. Agatsuma, N. Aggarwal, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, G. Allen, A. Allocca, M. A. Aloy, P. A. Altin, A. Amato , et al. (1297 additional authors not shown)

Abstract: We present our current best estimate of the plausible observing scenarios for the Advanced LIGO, Advanced Virgo and KAGRA gravitational-wave detectors over the next several years, with the intention of providing information to facilitate planning for multi-messenger astronomy with gravitational waves. We estimate the sensitivity of the network to transient gravitational-wave signals for the third… ▽ More We present our current best estimate of the plausible observing scenarios for the Advanced LIGO, Advanced Virgo and KAGRA gravitational-wave detectors over the next several years, with the intention of providing information to facilitate planning for multi-messenger astronomy with gravitational waves. We estimate the sensitivity of the network to transient gravitational-wave signals for the third (O3), fourth (O4) and fifth observing (O5) runs, including the planned upgrades of the Advanced LIGO and Advanced Virgo detectors. We study the capability of the network to determine the sky location of the source for gravitational-wave signals from the inspiral of binary systems of compact objects, that is BNS, NSBH, and BBH systems. The ability to localize the sources is given as a sky-area probability, luminosity distance, and comoving volume. The median sky localization area (90\% credible region) is expected to be a few hundreds of square degrees for all types of binary systems during O3 with the Advanced LIGO and Virgo (HLV) network. The median sky localization area will improve to a few tens of square degrees during O4 with the Advanced LIGO, Virgo, and KAGRA (HLVK) network. We evaluate sensitivity and localization expectations for unmodeled signal searches, including the search for intermediate mass black hole binary mergers. △ Less

Submitted 24 November, 2020; v1 submitted 2 April, 2013; originally announced April 2013.

Comments: 52 pages, 9 figures, 5 tables. We have updated the detector sensitivities (including the A+ and AdV+ upgrade); added expectations for binary black-holes, neutron-star black-holes, and intermediate-mass black holes; and added 3D volume localization expectations. This update is to change some numbers in Table 5 based on refinements to the simulations. Three authors were added

Report number: LIGO-P1200087, VIR-0288A-12, JGW-P1808427

Journal ref: Living Rev Relativ 23, 3 (2020)

arXiv:1209.4772 [pdf, ps, other]

doi 10.1103/PhysRevE.99.062132

Statistical mechanical evaluation of spread spectrum watermarking model with image restoration

Authors: Masaki Kawamura, Kao Hayashi, Tatsuya Uezu, Masato Okada

Abstract: In cases in which an original image is blind, a decoding method where both the image and the messages can be estimated simultaneously is desirable. We propose a spread spectrum watermarking model with image restoration based on Bayes estimation. We therefore need to assume some prior probabilities. The probability for estimating the messages is given by the uniform distribution, and the ones for t… ▽ More In cases in which an original image is blind, a decoding method where both the image and the messages can be estimated simultaneously is desirable. We propose a spread spectrum watermarking model with image restoration based on Bayes estimation. We therefore need to assume some prior probabilities. The probability for estimating the messages is given by the uniform distribution, and the ones for the image are given by the infinite range model and 2D Ising model. Any attacks from unauthorized users can be represented by channel models. We can obtain the estimated messages and image by maximizing the posterior probability. We analyzed the performance of the proposed method by the replica method in the case of the infinite range model. We first calculated the theoretical values of the bit error rate from obtained saddle point equations and then verified them by computer simulations. For this purpose, we assumed that the image is binary and is generated from a given prior probability. We also assume that attacks can be represented by the Gaussian channel. The computer simulation retults agreed with the theoretical values. In the case of prior probability given by the 2D Ising model, in which each pixel is statically connected with four-neighbors, we evaluated the decoding performance by computer simulations, since the replica theory could not be applied. Results using the 2D Ising model showed that the proposed method with image restoration is as effective as the infinite range model for decoding messages. We compared the performances in a case in which the image was blind and one in which it was informed. The difference between these cases was small as long as the embedding and attack rates were small. This demonstrates that the proposed method with simultaneous estimation is effective as a watermarking decoder. △ Less

Submitted 26 June, 2019; v1 submitted 21 September, 2012; originally announced September 2012.

Journal ref: Phys. Rev. E 99, 062132 (2019)

arXiv:1203.1767 [pdf, ps, other]

doi 10.1143/JPSJ.81.073001

Solvable model of a phase oscillator network on a circle with infinite-range Mexican-hat-type interaction

Authors: Tatsuya Uezu, Tomoyuki Kimoto, Masato Okada

Abstract: We describe a solvable model of a phase oscillator network on a circle with infinite-range Mexican-hat-type interaction. We derive self-consistent equations of the order parameters and obtain three non-trivial solutions characterized by the rotation number. We also derive relevant characteristics such as the location-dependent distributions of the resultant frequencies of desynchronized oscillator… ▽ More We describe a solvable model of a phase oscillator network on a circle with infinite-range Mexican-hat-type interaction. We derive self-consistent equations of the order parameters and obtain three non-trivial solutions characterized by the rotation number. We also derive relevant characteristics such as the location-dependent distributions of the resultant frequencies of desynchronized oscillators. Simulation results closely agree with the theoretical ones. △ Less

Submitted 8 March, 2012; originally announced March 2012.

arXiv:1106.2894 [pdf, ps, other]

doi 10.1143/JPSJ.80.084004

Influence of synaptic depression on memory storage capacity

Authors: Yosuke Otsubo, Kenji Nagata, Masafumi Oizumi, Masato Okada

Abstract: Synaptic efficacy between neurons is known to change within a short time scale dynamically. Neurophysiological experiments show that high-frequency presynaptic inputs decrease synaptic efficacy between neurons. This phenomenon is called synaptic depression, a short term synaptic plasticity. Many researchers have investigated how the synaptic depression affects the memory storage capacity. However,… ▽ More Synaptic efficacy between neurons is known to change within a short time scale dynamically. Neurophysiological experiments show that high-frequency presynaptic inputs decrease synaptic efficacy between neurons. This phenomenon is called synaptic depression, a short term synaptic plasticity. Many researchers have investigated how the synaptic depression affects the memory storage capacity. However, the noise has not been taken into consideration in their analysis. By introducing "temperature", which controls the level of the noise, into an update rule of neurons, we investigate the effects of synaptic depression on the memory storage capacity in the presence of the noise. We analytically compute the storage capacity by using a statistical mechanics technique called Self Consistent Signal to Noise Analysis (SCSNA). We find that the synaptic depression decreases the storage capacity in the case of finite temperature in contrast to the case of the low temperature limit, where the storage capacity does not change. △ Less

Submitted 15 June, 2011; originally announced June 2011.

arXiv:1103.1439 [pdf, ps, other]

Generating Functional Analysis for Iterative CDMA Multiuser Detectors

Authors: Kazushi Mimura, Masato Okada

Abstract: We investigate the detection dynamics of a soft parallel interference canceller (soft-PIC), which includes a hard-PIC as a special case, for code-division multiple-access (CDMA) multiuser detection, applied to a randomly spread, fully synchronous base-band uncoded CDMA channel model with additive white Gaussian noise under perfect power control in the large-system limit. We analyze the detection d… ▽ More We investigate the detection dynamics of a soft parallel interference canceller (soft-PIC), which includes a hard-PIC as a special case, for code-division multiple-access (CDMA) multiuser detection, applied to a randomly spread, fully synchronous base-band uncoded CDMA channel model with additive white Gaussian noise under perfect power control in the large-system limit. We analyze the detection dynamics of some iterative detectors, namely soft-PIC, the Onsager-reaction-cancelling parallel interference canceller (ORC-PIC) and the belief-propagation-based detector (BP-based detector), by the generating functional analysis (GFA). The GFA allows us to study the asymptotic behavior of the dynamics in the infinitely large system without assuming the independence of messages. We study the detection dynamics and the stationary estimates of an iterative algorithm. We also show the decoupling principle in iterative multiuser detection algorithms in the large-system limit. For a generic iterative multiuser detection algorithm with binary input, it is shown that the multiuser channel is equivalent to a bank of independent single-user additive non-Gaussian channels, whose signal-to-noise ratio degrades due to both the multiple-access interference and the Onsager reaction, at each stage of the algorithm. If an algorithm cancels the Onsager reaction, the equivalent single-user channels coincide with an additive white Gaussian noise channel. We also discuss ORC-PIC and the BP-based detector. △ Less

Submitted 10 September, 2013; v1 submitted 8 March, 2011; originally announced March 2011.

Comments: 28 pages, 5 figures

arXiv:1102.1497 [pdf, ps, other]

doi 10.1143/JPSJ.80.034802

Belief Propagation for Error Correcting Codes and Lossy Compression Using Multilayer Perceptrons

Authors: Kazushi Mimura, Florent Cousseau, Masato Okada

Abstract: The belief propagation (BP) based algorithm is investigated as a potential decoder for both of error correcting codes and lossy compression, which are based on non-monotonic tree-like multilayer perceptron encoders. We discuss that whether the BP can give practical algorithms or not in these schemes. The BP implementations in those kind of fully connected networks unfortunately shows strong limita… ▽ More The belief propagation (BP) based algorithm is investigated as a potential decoder for both of error correcting codes and lossy compression, which are based on non-monotonic tree-like multilayer perceptron encoders. We discuss that whether the BP can give practical algorithms or not in these schemes. The BP implementations in those kind of fully connected networks unfortunately shows strong limitation, while the theoretical results seems a bit promising. Instead, it reveals it might have a rich and complex structure of the solution space via the BP-based algorithms. △ Less

Submitted 10 March, 2011; v1 submitted 7 February, 2011; originally announced February 2011.

Comments: 18 pages, 18 figures

Journal ref: J. Phys. Soc. Jpn., 80, 3, 034802 (2011)

arXiv:1011.2575 [pdf, ps, other]

doi 10.1371/journal.pone.0024516

Complex sequencing rules of birdsong can be explained by simple hidden Markov processes

Authors: Kentaro Katahira, Kenta Suzuki, Kazuo Okanoya, Masato Okada

Abstract: Complex sequencing rules observed in birdsongs provide an opportunity to investigate the neural mechanism for generating complex sequential behaviors. To relate the findings from studying birdsongs to other sequential behaviors, it is crucial to characterize the statistical properties of the sequencing rules in birdsongs. However, the properties of the sequencing rules in birdsongs have not yet be… ▽ More Complex sequencing rules observed in birdsongs provide an opportunity to investigate the neural mechanism for generating complex sequential behaviors. To relate the findings from studying birdsongs to other sequential behaviors, it is crucial to characterize the statistical properties of the sequencing rules in birdsongs. However, the properties of the sequencing rules in birdsongs have not yet been fully addressed. In this study, we investigate the statistical propertiesof the complex birdsong of the Bengalese finch (Lonchura striata var. domestica). Based on manual-annotated syllable sequences, we first show that there are significant higher-order context dependencies in Bengalese finch songs, that is, which syllable appears next depends on more than one previous syllable. This property is shared with other complex sequential behaviors. We then analyze acoustic features of the song and show that higher-order context dependencies can be explained using first-order hidden state transition dynamics with redundant hidden states. This model corresponds to hidden Markov models (HMMs), well known statistical models with a large range of application for time series modeling. The song annotation with these models with first-order hidden state dynamics agreed well with manual annotation, the score was comparable to that of a second-order HMM, and surpassed the zeroth-order model (the Gaussian mixture model (GMM)), which does not use context information. Our results imply that the hierarchical representation with hidden state dynamics may underlie the neural implementation for generating complex sequences with higher-order dependencies. △ Less

Submitted 11 November, 2010; originally announced November 2010.

arXiv:1011.1876 [pdf, ps, other]

Statistical mechanics of digital halftoning

Authors: Jun-ichi Inoue, Yohei Saika, Masato Okada

Abstract: We consider the problem of digital halftoning from the view point of statistical mechanics. The digital halftoning is a sort of image processing, namely, representing each grayscale in terms of black and white binary dots. The digital halftoning is achieved by making use of the threshold mask, namely, for each pixel, the halftoned binary pixel is determined as black if the original grayscale pixel… ▽ More We consider the problem of digital halftoning from the view point of statistical mechanics. The digital halftoning is a sort of image processing, namely, representing each grayscale in terms of black and white binary dots. The digital halftoning is achieved by making use of the threshold mask, namely, for each pixel, the halftoned binary pixel is determined as black if the original grayscale pixel is greater than or equal to the mask value and is determined as white vice versa. To determine the optimal value of the mask on each pixel for a given original grayscale image, we first assume that the human-eyes might recognize the black and white binary halftoned image as the corresponding grayscale one by linear filters. The Hamiltonian is constructed as a distance between the original and the recognized images which is written in terms of the threshold mask. We are confirmed that the system described by the Hamiltonian is regarded as a kind of antiferromagnetic Ising model with quenched disorders. By searching the ground state of the Hamiltonian, we obtain the optimal threshold mask and the resulting halftoned binary dots simultaneously. From the power-spectrum analysis, we find that the binary dots image is physiologically plausible from the view point of human-eyes modulation properties. We also propose a theoretical framework to investigate statistical performance of inverse digital halftoning, that is, the inverse process of halftoning. From the Bayesian inference view point, we rigorously show that the Bayes-optimal inverse-halftoning is achieved on a specific condition which is very similar to the so-called Nishimori line in the research field of spin glasses. △ Less

Submitted 8 November, 2010; originally announced November 2010.

Comments: 20 pages, 27 figures, using revtex4

arXiv:1005.3916 [pdf, ps, other]

Instabilities in associative memory model with synaptic depression and switching phenomena among attractors

Authors: Yosuke Otsubo, Kenji Nagata, Masafumi Oizumi, Masato Okada

Abstract: We investigated how the stability of macroscopic states in the associative memory model is affected by synaptic depression. To this model, we applied the dynamical mean-field theory, which has recently been developed in stochastic neural network models with synaptic depression. By introducing a sublattice method, we derived macroscopic equations for firing state variables and depression variables.… ▽ More We investigated how the stability of macroscopic states in the associative memory model is affected by synaptic depression. To this model, we applied the dynamical mean-field theory, which has recently been developed in stochastic neural network models with synaptic depression. By introducing a sublattice method, we derived macroscopic equations for firing state variables and depression variables. By using the macroscopic equations, we obtained the phase diagram when the strength of synaptic depression and the correlation level among stored patterns were changed. We found that there is an unstable region in which both the memory state and mixed state cannot be stable and that various switching phenomena can occur in this region. △ Less

Submitted 21 May, 2010; originally announced May 2010.

Comments: 18pages,6figures

arXiv:1003.1196 [pdf, ps, other]

Mean Field Analysis of Stochastic Neural Network Models with Synaptic Depression

Authors: Yasuhiko Igarashi, Masafumi Oizumi, Masato Okada

Abstract: We investigated the effects of synaptic depression on the macroscopic behavior of stochastic neural networks. Dynamical mean field equations were derived for such networks by taking the average of two stochastic variables: a firing state varialbe and a synaptic variable. In these equations, their average product is decoupled as the product of averaged them because the two stochastic variables ar… ▽ More We investigated the effects of synaptic depression on the macroscopic behavior of stochastic neural networks. Dynamical mean field equations were derived for such networks by taking the average of two stochastic variables: a firing state varialbe and a synaptic variable. In these equations, their average product is decoupled as the product of averaged them because the two stochastic variables are independent. We proved the independence of these two stochastic variables assuming that the synaptic weight is of the order of 1/N with respect to the number of neurons N. Using these equations, we derived macroscopic steady state equations for a network with uniform connections and a ring attractor network with Mexican hat type connectivity and investigated the stability of the steady state solutions. An oscillatory uniform state was observed in the network with uniform connections due to a Hopf instability. With the ring network, high-frequency perturbations were shown not to affect system stability. Two mechanisms destabilize the inhomogeneous steady state, leading two oscillatory states. A Turing instability leads to a rotating bump state, while a Hopf instability leads to an oscillatory bump state, which was previous unreported. Various oscillatory states take place in a network with synaptic depression depending on the strength of the interneuron connections. △ Less

Submitted 5 March, 2010; originally announced March 2010.

Comments: 26 pages, 13 figures. Preliminary results for the present work have been published elsewhere (Y Igarashi et al., 2009. http://www.iop.org/EJ/abstract/1742-6596/197/1/012018)

arXiv:0903.3451 [pdf, ps, other]

doi 10.1143/JPSJ.78.114801

Neural network model with discrete and continuous information representation

Authors: Jun Kitazono, Toshiaki Omori, Masato Okada

Abstract: An associative memory model and a neural network model with a Mexican-hat type interaction are the two most typical attractor networks used in the artificial neural network models. The associative memory model has discretely distributed fixed-point attractors, and achieves a discrete information representation. On the other hand, a neural network model with a Mexican-hat type interaction uses a… ▽ More An associative memory model and a neural network model with a Mexican-hat type interaction are the two most typical attractor networks used in the artificial neural network models. The associative memory model has discretely distributed fixed-point attractors, and achieves a discrete information representation. On the other hand, a neural network model with a Mexican-hat type interaction uses a line attractor to achieves a continuous information representation, which can be seen in the working memory in the prefrontal cortex and columnar activity in the visual cortex. In the present study, we propose a neural network model that achieves discrete and continuous information representation. We use a statistical-mechanical analysis to find that a localized retrieval phase exists in the proposed model, where the memory pattern is retrieved in the localized subpopulation of the network. In the localized retrieval phase, the discrete and continuous information representation is achieved by using the orthogonality of the memory patterns and the neutral stability of fixed points along the positions of the localized retrieval. The obtained phase diagram suggests that the antiferromagnetic interaction and the external field are important for generating the localized retrieval phase. △ Less

Submitted 20 March, 2009; originally announced March 2009.

Comments: 15 pages, 5 figures

arXiv:0903.1915 [pdf, ps, other]

Statistical Mechanical Study on a Neural Network Model with Time Dependent Interactions

Authors: T. Uezu, K. Abe, S. Miyoshi, M. Okada

Abstract: We study a neural network model in which both neurons and synaptic interactions evolve in time simultaneously. The time evolution of synaptic interactions is described by a Langevin equation including a Hebbian learning term, and a bias term which is the interactions of the Hopfield model. We assume that synaptic interactions change much slower than neurons and study the stationary states of syn… ▽ More We study a neural network model in which both neurons and synaptic interactions evolve in time simultaneously. The time evolution of synaptic interactions is described by a Langevin equation including a Hebbian learning term, and a bias term which is the interactions of the Hopfield model. We assume that synaptic interactions change much slower than neurons and study the stationary states of synaptic interactions by the replica method. We find that the order of the phase transition changes from the second to the first and that the existence regions of the Hopfield attractor and mixed states increase as the coefficient of the learning term increases. We also study the AT stability of solutions and find that the temperature region in which the Hopfield attractor is stable increases as the learning coefficient increases. Theoretical results are confirmed by the direct numerical integration of the Langevin equation. Further, we study the characteristics of the resultant synaptic interactions by partial annealing and find that the stability of the attractor which emerges after partial annealing is enhanced and those of the coexistent attractors are reduced. △ Less

Submitted 11 March, 2009; originally announced March 2009.

Comments: 19 pages, 9 figures

arXiv:0811.3476 [pdf, ps, other]

doi 10.1103/PhysRevE.81.021104

Error correcting code using tree-like multilayer perceptron

Authors: Florent Cousseau, Kazushi Mimura, Masato Okada

Abstract: An error correcting code using a tree-like multilayer perceptron is proposed. An original message $\mbi{s}^0$ is encoded into a codeword $\boldmath{y}_0$ using a tree-like committee machine (committee tree) or a tree-like parity machine (parity tree). Based on these architectures, several schemes featuring monotonic or non-monotonic units are introduced. The codeword $\mbi{y}_0$ is then transmit… ▽ More An error correcting code using a tree-like multilayer perceptron is proposed. An original message $\mbi{s}^0$ is encoded into a codeword $\boldmath{y}_0$ using a tree-like committee machine (committee tree) or a tree-like parity machine (parity tree). Based on these architectures, several schemes featuring monotonic or non-monotonic units are introduced. The codeword $\mbi{y}_0$ is then transmitted via a Binary Asymmetric Channel (BAC) where it is corrupted by noise. The analytical performance of these schemes is investigated using the replica method of statistical mechanics. Under some specific conditions, some of the proposed schemes are shown to saturate the Shannon bound at the infinite codeword length limit. The influence of the monotonicity of the units on the performance is also discussed. △ Less

Submitted 16 January, 2010; v1 submitted 21 November, 2008; originally announced November 2008.

Comments: 23 pages, 3 figures, Content has been extended and revised

Journal ref: Phys. Rev. E, 81, 021104 (2010)

arXiv:0807.4009 [pdf, ps, other]

doi 10.1103/PhysRevE.78.021124

Statistical mechanics of lossy compression for non-monotonic multilayer perceptrons

Authors: Florent Cousseau, Kazushi Mimura, Toshiaki Omori, Masato Okada

Abstract: A lossy data compression scheme for uniformly biased Boolean messages is investigated via statistical mechanics techniques. We utilize tree-like committee machine (committee tree) and tree-like parity machine (parity tree) whose transfer functions are non-monotonic. The scheme performance at the infinite code length limit is analyzed using the replica method. Both committee and parity treelike n… ▽ More A lossy data compression scheme for uniformly biased Boolean messages is investigated via statistical mechanics techniques. We utilize tree-like committee machine (committee tree) and tree-like parity machine (parity tree) whose transfer functions are non-monotonic. The scheme performance at the infinite code length limit is analyzed using the replica method. Both committee and parity treelike networks are shown to saturate the Shannon bound. The AT stability of the Replica Symmetric solution is analyzed, and the tuning of the non-monotonic transfer function is also discussed. △ Less

Submitted 25 July, 2008; originally announced July 2008.

Comments: 29 pages, 7 figures

Journal ref: Phys. Rev. E, 78, 021124 (2008)

arXiv:0807.3209 [pdf, ps, other]

doi 10.2478/s11534-009-0066-0

Bayes-optimal inverse halftoning and statistical mechanics of the Q-Ising model

Authors: Yohei Saika, Jun-ichi Inoue, Hiroyuki Tanaka, Masato Okada

Abstract: On the basis of statistical mechanics of the Q-Ising model, we formulate the Bayesian inference to the problem of inverse halftoning, which is the inverse process of representing gray-scales in images by means of black and white dots. Using Monte Carlo simulations, we investigate statistical properties of the inverse process, especially, we reveal the condition of the Bayes-optimal solution for… ▽ More On the basis of statistical mechanics of the Q-Ising model, we formulate the Bayesian inference to the problem of inverse halftoning, which is the inverse process of representing gray-scales in images by means of black and white dots. Using Monte Carlo simulations, we investigate statistical properties of the inverse process, especially, we reveal the condition of the Bayes-optimal solution for which the mean-square error takes its minimum. The numerical result is qualitatively confirmed by analysis of the infinite-range model. As demonstrations of our approach, we apply the method to retrieve a grayscale image, such as standard image `Lenna', from the halftoned version. We find that the Bayes-optimal solution gives a fine restored grayscale image which is very close to the original. △ Less

Submitted 21 July, 2008; originally announced July 2008.

Comments: 13pages, 12figures, using elsart.cls

arXiv:0805.0425 [pdf, ps, other]

Effect of Slow Switching in On-line Learning for Ensemble Teachers

Authors: Seiji Miyoshi, Masato Okada

Abstract: We have analyzed the generalization performance of a student which slowly switches ensemble teachers. By calculating the generalization error analytically using statistical mechanics in the framework of on-line learning, we show that the dynamical behaviors of generalization error have the periodicity that is synchronized with the switching period and the behaviors differ with the number of ense… ▽ More We have analyzed the generalization performance of a student which slowly switches ensemble teachers. By calculating the generalization error analytically using statistical mechanics in the framework of on-line learning, we show that the dynamical behaviors of generalization error have the periodicity that is synchronized with the switching period and the behaviors differ with the number of ensemble teachers. Furthermore, we show that the smaller the switching period is, the larger the difference is. △ Less

Submitted 4 February, 2009; v1 submitted 4 May, 2008; originally announced May 2008.

Comments: 8 pages, 5 figures

arXiv:0705.2491 [pdf, ps, other]

doi 10.1143/JPSJ.76.124801

Sparse and Dense Encoding in Layered Associative Network of Spiking Neurons

Authors: Kazuya Ishibashi, Kosuke Hamaguchi, Masato Okada

Abstract: A synfire chain is a simple neural network model which can propagate stable synchronous spikes called a pulse packet and widely researched. However how synfire chains coexist in one network remains to be elucidated. We have studied the activity of a layered associative network of Leaky Integrate-and-Fire neurons in which connection we embed memory patterns by the Hebbian Learning. We analyzed th… ▽ More A synfire chain is a simple neural network model which can propagate stable synchronous spikes called a pulse packet and widely researched. However how synfire chains coexist in one network remains to be elucidated. We have studied the activity of a layered associative network of Leaky Integrate-and-Fire neurons in which connection we embed memory patterns by the Hebbian Learning. We analyzed their activity by the Fokker-Planck method. In our previous report, when a half of neurons belongs to each memory pattern (memory pattern rate $F=0.5$), the temporal profiles of the network activity is split into temporally clustered groups called sublattices under certain input conditions. In this study, we show that when the network is sparsely connected ($F<0.5$), synchronous firings of the memory pattern are promoted. On the contrary, the densely connected network ($F>0.5$) inhibit synchronous firings. The sparseness and denseness also effect the basin of attraction and the storage capacity of the embedded memory patterns. We show that the sparsely(densely) connected networks enlarge(shrink) the basion of attraction and increase(decrease) the storage capacity. △ Less

Submitted 17 May, 2007; originally announced May 2007.

arXiv:0705.2318 [pdf, ps, other]

doi 10.1143/JPSJ.76.114001

Statistical Mechanics of Nonlinear On-line Learning for Ensemble Teachers

Authors: Hideto Utsumi, Seiji Miyoshi, Masato Okada

Abstract: We analyze the generalization performance of a student in a model composed of nonlinear perceptrons: a true teacher, ensemble teachers, and the student. We calculate the generalization error of the student analytically or numerically using statistical mechanics in the framework of on-line learning. We treat two well-known learning rules: Hebbian learning and perceptron learning. As a result, it… ▽ More We analyze the generalization performance of a student in a model composed of nonlinear perceptrons: a true teacher, ensemble teachers, and the student. We calculate the generalization error of the student analytically or numerically using statistical mechanics in the framework of on-line learning. We treat two well-known learning rules: Hebbian learning and perceptron learning. As a result, it is proven that the nonlinear model shows qualitatively different behaviors from the linear model. Moreover, it is clarified that Hebbian learning and perceptron learning show qualitatively different behaviors from each other. In Hebbian learning, we can analytically obtain the solutions. In this case, the generalization error monotonically decreases. The steady value of the generalization error is independent of the learning rate. The larger the number of teachers is and the more variety the ensemble teachers have, the smaller the generalization error is. In perceptron learning, we have to numerically obtain the solutions. In this case, the dynamical behaviors of the generalization error are non-monotonic. The smaller the learning rate is, the larger the number of teachers is; and the more variety the ensemble teachers have, the smaller the minimum value of the generalization error is. △ Less

Submitted 16 May, 2007; originally announced May 2007.

Comments: 13 pages, 9 figures

arXiv:cond-mat/0702427 [pdf, ps, other]

doi 10.1143/JPSJ.76.044804

Retrieval of branching sequences in associative memory model with common external input and bias input

Authors: Kentaro Katahira, Masaki Kawamura, Kazuo Okanoya, Masato Okada

Abstract: We investigate a recurrent neural network model with common external and bias inputs that can retrieve branching sequences. Retrieval of memory sequences is one of the most important functions of the brain. A lot of research has been done on neural networks that process memory sequences. Most of it has focused on fixed memory sequences. However, many animals can remember and recall branching seq… ▽ More We investigate a recurrent neural network model with common external and bias inputs that can retrieve branching sequences. Retrieval of memory sequences is one of the most important functions of the brain. A lot of research has been done on neural networks that process memory sequences. Most of it has focused on fixed memory sequences. However, many animals can remember and recall branching sequences. Therefore, we propose an associative memory model that can retrieve branching sequences. Our model has bias input and common external input. Kawamura and Okada reported that common external input enables sequential memory retrieval in an associative memory model with auto- and weak cross-correlation connections. We show that retrieval processes along branching sequences are controllable with both the bias input and the common external input. To analyze the behaviors of our model, we derived the macroscopic dynamical description as a probability density function. The results obtained by our theory agree with those obtained by computer simulations. △ Less

Submitted 19 February, 2007; originally announced February 2007.

arXiv:cs/0612117 [pdf, ps, other]

doi 10.1143/JPSJ.76.044003

Statistical Mechanics of On-line Learning when a Moving Teacher Goes around an Unlearnable True Teacher

Authors: Masahiro Urakami, Seiji Miyoshi, Masato Okada

Abstract: In the framework of on-line learning, a learning machine might move around a teacher due to the differences in structures or output functions between the teacher and the learning machine. In this paper we analyze the generalization performance of a new student supervised by a moving machine. A model composed of a fixed true teacher, a moving teacher, and a student is treated theoretically using… ▽ More In the framework of on-line learning, a learning machine might move around a teacher due to the differences in structures or output functions between the teacher and the learning machine. In this paper we analyze the generalization performance of a new student supervised by a moving machine. A model composed of a fixed true teacher, a moving teacher, and a student is treated theoretically using statistical mechanics, where the true teacher is a nonmonotonic perceptron and the others are simple perceptrons. Calculating the generalization errors numerically, we show that the generalization errors of a student can temporarily become smaller than that of a moving teacher, even if the student only uses examples from the moving teacher. However, the generalization error of the student eventually becomes the same value with that of the moving teacher. This behavior is qualitatively different from that of a linear model. △ Less

Submitted 21 December, 2006; originally announced December 2006.

Comments: 12 pages, 5 pages

arXiv:cs/0610066 [pdf, ps, other]

doi 10.1016/S0304-3975(00)00347-9

Inductive-data-type Systems

Authors: Frédéric Blanqui, Jean-Pierre Jouannaud, Mitsuhiro Okada

Abstract: In a previous work ("Abstract Data Type Systems", TCS 173(2), 1997), the last two authors presented a combined language made of a (strongly normalizing) algebraic rewrite system and a typed lambda-calculus enriched by pattern-matching definitions following a certain format, called the "General Schema", which generalizes the usual recursor definitions for natural numbers and similar "basic inductiv… ▽ More In a previous work ("Abstract Data Type Systems", TCS 173(2), 1997), the last two authors presented a combined language made of a (strongly normalizing) algebraic rewrite system and a typed lambda-calculus enriched by pattern-matching definitions following a certain format, called the "General Schema", which generalizes the usual recursor definitions for natural numbers and similar "basic inductive types". This combined language was shown to be strongly normalizing. The purpose of this paper is to reformulate and extend the General Schema in order to make it easily extensible, to capture a more general class of inductive types, called "strictly positive", and to ease the strong normalization proof of the resulting system. This result provides a computation model for the combination of an algebraic specification language based on abstract data types and of a strongly typed functional language with strictly positive inductive types. △ Less

Submitted 16 September, 2013; v1 submitted 11 October, 2006; originally announced October 2006.

Comments: Theoretical Computer Science (2002)

arXiv:cs/0610063 [pdf, ps, other]

The Calculus of Algebraic Constructions

Authors: Frédéric Blanqui, Jean-Pierre Jouannaud, Mitsuhiro Okada

Abstract: This paper is concerned with the foundations of the Calculus of Algebraic Constructions (CAC), an extension of the Calculus of Constructions by inductive data types. CAC generalizes inductive types equipped with higher-order primitive recursion, by providing definitions of functions by pattern-matching which capture recursor definitions for arbitrary non-dependent and non-polymorphic inductive t… ▽ More This paper is concerned with the foundations of the Calculus of Algebraic Constructions (CAC), an extension of the Calculus of Constructions by inductive data types. CAC generalizes inductive types equipped with higher-order primitive recursion, by providing definitions of functions by pattern-matching which capture recursor definitions for arbitrary non-dependent and non-polymorphic inductive types satisfying a strictly positivity condition. CAC also generalizes the first-order framework of abstract data types by providing dependent types and higher-order rewrite rules. △ Less

Submitted 27 May, 2008; v1 submitted 11 October, 2006; originally announced October 2006.

Journal ref: Dans Rewriting Techniques and Applications, 10th International Conference, RTA-99 1631 (1999)

arXiv:cond-mat/0609568 [pdf, ps, other]

doi 10.1143/JPSJ.75.124002

Statistical Mechanics of Linear and Nonlinear Time-Domain Ensemble Learning

Authors: Seiji Miyoshi, Masato Okada

Abstract: Conventional ensemble learning combines students in the space domain. In this paper, however, we combine students in the time domain and call it time-domain ensemble learning. We analyze, compare, and discuss the generalization performances regarding time-domain ensemble learning of both a linear model and a nonlinear model. Analyzing in the framework of online learning using a statistical mecha… ▽ More Conventional ensemble learning combines students in the space domain. In this paper, however, we combine students in the time domain and call it time-domain ensemble learning. We analyze, compare, and discuss the generalization performances regarding time-domain ensemble learning of both a linear model and a nonlinear model. Analyzing in the framework of online learning using a statistical mechanical method, we show the qualitatively different behaviors between the two models. In a linear model, the dynamical behaviors of the generalization error are monotonic. We analytically show that time-domain ensemble learning is twice as effective as conventional ensemble learning. Furthermore, the generalization error of a nonlinear model features nonmonotonic dynamical behaviors when the learning rate is small. We numerically show that the generalization performance can be improved remarkably by using this phenomenon and the divergence of students in the time domain. △ Less

Submitted 22 September, 2006; originally announced September 2006.

Comments: 11 pages, 7 figures

arXiv:cond-mat/0607557 [pdf, ps, other]

doi 10.1143/JPSJ.75.124603

Stochastic transitions of attractors in associative memory models with correlated noise

Authors: Masaki Kawamura, Masato Okada

Abstract: We investigate dynamics of recurrent neural networks with correlated noise to analyze the noise's effect. The mechanism of correlated firing has been analyzed in various models, but its functional roles have not been discussed in sufficient detail. Aoyagi and Aoki have shown that the state transition of a network is invoked by synchronous spikes. We introduce two types of noise to each neuron: t… ▽ More We investigate dynamics of recurrent neural networks with correlated noise to analyze the noise's effect. The mechanism of correlated firing has been analyzed in various models, but its functional roles have not been discussed in sufficient detail. Aoyagi and Aoki have shown that the state transition of a network is invoked by synchronous spikes. We introduce two types of noise to each neuron: thermal independent noise and correlated noise. Due to the effects of correlated noise, the correlation between neural inputs cannot be ignored, so the behavior of the network has sample dependence. We discuss two types of associative memory models: one with auto- and weak cross-correlation connections and one with hierarchically correlated patterns. The former is similar in structure to Aoyagi and Aoki's model. We show that stochastic transition can be presented by correlated rather than thermal noise. In the latter, we show stochastic transition from a memory state to a mixture state using correlated noise. To analyze the stochastic transitions, we derive a macroscopic dynamic description as a recurrence relation form of a probability density function when the correlated noise exists. Computer simulations agree with theoretical results. △ Less

Submitted 2 October, 2006; v1 submitted 21 July, 2006; originally announced July 2006.

Comments: 21 pages

Journal ref: Journal of the Physical Society of Japan, Vol. 75 No. 12, December, 2006, 124603

arXiv:q-bio/0605022 [pdf, ps, other]

doi 10.1143/JPSJ.75.114803

Theory of Interaction of Memory Patterns in Layered Associative Networks

Authors: Kazuya Ishibashi, Kosuke Hamaguchi, Masato Okada

Abstract: A synfire chain is a network that can generate repeated spike patterns with millisecond precision. Although synfire chains with only one activity propagation mode have been intensively analyzed with several neuron models, those with several stable propagation modes have not been thoroughly investigated. By using the leaky integrate-and-fire neuron model, we constructed a layered associative netw… ▽ More A synfire chain is a network that can generate repeated spike patterns with millisecond precision. Although synfire chains with only one activity propagation mode have been intensively analyzed with several neuron models, those with several stable propagation modes have not been thoroughly investigated. By using the leaky integrate-and-fire neuron model, we constructed a layered associative network embedded with memory patterns. We analyzed the network dynamics with the Fokker-Planck equation. First, we addressed the stability of one memory pattern as a propagating spike volley. We showed that memory patterns propagate as pulse packets. Second, we investigated the activity when we activated two different memory patterns. Simultaneous activation of two memory patterns with the same strength led the propagating pattern to a mixed state. In contrast, when the activations had different strengths, the pulse packet converged to a two-peak state. Finally, we studied the effect of the preceding pulse packet on the following pulse packet. The following pulse packet was modified from its original activated memory pattern, and it converged to a two-peak state, mixed state or non-spike state depending on the time interval. △ Less

Submitted 14 May, 2006; originally announced May 2006.

arXiv:cond-mat/0605176 [pdf, ps, other]

doi 10.1143/JPSJ.75.084007

Statistical Mechanics of Time Domain Ensemble Learning

Authors: Seiji Miyoshi, Tatsuya Uezu, Masato Okada

Abstract: Conventional ensemble learning combines students in the space domain. On the other hand, in this paper we combine students in the time domain and call it time domain ensemble learning. In this paper, we analyze the generalization performance of time domain ensemble learning in the framework of online learning using a statistical mechanical method. We treat a model in which both the teacher and t… ▽ More Conventional ensemble learning combines students in the space domain. On the other hand, in this paper we combine students in the time domain and call it time domain ensemble learning. In this paper, we analyze the generalization performance of time domain ensemble learning in the framework of online learning using a statistical mechanical method. We treat a model in which both the teacher and the student are linear perceptrons with noises. Time domain ensemble learning is twice as effective as conventional space domain ensemble learning. △ Less

Submitted 7 May, 2006; originally announced May 2006.

Comments: 10 pages, 10 figures

arXiv:physics/0601162 [pdf, ps, other]

doi 10.1143/JPSJ.75.044002

Statistical Mechanics of Online Learning for Ensemble Teachers

Authors: Seiji Miyoshi, Masato Okada

Abstract: We analyze the generalization performance of a student in a model composed of linear perceptrons: a true teacher, ensemble teachers, and the student. Calculating the generalization error of the student analytically using statistical mechanics in the framework of on-line learning, it is proven that when learning rate $η<1$, the larger the number $K$ and the variety of the ensemble teachers are, t… ▽ More We analyze the generalization performance of a student in a model composed of linear perceptrons: a true teacher, ensemble teachers, and the student. Calculating the generalization error of the student analytically using statistical mechanics in the framework of on-line learning, it is proven that when learning rate $η<1$, the larger the number $K$ and the variety of the ensemble teachers are, the smaller the generalization error is. On the other hand, when $η>1$, the properties are completely reversed. If the variety of the ensemble teachers is rich enough, the direction cosine between the true teacher and the student becomes unity in the limit of $η\to 0$ and $K \to \infty$. △ Less

Submitted 20 January, 2006; originally announced January 2006.

Comments: 11 pages, 9 figures

arXiv:cond-mat/0512036 [pdf, ps, other]

doi 10.1088/0305-4470/39/15/003

Dynamical replica theoretic analysis of CDMA detection dynamics

Authors: J. P. L. Hatchett, M. Okada

Abstract: We investigate the detection dynamics of the Gibbs sampler for code-division multiple access (CDMA) multiuser detection. Our approach is based upon dynamical replica theory which allows an analytic approximation to the dynamics. We use this tool to investigate the basins of attraction when phase coexistence occurs and examine its efficacy via comparison with Monte Carlo simulations. We investigate the detection dynamics of the Gibbs sampler for code-division multiple access (CDMA) multiuser detection. Our approach is based upon dynamical replica theory which allows an analytic approximation to the dynamics. We use this tool to investigate the basins of attraction when phase coexistence occurs and examine its efficacy via comparison with Monte Carlo simulations. △ Less

Submitted 2 December, 2005; originally announced December 2005.

Comments: 18 pages, 2 figures

Journal ref: J. Phys. A: Math. Gen. (2006) 39 3883-3902

arXiv:cond-mat/0510169 [pdf, ps, other]

doi 10.1143/JPSJ.74.2961

Theory of Recurrent Neural Network with Common Synaptic Inputs

Authors: Masaki Kawamura, Michiko Yamana, Masato Okada

Abstract: We discuss the effects of common synaptic inputs in a recurrent neural network. Because of the effects of these common synaptic inputs, the correlation between neural inputs cannot be ignored, and thus the network exhibits sample dependence. Networks of this type do not have well-defined thermodynamic limits, and self-averaging breaks down. We therefore need to develop a suitable theory without… ▽ More We discuss the effects of common synaptic inputs in a recurrent neural network. Because of the effects of these common synaptic inputs, the correlation between neural inputs cannot be ignored, and thus the network exhibits sample dependence. Networks of this type do not have well-defined thermodynamic limits, and self-averaging breaks down. We therefore need to develop a suitable theory without relying on these common properties. While the effects of the common synaptic inputs have been analyzed in layered neural networks, it was apparently difficult to analyze these effects in recurrent neural networks due to feedback connections. We investigated a sequential associative memory model as an example of recurrent networks and succeeded in deriving a macroscopic dynamical description as a recurrence relation form of a probability density function. △ Less

Submitted 7 October, 2005; originally announced October 2005.

Comments: 12 pages

Journal ref: Journal of the Physical Society of Japan, vol.74, no.11, Nov. 2005, pp.2961-2965

arXiv:physics/0509050 [pdf, ps, other]

doi 10.1143/JPSJ.75.024003

Analysis of on-line learning when a moving teacher goes around a true teacher

Authors: Seiji Miyoshi, Masato Okada

Abstract: In the framework of on-line learning, a learning machine might move around a teacher due to the differences in structures or output functions between the teacher and the learning machine or due to noises. The generalization performance of a new student supervised by a moving machine has been analyzed. A model composed of a true teacher, a moving teacher and a student that are all linear perceptr… ▽ More In the framework of on-line learning, a learning machine might move around a teacher due to the differences in structures or output functions between the teacher and the learning machine or due to noises. The generalization performance of a new student supervised by a moving machine has been analyzed. A model composed of a true teacher, a moving teacher and a student that are all linear perceptrons with noises has been treated analytically using statistical mechanics. It has been proven that the generalization errors of a student can be smaller than that of a moving teacher, even if the student only uses examples from the moving teacher. △ Less

Submitted 7 September, 2005; originally announced September 2005.

Comments: 13 pages, 8 figures

arXiv:cond-mat/0508598 [pdf, ps, other]

doi 10.1103/PhysRevE.74.026108

Statistical mechanics of lossy compression using multilayer perceptrons

Authors: Kazushi Mimura, Masato Okada

Abstract: Statistical mechanics is applied to lossy compression using multilayer perceptrons for unbiased Boolean messages. We utilize a tree-like committee machine (committee tree) and tree-like parity machine (parity tree) whose transfer functions are monotonic. For compression using committee tree, a lower bound of achievable distortion becomes small as the number of hidden units K increases. However,… ▽ More Statistical mechanics is applied to lossy compression using multilayer perceptrons for unbiased Boolean messages. We utilize a tree-like committee machine (committee tree) and tree-like parity machine (parity tree) whose transfer functions are monotonic. For compression using committee tree, a lower bound of achievable distortion becomes small as the number of hidden units K increases. However, it cannot reach the Shannon bound even where K -> infty. For a compression using a parity tree with K >= 2 hidden units, the rate distortion function, which is known as the theoretical limit for compression, is derived where the code length becomes infinity. △ Less

Submitted 1 May, 2006; v1 submitted 25 August, 2005; originally announced August 2005.

Comments: 12 pages, 5 figures

Journal ref: Phys. Rev. E, 74, 026108 (2006)

Showing 151–200 of 238 results for author: Okada, M