-
Reversal in Thermally Driven Rotation of Chiral Liquid Crystal Droplets
Authors:
Shunsuke Takano,
Takuya Nakanishi,
Kenta Nakagawa,
Toru Asahi
Abstract:
For chiral liquid crystals that express topologically protected defects and thermally driven mechanical rotation, the size- and temperature-driven reversal of the rotational direction of their droplets was demonstrated even under a fixed temperature gradient. This unconventional reversal indicates the dependence of thermomechanical coupling on the molecular orientational order, this dependence is…
▽ More
For chiral liquid crystals that express topologically protected defects and thermally driven mechanical rotation, the size- and temperature-driven reversal of the rotational direction of their droplets was demonstrated even under a fixed temperature gradient. This unconventional reversal indicates the dependence of thermomechanical coupling on the molecular orientational order, this dependence is justified through an examination of the size, temperature, and molecular structure as well as by phenomenological arguments on the order parameter.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Fingerprints of Mott and Slater gaps in the core-level photoemission spectra of antiferromagnetic iridates
Authors:
K. Nakagawa,
A. Hariki,
T. Okauchi,
H. Fujiwara,
K. -H. Ahn,
Y. Murakami,
S. Hamamoto,
Y. Kanai-Nakata,
T. Kadono,
A. Higashiya,
K. Tamasaku,
M. Yabashi,
T. Ishikawa,
A. Sekiyama,
S. Imada,
J. Kuneš,
K. Takase,
A. Yamasaki
Abstract:
We present Ir $4f$ core-level hard-x-ray photoemission spectroscopy (HAXPES) experiments conducted across antiferromagnetic (AFM) ordering transition in Ruddlesden-Popper iridates Sr$_2$IrO$_4$ and Sr$_3$Ir$_2$O$_7$. The Ir $4f$ spectra exhibit distinct changes between the AFM and paramagnetic (PM) phases, with the spectral difference $I_\text{PM}-I_\text{AFM}$ showing a contrasting behavior in th…
▽ More
We present Ir $4f$ core-level hard-x-ray photoemission spectroscopy (HAXPES) experiments conducted across antiferromagnetic (AFM) ordering transition in Ruddlesden-Popper iridates Sr$_2$IrO$_4$ and Sr$_3$Ir$_2$O$_7$. The Ir $4f$ spectra exhibit distinct changes between the AFM and paramagnetic (PM) phases, with the spectral difference $I_\text{PM}-I_\text{AFM}$ showing a contrasting behavior in the two compounds. By employing computational simulations using the local-density approximation combined with the dynamical mean-field theory method, we elucidate that $I_\text{PM}-I_\text{AFM}$ primary reflects the Slater or Mott-Hubbard character of the AFM insulating state rather than material specific details. This sensitivity to fine low-energy electronic structure arises from the dependence of charge-transfer responses to the sudden creation of a localized core hole on both metal-insulator transitions and long-range AFM ordering. Our result broadens the applications of core-level HAXPES as a tool for characterization of electronic structure.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Development of a near-infrared wide-field integral field unit by ultra-precision diamond cutting
Authors:
Kosuke Kushibiki,
Shinobu Ozaki,
Masahiro Takeda,
Takuya Hosobata,
Yutaka Yamagata,
Shinya Morita,
Toshihiro Tsuzuki,
Keiichi Nakagawa,
Takao Saiki,
Yutaka Ohtake,
Kenji Mitsui,
Hirofumi Okita,
Yutaro Kitagawa,
Yukihiro Kono,
Kentaro Motohara,
Hidenori Takahashi,
Masahiro Konishi,
Natsuko Kato,
Shuhei Koyama,
Nuo Chen
Abstract:
Integral Field Spectroscopy (IFS) is an observational method to obtain spatially resolved spectra over a specific field of view (FoV) in a single exposure. In recent years, near-infrared IFS has gained importance in observing objects with strong dust attenuation or at high redshift. One limitation of existing near-infrared IFS instruments is their relatively small FoV, less than 100 arcsec$^2$, co…
▽ More
Integral Field Spectroscopy (IFS) is an observational method to obtain spatially resolved spectra over a specific field of view (FoV) in a single exposure. In recent years, near-infrared IFS has gained importance in observing objects with strong dust attenuation or at high redshift. One limitation of existing near-infrared IFS instruments is their relatively small FoV, less than 100 arcsec$^2$, compared to optical instruments. Therefore, we have developed a near-infrared (0.9-2.5 $\mathrmμ$m) image-slicer type integral field unit (IFU) with a larger FoV of 13.5 $\times$ 10.4 arcsec$^2$ by matching a slice width to a typical seeing size of 0.4 arcsec. The IFU has a compact optical design utilizing off-axis ellipsoidal mirrors to reduce aberrations. Complex optical elements were fabricated using an ultra-precision cutting machine to achieve RMS surface roughness of less than 10 nm and a P-V shape error of less than 300 nm. The ultra-precision machining can also simplify alignment procedures. The on-sky performance evaluation confirmed that the image quality and the throughput of the IFU were as designed. In conclusion, we have successfully developed a compact IFU utilizing an ultra-precision cutting technique, almost fulfilling the requirements.
△ Less
Submitted 3 March, 2024;
originally announced March 2024.
-
CFTM: Continuous time fractional topic model
Authors:
Kei Nakagawa,
Kohei Hayashi,
Yugo Fujimoto
Abstract:
In this paper, we propose the Continuous Time Fractional Topic Model (cFTM), a new method for dynamic topic modeling. This approach incorporates fractional Brownian motion~(fBm) to effectively identify positive or negative correlations in topic and word distribution over time, revealing long-term dependency or roughness. Our theoretical analysis shows that the cFTM can capture these long-term depe…
▽ More
In this paper, we propose the Continuous Time Fractional Topic Model (cFTM), a new method for dynamic topic modeling. This approach incorporates fractional Brownian motion~(fBm) to effectively identify positive or negative correlations in topic and word distribution over time, revealing long-term dependency or roughness. Our theoretical analysis shows that the cFTM can capture these long-term dependency or roughness in both topic and word distributions, mirroring the main characteristics of fBm. Moreover, we prove that the parameter estimation process for the cFTM is on par with that of LDA, traditional topic models. To demonstrate the cFTM's property, we conduct empirical study using economic news articles. The results from these tests support the model's ability to identify and track long-term dependency or roughness in topics over time.
△ Less
Submitted 6 February, 2024; v1 submitted 29 January, 2024;
originally announced February 2024.
-
Signature of BKT-like spin transport in a quasi-2D antiferromagnet BaNi$_2$V$_2$O$_8$
Authors:
Kurea Nakagawa,
Minoru Kanega,
Tomoyuki Yokouchi,
Masahiro Sato,
Yuki Shiomi
Abstract:
In two-dimensional (2D) spin systems, the augmentation of spin fluctuations gives rise to quasi-long-range order; however, how they manifest in spin transport remains unclear. Here we investigate the spin Seebeck effect (SSE) in a quasi-2D antiferromagnet, BaNi$_2$V$_2$O$_8$, which has been reported to exhibit the Berezinskii-Kosterlitz-Thouless (BKT) transition owing to its distinct 2D nature. We…
▽ More
In two-dimensional (2D) spin systems, the augmentation of spin fluctuations gives rise to quasi-long-range order; however, how they manifest in spin transport remains unclear. Here we investigate the spin Seebeck effect (SSE) in a quasi-2D antiferromagnet, BaNi$_2$V$_2$O$_8$, which has been reported to exhibit the Berezinskii-Kosterlitz-Thouless (BKT) transition owing to its distinct 2D nature. We found that the SSE in Pt / BaNi$_2$V$_2$O$_8$ persists well above the Néel temperature, significantly different from the behavior of 3D ordered magnets. Our numerical analysis for a 2D microscopic spin model supports the hypothesis that the observed SSE is linked to strong magnetic correlations in the BKT-like phase.
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
Doubly Robust Mean-CVaR Portfolio
Authors:
Kei Nakagawa,
Masaya Abe,
Seiichi Kuroki
Abstract:
In this study, we address the challenge of portfolio optimization, a critical aspect of managing investment risks and maximizing returns. The mean-CVaR portfolio is considered a promising method due to today's unstable financial market crises like the COVID-19 pandemic. It incorporates expected returns into the CVaR, which considers the expected value of losses exceeding a specified probability le…
▽ More
In this study, we address the challenge of portfolio optimization, a critical aspect of managing investment risks and maximizing returns. The mean-CVaR portfolio is considered a promising method due to today's unstable financial market crises like the COVID-19 pandemic. It incorporates expected returns into the CVaR, which considers the expected value of losses exceeding a specified probability level. However, the instability associated with the input parameter changes and estimation errors can deteriorate portfolio performance. Therefore in this study, we propose a Doubly Robust mean-CVaR Portfolio refined approach to the mean-CVaR portfolio optimization. Our method can solve the instability problem to simultaneously optimize the multiple levels of CVaRs and define uncertainty sets for the mean parameter to perform robust optimization. Theoretically, the proposed method can be formulated as a second-order cone programming problem which is the same formulation as traditional mean-variance portfolio optimization. In addition, we derive an estimation error bound of the proposed method for the finite-sample case. Finally, experiments with benchmark and real market data show that our proposed method exhibits better performance compared to existing portfolio optimization strategies.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
Efficient Model Selection for Predictive Pattern Mining Model by Safe Pattern Pruning
Authors:
Takumi Yoshida,
Hiroyuki Hanada,
Kazuya Nakagawa,
Kouichi Taji,
Koji Tsuda,
Ichiro Takeuchi
Abstract:
Predictive pattern mining is an approach used to construct prediction models when the input is represented by structured data, such as sets, graphs, and sequences. The main idea behind predictive pattern mining is to build a prediction model by considering substructures, such as subsets, subgraphs, and subsequences (referred to as patterns), present in the structured data as features of the model.…
▽ More
Predictive pattern mining is an approach used to construct prediction models when the input is represented by structured data, such as sets, graphs, and sequences. The main idea behind predictive pattern mining is to build a prediction model by considering substructures, such as subsets, subgraphs, and subsequences (referred to as patterns), present in the structured data as features of the model. The primary challenge in predictive pattern mining lies in the exponential growth of the number of patterns with the complexity of the structured data. In this study, we propose the Safe Pattern Pruning (SPP) method to address the explosion of pattern numbers in predictive pattern mining. We also discuss how it can be effectively employed throughout the entire model building process in practical data analysis. To demonstrate the effectiveness of the proposed method, we conduct numerical experiments on regression and classification problems involving sets, graphs, and sequences.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
A New Initial Distribution for Quantum Generative Adversarial Networks to Load Probability Distributions
Authors:
Yuichi Sano,
Ryosuke Koga,
Masaya Abe,
Kei Nakagawa
Abstract:
Quantum computers are gaining attention for their ability to solve certain problems faster than classical computers, and one example is the quantum expectation estimation algorithm that accelerates the widely-used Monte Carlo method in fields such as finance. A previous study has shown that quantum generative adversarial networks(qGANs), a quantum circuit version of generative adversarial networks…
▽ More
Quantum computers are gaining attention for their ability to solve certain problems faster than classical computers, and one example is the quantum expectation estimation algorithm that accelerates the widely-used Monte Carlo method in fields such as finance. A previous study has shown that quantum generative adversarial networks(qGANs), a quantum circuit version of generative adversarial networks(GANs), can generate the probability distribution necessary for the quantum expectation estimation algorithm in shallow quantum circuits. However, a previous study has also suggested that the convergence speed and accuracy of the generated distribution can vary greatly depending on the initial distribution of qGANs' generator. In particular, the effectiveness of using a normal distribution as the initial distribution has been claimed, but it requires a deep quantum circuit, which may lose the advantage of qGANs. Therefore, in this study, we propose a novel method for generating an initial distribution that improves the learning efficiency of qGANs. Our method uses the classical process of label replacement to generate various probability distributions in shallow quantum circuits. We demonstrate that our proposed method can generate the log-normal distribution, which is pivotal in financial engineering, as well as the triangular distribution and the bimodal distribution, more efficiently than current methods. Additionally, we show that the initial distribution proposed in our research is related to the problem of determining the initial weights for qGANs.
△ Less
Submitted 9 August, 2023; v1 submitted 21 June, 2023;
originally announced June 2023.
-
A Large-Scale Pad-Sensor Based Prototype of the Silicon Tungsten Electromagnetic Calorimeter for the Forward Direction in ALICE at LHC
Authors:
R. G. E. Barthel,
T. Chujo,
T. Hachiya,
M. Hatakeyama,
Y. Hoshi,
M. Inaba,
Y.,
Kawamura,
D. Kawana,
C. Loizides,
Y. Miake,
Y. Minato,
K. Nakagawa,
N. Novitzky,
T. Peitzmann,
M. Rossewij,
M. Shimomura,
T. Sugitate,
T. Suzuki,
K. Tadokoro,
M. Takamura,
S. Takasu,
A. van den Brink,
M. van Leeuwen
Abstract:
We constructed a large-scale electromagnetic calorimeter prototype as a part of the Forward Calorimeter upgrade project (FoCal) for the ALICE experiment at the Large Hadron Collider (LHC). The prototype, also known as ``Mini FoCal'', consists of 20 layers of silicon pad sensors and tungsten alloy plates with printed circuit boards and readout electronics. The constructed detector was tested at the…
▽ More
We constructed a large-scale electromagnetic calorimeter prototype as a part of the Forward Calorimeter upgrade project (FoCal) for the ALICE experiment at the Large Hadron Collider (LHC). The prototype, also known as ``Mini FoCal'', consists of 20 layers of silicon pad sensors and tungsten alloy plates with printed circuit boards and readout electronics. The constructed detector was tested at the test beam facility of the Super Proton Synchrotron (SPS) at CERN. We obtain an energy resolution of about 4.3% for electron beams at both 150 and 250 GeV/$c$, which is consistent with realistic detector response simulations. Longitudinal profiles of electromagnetic shower were also measured and found to agree with the simulations. The same prototype detector was installed in the ALICE experimental area about 7.5m away from the interaction point. It was used to measure inclusive electromagnetic cluster energy distributions and neutral-pion candidate invariant mass distributions for pseudo-rapidity of $η$=3.7-4.5 in proton-proton collisions at $\sqrt{s}$ = 13 TeV at LHC. The measured distributions in different $η$ regions are similar to those obtained from PYTHIA simulations.
△ Less
Submitted 18 March, 2024; v1 submitted 9 June, 2023;
originally announced June 2023.
-
Bulk superconductivity in Pb-substituted BiS$_{\bf 2}$-based compounds studied by hard-x-ray spectroscopy
Authors:
A. Yamasaki,
T. Oguni,
T. Hayashida,
K. Miyazaki,
N. Tanaka,
K. Nakagawa,
K. Tamura,
K. Mimura,
N. Kawamura,
H. Fujiwara,
G. Nozue,
A. Ose,
Y. Kanai-Nakata,
A. Higashiya,
S. Hamamoto,
K. Tamasaku,
M. Yabashi,
T. Ishikawa,
S. Imada,
A. Sekiyama,
H. Sakata,
S. Demura
Abstract:
In this study, we investigate the bulk electronic structure of Pb-substituted LaO$_{0.5}$F$_{0.5}$BiS$_2$ single crystals, using two types of hard-x-ray spectroscopy. High-energy-resolution fluorescence-detected x-ray absorption spectroscopy revealed a spectral change at low temperatures. Using density functional theory (DFT) simulations, we find that the temperature-induced change originates from…
▽ More
In this study, we investigate the bulk electronic structure of Pb-substituted LaO$_{0.5}$F$_{0.5}$BiS$_2$ single crystals, using two types of hard-x-ray spectroscopy. High-energy-resolution fluorescence-detected x-ray absorption spectroscopy revealed a spectral change at low temperatures. Using density functional theory (DFT) simulations, we find that the temperature-induced change originates from a structural phase transition, similar to the pressure-induced transition in LaO$_{0.5}$F$_{0.5}$BiS$_2$. This finding suggests that the mechanism of bulk superconductivity induced by Pb substitution is the same as that under high pressure. Furthermore, a novel low-valence state with a mixture of divalent and trivalent Bi ions is discovered using hard x-ray photoemission spectroscopy with the aid of DFT calculations.
△ Less
Submitted 4 January, 2024; v1 submitted 31 May, 2023;
originally announced May 2023.
-
Size-controlled quantum dots reveal the impact of intraband transitions on high-order harmonic generation in solids
Authors:
Kotaro Nakagawa,
Hideki Hirori,
Shunsuke A. Sato,
Hirokazu Tahara,
Fumiya Sekiguchi,
Go Yumoto,
Masaki Saruyama,
Ryota Sato,
Toshiharu Teranishi,
Yoshihiko Kanemitsu
Abstract:
Since the discovery of high-order harmonic generation (HHG) in solids, much effort has been devoted to understanding its generation mechanism and both interband and intraband transitions are known to be essential. However, intraband transitions are affected by the electronic structure of a solid, and how they contribute to nonlinear carrier generation and HHG remains an open question. Here, we use…
▽ More
Since the discovery of high-order harmonic generation (HHG) in solids, much effort has been devoted to understanding its generation mechanism and both interband and intraband transitions are known to be essential. However, intraband transitions are affected by the electronic structure of a solid, and how they contribute to nonlinear carrier generation and HHG remains an open question. Here, we use mid-infrared laser pulses to study HHG in CdSe and CdS quantum dots (QDs), where quantum confinement can be used to control the intraband transitions. We find that both the HHG intensity per excited volume and the generated carrier density increase when the average QD size is increased from about 2 nm to 3 nm. We show that the reduction of the subband gap energy in larger QDs enhances intraband transitions, and this in turn increases the rate of photocarrier injection by coupling with interband transitions, resulting in enhanced HHG.
△ Less
Submitted 21 December, 2022;
originally announced December 2022.
-
What Makes An Apology More Effective? Exploring Anthropomorphism, Individual Differences, And Emotion In Human-Automation Trust Repair
Authors:
Peggy Pei-Ying Lu,
Makoto Konishi,
Shin Sano,
Sho Hiruta,
Francis Ken Nakagawa
Abstract:
Recent advances in technology have allowed an automation system to recognize its errors and repair trust more actively than ever. While previous research has called for further studies of different human factors and design features, their effect on human-automation trust repair scenarios remains unknown, especially concerning emotions. This paper seeks to fill such gaps by investigating the impact…
▽ More
Recent advances in technology have allowed an automation system to recognize its errors and repair trust more actively than ever. While previous research has called for further studies of different human factors and design features, their effect on human-automation trust repair scenarios remains unknown, especially concerning emotions. This paper seeks to fill such gaps by investigating the impact of anthropomorphism, users' individual differences, and emotional responses on human-automation trust repair. Our experiment manipulated various types of trust violations and apology messages with different emotionally expressive anthropomorphic cues. While no significant effect from the different apology representations was found, our participants displayed polarizing attitudes toward the anthropomorphic cues. We also found that (1). some personality traits, such as openness and conscientiousness, negatively correlate with the effectiveness of the apology messages, and (2). a person's emotional response toward a trust violation positively correlates with the effectiveness of the apology messages.
△ Less
Submitted 1 December, 2022; v1 submitted 18 November, 2022;
originally announced November 2022.
-
Uncertainty Aware Trader-Company Method: Interpretable Stock Price Prediction Capturing Uncertainty
Authors:
Yugo Fujimoto,
Kei Nakagawa,
Kentaro Imajo,
Kentaro Minami
Abstract:
Machine learning is an increasingly popular tool with some success in predicting stock prices. One promising method is the Trader-Company~(TC) method, which takes into account the dynamism of the stock market and has both high predictive power and interpretability. Machine learning-based stock prediction methods including the TC method have been concentrating on point prediction. However, point pr…
▽ More
Machine learning is an increasingly popular tool with some success in predicting stock prices. One promising method is the Trader-Company~(TC) method, which takes into account the dynamism of the stock market and has both high predictive power and interpretability. Machine learning-based stock prediction methods including the TC method have been concentrating on point prediction. However, point prediction in the absence of uncertainty estimates lacks credibility quantification and raises concerns about safety. The challenge in this paper is to make an investment strategy that combines high predictive power and the ability to quantify uncertainty. We propose a novel approach called Uncertainty Aware Trader-Company Method~(UTC) method. The core idea of this approach is to combine the strengths of both frameworks by merging the TC method with the probabilistic modeling, which provides probabilistic predictions and uncertainty estimations. We expect this to retain the predictive power and interpretability of the TC method while capturing the uncertainty. We theoretically prove that the proposed method estimates the posterior variance and does not introduce additional biases from the original TC method. We conduct a comprehensive evaluation of our approach based on the synthetic and real market datasets. We confirm with synthetic data that the UTC method can detect situations where the uncertainty increases and the prediction is difficult. We also confirmed that the UTC method can detect abrupt changes in data generating distributions. We demonstrate with real market data that the UTC method can achieve higher returns and lower risks than baselines.
△ Less
Submitted 2 November, 2022; v1 submitted 30 October, 2022;
originally announced October 2022.
-
On a Proof of the Convergence Speed of a Second-order Recurrence Formula in the Arimoto-Blahut Algorithm
Authors:
Kenji Nakagawa,
Yoshinori Takei,
Shin-ichiro Hara
Abstract:
In [8] (Nakagawa, et.al., IEEE Trans. IT, 2021), we investigated the convergence speed of the Arimoto-Blahut algorithm. In [8], the convergence of the order $O(1/N)$ was analyzed by focusing on the second-order nonlinear recurrence formula consisting of the first- and second-order terms of the Taylor expansion of the defining function of the Arimoto-Blahut algorithm. However, in [8], an infinite n…
▽ More
In [8] (Nakagawa, et.al., IEEE Trans. IT, 2021), we investigated the convergence speed of the Arimoto-Blahut algorithm. In [8], the convergence of the order $O(1/N)$ was analyzed by focusing on the second-order nonlinear recurrence formula consisting of the first- and second-order terms of the Taylor expansion of the defining function of the Arimoto-Blahut algorithm. However, in [8], an infinite number of inequalities were assumed as a "conjecture," and proofs were given based on the conjecture. In this paper, we report a proof of the convergence of the order $O(1/N)$ for a class of channel matrices without assuming the conjecture. The correctness of the proof will be confirmed by several numerical examples.
△ Less
Submitted 11 September, 2022;
originally announced September 2022.
-
Schrödinger Risk Diversification Portfolio
Authors:
Yusuke Uchiyama,
Kei Nakagawa
Abstract:
The mean-variance portfolio that considers the trade-off between expected return and risk has been widely used in the problem of asset allocation for multi-asset portfolios. However, since it is difficult to estimate the expected return and the out-of-sample performance of the mean-variance portfolio is poor, risk-based portfolio construction methods focusing only on risk have been proposed, and a…
▽ More
The mean-variance portfolio that considers the trade-off between expected return and risk has been widely used in the problem of asset allocation for multi-asset portfolios. However, since it is difficult to estimate the expected return and the out-of-sample performance of the mean-variance portfolio is poor, risk-based portfolio construction methods focusing only on risk have been proposed, and are attracting attention mainly in practice. In terms of risk, asset fluctuations that make up the portfolio are thought to have common factors behind them, and principal component analysis, which is a dimension reduction method, is applied to extract the factors. In this study, we propose the Schrödinger risk diversification portfolio as a factor risk diversifying portfolio using Schrödinger principal component analysis that applies the Schrödinger equation in quantum mechanics. The Schrödinger principal component analysis can accurately estimate the factors even if the sample points are unequally spaced or in a small number, thus we can make efficient risk diversification. The proposed method was verified to outperform the conventional risk parity and other risk diversification portfolio constructions.
△ Less
Submitted 20 February, 2022;
originally announced February 2022.
-
GraphTune: A Learning-based Graph Generative Model with Tunable Structural Features
Authors:
Kohei Watabe,
Shohei Nakazawa,
Yoshiki Sato,
Sho Tsugawa,
Kenji Nakagawa
Abstract:
Generative models for graphs have been actively studied for decades, and they have a wide range of applications. Recently, learning-based graph generation that reproduces real-world graphs has been attracting the attention of many researchers. Although several generative models that utilize modern machine learning technologies have been proposed, conditional generation of general graphs has been l…
▽ More
Generative models for graphs have been actively studied for decades, and they have a wide range of applications. Recently, learning-based graph generation that reproduces real-world graphs has been attracting the attention of many researchers. Although several generative models that utilize modern machine learning technologies have been proposed, conditional generation of general graphs has been less explored in the field. In this paper, we propose a generative model that allows us to tune the value of a global-level structural feature as a condition. Our model, called GraphTune, makes it possible to tune the value of any structural feature of generated graphs using Long Short Term Memory (LSTM) and a Conditional Variational AutoEncoder (CVAE). We performed comparative evaluations of GraphTune and conventional models on a real graph dataset. The evaluations show that GraphTune makes it possible to more clearly tune the value of a global-level structural feature better than conventional models.
△ Less
Submitted 5 April, 2023; v1 submitted 27 January, 2022;
originally announced January 2022.
-
Fractional SDE-Net: Generation of Time Series Data with Long-term Memory
Authors:
Kohei Hayashi,
Kei Nakagawa
Abstract:
In this paper, we focus on the generation of time-series data using neural networks. It is often the case that input time-series data have only one realized (and usually irregularly sampled) path, which makes it difficult to extract time-series characteristics, and its noise structure is more complicated than i.i.d. type. Time series data, especially from hydrology, telecommunications, economics,…
▽ More
In this paper, we focus on the generation of time-series data using neural networks. It is often the case that input time-series data have only one realized (and usually irregularly sampled) path, which makes it difficult to extract time-series characteristics, and its noise structure is more complicated than i.i.d. type. Time series data, especially from hydrology, telecommunications, economics, and finance, exhibit long-term memory also called long-range dependency (LRD). The main purpose of this paper is to artificially generate time series with the help of neural networks, making the LRD of paths into account. We propose fSDE-Net: neural fractional Stochastic Differential Equation Network. It generalizes the neural stochastic differential equation model by using fractional Brownian motion with a Hurst index larger than half, which exhibits the LRD property. We derive the solver of fSDE-Net and theoretically analyze the existence and uniqueness of the solution to fSDE-Net. Our experiments with artificial and real time-series data demonstrate that the fSDE-Net model can replicate distributional properties well.
△ Less
Submitted 23 August, 2022; v1 submitted 16 January, 2022;
originally announced January 2022.
-
Improving Nonparametric Classification via Local Radial Regression with an Application to Stock Prediction
Authors:
Ruixing Cao,
Akifumi Okuno,
Kei Nakagawa,
Hidetoshi Shimodaira
Abstract:
For supervised classification problems, this paper considers estimating the query's label probability through local regression using observed covariates. Well-known nonparametric kernel smoother and $k$-nearest neighbor ($k$-NN) estimator, which take label average over a ball around the query, are consistent but asymptotically biased particularly for a large radius of the ball. To eradicate such b…
▽ More
For supervised classification problems, this paper considers estimating the query's label probability through local regression using observed covariates. Well-known nonparametric kernel smoother and $k$-nearest neighbor ($k$-NN) estimator, which take label average over a ball around the query, are consistent but asymptotically biased particularly for a large radius of the ball. To eradicate such bias, local polynomial regression (LPoR) and multiscale $k$-NN (MS-$k$-NN) learn the bias term by local regression around the query and extrapolate it to the query itself. However, their theoretical optimality has been shown for the limit of the infinite number of training samples. For correcting the asymptotic bias with fewer observations, this paper proposes a \emph{local radial regression (LRR)} and its logistic regression variant called \emph{local radial logistic regression~(LRLR)}, by combining the advantages of LPoR and MS-$k$-NN. The idea is quite simple: we fit the local regression to observed labels by taking only the radial distance as the explanatory variable and then extrapolate the estimated label probability to zero distance. The usefulness of the proposed method is shown theoretically and experimentally. We prove the convergence rate of the $L^2$ risk for LRR with reference to MS-$k$-NN, and our numerical experiments, including real-world datasets of daily stock indices, demonstrate that LRLR outperforms LPoR and MS-$k$-NN.
△ Less
Submitted 21 July, 2022; v1 submitted 27 December, 2021;
originally announced December 2021.
-
WebRTC-based measurement tool for peer-to-peer applications and preliminary findings with real users
Authors:
Kosuke Nakagawa,
Manabu Tsukada,
Keiichi Shima,
Hiroshi Esaki
Abstract:
Direct peer-to-peer (P2P) communication is often used to minimize the end-to-end latency for real-time applications that require accurate synchronization, such as remote musical ensembles. However, there are few studies on the performance of P2P communication between home network environments, thus hindering the deployment of services that require synchronization. In this study, we developed a P2P…
▽ More
Direct peer-to-peer (P2P) communication is often used to minimize the end-to-end latency for real-time applications that require accurate synchronization, such as remote musical ensembles. However, there are few studies on the performance of P2P communication between home network environments, thus hindering the deployment of services that require synchronization. In this study, we developed a P2P performance measurement tool using the Web Real-Time Communication (WebRTC) statistics application programming interface. Using this tool, we can easily measure P2P performance between home network environments on a web browser without downloading client applications. We also verified the reliability of round-trip time (RTT) measurements using WebRTC and confirmed that our system could provide the necessary measurement accuracy for RTT and jitter measurements for real-time applications. In addition, we measured the performance of a full mesh topology connection with 10 users in an actual environment in Japan. Consequently, we found that only 66% of the peer connections had a latency of 30 ms or less, which is the minimum requirement for high synchronization applications, such as musical ensembles.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
Continuity of isomorphisms applied to rigidity problems of entropy spectra
Authors:
Katsukuni Nakagawa
Abstract:
For a fixed topological Markov shift, we consider measure-preserving dynamical systems of Gibbs measures for 2-locally constant functions on the shift. We also consider isomorphisms between two such systems. We study the set of all 2-locally constant functions $f$ on the shift such that all those isomorphisms defined on the system associated with $f$ are induced from automorphisms of the shift. We…
▽ More
For a fixed topological Markov shift, we consider measure-preserving dynamical systems of Gibbs measures for 2-locally constant functions on the shift. We also consider isomorphisms between two such systems. We study the set of all 2-locally constant functions $f$ on the shift such that all those isomorphisms defined on the system associated with $f$ are induced from automorphisms of the shift. We prove that this set contains a full-measure open set of the space of all 2-locally constant functions on the shift. We apply this result to rigidity problems of entropy spectra and show that the strong non-rigidity occurs if and only if so does the weak non-rigidity.
△ Less
Submitted 28 October, 2021; v1 submitted 11 October, 2021;
originally announced October 2021.
-
Dispersive coherent Brillouin scattering spectroscopy
Authors:
Ayumu Ishijima,
Shinga Okabe,
Ichiro Sakuma,
Keiichi Nakagawa
Abstract:
Frequency- and time-domain Brillouin scattering spectroscopy are powerful tools to read out the mechanical properties of complex systems in material and life sciences. Indeed, coherent acoustic phonons in the time-domain method offer superior depth resolution and a stronger signal than incoherent acoustic phonons in the frequency-domain method. However, it does not allow multichannel detection and…
▽ More
Frequency- and time-domain Brillouin scattering spectroscopy are powerful tools to read out the mechanical properties of complex systems in material and life sciences. Indeed, coherent acoustic phonons in the time-domain method offer superior depth resolution and a stronger signal than incoherent acoustic phonons in the frequency-domain method. However, it does not allow multichannel detection and, therefore, falls short in signal acquisition speed. Here, we present Brillouin scattering spectroscopy that spans the time and frequency domains to allow the multichannel detection of Brillouin scattering light from coherent acoustic phonons. Our technique maps the time-evolve Brillouin oscillations at the instantaneous frequency of a chromatic-dispersed laser pulse. The spectroscopic heterodyning of Brillouin oscillations in the frequency domain enhances the signal acquisition speed by at least 100-fold over the time-domain method. As a proof of concept, we imaged heterogeneous thin films and biological cells over a wide bandwidth with nanometer depth resolution. We, therefore, foresee that our approach catalyzes future phonon spectroscopy toward real-time mechanical imaging.
△ Less
Submitted 11 May, 2022; v1 submitted 4 September, 2021;
originally announced September 2021.
-
High resolution IR spectroscopy and imaging based on graphene micro emitters
Authors:
Kenta Nakagawa,
Yui Shimura,
Yusuke Fukazawa,
Ryosuke Nishizaki,
Shinichiro Matano,
Hideyuki Maki
Abstract:
IR spectroscopy such as Fourier transform infrared spectroscopy (FTIR) are widely used for the investigation of structure and the quantitative determination of substances in the fields of chemistry, physics, biology, medicine, and astronomy, because the energy of IR absorption corresponds to the energy for each vibrational transition in functional groups within molecules. Microscopic imaging of FT…
▽ More
IR spectroscopy such as Fourier transform infrared spectroscopy (FTIR) are widely used for the investigation of structure and the quantitative determination of substances in the fields of chemistry, physics, biology, medicine, and astronomy, because the energy of IR absorption corresponds to the energy for each vibrational transition in functional groups within molecules. Microscopic imaging of FTIR is used for various practical applications, as it enables visualization of the composition distribution and changes in molecular structure without fluorescent labels. However, FTIR microscopy with an objective lens has a diffraction limit causing the low spatial resolution with the order of 10 $μ$m. Here, we present high-spatial-resolution IR spectroscopy and imaging based on graphene micro-emitters, which have distinct features over conventional IR sources: a planar structure, bright intensity, a small footprint (sub $μ$m$^2$), and high modulation speed of ~100 kHz. We performed IR absorption spectroscopy on a polymer thin film using graphene micro-emitters, realizing high-resolution IR imaging with a spatial resolution of ~2 $μ$m, far higher than that of the conventional FTIR. We show the two-dimensional IR chemical imaging that visualizes the distribution of the chemical information, such as molecular species and functional groups. This technique can open new routes for novel IR imaging and microanalysis in material science, physics, chemistry, biology, and medicine.
△ Less
Submitted 6 August, 2021;
originally announced August 2021.
-
Training of deep cross-modality conversion models with a small dataset, and their application in megavoltage CT to kilovoltage CT conversion
Authors:
Sho Ozaki,
Shizuo Kaji,
Kanabu Nawa,
Toshikazu Imae,
Atsushi Aoki,
Takahiro Nakamoto,
Takeshi Ohta,
Yuki Nozawa,
Hideomi Yamashita,
Akihiro Haga,
Keiichi Nakagawa
Abstract:
In recent years, deep-learning-based image processing has emerged as a valuable tool for medical imaging owing to its high performance. However, the quality of deep-learning-based methods heavily relies on the amount of training data; the high cost of acquiring a large dataset is a limitation to their utilization in medical fields. Herein, based on deep learning, we developed a computed tomography…
▽ More
In recent years, deep-learning-based image processing has emerged as a valuable tool for medical imaging owing to its high performance. However, the quality of deep-learning-based methods heavily relies on the amount of training data; the high cost of acquiring a large dataset is a limitation to their utilization in medical fields. Herein, based on deep learning, we developed a computed tomography (CT) modality conversion method requiring only a few unsupervised images. The proposed method is based on CycleGAN with several extensions tailored for CT images, which aims at preserving the structure in the processed images and reducing the amount of training data. This method was applied to realize the conversion of megavoltage computed tomography (MVCT) to kilovoltage computed tomography (kVCT) images. Training was conducted using several datasets acquired from patients with head and neck cancer. The size of the datasets ranged from 16 slices (two patients) to 2745 slices (137 patients) for MVCT and 2824 slices (98 patients) for kVCT. The required size of the training data was found to be as small as a few hundred slices. By statistical and visual evaluations, the quality improvement and structure preservation of the MVCT images converted by the proposed model were investigated. As a clinical benefit, it was observed by medical doctors that the converted images enhanced the precision of contouring. We developed an MVCT to kVCT conversion model based on deep learning, which can be trained using only a few hundred unpaired images. The stability of the model against changes in data size was demonstrated. This study promotes the reliable use of deep learning in clinical medicine by partially answering commonly asked questions, such as "Is our data sufficient?" and "How much data should we acquire?"
△ Less
Submitted 5 April, 2022; v1 submitted 12 July, 2021;
originally announced July 2021.
-
Enhancing thermopower and Nernst signal of high-mobility Dirac carriers by Fermi level tuning in the layered magnet EuMnBi$_2$
Authors:
Keigo Tsuruda,
Kento Nakagawa,
Masayuki Ochi,
Kazuhiko Kuroki,
Masashi Tokunaga,
Hiroshi Murakawa,
Noriaki Hanasaki,
Hideaki Sakai
Abstract:
Dirac/Weyl semimetals hosting linearly-dispersing bands have received recent attention for potential thermoelectric applications, since their ultrahigh-mobility carriers could generate large thermoelectric and Nernst power factors. To optimize these efficiencies, the Fermi energy needs to be chemically controlled in a wide range, which is generally difficult in bulk materials because of disorder e…
▽ More
Dirac/Weyl semimetals hosting linearly-dispersing bands have received recent attention for potential thermoelectric applications, since their ultrahigh-mobility carriers could generate large thermoelectric and Nernst power factors. To optimize these efficiencies, the Fermi energy needs to be chemically controlled in a wide range, which is generally difficult in bulk materials because of disorder effects from the substituted ions. Here it is shown that the Fermi energy is tunable across the Dirac point for layered magnet EuMnBi$_2$ by partially substituting Gd$^{3+}$ for Eu$^{2+}$ in the insulating block layer, which dopes electrons into the Dirac fermion layer without degrading the mobility. Clear quantum oscillation observed even in the doped samples allows us to quantitatively estimate the Fermi energy shift and optimize the power factor (exceeding 100 $μ$W/K$^2$cm at low temperatures) in combination with the first-principles calculation. Furthermore, it is shown that Nernst signal steeply increases with decreasing carrier density beyond a simple theoretical prediction, which likely originates from the field-induced gap reduction of the Dirac band due to the exchange interaction with the Eu moments. Thus, the magnetic block layer provides high controllability for the Dirac fermions in EuMnBi$_2$, which would make this series of materials an appealing platform for novel transport phenomena.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
A Tunable Model for Graph Generation Using LSTM and Conditional VAE
Authors:
Shohei Nakazawa,
Yoshiki Sato,
Kenji Nakagawa,
Sho Tsugawa,
Kohei Watabe
Abstract:
With the development of graph applications, generative models for graphs have been more crucial. Classically, stochastic models that generate graphs with a pre-defined probability of edges and nodes have been studied. Recently, some models that reproduce the structural features of graphs by learning from actual graph data using machine learning have been studied. However, in these conventional stu…
▽ More
With the development of graph applications, generative models for graphs have been more crucial. Classically, stochastic models that generate graphs with a pre-defined probability of edges and nodes have been studied. Recently, some models that reproduce the structural features of graphs by learning from actual graph data using machine learning have been studied. However, in these conventional studies based on machine learning, structural features of graphs can be learned from data, but it is not possible to tune features and generate graphs with specific features. In this paper, we propose a generative model that can tune specific features, while learning structural features of a graph from data. With a dataset of graphs with various features generated by a stochastic model, we confirm that our model can generate a graph with specific features.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
CP-violating supersymmetry anomaly
Authors:
Koichiro Nakagawa,
Yu Nakayama
Abstract:
We show that CP-violating Weyl anomaly induces a supersymmetry anomaly in the formulation of superconformal supergravity as is observed in CP-preserving cases. This supersymmetry anomaly can be removed in the old minimal supergravity by adding suitable local counterterms, and it becomes a consistent theory.
We show that CP-violating Weyl anomaly induces a supersymmetry anomaly in the formulation of superconformal supergravity as is observed in CP-preserving cases. This supersymmetry anomaly can be removed in the old minimal supergravity by adding suitable local counterterms, and it becomes a consistent theory.
△ Less
Submitted 18 March, 2021;
originally announced March 2021.
-
No-Transaction Band Network: A Neural Network Architecture for Efficient Deep Hedging
Authors:
Shota Imaki,
Kentaro Imajo,
Katsuya Ito,
Kentaro Minami,
Kei Nakagawa
Abstract:
Deep hedging (Buehler et al. 2019) is a versatile framework to compute the optimal hedging strategy of derivatives in incomplete markets. However, this optimal strategy is hard to train due to action dependence, that is, the appropriate hedging action at the next step depends on the current action. To overcome this issue, we leverage the idea of a no-transaction band strategy, which is an existing…
▽ More
Deep hedging (Buehler et al. 2019) is a versatile framework to compute the optimal hedging strategy of derivatives in incomplete markets. However, this optimal strategy is hard to train due to action dependence, that is, the appropriate hedging action at the next step depends on the current action. To overcome this issue, we leverage the idea of a no-transaction band strategy, which is an existing technique that gives optimal hedging strategies for European options and the exponential utility. We theoretically prove that this strategy is also optimal for a wider class of utilities and derivatives including exotics. Based on this result, we propose a no-transaction band network, a neural network architecture that facilitates fast training and precise evaluation of the optimal hedging strategy. We experimentally demonstrate that for European and lookback options, our architecture quickly attains a better hedging strategy in comparison to a standard feed-forward network.
△ Less
Submitted 2 March, 2021;
originally announced March 2021.
-
Controlling False Discovery Rates under Cross-Sectional Correlations
Authors:
Junpei Komiyama,
Masaya Abe,
Kei Nakagawa,
Kenichiro McAlinn
Abstract:
We consider controlling the false discovery rate for testing many time series with an unknown cross-sectional correlation structure. Given a large number of hypotheses, false and missing discoveries can plague an analysis. While many procedures have been proposed to control false discovery, most of them either assume independent hypotheses or lack statistical power. A problem of particular interes…
▽ More
We consider controlling the false discovery rate for testing many time series with an unknown cross-sectional correlation structure. Given a large number of hypotheses, false and missing discoveries can plague an analysis. While many procedures have been proposed to control false discovery, most of them either assume independent hypotheses or lack statistical power. A problem of particular interest is in financial asset pricing, where the goal is to determine which ``factors" lead to excess returns out of a large number of potential factors. Our contribution is two-fold. First, we show the consistency of Fama and French's prominent method under multiple testing. Second, we propose a novel method for false discovery control using double bootstrap**. We achieve superior statistical power to existing methods and prove that the false discovery rate is controlled. Simulations and a real data application illustrate the efficacy of our method over existing methods.
△ Less
Submitted 9 June, 2021; v1 submitted 15 February, 2021;
originally announced February 2021.
-
Trader-Company Method: A Metaheuristic for Interpretable Stock Price Prediction
Authors:
Katsuya Ito,
Kentaro Minami,
Kentaro Imajo,
Kei Nakagawa
Abstract:
Investors try to predict returns of financial assets to make successful investment. Many quantitative analysts have used machine learning-based methods to find unknown profitable market rules from large amounts of market data. However, there are several challenges in financial markets hindering practical applications of machine learning-based models. First, in financial markets, there is no single…
▽ More
Investors try to predict returns of financial assets to make successful investment. Many quantitative analysts have used machine learning-based methods to find unknown profitable market rules from large amounts of market data. However, there are several challenges in financial markets hindering practical applications of machine learning-based models. First, in financial markets, there is no single model that can consistently make accurate prediction because traders in markets quickly adapt to newly available information. Instead, there are a number of ephemeral and partially correct models called "alpha factors". Second, since financial markets are highly uncertain, ensuring interpretability of prediction models is quite important to make reliable trading strategies. To overcome these challenges, we propose the Trader-Company method, a novel evolutionary model that mimics the roles of a financial institute and traders belonging to it. Our method predicts future stock returns by aggregating suggestions from multiple weak learners called Traders. A Trader holds a collection of simple mathematical formulae, each of which represents a candidate of an alpha factor and would be interpretable for real-world investors. The aggregation algorithm, called a Company, maintains multiple Traders. By randomly generating new Traders and retraining them, Companies can efficiently find financially meaningful formulae whilst avoiding overfitting to a transient state of the market. We show the effectiveness of our method by conducting experiments on real market data.
△ Less
Submitted 18 December, 2020;
originally announced December 2020.
-
Deep Portfolio Optimization via Distributional Prediction of Residual Factors
Authors:
Kentaro Imajo,
Kentaro Minami,
Katsuya Ito,
Kei Nakagawa
Abstract:
Recent developments in deep learning techniques have motivated intensive research in machine learning-aided stock trading strategies. However, since the financial market has a highly non-stationary nature hindering the application of typical data-hungry machine learning methods, leveraging financial inductive biases is important to ensure better sample efficiency and robustness. In this study, we…
▽ More
Recent developments in deep learning techniques have motivated intensive research in machine learning-aided stock trading strategies. However, since the financial market has a highly non-stationary nature hindering the application of typical data-hungry machine learning methods, leveraging financial inductive biases is important to ensure better sample efficiency and robustness. In this study, we propose a novel method of constructing a portfolio based on predicting the distribution of a financial quantity called residual factors, which is known to be generally useful for hedging the risk exposure to common market factors. The key technical ingredients are twofold. First, we introduce a computationally efficient extraction method for the residual information, which can be easily combined with various prediction algorithms. Second, we propose a novel neural network architecture that allows us to incorporate widely acknowledged financial inductive biases such as amplitude invariance and time-scale invariance. We demonstrate the efficacy of our method on U.S. and Japanese stock market data. Through ablation experiments, we also verify that each individual technique contributes to improving the performance of trading strategies. We anticipate our techniques may have wide applications in various financial problems.
△ Less
Submitted 13 December, 2020;
originally announced December 2020.
-
Mean-Variance Efficient Reinforcement Learning by Expected Quadratic Utility Maximization
Authors:
Masahiro Kato,
Kei Nakagawa,
Kenshi Abe,
Tetsuro Morimura
Abstract:
Risk management is critical in decision making, and mean-variance (MV) trade-off is one of the most common criteria. However, in reinforcement learning (RL) for sequential decision making under uncertainty, most of the existing methods for MV control suffer from computational difficulties caused by the double sampling problem. In this paper, in contrast to strict MV control, we consider learning M…
▽ More
Risk management is critical in decision making, and mean-variance (MV) trade-off is one of the most common criteria. However, in reinforcement learning (RL) for sequential decision making under uncertainty, most of the existing methods for MV control suffer from computational difficulties caused by the double sampling problem. In this paper, in contrast to strict MV control, we consider learning MV efficient policies that achieve Pareto efficiency regarding MV trade-off. To achieve this purpose, we train an agent to maximize the expected quadratic utility function, a common objective of risk management in finance and economics. We call our approach direct expected quadratic utility maximization (EQUM). The EQUM does not suffer from the double sampling issue because it does not include gradient estimation of variance. We confirm that the maximizer of the objective in the EQUM directly corresponds to an MV efficient policy under a certain condition. We conduct experiments with benchmark settings to demonstrate the effectiveness of the EQUM.
△ Less
Submitted 5 September, 2021; v1 submitted 3 October, 2020;
originally announced October 2020.
-
Conformation of ultra-long-chain fatty acid in lipid bilayer: Molecular dynamics study
Authors:
Kazutomo Kawaguchi,
Koh M. Nakagawa,
Satoshi Nakagawa,
Hideo Shindou,
Hidemi Nagao,
Hiroshi Noguchi
Abstract:
Ultra-long-chain fatty acids (ULCFAs) are biosynthesized in the restricted tissues such as retina, testis, and skin. The conformation of a single ULCFA, in which the sn-1 unsaturated chain has 32 carbons, in three types of tensionless phospholipid bilayers is studied by molecular dynamics simulations. It is found that the ultra-long tail of the ULCFA flips between two leaflets and fluctuates among…
▽ More
Ultra-long-chain fatty acids (ULCFAs) are biosynthesized in the restricted tissues such as retina, testis, and skin. The conformation of a single ULCFA, in which the sn-1 unsaturated chain has 32 carbons, in three types of tensionless phospholipid bilayers is studied by molecular dynamics simulations. It is found that the ultra-long tail of the ULCFA flips between two leaflets and fluctuates among an elongation into the opposite leaflet, lying between two leaflets, and turning back. As the number ratio of lipids in the opposite leaflet increases, the ratio of the elongated shape linearly decreases in all three cases. Thus, ULCFAs can sense the density differences between the two leaflets and respond to these changes.
△ Less
Submitted 11 October, 2022; v1 submitted 25 September, 2020;
originally announced September 2020.
-
Analysis of the Convergence Speed of the Arimoto-Blahut Algorithm by the Second Order Recurrence Formula
Authors:
Kenji Nakagawa,
Yoshinori Takei,
Shin-ichiro Hara,
Kohei Watabe
Abstract:
In this paper, we investigate the convergence speed of the Arimoto-Blahut algorithm. For many channel matrices the convergence is exponential, but for some channel matrices it is slower than exponential. By analyzing the Taylor expansion of the defining function of the Arimoto-Blahut algorithm, we will make the conditions clear for the exponential or slower convergence. The analysis of the slow co…
▽ More
In this paper, we investigate the convergence speed of the Arimoto-Blahut algorithm. For many channel matrices the convergence is exponential, but for some channel matrices it is slower than exponential. By analyzing the Taylor expansion of the defining function of the Arimoto-Blahut algorithm, we will make the conditions clear for the exponential or slower convergence. The analysis of the slow convergence is new in this paper. Based on the analysis, we will compare the convergence speed of the Arimoto-Blahut algorithm numerically with the values obtained in our theorems for several channel matrices. The purpose of this paper is a complete understanding of the convergence speed of the Arimoto-Blahut algorithm.
△ Less
Submitted 17 September, 2020;
originally announced September 2020.
-
Weak rigidity of entropy spectra
Authors:
Katsukuni Nakagawa
Abstract:
In this paper, we consider entropy spectra on topological Markov shifts. We prove that if two measure-preserving dynamical systems of Gibbs measures with Hölder continuous potentials are isomorphic, then their entropy spectra are the same. This result raises a new rigidity problem. We call this problem the weak rigidity problem, contrasting it with the strong rigidity problem proposed by Barreira…
▽ More
In this paper, we consider entropy spectra on topological Markov shifts. We prove that if two measure-preserving dynamical systems of Gibbs measures with Hölder continuous potentials are isomorphic, then their entropy spectra are the same. This result raises a new rigidity problem. We call this problem the weak rigidity problem, contrasting it with the strong rigidity problem proposed by Barreira and Saraiva. We give a complete answer to the weak rigidity problem for Markov measures on a topological Markov shift with an aperiodic transition matrix of size 2.
△ Less
Submitted 25 September, 2020; v1 submitted 14 September, 2020;
originally announced September 2020.
-
Decomposition of Normal Operators and Its Application to Spectral Theorem
Authors:
Katsukuni Nakagawa
Abstract:
A decomposition theorem for self-adjoint operators proved by Riesz and Lorch is extended to normal operators. This extension gives a new proof of the spectral theorem for unbounded normal operators.
A decomposition theorem for self-adjoint operators proved by Riesz and Lorch is extended to normal operators. This extension gives a new proof of the spectral theorem for unbounded normal operators.
△ Less
Submitted 28 June, 2020; v1 submitted 8 June, 2020;
originally announced June 2020.
-
Compactness of Transfer Operators and Spectral Representation of Ruelle Zeta Functions for Super-continuous Functions
Authors:
Katsukuni Nakagawa
Abstract:
Transfer operators and Ruelle zeta functions for super-continuous functions on one-sided topological Markov shifts are considered. For every super-continuous function, we construct a Banach space on which the associated transfer operator is compact. Using this Banach space, we establish the trace formula and spectral representation of Ruelle zeta functions for a certain class of super-continuous f…
▽ More
Transfer operators and Ruelle zeta functions for super-continuous functions on one-sided topological Markov shifts are considered. For every super-continuous function, we construct a Banach space on which the associated transfer operator is compact. Using this Banach space, we establish the trace formula and spectral representation of Ruelle zeta functions for a certain class of super-continuous functions. Our results include, as a special case, the classical trace formula and spectral representation for the class of locally constant functions.
△ Less
Submitted 16 June, 2020; v1 submitted 2 June, 2020;
originally announced June 2020.
-
RM-CVaR: Regularized Multiple $β$-CVaR Portfolio
Authors:
Kei Nakagawa,
Shuhei Noma,
Masaya Abe
Abstract:
The problem of finding the optimal portfolio for investors is called the portfolio optimization problem. Such problem mainly concerns the expectation and variability of return (i.e., mean and variance). Although the variance would be the most fundamental risk measure to be minimized, it has several drawbacks. Conditional Value-at-Risk (CVaR) is a relatively new risk measure that addresses some of…
▽ More
The problem of finding the optimal portfolio for investors is called the portfolio optimization problem. Such problem mainly concerns the expectation and variability of return (i.e., mean and variance). Although the variance would be the most fundamental risk measure to be minimized, it has several drawbacks. Conditional Value-at-Risk (CVaR) is a relatively new risk measure that addresses some of the shortcomings of well-known variance-related risk measures, and because of its computational efficiencies, it has gained popularity. CVaR is defined as the expected value of the loss that occurs beyond a certain probability level ($β$). However, portfolio optimization problems that use CVaR as a risk measure are formulated with a single $β$ and may output significantly different portfolios depending on how the $β$ is selected. We confirm even small changes in $β$ can result in huge changes in the whole portfolio structure. In order to improve this problem, we propose RM-CVaR: Regularized Multiple $β$-CVaR Portfolio. We perform experiments on well-known benchmarks to evaluate the proposed portfolio. Compared with various portfolios, RM-CVaR demonstrates a superior performance of having both higher risk-adjusted returns and lower maximum drawdown.
△ Less
Submitted 9 May, 2020; v1 submitted 28 April, 2020;
originally announced April 2020.
-
Reconstructing particle number distributions with convoluting volume fluctuations
Authors:
ShinIchi Esumi,
Kana Nakagawa,
Toshihiro Nonaka
Abstract:
We propose methods to reconstruct particle distributions with and without considering initial volume fluctuations. This approach enables us to correct for detector efficiencies and initial volume fluctuations simultaneously. Our study suggests such a tool could investigate the possible bimodal structure of net-proton distribution in Au+Au collisions at $\sqrt{s_{\rm NN}}=$7.7 GeV a signature of fi…
▽ More
We propose methods to reconstruct particle distributions with and without considering initial volume fluctuations. This approach enables us to correct for detector efficiencies and initial volume fluctuations simultaneously. Our study suggests such a tool could investigate the possible bimodal structure of net-proton distribution in Au+Au collisions at $\sqrt{s_{\rm NN}}=$7.7 GeV a signature of first-order phase transition and critical point [arXiv:1804.04463,arXiv:1811.04456].
△ Less
Submitted 8 November, 2020; v1 submitted 25 February, 2020;
originally announced February 2020.
-
Cross-sectional Stock Price Prediction using Deep Learning for Actual Investment Management
Authors:
Masaya Abe,
Kei Nakagawa
Abstract:
Stock price prediction has been an important research theme both academically and practically. Various methods to predict stock prices have been studied until now. The feature that explains the stock price by a cross-section analysis is called a "factor" in the field of finance. Many empirical studies in finance have identified which stocks having features in the cross-section relatively increase…
▽ More
Stock price prediction has been an important research theme both academically and practically. Various methods to predict stock prices have been studied until now. The feature that explains the stock price by a cross-section analysis is called a "factor" in the field of finance. Many empirical studies in finance have identified which stocks having features in the cross-section relatively increase and which decrease in terms of price. Recently, stock price prediction methods using machine learning, especially deep learning, have been proposed since the relationship between these factors and stock prices is complex and non-linear. However, there are no practical examples for actual investment management. In this paper, therefore, we present a cross-sectional daily stock price prediction framework using deep learning for actual investment management. For example, we build a portfolio with information available at the time of market closing and invest at the time of market opening the next day. We perform empirical analysis in the Japanese stock market and confirm the profitability of our framework.
△ Less
Submitted 17 February, 2020;
originally announced February 2020.
-
TPLVM: Portfolio Construction by Student's $t$-process Latent Variable Model
Authors:
Yusuke Uchiyama,
Kei Nakagawa
Abstract:
Optimal asset allocation is a key topic in modern finance theory. To realize the optimal asset allocation on investor's risk aversion, various portfolio construction methods have been proposed. Recently, the applications of machine learning are rapidly growing in the area of finance. In this article, we propose the Student's $t$-process latent variable model (TPLVM) to describe non-Gaussian fluctu…
▽ More
Optimal asset allocation is a key topic in modern finance theory. To realize the optimal asset allocation on investor's risk aversion, various portfolio construction methods have been proposed. Recently, the applications of machine learning are rapidly growing in the area of finance. In this article, we propose the Student's $t$-process latent variable model (TPLVM) to describe non-Gaussian fluctuations of financial timeseries by lower dimensional latent variables. Subsequently, we apply the TPLVM to minimum-variance portfolio as an alternative of existing nonlinear factor models. To test the performance of the proposed portfolio, we construct minimum-variance portfolios of global stock market indices based on the TPLVM or Gaussian process latent variable model. By comparing these portfolios, we confirm the proposed portfolio outperforms that of the existing Gaussian process latent variable model.
△ Less
Submitted 28 January, 2020;
originally announced February 2020.
-
CP-violating super Weyl anomaly
Authors:
Koichiro Nakagawa,
Yu Nakayama
Abstract:
In CP-violating conformal field theories in four dimensions, the Pontryagin density can appear in the Weyl anomaly. The Pontryagin density in the Weyl anomaly is consistent, but it has a peculiar feature that the parent three-point function of the energy-momentum tensor can violate CP only (semi-)locally. In this paper, we study the supersymmetric completion of the Pontryagin density in the Weyl a…
▽ More
In CP-violating conformal field theories in four dimensions, the Pontryagin density can appear in the Weyl anomaly. The Pontryagin density in the Weyl anomaly is consistent, but it has a peculiar feature that the parent three-point function of the energy-momentum tensor can violate CP only (semi-)locally. In this paper, we study the supersymmetric completion of the Pontryagin density in the Weyl anomaly, where the central charge $c$ effectively becomes a complex number. The supersymmetry suggests that it accompanies the graviphoton $θ$ term associated with the R-symmetry gauging in the Weyl anomaly. It also accompanies new CP-violating terms in the R-current anomaly. While there are no conclusive perturbative examples of CP-violating super Weyl anomaly, we construct explicit supersymmetric dilaton effective action which generates these anomalies.
△ Less
Submitted 18 March, 2021; v1 submitted 3 February, 2020;
originally announced February 2020.
-
NAPLES;Mining the lead-lag Relationship from Non-synchronous and High-frequency Data
Authors:
Katsuya Ito,
Kei Nakagawa
Abstract:
In time-series analysis, the term "lead-lag effect" is used to describe a delayed effect on a given time series caused by another time series. lead-lag effects are ubiquitous in practice and are specifically critical in formulating investment strategies in high-frequency trading. At present, there are three major challenges in analyzing the lead-lag effects. First, in practical applications, not a…
▽ More
In time-series analysis, the term "lead-lag effect" is used to describe a delayed effect on a given time series caused by another time series. lead-lag effects are ubiquitous in practice and are specifically critical in formulating investment strategies in high-frequency trading. At present, there are three major challenges in analyzing the lead-lag effects. First, in practical applications, not all time series are observed synchronously. Second, the size of the relevant dataset and rate of change of the environment is increasingly faster, and it is becoming more difficult to complete the computation within a particular time limit. Third, some lead-lag effects are time-varying and only last for a short period, and their delay lengths are often affected by external factors. In this paper, we propose NAPLES (Negative And Positive lead-lag EStimator), a new statistical measure that resolves all these problems. Through experiments on artificial and real datasets, we demonstrate that NAPLES has a strong correlation with the actual lead-lag effects, including those triggered by significant macroeconomic announcements.
△ Less
Submitted 3 February, 2020;
originally announced February 2020.
-
A Robust Transferable Deep Learning Framework for Cross-sectional Investment Strategy
Authors:
Kei Nakagawa,
Masaya Abe,
Junpei Komiyama
Abstract:
Stock return predictability is an important research theme as it reflects our economic and social organization, and significant efforts are made to explain the dynamism therein. Statistics of strong explanative power, called "factor" have been proposed to summarize the essence of predictive stock returns. Although machine learning methods are increasingly popular in stock return prediction, an inf…
▽ More
Stock return predictability is an important research theme as it reflects our economic and social organization, and significant efforts are made to explain the dynamism therein. Statistics of strong explanative power, called "factor" have been proposed to summarize the essence of predictive stock returns. Although machine learning methods are increasingly popular in stock return prediction, an inference of the stock returns is highly elusive, and still most investors, if partly, rely on their intuition to build a better decision making. The challenge here is to make an investment strategy that is consistent over a reasonably long period, with the minimum human decision on the entire process. To this end, we propose a new stock return prediction framework that we call Ranked Information Coefficient Neural Network (RIC-NN). RIC-NN is a deep learning approach and includes the following three novel ideas: (1) nonlinear multi-factor approach, (2) stop** criteria with ranked information coefficient (rank IC), and (3) deep transfer learning among multiple regions. Experimental comparison with the stocks in the Morgan Stanley Capital International (MSCI) indices shows that RIC-NN outperforms not only off-the-shelf machine learning methods but also the average return of major equity investment funds in the last fourteen years.
△ Less
Submitted 2 October, 2019;
originally announced October 2019.
-
Government Expenditure on Research Plans and their Diversity
Authors:
Ryosuke Ishii,
Kuninori Nakagawa
Abstract:
In this study, we consider research and development investment by the government. Our study is motivated by the bias in the budget allocation owing to the competitive funding system. In our model, each researcher presents research plans and expenses, and the government selects a research plan in two periods---before and after the government knows its favorite plan---and spends funds on the adopted…
▽ More
In this study, we consider research and development investment by the government. Our study is motivated by the bias in the budget allocation owing to the competitive funding system. In our model, each researcher presents research plans and expenses, and the government selects a research plan in two periods---before and after the government knows its favorite plan---and spends funds on the adopted program in each period. We demonstrate that, in a subgame perfect equilibrium, the government adopts equally as many active plans as possible. In an equilibrium, the selected plans are distributed proportionally. Thus, the investment in research projects is symmetric and unbiased. Our results imply that equally widespread expenditure across all research fields is better than the selection of and concentration in some specific fields.
△ Less
Submitted 3 August, 2019;
originally announced August 2019.
-
Fast Statistical Iterative Reconstruction for MVCT in TomoTherapy
Authors:
Sho Ozaki,
Akihiro Haga,
Edward Chao,
Calvin Maurer,
Kanabu Nawa,
Takeshi Ohta,
Takahiro Nakamoto,
Yuki Nozawa,
Taiki Magome,
Masahiro Nakano,
Keiichi Nakagawa
Abstract:
Statistical iterative reconstruction is expected to improve the image quality of megavoltage computed tomography (MVCT). However, one of the challenges of iterative reconstruction is its large computational cost. The purpose of this work is to develop a fast iterative reconstruction algorithm by combining several iterative techniques and by optimizing reconstruction parameters. Megavolt projection…
▽ More
Statistical iterative reconstruction is expected to improve the image quality of megavoltage computed tomography (MVCT). However, one of the challenges of iterative reconstruction is its large computational cost. The purpose of this work is to develop a fast iterative reconstruction algorithm by combining several iterative techniques and by optimizing reconstruction parameters. Megavolt projection data was acquired from a TomoTherapy system and reconstructed using our statistical iterative reconstruction. Total variation was used as the regularization term and the weight of the regularization term was determined by evaluating signal-to-noise ratio (SNR), contrast-to-noise ratio (CNR), and visual assessment of spatial resolution using Gammex and Cheese phantoms. Gradient decent with an adaptive convergence parameter, ordered subset expectation maximization (OSEM), and CPU/GPU parallelization were applied in order to accelerate the present reconstruction algorithm. The SNR and CNR of the iterative reconstruction were several times better than that of filtered back projection (FBP). The GPU parallelization code combined with the OSEM algorithm reconstructed an image several hundred times faster than a CPU calculation. With 500 iterations, which provided good convergence, our method produced a 512$\times$512 pixel image within a few seconds. The image quality of the present algorithm was much better than that of FBP for patient data. An image from the iterative reconstruction in TomoTherapy can be obtained within few seconds by fine-tuning the parameters. The iterative reconstruction with GPU was fast enough for clinical use, and largely improve the MVCT images.
△ Less
Submitted 24 March, 2019;
originally announced March 2019.
-
Use of Ghost Cytometry to Differentiate Cells with Similar Gross Morphologic Characteristics
Authors:
Hiroaki Adachi,
Yoko Kawamura,
Keiji Nakagawa,
Ryoichi Horisaki,
Issei Sato,
Satoko Yamaguchi,
Katsuhito Fujiu,
Kayo Waki,
Hiroyuki Noji,
Sadao Ota
Abstract:
Imaging flow cytometry shows significant potential for increasing our understanding of heterogeneous and complex life systems and is useful for biomedical applications. Ghost cytometry is a recently proposed approach for directly analyzing compressively measured signals, thereby relieving the computational bottleneck observed in high-throughput cytometry based on morphological information. While t…
▽ More
Imaging flow cytometry shows significant potential for increasing our understanding of heterogeneous and complex life systems and is useful for biomedical applications. Ghost cytometry is a recently proposed approach for directly analyzing compressively measured signals, thereby relieving the computational bottleneck observed in high-throughput cytometry based on morphological information. While this image-free approach could distinguish different cell types using the same fluorescence staining method, further strict controls are sometimes required to clearly demonstrate that the classification is based on detailed morphologic analysis. In this study, we show that ghost cytometry can be used to classify cell populations of the same type but with different fluorescence distributions in space, supporting the strength of our image-free approach for morphologic cell analysis.
△ Less
Submitted 22 March, 2019;
originally announced March 2019.
-
Trion-based High-speed Electroluminescence from Semiconducting Carbon Nanotube Films
Authors:
Hidenori Takahashi,
Yuji Suzuki,
Norito Yoshida,
Kenta Nakagawa,
Hideyuki Maki
Abstract:
High-speed light emitters integrated on silicon chips can enable novel architectures for silicon-based optoelectronics, such as on-chip optical interconnects and silicon photonics. However, conventional light sources based on compound semiconductors face major challenges for their integration with the silicon-based platforms because of the difficulty of their direct growth on a silicon substrate.…
▽ More
High-speed light emitters integrated on silicon chips can enable novel architectures for silicon-based optoelectronics, such as on-chip optical interconnects and silicon photonics. However, conventional light sources based on compound semiconductors face major challenges for their integration with the silicon-based platforms because of the difficulty of their direct growth on a silicon substrate. Here, we report high-brightness, high-speed, ultra-small-size on-chip electroluminescence (EL) emitters based on semiconducting single-walled carbon nanotubes (SWNTs) thin films. The peaks of the EL emission spectra are 0.2-eV red-shifted from the peaks of the absorption and photoluminescence emission spectra, which suggests emission from trions. High-speed responses of ~ 100 ps were experimentally observed from the trion-based EL emitters, which indicates the possibility of several-GHz modulation. The pulsed light generation was also obtained by applying pulse voltage. These high-speed and ultra-small-size EL emitters can enable novel on-chip optoelectronic devices for highly integrated optoelectronics and silicon photonics.
△ Less
Submitted 4 March, 2019;
originally announced March 2019.
-
Deep Recurrent Factor Model: Interpretable Non-Linear and Time-Varying Multi-Factor Model
Authors:
Kei Nakagawa,
Tomoki Ito,
Masaya Abe,
Kiyoshi Izumi
Abstract:
A linear multi-factor model is one of the most important tools in equity portfolio management. The linear multi-factor models are widely used because they can be easily interpreted. However, financial markets are not linear and their accuracy is limited. Recently, deep learning methods were proposed to predict stock return in terms of the multi-factor model. Although these methods perform quite we…
▽ More
A linear multi-factor model is one of the most important tools in equity portfolio management. The linear multi-factor models are widely used because they can be easily interpreted. However, financial markets are not linear and their accuracy is limited. Recently, deep learning methods were proposed to predict stock return in terms of the multi-factor model. Although these methods perform quite well, they have significant disadvantages such as a lack of transparency and limitations in the interpretability of the prediction. It is thus difficult for institutional investors to use black-box-type machine learning techniques in actual investment practice because they should show accountability to their customers. Consequently, the solution we propose is based on LSTM with LRP. Specifically, we extend the linear multi-factor model to be non-linear and time-varying with LSTM. Then, we approximate and linearize the learned LSTM models by LRP. We call this LSTM+LRP model a deep recurrent factor model. Finally, we perform an empirical analysis of the Japanese stock market and show that our recurrent model has better predictive capability than the traditional linear model and fully-connected deep learning methods.
△ Less
Submitted 20 January, 2019;
originally announced January 2019.
-
Visual enhancement of Cone-beam CT by use of CycleGAN
Authors:
S. Kida,
S. Kaji,
K. Nawa,
T. Imae,
T. Nakamoto,
S. Ozaki,
T. Ohta,
Y. Nozawa,
K. Nakagawa
Abstract:
Cone-beam computed tomography (CBCT) offers advantages over conventional fan-beam CT in that it requires a shorter time and less exposure to obtain images. CBCT has found a wide variety of applications in patient positioning for image-guided radiation therapy, extracting radiomic information for designing patient-specific treatment, and computing fractional dose distributions for adaptive radiatio…
▽ More
Cone-beam computed tomography (CBCT) offers advantages over conventional fan-beam CT in that it requires a shorter time and less exposure to obtain images. CBCT has found a wide variety of applications in patient positioning for image-guided radiation therapy, extracting radiomic information for designing patient-specific treatment, and computing fractional dose distributions for adaptive radiation therapy. However, CBCT images suffer from low soft-tissue contrast, noise, and artifacts compared to conventional fan-beam CT images. Therefore, it is essential to improve the image quality of CBCT. In this paper, we propose a synthetic approach to translate CBCT images with deep neural networks. Our method requires only unpaired and unaligned CBCT images and planning fan-beam CT (PlanCT) images for training. Once trained, 3D reconstructed CBCT images can be directly translated to high-quality PlanCT-like images. We demonstrate the effectiveness of our method with images obtained from 24 prostate patients, and we provide a statistical and visual comparison. The image quality of the translated images shows substantial improvement in voxel values, spatial uniformity, and artifact suppression compared to those of the original CBCT. The anatomical structures of the original CBCT images were also well preserved in the translated images. Our method enables more accurate adaptive radiation therapy, and opens up new applications for CBCT that hinge on high-quality images.
△ Less
Submitted 25 November, 2019; v1 submitted 17 January, 2019;
originally announced January 2019.
-
Complex Valued Risk Diversification
Authors:
Yusuke Uchiyama,
Takanori Kadoya,
Kei Nakagawa
Abstract:
Risk diversification is one of the dominant concerns for portfolio managers. Various portfolio constructions have been proposed to minimize the risk of the portfolio under some constrains including expected returns. We propose a portfolio construction method that incorporates the complex valued principal component analysis into the risk diversification portfolio construction. The proposed method i…
▽ More
Risk diversification is one of the dominant concerns for portfolio managers. Various portfolio constructions have been proposed to minimize the risk of the portfolio under some constrains including expected returns. We propose a portfolio construction method that incorporates the complex valued principal component analysis into the risk diversification portfolio construction. The proposed method is verified to outperform the conventional risk parity and risk diversification portfolio constructions.
△ Less
Submitted 10 October, 2018;
originally announced October 2018.