Skip to main content

Showing 1–19 of 19 results for author: Naganuma, H

.
  1. arXiv:2405.00442  [pdf, other

    stat.ML cs.AI cs.LG

    Geometric Insights into Focal Loss: Reducing Curvature for Enhanced Model Calibration

    Authors: Masanari Kimura, Hiroki Naganuma

    Abstract: The key factor in implementing machine learning algorithms in decision-making situations is not only the accuracy of the model but also its confidence level. The confidence level of a model in a classification problem is often given by the output vector of a softmax function for convenience. However, these values are known to deviate significantly from the actual expected model confidence. This pr… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: This paper is under consideration at Pattern Recognition Letters

  2. arXiv:2404.01334  [pdf, other

    cs.CL cs.LG

    Augmenting NER Datasets with LLMs: Towards Automated and Refined Annotation

    Authors: Yuji Naraki, Ryosuke Yamaki, Yoshikazu Ikeda, Takafumi Horie, Hiroki Naganuma

    Abstract: In the field of Natural Language Processing (NLP), Named Entity Recognition (NER) is recognized as a critical technology, employed across a wide array of applications. Traditional methodologies for annotating datasets for NER models are challenged by high costs and variations in dataset quality. This research introduces a novel hybrid annotation approach that synergizes human effort with the capab… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  3. arXiv:2401.17541  [pdf, other

    cs.LG

    Towards Understanding Variants of Invariant Risk Minimization through the Lens of Calibration

    Authors: Kotaro Yoshida, Hiroki Naganuma

    Abstract: Machine learning models traditionally assume that training and test data are independently and identically distributed. However, in real-world applications, the test distribution often differs from training. This problem, known as out-of-distribution (OOD) generalization, challenges conventional models. Invariant Risk Minimization (IRM) emerges as a solution that aims to identify invariant feature… ▽ More

    Submitted 17 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to TMLR

  4. arXiv:2308.02171  [pdf, other

    cond-mat.mtrl-sci

    First-principle study of spin transport property in $L1_0$-FePd(001)/graphene heterojunction

    Authors: Hayato Adachi, Ryuusuke Endo, Hikari Shinya, Hiroshi Naganuma, Tomoya Ono, Mitsuharu Uemoto

    Abstract: In our previous work, we synthesized a metal/2D material heterointerface consisting of $L1_0$-ordered iron-palladium (FePd) and graphene (Gr) called FePd(001)/Gr. This system has been explored by both experimental measurements and theoretical calculations. In this study, we focus on a heterojunction composed of FePd and multilayer graphene referred to as FePd(001)/$m$-Gr/FePd(001), where $m$ repre… ▽ More

    Submitted 30 December, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: 22 pages, 9 figures

  5. arXiv:2307.08187  [pdf, other

    cs.LG cs.AI

    An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration

    Authors: Hiroki Naganuma, Ryuichiro Hataya, Ioannis Mitliagkas

    Abstract: In out-of-distribution (OOD) generalization tasks, fine-tuning pre-trained models has become a prevalent strategy. Different from most prior work that has focused on advancing learning algorithms, we systematically examined how pre-trained model size, pre-training dataset size, and training strategies impact generalization and uncertainty calibration on downstream tasks. We evaluated 100 models ac… ▽ More

    Submitted 30 May, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

  6. arXiv:2306.11922  [pdf, other

    cs.LG math.OC

    No Wrong Turns: The Simple Geometry Of Neural Networks Optimization Paths

    Authors: Charles Guille-Escuret, Hiroki Naganuma, Kilian Fatras, Ioannis Mitliagkas

    Abstract: Understanding the optimization dynamics of neural networks is necessary for closing the gap between theory and practice. Stochastic first-order optimization algorithms are known to efficiently locate favorable minima in deep neural networks. This efficiency, however, contrasts with the non-convex and seemingly complex structure of neural loss landscapes. In this study, we delve into the fundamenta… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  7. arXiv:2211.08583  [pdf, other

    cs.LG cs.AI

    Empirical Study on Optimizer Selection for Out-of-Distribution Generalization

    Authors: Hiroki Naganuma, Kartik Ahuja, Shiro Takagi, Tetsuya Motokawa, Rio Yokota, Kohta Ishikawa, Ikuro Sato, Ioannis Mitliagkas

    Abstract: Modern deep learning systems do not generalize well when the test data distribution is slightly different to the training data distribution. While much promising work has been accomplished to address this fragility, a systematic study of the role of optimizers and their out-of-distribution generalization performance has not been undertaken. In this study, we examine the performance of popular firs… ▽ More

    Submitted 5 June, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: Accepted to TMLR

  8. arXiv:2206.11180  [pdf, other

    cs.CV cs.LG stat.ML

    Optimal transport meets noisy label robust loss and MixUp regularization for domain adaptation

    Authors: Kilian Fatras, Hiroki Naganuma, Ioannis Mitliagkas

    Abstract: It is common in computer vision to be confronted with domain shift: images which have the same class but different acquisition conditions. In domain adaptation (DA), one wants to classify unlabeled target images using source labeled images. Unfortunately, deep neural networks trained on a source training set perform poorly on target images which do not belong to the training domain. One strategy t… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  9. arXiv:2203.14495  [pdf, other

    cs.LG cs.CV math.OC

    Conjugate Gradient Method for Generative Adversarial Networks

    Authors: Hiroki Naganuma, Hideaki Iiduka

    Abstract: One of the training strategies of generative models is to minimize the Jensen--Shannon divergence between the model distribution and the data distribution. Since data distribution is unknown, generative adversarial networks (GANs) formulate this problem as a game between two models, a generator and a discriminator. The training can be formulated in the context of game theory and the local Nash equ… ▽ More

    Submitted 20 February, 2023; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: Accepted to AISTATS 2023

  10. arXiv:2201.07942  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Density functional study of twisted graphene $L1_0$-FePd heterogeneous interface

    Authors: Mitsuharu Uemoto, Hayato Adachi, Hiroshi Naganuma, Tomoya Ono

    Abstract: Graphene on $L1_0$-FePd(001), which has been experimentally studied in recent years, is a heterogeneous interface with a significant lattice symmetry mismatch between the honeycomb structure of graphene and tetragonal alloy surface. In this work, we report on the density functional study of its atomic-scale configurations, electronic and magnetic properties, and adsorption mechanism, which have no… ▽ More

    Submitted 25 July, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: 10 pages, 11 figures

  11. arXiv:2003.08097  [pdf, other

    cs.DS

    Grammar compression with probabilistic context-free grammar

    Authors: Hiroaki Naganuma, Diptarama Hendrian, Ryo Yoshinaka, Ayumi Shinohara, Naoki Kobayashi

    Abstract: We propose a new approach for universal lossless text compression, based on grammar compression. In the literature, a target string $T$ has been compressed as a context-free grammar $G$ in Chomsky normal form satisfying $L(G) = \{T\}$. Such a grammar is often called a \emph{straight-line program} (SLP). In this paper, we consider a probabilistic grammar $G$ that generates $T$, but not necessarily… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Comments: 11 pages, 3 figures, accepted for poster presentation at DCC 2020

  12. Realization of a spin wave switch based on the Spin-Transfer-Torque effect

    Authors: Thomas Meyer, Thomas Brächer, Frank Heussner, Alexander A. Serga, Hiroshi Naganuma, Koki Mukaiyama, Mikihiko Oogane, Yasuo Ando, Burkard Hillebrands, Philipp Pirro

    Abstract: We investigate the amplification of externally excited spin waves via the Spin-Transfer-Torque (STT) effect in combination with the Spin-Hall-Effect (SHE) employing short current pulses. The results reveal that, in the case of an overcompensation of the spin wave dam**, a strong nonlinear shift of the spin wave frequency spectrum occurs. In particular, this shift affects the spin wave amplificat… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

  13. arXiv:1711.09647  [pdf, other

    cond-mat.mes-hall

    Characterization of Spin-Transfer-Torque effect induced magnetization dynamics driven by short current pulses

    Authors: T. Meyer, T. Brächer, F. Heussner, A. A. Serga, H. Naganuma, K. Mukaiyama, M. Oogane, Y. Ando, B. Hillebrands, P. Pirro

    Abstract: We present a time-resolved study of the magnetization dynamics in a microstructured Cr$|$Heusler$|$Pt waveguide driven by the Spin-Hall-Effect and the Spin-Transfer-Torque effect via short current pulses. In particular, we focus on the determination of the threshold current at which the spin-wave dam** is compensated. We have developed a novel method based on the temporal evolution of the magnon… ▽ More

    Submitted 27 November, 2017; originally announced November 2017.

    Comments: 5 pages, 3 figures. arXiv admin note: text overlap with arXiv:1701.02094

  14. arXiv:1708.09189  [pdf

    cond-mat.mtrl-sci

    Tuning up or down the critical thickness in LaAlO3/SrTiO3 through in situ deposition of metal overlayers

    Authors: D. C. Vaz, E. Lesne, H. Naganuma, E. Jacquet, J. Santamaria, A. Barthelemy, M. Bibes

    Abstract: The quasi 2D electron system (q2DES) that forms at the interface between LaAlO3 and SrTiO3 has attracted much attention from the oxide electronics community. One of its hallmark features is the existence of a critical LaAlO3 thickness of 4 unit-cells (uc) for interfacial conductivity to emerge. In this paper, the chemical, electronic, and transport properties of LaAlO3/SrTiO3 samples capped with d… ▽ More

    Submitted 30 August, 2017; originally announced August 2017.

    Comments: Work supported by ERC Consolidator grant MINT (Contract No. 615759)

    Journal ref: Adv. Mater. 29, 1700486 (2017)

  15. arXiv:1706.00619  [pdf, other

    cond-mat.mes-hall

    Experimental investigation of the temperature-dependent magnon density and its influence on studies of spin-transfer-torque-driven systems

    Authors: Thomas Meyer, Thomas Brächer, Frank Heussner, Alexander A. Serga, Hiroshi Naganuma, Koki Mukaiyama, Mikihiko Oogane, Yasuo Ando, Burkard Hillebrands, Philipp Pirro

    Abstract: We present the temperature dependence of the thermal magnon density in a thin ferromagnetic layer. By employing Brillouin light scattering and varying the temperature, an increase of the magnon density accompanied by a lowering of the spin-wave frequency is observed with increasing temperature. The magnon density follows the temperature according to the Bose-Einstein distribution function which le… ▽ More

    Submitted 2 June, 2017; originally announced June 2017.

    Comments: 5 pages, 4 figures

  16. arXiv:1701.02094   

    cond-mat.mes-hall

    Spin-Wave versus Joule Heating in Spin-Hall-Effect/Spin-Transfer-Torque Driven Cr/Heusler/Pt Waveguides

    Authors: T. Meyer, T. Brächer, F. Heussner, A. A. Serga, H. Naganuma, K. Mukaiyama, M. Oogane, Y. Ando, B. Hillebrands, P. Pirro

    Abstract: We present a time-resolved study of the DC-current driven magnetization dynamics in a microstructured Cr/Heusler/Pt waveguide by means of Brillouin light scattering. A reduction of the effective spin-wave dam** via the spin-transfer-torque effect leads to a strong increase in the magnon density. This is accompanied by a decrease of the spin-wave frequencies. By evaluating the time scales of thes… ▽ More

    Submitted 4 September, 2017; v1 submitted 9 January, 2017; originally announced January 2017.

    Comments: Recently, we found that the experimental setup partially influences the decay of the spin-wave intensity after the current pulse is switched off. Thus, further investigations on the presented effect are needed to allow for a more detailed analysis. For this reason, we need to withdraw the manuscript at this point and might publish an updated version later

  17. arXiv:1609.06464  [pdf

    cond-mat.mtrl-sci

    Highly efficient and tuneable spin-to-charge conversion through Rashba coupling at oxide interfaces

    Authors: E. Lesne, Y. Fu, S. Oyarzun, J. C. Rojas-Sanchez, D. C. Vaz, H. Naganuma, G. Sicoli, J. -P. Attane, M. Jamet, E. Jacquet, J. -M. George, A. Barthelemy, H. Jaffres, A. Fert, M. Bibes, L. Vila

    Abstract: The spin-orbit interaction couples the electrons' motion to their spin. Accordingly, passing a current in a material with strong spin-orbit coupling generates a transverse spin current (spin Hall effect, SHE) and vice-versa (inverse spin Hall effect, ISHE). The emergence of SHE and ISHE as charge-to-spin interconversion mechanisms offers a variety of novel spintronics functionalities and devices,… ▽ More

    Submitted 21 September, 2016; originally announced September 2016.

    Comments: Final version just published in Nature Materials. Contact author for a reprint

    Journal ref: Nature Materials (2016); doi:10.1038/nmat4726

  18. arXiv:1209.4368  [pdf, other

    cond-mat.mtrl-sci

    Revealing the spin and symmetry properties of the buried Co2MnSi/MgO interface by low energy spin-resolved photoemission

    Authors: Roman Fetzer, Marcel Lösch, Yusuke Ohdaira, Hiroshi Naganuma, Mikihiko Oogane, Yasuo Ando, Tomoyuki Taira, Tetsuya Uemura, Masafumi Yamamoto, Martin Aeschlimann, Mirko Cinchetti

    Abstract: We present a novel approach to study the spin and symmetry electronic properties of buried interfaces using low-energy spin-resolved photoemission spectroscopy. We show that this method is sensitive to interfaces buried below more than 20ML (~4nm) MgO, providing a powerful tool for the non-destructive characterization of spintronics interfaces. As a demonstration, we apply this technique to charac… ▽ More

    Submitted 19 September, 2012; originally announced September 2012.

    Comments: To be submitted to Physical Review Letters

  19. arXiv:1209.3669  [pdf, ps, other

    cond-mat.mes-hall physics.optics

    Nonlinear emission of spin-wave caustics from an edge mode of a micro-structured Co2Mn0.6Fe0.4Si waveguide

    Authors: T. Sebastian, P. Pirro, T. Kubota, T. Brächer, A. A. Serga, H. Naganuma, M. Oogane, Y. Ando, B. Hillebrands

    Abstract: Magnetic Heusler materials with very low Gilbert dam** are expected to show novel magnonic transport phenomena. We report nonlinear generation of higher harmonics leading to the emission of caustic spin-wave beams in a low-dam**, micro-structured Co2Mn0.6Fe0.4Si Heusler waveguide. The source for the higher harmonic generation is a localized edge mode formed by the strongly inhomogeneous field… ▽ More

    Submitted 12 December, 2012; v1 submitted 17 September, 2012; originally announced September 2012.