-
Geometric Insights into Focal Loss: Reducing Curvature for Enhanced Model Calibration
Authors:
Masanari Kimura,
Hiroki Naganuma
Abstract:
The key factor in implementing machine learning algorithms in decision-making situations is not only the accuracy of the model but also its confidence level. The confidence level of a model in a classification problem is often given by the output vector of a softmax function for convenience. However, these values are known to deviate significantly from the actual expected model confidence. This pr…
▽ More
The key factor in implementing machine learning algorithms in decision-making situations is not only the accuracy of the model but also its confidence level. The confidence level of a model in a classification problem is often given by the output vector of a softmax function for convenience. However, these values are known to deviate significantly from the actual expected model confidence. This problem is called model calibration and has been studied extensively. One of the simplest techniques to tackle this task is focal loss, a generalization of cross-entropy by introducing one positive parameter. Although many related studies exist because of the simplicity of the idea and its formalization, the theoretical analysis of its behavior is still insufficient. In this study, our objective is to understand the behavior of focal loss by reinterpreting this function geometrically. Our analysis suggests that focal loss reduces the curvature of the loss surface in training the model. This indicates that curvature may be one of the essential factors in achieving model calibration. We design numerical experiments to support this conjecture to reveal the behavior of focal loss and the relationship between calibration performance and curvature.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Augmenting NER Datasets with LLMs: Towards Automated and Refined Annotation
Authors:
Yuji Naraki,
Ryosuke Yamaki,
Yoshikazu Ikeda,
Takafumi Horie,
Hiroki Naganuma
Abstract:
In the field of Natural Language Processing (NLP), Named Entity Recognition (NER) is recognized as a critical technology, employed across a wide array of applications. Traditional methodologies for annotating datasets for NER models are challenged by high costs and variations in dataset quality. This research introduces a novel hybrid annotation approach that synergizes human effort with the capab…
▽ More
In the field of Natural Language Processing (NLP), Named Entity Recognition (NER) is recognized as a critical technology, employed across a wide array of applications. Traditional methodologies for annotating datasets for NER models are challenged by high costs and variations in dataset quality. This research introduces a novel hybrid annotation approach that synergizes human effort with the capabilities of Large Language Models (LLMs). This approach not only aims to ameliorate the noise inherent in manual annotations, such as omissions, thereby enhancing the performance of NER models, but also achieves this in a cost-effective manner. Additionally, by employing a label mixing strategy, it addresses the issue of class imbalance encountered in LLM-based annotations. Through an analysis across multiple datasets, this method has been consistently shown to provide superior performance compared to traditional annotation methods, even under constrained budget conditions. This study illuminates the potential of leveraging LLMs to improve dataset quality, introduces a novel technique to mitigate class imbalances, and demonstrates the feasibility of achieving high-performance NER in a cost-effective way.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.
-
Towards Understanding Variants of Invariant Risk Minimization through the Lens of Calibration
Authors:
Kotaro Yoshida,
Hiroki Naganuma
Abstract:
Machine learning models traditionally assume that training and test data are independently and identically distributed. However, in real-world applications, the test distribution often differs from training. This problem, known as out-of-distribution (OOD) generalization, challenges conventional models. Invariant Risk Minimization (IRM) emerges as a solution that aims to identify invariant feature…
▽ More
Machine learning models traditionally assume that training and test data are independently and identically distributed. However, in real-world applications, the test distribution often differs from training. This problem, known as out-of-distribution (OOD) generalization, challenges conventional models. Invariant Risk Minimization (IRM) emerges as a solution that aims to identify invariant features across different environments to enhance OOD robustness. However, IRM's complexity, particularly its bi-level optimization, has led to the development of various approximate methods. Our study investigates these approximate IRM techniques, using the consistency and variance of calibration across environments as metrics to measure the invariance aimed for by IRM. Calibration, which measures the reliability of model prediction, serves as an indicator of whether models effectively capture environment-invariant features by showing how uniformly over-confident the model remains across varied environments. Through a comparative analysis of datasets with distributional shifts, we observe that Information Bottleneck-based IRM achieves consistent calibration across different environments. This observation suggests that information compression techniques, such as IB, are potentially effective in achieving model invariance. Furthermore, our empirical evidence indicates that models exhibiting consistent calibration across environments are also well-calibrated. This demonstrates that invariance and cross-environment calibration are empirically equivalent. Additionally, we underscore the necessity for a systematic approach to evaluating OOD generalization. This approach should move beyond traditional metrics, such as accuracy and F1 scores, which fail to account for the model's degree of over-confidence, and instead focus on the nuanced interplay between accuracy, calibration, and model invariance.
△ Less
Submitted 17 June, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
First-principle study of spin transport property in $L1_0$-FePd(001)/graphene heterojunction
Authors:
Hayato Adachi,
Ryuusuke Endo,
Hikari Shinya,
Hiroshi Naganuma,
Tomoya Ono,
Mitsuharu Uemoto
Abstract:
In our previous work, we synthesized a metal/2D material heterointerface consisting of $L1_0$-ordered iron-palladium (FePd) and graphene (Gr) called FePd(001)/Gr. This system has been explored by both experimental measurements and theoretical calculations. In this study, we focus on a heterojunction composed of FePd and multilayer graphene referred to as FePd(001)/$m$-Gr/FePd(001), where $m$ repre…
▽ More
In our previous work, we synthesized a metal/2D material heterointerface consisting of $L1_0$-ordered iron-palladium (FePd) and graphene (Gr) called FePd(001)/Gr. This system has been explored by both experimental measurements and theoretical calculations. In this study, we focus on a heterojunction composed of FePd and multilayer graphene referred to as FePd(001)/$m$-Gr/FePd(001), where $m$ represents the number of graphene layers. We perform first-principles calculations to predict their spin-dependent transport properties. The quantitative calculations of spin-resolved conductance and magnetoresistance (MR) ratio (150-200%) suggest that the proposed structure can function as a magnetic tunnel junction in spintronics applications. We also find that an increase in $m$ not only reduces conductance but also changes transport properties from the tunneling behavior to the graphite $π$-band-like behavior. Additionally, we investigate the spin-transfer torque-induced magnetization switching behavior of our \color{blue} junction structures \color{black} using micromagnetic simulations. Furthermore, we examine the impact of lateral displacements (``sliding'') at the interface and find that the spin transport properties remain robust despite these changes; this is the advantage of two-dimensional material hetero-interfaces over traditional insulating barrier layers such as MgO.
△ Less
Submitted 30 December, 2023; v1 submitted 4 August, 2023;
originally announced August 2023.
-
An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration
Authors:
Hiroki Naganuma,
Ryuichiro Hataya,
Ioannis Mitliagkas
Abstract:
In out-of-distribution (OOD) generalization tasks, fine-tuning pre-trained models has become a prevalent strategy. Different from most prior work that has focused on advancing learning algorithms, we systematically examined how pre-trained model size, pre-training dataset size, and training strategies impact generalization and uncertainty calibration on downstream tasks. We evaluated 100 models ac…
▽ More
In out-of-distribution (OOD) generalization tasks, fine-tuning pre-trained models has become a prevalent strategy. Different from most prior work that has focused on advancing learning algorithms, we systematically examined how pre-trained model size, pre-training dataset size, and training strategies impact generalization and uncertainty calibration on downstream tasks. We evaluated 100 models across diverse pre-trained model sizes, \update{five} pre-training datasets, and five data augmentations through extensive experiments on four distribution shift datasets totaling over 120,000 GPU hours. Our results demonstrate the significant impact of pre-trained model selection, with optimal choices substantially improving OOD accuracy over algorithm improvement alone. We find larger models and bigger pre-training data improve OOD performance and calibration, in contrast to some prior studies that found modern deep networks to calibrate worse than classical shallow models. Our work underscores the overlooked importance of pre-trained model selection for out-of-distribution generalization and calibration.
△ Less
Submitted 30 May, 2024; v1 submitted 16 July, 2023;
originally announced July 2023.
-
No Wrong Turns: The Simple Geometry Of Neural Networks Optimization Paths
Authors:
Charles Guille-Escuret,
Hiroki Naganuma,
Kilian Fatras,
Ioannis Mitliagkas
Abstract:
Understanding the optimization dynamics of neural networks is necessary for closing the gap between theory and practice. Stochastic first-order optimization algorithms are known to efficiently locate favorable minima in deep neural networks. This efficiency, however, contrasts with the non-convex and seemingly complex structure of neural loss landscapes. In this study, we delve into the fundamenta…
▽ More
Understanding the optimization dynamics of neural networks is necessary for closing the gap between theory and practice. Stochastic first-order optimization algorithms are known to efficiently locate favorable minima in deep neural networks. This efficiency, however, contrasts with the non-convex and seemingly complex structure of neural loss landscapes. In this study, we delve into the fundamental geometric properties of sampled gradients along optimization paths. We focus on two key quantities, which appear in the restricted secant inequality and error bound. Both hold high significance for first-order optimization. Our analysis reveals that these quantities exhibit predictable, consistent behavior throughout training, despite the stochasticity induced by sampling minibatches. Our findings suggest that not only do optimization trajectories never encounter significant obstacles, but they also maintain stable dynamics during the majority of training. These observed properties are sufficiently expressive to theoretically guarantee linear convergence and prescribe learning rate schedules mirroring empirical practices. We conduct our experiments on image classification, semantic segmentation and language modeling across different batch sizes, network architectures, datasets, optimizers, and initialization seeds. We discuss the impact of each factor. Our work provides novel insights into the properties of neural network loss functions, and opens the door to theoretical frameworks more relevant to prevalent practice.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
Empirical Study on Optimizer Selection for Out-of-Distribution Generalization
Authors:
Hiroki Naganuma,
Kartik Ahuja,
Shiro Takagi,
Tetsuya Motokawa,
Rio Yokota,
Kohta Ishikawa,
Ikuro Sato,
Ioannis Mitliagkas
Abstract:
Modern deep learning systems do not generalize well when the test data distribution is slightly different to the training data distribution. While much promising work has been accomplished to address this fragility, a systematic study of the role of optimizers and their out-of-distribution generalization performance has not been undertaken. In this study, we examine the performance of popular firs…
▽ More
Modern deep learning systems do not generalize well when the test data distribution is slightly different to the training data distribution. While much promising work has been accomplished to address this fragility, a systematic study of the role of optimizers and their out-of-distribution generalization performance has not been undertaken. In this study, we examine the performance of popular first-order optimizers for different classes of distributional shift under empirical risk minimization and invariant risk minimization. We address this question for image and text classification using DomainBed, WILDS, and Backgrounds Challenge as testbeds for studying different types of shifts -- namely correlation and diversity shift. We search over a wide range of hyperparameters and examine classification accuracy (in-distribution and out-of-distribution) for over 20,000 models. We arrive at the following findings, which we expect to be helpful for practitioners: i) adaptive optimizers (e.g., Adam) perform worse than non-adaptive optimizers (e.g., SGD, momentum SGD) on out-of-distribution performance. In particular, even though there is no significant difference in in-distribution performance, we show a measurable difference in out-of-distribution performance. ii) in-distribution performance and out-of-distribution performance exhibit three types of behavior depending on the dataset -- linear returns, increasing returns, and diminishing returns. For example, in the training of natural language data using Adam, fine-tuning the performance of in-distribution performance does not significantly contribute to the out-of-distribution generalization performance.
△ Less
Submitted 5 June, 2023; v1 submitted 15 November, 2022;
originally announced November 2022.
-
Optimal transport meets noisy label robust loss and MixUp regularization for domain adaptation
Authors:
Kilian Fatras,
Hiroki Naganuma,
Ioannis Mitliagkas
Abstract:
It is common in computer vision to be confronted with domain shift: images which have the same class but different acquisition conditions. In domain adaptation (DA), one wants to classify unlabeled target images using source labeled images. Unfortunately, deep neural networks trained on a source training set perform poorly on target images which do not belong to the training domain. One strategy t…
▽ More
It is common in computer vision to be confronted with domain shift: images which have the same class but different acquisition conditions. In domain adaptation (DA), one wants to classify unlabeled target images using source labeled images. Unfortunately, deep neural networks trained on a source training set perform poorly on target images which do not belong to the training domain. One strategy to improve these performances is to align the source and target image distributions in an embedded space using optimal transport (OT). However OT can cause negative transfer, i.e. aligning samples with different labels, which leads to overfitting especially in the presence of label shift between domains. In this work, we mitigate negative alignment by explaining it as a noisy label assignment to target images. We then mitigate its effect by appropriate regularization. We propose to couple the MixUp regularization \citep{zhang2018mixup} with a loss that is robust to noisy labels in order to improve domain adaptation performance. We show in an extensive ablation study that a combination of the two techniques is critical to achieve improved performance. Finally, we evaluate our method, called \textsc{mixunbot}, on several benchmarks and real-world DA problems.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
Conjugate Gradient Method for Generative Adversarial Networks
Authors:
Hiroki Naganuma,
Hideaki Iiduka
Abstract:
One of the training strategies of generative models is to minimize the Jensen--Shannon divergence between the model distribution and the data distribution. Since data distribution is unknown, generative adversarial networks (GANs) formulate this problem as a game between two models, a generator and a discriminator. The training can be formulated in the context of game theory and the local Nash equ…
▽ More
One of the training strategies of generative models is to minimize the Jensen--Shannon divergence between the model distribution and the data distribution. Since data distribution is unknown, generative adversarial networks (GANs) formulate this problem as a game between two models, a generator and a discriminator. The training can be formulated in the context of game theory and the local Nash equilibrium (LNE). It does not seem feasible to derive guarantees of stability or optimality for the existing methods. This optimization problem is far more challenging than the single objective setting. Here, we use the conjugate gradient method to reliably and efficiently solve the LNE problem in GANs. We give a proof and convergence analysis under mild assumptions showing that the proposed method converges to a LNE with three different learning rate update rules, including a constant learning rate. Finally, we demonstrate that the proposed method outperforms stochastic gradient descent (SGD) and momentum SGD in terms of best Frechet inception distance (FID) score and outperforms Adam on average. The code is available at \url{https://github.com/Hiroki11x/ConjugateGradient_GAN}.
△ Less
Submitted 20 February, 2023; v1 submitted 28 March, 2022;
originally announced March 2022.
-
Density functional study of twisted graphene $L1_0$-FePd heterogeneous interface
Authors:
Mitsuharu Uemoto,
Hayato Adachi,
Hiroshi Naganuma,
Tomoya Ono
Abstract:
Graphene on $L1_0$-FePd(001), which has been experimentally studied in recent years, is a heterogeneous interface with a significant lattice symmetry mismatch between the honeycomb structure of graphene and tetragonal alloy surface. In this work, we report on the density functional study of its atomic-scale configurations, electronic and magnetic properties, and adsorption mechanism, which have no…
▽ More
Graphene on $L1_0$-FePd(001), which has been experimentally studied in recent years, is a heterogeneous interface with a significant lattice symmetry mismatch between the honeycomb structure of graphene and tetragonal alloy surface. In this work, we report on the density functional study of its atomic-scale configurations, electronic and magnetic properties, and adsorption mechanism, which have not been well understood in previous experimental studies. We propose various atomic-scale models, including simple nontwisted and low-strain twisted interfaces, and analyze their energetical stability by performing structural optimizations using the van der Waals interactions of both DFT-D2 and optB86b-vdW functionals. The binding energy of the most stable structure reached $E_\mathrm{B}=-0.22$~eV/atom for DFT-D2 ($E_\mathrm{B}=-0.19$~eV/atom for optB86b-vdW). The calculated FePd-graphene spacing distance was approximately 2~Å, which successfully reproduced the experimental value. We also find out characteristic behaviors: the modulation of $π$-bands, the suppression of the site-dependence of adsorption energy, and the rise of \color{blue} moiré-like \color{black} corrugated buckling. In addition, our atomic structure is expected to help build low-cost computational models for investigating the physical properties of $L1_0$ alloys/two-dimensional interfaces.
△ Less
Submitted 25 July, 2022; v1 submitted 19 January, 2022;
originally announced January 2022.
-
Grammar compression with probabilistic context-free grammar
Authors:
Hiroaki Naganuma,
Diptarama Hendrian,
Ryo Yoshinaka,
Ayumi Shinohara,
Naoki Kobayashi
Abstract:
We propose a new approach for universal lossless text compression, based on grammar compression. In the literature, a target string $T$ has been compressed as a context-free grammar $G$ in Chomsky normal form satisfying $L(G) = \{T\}$. Such a grammar is often called a \emph{straight-line program} (SLP). In this paper, we consider a probabilistic grammar $G$ that generates $T$, but not necessarily…
▽ More
We propose a new approach for universal lossless text compression, based on grammar compression. In the literature, a target string $T$ has been compressed as a context-free grammar $G$ in Chomsky normal form satisfying $L(G) = \{T\}$. Such a grammar is often called a \emph{straight-line program} (SLP). In this paper, we consider a probabilistic grammar $G$ that generates $T$, but not necessarily as a unique element of $L(G)$. In order to recover the original text $T$ unambiguously, we keep both the grammar $G$ and the derivation tree of $T$ from the start symbol in $G$, in compressed form. We show some simple evidence that our proposal is indeed more efficient than SLPs for certain texts, both from theoretical and practical points of view.
△ Less
Submitted 18 March, 2020;
originally announced March 2020.
-
Realization of a spin wave switch based on the Spin-Transfer-Torque effect
Authors:
Thomas Meyer,
Thomas Brächer,
Frank Heussner,
Alexander A. Serga,
Hiroshi Naganuma,
Koki Mukaiyama,
Mikihiko Oogane,
Yasuo Ando,
Burkard Hillebrands,
Philipp Pirro
Abstract:
We investigate the amplification of externally excited spin waves via the Spin-Transfer-Torque (STT) effect in combination with the Spin-Hall-Effect (SHE) employing short current pulses. The results reveal that, in the case of an overcompensation of the spin wave dam**, a strong nonlinear shift of the spin wave frequency spectrum occurs. In particular, this shift affects the spin wave amplificat…
▽ More
We investigate the amplification of externally excited spin waves via the Spin-Transfer-Torque (STT) effect in combination with the Spin-Hall-Effect (SHE) employing short current pulses. The results reveal that, in the case of an overcompensation of the spin wave dam**, a strong nonlinear shift of the spin wave frequency spectrum occurs. In particular, this shift affects the spin wave amplification using the SHE-STT effect. In contrast, this effect allows for the realization of a spin wave switch. By determining the corresponding working point, an efficient spin wave excitation is only possible in the presence of the SHE-STT effect yielding an increased spin wave intensity of a factor of 20 compared to the absence of the SHE-STT effect.
△ Less
Submitted 5 February, 2018;
originally announced February 2018.
-
Characterization of Spin-Transfer-Torque effect induced magnetization dynamics driven by short current pulses
Authors:
T. Meyer,
T. Brächer,
F. Heussner,
A. A. Serga,
H. Naganuma,
K. Mukaiyama,
M. Oogane,
Y. Ando,
B. Hillebrands,
P. Pirro
Abstract:
We present a time-resolved study of the magnetization dynamics in a microstructured Cr$|$Heusler$|$Pt waveguide driven by the Spin-Hall-Effect and the Spin-Transfer-Torque effect via short current pulses. In particular, we focus on the determination of the threshold current at which the spin-wave dam** is compensated. We have developed a novel method based on the temporal evolution of the magnon…
▽ More
We present a time-resolved study of the magnetization dynamics in a microstructured Cr$|$Heusler$|$Pt waveguide driven by the Spin-Hall-Effect and the Spin-Transfer-Torque effect via short current pulses. In particular, we focus on the determination of the threshold current at which the spin-wave dam** is compensated. We have developed a novel method based on the temporal evolution of the magnon density at the beginning of an applied current pulse at which the magnon density deviates from the thermal level. Since this method does not depend on the signal-to-noise ratio, it allows for a robust and reliable determination of the threshold current which is important for the characterization of any future application based on the Spin-Transfer-Torque effect.
△ Less
Submitted 27 November, 2017;
originally announced November 2017.
-
Tuning up or down the critical thickness in LaAlO3/SrTiO3 through in situ deposition of metal overlayers
Authors:
D. C. Vaz,
E. Lesne,
H. Naganuma,
E. Jacquet,
J. Santamaria,
A. Barthelemy,
M. Bibes
Abstract:
The quasi 2D electron system (q2DES) that forms at the interface between LaAlO3 and SrTiO3 has attracted much attention from the oxide electronics community. One of its hallmark features is the existence of a critical LaAlO3 thickness of 4 unit-cells (uc) for interfacial conductivity to emerge. In this paper, the chemical, electronic, and transport properties of LaAlO3/SrTiO3 samples capped with d…
▽ More
The quasi 2D electron system (q2DES) that forms at the interface between LaAlO3 and SrTiO3 has attracted much attention from the oxide electronics community. One of its hallmark features is the existence of a critical LaAlO3 thickness of 4 unit-cells (uc) for interfacial conductivity to emerge. In this paper, the chemical, electronic, and transport properties of LaAlO3/SrTiO3 samples capped with different metals grown in a system combining pulsed laser deposition, sputtering, and in situ X-ray photoemission spectroscopy are investigated. The results show that for metals with low work function a q2DES forms at 1-2 uc of LaAlO3 and is accompanied by a partial oxidation of the metal, a phenomenon that affects the q2DES properties and triggers the formation of defects. In contrast, for noble metals, the critical thickness is increased above 4 uc. The results are discussed in terms of a hybrid mechanism that incorporates electrostatic and chemical effects.
△ Less
Submitted 30 August, 2017;
originally announced August 2017.
-
Experimental investigation of the temperature-dependent magnon density and its influence on studies of spin-transfer-torque-driven systems
Authors:
Thomas Meyer,
Thomas Brächer,
Frank Heussner,
Alexander A. Serga,
Hiroshi Naganuma,
Koki Mukaiyama,
Mikihiko Oogane,
Yasuo Ando,
Burkard Hillebrands,
Philipp Pirro
Abstract:
We present the temperature dependence of the thermal magnon density in a thin ferromagnetic layer. By employing Brillouin light scattering and varying the temperature, an increase of the magnon density accompanied by a lowering of the spin-wave frequency is observed with increasing temperature. The magnon density follows the temperature according to the Bose-Einstein distribution function which le…
▽ More
We present the temperature dependence of the thermal magnon density in a thin ferromagnetic layer. By employing Brillouin light scattering and varying the temperature, an increase of the magnon density accompanied by a lowering of the spin-wave frequency is observed with increasing temperature. The magnon density follows the temperature according to the Bose-Einstein distribution function which leads to an approximately linear dependency. In addition, the influence of this effect in spin-transfer-torque-driven systems is presented. In particular, the increase in the magnon density with temperature sets the limit for a suppression of magnons in charge current-driven systems. Hence, the maximum possible suppression of thermal magnons occurs at a finite current.
△ Less
Submitted 2 June, 2017;
originally announced June 2017.
-
Spin-Wave versus Joule Heating in Spin-Hall-Effect/Spin-Transfer-Torque Driven Cr/Heusler/Pt Waveguides
Authors:
T. Meyer,
T. Brächer,
F. Heussner,
A. A. Serga,
H. Naganuma,
K. Mukaiyama,
M. Oogane,
Y. Ando,
B. Hillebrands,
P. Pirro
Abstract:
We present a time-resolved study of the DC-current driven magnetization dynamics in a microstructured Cr/Heusler/Pt waveguide by means of Brillouin light scattering. A reduction of the effective spin-wave dam** via the spin-transfer-torque effect leads to a strong increase in the magnon density. This is accompanied by a decrease of the spin-wave frequencies. By evaluating the time scales of thes…
▽ More
We present a time-resolved study of the DC-current driven magnetization dynamics in a microstructured Cr/Heusler/Pt waveguide by means of Brillouin light scattering. A reduction of the effective spin-wave dam** via the spin-transfer-torque effect leads to a strong increase in the magnon density. This is accompanied by a decrease of the spin-wave frequencies. By evaluating the time scales of these effects, the origin of this frequency shift can be identified. However, recently, we found that the experimental setup partially influences the decay of the spin-wave intensity after the current pulse is switched off. Thus, further investigations on the presented effect are needed to allow for a more detailed analysis. For this reason, we need to withdraw the manuscript at this point and might publish an updated version later.
△ Less
Submitted 4 September, 2017; v1 submitted 9 January, 2017;
originally announced January 2017.
-
Highly efficient and tuneable spin-to-charge conversion through Rashba coupling at oxide interfaces
Authors:
E. Lesne,
Y. Fu,
S. Oyarzun,
J. C. Rojas-Sanchez,
D. C. Vaz,
H. Naganuma,
G. Sicoli,
J. -P. Attane,
M. Jamet,
E. Jacquet,
J. -M. George,
A. Barthelemy,
H. Jaffres,
A. Fert,
M. Bibes,
L. Vila
Abstract:
The spin-orbit interaction couples the electrons' motion to their spin. Accordingly, passing a current in a material with strong spin-orbit coupling generates a transverse spin current (spin Hall effect, SHE) and vice-versa (inverse spin Hall effect, ISHE). The emergence of SHE and ISHE as charge-to-spin interconversion mechanisms offers a variety of novel spintronics functionalities and devices,…
▽ More
The spin-orbit interaction couples the electrons' motion to their spin. Accordingly, passing a current in a material with strong spin-orbit coupling generates a transverse spin current (spin Hall effect, SHE) and vice-versa (inverse spin Hall effect, ISHE). The emergence of SHE and ISHE as charge-to-spin interconversion mechanisms offers a variety of novel spintronics functionalities and devices, some of which do not require any ferromagnetic material. However, the interconversion efficiency of SHE and ISHE (spin Hall angle) is a bulk property that rarely exceeds ten percent, and does not take advantage of interfacial and low-dimensional effects otherwise ubiquitous in spintronics hetero- and mesostructures. Here, we make use of an interface-driven spin-orbit coupling mechanism - the Rashba effect - in the oxide two-dimensional electron system (2DES) LaAlO3/SrTiO3 to achieve spin-to-charge conversion with unprecedented efficiency. Through spin-pum**, we inject a spin current from a NiFe film into the oxide 2DES and detect the resulting charge current, which can be strongly modulated by a gate voltage. We discuss the amplitude of the effect and its gate dependence on the basis of the electronic structure of the 2DES.
△ Less
Submitted 21 September, 2016;
originally announced September 2016.
-
Revealing the spin and symmetry properties of the buried Co2MnSi/MgO interface by low energy spin-resolved photoemission
Authors:
Roman Fetzer,
Marcel Lösch,
Yusuke Ohdaira,
Hiroshi Naganuma,
Mikihiko Oogane,
Yasuo Ando,
Tomoyuki Taira,
Tetsuya Uemura,
Masafumi Yamamoto,
Martin Aeschlimann,
Mirko Cinchetti
Abstract:
We present a novel approach to study the spin and symmetry electronic properties of buried interfaces using low-energy spin-resolved photoemission spectroscopy. We show that this method is sensitive to interfaces buried below more than 20ML (~4nm) MgO, providing a powerful tool for the non-destructive characterization of spintronics interfaces. As a demonstration, we apply this technique to charac…
▽ More
We present a novel approach to study the spin and symmetry electronic properties of buried interfaces using low-energy spin-resolved photoemission spectroscopy. We show that this method is sensitive to interfaces buried below more than 20ML (~4nm) MgO, providing a powerful tool for the non-destructive characterization of spintronics interfaces. As a demonstration, we apply this technique to characterize the Co2MnSi/MgO interface, a fundamental building block of state-of-the-art magnetic tunnel junctions based on Heusler compounds. We find that a surface state with Δ1 symmetry and minority spin character dominating the electronic structure of the bare Co2MnSi(100) surface is quenched at the Co2MnSi(100)/MgO interface. As a result, the interface spin-dependent electronic structure resembles the theoretically expected Co2MnSi bulk band structure, with majority spin electronic states of both Δ1 and Δ5 symmetry. Furthermore we find an additional thermally-induced contribution in the minority channel, mirroring the Δ1/Δ5 asymmetry of the majority channel.
△ Less
Submitted 19 September, 2012;
originally announced September 2012.
-
Nonlinear emission of spin-wave caustics from an edge mode of a micro-structured Co2Mn0.6Fe0.4Si waveguide
Authors:
T. Sebastian,
P. Pirro,
T. Kubota,
T. Brächer,
A. A. Serga,
H. Naganuma,
M. Oogane,
Y. Ando,
B. Hillebrands
Abstract:
Magnetic Heusler materials with very low Gilbert dam** are expected to show novel magnonic transport phenomena. We report nonlinear generation of higher harmonics leading to the emission of caustic spin-wave beams in a low-dam**, micro-structured Co2Mn0.6Fe0.4Si Heusler waveguide. The source for the higher harmonic generation is a localized edge mode formed by the strongly inhomogeneous field…
▽ More
Magnetic Heusler materials with very low Gilbert dam** are expected to show novel magnonic transport phenomena. We report nonlinear generation of higher harmonics leading to the emission of caustic spin-wave beams in a low-dam**, micro-structured Co2Mn0.6Fe0.4Si Heusler waveguide. The source for the higher harmonic generation is a localized edge mode formed by the strongly inhomogeneous field distribution at the edges of the spin-wave waveguide. The radiation characteristics of the propagating caustic waves observed at twice and three times the excitation frequency are described by an analytical calculation based on the anisotropic dispersion of spin waves in a magnetic thin film.
△ Less
Submitted 12 December, 2012; v1 submitted 17 September, 2012;
originally announced September 2012.