-
Learning Object States from Actions via Large Language Models
Authors:
Masatoshi Tateno,
Takuma Yagi,
Ryosuke Furuta,
Yoichi Sato
Abstract:
Temporally localizing the presence of object states in videos is crucial in understanding human activities beyond actions and objects. This task has suffered from a lack of training data due to object states' inherent ambiguity and variety. To avoid exhaustive annotation, learning from transcribed narrations in instructional videos would be intriguing. However, object states are less described in…
▽ More
Temporally localizing the presence of object states in videos is crucial in understanding human activities beyond actions and objects. This task has suffered from a lack of training data due to object states' inherent ambiguity and variety. To avoid exhaustive annotation, learning from transcribed narrations in instructional videos would be intriguing. However, object states are less described in narrations compared to actions, making them less effective. In this work, we propose to extract the object state information from action information included in narrations, using large language models (LLMs). Our observation is that LLMs include world knowledge on the relationship between actions and their resulting object states, and can infer the presence of object states from past action sequences. The proposed LLM-based framework offers flexibility to generate plausible pseudo-object state labels against arbitrary categories. We evaluate our method with our newly collected Multiple Object States Transition (MOST) dataset including dense temporal annotation of 60 object state categories. Our model trained by the generated pseudo-labels demonstrates significant improvement of over 29% in mAP against strong zero-shot vision-language models, showing the effectiveness of explicitly extracting object state information from actions through LLMs.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Nonequilibrium magnonic thermal transport engineering
Authors:
Takamasa Hirai,
Toshiaki Morita,
Subrata Biswas,
Jun Uzuhashi,
Takashi Yagi,
Yuichiro Yamashita,
Varun Kushwaha Kumar,
Fuya Makino,
Rajkumar Modak,
Yuya Sakuraba,
Tadakatsu Ohkubo,
Rulei Guo,
Bin Xu,
Junichiro Shiomi,
Daichi Chiba,
Ken-ichi Uchida
Abstract:
Thermal conductivity, a fundamental parameter characterizing thermal transport in solids, is typically determined by electron and phonon transport. Although other transport properties including electrical conductivity and thermoelectric conversion coefficients have material-specific values, it is known that thermal conductivity can be modulated artificially via phonon engineering techniques. Here,…
▽ More
Thermal conductivity, a fundamental parameter characterizing thermal transport in solids, is typically determined by electron and phonon transport. Although other transport properties including electrical conductivity and thermoelectric conversion coefficients have material-specific values, it is known that thermal conductivity can be modulated artificially via phonon engineering techniques. Here, we demonstrate another way of artificially modulating the heat conduction in solids: magnonic thermal transport engineering. The time-domain thermoreflectance measurements using ferromagnetic metal/insulator junction systems reveal that the thermal conductivity of the ferromagnetic metals and interfacial thermal conductance vary significantly depending on the spatial distribution of nonequilibrium spin currents. Systematic measurements of the thermal transport properties with changing the boundary conditions for spin currents show that the observed thermal transport modulation stems from magnon origin. This observation unveils that magnons significantly contribute to the heat conduction even in ferromagnetic metals at room temperature, upsetting the conventional wisdom that the thermal conductivity mediated by magnons is very small in metals except at low temperatures. The magnonic thermal transport engineering offers a new principle and method for active thermal management.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
FineBio: A Fine-Grained Video Dataset of Biological Experiments with Hierarchical Annotation
Authors:
Takuma Yagi,
Misaki Ohashi,
Yifei Huang,
Ryosuke Furuta,
Shungo Adachi,
Toutai Mitsuyama,
Yoichi Sato
Abstract:
In the development of science, accurate and reproducible documentation of the experimental process is crucial. Automatic recognition of the actions in experiments from videos would help experimenters by complementing the recording of experiments. Towards this goal, we propose FineBio, a new fine-grained video dataset of people performing biological experiments. The dataset consists of multi-view v…
▽ More
In the development of science, accurate and reproducible documentation of the experimental process is crucial. Automatic recognition of the actions in experiments from videos would help experimenters by complementing the recording of experiments. Towards this goal, we propose FineBio, a new fine-grained video dataset of people performing biological experiments. The dataset consists of multi-view videos of 32 participants performing mock biological experiments with a total duration of 14.5 hours. One experiment forms a hierarchical structure, where a protocol consists of several steps, each further decomposed into a set of atomic operations. The uniqueness of biological experiments is that while they require strict adherence to steps described in each protocol, there is freedom in the order of atomic operations. We provide hierarchical annotation on protocols, steps, atomic operations, object locations, and their manipulation states, providing new challenges for structured activity understanding and hand-object interaction recognition. To find out challenges on activity understanding in biological experiments, we introduce baseline models and results on four different tasks, including (i) step segmentation, (ii) atomic operation detection (iii) object detection, and (iv) manipulated/affected object detection. Dataset and code are available from https://github.com/aistairc/FineBio.
△ Less
Submitted 31 January, 2024;
originally announced February 2024.
-
Quantitative measurement of figure of merit for transverse thermoelectric conversion in Fe/Pt metallic multilayers
Authors:
Takumi Yamazaki,
Takamasa Hirai,
Takashi Yagi,
Yuichiro Yamashita,
Ken-ichi Uchida,
Takeshi Seki,
Koki Takanashi
Abstract:
This study presents a measurement method for determining the figure of merit for transverse thermoelectric conversion ($ z_\mathrm{T}T $) in thin film forms. Leveraging the proposed methodology, we comprehensively investigate the transverse thermoelectric coefficient ($ S_\mathrm{T} $), in-plane electrical conductivity ($ σ_{yy} $), and out-of-plane thermal conductivity ($ κ_{xx} $) in epitaxial a…
▽ More
This study presents a measurement method for determining the figure of merit for transverse thermoelectric conversion ($ z_\mathrm{T}T $) in thin film forms. Leveraging the proposed methodology, we comprehensively investigate the transverse thermoelectric coefficient ($ S_\mathrm{T} $), in-plane electrical conductivity ($ σ_{yy} $), and out-of-plane thermal conductivity ($ κ_{xx} $) in epitaxial and polycrystalline Fe/Pt metallic multilayers. The $ κ_{xx} $ values of multilayers with a number of stacking repetitions ($ N $) of 200 are lower than those of FePt alloy films, indicating that the multilayer structure effectively contributes to the suppression of $ κ_{xx} $. $ z_\mathrm{T}T $ is found to increase with increasing $ N $, which remarkably reflects the $ N $-dependent enhancement of the $ S_\mathrm{T} $ values. Notably, $ S_\mathrm{T} $ and $ σ_{yy} $ are significantly larger in the epitaxial multilayers than those in the polycrystalline counterparts, whereas negligible differences in $ κ_{xx} $ are observed between the epitaxial and polycrystalline multilayers. This discrepancy in $ σ_{yy} $ and $ κ_{xx} $ with respect to crystal growth is due to the different degree of anisotropy in electron transport between epitaxial and polycrystalline multilayers, and epitaxial growth can lead to an enhancement of $ z_\mathrm{T}T $ in the multilayers. This study is the first demonstration in the evaluation of $ z_\mathrm{T}T $ in thin film forms, and our proposed measurement technique reveals the transverse thermoelectric properties inherent to multilayers.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Authors:
Kristen Grauman,
Andrew Westbury,
Lorenzo Torresani,
Kris Kitani,
Jitendra Malik,
Triantafyllos Afouras,
Kumar Ashutosh,
Vijay Baiyya,
Siddhant Bansal,
Bikram Boote,
Eugene Byrne,
Zach Chavis,
Joya Chen,
Feng Cheng,
Fu-Jen Chu,
Sean Crane,
Avijit Dasgupta,
**g Dong,
Maria Escobar,
Cristhian Forigua,
Abrham Gebreselasie,
Sanjay Haresh,
**g Huang,
Md Mohaiminul Islam,
Suyog Jain
, et al. (76 additional authors not shown)
Abstract:
We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike repair). 740 participants from 13 cities worldwide performed these activities in 123 different natural scene contexts, yielding long-form captures from…
▽ More
We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike repair). 740 participants from 13 cities worldwide performed these activities in 123 different natural scene contexts, yielding long-form captures from 1 to 42 minutes each and 1,286 hours of video combined. The multimodal nature of the dataset is unprecedented: the video is accompanied by multichannel audio, eye gaze, 3D point clouds, camera poses, IMU, and multiple paired language descriptions -- including a novel "expert commentary" done by coaches and teachers and tailored to the skilled-activity domain. To push the frontier of first-person video understanding of skilled human activity, we also present a suite of benchmark tasks and their annotations, including fine-grained activity understanding, proficiency estimation, cross-view translation, and 3D hand/body pose. All resources are open sourced to fuel new research in the community. Project page: http://ego-exo4d-data.org/
△ Less
Submitted 29 April, 2024; v1 submitted 30 November, 2023;
originally announced November 2023.
-
Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos
Authors:
Takehiko Ohkawa,
Takuma Yagi,
Taichi Nishimura,
Ryosuke Furuta,
Atsushi Hashimoto,
Yoshitaka Ushiku,
Yoichi Sato
Abstract:
We propose a novel benchmark for cross-view knowledge transfer of dense video captioning, adapting models from web instructional videos with exocentric views to an egocentric view. While dense video captioning (predicting time segments and their captions) is primarily studied with exocentric videos (e.g., YouCook2), benchmarks with egocentric videos are restricted due to data scarcity. To overcome…
▽ More
We propose a novel benchmark for cross-view knowledge transfer of dense video captioning, adapting models from web instructional videos with exocentric views to an egocentric view. While dense video captioning (predicting time segments and their captions) is primarily studied with exocentric videos (e.g., YouCook2), benchmarks with egocentric videos are restricted due to data scarcity. To overcome the limited video availability, transferring knowledge from abundant exocentric web videos is demanded as a practical approach. However, learning the correspondence between exocentric and egocentric views is difficult due to their dynamic view changes. The web videos contain mixed views focusing on either human body actions or close-up hand-object interactions, while the egocentric view is constantly shifting as the camera wearer moves. This necessitates the in-depth study of cross-view transfer under complex view changes. In this work, we first create a real-life egocentric dataset (EgoYC2) whose captions are shared with YouCook2, enabling transfer learning between these datasets assuming their ground-truth is accessible. To bridge the view gaps, we propose a view-invariant learning method using adversarial training in both the pre-training and fine-tuning stages. While the pre-training is designed to learn invariant features against the mixed views in the web videos, the view-invariant fine-tuning further mitigates the view gaps between both datasets. We validate our proposed method by studying how effectively it overcomes the view change problem and efficiently transfers the knowledge to the egocentric domain. Our benchmark pushes the study of the cross-view transfer into a new task domain of dense video captioning and will envision methodologies to describe egocentric videos in natural language.
△ Less
Submitted 29 November, 2023; v1 submitted 27 November, 2023;
originally announced November 2023.
-
Canary in Twitter Mine: Collecting Phishing Reports from Experts and Non-experts
Authors:
Hiroki Nakano,
Daiki Chiba,
Takashi Koide,
Naoki Fukushi,
Takeshi Yagi,
Takeo Hariu,
Katsunari Yoshioka,
Tsutomu Matsumoto
Abstract:
The rise in phishing attacks via e-mail and short message service (SMS) has not slowed down at all. The first thing we need to do to combat the ever-increasing number of phishing attacks is to collect and characterize more phishing cases that reach end users. Without understanding these characteristics, anti-phishing countermeasures cannot evolve. In this study, we propose an approach using Twitte…
▽ More
The rise in phishing attacks via e-mail and short message service (SMS) has not slowed down at all. The first thing we need to do to combat the ever-increasing number of phishing attacks is to collect and characterize more phishing cases that reach end users. Without understanding these characteristics, anti-phishing countermeasures cannot evolve. In this study, we propose an approach using Twitter as a new observation point to immediately collect and characterize phishing cases via e-mail and SMS that evade countermeasures and reach users. Specifically, we propose CrowdCanary, a system capable of structurally and accurately extracting phishing information (e.g., URLs and domains) from tweets about phishing by users who have actually discovered or encountered it. In our three months of live operation, CrowdCanary identified 35,432 phishing URLs out of 38,935 phishing reports, 31,960 (90.2%) of these phishing URLs were later detected by the anti-virus engine. We analyzed users who shared phishing threats by categorizing them into two groups: experts and non-experts. As a results, we discovered that CrowdCanary extracts non-expert report-specific information, like company brand name in tweets, phishing attack details from tweet images, and pre-redirect landing page information.
△ Less
Submitted 6 June, 2023; v1 submitted 28 March, 2023;
originally announced March 2023.
-
Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos
Authors:
Zecheng Yu,
Yifei Huang,
Ryosuke Furuta,
Takuma Yagi,
Yusuke Goutsu,
Yoichi Sato
Abstract:
Object affordance is an important concept in hand-object interaction, providing information on action possibilities based on human motor capacity and objects' physical property thus benefiting tasks such as action anticipation and robot imitation learning. However, the definition of affordance in existing datasets often: 1) mix up affordance with object functionality; 2) confuse affordance with go…
▽ More
Object affordance is an important concept in hand-object interaction, providing information on action possibilities based on human motor capacity and objects' physical property thus benefiting tasks such as action anticipation and robot imitation learning. However, the definition of affordance in existing datasets often: 1) mix up affordance with object functionality; 2) confuse affordance with goal-related action; and 3) ignore human motor capacity. This paper proposes an efficient annotation scheme to address these issues by combining goal-irrelevant motor actions and grasp types as affordance labels and introducing the concept of mechanical action to represent the action possibilities between two objects. We provide new annotations by applying this scheme to the EPIC-KITCHENS dataset and test our annotation with tasks such as affordance recognition, hand-object interaction hotspots prediction, and cross-domain evaluation of affordance. The results show that models trained with our annotation can distinguish affordance from other concepts, predict fine-grained interaction possibilities on objects, and generalize through different domains.
△ Less
Submitted 9 February, 2023; v1 submitted 7 February, 2023;
originally announced February 2023.
-
Low Thermal Conductivity Phase Change Memory Superlattices
Authors:
**g Ning,
Xilin Zhou,
Yunzheng Wang,
Takashi Yagi,
Janne Kalikka,
Siew Lang Teo,
Zhitang Song,
Michel Bosman,
Robert E. Simpson
Abstract:
Phase change memory devices are typically reset by melt-quenching a material to radically lower its electrical conductance. The high power and concomitantly high current density required to reset phase change materials is the major issue that limits the access times of 3D phase change memory architectures. Phase change superlattices were developed to lower the reset energy by confining the phase t…
▽ More
Phase change memory devices are typically reset by melt-quenching a material to radically lower its electrical conductance. The high power and concomitantly high current density required to reset phase change materials is the major issue that limits the access times of 3D phase change memory architectures. Phase change superlattices were developed to lower the reset energy by confining the phase transition to the interface between two different phase change materials. However, the high thermal conductivity of the superlattices means that heat is poorly confined within the phase change material, and most of the thermal energy is wasted to the surrounding materials. Here, we identified Ti as a useful dopant for substantially lowering the thermal conductivity of Sb2Te3-GeTe superlattices whilst also stabilising the layered structure from unwanted disordering. We demonstrate via laser heating that lowering the thermal conductivity by do** the Sb2Te3 layers with Ti halves the switching energy compared to superlattices that only use interfacial phase change transitions and strain engineering. The thermally optimized superlattice has (0 0 l) crystallographic orientation yet a thermal conductivity of just 0.25 W/m.K in the "on" (set) state. Prototype phase change memory devices that incorporate this Ti-doped superlattice switch faster and and at a substantially lower voltage than the undoped superlattice. During switching the Ti-doped Sb2Te3 layers remain stable within the superlattice and only the Ge atoms are active and undergo interfacial phase transitions. In conclusion, we show the potential of thermally optimised Sb2Te3-GeTe superlattices for a new generation of energy-efficient electrical and optical phase change memory.
△ Less
Submitted 30 September, 2022;
originally announced September 2022.
-
EEG-BBNet: a Hybrid Framework for Brain Biometric using Graph Connectivity
Authors:
Payongkit Lakhan,
Nannapas Banluesombatkul,
Natchaya Sricom,
Korn Surapat,
Ratha Rotruchiphong,
Phattarapong Sawangjai,
Tohru Yagi,
Tulaya Limpiti,
Theerawit Wilaiprasitporn
Abstract:
Brain biometrics based on electroencephalography (EEG) have been used increasingly for personal identification. Traditional machine learning techniques as well as modern day deep learning methods have been applied with promising results. In this paper we present EEG-BBNet, a hybrid network which integrates convolutional neural networks (CNN) with graph convolutional neural networks (GCNN). The ben…
▽ More
Brain biometrics based on electroencephalography (EEG) have been used increasingly for personal identification. Traditional machine learning techniques as well as modern day deep learning methods have been applied with promising results. In this paper we present EEG-BBNet, a hybrid network which integrates convolutional neural networks (CNN) with graph convolutional neural networks (GCNN). The benefit of the CNN in automatic feature extraction and the capability of GCNN in learning connectivity between EEG electrodes through graph representation are jointly exploited. We examine various connectivity measures, namely the Euclidean distance, Pearson's correlation coefficient, phase-locked value, phase-lag index, and Rho index. The performance of the proposed method is assessed on a benchmark dataset consisting of various brain-computer interface (BCI) tasks and compared to other state-of-the-art approaches. We found that our models outperform all baselines in the event-related potential (ERP) task with an average correct recognition rates up to 99.26% using intra-session data. EEG-BBNet with Pearson's correlation and RHO index provide the best classification results. In addition, our model demonstrates greater adaptability using inter-session and inter-task data. We also investigate the practicality of our proposed model with smaller number of electrodes. Electrode placements over the frontal lobe region appears to be most appropriate with minimal lost in performance.
△ Less
Submitted 17 August, 2022;
originally announced August 2022.
-
Precise Affordance Annotation for Egocentric Action Video Datasets
Authors:
Zecheng Yu,
Yifei Huang,
Ryosuke Furuta,
Takuma Yagi,
Yusuke Goutsu,
Yoichi Sato
Abstract:
Object affordance is an important concept in human-object interaction, providing information on action possibilities based on human motor capacity and objects' physical property thus benefiting tasks such as action anticipation and robot imitation learning. However, existing datasets often: 1) mix up affordance with object functionality; 2) confuse affordance with goal-related action; and 3) ignor…
▽ More
Object affordance is an important concept in human-object interaction, providing information on action possibilities based on human motor capacity and objects' physical property thus benefiting tasks such as action anticipation and robot imitation learning. However, existing datasets often: 1) mix up affordance with object functionality; 2) confuse affordance with goal-related action; and 3) ignore human motor capacity. This paper proposes an efficient annotation scheme to address these issues by combining goal-irrelevant motor actions and grasp types as affordance labels and introducing the concept of mechanical action to represent the action possibilities between two objects. We provide new annotations by applying this scheme to the EPIC-KITCHENS dataset and test our annotation with tasks such as affordance recognition. We qualitatively verify that models trained with our annotation can distinguish affordance and mechanical actions.
△ Less
Submitted 11 June, 2022;
originally announced June 2022.
-
Object Instance Identification in Dynamic Environments
Authors:
Takuma Yagi,
Md Tasnimul Hasan,
Yoichi Sato
Abstract:
We study the problem of identifying object instances in a dynamic environment where people interact with the objects. In such an environment, objects' appearance changes dynamically by interaction with other entities, occlusion by hands, background change, etc. This leads to a larger intra-instance variation of appearance than in static environments. To discover the challenges in this setting, we…
▽ More
We study the problem of identifying object instances in a dynamic environment where people interact with the objects. In such an environment, objects' appearance changes dynamically by interaction with other entities, occlusion by hands, background change, etc. This leads to a larger intra-instance variation of appearance than in static environments. To discover the challenges in this setting, we newly built a benchmark of more than 1,500 instances built on the EPIC-KITCHENS dataset which includes natural activities and conducted an extensive analysis of it. Experimental results suggest that (i) robustness against instance-specific appearance change (ii) integration of low-level (e.g., color, texture) and high-level (e.g., object category) features (iii) foreground feature selection on overlap** objects are required for further improvement.
△ Less
Submitted 10 June, 2022;
originally announced June 2022.
-
Sm-Co-based amorphous alloy films for zero-field operation of transverse thermoelectric generation
Authors:
Rajkumar Modak,
Yuya Sakuraba,
Takamasa Hirai,
Takashi Yagi,
Hossein Sepehri-Amin,
Weinan Zhou,
Hiroto Masuda,
Takeshi Seki,
Koki Takanashi,
Tadakatsu Ohkubo,
Ken-ichi Uchida
Abstract:
Transverse thermoelectric generation using magnetic materials is essential to develop active thermal engineering technologies, for which the improvements of not only the thermoelectric output but also applicability and versatility are required. In this study, using combinatorial material science and lock-in thermography technique, we have systematically investigated the transverse thermoelectric p…
▽ More
Transverse thermoelectric generation using magnetic materials is essential to develop active thermal engineering technologies, for which the improvements of not only the thermoelectric output but also applicability and versatility are required. In this study, using combinatorial material science and lock-in thermography technique, we have systematically investigated the transverse thermoelectric performance of Sm-Co-based alloy films. The high-throughput material investigation revealed the best Sm-Co-based alloys with the large anomalous Nernst effect (ANE) as well as the anomalous Ettingshausen effect (AEE). In addition to ANE/AEE, we discovered unique and superior material properties in these alloys: the amorphous structure, low thermal conductivity, and large in-plane coercivity and remanent magnetization. These properties make it advantageous over conventional materials to realize heat flux sensing applications based on ANE, as our Sm-Co-based films can generate thermoelectric output without an external magnetic field. Importantly, the amorphous nature enables the fabrication of these films on various substrates including flexible sheets, making the large-scale and low-cost manufacturing easier. Our demonstration will provide a pathway to develop flexible transverse thermoelectric devices for smart thermal management.
△ Less
Submitted 18 November, 2022; v1 submitted 21 March, 2022;
originally announced March 2022.
-
Hand-Object Contact Prediction via Motion-Based Pseudo-Labeling and Guided Progressive Label Correction
Authors:
Takuma Yagi,
Md Tasnimul Hasan,
Yoichi Sato
Abstract:
Every hand-object interaction begins with contact. Despite predicting the contact state between hands and objects is useful in understanding hand-object interactions, prior methods on hand-object analysis have assumed that the interacting hands and objects are known, and were not studied in detail. In this study, we introduce a video-based method for predicting contact between a hand and an object…
▽ More
Every hand-object interaction begins with contact. Despite predicting the contact state between hands and objects is useful in understanding hand-object interactions, prior methods on hand-object analysis have assumed that the interacting hands and objects are known, and were not studied in detail. In this study, we introduce a video-based method for predicting contact between a hand and an object. Specifically, given a video and a pair of hand and object tracks, we predict a binary contact state (contact or no-contact) for each frame. However, annotating a large number of hand-object tracks and contact labels is costly. To overcome the difficulty, we propose a semi-supervised framework consisting of (i) automatic collection of training data with motion-based pseudo-labels and (ii) guided progressive label correction (gPLC), which corrects noisy pseudo-labels with a small amount of trusted data. We validated our framework's effectiveness on a newly built benchmark dataset for hand-object contact prediction and showed superior performance against existing baseline methods. Code and data are available at https://github.com/takumayagi/hand_object_contact_prediction.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Authors:
Kristen Grauman,
Andrew Westbury,
Eugene Byrne,
Zachary Chavis,
Antonino Furnari,
Rohit Girdhar,
Jackson Hamburger,
Hao Jiang,
Miao Liu,
Xingyu Liu,
Miguel Martin,
Tushar Nagarajan,
Ilija Radosavovic,
Santhosh Kumar Ramakrishnan,
Fiona Ryan,
Jayant Sharma,
Michael Wray,
Mengmeng Xu,
Eric Zhongcong Xu,
Chen Zhao,
Siddhant Bansal,
Dhruv Batra,
Vincent Cartillier,
Sean Crane,
Tien Do
, et al. (60 additional authors not shown)
Abstract:
We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household, outdoor, workplace, leisure, etc.) captured by 931 unique camera wearers from 74 worldwide locations and 9 different countries. The approach to collection is designed to uphold rigorous privacy and ethics standards with cons…
▽ More
We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household, outdoor, workplace, leisure, etc.) captured by 931 unique camera wearers from 74 worldwide locations and 9 different countries. The approach to collection is designed to uphold rigorous privacy and ethics standards with consenting participants and robust de-identification procedures where relevant. Ego4D dramatically expands the volume of diverse egocentric video footage publicly available to the research community. Portions of the video are accompanied by audio, 3D meshes of the environment, eye gaze, stereo, and/or synchronized videos from multiple egocentric cameras at the same event. Furthermore, we present a host of new benchmark challenges centered around understanding the first-person visual experience in the past (querying an episodic memory), present (analyzing hand-object manipulation, audio-visual conversation, and social interactions), and future (forecasting activities). By publicly sharing this massive annotated dataset and benchmark suite, we aim to push the frontier of first-person perception. Project page: https://ego4d-data.org/
△ Less
Submitted 11 March, 2022; v1 submitted 13 October, 2021;
originally announced October 2021.
-
Foreground-Aware Stylization and Consensus Pseudo-Labeling for Domain Adaptation of First-Person Hand Segmentation
Authors:
Takehiko Ohkawa,
Takuma Yagi,
Atsushi Hashimoto,
Yoshitaka Ushiku,
Yoichi Sato
Abstract:
Hand segmentation is a crucial task in first-person vision. Since first-person images exhibit strong bias in appearance among different environments, adapting a pre-trained segmentation model to a new domain is required in hand segmentation. Here, we focus on appearance gaps for hand regions and backgrounds separately. We propose (i) foreground-aware image stylization and (ii) consensus pseudo-lab…
▽ More
Hand segmentation is a crucial task in first-person vision. Since first-person images exhibit strong bias in appearance among different environments, adapting a pre-trained segmentation model to a new domain is required in hand segmentation. Here, we focus on appearance gaps for hand regions and backgrounds separately. We propose (i) foreground-aware image stylization and (ii) consensus pseudo-labeling for domain adaptation of hand segmentation. We stylize source images independently for the foreground and background using target images as style. To resolve the domain shift that the stylization has not addressed, we apply careful pseudo-labeling by taking a consensus between the models trained on the source and stylized source images. We validated our method on domain adaptation of hand segmentation from real and simulation images. Our method achieved state-of-the-art performance in both settings. We also demonstrated promising results in challenging multi-target domain adaptation and domain generalization settings. Code is available at https://github.com/ut-vision/FgSty-CPL.
△ Less
Submitted 27 March, 2022; v1 submitted 6 July, 2021;
originally announced July 2021.
-
Ultrafast dynamics of electronic structure in InN thin film
Authors:
Junjun Jia,
Takashi Yagi,
Toshiki Makimoto
Abstract:
Simultaneous measurements of transient transmission and reflectivity were performed in the unintentionally doped InN film to reveal ultrafast optical bleaching and its recovery behavior under intense laser irradiation. The optical bleaching is attributed to Pauli blocking due to the occupation of photoexcited electrons at the probing energy level. The time constant for the transition from the exci…
▽ More
Simultaneous measurements of transient transmission and reflectivity were performed in the unintentionally doped InN film to reveal ultrafast optical bleaching and its recovery behavior under intense laser irradiation. The optical bleaching is attributed to Pauli blocking due to the occupation of photoexcited electrons at the probing energy level. The time constant for the transition from the excitation state to the conduction band edge is $\sim$260 fs. The interplay between band filling and band gap renormalization caused by electron-hole and electron-electron interactions gives rise to complex spectral characteristics of transient reflectivity, from which the time constants of photoexcited electron-hole direct recombination and band edge recombination are extracted as $\sim$60 fs and 250$\sim$400 fs, respectively. Our results also reveal that the electron-electron interaction suppresses band edge recombination, and mitigates the recovery process. Our experiments highlight the controllability of the band structure of semiconductors by intense laser irradiation.
△ Less
Submitted 15 February, 2021;
originally announced February 2021.
-
GO-Finder: A Registration-Free Wearable System for Assisting Users in Finding Lost Objects via Hand-Held Object Discovery
Authors:
Takuma Yagi,
Takumi Nishiyasu,
Kunimasa Kawasaki,
Moe Matsuki,
Yoichi Sato
Abstract:
People spend an enormous amount of time and effort looking for lost objects. To help remind people of the location of lost objects, various computational systems that provide information on their locations have been developed. However, prior systems for assisting people in finding objects require users to register the target objects in advance. This requirement imposes a cumbersome burden on the u…
▽ More
People spend an enormous amount of time and effort looking for lost objects. To help remind people of the location of lost objects, various computational systems that provide information on their locations have been developed. However, prior systems for assisting people in finding objects require users to register the target objects in advance. This requirement imposes a cumbersome burden on the users, and the system cannot help remind them of unexpectedly lost objects. We propose GO-Finder ("Generic Object Finder"), a registration-free wearable camera based system for assisting people in finding an arbitrary number of objects based on two key features: automatic discovery of hand-held objects and image-based candidate selection. Given a video taken from a wearable camera, Go-Finder automatically detects and groups hand-held objects to form a visual timeline of the objects. Users can retrieve the last appearance of the object by browsing the timeline through a smartphone app. We conducted a user study to investigate how users benefit from using GO-Finder and confirmed improved accuracy and reduced mental load regarding the object search task by providing clear visual cues on object locations.
△ Less
Submitted 12 February, 2021; v1 submitted 18 January, 2021;
originally announced January 2021.
-
Logarithmic and nonlogarithmic scaling laws of two-point statistics in wall turbulence
Authors:
H. Mouri,
T. Morinaga,
T. Yagi,
K. Mori
Abstract:
Wall turbulence has a sublayer where one-point statistics, e.g., the mean velocity and the variances of some velocity fluctuations, vary logarithmically with the distance from the wall. This logarithmic scaling is found here for two-point statistics or specifically two-point cumulants of those fluctuations by means of experiments in a wind tunnel. As for corresponding statistics of the rate of the…
▽ More
Wall turbulence has a sublayer where one-point statistics, e.g., the mean velocity and the variances of some velocity fluctuations, vary logarithmically with the distance from the wall. This logarithmic scaling is found here for two-point statistics or specifically two-point cumulants of those fluctuations by means of experiments in a wind tunnel. As for corresponding statistics of the rate of the energy dissipation, the scaling is found to be not logarithmic. We reproduce these scaling laws with some mathematics and also with a model of energy-containing eddies that are attached to the wall.
△ Less
Submitted 4 May, 2020; v1 submitted 6 April, 2020;
originally announced April 2020.
-
Modified Computation of Correlation Integral for Analyzing Epileptic Signals
Authors:
Prajna Upadhyaya,
Tohru Yagi
Abstract:
Epilepsy is a chronic neurological disorder characterized by recurrent seizures. One method for analyzing seizure activity is to compute the correlation dimension of time-series electroencephalographic signals. The Grasserberg and Proccacia algorithm is commonly used to compute this correlation dimension. The algorithm uses the Heaviside function to determine the correlation integral by counting t…
▽ More
Epilepsy is a chronic neurological disorder characterized by recurrent seizures. One method for analyzing seizure activity is to compute the correlation dimension of time-series electroencephalographic signals. The Grasserberg and Proccacia algorithm is commonly used to compute this correlation dimension. The algorithm uses the Heaviside function to determine the correlation integral by counting the number of distances between vectors (d_ij) that are greater than a threshold. However, information about the chaotic nature of the signal is not completely retained by this function. In this work, instead of using the Heaviside function, we calculated the correlation integral by using an exponential function of d_ij. Greater sensitivity to the interictal and ictal signals using this modified algorithm was verified using three datasets. Comparing heatmaps of d_ij obtained using the original and modified methods showed additional information that was retained with the new algorithm.
△ Less
Submitted 12 December, 2019;
originally announced December 2019.
-
A Single-Channel Consumer-Grade EEG Device for Brain-Computer Interface: Enhancing Detection of SSVEP and Its Amplitude Modulation
Authors:
Phairot Autthasan,
Xiangqian Du,
Jetsada Arnin,
Sirakorn Lamyai,
Maneesha Perera,
Sirawaj Itthipuripat,
Tohru Yagi,
Poramate Manoonpong,
Theerawit Wilaiprasitporn
Abstract:
Brain-Computer interfaces (BCIs) play a significant role in easing neuromuscular patients on controlling computers and prosthetics. Due to their high signal-to-noise ratio, steady-state visually evoked potentials (SSVEPs) has been widely used to build BCIs. However, currently developed algorithms do not predict the modulation of SSVEP amplitude, which is known to change as a function of stimulus l…
▽ More
Brain-Computer interfaces (BCIs) play a significant role in easing neuromuscular patients on controlling computers and prosthetics. Due to their high signal-to-noise ratio, steady-state visually evoked potentials (SSVEPs) has been widely used to build BCIs. However, currently developed algorithms do not predict the modulation of SSVEP amplitude, which is known to change as a function of stimulus luminance contrast. In this study, we aim to develop an integrated approach to simultaneously estimate the frequency and contrast-related amplitude modulations of the SSVEP signal. To achieve that, we developed a behavioral task in which human participants focused on a visual flicking target which the luminance contrast can change through time in several ways. SSVEP signals from 16 subjects were then recorded from electrodes placed at the central occipital site using a low-cost, consumer-grade EEG. Our results demonstrate that the filter bank canonical correlation analysis (FBCCA) performed well in SSVEP frequency recognition, while the support vector regression (SVR) outperformed the other supervised machine learning algorithms in predicting the contrast-dependent amplitude modulations of the SSVEPs. These findings indicate the applicability and strong performance of our integrated method at simultaneously predicting both frequency and amplitude of visually evoked signals, and have proven to be useful for advancing SSVEP-based applications.
△ Less
Submitted 4 December, 2019; v1 submitted 19 September, 2018;
originally announced September 2018.
-
User Blocking Considered Harmful? An Attacker-controllable Side Channel to Identify Social Accounts
Authors:
Takuya Watanabe,
Eitaro Shioji,
Mitsuaki Akiyama,
Keito Sasaoka,
Takeshi Yagi,
Tatsuya Mori
Abstract:
This paper presents a practical side-channel attack that identifies the social web service account of a visitor to an attacker's website. Our attack leverages the widely adopted user-blocking mechanism, abusing its inherent property that certain pages return different web content depending on whether a user is blocked from another user. Our key insight is that an account prepared by an attacker ca…
▽ More
This paper presents a practical side-channel attack that identifies the social web service account of a visitor to an attacker's website. Our attack leverages the widely adopted user-blocking mechanism, abusing its inherent property that certain pages return different web content depending on whether a user is blocked from another user. Our key insight is that an account prepared by an attacker can hold an attacker-controllable binary state of blocking/non-blocking with respect to an arbitrary user on the same service; provided that the user is logged in to the service, this state can be retrieved as one-bit data through the conventional cross-site timing attack when a user visits the attacker's website. We generalize and refer to such a property as visibility control, which we consider as the fundamental assumption of our attack. Building on this primitive, we show that an attacker with a set of controlled accounts can gain a complete and flexible control over the data leaked through the side channel. Using this mechanism, we show that it is possible to design and implement a robust, large-scale user identification attack on a wide variety of social web services. To verify the feasibility of our attack, we perform an extensive empirical study using 16 popular social web services and demonstrate that at least 12 of these are vulnerable to our attack. Vulnerable services include not only popular social networking sites such as Twitter and Facebook, but also other types of web services that provide social features, e.g., eBay and Xbox Live. We also demonstrate that the attack can achieve nearly 100% accuracy and can finish within a sufficiently short time in a practical setting. We discuss the fundamental principles, practical aspects, and limitations of the attack as well as possible defenses.
△ Less
Submitted 14 May, 2018;
originally announced May 2018.
-
Future Person Localization in First-Person Videos
Authors:
Takuma Yagi,
Karttikeya Mangalam,
Ryo Yonetani,
Yoichi Sato
Abstract:
We present a new task that predicts future locations of people observed in first-person videos. Consider a first-person video stream continuously recorded by a wearable camera. Given a short clip of a person that is extracted from the complete stream, we aim to predict that person's location in future frames. To facilitate this future person localization ability, we make the following three key ob…
▽ More
We present a new task that predicts future locations of people observed in first-person videos. Consider a first-person video stream continuously recorded by a wearable camera. Given a short clip of a person that is extracted from the complete stream, we aim to predict that person's location in future frames. To facilitate this future person localization ability, we make the following three key observations: a) First-person videos typically involve significant ego-motion which greatly affects the location of the target person in future frames; b) Scales of the target person act as a salient cue to estimate a perspective effect in first-person videos; c) First-person videos often capture people up-close, making it easier to leverage target poses (e.g., where they look) for predicting their future locations. We incorporate these three observations into a prediction framework with a multi-stream convolution-deconvolution architecture. Experimental results reveal our method to be effective on our new dataset as well as on a public social interaction dataset.
△ Less
Submitted 27 March, 2018; v1 submitted 29 November, 2017;
originally announced November 2017.
-
Logarithmic scaling for fluctuations of a scalar concentration in wall turbulence
Authors:
H. Mouri,
T. Morinaga,
T. Yagi,
K. Mori
Abstract:
Within wall turbulence, there is a sublayer where the mean velocity and the variance of velocity fluctuations vary logarithmically with the height from the wall. This logarithmic scaling is also known for the mean concentration of a passive scalar. By using heat as such a scalar in a laboratory experiment of a turbulent boundary layer, the existence of the logarithmic scaling is shown here for the…
▽ More
Within wall turbulence, there is a sublayer where the mean velocity and the variance of velocity fluctuations vary logarithmically with the height from the wall. This logarithmic scaling is also known for the mean concentration of a passive scalar. By using heat as such a scalar in a laboratory experiment of a turbulent boundary layer, the existence of the logarithmic scaling is shown here for the variance of fluctuations of the scalar concentration. It is reproduced by a model of energy-containing eddies that are attached to the wall.
△ Less
Submitted 16 November, 2017;
originally announced November 2017.
-
A Study on the Vulnerabilities of Mobile Apps associated with Software Modules
Authors:
Takuya Watanabe,
Mitsuaki Akiyama,
Fumihiro Kanei,
Eitaro Shioji,
Yuta Takata,
Bo Sun,
Yuta Ishi,
Toshiki Shibahara,
Takeshi Yagi,
Tatsuya Mori
Abstract:
This paper reports a large-scale study that aims to understand how mobile application (app) vulnerabilities are associated with software libraries. We analyze both free and paid apps. Studying paid apps was quite meaningful because it helped us understand how differences in app development/maintenance affect the vulnerabilities associated with libraries. We analyzed 30k free and paid apps collecte…
▽ More
This paper reports a large-scale study that aims to understand how mobile application (app) vulnerabilities are associated with software libraries. We analyze both free and paid apps. Studying paid apps was quite meaningful because it helped us understand how differences in app development/maintenance affect the vulnerabilities associated with libraries. We analyzed 30k free and paid apps collected from the official Android marketplace. Our extensive analyses revealed that approximately 70%/50% of vulnerabilities of free/paid apps stem from software libraries, particularly from third-party libraries. Somewhat paradoxically, we found that more expensive/popular paid apps tend to have more vulnerabilities. This comes from the fact that more expensive/popular paid apps tend to have more functionality, i.e., more code and libraries, which increases the probability of vulnerabilities. Based on our findings, we provide suggestions to stakeholders of mobile app distribution ecosystems.
△ Less
Submitted 27 March, 2017; v1 submitted 10 February, 2017;
originally announced February 2017.
-
I=2 $π$-$π$ scattering length with dynamical overlap fermion
Authors:
Takuya Yagi,
Shoji Hashimoto,
Osamu Morimatsu,
Munehisa Ohtani
Abstract:
We report on a lattice QCD calculation of the I=2 $ππ$ scattering length using the overlap fermion formulation for both sea and valence quarks. We investigate the consistency of the lattice data with the prediction of the next-to-next-to-leading order chiral perturbation theory after correcting finite volume effects. The calculation is performed on gauge ensembles of two-flavor QCD generated by th…
▽ More
We report on a lattice QCD calculation of the I=2 $ππ$ scattering length using the overlap fermion formulation for both sea and valence quarks. We investigate the consistency of the lattice data with the prediction of the next-to-next-to-leading order chiral perturbation theory after correcting finite volume effects. The calculation is performed on gauge ensembles of two-flavor QCD generated by the JLQCD collaboration on a $16^3\times 32$ lattice at a lattice spacing $\sim$ 0.12 fm.
△ Less
Submitted 15 August, 2011;
originally announced August 2011.
-
Space efficient opposed-anvil high-pressure cell and its application to optical and NMR measurements up to 9 GPa
Authors:
Kentaro Kitagawa,
Hirotada Gotou,
Takehiko Yagi,
Atsushi Yamada,
Takehiko Matsumoto,
Yoshiya Uwatoko,
Masashi Takigawa
Abstract:
We have developed a new type of opposed-anvil high pressure cell with substantially improved space efficiency. The clamp cell and the gasket are made of non-magnetic Ni-Cr-Al alloy. Non-magnetic tungsten carbide (NMWC) is used for the anvils. The assembled cell with the dimension φ29mm \times 41mm is capable of generating pressure up to 9 GPa over a relatively large volume of 7 mm3. Our cell is…
▽ More
We have developed a new type of opposed-anvil high pressure cell with substantially improved space efficiency. The clamp cell and the gasket are made of non-magnetic Ni-Cr-Al alloy. Non-magnetic tungsten carbide (NMWC) is used for the anvils. The assembled cell with the dimension φ29mm \times 41mm is capable of generating pressure up to 9 GPa over a relatively large volume of 7 mm3. Our cell is particularly suitable for those experiments which require large sample space to achieve good signal-to-noise ratio, such as the nuclear magnetic resonance (NMR) experiment. Argon is used as the pressure transmitting medium to obtain good hydrostaticity. The pressure was calibrated in situ by measuring the fluorescence from ruby through a transparent moissanite (6H-SiC) window. We have measured the pressure and temperature dependences of the 63Cu nuclear-quadrupole-resonance (NQR) frequency of Cu2O, the in-plane Knight shift of metallic tin, and the Knight shift of platinum. These quantities can be used as reliable manometers to determine the pressure values in situ during the NMR/NQR experiments up to 9 GPa.
△ Less
Submitted 25 November, 2009; v1 submitted 9 October, 2009;
originally announced October 2009.
-
Spontaneous formation of a superconducting and antiferromagnetic hybrid state in SrFe2As2 under high pressure
Authors:
K. Kitagawa,
N. Katayama,
H. Gotou,
T. Yagi,
K. Ohgushi,
T. Matsumoto,
Y. Uwatoko,
M. Takigawa
Abstract:
We report a novel superconducting (SC) and antiferromagnetic (AF) hybrid state in SrFe2As2 revealed by 75As nuclear magnetic resonance (NMR) experiments on a single crystal under highly hydrostatic pressure up to 7 GPa. The NMR spectra at 5.4 GPa indicate simultaneous development of the SC and AF orders below 30 K. The nuclear spin-lattice relaxation rate in the SC domains shows a substantial re…
▽ More
We report a novel superconducting (SC) and antiferromagnetic (AF) hybrid state in SrFe2As2 revealed by 75As nuclear magnetic resonance (NMR) experiments on a single crystal under highly hydrostatic pressure up to 7 GPa. The NMR spectra at 5.4 GPa indicate simultaneous development of the SC and AF orders below 30 K. The nuclear spin-lattice relaxation rate in the SC domains shows a substantial residual density of states, suggesting proximity effects due to spontaneous formation of a nano-scale SC/AF hybrid structure. This entangled behavior is a remarkable example of a self-organized heterogeneous structure in a clean system.
△ Less
Submitted 24 November, 2009; v1 submitted 25 June, 2009;
originally announced June 2009.
-
CaCrO3: an anomalous antiferromagnetic metallic oxide
Authors:
A. C. Komarek,
S. V. Streltsov,
M. Isobe,
T. Moeller,
M. Hoelzel,
A. Senyshyn,
D. Trots,
M. T. Fernandez-Diaz,
T. Hansen,
H. Gotou,
T. Yagi,
Y. Ueda,
V. I. Anisimov,
M. Grueninger,
D. I. Khomskii,
M. Braden
Abstract:
Combining infrared reflectivity, transport, susceptibility and several diffraction techniques, we find compelling evidence that CaCrO3 is a rare case of a metallic and antiferromagnetic transition-metal oxide with a three-dimensional electronic structure. LSDA calculations correctly describe the metallic behavior as well as the anisotropic magnetic ordering pattern of C type: The high Cr valence…
▽ More
Combining infrared reflectivity, transport, susceptibility and several diffraction techniques, we find compelling evidence that CaCrO3 is a rare case of a metallic and antiferromagnetic transition-metal oxide with a three-dimensional electronic structure. LSDA calculations correctly describe the metallic behavior as well as the anisotropic magnetic ordering pattern of C type: The high Cr valence state induces via sizeable pd hybridization remarkably strong next-nearest neighbor interactions stabilizing this ordering. The subtle balance of magnetic interactions gives rise to magneto-elastic coupling, explaining pronounced structural anomalies observed at the magnetic ordering transition.
△ Less
Submitted 7 April, 2008;
originally announced April 2008.
-
Superconducting properties of Pr-based Filled skutterudite PrRu$_4$As$_{12}$
Authors:
T. Namiki,
Y. Aoki,
H. Sato,
C. Sekine,
I. Shirotani,
T. D. Matsuda,
Y. Haga,
T. Yagi
Abstract:
We report on systematic study of superconducting characteristics and Pr crystalline-electric-field (CEF) levels of filled-skutterudite \pra ($T_{\rm c}$ = 2.33 K). The temperature dependences of the upper critical field $H_{\rm c2}$ and the Ginzburg-Landau (Maki) parameter $κ_2$ suggest an s-wave clean-limit superconductivity. The electronic specific heat coefficient $γ\sim 95$ mJ/K$^2$mol, bein…
▽ More
We report on systematic study of superconducting characteristics and Pr crystalline-electric-field (CEF) levels of filled-skutterudite \pra ($T_{\rm c}$ = 2.33 K). The temperature dependences of the upper critical field $H_{\rm c2}$ and the Ginzburg-Landau (Maki) parameter $κ_2$ suggest an s-wave clean-limit superconductivity. The electronic specific heat coefficient $γ\sim 95$ mJ/K$^2$mol, being $\sim 1.5$ times larger than that for \lra, indicates $4f$-originating quasiparticle mass enhancement. Magnetic susceptibility $χ(T)$ indicates that the CEF ground state is a $Γ_1$ singlet and a $Γ_4^{(1)}$ triplet first excited state lies at $Δ_{\rm CEF}\sim 30$ K above. Systematic comparison among \pos, \prs, \pra and La-based reference compounds suggests that inelastic exchange- and aspherical-charge-scatterings of conduction electrons from CEF-split $4f$ levels play an essential role for the quasiparticle mass enhancement and the value of $T_{\rm c}$ in the Pr-based filled skutterudites.
△ Less
Submitted 2 October, 2007;
originally announced October 2007.
-
Measurements of Thermophysical Property of Thin Films by Light Pulse Heating Thermoreflectance Methods
Authors:
T. Baba,
K. Ishikawa,
T. Yagi,
N. Taketoshi
Abstract:
Thermoreflectance methods by picosecond pulse heating and by nanosecond pulse heating have been developed under the same geometrical configuration as the laser flash method by the National Metrology Institute of JAPAN, AIST. Using these light pulse heating methods, thermal diffusivity of each layer of multilayered thin films and boundary thermal resistance between the layers can be determined fr…
▽ More
Thermoreflectance methods by picosecond pulse heating and by nanosecond pulse heating have been developed under the same geometrical configuration as the laser flash method by the National Metrology Institute of JAPAN, AIST. Using these light pulse heating methods, thermal diffusivity of each layer of multilayered thin films and boundary thermal resistance between the layers can be determined from the observed transient temperature curves based on the response function method. The measurement results of various thin films as transparent conductive films used for flat panel displays, hard coating films and multilayered films of the next generation phase-change optical disk will be presented.
△ Less
Submitted 12 September, 2007;
originally announced September 2007.
-
Effect of spin-orbit impurity scattering in the superconducting state of t-J model
Authors:
Toshifumi Yagi,
Kazuhiro Kuboki
Abstract:
We study the effect of magnetic impurities in the d_{x^2-y^2}-wave superconducting (SC) state of the two dimensional t-J model.The spin-orbit and the spin-exchange interactions are examined by treating the impurity as a classical spin. The Bogoliubov de Gennes equation derived within a slave-boson mean-field approximation is solved numerically at T = 0. The spin-exchange scattering induces spin-…
▽ More
We study the effect of magnetic impurities in the d_{x^2-y^2}-wave superconducting (SC) state of the two dimensional t-J model.The spin-orbit and the spin-exchange interactions are examined by treating the impurity as a classical spin. The Bogoliubov de Gennes equation derived within a slave-boson mean-field approximation is solved numerically at T = 0. The spin-exchange scattering induces spin-triplet p-wave SC order parameters near the impurity, while a SC state with broken time-reversal symmetry and a spontaneous current appears in the presence of the spin-orbit interaction. When both interactions coexist, it turns out that a state which carries a spontaneous spin current occurs.
△ Less
Submitted 29 May, 2000; v1 submitted 29 May, 2000;
originally announced May 2000.