Search | arXiv e-print repository

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

Authors: Haomiao Ni, Bernhard Egger, Suhas Lohit, Anoop Cherian, Ye Wang, Toshiaki Koike-Akino, Sharon X. Huang, Tim K. Marks

Abstract: Text-conditioned image-to-video generation (TI2V) aims to synthesize a realistic video starting from a given image (e.g., a woman's photo) and a text description (e.g., "a woman is drinking water."). Existing TI2V frameworks often require costly training on video-text datasets and specific model designs for text and image conditioning. In this paper, we propose TI2V-Zero, a zero-shot, tuning-free… ▽ More Text-conditioned image-to-video generation (TI2V) aims to synthesize a realistic video starting from a given image (e.g., a woman's photo) and a text description (e.g., "a woman is drinking water."). Existing TI2V frameworks often require costly training on video-text datasets and specific model designs for text and image conditioning. In this paper, we propose TI2V-Zero, a zero-shot, tuning-free method that empowers a pretrained text-to-video (T2V) diffusion model to be conditioned on a provided image, enabling TI2V generation without any optimization, fine-tuning, or introducing external modules. Our approach leverages a pretrained T2V diffusion foundation model as the generative prior. To guide video generation with the additional image input, we propose a "repeat-and-slide" strategy that modulates the reverse denoising process, allowing the frozen diffusion model to synthesize a video frame-by-frame starting from the provided image. To ensure temporal continuity, we employ a DDPM inversion strategy to initialize Gaussian noise for each newly synthesized frame and a resampling technique to help preserve visual details. We conduct comprehensive experiments on both domain-specific and open-domain datasets, where TI2V-Zero consistently outperforms a recent open-domain TI2V model. Furthermore, we show that TI2V-Zero can seamlessly extend to other tasks such as video infilling and prediction when provided with more images. Its autoregressive design also supports long video generation. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: CVPR 2024

arXiv:2404.10141 [pdf, other]

ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis

Authors: Aashish Anantha Ramakrishnan, Sharon X. Huang, Dongwon Lee

Abstract: Text-to-Image (T2I) Synthesis has made tremendous strides in enhancing synthesized image quality, but current datasets evaluate model performance only on descriptive, instruction-based prompts. Real-world news image captions take a more pragmatic approach, providing high-level situational and Named-Entity (NE) information and limited physical object descriptions, making them abstractive. To evalua… ▽ More Text-to-Image (T2I) Synthesis has made tremendous strides in enhancing synthesized image quality, but current datasets evaluate model performance only on descriptive, instruction-based prompts. Real-world news image captions take a more pragmatic approach, providing high-level situational and Named-Entity (NE) information and limited physical object descriptions, making them abstractive. To evaluate the ability of T2I models to capture intended subjects from news captions, we introduce the Abstractive News Captions with High-level cOntext Representation (ANCHOR) dataset, containing 70K+ samples sourced from 5 different news media organizations. With Large Language Models (LLM) achieving success in language and commonsense reasoning tasks, we explore the ability of different LLMs to identify and understand key subjects from abstractive captions. Our proposed method Subject-Aware Finetuning (SAFE), selects and enhances the representation of key subjects in synthesized images by leveraging LLM-generated subject weights. It also adapts to the domain distribution of news images and captions through custom Domain Fine-tuning, outperforming current T2I baselines on ANCHOR. By launching the ANCHOR dataset, we hope to motivate research in furthering the Natural Language Understanding (NLU) capabilities of T2I models. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: 23 pages, 9 figures

MSC Class: 65D19

arXiv:2311.02549 [pdf, other]

3D-Aware Talking-Head Video Motion Transfer

Authors: Haomiao Ni, Jiachen Liu, Yuan Xue, Sharon X. Huang

Abstract: Motion transfer of talking-head videos involves generating a new video with the appearance of a subject video and the motion pattern of a driving video. Current methodologies primarily depend on a limited number of subject images and 2D representations, thereby neglecting to fully utilize the multi-view appearance features inherent in the subject video. In this paper, we propose a novel 3D-aware t… ▽ More Motion transfer of talking-head videos involves generating a new video with the appearance of a subject video and the motion pattern of a driving video. Current methodologies primarily depend on a limited number of subject images and 2D representations, thereby neglecting to fully utilize the multi-view appearance features inherent in the subject video. In this paper, we propose a novel 3D-aware talking-head video motion transfer network, Head3D, which fully exploits the subject appearance information by generating a visually-interpretable 3D canonical head from the 2D subject frames with a recurrent network. A key component of our approach is a self-supervised 3D head geometry learning module, designed to predict head poses and depth maps from 2D subject video frames. This module facilitates the estimation of a 3D head in canonical space, which can then be transformed to align with driving video frames. Additionally, we employ an attention-based fusion network to combine the background and other details from subject frames with the 3D subject head to produce the synthetic target video. Our extensive experiments on two public talking-head video datasets demonstrate that Head3D outperforms both 2D and 3D prior arts in the practical cross-identity setting, with evidence showing it can be readily adapted to the pose-controllable novel view synthesis task. △ Less

Submitted 4 November, 2023; originally announced November 2023.

Comments: WACV2024

arXiv:2309.13240 [pdf, other]

NeRF-Enhanced Outpainting for Faithful Field-of-View Extrapolation

Authors: Rui Yu, Jiachen Liu, Zihan Zhou, Sharon X. Huang

Abstract: In various applications, such as robotic navigation and remote visual assistance, expanding the field of view (FOV) of the camera proves beneficial for enhancing environmental perception. Unlike image outpainting techniques aimed solely at generating aesthetically pleasing visuals, these applications demand an extended view that faithfully represents the scene. To achieve this, we formulate a new… ▽ More In various applications, such as robotic navigation and remote visual assistance, expanding the field of view (FOV) of the camera proves beneficial for enhancing environmental perception. Unlike image outpainting techniques aimed solely at generating aesthetically pleasing visuals, these applications demand an extended view that faithfully represents the scene. To achieve this, we formulate a new problem of faithful FOV extrapolation that utilizes a set of pre-captured images as prior knowledge of the scene. To address this problem, we present a simple yet effective solution called NeRF-Enhanced Outpainting (NEO) that uses extended-FOV images generated through NeRF to train a scene-specific image outpainting model. To assess the performance of NEO, we conduct comprehensive evaluations on three photorealistic datasets and one real-world dataset. Extensive experiments on the benchmark datasets showcase the robustness and potential of our method in addressing this challenge. We believe our work lays a strong foundation for future exploration within the research community. △ Less

Submitted 22 September, 2023; originally announced September 2023.

arXiv:2308.04020 [pdf, other]

Synthetic Augmentation with Large-scale Unconditional Pre-training

Authors: Jiarong Ye, Haomiao Ni, Peng **, Sharon X. Huang, Yuan Xue

Abstract: Deep learning based medical image recognition systems often require a substantial amount of training data with expert annotations, which can be expensive and time-consuming to obtain. Recently, synthetic augmentation techniques have been proposed to mitigate the issue by generating realistic images conditioned on class labels. However, the effectiveness of these methods heavily depends on the repr… ▽ More Deep learning based medical image recognition systems often require a substantial amount of training data with expert annotations, which can be expensive and time-consuming to obtain. Recently, synthetic augmentation techniques have been proposed to mitigate the issue by generating realistic images conditioned on class labels. However, the effectiveness of these methods heavily depends on the representation capability of the trained generative model, which cannot be guaranteed without sufficient labeled training data. To further reduce the dependency on annotated data, we propose a synthetic augmentation method called HistoDiffusion, which can be pre-trained on large-scale unlabeled datasets and later applied to a small-scale labeled dataset for augmented training. In particular, we train a latent diffusion model (LDM) on diverse unlabeled datasets to learn common features and generate realistic images without conditional inputs. Then, we fine-tune the model with classifier guidance in latent space on an unseen labeled dataset so that the model can synthesize images of specific categories. Additionally, we adopt a selective mechanism to only add synthetic samples with high confidence of matching to target labels. We evaluate our proposed method by pre-training on three histopathology datasets and testing on a histopathology dataset of colorectal cancer (CRC) excluded from the pre-training datasets. With HistoDiffusion augmentation, the classification accuracy of a backbone classifier is remarkably improved by 6.4% using a small set of the original labels. Our code is available at https://github.com/karenyyy/HistoDiffAug. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: MICCAI 2023

arXiv:2303.13744 [pdf, other]

Conditional Image-to-Video Generation with Latent Flow Diffusion Models

Authors: Haomiao Ni, Changhao Shi, Kai Li, Sharon X. Huang, Martin Renqiang Min

Abstract: Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video starting from an image (e.g., a person's face) and a condition (e.g., an action class label like smile). The key challenge of the cI2V task lies in the simultaneous generation of realistic spatial appearance and temporal dynamics corresponding to the given image and condition. In this paper, we propose an approac… ▽ More Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video starting from an image (e.g., a person's face) and a condition (e.g., an action class label like smile). The key challenge of the cI2V task lies in the simultaneous generation of realistic spatial appearance and temporal dynamics corresponding to the given image and condition. In this paper, we propose an approach for cI2V using novel latent flow diffusion models (LFDM) that synthesize an optical flow sequence in the latent space based on the given condition to warp the given image. Compared to previous direct-synthesis-based works, our proposed LFDM can better synthesize spatial details and temporal motion by fully utilizing the spatial content of the given image and war** it in the latent space according to the generated temporally-coherent flow. The training of LFDM consists of two separate stages: (1) an unsupervised learning stage to train a latent flow auto-encoder for spatial content generation, including a flow predictor to estimate latent flow between pairs of video frames, and (2) a conditional learning stage to train a 3D-UNet-based diffusion model (DM) for temporal latent flow generation. Unlike previous DMs operating in pixel space or latent feature space that couples spatial and temporal information, the DM in our LFDM only needs to learn a low-dimensional latent flow space for motion generation, thus being more computationally efficient. We conduct comprehensive experiments on multiple datasets, where LFDM consistently outperforms prior arts. Furthermore, we show that LFDM can be easily adapted to new domains by simply finetuning the image decoder. Our code is available at https://github.com/nihaomiao/CVPR23_LFDM. △ Less

Submitted 23 March, 2023; originally announced March 2023.

Comments: CVPR 2023

arXiv:2301.02160 [pdf, other]

ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions

Authors: Aashish Anantha Ramakrishnan, Sharon X. Huang, Dongwon Lee

Abstract: Advancements in Text-to-Image synthesis over recent years have focused more on improving the quality of generated samples using datasets with descriptive prompts. However, real-world image-caption pairs present in domains such as news data do not use simple and directly descriptive captions. With captions containing information on both the image content and underlying contextual cues, they become… ▽ More Advancements in Text-to-Image synthesis over recent years have focused more on improving the quality of generated samples using datasets with descriptive prompts. However, real-world image-caption pairs present in domains such as news data do not use simple and directly descriptive captions. With captions containing information on both the image content and underlying contextual cues, they become abstractive in nature. In this paper, we launch ANNA, an Abstractive News captioNs dAtaset extracted from online news articles in a variety of different contexts. We explore the capabilities of current Text-to-Image synthesis models to generate news domain-specific images using abstractive captions by benchmarking them on ANNA, in both standard training and transfer learning settings. The generated images are judged on the basis of contextual relevance, visual quality, and perceptual similarity to ground-truth image-caption pairs. Through our experiments, we show that techniques such as transfer learning achieve limited success in understanding abstractive captions but still fail to consistently learn the relationships between content and context features. The Dataset is available at https://github.com/aashish2000/ANNA . △ Less

Submitted 1 July, 2024; v1 submitted 5 January, 2023; originally announced January 2023.

Comments: To appear in the ACL 3rd Workshop on Advances in Language and Vision Research (ALVR), Bangkok, Thailand, August 2024, https://alvr-workshop.github.io

MSC Class: 65D19

arXiv:2211.02789 [pdf, other]

Forecasting User Interests Through Topic Tag Predictions in Online Health Communities

Authors: Amogh Subbakrishna Adishesha, Lily Jakielaszek, Fariha Azhar, Peixuan Zhang, Vasant Honavar, Fenglong Ma, Chandra Belani, Prasenjit Mitra, Sharon Xiaolei Huang

Abstract: The increasing reliance on online communities for healthcare information by patients and caregivers has led to the increase in the spread of misinformation, or subjective, anecdotal and inaccurate or non-specific recommendations, which, if acted on, could cause serious harm to the patients. Hence, there is an urgent need to connect users with accurate and tailored health information in a timely ma… ▽ More The increasing reliance on online communities for healthcare information by patients and caregivers has led to the increase in the spread of misinformation, or subjective, anecdotal and inaccurate or non-specific recommendations, which, if acted on, could cause serious harm to the patients. Hence, there is an urgent need to connect users with accurate and tailored health information in a timely manner to prevent such harm. This paper proposes an innovative approach to suggesting reliable information to participants in online communities as they move through different stages in their disease or treatment. We hypothesize that patients with similar histories of disease progression or course of treatment would have similar information needs at comparable stages. Specifically, we pose the problem of predicting topic tags or keywords that describe the future information needs of users based on their profiles, traces of their online interactions within the community (past posts, replies) and the profiles and traces of online interactions of other users with similar profiles and similar traces of past interaction with the target users. The result is a variant of the collaborative information filtering or recommendation system tailored to the needs of users of online health communities. We report results of our experiments on an expert curated data set which demonstrate the superiority of the proposed approach over the state of the art baselines with respect to accurate and timely prediction of topic tags (and hence information sources of interest). △ Less

Submitted 4 November, 2022; originally announced November 2022.

Comments: Healthcare Informatics and NLP

arXiv:2210.01559 [pdf, other]

Cross-identity Video Motion Retargeting with Joint Transformation and Synthesis

Authors: Haomiao Ni, Yihao Liu, Sharon X. Huang, Yuan Xue

Abstract: In this paper, we propose a novel dual-branch Transformation-Synthesis network (TS-Net), for video motion retargeting. Given one subject video and one driving video, TS-Net can produce a new plausible video with the subject appearance of the subject video and motion pattern of the driving video. TS-Net consists of a warp-based transformation branch and a warp-free synthesis branch. The novel desig… ▽ More In this paper, we propose a novel dual-branch Transformation-Synthesis network (TS-Net), for video motion retargeting. Given one subject video and one driving video, TS-Net can produce a new plausible video with the subject appearance of the subject video and motion pattern of the driving video. TS-Net consists of a warp-based transformation branch and a warp-free synthesis branch. The novel design of dual branches combines the strengths of deformation-grid-based transformation and warp-free generation for better identity preservation and robustness to occlusion in the synthesized videos. A mask-aware similarity module is further introduced to the transformation branch to reduce computational overhead. Experimental results on face and dance datasets show that TS-Net achieves better performance in video motion retargeting than several state-of-the-art models as well as its single-branch variants. Our code is available at https://github.com/nihaomiao/WACV23_TSNet. △ Less

Submitted 1 October, 2022; originally announced October 2022.

Comments: WACV 2023

arXiv:2206.02788 [pdf]

doi 10.1073/pnas.2118836119

Accurate Virus Identification with Interpretable Raman Signatures by Machine Learning

Authors: Jiarong Ye, Yin-Ting Yeh, Yuan Xue, Ziyang Wang, Na Zhang, He Liu, Kunyan Zhang, RyeAnne Ricker, Zhuohang Yu, Allison Roder, Nestor Perea Lopez, Lindsey Organtini, Wallace Greene, Susan Hafenstein, Huaguang Lu, Elodie Ghedin, Mauricio Terrones, Shengxi Huang, Sharon Xiaolei Huang

Abstract: Rapid identification of newly emerging or circulating viruses is an important first step toward managing the public health response to potential outbreaks. A portable virus capture device coupled with label-free Raman Spectroscopy holds the promise of fast detection by rapidly obtaining the Raman signature of a virus followed by a machine learning approach applied to recognize the virus based on i… ▽ More Rapid identification of newly emerging or circulating viruses is an important first step toward managing the public health response to potential outbreaks. A portable virus capture device coupled with label-free Raman Spectroscopy holds the promise of fast detection by rapidly obtaining the Raman signature of a virus followed by a machine learning approach applied to recognize the virus based on its Raman spectrum, which is used as a fingerprint. We present such a machine learning approach for analyzing Raman spectra of human and avian viruses. A Convolutional Neural Network (CNN) classifier specifically designed for spectral data achieves very high accuracy for a variety of virus type or subtype identification tasks. In particular, it achieves 99% accuracy for classifying influenza virus type A vs. type B, 96% accuracy for classifying four subtypes of influenza A, 95% accuracy for differentiating enveloped and non-enveloped viruses, and 99% accuracy for differentiating avian coronavirus (infectious bronchitis virus, IBV) from other avian viruses. Furthermore, interpretation of neural net responses in the trained CNN model using a full-gradient algorithm highlights Raman spectral ranges that are most important to virus identification. By correlating ML-selected salient Raman ranges with the signature ranges of known biomolecules and chemical functional groups (for example, amide, amino acid, carboxylic acid), we verify that our ML model effectively recognizes the Raman signatures of proteins, lipids and other vital functional groups present in different viruses and uses a weighted combination of these signatures to identify viruses. △ Less

Submitted 5 June, 2022; originally announced June 2022.

Comments: 23 pages, 8 figures

Journal ref: Proceedings of the National Academy of Sciences of the United States of America (2022)

arXiv:2110.07584 [pdf, other]

Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and Partial Differential Equation in a Loop

Authors: Peng **, Xitong Zhang, Yinpeng Chen, Sharon Xiaolei Huang, Zicheng Liu, Youzuo Lin

Abstract: This paper investigates unsupervised learning of Full-Waveform Inversion (FWI), which has been widely used in geophysics to estimate subsurface velocity maps from seismic data. This problem is mathematically formulated by a second order partial differential equation (PDE), but is hard to solve. Moreover, acquiring velocity map is extremely expensive, making it impractical to scale up a supervised… ▽ More This paper investigates unsupervised learning of Full-Waveform Inversion (FWI), which has been widely used in geophysics to estimate subsurface velocity maps from seismic data. This problem is mathematically formulated by a second order partial differential equation (PDE), but is hard to solve. Moreover, acquiring velocity map is extremely expensive, making it impractical to scale up a supervised approach to train the map** from seismic data to velocity maps with convolutional neural networks (CNN). We address these difficulties by integrating PDE and CNN in a loop, thus shifting the paradigm to unsupervised learning that only requires seismic data. In particular, we use finite difference to approximate the forward modeling of PDE as a differentiable operator (from velocity map to seismic data) and model its inversion by CNN (from seismic data to velocity map). Hence, we transform the supervised inversion task into an unsupervised seismic data reconstruction task. We also introduce a new large-scale dataset OpenFWI, to establish a more challenging benchmark for the community. Experiment results show that our model (using seismic data alone) yields comparable accuracy to the supervised counterpart (using both seismic data and velocity map). Furthermore, it outperforms the supervised model when involving more seismic data. △ Less

Submitted 18 March, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

arXiv:2105.12887 [pdf]

Multi-turn Dialog System on Single-turn Data in Medical Domain

Authors: Nazib Sorathiya, Chuan-An Lin, Daniel Chen Daniel Xiong, Scott Zin, Yi Zhang, He Sarina Yang, Sharon Xiaolei Huang

Abstract: Recently there has been a huge interest in dialog systems. This interest has also been developed in the field of the medical domain where researchers are focusing on building a dialog system in the medical domain. This research is focused on the multi-turn dialog system trained on the multi-turn dialog data. It is difficult to gather a huge amount of multi-turn conversational data in the medical d… ▽ More Recently there has been a huge interest in dialog systems. This interest has also been developed in the field of the medical domain where researchers are focusing on building a dialog system in the medical domain. This research is focused on the multi-turn dialog system trained on the multi-turn dialog data. It is difficult to gather a huge amount of multi-turn conversational data in the medical domain that is verified by professionals and can be trusted. However, there are several frequently asked questions (FAQs) or single-turn QA pairs that have information that is verified by the experts and can be used to build a multi-turn dialog system. △ Less

Submitted 26 May, 2021; originally announced May 2021.

arXiv:2102.10306 [pdf]

Designation of Intra-layer and Intercalated High Entropy Quasi-2D Compounds

Authors: Hong Xiang Chen, Sheng Li, Shu Xian Huang, Li An Ma, Sheng Liu, Fang Tang, Yong Fang, Pin Qiang Dai

Abstract: Here, we designed two promising schemes to realize the high-entropy structure in a series of quasi-two-dimensional compounds, transition metal dichalcogenides (TMDCs). In the intra-layer high-entropy plan, (HEM)X2 compounds with high-entropy structure in the MX2 slabs were obtained, here HEM means high-entropy metals, such as TiZrNbMoTa. And superconductivity with a Tc~7.4 K was found in a Mo-rich… ▽ More Here, we designed two promising schemes to realize the high-entropy structure in a series of quasi-two-dimensional compounds, transition metal dichalcogenides (TMDCs). In the intra-layer high-entropy plan, (HEM)X2 compounds with high-entropy structure in the MX2 slabs were obtained, here HEM means high-entropy metals, such as TiZrNbMoTa. And superconductivity with a Tc~7.4 K was found in a Mo-rich HEMX2. On the other hand, in the intercalation plan, we intercalated HEM-atoms (FeCoCrNiMn) into the gap between the sandwiched-MX2 slabs resulting in a series of (HEM)xMX2 compounds, x in the range of 0~0.5, in which HEM is mainly composed of 3d transition metal elements, such as FeCoCrNiMn. As the introduction of multi-component magnetic atoms, ferromagnetic spin-glass states with strong 2D characteristics ensued. Tuning the x content, three kinds of two in the high-entropy intercalated layer were observed including the 1*1 triangular lattice and two kinds of superlattices \sqrt3*\sqrt3 and \sqrt3*2 in x=0.333 and x>0.5, respectively. Meanwhile, the spin frustration in the two-dimensional high-entropy magnetic plane will be enhanced with the development of \sqrt3*\sqrt3 and will be reduced significantly when changing into the \sqrt3*2 phase. The high-entropy TMDCs and versatile two-dimensional high-entropy structures found by us possess great potentials to find new physics in low-dimensional high-entropy structures and future applications. △ Less

Submitted 18 April, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

Comments: 12 pages, 6 figures

arXiv:2008.02202 [pdf]

doi 10.1103/PhysRevMaterials.4.084203

Anomalous Hall and Nernst effects in epitaxial films of topological kagome magnet Fe3Sn2

Authors: Durga Khadka, T. R. Thapaliya, Sebastian Hurtado Parra, Jiajia Wen, Ryan Need, James M. Kikkawa, S. X. Huang

Abstract: The topological kagome magnet (TKM) Fe3Sn2 exhibits unusual topological properties, flat electronic bands, and chiral spin textures, making it an exquisite materials platform to explore the interplay between topological band structure, strong electron correlations, and magnetism. Here we report the first synthesis of high-quality epitaxial (0001) Fe3Sn2 films with large intrinsic anomalous Hall ef… ▽ More The topological kagome magnet (TKM) Fe3Sn2 exhibits unusual topological properties, flat electronic bands, and chiral spin textures, making it an exquisite materials platform to explore the interplay between topological band structure, strong electron correlations, and magnetism. Here we report the first synthesis of high-quality epitaxial (0001) Fe3Sn2 films with large intrinsic anomalous Hall effect close to that measured in bulk single crystals. In addition, we measured a large, anisotropic anomalous Nernst coefficient Syx of 1.26 μV/K, roughly 2-5x greater than that of common ferromagnets, suggesting the presence of Berry curvature sources near the Fermi level in this system. Crucially, the realization of high-quality Fe3Sn2 films opens the door to explore emergent interfacial physics and create novel spintronic devices based on TKMs by interfacing Fe3Sn2 with other quantum materials and by nanostructure patterning. △ Less

Submitted 5 August, 2020; originally announced August 2020.

Comments: accepted by Physical Review Materials

Journal ref: Phys. Rev. Materials 4, 084203 (2020)

arXiv:2007.05802 [pdf]

doi 10.1063/5.0011497

High quality epitaxial thin films and exchange bias of antiferromagnetic Dirac semimetal FeSn

Authors: Durga Khadka, T. R. Thapaliya, Jiajia Wen, Ryan F. Need, S. X. Huang

Abstract: FeSn is a topological semimetal (TSM) and kagome antiferromagnet (AFM) composed of alternating Fe3Sn kagome planes and honeycomb Sn planes. This unique structure gives rise to exotic features in the band structures such as the coexistence of Dirac cones and flatbands near the Fermi level, fully spin-polarized 2D surface Dirac fermions, and the ability to open a large gap in the Dirac cone by reori… ▽ More FeSn is a topological semimetal (TSM) and kagome antiferromagnet (AFM) composed of alternating Fe3Sn kagome planes and honeycomb Sn planes. This unique structure gives rise to exotic features in the band structures such as the coexistence of Dirac cones and flatbands near the Fermi level, fully spin-polarized 2D surface Dirac fermions, and the ability to open a large gap in the Dirac cone by reorienting the Néel vector. In this work, we report the synthesis of high quality epitaxial (0001) FeSn films by magnetron sputtering. Using FeSn/Py heterostructures, we show a large exchange bias effect that reaches an exchange field of 220 Oe at 5 K, providing unambiguous evidence of antiferromagnetism and strong interlayer exchange coupling in our films. Field cycling studies show steep initial training effects, highlighting the complex magnetic interactions and anisotropy. Importantly, our work provides a simple, alternative means to fabricate FeSn films and heterostructures, making it easier to explore the topological physics of AFM TSMs and develop FeSn-based spintronics. △ Less

Submitted 11 July, 2020; originally announced July 2020.

Comments: accepted by APL

Journal ref: Appl. Phys. Lett. 117, 032403 (2020)

arXiv:2007.02221 [pdf]

doi 10.1126/sciadv.abc1977

Kondo physics in antiferromagnetic Weyl semimetal Mn3+xSn1-x films

Authors: Durga Khadka, T. R. Thapaliya, Sebastian Hurtado Parra, Xingyue Han, Jiajia Wen, Ryan F. Need, Pravin Khanal, Weigang Wang, Jiadong Zang, James M. Kikkawa, Liang Wu, S. X. Huang

Abstract: Topology and strong electron correlations are crucial ingredients in emerging quantum materials, yet their intersection in experimental systems has been relatively limited to date. Strongly correlated Weyl semimetals, particularly when magnetism is incorporated, offer a unique and fertile platform to explore emergent phenomena in novel topological matter and topological spintronics. The antiferrom… ▽ More Topology and strong electron correlations are crucial ingredients in emerging quantum materials, yet their intersection in experimental systems has been relatively limited to date. Strongly correlated Weyl semimetals, particularly when magnetism is incorporated, offer a unique and fertile platform to explore emergent phenomena in novel topological matter and topological spintronics. The antiferromagnetic Weyl semimetal Mn3Sn exhibits many exotic physical properties such as a large spontaneous Hall effect and has recently attracted intense interest. In this work, we report synthesis of epitaxial Mn3+xSn1-x films with greatly extended compositional range in comparison with that of bulk samples. As Sn atoms are replaced by magnetic Mn atoms, the Kondo effect, which is a celebrated example of strong correlations, emerges, develops coherence, and induces a hybridization energy gap. The magnetic do** and gap opening lead to rich extraordinary properties as exemplified by the prominent DC Hall effects and resonance-enhanced terahertz Faraday rotation. △ Less

Submitted 4 July, 2020; originally announced July 2020.

Journal ref: Science Advances 6, eabc1977 (2020)

arXiv:1901.01242 [pdf, ps, other]

Spin phases of the helimagnetic insulator Cu$_2$OSeO$_3$ probed by magnon heat conduction

Authors: N. Prasai, A. Akopyan, B. A. Trump, G. G. Marcus, S. X. Huang, T. M. McQueen, J. L. Cohn

Abstract: We report studies of thermal conductivity as functions of magnetic field and temperature in the helimagnetic insulator Cu$_2$OSeO$_3$ that reveal novel features of the spin-phase transitions as probed by magnon heat conduction. The tilted conical spiral and low-temperature skyrmion phases, recently identified in small-angle neutron scattering studies, are clearly identified by sharp signatures in… ▽ More We report studies of thermal conductivity as functions of magnetic field and temperature in the helimagnetic insulator Cu$_2$OSeO$_3$ that reveal novel features of the spin-phase transitions as probed by magnon heat conduction. The tilted conical spiral and low-temperature skyrmion phases, recently identified in small-angle neutron scattering studies, are clearly identified by sharp signatures in the magnon thermal conductivity. Magnon scattering associated with the presence of domain boundaries in the tilted conical phase and regions of skyrmion and conical-phase coexistence are identified. △ Less

Submitted 4 January, 2019; originally announced January 2019.

Comments: 5 pp., 3 figures + Supplemental Material (2 pp.) Accepted, PRB Rapid Communications

arXiv:1705.06328 [pdf, ps, other]

doi 10.1103/PhysRevB.95.224407

Ballistic magnon heat conduction and possible Poiseuille flow in the helimagnetic insulator Cu$_2$OSeO$_3$

Authors: N. Prasai, B. A. Trump, G. G. Marcus, A. Akopyan, S. X. Huang, T. M. McQueen, J. L. Cohn

Abstract: We report on the observation of magnon thermal conductivity $κ_m\sim$ 70 W/mK near 5 K in the helimagnetic insulator Cu$_2$OSeO$_3$, exceeding that measured in any other ferromagnet by almost two orders of magnitude. Ballistic, boundary-limited transport for both magnons and phonons is established below 1 K, and Poiseuille flow of magnons is proposed to explain a magnon mean-free path substantiall… ▽ More We report on the observation of magnon thermal conductivity $κ_m\sim$ 70 W/mK near 5 K in the helimagnetic insulator Cu$_2$OSeO$_3$, exceeding that measured in any other ferromagnet by almost two orders of magnitude. Ballistic, boundary-limited transport for both magnons and phonons is established below 1 K, and Poiseuille flow of magnons is proposed to explain a magnon mean-free path substantially exceeding the specimen width for the least defective specimens in the range 2 K $<T<$ 10 K. These observations establish Cu$_2$OSeO$_3$ as a model system for studying long-wavelength magnon dynamics. △ Less

Submitted 29 May, 2017; v1 submitted 17 May, 2017; originally announced May 2017.

Comments: 10pp, 9 figures, accepted PRB (Editor's Suggestion)

arXiv:1409.7869 [pdf]

Universal Ratio of Intrinsic Resistivities of Spin Helix in B20 (Fe-Co)Si Magnets

Authors: S. X. Huang, Jian Kang, Fei Chen, Jiadong Zang, G. J. Shu, F. C. Chou, S. V. Grigoriev, V. A. Dyadkin, C. L. Chien

Abstract: The B20 magnets with the Dzyaloshinskii-Moriya (D-M) interaction exhibit spin helix and Skyrmion spin textures unattainable in traditional Heisenberg ferromagnets. We have determined the intrinsic resistivity of the spin helix, which is a macroscopic Bloch domain wall, in B20 (Fe-Co)Si magnets. We found a universal resistance ratio of gamma = 1.35 with current parallel and perpendicular to the hel… ▽ More The B20 magnets with the Dzyaloshinskii-Moriya (D-M) interaction exhibit spin helix and Skyrmion spin textures unattainable in traditional Heisenberg ferromagnets. We have determined the intrinsic resistivity of the spin helix, which is a macroscopic Bloch domain wall, in B20 (Fe-Co)Si magnets. We found a universal resistance ratio of gamma = 1.35 with current parallel and perpendicular to the helix, independent of composition and temperature. This gamma value is much smaller than 3, the well-known minimum value for domain wall resistivity in traditional ferromagnets, due to the significant spin-orbit coupling in the B20 magnets. △ Less

Submitted 27 September, 2014; originally announced September 2014.

arXiv:1409.7867 [pdf]

Magnetoresistance due to Broken C4 Symmetry in Cubic B20 Chiral Magnets

Authors: S. X. Huang, Fei Chen, Jian Kang, Jiadong Zang, G. J. Shu, F. C. Chou, C. L. Chien

Abstract: The B20 chiral magnets with broken inversion symmetry and C4 rotation symmetry have attracted much attention. The broken inversion symmetry leads to the Dzyaloshinskii-Moriya that gives rise to the helical and Skyrmion states. We report the unusual magnetoresistance (MR) of B20 chiral magnet Fe0.85Co0.15Si that directly reveals the broken C4 rotation symmetry. We present a microscopic theory, a mi… ▽ More The B20 chiral magnets with broken inversion symmetry and C4 rotation symmetry have attracted much attention. The broken inversion symmetry leads to the Dzyaloshinskii-Moriya that gives rise to the helical and Skyrmion states. We report the unusual magnetoresistance (MR) of B20 chiral magnet Fe0.85Co0.15Si that directly reveals the broken C4 rotation symmetry. We present a microscopic theory, a minimal theory with two spin-orbit terms, that satisfies all the symmetry requirements and accounts for the transport experiments. △ Less

Submitted 27 September, 2014; originally announced September 2014.

arXiv:1108.4399 [pdf, ps, other]

doi 10.1063/1.4726209

Pressure effects on strained FeSe0.5Te0.5 thin films

Authors: M. Gooch, B. Lorenz, S. X. Huang, C. L. Chien, C. W. Chu

Abstract: The pressure effect on the resistivity and superconducting Tc of prestrained thin films of the iron chalcogenide superconductor FeSe0.5Te0.5 is studied. Films with different anion heights above the Fe layer showing different values of ambient pressure Tc's are compressed up to a pressure of 1.7 GPa. All films exhibit a significant increase of Tc with pressure. The results cannot solely be explaine… ▽ More The pressure effect on the resistivity and superconducting Tc of prestrained thin films of the iron chalcogenide superconductor FeSe0.5Te0.5 is studied. Films with different anion heights above the Fe layer showing different values of ambient pressure Tc's are compressed up to a pressure of 1.7 GPa. All films exhibit a significant increase of Tc with pressure. The results cannot solely be explained by a pressure-induced decrease of the anion height but other parameters have to be considered to explain the data for all films. △ Less

Submitted 22 August, 2011; originally announced August 2011.

Comments: 4 pages, 3 figures

Journal ref: J. Appl. Phys. 111, 112610 (2012)

arXiv:1002.2669 [pdf]

doi 10.1103/PhysRevLett.104.217002

Control of tetrahedral coordination and superconductivity in FeSe0.5Te0.5 thin films

Authors: S. X. Huang, C. L. Chien, V. Thampy, C. Broholm

Abstract: We demonstrate a close relationship between superconductivity and the dimensions of the Fe-Se(Te) tetrahedron in FeSe0.5Te0.5. This is done by exploiting thin film epitaxy, which provides controlled biaxial stress, both compressive and tensile, to distort the tetrahedron. The Se/Te height within the tetrahedron is found to be of crucial importance to superconductivity, in agreement with the theo… ▽ More We demonstrate a close relationship between superconductivity and the dimensions of the Fe-Se(Te) tetrahedron in FeSe0.5Te0.5. This is done by exploiting thin film epitaxy, which provides controlled biaxial stress, both compressive and tensile, to distort the tetrahedron. The Se/Te height within the tetrahedron is found to be of crucial importance to superconductivity, in agreement with the theoretical proposal that (pi,pi) spin fluctuations promote superconductivity in Fe superconductors. △ Less

Submitted 12 February, 2010; originally announced February 2010.

Journal ref: Phys. Rev. Lett. 104, 217002 (2010)

arXiv:0902.4008 [pdf]

doi 10.1016/j.physc.2009.03.041

Determination of Superconducting Gap of SmFeAsFxO1-x Superconductors by Andreev Reflection Spectroscopy

Authors: T. Y. Chen, S. X. Huang, Z. Tesanovic, R. H. Liu, X. H. Chen, C. L. Chien

Abstract: The superconducting gap in FeAs-based superconductor SmFeAs(O1-xFx) (x = 0.15 and 0.30) and the temperature dependence of the sample with x = 0.15 have been measured by Andreev reflection spectroscopy. The intrinsic superconducting gap is independent of contacts while many other "gap-like" features vary appreciably for different contacts. The determined gap value of 2D = 13.34 +/-0.47 meV for Sm… ▽ More The superconducting gap in FeAs-based superconductor SmFeAs(O1-xFx) (x = 0.15 and 0.30) and the temperature dependence of the sample with x = 0.15 have been measured by Andreev reflection spectroscopy. The intrinsic superconducting gap is independent of contacts while many other "gap-like" features vary appreciably for different contacts. The determined gap value of 2D = 13.34 +/-0.47 meV for SmFeAs(O0.85F0.15) gives 2D/kBTC = 3.68, close to the BCS prediction of 3.53. The superconducting gap decreases with temperature and vanishes at TC, in a manner similar to the BCS behavior but dramatically different from that of the nodal pseudogap behavior in cuprate superconductors. △ Less

Submitted 23 February, 2009; originally announced February 2009.

Comments: 13 pages, 9 figures, Special Issue of Physica C on Superconducting Pnictides

Showing 1–23 of 23 results for author: Huang, S X