Skip to main content

Showing 1–23 of 23 results for author: Huang, S X

.
  1. arXiv:2404.16306  [pdf, other

    cs.CV

    TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

    Authors: Haomiao Ni, Bernhard Egger, Suhas Lohit, Anoop Cherian, Ye Wang, Toshiaki Koike-Akino, Sharon X. Huang, Tim K. Marks

    Abstract: Text-conditioned image-to-video generation (TI2V) aims to synthesize a realistic video starting from a given image (e.g., a woman's photo) and a text description (e.g., "a woman is drinking water."). Existing TI2V frameworks often require costly training on video-text datasets and specific model designs for text and image conditioning. In this paper, we propose TI2V-Zero, a zero-shot, tuning-free… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  2. arXiv:2404.10141  [pdf, other

    cs.CV cs.CL cs.MM

    ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis

    Authors: Aashish Anantha Ramakrishnan, Sharon X. Huang, Dongwon Lee

    Abstract: Text-to-Image (T2I) Synthesis has made tremendous strides in enhancing synthesized image quality, but current datasets evaluate model performance only on descriptive, instruction-based prompts. Real-world news image captions take a more pragmatic approach, providing high-level situational and Named-Entity (NE) information and limited physical object descriptions, making them abstractive. To evalua… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 23 pages, 9 figures

    MSC Class: 65D19

  3. arXiv:2311.02549  [pdf, other

    cs.CV

    3D-Aware Talking-Head Video Motion Transfer

    Authors: Haomiao Ni, Jiachen Liu, Yuan Xue, Sharon X. Huang

    Abstract: Motion transfer of talking-head videos involves generating a new video with the appearance of a subject video and the motion pattern of a driving video. Current methodologies primarily depend on a limited number of subject images and 2D representations, thereby neglecting to fully utilize the multi-view appearance features inherent in the subject video. In this paper, we propose a novel 3D-aware t… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: WACV2024

  4. arXiv:2309.13240  [pdf, other

    cs.CV cs.RO

    NeRF-Enhanced Outpainting for Faithful Field-of-View Extrapolation

    Authors: Rui Yu, Jiachen Liu, Zihan Zhou, Sharon X. Huang

    Abstract: In various applications, such as robotic navigation and remote visual assistance, expanding the field of view (FOV) of the camera proves beneficial for enhancing environmental perception. Unlike image outpainting techniques aimed solely at generating aesthetically pleasing visuals, these applications demand an extended view that faithfully represents the scene. To achieve this, we formulate a new… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  5. arXiv:2308.04020  [pdf, other

    cs.CV

    Synthetic Augmentation with Large-scale Unconditional Pre-training

    Authors: Jiarong Ye, Haomiao Ni, Peng **, Sharon X. Huang, Yuan Xue

    Abstract: Deep learning based medical image recognition systems often require a substantial amount of training data with expert annotations, which can be expensive and time-consuming to obtain. Recently, synthetic augmentation techniques have been proposed to mitigate the issue by generating realistic images conditioned on class labels. However, the effectiveness of these methods heavily depends on the repr… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: MICCAI 2023

  6. arXiv:2303.13744  [pdf, other

    cs.CV

    Conditional Image-to-Video Generation with Latent Flow Diffusion Models

    Authors: Haomiao Ni, Changhao Shi, Kai Li, Sharon X. Huang, Martin Renqiang Min

    Abstract: Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video starting from an image (e.g., a person's face) and a condition (e.g., an action class label like smile). The key challenge of the cI2V task lies in the simultaneous generation of realistic spatial appearance and temporal dynamics corresponding to the given image and condition. In this paper, we propose an approac… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  7. arXiv:2301.02160  [pdf, other

    cs.CV

    ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions

    Authors: Aashish Anantha Ramakrishnan, Sharon X. Huang, Dongwon Lee

    Abstract: Advancements in Text-to-Image synthesis over recent years have focused more on improving the quality of generated samples using datasets with descriptive prompts. However, real-world image-caption pairs present in domains such as news data do not use simple and directly descriptive captions. With captions containing information on both the image content and underlying contextual cues, they become… ▽ More

    Submitted 1 July, 2024; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: To appear in the ACL 3rd Workshop on Advances in Language and Vision Research (ALVR), Bangkok, Thailand, August 2024, https://alvr-workshop.github.io

    MSC Class: 65D19

  8. arXiv:2211.02789  [pdf, other

    cs.LG

    Forecasting User Interests Through Topic Tag Predictions in Online Health Communities

    Authors: Amogh Subbakrishna Adishesha, Lily Jakielaszek, Fariha Azhar, Peixuan Zhang, Vasant Honavar, Fenglong Ma, Chandra Belani, Prasenjit Mitra, Sharon Xiaolei Huang

    Abstract: The increasing reliance on online communities for healthcare information by patients and caregivers has led to the increase in the spread of misinformation, or subjective, anecdotal and inaccurate or non-specific recommendations, which, if acted on, could cause serious harm to the patients. Hence, there is an urgent need to connect users with accurate and tailored health information in a timely ma… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: Healthcare Informatics and NLP

  9. arXiv:2210.01559  [pdf, other

    cs.CV

    Cross-identity Video Motion Retargeting with Joint Transformation and Synthesis

    Authors: Haomiao Ni, Yihao Liu, Sharon X. Huang, Yuan Xue

    Abstract: In this paper, we propose a novel dual-branch Transformation-Synthesis network (TS-Net), for video motion retargeting. Given one subject video and one driving video, TS-Net can produce a new plausible video with the subject appearance of the subject video and motion pattern of the driving video. TS-Net consists of a warp-based transformation branch and a warp-free synthesis branch. The novel desig… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

    Comments: WACV 2023

  10. Accurate Virus Identification with Interpretable Raman Signatures by Machine Learning

    Authors: Jiarong Ye, Yin-Ting Yeh, Yuan Xue, Ziyang Wang, Na Zhang, He Liu, Kunyan Zhang, RyeAnne Ricker, Zhuohang Yu, Allison Roder, Nestor Perea Lopez, Lindsey Organtini, Wallace Greene, Susan Hafenstein, Huaguang Lu, Elodie Ghedin, Mauricio Terrones, Shengxi Huang, Sharon Xiaolei Huang

    Abstract: Rapid identification of newly emerging or circulating viruses is an important first step toward managing the public health response to potential outbreaks. A portable virus capture device coupled with label-free Raman Spectroscopy holds the promise of fast detection by rapidly obtaining the Raman signature of a virus followed by a machine learning approach applied to recognize the virus based on i… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

    Comments: 23 pages, 8 figures

    Journal ref: Proceedings of the National Academy of Sciences of the United States of America (2022)

  11. arXiv:2110.07584  [pdf, other

    cs.LG eess.SP physics.geo-ph

    Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and Partial Differential Equation in a Loop

    Authors: Peng **, Xitong Zhang, Yinpeng Chen, Sharon Xiaolei Huang, Zicheng Liu, Youzuo Lin

    Abstract: This paper investigates unsupervised learning of Full-Waveform Inversion (FWI), which has been widely used in geophysics to estimate subsurface velocity maps from seismic data. This problem is mathematically formulated by a second order partial differential equation (PDE), but is hard to solve. Moreover, acquiring velocity map is extremely expensive, making it impractical to scale up a supervised… ▽ More

    Submitted 18 March, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  12. arXiv:2105.12887  [pdf

    cs.CL cs.AI

    Multi-turn Dialog System on Single-turn Data in Medical Domain

    Authors: Nazib Sorathiya, Chuan-An Lin, Daniel Chen Daniel Xiong, Scott Zin, Yi Zhang, He Sarina Yang, Sharon Xiaolei Huang

    Abstract: Recently there has been a huge interest in dialog systems. This interest has also been developed in the field of the medical domain where researchers are focusing on building a dialog system in the medical domain. This research is focused on the multi-turn dialog system trained on the multi-turn dialog data. It is difficult to gather a huge amount of multi-turn conversational data in the medical d… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  13. arXiv:2102.10306  [pdf

    cond-mat.mtrl-sci

    Designation of Intra-layer and Intercalated High Entropy Quasi-2D Compounds

    Authors: Hong Xiang Chen, Sheng Li, Shu Xian Huang, Li An Ma, Sheng Liu, Fang Tang, Yong Fang, Pin Qiang Dai

    Abstract: Here, we designed two promising schemes to realize the high-entropy structure in a series of quasi-two-dimensional compounds, transition metal dichalcogenides (TMDCs). In the intra-layer high-entropy plan, (HEM)X2 compounds with high-entropy structure in the MX2 slabs were obtained, here HEM means high-entropy metals, such as TiZrNbMoTa. And superconductivity with a Tc~7.4 K was found in a Mo-rich… ▽ More

    Submitted 18 April, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: 12 pages, 6 figures

  14. arXiv:2008.02202  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Anomalous Hall and Nernst effects in epitaxial films of topological kagome magnet Fe3Sn2

    Authors: Durga Khadka, T. R. Thapaliya, Sebastian Hurtado Parra, Jiajia Wen, Ryan Need, James M. Kikkawa, S. X. Huang

    Abstract: The topological kagome magnet (TKM) Fe3Sn2 exhibits unusual topological properties, flat electronic bands, and chiral spin textures, making it an exquisite materials platform to explore the interplay between topological band structure, strong electron correlations, and magnetism. Here we report the first synthesis of high-quality epitaxial (0001) Fe3Sn2 films with large intrinsic anomalous Hall ef… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: accepted by Physical Review Materials

    Journal ref: Phys. Rev. Materials 4, 084203 (2020)

  15. arXiv:2007.05802  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    High quality epitaxial thin films and exchange bias of antiferromagnetic Dirac semimetal FeSn

    Authors: Durga Khadka, T. R. Thapaliya, Jiajia Wen, Ryan F. Need, S. X. Huang

    Abstract: FeSn is a topological semimetal (TSM) and kagome antiferromagnet (AFM) composed of alternating Fe3Sn kagome planes and honeycomb Sn planes. This unique structure gives rise to exotic features in the band structures such as the coexistence of Dirac cones and flatbands near the Fermi level, fully spin-polarized 2D surface Dirac fermions, and the ability to open a large gap in the Dirac cone by reori… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: accepted by APL

    Journal ref: Appl. Phys. Lett. 117, 032403 (2020)

  16. arXiv:2007.02221  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Kondo physics in antiferromagnetic Weyl semimetal Mn3+xSn1-x films

    Authors: Durga Khadka, T. R. Thapaliya, Sebastian Hurtado Parra, Xingyue Han, Jiajia Wen, Ryan F. Need, Pravin Khanal, Weigang Wang, Jiadong Zang, James M. Kikkawa, Liang Wu, S. X. Huang

    Abstract: Topology and strong electron correlations are crucial ingredients in emerging quantum materials, yet their intersection in experimental systems has been relatively limited to date. Strongly correlated Weyl semimetals, particularly when magnetism is incorporated, offer a unique and fertile platform to explore emergent phenomena in novel topological matter and topological spintronics. The antiferrom… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Journal ref: Science Advances 6, eabc1977 (2020)

  17. arXiv:1901.01242  [pdf, ps, other

    cond-mat.str-el

    Spin phases of the helimagnetic insulator Cu$_2$OSeO$_3$ probed by magnon heat conduction

    Authors: N. Prasai, A. Akopyan, B. A. Trump, G. G. Marcus, S. X. Huang, T. M. McQueen, J. L. Cohn

    Abstract: We report studies of thermal conductivity as functions of magnetic field and temperature in the helimagnetic insulator Cu$_2$OSeO$_3$ that reveal novel features of the spin-phase transitions as probed by magnon heat conduction. The tilted conical spiral and low-temperature skyrmion phases, recently identified in small-angle neutron scattering studies, are clearly identified by sharp signatures in… ▽ More

    Submitted 4 January, 2019; originally announced January 2019.

    Comments: 5 pp., 3 figures + Supplemental Material (2 pp.) Accepted, PRB Rapid Communications

  18. Ballistic magnon heat conduction and possible Poiseuille flow in the helimagnetic insulator Cu$_2$OSeO$_3$

    Authors: N. Prasai, B. A. Trump, G. G. Marcus, A. Akopyan, S. X. Huang, T. M. McQueen, J. L. Cohn

    Abstract: We report on the observation of magnon thermal conductivity $κ_m\sim$ 70 W/mK near 5 K in the helimagnetic insulator Cu$_2$OSeO$_3$, exceeding that measured in any other ferromagnet by almost two orders of magnitude. Ballistic, boundary-limited transport for both magnons and phonons is established below 1 K, and Poiseuille flow of magnons is proposed to explain a magnon mean-free path substantiall… ▽ More

    Submitted 29 May, 2017; v1 submitted 17 May, 2017; originally announced May 2017.

    Comments: 10pp, 9 figures, accepted PRB (Editor's Suggestion)

  19. arXiv:1409.7869  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Universal Ratio of Intrinsic Resistivities of Spin Helix in B20 (Fe-Co)Si Magnets

    Authors: S. X. Huang, Jian Kang, Fei Chen, Jiadong Zang, G. J. Shu, F. C. Chou, S. V. Grigoriev, V. A. Dyadkin, C. L. Chien

    Abstract: The B20 magnets with the Dzyaloshinskii-Moriya (D-M) interaction exhibit spin helix and Skyrmion spin textures unattainable in traditional Heisenberg ferromagnets. We have determined the intrinsic resistivity of the spin helix, which is a macroscopic Bloch domain wall, in B20 (Fe-Co)Si magnets. We found a universal resistance ratio of gamma = 1.35 with current parallel and perpendicular to the hel… ▽ More

    Submitted 27 September, 2014; originally announced September 2014.

  20. arXiv:1409.7867  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Magnetoresistance due to Broken C4 Symmetry in Cubic B20 Chiral Magnets

    Authors: S. X. Huang, Fei Chen, Jian Kang, Jiadong Zang, G. J. Shu, F. C. Chou, C. L. Chien

    Abstract: The B20 chiral magnets with broken inversion symmetry and C4 rotation symmetry have attracted much attention. The broken inversion symmetry leads to the Dzyaloshinskii-Moriya that gives rise to the helical and Skyrmion states. We report the unusual magnetoresistance (MR) of B20 chiral magnet Fe0.85Co0.15Si that directly reveals the broken C4 rotation symmetry. We present a microscopic theory, a mi… ▽ More

    Submitted 27 September, 2014; originally announced September 2014.

  21. arXiv:1108.4399  [pdf, ps, other

    cond-mat.supr-con cond-mat.str-el

    Pressure effects on strained FeSe0.5Te0.5 thin films

    Authors: M. Gooch, B. Lorenz, S. X. Huang, C. L. Chien, C. W. Chu

    Abstract: The pressure effect on the resistivity and superconducting Tc of prestrained thin films of the iron chalcogenide superconductor FeSe0.5Te0.5 is studied. Films with different anion heights above the Fe layer showing different values of ambient pressure Tc's are compressed up to a pressure of 1.7 GPa. All films exhibit a significant increase of Tc with pressure. The results cannot solely be explaine… ▽ More

    Submitted 22 August, 2011; originally announced August 2011.

    Comments: 4 pages, 3 figures

    Journal ref: J. Appl. Phys. 111, 112610 (2012)

  22. arXiv:1002.2669  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Control of tetrahedral coordination and superconductivity in FeSe0.5Te0.5 thin films

    Authors: S. X. Huang, C. L. Chien, V. Thampy, C. Broholm

    Abstract: We demonstrate a close relationship between superconductivity and the dimensions of the Fe-Se(Te) tetrahedron in FeSe0.5Te0.5. This is done by exploiting thin film epitaxy, which provides controlled biaxial stress, both compressive and tensile, to distort the tetrahedron. The Se/Te height within the tetrahedron is found to be of crucial importance to superconductivity, in agreement with the theo… ▽ More

    Submitted 12 February, 2010; originally announced February 2010.

    Journal ref: Phys. Rev. Lett. 104, 217002 (2010)

  23. arXiv:0902.4008  [pdf

    cond-mat.supr-con

    Determination of Superconducting Gap of SmFeAsFxO1-x Superconductors by Andreev Reflection Spectroscopy

    Authors: T. Y. Chen, S. X. Huang, Z. Tesanovic, R. H. Liu, X. H. Chen, C. L. Chien

    Abstract: The superconducting gap in FeAs-based superconductor SmFeAs(O1-xFx) (x = 0.15 and 0.30) and the temperature dependence of the sample with x = 0.15 have been measured by Andreev reflection spectroscopy. The intrinsic superconducting gap is independent of contacts while many other "gap-like" features vary appreciably for different contacts. The determined gap value of 2D = 13.34 +/-0.47 meV for Sm… ▽ More

    Submitted 23 February, 2009; originally announced February 2009.

    Comments: 13 pages, 9 figures, Special Issue of Physica C on Superconducting Pnictides