Skip to main content

Showing 51–100 of 248 results for author: Moe, S

.
  1. arXiv:2306.14490  [pdf, other

    cs.CV cs.AI

    TaiChi Action Capture and Performance Analysis with Multi-view RGB Cameras

    Authors: Jianwei Li, Siyu Mo, Yanfei Shen

    Abstract: Recent advances in computer vision and deep learning have influenced the field of sports performance analysis for researchers to track and reconstruct freely moving humans without any marker attachment. However, there are few works for vision-based motion capture and intelligent analysis for professional TaiChi movement. In this paper, we propose a framework for TaiChi performance capture and anal… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  2. arXiv:2305.19458  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition

    Authors: Shentong Mo, Pedro Morgado

    Abstract: The ability to accurately recognize, localize and separate sound sources is fundamental to any audio-visual perception task. Historically, these abilities were tackled separately, with several methods developed independently for each task. However, given the interconnected nature of source localization, separation, and recognition, independent models are likely to yield suboptimal performance as t… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  3. arXiv:2305.14095  [pdf, other

    cs.CV cs.LG

    S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions

    Authors: Sangwoo Mo, Minkyu Kim, Kyungmin Lee, **woo Shin

    Abstract: Vision-language models, such as contrastive language-image pre-training (CLIP), have demonstrated impressive results in natural image domains. However, these models often struggle when applied to specialized domains like remote sensing, and adapting to such domains is challenging due to the limited number of image-text pairs available for training. To address this, we propose S-CLIP, a semi-superv… ▽ More

    Submitted 25 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  4. arXiv:2305.12903  [pdf, other

    cs.CV cs.LG cs.MM

    DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment

    Authors: Shentong Mo, **g Shi, Yapeng Tian

    Abstract: Text-to-audio (TTA) generation is a recent popular problem that aims to synthesize general audio given text descriptions. Previous methods utilized latent diffusion models to learn audio embedding in a latent space with text embedding as the condition. However, they ignored the synchronization between audio and visual content in the video, and tended to generate audio mismatching from video frames… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  5. arXiv:2305.01836  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation

    Authors: Shentong Mo, Yapeng Tian

    Abstract: Segment Anything Model (SAM) has recently shown its powerful effectiveness in visual segmentation tasks. However, there is less exploration concerning how SAM works on audio-visual tasks, such as visual sound localization and segmentation. In this work, we propose a simple yet effective audio-visual localization and segmentation framework based on the Segment Anything Model, namely AV-SAM, that ca… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  6. arXiv:2304.04399  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    CAVL: Learning Contrastive and Adaptive Representations of Vision and Language

    Authors: Shentong Mo, **gfei Xia, Ihor Markevych

    Abstract: Visual and linguistic pre-training aims to learn vision and language representations together, which can be transferred to visual-linguistic downstream tasks. However, there exists semantic confusion between language and vision during the pre-training stage. Moreover, current pre-trained models tend to take lots of computation resources for fine-tuning when transferred to downstream tasks. In this… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  7. arXiv:2304.00425  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Charge order induced Dirac pockets in the nonsymmorphic crystal TaTe$_4$

    Authors: Yichen Zhang, Ruixiang Zhou, Hanlin Wu, Ji Seop Oh, Sheng Li, Jianwei Huang, Jonathan D. Denlinger, Makoto Hashimoto, Donghui Lu, Sung-Kwan Mo, Kevin F. Kelly, Gregory T. McCandless, Julia Y. Chan, Robert J. Birgeneau, Bing Lv, Gang Li, Ming Yi

    Abstract: The interplay between charge order (CO) and nontrivial band topology has spurred tremendous interest in understanding topological excitations beyond the single-particle description. In a quasi-one-dimensional nonsymmorphic crystal TaTe$_4$, the (2a$\times$2b$\times$3c) charge ordered ground state drives the system into a space group where the symmetry indicator features the emergence of Dirac ferm… ▽ More

    Submitted 25 March, 2024; v1 submitted 1 April, 2023; originally announced April 2023.

    Comments: 9 pages, 4 figures. The authorship of this paper has been amended to include new coauthors Dr. Gregory T. McCandless and Dr. Julia Y. Chan of Department of Chemistry and Biochemistry, Baylor University. Drs. McCandless and Chan were responsible for x-ray characterization of the sample used in this study. Erratum to be published on Phys. Rev. B

    Journal ref: Phys. Rev. B. 108, 155121 (2023)

  8. arXiv:2303.17056  [pdf, other

    cs.CV cs.LG cs.MM

    Audio-Visual Grou** Network for Sound Localization from Mixtures

    Authors: Shentong Mo, Yapeng Tian

    Abstract: Sound source localization is a typical and challenging task that predicts the location of sound sources in a video. Previous single-source methods mainly used the audio-visual association as clues to localize sounding objects in each image. Due to the mixed property of multiple sound sources in the original space, there exist rare multi-source approaches to localizing multiple sources simultaneous… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  9. arXiv:2303.12959  [pdf, other

    cs.LG cs.AI

    Variantional autoencoder with decremental information bottleneck for disentanglement

    Authors: Jiantao Wu, Shentong Mo, Xiang Yang, Muhammad Awais, Sara Atito, Xingshen Zhang, Lin Wang, Xiang Yang

    Abstract: One major challenge of disentanglement learning with variational autoencoders is the trade-off between disentanglement and reconstruction fidelity. Previous studies, which increase the information bottleneck during training, tend to lose the constraint of disentanglement, leading to the information diffusion problem. In this paper, we present a novel framework for disentangled representation learn… ▽ More

    Submitted 4 October, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  10. arXiv:2303.11622  [pdf

    cond-mat.supr-con

    Differentiated roles of Lifshitz transition on thermodynamics and superconductivity in La2-xSrxCuO4

    Authors: Yong Zhong, Zhuoyu Chen, Su-Di Chen, Ke-Jun Xu, Makoto Hashimoto, Yu He, Shin-ichi Uchida, Donghui Lu, Sung-Kwan Mo, Zhi-Xun Shen

    Abstract: The effect of Lifshitz transition on thermodynamics and superconductivity in hole-doped cuprates has been heavily debated but remains an open question. In particular, an observed peak of electronic specific heat is proposed to originate from fluctuations of a putative quantum critical point p* (e.g. the termination of pseudogap at zero temperature), which is close to, but distinguishable from the… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Journal ref: Proc. Natl. Acad. Sci. U.S.A. 119, e2204630119 (2022)

  11. arXiv:2303.04549  [pdf

    cond-mat.mtrl-sci

    Observation of plaid-like spin splitting in a noncoplanar antiferromagnet

    Authors: Yu-Peng Zhu, Xiaobing Chen, Xiang-Rui Liu, Yuntian Liu, Pengfei Liu, Heming Zha, Gexing Qu, Caiyun Hong, Jiayu Li, Zhicheng Jiang, Xiao-Ming Ma, Yu-Jie Hao, Ming-Yuan Zhu, Wen**g Liu, Meng Zeng, Sreehari Jayaram, Malik Lenger, Jianyang Ding, Shu Mo, Kiyohisa Tanaka, Masashi Arita, Zhengtai Liu, Mao Ye, Dawei Shen, Jörg Wrachtrup , et al. (5 additional authors not shown)

    Abstract: Spatial, momentum and energy separation of electronic spins in condensed matter systems guides the development of novel devices where spin-polarized current is generated and manipulated. Recent attention on a set of previously overlooked symmetry operations in magnetic materials leads to the emergence of a new type of spin splitting, enabling giant and momentum-dependent spin polarization of energ… ▽ More

    Submitted 4 January, 2024; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: Version 3, 49 pages, 4 main figures, 13 extended data figures and 2 extended data tables. Nature in press (2024)

    Journal ref: Nature 626, 523-528 (2024)

  12. Thermal hysteretic behavior and negative magnetoresistance in an unusual charge-density-wave material EuTe4

    Authors: Q. Q. Zhang, Y. Shi, K. Y. Zhai, W. X. Zhao, X. Du, J. S. Zhou, X. Gu, R. Z. Xu, Y. D. Li, Y. F. Guo, Z. K. Liu, C. Chen, S. -K. Mo, T. K. Kim, C. Cacho, J. W. Yu, W. Li, Y. L. Chen, Jiun-Haw Chu, L. X. Yang

    Abstract: EuTe4 is a newly-discovered van der Waals material exhibiting a novel charge-density wave (CDW) with a large thermal hysteresis in the resistivity and CDW gap. In this work, we systematically study the electronic structure and transport properties of EuTe4 using high-resolution angle-resolved photoemission spectroscopy (ARPES), magnetoresistance measurements, and scanning tunneling microscopy (STM… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

  13. arXiv:2302.14483  [pdf, other

    cs.LG cs.CV stat.ML

    RoPAWS: Robust Semi-supervised Representation Learning from Uncurated Data

    Authors: Sangwoo Mo, Jong-Chyi Su, Chih-Yao Ma, Mido Assran, Ishan Misra, Licheng Yu, Sean Bell

    Abstract: Semi-supervised learning aims to train a model using limited labels. State-of-the-art semi-supervised methods for image classification such as PAWS rely on self-supervised representations learned with large-scale unlabeled but curated data. However, PAWS is often less effective when using real-world unlabeled data that is uncurated, e.g., contains out-of-class data. We propose RoPAWS, a robust ext… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: ICLR 2023

  14. arXiv:2302.10506  [pdf, other

    cs.LG

    Diffusion Probabilistic Models for Structured Node Classification

    Authors: Hyosoon Jang, Seonghyun Park, Sangwoo Mo, Sungsoo Ahn

    Abstract: This paper studies structured node classification on graphs, where the predictions should consider dependencies between the node labels. In particular, we focus on solving the problem for partially labeled graphs where it is essential to incorporate the information in the known label for predicting the unknown labels. To address this issue, we propose a novel framework leveraging the diffusion pro… ▽ More

    Submitted 18 June, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  15. arXiv:2301.11104  [pdf, other

    cs.LG cs.CV

    Discovering and Mitigating Visual Biases through Keyword Explanation

    Authors: Younghyun Kim, Sangwoo Mo, Minkyu Kim, Kyungmin Lee, Jaeho Lee, **woo Shin

    Abstract: Addressing biases in computer vision models is crucial for real-world AI deployments. However, mitigating visual biases is challenging due to their unexplainable nature, often identified indirectly through visualization or sample statistics, which necessitates additional human supervision for interpretation. To tackle this issue, we propose the Bias-to-Text (B2T) framework, which interprets visual… ▽ More

    Submitted 26 March, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: CVPR 2024. First two authors contributed equally

  16. arXiv:2301.07926  [pdf, ps, other

    math.AP

    Normalized Solutions to Kirchhoff Equation with Nonnegative Potential

    Authors: Shuai Mo, Shiwang Ma

    Abstract: This paper is concerned with the existence of solutions to the problem $$-\left(a+ b\int_{\mathbb{R}^{N}}|\nabla u|^{2} dx \right)Δu +V(x)u+λu = |u|^{p-2}u,\ \ x \in \mathbb{R}^{N},\ \ λ\in \mathbb{R}^{+} $$ where $a, b>0$ are constants, $ V \geq 0$ is a potential, $N \geq 1 $, and $ p \in (2+ \frac{4}{N},2^*$). We use a more subtle analysis to revisit the limited problem($V \equiv 0$), and ob… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  17. arXiv:2301.06667  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Imaging the breakdown and restoration of topological protection in magnetic topological insulator MnBi$_2$Te$_4$

    Authors: Qile Li, Iolanda Di Bernardo, Johnathon Maniatis, Daniel McEwen, Liam Watson, Benjamin Lowe, Thi-Hai-Yen Vu, Chi Xuan Trang, **woong Hwang, Sung-Kwan Mo, Michael S. Fuhrer, Mark T. Edmonds

    Abstract: Quantum anomalous Hall (QAH) insulators transport charge without resistance along topologically protected chiral one-dimensional edge states. Yet, in magnetic topological insulators (MTI) to date, topological protection is far from robust, with the zero-magnetic field QAH effect only realised at temperatures an order of magnitude below the Néel temperature TN, though small magnetic fields can stab… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

  18. arXiv:2212.06595  [pdf, other

    cs.CV cs.LG

    OAMixer: Object-aware Mixing Layer for Vision Transformers

    Authors: Hyunwoo Kang, Sangwoo Mo, **woo Shin

    Abstract: Patch-based models, e.g., Vision Transformers (ViTs) and Mixers, have shown impressive results on various visual recognition tasks, alternating classic convolutional networks. While the initial patch-based models (ViTs) treated all patches equally, recent studies reveal that incorporating inductive bias like spatiality benefits the representations. However, most prior works solely focused on the l… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: CVPR Transformers for Vision Workshop 2022. First two authors contributed equally

  19. arXiv:2212.02090  [pdf, other

    cs.CV cs.AI cs.LG

    Breaking the Spurious Causality of Conditional Generation via Fairness Intervention with Corrective Sampling

    Authors: Junhyun Nam, Sangwoo Mo, Jaeho Lee, **woo Shin

    Abstract: To capture the relationship between samples and labels, conditional generative models often inherit spurious correlations from the training dataset. This can result in label-conditional distributions that are imbalanced with respect to another latent attribute. To mitigate this issue, which we call spurious causality of conditional generation, we propose a general two-step strategy. (a) Fairness I… ▽ More

    Submitted 4 July, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: TMLR 2023

  20. Electronic Origin of Half-metal to Semiconductor Transition and Colossal Magnetoresistance in Spinel HgCr2Se4

    Authors: Aiji Liang, Zhilin Li, Shihao Zhang, Shucui Sun, Shuai Liu, Cheng Chen, Haifeng Yang, Shengtao Cui, Sung-Kwan Mo, Shuai Yang, Yongqing Li, Meixiao Wang, Lexian Yang, Jianpeng Liu, Zhongkai Liu, Yulin Chen

    Abstract: Half-metals are ferromagnets hosting spin-polarized conducting carriers and crucial for spintronics applications. The chromium spinel HgCr2Se4 represents a unique type of half-metal, which features a half-metal to semiconductor transition (HMST) and exhibits colossal magnetoresistance (CMR) across the ferromagnetic-paramagnetic (FM-PM) transition. Using angle-resolved photoemission spectroscopy (A… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  21. arXiv:2211.09074  [pdf, other

    cs.CV

    Where a Strong Backbone Meets Strong Features -- ActionFormer for Ego4D Moment Queries Challenge

    Authors: Fangzhou Mu, Sicheng Mo, Gillian Wang, Yin Li

    Abstract: This report describes our submission to the Ego4D Moment Queries Challenge 2022. Our submission builds on ActionFormer, the state-of-the-art backbone for temporal action localization, and a trio of strong video features from SlowFast, Omnivore and EgoVLP. Our solution is ranked 2nd on the public leaderboard with 21.76% average mAP on the test set, which is nearly three times higher than the offici… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 2nd place in ECCV 2022 Ego4D Moment Queries Challenge

  22. arXiv:2211.08704  [pdf, other

    cs.CV

    A Simple Transformer-Based Model for Ego4D Natural Language Queries Challenge

    Authors: Sicheng Mo, Fangzhou Mu, Yin Li

    Abstract: This report describes Badgers@UW-Madison, our submission to the Ego4D Natural Language Queries (NLQ) Challenge. Our solution inherits the point-based event representation from our prior work on temporal action localization, and develops a Transformer-based model for video grounding. Further, our solution integrates several strong video features including SlowFast, Omnivore and EgoVLP. Without bell… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 5 pages, 2 figures

  23. arXiv:2211.08114  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Metal to Mott Insulator Transition in Two-dimensional 1T-TaSe$_2$

    Authors: Ning Tian, Zhe Huang, Bo Gyu Jang, Shuaifei Guo, Ya-Jun Yan, **g**g Gao, Yijun Yu, **woong Hwang, Meixiao Wang, Xuan Luo, Yu ** Sun, Zhongkai Liu, Dong-Lai Feng, Xianhui Chen, Sung-Kwan Mo, Minjae Kim, Young-Woo Son, Dawei Shen, Wei Ruan, Yuanbo Zhang

    Abstract: When electron-electron interaction dominates over other electronic energy scales, exotic, collective phenomena often emerge out of seemingly ordinary matter. The strongly correlated phenomena, such as quantum spin liquid and unconventional superconductivity, represent a major research frontier and a constant source of inspiration. Central to strongly correlated physics is the concept of Mott insul… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  24. arXiv:2210.10194  [pdf, other

    cs.CV cs.AI cs.LG

    Rethinking Prototypical Contrastive Learning through Alignment, Uniformity and Correlation

    Authors: Shentong Mo, Zhun Sun, Chao Li

    Abstract: Contrastive self-supervised learning (CSL) with a prototypical regularization has been introduced in learning meaningful representations for downstream tasks that require strong semantic information. However, to optimize CSL with a loss that performs the prototypical regularization aggressively, e.g., the ProtoNCE loss, might cause the "coagulation" of examples in the embedding space. That is, the… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: BMVC 2022

  25. arXiv:2210.02010  [pdf

    cond-mat.mtrl-sci

    A novel $\sqrt{19}\times\sqrt{19}$ superstructure in epitaxially grown 1T-TaTe$_2$

    Authors: **woong Hwang, Yeongrok **, Canxun Zhang, Tiancong Zhu, Kyoo Kim, Yong Zhong, Ji-Eun Lee, Zongqi Shen, Yi Chen, Wei Ruan, Hye** Ryu, Choongyu Hwang, Jaekwang Lee, Michael F. Crommie, Sung-Kwan Mo, Zhi-Xun Shen

    Abstract: The spontaneous formation of electronic orders is a crucial element for understanding complex quantum states and engineering heterostructures in two-dimensional materials. We report a novel $\sqrt{19}\times\sqrt{19}$ charge order in few-layer thick 1T-TaTe$_2$ transition metal dichalcogenide films grown by molecular beam epitaxy, which has not been realized. Our photoemission and scanning probe me… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Journal ref: Advanced materials 34, 2204579 (2022)

  26. arXiv:2210.00314  [pdf, other

    cs.CV cs.AI cs.LG

    Learning Hierarchical Image Segmentation For Recognition and By Recognition

    Authors: Tsung-Wei Ke, Sangwoo Mo, Stella X. Yu

    Abstract: Large vision and language models learned directly through image-text associations often lack detailed visual substantiation, whereas image segmentation tasks are treated separately from recognition, supervisedly learned without interconnections. Our key observation is that, while an image can be recognized in multiple ways, each has a consistent part-and-whole visual organization. Segmentation thu… ▽ More

    Submitted 2 May, 2024; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: ICLR 2024 (spotlight). First two authors contributed equally. Code available at https://github.com/twke18/CAST

    ACM Class: I.4.6; I.4.10; I.5.3

  27. arXiv:2209.09634  [pdf, other

    cs.SD cs.CV cs.LG cs.MM

    A Closer Look at Weakly-Supervised Audio-Visual Source Localization

    Authors: Shentong Mo, Pedro Morgado

    Abstract: Audio-visual source localization is a challenging task that aims to predict the location of visual sound sources in a video. Since collecting ground-truth annotations of sounding objects can be costly, a plethora of weakly-supervised localization methods that can learn from datasets with no bounding-box annotations have been proposed in recent years, by leveraging the natural co-occurrence of audi… ▽ More

    Submitted 30 August, 2022; originally announced September 2022.

  28. arXiv:2208.08819  [pdf, other

    cs.CV cs.AI cs.LG

    Siamese Prototypical Contrastive Learning

    Authors: Shentong Mo, Zhun Sun, Chao Li

    Abstract: Contrastive Self-supervised Learning (CSL) is a practical solution that learns meaningful visual representations from massive data in an unsupervised approach. The ordinary CSL embeds the features extracted from neural networks onto specific topological structures. During the training progress, the contrastive loss draws the different views of the same input together while pushing the embeddings f… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: BMVC 2021

  29. Persistent exchange splitting in a chiral helimagnet Cr1/3NbS2

    Authors: Na Qin, Cheng Chen, Shiqiao Du, Xian Du, Xin Zhang, Zhongxu Yin, **gsong Zhou, Runzhe Xu, Xu Gu, Qinqin Zhang, Wenxuan Zhao, Yidian Li, Sung-Kwan Mo, Zhongkai Liu, Shilei Zhang, Yanfeng Guo, P. Z. Tang, Yulin Chen, Lexian Yang

    Abstract: Using high-resolution angle-resolved photoemission spectroscopy (ARPES) and ab-initio calculation, we systematically investigate the electronic structure of the chiral helimagnet Cr1/3NbS2 and its temperature evolution. The comparison with NbS2 suggests that the electronic structure of Cr1/3NbS2 is strongly modified by the intercalation of Cr atoms. Our ab-initio calculation, consistent with exper… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  30. arXiv:2205.14339  [pdf, other

    cond-mat.str-el

    Spectral Evidence for Unidirectional Charge Density Wave in Detwinned BaNi$_2$As$_2$

    Authors: Yucheng Guo, Mason Klemm, Ji Seop Oh, Yaofeng Xie, Bing-Hua Lei, Sergey Gorovikov, Tor Pedersen, Matteo Michiardi, Sergey Zhdanovich, Andrea Damascelli, Jonathan Denlinger, Makoto Hashimoto, Donghui Lu, Sung-Kwan Mo, Rob G. Moore, Robert J. Birgeneau, David J. Singh, Pengcheng Dai, Ming Yi

    Abstract: The emergence of unconventional superconductivity in proximity to intertwined electronic orders is especially relevant in the case of iron-based superconductors. Such order consists of an electronic nematic order and a spin density wave in these systems. BaNi$_2$As$_2$, like its well-known iron-based analog BaFe$_2$As$_2$, also hosts a symmetry-breaking structural transition that is coupled to a u… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

    Comments: 6 pages, 4 figures

  31. arXiv:2205.14338  [pdf, other

    cs.CV cs.LG

    Object-wise Masked Autoencoders for Fast Pre-training

    Authors: Jiantao Wu, Shentong Mo

    Abstract: Self-supervised pre-training for images without labels has recently achieved promising performance in image classification. The success of transformer-based methods, ViT and MAE, draws the community's attention to the design of backbone architecture and self-supervised task. In this work, we show that current masked image encoding models learn the underlying relationship between all objects in the… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

  32. arXiv:2205.01679  [pdf, other

    eess.IV cs.CV

    Physics to the Rescue: Deep Non-line-of-sight Reconstruction for High-speed Imaging

    Authors: Fangzhou Mu, Sicheng Mo, Jiayong Peng, Xiaochun Liu, Ji Hyun Nam, Siddeshwar Raghavan, Andreas Velten, Yin Li

    Abstract: Computational approach to imaging around the corner, or non-line-of-sight (NLOS) imaging, is becoming a reality thanks to major advances in imaging hardware and reconstruction algorithms. A recent development towards practical NLOS imaging, Nam et al. demonstrated a high-speed non-confocal imaging system that operates at 5Hz, 100x faster than the prior art. This enormous gain in acquisition rate,… ▽ More

    Submitted 5 August, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: ICCP 2022 (TPAMI Special Issue on Computational Photography). Project page: https://pages.cs.wisc.edu/~fmu/nlos3d/

  33. arXiv:2204.11263  [pdf

    cond-mat.mtrl-sci

    Observation of Dimension-Crossover of a Tunable 1D Dirac Fermion in Topological Semimetal NbSi$_x$Te$_2$

    Authors: **g Zhang, Yangyang Lv, Xiaolong Feng, Aiji Liang, Wei Xia, Sung-Kwan Mo, Cheng Chen, Jiamin Xue, Shengyuan A. Yang, Lexian Yang, Yanfeng Guo, Yanbin Chen, Yulin Chen, Zhongkai Liu

    Abstract: Condensed matter systems in low dimensions exhibit emergent physics that does not exist in three dimensions. When electrons are confined to one dimension (1D), some significant electronic states appear, such as charge density wave, spin-charge separations and Su-Schrieffer-Heeger (SSH) topological state. However, a clear understanding of how the 1D electronic properties connects with topology is c… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    Comments: 24 pages, 4 figures, to be published in npj Quantum Materials

  34. arXiv:2204.11259  [pdf

    cond-mat.mtrl-sci

    Direct Visualization and Manipulation of Tunable Quantum Well State in Semiconducting Nb2SiTe4

    Authors: **g Zhang, Zhilong Yang, Shuai Liu, Wei Xia, Tongshuai Zhu, Cheng Chen, Chengwei Wang, Meixiao Wang, Sung-Kwan Mo, Lexian Yang, Xufeng Kou, Yanfeng Guo, Haijun Zhang, Zhongkai Liu, Yulin Chen

    Abstract: Quantum well states (QWSs) can form at the surface or interfaces of materials with confinement potential. They have broad applications in electronic and optical devices such as high mobility electron transistor, photodetector and quantum well laser. The properties of the QWSs are usually the key factors for the performance of the devices. However, direct visualization and manipulation of such stat… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    Comments: 28 pages, 5 figures,

    Journal ref: ACS Nano 2021 15 (10), 15850-15857

  35. arXiv:2204.11204  [pdf, ps, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Nematic fluctuations in the non-superconducting iron pnictide BaFe$_{1.9-x}$Ni$_{0.1}$Cr$_{x}$As$_{2}$

    Authors: Dongliang Gong, Ming Yi, Meng Wang, Tao Xie, Wenliang Zhang, Sergey Danilkin, Guochu Deng, Xinzhi Liu, Jitae T. Park, Kazuhiko Ikeuchi, Kazuya Kamazawa, Sung-Kwan Mo, Makoto Hashimoto, Donghui Lu, Rui Zhang, Pengcheng Dai, Robert J. Birgeneau, Shiliang Li, Huiqian Luo

    Abstract: The main driven force of the electronic nematic phase in iron-based superconductors is still under debate. Here, we report a comprehensive study on the nematic fluctuations in a non-superconducting iron pnictide system BaFe$_{1.9-x}$Ni$_{0.1}$Cr$_{x}$As$_{2}$ by electronic transport, angle-resolved photoemission spectroscopy (ARPES) and inelastic neutron scattering (INS) measurements. Previous neu… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    Comments: 12 pages, 8 figures. Frontiers in Physics: Topic "Nematicity in Iron-Based Superconductors"

    Journal ref: Front. Phys. 10, 886459 (2022)

  36. arXiv:2204.05627  [pdf, other

    cs.AI

    Proximal Policy Optimization Learning based Control of Congested Freeway Traffic

    Authors: Shurong Mo, Nailong Wu, Jie Qi, Anqi Pan, Zhiguang Feng, Huaicheng Yan, Yueying Wang

    Abstract: This study proposes a delay-compensated feedback controller based on proximal policy optimization (PPO) reinforcement learning to stabilize traffic flow in the congested regime by manipulating the time-gap of adaptive cruise control-equipped (ACC-equipped) vehicles.The traffic dynamics on a freeway segment are governed by an Aw-Rascle-Zhang (ARZ) model, consisting of $2\times 2$ nonlinear first-or… ▽ More

    Submitted 14 January, 2023; v1 submitted 12 April, 2022; originally announced April 2022.

  37. arXiv:2204.00298  [pdf, other

    cs.CV

    Unitail: Detecting, Reading, and Matching in Retail Scene

    Authors: Fangyi Chen, Han Zhang, Zaiwang Li, Jiachen Dou, Shentong Mo, Hao Chen, Yongxin Zhang, Uzair Ahmed, Chenchen Zhu, Marios Savvides

    Abstract: To make full use of computer vision technology in stores, it is required to consider the actual needs that fit the characteristics of the retail scene. Pursuing this goal, we introduce the United Retail Datasets (Unitail), a large-scale benchmark of basic visual tasks on products that challenges algorithms for detecting, reading, and matching. With 1.8M quadrilateral-shaped instances annotated, th… ▽ More

    Submitted 20 July, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: ECCV 2022

  38. arXiv:2203.16769  [pdf

    cond-mat.mtrl-sci

    Large-gap insulating dimer ground state in monolayer IrTe2

    Authors: **woong Hwang, Kyoo Kim, Canxun Zhang, Tiancong Zhu, Charlotte Herbig, Sooran Kim, Bongjae Kim, Yong Zhong, Mohamed Salah, Mohamed M. El-Desoky, Choongyu Hwang, Zhi-Xun Shen, Michael F. Crommie, Sung-Kwan Mo

    Abstract: Monolayers of two-dimensional van der Waals materials exhibit novel electronic phases distinct from their bulk due to the symmetry breaking and reduced screening in the absence of the interlayer coupling. In this work, we combine angle-resolved photoemission spectroscopy and scanning tunneling microscopy/spectroscopy to demonstrate the emergence of a unique insulating 2 x 1 dimer ground state in m… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Journal ref: Nature communications 13, 906 (2022)

  39. arXiv:2203.10584  [pdf, other

    cs.CV

    Point3D: tracking actions as moving points with 3D CNNs

    Authors: Shentong Mo, **gfei Xia, Xiaoqing Tan, Bhiksha Raj

    Abstract: Spatio-temporal action recognition has been a challenging task that involves detecting where and when actions occur. Current state-of-the-art action detectors are mostly anchor-based, requiring sensitive anchor designs and huge computations due to calculating large numbers of anchor boxes. Motivated by nascent anchor-free approaches, we propose Point3D, a flexible and computationally efficient net… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted by the 32nd British Machine Vision Conference (BMVC 2021)

  40. arXiv:2203.09705  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el

    Tailoring Dirac fermions by in-situ tunable high-order moire pattern in graphene-monolayer xenon heterostructure

    Authors: Chunlong Wu, Qiang Wan, Cao Peng, Shangkun Mo, Renzhe Li, Keming Zhao, Yan** Guo, Shengjun Yuan, Fengcheng Wu, Chendong Zhang, Nan Xu

    Abstract: A variety of novel quantum phases have been achieved in twist bilayer graphene (tBLG) and other moire superlattices recently, including correlated insulators, superconductivity, magnetism, and topological states. These phenomena are very sensitive to the moire superlattices, which can hardly be changed rapidly or intensely. Here, we report the experimental realization of a high-order moire pattern… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 17 pages, 4 figures, supplementary materials available from the authors, submitted Feb. 2022

    Journal ref: Phys. Rev. Lett. 129, 176402 (2022)

  41. arXiv:2203.09324  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    Localizing Visual Sounds the Easy Way

    Authors: Shentong Mo, Pedro Morgado

    Abstract: Unsupervised audio-visual source localization aims at localizing visible sound sources in a video without relying on ground-truth localization for training. Previous works often seek high audio-visual similarities for likely positive (sounding) regions and low similarities for likely negative regions. However, accurately distinguishing between sounding and non-sounding regions is challenging witho… ▽ More

    Submitted 29 March, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

  42. arXiv:2203.03838  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Multi-Scale Self-Contrastive Learning with Hard Negative Mining for Weakly-Supervised Query-based Video Grounding

    Authors: Shentong Mo, Daizong Liu, Wei Hu

    Abstract: Query-based video grounding is an important yet challenging task in video understanding, which aims to localize the target segment in an untrimmed video according to a sentence query. Most previous works achieve significant progress by addressing this task in a fully-supervised manner with segment-level labels, which require high labeling cost. Although some recent efforts develop weakly-supervise… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  43. arXiv:2203.01311  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.MM

    High-Modality Multimodal Transformer: Quantifying Modality & Interaction Heterogeneity for High-Modality Representation Learning

    Authors: Paul Pu Liang, Yiwei Lyu, Xiang Fan, Jeffrey Tsaw, Yudong Liu, Shentong Mo, Dani Yogatama, Louis-Philippe Morency, Ruslan Salakhutdinov

    Abstract: Many real-world problems are inherently multimodal, from spoken language, gestures, and paralinguistics humans use to communicate, to force, proprioception, and visual sensors on robots. While there has been an explosion of interest in multimodal learning, these methods are focused on a small set of modalities primarily in language, vision, and audio. In order to accelerate generalization towards… ▽ More

    Submitted 28 June, 2023; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: TMLR 2023, Code available at https://github.com/pliang279/HighMMT

  44. arXiv:2202.10571  [pdf, other

    cs.CV cs.LG

    Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

    Authors: Sihyun Yu, Jihoon Tack, Sangwoo Mo, Hyunsu Kim, Junho Kim, Jung-Woo Ha, **woo Shin

    Abstract: In the deep learning era, long video generation of high-quality still remains challenging due to the spatio-temporal complexity and continuity of videos. Existing prior works have attempted to model video distribution by representing videos as 3D grids of RGB values, which impedes the scale of generated videos and neglects continuous dynamics. In this paper, we found that the recent emerging parad… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: ICLR 2022. Project page with videos and code: https://sihyun-yu.github.io/digan/

  45. arXiv:2202.07224  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Evidence for a spinon Kondo effect in cobalt atoms on single-layer 1T-TaSe$_2$

    Authors: Yi Chen, Wen-Yu He, Wei Ruan, **woong Hwang, Shujie Tang, Ryan L. Lee, Meng Wu, Tiancong Zhu, Canxun Zhang, Hye** Ryu, Feng Wang, Steven G. Louie, Zhi-Xun Shen, Sung-Kwan Mo, Patrick A. Lee, Michael F. Crommie

    Abstract: Quantum spin liquids (QSLs) are highly entangled, disordered magnetic states that arise in frustrated Mott insulators and host exotic fractional excitations such as spinons and chargons. Despite being charge insulators some QSLs are predicted to exhibit gapless itinerant spinons that yield metallic behavior in the spin channel. We have deposited isolated magnetic atoms onto single-layer (SL) 1T-Ta… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Journal ref: Nature Physics 18, 1335 (2022)

  46. Nonsymmorphic Symmetry-Protected Band Crossings in a Square-Net Metal PtPb$_4$

    Authors: Han Wu, Alannah M. Hallas, Xiaochan Cai, Jianwei Huang, Ji Seop Oh, Vaideesh Loganathan, Ashley Weiland, Gregory T. McCandless, Julia Y. Chan, Sung-Kwan Mo, Donghui Lu, Makoto Hashimoto, Jonathan Denlinger, Robert J. Birgeneau, Andriy H. Nevidomskyy, Gang Li, Emilia Morosan, Ming Yi

    Abstract: Topological semimetals with symmetry-protected band crossings have emerged as a rich landscape to explore intriguing electronic phenomena. Nonsymmorphic symmetries in particular have been shown to play an important role in protecting the crossings along a line (rather than a point) in momentum space. Here we report experimental and theoretical evidence for Dirac nodal line crossings along the Bril… ▽ More

    Submitted 25 March, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 21 pages, 4 figures, accepted for publication in npj Quantum Mater

    Journal ref: npj Quantum Mater. 7, 31 (2022)

  47. arXiv:2202.03026  [pdf, other

    cs.CV

    Context Autoencoder for Self-Supervised Representation Learning

    Authors: Xiaokang Chen, Mingyu Ding, Xiaodi Wang, Ying Xin, Shentong Mo, Yunhao Wang, Shumin Han, ** Luo, Gang Zeng, **gdong Wang

    Abstract: We present a novel masked image modeling (MIM) approach, context autoencoder (CAE), for self-supervised representation pretraining. We pretrain an encoder by making predictions in the encoded representation space. The pretraining tasks include two tasks: masked representation prediction - predict the representations for the masked patches, and masked patch reconstruction - reconstruct the masked p… ▽ More

    Submitted 10 August, 2023; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: Accepted by International Journal of Computer Vision (IJCV)

  48. arXiv:2201.11592  [pdf

    cond-mat.mtrl-sci

    Evidences for the exciton gas phase and its condensation in monolayer 1T-ZrTe2

    Authors: Yekai Song, Chun**g Jia, Hongyu Xiong, Binbin Wang, Zhicheng Jiang, Kui Huang, **woong Hwang, Zhuojun Li, Choongyu Hwang, Zhongkai Liu, Dawei Shen, Jonathan Sobota, Patrick Kirchmann, Jiamin Xue, Thomas P. Devereaux, Sung-Kwan Mo, Zhi-Xun Shen, Shujie Tang

    Abstract: The excitonic insulator (EI) is a Bose-Einstein condensation (BEC) of excitons bound by electron-hole interaction in a solid, which could support high-temperature BEC transition. The material realization of EI has been elusive, which is further challenged by the difficulty of distinguishing it from a conventional charge density wave (CDW) state. In the BEC limit, the pre-condensation exciton gas p… ▽ More

    Submitted 30 March, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: 22 pages, 4 figures

  49. arXiv:2201.02667  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Correlation-Driven Electronic Reconstruction in FeTe$_{1-x}$Se$_x$

    Authors: Jianwei Huang, Rong Yu, Zhijun Xu, Jian-Xin Zhu, Ji Seop Oh, Qianni Jiang, Meng Wang, Han Wu, Tong Chen, Jonathan D. Denlinger, Sung-Kwan Mo, Makoto Hashimoto, Matteo Michiardi, Tor M. Pedersen, Sergey Gorovikov, Sergey Zhdanovich, Andrea Damascelli, Genda Gu, Pengcheng Dai, Jiun-Haw Chu, Donghui Lu, Qimiao Si, Robert J. Birgeneau, Ming Yi

    Abstract: Electronic correlation is of fundamental importance to high temperature superconductivity. While the low energy electronic states in cuprates are dominantly affected by correlation effects across the phase diagram, observation of correlation-driven changes in fermiology amongst the iron-based superconductors remains rare. Here we present experimental evidence for a correlation-driven reconstructio… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: 25 pages, 5 figures, accepted version to appear in Communications Physics. arXiv admin note: text overlap with arXiv:2010.13913

    Journal ref: Commun Phys 5, 29 (2022)

  50. arXiv:2111.04146  [pdf, other

    eess.SY cs.LG cs.RO

    Optimization of the Model Predictive Control Meta-Parameters Through Reinforcement Learning

    Authors: Eivind Bøhn, Sebastien Gros, Signe Moe, Tor Arne Johansen

    Abstract: Model predictive control (MPC) is increasingly being considered for control of fast systems and embedded applications. However, the MPC has some significant challenges for such systems. Its high computational complexity results in high power consumption from the control algorithm, which could account for a significant share of the energy resources in battery-powered embedded systems. The MPC param… ▽ More

    Submitted 7 November, 2021; originally announced November 2021.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible