Skip to main content

Showing 101–150 of 252 results for author: Yao, T

.
  1. arXiv:2203.05922  [pdf, other

    cs.CV

    Visualizing and Understanding Patch Interactions in Vision Transformer

    Authors: Jie Ma, Yalong Bai, Bineng Zhong, Wei Zhang, Ting Yao, Tao Mei

    Abstract: Vision Transformer (ViT) has become a leading tool in various computer vision tasks, owing to its unique self-attention mechanism that learns visual representations explicitly through cross-patch information interactions. Despite having good success, the literature seldom explores the explainability of vision transformer, and there is no clear picture of how the attention mechanism with respect to… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 15 pages, 14 figures

  2. arXiv:2202.00087  [pdf, other

    eess.IV cs.CV q-bio.QM

    Holistic Fine-grained GGS Characterization: From Detection to Unbalanced Classification

    Authors: Yuzhe Lu, Haichun Yang, Zuhayr Asad, Zheyu Zhu, Tianyuan Yao, Jiachen Xu, Agnes B. Fogo, Yuankai Huo

    Abstract: Recent studies have demonstrated the diagnostic and prognostic values of global glomerulosclerosis (GGS) in IgA nephropathy, aging, and end-stage renal disease. However, the fine-grained quantitative analysis of multiple GGS subtypes (e.g., obsolescent, solidified, and disappearing glomerulosclerosis) is typically a resource extensive manual process. Very few automatic methods, if any, have been d… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

  3. Probabilistic Learning of Treatment Trees in Cancer

    Authors: Tsung-Hung Yao, Zhenke Wu, Karthik Bharath, **ju Li, Veerabhadran Baladandayuthapan

    Abstract: Accurate identification of synergistic treatment combinations and their underlying biological mechanisms is critical across many disease domains, especially cancer. In translational oncology research, preclinical systems such as patient-derived xenografts (PDX) have emerged as a unique study design evaluating multiple treatments administered to samples from the same human tumor implanted into gene… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

  4. arXiv:2201.08632  [pdf, ps, other

    math.CO

    Cross $t$-intersecting families for symplectic polar spaces

    Authors: Tian Yao, Kaishun Wang

    Abstract: Let $\mathscr{P}$ be a symplectic polar space over a finite field $\mathbb{F}_q$, and $\mathscr{P}_m$ denote the collection of all $k$-dimensional totally isotropic subspace in $\mathscr{P}$. Let $\mathscr{F}_1\subset\mathscr{P}_{m_1}$ and $\mathscr{F}_2\subset\mathscr{P}_{m_2}$ satisfy $\dim(F_1\cap F_2)\ge t$ for any $F_1\in\mathscr{F}_1$ and $F_2\in\mathscr{F}_2$. We say they are cross $t$-inte… ▽ More

    Submitted 24 February, 2022; v1 submitted 21 January, 2022; originally announced January 2022.

    Comments: Some typos corrected, a reference added, some details of proof added. arXiv admin note: substantial text overlap with arXiv:2201.08084

    MSC Class: 05D05; 05A30; 51A50

  5. arXiv:2201.08084  [pdf, ps, other

    math.CO

    Cross $t$-intersecting families for finite affine spaces

    Authors: Tian Yao, Kaishun Wang

    Abstract: Denote the collection of all $k$-flats in $AG(n,\mathbb{F}_q)$ by $\mathscr{M}(k,n)$. Let $\mathscr{F}_1\subset\mathscr{M}(k_1,n)$ and $\mathscr{F}_2\subset\mathscr{M}(k_2,n)$ satisfy $\dim(F_1\cap F_2)\ge t$ for any $F_1\in\mathscr{F}_1$ and $F_2\in\mathscr{F}_2$. We say they are cross $t$-intersecting families. Moreover, we say they are trivial if each member of them contains a fixed $t$-flats i… ▽ More

    Submitted 15 February, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Some typos are corrected

    MSC Class: 05D05

  6. arXiv:2201.04029  [pdf, other

    cs.CV

    Motion-Focused Contrastive Learning of Video Representations

    Authors: Rui Li, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Dong Liu, Tao Mei

    Abstract: Motion, as the most distinct phenomenon in a video to involve the changes over time, has been unique and critical to the development of video representation learning. In this paper, we ask the question: how important is the motion particularly for self-supervised video representation learning. To this end, we compose a duet of exploiting the motion for data augmentation and feature learning in the… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: ICCV 2021 (Oral); Code is publicly available at: https://github.com/YihengZhang-CV/MCL-Motion-Focused-Contrastive-Learning

  7. arXiv:2201.04027  [pdf, other

    cs.CV

    Representing Videos as Discriminative Sub-graphs for Action Recognition

    Authors: Dong Li, Zhaofan Qiu, Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei

    Abstract: Human actions are typically of combinatorial structures or patterns, i.e., subjects, objects, plus spatio-temporal interactions in between. Discovering such structures is therefore a rewarding way to reason about the dynamics of interactions and recognize the actions. In this paper, we introduce a new design of sub-graphs to represent and encode the discriminative patterns of each action in the vi… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: CVPR 2021

  8. arXiv:2201.04026  [pdf, other

    cs.CV cs.CL cs.MM

    Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training

    Authors: Yehao Li, Jiahao Fan, Yingwei Pan, Ting Yao, Weiyao Lin, Tao Mei

    Abstract: Vision-language pre-training has been an emerging and fast-develo** research topic, which transfers multi-modal knowledge from rich-resource pre-training task to limited-resource downstream tasks. Unlike existing works that predominantly learn a single generic encoder, we present a pre-trainable Universal Encoder-DEcoder Network (Uni-EDEN) to facilitate both vision-language perception (e.g., vis… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)

  9. arXiv:2201.04024  [pdf, other

    cs.CV cs.MM

    Smart Director: An Event-Driven Directing System for Live Broadcasting

    Authors: Yingwei Pan, Yue Chen, Qian Bao, Ning Zhang, Ting Yao, **gen Liu, Tao Mei

    Abstract: Live video broadcasting normally requires a multitude of skills and expertise with domain knowledge to enable multi-camera productions. As the number of cameras keep increasing, directing a live sports broadcast has now become more complicated and challenging than ever before. The broadcast directors need to be much more concentrated, responsive, and knowledgeable, during the production. To reliev… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)

  10. arXiv:2201.04023  [pdf, other

    cs.CV

    Boosting Video Representation Learning with Multi-Faceted Integration

    Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Xiao-** Zhang, Dong Wu, Tao Mei

    Abstract: Video content is multifaceted, consisting of objects, scenes, interactions or actions. The existing datasets mostly label only one of the facets for model training, resulting in the video representation that biases to only one facet depending on the training dataset. There is no study yet on how to learn a video representation from multifaceted labels, and whether multifaceted information is helpf… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: CVPR 2021

  11. arXiv:2201.04022  [pdf, other

    cs.CV

    Condensing a Sequence to One Informative Frame for Video Recognition

    Authors: Zhaofan Qiu, Ting Yao, Yan Shu, Chong-Wah Ngo, Tao Mei

    Abstract: Video is complex due to large variations in motion and rich content in fine-grained visual details. Abstracting useful information from such information-intensive media requires exhaustive computing resources. This paper studies a two-step alternative that first condenses the video sequence to an informative "frame" and then exploits off-the-shelf image recognition system on the synthetic frame. A… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: ICCV 2021

  12. arXiv:2201.04021  [pdf, other

    cs.CV

    Optimization Planning for 3D ConvNets

    Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei

    Abstract: It is not trivial to optimally learn a 3D Convolutional Neural Networks (3D ConvNets) due to high complexity and various options of the training scheme. The most common hand-tuning process starts from learning 3D ConvNets using short video clips and then is followed by learning long-term temporal dependency using lengthy clips, while gradually decaying the learning rate from high to low as trainin… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: ICML 2021; Code is publicly available at: https://github.com/ZhaofanQiu/Optimization-Planning-for-3D-ConvNets

  13. arXiv:2112.13977  [pdf, other

    cs.CV

    Exploiting Fine-grained Face Forgery Clues via Progressive Enhancement Learning

    Authors: Qiqi Gu, Shen Chen, Tai** Yao, Yang Chen, Shouhong Ding, Ran Yi

    Abstract: With the rapid development of facial forgery techniques, forgery detection has attracted more and more attention due to security concerns. Existing approaches attempt to use frequency information to mine subtle artifacts under high-quality forged faces. However, the exploitation of frequency information is coarse-grained, and more importantly, their vanilla learning process struggles to extract fi… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

  14. arXiv:2112.13548  [pdf, other

    cs.CV

    Responsive Listening Head Generation: A Benchmark Dataset and Baseline

    Authors: Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei

    Abstract: We present a new listening head generation benchmark, for synthesizing responsive feedbacks of a listener (e.g., nod, smile) during a face-to-face conversation. As the indispensable complement to talking heads generation, listening head generation has seldomly been studied in literature. Automatically synthesizing listening behavior that actively responds to a talking head, is critical to applicat… ▽ More

    Submitted 20 July, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

    Comments: Accepted by ECCV 2022

  15. arXiv:2112.13522  [pdf, other

    cs.CV

    Dual Contrastive Learning for General Face Forgery Detection

    Authors: Ke Sun, Tai** Yao, Shen Chen, Shouhong Ding, Jilin L, Rongrong Ji

    Abstract: With various facial manipulation techniques arising, face forgery detection has drawn growing attention due to security concerns. Previous works always formulate face forgery detection as a classification problem based on cross-entropy loss, which emphasizes category-level differences rather than the essential discrepancies between real and fake faces, limiting model generalization in unseen domai… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: This paper was accepted by AAAI 2022 Conference on Artificial Intelligence

  16. arXiv:2112.10474  [pdf, other

    cs.CV

    Reciprocal Normalization for Domain Adaptation

    Authors: Zhiyong Huang, Kekai Sheng, Ke Li, Jian Liang, Tai** Yao, Weiming Dong, Dengwen Zhou, Xing Sun

    Abstract: Batch normalization (BN) is widely used in modern deep neural networks, which has been shown to represent the domain-related knowledge, and thus is ineffective for cross-domain tasks like unsupervised domain adaptation (UDA). Existing BN variant methods aggregate source and target domain knowledge in the same channel in normalization module. However, the misalignment between the features of corres… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: The best feature normalization module for domain adaptation

  17. arXiv:2112.08132  [pdf, other

    cs.LG cs.AI cs.CV

    Improving Self-supervised Learning with Automated Unsupervised Outlier Arbitration

    Authors: Yu Wang, **gyang Lin, **g**g Zou, Yingwei Pan, Ting Yao, Tao Mei

    Abstract: Our work reveals a structured shortcoming of the existing mainstream self-supervised learning methods. Whereas self-supervised learning frameworks usually take the prevailing perfect instance level invariance hypothesis for granted, we carefully investigate the pitfalls behind. Particularly, we argue that the existing augmentation pipeline for generating multiple positive views naturally introduce… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: NeurIPS 2021; Code is publicly available at: https://github.com/ssl-codelab/uota

  18. arXiv:2112.07517  [pdf, other

    cs.CV cs.AI cs.LG

    A Style and Semantic Memory Mechanism for Domain Generalization

    Authors: Yang Chen, Yu Wang, Yingwei Pan, Ting Yao, Xinmei Tian, Tao Mei

    Abstract: Mainstream state-of-the-art domain generalization algorithms tend to prioritize the assumption on semantic invariance across domains. Meanwhile, the inherent intra-domain style invariance is usually underappreciated and put on the shelf. In this paper, we reveal that leveraging intra-domain style invariance is also of pivotal importance in improving the efficiency of domain generalization. We veri… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: ICCV 2021

  19. arXiv:2112.07516  [pdf, other

    cs.CV cs.AI cs.MM

    Transferrable Contrastive Learning for Visual Domain Adaptation

    Authors: Yang Chen, Yingwei Pan, Yu Wang, Ting Yao, Xinmei Tian, Tao Mei

    Abstract: Self-supervised learning (SSL) has recently become the favorite among feature learning methodologies. It is therefore appealing for domain adaptation approaches to consider incorporating SSL. The intuition is to enforce instance-level feature consistency such that the predictor becomes somehow invariant across domains. However, most existing SSL methods in the regime of domain adaptation usually a… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: ACM Multimedia 2021

  20. arXiv:2112.07515  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising

    Authors: Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei

    Abstract: BERT-type structure has led to the revolution of vision-language pre-training and the achievement of state-of-the-art results on numerous vision-language downstream tasks. Existing solutions dominantly capitalize on the multi-modal inputs with mask tokens to trigger mask-based proxy pre-training tasks (e.g., masked language modeling and masked object/frame prediction). In this work, we argue that… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: ACM Multimedia 2021

  21. arXiv:2112.07513  [pdf, other

    cs.CV cs.AI cs.MM

    CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

    Authors: **gyang Lin, Yingwei Pan, Rongfeng Lai, Xuehang Yang, Hongyang Chao, Ting Yao

    Abstract: Localizing text instances in natural scenes is regarded as a fundamental challenge in computer vision. Nevertheless, owing to the extremely varied aspect ratios and scales of text instances in real scenes, most conventional text detectors suffer from the sub-text problem that only localizes the fragments of text instance (i.e., sub-texts). In this work, we quantitatively analyze the sub-text probl… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: ICME 2021 (Oral); Code is publicly available at: https://github.com/jylins/CORE-Text

  22. arXiv:2110.15749  [pdf, other

    math.NA

    A Riemannian Inexact Newton Dogleg Method for Constructing a Symmetric Nonnegative Matrix with Prescribed Spectrum

    Authors: Zhi Zhao, Teng-Teng Yao, Zheng-Jian Bai, Xiao-Qing **

    Abstract: This paper is concerned with the inverse problem of constructing a symmetric nonnegative matrix from realizable spectrum. We reformulate the inverse problem as an underdetermined nonlinear matrix equation over a Riemannian product manifold. To solve it, we develop a Riemannian underdetermined inexact Newton dogleg method for solving a general underdetermined nonlinear equation defined between Riem… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: 32 pages, 6 figures

    MSC Class: 15A18; 65F08; 65F18; 65F15

  23. arXiv:2110.12067  [pdf, other

    stat.ML cs.LG

    Fast and Accurate Graph Learning for Huge Data via Minipatch Ensembles

    Authors: Tianyi Yao, Minjie Wang, Genevera I. Allen

    Abstract: Gaussian graphical models provide a powerful framework for uncovering conditional dependence relationships between sets of nodes; they have found applications in a wide variety of fields including sensor and communication networks, physics, finance, and computational biology. Often, one observes data on the nodes and the task is to learn the graph structure, or perform graphical model selection. W… ▽ More

    Submitted 2 January, 2023; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  24. arXiv:2110.02536  [pdf, other

    gr-qc astro-ph.CO hep-ph

    Exploration of interacting dynamical dark energy model with interaction term including the equation-of-state parameter: alleviation of the $H_{0}$ tension

    Authors: Rui-Yun Guo, Lu Feng, Tian-Ying Yao, Xing-Yu Chen

    Abstract: We explore a scenario of interacting dynamical dark energy model with the interaction term $Q$ including the varying equation-of-state parameter $w$. Using the data combination of the cosmic microwave background, the baryon acoustic oscillation, and the type Ia supernovae, to global fit the interacting dynamical dark energy model, we find that adding a factor of the varying $w$ in the function of… ▽ More

    Submitted 20 December, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: 14 pages, 4 figures

    Journal ref: JCAP12(2021)036

  25. arXiv:2109.14584  [pdf, other

    cond-mat.soft

    Topological defect-propelled swimming of nematic colloids

    Authors: Tianyi Yao, Žiga Kos, Yimin Luo, Edward B. Steager, Miha Ravnik, Kathleen J. Stebe

    Abstract: Non-equilibrium dynamics of topological defects can be used as a fundamental propulsion mechanism in microscopic active matter. Here, we demonstrate swimming of topological defect-propelled colloidal particles in (passive) nematic fluids through experiments and numerical simulations. Dynamic swim strokes of the topological defects are driven by colloidal rotation in an external magnetic field, cau… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

  26. arXiv:2109.12020  [pdf, other

    cs.LG cs.MA eess.SY

    Distributed Estimation of Sparse Inverse Covariances

    Authors: Tong Yao, Shreyas Sundaram

    Abstract: Learning the relationships between various entities from time-series data is essential in many applications. Gaussian graphical models have been studied to infer these relationships. However, existing algorithms process data in a batch at a central location, limiting their applications in scenarios where data is gathered by different agents. In this paper, we propose a distributed sparse inverse c… ▽ More

    Submitted 30 September, 2021; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: 8 pages, 3 figures, 60th IEEE Conference on Decision and Control (CDC)

  27. arXiv:2109.01860  [pdf, other

    cs.CV

    Spatiotemporal Inconsistency Learning for DeepFake Video Detection

    Authors: Zhihao Gu, Yang Chen, Tai** Yao, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma

    Abstract: The rapid development of facial manipulation techniques has aroused public concerns in recent years. Following the success of deep learning, existing methods always formulate DeepFake video detection as a binary classification problem and develop frame-based and video-based solutions. However, little attention has been paid to capturing the spatial-temporal inconsistency in forged videos. To addre… ▽ More

    Submitted 11 October, 2021; v1 submitted 4 September, 2021; originally announced September 2021.

    Comments: To appear in ACM MM 2021

  28. arXiv:2108.08217  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics

    Authors: Yehao Li, Yingwei Pan, **gwen Chen, Ting Yao, Tao Mei

    Abstract: With the rise and development of deep learning over the past decade, there has been a steady momentum of innovation and breakthroughs that convincingly push the state-of-the-art of cross-modal analytics between vision and language in multimedia field. Nevertheless, there has not been an open-source codebase in support of training and deploying numerous neural network models for cross-modal analyti… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: Accepted by 2021 ACMMM Open Source Software Competition. Source code: https://github.com/YehLi/xmodaler

  29. arXiv:2108.02696  [pdf, other

    cs.CV cs.AI cs.LG

    A Low Rank Promoting Prior for Unsupervised Contrastive Learning

    Authors: Yu Wang, **gyang Lin, Qi Cai, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei

    Abstract: Unsupervised learning is just at a tip** point where it could really take off. Among these approaches, contrastive learning has seen tremendous progress and led to state-of-the-art performance. In this paper, we construct a novel probabilistic graphical model that effectively incorporates the low rank promoting prior into the framework of contrastive learning, referred to as LORAC. In contrast t… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

  30. arXiv:2108.02667  [pdf, other

    cs.CV

    Adaptive Normalized Representation Learning for Generalizable Face Anti-Spoofing

    Authors: Shubao Liu, Ke-Yue Zhang, Tai** Yao, Mingwei Bi, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma

    Abstract: With various face presentation attacks arising under unseen scenarios, face anti-spoofing (FAS) based on domain generalization (DG) has drawn growing attention due to its robustness. Most existing methods utilize DG frameworks to align the features to seek a compact and generalized feature space. However, little attention has been paid to the feature extraction process for the FAS task, especially… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: accepted on ACM MM 2021

  31. arXiv:2107.12292  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Contextual Transformer Networks for Visual Recognition

    Authors: Yehao Li, Ting Yao, Yingwei Pan, Tao Mei

    Abstract: Transformer with self-attention has led to the revolutionizing of natural language processing field, and recently inspires the emergence of Transformer-style architecture design with competitive results in numerous computer vision tasks. Nevertheless, most of existing designs directly employ self-attention over a 2D feature map to obtain the attention matrix based on pairs of isolated queries and… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

    Comments: Rank 1 in open-set image classification task of Open World Vision Challenge @ CVPR 2021; The source code and models are publicly available at: \url{https://github.com/JDAI-CV/CoTNet}

  32. arXiv:2107.10628  [pdf, other

    cs.CV

    Structure Destruction and Content Combination for Face Anti-Spoofing

    Authors: Ke-Yue Zhang, Tai** Yao, Jian Zhang, Shice Liu, Bangjie Yin, Shouhong Ding, Jilin Li

    Abstract: In pursuit of consolidating the face verification systems, prior face anti-spoofing studies excavate the hidden cues in original images to discriminate real persons and diverse attack types with the assistance of auxiliary supervision. However, limited by the following two inherent disturbances in their training process: 1) Complete facial structure in a single image. 2) Implicit subdomains in the… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Accepted by IJCB2021

  33. arXiv:2107.08650  [pdf, other

    cs.CV

    Compound Figure Separation of Biomedical Images with Side Loss

    Authors: Tianyuan Yao, Chang Qu, Quan Liu, Ruining Deng, Yuanhan Tian, Jiachen Xu, Aadarsh Jha, Shunxing Bao, Mengyang Zhao, Agnes B. Fogo, Bennett A. Landman, Catie Chang, Haichun Yang, Yuankai Huo

    Abstract: Unsupervised learning algorithms (e.g., self-supervised learning, auto-encoder, contrastive learning) allow deep learning models to learn effective image representations from large-scale unlabeled data. In medical image analysis, even unannotated data can be difficult to obtain for individual labs. Fortunately, national-level efforts have been made to provide efficient access to obtain biomedical… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  34. arXiv:2106.16128  [pdf, other

    cs.CV

    Dual Reweighting Domain Generalization for Face Presentation Attack Detection

    Authors: Shubao Liu, Ke-Yue Zhang, Tai** Yao, Kekai Sheng, Shouhong Ding, Ying Tai, Jilin Li, Yuan Xie, Lizhuang Ma

    Abstract: Face anti-spoofing approaches based on domain generalization (DG) have drawn growing attention due to their robustness for unseen scenarios. Previous methods treat each sample from multiple domains indiscriminately during the training process, and endeavor to extract a common feature space to improve the generalization. However, due to complex and biased data distribution, directly treating them e… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: accepted on IJCAI 2021

  35. arXiv:2106.11480  [pdf, other

    cs.CV eess.IV

    VoxelEmbed: 3D Instance Segmentation and Tracking with Voxel Embedding based Deep Learning

    Authors: Mengyang Zhao, Quan Liu, Aadarsh Jha, Ruining Deng, Tianyuan Yao, Anita Mahadevan-Jansen, Matthew J. Tyska, Bryan A. Millis, Yuankai Huo

    Abstract: Recent advances in bioimaging have provided scientists a superior high spatial-temporal resolution to observe dynamics of living cells as 3D volumetric videos. Unfortunately, the 3D biomedical video analysis is lagging, impeded by resource insensitive human curation using off-the-shelf 3D analytic tools. Herein, biologists often need to discard a considerable amount of rich 3D spatial information… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  36. arXiv:2105.03162  [pdf, other

    cs.CV

    Adv-Makeup: A New Imperceptible and Transferable Attack on Face Recognition

    Authors: Bangjie Yin, Wenxuan Wang, Tai** Yao, Junfeng Guo, Zelun Kong, Shouhong Ding, Jilin Li, Cong Liu

    Abstract: Deep neural networks, particularly face recognition models, have been shown to be vulnerable to both digital and physical adversarial examples. However, existing adversarial examples against face recognition systems either lack transferability to black-box models, or fail to be implemented in practice. In this paper, we propose a unified adversarial face generation method - Adv-Makeup, which can r… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: 8 pages, 6 figures, 1 tables, 1 algorithm, To appear in IJCAI 2021 as a regular paper

  37. arXiv:2105.02577  [pdf, other

    cs.CV

    Local Relation Learning for Face Forgery Detection

    Authors: Shen Chen, Tai** Yao, Yang Chen, Shouhong Ding, Jilin Li, Rongrong Ji

    Abstract: With the rapid development of facial manipulation techniques, face forgery detection has received considerable attention in digital media forensics due to security concerns. Most existing methods formulate face forgery detection as a classification problem and utilize binary labels or manipulated region masks as supervision. However, without considering the correlation between local regions, these… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: 8 pages, 6 figures, Accepted by AAAI2021

  38. arXiv:2105.02453  [pdf, other

    cs.CV

    Generalizable Representation Learning for Mixture Domain Face Anti-Spoofing

    Authors: Zhihong Chen, Tai** Yao, Kekai Sheng, Shouhong Ding, Ying Tai, Jilin Li, Feiyue Huang, Xinyu **

    Abstract: Face anti-spoofing approach based on domain generalization(DG) has drawn growing attention due to its robustness forunseen scenarios. Existing DG methods assume that the do-main label is known.However, in real-world applications, thecollected dataset always contains mixture domains, where thedomain label is unknown. In this case, most of existing meth-ods may not work. Further, even if we can obta… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in AAAI2021

  39. arXiv:2104.13089  [pdf, ps, other

    math.CO

    Large non-trivial $t$-intersecting families for signed sets

    Authors: Tian Yao, Benjian Lv, Kaishun Wang

    Abstract: For positive integers $n,r,k$ with $n\ge r$ and $k\ge2$, a set $\{(x_1,y_1),(x_2,y_2),\dots,(x_r,y_r)\}$ is called a $k$-signed $r$-set on $[n]$ if $x_1,\dots,x_r$ are distinct elements of $[n]$ and $y_1\dots,y_r\in[k]$. We say a $t$-intersecting family consisting of $k$-signed $r$-sets on $[n]$ is trivial if each member of this family contains a fixed $k$-signed $t$-set. In this paper, we determi… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    MSC Class: 05D05

  40. arXiv:2104.12378  [pdf, other

    cs.CV

    Delving into Data: Effectively Substitute Training for Black-box Attack

    Authors: Wenxuan Wang, Bangjie Yin, Tai** Yao, Li Zhang, Yanwei Fu, Shouhong Ding, Jilin Li, Feiyue Huang, Xiangyang Xue

    Abstract: Deep models have shown their vulnerability when processing adversarial samples. As for the black-box attack, without access to the architecture and weights of the attacked model, training a substitute model for adversarial attacks has attracted wide attention. Previous substitute training approaches focus on stealing the knowledge of the target model based on real training data or synthetic data,… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: 10 pages, 6 figures, 6 tables, 1 algorithm, To appear in CVPR 2021 as a poster paper

  41. arXiv:2104.05786  [pdf

    cond-mat.mtrl-sci cs.CV

    Understanding Fission Gas Bubble Distribution, Lanthanide Transportation, and Thermal Conductivity Degradation in Neutron-irradiated α-U Using Machine Learning

    Authors: Lu Cai, Fei Xu, Fidelma Dilemma, Daniel J. Murray, Cynthia A. Adkins, Larry K Aagesen Jr, Min Xian, Luca Caprriot, Tiankai Yao

    Abstract: UZr based metallic nuclear fuel is the leading candidate for next-generation sodium-cooled fast reactors in the United States. US research reactors have been using and testing this fuel type since the 1960s and accumulated considerable experience and knowledge about the fuel performance. However, most of knowledge remains empirical. The lack of mechanistic understanding of fuel performance is prev… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 19 pages, 12 figures, 2 tables

  42. Deep Learning Techniques for In-Crop Weed Identification: A Review

    Authors: Kun Hu, Zhiyong Wang, Guy Coleman, Asher Bender, Tingting Yao, Shan Zeng, Dezhen Song, Arnold Schumann, Michael Walsh

    Abstract: Weeds are a significant threat to the agricultural productivity and the environment. The increasing demand for sustainable agriculture has driven innovations in accurate weed control technologies aimed at reducing the reliance on herbicides. With the great success of deep learning in various vision tasks, many promising image-based weed detection algorithms have been developed. This paper reviews… ▽ More

    Submitted 4 February, 2024; v1 submitted 27 March, 2021; originally announced March 2021.

  43. arXiv:2103.05585  [pdf, other

    cs.CV

    SimTriplet: Simple Triplet Representation Learning with a Single GPU

    Authors: Quan Liu, Peter C. Louis, Yuzhe Lu, Aadarsh Jha, Mengyang Zhao, Ruining Deng, Tianyuan Yao, Joseph T. Roland, Haichun Yang, Shilin Zhao, Lee E. Wheless, Yuankai Huo

    Abstract: Contrastive learning is a key technique of modern self-supervised learning. The broader accessibility of earlier approaches is hindered by the need of heavy computational resources (e.g., at least 8 GPUs or 32 TPU cores), which accommodate for large-scale negative samples or momentum. The more recent SimSiam approach addresses such key limitations via stop-gradient without momentum encoders. In me… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  44. arXiv:2102.09471  [pdf, other

    cs.CV cs.LG

    DeeperForensics Challenge 2020 on Real-World Face Forgery Detection: Methods and Results

    Authors: Liming Jiang, Zhengkui Guo, Wayne Wu, Zhaoyang Liu, Ziwei Liu, Chen Change Loy, Shuo Yang, Yuanjun Xiong, Wei Xia, Baoying Chen, Peiyu Zhuang, Sili Li, Shen Chen, Tai** Yao, Shouhong Ding, Jilin Li, Feiyue Huang, Liujuan Cao, Rongrong Ji, Changlei Lu, Ganchao Tan

    Abstract: This paper reports methods and results in the DeeperForensics Challenge 2020 on real-world face forgery detection. The challenge employs the DeeperForensics-1.0 dataset, one of the most extensive publicly available real-world face forgery detection datasets, with 60,000 videos constituted by a total of 17.6 million frames. The model evaluation is conducted online on a high-quality hidden test set… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    Comments: Technical report. Challenge website: https://competitions.codalab.org/competitions/25228

  45. arXiv:2102.00713  [pdf, other

    cs.CV

    Aurora Guard: Reliable Face Anti-Spoofing via Mobile Lighting System

    Authors: Jian Zhang, Ying Tai, Tai** Yao, Jia Meng, Shouhong Ding, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji

    Abstract: Face authentication on mobile end has been widely applied in various scenarios. Despite the increasing reliability of cutting-edge face authentication/verification systems to variations like blinking eye and subtle facial expression, anti-spoofing against high-resolution rendering replay of paper photos or digital videos retains as an open problem. In this paper, we propose a simple yet effective… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:1902.10311

  46. arXiv:2102.00662  [pdf, other

    cs.LG cs.AI

    Towards Speeding up Adversarial Training in Latent Spaces

    Authors: Yaguan Qian, Qiqi Shao, Tengteng Yao, Bin Wang, Shouling Ji, Shaoning Zeng, Zhaoquan Gu, Wassim Swaileh

    Abstract: Adversarial training is wildly considered as one of the most effective way to defend against adversarial examples. However, existing adversarial training methods consume unbearable time, due to the fact that they need to generate adversarial examples in the large input space. To speed up adversarial training, we propose a novel adversarial training method that does not need to generate real advers… ▽ More

    Submitted 8 March, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

  47. arXiv:2101.11562  [pdf, other

    cs.CV cs.CL

    Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network

    Authors: Yehao Li, Yingwei Pan, Ting Yao, **gwen Chen, Tao Mei

    Abstract: Despite having impressive vision-language (VL) pretraining with BERT-based encoder for VL understanding, the pretraining of a universal encoder-decoder for both VL understanding and generation remains challenging. The difficulty originates from the inherently different peculiarities of the two disciplines, e.g., VL understanding tasks capitalize on the unrestricted message passing across modalitie… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: AAAI 2021; Code is publicly available at: https://github.com/YehLi/TDEN

  48. arXiv:2101.06400  [pdf, other

    cs.CL

    ComQA:Compositional Question Answering via Hierarchical Graph Neural Networks

    Authors: Bingning Wang, Ting Yao, Weipeng Chen, **gfang Xu, Xiaochuan Wang

    Abstract: With the development of deep learning techniques and large scale datasets, the question answering (QA) systems have been quickly improved, providing more accurate and satisfying answers. However, current QA systems either focus on the sentence-level answer, i.e., answer selection, or phrase-level answer, i.e., machine reading comprehension. How to produce compositional answers has not been through… ▽ More

    Submitted 16 January, 2021; originally announced January 2021.

    Comments: Accepted by WWW2021

  49. arXiv:2010.15982  [pdf, other

    cs.IR

    A Model of Two Tales: Dual Transfer Learning Framework for Improved Long-tail Item Recommendation

    Authors: Yin Zhang, Derek Zhiyuan Cheng, Tiansheng Yao, Xinyang Yi, Lichan Hong, Ed H. Chi

    Abstract: Highly skewed long-tail item distribution is very common in recommendation systems. It significantly hurts model performance on tail items. To improve tail-item recommendation, we conduct research to transfer knowledge from head items to tail items, leveraging the rich user feedback in head items and the semantic connections between head and tail items. Specifically, we propose a novel dual transf… ▽ More

    Submitted 7 March, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: Accepted by WWW 2021 as a long paper

  50. Develo** Univariate Neurodegeneration Biomarkers with Low-Rank and Sparse Subspace Decomposition

    Authors: Gang Wang, Qunxi Dong, Jianfeng Wu, Yi Su, Kewei Chen, Qingtang Su, Xiaofeng Zhang, **guang Hao, Tao Yao, Li Liu, Caiming Zhang, Richard J Caselli, Eric M Reiman, Yalin Wang

    Abstract: Cognitive decline due to Alzheimer's disease (AD) is closely associated with brain structure alterations captured by structural magnetic resonance imaging (sMRI). It supports the validity to develop sMRI-based univariate neurodegeneration biomarkers (UNB). However, existing UNB work either fails to model large group variances or does not capture AD dementia (ADD) induced changes. We propose a nove… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: Accepted by Medical Image Analysis