Skip to main content

Showing 1–27 of 27 results for author: Yao, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19070  [pdf, other

    cs.CV

    FAGhead: Fully Animate Gaussian Head from Monocular Videos

    Authors: Yixin Xuan, Xinyang Li, Gongxin Yao, Shiwei Zhou, Donghui Sun, Xiaoxin Chen, Yu Pan

    Abstract: High-fidelity reconstruction of 3D human avatars has a wild application in visual reality. In this paper, we introduce FAGhead, a method that enables fully controllable human portraits from monocular videos. We explicit the traditional 3D morphable meshes (3DMM) and optimize the neutral 3D Gaussians to reconstruct with complex expressions. Furthermore, we employ a novel Point-based Learnable Repre… ▽ More

    Submitted 28 June, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.00301  [pdf, other

    cs.NI eess.SP

    A Survey on the Use of Partitioning in IoT-Edge-AI Applications

    Authors: Guoxing Yao, Lav Gupta

    Abstract: Centralized clouds processing the large amount of data generated by Internet-of-Things (IoT) can lead to unacceptable latencies for the end user. Against this backdrop, Edge Computing (EC) is an emerging paradigm that can address the shortcomings of traditional centralized Cloud Computing (CC). Its use is associated with improved performance, productivity, and security. Some of its use cases inclu… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  3. arXiv:2405.14074  [pdf

    cs.CR cs.NI

    Enhancing Critical Infrastructure Cybersecurity: Collaborative DNN Synthesis in the Cloud Continuum

    Authors: Lav Gupta, Guoxing Yao

    Abstract: Researchers are exploring the integration of IoT and the cloud continuum, together with AI to enhance the cost-effectiveness and efficiency of critical infrastructure (CI) systems. This integration, however, increases susceptibility of CI systems to cyberattacks, potentially leading to disruptions like power outages, oil spills, or even a nuclear mishap. CI systems are inherently complex and gener… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2405.11993  [pdf, other

    cs.CV

    GGAvatar: Geometric Adjustment of Gaussian Head Avatar

    Authors: Xinyang Li, Jiaxin Wang, Yixin Xuan, Gongxin Yao, Yu Pan

    Abstract: We propose GGAvatar, a novel 3D avatar representation designed to robustly model dynamic head avatars with complex identities and deformations. GGAvatar employs a coarse-to-fine structure, featuring two core modules: Neutral Gaussian Initialization Module and Geometry Morph Adjuster. Neutral Gaussian Initialization Module pairs Gaussian primitives with deformable triangular meshes, employing an ad… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 9 pages, 5 figures

  5. arXiv:2402.08910  [pdf, other

    cs.CV cs.LG

    Learning-based Bone Quality Classification Method for Spinal Metastasis

    Authors: Shiqi Peng, Bolin Lai, Guangyu Yao, Xiaoyun Zhang, Ya Zhang, Yan-Feng Wang, Hui Zhao

    Abstract: Spinal metastasis is the most common disease in bone metastasis and may cause pain, instability and neurological injuries. Early detection of spinal metastasis is critical for accurate staging and optimal treatment. The diagnosis is usually facilitated with Computed Tomography (CT) scans, which requires considerable efforts from well-trained radiologists. In this paper, we explore a learning-based… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  6. arXiv:2402.08892  [pdf, other

    cs.CV cs.LG

    Weakly Supervised Segmentation of Vertebral Bodies with Iterative Slice-propagation

    Authors: Shiqi Peng, Bolin Lai, Guangyu Yao, Xiaoyun Zhang, Ya Zhang, Yan-Feng Wang, Hui Zhao

    Abstract: Vertebral body (VB) segmentation is an important preliminary step towards medical visual diagnosis for spinal diseases. However, most previous works require pixel/voxel-wise strong supervisions, which is expensive, tedious and time-consuming for experts to annotate. In this paper, we propose a Weakly supervised Iterative Spinal Segmentation (WISS) method leveraging only four corner landmark weak l… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:1412.7062 by other authors

  7. arXiv:2310.01377  [pdf, other

    cs.CL cs.AI cs.LG

    UltraFeedback: Boosting Language Models with High-quality Feedback

    Authors: Ganqu Cui, Lifan Yuan, Ning Ding, Guanming Yao, Wei Zhu, Yuan Ni, Guotong Xie, Zhiyuan Liu, Maosong Sun

    Abstract: Reinforcement learning from human feedback (RLHF) has become a pivot technique in aligning large language models (LLMs) with human preferences. In RLHF practice, preference data plays a crucial role in bridging human proclivity and LLMs. However, the scarcity of diverse, naturalistic datasets of human preferences on LLM outputs at scale poses a great challenge to RLHF as well as feedback learning… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  8. arXiv:2307.07142  [pdf, other

    cs.CV

    Quantity-Aware Coarse-to-Fine Correspondence for Image-to-Point Cloud Registration

    Authors: Gongxin Yao, Yixin Xuan, Yiwei Chen, Yu Pan

    Abstract: Image-to-point cloud registration aims to determine the relative camera pose between an RGB image and a reference point cloud, serving as a general solution for locating 3D objects from 2D observations. Matching individual points with pixels can be inherently ambiguous due to modality gaps. To address this challenge, we propose a framework to capture quantity-aware correspondences between local po… ▽ More

    Submitted 18 January, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

  9. arXiv:2302.13479  [pdf, other

    cs.IT

    Age Minimization with Energy and Distortion Constraints

    Authors: Guidan Yao, Chih-Chun Wang, Ness B. Shroff

    Abstract: In this paper, we consider a status update system, where an access point collects measurements from multiple sensors that monitor a common physical process, fuses them, and transmits the aggregated sample to the destination over an erasure channel. Under a typical information fusion scheme, the distortion of the fused sample is inversely proportional to the number of measurements received. Our goa… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

  10. arXiv:2201.02475  [pdf, other

    eess.IV cs.CV

    Deep Domain Adversarial Adaptation for Photon-efficient Imaging

    Authors: Yiwei Chen, Gongxin Yao, Yong Liu, Hongye Su, Xiaomin Hu, Yu Pan

    Abstract: Photon-efficient imaging with the single-photon light detection and ranging (LiDAR) captures the three-dimensional (3D) structure of a scene by only a few detected signal photons per pixel. However, the existing computational methods for photon-efficient imaging are pre-tuned on a restricted scenario or trained on simulated datasets. When applied to realistic scenarios whose signal-to-background r… ▽ More

    Submitted 27 October, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  11. arXiv:2201.01453  [pdf, other

    eess.IV cs.CV eess.SP

    Robust photon-efficient imaging using a pixel-wise residual shrinkage network

    Authors: Gongxin Yao, Yiwei Chen, Yong Liu, Xiaomin Hu, Yu Pan

    Abstract: Single-photon light detection and ranging (LiDAR) has been widely applied to 3D imaging in challenging scenarios. However, limited signal photon counts and high noises in the collected data have posed great challenges for predicting the depth image precisely. In this paper, we propose a pixel-wise residual shrinkage network for photon-efficient imaging from high-noise data, which adaptively genera… ▽ More

    Submitted 18 May, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

    Journal ref: Optics Express 30(11):18856-18873, 2022

  12. arXiv:2112.12390  [pdf, other

    cs.CV

    Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields

    Authors: Guangming Yao, Hongzhi Wu, Yi Yuan, Lincheng Li, Kun Zhou, Xin Yu

    Abstract: In this paper, we present a novel double diffusion based neural radiance field, dubbed DD-NeRF, to reconstruct human body geometry and render the human body appearance in novel views from a sparse set of images. We first propose a double diffusion mechanism to achieve expressive representations of input images by fully exploiting human body priors and image appearance details at two levels. At the… ▽ More

    Submitted 17 January, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

    Comments: 6 pages, 5 figures

  13. arXiv:2105.10112  [pdf, other

    cs.CV cs.AI

    IDEAL: Independent Domain Embedding Augmentation Learning

    Authors: Zhiyuan Chen, Guang Yao, Wennan Ma, Lin Xu

    Abstract: Many efforts have been devoted to designing sampling, mining, and weighting strategies in high-level deep metric learning (DML) loss objectives. However, little attention has been paid to low-level but essential data transformation. In this paper, we develop a novel mechanism, the independent domain embedding augmentation learning ({IDEAL}) method. It can simultaneously learn multiple independent… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: 11 pages, 2 figures, 4 tables

  14. arXiv:2105.04286  [pdf, other

    cs.CV

    Primitive Representation Learning for Scene Text Recognition

    Authors: Ruijie Yan, Liangrui Peng, Shanyu Xiao, Gang Yao

    Abstract: Scene text recognition is a challenging task due to diverse variations of text instances in natural scene images. Conventional methods based on CNN-RNN-CTC or encoder-decoder with attention mechanism may not fully investigate stable and efficient feature representations for multi-oriented scene texts. In this paper, we propose a primitive representation learning method that aims to exploit intrins… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  15. arXiv:2105.02039  [pdf, other

    cs.CV

    Towards an efficient framework for Data Extraction from Chart Images

    Authors: Weihong Ma, Hesuo Zhang, Shuang Yan, Guangshun Yao, Yichao Huang, Hui Li, Yaqiang Wu, Lianwen **

    Abstract: In this paper, we fill the research gap by adopting state-of-the-art computer vision techniques for the data extraction stage in a data mining system. As shown in Fig.1, this stage contains two subtasks, namely, plot element detection and data conversion. For building a robust box detector, we comprehensively compare different deep learning-based methods and find a suitable method to detect box wi… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: accepted by ICDAR2021

  16. arXiv:2102.03984  [pdf, other

    cs.CV

    One-shot Face Reenactment Using Appearance Adaptive Normalization

    Authors: Guangming Yao, Yi Yuan, Tianjia Shao, Shuang Li, Shanqi Liu, Yong Liu, Mengmeng Wang, Kun Zhou

    Abstract: The paper proposes a novel generative adversarial network for one-shot face reenactment, which can animate a single face image to a different pose-and-expression (provided by a driving image) while kee** its original appearance. The core of our network is a novel mechanism called appearance adaptive normalization, which can effectively integrate the appearance information from the input image in… ▽ More

    Submitted 26 April, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: 9 pages, 8 figures,3 tables ,Accepted by AAAI2021

  17. arXiv:2012.09351  [pdf, other

    cs.IT

    Battle between Rate and Error in Minimizing Age of Information

    Authors: Guidan Yao, Ahmed M. Bedewy, Ness B. Shroff

    Abstract: In this paper, we consider a status update system, in which update packets are sent to the destination via a wireless medium that allows for multiple rates, where a higher rate also naturally corresponds to a higher error probability. The data freshness is measured using age of information, which is defined as the age of the recent update at the destination. A packet that is transmitted with a hig… ▽ More

    Submitted 28 December, 2020; v1 submitted 16 December, 2020; originally announced December 2020.

  18. arXiv:2012.02958  [pdf, other

    cs.IT

    Age-Optimal Low-Power Status Update over Time-Correlated Fading Channel

    Authors: Guidan Yao, Ahmed M. Bedewy, Ness B. Shroff

    Abstract: In this paper, we consider transmission scheduling in a status update system, where updates are generated periodically and transmitted over a Gilbert-Elliott fading channel. The goal is to minimize the long-run average age of information (AoI) at the destination under an average energy constraint. We consider two practical cases to obtain channel state information (CSI): (i) \emph{without channel… ▽ More

    Submitted 31 January, 2021; v1 submitted 5 December, 2020; originally announced December 2020.

  19. Mesh Guided One-shot Face Reenactment using Graph Convolutional Networks

    Authors: Guangming Yao, Yi Yuan, Tianjia Shao, Kun Zhou

    Abstract: Face reenactment aims to animate a source face image to a different pose and expression provided by a driving image. Existing approaches are either designed for a specific identity, or suffer from the identity preservation problem in the one-shot or few-shot scenarios. In this paper, we introduce a method for one-shot face reenactment, which uses the reconstructed 3D meshes (i.e., the source mesh… ▽ More

    Submitted 18 September, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: 9 pages, 8 figures,accepted by ACM MM2020

  20. arXiv:2004.05233  [pdf, other

    cs.RO cs.CV eess.SY

    Shape Estimation for Elongated Deformable Object using B-spline Chained Multiple Random Matrices Model

    Authors: Gang Yao, Ryan Saltus, Ashwin Dani

    Abstract: In this paper, a B-spline chained multiple random matrices representation is proposed to model geometric characteristics of an elongated deformable object. The hyper degrees of freedom structure of the elongated deformable object make its shape estimation challenging. Based on the likelihood function of the proposed model, an expectation-maximization (EM) method is derived to estimate the shape of… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

  21. arXiv:2003.09615  [pdf, other

    cs.LG stat.ML

    DP-Net: Dynamic Programming Guided Deep Neural Network Compression

    Authors: Dingcheng Yang, Wenjian Yu, Ao Zhou, Haoyuan Mu, Gary Yao, Xiaoyi Wang

    Abstract: In this work, we propose an effective scheme (called DP-Net) for compressing the deep neural networks (DNNs). It includes a novel dynamic programming (DP) based algorithm to obtain the optimal solution of weight quantization and an optimization process to train a clustering-friendly DNN. Experiments showed that the DP-Net allows larger compression than the state-of-the-art counterparts while prese… ▽ More

    Submitted 21 March, 2020; originally announced March 2020.

    Comments: 7pages, 4 figures

  22. arXiv:2003.00835   

    cs.CV

    Deep Variational Luenberger-type Observer for Stochastic Video Prediction

    Authors: Dong Wang, Feng Zhou, Zheng Yan, Guang Yao, Zongxuan Liu, Wennan Ma, Cewu Lu

    Abstract: Considering the inherent stochasticity and uncertainty, predicting future video frames is exceptionally challenging. In this work, we study the problem of video prediction by combining interpretability of stochastic state space models and representation learning of deep neural networks. Our model builds upon an variational encoder which transforms the input video into a latent feature space and a… ▽ More

    Submitted 10 September, 2023; v1 submitted 12 February, 2020; originally announced March 2020.

    Comments: rewrite paper

  23. arXiv:1912.01054  [pdf, other

    eess.IV cs.CV cs.LG

    The state of the art in kidney and kidney tumor segmentation in contrast-enhanced CT imaging: Results of the KiTS19 Challenge

    Authors: Nicholas Heller, Fabian Isensee, Klaus H. Maier-Hein, Xiaoshuai Hou, Chunmei Xie, Fengyi Li, Yang Nan, Guangrui Mu, Zhiyong Lin, Miofei Han, Guang Yao, Yaozong Gao, Yao Zhang, Yixin Wang, Feng Hou, Jiawei Yang, Guangwei Xiong, Jiang Tian, Cheng Zhong, Jun Ma, Jack Rickman, Joshua Dean, Bethany Stai, Resha Tejpaul, Makinna Oestreich , et al. (16 additional authors not shown)

    Abstract: There is a large body of literature linking anatomic and geometric characteristics of kidney tumors to perioperative and oncologic outcomes. Semantic segmentation of these tumors and their host kidneys is a promising tool for quantitatively characterizing these lesions, but its adoption is limited due to the manual effort required to produce high-quality 3D segmentations of these structures. Recen… ▽ More

    Submitted 7 August, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: 24 pages, 11 figures

  24. arXiv:1911.01002  [pdf, other

    cs.CR

    Generalized NLFSR Transformation Algorithms and Cryptanalysis of the Class of Espresso-like Stream Ciphers

    Authors: Ge Yao, Udaya Parampalli

    Abstract: Lightweight stream ciphers are highly demanded in IoT applications. In order to optimize the hardware performance, a new class of stream cipher has been proposed. The basic idea is to employ a single Galois NLFSR with maximum period to construct the cipher. As a representative design of this kind of stream ciphers, Espresso is based on a 256-bit Galois NLFSR initialized by a 128-bit key. The… ▽ More

    Submitted 3 November, 2019; originally announced November 2019.

  25. arXiv:1901.00963  [pdf, other

    cs.IT cs.NI

    Integrating Sub-6 GHz and Millimeter Wave to Combat Blockage: Delay-Optimal Scheduling

    Authors: Guidan Yao, Morteza Hashemi, Ness B. Shroff

    Abstract: Millimeter wave (mmWave) technologies have the potential to achieve very high data rates, but suffer from intermittent connectivity. In this paper, we provision an architecture to integrate sub-6 GHz and mmWave technologies, where we incorporate the sub-6 GHz interface as a fallback data transfer mechanism to combat blockage and intermittent connectivity of the mmWave communications. To this end,… ▽ More

    Submitted 21 January, 2019; v1 submitted 3 January, 2019; originally announced January 2019.

  26. arXiv:1804.03036  [pdf, other

    eess.IV cs.RO eess.SY

    Image Moment Models for Extended Object Tracking

    Authors: Gang Yao, Ashwin Dani

    Abstract: In this paper, a novel image moments based model for shape estimation and tracking of an object moving with a complex trajectory is presented. The camera is assumed to be stationary looking at a moving object. Point features inside the object are sampled as measurements. An ellipsoidal approximation of the shape is assumed as a primitive shape. The shape of an ellipse is estimated using a combinat… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

    Journal ref: IEEE Transactions on Aerospace and Electronic Systems, 2018

  27. arXiv:1804.02470  [pdf, other

    eess.IV cs.CV cs.RO

    Visual Tracking Using Sparse Coding and Earth Mover's Distance

    Authors: Gang Yao, Ashwin Dani

    Abstract: An efficient iterative Earth Mover's Distance (iEMD) algorithm for visual tracking is proposed in this paper. The Earth Mover's Distance (EMD) is used as the similarity measure to search for the optimal template candidates in feature-spatial space in a video sequence. The computation of the EMD is formulated as the transportation problem from linear programming. The efficiency of the EMD optimizat… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.