Skip to main content

Showing 1–7 of 7 results for author: Zeng, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2403.11626  [pdf, other

    cs.GR cs.AI cs.CV cs.MM cs.SD eess.AS

    QEAN: Quaternion-Enhanced Attention Network for Visual Dance Generation

    Authors: Zhizhen Zhou, Ye**g Huo, Guoheng Huang, An Zeng, Xuhang Chen, Lian Huang, Zinuo Li

    Abstract: The study of music-generated dance is a novel and challenging Image generation task. It aims to input a piece of music and seed motions, then generate natural dance movements for the subsequent music. Transformer-based methods face challenges in time series prediction tasks related to human movements and music due to their struggle in capturing the nonlinear relationship and temporal aspects. This… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by The Visual Computer Journal

  2. arXiv:2401.04747  [pdf, other

    cs.SD cs.AI cs.CV cs.GR eess.AS

    DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation

    Authors: Junming Chen, Yunfei Liu, Jianan Wang, Ailing Zeng, Yu Li, Qifeng Chen

    Abstract: We propose DiffSHEG, a Diffusion-based approach for Speech-driven Holistic 3D Expression and Gesture generation with arbitrary length. While previous works focused on co-speech gesture or expression generation individually, the joint generation of synchronized expressions and gestures remains barely explored. To address this, our diffusion-based co-speech motion generation transformer enables uni-… ▽ More

    Submitted 6 April, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted by CVPR 2024. Project page: https://jeremycjm.github.io/proj/DiffSHEG

  3. arXiv:2211.01607  [pdf, other

    eess.IV cs.LG

    ImageCAS: A Large-Scale Dataset and Benchmark for Coronary Artery Segmentation based on Computed Tomography Angiography Images

    Authors: An Zeng, Chunbiao Wu, Mei** Huang, Jian Zhuang, Shanshan Bi, Dan Pan, Najeeb Ullah, Kaleem Nawaz Khan, Tianchen Wang, Yiyu Shi, Xiaomeng Li, Guisen Lin, Xiaowei Xu

    Abstract: Cardiovascular disease (CVD) accounts for about half of non-communicable diseases. Vessel stenosis in the coronary artery is considered to be the major risk of CVD. Computed tomography angiography (CTA) is one of the widely used noninvasive imaging modalities in coronary artery diagnosis due to its superior image resolution. Clinically, segmentation of coronary arteries is essential for the diagno… ▽ More

    Submitted 17 October, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: 17 pages, 12 figures, 4 tables

    Journal ref: Computerized Medical Imaging and Graphics, 2023

  4. arXiv:2203.08715  [pdf, other

    cs.RO cs.AI cs.LG eess.SY math.DS

    Multiscale Sensor Fusion and Continuous Control with Neural CDEs

    Authors: Sumeet Singh, Francis McCann Ramirez, Jacob Varley, Andy Zeng, Vikas Sindhwani

    Abstract: Though robot learning is often formulated in terms of discrete-time Markov decision processes (MDPs), physical robots require near-continuous multiscale feedback control. Machines operate on multiple asynchronous sensing modalities, each with different frequencies, e.g., video frames at 30Hz, proprioceptive state at 100Hz, force-torque data at 500Hz, etc. While the classic approach is to batch obs… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Submitted to IEEE IROS 2022

  5. arXiv:2109.07578  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Multi-Task Learning with Sequence-Conditioned Transporter Networks

    Authors: Michael H. Lim, Andy Zeng, Brian Ichter, Maryam Bandari, Erwin Coumans, Claire Tomlin, Stefan Schaal, Aleksandra Faust

    Abstract: Enabling robots to solve multiple manipulation tasks has a wide range of industrial applications. While learning-based approaches enjoy flexibility and generalizability, scaling these approaches to solve such compositional tasks remains a challenge. In this work, we aim to solve multi-task learning through the lens of sequence-conditioning and weighted sampling. First, we propose a new suite of be… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  6. arXiv:2012.05456  [pdf, other

    eess.SP cs.LG

    T-WaveNet: Tree-Structured Wavelet Neural Network for Sensor-Based Time Series Analysis

    Authors: Minhao Liu, Ailing Zeng, Qiuxia Lai, Qiang Xu

    Abstract: Sensor-based time series analysis is an essential task for applications such as activity recognition and brain-computer interface. Recently, features extracted with deep neural networks (DNNs) are shown to be more effective than conventional hand-crafted ones. However, most of these solutions rely solely on the network to extract application-specific information carried in the sensor data. Motivat… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

  7. arXiv:1910.02550  [pdf, other

    cs.CV cs.RO eess.IV

    ClearGrasp: 3D Shape Estimation of Transparent Objects for Manipulation

    Authors: Shreeyak S. Sajjan, Matthew Moore, Mike Pan, Ganesh Nagaraja, Johnny Lee, Andy Zeng, Shuran Song

    Abstract: Transparent objects are a common part of everyday life, yet they possess unique visual properties that make them incredibly difficult for standard 3D sensors to produce accurate depth estimates for. In many cases, they often appear as noisy or distorted approximations of the surfaces that lie behind them. To address these challenges, we present ClearGrasp -- a deep learning approach for estimating… ▽ More

    Submitted 14 October, 2019; v1 submitted 6 October, 2019; originally announced October 2019.

    Comments: Project Website: https://sites.google.com/view/cleargrasp, 13 pages, 13 figures, submitted to ICRA