Skip to main content

Showing 1–3 of 3 results for author: Chai, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.11164  [pdf

    eess.IV

    TinyLIC-High efficiency lossy image compression method

    Authors: Gaocheng Ma, Yinfeng Chai, Tianhao Jiang, Ming Lu, Tong Chen

    Abstract: Image compression has been the subject of extensive research for several decades, resulting in the development of well-known standards such as JPEG, JPEG2000, and H.264/AVC. However, recent advancements in deep learning have led to the emergence of learned image compression methods that offer significant improvements in coding efficiency compared to traditional codecs. These learned compression te… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  2. arXiv:2307.14491  [pdf, other

    cs.MM cs.SD eess.AS

    A Unified Framework for Modality-Agnostic Deepfakes Detection

    Authors: Cai Yu, Peng Chen, Jiahe Tian, ** Liu, Jiao Dai, Xi Wang, Yesheng Chai, Shan Jia, Siwei Lyu, Jizhong Han

    Abstract: As AI-generated content (AIGC) thrives, deepfakes have expanded from single-modality falsification to cross-modal fake content creation, where either audio or visual components can be manipulated. While using two unimodal detectors can detect audio-visual deepfakes, cross-modal forgery clues could be overlooked. Existing multimodal deepfake detection methods typically establish correspondence betw… ▽ More

    Submitted 24 October, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2302.04456  [pdf, other

    cs.SD cs.AI cs.CL cs.MM eess.AS

    ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models

    Authors: Pengfei Zhu, Chao Pang, Yekun Chai, Lei Li, Shuohuan Wang, Yu Sun, Hao Tian, Hua Wu

    Abstract: In recent years, the burgeoning interest in diffusion models has led to significant advances in image and speech generation. Nevertheless, the direct synthesis of music waveforms from unrestricted textual prompts remains a relatively underexplored domain. In response to this lacuna, this paper introduces a pioneering contribution in the form of a text-to-waveform music generation model, underpinne… ▽ More

    Submitted 21 September, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted by AACL demo 2023