Skip to main content

Showing 1–8 of 8 results for author: Duan, Q

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.16136  [pdf, other

    cs.AI cs.CL cs.LG cs.SD eess.AS

    C3LLM: Conditional Multimodal Content Generation Using Large Language Models

    Authors: Zixuan Wang, Qinkai Duan, Yu-Wing Tai, Chi-Keung Tang

    Abstract: We introduce C3LLM (Conditioned-on-Three-Modalities Large Language Models), a novel framework combining three tasks of video-to-audio, audio-to-text, and text-to-audio together. C3LLM adapts the Large Language Model (LLM) structure as a bridge for aligning different modalities, synthesizing the given conditional information, and making multimodal generation in a discrete manner. Our contributions… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2308.15144  [pdf, other

    eess.IV

    TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching

    Authors: Yun Liao, Yide Di, Hao Zhou, Kaijun Zhu, Mingyu Lu, Yijia Zhang, Qing Duan, Junhui Liu

    Abstract: Local feature matching remains a challenging task, primarily due to difficulties in matching sparse keypoints and low-texture regions. The key to solving this problem lies in effectively and accurately integrating global and local information. To achieve this goal, we introduce an innovative local feature matching method called TKwinFormer. Our approach employs a multi-stage matching strategy to o… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 11 pages, 7 figures

    ACM Class: I.4.7

  3. arXiv:2202.03424  [pdf, other

    cs.LG cs.AI cs.DM eess.SY math.OC

    Reinforcement learning for multi-item retrieval in the puzzle-based storage system

    Authors: **g He, Xinglu Liu, Qiyao Duan, Wai Kin Victor Chan, Mingyao Qi

    Abstract: Nowadays, fast delivery services have created the need for high-density warehouses. The puzzle-based storage system is a practical way to enhance the storage density, however, facing difficulties in the retrieval process. In this work, a deep reinforcement learning algorithm, specifically the Double&Dueling Deep Q Network, is developed to solve the multi-item retrieval problem in the system with g… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: 32 pages, 13 figures, 5 tables, journal

  4. Dual Optimization for Kolmogorov Model Learning Using Enhanced Gradient Descent

    Authors: Qiyou Duan, Hadi Ghauch, Taejoon Kim

    Abstract: Data representation techniques have made a substantial contribution to advancing data processing and machine learning (ML). Improving predictive power was the focus of previous representation techniques, which unfortunately perform rather poorly on the interpretability in terms of extracting underlying insights of the data. Recently, the Kolmogorov model (KM) was studied, which is an interpretable… ▽ More

    Submitted 20 May, 2022; v1 submitted 11 July, 2021; originally announced July 2021.

    Comments: Published in the IEEE Transactions on Signal Processing (15 pages, 11 figures, and 6 tables)

  5. arXiv:2007.13299  [pdf, other

    eess.SP cs.LG

    Enhanced Beam Alignment for Millimeter Wave MIMO Systems: A Kolmogorov Model

    Authors: Qiyou Duan, Taejoon Kim, Hadi Ghauch

    Abstract: We present an enhancement to the problem of beam alignment in millimeter wave (mmWave) multiple-input multiple-output (MIMO) systems, based on a modification of the machine learning-based criterion, called Kolmogorov model (KM), previously applied to the beam alignment problem. Unlike the previous KM, whose computational complexity is not scalable with the size of the problem, a new approach, cent… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

    Comments: Submitted to the 2020 IEEE Globecom

  6. arXiv:2004.07031  [pdf, other

    cs.HC eess.IV

    SenseCare: A Research Platform for Medical Image Informatics and Interactive 3D Visualization

    Authors: Qi Duan, Guotai Wang, Rui Wang, Chao Fu, Xinjun Li, Na Wang, Yechong Huang, Xiaodi Huang, Tao Song, Liang Zhao, Xinglong Liu, Qing Xia, Zhiqiang Hu, Yinan Chen, Shaoting Zhang

    Abstract: Clinical research on smart health has an increasing demand for intelligent and clinic-oriented medical image computing algorithms and platforms that support various applications. To this end, we have developed SenseCare research platform, which is designed to facilitate translational research on intelligent diagnosis and treatment planning in various clinical scenarios. To enable clinical research… ▽ More

    Submitted 2 September, 2022; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: 15 pages, 16 figures

  7. arXiv:1910.03729  [pdf, other

    eess.IV cs.CV

    Large-scale Gastric Cancer Screening and Localization Using Multi-task Deep Neural Network

    Authors: Hong Yu, Xiaofan Zhang, Lingjun Song, Liren Jiang, Xiaodi Huang, Wen Chen, Chenbin Zhang, Jiahui Li, Jiji Yang, Zhiqiang Hu, Qi Duan, Wanyuan Chen, Xianglei He, **shuang Fan, Weihai Jiang, Li Zhang, Chengmin Qiu, Minmin Gu, Weiwei Sun, Yangqiong Zhang, Guangyin Peng, Weiwei Shen, Guohui Fu

    Abstract: Gastric cancer is one of the most common cancers, which ranks third among the leading causes of cancer death. Biopsy of gastric mucosa is a standard procedure in gastric cancer screening test. However, manual pathological inspection is labor-intensive and time-consuming. Besides, it is challenging for an automated algorithm to locate the small lesion regions in the gigapixel whole-slide image and… ▽ More

    Submitted 19 September, 2020; v1 submitted 8 October, 2019; originally announced October 2019.

    Comments: under minor revision

  8. Coherence Statistics of Structured Random Ensembles and Support Detection Bounds for OMP

    Authors: Qiyou Duan, Taejoon Kim, Lin Dai, Erik Perrins

    Abstract: A structured random matrix ensemble that maintains constant modulus entries and unit-norm columns, often called a random phase-rotated (RPR) matrix, is considered in this paper. We analyze the coherence statistics of RPR measurement matrices and apply them to acquire probabilistic performance guarantees of orthogonal matching pursuit (OMP) for support detection (SD). It is revealed via numerical s… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: Accepted for publication in the IEEE Signal Processing Letters