Skip to main content

Showing 1–21 of 21 results for author: Yue, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.05856  [pdf, other

    cs.CV

    A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing

    Authors: Maomao Li, Yu Li, Tianyu Yang, Yunfei Liu, Dongxu Yue, Zhihui Lin, Dong Xu

    Abstract: This paper presents a video inversion approach for zero-shot video editing, which models the input video with low-rank representation during the inversion process. The existing video editing methods usually apply the typical 2D DDIM inversion or naive spatial-temporal DDIM inversion before editing, which leverages time-varying representation for each frame to derive noisy latent. Unlike most exist… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: 14 pages, Project page: https://stem-inv.github.io/page/

    Journal ref: CVPR 2024

  2. arXiv:2310.18021  [pdf, other

    cs.AI

    FormalGeo: An Extensible Formalized Framework for Olympiad Geometric Problem Solving

    Authors: Xiaokai Zhang, Na Zhu, Yiming He, Jia Zou, Qike Huang, Xiaoxiao **, Yanjun Guo, Chenyang Mao, Yang Li, Zhe Zhu, Dengfeng Yue, Fangzhen Zhu, Yifan Wang, Yiwen Huang, Runan Wang, Cheng Qin, Zhenbing Zeng, Shaorong Xie, Xiangfeng Luo, Tuo Leng

    Abstract: This is the first paper in a series of work we have accomplished over the past three years. In this paper, we have constructed a consistent formal plane geometry system. This will serve as a crucial bridge between IMO-level plane geometry challenges and readable AI automated reasoning. Within this formal framework, we have been able to seamlessly integrate modern AI models with our formal system.… ▽ More

    Submitted 14 February, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: 44 pages

  3. arXiv:2305.14742  [pdf, other

    cs.CV

    ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

    Authors: Dongxu Yue, Qin Guo, Munan Ning, Jiaxi Cui, Yuesheng Zhu, Li Yuan

    Abstract: Editing real facial images is a crucial task in computer vision with significant demand in various real-world applications. While GAN-based methods have showed potential in manipulating images especially when combined with CLIP, these methods are limited in their ability to reconstruct real images due to challenging GAN inversion capability. Despite the successful image reconstruction achieved by… ▽ More

    Submitted 5 June, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  4. arXiv:2303.09916  [pdf

    physics.chem-ph cs.LG

    DSDP: A Blind Docking Strategy Accelerated by GPUs

    Authors: YuPeng Huang, Hong Zhang, Siyuan Jiang, Dajiong Yue, Xiaohan Lin, Jun Zhang, Yi Qin Gao

    Abstract: Virtual screening, including molecular docking, plays an essential role in drug discovery. Many traditional and machine-learning based methods are available to fulfil the docking task. The traditional docking methods are normally extensively time-consuming, and their performance in blind docking remains to be improved. Although the runtime of docking based on machine learning is significantly decr… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  5. arXiv:2302.10484  [pdf, other

    cs.CV

    Lightweight Real-time Semantic Segmentation Network with Efficient Transformer and CNN

    Authors: Guoan Xu, Juncheng Li, Guangwei Gao, Huimin Lu, Jian Yang, Dong Yue

    Abstract: In the past decade, convolutional neural networks (CNNs) have shown prominence for semantic segmentation. Although CNN models have very impressive performance, the ability to capture global representation is still insufficient, which results in suboptimal results. Recently, Transformer achieved huge success in NLP tasks, demonstrating its advantages in modeling long-range dependency. Recently, Tra… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: IEEE Transactions on Intelligent Transportation Systems, 10 pages

  6. arXiv:2212.12744  [pdf, ps, other

    eess.SP cs.LG

    Energy Efficiency Maximization in IRS-Aided Cell-Free Massive MIMO System

    Authors: Si-Nian **, Dian-Wu Yue, Yi-Ling Chen, Qing Hu

    Abstract: In this paper, we consider an intelligent reflecting surface (IRS)-aided cell-free massive multiple-input multiple-output system, where the beamforming at access points and the phase shifts at IRSs are jointly optimized to maximize energy efficiency (EE). To solve EE maximization problem, we propose an iterative optimization algorithm by using quadratic transform and Lagrangian dual transform to f… ▽ More

    Submitted 24 December, 2022; originally announced December 2022.

    Comments: 6 pages, 4 figures

  7. arXiv:2112.06593  [pdf, ps, other

    cs.IT eess.SP

    RIS-Aided Cell-Free Massive MIMO Systems: Joint Design of Transmit Beamforming and Phase Shifts

    Authors: Si-Nian **, Dian-Wu Yue, Ha H. Nguyen

    Abstract: This paper studies RIS-aided cell-free massive MIMO systems, where multiple RISs are deployed to assist the communication between multiple access points (APs) and multiple users, with either continuous or discrete phase shifts at the RISs. We formulate the max-min fairness problem that maximizes the minimum achievable rate among all users by jointly optimizing the transmit beamforming at active AP… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 13 pages, 10 figures. Submitted to IEEE for possible publication

  8. arXiv:2103.13044  [pdf, other

    cs.CV

    MSCFNet: A Lightweight Network With Multi-Scale Context Fusion for Real-Time Semantic Segmentation

    Authors: Guangwei Gao, Guoan Xu, Yi Yu, ** Xie, Jian Yang, Dong Yue

    Abstract: In recent years, how to strike a good trade-off between accuracy and inference speed has become the core issue for real-time semantic segmentation applications, which plays a vital role in real-world scenarios such as autonomous driving systems and drones. In this study, we devise a novel lightweight network using a multi-scale context fusion (MSCFNet) scheme, which explores an asymmetric encoder-… ▽ More

    Submitted 16 July, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: IEEE Transactions on Intelligent Transportation Systems, 11 pages, 7 figures

  9. Consolidated Dataset and Metrics for High-Dynamic-Range Image Quality

    Authors: Aliaksei Mikhailiuk, Maria Perez-Ortiz, Dingcheng Yue, Wilson Suen, Rafal K. Mantiuk

    Abstract: Increasing popularity of high-dynamic-range (HDR) image and video content brings the need for metrics that could predict the severity of image impairments as seen on displays of different brightness levels and dynamic range. Such metrics should be trained and validated on a sufficiently large subjective image quality dataset to ensure robust performance. As the existing HDR quality datasets are li… ▽ More

    Submitted 10 May, 2021; v1 submitted 19 December, 2020; originally announced December 2020.

  10. Heterogeneous Swarms for Maritime Dynamic Target Search and Tracking

    Authors: Hian Lee Kwa, Grgur Tokić, Roland Bouffanais, Dick K. P. Yue

    Abstract: Current strategies employed for maritime target search and tracking are primarily based on the use of agents following a predetermined path to perform a systematic sweep of a search area. Recently, dynamic Particle Swarm Optimization (PSO) algorithms have been used together with swarming multi-robot systems (MRS), giving search and tracking solutions the added properties of robustness, scalability… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Accepted for IEEE/MTS OCEANS 2020, Singapore

    Journal ref: IEEE/MTS Global Oceans 2020: Singapore - U.S. Gulf Coast, October 5-30, 2020, online, pp. 1-8

  11. arXiv:2006.14156  [pdf, ps, other

    eess.SY cs.LG

    Multi-Agent Deep Reinforcement Learning for HVAC Control in Commercial Buildings

    Authors: Liang Yu, Yi Sun, Zhanbo Xu, Chao Shen, Dong Yue, Tao Jiang, Xiaohong Guan

    Abstract: In commercial buildings, about 40%-50% of the total electricity consumption is attributed to Heating, Ventilation, and Air Conditioning (HVAC) systems, which places an economic burden on building operators. In this paper, we intend to minimize the energy cost of an HVAC system in a multi-zone commercial building under dynamic pricing with the consideration of random zone occupancy, thermal comfort… ▽ More

    Submitted 22 July, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: 14 pages, 21 figures, accepted by IEEE Transactions on Smart Grid

  12. arXiv:2004.05691  [pdf, other

    cs.LG stat.ML

    Active Sampling for Pairwise Comparisons via Approximate Message Passing and Information Gain Maximization

    Authors: Aliaksei Mikhailiuk, Clifford Wilmot, Maria Perez-Ortiz, Dingcheng Yue, Rafal Mantiuk

    Abstract: Pairwise comparison data arise in many domains with subjective assessment experiments, for example in image and video quality assessment. In these experiments observers are asked to express a preference between two conditions. However, many pairwise comparison protocols require a large number of comparisons to infer accurate scores, which may be unfeasible when each comparison is time-consuming (e… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

  13. arXiv:1809.03327  [pdf, other

    cs.CV cs.AI

    YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark

    Authors: Ning Xu, Linjie Yang, Yuchen Fan, Dingcheng Yue, Yuchen Liang, Jianchao Yang, Thomas Huang

    Abstract: Learning long-term spatial-temporal features are critical for many video analysis tasks. However, existing video segmentation methods predominantly rely on static image segmentation techniques, and methods capturing temporal dependency for segmentation have to depend on pretrained optical flow models, leading to suboptimal solutions for the problem. End-to-end sequential learning to explore spatia… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: Dataset Report. arXiv admin note: substantial text overlap with arXiv:1809.00461

  14. arXiv:1809.00461  [pdf, other

    cs.CV

    YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

    Authors: Ning Xu, Linjie Yang, Yuchen Fan, Jianchao Yang, Dingcheng Yue, Yuchen Liang, Brian Price, Scott Cohen, Thomas Huang

    Abstract: Learning long-term spatial-temporal features are critical for many video analysis tasks. However, existing video segmentation methods predominantly rely on static image segmentation techniques, and methods capturing temporal dependency for segmentation have to depend on pretrained optical flow models, leading to suboptimal solutions for the problem. End-to-end sequential learning to explore spatia… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: ECCV 2018 accepted paper

  15. Gradual Collective Upgrade of a Swarm of Autonomous Buoys for Dynamic Ocean Monitoring

    Authors: Francesco Vallegra, David Mateo, Grgur Tokić, Roland Bouffanais, Dick K. P. Yue

    Abstract: Swarms of autonomous surface vehicles equipped with environmental sensors and decentralized communications bring a new wave of attractive possibilities for the monitoring of dynamic features in oceans and other waterbodies. However, a key challenge in swarm robotics design is the efficient collective operation of heterogeneous systems. We present both theoretical analysis and field experiments on… ▽ More

    Submitted 31 August, 2018; originally announced August 2018.

    Comments: Proceedings of the OCEANS 2018 conference

    Journal ref: OCEANS 2018 MTS/IEEE Charleston, Charleston, S.C., 2018, p. 1-7

  16. arXiv:1801.02987  [pdf, ps, other

    cs.IT

    Multiplexing Analysis of Millimeter-Wave Massive MIMO Systems

    Authors: Dian-Wu Yue, Ha H. Nguyen, Shuai Xu

    Abstract: This paper is concerned with spatial multiplexing analysis for millimeter-wave (mmWave) massive MIMO systems. For a single-user mmWave system employing distributed antenna subarray architecture in which the transmitter and receiver consist of Kt and Kr subarrays, respectively, an asymptotic multiplexing gain formula is firstly derived when the numbers of antennas at subarrays go to infinity. Speci… ▽ More

    Submitted 29 July, 2018; v1 submitted 7 January, 2018; originally announced January 2018.

    Comments: 10 pages, 8 figures. arXiv admin note: substantial text overlap with arXiv:1801.00387

  17. arXiv:1801.00387  [pdf, ps, other

    cs.IT

    Diversity Analysis of Millimeter-Wave Massive MIMO Systems

    Authors: Dian-Wu Yue, Shuai Xu, Ha H. Nguyen

    Abstract: This paper is concerned with asymptotic diversity analysis for millimeter-wave (mmWave) massive MIMO systems. First, for a single-user mmWave system employing distributed antenna subarray architecture in which the transmitter and receiver consist of Kt and Kr subarrays, respectively, a diversity gain theorem is established when the numbers of antennas at subarrays go to infinity. Specifically, ass… ▽ More

    Submitted 31 December, 2017; originally announced January 2018.

    Comments: 10 pages, 10 figures

  18. Swarm-Enabling Technology for Multi-Robot Systems

    Authors: Mohammadreza Chamanbaz, David Mateo, Brandon M. Zoss, Grgur Tokić, Erik Wilhelm, Roland Bouffanais, and Dick K. P. Yue

    Abstract: Swarm robotics has experienced a rapid expansion in recent years, primarily fueled by specialized multi-robot systems developed to achieve dedicated collective actions. These specialized platforms are in general designed with swarming considerations at the front and center. Key hardware and software elements required for swarming are often deeply embedded and integrated with the particular system.… ▽ More

    Submitted 11 May, 2017; originally announced May 2017.

    Journal ref: Frontiers in Robotics and AI 4 (2017) 12

  19. arXiv:1702.05729  [pdf, other

    cs.CV

    Person Search with Natural Language Description

    Authors: Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, Xiaogang Wang

    Abstract: Searching persons in large-scale image databases with the query of natural language description has important applications in video surveillance. Existing methods mainly focused on searching persons with image-based or attribute-based queries, which have major limitations for a practical usage. In this paper, we study the problem of person search with natural language description. Given the textua… ▽ More

    Submitted 30 March, 2017; v1 submitted 19 February, 2017; originally announced February 2017.

  20. arXiv:1404.1654  [pdf, ps, other

    cs.IT

    LOS-based Conjugate Beamforming and Power-Scaling Law in Massive-MIMO Systems

    Authors: Dian-Wu Yue, Geoffrey Ye Li

    Abstract: This paper is concerned with massive-MIMO systems over Rician flat fading channels. In order to reduce the overhead to obtain full channel state information and to avoid the pilot contamination problem, by treating the scattered component as interference, we investigate a transmit and receive conjugate beamforming (BF) transmission scheme only based on the line-of-sight (LOS) component. Under Rank… ▽ More

    Submitted 8 December, 2014; v1 submitted 7 April, 2014; originally announced April 2014.

    Comments: 32 pages, 11 figures

  21. arXiv:1403.6561  [pdf, ps, other

    cs.IT

    Transmit Power Minimization for MIMO Systems of Exponential Average BER with Fixed Outage Probability

    Authors: Dian-Wu Yue, Yichuang Sun

    Abstract: This paper is concerned with a wireless system operating in MIMO fading channels with channel state information being known at both transmitter and receiver. By spatiotemporal subchannel selection and power control, it aims to minimize the average transmit power (ATP) of the MIMO system while achieving an exponential type of average bit error rate (BER) for each data stream. Under the constraints… ▽ More

    Submitted 25 March, 2014; originally announced March 2014.

    Comments: 20 pages, 4 figures