Skip to main content

Showing 1–50 of 90 results for author: Sun, C

Searching in archive eess. Search in all archives.
.
  1. Learning Autonomous Race Driving with Action Map** Reinforcement Learning

    Authors: Yuanda Wang, Xin Yuan, Changyin Sun

    Abstract: Autonomous race driving poses a complex control challenge as vehicles must be operated at the edge of their handling limits to reduce lap times while respecting physical and safety constraints. This paper presents a novel reinforcement learning (RL)-based approach, incorporating the action map** (AM) mechanism to manage state-dependent input constraints arising from limited tire-road friction. A… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2405.20595  [pdf, other

    eess.SP

    Multi-Beam Integrated Sensing and Communication: State-of-the-Art, Challenges and Opportunities

    Authors: Yinxiao Zhuo, Tianqi Mao, Hao** Li, Chen Sun, Zhaocheng Wang, Zhu Han, Sheng Chen

    Abstract: Integrated sensing and communication (ISAC) has been envisioned as a critical enabling technology for the next-generation wireless communication, which can realize location/motion detection of surroundings with communication devices. This additional sensing capability leads to a substantial network quality gain and expansion of the service scenarios. As the system evolves to millimeter wave (mmWav… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.10116  [pdf, other

    eess.SY eess.SP

    Enhancing Energy Efficiency in O-RAN Through Intelligent xApps Deployment

    Authors: Xuanyu Liang, Ahmed Al-Tahmeesschi, Qiao Wang, Swarna Chetty, Chenrui Sun, Hamed Ahmadi

    Abstract: The proliferation of 5G technology presents an unprecedented challenge in managing the energy consumption of densely deployed network infrastructures, particularly Base Stations (BSs), which account for the majority of power usage in mobile networks. The O-RAN architecture, with its emphasis on open and intelligent design, offers a promising framework to address the Energy Efficiency (EE) demands… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures

  4. arXiv:2405.10087  [pdf, other

    eess.SP

    Continuous Transfer Learning for UAV Communication-aware Trajectory Design

    Authors: Chenrui Sun, Gianluca Fontanesi, Swarna Bindu Chetty, Xuanyu Liang, Berk Canberk, Hamed Ahmadi

    Abstract: Deep Reinforcement Learning (DRL) emerges as a prime solution for Unmanned Aerial Vehicle (UAV) trajectory planning, offering proficiency in navigating high-dimensional spaces, adaptability to dynamic environments, and making sequential decisions based on real-time feedback. Despite these advantages, the use of DRL for UAV trajectory planning requires significant retraining when the UAV is confron… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 6 pages

  5. arXiv:2404.07425  [pdf, ps, other

    eess.SP cs.IT

    Precoder Design for User-Centric Network Massive MIMO with Matrix Manifold Optimization

    Authors: Rui Sun, Li You, An-An Lu, Chen Sun, Xiqi Gao, Xiang-Gen Xia

    Abstract: In this paper, we investigate the precoder design for user-centric network (UCN) massive multiple-input multiple-output (mMIMO) downlink with matrix manifold optimization. In UCN mMIMO systems, each user terminal (UT) is served by a subset of base stations (BSs) instead of all the BSs, facilitating the implementation of the system and lowering the dimension of the precoders to be designed. By prov… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 13 pages, 9 figures, journal

  6. arXiv:2403.20091  [pdf, other

    cs.IT eess.SP

    A Signature Based Approach Towards Global Channel Charting with Ultra Low Complexity

    Authors: Longhai Zhao, Yunchuan Yang, Qi Xiong, He Wang, Bin Yu, Feifei Sun, Chengjun Sun

    Abstract: Channel charting, an unsupervised learning method that learns a low-dimensional representation from channel information to preserve geometrical property of physical space of user equipments (UEs), has drawn many attentions from both academic and industrial communities, because it can facilitate many downstream tasks, such as indoor localization, UE handover, beam management, and so on. However, ma… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: accepted by IEEE ICC 2024 Workshops

  7. arXiv:2403.18843  [pdf, other

    cs.CV cs.CL cs.LG cs.SD eess.AS

    JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition

    Authors: Chang Sun, Hong Yang, Bo Qin

    Abstract: Visual Speech Recognition (VSR) tasks are generally recognized to have a lower theoretical performance ceiling than Automatic Speech Recognition (ASR), owing to the inherent limitations of conveying semantic information visually. To mitigate this challenge, this paper introduces an advanced knowledge distillation approach using a Joint-Embedding Predictive Architecture (JEPA), named JEP-KD, design… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  8. arXiv:2403.16136  [pdf, ps, other

    eess.SY

    Data-Driven Sliding Mode Control for Partially Unknown Nonlinear Systems

    Authors: Jianglin Lan, Xianxian Zhao, Congcong Sun

    Abstract: This paper introduces a new design method for data-driven control of nonlinear systems with partially unknown dynamics and unknown bounded disturbance. Since it is not possible to achieve exact nonlinearity cancellation in the presence of unknown disturbance, this paper adapts the idea of sliding mode control (SMC) to ensure system stability and robustness without assuming that the nonlinearity go… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE CDC 2024

  9. arXiv:2403.15468  [pdf, other

    eess.SP

    Human Detection in Realistic Through-the-Wall Environments using Raw Radar ADC Data and Parametric Neural Networks

    Authors: Wei Wang, Naike Du, Yuchao Guo, Chao Sun, **gyang Liu, Rencheng Song, Xiuzhu Ye

    Abstract: The radar signal processing algorithm is one of the core components in through-wall radar human detection technology. Traditional algorithms (e.g., DFT and matched filtering) struggle to adaptively handle low signal-to-noise ratio echo signals in challenging and dynamic real-world through-wall application environments, which becomes a major bottleneck in the system. In this paper, we introduce an… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 11pages,13figures

  10. arXiv:2403.13820  [pdf, other

    cs.LG cs.CR eess.SP

    Identity information based on human magnetocardiography signals

    Authors: Pengju Zhang, Chenxi Sun, Jianwei Zhang, Hong Guo

    Abstract: We have developed an individual identification system based on magnetocardiography (MCG) signals captured using optically pumped magnetometers (OPMs). Our system utilizes pattern recognition to analyze the signals obtained at different positions on the body, by scanning the matrices composed of MCG signals with a 2*2 window. In order to make use of the spatial information of MCG signals, we transf… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures. Author manuscript accepted for AAAI 2024 Spring Symposium on Clinical Foundation Models

  11. arXiv:2402.09424  [pdf, other

    eess.SP cs.CV cs.LG cs.NE

    Epilepsy Seizure Detection and Prediction using an Approximate Spiking Convolutional Transformer

    Authors: Qinyu Chen, Congyi Sun, Chang Gao, Shih-Chii Liu

    Abstract: Epilepsy is a common disease of the nervous system. Timely prediction of seizures and intervention treatment can significantly reduce the accidental injury of patients and protect the life and health of patients. This paper presents a neuromorphic Spiking Convolutional Transformer, named Spiking Conformer, to detect and predict epileptic seizure segments from scalped long-term electroencephalogram… ▽ More

    Submitted 21 January, 2024; originally announced February 2024.

    Comments: To be published at the 2024 IEEE International Symposium on Circuits and Systems (ISCAS), Singapore

  12. arXiv:2401.03396  [pdf

    eess.SP

    A Closed-loop Brain-Machine Interface SoC Featuring a 0.2$μ$J/class Multiplexer Based Neural Network

    Authors: Chao Zhang, Yongxiang Guo, Dawid Sheng, Zhixiong Ma, Chao Sun, Yuwei Zhang, Wenxin Zhao, Fenyan Zhang, Tongfei Wang, Xing Sheng, Milin Zhang

    Abstract: This work presents the first fabricated electrophysiology-optogenetic closed-loop bidirectional brain-machine interface (CL-BBMI) system-on-chip (SoC) with electrical neural signal recording, on-chip sleep staging and optogenetic stimulation. The first multiplexer with static assignment based table lookup solution (MUXnet) for multiplier-free NN processor was proposed. A state-of-the-art average a… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 2 pages, 6 figures. Accepted by IEEE Custom Integrated Circuits Conference (CICC) 2024. The codes for the MUXnet (constructing neural networks using multiplexers instead of multipliers) will be open-sourced after the Journal version of this work is accepted

  13. arXiv:2312.01970  [pdf, other

    cs.NI eess.SY

    CaRL: Cascade Reinforcement Learning with State Space Splitting for O-RAN based Traffic Steering

    Authors: Chuanneng Sun, Yu Zhou, Gueyoung Jung, Tuyen Xuan Tran, Dario Pompili

    Abstract: The Open Radio Access Network (O-RAN) architecture empowers intelligent and automated optimization of the RAN through applications deployed on the RAN Intelligent Controller (RIC) platform, enabling capabilities beyond what is achievable with traditional RAN solutions. Within this paradigm, Traffic Steering (TS) emerges as a pivotal RIC application that focuses on optimizing cell-level mobility se… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 14 pages, 10 figures

    ACM Class: C.2.3; I.2.8

  14. arXiv:2312.00308  [pdf, other

    cs.CV eess.IV stat.AP

    A knowledge-based data-driven (KBDD) framework for all-day identification of cloud types using satellite remote sensing

    Authors: Longfeng Nie, Yuntian Chen, Mengge Du, Changqi Sun, Dongxiao Zhang

    Abstract: Cloud types, as a type of meteorological data, are of particular significance for evaluating changes in rainfall, heatwaves, water resources, floods and droughts, food security and vegetation cover, as well as land use. In order to effectively utilize high-resolution geostationary observations, a knowledge-based data-driven (KBDD) framework for all-day identification of cloud types based on spectr… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  15. arXiv:2311.11255  [pdf, other

    cs.SD cs.MM eess.AS

    M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models

    Authors: Shansong Liu, Atin Sakkeer Hussain, Chenshuo Sun, Ying Shan

    Abstract: The current landscape of research leveraging large language models (LLMs) is experiencing a surge. Many works harness the powerful reasoning capabilities of these models to comprehend various modalities, such as text, speech, images, videos, etc. They also utilize LLMs to understand human intention and generate desired outputs like images, videos, and music. However, research that combines both un… ▽ More

    Submitted 4 March, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  16. arXiv:2310.12987  [pdf, other

    eess.IV cs.CV cs.GR

    Spec-NeRF: Multi-spectral Neural Radiance Fields

    Authors: Jiabao Li, Yuqi Li, Ciliang Sun, Chong Wang, **hui Xiang

    Abstract: We propose Multi-spectral Neural Radiance Fields(Spec-NeRF) for jointly reconstructing a multispectral radiance field and spectral sensitivity functions(SSFs) of the camera from a set of color images filtered by different filters. The proposed method focuses on modeling the physical imaging process, and applies the estimated SSFs and radiance field to synthesize novel views of multispectral scenes… ▽ More

    Submitted 14 September, 2023; originally announced October 2023.

  17. arXiv:2310.02459  [pdf, other

    cs.LG cs.RO eess.SY

    Distributionally Safe Reinforcement Learning under Model Uncertainty: A Single-Level Approach by Differentiable Convex Programming

    Authors: Alaa Eddine Chriat, Chuangchuang Sun

    Abstract: Safety assurance is uncompromisable for safety-critical environments with the presence of drastic model uncertainties (e.g., distributional shift), especially with humans in the loop. However, incorporating uncertainty in safe learning will naturally lead to a bi-level problem, where at the lower level the (worst-case) safety constraint is evaluated within the uncertainty ambiguity set. In this pa… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  18. arXiv:2309.13922  [pdf, other

    eess.SP

    Track-before-detect Algorithm based on Cost-reference Particle Filter Bank for Weak Target Detection

    Authors: ** Lu, Guojie Peng, Weichuan Zhang, Changming Sun

    Abstract: Detecting weak target is an important and challenging problem in many applications such as radar, sonar etc. However, conventional detection methods are often ineffective in this case because of low signal-to-noise ratio (SNR). This paper presents a track-before-detect (TBD) algorithm based on an improved particle filter, i.e. cost-reference particle filter bank (CRPFB), which turns the problem of… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  19. arXiv:2309.08700  [pdf, other

    cs.RO cs.LG eess.SY

    Wasserstein Distributionally Robust Control Barrier Function using Conditional Value-at-Risk with Differentiable Convex Programming

    Authors: Alaa Eddine Chriat, Chuangchuang Sun

    Abstract: Control Barrier functions (CBFs) have attracted extensive attention for designing safe controllers for their deployment in real-world safety-critical systems. However, the perception of the surrounding environment is often subject to stochasticity and further distributional shift from the nominal one. In this paper, we present distributional robust CBF (DR-CBF) to achieve resilience under distribu… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  20. arXiv:2309.00723  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Contextual Biasing of Named-Entities with Large Language Models

    Authors: Chuanneng Sun, Zeeshan Ahmed, Yingyi Ma, Zhe Liu, Lucas Kabela, Yutong Pang, Ozlem Kalinli

    Abstract: This paper studies contextual biasing with Large Language Models (LLMs), where during second-pass rescoring additional contextual information is provided to a LLM to boost Automatic Speech Recognition (ASR) performance. We propose to leverage prompts for a LLM without fine tuning during rescoring which incorporate a biasing list and few-shot examples to serve as additional information when calcula… ▽ More

    Submitted 21 September, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: 5 pages, 4 figures. Conference: ICASSP 2024

    MSC Class: 68T10 ACM Class: I.2.7

  21. arXiv:2308.16021  [pdf, other

    cs.SD eess.AS

    CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis

    Authors: Yi Meng, Xiang Li, Zhiyong Wu, Tingtian Li, Zixun Sun, Xinyu Xiao, Chi Sun, Hui Zhan, Helen Meng

    Abstract: To further improve the speaking styles of synthesized speeches, current text-to-speech (TTS) synthesis systems commonly employ reference speeches to stylize their outputs instead of just the input texts. These reference speeches are obtained by manual selection which is resource-consuming, or selected by semantic features. However, semantic features contain not only style-related information, but… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: Accepted by InterSpeech 2022

  22. arXiv:2308.11276  [pdf, other

    cs.SD cs.AI cs.CL cs.MM eess.AS

    Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning

    Authors: Shansong Liu, Atin Sakkeer Hussain, Chenshuo Sun, Ying Shan

    Abstract: Text-to-music generation (T2M-Gen) faces a major obstacle due to the scarcity of large-scale publicly available music datasets with natural language captions. To address this, we propose the Music Understanding LLaMA (MU-LLaMA), capable of answering music-related questions and generating captions for music files. Our model utilizes audio representations from a pretrained MERT model to extract musi… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  23. Artificial-Intelligence-Based Triple Phase Shift Modulation for Dual Active Bridge Converter with Minimized Current Stress

    Authors: Xinze Li, Xin Zhang, Fanfan Lin, Changjiang Sun, Kezhi Mao

    Abstract: The dual active bridge (DAB) converter has been popular in many applications for its outstanding power density and bidirectional power transfer capacity. Up to now, triple phase shift (TPS) can be considered as one of the most advanced modulation techniques for DAB converter. It can widen zero voltage switching range and improve power efficiency significantly. Currently, current stress of the DAB… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 12 pages, 29 figures

  24. Artificial-Intelligence-Based Hybrid Extended Phase Shift Modulation for the Dual Active Bridge Converter with Full ZVS Range and Optimal Efficiency

    Authors: Xinze Li, Xin Zhang, Fanfan Lin, Changjiang Sun, Kezhi Mao

    Abstract: Dual active bridge (DAB) converter is the key enabler in many popular applications such as wireless charging, electric vehicle and renewable energy. ZVS range and efficiency are two significant performance indicators for DAB converter. To obtain the desired ZVS and efficiency performance, modulation should be carefully designed. Hybrid modulation considers several single modulation strategies to a… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 13 pages, 32 figures

  25. arXiv:2307.06615  [pdf, other

    eess.SY cs.CV cs.NI

    NLOS Dies Twice: Challenges and Solutions of V2X for Cooperative Perception

    Authors: Lantao Li, Chen Sun

    Abstract: Multi-agent multi-lidar sensor fusion between connected vehicles for cooperative perception has recently been recognized as the best technique for minimizing the blind zone of individual vehicular perception systems and further enhancing the overall safety of autonomous driving systems. This technique relies heavily on the reliability and availability of vehicle-to-everything (V2X) communication.… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Submission to IEEE Vehicular Technology Magazine

  26. arXiv:2307.04402  [pdf

    stat.ME eess.SY

    Moving pattern-based modeling using a new type of interval ARX model

    Authors: Chang** Sun

    Abstract: In this paper,firstly,to overcome the shortcoming of traditional ARX model, a new operator between an interval number and a real matrix is defined, and then it is applied to the traditional ARX model to get a new type of structure interval ARX model that can deal with interval data, which is defined as interval ARX model (IARX). Secondly,the IARX model is applied to moving pattern-based modeling.… ▽ More

    Submitted 12 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

  27. arXiv:2306.17434  [pdf

    eess.IV

    A Motion Assessment Method for Reference Stack Selection in Fetal Brain MRI Reconstruction Based on Tensor Rank Approximation

    Authors: Haoan Xu, Wen Shi, Jiwei Sun, Tianshu Zheng, Cong Sun, Sun Yi, Guangbin Wang, Dan Wu

    Abstract: Purpose: Slice-to-volume registration and super-resolution reconstruction (SVR-SRR) is commonly used to generate 3D volumes of the fetal brain from 2D stacks of slices acquired in multiple orientations. A critical initial step in this pipeline is to select one stack with the minimum motion as a reference for registration. An accurate and unbiased motion assessment (MA) is thus crucial for successf… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: 6 figures. Correspondence to: Dan Wu, Ph.D. E-mail: [email protected]

  28. arXiv:2305.12107  [pdf, other

    cs.SD cs.CL eess.AS

    EE-TTS: Emphatic Expressive TTS with Linguistic Information

    Authors: Yi Zhong, Chen Zhang, Xule Liu, Chenxi Sun, Weishan Deng, Haifeng Hu, Zhongqian Sun

    Abstract: While Current TTS systems perform well in synthesizing high-quality speech, producing highly expressive speech remains a challenge. Emphasis, as a critical factor in determining the expressiveness of speech, has attracted more attention nowadays. Previous works usually enhance the emphasis by adding intermediate features, but they can not guarantee the overall expressiveness of the speech. To reso… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: Accepted by Interspeech 2023, fix some typos

  29. arXiv:2305.03608  [pdf, other

    cs.LG cs.RO eess.SY math.OC

    On the Optimality, Stability, and Feasibility of Control Barrier Functions: An Adaptive Learning-Based Approach

    Authors: Alaa Eddine Chriat, Chuangchuang Sun

    Abstract: Safety has been a critical issue for the deployment of learning-based approaches in real-world applications. To address this issue, control barrier function (CBF) and its variants have attracted extensive attention for safety-critical control. However, due to the myopic one-step nature of CBF and the lack of principled methods to design the class-$\mathcal{K}$ functions, there are still fundamenta… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  30. arXiv:2304.13085  [pdf, other

    cs.SD cs.MM eess.AS

    AI-Synthesized Voice Detection Using Neural Vocoder Artifacts

    Authors: Chengzhe Sun, Shan Jia, Shuwei Hou, Siwei Lyu

    Abstract: Advancements in AI-synthesized human voices have created a growing threat of impersonation and disinformation, making it crucial to develop methods to detect synthetic human voices. This study proposes a new approach to identifying synthetic human voices by detecting artifacts of vocoders in audio signals. Most DeepFake audio synthesis models use a neural vocoder, a neural network that generates w… ▽ More

    Submitted 27 April, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Paper accepted in CVPRW 2023. Codes and data can be found at https://github.com/csun22/Synthetic-Voice-Detection-Vocoder-Artifacts. arXiv admin note: substantial text overlap with arXiv:2302.09198

  31. Filter-informed Spectral Graph Wavelet Networks for Multiscale Feature Extraction and Intelligent Fault Diagnosis

    Authors: Tianfu Li, Chuang Sun, Olga Fink, Yuangui Yang, Xuefeng Chen, Ruqiang Yan

    Abstract: Intelligent fault diagnosis has been increasingly improved with the evolution of deep learning (DL) approaches. Recently, the emerging graph neural networks (GNNs) have also been introduced in the field of fault diagnosis with the goal to make better use of the inductive bias of the interdependencies between the different sensor measurements. However, there are some limitations with these GNN-base… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Journal ref: IEEE Transactions on Cybernetics,2023

  32. arXiv:2302.09198  [pdf, other

    cs.SD cs.MM eess.AS

    Exposing AI-Synthesized Human Voices Using Neural Vocoder Artifacts

    Authors: Chengzhe Sun, Shan Jia, Shuwei Hou, Ehab AlBadawy, Siwei Lyu

    Abstract: The advancements of AI-synthesized human voices have introduced a growing threat of impersonation and disinformation. It is therefore of practical importance to developdetection methods for synthetic human voices. This work proposes a new approach to detect synthetic human voices based on identifying artifacts of neural vocoders in audio signals. A neural vocoder is a specially designed neural net… ▽ More

    Submitted 27 April, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: Dataset and codes will be available at https://github.com/csun22/LibriVoc-Dataset

  33. arXiv:2301.13385  [pdf

    cs.CV eess.IV

    Fisheye traffic data set of point center markers

    Authors: Chung-I Huang, Wei-Yu Chen, Wei Jan Ko, Jih-Sheng Chang, Chen-Kai Sun, Hui Hung Yu, Fang-Pang Lin

    Abstract: This study presents an open data-market platform and a dataset containing 160,000 markers and 18,000 images. We hope that this dataset will bring more new data value and applications In this paper, we introduce the format and usage of the dataset, and we show a demonstration of deep learning vehicle detection trained by this dataset.

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: https://youtu.be/sjUQ-Ayxxtk

  34. arXiv:2212.07656  [pdf, other

    eess.SY

    Hybrid stability augmentation control of multi-rotor UAV in confined space based on adaptive backstep** control

    Authors: QuanXi Zhan, JunRui Zhang, ChenYang Sun, RunJie Shen, Bin He

    Abstract: This paper applies the UAV to the inspection of water diversion pipelines in hydropower stations. The diversion pipeline is an enclosed space, so the airflow disturbance caused by the rotation of the UAV blades and the strong air convection from the chimney effect have a great impact on the flight control of the UAV. Although the traditional linear control PID flight control algorithm has been wid… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: 7 pages

  35. arXiv:2212.03848  [pdf, other

    cs.CV cs.GR eess.IV

    NeRFEditor: Differentiable Style Decomposition for Full 3D Scene Editing

    Authors: Chunyi Sun, Yanbin Liu, Junlin Han, Stephen Gould

    Abstract: We present NeRFEditor, an efficient learning framework for 3D scene editing, which takes a video captured over 360° as input and outputs a high-quality, identity-preserving stylized 3D scene. Our method supports diverse types of editing such as guided by reference images, text prompts, and user interactions. We achieve this by encouraging a pre-trained StyleGAN model and a NeRF model to learn from… ▽ More

    Submitted 8 December, 2022; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: Project page: https://chuny1.github.io/NeRFEditor/nerfeditor.html

  36. arXiv:2210.07539  [pdf

    cs.CV eess.IV

    Superpixel Perception Graph Neural Network for Intelligent Defect Detection

    Authors: Hongbing Shang, Qixiu Yang, Chuang Sun, Xuefeng Chen, Ruqiang Yan

    Abstract: Aero-engine is the core component of aircraft and other spacecraft. The high-speed rotating blades provide power by sucking in air and fully combusting, and various defects will inevitably occur, threatening the operation safety of aero-engine. Therefore, regular inspections are essential for such a complex system. However, existing traditional technology which is borescope inspection is labor-int… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  37. arXiv:2210.05258  [pdf, other

    eess.IV cs.CV cs.LG

    EOCSA: Predicting Prognosis of Epithelial Ovarian Cancer with Whole Slide Histopathological Images

    Authors: Tianling Liu, Ran Su, Changming Sun, Xiuting Li, Leyi Wei

    Abstract: Ovarian cancer is one of the most serious cancers that threaten women around the world. Epithelial ovarian cancer (EOC), as the most commonly seen subtype of ovarian cancer, has rather high mortality rate and poor prognosis among various gynecological cancers. Survival analysis outcome is able to provide treatment advices to doctors. In recent years, with the development of medical imaging technol… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Published in Expert Systems with Applications 2022

  38. arXiv:2209.03437  [pdf, other

    eess.SY

    An efficient approach for nonconvex semidefinite optimization via customized alternating direction method of multipliers

    Authors: Chuangchuang Sun

    Abstract: We investigate a class of general combinatorial graph problems, including MAX-CUT and community detection, reformulated as quadratic objectives over nonconvex constraints and solved via the alternating direction method of multipliers (ADMM). We propose two reformulations: one using vector variables and a binary constraint, and the other further reformulating the Burer-Monteiro form for simpler s… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: text overlap with arXiv:1805.10678

  39. arXiv:2206.07684  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    AVATAR: Unconstrained Audiovisual Speech Recognition

    Authors: Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid

    Abstract: Audio-visual automatic speech recognition (AV-ASR) is an extension of ASR that incorporates visual cues, often from the movements of a speaker's mouth. Unlike works that simply focus on the lip motion, we investigate the contribution of entire visual frames (visual actions, objects, background etc.). This is particularly useful for unconstrained videos, where the speaker is not necessarily visible… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  40. arXiv:2205.06450  [pdf, other

    eess.SP cs.CV

    A microstructure estimation Transformer inspired by sparse representation for diffusion MRI

    Authors: Tianshu Zheng, Cong Sun, Weihao Zheng, Wen Shi, Haotian Li, Yi Sun, Yi Zhang, Guangbin Wang, Chuyang Ye, Dan Wu

    Abstract: Diffusion magnetic resonance imaging (dMRI) is an important tool in characterizing tissue microstructure based on biophysical models, which are complex and highly non-linear. Resolving microstructures with optimization techniques is prone to estimation errors and requires dense sampling in the q-space. Deep learning based approaches have been proposed to overcome these limitations. Motivated by th… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

  41. AFFIRM: Affinity Fusion-based Framework for Iteratively Random Motion correction of multi-slice fetal brain MRI

    Authors: Wen Shi, Haoan Xu, Cong Sun, Jiwei Sun, Yamin Li, Xinyi Xu, Tianshu Zheng, Yi Zhang, Guangbin Wang, Dan Wu

    Abstract: Multi-slice magnetic resonance images of the fetal brain are usually contaminated by severe and arbitrary fetal and maternal motion. Hence, stable and robust motion correction is necessary to reconstruct high-resolution 3D fetal brain volume for clinical diagnosis and quantitative analysis. However, the conventional registration-based correction has a limited capture range and is insufficient for… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

  42. arXiv:2204.00679  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Learning Audio-Video Modalities from Image Captions

    Authors: Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid

    Abstract: A major challenge in text-video and text-audio retrieval is the lack of large-scale training data. This is unlike image-captioning, where datasets are in the order of millions of samples. To close this gap we propose a new video mining pipeline which involves transferring captions from image captioning datasets to video clips with no additional manual effort. Using this pipeline, we create a new l… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

  43. arXiv:2203.06328  [pdf, other

    cs.CV eess.IV

    Image Style Transfer: from Artistic to Photorealistic

    Authors: Chenggui Sun, Li Bin Song

    Abstract: The rapid advancement of deep learning has significantly boomed the development of photorealistic style transfer. In this review, we reviewed the development of photorealistic style transfer starting from artistic style transfer and the contribution of traditional image processing techniques on photorealistic style transfer, including some work that had been completed in the Multimedia lab at the… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

  44. arXiv:2203.03107  [pdf, other

    cs.MM cs.NI eess.SY

    Privacy Leakage in Proactive VR Streaming: Modeling and Tradeoff

    Authors: Xing Wei, Chenyang Yang, Chengjian Sun

    Abstract: Proactive tile-based virtual reality (VR) video streaming employs the viewpoint of a user to predict the tiles to be requested, renders and delivers the predicted tiles before playback. Recently, it has been found that the identity and preference of the user can be inferred from the trace of viewpoint uploaded for proactive streaming, which indicates that viewpoint leakage incurs privacy leakage.… ▽ More

    Submitted 10 April, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: 30 pages, 9 figures, submit to IEEE for possible publication, the proofs in this version of the manuscript is omitted and can be found in version 1

  45. arXiv:2202.13565  [pdf, other

    eess.SY

    A Holistic Review on Advanced Bi-directional EV Charging Control Algorithms

    Authors: Xiaoying Tang, Chenxi Sun, Suzhi Bi, Shuoyao Wang, Angela Yingjun Zhang

    Abstract: The rapid growth of electric vehicles (EVs) has promised a next-generation transportation system with reduced carbon emission. The fast development of EVs and charging facilities is driving the evolution of Internet of Vehicles (IoV) to Internet of Electric Vehicles (IoEV). IoEV benefits from both smart grid and Internet of Things (IoT) technologies which provide advanced bi-directional charging s… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Journal ref: ACM SIGEnergy Energy Informatics Review Volume 1 Issue 1 November 2021 pp 78-88

  46. arXiv:2202.09609  [pdf, other

    eess.IV cs.CV

    A Lightweight Dual-Domain Attention Framework for Sparse-View CT Reconstruction

    Authors: Chang Sun, Ken Deng, Yitong Liu, Hongwen Yang

    Abstract: Computed Tomography (CT) plays an essential role in clinical diagnosis. Due to the adverse effects of radiation on patients, the radiation dose is expected to be reduced as low as possible. Sparse sampling is an effective way, but it will lead to severe artifacts on the reconstructed CT image, thus sparse-view CT image reconstruction has been a prevailing and challenging research area. With the po… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

  47. arXiv:2202.06668  [pdf, other

    eess.SP cs.IT

    Resource allocation for reconfigurable intelligent surface aided broadcast channels

    Authors: Cong Sun, Xian Liu, Bile Peng, Eduard Jorswieck

    Abstract: A two-user downlink network aided by a reconfigurable intelligent surface is considered. The weighted sum signal to interference plus noise ratio maximization and the sum rate maximization models are presented, where the precoding vectors and the RIS matrix are jointly optimized. Since the optimization problem is non-convex and difficult, new approximation models are proposed. The upper bounds of… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  48. arXiv:2201.02834  [pdf, other

    eess.SP cs.LG

    Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network

    Authors: Bile Peng, Jan-Aike Termöhlen, Cong Sun, Dan** He, Ke Guan, Tim Fingscheidt, Eduard A. Jorswieck

    Abstract: Reconfigurable intelligent surface (RIS) is an emerging technology for future wireless communication systems. In this work, we consider downlink spatial multiplexing enabled by the RIS for weighted sum-rate (WSR) maximization. In the literature, most solutions use alternating gradient-based optimization, which has moderate performance, high complexity, and limited scalability. We propose to apply… ▽ More

    Submitted 21 September, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

  49. arXiv:2112.03511  [pdf, other

    cs.RO cs.CR eess.SY

    Control Parameters Considered Harmful: Detecting Range Specification Bugs in Drone Configuration Modules via Learning-Guided Search

    Authors: Ruidong Han, Chao Yang, Siqi Ma, JiangFeng Ma, Cong Sun, Juanru Li, Elisa Bertino

    Abstract: In order to support a variety of missions and deal with different flight environments, drone control programs typically provide configurable control parameters. However, such a flexibility introduces vulnerabilities. One such vulnerability, referred to as range specification bugs, has been recently identified. The vulnerability originates from the fact that even though each individual parameter re… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted to ICSE2022 Technical Track

  50. arXiv:2109.05159  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Co-Correcting: Noise-tolerant Medical Image Classification via mutual Label Correction

    Authors: Jiarun Liu, Ruirui Li, Chuan Sun

    Abstract: With the development of deep learning, medical image classification has been significantly improved. However, deep learning requires massive data with labels. While labeling the samples by human experts is expensive and time-consuming, collecting labels from crowd-sourcing suffers from the noises which may degenerate the accuracy of classifiers. Therefore, approaches that can effectively handle la… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: IEEE Transactions on Medical Imaging 2021