Skip to main content

Showing 1–50 of 90 results for author: Zhou, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.00995  [pdf, other

    cs.CY eess.SY physics.app-ph

    Data on the Move: Traffic-Oriented Data Trading Platform Powered by AI Agent with Common Sense

    Authors: Yi Yu, Shengyue Yao, Tianchen Zhou, Yexuan Fu, **gru Yu, Ding Wang, Xuhong Wang, Cen Chen, Yilun Lin

    Abstract: In the digital era, data has become a pivotal asset, advancing technologies such as autonomous driving. Despite this, data trading faces challenges like the absence of robust pricing methods and the lack of trustworthy trading mechanisms. To address these challenges, we introduce a traffic-oriented data trading platform named Data on The Move (DTM), integrating traffic simulation, data trading, an… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.18313  [pdf, other

    cs.SD cs.CL eess.AS

    Advancing Airport Tower Command Recognition: Integrating Squeeze-and-Excitation and Broadcasted Residual Learning

    Authors: Yuanxi Lin, Tonglin Zhou, Yang Xiao

    Abstract: Accurate recognition of aviation commands is vital for flight safety and efficiency, as pilots must follow air traffic control instructions precisely. This paper addresses challenges in speech command recognition, such as noisy environments and limited computational resources, by advancing keyword spotting technology. We create a dataset of standardized airport tower commands, including routine an… ▽ More

    Submitted 28 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted by IALP 2024

  3. arXiv:2406.12463  [pdf, other

    cs.CV eess.IV

    LFMamba: Light Field Image Super-Resolution with State Space Model

    Authors: Wang xia, Yao Lu, Shunzhou Wang, Ziqi Wang, Peiqi Xia, Tianfei Zhou

    Abstract: Recent years have witnessed significant advancements in light field image super-resolution (LFSR) owing to the progress of modern neural networks. However, these methods often face challenges in capturing long-range dependencies (CNN-based) or encounter quadratic computational complexities (Transformer-based), which limit their performance. Recently, the State Space Model (SSM) with selective scan… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2406.00956  [pdf, other

    cs.CV cs.LG eess.IV

    Improving Segment Anything on the Fly: Auxiliary Online Learning and Adaptive Fusion for Medical Image Segmentation

    Authors: Tianyu Huang, Tao Zhou, Weidi Xie, Shuo Wang, Qi Dou, Yizhe Zhang

    Abstract: The current variants of the Segment Anything Model (SAM), which include the original SAM and Medical SAM, still lack the capability to produce sufficiently accurate segmentation for medical images. In medical imaging contexts, it is not uncommon for human experts to rectify segmentations of specific test samples after SAM generates its segmentation predictions. These rectifications typically entai… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Project Link: https://sam-auxol.github.io/AuxOL/

  5. arXiv:2404.15163  [pdf, other

    cs.CV eess.IV

    Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment

    Authors: Tianwei Zhou, Songbai Tan, Wei Zhou, Yu Luo, Yuan-Gen Wang, Guanghui Yue

    Abstract: With the increasing maturity of the text-to-image and image-to-image generative models, AI-generated images (AGIs) have shown great application potential in advertisement, entertainment, education, social media, etc. Although remarkable advancements have been achieved in generative models, very few efforts have been paid to design relevant quality assessment models. In this paper, we propose a nov… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: IEEE Transactions on Broadcasting (TBC)

  6. arXiv:2402.09729  [pdf, other

    cs.AI eess.SY

    Federated Prompt-based Decision Transformer for Customized VR Services in Mobile Edge Computing System

    Authors: Tailin Zhou, Jiadong Yu, Jun Zhang, Danny H. K. Tsang

    Abstract: This paper investigates resource allocation to provide heterogeneous users with customized virtual reality (VR) services in a mobile edge computing (MEC) system. We first introduce a quality of experience (QoE) metric to measure user experience, which considers the MEC system's latency, user attention levels, and preferred resolutions. Then, a QoE maximization problem is formulated for resource al… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  7. arXiv:2401.01151  [pdf, ps, other

    eess.SY physics.app-ph

    Identification of Secondary Resonances of Nonlinear Systems using Phase-Locked Loop Testing

    Authors: Tong Zhou, Gaetan Kerschen

    Abstract: One unique feature of nonlinear dynamical systems is the existence of superharmonic and subharmonic resonances in addition to primary resonances. In this study, an effective vibration testing methodology is introduced for the experimental identification of these secondary resonances. The proposed method relies on phase-locked loop control combined with adaptive filters for online Fourier decomposi… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 20 pages, 24 figures

  8. arXiv:2312.09899  [pdf, other

    eess.IV cs.CV cs.LG

    SQA-SAM: Segmentation Quality Assessment for Medical Images Utilizing the Segment Anything Model

    Authors: Yizhe Zhang, Shuo Wang, Tao Zhou, Qi Dou, Danny Z. Chen

    Abstract: Segmentation quality assessment (SQA) plays a critical role in the deployment of a medical image based AI system. Users need to be informed/alerted whenever an AI system generates unreliable/incorrect predictions. With the introduction of the Segment Anything Model (SAM), a general foundation segmentation model, new research opportunities emerged in how one can utilize SAM for medical image segmen… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Work in progress;

  9. arXiv:2312.09576  [pdf, other

    eess.IV cs.CV

    SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

    Authors: Xiangde Luo, Jia Fu, Yunxin Zhong, Shuolin Liu, Bing Han, Mehdi Astaraki, Simone Bendazzoli, Iuliana Toma-Dasu, Yiwen Ye, Ziyang Chen, Yong Xia, Yanzhou Su, ** Ye, Junjun He, Zhaohu Xing, Hongqiu Wang, Lei Zhu, Kaixiang Yang, Xin Fang, Zhiwei Wang, Chan Woong Lee, Sang Joon Park, Jaehee Chun, Constantin Ulrich, Klaus H. Maier-Hein , et al. (17 additional authors not shown)

    Abstract: Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: A challenge report of SegRap2023 (organized in conjunction with MICCAI2023)

  10. arXiv:2312.04148  [pdf

    eess.SY

    Generalized Dam** Torque Analysis of Ultra-Low Frequency Oscillation in the Jerk Space

    Authors: Yichen Zhou, Yang Yang, Tao Zhou, Yonggang Li

    Abstract: Ultra low frequency oscillation (ULFO) is significantly threatening the power system stability. Its unstable mechanism is mostly studied via generalized dam** torque analysis method (GDTA). However, the analysis still adopts the framework established for low frequency oscillation. Hence, this letter proposes a GDTA approach in the jerk space for ULFO. A multi-information variable is constructed… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  11. arXiv:2309.10330  [pdf, other

    physics.optics eess.SP

    Time Stretch with Continuous-Wave Lasers

    Authors: Tingyi Zhou, Yuta Goto, Takeshi Makino, Callen MacPhee, Yiming Zhou, Asad M. Madni, Hideaki Furukawa, Naoya Wada, Bahram Jalali

    Abstract: A single-shot measurement technique for ultrafast phenomena with high throughput enables the capture of rare events within a short time scale, facilitating the exploration of rare ultrafast processes. Photonic time stretch stands out as a highly effective method for both detecting rapid events and achieving remarkable speed in imaging and ranging applications. The current time stretch method relie… ▽ More

    Submitted 1 November, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

  12. arXiv:2309.03779  [pdf, other

    cs.LG cs.AI cs.AR cs.OS eess.SY

    CPU frequency scheduling of real-time applications on embedded devices with temporal encoding-based deep reinforcement learning

    Authors: Ti Zhou, Man Lin

    Abstract: Small devices are frequently used in IoT and smart-city applications to perform periodic dedicated tasks with soft deadlines. This work focuses on develo** methods to derive efficient power-management methods for periodic tasks on small devices. We first study the limitations of the existing Linux built-in methods used in small devices. We illustrate three typical workload/system patterns that a… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted to Journal of Systems Architecture

    Journal ref: Journal of Systems Architecture, 2023

  13. Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning

    Authors: Qiang He, Tianyi Zhou, Meng Fang, Setareh Maghsudi

    Abstract: We propose a novel value approximation method, namely Eigensubspace Regularized Critic (ERC) for deep reinforcement learning (RL). ERC is motivated by an analysis of the dynamics of Q-value approximation error in the Temporal-Difference (TD) method, which follows a path defined by the 1-eigensubspace of the transition kernel associated with the Markov Decision Process (MDP). It reveals a fundament… ▽ More

    Submitted 8 November, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: Accepted to ECML23. Code: https://sites.google.com/view/erc-ecml23/

  14. arXiv:2305.15193  [pdf, other

    cs.LG eess.SY

    Adaptive Policy Learning to Additional Tasks

    Authors: Wenjian Hao, Zehui Lu, Zihao Liang, Tianyu Zhou, Shaoshuai Mou

    Abstract: This paper develops a policy learning method for tuning a pre-trained policy to adapt to additional tasks without altering the original task. A method named Adaptive Policy Gradient (APG) is proposed in this paper, which combines Bellman's principle of optimality with the policy gradient approach to improve the convergence rate. This paper provides theoretical analysis which guarantees the converg… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  15. arXiv:2305.08569  [pdf, ps, other

    eess.SY

    Attention-based QoE-aware Digital Twin Empowered Edge Computing for Immersive Virtual Reality

    Authors: Jiadong Yu, Ahmad Alhilal, Tailin Zhou, Pan Hui, Danny H. K. Tsang

    Abstract: Metaverse applications such as virtual reality (VR) content streaming, require optimal resource allocation strategies for mobile edge computing (MEC) to ensure a high-quality user experience. In contrast to online reinforcement learning (RL) algorithms, which can incur substantial communication overheads and longer delays, the majority of existing works employ offline-trained RL algorithms for res… ▽ More

    Submitted 23 May, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

  16. arXiv:2304.13725  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Prediction of brain tumor recurrence location based on multi-modal fusion and nonlinear correlation learning

    Authors: Tongxue Zhou, Alexandra Noeuveglise, Romain Modzelewski, Fethi Ghazouani, Sébastien Thureau, Maxime Fontanilles, Su Ruan

    Abstract: Brain tumor is one of the leading causes of cancer death. The high-grade brain tumors are easier to recurrent even after standard treatment. Therefore, develo** a method to predict brain tumor recurrence location plays an important role in the treatment planning and it can potentially prolong patient's survival time. There is still little work to deal with this issue. In this paper, we present a… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: 23 pages, 4 figures

    Journal ref: Computerized Medical Imaging and Graphics, 2023

  17. arXiv:2304.04297  [pdf, other

    cs.CV cs.DC eess.IV

    AI-assisted Automated Workflow for Real-time X-ray Ptychography Data Analysis via Federated Resources

    Authors: Anakha V Babu, Tekin Bicer, Saugat Kandel, Tao Zhou, Daniel J. Ching, Steven Henke, Siniša Veseli, Ryan Chard, Antonino Miceli, Mathew Joseph Cherukara

    Abstract: We present an end-to-end automated workflow that uses large-scale remote compute resources and an embedded GPU platform at the edge to enable AI/ML-accelerated real-time analysis of data collected for x-ray ptychography. Ptychography is a lensless method that is being used to image samples through a simultaneous numerical inversion of a large number of diffraction patterns from adjacent overlappin… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: 7 pages, 1 figure, to be published in High Performance Computing for Imaging Conference, Electronic Imaging (HPCI 2023)

  18. arXiv:2304.02249  [pdf, other

    physics.optics eess.SP

    Low Latency Computing for Time Stretch Instruments

    Authors: Tingyi Zhou, Bahram Jalali

    Abstract: Time stretch instruments have been exceptionally successful in discovering single-shot ultrafast phenomena such as optical rogue waves and have led to record-speed microscopy, spectroscopy, lidar, etc. These instruments encode the ultrafast events into the spectrum of a femtosecond pulse and then dilate the time scale of the data using group velocity dispersion. Generating as much as Tbit per seco… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  19. arXiv:2302.04432  [pdf, ps, other

    eess.SP

    Active Simultaneously Transmitting and Reflecting (STAR)-RISs: Modelling and Analysis

    Authors: Jiaqi Xu, Jiakuo Zuo, Joey Tianyi Zhou, Yuanwei Liu

    Abstract: A hardware model for active simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RISs) is proposed consisting of reflection-type amplifiers. The amplitude gains of the STAR element are derived for both coupled and independent phase-shift scenarios. Based on the proposed hardware model, an active STAR-RIS-aided two-user downlink communication system is investigated.… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: 13 pages

  20. arXiv:2212.12134  [pdf, other

    eess.SP

    AMDET: Attention based Multiple Dimensions EEG Transformer for Emotion Recognition

    Authors: Yongling Xu, Yang Du, **g Zou, Tianying Zhou, Lushan Xiao, Li Liu, Pengcheng

    Abstract: Affective computing is an important branch of artificial intelligence, and with the rapid development of brain computer interface technology, emotion recognition based on EEG signals has received broad attention. It is still a great challenge to effectively explore the multi-dimensional information in the EEG data in spite of a large number of deep learning methods. In this paper, we propose a dee… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  21. Multi-scale Transformer Network with Edge-aware Pre-training for Cross-Modality MR Image Synthesis

    Authors: Yonghao Li, Tao Zhou, Kelei He, Yi Zhou, Dinggang Shen

    Abstract: Cross-modality magnetic resonance (MR) image synthesis can be used to generate missing modalities from given ones. Existing (supervised learning) methods often require a large number of paired multi-modal data to train an effective synthesis model. However, it is often challenging to obtain sufficient paired data for supervised training. In reality, we often have a small number of paired data whil… ▽ More

    Submitted 18 June, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: 13 pages, 16 figures. This paper has been accepted by IEEE TMI

  22. arXiv:2212.00555  [pdf, other

    q-bio.NC cs.AI eess.IV

    A Structure-guided Effective and Temporal-lag Connectivity Network for Revealing Brain Disorder Mechanisms

    Authors: Zhengwang Xia, Tao Zhou, Saqib Mamoon, Amani Alfakih, Jianfeng Lu

    Abstract: Brain network provides important insights for the diagnosis of many brain disorders, and how to effectively model the brain structure has become one of the core issues in the domain of brain imaging analysis. Recently, various computational methods have been proposed to estimate the causal relationship (i.e., effective connectivity) between brain regions. Compared with traditional correlation-base… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  23. arXiv:2210.02245  [pdf, other

    eess.SP eess.IV

    Channel Modeling for UAV-to-Ground Communications with Posture Variation and Fuselage Scattering Effect

    Authors: Boyu Hua, Haoran Ni, Qiuming Zhu, Cheng-Xiang Wang, Tongtong Zhou, Kai Mao, Junwei Bao, Xiaofei Zhang

    Abstract: Unmanned aerial vehicle (UAV)-to-ground (U2G) channel models play a pivotal role for reliable communications between UAV and ground terminal. This paper proposes a three-dimensional (3D) non-stationary hybrid model including both large-scale and small-scale fading for U2G multiple-input-multiple-output (MIMO) channels. Distinctive channel characteristics under U2G scenarios, i.e., 3D trajectory an… ▽ More

    Submitted 13 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

  24. arXiv:2209.09408  [pdf, other

    cs.LG eess.IV

    Deep learning at the edge enables real-time streaming ptychographic imaging

    Authors: Anakha V Babu, Tao Zhou, Saugat Kandel, Tekin Bicer, Zhengchun Liu, William Judge, Daniel J. Ching, Yi Jiang, Sinisa Veseli, Steven Henke, Ryan Chard, Yudong Yao, Ekaterina Sirazitdinova, Geetika Gupta, Martin V. Holt, Ian T. Foster, Antonino Miceli, Mathew J. Cherukara

    Abstract: Coherent microscopy techniques provide an unparalleled multi-scale view of materials across scientific and technological fields, from structural materials to quantum devices, from integrated circuits to biological cells. Driven by the construction of brighter sources and high-rate detectors, coherent X-ray microscopy methods like ptychography are poised to revolutionize nanoscale materials charact… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  25. arXiv:2209.08800  [pdf, ps, other

    eess.SP

    A Realistic 3D Non-Stationary Channel Model for UAV-to-Vehicle Communications Incorporating Fuselage Posture

    Authors: Boyu Hua, Tongtong Zhou, Qiuming Zhu, Kai Mao, Junwei Bao, Weizhi Zhong, Naeem Ahmed

    Abstract: Considering the unmanned aerial vehicle (UAV) three-dimensional (3D) posture, a novel 3D non-stationary geometry-based stochastic model (GBSM) is proposed for multiple-input multiple-output (MIMO) UAV-to-vehicle (U2V) channels. It consists of a line-of-sight (LoS) and non-line-of-sight (NLoS) components. The factor of fuselage posture is considered by introducing a time-variant 3D posture matrix.… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 12 pages, 8 figures, CNCOM

  26. arXiv:2209.05317  [pdf, ps, other

    eess.SP physics.app-ph

    Simultaneously Transmitting and Reflecting (STAR)-RISs: Are they Applicable to Dual-Sided Incidence?

    Authors: Jiaqi Xu, Xidong Mu, Joey Tianyi Zhou, Yuanwei Liu

    Abstract: A hardware model and a signal model are proposed for dual-sided simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RISs), where the signal simultaneously incident on both sides of the surface. Based on the proposed hardware model, signal models for dual-sided STAR-RISs are developed. For elements with scalar surface impedance, it is proved that their transmission… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 13 pages

  27. arXiv:2208.07528  [pdf, other

    eess.SY

    Integrating Satellites and Mobile Edge Computing for 6G Wide-Area Edge Intelligence: Minimal Structures and Systematic Thinking

    Authors: Yueshan Lin, Wei Feng, Ting Zhou, Yanmin Wang, Yunfei Chen, Ning Ge, Cheng-Xiang Wang

    Abstract: The sixth-generation (6G) network will shift its focus to supporting everything including various machine-type devices (MTDs) in an everyone-centric manner. To ubiquitously cover the MTDs working in rural and disastrous areas, satellite communications become indispensable, while mobile edge computing (MEC) also plays an increasingly crucial role. Their sophisticated integration enables wide-area e… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  28. arXiv:2206.10814  [pdf, ps, other

    eess.SY math.OC q-bio.MN

    Frequency Domain Identifiability and Sloppiness of Descriptor Systems with an LFT Structure

    Authors: Tong Zhou

    Abstract: Identifiability and sloppiness are investigated in this paper for the parameters of a descriptor system based on its frequency response samples. Two metrics are suggested respectively for measuring absolute and relative sloppiness of the parameter vector at a prescribed value. In this descriptor system, system matrices are assumed to depend on its parameters through a linear fractional transformat… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: 14 pages

    Journal ref: Automatica, vol. 159, 111362, 2024

  29. arXiv:2203.04313  [pdf, other

    eess.IV cs.CV

    Multi-Scale Adaptive Network for Single Image Denoising

    Authors: Yuanbiao Gou, Peng Hu, Jiancheng Lv, Joey Tianyi Zhou, Xi Peng

    Abstract: Multi-scale architectures have shown effectiveness in a variety of tasks thanks to appealing cross-scale complementarity. However, existing architectures treat different scale features equally without considering the scale-specific characteristics, \textit{i.e.}, the within-scale characteristics are ignored in the architecture design. In this paper, we reveal this missing piece for multi-scale arc… ▽ More

    Submitted 29 October, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Journal ref: the Thirty-Sixth Annual Conference on Neural Information Processing Systems (NeurIPS 2022)

  30. Mobile Device Association and Resource Allocation in Small-Cell IoT Networks with Mobile Edge Computing and Caching

    Authors: Tianqing Zhou, Yali Yue, Dong Qin, Xuefang Nie, Xuan Li, Chunguo Li

    Abstract: To meet the need of computation-sensitive (CS) and high-rate (HR) communications, the framework of mobile edge computing and caching has been widely regarded as a promising solution. When such a framework is implemented in small-cell IoT (Internet of Tings) networks, it is a key and open topic how to assign mobile edge computing and caching servers to mobile devices (MDs) with CS and HR communicat… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

  31. arXiv:2202.07125  [pdf, other

    cs.LG cs.AI eess.SP stat.ML

    Transformers in Time Series: A Survey

    Authors: Qingsong Wen, Tian Zhou, Chaoli Zhang, Weiqi Chen, Ziqing Ma, Junchi Yan, Liang Sun

    Abstract: Transformers have achieved superior performances in many tasks in natural language processing and computer vision, which also triggered great interest in the time series community. Among multiple advantages of Transformers, the ability to capture long-range dependencies and interactions is especially attractive for time series modeling, leading to exciting progress in various time series applicati… ▽ More

    Submitted 11 May, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted by 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023). 9 pages. The first work to comprehensively and systematically summarize time series Transformers. The GitHub repository is https://github.com/qingsongedu/time-series-transformers-review

    Journal ref: In the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)

  32. arXiv:2111.12260  [pdf, ps, other

    eess.SP

    Federated Dynamic Neural Network for Deep MIMO Detection

    Authors: Yuwen Yang, Feifei Gao, Jiang Xue, Ting Zhou, Zongben Xu

    Abstract: In this paper, we develop a dynamic detection network (DDNet) based detector for multiple-input multiple-output (MIMO) systems. By constructing an improved DetNet (IDetNet) detector and the OAMPNet detector as two independent network branches, the DDNet detector performs sample-wise dynamic routing to adaptively select a better one between the IDetNet and the OAMPNet detectors for every samples un… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  33. arXiv:2111.04735  [pdf, other

    eess.IV cs.CV physics.med-ph

    Feature-enhanced Generation and Multi-modality Fusion based Deep Neural Network for Brain Tumor Segmentation with Missing MR Modalities

    Authors: Tongxue Zhou, Stéphane Canu, Pierre Vera, Su Ruan

    Abstract: Using multimodal Magnetic Resonance Imaging (MRI) is necessary for accurate brain tumor segmentation. The main problem is that not all types of MRIs are always available in clinical exams. Based on the fact that there is a strong correlation between MR modalities of the same patient, in this work, we propose a novel brain tumor segmentation network in the case of missing one or more modalities. Th… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: 30 pages, 7 figures

    Journal ref: Neurocomputing 2021

  34. A Tri-attention Fusion Guided Multi-modal Segmentation Network

    Authors: Tongxue Zhou, Su Ruan, Pierre Vera, Stéphane Canu

    Abstract: In the field of multimodal segmentation, the correlation between different modalities can be considered for improving the segmentation results. Considering the correlation between different MR modalities, in this paper, we propose a multi-modality segmentation network guided by a novel tri-attention fusion. Our network includes N model-independent encoding paths with N image sources, a tri-attenti… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: 33 pages, 11 figures, accepted by Pattern Recognition on 01 November 2021. arXiv admin note: substantial text overlap with arXiv:2102.03111

    Journal ref: Pattern Recognition 2021

  35. arXiv:2110.08080  [pdf, other

    eess.IV cs.CV

    Deep multi-modal aggregation network for MR image reconstruction with auxiliary modality

    Authors: Chun-Mei Feng, Huazhu Fu, Tianfei Zhou, Yong Xu, Ling Shao, David Zhang

    Abstract: Magnetic resonance (MR) imaging produces detailed images of organs and tissues with better contrast, but it suffers from a long acquisition time, which makes the image quality vulnerable to say motion artifacts. Recently, many approaches have been developed to reconstruct full-sampled images from partially observed measurements to accelerate MR imaging. However, most approaches focused on reconstr… ▽ More

    Submitted 21 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  36. arXiv:2108.06233  [pdf, ps, other

    eess.SP physics.app-ph

    Simultaneously Transmitting and Reflecting (STAR) Intelligent Omni-Surfaces, Their Modeling and Implementation

    Authors: Jiaqi Xu, Yuanwei Liu, Xidong Mu, Joey Tianyi Zhou, Lingyang Song, H. Vincent Poor, Lajos Hanzo

    Abstract: With the rapid development of advanced electromagnetic manipulation technologies, researchers and engineers are starting to study smart surfaces that can achieve enhanced coverages, high reconfigurability, and are easy to deploy. Among these efforts, simultaneously transmitting and reflecting intelligent omni-surface (STAR-IOS) is one of the most promising categories. Although pioneering works hav… ▽ More

    Submitted 3 September, 2021; v1 submitted 13 August, 2021; originally announced August 2021.

    Comments: Submitted to IEEE for possible publication

  37. arXiv:2107.02852  [pdf, other

    eess.AS cs.CL cs.SD

    A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio

    Authors: Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

    Abstract: Speaker-attributed automatic speech recognition (SA-ASR) is a task to recognize "who spoke what" from multi-talker recordings. An SA-ASR system usually consists of multiple modules such as speech separation, speaker diarization and ASR. On the other hand, considering the joint optimization, an end-to-end (E2E) SA-ASR model has recently been proposed with promising results on simulation data. In th… ▽ More

    Submitted 17 September, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: To appear in ASRU 2021

  38. Global Structure Identifiability and Reconstructibility of an NDS with Descriptor Subsystems

    Authors: Tong Zhou, Kailin Yin

    Abstract: This paper investigates requirements on a networked dynamic system (NDS) such that its subsystem interactions can be solely determined from experiment data or reconstructed from its overall model. The NDS is constituted from several subsystems whose dynamics are described through a descriptor form. Except regularity on each subsystem and the whole NDS, no other restrictions are put on either subsy… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 15 pages, 4 figures

    Journal ref: Automatica, Vol.142, 110356, 2022

  39. arXiv:2106.04047  [pdf, ps, other

    eess.SP

    Joint Channel Estimation and Mixed-ADCs Allocation for Massive MIMO via Deep Learning

    Authors: Liangyuan Xu, Feifei Gao, Ting Zhou, Shaodan Ma, Wei Zhang

    Abstract: Millimeter wave (mmWave) multi-user massive multi-input multi-output (MIMO) is a promising technique for the next generation communication systems. However, the hardware cost and power consumption grow significantly as the number of radio frequency (RF) components increases, which hampers the deployment of practical massive MIMO systems. To address this issue and further facilitate the commerciali… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: 13 pages, 12 figures. Submitted to IEEE Transactions on Wireless Communications

  40. Deep Unsupervised Learning for Joint Antenna Selection and Hybrid Beamforming

    Authors: Zhiyan Liu, Yuwen Yang, Feifei Gao, Ting Zhou, Hongbing Ma

    Abstract: In this paper, we propose a novel deep unsupervised learning-based approach that jointly optimizes antenna selection and hybrid beamforming to improve the hardware and spectral efficiencies of massive multiple-input-multiple-output (MIMO) downlink systems. By employing ResNet to extract features from the channel matrices, two neural networks, i.e., the antenna selection network (ASNet) and the hyb… ▽ More

    Submitted 21 January, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Accepted by IEEE Transactions on Communications

  41. arXiv:2105.13013  [pdf, ps, other

    eess.IV

    Conditional generator and multi-sourcecorrelation guided brain tumor segmentation with missing MR modalities

    Authors: Tongxue Zhou, Stéphane Canu, Pierre Vera, Su Ruan

    Abstract: Brain tumor is one of the most high-risk cancers which causes the 5-year survival rate of only about 36%. Accurate diagnosis of brain tumor is critical for the treatment planning. However, complete data are not always available in clinical scenarios. In this paper, we propose a novel brain tumor segmentation network to deal with the missing data issue. To compensate for missing data, we propose to… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    Comments: 10 pages, 4 figures

  42. arXiv:2105.08629  [pdf, other

    eess.IV cs.CV cs.LG

    Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Kim Byeoung-su, Radu Timofte, Angeline Pouget, Fenglong Song, Cheng Li, Shuai Xiao, Zhongqian Fu, Matteo Maggioni, Yibin Huang, Shen Cheng, Xin Lu, Yifeng Zhou, Liangyu Chen, Donghao Liu, Xiangyu Zhang, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang, Shinjo Wang, Bin Huang , et al. (7 additional authors not shown)

    Abstract: Image denoising is one of the most critical problems in mobile photo processing. While many solutions have been proposed for this task, they are usually working with synthetic data and are too computationally expensive to run on mobile devices. To address this problem, we introduce the first Mobile AI challenge, where the target is to develop an end-to-end deep learning-based image denoising solut… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/. arXiv admin note: substantial text overlap with arXiv:2105.07809, arXiv:2105.07825

  43. arXiv:2104.14119  [pdf, other

    math.OC cs.CE eess.SY

    Adaptive Partitioning Strategy for High-Dimensional Discrete Simulation-based Optimization Problems

    Authors: **g Lu, Tianli Zhou, Carolina Osorio

    Abstract: In this paper, we introduce a technique to enhance the computational efficiency of solution algorithms for high-dimensional discrete simulation-based optimization problems. The technique is based on innovative adaptive partitioning strategies that partition the feasible region using solutions that has already been simulated as well as prior knowledge of the problem of interesting. We integrate the… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  44. Latent Correlation Representation Learning for Brain Tumor Segmentation with Missing MRI Modalities

    Authors: Tongxue Zhou, Stéphane Canu, Pierre Vera, Su Ruan

    Abstract: Magnetic Resonance Imaging (MRI) is a widely used imaging technique to assess brain tumor. Accurately segmenting brain tumor from MR images is the key to clinical diagnostics and treatment planning. In addition, multi-modal MR images can provide complementary information for accurate brain tumor segmentation. However, it's common to miss some imaging modalities in clinical practice. In this paper,… ▽ More

    Submitted 20 April, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: 12 pages, 10 figures, accepted by IEEE Transactions on Image Processing (8 April 2021). arXiv admin note: text overlap with arXiv:2003.08870, arXiv:2102.03111

    Journal ref: IEEE Transactions on Image Processing On page(s): 4263-4274 Print ISSN: 1057-7149 Online ISSN: 1941-0042

  45. arXiv:2103.02378  [pdf, other

    cs.SD cs.AI cs.LG eess.AS eess.SP

    Continuous Speech Separation with Ad Hoc Microphone Arrays

    Authors: Dongmei Wang, Takuya Yoshioka, Zhuo Chen, Xiaofei Wang, Tianyan Zhou, Zhong Meng

    Abstract: Speech separation has been shown effective for multi-talker speech recognition. Under the ad hoc microphone array setup where the array consists of spatially distributed asynchronous microphones, additional challenges must be overcome as the geometry and number of microphones are unknown beforehand. Prior studies show, with a spatial-temporalinterleaving structure, neural networks can efficiently… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

  46. arXiv:2102.11634  [pdf, other

    eess.AS cs.SD

    Dual-Path Modeling for Long Recording Speech Separation in Meetings

    Authors: Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian

    Abstract: The continuous speech separation (CSS) is a task to separate the speech sources from a long, partially overlapped recording, which involves a varying number of speakers. A straightforward extension of conventional utterance-level speech separation to the CSS task is to segment the long recording with a size-fixed window and process each window separately. Though effective, this extension fails to… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: Accepted by ICASSP 2021

  47. arXiv:2102.03111  [pdf, other

    eess.IV cs.CV cs.LG

    3D Medical Multi-modal Segmentation Network Guided by Multi-source Correlation Constraint

    Authors: Tongxue Zhou, Stéphane Canu, Pierre Vera, Su Ruan

    Abstract: In the field of multimodal segmentation, the correlation between different modalities can be considered for improving the segmentation results. In this paper, we propose a multi-modality segmentation network with a correlation constraint. Our network includes N model-independent encoding paths with N image sources, a correlation constraint block, a feature fusion block, and a decoding path. The mo… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

    Comments: 8 pages, 8 figures

  48. arXiv:2101.06951  [pdf, other

    eess.SP

    Deep Learning based Antenna Selection and CSI Extrapolation in Massive MIMO Systems

    Authors: Bo Lin, Feifei Gao, Shun Zhang, Ting Zhou, Ahmed Alkhateeb

    Abstract: A critical bottleneck of massive multiple-input multiple-output (MIMO) system is the huge training overhead caused by downlink transmission, like channel estimation, downlink beamforming and covariance observation. In this paper, we propose to use the channel state information (CSI) of a small number of antennas to extrapolate the CSI of the other antennas and reduce the training overhead. Specifi… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  49. arXiv:2101.01149  [pdf, ps, other

    eess.SP

    Deep Learning for Latent Events Forecasting in Twitter Aided Caching Networks

    Authors: Zhong Yang, Yuanwei Liu, Yue Chen, Joey Tianyi Zhou

    Abstract: A novel Twitter context aided content caching (TAC) framework is proposed for enhancing the caching efficiency by taking advantage of the legibility and massive volume of Twitter data. For the purpose of promoting the caching efficiency, three machine learning models are proposed to predict latent events and events popularity, utilizing collect Twitter data with geo-tags and geographic information… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: 30 pages, 15 figures

  50. arXiv:2012.09727  [pdf, other

    eess.AS cs.SD eess.SP

    Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording

    Authors: Cong Han, Yi Luo, Chenda Li, Tianyan Zhou, Keisuke Kinoshita, Shinji Watanabe, Marc Delcroix, Hakan Erdogan, John R. Hershey, Nima Mesgarani, Zhuo Chen

    Abstract: Leveraging additional speaker information to facilitate speech separation has received increasing attention in recent years. Recent research includes extracting target speech by using the target speaker's voice snippet and jointly separating all participating speakers by using a pool of additional speaker signals, which is known as speech separation using speaker inventory (SSUSI). However, all th… ▽ More

    Submitted 18 December, 2020; v1 submitted 17 December, 2020; originally announced December 2020.