Skip to main content

Showing 1–50 of 59 results for author: Yu, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.03935  [pdf, other

    eess.SY

    Roadside Units Assisted Localized Automated Vehicle Maneuvering: An Offline Reinforcement Learning Approach

    Authors: Kui Wang, Changyang She, Zongdian Li, Tao Yu, Yonghui Li, Kei Sakaguchi

    Abstract: Traffic intersections present significant challenges for the safe and efficient maneuvering of connected and automated vehicles (CAVs). This research proposes an innovative roadside unit (RSU)-assisted cooperative maneuvering system aimed at enhancing road safety and traveling efficiency at intersections for CAVs. We utilize RSUs for real-time traffic data acquisition and train an offline reinforc… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 6 pages, 6 figures

  2. arXiv:2403.18992  [pdf

    eess.IV

    Tractography with T1-weighted MRI and associated anatomical constraints on clinical quality diffusion MRI

    Authors: Tian Yu, Yunhe Li, Michael E. Kim, Chenyu Gao, Qi Yang, Leon Y. Cai, Susane M. Resnick, Lori L. Beason-Held, Daniel C. Moyer, Kurt G. Schilling, Bennett A. Landman

    Abstract: Diffusion MRI (dMRI) streamline tractography, the gold standard for in vivo estimation of brain white matter (WM) pathways, has long been considered indicative of macroscopic relationships with WM microstructure. However, recent advances in tractography demonstrated that convolutional recurrent neural networks (CoRNN) trained with a teacher-student framework have the ability to learn and propagate… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  3. Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept

    Authors: Kui Wang, Zongdian Li, Kazuma Nonomura, Tao Yu, Kei Sakaguchi, Omar Hashash, Walid Saad

    Abstract: Digital twins (DTs) have driven major advancements across various industrial domains over the past two decades. With the rapid advancements in autonomous driving and vehicle-to-everything (V2X) technologies, integrating DTs into vehicular platforms is anticipated to further revolutionize smart mobility systems. In this paper, a new smart mobility DT (SMDT) platform is proposed for the control of c… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 15 pages, 10 figures

  4. arXiv:2402.00565  [pdf, other

    eess.SY

    A Review of Carsickness Mitigation: Navigating Challenges and Exploiting Opportunities in the Era of Intelligent Vehicles

    Authors: Daofei Li, Tingzhe Yu, Binbin Tang

    Abstract: Motion sickness (MS) has long been a common complaint in road transportation. However, in the era of driving automation, MS has become an increasingly significant issue. The future intelligent vehicle is envisioned as a mobile space for work or entertainment, but unfortunately passengers' engagement in non-driving tasks may exacerbate MS. Finding effective MS countermeasures is crucial to ensure a… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 19 pages, 5 figures, 5 tables

  5. arXiv:2401.12235  [pdf

    cs.LG eess.SY

    Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning

    Authors: Bairong Deng, Tao Yu, Zhenning Pan, Xuehan Zhang, Yufeng Wu, Qiaoyi Ding

    Abstract: Reinforcement learning is an emerging approaches to facilitate multi-stage sequential decision-making problems. This paper studies a real-time multi-stage stochastic power dispatch considering multivariate uncertainties. Current researches suffer from low generalization and practicality, that is, the learned dispatch policy can only handle a specific dispatch scenario, its performance degrades sig… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  6. arXiv:2401.08654  [pdf, other

    cs.RO eess.SY

    Smart Mobility Digital Twin for Automated Driving: Design and Proof-of-Concept

    Authors: Kui Wang, Zongdian Li, Tao Yu, Kei Sakaguchi

    Abstract: During the past decade, smart mobility and intelligent vehicles have attracted increasing attention, because they promise to create a highly efficient and safe transportation system in the future. Meanwhile, digital twin, as an emerging technology, will play an important role in automated driving and intelligent transportation systems. This technology is applied in this paper to design a platform… ▽ More

    Submitted 24 December, 2023; originally announced January 2024.

  7. arXiv:2401.02099  [pdf

    cs.CV cs.SD eess.AS

    Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition

    Authors: Zeyu Li, Suncheng Xiang, Tong Yu, **gsheng Gao, Jiacheng Ruan, Yan** Hu, Ting Liu, Yuzhuo Fu

    Abstract: The recognition of underwater audio plays a significant role in identifying a vessel while it is in motion. Underwater target recognition tasks have a wide range of applications in areas such as marine environmental protection, detection of ship radiated noise, underwater noise control, and coastal vessel dispatch. The traditional UATR task involves training a network to extract features from audi… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted by ICIC 2024

  8. arXiv:2306.13101  [pdf, other

    eess.SP cs.AI cs.LG

    BrainNet: Epileptic Wave Detection from SEEG with Hierarchical Graph Diffusion Learning

    Authors: Junru Chen, Yang Yang, Tao Yu, Yingying Fan, Xiaolong Mo, Carl Yang

    Abstract: Epilepsy is one of the most serious neurological diseases, affecting 1-2% of the world's population. The diagnosis of epilepsy depends heavily on the recognition of epileptic waves, i.e., disordered electrical brainwave activity in the patient's brain. Existing works have begun to employ machine learning models to detect epileptic waves via cortical electroencephalogram (EEG). However, the recentl… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  9. arXiv:2305.18771  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    SFCNeXt: a simple fully convolutional network for effective brain age estimation with small sample size

    Authors: Yu Fu, Yanyan Huang, Shunjie Dong, Yalin Wang, Tianbai Yu, Meng Niu, Cheng Zhuo

    Abstract: Deep neural networks (DNN) have been designed to predict the chronological age of a healthy brain from T1-weighted magnetic resonance images (T1 MRIs), and the predicted brain age could serve as a valuable biomarker for the early detection of development-related or aging-related disorders. Recent DNN models for brain age estimations usually rely too much on large sample sizes and complex network s… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: This paper has been accepted by IEEE ISBI 2023

  10. arXiv:2305.15526  [pdf, other

    eess.SP

    Radiomap Inpainting for Restricted Areas based on Propagation Priority and Depth Map

    Authors: Songyang Zhang, Tianhang Yu, Brian Choi, Feng Ouyang, Zhi Ding

    Abstract: Providing rich and useful information regarding spectrum activities and propagation channels, radiomaps characterize the detailed distribution of power spectral density (PSD) and are important tools for network planning in modern wireless systems. Generally, radiomaps are constructed from radio strength measurements by deployed sensors and user devices. However, not all areas are accessible for ra… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: submitted to IEEE journal for possible publication

  11. arXiv:2305.09011  [pdf, other

    eess.IV cs.CV

    The Brain Tumor Segmentation (BraTS) Challenge 2023: Brain MR Image Synthesis for Tumor Segmentation (BraSyn)

    Authors: Hongwei Bran Li, Gian Marco Conte, Syed Muhammad Anwar, Florian Kofler, Ivan Ezhov, Koen van Leemput, Marie Piraud, Maria Diaz, Byrone Cole, Evan Calabrese, Jeff Rudie, Felix Meissen, Maruf Adewole, Anastasia Janas, Anahita Fathi Kazerooni, Dominic LaBella, Ahmed W. Moawad, Keyvan Farahani, James Eddy, Timothy Bergquist, Verena Chung, Russell Takeshi Shinohara, Farouk Dako, Walter Wiggins, Zachary Reitman , et al. (43 additional authors not shown)

    Abstract: Automated brain tumor segmentation methods have become well-established and reached performance levels offering clear clinical utility. These methods typically rely on four input magnetic resonance imaging (MRI) modalities: T1-weighted images with and without contrast enhancement, T2-weighted images, and FLAIR images. However, some sequences are often missing in clinical practice due to time const… ▽ More

    Submitted 28 June, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Technical report of BraSyn

  12. arXiv:2305.05644  [pdf, other

    cs.CL cs.DC eess.SY

    Towards Building the Federated GPT: Federated Instruction Tuning

    Authors: Jianyi Zhang, Saeed Vahidian, Martin Kuo, Chunyuan Li, Ruiyi Zhang, Tong Yu, Yufan Zhou, Guoyin Wang, Yiran Chen

    Abstract: While "instruction-tuned" generative large language models (LLMs) have demonstrated an impressive ability to generalize to new tasks, the training phases heavily rely on large amounts of diverse and high-quality instruction data (such as ChatGPT and GPT-4). Unfortunately, acquiring high-quality data, especially when it comes to human-written data, can pose significant challenges both in terms of c… ▽ More

    Submitted 29 January, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Project page: https://github.com/JayZhang42/FederatedGPT-Shepherd

  13. arXiv:2305.02583  [pdf, other

    eess.AS cs.SD

    Hybrid AHS: A Hybrid of Kalman Filter and Deep Learning for Acoustic Howling Suppression

    Authors: Hao Zhang, Meng Yu, Yuzhong Wu, Tao Yu, Dong Yu

    Abstract: Deep learning has been recently introduced for efficient acoustic howling suppression (AHS). However, the recurrent nature of howling creates a mismatch between offline training and streaming inference, limiting the quality of enhanced speech. To address this limitation, we propose a hybrid method that combines a Kalman filter with a self-attentive recurrent neural network (SARNN) to leverage thei… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: submitted to INTERSPEECH 2023. arXiv admin note: text overlap with arXiv:2302.09252

  14. arXiv:2303.07704  [pdf, other

    eess.AS cs.SD

    TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge

    Authors: Yukai Ju, Jun Chen, Shimin Zhang, Shulin He, Wei Rao, Weixin Zhu, Yannan Wang, Tao Yu, Shidong Shang

    Abstract: This paper introduces the Unbeatable Team's submission to the ICASSP 2023 Deep Noise Suppression (DNS) Challenge. We expand our previous work, TEA-PSE, to its upgraded version -- TEA-PSE 3.0. Specifically, TEA-PSE 3.0 incorporates a residual LSTM after squeezed temporal convolution network (S-TCN) to enhance sequence modeling capabilities. Additionally, the local-global representation (LGR) struct… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  15. arXiv:2302.06294  [pdf, other

    eess.IV cs.CV cs.LG

    CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

    Authors: Chinedu Innocent Nwoye, Tong Yu, Saurav Sharma, Aditya Murali, Deepak Alapatt, Armine Vardazaryan, Kun Yuan, Jonas Hajek, Wolfgang Reiter, Amine Yamlahi, Finn-Henri Smidt, Xiaoyang Zou, Guoyan Zheng, Bruno Oliveira, Helena R. Torres, Satoshi Kondo, Satoshi Kasai, Felix Holm, Ege Özsoy, Shuangchun Gui, Han Li, Sista Raviteja, Rachana Sathish, Pranav Poudel, Binod Bhattarai , et al. (24 additional authors not shown)

    Abstract: Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier effor… ▽ More

    Submitted 14 July, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: MICCAI EndoVis CholecTriplet2022 challenge report. Published at Elsevier journal of Medical Image Analysis. 25 pages, 15 figures, 8 tables

    Journal ref: Medical Image Analysis, Volume 89, 2023, 102888, ISSN 1361-8415

  16. arXiv:2301.00504  [pdf

    eess.IV cs.AI cs.CV eess.SP

    Spectral Bandwidth Recovery of Optical Coherence Tomography Images using Deep Learning

    Authors: Timothy T. Yu, Da Ma, Jayden Cole, Myeong ** Ju, Mirza F. Beg, Marinko V. Sarunic

    Abstract: Optical coherence tomography (OCT) captures cross-sectional data and is used for the screening, monitoring, and treatment planning of retinal diseases. Technological developments to increase the speed of acquisition often results in systems with a narrower spectral bandwidth, and hence a lower axial resolution. Traditionally, image-processing-based techniques have been utilized to reconstruct subs… ▽ More

    Submitted 1 January, 2023; originally announced January 2023.

  17. arXiv:2212.11233  [pdf

    eess.IV cs.CR

    Realization Scheme for Visual Cryptography with Computer-generated Holograms

    Authors: Tao Yu, **ge Ma, Guilin Li, Dongyu Yang, Rui Ma, Yishi Shi

    Abstract: We propose to realize visual cryptography in an indirect way with the help of computer-generated hologram. At present, the recovery method of visual cryptography is mainly superimposed on transparent film or superimposed by computer equipment, which greatly limits the application range of visual cryptography. In this paper, the shares of the visual cryptography were encoded with computer-generated… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: International Workshop on Holography and related technologies (IWH) 2018

  18. arXiv:2212.00729  [pdf, other

    eess.SP cs.LG

    Edge Deep Learning Enabled Freezing of Gait Detection in Parkinson's Patients

    Authors: Ourong Lin, Tian Yu, Yuhan Hou, Yi Zhu, Xilin Liu

    Abstract: This paper presents the design of a wireless sensor network for detecting and alerting the freezing of gait (FoG) symptoms in patients with Parkinson's disease. Three sensor nodes, each integrating a 3-axis accelerometer, can be placed on a patient at ankle, thigh, and truck. Each sensor node can independently detect FoG using an on-device deep learning (DL) model, featuring a squeeze and excitati… ▽ More

    Submitted 27 November, 2022; originally announced December 2022.

  19. arXiv:2211.05432  [pdf, other

    cs.SD eess.AS

    Speech Enhancement with Fullband-Subband Cross-Attention Network

    Authors: Jun Chen, Wei Rao, Zilin Wang, Zhiyong Wu, Yannan Wang, Tao Yu, Shidong Shang, Helen Meng

    Abstract: FullSubNet has shown its promising performance on speech enhancement by utilizing both fullband and subband information. However, the relationship between fullband and subband in FullSubNet is achieved by simply concatenating the output of fullband model and subband units. It only supplements the subband units with a small quantity of global information and has not considered the interaction betwe… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted by InterSpeech 2022. arXiv admin note: text overlap with arXiv:2203.12188

  20. arXiv:2210.02883  [pdf, ps, other

    cs.NI eess.SP

    A Novel Energy Efficiency Metric for Next Generation Wireless Communication Networks

    Authors: Tao Yu, Shunqing Zhang, Xiao**g Chen, Xin Wang

    Abstract: As a core performance metric for green communications, the conventional energy efficiency definition has successfully resolved many issues in the energy efficient wireless network design. In the past several generations of wireless communication networks, the traditional energy efficiency measure plays an important role to guide many energy saving techniques for slow varying traffic profiles. Howe… ▽ More

    Submitted 9 September, 2022; originally announced October 2022.

  21. arXiv:2210.00778  [pdf, other

    cs.IT eess.SP

    On Lattice-Code based Multiple Access: Uplink Architecture and Algorithms

    Authors: Tao Yang. Fangtao Yu, Qiuzhuo Chen, Rongke Liu

    Abstract: This paper studies a lattice-code based multiple-access (LCMA) framework, and develops a package of processing techniques that are essential to its practical implementation. In the uplink, $K$ users encode their messages with the same ring coded modulation of $2^{m}$-PAM signaling. With it, the integer sum of multiple codewords belongs to the $n$-dimension lattice of the base code. Such property e… ▽ More

    Submitted 21 June, 2024; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 30 Pages, 11 figures, submitted to IEEE Trans. Wireless Comm

  22. arXiv:2209.10843  [pdf, ps, other

    cs.IT eess.SP

    A Unified Joint Optimization of Training Sequences and Transceivers Based on Matrix-Monotonic Optimization

    Authors: Chengwen Xing, Tao Yu, **peng Song, Zhong Zheng, Lian Zhao, Lajos Hanzo

    Abstract: Channel estimation and data transmission constitute the most fundamental functional modules of multiple-input multiple-output (MIMO) communication systems. The underlying key tasks corresponding to these modules are training sequence optimization and transceiver optimization. Hence, we jointly optimize the linear transmit precoder and the training sequence of MIMO systems using the metrics of thei… ▽ More

    Submitted 17 July, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 39 pages, 9 figures, 1 table, manuscript accepted in IEEE TVT

  23. arXiv:2209.04566  [pdf, other

    eess.SP

    Exemplar-Based Radio Map Reconstruction of Missing Areas Using Propagation Priority

    Authors: Songyang Zhang, Tianhang Yu, Jonathan Tivald, Brian Choi, Feng Ouyang, Zhi Ding

    Abstract: Radio map describes network coverage and is a practically important tool for network planning in modern wireless systems. Generally, radio strength measurements are collected to construct fine-resolution radio maps for analysis. However, certain protected areas are not accessible for measurement due to physical constraints and security considerations, leading to blanked spaces on a radio map. Non-… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: To appear in 2022 IEEE Global Communications Conference (Globecom)

  24. arXiv:2208.14635  [pdf, other

    eess.IV cs.CV cs.LG

    Segmentation-guided Domain Adaptation and Data Harmonization of Multi-device Retinal Optical Coherence Tomography using Cycle-Consistent Generative Adversarial Networks

    Authors: Shuo Chen, Da Ma, Sieun Lee, Timothy T. L. Yu, Gavin Xu, Donghuan Lu, Karteek Popuri, Myeong ** Ju, Marinko V. Sarunic, Mirza Faisal Beg

    Abstract: Optical Coherence Tomography(OCT) is a non-invasive technique capturing cross-sectional area of the retina in micro-meter resolutions. It has been widely used as a auxiliary imaging reference to detect eye-related pathology and predict longitudinal progression of the disease characteristics. Retina layer segmentation is one of the crucial feature extraction techniques, where the variations of reti… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Comments: 16 pages, 10 figures

  25. arXiv:2208.13328  [pdf, other

    eess.IV physics.med-ph

    Slice estimation in diffusion MRI of neonatal and fetal brains in image and spherical harmonics domains using autoencoders

    Authors: Hamza Kebiri, Gabriel Girard, Yasser Aleman-Gomez, Thomas Yu, Andras Jakab, Erick Jorge Canales-Rodriguez, Meritxell Bach Cuadra

    Abstract: Diffusion MRI (dMRI) of the develo** brain can provide valuable insights into the white matter development. However, slice thickness in fetal dMRI is typically high (i.e., 3-5 mm) to freeze the in-plane motion, which reduces the sensitivity of the dMRI signal to the underlying anatomy. In this study, we aim at overcoming this problem by using autoencoders to learn unsupervised efficient represen… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

  26. arXiv:2207.02663  [pdf, other

    cs.CL cs.SD eess.AS

    Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands

    Authors: Wenliang Dai, Samuel Cahyawijaya, Tiezheng Yu, Elham J Barezi, Pascale Fung

    Abstract: With the rise of deep learning and intelligent vehicles, the smart assistant has become an essential in-car component to facilitate driving and provide extra functionalities. In-car smart assistants should be able to process general as well as car-related commands and perform corresponding actions, which eases driving and improves safety. However, in this research field, most datasets are in major… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  27. arXiv:2207.00492  [pdf

    eess.SY

    Reinforcement Learning Based User-Guided Motion Planning for Human-Robot Collaboration

    Authors: Tian Yu, Qing Chang

    Abstract: Robots are good at performing repetitive tasks in modern manufacturing industries. However, robot motions are mostly planned and preprogrammed with a notable lack of adaptivity to task changes. Even for slightly changed tasks, the whole system must be reprogrammed by robotics experts. Therefore, it is highly desirable to have a flexible motion planning method, with which robots can adapt to specif… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  28. arXiv:2205.04264  [pdf, other

    cs.CV cs.MM eess.IV

    SwinIQA: Learned Swin Distance for Compressed Image Quality Assessment

    Authors: Jianzhao Liu, Xin Li, Yanding Peng, Tao Yu, Zhibo Chen

    Abstract: Image compression has raised widespread interest recently due to its significant importance for multimedia storage and transmission. Meanwhile, a reliable image quality assessment (IQA) for compressed images can not only help to verify the performance of various compression algorithms but also help to guide the compression optimization in turn. In this paper, we design a full-reference image quali… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: CVPR2022 Workshop (CLIC) accepted

  29. Live Laparoscopic Video Retrieval with Compressed Uncertainty

    Authors: Tong Yu, Pietro Mascagni, Juan Verde, Jacques Marescaux, Didier Mutter, Nicolas Padoy

    Abstract: Searching through large volumes of medical data to retrieve relevant information is a challenging yet crucial task for clinical care. However the primitive and most common approach to retrieval, involving text in the form of keywords, is severely limited when dealing with complex media formats. Content-based retrieval offers a way to overcome this limitation, by using rich media as the query itsel… ▽ More

    Submitted 12 June, 2023; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: 16 pages, 13 figures

    Journal ref: Medical Image Analysis 88 (2023) 102866

  30. arXiv:2202.06722  [pdf

    eess.SY eess.SP

    Active and Passive Hybrid Detection Method for Power CPS False Data Injection Attacks with Improved AKF and GRU-CNN

    Authors: Zhaoyang Qu, Xiaoyong Bo, Tong Yu, Yaowei Liu, Yunchang Dong, Zhongfeng Kan, Lei Wang, Yang Li

    Abstract: Influenced by deep penetration of the new generation of information technology, power systems have gradually evolved into highly coupled cyber-physical systems (CPS). Among many possible power CPS network attacks, a false data injection attacks (FDIAs) is the most serious. Taking account of the fact that the existing knowledge-driven detection process for FDIAs has been in a passive detection stat… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted by IET Renewable Power Generation

    Journal ref: IET Renewable Power Generation 16 (2022) 1490-1508

  31. arXiv:2202.06548  [pdf, other

    eess.IV cs.LG

    A resource-efficient deep learning framework for low-dose brain PET image reconstruction and analysis

    Authors: Yu Fu, Shunjie Dong, Yi Liao, Le Xue, Yuanfan Xu, Feng Li, Qianqian Yang, Tianbai Yu, Mei Tian, Cheng Zhuo

    Abstract: 18F-fluorodeoxyglucose (18F-FDG) Positron Emission Tomography (PET) imaging usually needs a full-dose radioactive tracer to obtain satisfactory diagnostic results, which raises concerns about the potential health risks of radiation exposure, especially for pediatric patients. Reconstructing the low-dose PET (L-PET) images to the high-quality full-dose PET (F-PET) ones is an effective way that both… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  32. arXiv:2201.12535  [pdf, ps, other

    eess.IV cs.AI physics.med-ph

    Validation and Generalizability of Self-Supervised Image Reconstruction Methods for Undersampled MRI

    Authors: Thomas Yu, Tom Hilbert, Gian Franco Piredda, Arun Joseph, Gabriele Bonanno, Salim Zenkhri, Patrick Omoumi, Meritxell Bach Cuadra, Erick Jorge Canales-Rodríguez, Tobias Kober, Jean-Philippe Thiran

    Abstract: Deep learning methods have become the state of the art for undersampled MR reconstruction. Particularly for cases where it is infeasible or impossible for ground truth, fully sampled data to be acquired, self-supervised machine learning methods for reconstruction are becoming increasingly used. However potential issues in the validation of such methods, as well as their generalizability, remain un… ▽ More

    Submitted 12 September, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://www.melba-journal.org/papers/2022:022.html

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)

  33. arXiv:2201.05501  [pdf, ps, other

    eess.SP cs.LG

    Study of Frequency domain exponential functional link network filters

    Authors: T. Yu, S. Tana, R. C. de Lamareb, Y. Yu

    Abstract: The exponential functional link network (EFLN) filter has attracted tremendous interest due to its enhanced nonlinear modeling capability. However, the computational complexity will dramatically increase with the dimension growth of the EFLN-based filter. To improve the computational efficiency, we propose a novel frequency domain exponential functional link network (FDEFLN) filter in this paper.… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: 32 pages, 17 figures

  34. arXiv:2201.02419  [pdf, other

    cs.CL cs.SD eess.AS

    Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset

    Authors: Tiezheng Yu, Rita Frieske, Peng Xu, Samuel Cahyawijaya, Cheuk Tung Shadow Yiu, Holy Lovenia, Wenliang Dai, Elham J. Barezi, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung

    Abstract: Automatic speech recognition (ASR) on low resource languages improves the access of linguistic minorities to technological advantages provided by artificial intelligence (AI). In this paper, we address the problem of data scarcity for the Hong Kong Cantonese language by creating a new Cantonese dataset. Our dataset, Multi-Domain Cantonese Corpus (MDCC), consists of 73.6 hours of clean read speech… ▽ More

    Submitted 17 January, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  35. arXiv:2112.06382  [pdf

    eess.SY math.OC

    Dynamic Exploitation Gaussian Bare-Bones Bat Algorithm for Optimal Reactive Power Dispatch to Improve the Safety and Stability of Power System

    Authors: Zhaoyang Qu, Yunchang Dong, Sylvère Mugemanyi, Tong Yu, Xiaoyong Bo, Huashun Li, Yang Li, François Xavier Rugema, Christophe Bananeza

    Abstract: In this paper, a novel Gaussian bare-bones bat algorithm (GBBBA) and its modified version named as dynamic exploitation Gaussian bare-bones bat algorithm (DeGBBBA) are proposed for solving optimal reactive power dispatch (ORPD) problem. The optimal reactive power dispatch (ORPD) plays a fundamental role in ensuring stable, secure, reliable as well as economical operation of the power system. The O… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

    Comments: Accepted by IET Renewable Power Generation

    Journal ref: IET Renewable Power Generation 16 (2022) 1401-1424

  36. arXiv:2111.08387  [pdf, other

    eess.AS cs.SD

    S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement

    Authors: Shubo Lv, Yihui Fu, Mengtao Xing, Jiayao Sun, Lei Xie, Jun Huang, Yannan Wang, Tao Yu

    Abstract: In speech enhancement, complex neural network has shown promising performance due to their effectiveness in processing complex-valued spectrum. Most of the recent speech enhancement approaches mainly focus on wide-band signal with a sampling rate of 16K Hz. However, research on super wide band (e.g., 32K Hz) or even full-band (48K) denoising is still lacked due to the difficulty of modeling more f… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Submitted to ICASSP 2022

  37. arXiv:2109.14956  [pdf

    eess.IV cs.CV cs.LG

    Comparative Validation of Machine Learning Algorithms for Surgical Workflow and Skill Analysis with the HeiChole Benchmark

    Authors: Martin Wagner, Beat-Peter Müller-Stich, Anna Kisilenko, Duc Tran, Patrick Heger, Lars Mündermann, David M Lubotsky, Benjamin Müller, Tornike Davitashvili, Manuela Capek, Annika Reinke, Tong Yu, Armine Vardazaryan, Chinedu Innocent Nwoye, Nicolas Padoy, Xinyang Liu, Eung-Joo Lee, Constantin Disch, Hans Meine, Tong Xia, Fucang Jia, Satoshi Kondo, Wolfgang Reiter, Yueming **, Yonghao Long , et al. (16 additional authors not shown)

    Abstract: PURPOSE: Surgical workflow and skill analysis are key technologies for the next generation of cognitive surgical assistance systems. These systems could increase the safety of the operation through context-sensitive warnings and semi-autonomous robotic assistance or improve training of surgeons via data-driven feedback. In surgical workflow analysis up to 91% average precision has been reported fo… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

  38. arXiv:2109.03624  [pdf, other

    physics.med-ph cs.LG eess.IV

    FaBiAN: A Fetal Brain magnetic resonance Acquisition Numerical phantom

    Authors: Hélène Lajous, Christopher W. Roy, Tom Hilbert, Priscille de Dumast, Sébastien Tourbier, Yasser Alemán-Gómez, Jérôme Yerly, Thomas Yu, Hamza Kebiri, Kelly Payette, Jean-Baptiste Ledoux, Reto Meuli, Patric Hagmann, Andras Jakab, Vincent Dunet, Mériam Koob, Tobias Kober, Matthias Stuber, Meritxell Bach Cuadra

    Abstract: Accurate characterization of in utero human brain maturation is critical as it involves complex and interconnected structural and functional processes that may influence health later in life. Magnetic resonance imaging is a powerful tool to investigate equivocal neurological patterns during fetal development. However, the number of acquisitions of satisfactory quality available in this cohort of s… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: 23 pages, 9 figures (including Supplementary Material), 4 tables, 1 supplement. Submitted to Scientific Reports (2021)

  39. arXiv:2107.02345  [pdf, other

    eess.IV cs.CV cs.LG

    Domain Adaptation via CycleGAN for Retina Segmentation in Optical Coherence Tomography

    Authors: Ricky Chen, Timothy T. Yu, Gavin Xu, Da Ma, Marinko V. Sarunic, Mirza Faisal Beg

    Abstract: With the FDA approval of Artificial Intelligence (AI) for point-of-care clinical diagnoses, model generalizability is of the utmost importance as clinical decision-making must be domain-agnostic. A method of tackling the problem is to increase the dataset to include images from a multitude of domains; while this technique is ideal, the security requirements of medical data is a major limitation. A… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 10 pages, 6 figures, 1 table

    ACM Class: I.4.0

  40. arXiv:2104.00960  [pdf, other

    eess.AS cs.SD

    INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing

    Authors: Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang

    Abstract: The ConferencingSpeech 2021 challenge is proposed to stimulate research on far-field multi-channel speech enhancement for video conferencing. The challenge consists of two separate tasks: 1) Task 1 is multi-channel speech enhancement with single microphone array and focusing on practical application with real-time requirement and 2) Task 2 is multi-channel speech enhancement with multiple distribu… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: 5 pages, submitted to INTERSPEECH 2021

  41. arXiv:2012.06131  [pdf, other

    cs.CV eess.IV

    Learning Omni-frequency Region-adaptive Representations for Real Image Super-Resolution

    Authors: Xin Li, Xin **, Tao Yu, Yingxue Pang, Simeng Sun, Zhizheng Zhang, Zhibo Chen

    Abstract: Traditional single image super-resolution (SISR) methods that focus on solving single and uniform degradation (i.e., bicubic down-sampling), typically suffer from poor performance when applied into real-world low-resolution (LR) images due to the complicated realistic degradations. The key to solving this more challenging real image super-resolution (RealSR) problem lies in learning feature repres… ▽ More

    Submitted 10 January, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: Accepted by AAAI2021

  42. arXiv:2011.09590  [pdf

    cs.SI eess.SP

    Towards mmWave V2X in 5G and Beyond to Support Automated Driving

    Authors: Kei Sakaguchi, Ryuichi Fukatsu, Tao Yu, Eisuke Fukuda, Kim Mahler, Robert Heath, Takeo Fujii, Kazuaki Takahashi, Alexey Khoryaev, Satoshi Nagata, Takayuki Shimizu

    Abstract: Millimeter wave provides high data rates for Vehicle-to-Everything (V2X) communications. This paper motivates millimeter wave to support automated driving and begins by explaining V2X use cases that support automated driving with references to several standardi-zation bodies. The paper gives a classification of existing V2X stand-ards: IEEE802.11p and LTE V2X, along with the status of their com-me… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Journal ref: IEICE Trans. Commun., vol.E104-B, no.6, June 2021

  43. arXiv:2010.14841  [pdf, other

    cs.SD cs.CL eess.AS

    INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices

    Authors: Yiwu Yao, Yuchao Li, Chengyu Wang, Tianhang Yu, Houjiang Chen, Xiaotang Jiang, Jun Yang, Jun Huang, Wei Lin, Hui Shu, Chengfei Lv

    Abstract: The intensive computation of Automatic Speech Recognition (ASR) models obstructs them from being deployed on mobile devices. In this paper, we present a novel quantized Winograd optimization pipeline, which combines the quantization and fast convolution to achieve efficient inference acceleration on mobile devices for ASR models. To avoid the information loss due to the combination of quantization… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  44. arXiv:2007.12199  [pdf, other

    eess.IV physics.med-ph

    T2 Map** from Super-Resolution-Reconstructed Clinical Fast Spin Echo Magnetic Resonance Acquisitions

    Authors: Hélène Lajous, Tom Hilbert, Christopher W. Roy, Sébastien Tourbier, Priscille de Dumast, Thomas Yu, Jean-Philippe Thiran, Jean-Baptiste Ledoux, Davide Piccini, Patric Hagmann, Reto Meuli, Tobias Kober, Matthias Stuber, Ruud B. van Heeswijk, Meritxell Bach Cuadra

    Abstract: Relaxometry studies in preterm and at-term newborns have provided insight into brain microstructure, thus opening new avenues for studying normal brain development and supporting diagnosis in equivocal neurological situations. However, such quantitative techniques require long acquisition times and therefore cannot be straightforwardly translated to in utero brain developmental studies. In clinica… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

    Comments: 11 pages, 4 figures, 1 table, 1 supplement, to appear in Proceedings in Medical Image Computing and Computer Assisted Intervention (MICCAI), Peru, October 2020

  45. arXiv:2007.11430  [pdf, other

    cs.CV eess.IV

    Learning Disentangled Feature Representation for Hybrid-distorted Image Restoration

    Authors: Xin Li, Xin **, Jianxin Lin, Tao Yu, Sen Liu, Yaojun Wu, Wei Zhou, Zhibo Chen

    Abstract: Hybrid-distorted image restoration (HD-IR) is dedicated to restore real distorted image that is degraded by multiple distortions. Existing HD-IR approaches usually ignore the inherent interference among hybrid distortions which compromises the restoration performance. To decompose such interference, we introduce the concept of Disentangled Feature Learning to achieve the feature-level divide-and-c… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: Accepted by ECCV2020

  46. arXiv:2007.10225  [pdf, other

    physics.med-ph eess.IV

    Model-Informed Machine Learning for Multi-component T2 Relaxometry

    Authors: Thomas Yu, Erick Jorge Canales Rodriguez, Marco Pizzolato, Gian Franco Piredda, Tom Hilbert, Elda Fischi-Gomez, Matthias Weigel, Muhamed Barakovic, Meritxell Bach-Cuadra, Cristina Granziera, Tobias Kober, Jean-Philippe Thiran

    Abstract: Recovering the T2 distribution from multi-echo T2 magnetic resonance (MR) signals is challenging but has high potential as it provides biomarkers characterizing the tissue micro-structure, such as the myelin water fraction (MWF). In this work, we propose to combine machine learning and aspects of parametric (fitting from the MRI signal using biophysical models) and non-parametric (model-free fitti… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: Preprint submitted to Medical Image Analysis (July 14, 2020)

  47. Recognition of Instrument-Tissue Interactions in Endoscopic Videos via Action Triplets

    Authors: Chinedu Innocent Nwoye, Cristians Gonzalez, Tong Yu, Pietro Mascagni, Didier Mutter, Jacques Marescaux, Nicolas Padoy

    Abstract: Recognition of surgical activity is an essential component to develop context-aware decision support for the operating room. In this work, we tackle the recognition of fine-grained activities, modeled as action triplets <instrument, verb, target> representing the tool activity. To this end, we introduce a new laparoscopic dataset, CholecT40, consisting of 40 videos from the public dataset Cholec80… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: 13 pages, 4 figures, 6 tables. Accepted and to be published in MICCAI 2020

    Journal ref: Medical Image Computing and Computer Assisted Intervention MICCAI 12263 (2020) 364-374

  48. arXiv:2007.04140  [pdf, other

    cs.RO eess.SY

    Mastering the working sequence in human-robot collaborative assembly based on reinforcement learning

    Authors: Tian Yu, **g Huang, Qing Chang

    Abstract: A long-standing goal of the Human-Robot Collaboration (HRC) in manufacturing systems is to increase the collaborative working efficiency. In line with the trend of Industry 4.0 to build up the smart manufacturing system, the Co-robot in the HRC system deserves better designing to be more self-organized and to find the superhuman proficiency by self-learning. Inspired by the impressive machine lear… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    Comments: 11 pages, 6 figures

  49. arXiv:2007.03053  [pdf, other

    eess.IV cs.CV

    Benefiting from Bicubically Down-Sampled Images for Learning Real-World Image Super-Resolution

    Authors: Mohammad Saeed Rad, Thomas Yu, Claudiu Musat, Hazim Kemal Ekenel, Behzad Bozorgtabar, Jean-Philippe Thiran

    Abstract: Super-resolution (SR) has traditionally been based on pairs of high-resolution images (HR) and their low-resolution (LR) counterparts obtained artificially with bicubic downsampling. However, in real-world SR, there is a large variety of realistic image degradations and analytically modeling these realistic degradations can prove quite difficult. In this work, we propose to handle real-world SR by… ▽ More

    Submitted 5 November, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: WACV 2021

  50. arXiv:2006.09201  [pdf, other

    eess.SP cs.LG stat.ML

    A Hybrid Deep Learning Model for Predictive Flood Warning and Situation Awareness using Channel Network Sensors Data

    Authors: Shangjia Dong, Tianbo Yu, Hamed Farahmand, Ali Mostafavi

    Abstract: The objective of this study is to create and test a hybrid deep learning model, FastGRNN-FCN (Fast, Accurate, Stable and Tiny Gated Recurrent Neural Network-Fully Convolutional Network), for urban flood prediction and situation awareness using channel network sensors data. The study used Harris County, Texas as the testbed, and obtained channel sensor data from three historical flood events (e.g.,… ▽ More

    Submitted 8 September, 2020; v1 submitted 15 June, 2020; originally announced June 2020.