Skip to main content

Showing 1–28 of 28 results for author: Zhu, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.12408  [pdf, other

    cs.RO eess.SY

    Flexible Active Safety Motion Control for Robotic Obstacle Avoidance: A CBF-Guided MPC Approach

    Authors: **hao Liu, Jun Yang, Jianliang Mao, Tianqi Zhu, Qihang Xie, Yimeng Li, Xiangyu Wang, Shihua Li

    Abstract: A flexible active safety motion (FASM) control approach is proposed for the avoidance of dynamic obstacles and the reference tracking in robot manipulators. The distinctive feature of the proposed method lies in its utilization of control barrier functions (CBF) to design flexible CBF-guided safety criteria (CBFSC) with dynamically optimized decay rates, thereby offering flexibility and active saf… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 11 pages, 11 figures

  2. arXiv:2403.19971  [pdf, other

    eess.AS eess.SP

    3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization

    Authors: Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Tinglong Zhu, Changhe Song, Rongjie Huang, Ziyang Ma, Qian Chen, Shiliang Zhang, Xihao Li

    Abstract: This paper introduces 3D-Speaker-Toolkit, an open source toolkit for multi-modal speaker verification and diarization. It is designed for the needs of academic researchers and industrial practitioners. The 3D-Speaker-Toolkit adeptly leverages the combined strengths of acoustic, semantic, and visual data, seamlessly fusing these modalities to offer robust speaker recognition capabilities. The acous… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  3. arXiv:2402.14543  [pdf

    eess.SY

    Low-frequency Resonances in Grid-Forming Converters: Causes and Dam** Control

    Authors: Fangzhou Zhao, Tianhua Zhu, Zejie Li, Xiongfei Wang

    Abstract: Grid-forming voltage-source converter (GFM-VSC) may experience low-frequency resonances, such as synchronous resonance (SR) and sub-synchronous resonance (SSR), in the output power. This paper offers a comprehensive study on the root causes of low-frequency resonances with GFM-VSC systems and the dam** control methods. The typical GFM control structures are introduced first, along with a map**… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  4. arXiv:2401.10345  [pdf, other

    eess.IV

    Attack and Defense Analysis of Learned Image Compression

    Authors: Tianyu Zhu, Heming Sun, Xiankui Xiong, Xuanpeng Zhu, Yong Gong, Minge **g, Yibo Fan

    Abstract: Learned image compression (LIC) is becoming more and more popular these years with its high efficiency and outstanding compression quality. Still, the practicality against modified inputs added with specific noise could not be ignored. White-box attacks such as FGSM and PGD use only gradient to compute adversarial images that mislead LIC models to output unexpected results. Our experiments compare… ▽ More

    Submitted 27 March, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  5. arXiv:2401.00225  [pdf

    eess.AS cs.AI eess.SP

    Enhancing dysarthria speech feature representation with empirical mode decomposition and Walsh-Hadamard transform

    Authors: Ting Zhu, Shufei Duan, Camille Dingam, Huizhi Liang, Wei Zhang

    Abstract: Dysarthria speech contains the pathological characteristics of vocal tract and vocal fold, but so far, they have not yet been included in traditional acoustic feature sets. Moreover, the nonlinearity and non-stationarity of speech have been ignored. In this paper, we propose a feature enhancement algorithm for dysarthria speech called WHFEMD. It combines empirical mode decomposition (EMD) and fast… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  6. arXiv:2312.15153  [pdf

    cs.OS cs.CR eess.SY

    Design and Implementation Considerations for a Virtual File System Using an Inode Data Structure

    Authors: Qin Sun, Grace McKenzie, Guanqun Song, Ting Zhu

    Abstract: Virtual file systems are a tool to centralize and mobilize a file system that could otherwise be complex and consist of multiple hierarchies, hard disks, and more. In this paper, we discuss the design of Unix-based file systems and how this type of file system layout using inode data structures and a disk emulator can be implemented as a single-file virtual file system in Linux. We explore the way… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  7. arXiv:2312.08998  [pdf

    eess.AS cs.AI cs.SD eess.SP

    Design, construction and evaluation of emotional multimodal pathological speech database

    Authors: Ting Zhu, Shufei Duan, Huizhi Liang, Wei Zhang

    Abstract: The lack of an available emotion pathology database is one of the key obstacles in studying the emotion expression status of patients with dysarthria. The first Chinese multimodal emotional pathological speech database containing multi-perspective information is constructed in this paper. It includes 29 controls and 39 patients with different degrees of motor dysarthria, expressing happy, sad, ang… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  8. arXiv:2312.01785  [pdf

    eess.SY

    Closed-Form Solutions for Grid-Forming Converters: A Design-Oriented Study

    Authors: Fangzhou Zhao, Tianhua Zhu, Lennart Harnefors, Bo Fan, Heng Wu, Zichao Zhou, Yin Sun, Xiongfei Wang

    Abstract: This paper derives closed-form solutions for grid-forming converters with power synchronization control (PSC) by subtly simplifying and factorizing the complex closed-loop models. The solutions can offer clear analytical insights into control-loop interactions, enabling guidelines for robust controller design. It is proved that 1) the proportional gains of PSC and alternating voltage control (AVC)… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  9. arXiv:2307.03898   

    cs.CV eess.IV

    StyleGAN3: Generative Networks for Improving the Equivariance of Translation and Rotation

    Authors: Tianlei Zhu, Junqi Chen, Renzhe Zhu, Gaurav Gupta

    Abstract: StyleGAN can use style to affect facial posture and identity features, and noise to affect hair, wrinkles, skin color and other details. Among these, the outcomes of the picture processing will vary slightly between different versions of styleGAN. As a result, the comparison of performance differences between styleGAN2 and the two modified versions of styleGAN3 will be the main focus of this study… ▽ More

    Submitted 5 February, 2024; v1 submitted 8 July, 2023; originally announced July 2023.

    Comments: But now we feel we haven't fully studied our work and have found some new great results. So after careful consideration, we're going to rework this manuscript and try to give a more accurate model

  10. arXiv:2306.02913  [pdf, other

    cs.LG cs.CY cs.DC eess.SY stat.ML

    Decentralized SGD and Average-direction SAM are Asymptotically Equivalent

    Authors: Tongtian Zhu, Fengxiang He, Kaixuan Chen, Mingli Song, Dacheng Tao

    Abstract: Decentralized stochastic gradient descent (D-SGD) allows collaborative learning on massive devices simultaneously without the control of a central server. However, existing theories claim that decentralization invariably undermines generalization. In this paper, we challenge the conventional belief and present a completely new perspective for understanding decentralized learning. We prove that D-S… ▽ More

    Submitted 9 November, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 40th International Conference on Machine Learning (ICML 2023)

  11. arXiv:2212.14418  [pdf

    eess.SY

    Heterogeneous Computing Systems

    Authors: Dimple P. Khatri, Guanqun Song, Ting Zhu

    Abstract: This survey of heterogeneous computing systems will help in analyzing the technological trends that will be at the basis of heterogeneous computing systems, highlighting the major opportunities and challenges such technologies will bring with them. This will help to understand the importance of heterogeneous computing systems, which are becoming common architectural elements of not only the modern… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

  12. arXiv:2212.08601  [pdf, other

    cs.SD eess.AS

    Source Tracing: Detecting Voice Spoofing

    Authors: Tinglong Zhu, Xingming Wang, Xiaoyi Qin, Ming Li

    Abstract: Recent anti-spoofing systems focus on spoofing detection, where the task is only to determine whether the test audio is fake. However, there are few studies putting attention to identifying the methods of generating fake speech. Common spoofing attack algorithms in the logical access (LA) scenario, such as voice conversion and speech synthesis, can be divided into several stages: input processing,… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: Accepted by APSIPA ASC

  13. arXiv:2202.05397  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Neural Architecture Search for Energy Efficient Always-on Audio Models

    Authors: Daniel T. Speckhard, Karolis Misiunas, Sagi Perel, Tenghui Zhu, Simon Carlile, Malcolm Slaney

    Abstract: Mobile and edge computing devices for always-on classification tasks require energy-efficient neural network architectures. In this paper we present several changes to neural architecture searches (NAS) that improve the chance of success in practical situations. Our search simultaneously optimizes for network accuracy, energy efficiency and memory usage. We benchmark the performance of our search… ▽ More

    Submitted 1 June, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

  14. arXiv:2109.10499  [pdf, other

    eess.IV cs.CV

    Joint Optical Neuroimaging Denoising with Semantic Tasks

    Authors: Tianfang Zhu, Yue Guan, Anan Li

    Abstract: Optical neuroimaging is a vital tool for understanding the brain structure and the connection between regions and nuclei. However, the image noise introduced in the sample preparation and the imaging system hinders the extraction of the possible knowlege from the dataset, thus denoising for the optical neuroimaging is usually necessary. The supervised denoisng methods often outperform the unsuperv… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

  15. A Lightweight Privacy-Preserving Scheme Using Label-based Pixel Block Mixing for Image Classification in Deep Learning

    Authors: Yuexin Xiang, Tiantian Li, Wei Ren, Tianqing Zhu, Kim-Kwang Raymond Choo

    Abstract: To ensure the privacy of sensitive data used in the training of deep learning models, a number of privacy-preserving methods have been designed by the research community. However, existing schemes are generally designed to work with textual data, or are not efficient when a large number of images is used for training. Hence, in this paper we propose a lightweight and efficient approach to preserve… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: 11 pages, 16 figures

    MSC Class: 68T07 ACM Class: I.2.6; I.2.9

    Journal ref: Engineering Applications of Artificial Intelligence 126 (2023): 107180

  16. arXiv:2104.02306  [pdf, other

    cs.SD cs.LG eess.AS

    Binary Neural Network for Speaker Verification

    Authors: Tinglong Zhu, Xiaoyi Qin, Ming Li

    Abstract: Although deep neural networks are successful for many tasks in the speech domain, the high computational and memory costs of deep neural networks make it difficult to directly deploy highperformance Neural Network systems on low-resource embedded devices. There are several mechanisms to reduce the size of the neural networks i.e. parameter pruning, parameter quantization, etc. This paper focuses o… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  17. arXiv:2104.02054  [pdf, ps, other

    eess.SP cs.LG

    DeepMI: Deep Multi-lead ECG Fusion for Identifying Myocardial Infarction and its Occurrence-time

    Authors: Girmaw Abebe Tadesse, Hamza Javed, Yong Liu, ** Liu, Jiyan Chen, Komminist Weldemariam, Tingting Zhu

    Abstract: Myocardial Infarction (MI) has the highest mortality of all cardiovascular diseases (CVDs). Detection of MI and information regarding its occurrence-time in particular, would enable timely interventions that may improve patient outcomes, thereby reducing the global rise in CVD deaths. Electrocardiogram (ECG) recordings are currently used to screen MI patients. However, manual inspection of ECGs is… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

    Comments: 10 pages

  18. arXiv:2012.15387   

    eess.SY

    Indoor Air Quality Improvement

    Authors: A**kya Gawade, Aniket Sanap, Vishal Baviskar, Ryan Jahnige, Qingquan Zhang, Ting Zhu

    Abstract: Poor indoor air quality can contribute to the development of various chronic respiratory diseases such as asthma, heart disease, and lung cancer. Since air quality is extremely difficult for humans to detect though sensory processing, there is a need for efficient ventilation systems that can provide a healthier environment. In this paper, we have designed an energy efficient ventilation system th… ▽ More

    Submitted 1 January, 2021; v1 submitted 30 December, 2020; originally announced December 2020.

    Comments: Evaluation is incomplete. Difference in air quality improvement when outdoor air is used as opposed to circulating indoor air is not considered

  19. arXiv:2012.04192  [pdf

    eess.SY

    Benchmarking Resource Usage of Underlying Datatypes of Apache Spark

    Authors: Brittany Nicholls, Mariama Adangwa, Rachel Estes, Hugues Nelson Iradukunda, Qingquan Zhang, Ting Zhu

    Abstract: The purpose of this paper is to examine how resource usage of an analytic is affected by the different underlying datatypes of Spark analytics - Resilient Distributed Datasets (RDDs), Datasets, and DataFrames. The resource usage of an analytic is explored as a viable and preferred alternative of benchmarking big data analytics instead of the current common benchmarking performed using execution ti… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  20. arXiv:2011.14230  [pdf, other

    eess.SP cs.LG

    CROCS: Clustering and Retrieval of Cardiac Signals Based on Patient Disease Class, Sex, and Age

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: The process of manually searching for relevant instances in, and extracting information from, clinical databases underpin a multitude of clinical tasks. Such tasks include disease diagnosis, clinical trial recruitment, and continuing medical education. This manual search-and-extract process, however, has been hampered by the growth of large-scale clinical databases and the increased prevalence of… ▽ More

    Submitted 3 October, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

    Comments: Accepted at Advances in Neural Information Processing Systems (NeurIPS) 2021

  21. arXiv:2011.14227  [pdf, other

    eess.SP cs.LG

    PCPs: Patient Cardiac Prototypes

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Many clinical deep learning algorithms are population-based and difficult to interpret. Such properties limit their clinical utility as population-based findings may not generalize to individual patients and physicians are reluctant to incorporate opaque models into their clinical workflow. To overcome these obstacles, we propose to learn patient-specific embeddings, entitled patient cardiac proto… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

  22. arXiv:2005.13249  [pdf, other

    cs.LG eess.SP stat.ML

    CLOCS: Contrastive Learning of Cardiac Signals Across Space, Time, and Patients

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: The healthcare industry generates troves of unlabelled physiological data. This data can be exploited via contrastive learning, a self-supervised pre-training method that encourages representations of instances to be similar to one another. We propose a family of contrastive learning methods, CLOCS, that encourages representations across space, time, \textit{and} patients to be similar to one anot… ▽ More

    Submitted 16 May, 2021; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: Accepted to ICML 2021

  23. arXiv:2005.09059  [pdf, other

    eess.SP cs.LG q-bio.QM

    Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation

    Authors: Taiyu Zhu, Kezhi Li, Pau Herrero, Pantelis Georgiou

    Abstract: People with Type 1 diabetes (T1D) require regular exogenous infusion of insulin to maintain their blood glucose concentration in a therapeutically adequate target range. Although the artificial pancreas and continuous glucose monitoring have been proven to be effective in achieving closed-loop control, significant challenges still remain due to the high complexity of glucose dynamics and limitatio… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Journal ref: IEEE journal of biomedical and health informatics 2020

  24. arXiv:1912.07383  [pdf, other

    eess.SP eess.SY

    A Survey of Predictive Maintenance: Systems, Purposes and Approaches

    Authors: Tianwen Zhu, Yongyi Ran, Xin Zhou, Yonggang Wen

    Abstract: This paper highlights the importance of maintenance techniques in the coming industrial revolution, reviews the evolution of maintenance techniques, and presents a comprehensive literature review on the latest advancement of maintenance techniques, i.e., Predictive Maintenance (PdM), with emphasis on system architectures, optimization objectives, and optimization methods. In industry, any outages… ▽ More

    Submitted 21 March, 2024; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: 38 pages, 23 figures

  25. arXiv:1912.05345  [pdf, other

    eess.SP cs.CV cs.LG

    Severity Detection Tool for Patients with Infectious Disease

    Authors: Girmaw Abebe Tadesse, Tingting Zhu, Nhan Le Nguyen Thanh, Nguyen Thanh Hung, Ha Thi Hai Duong, Truong Huu Khanh, Pham Van Quang, Duc Duong Tran, LamMinh Yen, H Rogier Van Doorn, Nguyen Van Hao, John Prince, Hamza Javed, DaniKiyasseh, Le Van Tan, Louise Thwaites, David A. Clifton

    Abstract: Hand, foot and mouth disease (HFMD) and tetanus are serious infectious diseases in low and middle income countries. Tetanus in particular has a high mortality rate and its treatment is resource-demanding. Furthermore, HFMD often affects a large number of infants and young children. As a result, its treatment consumes enormous healthcare resources, especially when outbreaks occur. Autonomic nervous… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

  26. arXiv:1911.10468  [pdf

    physics.app-ph eess.SP

    Extending the dynamic strain sensing rang of phase-OTDR with frequency modulation pulse and frequency interrogation

    Authors: **gdong Zhang, Haoting Wu, **gsheng Huang, Hua Zheng, Danqi Feng, Guolu Yin, Tao Zhu

    Abstract: We propose and experimentally demonstrate a technique to extend the dynamic sensing range of phase sensitive optical time domain reflectometry system based on the frequency interrogation. Benefitting from the range Doppler coupling feature, the frequency modulation pulse is capable of measuring the frequency shift induced by the dynamic strain, thus the large dynamic strain can be recovered. The p… ▽ More

    Submitted 24 November, 2019; originally announced November 2019.

  27. arXiv:1712.05587  [pdf, ps, other

    eess.SP

    DOA and Polarization Estimation for Non-Circular Signals in 3-D Millimeter Wave Polarized Massive MIMO Systems

    Authors: Liangtian Wan, Kaihui Liu, Ying-Chang Liang, Tong Zhu

    Abstract: In this paper, an algorithm of multiple signal classification (MUSIC) is proposed for two-dimensional (2-D) direction of- arrival (DOA) and polarization estimation of non-circular signal in three-dimensional (3-D) millimeter wave polarized largescale/ massive multiple-input-multiple-output (MIMO) systems. The traditional MUSIC-based algorithms can estimate either the DOA and polarization for circu… ▽ More

    Submitted 15 December, 2017; originally announced December 2017.

  28. Feedback Control of Real-Time Display Advertising

    Authors: Weinan Zhang, Yifei Rong, Jun Wang, Tianchi Zhu, Xiaofan Wang

    Abstract: Real-Time Bidding (RTB) is revolutionising display advertising by facilitating per-impression auctions to buy ad impressions as they are being generated. Being able to use impression-level data, such as user cookies, encourages user behaviour targeting, and hence has significantly improved the effectiveness of ad campaigns. However, a fundamental drawback of RTB is its instability because the bid… ▽ More

    Submitted 3 March, 2016; originally announced March 2016.

    Comments: WSDM 2016