Skip to main content

Showing 1–22 of 22 results for author: Du, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.06295  [pdf, other

    cs.SD eess.AS

    Zero-Shot Audio Captioning Using Soft and Hard Prompts

    Authors: Yiming Zhang, Xuenan Xu, Ruoyi Du, Haohe Liu, Yuan Dong, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma

    Abstract: In traditional audio captioning methods, a model is usually trained in a fully supervised manner using a human-annotated dataset containing audio-text pairs and then evaluated on the test sets from the same dataset. Such methods have two limitations. First, these methods are often data-hungry and require time-consuming and expensive human annotations to obtain audio-text pairs. Second, these model… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing

  2. arXiv:2406.02233  [pdf, other

    eess.AS

    Towards Out-of-Distribution Detection in Vocoder Recognition via Latent Feature Reconstruction

    Authors: Renmingyue Du, Jixun Yao, Qiuqiang Kong, Yin Cao

    Abstract: Advancements in synthesized speech have created a growing threat of impersonation, making it crucial to develop deepfake algorithm recognition. One significant aspect is out-of-distribution (OOD) detection, which has gained notable attention due to its important role in deepfake algorithm recognition. However, most of the current approaches for detecting OOD in deepfake algorithm recognition rely… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 5 pages, 4 figures

  3. arXiv:2405.17100  [pdf, other

    cs.CR cs.SD eess.AS

    Sok: Comprehensive Security Overview, Challenges, and Future Directions of Voice-Controlled Systems

    Authors: Haozhe Xu, Cong Wu, Yangyang Gu, Xingcan Shang, **g Chen, Kun He, Ruiying Du

    Abstract: The integration of Voice Control Systems (VCS) into smart devices and their growing presence in daily life accentuate the importance of their security. Current research has uncovered numerous vulnerabilities in VCS, presenting significant risks to user privacy and security. However, a cohesive and systematic examination of these vulnerabilities and the corresponding solutions is still absent. This… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2404.15854  [pdf, other

    cs.CR cs.LG cs.SD eess.AS

    CLAD: Robust Audio Deepfake Detection Against Manipulation Attacks with Contrastive Learning

    Authors: Haolin Wu, **g Chen, Ruiying Du, Cong Wu, Kun He, Xingcan Shang, Hao Ren, Guowen Xu

    Abstract: The increasing prevalence of audio deepfakes poses significant security threats, necessitating robust detection methods. While existing detection systems exhibit promise, their robustness against malicious audio manipulations remains underexplored. To bridge the gap, we undertake the first comprehensive study of the susceptibility of the most widely adopted audio deepfake detectors to manipulation… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE TDSC

  5. arXiv:2402.09636  [pdf, other

    eess.IV cs.CV

    Spatiotemporal Disentanglement of Arteriovenous Malformations in Digital Subtraction Angiography

    Authors: Kathleen Baur, Xin Xiong, Erickson Torio, Rose Du, Parikshit Juvekar, Reuben Dorent, Alexandra Golby, Sarah Frisken, Nazim Haouchine

    Abstract: Although Digital Subtraction Angiography (DSA) is the most important imaging for visualizing cerebrovascular anatomy, its interpretation by clinicians remains difficult. This is particularly true when treating arteriovenous malformations (AVMs), where entangled vasculature connecting arteries and veins needs to be carefully identified.The presented method aims to enhance DSA image series by highli… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Paper accepted for publication at SPIE Medical Imaging 2024

  6. arXiv:2402.05887  [pdf, other

    eess.IV cs.MM

    Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers

    Authors: Onur G. Guleryuz, Philip A. Chou, Berivan Isik, Hugues Hoppe, Danhang Tang, Ruofei Du, Jonathan Taylor, Philip Davidson, Sean Fanello

    Abstract: We propose sandwiching standard image and video codecs between pre- and post-processing neural networks. The networks are jointly trained through a differentiable codec proxy to minimize a given rate-distortion loss. This sandwich architecture not only improves the standard codec's performance on its intended content, it can effectively adapt the codec to other types of image/video content and to… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  7. arXiv:2310.17661  [pdf, other

    eess.SP cs.NI

    An Overview on IEEE 802.11bf: WLAN Sensing

    Authors: Rui Du, Haocheng Hua, Hailiang Xie, Xianxin Song, Zhonghao Lyu, Mengshi Hu, Narengerile, Yan Xin, Stephen McCann, Michael Montemurro, Tony Xiao Han, Jie Xu

    Abstract: With recent advancements, the wireless local area network (WLAN) or wireless fidelity (Wi-Fi) technology has been successfully utilized to realize sensing functionalities such as detection, localization, and recognition. However, the WLANs standards are developed mainly for the purpose of communication, and thus may not be able to meet the stringent requirements for emerging sensing applications.… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 31 pages, 25 figures, this is a significant updated version of arXiv:2207.04859

  8. arXiv:2305.06777  [pdf

    eess.IV cs.CV cs.LG q-bio.QM

    Generating high-quality 3DMPCs by adaptive data acquisition and NeREF-based radiometric calibration with UGV plant phenoty** system

    Authors: Pengyao Xie, Zhihong Ma, Ruiming Du, Xin Yang, Haiyan Cen

    Abstract: Fusion of 3D and MS imaging data has a great potential for high-throughput plant phenoty** of structural and biochemical as well as physiological traits simultaneously, which is important for decision support in agriculture and for crop breeders in selecting the best genotypes. However, lacking of 3D data integrity of various plant canopy structures and low-quality of MS images caused by the com… ▽ More

    Submitted 1 December, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  9. arXiv:2210.11684  [pdf, other

    eess.SY

    Change Point Detection Approach for Online Control of Unknown Time Varying Dynamical Systems

    Authors: Deepan Muthirayan, Ruijie Du, Yanning Shen, Pramod P. Khargonekar

    Abstract: We propose a novel change point detection approach for online learning control with full information feedback (state, disturbance, and cost feedback) for unknown time-varying dynamical systems. We show that our algorithm can achieve a sub-linear regret with respect to the class of Disturbance Action Control (DAC) policies, which are a widely studied class of policies for online control of dynamica… ▽ More

    Submitted 24 March, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

  10. arXiv:2207.10306  [pdf, ps, other

    eess.SP

    Fundamental Limits and Optimization of Multiband Sensing

    Authors: Yubo Wan, An Liu, Rui Du, Tony Xiao Han

    Abstract: Multiband sensing is a promising technology that utilizes multiple non-contiguous frequency bands to achieve high-resolution target sensing. In this paper, we investigate the fundamental limits and optimization of multiband sensing, focusing on the fundamental limits associated with time delay. We first derive a Fisher information matrix (FIM) with a compact form using the Dirichlet kernel and the… ▽ More

    Submitted 31 January, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

  11. arXiv:2207.04859  [pdf, ps, other

    cs.NI eess.SP

    An Overview on IEEE 802.11bf: WLAN Sensing

    Authors: Rui Du, Hailiang Xie, Mengshi Hu, Narengerile, Yan Xin, Stephen McCann, Michael Montemurro, Tony Xiao Han, Jie Xu

    Abstract: With recent advancements, the wireless local area network (WLAN) or wireless fidelity (Wi-Fi) technology has been successfully utilized to realize sensing functionalities such as detection, localization, and recognition. However, the WLANs standards are developed mainly for the purpose of communication, and thus may not be able to meet the stringent sensing requirements in emerging applications. T… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  12. arXiv:2206.00493  [pdf, ps, other

    eess.SP cs.IT

    Networked Sensing in 6G Cellular Networks: Opportunities and Challenges

    Authors: Liang Liu, Shuowen Zhang, Rui Du, Tong Xiao Han, Shuguang Cui

    Abstract: Radar and wireless communication are widely acknowledged as the two most successful applications of the radio technology over the past decades. Recently, there is a trend in both academia and industry to achieve integrated sensing and communication (ISAC) in one system via utilizing a common radio spectrum and the same hardware platform. This article will discuss about the possibility of exploitin… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  13. arXiv:2204.08409  [pdf, other

    cs.SD cs.CL eess.AS

    Caption Feature Space Regularization for Audio Captioning

    Authors: Yiming Zhang, Hong Yu, Ruoyi Du, Zhanyu Ma, Yuan Dong

    Abstract: Audio captioning aims at describing the content of audio clips with human language. Due to the ambiguity of audio, different people may perceive the same audio differently, resulting in caption disparities (i.e., one audio may correlate to several captions with diverse semantics). For that, general audio captioning models achieve the one-to-many training by randomly selecting a correlated caption… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

  14. arXiv:2202.11134  [pdf

    cs.HC cs.LG cs.SD eess.AS

    ProtoSound: A Personalized and Scalable Sound Recognition System for Deaf and Hard-of-Hearing Users

    Authors: Dhruv Jain, Khoa Huynh Anh Nguyen, Steven Goodman, Rachel Grossman-Kahn, Hung Ngo, Aditya Kusupati, Ruofei Du, Alex Olwal, Leah Findlater, Jon E. Froehlich

    Abstract: Recent advances have enabled automatic sound recognition systems for deaf and hard of hearing (DHH) users on mobile devices. However, these tools use pre-trained, generic sound recognition models, which do not meet the diverse needs of DHH users. We introduce ProtoSound, an interactive system for customizing sound recognition models by recording a few examples, thereby enabling personalized and fi… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: Published at the ACM CHI Conference on Human Factors in Computing Systems (CHI) 2022

  15. arXiv:2104.09954  [pdf, other

    cs.IT eess.SP

    A Survey on Fundamental Limits of Integrated Sensing and Communication

    Authors: An Liu, Zhe Huang, Min Li, Yubo Wan, Wenrui Li, Tony Xiao Han, Chenchen Liu, Rui Du, Danny Tan Kai Pin, Jianmin Lu, Yuan Shen, Fabiola Colone, Kevin Chetty

    Abstract: The integrated sensing and communication (ISAC), in which the sensing and communication share the same frequency band and hardware, has emerged as a key technology in future wireless systems. Early works on ISAC have been focused on the design, analysis and optimization of practical ISAC technologies for various ISAC systems. While this line of works are necessary, it is equally important to study… ▽ More

    Submitted 22 April, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: 32 pages, submitted to IEEE Communications Surveys and Tutorials

  16. arXiv:2010.05440  [pdf

    eess.SY

    Using Empirical Trajectory Data to Design Connected Autonomous Vehicle Controllers for Traffic Stabilization

    Authors: Yujie Li, Sikai Chen, Runjia Du, Paul Young Joun Ha, Jiqian Dong, Samuel Labi

    Abstract: Emerging transportation technologies offer unprecedented opportunities to improve the efficiency of the transportation system from the perspectives of energy consumption, congestion, and emissions. One of these technologies is connected and autonomous vehicles (CAVs). With the prospective duality of operations of CAVs and human driven vehicles in the same roadway space (also referred to as a mixed… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: TRB 2021 Annual Meeting

  17. arXiv:2010.05439  [pdf

    eess.SY

    A Cooperative Control Framework for CAV Lane Change in a Mixed Traffic Environment

    Authors: Runjia Du, Sikai Chen, Yujie Li, Jiqian Dong, Paul Young Joun Ha, Samuel Labi

    Abstract: In preparing for connected and autonomous vehicles (CAVs), a worrisome aspect is the transition era which will be characterized by mixed traffic (where CAVs and human-driven vehicles (HDVs) share the roadway). Consistent with expectations that CAVs will improve road safety, on-road CAVs may adopt rather conservative control policies, and this will likely cause HDVs to unduly exploit CAV conservati… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: TRB 2021 Annual Meeting

  18. arXiv:2010.05436  [pdf

    cs.LG eess.SY

    Leveraging the Capabilities of Connected and Autonomous Vehicles and Multi-Agent Reinforcement Learning to Mitigate Highway Bottleneck Congestion

    Authors: Paul Young Joun Ha, Sikai Chen, Jiqian Dong, Runjia Du, Yujie Li, Samuel Labi

    Abstract: Active Traffic Management strategies are often adopted in real-time to address such sudden flow breakdowns. When queuing is imminent, Speed Harmonization (SH), which adjusts speeds in upstream traffic to mitigate traffic showckwaves downstream, can be applied. However, because SH depends on driver awareness and compliance, it may not always be effective in mitigating congestion. The use of multiag… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: TRB 20201 Annual Meeting

  19. arXiv:2009.14665  [pdf

    cs.AI cs.LG eess.SY

    Facilitating Connected Autonomous Vehicle Operations Using Space-weighted Information Fusion and Deep Reinforcement Learning Based Control

    Authors: Jiqian Dong, Sikai Chen, Yujie Li, Runjia Du, Aaron Steinfeld, Samuel Labi

    Abstract: The connectivity aspect of connected autonomous vehicles (CAV) is beneficial because it facilitates dissemination of traffic-related information to vehicles through Vehicle-to-External (V2X) communication. Onboard sensing equipment including LiDAR and camera can reasonably characterize the traffic environment in the immediate locality of the CAV. However, their performance is limited by their sens… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  20. arXiv:2001.08847  [pdf, ps, other

    cs.NI eess.SY

    Wirelessly-powered Sensor Networks Power Allocation for Channel Estimation and Energy Beamforming

    Authors: Rong Du, Hossein Shokri Ghadikolaei, Carlo Fischione

    Abstract: Wirelessly-powered sensor networks (WPSNs) are becoming increasingly important in different monitoring applications. We consider a WPSN where a multiple-antenna base station, which is dedicated for energy transmission, sends pilot signals to estimate the channel state information and consequently shapes the energy beams toward the sensor nodes. Given a fixed energy budget at the base station, in t… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

    Comments: The paper has been accepted in IEEE Transactions on Wireless Communications on Jan. 19th, 2020. 7 figures, 35 pages

  21. arXiv:1907.11861  [pdf, ps, other

    eess.IV

    Deep convolution neural network model for automatic risk assessment of patients with non-metastatic nasopharyngeal carcinoma

    Authors: Richard Du, Peng Cao, Lujun Han, Qiyong Ai, Ann D. King, Varut Vardhanabhuti

    Abstract: Nasopharyngeal Carcinoma (NPC) is endemic cancer in the south-east Asia. With the advent of intensity-modulated radiotherapy excellent locoregional control are being achieved. Consequently, this had led to pretreatment clinical staging classification to be less prognostic of outcomes such as recurrence after treatment. Alternative pretreatment strategies for prognosis of NPC after treatment are ne… ▽ More

    Submitted 27 July, 2019; originally announced July 2019.

    Comments: Medical Imaging with Deep Learning 2019 - Extended Abstract. MIDL 2019 [arXiv:1907.08612]

    Report number: MIDL/2019/ExtendedAbstract/S1xEkdTpYN

  22. On Maximizing Sensor Network Lifetime by Energy Balancing

    Authors: Rong Du, Lazaros Gkatzikis, Carlo Fischione, Ming Xiao

    Abstract: Many physical systems, such as water/electricity distribution networks, are monitored by battery-powered Wireless Sensor Networks (WSNs). Since battery replacement of sensor nodes is generally difficult, long-term monitoring can be only achieved if the operation of the WSN nodes contributes to a long WSN lifetime. Two prominent techniques to long WSN lifetime are i) optimal sensor activation and i… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

    Comments: 14 pages, 4 figures, extended version of the one accepted by IEEE Transactions on Control of Network Systems