Skip to main content

Showing 1–39 of 39 results for author: Su, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18089  [pdf, other

    cs.SD cs.MM eess.AS

    A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons

    Authors: Tzu-Yun Hung, Jui-Te Wu, Yu-Chia Kuo, Yo-Wei Hsiao, Ting-Wei Lin, Li Su

    Abstract: Expressive music synthesis (EMS) for violin performance is a challenging task due to the disagreement among music performers in the interpretation of expressive musical terms (EMTs), scarcity of labeled recordings, and limited generalization ability of the synthesis model. These challenges create trade-offs between model effectiveness, diversity of generated results, and controllability of the syn… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 15 pages, 2 figures, 3 tables

  2. arXiv:2406.06375  [pdf, other

    cs.SD cs.AI eess.AS

    MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing

    Authors: Yu-Fen Huang, Nikki Moran, Simon Coleman, Jon Kelly, Shun-Hwa Wei, Po-Yin Chen, Yun-Hsin Huang, Tsung-** Chen, Yu-Chia Kuo, Yu-Chi Wei, Chih-Hsuan Li, Da-Yu Huang, Hsuan-Kai Kao, Ting-Wei Lin, Li Su

    Abstract: In cross-modal music processing, translation between visual, auditory, and semantic content opens up new possibilities as well as challenges. The construction of such a transformative scheme depends upon a benchmark corpus with a comprehensive data infrastructure. In particular, the assembly of a large-scale cross-modal dataset presents major challenges. In this paper, we present the MOSA (Music m… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024. 14 pages, 7 figures. Dataset is available on: https://github.com/yufenhuang/MOSA-Music-mOtion-and-Semantic-Annotation-dataset/tree/main and https://zenodo.org/records/11393449

  3. Multi-Objective Optimization-based Transmit Beamforming for Multi-Target and Multi-User MIMO-ISAC Systems

    Authors: Chunwei Meng, Zhiqing Wei, Dingyou Ma, Wanli Ni, Liyan Su, Zhiyong Feng

    Abstract: Integrated sensing and communication (ISAC) is an enabling technology for the sixth-generation mobile communications, which equips the wireless communication networks with sensing capabilities. In this paper, we investigate transmit beamforming design for multiple-input and multiple-output (MIMO)-ISAC systems in scenarios with multiple radar targets and communication users. A general form of multi… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  4. arXiv:2403.07390  [pdf, other

    eess.IV cs.CV

    Learning Correction Errors via Frequency-Self Attention for Blind Image Super-Resolution

    Authors: Haochen Sun, Yan Yuan, Lijuan Su, Haotian Shao

    Abstract: Previous approaches for blind image super-resolution (SR) have relied on degradation estimation to restore high-resolution (HR) images from their low-resolution (LR) counterparts. However, accurate degradation estimation poses significant challenges. The SR model's incompatibility with degradation estimation methods, particularly the Correction Filter, may significantly impair performance as a res… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 16 pages

  5. arXiv:2312.17156  [pdf, other

    cs.SD eess.AS

    BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer

    Authors: Chih-Cheng Chang, Li Su

    Abstract: Many deep learning models have achieved dominant performance on the offline beat tracking task. However, online beat tracking, in which only the past and present input features are available, still remains challenging. In this paper, we propose BEAt tracking Streaming Transformer (BEAST), an online joint beat and downbeat tracking system based on the streaming Transformer. To deal with online scen… ▽ More

    Submitted 23 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  6. arXiv:2311.12488  [pdf, other

    eess.AS cs.SD

    Adapting pretrained speech model for Mandarin lyrics transcription and alignment

    Authors: Jun-You Wang, Chon-In Leong, Yu-Chen Lin, Li Su, Jyh-Shing Roger Jang

    Abstract: The tasks of automatic lyrics transcription and lyrics alignment have witnessed significant performance improvements in the past few years. However, most of the previous works only focus on English in which large-scale datasets are available. In this paper, we address lyrics transcription and alignment of polyphonic Mandarin pop music in a low-resource setting. To deal with the data scarcity issue… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: Accepted by ASRU 2023

  7. arXiv:2310.19198  [pdf

    q-bio.QM cs.LG eess.SP

    Enhancing Motor Imagery Decoding in Brain Computer Interfaces using Riemann Tangent Space Map** and Cross Frequency Coupling

    Authors: Xiong Xiong, Li Su, **guo Huang, Guixia Kang

    Abstract: Objective: Motor Imagery (MI) serves as a crucial experimental paradigm within the realm of Brain Computer Interfaces (BCIs), aiming to decoding motor intentions from electroencephalogram (EEG) signals. Method: Drawing inspiration from Riemannian geometry and Cross-Frequency Coupling (CFC), this paper introduces a novel approach termed Riemann Tangent Space Map** using Dichotomous Filter Bank wi… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: 22 pages, 7 figures

  8. arXiv:2307.06634  [pdf, ps, other

    eess.SP

    Coherent Compensation based ISAC Signal Processing for Long-range Sensing

    Authors: Lin Wang, Zhiqing Wei, Liyan Su, Zhiyong Feng, Huici Wu, Dongsheng Xue

    Abstract: Integrated sensing and communication (ISAC) will greatly enhance the efficiency of physical resource utilization. The design of ISAC signal based on the orthogonal frequency division multiplex (OFDM) signal is the mainstream. However, when detecting the long-range target, the delay of echo signal exceeds CP duration, which will result in inter-symbol interference (ISI) and inter-carrier interferen… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  9. arXiv:2305.20003  [pdf

    cs.LG eess.SY math.OC

    A Novel Black Box Process Quality Optimization Approach based on Hit Rate

    Authors: Yang Yang, Jian Wu, Xiangman Song, Derun Wu, Lijie Su, Lixin Tang

    Abstract: Hit rate is a key performance metric in predicting process product quality in integrated industrial processes. It represents the percentage of products accepted by downstream processes within a controlled range of quality. However, optimizing hit rate is a non-convex and challenging problem. To address this issue, we propose a data-driven quasi-convex approach that combines factorial hidden Markov… ▽ More

    Submitted 2 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  10. arXiv:2305.19956  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    MicroSegNet: A Deep Learning Approach for Prostate Segmentation on Micro-Ultrasound Images

    Authors: Hongxu Jiang, Muhammad Imran, Preethika Muralidharan, Anjali Patel, Jake Pensa, Muxuan Liang, Tarik Benidir, Joseph R. Grajo, Jason P. Joseph, Russell Terry, John Michael DiBianco, Li-Ming Su, Yuyin Zhou, Wayne G. Brisbane, Wei Shao

    Abstract: Micro-ultrasound (micro-US) is a novel 29-MHz ultrasound technique that provides 3-4 times higher resolution than traditional ultrasound, potentially enabling low-cost, accurate diagnosis of prostate cancer. Accurate prostate segmentation is crucial for prostate volume measurement, cancer diagnosis, prostate biopsy, and treatment planning. However, prostate segmentation on micro-US is challenging… ▽ More

    Submitted 25 January, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Computerized Medical Imaging and Graphics (2024): 102326

  11. arXiv:2305.19939  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Image Registration of In Vivo Micro-Ultrasound and Ex Vivo Pseudo-Whole Mount Histopathology Images of the Prostate: A Proof-of-Concept Study

    Authors: Muhammad Imran, Brianna Nguyen, Jake Pensa, Sara M. Falzarano, Anthony E. Sisk, Muxuan Liang, John Michael DiBianco, Li-Ming Su, Yuyin Zhou, Wayne G. Brisbane, Wei Shao

    Abstract: Early diagnosis of prostate cancer significantly improves a patient's 5-year survival rate. Biopsy of small prostate cancers is improved with image-guided biopsy. MRI-ultrasound fusion-guided biopsy is sensitive to smaller tumors but is underutilized due to the high cost of MRI and fusion equipment. Micro-ultrasound (micro-US), a novel high-resolution ultrasound technology, provides a cost-effecti… ▽ More

    Submitted 16 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  12. arXiv:2305.19023  [pdf, other

    q-bio.PE eess.SY

    Steady-state analysis of networked epidemic models

    Authors: Sei Zhen Khong, Lanlan Su

    Abstract: Compartmental epidemic models with dynamics that evolve over a graph network have gained considerable importance in recent years but analysis of these models is in general difficult due to their complexity. In this paper, we develop two positive feedback frameworks that are applicable to the study of steady-state values in a wide range of compartmental epidemic models, including both group and… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  13. arXiv:2304.05917  [pdf, other

    cs.SD cs.LG eess.AS

    A Phoneme-Informed Neural Network Model for Note-Level Singing Transcription

    Authors: Sangeon Yong, Li Su, Juhan Nam

    Abstract: Note-level automatic music transcription is one of the most representative music information retrieval (MIR) tasks and has been studied for various instruments to understand music. However, due to the lack of high-quality labeled data, transcription of many instruments is still a challenging task. In particular, in the case of singing, it is difficult to find accurate notes due to its expressivene… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted at ICASSP 2023

  14. arXiv:2206.01945  [pdf, other

    eess.SY math.OC

    On the exponential convergence of input-output signals of nonlinear feedback systems

    Authors: Lanlan Su, Di Zhao, Sei Zhen Khong

    Abstract: This note studies the exponential convergence of input-output signals of discrete-time nonlinear systems composed of a feedback interconnection of a linear time-invariant system and a nonlinear uncertainty. Both the open-loop subsystems are allowed to be unbounded. Integral-quadratic-constraint-based conditions are proposed for these uncertain feedback systems, including the Lurye type, to exhibit… ▽ More

    Submitted 12 June, 2024; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: This paper has been submitted to IEEE Transactions on Automatic Control

  15. arXiv:2112.07456  [pdf, other

    eess.SY

    On the Necessity and Sufficiency of Discrete-Time O'Shea-Zames-Falb Multipliers

    Authors: Lanlan Su, Peter Seiler, Joaquin Carrasco, Sei Zhen Khong

    Abstract: This paper considers the robust stability of a discrete-time Lurye system consisting of the feedback interconnection between a linear system and a bounded and monotone nonlinearity. It has been conjectured that the existence of a suitable linear time-invariant (LTI) O'Shea-Zames-Falb multiplier is not only sufficient but also necessary. Roughly speaking, a successful proof of the conjecture would… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 25 Pages

  16. arXiv:2110.12855  [pdf, other

    cs.SD cs.HC cs.LG cs.MM eess.AS

    Actions Speak Louder than Listening: Evaluating Music Style Transfer based on Editing Experience

    Authors: Wei-Tsung Lu, Meng-Hsuan Wu, Yuh-Ming Chiu, Li Su

    Abstract: The subjective evaluation of music generation techniques has been mostly done with questionnaire-based listening tests while ignoring the perspectives from music composition, arrangement, and soundtrack editing. In this paper, we propose an editing test to evaluate users' editing experience of music generation models in a systematic way. To do this, we design a new music style transfer model combi… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 9 pages, Proceedings of the 29th ACM International Conference on Multimedia

  17. arXiv:2107.04954  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data

    Authors: Kin Wai Cheuk, Dorien Herremans, Li Su

    Abstract: Most of the current supervised automatic music transcription (AMT) models lack the ability to generalize. This means that they have trouble transcribing real-world music recordings from diverse musical genres that are not presented in the labelled training data. In this paper, we propose a semi-supervised framework, ReconVAT, which solves this issue by leveraging the huge amount of available unlab… ▽ More

    Submitted 29 July, 2021; v1 submitted 10 July, 2021; originally announced July 2021.

    Comments: Accepted in ACMMM 21. Camera ready version

  18. arXiv:2106.00497  [pdf, ps, other

    cs.SD cs.AI eess.AS

    Omnizart: A General Toolbox for Automatic Music Transcription

    Authors: Yu-Te Wu, Yin-Jyun Luo, Tsung-** Chen, I-Chieh Wei, Jui-Yang Hsu, Yi-Chin Chuang, Li Su

    Abstract: We present and release Omnizart, a new Python library that provides a streamlined solution to automatic music transcription (AMT). Omnizart encompasses modules that construct the life-cycle of deep learning-based AMT, and is designed for ease of use with a compact command-line interface. To the best of our knowledge, Omnizart is the first transcription toolkit which offers models covering a wide c… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  19. arXiv:2011.10947  [pdf, other

    cs.CR eess.SP

    Who is in Control? Practical Physical Layer Attack and Defense for mmWave based Sensing in Autonomous Vehicles

    Authors: Zhi Sun, Sarankumar Balakrishnan, Lu Su, Arupjyoti Bhuyan, Pu Wang, Chunming Qiao

    Abstract: With the wide bandwidths in millimeter wave (mmWave) frequency band that results in unprecedented accuracy, mmWave sensing has become vital for many applications, especially in autonomous vehicles (AVs). In addition, mmWave sensing has superior reliability compared to other sensing counterparts such as camera and LiDAR, which is essential for safety-critical driving. Therefore, it is critical to u… ▽ More

    Submitted 22 November, 2020; originally announced November 2020.

  20. arXiv:2010.12196  [pdf, other

    eess.AS cs.SD

    Toward Expressive Singing Voice Correction: On Perceptual Validity of Evaluation Metrics for Vocal Melody Extraction

    Authors: Yin-Jyun Luo, Yuen-Jen Lin, Li Su

    Abstract: Singing voice correction (SVC) is an appealing application for amateur singers. Commercial products automate SVC by snap** pitch contours to equal-tempered scales, which could lead to deadpan modifications. Together with the neglect of rhythmic errors, extensive manual corrections are still necessary. In this paper, we present a streamlined system to automate expressive SVC for both pitch and rh… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: Submitted to ICASSP 2021

  21. arXiv:2009.13574  [pdf, other

    eess.SY

    Robust Monotonic Convergent Iterative Learning Control Design: an LMI-based Method

    Authors: Lanlan Su

    Abstract: This work investigates robust monotonic convergent iterative learning control (ILC) for uncertain linear systems in both time and frequency domains, and the ILC algorithm optimizing the convergence speed in terms of $l_{2}$ norm of error signals is derived. Firstly, it is shown that the robust monotonic convergence of the ILC system can be established equivalently by the positive definiteness of a… ▽ More

    Submitted 15 January, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

  22. arXiv:2009.13571  [pdf, other

    eess.SY

    On the Necessity and Sufficiency of the Zames-Falb Multipliers for Bounded Operators

    Authors: Sei Zhen Khong, Lanlan Su

    Abstract: This paper analyzes the robust feedback stability of a single-input-single-output stable linear time-invariant (LTI) system against four different classes of nonlinear systems using the Zames-Falb multipliers. The contribution is fourfold. Firstly, we present a generalised S-procedure lossless theorem that involves a countably infinite number of quadratic forms. Secondly, we identify a class of un… ▽ More

    Submitted 18 August, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

  23. arXiv:2009.08015  [pdf, other

    cs.MM cs.AI cs.SD eess.AS eess.IV

    Temporally Guided Music-to-Body-Movement Generation

    Authors: Hsuan-Kai Kao, Li Su

    Abstract: This paper presents a neural network model to generate virtual violinist's 3-D skeleton movements from music audio. Improved from the conventional recurrent neural network models for generating 2-D skeleton data in previous works, the proposed model incorporates an encoder-decoder architecture, as well as the self-attention mechanism to model the complicated dynamics in body movement sequences. To… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

  24. arXiv:2008.06358  [pdf, other

    eess.AS cs.SD

    Semi-supervised learning using teacher-student models for vocal melody extraction

    Authors: Sangeun Kum, **g-Hua Lin, Li Su, Juhan Nam

    Abstract: The lack of labeled data is a major obstacle in many music information retrieval tasks such as melody extraction, where labeling is extremely laborious or costly. Semi-supervised learning (SSL) provides a solution to alleviate the issue by leveraging a large amount of unlabeled data. In this paper, we propose an SSL method using teacher-student models for vocal melody extraction. The teacher model… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

    Comments: 8 pages, 5 figures, accepted for the 21st International Society for Music Information Retrieval Conference (ISMIR 2020)

  25. Road Grade Estimation Using Crowd-Sourced Smartphone Data

    Authors: Abhishek Gupta, Shaohan Hu, Weida Zhong, Adel Sadek, Lu Su, Chunming Qiao

    Abstract: Estimates of road grade/slope can add another dimension of information to existing 2D digital road maps. Integration of road grade information will widen the scope of digital map's applications, which is primarily used for navigation, by enabling driving safety and efficiency applications such as Advanced Driver Assistance Systems (ADAS), eco-driving, etc. The huge scale and dynamic nature of road… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

    Comments: Proceedings of 19th ACM/IEEE Conference on Information Processing in Sensor Networks (IPSN'20)

  26. arXiv:2002.01788  [pdf

    physics.optics eess.SP

    Learning Enabled Dense Space-division Multiplexing through a Single Multimode Fibre

    Authors: Pengfei Fan, Michael Ruddlesden, Yufei Wang, Luming Zhao, Chao Lu, Lei Su

    Abstract: Space-division multiplexing is a promising technology in optical fibre communication to improve the transmission capacity of a single optical fibre. However, the number of channels that can be multiplexed is limited by the crosstalks between channels, and the multiplexing is only applied to few-mode or multi-core fibres. Here, we propose a high-spatial-density channel multiplexing framework employ… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

  27. Analysis of Two-Dimensional Feedback Systems over Networks Using Dissipativity

    Authors: Yang Yan, Lanlan Su, Vijay Gupta, Panos Antsaklis

    Abstract: This paper investigates the closed-loop $\mathcal{L}_2$ stability of two-dimensional (2-D) feedback systems across a digital communication network by introducing the tool of dissipativity. First, sampling of a continuous 2-D system is considered and an analytical characterization of the $QSR$-dissipativity of the sampled system is presented. Next, the input-feedforward output-feedback passivity (I… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

    Comments: 13 pages, 7 figures

  28. arXiv:1907.13024  [pdf, other

    eess.SY cs.IT math.OC

    Stabilization of Linear Systems Across a Time-Varying AWGN Fading Channel

    Authors: Lanlan Su, Vijay Gupta, Graziano Chesi

    Abstract: This technical note investigates the minimum average transmit power required for mean-square stabilization of a discrete-time linear process across a time-varying additive white Gaussian noise (AWGN) fading channel that is presented between the sensor and the controller. We assume channel state information at both the transmitter and the receiver, and allow the transmit power to vary with the chan… ▽ More

    Submitted 31 July, 2019; v1 submitted 30 July, 2019; originally announced July 2019.

    Comments: 6 pages, 2 figures

  29. arXiv:1907.13003  [pdf, other

    cs.MA eess.SY math.OC

    Distributed Resource Allocation over Time-varying Balanced Digraphs with Discrete-time Communication

    Authors: Lanlan Su, Mengmou Li, Vijay Gupta, Graziano Chesi

    Abstract: This work is concerned with the problem of distributed resource allocation in continuous-time setting but with discrete-time communication over infinitely jointly connected and balanced digraphs. We provide a passivity-based perspective for the continuous-time algorithm, based on which an intermittent communication scheme is developed. Particularly, a periodic communication scheme is first derived… ▽ More

    Submitted 15 January, 2021; v1 submitted 30 July, 2019; originally announced July 2019.

    Comments: 12 pages, 7 figures

  30. arXiv:1907.12988  [pdf, other

    eess.SY math.OC

    Feedback Passivation of Linear Systems with Fixed-Structured Controllers

    Authors: Lanlan Su, Vijay Gupta, Panos Antsaklis

    Abstract: This paper addresses the problem of designing an optimal output feedback controller with a specified controller structure for linear time-invariant (LTI) systems to maximize the passivity level for the closed-loop system, in both continuous-time (CT) and discrete-time (DT). Specifically, the set of controllers under consideration is linearly parameterized with constrained parameters. Both input fe… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: 8 pages, 1 figure

  31. arXiv:1902.00539  [pdf, other

    eess.AS cs.SD eess.SP

    Multi-layered Cepstrum for Instantaneous Frequency Estimation

    Authors: Chin-Yun Yu, Li Su

    Abstract: We propose the multi-layered cepstrum (MLC) method to estimate multiple fundamental frequencies (MF0) of a signal under challenging contamination such as high-pass filter noise. Taking the operation of cepstrum (i.e., Fourier transform, filtering, and nonlinear activation) recursively, MLC is shown as an efficient method to enhance MF0 saliency in a step-by-step manner. Evaluation on a real-world… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

    Comments: In 2018 6th IEEE Global Conference on Signal and Information Processing

  32. arXiv:1811.12214  [pdf, other

    cs.SD eess.AS

    Play as You Like: Timbre-enhanced Multi-modal Music Style Transfer

    Authors: Chien-Yu Lu, Min-Xin Xue, Chia-Che Chang, Che-Rung Lee, Li Su

    Abstract: Style transfer of polyphonic music recordings is a challenging task when considering the modeling of diverse, imaginative, and reasonable music pieces in the style different from their original one. To achieve this, learning stable multi-modal representations for both domain-variant (i.e., style) and domain-invariant (i.e., content) information of music in an unsupervised manner is critical. In th… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

  33. arXiv:1810.12947  [pdf, other

    eess.AS cs.SD

    A Streamlined Encoder/Decoder Architecture for Melody Extraction

    Authors: Tsung-Han Hsieh, Li Su, Yi-Hsuan Yang

    Abstract: Melody extraction in polyphonic musical audio is important for music signal processing. In this paper, we propose a novel streamlined encoder/decoder network that is designed for the task. We make two technical contributions. First, drawing inspiration from a state-of-the-art model for semantic pixel-wise segmentation, we pass through the pooling indices between pooling and un-pooling layers to lo… ▽ More

    Submitted 18 February, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: This is a pre-print version of an ICASSP 2019 paper

  34. arXiv:1810.12764  [pdf

    eess.IV physics.optics

    Single-shot image retrieval through a multimode fiber using a genetic algorithm

    Authors: Michael Ruddlesden, **shuai Zhang, Tianrui Zhao, Wen Wang, Lei Su

    Abstract: In this letter, we present a genetic algorithm-based approach for image retrieval through a multimode fiber in a reference-less system. Due to mode interference, when an image is illuminated at one side of a multimode fiber, the transmitted light forms a noise-like speckle pattern at the other end. With the use of a prior-measured transmission matrix of the fiber, a speckle pattern is calculated u… ▽ More

    Submitted 26 October, 2018; originally announced October 2018.

  35. arXiv:1810.10086  [pdf, ps, other

    eess.SY

    Finite-time Guarantees for Byzantine-Resilient Distributed State Estimation with Noisy Measurements

    Authors: Lili Su, Shahin Shahrampour

    Abstract: This work considers resilient, cooperative state estimation in unreliable multi-agent networks. A network of agents aims to collaboratively estimate the value of an unknown vector parameter, while an {\em unknown} subset of agents suffer Byzantine faults. Faulty agents malfunction arbitrarily and may send out {\em highly unstructured} messages to other agents in the network. As opposed to fault-fr… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

  36. arXiv:1809.06970  [pdf, other

    cs.LG cs.NI cs.PF eess.SY stat.ML

    FastDeepIoT: Towards Understanding and Optimizing Neural Network Execution Time on Mobile and Embedded Devices

    Authors: Shuochao Yao, Yiran Zhao, Huajie Shao, Shengzhong Liu, Dongxin Liu, Lu Su, Tarek Abdelzaher

    Abstract: Deep neural networks show great potential as solutions to many sensing application problems, but their excessive resource demand slows down execution time, pausing a serious impediment to deployment on low-end devices. To address this challenge, recent literature focused on compressing neural network size to improve performance. We show that changing neural network size does not proportionally aff… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

    Comments: Accepted by SenSys '18

  37. arXiv:1804.09202  [pdf, other

    cs.SD eess.AS

    Vocal melody extraction using patch-based CNN

    Authors: Li Su

    Abstract: A patch-based convolutional neural network (CNN) model presented in this paper for vocal melody extraction in polyphonic music is inspired from object detection in image processing. The input of the model is a novel time-frequency representation which enhances the pitch contours and suppresses the harmonic components of a signal. This succinct data representation and the patch-based CNN model enab… ▽ More

    Submitted 24 April, 2018; originally announced April 2018.

    Journal ref: Proc. Int. Conf. Acoustic, Speech and Signal Processing (ICASSP), 2018

  38. arXiv:1711.08600  [pdf, other

    eess.AS

    Singing voice correction using canonical time war**

    Authors: Yin-Jyun Luo, Ming-Tso Chen, Tai-Shih Chi, Li Su

    Abstract: Expressive singing voice correction is an appealing but challenging problem. A robust time-war** algorithm which synchronizes two singing recordings can provide a promising solution. We thereby propose to address the problem by canonical time war** (CTW) which aligns amateur singing recordings to professional ones. A new pitch contour is generated given the alignment information, and a pitch-c… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

  39. arXiv:1705.03955  [pdf

    eess.SY

    VehSense: Slippery Road Detection Using Smartphones

    Authors: Yunfei Hou, Abhishek Gupta, Tong Guan, Shaohan Hu, Lu Su, Chunming Qiao

    Abstract: This paper investigates a new application of vehicular sensing: detecting and reporting the slippery road conditions. We describe a system and associated algorithm to monitor vehicle skidding events using smartphones and OBD-II (On board Diagnostics) adaptors. This system, which we call the VehSense, gathers data from smartphone inertial sensors and vehicle wheel speed sensors, and processes the d… ▽ More

    Submitted 10 May, 2017; originally announced May 2017.

    Comments: 2017 IEEE 85th Vehicular Technology Conference (VTC2017-Spring)