Skip to main content

Showing 1–39 of 39 results for author: yin, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18536  [pdf, other

    eess.SY cs.AI cs.AR

    Reliable Interval Prediction of Minimum Operating Voltage Based on On-chip Monitors via Conformalized Quantile Regression

    Authors: Yuxuan Yin, Xiaoxiao Wang, Rebecca Chen, Chen He, Peng Li

    Abstract: Predicting the minimum operating voltage ($V_{min}$) of chips is one of the important techniques for improving the manufacturing testing flow, as well as ensuring the long-term reliability and safety of in-field systems. Current $V_{min}$ prediction methods often provide only point estimates, necessitating additional techniques for constructing prediction confidence intervals to cover uncertaintie… ▽ More

    Submitted 3 May, 2024; originally announced June 2024.

    Comments: Accepted by DATE 2024. Camera-ready version

  2. arXiv:2405.11895  [pdf, other

    cs.LG eess.SY

    Sparse Attention-driven Quality Prediction for Production Process Optimization in Digital Twins

    Authors: Yanlei Yin, Lihua Wang, Wenbo Wang, Dinh Thai Hoang

    Abstract: In the process industry, optimizing production lines for long-term efficiency requires real-time monitoring and analysis of operation states to fine-tune production line parameters. However, the complexity in operational logic and the intricate coupling of production process parameters make it difficult to develop an accurate mathematical model for the entire process, thus hindering the deployment… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  3. arXiv:2403.19470  [pdf, other

    math.NA cs.LG eess.SP

    Deep decomposition method for the limited aperture inverse obstacle scattering problem

    Authors: Yunwen Yin, Liang Yan

    Abstract: In this paper, we consider a deep learning approach to the limited aperture inverse obstacle scattering problem. It is well known that traditional deep learning relies solely on data, which may limit its performance for the inverse problem when only indirect observation data and a physical model are available. A fundamental question arises in light of these limitations: is it possible to enable de… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  4. arXiv:2310.06930  [pdf, other

    cs.SD cs.LG eess.AS

    Prosody Analysis of Audiobooks

    Authors: Charuta Pethe, Yunting Yin, Steven Skiena

    Abstract: Recent advances in text-to-speech have made it possible to generate natural-sounding audio from text. However, audiobook narrations involve dramatic vocalizations and intonations by the reader, with greater reliance on emotions, dialogues, and descriptions in the narrative. Using our dataset of 93 aligned book-audiobook pairs, we present improved models for prosody prediction properties (pitch, vo… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  5. arXiv:2309.02418  [pdf, other

    eess.AS cs.SD eess.SP

    Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition

    Authors: Minh Tran, Yufeng Yin, Mohammad Soleymani

    Abstract: There are individual differences in expressive behaviors driven by cultural norms and personality. This between-person variation can result in reduced emotion recognition performance. Therefore, personalization is an important step in improving the generalization and robustness of speech emotion recognition. In this paper, to achieve unsupervised personalized emotion recognition, we first pre-trai… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted by INTERSPEECH 2023

  6. arXiv:2308.05975  [pdf

    eess.IV

    A Self-supervised SAR Image Despeckling Strategy Based on Parameter-sharing Convolutional Neural Networks

    Authors: Liang Chen, Yifei Yin, Hao Shi, Qingqing Sheng, Wei Li

    Abstract: Speckle noise is generated due to the SAR imaging mechanism, which brings difficulties in SAR image interpretation. Hence, despeckling is a helpful step in SAR pre-processing. Nowadays, deep learning has been proved to be a progressive method for SAR image despeckling. Most deep learning methods for despeckling are based on supervised learning, which needs original SAR images and speckle-free SAR… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  7. arXiv:2306.15530  [pdf, other

    eess.SY

    Fast and Automatic 3D Modeling of Antenna Structure Using CNN-LSTM Network for Efficient Data Generation

    Authors: Zhaohui Wei, Zhao Zhou, Peng Wang, Jian Ren, Yingzeng Yin, Gert Frølund Pedersen, Ming Shen

    Abstract: Deep learning-assisted antenna design methods such as surrogate models have gained significant popularity in recent years due to their potential to greatly increase design efficiencies by replacing the time-consuming full-wave electromagnetic (EM) simulations. However, a large number of training data with sufficiently diverse and representative samples (antenna structure parameters, scattering pro… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  8. Robust and Efficient Fault Diagnosis of mm-Wave Active Phased Arrays using Baseband Signal

    Authors: Martin H. Nielsen, Yufeng Zhang, Changbin Xue, Jian Ren, Yingzeng Yin, Ming Shen, Gert F. Pedersen

    Abstract: One key communication block in 5G and 6G radios is the active phased array (APA). To ensure reliable operation, efficient and timely fault diagnosis of APAs on-site is crucial. To date, fault diagnosis has relied on measurement of frequency domain radiation patterns using costly equipment and multiple strictly controlled measurement probes, which are time-consuming, complex, and therefore infeasib… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 10 pages

    Journal ref: in IEEE Transactions on Antennas and Propagation, vol. 70, no. 7, pp. 5044-5053, July 2022

  9. arXiv:2302.03033  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Exemplars and Counterexemplars Explanations for Image Classifiers, Targeting Skin Lesion Labeling

    Authors: Carlo Metta, Riccardo Guidotti, Yuan Yin, Patrick Gallinari, Salvatore Rinzivillo

    Abstract: Explainable AI consists in develo** mechanisms allowing for an interaction between decision systems and humans by making the decisions of the formers understandable. This is particularly important in sensitive contexts like in the medical domain. We propose a use case study, for skin lesion diagnosis, illustrating how it is possible to provide the practitioner with explanations on the decisions… ▽ More

    Submitted 18 January, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2111.11863

    Journal ref: 2021 IEEE Symposium on Computers and Communications (ISCC)

  10. arXiv:2210.08868  [pdf, other

    eess.IV cs.CV

    Cerebrovascular Segmentation via Vessel Oriented Filtering Network

    Authors: Zhanqiang Guo, Yao Luan, Jianjiang Feng, Wangsheng Lu, Yin Yin, Guangming Yang, Jie Zhou

    Abstract: Accurate cerebrovascular segmentation from Magnetic Resonance Angiography (MRA) and Computed Tomography Angiography (CTA) is of great significance in diagnosis and treatment of cerebrovascular pathology. Due to the complexity and topology variability of blood vessels, complete and accurate segmentation of vascular network is still a challenge. In this paper, we proposed a Vessel Oriented Filtering… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  11. arXiv:2206.05687  [pdf, other

    cs.HC cs.CV eess.IV

    DRNet: Decomposition and Reconstruction Network for Remote Physiological Measurement

    Authors: Yuhang Dong, Gong** Yang, Yilong Yin

    Abstract: Remote photoplethysmography (rPPG) based physiological measurement has great application values in affective computing, non-contact health monitoring, telehealth monitoring, etc, which has become increasingly important especially during the COVID-19 pandemic. Existing methods are generally divided into two groups. The first focuses on mining the subtle blood volume pulse (BVP) signals from face vi… ▽ More

    Submitted 20 June, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

  12. arXiv:2206.03596  [pdf, other

    cs.LG cs.CV eess.IV

    Neural Network Compression via Effective Filter Analysis and Hierarchical Pruning

    Authors: Ziqi Zhou, Li Lian, Yilong Yin, Ze Wang

    Abstract: Network compression is crucial to making the deep networks to be more efficient, faster, and generalizable to low-end hardware. Current network compression methods have two open problems: first, there lacks a theoretical framework to estimate the maximum compression rate; second, some layers may get over-prunned, resulting in significant network performance drop. To solve these two problems, this… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  13. arXiv:2206.00901  [pdf

    cs.SD eess.AS

    Musical Instrument Recognition by XGBoost Combining Feature Fusion

    Authors: Yijie Liu, Yanfang Yin, Qigang Zhu, Wenzhuo Cui

    Abstract: Musical instrument classification is one of the focuses of Music Information Retrieval (MIR). In order to solve the problem of poor performance of current musical instrument classification models, we propose a musical instrument classification algorithm based on multi-channel feature fusion and XGBoost. Based on audio feature extraction and fusion of the dataset, the features are input into the XG… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  14. Real-Time Cross-Fleet Pareto-Improving Truck Platoon Coordination

    Authors: Alexander Johansson, Jonas MÃ¥rtensson, Xiaotong Sun, Yafeng Yin

    Abstract: This paper studies a multi-fleet platoon coordination system in transport networks that deploy hubs to form trucks into platoons. The trucks belong to different fleets that are interested in increasing their profits by platooning across fleets. The profit of each fleet incorporates platooning rewards and costs for waiting at hubs. Each truck has a fixed route and a waiting time budget to spend at… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: ITSC 2021

  15. arXiv:2112.07415  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration

    Authors: Ziwei Luo, **g Hu, Xin Wang, Shu Hu, Bin Kong, Youbing Yin, Qi Song, Xi Wu, Siwei Lyu

    Abstract: Large deformations of organs, caused by diverse shapes and nonlinear shape changes, pose a significant challenge for medical image registration. Traditional registration methods need to iteratively optimize an objective function via a specific deformation model along with meticulous parameter tuning, but which have limited capabilities in registering images with large deformations. While deep lear… ▽ More

    Submitted 30 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI 2022

  16. Integrated Decision and Control at Multi-Lane Intersections with Mixed Traffic Flow

    Authors: Jianhua Jiang, Yangang Ren, Yang Guan, Shengbo Eben Li, Yuming Yin, ** **

    Abstract: Autonomous driving at intersections is one of the most complicated and accident-prone traffic scenarios, especially with mixed traffic participants such as vehicles, bicycles and pedestrians. The driving policy should make safe decisions to handle the dynamic traffic conditions and meet the requirements of on-board computation. However, most of the current researches focuses on simplified intersec… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: 8 pages, 10 figures, 11 equations and 14 conferences

  17. arXiv:2107.11517  [pdf, other

    eess.IV cs.CV cs.LG

    Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing Vertical and Horizontal Convolutions

    Authors: Qian Yu, Lei Qi, Lu** Zhou, Lei Wang, Yilong Yin, Yinghuan Shi, Wuzhang Wang, Yang Gao

    Abstract: Accurate image segmentation plays a crucial role in medical image analysis, yet it faces great challenges of various shapes, diverse sizes, and blurry boundaries. To address these difficulties, square kernel-based encoder-decoder architecture has been proposed and widely used, but its performance remains still unsatisfactory. To further cope with these challenges, we present a novel double-branch… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: 13 pages, 12 figures

    MSC Class: 68T07 ACM Class: I.4.6

  18. arXiv:2107.11318  [pdf, other

    eess.SY

    Heuristics for Customer-focused Ride-pooling Assignment

    Authors: Alexander Sundt, Qi Luo, John Vincent, Mehrdad Shahabi, Yafeng Yin

    Abstract: Ride-pooling has become an important service option offered by ride-hailing platforms as it serves multiple trip requests in a single ride. By leveraging customer data, connected vehicles, and efficient assignment algorithms, ride-pooling can be a critical instrument to address driver shortages and mitigate the negative externalities of ride-hailing operations. Recent literature has focused on com… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: 13 pages, 8 figures, 4 tables

  19. arXiv:2103.05505  [pdf

    eess.SY cs.LG

    Approximate Optimal Filter for Linear Gaussian Time-invariant Systems

    Authors: Kaiming Tang, Shengbo Eben Li, Yuming Yin, Yang Guan, **gliang Duan, Wenhan Cao, Jie Li

    Abstract: State estimation is critical to control systems, especially when the states cannot be directly measured. This paper presents an approximate optimal filter, which enables to use policy iteration technique to obtain the steady-state gain in linear Gaussian time-invariant systems. This design transforms the optimal filtering problem with minimum mean square error into an optimal control problem, call… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  20. arXiv:2103.00714  [pdf

    eess.IV physics.med-ph

    Diffusion-weighted MRI-guided needle biopsies permit quantitative tumor heterogeneity assessment and cell load estimation

    Authors: Yi Yin, Kai Breuhahn, Hans-Ulrich Kauczor, Oliver Sedlaczek, Irene E. Vignon-Clementel, Dirk Drasdo

    Abstract: Quantitative information on tumor heterogeneity and cell load could assist in designing effective and refined personalized treatment strategies. It was recently shown by us that such information can be inferred from the diffusion parameter D derived from the diffusion-weighted MRI (DWI) if a relation between D and cell density can be established. However, such relation cannot a priori be assumed t… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  21. arXiv:2103.00430  [pdf, other

    cs.CV cs.LG eess.IV

    Training Generative Adversarial Networks in One Stage

    Authors: Chengchao Shen, Youtan Yin, Xinchao Wang, Xubin Li, Jie Song, Mingli Song

    Abstract: Generative Adversarial Networks (GANs) have demonstrated unprecedented success in various image generation tasks. The encouraging results, however, come at the price of a cumbersome training process, during which the generator and discriminator are alternately updated in two stages. In this paper, we investigate a general training scheme that enables training GANs efficiently in only one stage. Ba… ▽ More

    Submitted 16 June, 2021; v1 submitted 28 February, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021

  22. arXiv:2102.11736  [pdf, other

    eess.SY cs.AI

    Recurrent Model Predictive Control

    Authors: Zhengyu Liu, **gliang Duan, Wenxuan Wang, Shengbo Eben Li, Yuming Yin, Ziyu Lin, Qi Sun, Bo Cheng

    Abstract: This paper proposes an off-line algorithm, called Recurrent Model Predictive Control (RMPC), to solve general nonlinear finite-horizon optimal control problems. Unlike traditional Model Predictive Control (MPC) algorithms, it can make full use of the current computing resources and adaptively select the longest model prediction horizon. Our algorithm employs a recurrent function to approximate the… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2102.10289

  23. Recurrent Model Predictive Control: Learning an Explicit Recurrent Controller for Nonlinear Systems

    Authors: Zhengyu Liu, **gliang Duan, Wenxuan Wang, Shengbo Eben Li, Yuming Yin, Ziyu Lin, Bo Cheng

    Abstract: This paper proposes an offline control algorithm, called Recurrent Model Predictive Control (RMPC), to solve large-scale nonlinear finite-horizon optimal control problems. It can be regarded as an explicit solver of traditional Model Predictive Control (MPC) algorithms, which can adaptively select appropriate model prediction horizon according to current computing resources, so as to improve the p… ▽ More

    Submitted 8 April, 2022; v1 submitted 20 February, 2021; originally announced February 2021.

    Journal ref: IEEE Transactions on Industrial Electronics, 2022

  24. arXiv:2012.10716  [pdf, other

    cs.LG cs.AI eess.SY

    Model-Based Actor-Critic with Chance Constraint for Stochastic System

    Authors: Baiyu Peng, Yao Mu, Yang Guan, Shengbo Eben Li, Yuming Yin, Jianyu Chen

    Abstract: Safety is essential for reinforcement learning (RL) applied in real-world situations. Chance constraints are suitable to represent the safety requirements in stochastic systems. Previous chance-constrained RL methods usually have a low convergence rate, or only learn a conservative policy. In this paper, we propose a model-based chance constrained actor-critic (CCAC) algorithm which can efficientl… ▽ More

    Submitted 16 March, 2021; v1 submitted 19 December, 2020; originally announced December 2020.

  25. arXiv:2012.05509  [pdf

    eess.IV cs.CV cs.LG

    COVID-MTL: Multitask Learning with Shift3D and Random-weighted Loss for Automated Diagnosis and Severity Assessment of COVID-19

    Authors: Guoqing Bao, Huai Chen, Tongliang Liu, Guanzhong Gong, Yong Yin, Lisheng Wang, Xiuying Wang

    Abstract: There is an urgent need for automated methods to assist accurate and effective assessment of COVID-19. Radiology and nucleic acid test (NAT) are complementary COVID-19 diagnosis methods. In this paper, we present an end-to-end multitask learning (MTL) framework (COVID-MTL) that is capable of automated and simultaneous detection (against both radiology and NAT) and severity assessment of COVID-19.… ▽ More

    Submitted 31 December, 2020; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: COVID-19 research; computer vision and pattern recognition; 13 pages, 10 figures and 5 tables

  26. arXiv:2009.12812  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    TernaryBERT: Distillation-aware Ultra-low Bit BERT

    Authors: Wei Zhang, Lu Hou, Yichun Yin, Lifeng Shang, Xiao Chen, Xin Jiang, Qun Liu

    Abstract: Transformer-based pre-training models like BERT have achieved remarkable performance in many natural language processing tasks.However, these models are both computation and memory expensive, hindering their deployment to resource-constrained devices. In this work, we propose TernaryBERT, which ternarizes the weights in a fine-tuned BERT model. Specifically, we use both approximation-based and los… ▽ More

    Submitted 10 October, 2020; v1 submitted 27 September, 2020; originally announced September 2020.

    Comments: Accepted by EMNLP 2020

  27. arXiv:2008.02492  [pdf, other

    cs.CV cs.LG eess.IV

    Zero-Shot Multi-View Indoor Localization via Graph Location Networks

    Authors: Meng-Jiun Chiou, Zhenguang Liu, Yifang Yin, Anan Liu, Roger Zimmermann

    Abstract: Indoor localization is a fundamental problem in location-based applications. Current approaches to this problem typically rely on Radio Frequency technology, which requires not only supporting infrastructures but human efforts to measure and calibrate the signal. Moreover, data collection for all locations is indispensable in existing methods, which in turn hinders their large-scale deployment. In… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted at ACM MM 2020. 10 pages, 7 figures. Code and datasets available at https://github.com/coldmanck/zero-shot-indoor-localization-release

    ACM Class: I.2.10

    Journal ref: Proceedings of the 28th ACM International Conference on Multimedia, 2020

  28. arXiv:2007.06810  [pdf

    eess.SY cs.GT cs.LG

    Ternary Policy Iteration Algorithm for Nonlinear Robust Control

    Authors: Jie Li, Shengbo Eben Li, Yang Guan, **gliang Duan, Wenyu Li, Yuming Yin

    Abstract: The uncertainties in plant dynamics remain a challenge for nonlinear control problems. This paper develops a ternary policy iteration (TPI) algorithm for solving nonlinear robust control problems with bounded uncertainties. The controller and uncertainty of the system are considered as game players, and the robust control problem is formulated as a two-player zero-sum differential game. In order t… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  29. arXiv:2007.02070  [pdf, other

    eess.SY

    Continuous-time finite-horizon ADP for automated vehicle controller design with high efficiency

    Authors: Ziyu Lin, **gliang Duan, Shengbo Eben Li, Haitong Ma, Yuming Yin

    Abstract: The design of an automated vehicle controller can be generally formulated into an optimal control problem. This paper proposes a continuous-time finite-horizon approximate dynamicprogramming (ADP) method, which can synthesis off-line near-optimal control policy with analytical vehicle dynamics. Lying on the general Policy Iteration framework, it employs value andpolicy neural networks to approxima… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Comments: 7 pages,conference

  30. arXiv:2006.08599  [pdf, other

    cs.CL cs.SD eess.AS

    "Notic My Speech" -- Blending Speech Patterns With Multimedia

    Authors: Dhruva Sahrawat, Yaman Kumar, Shashwat Aggarwal, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann

    Abstract: Speech as a natural signal is composed of three parts - visemes (visual part of speech), phonemes (spoken part of speech), and language (the imposed structure). However, video as a medium for the delivery of speech and a multimedia construct has mostly ignored the cognitive aspects of speech delivery. For example, video applications like transcoding and compression have till now ignored the fact h… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: Under Review

  31. arXiv:2005.08497  [pdf, other

    eess.AS cs.CL cs.SD

    Attention-based Transducer for Online Speech Recognition

    Authors: Bin Wang, Yan Yin, Hui Lin

    Abstract: Recent studies reveal the potential of recurrent neural network transducer (RNN-T) for end-to-end (E2E) speech recognition. Among some most popular E2E systems including RNN-T, Attention Encoder-Decoder (AED), and Connectionist Temporal Classification (CTC), RNN-T has some clear advantages given that it supports streaming recognition and does not have frame-independency assumption. Although signif… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: submitted to Interspeech 2020

  32. arXiv:2004.13577  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Unifying Neural Learning and Symbolic Reasoning for Spinal Medical Report Generation

    Authors: Zhongyi Han, Benzheng Wei, Yilong Yin, Shuo Li

    Abstract: Automated medical report generation in spine radiology, i.e., given spinal medical images and directly create radiologist-level diagnosis reports to support clinical decision making, is a novel yet fundamental study in the domain of artificial intelligence in healthcare. However, it is incredibly challenging because it is an extremely complicated task that involves visual perception and high-level… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: Under review

  33. arXiv:2002.02909  [pdf, other

    cs.CV cs.LG eess.IV

    Domain Embedded Multi-model Generative Adversarial Networks for Image-based Face Inpainting

    Authors: Xian Zhang, Xin Wang, Bin Kong, Canghong Shi, Youbing Yin, Qi Song, Siwei Lyu, Jiancheng Lv, Canghong Shi, Xiaojie Li

    Abstract: Prior knowledge of face shape and structure plays an important role in face inpainting. However, traditional face inpainting methods mainly focus on the generated image resolution of the missing portion without consideration of the special particularities of the human face explicitly and generally produce discordant facial parts. To solve this problem, we present a domain embedded multi-model gene… ▽ More

    Submitted 20 June, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

  34. arXiv:1910.08375  [pdf, other

    eess.IV cs.CV

    Detecting intracranial aneurysm rupture from 3D surfaces using a novel GraphNet approach

    Authors: Z. Ma, L. Song, X. Feng, G. Yang, W. Zhu, J. Liu, Y. Zhang, X. Yang, Y. Yin

    Abstract: Intracranial aneurysm (IA) is a life-threatening blood spot in human's brain if it ruptures and causes cerebral hemorrhage. It is challenging to detect whether an IA has ruptured from medical images. In this paper, we propose a novel graph based neural network named GraphNet to detect IA rupture from 3D surface data. GraphNet is based on graph convolution network (GCN) and is designed for graph-le… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: Submitted to ISBI 2020

  35. arXiv:1907.01607  [pdf, other

    eess.AS cs.LG cs.MM cs.SD

    MIDI-Sandwich: Multi-model Multi-task Hierarchical Conditional VAE-GAN networks for Symbolic Single-track Music Generation

    Authors: Xia Liang, Junmin Wu, Yan Yin

    Abstract: Most existing neural network models for music generation explore how to generate music bars, then directly splice the music bars into a song. However, these methods do not explore the relationship between the bars, and the connected song as a whole has no musical form structure and sense of musical direction. To address this issue, we propose a Multi-model Multi-task Hierarchical Conditional VAE-G… ▽ More

    Submitted 4 July, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: cast KSEM2019 on May 3, 2019 (weak rejected)

  36. arXiv:1907.01367  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Lipper: Synthesizing Thy Speech using Multi-View Lipreading

    Authors: Yaman Kumar, Rohit Jain, Khwaja Mohd. Salik, Rajiv Ratn Shah, Yifang yin, Roger Zimmermann

    Abstract: Lipreading has a lot of potential applications such as in the domain of surveillance and video conferencing. Despite this, most of the work in building lipreading systems has been limited to classifying silent videos into classes representing text phrases. However, there are multiple problems associated with making lipreading a text-based classification task like its dependence on a particular lan… ▽ More

    Submitted 28 June, 2019; originally announced July 2019.

    Comments: Accepted at AAAI 2019

  37. arXiv:1810.13088  [pdf

    cs.CL cs.LG cs.SD eess.AS

    Attention-based sequence-to-sequence model for speech recognition: development of state-of-the-art system on LibriSpeech and its application to non-native English

    Authors: Yan Yin, Ramon Prieto, Bin Wang, Jianwei Zhou, Yiwei Gu, Yang Liu, Hui Lin

    Abstract: Recent research has shown that attention-based sequence-to-sequence models such as Listen, Attend, and Spell (LAS) yield comparable results to state-of-the-art ASR systems on various tasks. In this paper, we describe the development of such a system and demonstrate its performance on two tasks: first we achieve a new state-of-the-art word error rate of 3.43% on the test clean subset of LibriSpeech… ▽ More

    Submitted 5 November, 2018; v1 submitted 30 October, 2018; originally announced October 2018.

  38. arXiv:1804.06586  [pdf, other

    eess.SY cs.RO

    Composite Adaptive Control for Bilateral Teleoperation Systems without Persistency of Excitation

    Authors: Yuling Li, Yixin Yin, Sen Zhang, Jie Dong, Rolf Johansson

    Abstract: Composite adaptive control schemes, which use both the system tracking errors and the prediction error to drive the update laws, have become widespread in achieving an improvement of system performance. However, a strong persistent-excitation (PE) condition should be satisfied to guarantee the parameter convergence. This paper proposes a novel composite adaptive control to guarantee parameter conv… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

    Comments: 21 pages, 9 figures, submitted to Journal of The Franklin Institute

  39. arXiv:1804.04290  [pdf, other

    eess.SY cs.HC math.OC

    Bilateral Teleoperation of Multiple Robots under Scheduling Communication

    Authors: Yuling Li, Kun Liu, Wei He, Yixin Yin, Rolf Johansson, Kai Zhang

    Abstract: In this paper, bilateral teleoperation of multiple slaves coupled to a single master under scheduling communication is investigated. The sampled-data transmission between the master and the multiple slaves is fulfilled over a delayed communication network, and at each sampling instant, only one slave is allowed to transmit its current information to the master side according to some scheduling pro… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: 13 pages, 12 figures, 4 tables, submitted to IEEE Transactions on Control Systems Technology