Skip to main content

Showing 1–48 of 48 results for author: Yin, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.07857  [pdf, other

    eess.SY cs.LG cs.NI

    Toward Enhanced Reinforcement Learning-Based Resource Management via Digital Twin: Opportunities, Applications, and Challenges

    Authors: Nan Cheng, Xiucheng Wang, Zan Li, Zhisheng Yin, Tom Luan, Xuemin Shen

    Abstract: This article presents a digital twin (DT)-enhanced reinforcement learning (RL) framework aimed at optimizing performance and reliability in network resource management, since the traditional RL methods face several unified challenges when applied to physical networks, including limited exploration efficiency, slow convergence, poor long-term performance, and safety concerns during the exploration… ▽ More

    Submitted 15 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 7pages, 6figures

  2. arXiv:2406.03875  [pdf, other

    eess.SY

    Energy-storing analysis and fishtail stiffness optimization for a wire-driven elastic robotic fish

    Authors: Xiaocun Liao, Chao Zhou, Junfeng Fan, Zhuoliang Zhang, Zhaoran Yin, Liangwei Deng

    Abstract: The robotic fish with high propulsion efficiency and good maneuverability achieves underwater fishlike propulsion by commonly adopting the motor to drive the fishtail, causing the significant fluctuations of the motor power due to the uneven swing speed of the fishtail in one swing cycle. Hence, we propose a wire-driven robotic fish with a spring-steel-based active-segment elastic spine. This bion… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 14 pages, 19 figures

  3. arXiv:2405.05498  [pdf, other

    cs.SD eess.AS

    The RoyalFlush Automatic Speech Diarization and Recognition System for In-Car Multi-Channel Automatic Speech Recognition Challenge

    Authors: **gguang Tian, Shuaishuai Ye, Shunfei Chen, Yang Xiang, Zhaohui Yin, Xinhui Hu, Xinkang Xu

    Abstract: This paper presents our system submission for the In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) Challenge, which focuses on speaker diarization and speech recognition in complex multi-speaker scenarios. To address these challenges, we develop end-to-end speaker diarization models that notably decrease the diarization error rate (DER) by 49.58\% compared to the official baseline on t… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  4. arXiv:2405.02809  [pdf, other

    eess.SY

    Does Optimal Control Always Benefit from Better Prediction? An Analysis Framework for Predictive Optimal Control

    Authors: Xiangrui Zeng, Cheng Yin, Zhou** Yin

    Abstract: The ``prediction + optimal control'' scheme has shown good performance in many applications of automotive, traffic, robot, and building control. In practice, the prediction results are simply considered correct in the optimal control design process. However, in reality, these predictions may never be perfect. Under a conventional stochastic optimal control formulation, it is difficult to answer qu… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  5. arXiv:2401.06230  [pdf, other

    physics.geo-ph cs.AI cs.LG eess.SP stat.AP

    WISE: full-Waveform variational Inference via Subsurface Extensions

    Authors: Ziyi Yin, Rafael Orozco, Mathias Louboutin, Felix J. Herrmann

    Abstract: We introduce a probabilistic technique for full-waveform inversion, employing variational inference and conditional normalizing flows to quantify uncertainty in migration-velocity models and its impact on imaging. Our approach integrates generative artificial intelligence with physics-informed common-image gathers, reducing reliance on accurate initial velocity models. Considered case studies demo… ▽ More

    Submitted 10 December, 2023; originally announced January 2024.

  6. arXiv:2312.09620  [pdf, other

    eess.AS

    A Deep Representation Learning-based Speech Enhancement Method Using Complex Convolution Recurrent Variational Autoencoder

    Authors: Yang Xiang, **gguang Tian, Xinhui Hu, Xinkang Xu, ZhaoHui Yin

    Abstract: Generally, the performance of deep neural networks (DNNs) heavily depends on the quality of data representation learning. Our preliminary work has emphasized the significance of deep representation learning (DRL) in the context of speech enhancement (SE) applications. Specifically, our initial SE algorithm employed a gated recurrent unit variational autoencoder (VAE) with a Gaussian distribution t… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  7. arXiv:2310.16302  [pdf, other

    cs.LG eess.SY

    Imperfect Digital Twin Assisted Low Cost Reinforcement Training for Multi-UAV Networks

    Authors: Xiucheng Wang, Nan Cheng, Longfei Ma, Zhisheng Yin, Tom. Luan, Ning Lu

    Abstract: Deep Reinforcement Learning (DRL) is widely used to optimize the performance of multi-UAV networks. However, the training of DRL relies on the frequent interactions between the UAVs and the environment, which consumes lots of energy due to the flying and communication of UAVs in practical experiments. Inspired by the growing digital twin (DT) technology, which can simulate the performance of algor… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  8. arXiv:2310.03749  [pdf

    eess.SP cs.AI cs.LG

    SCVCNet: Sliding cross-vector convolution network for cross-task and inter-individual-set EEG-based cognitive workload recognition

    Authors: Qi Wang, Li Chen, Zhiyuan Zhan, Jianhua Zhang, Zhong Yin

    Abstract: This paper presents a generic approach for applying the cognitive workload recognizer by exploiting common electroencephalogram (EEG) patterns across different human-machine tasks and individual sets. We propose a neural network called SCVCNet, which eliminates task- and individual-set-related interferences in EEGs by analyzing finer-grained frequency structures in the power spectral densities. Th… ▽ More

    Submitted 21 September, 2023; originally announced October 2023.

    Comments: 12 pages

  9. arXiv:2308.14348  [pdf, other

    eess.SY cs.CR cs.LG

    Label-free Deep Learning Driven Secure Access Selection in Space-Air-Ground Integrated Networks

    Authors: Zhaowei Wang, Zhisheng Yin, Xiucheng Wang, Nan Cheng, Yuan Zhang, Tom H. Luan

    Abstract: In Space-air-ground integrated networks (SAGIN), the inherent openness and extensive broadcast coverage expose these networks to significant eavesdrop** threats. Considering the inherent co-channel interference due to spectrum sharing among multi-tier access networks in SAGIN, it can be leveraged to assist the physical layer security among heterogeneous transmissions. However, it is challenging… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  10. arXiv:2308.07511  [pdf, other

    cs.LG eess.SY

    Distilling Knowledge from Resource Management Algorithms to Neural Networks: A Unified Training Assistance Approach

    Authors: Longfei Ma, Nan Cheng, Xiucheng Wang, Zhisheng Yin, Haibo Zhou, Wei Quan

    Abstract: As a fundamental problem, numerous methods are dedicated to the optimization of signal-to-interference-plus-noise ratio (SINR), in a multi-user setting. Although traditional model-based optimization methods achieve strong performance, the high complexity raises the research of neural network (NN) based approaches to trade-off the performance and complexity. To fully leverage the high performance o… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  11. arXiv:2308.05987  [pdf, other

    cs.SD eess.AS

    Large-Scale Learning on Overlapped Speech Detection: New Benchmark and New General System

    Authors: Zhaohui Yin, **gguang Tian, Xinhui Hu, Xinkang Xu, Yang Xiang

    Abstract: Overlapped Speech Detection (OSD) is an important part of speech applications involving analysis of multi-party conversations. However, most of existing OSD systems are trained and evaluated on small datasets with limited application domains, which led to the robustness of them lacks benchmark for evaluation and the accuracy of them remains inadequate in realistic acoustic environments. To solve t… ▽ More

    Submitted 7 September, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

  12. arXiv:2307.13945  [pdf, ps, other

    eess.SY cs.AI

    Learning-based Control for PMSM Using Distributed Gaussian Processes with Optimal Aggregation Strategy

    Authors: Zhenxiao Yin, Xiaobing Dai, Zewen Yang, Yang Shen, Georges Hattab, Hang Zhao

    Abstract: The growing demand for accurate control in varying and unknown environments has sparked a corresponding increase in the requirements for power supply components, including permanent magnet synchronous motors (PMSMs). To infer the unknown part of the system, machine learning techniques are widely employed, especially Gaussian process regression (GPR) due to its flexibility of continuous system mode… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  13. arXiv:2307.02002  [pdf, other

    eess.SY

    Interpretable and Secure Trajectory Optimization for UAV-Assisted Communication

    Authors: Yunhao Quan, Nan Cheng, Xiucheng Wang, **glong Shen, Longfei Ma, Zhisheng Yin

    Abstract: Unmanned aerial vehicles (UAVs) have gained popularity due to their flexible mobility, on-demand deployment, and the ability to establish high probability line-of-sight wireless communication. As a result, UAVs have been extensively used as aerial base stations (ABSs) to supplement ground-based cellular networks for various applications. However, existing UAV-assisted communication schemes mainly… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  14. arXiv:2306.06144  [pdf, other

    eess.SP cs.LG stat.AP

    Bayesian Calibration of MEMS Accelerometers

    Authors: Oliver Dürr, Po-Yu Fan, Zong-Xian Yin

    Abstract: This study aims to investigate the utilization of Bayesian techniques for the calibration of micro-electro-mechanical systems (MEMS) accelerometers. These devices have garnered substantial interest in various practical applications and typically require calibration through error-correcting functions. The parameters of these error-correcting functions are determined during a calibration process. Ho… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted in IEEE Sensors

  15. arXiv:2305.15719  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Efficient Neural Music Generation

    Authors: Max W. Y. Lam, Qiao Tian, Tang Li, Zongyu Yin, Siyuan Feng, Ming Tu, Yuliang Ji, Rui Xia, Mingbo Ma, Xuchen Song, Jitong Chen, Yu** Wang, Yuxuan Wang

    Abstract: Recent progress in music generation has been remarkably advanced by the state-of-the-art MusicLM, which comprises a hierarchy of three LMs, respectively, for semantic, coarse acoustic, and fine acoustic modelings. Yet, sampling with the MusicLM requires processing through these LMs one by one to obtain the fine-grained acoustic tokens, making it computationally expensive and prohibitive for a real… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  16. arXiv:2305.07662  [pdf, other

    cs.IT cs.LG eess.SP

    Self-information Domain-based Neural CSI Compression with Feature Coupling

    Authors: Ziqing Yin, Renjie Xie, Wei Xu, Zhaohui Yang, Xiaohu You

    Abstract: Deep learning (DL)-based channel state information (CSI) feedback methods compressed the CSI matrix by exploiting its delay and angle features straightforwardly, while the measure in terms of information contained in the CSI matrix has rarely been considered. Based on this observation, we introduce self-information as an informative CSI representation from the perspective of information theory, wh… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  17. Technology-Circuit-Algorithm Tri-Design for Processing-in-Pixel-in-Memory (P2M)

    Authors: Md Abdullah-Al Kaiser, Gourav Datta, Sreetama Sarkar, Souvik Kundu, Zihan Yin, Manas Garg, Ajey P. Jacob, Peter A. Beerel, Akhilesh R. Jaiswal

    Abstract: The massive amounts of data generated by camera sensors motivate data processing inside pixel arrays, i.e., at the extreme-edge. Several critical developments have fueled recent interest in the processing-in-pixel-in-memory paradigm for a wide range of visual machine intelligence tasks, including (1) advances in 3D integration technology to enable complex processing inside each pixel in a 3D integ… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Journal ref: GLSVLSI '23: Great Lakes Symposium on VLSI 2023 Proceedings

  18. arXiv:2303.14095  [pdf, other

    cs.CV cs.RO eess.IV

    PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View

    Authors: Ze Shi, Hao Shi, Kailun Yang, Zhe Yin, Yining Lin, Kaiwei Wang

    Abstract: Visual place recognition has gained significant attention in recent years as a crucial technology in autonomous driving and robotics. Currently, the two main approaches are the perspective view retrieval (P2P) paradigm and the equirectangular image retrieval (E2E) paradigm. However, it is practical and natural to assume that users only have consumer-grade pinhole cameras to obtain query perspectiv… ▽ More

    Submitted 28 July, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Accepted to ITSC 2023. Code and datasets will be made available at https://github.com/zafirshi/PanoVPR

  19. arXiv:2302.14751  [pdf

    eess.SP physics.optics

    High speed free-space optical communication using standard fiber communication component without optical amplification

    Authors: Yao Zhang, Hua-Ying Liu, Xiaoyi Liu, Peng Xu, Xiang Dong, Pengfei Fan, Xiaohui Tian, Hua Yu, Dong Pan, Zhijun Yin, Guilu Long, Shi-Ning Zhu, Zhenda Xie

    Abstract: Free-space optical communication (FSO) can achieve fast, secure and license-free communication without need for physical cables, making it a cost-effective, energy-efficient and flexible solution when the fiber connection is unavailable. To establish FSO connection on-demand, it is essential to build portable FSO devices with compact structure and light weight. Here, we develop a miniaturized FSO… ▽ More

    Submitted 16 April, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 7 pages, 5 figures

  20. arXiv:2212.04248  [pdf, other

    cs.GR cs.CV cs.SD eess.AS

    Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors

    Authors: Zhentao Yu, Zixin Yin, Deyu Zhou, Duomin Wang, Finn Wong, Baoyuan Wang

    Abstract: In this paper, we introduce a simple and novel framework for one-shot audio-driven talking head generation. Unlike prior works that require additional driving sources for controlled synthesis in a deterministic manner, we instead probabilistically sample all the holistic lip-irrelevant facial motions (i.e. pose, expression, blink, gaze, etc.) to semantically match the input audio while still maint… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 16 pages

  21. arXiv:2212.02918  [pdf, other

    cs.DC cs.HC eess.SP

    Thermal Dissipation Resulting from Everyday Interactions as a Sensing Modality -- The MIDAS Touch

    Authors: Farooq Dar, Hilary Emenike, Zhigang Yin, Mohan Liyanage, Rajesh Sharma, Agustin Zuniga, Mohammad A. Hoque, Marko Radeta, Petteri Nurmi, Huber Flores

    Abstract: We contribute MIDAS as a novel sensing solution for characterizing everyday objects using thermal dissipation. MIDAS takes advantage of the fact that anytime a person touches an object it results in heat transfer. By capturing and modeling the dissipation of the transferred heat, e.g., through the decrease in the captured thermal radiation, MIDAS can characterize the object and determine its mater… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Journal ref: Pervasive and Mobile Computing, Volume 84, 2022

  22. arXiv:2211.16791  [pdf, other

    cs.CV cs.LG eess.IV

    Adaptive adversarial training method for improving multi-scale GAN based on generalization bound theory

    Authors: **g Tang, Bo Tao, Zeyu Gong, Zhou** Yin

    Abstract: In recent years, multi-scale generative adversarial networks (GANs) have been proposed to build generalized image processing models based on single sample. Constraining on the sample size, multi-scale GANs have much difficulty converging to the global optimum, which ultimately leads to limitations in their capabilities. In this paper, we pioneered the introduction of PAC-Bayes generalized bound th… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

  23. arXiv:2211.10661  [pdf, other

    cs.SD cs.CR eess.AS

    Phonemic Adversarial Attack against Audio Recognition in Real World

    Authors: Jiakai Wang, Zhendong Chen, Zixin Yin, Qinghong Yang, Xianglong Liu

    Abstract: Recently, adversarial attacks for audio recognition have attracted much attention. However, most of the existing studies mainly rely on the coarse-grain audio features at the instance level to generate adversarial noises, which leads to expensive generation time costs and weak universal attacking ability. Motivated by the observations that all audio speech consists of fundamental phonemes, this pa… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

  24. arXiv:2211.08880  [pdf

    eess.SP

    Temporal-spatial Representation Learning Transformer for EEG-based Emotion Recognition

    Authors: Zhe Wang, Yongxiong Wang, Chuanfei Hu, Zhong Yin, Yu Song

    Abstract: Both the temporal dynamics and spatial correlations of Electroencephalogram (EEG), which contain discriminative emotion information, are essential for the emotion recognition. However, some redundant information within the EEG signals would degrade the performance. Specifically,the subjects reach prospective intense emotions for only a fraction of the stimulus duration. Besides, it is a challenge… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  25. arXiv:2211.03527  [pdf, other

    physics.geo-ph cs.CV eess.IV math.NA

    De-risking geological carbon storage from high resolution time-lapse seismic to explainable leakage detection

    Authors: Ziyi Yin, Huseyin Tuna Erdinc, Abhinav Prakash Gahlot, Mathias Louboutin, Felix J. Herrmann

    Abstract: Geological carbon storage represents one of the few truly scalable technologies capable of reducing the CO2 concentration in the atmosphere. While this technology has the potential to scale, its success hinges on our ability to mitigate its risks. An important aspect of risk mitigation concerns assurances that the injected CO2 remains within the storage complex. Amongst the different monitoring mo… ▽ More

    Submitted 7 October, 2022; originally announced November 2022.

  26. arXiv:2210.11153  [pdf, other

    eess.IV cs.CV

    Reversed Image Signal Processing and RAW Reconstruction. AIM 2022 Challenge Report

    Authors: Marcos V. Conde, Radu Timofte, Yibin Huang, **gyang Peng, Chang Chen, Cheng Li, Eduardo Pérez-Pellitero, Fenglong Song, Furui Bai, Shuai Liu, Chaoyu Feng, Xiaotao Wang, Lei Lei, Yu Zhu, Chenghua Li, Yingying Jiang, Yong A, Peisong Wang, Cong Leng, Jian Cheng, Xiaoyu Liu, Zhicun Yin, Zhilu Zhang, Junyi Li, Ming Liu , et al. (18 additional authors not shown)

    Abstract: Cameras capture sensor RAW images and transform them into pleasant RGB images, suitable for the human eyes, using their integrated Image Signal Processor (ISP). Numerous low-level vision tasks operate in the RAW domain (e.g. image denoising, white balance) due to its linear relationship with the scene irradiance, wide-range of information at 12bits, and sensor designs. Despite this, RAW image data… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: ECCV 2022 Advances in Image Manipulation (AIM) workshop

  27. arXiv:2210.05451  [pdf, other

    cs.CV eess.IV

    Enabling ISP-less Low-Power Computer Vision

    Authors: Gourav Datta, Zeyu Liu, Zihan Yin, Linyu Sun, Akhilesh R. Jaiswal, Peter A. Beerel

    Abstract: In order to deploy current computer vision (CV) models on resource-constrained low-power devices, recent works have proposed in-sensor and in-pixel computing approaches that try to partly/fully bypass the image signal processor (ISP) and yield significant bandwidth reduction between the image sensor and the CV processing unit by downsampling the activation maps in the initial convolutional neural… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023

  28. arXiv:2208.12707  [pdf, other

    eess.IV

    IRIS: Integrated Retinal Functionality in Image Sensors

    Authors: Zihan Yin, Md Abdullah-Al Kaiser, Lamine Ousmane Camara, Mark Camarena, Maryam Parsa, Ajey Jacob, Gregory Schwartz, Akhilesh Jaiswal

    Abstract: Neuromorphic image sensors draw inspiration from the biological retina to implement visual computations in electronic hardware. Gain control in phototransduction and temporal differentiation at the first retinal synapse inspired the first generation of neuromorphic sensors, but processing in downstream retinal circuits, much of which has been discovered in the past decade, has not been implemented… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

    Comments: 18 pages

  29. arXiv:2208.11087  [pdf

    eess.SP cs.AI cs.CV cs.HC cs.LG

    Locally temporal-spatial pattern learning with graph attention mechanism for EEG-based emotion recognition

    Authors: Yiwen Zhu, Kaiyu Gan, Zhong Yin

    Abstract: Technique of emotion recognition enables computers to classify human affective states into discrete categories. However, the emotion may fluctuate instead of maintaining a stable state even within a short time interval. There is also a difficulty to take the full use of the EEG spatial distribution due to its 3-D topology structure. To tackle the above issues, we proposed a locally temporal-spatia… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

  30. arXiv:2208.01781  [pdf, other

    cs.LG cs.AI eess.SY

    Digital Twin-Assisted Efficient Reinforcement Learning for Edge Task Scheduling

    Authors: Xiucheng Wang, Longfei Ma, Haocheng Li, Zhisheng Yin, Tom. Luan, Nan Cheng

    Abstract: Task scheduling is a critical problem when one user offloads multiple different tasks to the edge server. When a user has multiple tasks to offload and only one task can be transmitted to server at a time, while server processes tasks according to the transmission order, the problem is NP-hard. However, it is difficult for traditional optimization methods to quickly obtain the optimal solution, wh… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  31. arXiv:2206.02407  [pdf, other

    cs.IT eess.SP

    Green Interference Based Symbiotic Security in Integrated Satellite-terrestrial Communications

    Authors: Zhisheng Yin, Nan Cheng, Tom H. Luan, Yilong Hui, Wei Wang

    Abstract: In this paper, we investigate secure transmissions in integrated satellite-terrestrial communications and the green interference based symbiotic security scheme is proposed. Particularly, the co-channel interference induced by the spectrum sharing between satellite and terrestrial networks and the inter-beam interference due to frequency reuse among satellite multi-beam serve as the green interfer… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  32. arXiv:2205.14285  [pdf, other

    eess.IV

    P2M-DeTrack: Processing-in-Pixel-in-Memory for Energy-efficient and Real-Time Multi-Object Detection and Tracking

    Authors: Gourav Datta, Souvik Kundu, Zihan Yin, Joe Mathai, Zeyu Liu, Zixu Wang, Mulin Tian, Shunlin Lu, Ravi T. Lakkireddy, Andrew Schmidt, Wael Abd-Almageed, Ajey P. Jacob, Akhilesh R. Jaiswal, Peter A. Beerel

    Abstract: Today's high resolution, high frame rate cameras in autonomous vehicles generate a large volume of data that needs to be transferred and processed by a downstream processor or machine learning (ML) accelerator to enable intelligent computing tasks, such as multi-object detection and tracking. The massive amount of data transfer incurs significant energy, latency, and bandwidth bottlenecks, which h… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 6 pages, 4 figures, 4 tables

  33. arXiv:2204.11567  [pdf, other

    cs.IT eess.SP

    Deep CSI Compression for Massive MIMO: A Self-information Model-driven Neural Network

    Authors: Ziqing Yin, Wei Xu, Renjie Xie, Shaoqing Zhang, Derrick Wing Kwan Ng, Xiaohu You

    Abstract: In order to fully exploit the advantages of massive multiple-input multiple-output (mMIMO), it is critical for the transmitter to accurately acquire the channel state information (CSI). Deep learning (DL)-based methods have been proposed for CSI compression and feedback to the transmitter. Although most existing DL-based methods consider the CSI matrix as an image, structural features of the CSI i… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  34. arXiv:2203.05696  [pdf, other

    eess.IV cs.CV

    Toward Efficient Hyperspectral Image Processing inside Camera Pixels

    Authors: Gourav Datta, Zihan Yin, Ajey Jacob, Akhilesh R. Jaiswal, Peter A. Beerel

    Abstract: Hyperspectral cameras generate a large amount of data due to the presence of hundreds of spectral bands as opposed to only three channels (red, green, and blue) in traditional cameras. This requires a significant amount of data transmission between the hyperspectral image sensor and a processor used to classify/detect/track the images, frame by frame, expending high energy and causing bandwidth an… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: 6 pages, 3 figures

  35. arXiv:2202.13388  [pdf, other

    cs.CV cs.RO eess.IV

    PanoFlow: Learning 360° Optical Flow for Surrounding Temporal Understanding

    Authors: Hao Shi, Yifan Zhou, Kailun Yang, Xiaoting Yin, Ze Wang, Yaozu Ye, Zhe Yin, Shi Meng, Peng Li, Kaiwei Wang

    Abstract: Optical flow estimation is a basic task in self-driving and robotics systems, which enables to temporally interpret traffic scenes. Autonomous vehicles clearly benefit from the ultra-wide Field of View (FoV) offered by 360° panoramic sensors. However, due to the unique imaging process of panoramic cameras, models designed for pinhole images do not directly generalize satisfactorily to 360° panoram… ▽ More

    Submitted 29 November, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: Code and dataset are publicly available at https://github.com/MasterHow/PanoFlow

  36. arXiv:2109.03488  [pdf, ps, other

    cs.NI eess.SP

    Partial Symbol Recovery for Interference Resilience in Low-Power Wide Area Networks

    Authors: Kai Sun, Zhimeng Yin, Weiwei Chen, Shuai Wang, Zeyu Zhang, Tian He

    Abstract: Recent years have witnessed the proliferation of Low-power Wide Area Networks (LPWANs) in the unlicensed band for various Internet-of-Things (IoT) applications. Due to the ultra-low transmission power and long transmission duration, LPWAN devices inevitably suffer from high power Cross Technology Interference (CTI), such as interference from Wi-Fi, coexisting in the same spectrum. To alleviate thi… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  37. arXiv:2108.00129  [pdf

    eess.IV

    Point-wise posteriori phase estimation in high-precision fringe projection profilometry

    Authors: Cong Liu, Chuang Zhang, Zhuoyi Yin, Xiaopeng Liu, Zhihong Xu

    Abstract: In fringe projection profilometry, the high-order harmonics information of non-sinusoidal fringes will lead to errors in the phase estimation. In order to solve this problem, a point-wise posterior phase estimation (PWPPE) method based on deep learning technique is proposed in this paper. The complex nonlinear map** relationship between the multiple gray values and the sine / cosine value of the… ▽ More

    Submitted 30 July, 2021; originally announced August 2021.

  38. arXiv:2104.08337  [pdf

    cs.AI cs.LG eess.SP

    Identification of mental fatigue in language comprehension tasks based on EEG and deep learning

    Authors: Chunhua Ye, Zhong Yin, Chenxi Wu, Xiayidai Abulaiti, Yixing Zhang, Zhenqi Sun, Jianhua Zhang

    Abstract: Mental fatigue increases the risk of operator error in language comprehension tasks. In order to prevent operator performance degradation, we used EEG signals to assess the mental fatigue of operators in human-computer systems. This study presents an experimental design for fatigue detection in language comprehension tasks. We obtained EEG signals from a 14-channel wireless EEG detector in 15 heal… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

  39. arXiv:2103.06725  [pdf, other

    eess.IV cs.CV

    Duplex Contextual Relation Network for Polyp Segmentation

    Authors: Zi** Yin, Kongming Liang, Zhanyu Ma, Jun Guo

    Abstract: Polyp segmentation is of great importance in the early diagnosis and treatment of colorectal cancer. Since polyps vary in their shape, size, color, and texture, accurate polyp segmentation is very challenging. One promising way to mitigate the diversity of polyps is to model the contextual relation for each pixel such as using attention mechanism. However, previous methods only focus on learning t… ▽ More

    Submitted 19 January, 2022; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted to ISBI2022

  40. arXiv:2012.04701  [pdf, other

    eess.IV cs.CV

    3D Graph Anatomy Geometry-Integrated Network for Pancreatic Mass Segmentation, Diagnosis, and Quantitative Patient Management

    Authors: Tianyi Zhao, Kai Cao, Jiawen Yao, Isabella Nogues, Le Lu, Lingyun Huang, **g Xiao, Zhaozheng Yin, Ling Zhang

    Abstract: The pancreatic disease taxonomy includes ten types of masses (tumors or cysts)[20,8]. Previous work focuses on develo** segmentation or classification methods only for certain mass types. Differential diagnosis of all mass types is clinically highly desirable [20] but has not been investigated using an automated image understanding approach. We exploit the feasibility to distinguish pancreatic d… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  41. arXiv:2009.12525  [pdf

    cs.LG eess.SP

    Cross-individual Recognition of Emotions by a Dynamic Entropy based on Pattern Learning with EEG features

    Authors: Xiaolong Zhong, Zhong Yin

    Abstract: Use of the electroencephalogram (EEG) and machine learning approaches to recognize emotions can facilitate affective human computer interactions. However, the type of EEG data constitutes an obstacle for cross-individual EEG feature modelling and classification. To address this issue, we propose a deep-learning framework denoted as a dynamic entropy-based pattern learning (DEPL) to abstract inform… ▽ More

    Submitted 25 May, 2021; v1 submitted 26 September, 2020; originally announced September 2020.

  42. arXiv:2006.10358  [pdf

    eess.IV cs.LG

    Cloud detection in Landsat-8 imagery in Google Earth Engine based on a deep neural network

    Authors: Zhixiang Yin, Feng Ling, Giles M. Foody, Xinyan Li, Yun Du

    Abstract: Google Earth Engine (GEE) provides a convenient platform for applications based on optical satellite imagery of large areas. With such data sets, the detection of cloud is often a necessary prerequisite step. Recently, deep learning-based cloud detection methods have shown their potential for cloud detection but they can only be applied locally, leading to inefficient data downloading time and sto… ▽ More

    Submitted 1 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

  43. arXiv:2004.07389  [pdf, other

    physics.geo-ph cs.CE eess.IV

    Extended source imaging, a unifying framework for seismic & medical imaging

    Authors: Ziyi Yin, Rafael Orozco, Philipp Witte, Mathias Louboutin, Gabrio Rizzuti, Felix J. Herrmann

    Abstract: We present three imaging modalities that live on the crossroads of seismic and medical imaging. Through the lens of extended source imaging, we can draw deep connections among the fields of wave-equation based seismic and medical imaging, despite first appearances. From the seismic perspective, we underline the importance to work with the correct physics and spatially varying velocity fields. Medi… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: Submitted to the Society of Exploration Geophysicists Annual Meeting 2020

  44. arXiv:1911.09275  [pdf, other

    eess.SP cs.LG

    A Machine Learning-enhanced Robust P-Phase Picker for Real-time Seismic Monitoring

    Authors: Dazhong Shen, Qi Zhang, Tong Xu, Hengshu Zhu, Wenjia Zhao, Zikai Yin, Peilun Zhou, Lihua Fang, Enhong Chen, Hui Xiong

    Abstract: Identifying the arrival times of seismic P-phases plays a significant role in real-time seismic monitoring, which provides critical guidance for emergency response activities. While considerable research has been conducted on this topic, efficiently capturing the arrival times of seismic P-phases hidden within intensively distributed and noisy seismic waves, such as those generated by the aftersho… ▽ More

    Submitted 20 August, 2020; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: Note that this paper is the English version of our work published in SCIENTIA SINICA Informationis (http://engine.scichina.com/doi/10.1360/SSI-2020-0214), which is suggested to be cited if needed

  45. arXiv:1911.02360  [pdf, other

    eess.IV cs.CV cs.MM

    Reversible Adversarial Attack based on Reversible Image Transformation

    Authors: Zhaoxia Yin, Hua Wang, Li Chen, Jie Wang, Weiming Zhang

    Abstract: In order to prevent illegal or unauthorized access of image data such as human faces and ensure legitimate users can use authorization-protected data, reversible adversarial attack technique is rise. Reversible adversarial examples (RAE) get both attack capability and reversibility at the same time. However, the existing technique can not meet application requirements because of serious distortion… ▽ More

    Submitted 25 May, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: 2021 International Workshop on Safety & Security of Deep Learning

  46. arXiv:1909.09316  [pdf

    physics.ao-ph eess.IV physics.data-an

    Spatially Continuous and High-resolution Land Surface Temperature: A Review of Reconstruction and Spatiotemporal Fusion Techniques

    Authors: Penghai Wu, Zhixiang Yin, Chao Zeng, Sibo Duan, Frank-Michael Gottsche, Xiaoshaung Ma, Xinghua Li, Hui Yang, Huanfeng Shen

    Abstract: Remotely sensed, spatially continuous and high spatiotemporal resolution (hereafter referred to as high resolution) land surface temperature (LST) is a key parameter for studying the thermal environment and has important applications in many fields. However, difficult atmospheric conditions, sensor malfunctioning and scanning gaps between orbits frequently introduce spatial discontinuities into sa… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

    Comments: 41 pages, 7 figures, 2 tables

  47. arXiv:1908.07519  [pdf, other

    cs.CV cs.HC cs.LG eess.IV eess.SP

    Multi-Modal Recognition of Worker Activity for Human-Centered Intelligent Manufacturing

    Authors: Wen** Tao, Ming C. Leu, Zhaozheng Yin

    Abstract: In a human-centered intelligent manufacturing system, sensing and understanding of the worker's activity are the primary tasks. In this paper, we propose a novel multi-modal approach for worker activity recognition by leveraging information from different sensors and in different modalities. Specifically, a smart armband and a visual camera are applied to capture Inertial Measurement Unit (IMU) si… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: 17 pages, 8 figures, 6 tables

  48. arXiv:1905.08967   

    cs.MM eess.IV

    Multiple reconstruction compression framework based on PNG image

    Authors: Zhiqing Lu, Zhaoxia Yin, Bin Luo

    Abstract: It is shown that neural networks (NNs) achieve excellent performances in image compression and reconstruction. However, there are still many shortcomings in the practical application, which eventually lead to the loss of neural network image processing ability. Based on this, this paper proposes a joint framework based on neural network and zoom compression. The framework first encodes the incomin… ▽ More

    Submitted 14 November, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: The experimental results cannot reproduced