Skip to main content

Showing 1–50 of 103 results for author: Nie

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14953  [pdf, other

    cs.CV cs.AI cs.LG eess.SP

    Deep Imbalanced Regression to Estimate Vascular Age from PPG Data: a Novel Digital Biomarker for Cardiovascular Health

    Authors: Guangkun Nie, Qinghao Zhao, Gongzheng Tang, Jun Li, Shenda Hong

    Abstract: Photoplethysmography (PPG) is emerging as a crucial tool for monitoring human hemodynamics, with recent studies highlighting its potential in assessing vascular aging through deep learning. However, real-world age distributions are often imbalanced, posing significant challenges for deep learning models. In this paper, we introduce a novel, simple, and effective loss function named the Dist Loss t… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2406.11296  [pdf

    eess.SY

    Energy efficiency analysis of ammonia-fueled power systems for vehicles considering residual heat recovery

    Authors: Zexin Nie, Yi Huang, Guangyu Tian

    Abstract: Ammonia, known as a good hydrogen carrier, shows great potential for use as a zero-carbon fuel for vehicles. However, both the internal combustion engine (ICE) and the proton exchange membrane fuel cell (PEMFC), the currently available engines used by the vehicle, require hydrogen decomposed from ammonia. On-board hydrogen production is an energy-intensive process that significantly reduces system… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2404.09729  [pdf

    eess.SP cs.IT cs.LG stat.ME

    Amplitude-Phase Fusion for Enhanced Electrocardiogram Morphological Analysis

    Authors: Shuaicong Hu, Yanan Wang, Jian Liu, **gyu Lin, Shengmei Qin, Zhenning Nie, Zhifeng Yao, Wenjie Cai, Cuiwei Yang

    Abstract: Considering the variability of amplitude and phase patterns in electrocardiogram (ECG) signals due to cardiac activity and individual differences, existing entropy-based studies have not fully utilized these two patterns and lack integration. To address this gap, this paper proposes a novel fusion entropy metric, morphological ECG entropy (MEE) for the first time, specifically designed for ECG mor… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 16 pages, 12 figures

    ACM Class: I.5.2

  4. arXiv:2403.19983  [pdf, other

    eess.IV cs.CV

    A multi-stage semi-supervised learning for ankle fracture classification on CT images

    Authors: Hongzhi Liu, Guicheng Li, Jiacheng Nie, Hui Tang, Chunfeng Yang, Qian** Feng, Hailin Xu, Yang Chen

    Abstract: Because of the complicated mechanism of ankle injury, it is very difficult to diagnose ankle fracture in clinic. In order to simplify the process of fracture diagnosis, an automatic diagnosis model of ankle fracture was proposed. Firstly, a tibia-fibula segmentation network is proposed for the joint tibiofibular region of the ankle joint, and the corresponding segmentation dataset is established o… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  5. arXiv:2403.12467  [pdf, other

    eess.SP

    Digital Twin Channel for 6G: Concepts, Architectures and Potential Applications

    Authors: Heng Wang, Jianhua Zhang, Gaofeng Nie, Li Yu, Zhiqiang Yuan, Tongjie Li, Jialin Wang, Guangyi Liu

    Abstract: Digital twin channel (DTC) is the real-time map** of a wireless channel from the physical world to the digital world, which is expected to provide significant performance enhancements for the sixth-generation (6G) air-interface design. In this work, we first define five evolution levels of channel twins with the progression of wireless communication. The fifth level, autonomous DTC, is elaborate… ▽ More

    Submitted 31 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures, 15 references. It is submitted to IEEE journal

  6. arXiv:2403.03066  [pdf, ps, other

    eess.SY

    Tracking-in-range Formulations for Numerical Optimal Control

    Authors: Nikilesh Ramesh, Eric C. Kerrigan, Yuanbo Nie

    Abstract: In contrast to set-point tracking which aims to reduce the tracking error between the tracker and the reference, tracking-in-range problems only focus on whether the tracker is within a given range around the reference, making it more suitable for the mission specifications of many practical applications. In this work, we present novel optimal control formulations to solve tracking-in-range proble… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  7. arXiv:2401.12783  [pdf, other

    cs.AI cs.LG eess.SP

    A Review of Deep Learning Methods for Photoplethysmography Data

    Authors: Guangkun Nie, Jiabao Zhu, Gongzheng Tang, Deyun Zhang, Shijia Geng, Qinghao Zhao, Shenda Hong

    Abstract: Photoplethysmography (PPG) is a highly promising device due to its advantages in portability, user-friendly operation, and non-invasive capabilities to measure a wide range of physiological information. Recent advancements in deep learning have demonstrated remarkable outcomes by leveraging PPG signals for tasks related to personal health management and other multifaceted applications. In this rev… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  8. arXiv:2401.11449  [pdf, other

    eess.SP cs.NI

    Energy Consumption Analysis for Continuous Phase Modulation in Smart-Grid Internet of Things of beyond 5G

    Authors: Hongjian Gao, Yang Lu, Shaoshi Yang, **gsheng Tan, Longlong Nie, Xinyi Qu

    Abstract: Wireless sensor network (WSN) underpinning the smart-grid Internet of Things (SG-IoT) has been a popular research topic in recent years due to its great potential for enabling a wide range of important applications. However, the energy consumption (EC) characteristic of sensor nodes is a key factor that affects the operational performance (e.g., lifetime of sensors) and the total cost of ownership… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 7 figures, 2 tables

    Journal ref: Sensors, vol. 24, no. 2, pp. 1-14, article number 533, Jan. 2024

  9. arXiv:2312.10287  [pdf, other

    eess.SP

    Towards 6G Digital Twin Channel Using Radio Environment Knowledge Pool

    Authors: Jialin Wang, Jianhua Zhang, Yuxiang Zhang, Yutong Sun, Gaofeng, Nie, Lianzheng Shi, ** Zhang, Guangyi Liu

    Abstract: The digital twin channel (DTC) is crucial for 6G wireless autonomous networks as it replicates the wireless channel fading states in 6G air interface transmissions. It is well known that the physical environment influences channels. A key task for accurately twinning channels in complex 6G scenarios is establishing precise relationships between the environment and the channels. In this article, th… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  10. arXiv:2312.07934  [pdf, other

    eess.IV cs.CV

    Toward Real World Stereo Image Super-Resolution via Hybrid Degradation Model and Discriminator for Implied Stereo Image Information

    Authors: Yuanbo Zhou, Yuyang Xue, Jiang Bi, Wenlin He, Xinlin Zhang, Jiajun Zhang, Wei Deng, Ruofeng Nie, Junlin Lan, Qinquan Gao, Tong Tong

    Abstract: Real-world stereo image super-resolution has a significant influence on enhancing the performance of computer vision systems. Although existing methods for single-image super-resolution can be applied to improve stereo images, these methods often introduce notable modifications to the inherent disparity, resulting in a loss in the consistency of disparity between the original and the enhanced ster… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  11. arXiv:2312.06462  [pdf, other

    cs.CV cs.AI cs.SD eess.AS

    Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation

    Authors: Qi Yang, Xing Nie, Tong Li, Pengfei Gao, Ying Guo, Cheng Zhen, Pengfei Yan, Shiming Xiang

    Abstract: Recently, an audio-visual segmentation (AVS) task has been introduced, aiming to group pixels with sounding objects within a given video. This task necessitates a first-ever audio-driven pixel-level understanding of the scene, posing significant challenges. In this paper, we propose an innovative audio-visual transformer framework, termed COMBO, an acronym for COoperation of Multi-order Bilateral… ▽ More

    Submitted 7 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 Highlight. 13 pages, 10 figures

  12. arXiv:2312.00308  [pdf, other

    cs.CV eess.IV stat.AP

    A knowledge-based data-driven (KBDD) framework for all-day identification of cloud types using satellite remote sensing

    Authors: Longfeng Nie, Yuntian Chen, Mengge Du, Changqi Sun, Dongxiao Zhang

    Abstract: Cloud types, as a type of meteorological data, are of particular significance for evaluating changes in rainfall, heatwaves, water resources, floods and droughts, food security and vegetation cover, as well as land use. In order to effectively utilize high-resolution geostationary observations, a knowledge-based data-driven (KBDD) framework for all-day identification of cloud types based on spectr… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  13. Convergence Analysis and Latency Minimization for Semi-Federated Learning in Massive IoT Networks

    Authors: Jianyang Ren, Wanli Ni, Hui Tian, Gaofeng Nie

    Abstract: As the number of sensors becomes massive in Internet of Things (IoT) networks, the amount of data is humongous. To process data in real-time while protecting user privacy, federated learning (FL) has been regarded as an enabling technique to push edge intelligence into IoT networks with massive devices. However, FL latency increases dramatically due to the increase of the number of parameters in d… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: This paper has been accepted by IEEE Transactions on Green Communications and Networking

  14. arXiv:2309.05927  [pdf, other

    cs.LG cs.AI eess.SP

    Frequency-Aware Masked Autoencoders for Multimodal Pretraining on Biosignals

    Authors: Ran Liu, Ellen L. Zippi, Hadi Pouransari, Chris Sandino, **g** Nie, Hanlin Goh, Erdrin Azemi, Ali Moin

    Abstract: Leveraging multimodal information from biosignals is vital for building a comprehensive representation of people's physical and mental states. However, multimodal biosignals often exhibit substantial distributional shifts between pretraining and inference datasets, stemming from changes in task specification or variations in modality compositions. To achieve effective pretraining in the presence o… ▽ More

    Submitted 18 April, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Extended version of ICLR 2024 Learning from Time Series for Health workshop

  15. arXiv:2308.09831  [pdf, other

    eess.IV cs.CV

    Cross-modality Attention-based Multimodal Fusion for Non-small Cell Lung Cancer (NSCLC) Patient Survival Prediction

    Authors: Ruining Deng, Nazim Shaikh, Gareth Shannon, Yao Nie

    Abstract: Cancer prognosis and survival outcome predictions are crucial for therapeutic response estimation and for stratifying patients into various treatment groups. Medical domains concerned with cancer prognosis are abundant with multiple modalities, including pathological image data and non-image data such as genomic information. To date, multimodal learning has shown potential to enhance clinical pred… ▽ More

    Submitted 27 February, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

  16. arXiv:2308.08179  [pdf

    eess.SY

    A Robust Integrated Multi-Strategy Bus Control System via Deep Reinforcement Learning

    Authors: Qinghui Nie, Jishun Ou, Haiyang Zhang, Jiawei Lu, Shen Li, Haotian Shi

    Abstract: An efficient urban bus control system has the potential to significantly reduce travel delays and streamline the allocation of transportation resources, thereby offering enhanced and user-friendly transit services to passengers. However, bus operation efficiency can be impacted by bus bunching. This problem is notably exacerbated when the bus system operates along a signalized corridor with unpred… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  17. arXiv:2308.08172  [pdf, other

    eess.IV cs.CV cs.LG

    AATCT-IDS: A Benchmark Abdominal Adipose Tissue CT Image Dataset for Image Denoising, Semantic Segmentation, and Radiomics Evaluation

    Authors: Zhiyu Ma, Chen Li, Tianming Du, Le Zhang, Dechao Tang, Deguo Ma, Shanchuan Huang, Yan Liu, Yihao Sun, Zhihao Chen, ** Yuan, Qianqing Nie, Marcin Grzegorzek, Hongzan Sun

    Abstract: Methods: In this study, a benchmark \emph{Abdominal Adipose Tissue CT Image Dataset} (AATTCT-IDS) containing 300 subjects is prepared and published. AATTCT-IDS publics 13,732 raw CT slices, and the researchers individually annotate the subcutaneous and visceral adipose tissue regions of 3,213 of those slices that have the same slice distance to validate denoising methods, train semantic segmentati… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 17 pages, 7 figures

  18. arXiv:2308.03027  [pdf, other

    cs.LG cs.CV eess.SP

    Causal Disentanglement Hidden Markov Model for Fault Diagnosis

    Authors: Rihao Chang, Yongtao Ma, Weizhi Nie, Jie Nie, An-an Liu

    Abstract: In modern industries, fault diagnosis has been widely applied with the goal of realizing predictive maintenance. The key issue for the fault diagnosis system is to extract representative characteristics of the fault signal and then accurately predict the fault type. In this paper, we propose a Causal Disentanglement Hidden Markov model (CDHM) to learn the causality in the bearing fault mechanism a… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  19. arXiv:2306.13843  [pdf, other

    cs.CV eess.IV

    Score-based Generative Models for Photoacoustic Image Reconstruction with Rotation Consistency Constraints

    Authors: Shangqing Tong, Hengrong Lan, Liming Nie, Jianwen Luo, Fei Gao

    Abstract: Photoacoustic tomography (PAT) is a newly emerged imaging modality which enables both high optical contrast and acoustic depth of penetration. Reconstructing images of photoacoustic tomography from limited amount of senser data is among one of the major challenges in photoacoustic imaging. Previous works based on deep learning were trained in supervised fashion, which directly map the input partia… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  20. arXiv:2306.01232  [pdf, other

    eess.IV cs.CV

    Deep Reinforcement Learning Framework for Thoracic Diseases Classification via Prior Knowledge Guidance

    Authors: Weizhi Nie, Chen Zhang, Dan Song, Lina Zhao, Yunpeng Bai, Keliang Xie, Anan Liu

    Abstract: The chest X-ray is often utilized for diagnosing common thoracic diseases. In recent years, many approaches have been proposed to handle the problem of automatic diagnosis based on chest X-rays. However, the scarcity of labeled data for related diseases still poses a huge challenge to an accurate diagnosis. In this paper, we focus on the thorax disease diagnostic problem and propose a novel deep r… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  21. arXiv:2305.13774  [pdf, other

    cs.SD eess.AS

    ADD 2023: the Second Audio Deepfake Detection Challenge

    Authors: Jiangyan Yi, Jianhua Tao, Ruibo Fu, Xinrui Yan, Chenglong Wang, Tao Wang, Chu Yuan Zhang, Xiaohui Zhang, Yan Zhao, Yong Ren, Le Xu, Junzuo Zhou, Hao Gu, Zhengqi Wen, Shan Liang, Zheng Lian, Shuai Nie, Haizhou Li

    Abstract: Audio deepfake detection is an emerging topic in the artificial intelligence community. The second Audio Deepfake Detection Challenge (ADD 2023) aims to spur researchers around the world to build new innovative technologies that can further accelerate and foster research on detecting and analyzing deepfake speech utterances. Different from previous challenges (e.g. ADD 2022), ADD 2023 focuses on s… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  22. arXiv:2305.12072  [pdf, other

    eess.IV cs.CV

    Chest X-ray Image Classification: A Causal Perspective

    Authors: Weizhi Nie, Chen Zhang, Dan Song, Lina Zhao, Yunpeng Bai, Keliang Xie, Anan Liu

    Abstract: The chest X-ray (CXR) is one of the most common and easy-to-get medical tests used to diagnose common diseases of the chest. Recently, many deep learning-based methods have been proposed that are capable of effectively classifying CXRs. Even though these techniques have worked quite well, it is difficult to establish whether what these algorithms actually learn is the cause-and-effect link between… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  23. arXiv:2305.12070  [pdf, other

    eess.IV cs.CV

    Instrumental Variable Learning for Chest X-ray Classification

    Authors: Weizhi Nie, Chen Zhang, Dan song, Yunpeng Bai, Keliang Xie, Anan Liu

    Abstract: The chest X-ray (CXR) is commonly employed to diagnose thoracic illnesses, but the challenge of achieving accurate automatic diagnosis through this method persists due to the complex relationship between pathology. In recent years, various deep learning-based approaches have been suggested to tackle this problem but confounding factors such as image resolution or noise problems often damage model… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  24. arXiv:2304.11409  [pdf, other

    eess.IV cs.CV

    The Devil is in the Upsampling: Architectural Decisions Made Simpler for Denoising with Deep Image Prior

    Authors: Yilin Liu, Jiang Li, Yunkui Pang, Dong Nie, Pew-thian Yap

    Abstract: Deep Image Prior (DIP) shows that some network architectures naturally bias towards smooth images and resist noises, a phenomenon known as spectral bias. Image denoising is an immediate application of this property. Although DIP has removed the requirement of large training sets, it still presents two practical challenges for denoising: architectural design and noise-fitting, which are often inter… ▽ More

    Submitted 26 August, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

    Comments: Accepted to ICCV 2023

  25. arXiv:2304.07150  [pdf, other

    eess.SY physics.soc-ph

    FOCUS : A framework for energy system optimization from prosumer to district and city scale

    Authors: **gyu Gong, Yi Nie, Jonas van Ouwerkerk, Felix Wege, Mauricio Celi Cortés, Christoph von Oy, Jonas Brucksch, Christian Bußar, Thomas Schreiber, Dirk Uwe Sauer, Dirk Müller, Antonello Monti

    Abstract: Decarbonizing the energy sector is one of the main challenges to combat the climate crisis. Cities play an important role to reach climate neutrality as more than 70% of global CO2 emissions originate from urban areas. Decarbonization of energy supply systems can be achieved through various means, including the use of renewable energy sources, improving the efficiency of technologies, the coupling… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  26. arXiv:2303.06548  [pdf, other

    cs.CV eess.IV

    CoT-MISR:Marrying Convolution and Transformer for Multi-Image Super-Resolution

    Authors: Mingming Xiu, Yang Nie, Qing Song, Chun Liu

    Abstract: As a method of image restoration, image super-resolution has been extensively studied at first. How to transform a low-resolution image to restore its high-resolution image information is a problem that researchers have been exploring. In the early physical transformation methods, the high-resolution pictures generated by these methods always have a serious problem of missing information, and the… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  27. arXiv:2303.02967  [pdf, other

    eess.IV cs.CV

    Automated Peripancreatic Vessel Segmentation and Labeling Based on Iterative Trunk Growth and Weakly Supervised Mechanism

    Authors: Liwen Zou, Zhenghua Cai, Liang Mao, Ziwei Nie, Yudong Qiu, ** Yang

    Abstract: Peripancreatic vessel segmentation and anatomical labeling play extremely important roles to assist the early diagnosis, surgery planning and prognosis for patients with pancreatic tumors. However, most current techniques cannot achieve satisfactory segmentation performance for peripancreatic veins and usually make predictions with poor integrity and connectivity. Besides, unsupervised labeling al… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  28. arXiv:2303.01507  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Defending against Adversarial Audio via Diffusion Model

    Authors: Shutong Wu, Jiongxiao Wang, Wei **, Weili Nie, Chaowei Xiao

    Abstract: Deep learning models have been widely used in commercial acoustic systems in recent years. However, adversarial audio examples can cause abnormal behaviors for those acoustic systems, while being hard for humans to perceive. Various methods, such as transformation-based defenses and adversarial training, have been proposed to protect acoustic systems from adversarial attacks, but they are less eff… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  29. arXiv:2302.06611  [pdf, other

    eess.IV

    Deep Learning and Medical Imaging for COVID-19 Diagnosis: A Comprehensive Survey

    Authors: Song Wu, Yazhou Ren, Aodi Yang, Xinyue Chen, Xiaorong Pu, **g He, Liqiang Nie, Philip S. Yu

    Abstract: COVID-19 (Coronavirus disease 2019) has been quickly spreading since its outbreak, impacting financial markets and healthcare systems globally. Countries all around the world have adopted a number of extraordinary steps to restrict the spreading virus, where early COVID-19 diagnosis is essential. Medical images such as X-ray images and Computed Tomography scans are becoming one of the main diagnos… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  30. arXiv:2301.13326  [pdf, other

    cs.LG cs.AI cs.DS eess.SY

    A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit Feedback

    Authors: Guanyu Nie, Yididiya Y Nadew, Yanhui Zhu, Vaneet Aggarwal, Christopher John Quinn

    Abstract: We investigate the problem of stochastic, combinatorial multi-armed bandits where the learner only has access to bandit feedback and the reward function can be non-linear. We provide a general framework for adapting discrete offline approximation algorithms into sublinear $α$-regret methods that only require bandit feedback, achieving $\mathcal{O}\left(T^\frac{2}{3}\log(T)^\frac{1}{3}\right)$ expe… ▽ More

    Submitted 11 October, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: This extends the framework in previous version to adapt randomized offline approximation algorithms

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, 2023

  31. arXiv:2210.15149  [pdf

    eess.IV cs.CV

    Fully Automated Deep Learning-enabled Detection for Hepatic Steatosis on Computed Tomography: A Multicenter International Validation Study

    Authors: Zhongyi Zhang, Guixia Li, Ziqiang Wang, Feng Xia, Ning Zhao, Huibin Nie, Zezhong Ye, Joshua Lin, Yiyi Hui, Xiangchun Liu

    Abstract: Despite high global prevalence of hepatic steatosis, no automated diagnostics demonstrated generalizability in detecting steatosis on multiple international datasets. Traditionally, hepatic steatosis detection relies on clinicians selecting the region of interest (ROI) on computed tomography (CT) to measure liver attenuation. ROI selection demands time and expertise, and therefore is not routinely… ▽ More

    Submitted 6 November, 2022; v1 submitted 26 October, 2022; originally announced October 2022.

  32. arXiv:2209.15012  [pdf, other

    eess.IV physics.optics

    Ghost translation

    Authors: Wenhan Ren, Xiaoyu Nie, Tao Peng, Marlan O. Scully

    Abstract: Artificial intelligence has recently been widely used in computational imaging. The deep neural network (DNN) improves the signal-to-noise ratio of the retrieved images, whose quality is otherwise corrupted due to the low sampling ratio or noisy environments. This work proposes a new computational imaging scheme based on the sequence transduction mechanism with the transformer network. The simulat… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: 10 pages, 8 figures

  33. arXiv:2209.07618  [pdf, other

    cs.GT cs.AI cs.MA eess.SY

    Differentiable Bilevel Programming for Stackelberg Congestion Games

    Authors: Jiayang Li, **g Yu, Qianni Wang, Boyi Liu, Zhaoran Wang, Yu Marco Nie

    Abstract: In a Stackelberg congestion game (SCG), a leader aims to maximize their own gain by anticipating and manipulating the equilibrium state at which the followers settle by playing a congestion game. Often formulated as bilevel programs, large-scale SCGs are well known for their intractability and complexity. Here, we attempt to tackle this computational challenge by marrying traditional methodologies… ▽ More

    Submitted 13 May, 2024; v1 submitted 15 September, 2022; originally announced September 2022.

  34. arXiv:2208.08644  [pdf, other

    physics.optics eess.SP

    Ghost Aperture Synthesis Imaging with Computational Aberration Cancellation

    Authors: Shuai Sun, Zhen-Wu Nie, Yue-Gang Li, Hui-Zu Lin, Wei-Tao Liu, **-Xing Chen

    Abstract: Although optical aperture synthesis has been generally regarded as the only access to very large imager for over a century, the problem of phasing all the giant sub-apertures on the scale of wavelength is still prohibitive. Besides, the accompanied adaptive optics combatting the atmospheric turbulence is also bulky and complicated. We here propose a new paradigm aperture synthesis imager through t… ▽ More

    Submitted 30 August, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: Ghost Aperture Synthesis reported in the previous version (v1) is worth a better presentation for a wider interest. To make both the physics and the principle of the imager more readable, we withdrew the previous manuscript and upload a new version, which is rearranged upon to some nice suggestions form referees of a physical journal

  35. arXiv:2206.12559  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Self-supervised Context-aware Style Representation for Expressive Speech Synthesis

    Authors: Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie

    Abstract: Expressive speech synthesis, like audiobook synthesis, is still challenging for style representation learning and prediction. Deriving from reference audio or predicting style tags from text requires a huge amount of labeled data, which is costly to acquire and difficult to define and annotate accurately. In this paper, we propose a novel framework for learning style representation from abundant p… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

    Comments: Accepted by Interspeech 2022

  36. arXiv:2206.05641  [pdf, ps, other

    cs.CV cs.LG eess.IV

    An Unsupervised Deep-Learning Method for Bone Age Assessment

    Authors: Hao Zhu, Wan-**g Nie, Yue-Jie Hou, Qi-Meng Du, Si-**g Li, Chi-Chun Zhou

    Abstract: The bone age, reflecting the degree of development of the bones, can be used to predict the adult height and detect endocrine diseases of children. Both examinations of radiologists and variability of operators have a significant impact on bone age assessment. To decrease human intervention , machine learning algorithms are used to assess the bone age automatically. However, conventional supervise… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

  37. Fast and accurate method for computing non-smooth solutions to constrained control problems

    Authors: Lucian Nita, Eduardo M. G. Vila, Marta A. Zagorowska, Eric C. Kerrigan, Yuanbo Nie, Ian McInerney, Paola Falugi

    Abstract: Introducing flexibility in the time-discretisation mesh can improve convergence and computational time when solving differential equations numerically, particularly when the solutions are discontinuous, as commonly found in control problems with constraints. State-of-the-art methods use fixed mesh schemes, which cannot achieve superlinear convergence in the presence of non-smooth solutions. In thi… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 6 pages, 4 figures, Accepted to 20th European Control Conference (ECC 2022)

    Journal ref: Proc. 20th European Control Conference (ECC 2022)

  38. arXiv:2204.06705  [pdf, other

    eess.SP

    Hierarchical-Absolute Reciprocity Calibration for Millimeter-wave Hybrid Beamforming Systems

    Authors: Li Chen, Rongjiang Nie, Yunfei Chen, Weidong Wang

    Abstract: In time-division duplexing (TDD) millimeter-wave (mmWave) massive multiple-input multiple-output (MIMO) systems, the reciprocity mismatch severely degrades the performance of the hybrid beamforming (HBF). In this work, to mitigate the detrimental effect of the reciprocity mismatch, we investigate reciprocity calibration for the mmWave-HBF system with a fully-connected phase shifter network. To red… ▽ More

    Submitted 2 November, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

  39. arXiv:2203.14006  [pdf, other

    math.DS eess.SY physics.data-an q-bio.QM

    Continuity scaling: A rigorous framework for detecting and quantifying causality accurately

    Authors: Xiong Ying, Si-Yang Leng, Huan-Fei Ma, Qing Nie, Ying-Cheng Lai, Wei Lin

    Abstract: Data based detection and quantification of causation in complex, nonlinear dynamical systems is of paramount importance to science, engineering and beyond. Inspired by the widely used methodology in recent years, the cross-map-based techniques, we develop a general framework to advance towards a comprehensive understanding of dynamical causal mechanisms, which is consistent with the natural interp… ▽ More

    Submitted 26 March, 2022; originally announced March 2022.

    Comments: 7 figures; The article has been peer reviewed and accepted by RESEARCH

  40. Distortion-Tolerant Monocular Depth Estimation On Omnidirectional Images Using Dual-cubemap

    Authors: Zhijie Shen, Chunyu Lin, Lang Nie, Kang Liao, Yao zhao

    Abstract: Estimating the depth of omnidirectional images is more challenging than that of normal field-of-view (NFoV) images because the varying distortion can significantly twist an object's shape. The existing methods suffer from troublesome distortion while estimating the depth of omnidirectional images, leading to inferior performance. To reduce the negative impact of the distortion influence, we propos… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Accepted by ICME2021, poster

  41. arXiv:2203.00328  [pdf, other

    cs.CL cs.SD eess.AS

    BERT-LID: Leveraging BERT to Improve Spoken Language Identification

    Authors: Yuting Nie, Junhong Zhao, Wei-Qiang Zhang, **feng Bai

    Abstract: Language identification is the task of automatically determining the identity of a language conveyed by a spoken segment. It has a profound impact on the multilingual interoperability of an intelligent speech system. Despite language identification attaining high accuracy on medium or long utterances(>3s), the performance on short utterances (<=1s) is still far from satisfactory. We propose a BERT… ▽ More

    Submitted 11 October, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: accepted by ISCSLP 2022

  42. Mobile Device Association and Resource Allocation in Small-Cell IoT Networks with Mobile Edge Computing and Caching

    Authors: Tianqing Zhou, Yali Yue, Dong Qin, Xuefang Nie, Xuan Li, Chunguo Li

    Abstract: To meet the need of computation-sensitive (CS) and high-rate (HR) communications, the framework of mobile edge computing and caching has been widely regarded as a promising solution. When such a framework is implemented in small-cell IoT (Internet of Tings) networks, it is a key and open topic how to assign mobile edge computing and caching servers to mobile devices (MDs) with CS and HR communicat… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

  43. arXiv:2202.08433  [pdf, ps, other

    cs.SD cs.LG eess.AS

    ADD 2022: the First Audio Deep Synthesis Detection Challenge

    Authors: Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li, Zheng Lian, Bin Liu

    Abstract: Audio deepfake detection is an emerging topic, which was included in the ASVspoof 2021. However, the recent shared tasks have not covered many real-life and challenging scenarios. The first Audio Deep synthesis Detection challenge (ADD) was motivated to fill in the gap. The ADD 2022 includes three tracks: low-quality fake audio detection (LF), partially fake audio detection (PF) and audio fake gam… ▽ More

    Submitted 26 February, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted by ICASSP 2022

  44. arXiv:2112.13303  [pdf, other

    physics.optics eess.IV

    Imaging through scattering media via spatial-temporal encoded pattern illumination

    Authors: Xingchen Zhao, Xiaoyu Nie, Zhenhuan Yi, Tao Peng, Marlan O. Scully

    Abstract: Optical imaging through scattering media is a long-standing challenge. Although many approaches have been developed to focus light or image objects through scattering media, they are either invasive, restricted to stationary or slowly-moving media, or require high-resolution cameras and complex algorithms to retrieve the images. Here we introduce a computational imaging technique that can overcome… ▽ More

    Submitted 25 December, 2021; originally announced December 2021.

    Comments: 7 pages, 4 figures

  45. arXiv:2112.13293  [pdf, other

    eess.IV physics.optics

    Deep-learned speckle pattern and its application to ghost imaging

    Authors: Xiaoyu Nie, Haotian Song, Wenhan Ren, Xingchen Zhao, Zhedong Zhang, Tao Peng, Marlan O. Scully

    Abstract: In this paper, we present a method for speckle pattern design using deep learning. The speckle patterns possess unique features after experiencing convolutions in Speckle-Net, our well-designed framework for speckle pattern generation. We then apply our method to the computational ghost imaging system. The standard deep learning-assisted ghost imaging methods use the network to recognize the recon… ▽ More

    Submitted 27 December, 2021; v1 submitted 25 December, 2021; originally announced December 2021.

    Comments: 12 pages, 12 figures

  46. arXiv:2112.13187  [pdf, ps, other

    eess.SP

    TeraHertz Band Communication: An Old Problem Revisited and Research Directions for the Next Decade

    Authors: Ian F. Akyildiz, Chong Han, Zhifeng Hu, Shuai Nie, Josep M. Jornet

    Abstract: Terahertz (THz) band communications are envisioned as a key technology for 6G and Beyond. As a fundamental wireless infrastructure, THz communication can boost abundant promising applications. In 2014, our team published two comprehensive roadmaps for the development and progress of THz communication networks [1], [2], which helped the research community to start research on this subject afterward… ▽ More

    Submitted 26 April, 2022; v1 submitted 25 December, 2021; originally announced December 2021.

    Comments: To appear in IEEE Transactions on Communications, 2022

  47. arXiv:2111.07549  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data

    Authors: Zhu Li, Yuqing Zhang, Mengxi Nie, Ming Yan, Mengnan He, Ruixiong Zhang, Caixia Gong

    Abstract: Recent advancements in end-to-end speech synthesis have made it possible to generate highly natural speech. However, training these models typically requires a large amount of high-fidelity speech data, and for unseen texts, the prosody of synthesized speech is relatively unnatural. To address these issues, we propose to combine a fine-tuned BERT-based front-end with a pre-trained FastSpeech2-base… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  48. arXiv:2111.06400  [pdf, other

    eess.IV cs.CV physics.med-ph

    Fast T2w/FLAIR MRI Acquisition by Optimal Sampling of Information Complementary to Pre-acquired T1w MRI

    Authors: Junwei Yang, Xiao-Xin Li, Feihong Liu, Dong Nie, Pietro Lio, Haikun Qi, Dinggang Shen

    Abstract: Recent studies on T1-assisted MRI reconstruction for under-sampled images of other modalities have demonstrated the potential of further accelerating MRI acquisition of other modalities. Most of the state-of-the-art approaches have achieved improvement through the development of network architectures for fixed under-sampling patterns, without fully exploiting the complementary information between… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

  49. 0.8% Nyquist computational ghost imaging via non-experimental deep learning

    Authors: Haotian Song, Xiaoyu Nie, Hairong Su, Hui Chen, Yu Zhou, Xingchen Zhao, Tao Peng, Marlan O. Scully

    Abstract: We present a framework for computational ghost imaging based on deep learning and customized pink noise speckle patterns. The deep neural network in this work, which can learn the sensing model and enhance image reconstruction quality, is trained merely by simulation. To demonstrate the sub-Nyquist level in our work, the conventional computational ghost imaging results, reconstructed imaging resul… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: 10 pages, 6 figures

  50. A survey on computational spectral reconstruction methods from RGB to hyperspectral imaging

    Authors: **gang Zhang, Runmu Su, Wenqi Ren, Qiang Fu, Felix Heide, Yunfeng Nie

    Abstract: Hyperspectral imaging enables versatile applications due to its competence in capturing abundant spatial and spectral information, which are crucial for identifying substances. However, the devices for acquiring hyperspectral images are expensive and complicated. Therefore, many alternative spectral imaging methods have been proposed by directly reconstructing the hyperspectral information from lo… ▽ More

    Submitted 13 July, 2022; v1 submitted 30 June, 2021; originally announced June 2021.

    Journal ref: Scientific Reports | (2022) 12:11905