Skip to main content

Showing 1–50 of 73 results for author: Wang, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.12292  [pdf, other

    cs.SD cs.AI eess.AS

    JEN-1 DreamStyler: Customized Musical Concept Learning via Pivotal Parameters Tuning

    Authors: Boyu Chen, Peike Li, Yao Yao, Alex Wang

    Abstract: Large models for text-to-music generation have achieved significant progress, facilitating the creation of high-quality and varied musical compositions from provided text prompts. However, input text prompts may not precisely capture user requirements, particularly when the objective is to generate music that embodies a specific concept derived from a designated reference collection. In this paper… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2405.08672  [pdf, other

    eess.IV cs.CV

    EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera

    Authors: Beilei Cui, Mobarakol Islam, Long Bai, An Wang, Hongliang Ren

    Abstract: Depth estimation plays a crucial role in various tasks within endoscopic surgery, including navigation, surface reconstruction, and augmented reality visualization. Despite the significant achievements of foundation models in vision tasks, including depth estimation, their direct application to the medical domain often results in suboptimal performance. This highlights the need for efficient adapt… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: early accepted by MICCAI 2024

  3. arXiv:2404.10640  [pdf, other

    eess.IV

    Adapting SAM for Surgical Instrument Tracking and Segmentation in Endoscopic Submucosal Dissection Videos

    Authors: Jieming Yu, Long Bai, Guankun Wang, An Wang, Xiaoxiao Yang, Huxin Gao, Hongliang Ren

    Abstract: The precise tracking and segmentation of surgical instruments have led to a remarkable enhancement in the efficiency of surgical procedures. However, the challenge lies in achieving accurate segmentation of surgical instruments while minimizing the need for manual annotation and reducing the time required for the segmentation process. To tackle this, we propose a novel framework for surgical instr… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: To appear in IEEE ICRA 2024 C4SR+ Workshop

  4. Cepstral Analysis Based Artifact Detection, Recognition and Removal for Prefrontal EEG

    Authors: Siqi Han, Chao Zhang, Jiaxin Lei, Qingquan Han, Yuhui Du, Anhe Wang, Shuo Bai, Milin Zhang

    Abstract: This paper proposes to use cepstrum for artifact detection, recognition and removal in prefrontal EEG. This work focuses on the artifact caused by eye movement. A database containing artifact-free EEG and eye movement contaminated EEG from different subjects is established. A cepstral analysis-based feature extraction with support vector machine (SVM) based classifier is designed to identify the a… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 5 pages, 4 figures, published by TCAS-II

    Journal ref: IEEE Transactions on Circuits and Systems II: Express Briefs, 2023

  5. arXiv:2403.15411  [pdf, other

    eess.SP

    UAV Deployment Optimization in UAV-assisted Wireless Communications

    Authors: Xueqi Zhang, Aimin Wang, Geng Sun, Lingling Liu, **g Zhang

    Abstract: Due to the fact that the locations of base stations (BSs) cannot be changed after they are installed, it is very difficult to communicate directly with remote user equipment (UE), which will directly affect the lifespan of the system. Unmanned aerial vehicles (UAVs) offer a hopeful solution as mobile relays for fifth-generation wireless communications due to the flexible and cost-effective deploym… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 6 pages, 3 figures, 2 tables

  6. arXiv:2403.15410  [pdf, other

    eess.SP eess.SY

    Secure and Energy-efficient Unmanned Aerial Vehicle-enabled Visible Light Communication via A Multi-objective Optimization Approach

    Authors: Lingling Liu, Aimin Wang, **g Wu, Jiao Lu, Jiahui Li, Geng Sun

    Abstract: In this research, a unique approach to provide communication service for terrestrial receivers via using unmanned aerial vehicle-enabled visible light communication is investigated. Specifically, we take into account a unmanned aerial vehicle-enabled visible light communication scenario with multiplex transmitters, multiplex receivers, and a single eavesdropper, each of which is equipped with a si… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 18 pages, 9 tables, 3 tables

  7. arXiv:2403.12985  [pdf, other

    cs.IT eess.SP

    Multi-objective Optimization for Data Collection in UAV-assisted Agricultural IoT

    Authors: Lingling Liu, Aimin Wang, Geng Sun, Jiahui Li, Hongyang Pan, Tony Q. S. Quek

    Abstract: The ground fixed base stations (BSs) are often deployed inflexibly, and have high overheads, as well as are susceptible to the damage from natural disasters, making it impractical for them to continuously collect data from sensor devices. To improve the network coverage and performance of wireless communication, unmanned aerial vehicles (UAVs) have been introduced in diverse wireless networks, the… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 13 pages, 7 figures, 4 tables

  8. arXiv:2403.09327  [pdf, other

    cs.CV eess.IV

    Perspective-Equivariant Imaging: an Unsupervised Framework for Multispectral Pansharpening

    Authors: Andrew Wang, Mike Davies

    Abstract: Ill-posed image reconstruction problems appear in many scenarios such as remote sensing, where obtaining high quality images is crucial for environmental monitoring, disaster management and urban planning. Deep learning has seen great success in overcoming the limitations of traditional methods. However, these inverse problems rarely come with ground truth data, highlighting the importance of unsu… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Pre-print

  9. arXiv:2403.06459  [pdf, other

    eess.IV cs.CV

    From Pixel to Cancer: Cellular Automata in Computed Tomography

    Authors: Yuxiang Lai, Xiaoxi Chen, Angtian Wang, Alan Yuille, Zongwei Zhou

    Abstract: AI for cancer detection encounters the bottleneck of data scarcity, annotation difficulty, and low prevalence of early tumors. Tumor synthesis seeks to create artificial tumors in medical images, which can greatly diversify the data and annotations for AI training. However, current tumor synthesis approaches are not applicable across different organs due to their need for specific expertise and de… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  10. arXiv:2403.04116  [pdf, other

    eess.IV cs.CV

    Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis

    Authors: Yuanhao Cai, Yixun Liang, Jiahao Wang, Angtian Wang, Yulun Zhang, Xiaokang Yang, Zongwei Zhou, Alan Yuille

    Abstract: X-ray is widely applied for transmission imaging due to its stronger penetration than natural light. When rendering novel view X-ray projections, existing methods mainly based on NeRF suffer from long training time and slow inference speed. In this paper, we propose a 3D Gaussian splatting-based framework, namely X-Gaussian, for X-ray novel view synthesis. Firstly, we redesign a radiative Gaussian… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: The first 3D Gaussian Splatting-based method for X-ray 3D reconstruction

  11. arXiv:2402.08159  [pdf, other

    eess.IV cs.CV

    Poisson flow consistency models for low-dose CT image denoising

    Authors: Dennis Hein, Adam Wang, Ge Wang

    Abstract: Diffusion and Poisson flow models have demonstrated remarkable success for a wide range of generative tasks. Nevertheless, their iterative nature results in computationally expensive sampling and the number of function evaluations (NFE) required can be orders of magnitude larger than for single-step methods. Consistency models are a recent class of deep generative models which enable single-step s… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  12. arXiv:2401.09791  [pdf

    eess.IV cs.CV cs.LG

    BreastRegNet: A Deep Learning Framework for Registration of Breast Faxitron and Histopathology Images

    Authors: Negar Golestani, Aihui Wang, Gregory R Bean, Mirabela Rusu

    Abstract: A standard treatment protocol for breast cancer entails administering neoadjuvant therapy followed by surgical removal of the tumor and surrounding tissue. Pathologists typically rely on cabinet X-ray radiographs, known as Faxitron, to examine the excised breast tissue and diagnose the extent of residual disease. However, accurately determining the location, size, and focality of residual cancer c… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  13. arXiv:2312.05832  [pdf, other

    cs.CV eess.IV

    Spatial-wise Dynamic Distillation for MLP-like Efficient Visual Fault Detection of Freight Trains

    Authors: Yang Zhang, Huilin Pan, Mingying Li, An Wang, Yang Zhou, Hongliang Ren

    Abstract: Despite the successful application of convolutional neural networks (CNNs) in object detection tasks, their efficiency in detecting faults from freight train images remains inadequate for implementation in real-world engineering scenarios. Existing modeling shortcomings of spatial invariance and pooling layers in conventional CNNs often ignore the neglect of crucial global information, resulting i… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 10 pages, 6 figures

  14. arXiv:2312.01566  [pdf, other

    physics.med-ph eess.IV

    Coronary Atherosclerotic Plaque Characterization with Photon-counting CT: a Simulation-based Feasibility Study

    Authors: Mengzhou Li, Mingye Wu, Jed Pack, Pengwei Wu, Bruno De Man, Adam Wang, Koen Nieman, Ge Wang

    Abstract: Recent development of photon-counting CT (PCCT) brings great opportunities for plaque characterization with much-improved spatial resolution and spectral imaging capability. While existing coronary plaque PCCT imaging results are based on detectors made of CZT or CdTe materials, deep-silicon photon-counting detectors have unique performance characteristics and promise distinct imaging capabilities… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 13 figures, 5 tables

  15. arXiv:2311.10959  [pdf, other

    eess.IV cs.CV

    Structure-Aware Sparse-View X-ray 3D Reconstruction

    Authors: Yuanhao Cai, Jiahao Wang, Alan Yuille, Zongwei Zhou, Angtian Wang

    Abstract: X-ray, known for its ability to reveal internal structures of objects, is expected to provide richer information for 3D reconstruction than visible light. Yet, existing neural radiance fields (NeRF) algorithms overlook this important nature of X-ray, leading to their limitations in capturing structural contents of imaged objects. In this paper, we propose a framework, Structure-Aware X-ray Neural… ▽ More

    Submitted 23 March, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: CVPR 2024; The first Transformer-based method for X-ray and CT 3D reconstruction

  16. arXiv:2310.19180  [pdf, other

    cs.SD cs.AI cs.CV cs.MM eess.AS

    JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation

    Authors: Yao Yao, Peike Li, Boyu Chen, Alex Wang

    Abstract: With rapid advances in generative artificial intelligence, the text-to-music synthesis task has emerged as a promising direction for music generation from scratch. However, finer-grained control over multi-track generation remains an open challenge. Existing models exhibit strong raw generation capability but lack the flexibility to compose separate tracks and combine them in a controllable manner… ▽ More

    Submitted 2 November, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: Preprints

  17. arXiv:2310.13124  [pdf

    eess.SY

    Efficient online cross-covariance monitoring with incremental SVD: An approach for the detection of emerging dependency patterns in IoT systems

    Authors: Xinmiao Luan, Qing Zou, Jian Li, Andi Wang

    Abstract: The development of the manufacturing systems has made it increasingly necessary to monitor the data generated by multiple interconnected subsystems with rapid incoming of samples. Based on incremental Singular Value Decomposition (ISVD), we develop a general online monitoring approach for the relationship of data generated from two interconnected subsystems, where each subsystem produces big data… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  18. arXiv:2310.00396  [pdf, other

    eess.SY

    Joint Scheduling and Trajectory Optimization of Charging UAV in Wireless Rechargeable Sensor Networks

    Authors: Yanheng Liu, Hongyang Pan, Geng Sun, Aimin Wang, Jiahui Li, Shuang Liang

    Abstract: Wireless rechargeable sensor networks with a charging unmanned aerial vehicle (CUAV) have the broad application prospects in the power supply of the rechargeable sensor nodes (SNs). However, how to schedule a CUAV and design the trajectory to improve the charging efficiency of the entire system is still a vital problem. In this paper, we formulate a joint-CUAV scheduling and trajectory optimizatio… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  19. arXiv:2308.07156  [pdf, other

    eess.IV cs.CV

    SAM Meets Robotic Surgery: An Empirical Study on Generalization, Robustness and Adaptation

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Yang Zhang, Hongliang Ren

    Abstract: The Segment Anything Model (SAM) serves as a fundamental model for semantic segmentation and demonstrates remarkable generalization capabilities across a wide range of downstream scenarios. In this empirical study, we examine SAM's robustness and zero-shot generalizability in the field of robotic surgery. We comprehensively explore different scenarios, including prompted and unprompted situations,… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: Accepted as Oral Presentation at MedAGI Workshop - MICCAI 2023 1st International Workshop on Foundation Models for General Medical AI. arXiv admin note: substantial text overlap with arXiv:2304.14674

  20. arXiv:2308.04729  [pdf, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models

    Authors: Peike Li, Boyu Chen, Yao Yao, Yikai Wang, Allen Wang, Alex Wang

    Abstract: Music generation has attracted growing interest with the advancement of deep generative models. However, generating music conditioned on textual descriptions, known as text-to-music, remains challenging due to the complexity of musical structures and high sampling rate requirements. Despite the task's significance, prevailing generative models exhibit limitations in music quality, computational ef… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  21. arXiv:2308.03094  [pdf

    physics.optics eess.SP

    A reconfigurable multiple-format coherent-dual-band signal generator based on a single optoelectronic oscillation cavity

    Authors: Yibei Wang, Yalan Wang, Hongyi Wang, Xiaotong Liu, Hong Chen, ** Zhang, Dongyu Li, Dangwei Wang, Anle Wang

    Abstract: An optoelectronic oscillation method with reconfigurable multiple formats for simultaneous generation of coherent dual-band signals is proposed and experimentally demonstrated. By introducing a compatible filtering mechanism based on stimulated Brillouin scattering (SBS) effect into a typical Phase-shifted grating Bragg fiber (PS-FBG) notch filtering cavity, dual mode-selection mechanisms which ha… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: 12 pages, 8 figures

  22. arXiv:2308.02845  [pdf, other

    eess.IV cs.CV cs.RO

    Landmark Detection using Transformer Toward Robot-assisted Nasal Airway Intubation

    Authors: Tianhang Liu, Hechen Li, Long Bai, Yanan Wu, An Wang, Mobarakol Islam, Hongliang Ren

    Abstract: Robot-assisted airway intubation application needs high accuracy in locating targets and organs. Two vital landmarks, nostrils and glottis, can be detected during the intubation to accommodate the stages of nasal intubation. Automated landmark detection can provide accurate localization and quantitative evaluation. The Detection Transformer (DeTR) leads object detectors to a new paradigm with long… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: ICBIR 2023 (Best Student Paper Award). Code availability: https://github.com/ConorLTH/airway_intubation_landmarks_detection

  23. arXiv:2307.02452  [pdf, other

    eess.IV cs.CV cs.RO

    LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusion

    Authors: Long Bai, Tong Chen, Yanan Wu, An Wang, Mobarakol Islam, Hongliang Ren

    Abstract: Wireless capsule endoscopy (WCE) is a painless and non-invasive diagnostic tool for gastrointestinal (GI) diseases. However, due to GI anatomical constraints and hardware manufacturing limitations, WCE vision signals may suffer from insufficient illumination, leading to a complicated screening and examination procedure. Deep learning-based low-light image enhancement (LLIE) in the medical field gr… ▽ More

    Submitted 22 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: To appear in MICCAI 2023. Code availability: https://github.com/longbai1006/LLCaps

  24. arXiv:2306.16285  [pdf, other

    eess.IV cs.CV

    Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Hongliang Ren

    Abstract: Despite their impressive performance in various surgical scene understanding tasks, deep learning-based methods are frequently hindered from deploying to real-world surgical applications for various causes. Particularly, data collection, annotation, and domain shift in-between sites and patients are the most common obstacles. In this work, we mitigate data-related issues by efficiently leveraging… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: First two authors contributed equally. Accepted by IROS2023

  25. arXiv:2306.12109  [pdf, other

    eess.IV cs.CV

    DiffuseIR:Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images

    Authors: Mingjie Pan, Yulu Gan, Fangxu Zhou, Jiaming Liu, Aimin Wang, Shanghang Zhang, Dawei Li

    Abstract: Three-dimensional microscopy is often limited by anisotropic spatial resolution, resulting in lower axial resolution than lateral resolution. Current State-of-The-Art (SoTA) isotropic reconstruction methods utilizing deep neural networks can achieve impressive super-resolution performance in fixed imaging settings. However, their generality in practical use is limited by degraded performance cause… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  26. arXiv:2306.03511  [pdf, other

    eess.IV cs.CV

    Curriculum-Based Augmented Fourier Domain Adaptation for Robust Medical Image Segmentation

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Hongliang Ren

    Abstract: Accurate and robust medical image segmentation is fundamental and crucial for enhancing the autonomy of computer-aided diagnosis and intervention systems. Medical data collection normally involves different scanners, protocols, and populations, making domain adaptation (DA) a highly demanding research field to alleviate model degradation in the deployment site. To preserve the model performance ac… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Work under review. First three authors contributed equally

  27. arXiv:2306.00451  [pdf, other

    eess.IV cs.CV

    S$^2$ME: Spatial-Spectral Mutual Teaching and Ensemble Learning for Scribble-supervised Polyp Segmentation

    Authors: An Wang, Mengya Xu, Yang Zhang, Mobarakol Islam, Hongliang Ren

    Abstract: Fully-supervised polyp segmentation has accomplished significant triumphs over the years in advancing the early diagnosis of colorectal cancer. However, label-efficient solutions from weak supervision like scribbles are rarely explored yet primarily meaningful and demanding in medical practice due to the expensiveness and scarcity of densely-annotated polyp data. Besides, various deployment issues… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: MICCAI 2023 Early Acceptance

  28. arXiv:2304.14674  [pdf, other

    eess.IV cs.CV cs.RO

    SAM Meets Robotic Surgery: An Empirical Study in Robustness Perspective

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Yang Zhang, Hongliang Ren

    Abstract: Segment Anything Model (SAM) is a foundation model for semantic segmentation and shows excellent generalization capability with the prompts. In this empirical study, we investigate the robustness and zero-shot generalizability of the SAM in the domain of robotic surgery in various settings of (i) prompted vs. unprompted; (ii) bounding box vs. points-based prompt; (iii) generalization under corrupt… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Work under active progress

  29. arXiv:2304.06477  [pdf, other

    eess.SP

    Building Performance Simulations Can Inform IoT Privacy Leaks in Buildings

    Authors: Alan Wang, Bradford Campbell, Arsalan Heydarian

    Abstract: As IoT devices become cheaper, smaller, and more ubiquitously deployed, they can reveal more information than their intended design and threaten user privacy. Indoor Environmental Quality (IEQ) sensors previously installed for energy savings and indoor health monitoring have emerged as an avenue to infer sensitive occupant information. For example, light sensors are a known conduit for inspecting… ▽ More

    Submitted 26 March, 2023; originally announced April 2023.

  30. Arrhythmia Classifier Based on Ultra-Lightweight Binary Neural Network

    Authors: Ninghao Pu, Zhongxing Wu, Ao Wang, Hanshi Sun, Zi** Liu, Hao Liu

    Abstract: Reasonably and effectively monitoring arrhythmias through ECG signals has significant implications for human health. With the development of deep learning, numerous ECG classification algorithms based on deep learning have emerged. However, most existing algorithms trade off high accuracy for complex models, resulting in high storage usage and power consumption. This also inevitably increases the… ▽ More

    Submitted 25 August, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: 6 pages, 3 figures

  31. arXiv:2303.12148  [pdf, other

    eess.IV cs.CV cs.LG

    Neural Pre-Processing: A Learning Framework for End-to-end Brain MRI Pre-processing

    Authors: Xinzi He, Alan Wang, Mert R. Sabuncu

    Abstract: Head MRI pre-processing involves converting raw images to an intensity-normalized, skull-stripped brain in a standard coordinate space. In this paper, we propose an end-to-end weakly supervised learning approach, called Neural Pre-processing (NPP), for solving all three sub-tasks simultaneously via a neural network, trained on a large dataset without individual sub-task supervision. Because the ov… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 8

  32. arXiv:2211.13937  [pdf, other

    cs.LG cs.AI eess.SY math.OC stat.ML

    Operator Splitting Value Iteration

    Authors: Amin Rakhsha, Andrew Wang, Mohammad Ghavamzadeh, Amir-massoud Farahmand

    Abstract: We introduce new planning and reinforcement learning algorithms for discounted MDPs that utilize an approximate model of the environment to accelerate the convergence of the value function. Inspired by the splitting approach in numerical linear algebra, we introduce Operator Splitting Value Iteration (OS-VI) for both Policy Evaluation and Control problems. OS-VI achieves a much faster convergence… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS2022

  33. arXiv:2211.12421  [pdf, other

    q-bio.NC cs.LG eess.IV

    Data-Driven Network Neuroscience: On Data Collection and Benchmark

    Authors: Jiaxing Xu, Yunhan Yang, David Tse Jung Huang, Sophi Shilpa Gururajapathy, Yi** Ke, Miao Qiao, Alan Wang, Haribalan Kumar, Josh McGeown, Eryn Kwon

    Abstract: This paper presents a comprehensive and quality collection of functional human brain network data for potential research in the intersection of neuroscience, machine learning, and graph analytics. Anatomical and functional MRI images have been used to understand the functional connectivity of the human brain and are particularly important in identifying underlying neurodegenerative conditions such… ▽ More

    Submitted 29 October, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Journal ref: Advances in Neural Information Processing Systems, 2023

  34. arXiv:2211.04894  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives

    Authors: Haoning Wu, Erli Zhang, Liang Liao, Chaofeng Chen, **gwen Hou, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

    Abstract: The rapid increase in user-generated-content (UGC) videos calls for the development of effective video quality assessment (VQA) algorithms. However, the objective of the UGC-VQA problem is still ambiguous and can be viewed from two perspectives: the technical perspective, measuring the perception of distortions; and the aesthetic perspective, which relates to preference and recommendation on conte… ▽ More

    Submitted 7 March, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  35. arXiv:2209.01386  [pdf, other

    cs.AR cs.LG eess.SP

    SaleNet: A low-power end-to-end CNN accelerator for sustained attention level evaluation using EEG

    Authors: Chao Zhang, Zijian Tang, Taoming Guo, Jiaxin Lei, Jiaxin Xiao, Anhe Wang, Shuo Bai, Milin Zhang

    Abstract: This paper proposes SaleNet - an end-to-end convolutional neural network (CNN) for sustained attention level evaluation using prefrontal electroencephalogram (EEG). A bias-driven pruning method is proposed together with group convolution, global average pooling (GAP), near-zero pruning, weight clustering and quantization for the model compression, achieving a total compression ratio of 183.11x. Th… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: 5 pages, 4 figures, to be published in IEEE International Symposium on Circuits and Systems (ISCAS) 2022

  36. arXiv:2203.04294  [pdf, other

    eess.IV cs.AI cs.CV

    NaviAirway: a Bronchiole-sensitive Deep Learning-based Airway Segmentation Pipeline

    Authors: Andong Wang, Terence Chi Chun Tam, Ho Ming Poon, Kun-Chang Yu, Wei-Ning Lee

    Abstract: Airway segmentation is essential for chest CT image analysis. Different from natural image segmentation, which pursues high pixel-wise accuracy, airway segmentation focuses on topology. The task is challenging not only because of its complex tree-like structure but also the severe pixel imbalance among airway branches of different generations. To tackle the problems, we present a NaviAirway method… ▽ More

    Submitted 16 June, 2023; v1 submitted 7 March, 2022; originally announced March 2022.

  37. arXiv:2202.12943  [pdf, other

    eess.SP cs.LG

    Arrhythmia Classifier Using Convolutional Neural Network with Adaptive Loss-aware Multi-bit Networks Quantization

    Authors: Hanshi Sun, Ao Wang, Ninghao Pu, Zhiqing Li, Junguang Huang, Hao Liu, Zhi Qi

    Abstract: Cardiovascular disease (CVDs) is one of the universal deadly diseases, and the detection of it in the early stage is a challenging task to tackle. Recently, deep learning and convolutional neural networks have been employed widely for the classification of objects. Moreover, it is promising that lots of networks can be deployed on wearable devices. An increasing number of methods can be used to re… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

    Comments: 7 pages, 7 figures

  38. arXiv:2202.12806  [pdf, other

    physics.optics eess.IV

    Deep learning-assisted imaging through stationary scattering media

    Authors: Siddharth Rawat, Jonathan Wendoloski, Anna Wang

    Abstract: Imaging through scattering media is a challenging problem owing to speckle decorrelations from perturbations in the media itself. For in-line imaging modalities, which are appealing because they are compact, require no moving parts, and are robust, negating the effects of such scattering becomes particularly challenging. Here we explore the effect of stationary scattering media on light scattering… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: 4 figures

  39. arXiv:2202.02701  [pdf, other

    eess.IV cs.CV

    Hyper-Convolutions via Implicit Kernels for Medical Imaging

    Authors: Tianyu Ma, Alan Q. Wang, Adrian V. Dalca, Mert R. Sabuncu

    Abstract: The convolutional neural network (CNN) is one of the most commonly used architectures for computer vision tasks. The key building block of a CNN is the convolutional kernel that aggregates information from the pixel neighborhood and shares weights across all pixels. A standard CNN's capacity, and thus its performance, is directly related to the number of learnable kernel weights, which is determin… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2105.10559

  40. arXiv:2112.10074  [pdf, other

    eess.IV cs.CV cs.LG

    QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking Results

    Authors: Raghav Mehta, Angelos Filos, Ujjwal Baid, Chiharu Sako, Richard McKinley, Michael Rebsamen, Katrin Datwyler, Raphael Meier, Piotr Radojewski, Gowtham Krishnan Murugesan, Sahil Nalawade, Chandan Ganesh, Ben Wagner, Fang F. Yu, Baowei Fei, Ananth J. Madhuranthakam, Joseph A. Maldjian, Laura Daza, Catalina Gomez, Pablo Arbelaez, Chengliang Dai, Shuo Wang, Hadrien Reynaud, Yuan-han Mo, Elsa Angelini , et al. (67 additional authors not shown)

    Abstract: Deep learning (DL) models have provided state-of-the-art performance in various medical imaging benchmarking challenges, including the Brain Tumor Segmentation (BraTS) challenges. However, the task of focal pathology multi-compartment segmentation (e.g., tumor and lesion sub-regions) is particularly challenging, and potential errors hinder translating DL models into clinical workflows. Quantifying… ▽ More

    Submitted 23 August, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA): https://www.melba-journal.org/papers/2022:026.html

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)

  41. arXiv:2112.05893  [pdf, other

    cs.SD cs.LG eess.AS

    Hybrid Neural Networks for On-device Directional Hearing

    Authors: Anran Wang, Maruchi Kim, Hao Zhang, Shyamnath Gollakota

    Abstract: On-device directional hearing requires audio source separation from a given direction while achieving stringent human-imperceptible latency requirements. While neural nets can achieve significantly better performance than traditional beamformers, all existing models fall short of supporting low-latency causal inference on computationally-constrained wearables. We present DeepBeam, a hybrid model t… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Journal ref: AAAI 2022

  42. arXiv:2110.09956  [pdf, other

    eess.SP cs.CY

    Food Odor Recognition via Multi-step Classification

    Authors: Ang Xu, Tianzhang Cai, Dinghao Shen, Asher Wang

    Abstract: Predicting food labels and freshness from its odor remains a decades-old task that requires a complicated algorithm combined with high sensitivity sensors. In this paper, we initiate a multi-step classifier, which firstly clusters food into four categories, then classifies the food label concerning the predicted category, and finally identifies the freshness. We use BME688 gas sensors packed with… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  43. arXiv:2110.06694  [pdf, ps, other

    cs.IT eess.SP

    Joint Optimization of Beam-Hop** Design and NOMA-Assisted Transmission for Flexible Satellite Systems

    Authors: Anyue Wang, Lei Lei, Eva Lagunas, Ana I. Perez-Neira, Symeon Chatzinotas, Bjorn Ottersten

    Abstract: Next-generation satellite systems require more flexibility in resource management such that available radio resources can be dynamically allocated to meet time-varying and non-uniform traffic demands. Considering potential benefits of beam hop** (BH) and non-orthogonal multiple access (NOMA), we exploit the time-domain flexibility in multi-beam satellite systems by optimizing BH design, and enha… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  44. arXiv:2110.06634  [pdf, other

    cs.SD cs.CL eess.AS q-bio.NC

    End-to-end translation of human neural activity to speech with a dual-dual generative adversarial network

    Authors: Yina Guo, Xiaofei Zhang, Zhenying Gong, Anhong Wang, Wenwu Wang

    Abstract: In a recent study of auditory evoked potential (AEP) based brain-computer interface (BCI), it was shown that, with an encoder-decoder framework, it is possible to translate human neural activity to speech (T-CAS). However, current encoder-decoder-based methods achieve T-CAS often with a two-step method where the information is passed between the encoder and decoder with a shared dimension reductio… ▽ More

    Submitted 26 March, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: 12 pages, 13 figures

  45. arXiv:2105.12718  [pdf

    physics.app-ph eess.SY

    Magnetic Particle Spectroscopy (MPS) with One-stage Lock-in Implementation for Magnetic Bioassays with Improved Sensitivities

    Authors: Vinit Kumar Chugh, Kai Wu, Venkatramana D. Krishna, Arturo di Girolamo, Robert P. Bloom, Yongqiang Andrew Wang, Renata Saha, Shuang Liang, Maxim C-J Cheeran, Jian-** Wang

    Abstract: In recent years, magnetic particle spectroscopy (MPS) has become a highly sensitive and versatile sensing technique for quantitative bioassays. It relies on the dynamic magnetic responses of magnetic nanoparticles (MNPs) for the detection of target analytes in liquid phase. There are many research studies reporting the application of MPS for detecting a variety of analytes including viruses, toxin… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: 26 Pages, 11 Figures

  46. arXiv:2105.07961  [pdf, other

    eess.IV cs.CV

    Joint Optimization of Hadamard Sensing and Reconstruction in Compressed Sensing Fluorescence Microscopy

    Authors: Alan Q. Wang, Aaron K. LaViolette, Leo Moon, Chris Xu, Mert R. Sabuncu

    Abstract: Compressed sensing fluorescence microscopy (CS-FM) proposes a scheme whereby less measurements are collected during sensing and reconstruction is performed to recover the image. Much work has gone into optimizing the sensing and reconstruction portions separately. We propose a method of jointly optimizing both sensing and reconstruction end-to-end under a total measurement constraint, enabling lea… ▽ More

    Submitted 9 July, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: Accepted at MICCAI 2021

  47. arXiv:2105.07153  [pdf, other

    eess.IV cs.CV

    Window-Level is a Strong Denoising Surrogate

    Authors: Ayaan Haque, Adam Wang, Abdullah-Al-Zubaer Imran

    Abstract: CT image quality is heavily reliant on radiation dose, which causes a trade-off between radiation dose and image quality that affects the subsequent image-based diagnostic performance. However, high radiation can be harmful to both patients and operators. Several (deep learning-based) approaches have been attempted to denoise low dose images. However, those approaches require access to large train… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

    Comments: 11 pages, 4 figures

  48. arXiv:2104.04627  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Accented Speech Recognition Inspired by Human Perception

    Authors: Xiangyun Chu, Elizabeth Combs, Amber Wang, Michael Picheny

    Abstract: While improvements have been made in automatic speech recognition performance over the last several years, machines continue to have significantly lower performance on accented speech than humans. In addition, the most significant improvements on accented speech primarily arise by overwhelming the problem with hundreds or even thousands of hours of data. Humans typically require much less data to… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Submitted to INTERSPEECH 2021

  49. arXiv:2101.12490  [pdf, other

    eess.SY cs.RO math.OC

    Moment-Based Exact Uncertainty Propagation Through Nonlinear Stochastic Autonomous Systems

    Authors: Ashkan Jasour, Allen Wang, Brian C. Williams

    Abstract: In this paper, we address the problem of uncertainty propagation through nonlinear stochastic dynamical systems. More precisely, given a discrete-time continuous-state probabilistic nonlinear dynamical system, we aim at finding the sequence of the moments of the probability distributions of the system states up to any desired order over the given planning horizon. Moments of uncertain states can b… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

    Comments: This work has been submitted to the IEEE Transactions on Automatic Control

  50. arXiv:2101.08136  [pdf

    eess.IV physics.med-ph

    High-throughput fast full-color digital pathology based on Fourier ptychographic microscopy via color transfer

    Authors: Yuting Gao, Jiurun Chen, Aiye Wang, An Pan, Caiwen Ma, Baoli Yao

    Abstract: Full-color imaging is significant in digital pathology. Compared with a grayscale image or a pseudo-color image that only contains the contrast information, it can identify and detect the target object better with color texture information. Fourier ptychographic microscopy (FPM) is a high-throughput computational imaging technique that breaks the tradeoff between high resolution (HR) and large fie… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: 24 pages, 8 figures