Skip to main content

Showing 1–38 of 38 results for author: Ma, Q

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.12650  [pdf, other

    eess.IV

    Weakly Supervised Learning of Cortical Surface Reconstruction from Segmentations

    Authors: Qiang Ma, Liu Li, Emma C. Robinson, Bernhard Kainz, Daniel Rueckert

    Abstract: Existing learning-based cortical surface reconstruction approaches heavily rely on the supervision of pseudo ground truth (pGT) cortical surfaces for training. Such pGT surfaces are generated by traditional neuroimage processing pipelines, which are time consuming and difficult to generalize well to low-resolution brain MRI, e.g., from fetuses and neonates. In this work, we present CoSeg, a learni… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by the 27th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024)

  2. arXiv:2405.20336  [pdf, other

    cs.CV cs.SD eess.AS

    RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text

    Authors: Jiaben Chen, Xin Yan, Yihang Chen, Siyuan Cen, Qinwei Ma, Haoyu Zhen, Kaizhi Qian, Lie Lu, Chuang Gan

    Abstract: In this work, we introduce a challenging task for simultaneously generating 3D holistic body motions and singing vocals directly from textual lyrics inputs, advancing beyond existing works that typically address these two modalities in isolation. To facilitate this, we first collect the RapVerse dataset, a large dataset containing synchronous rap** vocals, lyrics, and high-quality 3D holistic bo… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Project website: https://vis-www.cs.umass.edu/RapVerse

  3. arXiv:2405.08783  [pdf, other

    eess.IV

    The Develo** Human Connectome Project: A Fast Deep Learning-based Pipeline for Neonatal Cortical Surface Reconstruction

    Authors: Qiang Ma, Kaili Liang, Liu Li, Saga Masui, Yourong Guo, Chiara Nosarti, Emma C. Robinson, Bernhard Kainz, Daniel Rueckert

    Abstract: The Develo** Human Connectome Project (dHCP) aims to explore developmental patterns of the human brain during the perinatal period. An automated processing pipeline has been developed to extract high-quality cortical surfaces from structural brain magnetic resonance (MR) images for the dHCP neonatal dataset. However, the current implementation of the pipeline requires more than 6.5 hours to proc… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  4. arXiv:2403.16062  [pdf

    eess.SP

    Holography inspired self-controlled reconfigurable intelligent surface

    Authors: Jieao Zhu, Ze Gu, Qian Ma, Linglong Dai, Tie Jun Cui

    Abstract: Among various promising candidate technologies for the sixth-generation (6G) wireless communications, recent advances in microwave metasurfaces have sparked a new research area of reconfigurable intelligent surfaces (RISs). By controllably reprogramming the wireless propagation channel, RISs are envisioned to achieve low-cost wireless capacity boosting, coverage extension, and enhanced energy effi… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Traditional BS-controlled RISs suffer from complicated control cables. To "cut" the control cables, we propose a self-controlled RIS by leveraging the holographic interference principle, thus realizing autonomous RIS beamforming

  5. arXiv:2403.03736  [pdf, other

    cs.CV cs.LG eess.IV

    Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer

    Authors: Naifu Xue, Qi Mao, Zijian Wang, Yuan Zhang, Siwei Ma

    Abstract: Recent progress in generative compression technology has significantly improved the perceptual quality of compressed data. However, these advancements primarily focus on producing high-frequency details, often overlooking the ability of generative models to capture the prior distribution of image content, thus impeding further bitrate reduction in extreme compression scenarios (<0.05 bpp). Motivat… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  6. arXiv:2402.03492  [pdf, other

    eess.IV cs.CV

    Beyond Strong labels: Weakly-supervised Learning Based on Gaussian Pseudo Labels for The Segmentation of Ellipse-like Vascular Structures in Non-contrast CTs

    Authors: Qixiang Ma, Antoine Łucas, Huazhong Shu, Adrien Kaladji, Pascal Haigron

    Abstract: Deep-learning-based automated segmentation of vascular structures in preoperative CT scans contributes to computer-assisted diagnosis and intervention procedure in vascular diseases. While CT angiography (CTA) is the common standard, non-contrast CT imaging is significant as a contrast-risk-free alternative, avoiding complications associated with contrast agents. However, the challenges of labor-i… ▽ More

    Submitted 10 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  7. arXiv:2402.02514  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Supervision by Gaussian Pseudo-label-based Morphological Attention for Abdominal Aorta Segmentation in Non-Contrast CTs

    Authors: Qixiang Ma, Antoine Lucas, Adrien Kaladji, Pascal Haigron

    Abstract: The segmentation of the abdominal aorta in non-contrast CT images is a non-trivial task for computer-assisted endovascular navigation, particularly in scenarios where contrast agents are unsuitable. While state-of-the-art deep learning segmentation models have been proposed recently for this task, they are trained on manually annotated strong labels. However, the inherent ambiguity in the boundary… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by 21st IEEE International Symposium on Biomedical Imaging

  8. arXiv:2401.07422  [pdf, other

    eess.SP

    Multiperson Detection and Vital-Sign Sensing Empowered by Space-Time-Coding RISs

    Authors: Xinyu Li, Jian Wei You, Ze Gu, Qian Ma, **gyuan Zhang, Long Chen, Tie Jun Cui

    Abstract: Passive human sensing using wireless signals has attracted increasing attention due to its superiorities of non-contact and robustness in various lighting conditions. However, when multiple human individuals are present, their reflected signals could be intertwined in the time, frequency and spatial domains, making it challenging to separate them. To address this issue, this paper proposes a novel… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  9. arXiv:2311.11460  [pdf, other

    math.OC eess.SY

    Classical Stability Margins by PID Control

    Authors: Qi Mao, Yong Xu, Jianqi Chen, Jie Chen, Tryphon Georgiou

    Abstract: Proportional-Integral-Derivative (PID) control has been the workhorse of control technology for about a century. Yet to this day, designing and tuning PID controllers relies mostly on either tabulated rules (Ziegler-Nichols) or on classical graphical techniques (Bode). Our goal in this paper is to take a fresh look on PID control in the context of optimizing stability margins for low-order (first-… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  10. arXiv:2311.07873  [pdf, other

    eess.SP

    Passive Human Sensing Enhanced by Reconfigurable Intelligent Surface: Opportunities and Challenges

    Authors: Xinyu Li, Jian Wei You, Ze Gu, Qian Ma, Long Chen, **gyuan Zhang, Shi **, Tie Jun Cui

    Abstract: Reconfigurable intelligent surfaces (RISs) have flexible and exceptional performance in manipulating electromagnetic waves and customizing wireless channels. These capabilities enable them to provide a plethora of valuable activity-related information for promoting wireless human sensing. In this article, we present a comprehensive review of passive human sensing using radio frequency signals with… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  11. arXiv:2307.11870  [pdf, other

    eess.IV q-bio.NC

    Conditional Temporal Attention Networks for Neonatal Cortical Surface Reconstruction

    Authors: Qiang Ma, Liu Li, Vanessa Kyriakopoulou, Joseph Hajnal, Emma C. Robinson, Bernhard Kainz, Daniel Rueckert

    Abstract: Cortical surface reconstruction plays a fundamental role in modeling the rapid brain development during the perinatal period. In this work, we propose Conditional Temporal Attention Network (CoTAN), a fast end-to-end framework for diffeomorphic neonatal cortical surface reconstruction. CoTAN predicts multi-resolution stationary velocity fields (SVF) from neonatal brain magnetic resonance images (M… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: Accepted by the 26th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2023

  12. arXiv:2307.08265  [pdf, other

    cs.CV eess.IV

    Extreme Image Compression using Fine-tuned VQGANs

    Authors: Qi Mao, Tinghan Yang, Yinuo Zhang, Zijian Wang, Meng Wang, Shiqi Wang, Siwei Ma

    Abstract: Recent advances in generative compression methods have demonstrated remarkable progress in enhancing the perceptual quality of compressed data, especially in scenarios with low bitrates. However, their efficacy and applicability to achieve extreme compression ratios ($<0.05$ bpp) remain constrained. In this work, we propose a simple yet effective coding framework by introducing vector quantization… ▽ More

    Submitted 15 December, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Generative Compression, Extreme Compression, VQGANs, Low Bitrate

  13. arXiv:2305.00837  [pdf, other

    eess.IV cs.CV cs.LG

    LCAUnet: A skin lesion segmentation network with enhanced edge and body fusion

    Authors: Qisen Ma, Keming Mao, Gao Wang, Lisheng Xu, Yuhai Zhao

    Abstract: Accurate segmentation of skin lesions in dermatoscopic images is crucial for the early diagnosis of skin cancer and improving the survival rate of patients. However, it is still a challenging task due to the irregularity of lesion areas, the fuzziness of boundaries, and other complex interference factors. In this paper, a novel LCAUnet is proposed to improve the ability of complementary representa… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 14 pages, 10 figures

  14. arXiv:2304.13471  [pdf, other

    eess.IV cs.CV

    OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution

    Authors: Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang

    Abstract: 360° omnidirectional images have gained research attention due to their immersive and interactive experience, particularly in AR/VR applications. However, they suffer from lower angular resolution due to being captured by fisheye lenses with the same sensor size for capturing planar images. To solve the above issues, we propose a two-stage framework for 360° omnidirectional image superresolution.… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPRW 2023

  15. Synthetic Datasets for Autonomous Driving: A Survey

    Authors: Zhihang Song, Zimin He, Xingyu Li, Qiming Ma, Ruibo Ming, Zhiqi Mao, Huaxin Pei, Lihui Peng, Jianming Hu, Danya Yao, Yi Zhang

    Abstract: Autonomous driving techniques have been flourishing in recent years while thirsting for huge amounts of high-quality data. However, it is difficult for real-world datasets to keep up with the pace of changing requirements due to their expensive and time-consuming experimental and labeling costs. Therefore, more and more researchers are turning to synthetic datasets to easily generate rich and chan… ▽ More

    Submitted 27 February, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 19 pages, 5 figures

    Journal ref: in IEEE Transactions on Intelligent Vehicles, vol. 9, no. 1, pp. 1847-1864, Jan. 2024

  16. arXiv:2303.00155  [pdf, other

    eess.SY

    Exponential Consensus of Multiple Agents over Dynamic Network Topology: Controllability, Connectivity, and Compactness

    Authors: Qichao Ma, Jiahu Qin, Brian D. O. Anderson, Long Wang

    Abstract: This paper investigates the problem of securing exponentially fast consensus (exponential consensus for short) for identical agents with finite-dimensional linear system dynamics over dynamic network topology. Our aim is to find the weakest possible conditions that guarantee exponentially fast consensus using a Lyapunov function consisting of a sum of terms of the same functional form. We first in… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

  17. arXiv:2210.16197  [pdf

    eess.SP

    Dimensionality Reduced Antenna Array for Beamforming/steering

    Authors: Shiyi Xia, Mingyang Zhao, Qian Ma, Xunnan Zhang, Ling Yang, Yazhi Pi, Hyunchul Chung, Ad Reniers, A. M. J. Koonen, Zizheng Cao

    Abstract: Beamforming makes possible a focused communication method. It is extensively employed in many disciplines involving electromagnetic waves, including arrayed ultrasonic, optical, and high-speed wireless communication. Conventional beam steering often requires the addition of separate active amplitude phase control units after each radiating element. The high power consumption and complexity of larg… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

  18. arXiv:2209.06434  [pdf, other

    cs.SD cs.CL eess.AS

    ConvNeXt Based Neural Network for Audio Anti-Spoofing

    Authors: Qiaowei Ma, **ghui Zhong, Yitao Yang, Weiheng Liu, Ying Gao, Wing W. Y. Ng

    Abstract: With the rapid development of speech conversion and speech synthesis algorithms, automatic speaker verification (ASV) systems are vulnerable to spoofing attacks. In recent years, researchers had proposed a number of anti-spoofing methods based on hand-crafted features. However, using hand-crafted features rather than raw waveform will lose implicit information for anti-spoofing. Inspired by the pr… ▽ More

    Submitted 21 December, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: 6 pages

  19. arXiv:2209.05482  [pdf, ps, other

    eess.SY

    Improved Fuzzy $H_{\infty}$ Filter Design Method for Nonlinear Systems with Time-Varing Delay

    Authors: Qianqian Ma, Li Li, Junhui Shen, Haowei Guan, Guangcheng Ma, Hongwei Xia

    Abstract: This paper investigates the fuzzy $H_{\infty}$ filter design issue for nonlinear systems with time-varying delay. In order to obtain less conservative fuzzy $H_{\infty}$ filter design method, a novel integral inequality is employed to replace the conventional Lebniz-Newton formula to analyze the stability conditions of the filtering error system. Besides, the information of the membership function… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: This paper was published in 2017 IEEE SMC. arXiv admin note: text overlap with arXiv:2209.04989. text overlap with arXiv:2209.04989

  20. arXiv:2209.04989  [pdf, ps, other

    eess.SY

    A New Fuzzy $H_{\infty}$ Filter Design for Nonlinear Time-Delay Systems with Mismatched Premise Membership Functions

    Authors: Qianqian Ma, Hongwei Xia, Li Li, Guangcheng Ma

    Abstract: This paper is concerned with the fuzzy $H_{\infty}$ filter design issue for nonlinear systems with time-varying delay. To overcome the shortcomings of the conventional methods with matched preconditions, the fuzzy $H_{\infty}$ filter to be designed and the T-S fuzzy model are assumed to have different premise membership functions and number of rules, thus, greater design flexibility and robustness… ▽ More

    Submitted 13 September, 2022; v1 submitted 11 September, 2022; originally announced September 2022.

    Comments: This paper was published at IFAC 2017

  21. arXiv:2208.14022  [pdf, other

    eess.IV cs.CV

    Stabilize, Decompose, and Denoise: Self-Supervised Fluoroscopy Denoising

    Authors: Ruizhou Liu, Qiang Ma, Zhiwei Cheng, Yuanyuan Lyu, Jianji Wang, S. Kevin Zhou

    Abstract: Fluoroscopy is an imaging technique that uses X-ray to obtain a real-time 2D video of the interior of a 3D object, hel** surgeons to observe pathological structures and tissue functions especially during intervention. However, it suffers from heavy noise that mainly arises from the clinical use of a low dose X-ray, thereby necessitating the technology of fluoroscopy denoising. Such denoising is… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 11 pages, 18 figures

  22. arXiv:2206.03173  [pdf, other

    cs.CL cs.SD eess.AS

    Speaker-Guided Encoder-Decoder Framework for Emotion Recognition in Conversation

    Authors: Yinan Bao, Qianwen Ma, Lingwei Wei, Wei Zhou, Songlin Hu

    Abstract: The emotion recognition in conversation (ERC) task aims to predict the emotion label of an utterance in a conversation. Since the dependencies between speakers are complex and dynamic, which consist of intra- and inter-speaker dependencies, the modeling of speaker-specific information is a vital role in ERC. Although existing researchers have proposed various methods of speaker interaction modelin… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: Accepted by IJCAI-ECAI 2022

  23. arXiv:2205.08239  [pdf, other

    eess.IV cs.CV

    CAS-Net: Conditional Atlas Generation and Brain Segmentation for Fetal MRI

    Authors: Liu Li, Qiang Ma, Matthew Sinclair, Antonios Makropoulos, Joseph Hajnal, A. David Edwards, Bernhard Kainz, Daniel Rueckert, Amir Alansary

    Abstract: Fetal Magnetic Resonance Imaging (MRI) is used in prenatal diagnosis and to assess early brain development. Accurate segmentation of the different brain tissues is a vital step in several brain analysis tasks, such as cortical surface reconstruction and tissue thickness measurements. Fetal MRI scans, however, are prone to motion artifacts that can affect the correctness of both manual and automati… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  24. arXiv:2202.08329  [pdf, other

    eess.IV cs.CV

    CortexODE: Learning Cortical Surface Reconstruction by Neural ODEs

    Authors: Qiang Ma, Liu Li, Emma C. Robinson, Bernhard Kainz, Daniel Rueckert, Amir Alansary

    Abstract: We present CortexODE, a deep learning framework for cortical surface reconstruction. CortexODE leverages neural ordinary differential equations (ODEs) to deform an input surface into a target shape by learning a diffeomorphic flow. The trajectories of the points on the surface are modeled as ODEs, where the derivatives of their coordinates are parameterized via a learnable Lipschitz-continuous def… ▽ More

    Submitted 10 September, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted by IEEE Transactions on Medical Imaging

  25. arXiv:2111.13923  [pdf, other

    eess.IV cs.CV

    Learning A 3D-CNN and Transformer Prior for Hyperspectral Image Super-Resolution

    Authors: Qing Ma, Junjun Jiang, Xianming Liu, Jiayi Ma

    Abstract: To solve the ill-posed problem of hyperspectral image super-resolution (HSISR), an usually method is to use the prior information of the hyperspectral images (HSIs) as a regularization term to constrain the objective function. Model-based methods using hand-crafted priors cannot fully characterize the properties of HSIs. Learning-based methods usually use a convolutional neural network (CNN) to le… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: 10 pages, 5 figures

  26. arXiv:2109.03693  [pdf, other

    eess.IV

    PialNN: A Fast Deep Learning Framework for Cortical Pial Surface Reconstruction

    Authors: Qiang Ma, Emma C. Robinson, Bernhard Kainz, Daniel Rueckert, Amir Alansary

    Abstract: Traditional cortical surface reconstruction is time consuming and limited by the resolution of brain Magnetic Resonance Imaging (MRI). In this work, we introduce Pial Neural Network (PialNN), a 3D deep learning framework for pial surface reconstruction. PialNN is trained end-to-end to deform an initial white matter surface to a target pial surface by a sequence of learned deformation blocks. A loc… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: Accepted in The 4th International Workshop on Machine Learning in Clinical Neuroimaging (MLCN2021)

  27. arXiv:2106.05905  [pdf, other

    eess.SY cs.AI math.OC

    Multiple Dynamic Pricing for Demand Response with Adaptive Clustering-based Customer Segmentation in Smart Grids

    Authors: Fanlin Meng, Qian Ma, Zixu Liu, Xiao-Jun Zeng

    Abstract: In this paper, we propose a realistic multiple dynamic pricing approach to demand response in the retail market. First, an adaptive clustering-based customer segmentation framework is proposed to categorize customers into different groups to enable the effective identification of usage patterns. Second, customized demand models with important market constraints which capture the price-demand relat… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

  28. arXiv:2105.06887  [pdf

    eess.IV cs.CV cs.LG

    A Frequency Domain Constraint for Synthetic and Real X-ray Image Super Resolution

    Authors: Qing Ma, Jae Chul Koh, WonSook Lee

    Abstract: Synthetic X-ray images are simulated X-ray images projected from CT data. High-quality synthetic X-ray images can facilitate various applications such as surgical image guidance systems and VR training simulations. However, it is difficult to produce high-quality arbitrary view synthetic X-ray images in real-time due to different CT slice thickness, high computational cost, and the complexity of a… ▽ More

    Submitted 10 August, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

  29. arXiv:2011.04976  [pdf, other

    cs.CV eess.IV

    Conceptual Compression via Deep Structure and Texture Synthesis

    Authors: Jianhui Chang, Zhenghui Zhao, Chuanmin Jia, Shiqi Wang, Lingbo Yang, Qi Mao, Jian Zhang, Siwei Ma

    Abstract: Existing compression methods typically focus on the removal of signal-level redundancies, while the potential and versatility of decomposing visual data into compact conceptual components still lack further study. To this end, we propose a novel conceptual compression framework that encodes visual data into compact structure and texture representations, then decodes in a deep synthesis fashion, ai… ▽ More

    Submitted 10 March, 2022; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: 15 pages, 14 figures

  30. arXiv:2007.13975  [pdf, other

    eess.AS cs.SD

    Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation

    Authors: **g**g Chen, Qirong Mao, Dong Liu

    Abstract: The dominant speech separation models are based on complex recurrent or convolution neural network that model speech sequences indirectly conditioning on context, such as passing information through many intermediate states in recurrent neural network, leading to suboptimal separation performance. In this paper, we propose a dual-path transformer network (DPTNet) for end-to-end speech separation,… ▽ More

    Submitted 14 August, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: 5 pages. Accepted by INTERSPEECH 2020

  31. arXiv:2004.07442  [pdf, other

    cs.CR cs.SD eess.AS

    Voice-Indistinguishability: Protecting Voiceprint in Privacy-Preserving Speech Data Release

    Authors: Yaowei Han, Sheng Li, Yang Cao, Qiang Ma, Masatoshi Yoshikawa

    Abstract: With the development of smart devices, such as the Amazon Echo and Apple's HomePod, speech data have become a new dimension of big data. However, privacy and security concerns may hinder the collection and sharing of real-world speech data, which contain the speaker's identifiable information, i.e., voiceprint, which is considered a type of biometric identifier. Current studies on voiceprint priva… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: The paper has been accepted by the IEEE International Conference on Multimedia & Expo 2020(ICME 2020)

  32. arXiv:2003.10916  [pdf, other

    cs.NI eess.SP

    Age of Processing: Age-driven Status Sampling and Processing Offloading for Edge Computing-enabled Real-time IoT Applications

    Authors: Rui Li, Qian Ma, Jie Gong, Zhi Zhou, Xu Chen

    Abstract: The freshness of status information is of great importance for time-critical Internet of Things (IoT) applications. A metric measuring status freshness is the age-of-information (AoI), which captures the time elapsed from the status being generated at the source node (e.g., a sensor) to the latest status update.However, in intelligent IoT applications such as video surveillance, the status informa… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: submitted for review

  33. arXiv:1912.00191  [pdf, other

    cs.AI cs.RO eess.SP

    Learning a Decision Module by Imitating Driver's Control Behaviors

    Authors: Junning Huang, Sirui Xie, Jiankai Sun, Qiurui Ma, Chunxiao Liu, Jian** Shi, Dahua Lin, Bolei Zhou

    Abstract: Autonomous driving systems have a pipeline of perception, decision, planning, and control. The decision module processes information from the perception module and directs the execution of downstream planning and control modules. On the other hand, the recent success of deep learning suggests that this pipeline could be replaced by end-to-end neural control policies, however, safety cannot be well… ▽ More

    Submitted 5 May, 2021; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: Proceedings of the Conference on Robot Learning (CoRL) 2020

  34. arXiv:1910.10202  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Complex Transformer: A Framework for Modeling Complex-Valued Sequence

    Authors: Muqiao Yang, Martin Q. Ma, Dongyu Li, Yao-Hung Hubert Tsai, Ruslan Salakhutdinov

    Abstract: While deep learning has received a surge of interest in a variety of fields in recent years, major deep learning models barely use complex numbers. However, speech, signal and audio data are naturally complex-valued after Fourier Transform, and studies have shown a potentially richer representation of complex nets. In this paper, we propose a Complex Transformer, which incorporates the transformer… ▽ More

    Submitted 6 August, 2021; v1 submitted 22 October, 2019; originally announced October 2019.

  35. arXiv:1910.00497  [pdf

    physics.app-ph eess.SP

    Intelligent Metasurface Imager and Recognizer

    Authors: Lianlin Li, Ya Shuang, Qian Ma, Haoyang Li, Hanting Zhao, Menglin Wei1, Che Liu, Chenglong Hao, Cheng-Wei Qiu, Tie Jun Cui

    Abstract: It is ever-increasingly demanded to remotely monitor people in daily life using radio-frequency probing signals. However, conventional systems can hardly be deployed in real-world settings since they typically require objects to either deliberately cooperate or carry a wireless active device or identification tag. To accomplish the complicated successive tasks using a single device in real time, w… ▽ More

    Submitted 2 September, 2019; originally announced October 2019.

  36. arXiv:1907.09320  [pdf

    cs.CV cs.NE eess.IV

    An Efficient Target Detection and Recognition Method in Aerial Remote-sensing Images Based on Multiangle Regions-of-Interest

    Authors: Guangcun Shan, Hongyu Wang, Wei Liang, Congcong Liu, Qizi Ma, Quan Quan

    Abstract: Recently, deep learning technology have been extensively used in the field of image recognition. However, its main application is the recognition and detection of ordinary pictures and common scenes. It is challenging to effectively and expediently analyze remote-sensing images obtained by the image acquisition systems on unmanned aerial vehicles (UAVs), which includes the identification of the ta… ▽ More

    Submitted 7 June, 2022; v1 submitted 22 July, 2019; originally announced July 2019.

    Comments: 5 pages, 3 figures

  37. arXiv:1907.03548  [pdf, ps, other

    cs.CV eess.IV

    Unified Attentional Generative Adversarial Network for Brain Tumor Segmentation From Multimodal Unpaired Images

    Authors: Wenguang Yuan, Jia Wei, Jiabing Wang, Qianli Ma, Tolga Tasdizen

    Abstract: In medical applications, the same anatomical structures may be observed in multiple modalities despite the different image characteristics. Currently, most deep models for multimodal segmentation rely on paired registered images. However, multimodal paired registered images are difficult to obtain in many cases. Therefore, develo** a model that can segment the target objects from different modal… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: 9 pages, 4 figures, Accepted by MICCAI2019

  38. arXiv:1807.07840  [pdf, other

    eess.SY

    On Synchronization of Dynamical Systems over Directed Switching Topologies: An Algebraic and Geometric Perspective

    Authors: Jiahu Qin, Qichao Ma, Xinghuo Yu, Long Wang

    Abstract: In this paper, we aim to investigate the synchronization problem of dynamical systems, which can be of generic linear or Lipschitz nonlinear type, communicating over directed switching network topologies. A mild connectivity assumption on the switching topologies is imposed, which allows them to be directed and jointly connected. We propose a novel analysis framework from both algebraic and geomet… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

    Comments: 17 pages, 11 figures