Skip to main content

Showing 1–50 of 79 results for author: Ha, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18414  [pdf, other

    cs.CV cs.AI

    BiTrack: Bidirectional Offline 3D Multi-Object Tracking Using Camera-LiDAR Data

    Authors: Kemiao Huang, Meiying Zhang, Qi Hao

    Abstract: Compared with real-time multi-object tracking (MOT), offline multi-object tracking (OMOT) has the advantages to perform 2D-3D detection fusion, erroneous link correction, and full track optimization but has to deal with the challenges from bounding box misalignment and track evaluation, editing, and refinement. This paper proposes "BiTrack", a 3D OMOT framework that includes modules of 2D-3D detec… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.18129  [pdf, other

    cs.CV cs.LG

    CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection

    Authors: Meiying Zhang, Weiyuan Peng, Guangyao Ding, Chenyang Lei, Chunlin Ji, Qi Hao

    Abstract: Simulation data can be accurately labeled and have been expected to improve the performance of data-driven algorithms, including object detection. However, due to the various domain inconsistencies from simulation to reality (sim-to-real), cross-domain object detection algorithms usually suffer from dramatic performance drops. While numerous unsupervised domain adaptation (UDA) methods have been d… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2404.19218  [pdf

    cs.LG

    Flight Trajectory Prediction Using an Enhanced CNN-LSTM Network

    Authors: Qinzhi Hao, Jiali Zhang, Tengyu **g, Wei Wang

    Abstract: Aiming at the problem of low accuracy of flight trajectory prediction caused by the high speed of fighters, the diversity of tactical maneuvers, and the transient nature of situational change in close range air combat, this paper proposes an enhanced CNN-LSTM network as a fighter flight trajectory prediction method. Firstly, we extract spatial features from fighter trajectory data using CNN, aggre… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  4. "Are Adversarial Phishing Webpages a Threat in Reality?" Understanding the Users' Perception of Adversarial Webpages

    Authors: Ying Yuan, Qingying Hao, Giovanni Apruzzese, Mauro Conti, Gang Wang

    Abstract: Machine learning based phishing website detectors (ML-PWD) are a critical part of today's anti-phishing solutions in operation. Unfortunately, ML-PWD are prone to adversarial evasions, evidenced by both academic studies and analyses of real-world adversarial phishing webpages. However, existing works mostly focused on assessing adversarial phishing webpages against ML-PWD, while neglecting a cruci… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  5. arXiv:2403.17392  [pdf, other

    cs.RO eess.SY nlin.AO

    Natural-artificial hybrid swarm: Cyborg-insect group navigation in unknown obstructed soft terrain

    Authors: Yang Bai, Phuoc Thanh Tran Ngoc, Huu Duoc Nguyen, Duc Long Le, Quang Huy Ha, Kazuki Kai, Yu Xiang See To, Yaosheng Deng, Jie Song, Naoki Wakamiya, Hirotaka Sato, Masaki Ogura

    Abstract: Navigating multi-robot systems in complex terrains has always been a challenging task. This is due to the inherent limitations of traditional robots in collision avoidance, adaptation to unknown environments, and sustained energy efficiency. In order to overcome these limitations, this research proposes a solution by integrating living insects with miniature electronic controllers to enable roboti… ▽ More

    Submitted 27 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  6. arXiv:2403.12970  [pdf

    eess.IV cs.CV physics.bio-ph physics.optics

    Hybrid deep learning and physics-based neural network for programmable illumination computational microscopy

    Authors: Ruiqing Sun, Delong Yang, Shaohui Zhang, Qun Hao

    Abstract: Relying on either deep models or physical models are two mainstream approaches for solving inverse sample reconstruction problems in programmable illumination computational microscopy. Solutions based on physical models possess strong generalization capabilities while struggling with global optimization of inverse problems due to a lack of insufficient physical constraints. In contrast, deep learn… ▽ More

    Submitted 17 January, 2024; originally announced March 2024.

  7. arXiv:2403.06828  [pdf, other

    cs.RO cs.AI

    NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning

    Authors: Ruihua Han, Shuai Wang, Shuaijun Wang, Zeqing Zhang, Jianjun Chen, Shijie Lin, Chengyang Li, Chengzhong Xu, Yonina C. Eldar, Qi Hao, Jia Pan

    Abstract: Navigating a nonholonomic robot in a cluttered environment requires extremely accurate perception and locomotion for collision avoidance. This paper presents NeuPAN: a real-time, highly-accurate, map-free, robot-agnostic, and environment-invariant robot navigation solution. Leveraging a tightly-coupled perception-locomotion framework, NeuPAN has two key innovations compared to existing approaches:… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: submit to TRO

  8. arXiv:2403.03541  [pdf, other

    cs.RO

    Seamless Virtual Reality with Integrated Synchronizer and Synthesizer for Autonomous Driving

    Authors: He Li, Ruihua Han, Zirui Zhao, Wei Xu, Qi Hao, Shuai Wang, Chengzhong Xu

    Abstract: Virtual reality (VR) is a promising data engine for autonomous driving (AD). However, data fidelity in this paradigm is often degraded by VR inconsistency, for which the existing VR approaches become ineffective, as they ignore the inter-dependency between low-level VR synchronizer designs (i.e., data collector) and high-level VR synthesizer designs (i.e., data processor). This paper presents a se… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  9. arXiv:2402.17269  [pdf, other

    cs.LG

    Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition

    Authors: Cam-Van Thi Nguyen, Cao-Bach Nguyen, Quang-Thuy Ha, Duc-Trong Le

    Abstract: Emotion recognition in conversation (ERC) is a crucial task in natural language processing and affective computing. This paper proposes MultiDAG+CL, a novel approach for Multimodal Emotion Recognition in Conversation (ERC) that employs Directed Acyclic Graph (DAG) to integrate textual, acoustic, and visual features within a unified framework. The model is enhanced by Curriculum Learning (CL) to ad… ▽ More

    Submitted 8 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by LREC-COLING 2024

  10. arXiv:2401.08658  [pdf, other

    cs.RO cs.AI

    End-To-End Planning of Autonomous Driving in Industry and Academia: 2022-2023

    Authors: Gong** Lan, Qi Hao

    Abstract: This paper aims to provide a quick review of the methods including the technologies in detail that are currently reported in industry and academia. Specifically, this paper reviews the end-to-end planning, including Tesla FSD V12, Momenta 2023, Horizon Robotics 2023, Motional RoboTaxi 2022, Woven Planet (Toyota): Urban Driver, and Nvidia. In addition, we review the state-of-the-art academic studie… ▽ More

    Submitted 26 December, 2023; originally announced January 2024.

    Comments: 8 pages, 14 figures

  11. arXiv:2401.08100  [pdf, other

    cs.CV cs.AI

    KTVIC: A Vietnamese Image Captioning Dataset on the Life Domain

    Authors: Anh-Cuong Pham, Van-Quang Nguyen, Thi-Hong Vuong, Quang-Thuy Ha

    Abstract: Image captioning is a crucial task with applications in a wide range of domains, including healthcare and education. Despite extensive research on English image captioning datasets, the availability of such datasets for Vietnamese remains limited, with only two existing datasets. In this study, we introduce KTVIC, a comprehensive Vietnamese Image Captioning dataset focused on the life domain, cove… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  12. arXiv:2312.16971  [pdf, other

    cs.NI

    High Throughput Inter-Layer Connecting Strategy for Multi-Layer Ultra-Dense Satellite Networks

    Authors: Qi Hao, Di Zhou, Min Sheng, Yan Shi, Jiandong Li

    Abstract: Multi-layer ultra-dense satellite networks (MLUDSNs) have soared this meteoric to provide vast throughputd for globally diverse services. Differing from traditional monolayer constellations, MLUDSNs emphasize the spatial integration among layers, and its throughput may not be simply the sum of throughput of each layer. The hop-count of cross-layer communication paths can be reduced by deploying in… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  13. arXiv:2312.15751  [pdf, other

    cs.CL

    Solving Label Variation in Scientific Information Extraction via Multi-Task Learning

    Authors: Dong Pham, Xanh Ho, Quang-Thuy Ha, Akiko Aizawa

    Abstract: Scientific Information Extraction (ScientificIE) is a critical task that involves the identification of scientific entities and their relationships. The complexity of this task is compounded by the necessity for domain-specific knowledge and the limited availability of annotated data. Two of the most popular datasets for ScientificIE are SemEval-2018 Task-7 and SciERC. They have overlap** sample… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 14 pages, 7 figures, PACLIC 37

  14. arXiv:2311.08747  [pdf, other

    cs.CV

    Improved Dense Nested Attention Network Based on Transformer for Infrared Small Target Detection

    Authors: Chun Bao, Jie Cao, Yaqian Ning, Tianhua Zhao, Zhijun Li, Zechen Wang, Li Zhang, Qun Hao

    Abstract: Infrared small target detection based on deep learning offers unique advantages in separating small targets from complex and dynamic backgrounds. However, the features of infrared small targets gradually weaken as the depth of convolutional neural network (CNN) increases. To address this issue, we propose a novel method for detecting infrared small targets called improved dense nested attention ne… ▽ More

    Submitted 17 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  15. arXiv:2311.03785  [pdf, other

    cs.CV cs.MM

    Self-MI: Efficient Multimodal Fusion via Self-Supervised Multi-Task Learning with Auxiliary Mutual Information Maximization

    Authors: Cam-Van Thi Nguyen, Ngoc-Hoa Thi Nguyen, Duc-Trong Le, Quang-Thuy Ha

    Abstract: Multimodal representation learning poses significant challenges in capturing informative and distinct features from multiple modalities. Existing methods often struggle to exploit the unique characteristics of each modality due to unified multimodal annotations. In this study, we propose Self-MI in the self-supervised learning fashion, which also leverage Contrastive Predictive Coding (CPC) as an… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted at The 37th Pacific Asia Conference on Language, Information and Computation (PACLIC 37)

  16. arXiv:2311.02108  [pdf, other

    cs.HC cs.AI

    A Virtual Reality Training System for Automotive Engines Assembly and Disassembly

    Authors: Gong** Lan, Qiangqiang Lai, Bing Bai, Zirui Zhao, Qi Hao

    Abstract: Automotive engine assembly and disassembly are common and crucial programs in the automotive industry. Traditional education trains students to learn automotive engine assembly and disassembly in lecture courses and then to operate with physical engines, which are generally low effectiveness and high cost. In this work, we developed a multi-layer structured Virtual Reality (VR) system to provide s… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 10 pages, 9 figures

  17. Vision-Based Human Pose Estimation via Deep Learning: A Survey

    Authors: Gong** Lan, Yu Wu, Fei Hu, Qi Hao

    Abstract: Human pose estimation (HPE) has attracted a significant amount of attention from the computer vision community in the past decades. Moreover, HPE has been applied to various domains, such as human-computer interaction, sports analysis, and human tracking via images and videos. Recently, deep learning-based approaches have shown state-of-the-art performance in HPE-based applications. Although deep… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 16 pages, 4 figures

  18. arXiv:2304.10167  [pdf

    physics.optics cs.ET eess.IV

    Adaptive coded illumination Fourier ptychography microscopy based on physical neural network

    Authors: Ruiqing Sun, Delong Yang, Yao Hu, Qun Hao, Xin Li, Shaohui Zhang

    Abstract: Fourier Ptychographic Microscopy (FPM) is a computational technique that achieves a large space-bandwidth product imaging. It addresses the challenge of balancing a large field of view and high resolution by fusing information from multiple images taken with varying illumination angles. Nevertheless, conventional FPM framework always suffers from long acquisition time and a heavy computational bur… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  19. arXiv:2302.14643  [pdf, other

    cs.LG

    Graph-based Knowledge Distillation: A survey and experimental evaluation

    Authors: **g Liu, Tongya Zheng, Guanzheng Zhang, Qinfen Hao

    Abstract: Graph, such as citation networks, social networks, and transportation networks, are prevalent in the real world. Graph Neural Networks (GNNs) have gained widespread attention for their robust expressiveness and exceptional performance in various graph applications. However, the efficacy of GNNs is heavily reliant on sufficient data labels and complex network models, with the former obtaining hardl… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 25 pages,7 figures, 11 tables

  20. arXiv:2302.08055  [pdf

    cs.AR

    CXL over Ethernet: A Novel FPGA-based Memory Disaggregation Design in Data Centers

    Authors: Chenjiu Wang, Ke He, Ruiqi Fan, Xiaonan Wang, Yang Kong, Wei Wang, Qinfen Hao

    Abstract: Memory resources in data centers generally suffer from low utilization and lack of dynamics. Memory disaggregation solves these problems by decoupling CPU and memory, which currently includes approaches based on RDMA or interconnection protocols such as Compute Express Link (CXL). However, the RDMA-based approach involves code refactoring and higher latency. The CXL-based approach supports native… ▽ More

    Submitted 22 February, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

  21. Rega-Net:Retina Gabor Attention for Deep Convolutional Neural Networks

    Authors: Chun Bao, Jie Cao, Yaqian Ning, Yang Cheng, Qun Hao

    Abstract: Extensive research works demonstrate that the attention mechanism in convolutional neural networks (CNNs) effectively improves accuracy. Nevertheless, few works design attention mechanisms using large receptive fields. In this work, we propose a novel attention method named Rega-net to increase CNN accuracy by enlarging the receptive field. Inspired by the mechanism of the human retina, we design… ▽ More

    Submitted 3 March, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

  22. RDA: An Accelerated Collision Free Motion Planner for Autonomous Navigation in Cluttered Environments

    Authors: Ruihua Han, Shuai Wang, Shuaijun Wang, Zeqing Zhang, Qianru Zhang, Yonina C. Eldar, Qi Hao, Jia Pan

    Abstract: Autonomous motion planning is challenging in multi-obstacle environments due to nonconvex collision avoidance constraints. Directly applying numerical solvers to these nonconvex formulations fails to exploit the constraint structures, resulting in excessive computation time. In this paper, we present an accelerated collision-free motion planner, namely regularized dual alternating direction method… ▽ More

    Submitted 4 April, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: Published in: IEEE Robotics and Automation Letters ( Volume: 8, Issue: 3, March 2023) (https://ieeexplore.ieee.org/document/10036019)

  23. Stag hunt game-based approach for cooperative UAVs

    Authors: L. V. Nguyen, I. Torres Herrera, T. H. Le, M. D. Phung, R. P. Aguilera, Q. P. Ha

    Abstract: Unmanned aerial vehicles (UAVs) are being employed in many areas such as photography, emergency, entertainment, defence, agriculture, forestry, mining and construction. Over the last decade, UAV technology has found applications in numerous construction project phases, ranging from site map**, progress monitoring, building inspection, damage assessments, and material delivery. While extensive st… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

    Comments: in 2022 Proceedings of 39th International Symposium on Automation and Robotics in Construction, Pages 367-374, Bogotá, Colombia, ISBN 978-952-69524-2-0, ISSN 2413-5844

  24. arXiv:2208.03945  [pdf, other

    cs.RO

    SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty

    Authors: Shuai Zhang, Liang Zhao, Shoudong Huang, Hua Wang, Qi Luo, Qi Hao

    Abstract: Total knee arthroplasty (TKA) is a common orthopaedic surgery to replace a damaged knee joint with artificial implants. The inaccuracy of achieving the planned implant position can result in the risk of implant component aseptic loosening, wear out, and even a joint revision, and those failures most of the time occur on the tibial side in the conventional jig-based TKA (CON-TKA). This study aims t… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: 10 pages, 4 figures, The 25th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2022

  25. arXiv:2208.00593  [pdf, other

    cs.IR cs.LG

    Long Short-Term Preference Modeling for Continuous-Time Sequential Recommendation

    Authors: Huixuan Chi, Hao Xu, Hao Fu, Mengya Liu, Mengdi Zhang, Yuji Yang, Qinfen Hao, Wei Wu

    Abstract: Modeling the evolution of user preference is essential in recommender systems. Recently, dynamic graph-based methods have been studied and achieved SOTA for recommendation, majority of which focus on user's stable long-term preference. However, in real-world scenario, user's short-term preference evolves over time dynamically. Although there exists sequential methods that attempt to capture it, ho… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: 9 pages, 4 figures

  26. arXiv:2207.11887  [pdf, other

    cs.LG

    HIRE: Distilling High-order Relational Knowledge From Heterogeneous Graph Neural Networks

    Authors: **g Liu, Tongya Zheng, Qinfen Hao

    Abstract: Researchers have recently proposed plenty of heterogeneous graph neural networks (HGNNs) due to the ubiquity of heterogeneous graphs in both academic and industrial areas. Instead of pursuing a more powerful HGNN model, in this paper, we are interested in devising a versatile plug-and-play module, which accounts for distilling relational knowledge from pre-trained HGNNs. To the best of our knowl… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

    Comments: 22 pages, 15 figures, submitted to Neurocomputing

  27. arXiv:2206.01748  [pdf, other

    cs.RO cs.LG

    Federated Deep Learning Meets Autonomous Vehicle Perception: Design and Verification

    Authors: Shuai Wang, Chengyang Li, Derrick Wing Kwan Ng, Yonina C. Eldar, H. Vincent Poor, Qi Hao, Chengzhong Xu

    Abstract: Realizing human-like perception is a challenge in open driving scenarios due to corner cases and visual occlusions. To gather knowledge of rare and occluded instances, federated learning assisted connected autonomous vehicle (FLCAV) has been proposed, which leverages vehicular networks to establish federated deep neural networks (DNNs) from distributed data captured by vehicles and road sensors. W… ▽ More

    Submitted 5 December, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: 10 pages, 6 figures, IEEE Network, accepted from open call

  28. arXiv:2203.10229  [pdf, other

    cs.RO

    Reinforcement Learned Distributed Multi-Robot Navigation with Reciprocal Velocity Obstacle Shaped Rewards

    Authors: Ruihua Han, Shengduo Chen, Shuaijun Wang, Zeqing Zhang, Rui Gao, Qi Hao, Jia Pan

    Abstract: The challenges to solving the collision avoidance problem lie in adaptively choosing optimal robot velocities in complex scenarios full of interactive obstacles. In this paper, we propose a distributed approach for multi-robot navigation which combines the concept of reciprocal velocity obstacle (RVO) and the scheme of deep reinforcement learning (DRL) to solve the reciprocal collision avoidance p… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  29. arXiv:2203.07709  [pdf, other

    cs.RO

    Adaptive Environment Modeling Based Reinforcement Learning for Collision Avoidance in Complex Scenes

    Authors: Shuaijun Wang, Rui Gao, Ruihua Han, Shengduo Chen, Chengyang Li, Qi Hao

    Abstract: The major challenges of collision avoidance for robot navigation in crowded scenes lie in accurate environment modeling, fast perceptions, and trustworthy motion planning policies. This paper presents a novel adaptive environment model based collision avoidance reinforcement learning (i.e., AEMCARL) framework for an unmanned robot to achieve collision-free motions in challenging navigation scenari… ▽ More

    Submitted 27 October, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: accepted by IROS2022

  30. arXiv:2201.12702  [pdf, ps, other

    cs.RO cs.IT eess.SY

    Robotic Wireless Energy Transfer in Dynamic Environments: System Design and Experimental Validation

    Authors: Shuai Wang, Ruihua Han, Yuncong Hong, Qi Hao, Miaowen Wen, Leila Musavian, Shahid Mumtaz, Derrick Wing Kwan Ng

    Abstract: Wireless energy transfer (WET) is a ground-breaking technology for cutting the last wire between mobile sensors and power grids in smart cities. Yet, WET only offers effective transmission of energy over a short distance. Robotic WET is an emerging paradigm that mounts the energy transmitter on a mobile robot and navigates the robot through different regions in a large area to charge remote energy… ▽ More

    Submitted 10 February, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

    Comments: single column, 18 pages, 6 figures, to appear in IEEE Communications Magazine

    Journal ref: IEEE Communications Magazine, Mar. 2022

  31. arXiv:2201.09048  [pdf, other

    cs.CV cs.RO

    Phase-SLAM: Phase Based Simultaneous Localization and Map** for Mobile Structured Light Illumination Systems

    Authors: Xi Zheng, Rui Ma, Rui Gao, Qi Hao

    Abstract: Structured Light Illumination (SLI) systems have been used for reliable indoor dense 3D scanning via phase triangulation. However, mobile SLI systems for 360 degree 3D reconstruction demand 3D point cloud registration, involving high computational complexity. In this paper, we propose a phase based Simultaneous Localization and Map** (Phase-SLAM) framework for fast and accurate SLI sensor pose e… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

  32. arXiv:2112.07621  [pdf, other

    cs.IR cs.LG

    Re-ranking With Constraints on Diversified Exposures for Homepage Recommender System

    Authors: Qi Hao, Tianze Luo, Guangda Huzhang

    Abstract: The homepage recommendation on most E-commerce applications places items in a hierarchical manner, where different channels display items in different styles. Existing algorithms usually optimize the performance of a single channel. So designing the model to achieve the optimal recommendation list which maximize the Click-Through Rate (CTR) of whole homepage is a challenge problem. Other than the… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

    Comments: 8pages,7figures

  33. arXiv:2112.05301  [pdf, other

    cs.CV

    Self-Ensemling for 3D Point Cloud Domain Adaption

    Authors: Qing Li, Xiaojiang Peng, Chuan Yan, Pan Gao, Qi Hao

    Abstract: Recently 3D point cloud learning has been a hot topic in computer vision and autonomous driving. Due to the fact that it is difficult to manually annotate a qualitative large-scale 3D point cloud dataset, unsupervised domain adaptation (UDA) is popular in 3D point cloud learning which aims to transfer the learned knowledge from the labeled source domain to the unlabeled target domain. However, the… ▽ More

    Submitted 24 March, 2023; v1 submitted 9 December, 2021; originally announced December 2021.

  34. arXiv:2110.04619  [pdf, ps, other

    cs.CV

    Google Landmark Retrieval 2021 Competition Third Place Solution

    Authors: Qishen Ha, Bo Liu, Hongwei Zhang

    Abstract: We present our solutions to the Google Landmark Challenges 2021, for both the retrieval and the recognition tracks. Both solutions are ensembles of transformers and ConvNet models based on Sub-center ArcFace with dynamic margins. Since the two tracks share the same training data, we used the same pipeline and training approach, but with different model selections for the ensemble and different pos… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

  35. arXiv:2109.13446  [pdf, other

    cs.RO eess.SY

    Runtime Safety Assurance for Learning-enabled Control of Autonomous Driving Vehicles

    Authors: Shengduo Chen, Yaowei Sun, Dachuan Li, Qiang Wang, Qi Hao, Joseph Sifakis

    Abstract: Providing safety guarantees for Autonomous Vehicle (AV) systems with machine-learning-based controllers remains a challenging issue. In this work, we propose Simplex-Drive, a framework that can achieve runtime safety assurance for machine-learning enabled controllers of AVs. The proposed Simplex-Drive consists of an unverified Deep Reinforcement Learning (DRL)-based advanced controller (AC) that a… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    MSC Class: 68T40 ACM Class: I.2.9

  36. arXiv:2108.13669  [pdf, ps, other

    eess.SP cs.LG

    Unit-Modulus Wireless Federated Learning Via Penalty Alternating Minimization

    Authors: Shuai Wang, Dachuan Li, Rui Wang, Qi Hao, Yik-Chung Wu, Derrick Wing Kwan Ng

    Abstract: Wireless federated learning (FL) is an emerging machine learning paradigm that trains a global parametric model from distributed datasets via wireless communications. This paper proposes a unit-modulus wireless FL (UMWFL) framework, which simultaneously uploads local model parameters and computes global model parameters via optimized phase shifting. The proposed framework avoids sophisticated base… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: IEEE Global Communications Conference 2021. arXiv admin note: substantial text overlap with arXiv:2101.12051

  37. arXiv:2108.05118  [pdf

    cs.RO cs.AI eess.SY

    Capture Uncertainties in Deep Neural Networks for Safe Operation of Autonomous Driving Vehicles

    Authors: Liuhui Ding, Dachuan Li, Bowen Liu, Wenxing Lan, Bing Bai, Qi Hao, Weipeng Cao, Ke Pei

    Abstract: Uncertainties in Deep Neural Network (DNN)-based perception and vehicle's motion pose challenges to the development of safe autonomous driving vehicles. In this paper, we propose a safe motion planning framework featuring the quantification and propagation of DNN-based perception uncertainties and motion uncertainties. Contributions of this work are twofold: (1) A Bayesian Deep Neural network mode… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Comments: To appear in the 19th IEEE International Symposium on Parallel and Distributed Processing with Applications (IEEE ISPA 2021)

    MSC Class: 68T40 ACM Class: I.2.9

  38. arXiv:2108.04602  [pdf, other

    cs.CV

    Joint Multi-Object Detection and Tracking with Camera-LiDAR Fusion for Autonomous Driving

    Authors: Kemiao Huang, Qi Hao

    Abstract: Multi-object tracking (MOT) with camera-LiDAR fusion demands accurate results of object detection, affinity computation and data association in real time. This paper presents an efficient multi-modal MOT framework with online joint detection and tracking schemes and robust data association for autonomous driving applications. The novelty of this work includes: (1) development of an end-to-end deep… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: accepted by IROS 2021

  39. Residual Network and Embedding Usage: New Tricks of Node Classification with Graph Convolutional Networks

    Authors: Huixuan Chi, Yuying Wang, Qinfen Hao, Hong Xia

    Abstract: Graph Convolutional Networks (GCNs) and subsequent variants have been proposed to solve tasks on graphs, especially node classification tasks. In the literature, however, most tricks or techniques are either briefly mentioned as implementation details or only visible in source code. In this paper, we first summarize some existing effective tricks used in GCNs mini-batch training. Based on this, tw… ▽ More

    Submitted 21 May, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: This work is still working in process. (14 pages, 6 figures)

  40. arXiv:2104.14447  [pdf, other

    math.NA cs.DC cs.MS cs.PF math.AP

    Parallel implementation of a compatible high-order meshless method for the Stokes' equations

    Authors: Quang-Thinh Ha, Paul A. Kuberry, Nathaniel A. Trask, Emily M. Ryan

    Abstract: A parallel implementation of a compatible discretization scheme for steady-state Stokes problems is presented in this work. The scheme uses generalized moving least squares to generate differential operators and apply boundary conditions. This meshless scheme allows a high-order convergence for both the velocity and pressure, while also incorporates finite-difference-like sparse discretization. Ad… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  41. Hierarchical Convolutional Neural Network with Feature Preservation and Autotuned Thresholding for Crack Detection

    Authors: Qiuchen Zhu, Tran Hiep Dinh, Manh Duong Phung, Quang Phuc Ha

    Abstract: Drone imagery is increasingly used in automated inspection for infrastructure surface defects, especially in hazardous or unreachable environments. In machine vision, the key to crack detection rests with robust and accurate algorithms for image processing. To this end, this paper proposes a deep learning approach using hierarchical convolutional neural networks with feature preservation (HCNNFP)… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Journal ref: IEEE Access, 2021

  42. arXiv:2104.10033  [pdf, other

    cs.NE cs.AI cs.RO eess.SY

    Safety-enhanced UAV Path Planning with Spherical Vector-based Particle Swarm Optimization

    Authors: Manh Duong Phung, Quang Phuc Ha

    Abstract: This paper presents a new algorithm named spherical vector-based particle swarm optimization (SPSO) to deal with the problem of path planning for unmanned aerial vehicles (UAVs) in complicated environments subjected to multiple threats. A cost function is first formulated to convert the path planning into an optimization problem that incorporates requirements and constraints for the feasible and s… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Journal ref: Applied Soft Computing, Volume 107, August 2021, 107376

  43. arXiv:2103.04580  [pdf, other

    cs.CV

    Unsupervised Person Re-Identification with Multi-Label Learning Guided Self-Paced Clustering

    Authors: Qing Li, Xiaojiang Peng, Yu Qiao, Qi Hao

    Abstract: Although unsupervised person re-identification (Re-ID) has drawn increasing research attention recently, it remains challenging to learn discriminative features without annotations across disjoint camera views. In this paper, we address the unsupervised person Re-ID with a conceptually novel yet simple framework, termed as Multi-label Learning guided self-paced Clustering (MLC). MLC mainly learns… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

  44. Distributed Dynamic Map Fusion via Federated Learning for Intelligent Networked Vehicles

    Authors: Zijian Zhang, Shuai Wang, Yuncong Hong, Liangkai Zhou, Qi Hao

    Abstract: The technology of dynamic map fusion among networked vehicles has been developed to enlarge sensing ranges and improve sensing accuracies for individual vehicles. This paper proposes a federated learning (FL) based dynamic map fusion framework to achieve high map quality despite unknown numbers of objects in fields of view (FoVs), various sensing and model uncertainties, and missing data labels fo… ▽ More

    Submitted 21 September, 2022; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: 7 pages, 5 figures, 2021 IEEE International Conference on Robotics and Automation (ICRA)

  45. Edge Federated Learning Via Unit-Modulus Over-The-Air Computation

    Authors: Shuai Wang, Yuncong Hong, Rui Wang, Qi Hao, Yik-Chung Wu, Derrick Wing Kwan Ng

    Abstract: Edge federated learning (FL) is an emerging paradigm that trains a global parametric model from distributed datasets based on wireless communications. This paper proposes a unit-modulus over-the-air computation (UMAirComp) framework to facilitate efficient edge federated learning, which simultaneously uploads local model parameters and updates global model parameters via analog beamforming. The pr… ▽ More

    Submitted 11 April, 2022; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: IEEE Transactions on Communications, vol. 70, no. 5, 2022

  46. arXiv:2010.15371  [pdf, ps, other

    cs.IT cs.AI

    Learning Centric Wireless Resource Allocation for Edge Computing: Algorithm and Experiment

    Authors: Liangkai Zhou, Yuncong Hong, Shuai Wang, Ruihua Han, Dachuan Li, Rui Wang, Qi Hao

    Abstract: Edge intelligence is an emerging network architecture that integrates sensing, communication, computing components, and supports various machine learning applications, where a fundamental communication question is: how to allocate the limited wireless resources (such as time, energy) to the simultaneous model training of heterogeneous learning tasks? Existing methods ignore two important facts: 1)… ▽ More

    Submitted 22 December, 2020; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: 8 pages, 4 figures, to appear in IEEE Transactions on Vehicular Technology

  47. QBSUM: a Large-Scale Query-Based Document Summarization Dataset from Real-world Applications

    Authors: Mingjun Zhao, Shengli Yan, Bang Liu, Xinwang Zhong, Qian Hao, Haolan Chen, Di Niu, Bowei Long, Weidong Guo

    Abstract: Query-based document summarization aims to extract or generate a summary of a document which directly answers or is relevant to the search query. It is an important technique that can be beneficial to a variety of applications such as search engines, document-level machine reading comprehension, and chatbots. Currently, datasets designed for query-based summarization are short in numbers and exist… ▽ More

    Submitted 28 October, 2020; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: accepted by Computer Speech & Language

  48. arXiv:2010.05351  [pdf, other

    cs.CV

    Identifying Melanoma Images using EfficientNet Ensemble: Winning Solution to the SIIM-ISIC Melanoma Classification Challenge

    Authors: Qishen Ha, Bo Liu, Fuxu Liu

    Abstract: We present our winning solution to the SIIM-ISIC Melanoma Classification Challenge. It is an ensemble of convolutions neural network (CNN) models with different backbones and input sizes, most of which are image-only models while a few of them used image-level and patient-level metadata. The keys to our winning are: (1) stable validation scheme (2) good choice of model target (3) carefully tuned p… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: 6 pages, 2 figures

  49. arXiv:2010.05350  [pdf, other

    cs.CV

    Google Landmark Recognition 2020 Competition Third Place Solution

    Authors: Qishen Ha, Bo Liu, Fuxu Liu, Peiyuan Liao

    Abstract: We present our third place solution to the Google Landmark Recognition 2020 competition. It is an ensemble of global features only Sub-center ArcFace models. We introduce dynamic margins for ArcFace loss, a family of tune-able margin functions of class size, designed to deal with the extreme imbalance in GLDv2 dataset. Progressive finetuning and careful postprocessing are also key to the solution.… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: 5 pages, 2 figures

  50. Motion-Encoded Particle Swarm Optimization for Moving Target Search Using UAVs

    Authors: Manh Duong Phung, Quang Phuc Ha

    Abstract: This paper presents a novel algorithm named the motion-encoded particle swarm optimization (MPSO) for finding a moving target with unmanned aerial vehicles (UAVs). From the Bayesian theory, the search problem can be converted to the optimization of a cost function that represents the probability of detecting the target. Here, the proposed MPSO is developed to solve that problem by encoding the sea… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Applied Soft Computing, 2020