Skip to main content

Showing 1–22 of 22 results for author: Ho, E S L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18691  [pdf, other

    cs.CV

    Geometric Features Enhanced Human-Object Interaction Detection

    Authors: Manli Zhu, Edmond S. L. Ho, Shuang Chen, Longzhi Yang, Hubert P. H. Shum

    Abstract: Cameras are essential vision instruments to capture images for pattern detection and measurement. Human-object interaction (HOI) detection is one of the most popular pattern detection approaches for captured human-centric visual scenes. Recently, Transformer-based models have become the dominant approach for HOI detection due to their advanced network architectures and thus promising results. Howe… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted to IEEE TIM

  2. arXiv:2404.05490  [pdf, other

    cs.CV

    Two-Person Interaction Augmentation with Skeleton Priors

    Authors: Baiyi Li, Edmond S. L. Ho, Hubert P. H. Shum, He Wang

    Abstract: Close and continuous interaction with rich contacts is a crucial aspect of human activities (e.g. hugging, dancing) and of interest in many domains like activity recognition, motion prediction, character animation, etc. However, acquiring such skeletal motion is challenging. While direct motion capture is expensive and slow, motion editing/generation is also non-trivial, as complex contact pattern… ▽ More

    Submitted 9 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  3. arXiv:2312.13776  [pdf, other

    cs.CV

    Pose-based Tremor Type and Level Analysis for Parkinson's Disease from Video

    Authors: Haozheng Zhang, Edmond S. L. Ho, Xiatian Zhang, Silvia Del Din, Hubert P. H. Shum

    Abstract: Purpose:Current methods for diagnosis of PD rely on clinical examination. The accuracy of diagnosis ranges between 73% and 84%, and is influenced by the experience of the clinical assessor. Hence, an automatic, effective and interpretable supporting system for PD symptom identification would support clinicians in making more robust PD diagnostic decisions. Methods: We propose to analyze Parkinson'… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  4. arXiv:2310.18891  [pdf, other

    cs.HC cs.CY cs.RO eess.SY

    Social Interaction-Aware Dynamical Models and Decision Making for Autonomous Vehicles

    Authors: Luca Crosato, Kai Tian, Hubert P. H Shum, Edmond S. L. Ho, Yafei Wang, Chongfeng Wei

    Abstract: Interaction-aware Autonomous Driving (IAAD) is a rapidly growing field of research that focuses on the development of autonomous vehicles (AVs) that are capable of interacting safely and efficiently with human road users. This is a challenging task, as it requires the autonomous vehicle to be able to understand and predict the behaviour of human road users. In this literature review, the current s… ▽ More

    Submitted 30 October, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

  5. arXiv:2305.10589  [pdf, other

    cs.CV

    INCLG: Inpainting for Non-Cleft Lip Generation with a Multi-Task Image Processing Network

    Authors: Shuang Chen, Amir Atapour-Abarghouei, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: We present a software that predicts non-cleft facial images for patients with cleft lip, thereby facilitating the understanding, awareness and discussion of cleft lip surgeries. To protect patients privacy, we design a software framework using image inpainting, which does not require cleft lip images for training, thereby mitigating the risk of model leakage. We implement a novel multi-task archit… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  6. arXiv:2304.00858  [pdf, other

    cs.CV

    Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition

    Authors: Qianhui Men, Edmond S. L. Ho, Hubert P. H. Shum, Howard Leung

    Abstract: Learning view-invariant representation is a key to improving feature discrimination power for skeleton-based action recognition. Existing approaches cannot effectively remove the impact of viewpoint due to the implicit view-dependent representations. In this work, we propose a self-supervised framework called Focalized Contrastive View-invariant Learning (FoCoViL), which significantly suppresses t… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  7. arXiv:2209.02824  [pdf, other

    cs.CV cs.LG eess.IV

    CP-AGCN: Pytorch-based Attention Informed Graph Convolutional Network for Identifying Infants at Risk of Cerebral Palsy

    Authors: Haozheng Zhang, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: Early prediction is clinically considered one of the essential parts of cerebral palsy (CP) treatment. We propose to implement a low-cost and interpretable classification system for supporting CP prediction based on General Movement Assessment (GMA). We design a Pytorch-based attention-informed graph convolutional network to early identify infants at risk of CP from skeletal data extracted from RG… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  8. arXiv:2208.08848  [pdf, other

    cs.CV

    A Two-stream Convolutional Network for Musculoskeletal and Neurological Disorders Prediction

    Authors: Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum

    Abstract: Musculoskeletal and neurological disorders are the most common causes of walking problems among older people, and they often lead to diminished quality of life. Analyzing walking motion data manually requires trained professionals and the evaluations may not always be objective. To facilitate early diagnosis, recent deep learning-based methods have shown promising results for automated analysis, w… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Journal of Medical Systems

  9. arXiv:2208.01149  [pdf, other

    cs.CV

    A Feasibility Study on Image Inpainting for Non-cleft Lip Generation from Patients with Cleft Lip

    Authors: Shuang Chen, Amir Atapour-Abarghouei, Jane Kerby, Edmond S. L. Ho, David C. G. Sainsbury, Sophie Butterworth, Hubert P. H. Shum

    Abstract: A Cleft lip is a congenital abnormality requiring surgical repair by a specialist. The surgeon must have extensive experience and theoretical knowledge to perform surgery, and Artificial Intelligence (AI) method has been proposed to guide surgeons in improving surgical outcomes. If AI can be used to predict what a repaired cleft lip would look like, surgeons could use it as an adjunct to adjust th… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 4 pages, 2 figures, BHI 2022

  10. arXiv:2208.00774  [pdf, other

    cs.GR cs.CV

    Interaction Mix and Match: Synthesizing Close Interaction using Conditional Hierarchical GAN with Multi-Hot Class Embedding

    Authors: Aman Goel, Qianhui Men, Edmond S. L. Ho

    Abstract: Synthesizing multi-character interactions is a challenging task due to the complex and varied interactions between the characters. In particular, precise spatiotemporal alignment between characters is required in generating close interactions such as dancing and fighting. Existing work in generating multi-character interactions focuses on generating a single type of reactive motion for a given seq… ▽ More

    Submitted 4 August, 2022; v1 submitted 23 July, 2022; originally announced August 2022.

    Comments: Accepted to SCA 2022 (will be published in CGF)

  11. arXiv:2207.06828  [pdf, other

    cs.CV cs.LG

    Pose-based Tremor Classification for Parkinson's Disease Diagnosis from Video

    Authors: Haozheng Zhang, Edmond S. L. Ho, Xiatian Zhang, Hubert P. H. Shum

    Abstract: Parkinson's disease (PD) is a progressive neurodegenerative disorder that results in a variety of motor dysfunction symptoms, including tremors, bradykinesia, rigidity and postural instability. The diagnosis of PD mainly relies on clinical experience rather than a definite medical test, and the diagnostic accuracy is only about 73-84% since it is challenged by the subjective opinions or experience… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: MICCAI 2022

  12. Interaction-aware Decision-making for Automated Vehicles using Social Value Orientation

    Authors: Luca Crosato, Hubert P. H. Shum, Edmond S. L. Ho, Chongfeng Wei

    Abstract: Motion control algorithms in the presence of pedestrians are critical for the development of safe and reliable Autonomous Vehicles (AVs). Traditional motion control algorithms rely on manually designed decision-making policies which neglect the mutual interactions between AVs and pedestrians. On the other hand, recent advances in Deep Reinforcement Learning allow for the automatic learning of poli… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  13. arXiv:2207.05733  [pdf, other

    cs.CV cs.AI

    A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection

    Authors: Manli Zhu, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: Detecting human-object interactions is essential for comprehensive understanding of visual scenes. In particular, spatial connections between humans and objects are important cues for reasoning interactions. To this end, we propose a skeleton-aware graph convolutional network for human-object interaction detection, named SGCN4HOI. Our network exploits the spatial connections between human keypoint… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE SMC 2022

  14. arXiv:2204.13584  [pdf, ps, other

    eess.SP cs.AI cs.CV cs.LG

    Predicting Slee** Quality using Convolutional Neural Networks

    Authors: Vidya Rohini Konanur Sathish, Wai Lok Woo, Edmond S. L. Ho

    Abstract: Identifying sleep stages and patterns is an essential part of diagnosing and treating sleep disorders. With the advancement of smart technologies, sensor data related to slee** patterns can be captured easily. In this paper, we propose a Convolution Neural Network (CNN) architecture that improves the classification performance. In particular, we benchmark the classification performance from diff… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    ACM Class: I.2.10

  15. arXiv:2204.11357  [pdf, ps, other

    cs.LG cs.CR cs.NI

    Improving Deep Learning Model Robustness Against Adversarial Attack by Increasing the Network Capacity

    Authors: Marco Marchetti, Edmond S. L. Ho

    Abstract: Nowadays, we are more and more reliant on Deep Learning (DL) models and thus it is essential to safeguard the security of these systems. This paper explores the security issues in Deep Learning and analyses, through the use of experiments, the way forward to build more resilient models. Experiments are conducted to identify the strengths and weaknesses of a new approach to improve the robustness o… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    ACM Class: I.2.10

  16. arXiv:2204.10997  [pdf, other

    cs.CV cs.LG

    Cerebral Palsy Prediction with Frequency Attention Informed Graph Convolutional Networks

    Authors: Haozheng Zhang, Hubert P. H. Shum, Edmond S. L. Ho

    Abstract: Early diagnosis and intervention are clinically considered the paramount part of treating cerebral palsy (CP), so it is essential to design an efficient and interpretable automatic prediction system for CP. We highlight a significant difference between CP infants' frequency of human movement and that of the healthy group, which improves prediction performance. However, the existing deep learning-b… ▽ More

    Submitted 28 March, 2023; v1 submitted 23 April, 2022; originally announced April 2022.

  17. arXiv:2110.00380  [pdf, other

    cs.GR cs.CV

    GAN-based Reactive Motion Synthesis with Class-aware Discriminators for Human-human Interaction

    Authors: Qianhui Men, Hubert P. H. Shum, Edmond S. L. Ho, Howard Leung

    Abstract: Creating realistic characters that can react to the users' or another character's movement can benefit computer graphics, games and virtual reality hugely. However, synthesizing such reactive motions in human-human interactions is a challenging task due to the many different ways two humans can interact. While there are a number of successful researches in adapting the generative adversarial netwo… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

  18. arXiv:2106.04966  [pdf, other

    cs.CV cs.LG

    Towards Explainable Abnormal Infant Movements Identification: A Body-part Based Prediction and Visualisation Framework

    Authors: Kevin D. McCay, Edmond S. L. Ho, Dimitrios Sakkos, Wai Lok Woo, Claire Marcroft, Patricia Dulson, Nicholas D. Embleton

    Abstract: Providing early diagnosis of cerebral palsy (CP) is key to enhancing the developmental outcomes for those affected. Diagnostic tools such as the General Movements Assessment (GMA), have produced promising results in early diagnosis, however these manual methods can be laborious. In this paper, we propose a new framework for the automated classification of infant body movements, based upon the GM… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Proceedings of the 2021 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI), accepted, 2021

    ACM Class: I.4.9; I.5.0; J.3; I.2.1

  19. arXiv:2106.04471  [pdf, other

    cs.CV cs.LG eess.IV

    Interpreting Deep Learning based Cerebral Palsy Prediction with Channel Attention

    Authors: Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum

    Abstract: Early prediction of cerebral palsy is essential as it leads to early treatment and monitoring. Deep learning has shown promising results in biomedical engineering thanks to its capacity of modelling complicated data with its non-linear architecture. However, due to their complex structure, deep learning models are generally not interpretable by humans, making it difficult for clinicians to rely on… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  20. arXiv:2103.09184  [pdf, other

    cs.RO cs.MA eess.SY

    Formation Control for UAVs Using a Flux Guided Approach

    Authors: John Hartley, Hubert P. H. Shum, Edmond S. L. Ho, He Wang, Subramanian Ramamoorthy

    Abstract: Existing studies on formation control for unmanned aerial vehicles (UAV) have not considered encircling targets where an optimum coverage of the target is required at all times. Such coverage plays a critical role in many real-world applications such as tracking hostile UAVs. This paper proposes a new path planning approach called the Flux Guided (FG) method, which generates collision-free traject… ▽ More

    Submitted 31 May, 2022; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: 37 pages, 9 figures, 3 table

  21. arXiv:1910.08470  [pdf, other

    cs.CV cs.GR

    Illumination-Based Data Augmentation for Robust Background Subtraction

    Authors: Dimitrios Sakkos, Hubert P. H. Shum, Edmond S. L. Ho

    Abstract: A core challenge in background subtraction (BGS) is handling videos with sudden illumination changes in consecutive frames. In this paper, we tackle the problem from a data point-of-view using data augmentation. Our method performs data augmentation that not only creates endless data on the fly, but also features semantic transformations of illumination which enhance the generalisation of the mode… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: SKIMA 2019 - Best Paper Award

  22. arXiv:1908.07214  [pdf, other

    cs.GR cs.LG

    Spatio-temporal Manifold Learning for Human Motions via Long-horizon Modeling

    Authors: He Wang, Edmond S. L. Ho, Hubert P. H. Shum, Zhanxing Zhu

    Abstract: Data-driven modeling of human motions is ubiquitous in computer graphics and computer vision applications, such as synthesizing realistic motions or recognizing actions. Recent research has shown that such problems can be approached by learning a natural motion manifold using deep learning to address the shortcomings of traditional data-driven approaches. However, previous methods can be sub-optim… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: 12 pages, Accepted in IEEE Transaction on Visualization and Computer Graphics

    Journal ref: IEEE Transaction on Visualization and Computer Graphics, 2019