Skip to main content

Showing 1–47 of 47 results for author: Shum, H P H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00917  [pdf, other

    cs.CV

    From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos

    Authors: Tanqiu Qiao, Ruochen Li, Frederick W. B. Li, Hubert P. H. Shum

    Abstract: Video-based Human-Object Interaction (HOI) recognition explores the intricate dynamics between humans and objects, which are essential for a comprehensive understanding of human behavior and intentions. While previous work has made significant strides, effectively integrating geometric and visual features to model dynamic relationships between humans and objects in a graph framework remains a chal… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: Accepted by ICPR 2024

  2. arXiv:2406.18691  [pdf, other

    cs.CV

    Geometric Features Enhanced Human-Object Interaction Detection

    Authors: Manli Zhu, Edmond S. L. Ho, Shuang Chen, Longzhi Yang, Hubert P. H. Shum

    Abstract: Cameras are essential vision instruments to capture images for pattern detection and measurement. Human-object interaction (HOI) detection is one of the most popular pattern detection approaches for captured human-centric visual scenes. Recently, Transformer-based models have become the dominant approach for HOI detection due to their advanced network architectures and thus promising results. Howe… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted to IEEE TIM

  3. arXiv:2406.18422  [pdf, other

    cs.CV eess.IV

    Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling

    Authors: Abril Corona-Figueroa, Hubert P. H. Shum, Chris G. Willcocks

    Abstract: This paper investigates a 2D to 3D image translation method with a straightforward technique, enabling correlated 2D X-ray to 3D CT-like reconstruction. We observe that existing approaches, which integrate information across multiple 2D views in the latent space, lose valuable signal information during latent encoding. Instead, we simply repeat and concatenate the 2D views into higher-channel 3D v… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: CVPRW 2024 - DCA in MI; Best Paper Award

  4. DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications

    Authors: Li Li, Khalid N. Ismail, Hubert P. H. Shum, Toby P. Breckon

    Abstract: We present DurLAR, a high-fidelity 128-channel 3D LiDAR dataset with panoramic ambient (near infrared) and reflectivity imagery, as well as a sample benchmark task using depth estimation for autonomous driving applications. Our driving platform is equipped with a high resolution 128 channel LiDAR, a 2MPix stereo camera, a lux meter and a GNSS/INS system. Ambient and reflectivity images are made av… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted by 3DV 2021; 13 pages, 14 figures; Dataset at https://github.com/l1997i/durlar

    Journal ref: Proc. Int. Conf. on 3D Vision (3DV 2021)

  5. arXiv:2404.05490  [pdf, other

    cs.CV

    Two-Person Interaction Augmentation with Skeleton Priors

    Authors: Baiyi Li, Edmond S. L. Ho, Hubert P. H. Shum, He Wang

    Abstract: Close and continuous interaction with rich contacts is a crucial aspect of human activities (e.g. hugging, dancing) and of interest in many domains like activity recognition, motion prediction, character animation, etc. However, acquiring such skeletal motion is challenging. While direct motion capture is expensive and slow, motion editing/generation is also non-trivial, as complex contact pattern… ▽ More

    Submitted 9 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  6. arXiv:2403.04398  [pdf, other

    cs.CV

    MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment

    Authors: Kanglei Zhou, Liyuan Wang, Xingxing Zhang, Hubert P. H. Shum, Frederick W. B. Li, Jianguo Li, Xiaohui Liang

    Abstract: Action Quality Assessment (AQA) evaluates diverse skills but models struggle with non-stationary data. We propose Continual AQA (CAQA) to refine models using sparse new data. Feature replay preserves memory without storing raw inputs. However, the misalignment between static old features and the dynamically changing feature manifold causes severe catastrophic forgetting. To address this novel prob… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  7. arXiv:2402.14185  [pdf, other

    cs.CV

    HINT: High-quality INPainting Transformer with Mask-Aware Encoding and Enhanced Attention

    Authors: Shuang Chen, Amir Atapour-Abarghouei, Hubert P. H. Shum

    Abstract: Existing image inpainting methods leverage convolution-based downsampling approaches to reduce spatial dimensions. This may result in information loss from corrupted images where the available information is inherently sparse, especially for the scenario of large missing regions. Recent advances in self-attention mechanisms within transformers have led to significant improvements in many computer… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  8. arXiv:2402.11288  [pdf

    cs.CV

    Enhancing Surgical Performance in Cardiothoracic Surgery with Innovations from Computer Vision and Artificial Intelligence: A Narrative Review

    Authors: Merryn D. Constable, Hubert P. H. Shum, Stephen Clark

    Abstract: When technical requirements are high, and patient outcomes are critical, opportunities for monitoring and improving surgical skills via objective motion analysis feedback may be particularly beneficial. This narrative review synthesises work on technical and non-technical surgical skills, collaborative task performance, and pose estimation to illustrate new opportunities to advance cardiothoracic… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  9. arXiv:2312.13776  [pdf, other

    cs.CV

    Pose-based Tremor Type and Level Analysis for Parkinson's Disease from Video

    Authors: Haozheng Zhang, Edmond S. L. Ho, Xiatian Zhang, Silvia Del Din, Hubert P. H. Shum

    Abstract: Purpose:Current methods for diagnosis of PD rely on clinical examination. The accuracy of diagnosis ranges between 73% and 84%, and is influenced by the experience of the clinical assessor. Hence, an automatic, effective and interpretable supporting system for PD symptom identification would support clinicians in making more robust PD diagnostic decisions. Methods: We propose to analyze Parkinson'… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  10. arXiv:2311.10463  [pdf, other

    eess.IV cs.CV

    Correlation-Distance Graph Learning for Treatment Response Prediction from rs-fMRI

    Authors: Xiatian Zhang, Sisi Zheng, Hubert P. H. Shum, Haozheng Zhang, Nan Song, Mingkang Song, Hongxiao Jia

    Abstract: Resting-state fMRI (rs-fMRI) functional connectivity (FC) analysis provides valuable insights into the relationships between different brain regions and their potential implications for neurological or psychiatric disorders. However, specific design efforts to predict treatment response from rs-fMRI remain limited due to difficulties in understanding the current brain state and the underlying mech… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Proceedings of the 2023 International Conference on Neural Information Processing (ICONIP)

  11. arXiv:2311.06018  [pdf, other

    cs.CV

    U3DS$^3$: Unsupervised 3D Semantic Scene Segmentation

    Authors: Jiaxu Liu, Zhengdi Yu, Toby P. Breckon, Hubert P. H. Shum

    Abstract: Contemporary point cloud segmentation approaches largely rely on richly annotated 3D training data. However, it is both time-consuming and challenging to obtain consistently accurate annotations for such 3D scene data. Moreover, there is still a lack of investigation into fully unsupervised scene segmentation for point clouds, especially for holistic 3D scenes. This paper presents U3DS$^3$, as a s… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 10 Pages, 4 figures, accepted to IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  12. arXiv:2310.18891  [pdf, other

    cs.HC cs.CY cs.RO eess.SY

    Social Interaction-Aware Dynamical Models and Decision Making for Autonomous Vehicles

    Authors: Luca Crosato, Kai Tian, Hubert P. H Shum, Edmond S. L. Ho, Yafei Wang, Chongfeng Wei

    Abstract: Interaction-aware Autonomous Driving (IAAD) is a rapidly growing field of research that focuses on the development of autonomous vehicles (AVs) that are capable of interacting safely and efficiently with human road users. This is a challenging task, as it requires the autonomous vehicle to be able to understand and predict the behaviour of human road users. In this literature review, the current s… ▽ More

    Submitted 30 October, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

  13. arXiv:2308.14152  [pdf, other

    cs.CV

    Unaligned 2D to 3D Translation with Conditional Vector-Quantized Code Diffusion using Transformers

    Authors: Abril Corona-Figueroa, Sam Bond-Taylor, Neelanjan Bhowmik, Yona Falinie A. Gaus, Toby P. Breckon, Hubert P. H. Shum, Chris G. Willcocks

    Abstract: Generating 3D images of complex objects conditionally from a few 2D views is a difficult synthesis problem, compounded by issues such as domain gap and geometric misalignment. For instance, a unified framework such as Generative Adversarial Networks cannot achieve this unless they explicitly define both a domain-invariant and geometric-invariant joint latent distribution, whereas Neural Radiance F… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: Camera-ready version for ICCV 2023

  14. arXiv:2308.13042  [pdf, other

    cs.CV cs.HC

    Enhancing Perception and Immersion in Pre-Captured Environments through Learning-Based Eye Height Adaptation

    Authors: Qi Feng, Hubert P. H. Shum, Shigeo Morishima

    Abstract: Pre-captured immersive environments using omnidirectional cameras provide a wide range of virtual reality applications. Previous research has shown that manipulating the eye height in egocentric virtual environments can significantly affect distance perception and immersion. However, the influence of eye height in pre-captured real environments has received less attention due to the difficulty of… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 10 pages, 13 figures, 3 tables, submitted to ISMAR 2023

  15. arXiv:2308.05681  [pdf, other

    cs.CV cs.AI cs.LG

    Hard No-Box Adversarial Attack on Skeleton-Based Human Action Recognition with Skeleton-Motion-Informed Gradient

    Authors: Zhengzhi Lu, He Wang, Ziyi Chang, Guoan Yang, Hubert P. H. Shum

    Abstract: Recently, methods for skeleton-based human activity recognition have been shown to be vulnerable to adversarial attacks. However, these attack methods require either the full knowledge of the victim (i.e. white-box attacks), access to training data (i.e. transfer-based attacks) or frequent model queries (i.e. black-box attacks). All their requirements are highly restrictive, raising the question o… ▽ More

    Submitted 18 August, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: Camera-ready version for ICCV 2023

  16. arXiv:2306.04542  [pdf, other

    cs.LG cs.AI cs.CV

    On the Design Fundamentals of Diffusion Models: A Survey

    Authors: Ziyi Chang, George Alex Koulieris, Hubert P. H. Shum

    Abstract: Diffusion models are generative models, which gradually add and remove noise to learn the underlying distribution of training data for data generation. The components of diffusion models have gained significant attention with many design choices proposed. Existing reviews have primarily focused on higher-level solutions, thereby covering less on the design fundamentals of components. This study se… ▽ More

    Submitted 19 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

  17. arXiv:2305.10589  [pdf, other

    cs.CV

    INCLG: Inpainting for Non-Cleft Lip Generation with a Multi-Task Image Processing Network

    Authors: Shuang Chen, Amir Atapour-Abarghouei, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: We present a software that predicts non-cleft facial images for patients with cleft lip, thereby facilitating the understanding, awareness and discussion of cleft lip surgeries. To protect patients privacy, we design a software framework using image inpainting, which does not require cleft lip images for training, thereby mitigating the risk of model leakage. We implement a novel multi-task archit… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  18. arXiv:2304.00858  [pdf, other

    cs.CV

    Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition

    Authors: Qianhui Men, Edmond S. L. Ho, Hubert P. H. Shum, Howard Leung

    Abstract: Learning view-invariant representation is a key to improving feature discrimination power for skeleton-based action recognition. Existing approaches cannot effectively remove the impact of viewpoint due to the implicit view-dependent representations. In this work, we propose a self-supervised framework called Focalized Contrastive View-invariant Learning (FoCoViL), which significantly suppresses t… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  19. arXiv:2303.11203  [pdf, other

    cs.CV

    Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation

    Authors: Li Li, Hubert P. H. Shum, Toby P. Breckon

    Abstract: Whilst the availability of 3D LiDAR point cloud data has significantly grown in recent years, annotation remains expensive and time-consuming, leading to a demand for semi-supervised semantic segmentation methods with application domains such as autonomous driving. Existing work very often employs relatively large segmentation backbone networks to improve segmentation accuracy, at the expense of c… ▽ More

    Submitted 28 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023; 11 pages, 8 figures; Code at https://github.com/l1997i/lim3d

  20. arXiv:2301.02524  [pdf, other

    cs.CV

    Tackling Data Bias in Painting Classification with Style Transfer

    Authors: Mridula Vijendran, Frederick W. B. Li, Hubert P. H. Shum

    Abstract: It is difficult to train classifiers on paintings collections due to model bias from domain gaps and data bias from the uneven distribution of artistic styles. Previous techniques like data distillation, traditional data augmentation and style transfer improve classifier training using task specific training datasets or domain adaptation. We propose a system to handle data bias in small paintings… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: International Conference on Computer Vision Theory and Applications (VISAPP), 2023 ,12 pages, 9 figures

  21. arXiv:2212.08526  [pdf, ps, other

    cs.CV cs.AI cs.GR

    Unifying Human Motion Synthesis and Style Transfer with Denoising Diffusion Probabilistic Models

    Authors: Ziyi Chang, Edmund J. C. Findlay, Haozheng Zhang, Hubert P. H. Shum

    Abstract: Generating realistic motions for digital humans is a core but challenging part of computer animations and games, as human motions are both diverse in content and rich in styles. While the latest deep learning approaches have made significant advancements in this domain, they mostly consider motion synthesis and style manipulation as two separate problems. This is mainly due to the challenge of lea… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  22. arXiv:2210.04265  [pdf, other

    cs.CV

    3D Reconstruction of Sculptures from Single Images via Unsupervised Domain Adaptation on Implicit Models

    Authors: Ziyi Chang, George Alex Koulieris, Hubert P. H. Shum

    Abstract: Acquiring the virtual equivalent of exhibits, such as sculptures, in virtual reality (VR) museums, can be labour-intensive and sometimes infeasible. Deep learning based 3D reconstruction approaches allow us to recover 3D shapes from 2D observations, among which single-view-based approaches can reduce the need for human intervention and specialised equipment in acquiring 3D sculptures for VR museum… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  23. arXiv:2209.14828  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Denoising Diffusion Probabilistic Models for Styled Walking Synthesis

    Authors: Edmund J. C. Findlay, Haozheng Zhang, Ziyi Chang, Hubert P. H. Shum

    Abstract: Generating realistic motions for digital humans is time-consuming for many graphics applications. Data-driven motion synthesis approaches have seen solid progress in recent years through deep generative models. These results offer high-quality motions but typically suffer in motion style diversity. For the first time, we propose a framework using the denoising diffusion probabilistic model (DDPM)… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  24. arXiv:2209.02824  [pdf, other

    cs.CV cs.LG eess.IV

    CP-AGCN: Pytorch-based Attention Informed Graph Convolutional Network for Identifying Infants at Risk of Cerebral Palsy

    Authors: Haozheng Zhang, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: Early prediction is clinically considered one of the essential parts of cerebral palsy (CP) treatment. We propose to implement a low-cost and interpretable classification system for supporting CP prediction based on General Movement Assessment (GMA). We design a Pytorch-based attention-informed graph convolutional network to early identify infants at risk of CP from skeletal data extracted from RG… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  25. arXiv:2208.08848  [pdf, other

    cs.CV

    A Two-stream Convolutional Network for Musculoskeletal and Neurological Disorders Prediction

    Authors: Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum

    Abstract: Musculoskeletal and neurological disorders are the most common causes of walking problems among older people, and they often lead to diminished quality of life. Analyzing walking motion data manually requires trained professionals and the evaluations may not always be objective. To facilitate early diagnosis, recent deep learning-based methods have shown promising results for automated analysis, w… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Journal of Medical Systems

  26. arXiv:2208.03824  [pdf, other

    cs.CV cs.LG

    Towards Graph Representation Learning Based Surgical Workflow Anticipation

    Authors: Xiatian Zhang, Noura Al Moubayed, Hubert P. H. Shum

    Abstract: Surgical workflow anticipation can give predictions on what steps to conduct or what instruments to use next, which is an essential part of the computer-assisted intervention system for surgery, e.g. workflow reasoning in robotic surgery. However, current approaches are limited to their insufficient expressive power for relationships between instruments. Hence, we propose a graph representation le… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: Proceedings of the 2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), 2022

  27. arXiv:2208.01149  [pdf, other

    cs.CV

    A Feasibility Study on Image Inpainting for Non-cleft Lip Generation from Patients with Cleft Lip

    Authors: Shuang Chen, Amir Atapour-Abarghouei, Jane Kerby, Edmond S. L. Ho, David C. G. Sainsbury, Sophie Butterworth, Hubert P. H. Shum

    Abstract: A Cleft lip is a congenital abnormality requiring surgical repair by a specialist. The surgeon must have extensive experience and theoretical knowledge to perform surgery, and Artificial Intelligence (AI) method has been proposed to guide surgeons in improving surgical outcomes. If AI can be used to predict what a repaired cleft lip would look like, surgeons could use it as an adjunct to adjust th… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 4 pages, 2 figures, BHI 2022

  28. arXiv:2207.09425  [pdf, other

    cs.CV

    Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos

    Authors: Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima, Hubert P. H. Shum

    Abstract: Human-Object Interaction (HOI) recognition in videos is important for analyzing human activity. Most existing work focusing on visual features usually suffer from occlusion in the real-world scenarios. Such a problem will be further complicated when multiple people and objects are involved in HOIs. Consider that geometric features such as human pose and object position provide meaningful informati… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV 2022

  29. arXiv:2207.06828  [pdf, other

    cs.CV cs.LG

    Pose-based Tremor Classification for Parkinson's Disease Diagnosis from Video

    Authors: Haozheng Zhang, Edmond S. L. Ho, Xiatian Zhang, Hubert P. H. Shum

    Abstract: Parkinson's disease (PD) is a progressive neurodegenerative disorder that results in a variety of motor dysfunction symptoms, including tremors, bradykinesia, rigidity and postural instability. The diagnosis of PD mainly relies on clinical experience rather than a definite medical test, and the diagnostic accuracy is only about 73-84% since it is challenged by the subjective opinions or experience… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: MICCAI 2022

  30. Interaction-aware Decision-making for Automated Vehicles using Social Value Orientation

    Authors: Luca Crosato, Hubert P. H. Shum, Edmond S. L. Ho, Chongfeng Wei

    Abstract: Motion control algorithms in the presence of pedestrians are critical for the development of safe and reliable Autonomous Vehicles (AVs). Traditional motion control algorithms rely on manually designed decision-making policies which neglect the mutual interactions between AVs and pedestrians. On the other hand, recent advances in Deep Reinforcement Learning allow for the automatic learning of poli… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  31. arXiv:2207.05733  [pdf, other

    cs.CV cs.AI

    A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection

    Authors: Manli Zhu, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: Detecting human-object interactions is essential for comprehensive understanding of visual scenes. In particular, spatial connections between humans and objects are important cues for reasoning interactions. To this end, we propose a skeleton-aware graph convolutional network for human-object interaction detection, named SGCN4HOI. Our network exploits the spatial connections between human keypoint… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE SMC 2022

  32. arXiv:2206.15275  [pdf, other

    cs.CV

    Multiclass-SGCN: Sparse Graph-based Trajectory Prediction with Agent Class Embedding

    Authors: Ruochen Li, Stamos Katsigiannis, Hubert P. H. Shum

    Abstract: Trajectory prediction of road users in real-world scenarios is challenging because their movement patterns are stochastic and complex. Previous pedestrian-oriented works have been successful in modelling the complex interactions among pedestrians, but fail in predicting trajectories when other types of road users are involved (e.g., cars, cyclists, etc.), because they ignore user types. Although a… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

  33. arXiv:2204.10997  [pdf, other

    cs.CV cs.LG

    Cerebral Palsy Prediction with Frequency Attention Informed Graph Convolutional Networks

    Authors: Haozheng Zhang, Hubert P. H. Shum, Edmond S. L. Ho

    Abstract: Early diagnosis and intervention are clinically considered the paramount part of treating cerebral palsy (CP), so it is essential to design an efficient and interpretable automatic prediction system for CP. We highlight a significant difference between CP infants' frequency of human movement and that of the healthy group, which improves prediction performance. However, the existing deep learning-b… ▽ More

    Submitted 28 March, 2023; v1 submitted 23 April, 2022; originally announced April 2022.

  34. arXiv:2203.17085  [pdf, other

    cs.LG

    RobIn: A Robust Interpretable Deep Network for Schizophrenia Diagnosis

    Authors: Daniel Organisciak, Hubert P. H. Shum, Ephraim Nwoye, Wai Lok Woo

    Abstract: Schizophrenia is a severe mental health condition that requires a long and complicated diagnostic process. However, early diagnosis is vital to control symptoms. Deep learning has recently become a popular way to analyse and interpret medical data. Past attempts to use deep learning for schizophrenia diagnosis from brain-imaging data have shown promise but suffer from a large training-application… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  35. arXiv:2202.08010  [pdf, other

    cs.CV

    360 Depth Estimation in the Wild -- The Depth360 Dataset and the SegFuse Network

    Authors: Qi Feng, Hubert P. H. Shum, Shigeo Morishima

    Abstract: Single-view depth estimation from omnidirectional images has gained popularity with its wide range of applications such as autonomous driving and scene reconstruction. Although data-driven learning-based methods demonstrate significant potential in this field, scarce training data and ineffective 360 estimation algorithms are still two key limitations hindering accurate estimation across diverse d… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: 10 pages, 10 figures, 5 tables, submitted to IEEE VR 2022

    ACM Class: I.2.10

  36. arXiv:2202.01020  [pdf, other

    eess.IV cs.CV

    MedNeRF: Medical Neural Radiance Fields for Reconstructing 3D-aware CT-Projections from a Single X-ray

    Authors: Abril Corona-Figueroa, Jonathan Frawley, Sam Bond-Taylor, Sarath Bethapudi, Hubert P. H. Shum, Chris G. Willcocks

    Abstract: Computed tomography (CT) is an effective medical imaging modality, widely used in the field of clinical medicine for the diagnosis of various pathologies. Advances in Multidetector CT imaging technology have enabled additional functionalities, including generation of thin slice multiplanar cross-sectional body imaging and 3D reconstructions. However, this involves patients being exposed to a consi… ▽ More

    Submitted 8 April, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: 6 pages, 4 figures, accepted at IEEE EMBC 2022

    ACM Class: I.4; J.7

  37. Spoofing Detection on Hand Images Using Quality Assessment

    Authors: Asish Bera, Ratnadeep Dey, Debotosh Bhattacharjee, Mita Nasipuri, Hubert P. H. Shum

    Abstract: Recent research on biometrics focuses on achieving a high success rate of authentication and addressing the concern of various spoofing attacks. Although hand geometry recognition provides adequate security over unauthorized access, it is susceptible to presentation attack. This paper presents an anti-spoofing method toward hand biometrics. A presentation attack detection approach is addressed by… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Journal ref: Multimedia Tools and Applications, Springer. 2021

  38. arXiv:2110.00380  [pdf, other

    cs.GR cs.CV

    GAN-based Reactive Motion Synthesis with Class-aware Discriminators for Human-human Interaction

    Authors: Qianhui Men, Hubert P. H. Shum, Edmond S. L. Ho, Howard Leung

    Abstract: Creating realistic characters that can react to the users' or another character's movement can benefit computer graphics, games and virtual reality hugely. However, synthesizing such reactive motions in human-human interactions is a challenging task due to the many different ways two humans can interact. While there are a number of successful researches in adapting the generative adversarial netwo… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

  39. arXiv:2108.13969  [pdf, other

    cs.CV cs.LG

    Semi-Supervised Crowd Counting from Unlabeled Data

    Authors: Haoran Duan, Fan Wan, Rui Sun, Zeyu Wang, Varun Ojha, Yu Guan, Hubert P. H. Shum, Bingzhang Hu, Yang Long

    Abstract: Automatic Crowd behavior analysis can be applied to effectively help the daily transportation statistics and planning, which helps the smart city construction. As one of the most important keys, crowd counting has drawn increasing attention. Recent works achieved promising performance but relied on the supervised paradigm with expensive crowd annotations. To alleviate the annotation cost in real-w… ▽ More

    Submitted 26 March, 2024; v1 submitted 31 August, 2021; originally announced August 2021.

  40. arXiv:2108.04740  [pdf, other

    cs.CV

    Semantics-STGCNN: A Semantics-guided Spatial-Temporal Graph Convolutional Network for Multi-class Trajectory Prediction

    Authors: Ben A. Rainbow, Qianhui Men, Hubert P. H. Shum

    Abstract: Predicting the movement trajectories of multiple classes of road users in real-world scenarios is a challenging task due to the diverse trajectory patterns. While recent works of pedestrian trajectory prediction successfully modelled the influence of surrounding neighbours based on the relative distances, they are ineffective on multi-class trajectory prediction. This is because they ignore the im… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

  41. arXiv:2106.04471  [pdf, other

    cs.CV cs.LG eess.IV

    Interpreting Deep Learning based Cerebral Palsy Prediction with Channel Attention

    Authors: Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum

    Abstract: Early prediction of cerebral palsy is essential as it leads to early treatment and monitoring. Deep learning has shown promising results in biomedical engineering thanks to its capacity of modelling complicated data with its non-linear architecture. However, due to their complex structure, deep learning models are generally not interpretable by humans, making it difficult for clinicians to rely on… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  42. arXiv:2104.06219  [pdf, other

    cs.CV cs.LG cs.RO

    UAV-ReID: A Benchmark on Unmanned Aerial Vehicle Re-identification in Video Imagery

    Authors: Daniel Organisciak, Matthew Poyser, Aishah Alsehaim, Shanfeng Hu, Brian K. S. Isaac-Medina, Toby P. Breckon, Hubert P. H. Shum

    Abstract: As unmanned aerial vehicles (UAVs) become more accessible with a growing range of applications, the potential risk of UAV disruption increases. Recent development in deep learning allows vision-based counter-UAV systems to detect and track UAVs with a single camera. However, the coverage of a single camera is limited, necessitating the need for multicamera configurations to match UAVs across camer… ▽ More

    Submitted 2 December, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

  43. Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark

    Authors: Brian K. S. Isaac-Medina, Matt Poyser, Daniel Organisciak, Chris G. Willcocks, Toby P. Breckon, Hubert P. H. Shum

    Abstract: Unmanned Aerial Vehicles (UAV) can pose a major risk for aviation safety, due to both negligent and malicious use. For this reason, the automated detection and tracking of UAV is a fundamental task in aerial security systems. Common technologies for UAV detection include visible-band and thermal infrared imaging, radio frequency and radar. Recent advances in deep neural networks (DNNs) for image-b… ▽ More

    Submitted 18 August, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

  44. arXiv:2103.09184  [pdf, other

    cs.RO cs.MA eess.SY

    Formation Control for UAVs Using a Flux Guided Approach

    Authors: John Hartley, Hubert P. H. Shum, Edmond S. L. Ho, He Wang, Subramanian Ramamoorthy

    Abstract: Existing studies on formation control for unmanned aerial vehicles (UAV) have not considered encircling targets where an optimum coverage of the target is required at all times. Such coverage plays a critical role in many real-world applications such as tracking hostile UAVs. This paper proposes a new path planning approach called the Flux Guided (FG) method, which generates collision-free traject… ▽ More

    Submitted 31 May, 2022; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: 37 pages, 9 figures, 3 table

  45. arXiv:2006.11620  [pdf, other

    cs.GR

    Technical Note: Generating Realistic Fighting Scenes by Game Tree

    Authors: Hubert P. H. Shum, Taku Komura

    Abstract: Recently, there have been a lot of researches to synthesize / edit the motion of a single avatar in the virtual environment. However, there has not been so much work of simulating continuous interactions of multiple avatars such as fighting. In this paper, we propose a new method to generate a realistic fighting scene based on motion capture data. We propose a new algorithm called the temporal exp… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

    Comments: 7 pages, 7 figures

    ACM Class: I.3.3

  46. arXiv:1910.08470  [pdf, other

    cs.CV cs.GR

    Illumination-Based Data Augmentation for Robust Background Subtraction

    Authors: Dimitrios Sakkos, Hubert P. H. Shum, Edmond S. L. Ho

    Abstract: A core challenge in background subtraction (BGS) is handling videos with sudden illumination changes in consecutive frames. In this paper, we tackle the problem from a data point-of-view using data augmentation. Our method performs data augmentation that not only creates endless data on the fly, but also features semantic transformations of illumination which enhance the generalisation of the mode… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: SKIMA 2019 - Best Paper Award

  47. arXiv:1908.07214  [pdf, other

    cs.GR cs.LG

    Spatio-temporal Manifold Learning for Human Motions via Long-horizon Modeling

    Authors: He Wang, Edmond S. L. Ho, Hubert P. H. Shum, Zhanxing Zhu

    Abstract: Data-driven modeling of human motions is ubiquitous in computer graphics and computer vision applications, such as synthesizing realistic motions or recognizing actions. Recent research has shown that such problems can be approached by learning a natural motion manifold using deep learning to address the shortcomings of traditional data-driven approaches. However, previous methods can be sub-optim… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: 12 pages, Accepted in IEEE Transaction on Visualization and Computer Graphics

    Journal ref: IEEE Transaction on Visualization and Computer Graphics, 2019