Skip to main content

Showing 1–14 of 14 results for author: Gilani, S Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.12006  [pdf, other

    cs.CV

    SCOL: Supervised Contrastive Ordinal Loss for Abdominal Aortic Calcification Scoring on Vertebral Fracture Assessment Scans

    Authors: Afsah Saleem, Zaid Ilyas, David Suter, Ghulam Mubashar Hassan, Siobhan Reid, John T. Schousboe, Richard Prince, William D. Leslie, Joshua R. Lewis, Syed Zulqarnain Gilani

    Abstract: Abdominal Aortic Calcification (AAC) is a known marker of asymptomatic Atherosclerotic Cardiovascular Diseases (ASCVDs). AAC can be observed on Vertebral Fracture Assessment (VFA) scans acquired using Dual-Energy X-ray Absorptiometry (DXA) machines. Thus, the automatic quantification of AAC on VFA DXA scans may be used to screen for CVD risks, allowing early interventions. In this research, we for… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: Accepted in conference MICCAI 2023

  2. arXiv:2202.10579  [pdf, other

    cs.CV

    Fast Semantic-Assisted Outlier Removal for Large-scale Point Cloud Registration

    Authors: Giang Truong, Huu Le, Alvaro Parra, Syed Zulqarnain Gilani, Syed M. S. Islam, David Suter

    Abstract: With current trends in sensors (cheaper, more volume of data) and applications (increasing affordability for new tasks, new ideas in what 3D data could be useful for); there is corresponding increasing interest in the ability to automatically, reliably, and cheaply, register together individual point clouds. The volume of data to handle, and still elusive need to have the registration occur fully… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  3. arXiv:2112.00953  [pdf, other

    cs.CV cs.LG

    Maximum Consensus by Weighted Influences of Monotone Boolean Functions

    Authors: Erchuan Zhang, David Suter, Ruwan Tennakoon, Tat-Jun Chin, Alireza Bab-Hadiashar, Giang Truong, Syed Zulqarnain Gilani

    Abstract: Robust model fitting is a fundamental problem in computer vision: used to pre-process raw data in the presence of outliers. Maximisation of Consensus (MaxCon) is one of the most popular robust criteria and widely used. Recently (Tennakoon et al. CVPR2021), a connection has been made between MaxCon and estimation of influences of a Monotone Boolean function. Equip** the Boolean cube with differen… ▽ More

    Submitted 6 March, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

  4. arXiv:2109.08043  [pdf

    cs.CV

    Generating Dataset For Large-scale 3D Facial Emotion Recognition

    Authors: Faizan Farooq Khan, Syed Zulqarnain Gilani

    Abstract: The tremendous development in deep learning has led facial expression recognition (FER) to receive much attention in the past few years. Although 3D FER has an inherent edge over its 2D counterpart, work on 2D images has dominated the field. The main reason for the slow development of 3D FER is the unavailability of large training and large test datasets. Recognition accuracies have already satura… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  5. arXiv:2103.03501  [pdf, other

    cs.CV cs.AI

    Unsupervised Learning for Robust Fitting:A Reinforcement Learning Approach

    Authors: Giang Truong, Huu Le, David Suter, Erchuan Zhang, Syed Zulqarnain Gilani

    Abstract: Robust model fitting is a core algorithm in a large number of computer vision applications. Solving this problem efficiently for datasets highly contaminated with outliers is, however, still challenging due to the underlying computational complexity. Recent literature has focused on learning-based algorithms. However, most approaches are supervised which require a large amount of labelled training… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: The preprint of paper accepted to CVPR 2021

  6. arXiv:1912.00202  [pdf, other

    cs.CV

    Relation Graph Network for 3D Object Detection in Point Clouds

    Authors: Mingtao Feng, Syed Zulqarnain Gilani, Yaonan Wang, Liang Zhang, Ajmal Mian

    Abstract: Convolutional Neural Networks (CNNs) have emerged as a powerful strategy for most object detection tasks on 2D images. However, their power has not been fully realised for detecting 3D objects in point clouds directly without converting them to regular grids. Existing state-of-art 3D object detection methods aim to recognize 3D objects individually without exploiting their relationships during lea… ▽ More

    Submitted 30 November, 2019; originally announced December 2019.

    Comments: Manuscript

  7. arXiv:1909.12663  [pdf, other

    cs.CV

    Point Attention Network for Semantic Segmentation of 3D Point Clouds

    Authors: Mingtao Feng, Liang Zhang, Xuefei Lin, Syed Zulqarnain Gilani, Ajmal Mian

    Abstract: Convolutional Neural Networks (CNNs) have performed extremely well on data represented by regularly arranged grids such as images. However, directly leveraging the classic convolution kernels or parameter sharing mechanisms on sparse 3D point clouds is inefficient due to their irregular and unordered nature. We propose a point attention network that learns rich local shape features and their conte… ▽ More

    Submitted 27 September, 2019; originally announced September 2019.

    Comments: Submitted to a journal

  8. arXiv:1902.10322  [pdf, other

    cs.CV

    Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning

    Authors: Nayyer Aafaq, Naveed Akhtar, Wei Liu, Syed Zulqarnain Gilani, Ajmal Mian

    Abstract: Automatic generation of video captions is a fundamental challenge in computer vision. Recent techniques typically employ a combination of Convolutional Neural Networks (CNNs) and Recursive Neural Networks (RNNs) for video captioning. These methods mainly focus on tailoring sequence learning through RNNs for better caption generation, whereas off-the-shelf visual features are borrowed from CNNs. We… ▽ More

    Submitted 29 April, 2019; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: Accepted in CVPR-2019 (Camera Ready)

  9. arXiv:1806.07272  [pdf, other

    cs.CV

    Unsupervised Deep Multi-focus Image Fusion

    Authors: Xiang Yan, Syed Zulqarnain Gilani, Hanlin Qin, Ajmal Mian

    Abstract: Convolutional neural networks have recently been used for multi-focus image fusion. However, due to the lack of labeled data for supervised training of such networks, existing methods have resorted to adding Gaussian blur in focused images to simulate defocus and generate synthetic training data with ground-truth for supervised learning. Moreover, they classify pixels as focused or defocused and l… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

  10. Video Description: A Survey of Methods, Datasets and Evaluation Metrics

    Authors: Nayyer Aafaq, Ajmal Mian, Wei Liu, Syed Zulqarnain Gilani, Mubarak Shah

    Abstract: Video description is the automatic generation of natural language sentences that describe the contents of a given video. It has applications in human-robot interaction, hel** the visually impaired and video subtitling. The past few years have seen a surge of research in this area due to the unprecedented success of deep learning in computer vision and natural language processing. Numerous method… ▽ More

    Submitted 2 March, 2020; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: Accepted by ACM Computing Surveys

    Journal ref: ACM Computing Surveys (CSUR) 52(6), 115 (2019)

  11. arXiv:1804.10021  [pdf, other

    cs.CV

    Deep Keyframe Detection in Human Action Videos

    Authors: Xiang Yan, Syed Zulqarnain Gilani, Hanlin Qin, Mingtao Feng, Liang Zhang, Ajmal Mian

    Abstract: Detecting representative frames in videos based on human actions is quite challenging because of the combined factors of human pose in action and the background. This paper addresses this problem and formulates the key frame detection as one of finding the video frames that optimally maximally contribute to differentiating the underlying action category from all other categories. To this end, we i… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.

  12. arXiv:1711.05953  [pdf, other

    cs.CV

    3D Face Reconstruction from Light Field Images: A Model-free Approach

    Authors: Mingtao Feng, Syed Zulqarnain Gilani, Yaonan Wang, Ajmal Mian

    Abstract: Reconstructing 3D facial geometry from a single RGB image has recently instigated wide research interest. However, it is still an ill-posed problem and most methods rely on prior models hence undermining the accuracy of the recovered 3D faces. In this paper, we exploit the Epipolar Plane Images (EPI) obtained from light field cameras and learn CNN models that recover horizontal and vertical 3D fac… ▽ More

    Submitted 5 July, 2018; v1 submitted 16 November, 2017; originally announced November 2017.

    Journal ref: European Conference on Computer Vision (ECCV), 2018

  13. Learning from Millions of 3D Scans for Large-scale 3D Face Recognition

    Authors: Syed Zulqarnain Gilani, Ajmal Mian

    Abstract: Deep networks trained on millions of facial images are believed to be closely approaching human-level performance in face recognition. However, open world face recognition still remains a challenge. Although, 3D face recognition has an inherent edge over its 2D counterpart, it has not benefited from the recent developments in deep learning due to the unavailability of large training as well as lar… ▽ More

    Submitted 5 July, 2018; v1 submitted 16 November, 2017; originally announced November 2017.

    Comments: 11 pages

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition, 2018

  14. Dense 3D Face Correspondence

    Authors: Syed Zulqarnain Gilani, Ajmal Mian, Faisal Shafait, Ian Reid

    Abstract: We present an algorithm that automatically establishes dense correspondences between a large number of 3D faces. Starting from automatically detected sparse correspondences on the outer boundary of 3D faces, the algorithm triangulates existing correspondences and expands them iteratively by matching points of distinctive surface curvature along the triangle edges. After exhausting keypoint matches… ▽ More

    Submitted 15 August, 2019; v1 submitted 19 October, 2014; originally announced October 2014.

    Comments: 24 Pages, 12 Figures, 6 Tables and 3 Algorithms

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(7), 2017