Skip to main content

Showing 1–50 of 77 results for author: Barnes, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15622  [pdf, other

    cs.CV

    LAM3D: Large Image-Point-Cloud Alignment Model for 3D Reconstruction from Single Image

    Authors: Ruikai Cui, Xibin Song, Weixuan Sun, Senbo Wang, Weizhe Liu, Shenzhou Chen, Taizhang Shang, Yang Li, Nick Barnes, Hongdong Li, Pan Ji

    Abstract: Large Reconstruction Models have made significant strides in the realm of automated 3D content generation from single or multiple input images. Despite their success, these models often produce 3D meshes with geometric inaccuracies, stemming from the inherent challenges of deducing 3D shapes solely from image data. In this work, we introduce a novel framework, the Large Image and Point Cloud Align… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 19 pages, 10 figures

  2. arXiv:2403.14821  [pdf, other

    cs.CV

    Learning Gaussian Representation for Eye Fixation Prediction

    Authors: Peipei Song, **g Zhang, Piotr Koniusz, Nick Barnes

    Abstract: Existing eye fixation prediction methods perform the map** from input images to the corresponding dense fixation maps generated from raw fixation points. However, due to the stochastic nature of human fixation, the generated dense fixation maps may be a less-than-ideal representation of human fixation. To provide a robust fixation model, we introduce Gaussian Representation for eye fixation mode… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 11 pages, 7 figures

  3. arXiv:2308.04321  [pdf, other

    cs.CV

    All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation

    Authors: Weixuan Sun, Yanhao Zhang, Zhen Qin, Zheyuan Liu, Lin Cheng, Fanyi Wang, Yiran Zhong, Nick Barnes

    Abstract: In this work, we propose a new transformer-based regularization to better localize objects for Weakly supervised semantic segmentation (WSSS). In image-level WSSS, Class Activation Map (CAM) is adopted to generate object localization as pseudo segmentation labels. To address the partial activation issue of the CAMs, consistency regularization is employed to maintain activation intensity invariance… ▽ More

    Submitted 24 September, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: ICCV 2023 workshop, code released at: https://github.com/OpenNLPLab/ACR_WSSS

  4. arXiv:2307.16572  [pdf, other

    cs.CV

    Transferable Attack for Semantic Segmentation

    Authors: Mengqi He, **g Zhang, Zhaoyuan Yang, Mingyi He, Nick Barnes, Yuchao Dai

    Abstract: We analysis performance of semantic segmentation models wrt. adversarial attacks, and observe that the adversarial examples generated from a source model fail to attack the target models. i.e The conventional attack methods, such as PGD and FGSM, do not transfer well to target models, making it necessary to study the transferable attacks, especially transferable attacks for semantic segmentation.… ▽ More

    Submitted 21 August, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: Source code is available at: https://github.com/anucvers/TASS

  5. arXiv:2307.14726  [pdf, other

    cs.CV cs.GR

    P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds

    Authors: Ruikai Cui, Shi Qiu, Saeed Anwar, Jiawei Liu, Chaoyue Xing, **g Zhang, Nick Barnes

    Abstract: Point cloud completion aims to recover the complete shape based on a partial observation. Existing methods require either complete point clouds or multiple partial observations of the same object for learning. In contrast to previous approaches, we present Partial2Complete (P2C), the first self-supervised framework that completes point cloud objects using training samples consisting of only a sing… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCV 2023

  6. arXiv:2307.13539  [pdf, other

    cs.CV cs.LG

    Model Calibration in Dense Classification with Adaptive Label Perturbation

    Authors: Jiawei Liu, Changkun Ye, Shan Wang, Ruikai Cui, **g Zhang, Kaihao Zhang, Nick Barnes

    Abstract: For safety-related applications, it is crucial to produce trustworthy deep neural networks whose prediction is associated with confidence that can represent the likelihood of correctness for subsequent decision-making. Existing dense binary classification models are prone to being over-confident. To improve model calibration, we propose Adaptive Stochastic Label Perturbation (ASLP) which learns a… ▽ More

    Submitted 2 August, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023

  7. arXiv:2307.09929  [pdf, other

    cs.CV

    Measuring and Modeling Uncertainty Degree for Monocular Depth Estimation

    Authors: Mochu Xiang, **g Zhang, Nick Barnes, Yuchao Dai

    Abstract: Effectively measuring and modeling the reliability of a trained model is essential to the real-world deployment of monocular depth estimation (MDE) models. However, the intrinsic ill-posedness and ordinal-sensitive nature of MDE pose major challenges to the estimation of uncertainty degree of the trained models. On the one hand, utilizing current uncertainty modeling methods may increase memory co… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  8. arXiv:2307.06435  [pdf, other

    cs.CL

    A Comprehensive Overview of Large Language Models

    Authors: Humza Naveed, Asad Ullah Khan, Shi Qiu, Muhammad Saqib, Saeed Anwar, Muhammad Usman, Naveed Akhtar, Nick Barnes, Ajmal Mian

    Abstract: Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond. This success of LLMs has led to a large influx of research contributions in this direction. These works encompass diverse topics such as architectural innovations, better training strategies, context length improvements, fine-tuning, multi-modal LLMs, robotics, datasets,… ▽ More

    Submitted 9 April, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

  9. arXiv:2307.03376  [pdf, other

    cs.CV

    Weakly-supervised Contrastive Learning for Unsupervised Object Discovery

    Authors: Yunqiu Lv, **g Zhang, Nick Barnes, Yuchao Dai

    Abstract: Unsupervised object discovery (UOD) refers to the task of discriminating the whole region of objects from the background within a scene without relying on labeled datasets, which benefits the task of bounding-box-level localization and pixel-level segmentation. This task is promising due to its ability to discover objects in a generic manner. We roughly categorise existing techniques into two main… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  10. Rethinking Polyp Segmentation from an Out-of-Distribution Perspective

    Authors: Ge-Peng Ji, **g Zhang, Dylan Campbell, Huan Xiong, Nick Barnes

    Abstract: Unlike existing fully-supervised approaches, we rethink colorectal polyp segmentation from an out-of-distribution perspective with a simple but effective self-supervised learning approach. We leverage the ability of masked autoencoders -- self-supervised vision transformers trained on a reconstruction task -- to learn in-distribution representations; here, the distribution of healthy colon images.… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: Technical report

    Journal ref: Machine Intelligence Research (2024)

  11. arXiv:2305.01586  [pdf, other

    cs.CV

    An Alternative to WSSS? An Empirical Study of the Segment Anything Model (SAM) on Weakly-Supervised Semantic Segmentation Problems

    Authors: Weixuan Sun, Zheyuan Liu, Yanhao Zhang, Yiran Zhong, Nick Barnes

    Abstract: The Segment Anything Model (SAM) has demonstrated exceptional performance and versatility, making it a promising tool for various related tasks. In this report, we explore the application of SAM in Weakly-Supervised Semantic Segmentation (WSSS). Particularly, we adapt SAM as the pseudo-label generation pipeline given only the image-level class labels. While we observed impressive results in most c… ▽ More

    Submitted 18 June, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: Technique report

  12. arXiv:2303.11302  [pdf, other

    cs.CV

    Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning

    Authors: Weixuan Sun, Jiayi Zhang, Jianyuan Wang, Zheyuan Liu, Yiran Zhong, Tianpeng Feng, Yandong Guo, Yanhao Zhang, Nick Barnes

    Abstract: Self-supervised audio-visual source localization aims to locate sound-source objects in video frames without extra annotations. Recent methods often approach this goal with the help of contrastive learning, which assumes only the audio and visual contents from the same video are positive samples for each other. However, this assumption would suffer from false negative samples in real-world trainin… ▽ More

    Submitted 25 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: CVPR2023

  13. arXiv:2303.00900  [pdf, other

    cs.CV

    Transmission-Guided Bayesian Generative Model for Smoke Segmentation

    Authors: Siyuan Yan, **g Zhang, Nick Barnes

    Abstract: Smoke segmentation is essential to precisely localize wildfire so that it can be extinguished in an early phase. Although deep neural networks have achieved promising results on image segmentation tasks, they are prone to be overconfident for smoke segmentation due to its non-rigid shape and transparent appearance. This is caused by both knowledge level uncertainty due to limited training data for… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted by AAAI2022

  14. arXiv:2212.02011  [pdf, other

    cs.CV

    PointCaM: Cut-and-Mix for Open-Set Point Cloud Learning

    Authors: Jie Hong, Shi Qiu, Weihao Li, Saeed Anwar, Mehrtash Harandi, Nick Barnes, Lars Petersson

    Abstract: Point cloud learning is receiving increasing attention, however, most existing point cloud models lack the practical ability to deal with the unavoidable presence of unknown objects. This paper mainly discusses point cloud learning under open-set settings, where we train the model without data from unknown classes and identify them in the inference stage. Basically, we propose to solve open-set po… ▽ More

    Submitted 24 August, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

  15. arXiv:2211.06820  [pdf, other

    cs.CV

    Energy-Based Residual Latent Transport for Unsupervised Point Cloud Completion

    Authors: Ruikai Cui, Shi Qiu, Saeed Anwar, **g Zhang, Nick Barnes

    Abstract: Unsupervised point cloud completion aims to infer the whole geometry of a partial object observation without requiring partial-complete correspondence. Differing from existing deterministic approaches, we advocate generative modeling based unsupervised point cloud completion to explore the missing correspondence. Specifically, we propose a novel framework that performs completion by transforming a… ▽ More

    Submitted 13 November, 2022; originally announced November 2022.

    Comments: BMVC 2022 paper

  16. arXiv:2210.10340  [pdf, other

    cs.CL cs.LG

    The Devil in Linear Transformer

    Authors: Zhen Qin, XiaoDong Han, Weixuan Sun, Dongxu Li, Lingpeng Kong, Nick Barnes, Yiran Zhong

    Abstract: Linear transformers aim to reduce the quadratic space-time complexity of vanilla transformers. However, they usually suffer from degraded performances on various tasks and corpus. In this paper, we examine existing kernel-based linear transformers and identify two key issues that lead to such performance gaps: 1) unbounded gradients in the attention computation adversely impact the convergence of… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: accepted to EMNLP2022

  17. arXiv:2210.06120  [pdf, other

    cs.CV

    Efficient Gaussian Process Model on Class-Imbalanced Datasets for Generalized Zero-Shot Learning

    Authors: Changkun Ye, Nick Barnes, Lars Petersson, Russell Tsuchida

    Abstract: Zero-Shot Learning (ZSL) models aim to classify object classes that are not seen during the training process. However, the problem of class imbalance is rarely discussed, despite its presence in several ZSL datasets. In this paper, we propose a Neural Network model that learns a latent feature embedding and a Gaussian Process (GP) regression model that predicts latent feature prototypes of unseen… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Paper Accepted in ICPR 2022

  18. arXiv:2208.09668  [pdf, other

    cs.CV

    Generalised Co-Salient Object Detection

    Authors: Jiawei Liu, **g Zhang, Ruikai Cui, Kaihao Zhang, Weihao Li, Nick Barnes

    Abstract: We propose a new setting that relaxes an assumption in the conventional Co-Salient Object Detection (CoSOD) setting by allowing the presence of "noisy images" which do not show the shared co-salient object. We call this new setting Generalised Co-Salient Object Detection (GCoSOD). We propose a novel random sampling based Generalised CoSOD Training (GCT) strategy to distill the awareness of inter-i… ▽ More

    Submitted 11 August, 2023; v1 submitted 20 August, 2022; originally announced August 2022.

  19. arXiv:2206.10552  [pdf, other

    cs.CV

    Vicinity Vision Transformer

    Authors: Weixuan Sun, Zhen Qin, Hui Deng, Jianyuan Wang, Yi Zhang, Kaihao Zhang, Nick Barnes, Stan Birchfield, Lingpeng Kong, Yiran Zhong

    Abstract: Vision transformers have shown great success on numerous computer vision tasks. However, its central component, softmax attention, prohibits vision transformers from scaling up to high-resolution images, due to both the computational complexity and memory footprint being quadratic. Although linear attention was introduced in natural language processing (NLP) tasks to mitigate a similar issue, dire… ▽ More

    Submitted 20 July, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: code: https://github.com/OpenNLPLab/Vicinity-Vision-Transformer

  20. arXiv:2205.11333  [pdf, other

    cs.CV

    Towards Deeper Understanding of Camouflaged Object Detection

    Authors: Yunqiu Lv, **g Zhang, Yuchao Dai, Aixuan Li, Nick Barnes, Deng-** Fan

    Abstract: Preys in the wild evolve to be camouflaged to avoid being recognized by predators. In this way, camouflage acts as a key defence mechanism across species that is critical to survival. To detect and segment the whole scope of a camouflaged object, camouflaged object detection (COD) is introduced as a binary segmentation task, with the binary ground truth camouflage map indicating the exact regions… ▽ More

    Submitted 2 January, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: IEEE Transactions on Circuits and Systems for Video Technology 2023

  21. arXiv:2204.08803  [pdf, other

    cs.CV

    An Energy-Based Prior for Generative Saliency

    Authors: **g Zhang, Jianwen Xie, Nick Barnes, ** Li

    Abstract: We propose a novel generative saliency prediction framework that adopts an informative energy-based model as a prior distribution. The energy-based prior model is defined on the latent space of a saliency generator network that generates the saliency map based on a continuous latent variables and an observed image. Both the parameters of saliency generator and the energy-based prior are jointly tr… ▽ More

    Submitted 27 June, 2023; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence 2023. arXiv admin note: text overlap with arXiv:2112.13528

  22. arXiv:2204.05604  [pdf, other

    cs.CV

    Towards Open-Set Object Detection and Discovery

    Authors: Jiyang Zheng, Weihao Li, Jie Hong, Lars Petersson, Nick Barnes

    Abstract: With the human pursuit of knowledge, open-set object detection (OSOD) has been designed to identify unknown objects in a dynamic world. However, an issue with the current setting is that all the predicted unknown objects share the same category as "unknown", which require incremental learning via a human-in-the-loop approach to label novel classes. In order to address this problem, we present a ne… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: CVPRW 2022

  23. arXiv:2112.14019  [pdf, other

    cs.CV

    Semi-supervised Salient Object Detection with Effective Confidence Estimation

    Authors: Jiawei Liu, **g Zhang, Nick Barnes

    Abstract: The success of existing salient object detection models relies on a large pixel-wise labeled training dataset, which is time-consuming and expensive to obtain. We study semi-supervised salient object detection, with access to a small number of labeled samples and a large number of unlabeled samples. Specifically, we present a pseudo label based learn-ing framework with a Conditional Energy-based M… ▽ More

    Submitted 25 November, 2023; v1 submitted 28 December, 2021; originally announced December 2021.

  24. arXiv:2112.13528  [pdf, other

    cs.CV

    Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction

    Authors: **g Zhang, Jianwen Xie, Nick Barnes, ** Li

    Abstract: Vision transformer networks have shown superiority in many computer vision tasks. In this paper, we take a step further by proposing a novel generative vision transformer with latent variables following an informative energy-based prior for salient object detection. Both the vision transformer network and the energy-based prior model are jointly trained via Markov chain Monte Carlo-based maximum l… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: NeurIPS 2021

  25. arXiv:2112.02841  [pdf, other

    cs.CV

    GETAM: Gradient-weighted Element-wise Transformer Attention Map for Weakly-supervised Semantic segmentation

    Authors: Weixuan Sun, **g Zhang, Zheyuan Liu, Yiran Zhong, Nick Barnes

    Abstract: Weakly Supervised Semantic Segmentation (WSSS) is challenging, particularly when image-level labels are used to supervise pixel level prediction. To bridge their gap, a Class Activation Map (CAM) is usually generated to provide pixel level pseudo labels. CAMs in Convolutional Neural Networks suffer from partial activation ie, only the most discriminative regions are activated. Transformer based me… ▽ More

    Submitted 10 May, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  26. arXiv:2111.14160  [pdf, other

    cs.CV

    Learning To Segment Dominant Object Motion From Watching Videos

    Authors: Sahir Shrestha, Mohammad Ali Armin, Hongdong Li, Nick Barnes

    Abstract: Existing deep learning based unsupervised video object segmentation methods still rely on ground-truth segmentation masks to train. Unsupervised in this context only means that no annotated frames are used during inference. As obtaining ground-truth segmentation masks for real image scenes is a laborious task, we envision a simple framework for dominant moving object segmentation that neither requ… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

    Comments: DICTA 2021

  27. arXiv:2111.12242  [pdf, other

    cs.CV

    PU-Transformer: Point Cloud Upsampling Transformer

    Authors: Shi Qiu, Saeed Anwar, Nick Barnes

    Abstract: Given the rapid development of 3D scanners, point clouds are becoming popular in AI-driven machines. However, point cloud data is inherently sparse and irregular, causing significant difficulties for machine perception. In this work, we focus on the point cloud upsampling task that intends to generate dense high-fidelity point clouds from sparse input data. Specifically, to activate the transforme… ▽ More

    Submitted 3 October, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: ACCV 2022

  28. arXiv:2111.11055  [pdf, other

    cs.CV

    Dense Uncertainty Estimation via an Ensemble-based Conditional Latent Variable Model

    Authors: **g Zhang, Yuchao Dai, Mehrtash Harandi, Yiran Zhong, Nick Barnes, Richard Hartley

    Abstract: Uncertainty estimation has been extensively studied in recent literature, which can usually be classified as aleatoric uncertainty and epistemic uncertainty. In current aleatoric uncertainty estimation frameworks, it is often neglected that the aleatoric uncertainty is an inherent attribute of the data and can only be correctly estimated with an unbiased oracle model. Since the oracle model is ina… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  29. arXiv:2110.14309  [pdf, other

    cs.CV

    Inferring the Class Conditional Response Map for Weakly Supervised Semantic Segmentation

    Authors: Weixuan Sun, **g Zhang, Nick Barnes

    Abstract: Image-level weakly supervised semantic segmentation (WSSS) relies on class activation maps (CAMs) for pseudo labels generation. As CAMs only highlight the most discriminative regions of objects, the generated pseudo labels are usually unsatisfactory to serve directly as supervision. To solve this, most existing approaches follow a multi-training pipeline to refine CAMs for better pseudo-labels, wh… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Journal ref: WACV 2022

  30. arXiv:2110.06427  [pdf, other

    cs.LG cs.CV

    Dense Uncertainty Estimation

    Authors: **g Zhang, Yuchao Dai, Mochu Xiang, Deng-** Fan, Peyman Moghadam, Mingyi He, Christian Walder, Kaihao Zhang, Mehrtash Harandi, Nick Barnes

    Abstract: Deep neural networks can be roughly divided into deterministic neural networks and stochastic neural networks.The former is usually trained to achieve a map** from input space to output space via maximum likelihood estimation for the weights, which leads to deterministic predictions during testing. In this way, a specific weights set is estimated while ignoring any uncertainty that may occur in… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: Technical Report

  31. arXiv:2109.07246  [pdf, other

    cs.CV

    RGB-D Saliency Detection via Cascaded Mutual Information Minimization

    Authors: **g Zhang, Deng-** Fan, Yuchao Dai, Xin Yu, Yiran Zhong, Nick Barnes, Ling Shao

    Abstract: Existing RGB-D saliency detection models do not explicitly encourage RGB and depth to achieve effective multi-modal learning. In this paper, we introduce a novel multi-stage cascaded learning framework via mutual information minimization to "explicitly" model the multi-modal information between RGB image and depth data. Specifically, we first map the feature of each mode to a lower dimensional fea… ▽ More

    Submitted 5 January, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: Accepted as ICCV2021 paper

  32. arXiv:2108.07378  [pdf, other

    cs.CV

    PnP-3D: A Plug-and-Play for 3D Point Clouds

    Authors: Shi Qiu, Saeed Anwar, Nick Barnes

    Abstract: With the help of the deep learning paradigm, many point cloud networks have been invented for visual analysis. However, there is great potential for development of these networks since the given information of point cloud data has not been fully exploited. To improve the effectiveness of existing networks in analyzing point cloud data, we propose a plug-and-play module, PnP-3D, aiming to refine th… ▽ More

    Submitted 16 December, 2021; v1 submitted 16 August, 2021; originally announced August 2021.

  33. arXiv:2106.13389  [pdf, other

    cs.CV

    Energy-Based Generative Cooperative Saliency Prediction

    Authors: **g Zhang, Jianwen Xie, Zilong Zheng, Nick Barnes

    Abstract: Conventional saliency prediction models typically learn a deterministic map** from an image to its saliency map, and thus fail to explain the subjective nature of human attention. In this paper, to model the uncertainty of visual saliency, we study the saliency prediction problem from the perspective of generative models by learning a conditional probability distribution over the saliency map gi… ▽ More

    Submitted 23 June, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

    Journal ref: The Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI) 2022

  34. arXiv:2106.11641  [pdf, other

    cs.CV

    Confidence-Aware Learning for Camouflaged Object Detection

    Authors: Jiawei Liu, **g Zhang, Nick Barnes

    Abstract: Confidence-aware learning is proven as an effective solution to prevent networks becoming overconfident. We present a confidence-aware camouflaged object detection framework using dynamic supervision to produce both accurate camouflage map and meaningful "confidence" representing model awareness about the current prediction. A camouflaged object detection network is designed to produce our camoufl… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

  35. arXiv:2104.10127  [pdf, other

    cs.CV

    Generative Transformer for Accurate and Reliable Salient Object Detection

    Authors: Yuxin Mao, **g Zhang, Zhexiong Wan, Yuchao Dai, Aixuan Li, Yunqiu Lv, Xinyu Tian, Deng-** Fan, Nick Barnes

    Abstract: Transformer, which originates from machine translation, is particularly powerful at modeling long-range dependencies. Currently, the transformer is making revolutionary progress in various vision tasks, leading to significant performance improvements compared with the convolutional neural network (CNN) based frameworks. In this paper, we conduct extensive research on exploiting the contributions o… ▽ More

    Submitted 30 December, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: Technical report, 18 pages, 17 figures

  36. Learning structure-aware semantic segmentation with image-level supervision

    Authors: Jiawei Liu, **g Zhang, Yicong Hong, Nick Barnes

    Abstract: Compared with expensive pixel-wise annotations, image-level labels make it possible to learn semantic segmentation in a weakly-supervised manner. Within this pipeline, the class activation map (CAM) is obtained and further processed to serve as a pseudo label to train the semantic segmentation model in a fully-supervised manner. In this paper, we argue that the lost structure information in CAM li… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Journal ref: 2021 International Joint Conference on Neural Networks (IJCNN)

  37. arXiv:2104.02391  [pdf, other

    cs.CV

    Weakly Supervised Video Salient Object Detection

    Authors: Wangbo Zhao, **g Zhang, Long Li, Nick Barnes, Nian Liu, Junwei Han

    Abstract: Significant performance improvement has been achieved for fully-supervised video salient object detection with the pixel-wise labeled training datasets, which are time-consuming and expensive to obtain. To relieve the burden of data annotation, we present the first weakly supervised video salient object detection model based on relabeled "fixation guided scribble annotations". Specifically, an "Ap… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Journal ref: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

  38. arXiv:2103.07074  [pdf, other

    cs.CV

    Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion

    Authors: Shi Qiu, Saeed Anwar, Nick Barnes

    Abstract: Given the prominence of current 3D sensors, a fine-grained analysis on the basic point cloud data is worthy of further investigation. Particularly, real point cloud scenes can intuitively capture complex surroundings in the real world, but due to 3D data's raw nature, it is very challenging for machine perception. In this work, we concentrate on the essential visual task, semantic segmentation, fo… ▽ More

    Submitted 13 April, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted in CVPR2021

  39. arXiv:2103.04011  [pdf, other

    cs.CV

    Simultaneously Localize, Segment and Rank the Camouflaged Objects

    Authors: Yunqiu Lv, **g Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, Deng-** Fan

    Abstract: Camouflage is a key defence mechanism across species that is critical to survival. Common strategies for camouflage include background matching, imitating the color and pattern of the environment, and disruptive coloration, disguising body outlines [35]. Camouflaged object detection (COD) aims to segment camouflaged objects hiding in their surroundings. Existing COD models are built upon binary gr… ▽ More

    Submitted 13 April, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: Accepted to IEEE/CVF CVPR 2021. Our code and dataset are publicly available at https://github.com/**gZhang617/COD-Rank-Localize-and-Segment

  40. Recursive Training for Zero-Shot Semantic Segmentation

    Authors: Ce Wang, Moshiur Farazi, Nick Barnes

    Abstract: General purpose semantic segmentation relies on a backbone CNN network to extract discriminative features that help classify each image pixel into a 'seen' object class (ie., the object classes available during training) or a background class. Zero-shot semantic segmentation is a challenging task that requires a computer vision model to identify image pixels belonging to an object class which it h… ▽ More

    Submitted 26 February, 2021; originally announced March 2021.

    Journal ref: 2021 International Joint Conference on Neural Networks (IJCNN)

  41. arXiv:2102.03509  [pdf, other

    cs.LG stat.ML

    Robust normalizing flows using Bernstein-type polynomials

    Authors: Sameera Ramasinghe, Kasun Fernando, Salman Khan, Nick Barnes

    Abstract: Modeling real-world distributions can often be challenging due to sample data that are subjected to perturbations, e.g., instrumentation errors, or added random noise. Since flow models are typically nonlinear algorithms, they amplify these initial errors, leading to poor generalizations. This paper proposes a framework to construct Normalizing Flows (NF), which demonstrates higher robustness agai… ▽ More

    Submitted 9 October, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

  42. arXiv:2012.06020  [pdf, other

    cs.CV

    Uncertainty-Aware Deep Calibrated Salient Object Detection

    Authors: **g Zhang, Yuchao Dai, Xin Yu, Mehrtash Harandi, Nick Barnes, Richard Hartley

    Abstract: Existing deep neural network based salient object detection (SOD) methods mainly focus on pursuing high network accuracy. However, those methods overlook the gap between network accuracy and prediction confidence, known as the confidence uncalibration problem. Thus, state-of-the-art SOD networks are prone to be overconfident. In other words, the predicted confidence of the networks does not reflec… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Comments: Completed in 2019

  43. arXiv:2012.00242  [pdf, other

    cs.CV

    3D Guided Weakly Supervised Semantic Segmentation

    Authors: Weixuan Sun, **g Zhang, Nick Barnes

    Abstract: Pixel-wise clean annotation is necessary for fully-supervised semantic segmentation, which is laborious and expensive to obtain. In this paper, we propose a weakly supervised 2D semantic segmentation model by incorporating sparse bounding box labels with available 3D information, which is much easier to obtain with advanced sensors. We manually labeled a subset of the 2D-3D Semantics(2D-3D-S) data… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

  44. arXiv:2011.13055  [pdf, other

    cs.CV

    Rethinking conditional GAN training: An approach using geometrically structured latent manifolds

    Authors: Sameera Ramasinghe, Moshiur Farazi, Salman Khan, Nick Barnes, Stephen Gould

    Abstract: Conditional GANs (cGAN), in their rudimentary form, suffer from critical drawbacks such as the lack of diversity in generated outputs and distortion between the latent and output manifolds. Although efforts have been made to improve results, they can suffer from unpleasant side-effects such as the topology mismatch between latent and output spaces. In contrast, we tackle this problem from a geomet… ▽ More

    Submitted 2 June, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

  45. arXiv:2010.03132  [pdf, other

    cs.LG cs.CV

    Conditional Generative Modeling via Learning the Latent Space

    Authors: Sameera Ramasinghe, Kanchana Ranasinghe, Salman Khan, Nick Barnes, Stephen Gould

    Abstract: Although deep learning has achieved appealing results on several machine learning tasks, most of the models are deterministic at inference, limiting their application to single-modal settings. We propose a novel general-purpose framework for conditional generation in multimodal spaces, that uses latent variables to model generalizable learning patterns while minimizing a family of regression cost… ▽ More

    Submitted 8 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

  46. arXiv:2010.01725  [pdf, other

    cs.CV cs.AI

    Attention Guided Semantic Relationship Parsing for Visual Question Answering

    Authors: Moshiur Farazi, Salman Khan, Nick Barnes

    Abstract: Humans explain inter-object relationships with semantic labels that demonstrate a high-level understanding required to perform complex Vision-Language tasks such as Visual Question Answering (VQA). However, existing VQA models represent relationships as a combination of object-level visual features which constrain a model to express interactions between objects in a single domain, while the model… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

  47. arXiv:2009.03075  [pdf, other

    cs.CV

    Uncertainty Inspired RGB-D Saliency Detection

    Authors: **g Zhang, Deng-** Fan, Yuchao Dai, Saeed Anwar, Fatemeh Saleh, Sadegh Aliakbarian, Nick Barnes

    Abstract: We propose the first stochastic framework to employ uncertainty for RGB-D saliency detection by learning from the data labeling process. Existing RGB-D saliency detection models treat this task as a point estimation problem by predicting a single saliency map following a deterministic learning pipeline. We argue that, however, the deterministic solution is relatively ill-posed. Inspired by the sal… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

  48. arXiv:2007.12211  [pdf, other

    cs.CV

    Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection

    Authors: **g Zhang, Jianwen Xie, Nick Barnes

    Abstract: In this paper, we propose a noise-aware encoder-decoder framework to disentangle a clean saliency predictor from noisy training examples, where the noisy labels are generated by unsupervised handcrafted feature-based methods. The proposed model consists of two sub-models parameterized by neural networks: (1) a saliency predictor that maps input images to clean saliency maps, and (2) a noise genera… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

    Comments: ECCV2020

  49. arXiv:2005.06734  [pdf, other

    cs.CV

    Dense-Resolution Network for Point Cloud Classification and Segmentation

    Authors: Shi Qiu, Saeed Anwar, Nick Barnes

    Abstract: Point cloud analysis is attracting attention from Artificial Intelligence research since it can be widely used in applications such as robotics, Augmented Reality, self-driving. However, it is always challenging due to irregularities, unorderedness, and sparsity. In this article, we propose a novel network named Dense-Resolution Network (DRNet) for point cloud analysis. Our DRNet is designed to le… ▽ More

    Submitted 17 November, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

    Comments: To appear in WACV2021. Codes and models are available at: https://github.com/ShiQiu0419/DRNet

  50. arXiv:2004.13524  [pdf, other

    cs.CV cs.LG eess.IV

    Attention Based Real Image Restoration

    Authors: Saeed Anwar, Nick Barnes, Lars Petersson

    Abstract: Deep convolutional neural networks perform better on images containing spatially invariant degradations, also known as synthetic degradations; however, their performance is limited on real-degraded photographs and requires multiple-stage network modeling. To advance the practicability of restoration algorithms, this paper proposes a novel single-stage blind real image restoration network (R$^2$Net… ▽ More

    Submitted 1 October, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1904.07396