Skip to main content

Showing 1–4 of 4 results for author: Zhang, F Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02880  [pdf, other

    cs.LG cs.AI cs.CV

    Knowledge Composition using Task Vectors with Learned Anisotropic Scaling

    Authors: Frederic Z. Zhang, Paul Albert, Cristian Rodriguez-Opazo, Anton van den Hengel, Ehsan Abbasnejad

    Abstract: Pre-trained models produce strong generic representations that can be adapted via fine-tuning. The learned weight difference relative to the pre-trained model, known as a task vector, characterises the direction and stride of fine-tuning. The significance of task vectors is such that simple arithmetic operations on them can be used to combine diverse representations from different domains. This pa… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2308.06202  [pdf, other

    cs.CV cs.AI cs.LG

    Exploring Predicate Visual Context in Detecting Human-Object Interactions

    Authors: Frederic Z. Zhang, Yuhui Yuan, Dylan Campbell, Zhuoyao Zhong, Stephen Gould

    Abstract: Recently, the DETR framework has emerged as the dominant approach for human--object interaction (HOI) research. In particular, two-stage transformer-based HOI detectors are amongst the most performant and training-efficient approaches. However, these often condition HOI classification on object features that lack fine-grained contextual information, eschewing pose and orientation information in fa… ▽ More

    Submitted 7 November, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  3. arXiv:2112.01838  [pdf, other

    cs.CV cs.AI cs.LG

    Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer

    Authors: Frederic Z. Zhang, Dylan Campbell, Stephen Gould

    Abstract: Recent developments in transformer models for visual data have led to significant improvements in recognition and detection tasks. In particular, using learnable queries in place of region proposals has given rise to a new class of one-stage detection models, spearheaded by the Detection Transformer (DETR). Variations on this one-stage approach have since dominated human-object interaction (HOI) d… ▽ More

    Submitted 26 March, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: Accepted to CVPR2022. 14 pages, 14 figures and 5 tables

  4. arXiv:2012.06060  [pdf, other

    cs.CV cs.AI cs.LG

    Spatially Conditioned Graphs for Detecting Human-Object Interactions

    Authors: Frederic Z. Zhang, Dylan Campbell, Stephen Gould

    Abstract: We address the problem of detecting human-object interactions in images using graphical neural networks. Unlike conventional methods, where nodes send scaled but otherwise identical messages to each of their neighbours, we propose to condition messages between pairs of nodes on their spatial relationships, resulting in different messages going to neighbours of the same node. To this end, we explor… ▽ More

    Submitted 17 August, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: Accepted to ICCV 2021