Skip to main content

Showing 1–50 of 66 results for author: Hospedales, T M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.03879  [pdf, other

    cs.LG cs.CV

    Better Practices for Domain Adaptation

    Authors: Linus Ericsson, Da Li, Timothy M. Hospedales

    Abstract: Distribution shifts are all too common in real-world applications of machine learning. Domain adaptation (DA) aims to address this by providing various frameworks for adapting models to the deployment data without using labels. However, the domain shift scenario raises a second more subtle challenge: the difficulty of performing hyperparameter optimisation (HPO) for these adaptation algorithms wit… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: AutoML 2023 (Best paper award)

  2. arXiv:2303.14411  [pdf, other

    cs.CV cs.AI

    Fairness meets Cross-Domain Learning: a new perspective on Models and Metrics

    Authors: Leonardo Iurada, Silvia Bucci, Timothy M. Hospedales, Tatiana Tommasi

    Abstract: Deep learning-based recognition systems are deployed at scale for several real-world applications that inevitably involve our social life. Although being of great support when making complex decisions, they might capture spurious data correlations and leverage sensitive attributes (e.g. age, gender, ethnicity). How to factor out this information while kee** a high prediction performance is a tas… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

  3. arXiv:2212.05611  [pdf, other

    cs.CV

    Accelerating Self-Supervised Learning via Efficient Training Strategies

    Authors: Mustafa Taha Koçyiğit, Timothy M. Hospedales, Hakan Bilen

    Abstract: Recently the focus of the computer vision community has shifted from expensive supervised learning towards self-supervised learning of visual representations. While the performance gap between supervised and self-supervised has been narrowing, the time for training self-supervised deep networks remains an order of magnitude larger than its supervised counterparts, which hinders progress, imposes c… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  4. arXiv:2206.04920  [pdf, other

    cs.LG

    Fisher SAM: Information Geometry and Sharpness Aware Minimisation

    Authors: Minyoung Kim, Da Li, Shell Xu Hu, Timothy M. Hospedales

    Abstract: Recent sharpness-aware minimisation (SAM) is known to find flat minima which is beneficial for better generalisation with improved robustness. SAM essentially modifies the loss function by reporting the maximum loss value within the small neighborhood around the current iterate. However, it uses the Euclidean ball to define the neighborhood, which can be inaccurate since loss functions for neural… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  5. arXiv:2204.07305  [pdf, other

    cs.CV cs.LG

    Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference

    Authors: Shell Xu Hu, Da Li, Jan Stühmer, Minyoung Kim, Timothy M. Hospedales

    Abstract: Few-shot learning (FSL) is an important and topical problem in computer vision that has motivated extensive research into numerous methods spanning from sophisticated meta-learning methods to simple transfer learning baselines. We seek to push the limits of a simple-but-effective pipeline for more realistic and practical settings of few-shot image classification. To this end, we explore few-shot l… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR2022

  6. arXiv:2203.02711  [pdf, other

    cs.LG math.OC

    Meta Mirror Descent: Optimiser Learning for Fast Convergence

    Authors: Boyan Gao, Henry Gouk, Hae Beom Lee, Timothy M. Hospedales

    Abstract: Optimisers are an essential component for training machine learning models, and their design influences learning speed and generalisation. Several studies have attempted to learn more effective gradient-descent optimisers via solving a bi-level optimisation problem where generalisation error is minimised with respect to optimiser parameters. However, most existing optimiser learning methods are in… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

  7. arXiv:2111.11398  [pdf, other

    cs.CV

    Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks

    Authors: Linus Ericsson, Henry Gouk, Timothy M. Hospedales

    Abstract: Self-supervised learning is a powerful paradigm for representation learning on unlabelled images. A wealth of effective new methods based on instance matching rely on data-augmentation to drive learning, and these have reached a rough agreement on an augmentation scheme that optimises popular recognition benchmarks. However, there is strong reason to suspect that different tasks in computer vision… ▽ More

    Submitted 10 October, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Code available at https://github.com/linusericsson/ssl-invariances

  8. arXiv:2110.09327  [pdf, other

    cs.LG cs.CV stat.ML

    Self-Supervised Representation Learning: Introduction, Advances and Challenges

    Authors: Linus Ericsson, Henry Gouk, Chen Change Loy, Timothy M. Hospedales

    Abstract: Self-supervised representation learning methods aim to provide powerful deep feature learning without the requirement of large annotated datasets, thus alleviating the annotation bottleneck that is one of the main barriers to practical deployment of deep learning today. These methods have advanced rapidly in recent years, with their efficacy approaching and sometimes surpassing fully supervised pr… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  9. arXiv:2105.08237  [pdf, other

    cs.CV

    Towards Unsupervised Sketch-based Image Retrieval

    Authors: Conghui Hu, Yongxin Yang, Yunpeng Li, Timothy M. Hospedales, Yi-Zhe Song

    Abstract: The practical value of existing supervised sketch-based image retrieval (SBIR) algorithms is largely limited by the requirement for intensive data collection and labeling. In this paper, we present the first attempt at unsupervised SBIR to remove the labeling cost (both category annotations and sketch-photo pairings) that is conventionally needed for training. Existing single-domain unsupervised r… ▽ More

    Submitted 18 November, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

  10. arXiv:2103.13716  [pdf, other

    cs.CV

    Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting

    Authors: Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song

    Abstract: Self-supervised learning has gained prominence due to its efficacy at learning powerful representations from unlabelled data that achieve excellent performance on many challenging downstream tasks. However supervision-free pre-text tasks are challenging to design and usually modality specific. Although there is a rich literature of self-supervised methods for either spatial (such as images) or tem… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021 Code : https://github.com/AyanKumarBhunia/Self-Supervised-Learning-for-Sketch

  11. arXiv:2103.00243  [pdf, other

    cs.LG

    Searching for Robustness: Loss Learning for Noisy Classification Tasks

    Authors: Boyan Gao, Henry Gouk, Timothy M. Hospedales

    Abstract: We present a "learning to learn" approach for automatically constructing white-box classification loss functions that are robust to label noise in the training data. We parameterize a flexible family of loss functions using Taylor polynomials, and apply evolutionary strategies to search for noise-robust losses in this space. To learn re-usable loss functions that can apply to new tasks, our fitnes… ▽ More

    Submitted 27 February, 2021; originally announced March 2021.

  12. arXiv:2101.11296  [pdf, other

    cs.LG cs.AI

    FedH2L: Federated Learning with Model and Statistical Heterogeneity

    Authors: Yiying Li, Wei Zhou, Huaimin Wang, Haibo Mi, Timothy M. Hospedales

    Abstract: Federated learning (FL) enables distributed participants to collectively learn a strong global model without sacrificing their individual data privacy. Mainstream FL approaches require each participant to share a common network architecture and further assume that data are are sampled IID across participants. However, in real-world deployments participants may require heterogeneous network archite… ▽ More

    Submitted 27 July, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

  13. arXiv:2012.05473  [pdf, other

    cs.CV

    Tensor Composition Net for Visual Relationship Prediction

    Authors: Yuting Qiang, Yongxin Yang, Xueting Zhang, Yanwen Guo, Timothy M. Hospedales

    Abstract: We present a novel Tensor Composition Net (TCN) to predict visual relationships in images. Visual Relationship Prediction (VRP) provides a more challenging test of image understanding than conventional image tagging and is difficult to learn due to a large label-space and incomplete annotation. The key idea of our TCN is to exploit the low-rank property of the visual relationship tensor, so as to… ▽ More

    Submitted 9 February, 2022; v1 submitted 10 December, 2020; originally announced December 2020.

    Journal ref: BMVC 2021

  14. arXiv:2011.13377  [pdf, other

    cs.CV

    How Well Do Self-Supervised Models Transfer?

    Authors: Linus Ericsson, Henry Gouk, Timothy M. Hospedales

    Abstract: Self-supervised visual representation learning has seen huge progress recently, but no large scale evaluation has compared the many models now available. We evaluate the transfer performance of 13 top self-supervised models on 40 downstream tasks, including many-shot and few-shot recognition, object detection, and dense prediction. We compare their performance to a supervised baseline and show tha… ▽ More

    Submitted 29 March, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: CVPR 2021. Code available at https://github.com/linusericsson/ssl-transfer

  15. arXiv:2007.04101  [pdf, other

    cs.CV

    On Learning Semantic Representations for Million-Scale Free-Hand Sketches

    Authors: Peng Xu, Yongye Huang, Tongtong Yuan, Tao Xiang, Timothy M. Hospedales, Yi-Zhe Song, Liang Wang

    Abstract: In this paper, we study learning semantic representations for million-scale free-hand sketches. This is highly challenging due to the domain-unique traits of sketches, e.g., diverse, sparse, abstract, noisy. We propose a dual-branch CNNRNN network architecture to represent sketches, which simultaneously encodes both the static and temporal patterns of sketch strokes. Based on this architecture, we… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1804.01401

  16. arXiv:2006.12360  [pdf, other

    cs.LG stat.ML

    Don't Wait, Just Weight: Improving Unsupervised Representations by Learning Goal-Driven Instance Weights

    Authors: Linus Ericsson, Henry Gouk, Timothy M. Hospedales

    Abstract: In the absence of large labelled datasets, self-supervised learning techniques can boost performance by learning useful representations from unlabelled data, which is often more readily available. However, there is often a domain shift between the unlabelled collection and the downstream target problem data. We show that by learning Bayesian instance weights for the unlabelled data, we can improve… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  17. arXiv:2003.05334  [pdf, other

    cs.LG stat.ML

    Online Meta-Critic Learning for Off-Policy Actor-Critic Methods

    Authors: Wei Zhou, Yiying Li, Yongxin Yang, Huaimin Wang, Timothy M. Hospedales

    Abstract: Off-Policy Actor-Critic (Off-PAC) methods have proven successful in a variety of continuous control tasks. Normally, the critic's action-value function is updated using temporal-difference, and the critic in turn provides a loss for the actor that trains it to take actions with higher expected return. In this paper, we introduce a novel and flexible meta-critic that observes the learning process a… ▽ More

    Submitted 1 November, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: NeurIPS 2020

  18. arXiv:2003.01063  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Unlimited Resolution Image Generation with R2D2-GANs

    Authors: Marija Jegorova, Antti Ilari Karjalainen, Jose Vazquez, Timothy M. Hospedales

    Abstract: In this paper we present a novel simulation technique for generating high quality images of any predefined resolution. This method can be used to synthesize sonar scans of size equivalent to those collected during a full-length mission, with across track resolutions of any chosen magnitude. In essence, our model extends Generative Adversarial Networks (GANs) based architecture into a conditional r… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: Accepted to 2020 IEEE OCEANS (Singapore)

  19. arXiv:2002.10310  [pdf, other

    cs.CV

    Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval

    Authors: Ayan Kumar Bhunia, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song

    Abstract: Fine-grained sketch-based image retrieval (FG-SBIR) addresses the problem of retrieving a particular photo instance given a user's query sketch. Its widespread applicability is however hindered by the fact that drawing a sketch takes time, and most people struggle to draw a complete and faithful sketch. In this paper, we reformulate the conventional FG-SBIR framework to tackle these challenges, wi… ▽ More

    Submitted 11 May, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2020 [Oral Presentation] Code: https://github.com/AyanKumarBhunia/on-the-fly-FGSBIR

  20. arXiv:2002.09461  [pdf, other

    cs.CV cs.MM

    Fine-Grained Instance-Level Sketch-Based Video Retrieval

    Authors: Peng Xu, Kun Liu, Tao Xiang, Timothy M. Hospedales, Zhanyu Ma, Jun Guo, Yi-Zhe Song

    Abstract: Existing sketch-analysis work studies sketches depicting static objects or scenes. In this work, we propose a novel cross-modal retrieval problem of fine-grained instance-level sketch-based video retrieval (FG-SBVR), where a sketch sequence is used as a query to retrieve a specific target video instance. Compared with sketch-based still image retrieval, and coarse-grained category-level video retr… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

  21. arXiv:2002.08253  [pdf, ps, other

    stat.ML cs.LG

    Distance-Based Regularisation of Deep Networks for Fine-Tuning

    Authors: Henry Gouk, Timothy M. Hospedales, Massimiliano Pontil

    Abstract: We investigate approaches to regularisation during fine-tuning of deep neural networks. First we provide a neural network generalisation bound based on Rademacher complexity that uses the distance the weights have moved from their initial values. This bound has no direct dependence on the number of weights and compares favourably to other bounds when applied to convolutional networks. Our bound is… ▽ More

    Submitted 15 January, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

  22. arXiv:2001.02600  [pdf, other

    cs.CV cs.GR cs.LG

    Deep Learning for Free-Hand Sketch: A Survey

    Authors: Peng Xu, Timothy M. Hospedales, Qiyue Yin, Yi-Zhe Song, Tao Xiang, Liang Wang

    Abstract: Free-hand sketches are highly illustrative, and have been widely used by humans to depict objects or stories from ancient times to the present. The recent prevalence of touchscreen devices has made sketch creation a much easier task than ever and consequently made sketch-oriented applications increasingly popular. The progress of deep learning has immensely benefited free-hand sketch research and… ▽ More

    Submitted 1 February, 2022; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: This paper is accepted by IEEE TPAMI

  23. arXiv:1910.08031  [pdf, ps, other

    cs.LG stat.ML

    Deep clustering with concrete k-means

    Authors: Boyan Gao, Yongxin Yang, Henry Gouk, Timothy M. Hospedales

    Abstract: We address the problem of simultaneously learning a k-means clustering and deep feature representation from unlabelled data, which is of interest due to the potential of deep k-means to outperform traditional two-step feature extraction and shallow-clustering strategies. We achieve this by develo** a gradient-estimator for the non-differentiable k-means objective via the Gumbel-Softmax reparamet… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

  24. arXiv:1907.12336  [pdf, other

    cs.CV

    Goal-Driven Sequential Data Abstraction

    Authors: Umar Riaz Muhammad, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song

    Abstract: Automatic data abstraction is an important capability for both benchmarking machine intelligence and supporting summarization applications. In the former one asks whether a machine can `understand' enough about the meaning of input data to produce a meaningful but more compact abstraction. In the latter this capability is exploited for saving space or human time by summarizing the essence of input… ▽ More

    Submitted 8 August, 2019; v1 submitted 29 July, 2019; originally announced July 2019.

    Comments: Accepted at ICCV 2019

  25. arXiv:1907.06291  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Measuring the Transferability of Adversarial Examples

    Authors: Deyan Petrov, Timothy M. Hospedales

    Abstract: Adversarial examples are of wide concern due to their impact on the reliability of contemporary machine learning systems. Effective adversarial examples are mostly found via white-box attacks. However, in some cases they can be transferred across models, thus enabling them to attack black-box models. In this work we evaluate the transferability of three adversarial attacks - the Fast Gradient Sign… ▽ More

    Submitted 14 July, 2019; originally announced July 2019.

  26. arXiv:1905.03422  [pdf, ps, other

    cs.CV

    Frustratingly Easy Person Re-Identification: Generalizing Person Re-ID in Practice

    Authors: Jieru Jia, Qiuqi Ruan, Timothy M. Hospedales

    Abstract: Contemporary person re-identification (\reid) methods usually require access to data from the deployment camera network during training in order to perform well. This is because contemporary \reid{} models trained on one dataset do not generalise to other camera networks due to the domain-shift between datasets. This requirement is often the bottleneck for deploying \reid{} systems in practical se… ▽ More

    Submitted 22 July, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

    Comments: 14 pages,2 figures

  27. arXiv:1902.07015  [pdf, other

    cs.LG cs.AI stat.ML

    Investigating Generalisation in Continuous Deep Reinforcement Learning

    Authors: Chenyang Zhao, Olivier Sigaud, Freek Stulp, Timothy M. Hospedales

    Abstract: Deep Reinforcement Learning has shown great success in a variety of control tasks. However, it is unclear how close we are to the vision of putting Deep RL into practice to solve real world problems. In particular, common practice in the field is to train policies on largely deterministic simulators and to evaluate algorithms through training performance alone, without a train/test distinction to… ▽ More

    Submitted 20 February, 2019; v1 submitted 19 February, 2019; originally announced February 2019.

  28. arXiv:1902.00113  [pdf, other

    cs.CV

    Episodic Training for Domain Generalization

    Authors: Da Li, Jianshu Zhang, Yongxin Yang, Cong Liu, Yi-Zhe Song, Timothy M. Hospedales

    Abstract: Domain generalization (DG) is the challenging and topical problem of learning models that generalize to novel testing domains with different statistics than a set of known training domains. The simple approach of aggregating data from all source domains and training a single deep neural network end-to-end on all the data provides a surprisingly strong baseline that surpasses many prior published m… ▽ More

    Submitted 6 December, 2019; v1 submitted 31 January, 2019; originally announced February 2019.

    Comments: ICCV'19 CR version and fix Table 5. Code is now available at https://github.com/HAHA-DL/Episodic-DG

  29. arXiv:1901.11448  [pdf, other

    cs.LG stat.ML

    Feature-Critic Networks for Heterogeneous Domain Generalization

    Authors: Yiying Li, Yongxin Yang, Wei Zhou, Timothy M. Hospedales

    Abstract: The well known domain shift issue causes model performance to degrade when deployed to a new target domain with different statistics to training. Domain adaptation techniques alleviate this, but need some instances from the target domain to drive adaptation. Domain generalisation is the recently topical problem of learning a model that generalises to unseen domains out of the box, and various appr… ▽ More

    Submitted 8 June, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

    Comments: Presented at ICML 2019

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, Long Beach, California, PMLR 97, 2019

  30. TuckER: Tensor Factorization for Knowledge Graph Completion

    Authors: Ivana Balažević, Carl Allen, Timothy M. Hospedales

    Abstract: Knowledge graphs are structured representations of real world facts. However, they typically contain only a small subset of all possible facts. Link prediction is a task of inferring missing facts based on existing ones. We propose TuckER, a relatively straightforward but powerful linear model based on Tucker decomposition of the binary tensor representation of knowledge graph triples. TuckER outp… ▽ More

    Submitted 24 August, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

  31. arXiv:1812.02605  [pdf, other

    cs.CV cs.LG

    Disjoint Label Space Transfer Learning with Common Factorised Space

    Authors: Xiaobin Chang, Yongxin Yang, Tao Xiang, Timothy M. Hospedales

    Abstract: In this paper, a unified approach is presented to transfer learning that addresses several source and target domain label-space and annotation assumptions with a single model. It is particularly effective in handling a challenging case, where source and target label-spaces are disjoint, and outperforms alternatives in both unsupervised and semi-supervised settings. The key ingredient is a common r… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

    Comments: AAAI-19

  32. arXiv:1811.07100  [pdf, other

    cs.CV

    RelationNet2: Deep Comparison Columns for Few-Shot Learning

    Authors: Xueting Zhang, Yuting Qiang, Flood Sung, Yongxin Yang, Timothy M. Hospedales

    Abstract: Few-shot deep learning is a topical challenge area for scaling visual recognition to open ended growth of unseen new classes with limited labeled examples. A promising approach is based on metric learning, which trains a deep embedding to support image similarity matching. Our insight is that effective general purpose matching requires non-linear comparison of features at multiple abstraction leve… ▽ More

    Submitted 27 April, 2020; v1 submitted 16 November, 2018; originally announced November 2018.

    Comments: 10 pages, 5 figures, Published in IJCNN 2020

  33. arXiv:1810.07778  [pdf, other

    cs.LG cs.AI stat.ML

    Dynamic Ensemble Active Learning: A Non-Stationary Bandit with Expert Advice

    Authors: Kunkun Pang, Mingzhi Dong, Yang Wu, Timothy M. Hospedales

    Abstract: Active learning aims to reduce annotation cost by predicting which samples are useful for a human teacher to label. However it has become clear there is no best active learning algorithm. Inspired by various philosophies about what constitutes a good criteria, different algorithms perform well on different datasets. This has motivated research into ensembles of active learners that learn what cons… ▽ More

    Submitted 29 September, 2018; originally announced October 2018.

    Comments: This work has been accepted at ICPR2018 and won Piero Zamperoni Best Student Paper Award

  34. Hypernetwork Knowledge Graph Embeddings

    Authors: Ivana Balažević, Carl Allen, Timothy M. Hospedales

    Abstract: Knowledge graphs are graphical representations of large databases of facts, which typically suffer from incompleteness. Inferring missing relations (links) between entities (nodes) is the task of link prediction. A recent state-of-the-art approach to link prediction, ConvE, implements a convolutional neural network to extract features from concatenated subject and relation vectors. Whilst results… ▽ More

    Submitted 15 July, 2019; v1 submitted 21 August, 2018; originally announced August 2018.

  35. arXiv:1808.02313  [pdf, other

    cs.CV

    Deep Factorised Inverse-Sketching

    Authors: Kaiyue Pang, Da Li, Jifei Song, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales

    Abstract: Modelling human free-hand sketches has become topical recently, driven by practical applications such as fine-grained sketch based image retrieval (FG-SBIR). Sketches are clearly related to photo edge-maps, but a human free-hand sketch of a photo is not simply a clean rendering of that photo's edge map. Instead there is a fundamental process of abstraction and iconic rendering, where overall geome… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

    Comments: Accepted to ECCV 2018

  36. arXiv:1808.02312  [pdf, other

    cs.CV

    Universal Perceptual Grou**

    Authors: Ke Li, Kaiyue Pang, Jifei Song, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Honggang Zhang

    Abstract: In this work we aim to develop a universal sketch grouper. That is, a grouper that can be applied to sketches of any category in any domain to group constituent strokes/segments into semantically meaningful object parts. The first obstacle to this goal is the lack of large-scale datasets with grou** annotation. To overcome this, we contribute the largest sketch perceptual grou** (SPG) dataset… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

    Comments: Accepted ECCV 2018

  37. arXiv:1806.06988  [pdf, other

    cs.LG stat.ML

    Deep Neural Decision Trees

    Authors: Yongxin Yang, Irene Garcia Morillo, Timothy M. Hospedales

    Abstract: Deep neural networks have been proven powerful at processing perceptual data, such as images and audio. However for tabular data, tree-based models are more popular. A nice property of tree-based models is their natural interpretability. In this work, we present Deep Neural Decision Trees (DNDT) -- tree models realised by neural networks. A DNDT is intrinsically interpretable, as it is a tree. Yet… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Comments: presented at 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), Stockholm, Sweden

  38. arXiv:1804.11182  [pdf, other

    cs.CV

    Sketch-a-Classifier: Sketch-based Photo Classifier Generation

    Authors: Conghui Hu, Da Li, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales

    Abstract: Contemporary deep learning techniques have made image recognition a reasonably reliable technology. However training effective photo classifiers typically takes numerous examples which limits image recognition's scalability and applicability to scenarios where images may not be available. This has motivated investigation into zero-shot learning, which addresses the issue via knowledge transfer fro… ▽ More

    Submitted 30 April, 2018; originally announced April 2018.

    Comments: published in CVPR2018 as spotlight

  39. arXiv:1804.04804  [pdf, other

    cs.CV

    Learning Deep Sketch Abstraction

    Authors: Umar Riaz Muhammad, Yongxin Yang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales

    Abstract: Human free-hand sketches have been studied in various contexts including sketch recognition, synthesis and fine-grained sketch-based image retrieval (FG-SBIR). A fundamental challenge for sketch analysis is to deal with drastically different human drawing styles, particularly in terms of abstraction level. In this work, we propose the first stroke-level sketch abstraction model based on the insigh… ▽ More

    Submitted 13 April, 2018; originally announced April 2018.

    Comments: This paper is accepted at CVPR 2018 as poster

  40. arXiv:1804.01401  [pdf, ps, other

    cs.CV

    SketchMate: Deep Hashing for Million-Scale Human Sketch Retrieval

    Authors: Peng Xu, Yongye Huang, Tongtong Yuan, Kaiyue Pang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Zhanyu Ma, Jun Guo

    Abstract: We propose a deep hashing framework for sketch retrieval that, for the first time, works on a multi-million scale human sketch dataset. Leveraging on this large dataset, we explore a few sketch-specific traits that were otherwise under-studied in prior literature. Instead of following the conventional sketch recognition task, we introduce the novel problem of sketch hashing retrieval which is not… ▽ More

    Submitted 4 April, 2018; originally announced April 2018.

    Comments: Accepted by CVPR2018

  41. arXiv:1803.09132  [pdf, other

    cs.CV

    Multi-Level Factorisation Net for Person Re-Identification

    Authors: Xiaobin Chang, Timothy M. Hospedales, Tao Xiang

    Abstract: Key to effective person re-identification (Re-ID) is modelling discriminative and view-invariant factors of person appearance at both high and low semantic levels. Recently developed deep Re-ID models either learn a holistic single semantic level feature representation and/or require laborious human annotation of these factors as attributes. We propose Multi-Level Factorisation Net (MLFN), a novel… ▽ More

    Submitted 17 April, 2018; v1 submitted 24 March, 2018; originally announced March 2018.

    Comments: To Appear at CVPR2018

  42. arXiv:1803.06936  [pdf, other

    cs.CV

    Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool

    Authors: Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun

    Abstract: In recent years, visual question answering (VQA) has become topical. The premise of VQA's significance as a benchmark in AI, is that both the image and textual question need to be well understood and mutually grounded in order to infer the correct answer. However, current VQA models perhaps `understand' less than initially hoped, and instead master the easier task of exploiting cues given away in… ▽ More

    Submitted 16 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: text overlap with arXiv:1710.03370

  43. arXiv:1711.08106  [pdf, other

    cs.CV

    The Devil is in the Middle: Exploiting Mid-level Representations for Cross-Domain Instance Matching

    Authors: Qian Yu, Xiaobin Chang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales

    Abstract: Many vision problems require matching images of object instances across different domains. These include fine-grained sketch-based image retrieval (FG-SBIR) and Person Re-identification (person ReID). Existing approaches attempt to learn a joint embedding space where images from different domains can be directly compared. In most cases, this space is defined by the output of the final layer of a d… ▽ More

    Submitted 4 April, 2018; v1 submitted 21 November, 2017; originally announced November 2017.

    Comments: Reference updated

  44. arXiv:1711.06047  [pdf, other

    cs.CV stat.ML

    Deep Matching Autoencoders

    Authors: Tanmoy Mukherjee, Makoto Yamada, Timothy M. Hospedales

    Abstract: Increasingly many real world tasks involve data in multiple modalities or views. This has motivated the development of many effective algorithms for learning a common latent space to relate multiple domains. However, most existing cross-view learning algorithms assume access to paired data for training. Their applicability is thus limited as the paired data assumption is often violated in practice… ▽ More

    Submitted 16 November, 2017; originally announced November 2017.

    Comments: 10 pages

  45. arXiv:1711.06025  [pdf, other

    cs.CV

    Learning to Compare: Relation Network for Few-Shot Learning

    Authors: Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip H. S. Torr, Timothy M. Hospedales

    Abstract: We present a conceptually simple, flexible, and general framework for few-shot learning, where a classifier must learn to recognise new classes given only few examples from each. Our method, called the Relation Network (RN), is trained end-to-end from scratch. During meta-learning, it learns to learn a deep distance metric to compare a small number of images within episodes, each of which is desig… ▽ More

    Submitted 27 March, 2018; v1 submitted 16 November, 2017; originally announced November 2017.

    Comments: To appear in CVPR2018

  46. arXiv:1710.03463  [pdf, other

    cs.LG

    Learning to Generalize: Meta-Learning for Domain Generalization

    Authors: Da Li, Yongxin Yang, Yi-Zhe Song, Timothy M. Hospedales

    Abstract: Domain shift refers to the well known problem that a model trained in one source domain performs poorly when applied to a target domain with different statistics. {Domain Generalization} (DG) techniques attempt to alleviate this issue by producing models which by design generalize well to novel testing domains. We propose a novel {meta-learning} method for domain generalization. Rather than design… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.

    Comments: 8 pages, 2 figures, under review of AAAI 2018

  47. arXiv:1710.03370  [pdf, other

    cs.CV

    iVQA: Inverse Visual Question Answering

    Authors: Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun

    Abstract: We propose the inverse problem of Visual question answering (iVQA), and explore its suitability as a benchmark for visuo-linguistic understanding. The iVQA task is to generate a question that corresponds to a given image and answer pair. Since the answers are less informative than the questions, and the questions have less learnable bias, an iVQA model needs to better understand the image to be su… ▽ More

    Submitted 16 March, 2018; v1 submitted 9 October, 2017; originally announced October 2017.

    Comments: CVPR18 Spotlight

  48. arXiv:1710.03077  [pdf, other

    cs.CV

    Deeper, Broader and Artier Domain Generalization

    Authors: Da Li, Yongxin Yang, Yi-Zhe Song, Timothy M. Hospedales

    Abstract: The problem of domain generalization is to learn from multiple training domains, and extract a domain-agnostic model that can then be applied to an unseen domain. Domain generalization (DG) has a clear motivation in contexts where there are target domains with distinct characteristics, yet sparse data for training. For example recognition in sketch images, which are distinctly more abstract and ra… ▽ More

    Submitted 9 October, 2017; originally announced October 2017.

    Comments: 9 pages, 4 figures, ICCV 2017

  49. arXiv:1708.02459  [pdf, other

    cs.CV

    Weakly Supervised Image Annotation and Segmentation with Objects and Attributes

    Authors: Zhiyuan Shi, Yongxin Yang, Timothy M. Hospedales, Tao Xiang

    Abstract: We propose to model complex visual scenes using a non-parametric Bayesian model learned from weakly labelled images abundant on media sharing sites such as Flickr. Given weak image-level annotations of objects and attributes without locations or associations between them, our model aims to learn the appearance of object and attribute classes as well as their association on each object instance. On… ▽ More

    Submitted 8 August, 2017; originally announced August 2017.

    Comments: Accepted in IEEE Transaction on Pattern Analysis and Machine Intelligence

  50. arXiv:1707.09669  [pdf, other

    cs.CV

    Scalable and Effective Deep CCA via Soft Decorrelation

    Authors: Xiaobin Chang, Tao Xiang, Timothy M. Hospedales

    Abstract: Recently the widely used multi-view learning model, Canonical Correlation Analysis (CCA) has been generalised to the non-linear setting via deep neural networks. Existing deep CCA models typically first decorrelate the feature dimensions of each view before the different views are maximally correlated in a common latent space. This feature decorrelation is achieved by enforcing an exact decorrelat… ▽ More

    Submitted 24 March, 2018; v1 submitted 30 July, 2017; originally announced July 2017.

    Comments: To Appear at CVPR2018