Skip to main content

Showing 1–50 of 206 results for author: Torr, P H S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10288  [pdf, other

    cs.CL cs.LG

    Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models

    Authors: Francisco Eiras, Aleksandar Petrov, Phillip H. S. Torr, M. Pawan Kumar, Adel Bibi

    Abstract: Fine-tuning large language models on small, high-quality datasets can enhance their performance on specific downstream tasks. Recent research shows that fine-tuning on benign, instruction-following data can inadvertently undo the safety alignment process and increase a model's propensity to comply with harmful queries. Although critical, understanding and mitigating safety risks in well-defined ta… ▽ More

    Submitted 1 July, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.01424  [pdf, other

    cs.LG cs.AI cs.CL

    Universal In-Context Approximation By Prompting Fully Recurrent Models

    Authors: Aleksandar Petrov, Tom A. Lamb, Alasdair Paren, Philip H. S. Torr, Adel Bibi

    Abstract: Zero-shot and in-context learning enable solving tasks without model fine-tuning, making them essential for develo** generative model solutions. Therefore, it is crucial to understand whether a pretrained model can be prompted to approximate any function, i.e., whether it is a universal in-context approximator. While it was recently shown that transformer models do possess this property, these r… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2405.13922  [pdf, other

    cs.LG stat.ML

    Towards Certification of Uncertainty Calibration under Adversarial Attacks

    Authors: Cornelius Emde, Francesco Pinto, Thomas Lukasiewicz, Philip H. S. Torr, Adel Bibi

    Abstract: Since neural classifiers are known to be sensitive to adversarial perturbations that alter their accuracy, \textit{certification methods} have been developed to provide provable guarantees on the insensitivity of their predictions to such perturbations. Furthermore, in safety-critical applications, the frequentist interpretation of the confidence of a classifier (also known as model calibration) c… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 11 pages main paper, appendix included

  4. arXiv:2405.08597  [pdf, other

    cs.LG

    Risks and Opportunities of Open-Source Generative AI

    Authors: Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Aaron Purewal, Csaba Botos, Fabro Steibel, Fazel Keshtkar, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Imperial, Juan Arturo Nolazco, Lori Landay, Matthew Jackson, Phillip H. S. Torr, Trevor Darrell, Yong Lee, Jakob Foerster

    Abstract: Applications of Generative AI (Gen AI) are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about the potential risks of the technology, and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This reg… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: Extension of arXiv:2404.17047

  5. arXiv:2404.17047  [pdf, other

    cs.LG

    Near to Mid-term Risks and Opportunities of Open-Source Generative AI

    Authors: Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder de Witt, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Botos Csaba, Fabro Steibel, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Marvin Imperial, Juan A. Nolazco-Flores, Lori Landay, Matthew Jackson, Paul Röttger, Philip H. S. Torr, Trevor Darrell, Yong Suk Lee, Jakob Foerster

    Abstract: In the next few years, applications of Generative AI are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about potential risks and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This regulation i… ▽ More

    Submitted 24 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to ICML'24 as a position paper

  6. arXiv:2404.12766  [pdf, other

    cs.LG cs.CV

    Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation

    Authors: Wenxuan Zhang, Youssef Mohamed, Bernard Ghanem, Philip H. S. Torr, Adel Bibi, Mohamed Elhoseiny

    Abstract: We propose and study a realistic Continual Learning (CL) setting where learning algorithms are granted a restricted computational budget per time step while training. We apply this setting to large-scale semi-supervised Continual Learning scenarios with sparse label rates. Previous proficient CL methods perform very poorly in this challenging setting. Overfitting to the sparse labeled data and ins… ▽ More

    Submitted 8 June, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  7. arXiv:2404.04125  [pdf, other

    cs.CV cs.CL cs.LG

    No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

    Authors: Vishaal Udandarao, Ameya Prabhu, Adhiraj Ghosh, Yash Sharma, Philip H. S. Torr, Adel Bibi, Samuel Albanie, Matthias Bethge

    Abstract: Web-crawled pretraining datasets underlie the impressive "zero-shot" evaluation performance of multimodal models, such as CLIP for classification/retrieval and Stable-Diffusion for image generation. However, it is unclear how meaningful the notion of "zero-shot" generalization is for such multimodal models, as it is not known to what extent their pretraining datasets encompass the downstream conce… ▽ More

    Submitted 8 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Extended version of the short paper accepted at DPFM, ICLR'24

  8. arXiv:2403.01325  [pdf, other

    cs.CV

    NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning

    Authors: Linsheng Chen, Guangrun Wang, Liuchun Yuan, Keze Wang, Ken Deng, Philip H. S. Torr

    Abstract: Neural Radiance Fields (NeRF) have garnered remarkable success in novel view synthesis. Nonetheless, the task of generating high-quality images for novel views persists as a critical challenge. While the existing efforts have exhibited commendable progress, capturing intricate details, enhancing textures, and achieving superior Peak Signal-to-Noise Ratio (PSNR) metrics warrant further focused atte… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: AAAI 2024

  9. arXiv:2402.16392  [pdf, other

    cs.CV

    Placing Objects in Context via Inpainting for Out-of-distribution Segmentation

    Authors: Pau de Jorge, Riccardo Volpi, Puneet K. Dokania, Philip H. S. Torr, Gregory Rogez

    Abstract: When deploying a semantic segmentation model into the real world, it will inevitably be confronted with semantic classes unseen during training. Thus, to safely deploy such systems, it is crucial to accurately evaluate and improve their anomaly segmentation capabilities. However, acquiring and labelling semantic segmentation data is expensive and unanticipated conditions are long-tail and potentia… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  10. arXiv:2402.14753  [pdf, other

    cs.LG cs.AI math.FA

    Prompting a Pretrained Transformer Can Be a Universal Approximator

    Authors: Aleksandar Petrov, Philip H. S. Torr, Adel Bibi

    Abstract: Despite the widespread adoption of prompting, prompt tuning and prefix-tuning of transformer models, our theoretical understanding of these fine-tuning methods remains limited. A key question is whether one can arbitrarily modify the behavior of pretrained model by prompting or prefix-tuning it. Formally, whether prompting and prefix-tuning a pretrained model can universally approximate sequence-t… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  11. arXiv:2402.10186  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph

    Self-consistent Validation for Machine Learning Electronic Structure

    Authors: Gengyuan Hu, Gengchen Wei, Zekun Lou, Philip H. S. Torr, Wanli Ouyang, Han-sen Zhong, Chen Lin

    Abstract: Machine learning has emerged as a significant approach to efficiently tackle electronic structure problems. Despite its potential, there is less guarantee for the model to generalize to unseen data that hinders its application in real-world scenarios. To address this issue, a technique has been proposed to estimate the accuracy of the predictions. This method integrates machine learning with self-… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures

  12. arXiv:2402.08823  [pdf, other

    cs.CV cs.LG

    RanDumb: A Simple Approach that Questions the Efficacy of Continual Representation Learning

    Authors: Ameya Prabhu, Shiven Sinha, Ponnurangam Kumaraguru, Philip H. S. Torr, Ozan Sener, Puneet K. Dokania

    Abstract: We propose RanDumb to examine the efficacy of continual representation learning. RanDumb embeds raw pixels using a fixed random transform which approximates an RBF-Kernel, initialized before seeing any data, and learns a simple linear classifier on top. We present a surprising and consistent finding: RanDumb significantly outperforms the continually learned representations using deep networks acro… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Tech Report

  13. arXiv:2402.08480  [pdf, other

    cs.LG math.DG

    Revealing Decurve Flows for Generalized Graph Propagation

    Authors: Chen Lin, Liheng Ma, Yiyang Chen, Wanli Ouyang, Michael M. Bronstein, Philip H. S. Torr

    Abstract: This study addresses the limitations of the traditional analysis of message-passing, central to graph learning, by defining {\em \textbf{generalized propagation}} with directed and weighted graphs. The significance manifest in two ways. \textbf{Firstly}, we propose {\em Generalized Propagation Neural Networks} (\textbf{GPNNs}), a framework that unifies most propagation-based graph neural networks.… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 15 pages, 4 figures

  14. arXiv:2402.07510  [pdf, other

    cs.AI cs.CR

    Secret Collusion Among Generative AI Agents

    Authors: Sumeet Ramesh Motwani, Mikhail Baranchuk, Martin Strohmeier, Vijay Bolina, Philip H. S. Torr, Lewis Hammond, Christian Schroeder de Witt

    Abstract: Recent capability increases in large language models (LLMs) open up applications in which teams of communicating generative AI agents solve joint tasks. This poses privacy and security challenges concerning the unauthorised sharing of information, or other unwanted forms of agent coordination. Modern steganographic techniques could render such dynamics hard to detect. In this paper, we comprehensi… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  15. arXiv:2311.11293  [pdf, other

    cs.LG

    From Categories to Classifier: Name-Only Continual Learning by Exploring the Web

    Authors: Ameya Prabhu, Hasan Abed Al Kader Hammoud, Ser-Nam Lim, Bernard Ghanem, Philip H. S. Torr, Adel Bibi

    Abstract: Continual Learning (CL) often relies on the availability of extensive annotated datasets, an assumption that is unrealistically time-consuming and costly in practice. We explore a novel paradigm termed name-only continual learning where time and cost constraints prohibit manual annotation. In this scenario, learners adapt to new category shifts using only category names without the luxury of annot… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  16. arXiv:2310.19698  [pdf, other

    cs.LG cs.CL

    When Do Prompting and Prefix-Tuning Work? A Theory of Capabilities and Limitations

    Authors: Aleksandar Petrov, Philip H. S. Torr, Adel Bibi

    Abstract: Context-based fine-tuning methods, including prompting, in-context learning, soft prompting (also known as prompt tuning), and prefix-tuning, have gained popularity due to their ability to often match the performance of full fine-tuning with a fraction of the parameters. Despite their empirical successes, there is little theoretical understanding of how these techniques influence the internal comp… ▽ More

    Submitted 9 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted at ICLR 2024

  17. arXiv:2310.19252  [pdf, other

    cs.CV cs.AI cs.LG

    Revisiting Evaluation Metrics for Semantic Segmentation: Optimization and Evaluation of Fine-grained Intersection over Union

    Authors: Zifu Wang, Maxim Berman, Amal Rannen-Triki, Philip H. S. Torr, Devis Tuia, Tinne Tuytelaars, Luc Van Gool, Jiaqian Yu, Matthew B. Blaschko

    Abstract: Semantic segmentation datasets often exhibit two types of imbalance: \textit{class imbalance}, where some classes appear more frequently than others and \textit{size imbalance}, where some objects occupy more pixels than others. This causes traditional evaluation metrics to be biased towards \textit{majority classes} (e.g. overall pixel-wise accuracy) and \textit{large objects} (e.g. mean pixel-wi… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  18. arXiv:2310.13479  [pdf, other

    cs.CV cs.LG

    Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation

    Authors: Francisco Eiras, Kemal Oksuz, Adel Bibi, Philip H. S. Torr, Puneet K. Dokania

    Abstract: Referring Image Segmentation (RIS) - the problem of identifying objects in images through natural language sentences - is a challenging task currently mostly solved through supervised learning. However, while collecting referred annotation masks is a time-consuming process, the few existing weakly-supervised and zero-shot approaches fall significantly short in performance compared to fully-supervi… ▽ More

    Submitted 23 October, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

  19. arXiv:2308.13320  [pdf, other

    cs.LG cs.CV

    Fine-tuning can cripple your foundation model; preserving features may be the solution

    Authors: Jishnu Mukhoti, Yarin Gal, Philip H. S. Torr, Puneet K. Dokania

    Abstract: Pre-trained foundation models, due to their enormous capacity and exposure to vast amounts of data during pre-training, are known to have learned plenty of real-world concepts. An important step in making these pre-trained models effective on downstream tasks is to fine-tune them on related datasets. While various fine-tuning methods have been devised and have been shown to be highly effective, we… ▽ More

    Submitted 1 July, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Published in TMLR: https://openreview.net/forum?id=kfhoeZCeW7

  20. arXiv:2308.08414  [pdf, other

    cs.CV

    Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer

    Authors: Guangyi Chen, Xiao Liu, Guangrun Wang, Kun Zhang, Philip H. S. Torr, Xiao-** Zhang, Yansong Tang

    Abstract: Video-language pre-trained models have shown remarkable success in guiding video question-answering (VideoQA) tasks. However, due to the length of video sequences, training large-scale video-based models incurs considerably higher costs than training image-based ones. This motivates us to leverage the knowledge from image-based pretraining, despite the obvious gaps between image and video domains.… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  21. arXiv:2306.04431  [pdf, other

    cs.LG

    Faithful Knowledge Distillation

    Authors: Tom A. Lamb, Rudy Brunel, Krishnamurthy DJ Dvijotham, M. Pawan Kumar, Philip H. S. Torr, Francisco Eiras

    Abstract: Knowledge distillation (KD) has received much attention due to its success in compressing networks to allow for their deployment in resource-constrained systems. While the problem of adversarial robustness has been studied before in the KD setting, previous works overlook what we term the relative calibration of the student network with respect to its teacher in terms of soft confidences. In parti… ▽ More

    Submitted 11 August, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 7pgs (main content), 4 figures

  22. arXiv:2305.15425  [pdf

    cs.CL cs.LG

    Language Model Tokenizers Introduce Unfairness Between Languages

    Authors: Aleksandar Petrov, Emanuele La Malfa, Philip H. S. Torr, Adel Bibi

    Abstract: Recent language models have shown impressive multilingual performance, even when not explicitly trained for it. Despite this, there are concerns about the quality of their outputs across different languages. In this paper, we show how disparity in the treatment of different languages arises at the tokenization stage, well before a model is even invoked. The same text translated into different lang… ▽ More

    Submitted 20 October, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Published at NeurIPS 2023, Project webpage: https://aleksandarpetrov.github.io/tokenization-fairness, Code: https://github.com/AleksandarPetrov/tokenization-fairness

  23. arXiv:2305.09275  [pdf, other

    cs.LG cs.AI cs.CV

    Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?

    Authors: Hasan Abed Al Kader Hammoud, Ameya Prabhu, Ser-Nam Lim, Philip H. S. Torr, Adel Bibi, Bernard Ghanem

    Abstract: We revisit the common practice of evaluating adaptation of Online Continual Learning (OCL) algorithms through the metric of online accuracy, which measures the accuracy of the model on the immediate next few samples. However, we show that this metric is unreliable, as even vacuous blind classifiers, which do not use input images for prediction, can achieve unrealistically high online accuracy by e… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  24. arXiv:2304.13019  [pdf, other

    cs.LG

    Certifying Ensembles: A General Certification Theory with S-Lipschitzness

    Authors: Aleksandar Petrov, Francisco Eiras, Amartya Sanyal, Philip H. S. Torr, Adel Bibi

    Abstract: Improving and guaranteeing the robustness of deep learning models has been a topic of intense research. Ensembling, which combines several classifiers to provide a better model, has shown to be beneficial for generalisation, uncertainty estimation, calibration, and mitigating the effects of concept drift. However, the impact of ensembling on certified robustness is less well understood. In this wo… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted to ICML 2023

  25. arXiv:2304.09826  [pdf

    cs.CY cs.AI cs.CL cs.CV cs.LG

    Fairness in AI and Its Long-Term Implications on Society

    Authors: Ondrej Bohdal, Timothy Hospedales, Philip H. S. Torr, Fazl Barez

    Abstract: Successful deployment of artificial intelligence (AI) in various settings has led to numerous positive outcomes for individuals and society. However, AI systems have also been shown to harm parts of the population due to biased predictions. AI fairness focuses on mitigating such biases to ensure AI decision making is not discriminatory towards certain groups. We take a closer look at AI fairness a… ▽ More

    Submitted 19 July, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

    Comments: Stanford Existential Risks Conference 2023

  26. arXiv:2303.13211  [pdf, other

    cs.CR cs.CV cs.LG

    Don't FREAK Out: A Frequency-Inspired Approach to Detecting Backdoor Poisoned Samples in DNNs

    Authors: Hasan Abed Al Kader Hammoud, Adel Bibi, Philip H. S. Torr, Bernard Ghanem

    Abstract: In this paper we investigate the frequency sensitivity of Deep Neural Networks (DNNs) when presented with clean samples versus poisoned samples. Our analysis shows significant disparities in frequency sensitivity between these two types of samples. Building on these findings, we propose FREAK, a frequency-based poisoned sample detection algorithm that is simple yet effective. Our experimental resu… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPRW (The Art of Robustness)

  27. arXiv:2303.11165  [pdf, other

    cs.LG cs.CV

    Computationally Budgeted Continual Learning: What Does Matter?

    Authors: Ameya Prabhu, Hasan Abed Al Kader Hammoud, Puneet Dokania, Philip H. S. Torr, Ser-Nam Lim, Bernard Ghanem, Adel Bibi

    Abstract: Continual Learning (CL) aims to sequentially train models on streams of incoming data that vary in distribution by preserving previous knowledge while adapting to new data. Current CL literature focuses on restricted access to previously seen data, while imposing no constraints on the computational budget for training. This is unreasonable for applications in-the-wild, where systems are primarily… ▽ More

    Submitted 14 July, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  28. arXiv:2303.06345  [pdf, other

    cs.CV

    Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation

    Authors: Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr

    Abstract: Referring image segmentation segments an image from a language expression. With the aim of producing high-quality masks, existing methods often adopt iterative learning approaches that rely on RNNs or stacked attention layers to refine vision-language features. Despite their complexity, RNN-based methods are subject to specific encoder choices, while attention-based methods offer limited gains. In… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: AAAI 2023. 11 pages. 7 figures

  29. arXiv:2303.01932  [pdf, other

    cs.CV cs.LG

    MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices

    Authors: Kejie Li, Jia-Wang Bian, Robert Castle, Philip H. S. Torr, Victor Adrian Prisacariu

    Abstract: High-quality 3D ground-truth shapes are critical for 3D object reconstruction evaluation. However, it is difficult to create a replica of an object in reality, and even 3D reconstructions generated by 3D scanners have artefacts that cause biases in evaluation. To address this issue, we introduce a novel multi-view RGBD dataset captured using a mobile device, which includes highly precise 3D ground… ▽ More

    Submitted 9 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: To be appeared at CVPR 2023

  30. arXiv:2302.01872  [pdf, other

    cs.CV

    MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

    Authors: Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr, Song Bai

    Abstract: Video object segmentation (VOS) aims at segmenting a particular object throughout the entire video clip sequence. The state-of-the-art VOS methods have achieved excellent performance (e.g., 90+% J&F) on existing datasets. However, since the target objects in these existing datasets are usually relatively salient, dominant, and isolated, VOS under complex scenes has rarely been studied. To revisit… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: MOSE Dataset Report

    Journal ref: ICCV 2023

  31. arXiv:2302.01047  [pdf, other

    cs.LG cs.AI cs.CV

    Real-Time Evaluation in Online Continual Learning: A New Hope

    Authors: Yasir Ghunaim, Adel Bibi, Kumail Alhamoud, Motasem Alfarra, Hasan Abed Al Kader Hammoud, Ameya Prabhu, Philip H. S. Torr, Bernard Ghanem

    Abstract: Current evaluations of Continual Learning (CL) methods typically assume that there is no constraint on training time and computation. This is an unrealistic assumption for any real-world setting, which motivates us to propose: a practical real-time evaluation of continual learning, in which the stream does not wait for the model to complete training before revealing the next data for predictions.… ▽ More

    Submitted 24 March, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Accepted at CVPR'23 as Highlight (Top 2.5%)

  32. arXiv:2212.04994  [pdf, other

    cs.CV

    Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning

    Authors: Jishnu Mukhoti, Tsung-Yu Lin, Omid Poursaeed, Rui Wang, Ashish Shah, Philip H. S. Torr, Ser-Nam Lim

    Abstract: We introduce Patch Aligned Contrastive Learning (PACL), a modified compatibility function for CLIP's contrastive loss, intending to train an alignment between the patch tokens of the vision encoder and the CLS token of the text encoder. With such an alignment, a model can identify regions of an image corresponding to a given text input, and therefore transfer seamlessly to the task of open vocabul… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  33. arXiv:2211.14960  [pdf, other

    cs.LG stat.ML

    Label Alignment Regularization for Distribution Shift

    Authors: Ehsan Imani, Guojun Zhang, Runjia Li, Jun Luo, Pascal Poupart, Philip H. S. Torr, Yangchen Pan

    Abstract: Recent work has highlighted the label alignment property (LAP) in supervised learning, where the vector of all labels in the dataset is mostly in the span of the top few singular vectors of the data matrix. Drawing inspiration from this observation, we propose a regularization method for unsupervised domain adaptation that encourages alignment between the predictions in the target domain and its t… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 November, 2022; originally announced November 2022.

  34. arXiv:2211.14794  [pdf, other

    cs.CV cs.AI cs.LG cs.MM stat.ML

    Traditional Classification Neural Networks are Good Generators: They are Competitive with DDPMs and GANs

    Authors: Guangrun Wang, Philip H. S. Torr

    Abstract: Classifiers and generators have long been separated. We break down this separation and showcase that conventional neural network classifiers can generate high-quality images of a large number of categories, being comparable to the state-of-the-art generative models (e.g., DDPMs and GANs). We achieve this by computing the partial derivative of the classification loss function with respect to the in… ▽ More

    Submitted 8 December, 2022; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: This paper has 29 pages with 22 figures, including rich supplementary information. Project page is at \url{https://classifier-as-generator.github.io/}

  35. arXiv:2211.06719  [pdf, other

    cs.CV cs.AI

    Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis

    Authors: Hao Tang, Ling Shao, Philip H. S. Torr, Nicu Sebe

    Abstract: We present a novel bipartite graph reasoning Generative Adversarial Network (BiGraphGAN) for two challenging tasks: person pose and facial image synthesis. The proposed graph generator consists of two novel blocks that aim to model the pose-to-pose and pose-to-image relations, respectively. Specifically, the proposed bipartite graph reasoning (BGR) block aims to reason the long-range cross relatio… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

    Comments: Accepted to IJCV, an extended version of a paper published in BMVC 2020. arXiv admin note: substantial text overlap with arXiv:2008.04381

  36. arXiv:2211.06701  [pdf, other

    cs.CV

    Structure-Preserving 3D Garment Modeling with Neural Sewing Machines

    Authors: Xipeng Chen, Guangrun Wang, Dizhong Zhu, Xiaodan Liang, Philip H. S. Torr, Liang Lin

    Abstract: 3D Garment modeling is a critical and challenging topic in the area of computer vision and graphics, with increasing attention focused on garment representation learning, garment reconstruction, and controllable garment manipulation, whereas existing methods were constrained to model garments under specific categories or with relatively simple topologies. In this paper, we propose a novel Neural S… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022

  37. Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning

    Authors: Nan Xue, Tianfu Wu, Song Bai, Fu-Dong Wang, Gui-Song Xia, Liangpei Zhang, Philip H. S. Torr

    Abstract: This article presents Holistically-Attracted Wireframe Parsing (HAWP), a method for geometric analysis of 2D images containing wireframes formed by line segments and junctions. HAWP utilizes a parsimonious Holistic Attraction (HAT) field representation that encodes line segments using a closed-form 4D geometric vector field. The proposed HAWP consists of three sequential components empowered by en… ▽ More

    Submitted 5 September, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Journal extension of arXiv:2003.01663; Accepted by IEEE TPAMI; Code is available at https://github.com/cherubicxn/hawp

  38. arXiv:2209.12093  [pdf, other

    cs.AI

    Learn what matters: cross-domain imitation learning with task-relevant embeddings

    Authors: Tim Franzmeyer, Philip H. S. Torr, João F. Henriques

    Abstract: We study how an autonomous agent learns to perform a task from demonstrations in a different domain, such as a different environment or different agent. Such cross-domain imitation learning is required to, for example, train an artificial agent from demonstrations of a human expert. We propose a scalable framework that enables cross-domain imitation learning without access to additional demonstrat… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022

  39. arXiv:2209.11960  [pdf, other

    cs.CV cs.LG

    Raising the Bar on the Evaluation of Out-of-Distribution Detection

    Authors: Jishnu Mukhoti, Tsung-Yu Lin, Bor-Chun Chen, Ashish Shah, Philip H. S. Torr, Puneet K. Dokania, Ser-Nam Lim

    Abstract: In image classification, a lot of development has happened in detecting out-of-distribution (OoD) data. However, most OoD detection methods are evaluated on a standard set of datasets, arbitrarily different from training data. There is no clear definition of what forms a ``good" OoD dataset. Furthermore, the state-of-the-art OoD detection methods already achieve near perfect results on these stand… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

  40. arXiv:2209.09760  [pdf, other

    cs.CV cs.LG

    Dynamic Graph Message Passing Networks for Visual Recognition

    Authors: Li Zhang, Mohan Chen, Anurag Arnab, Xiangyang Xue, Philip H. S. Torr

    Abstract: Modelling long-range dependencies is critical for scene understanding tasks in computer vision. Although convolution neural networks (CNNs) have excelled in many vision tasks, they are still limited in capturing long-range structured relationships as they typically consist of layers of local kernels. A fully-connected graph, such as the self-attention operation in Transformers, is beneficial for s… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: PAMI extension of CVPR 2020 oral work arXiv:1908.06955

  41. arXiv:2208.07022  [pdf, other

    cs.CV cs.CL cs.LG

    Memory-Driven Text-to-Image Generation

    Authors: Bowen Li, Philip H. S. Torr, Thomas Lukasiewicz

    Abstract: We introduce a memory-driven semi-parametric approach to text-to-image generation, which is based on both parametric and non-parametric techniques. The non-parametric component is a memory bank of image features constructed from a training set of images. The parametric component is a generative adversarial network. Given a new text description at inference time, the memory bank is used to selectiv… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

  42. arXiv:2207.11860  [pdf, other

    cs.CV cs.RO eess.IV

    Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation

    Authors: Jiaming Zhang, Kailun Yang, Hao Shi, Simon Reiß, Kunyu Peng, Chaoxiang Ma, Haodong Fu, Philip H. S. Torr, Kaiwei Wang, Rainer Stiefelhagen

    Abstract: In this paper, we address panoramic semantic segmentation which is under-explored due to two critical challenges: (1) image distortions and object deformations on panoramas; (2) lack of semantic annotations in the 360° imagery. To tackle these problems, first, we propose the upgraded Transformer for Panoramic Semantic Segmentation, i.e., Trans4PASS+, equipped with Deformable Patch Embedding (DPE)… ▽ More

    Submitted 31 May, 2024; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). Extended version of CVPR 2022 paper arXiv:2203.01452. Code is available at https://github.com/jamycheung/Trans4PASS

  43. arXiv:2207.11347  [pdf, other

    cs.CV cs.LG

    An Impartial Take to the CNN vs Transformer Robustness Contest

    Authors: Francesco Pinto, Philip H. S. Torr, Puneet K. Dokania

    Abstract: Following the surge of popularity of Transformers in Computer Vision, several studies have attempted to determine whether they could be more robust to distribution shifts and provide better uncertainty estimates than Convolutional Neural Networks (CNNs). The almost unanimous conclusion is that they are, and it is often conjectured more or less explicitly that the reason of this supposed superiorit… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Journal ref: ECCV 2022

  44. arXiv:2207.10170  [pdf, other

    cs.AI

    Illusory Attacks: Information-Theoretic Detectability Matters in Adversarial Attacks

    Authors: Tim Franzmeyer, Stephen McAleer, João F. Henriques, Jakob N. Foerster, Philip H. S. Torr, Adel Bibi, Christian Schroeder de Witt

    Abstract: Autonomous agents deployed in the real world need to be robust against adversarial attacks on sensory inputs. Robustifying agent policies requires anticipating the strongest attacks possible. We demonstrate that existing observation-space attacks on reinforcement learning agents have a common weakness: while effective, their lack of information-theoretic detectability constraints makes them detect… ▽ More

    Submitted 6 May, 2024; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: ICLR 2024 Spotlight (top 5%)

  45. arXiv:2207.09339  [pdf, other

    cs.CV

    Vision Transformers: From Semantic Segmentation to Dense Prediction

    Authors: Li Zhang, Jiachen Lu, Sixiao Zheng, Xinxuan Zhao, Xiatian Zhu, Yanwei Fu, Tao Xiang, Jianfeng Feng, Philip H. S. Torr

    Abstract: The emergence of vision transformers (ViTs) in image classification has shifted the methodologies for visual representation learning. In particular, ViTs learn visual representation at full receptive field per layer across all the image patches, in comparison to the increasing receptive fields of CNNs across layers and other alternatives (e.g., large kernels and atrous convolution). In this work,… ▽ More

    Submitted 12 October, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Extended version of CVPR 2021 paper arXiv:2012.15840

  46. arXiv:2207.06211  [pdf, other

    cs.CV

    Sample-dependent Adaptive Temperature Scaling for Improved Calibration

    Authors: Tom Joy, Francesco Pinto, Ser-Nam Lim, Philip H. S. Torr, Puneet K. Dokania

    Abstract: It is now well known that neural networks can be wrong with high confidence in their predictions, leading to poor calibration. The most common post-hoc approach to compensate for this is to perform temperature scaling, which adjusts the confidences of the predictions on any input by scaling the logits by a fixed value. Whilst this approach typically improves the average calibration across the whol… ▽ More

    Submitted 22 July, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

  47. arXiv:2207.02088  [pdf, other

    cs.CV

    SiamMask: A Framework for Fast Online Object Tracking and Segmentation

    Authors: Weiming Hu, Qiang Wang, Li Zhang, Luca Bertinetto, Philip H. S. Torr

    Abstract: In this paper we introduce SiamMask, a framework to perform both visual object tracking and video object segmentation, in real-time, with the same simple method. We improve the offline training procedure of popular fully-convolutional Siamese approaches by augmenting their losses with a binary segmentation task. Once the offline training is completed, SiamMask only requires a single bounding box f… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: 17 pages, Accepted by TPAMI 2022. arXiv admin note: substantial text overlap with arXiv:1812.05050

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2022

  48. arXiv:2206.14502  [pdf, other

    cs.LG cs.CV

    RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness

    Authors: Francesco Pinto, Harry Yang, Ser-Nam Lim, Philip H. S. Torr, Puneet K. Dokania

    Abstract: We show that the effectiveness of the well celebrated Mixup [Zhang et al., 2018] can be further improved if instead of using it as the sole learning objective, it is utilized as an additional regularizer to the standard cross-entropy loss. This simple change not only provides much improved accuracy but also significantly improves the quality of the predictive uncertainty estimation of Mixup in mos… ▽ More

    Submitted 6 February, 2023; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: 22 pages, 18 figures

    ACM Class: I.4.0; I.2.6

  49. arXiv:2206.08871  [pdf, other

    cs.LG stat.ML

    How Robust is Unsupervised Representation Learning to Distribution Shift?

    Authors: Yuge Shi, Imant Daunhawer, Julia E. Vogt, Philip H. S. Torr, Amartya Sanyal

    Abstract: The robustness of machine learning algorithms to distributions shift is primarily discussed in the context of supervised learning (SL). As such, there is a lack of insight on the robustness of the representations learned from unsupervised methods, such as self-supervised learning (SSL) and auto-encoder based algorithms (AE), to distribution shift. We posit that the input-driven objectives of unsup… ▽ More

    Submitted 16 December, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  50. arXiv:2206.08242  [pdf, other

    cs.LG cs.AI cs.CV

    Catastrophic overfitting can be induced with discriminative non-robust features

    Authors: Guillermo Ortiz-Jiménez, Pau de Jorge, Amartya Sanyal, Adel Bibi, Puneet K. Dokania, Pascal Frossard, Gregory Rogéz, Philip H. S. Torr

    Abstract: Adversarial training (AT) is the de facto method for building robust neural networks, but it can be computationally expensive. To mitigate this, fast single-step attacks can be used, but this may lead to catastrophic overfitting (CO). This phenomenon appears when networks gain non-trivial robustness during the first stages of AT, but then reach a breaking point where they become vulnerable in just… ▽ More

    Submitted 15 August, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR)