Skip to main content

Showing 1–12 of 12 results for author: Ioannou, Y

Searching in archive cs. Search in all archives.
.
  1. Meta-GCN: A Dynamically Weighted Loss Minimization Method for Dealing with the Data Imbalance in Graph Neural Networks

    Authors: Mahdi Mohammadizadeh, Arash Mozhdehi, Yani Ioannou, Xin Wang

    Abstract: Although many real-world applications, such as disease prediction, and fault detection suffer from class imbalance, most existing graph-based classification methods ignore the skewness of the distribution of classes; therefore, tend to be biased towards the majority class(es). Conventional methods typically tackle this problem through the assignment of weights to each one of the class samples base… ▽ More

    Submitted 27 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2305.02299  [pdf, other

    cs.LG cs.CV

    Dynamic Sparse Training with Structured Sparsity

    Authors: Mike Lasby, Anna Golubeva, Utku Evci, Mihai Nica, Yani Ioannou

    Abstract: Dynamic Sparse Training (DST) methods achieve state-of-the-art results in sparse neural network training, matching the generalization of dense models while enabling sparse training and inference. Although the resulting models are highly sparse and theoretically less computationally expensive, achieving speedups with unstructured sparsity on real-world hardware is challenging. In this work, we prop… ▽ More

    Submitted 21 February, 2024; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: ICLR 2024, 29 pages, 22 figures

  3. arXiv:2207.09408  [pdf, other

    cs.LG cs.AI

    Bounding generalization error with input compression: An empirical study with infinite-width networks

    Authors: Angus Galloway, Anna Golubeva, Mahmoud Salem, Mihai Nica, Yani Ioannou, Graham W. Taylor

    Abstract: Estimating the Generalization Error (GE) of Deep Neural Networks (DNNs) is an important task that often relies on availability of held-out data. The ability to better predict GE based on a single training set may yield overarching DNN design principles to reduce a reliance on trial-and-error, along with other performance assessment advantages. In search of a quantity relevant to GE, we investigate… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: 12 pages main content, 26 pages total

  4. arXiv:2206.13034  [pdf, other

    cs.LG cs.AI

    Monitoring Shortcut Learning using Mutual Information

    Authors: Mohammed Adnan, Yani Ioannou, Chuan-Yung Tsai, Angus Galloway, H. R. Tizhoosh, Graham W. Taylor

    Abstract: The failure of deep neural networks to generalize to out-of-distribution data is a well-known problem and raises concerns about the deployment of trained networks in safety-critical domains such as healthcare, finance and autonomous vehicles. We study a particular kind of distribution shift $\unicode{x2013}$ shortcuts or spurious correlations in the training data. Shortcut learning is often only e… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: Accepted at ICML 2022 Workshop on Spurious Correlations, Invariance, and Stability

  5. arXiv:2111.12170  [pdf, other

    cs.LG cs.AI cs.CV

    Domain-Agnostic Clustering with Self-Distillation

    Authors: Mohammed Adnan, Yani A. Ioannou, Chuan-Yung Tsai, Graham W. Taylor

    Abstract: Recent advancements in self-supervised learning have reduced the gap between supervised and unsupervised representation learning. However, most self-supervised and deep clustering techniques rely heavily on data augmentation, rendering them ineffective for many learning tasks where insufficient domain knowledge exists for performing augmentation. We propose a new self-distillation based algorithm… ▽ More

    Submitted 20 December, 2021; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021 Workshop: Self-Supervised Learning - Theory and Practice

  6. arXiv:2010.03533  [pdf, other

    cs.LG cs.CV

    Gradient Flow in Sparse Neural Networks and How Lottery Tickets Win

    Authors: Utku Evci, Yani A. Ioannou, Cem Keskin, Yann Dauphin

    Abstract: Sparse Neural Networks (NNs) can match the generalization of dense NNs using a fraction of the compute/storage for inference, and also have the potential to enable efficient training. However, naively training unstructured sparse NNs from random initialization results in significantly worse generalization, with the notable exceptions of Lottery Tickets (LTs) and Dynamic Sparse Training (DST). Thro… ▽ More

    Submitted 15 March, 2022; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: Published in AAAI 2022. Code can be found at https://github.com/google-research/rigl/tree/master/rigl/rigl_tf2

    MSC Class: 68T07

  7. arXiv:1605.07262  [pdf, other

    cs.LG cs.CV cs.NE

    Measuring Neural Net Robustness with Constraints

    Authors: Osbert Bastani, Yani Ioannou, Leonidas Lampropoulos, Dimitrios Vytiniotis, Aditya Nori, Antonio Criminisi

    Abstract: Despite having high accuracy, neural nets have been shown to be susceptible to adversarial examples, where a small perturbation to an input can cause it to become mislabeled. We propose metrics for measuring the robustness of a neural net and devise a novel algorithm for approximating these metrics based on an encoding of robustness as a linear program. We show how our metrics can be used to evalu… ▽ More

    Submitted 16 June, 2017; v1 submitted 23 May, 2016; originally announced May 2016.

  8. arXiv:1605.06489  [pdf, other

    cs.NE cs.CV cs.LG

    Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

    Authors: Yani Ioannou, Duncan Robertson, Roberto Cipolla, Antonio Criminisi

    Abstract: We propose a new method for creating computationally efficient and compact convolutional neural networks (CNNs) using a novel sparse connection structure that resembles a tree root. This allows a significant reduction in computational cost and number of parameters compared to state-of-the-art deep CNNs, without compromising accuracy, by exploiting the sparsity of inter-layer filter dependencies. W… ▽ More

    Submitted 30 November, 2016; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: Updated full version of paper, in full letter paper two-column paper. Includes many textual changes, updated CIFAR10 results, and new analysis of inter/intra-layer correlation

  9. arXiv:1604.06832  [pdf, ps, other

    cs.CV

    Refining Architectures of Deep Convolutional Neural Networks

    Authors: Sukrit Shankar, Duncan Robertson, Yani Ioannou, Antonio Criminisi, Roberto Cipolla

    Abstract: Deep Convolutional Neural Networks (CNNs) have recently evinced immense success for various image recognition tasks. However, a question of paramount importance is somewhat unanswered in deep learning research - is the selected CNN optimal for the dataset in terms of accuracy and model size? In this paper, we intend to answer this question and introduce a novel strategy that alters the architectur… ▽ More

    Submitted 22 April, 2016; originally announced April 2016.

    Comments: 9 pages, 6 figures, CVPR 2016

  10. arXiv:1603.01250  [pdf, other

    cs.CV cs.AI

    Decision Forests, Convolutional Networks and the Models in-Between

    Authors: Yani Ioannou, Duncan Robertson, Darko Zikic, Peter Kontschieder, Jamie Shotton, Matthew Brown, Antonio Criminisi

    Abstract: This paper investigates the connections between two state of the art classifiers: decision forests (DFs, including decision jungles) and convolutional neural networks (CNNs). Decision forests are computationally efficient thanks to their conditional computation property (computation is confined to only a small region of the tree, the nodes along a single branch). CNNs achieve state of the art accu… ▽ More

    Submitted 3 March, 2016; originally announced March 2016.

    Comments: Microsoft Research Technical Report

    Report number: MSR-TR-2015-58

  11. arXiv:1511.06744  [pdf, other

    cs.CV cs.LG cs.NE

    Training CNNs with Low-Rank Filters for Efficient Image Classification

    Authors: Yani Ioannou, Duncan Robertson, Jamie Shotton, Roberto Cipolla, Antonio Criminisi

    Abstract: We propose a new method for creating computationally efficient convolutional neural networks (CNNs) by using low-rank representations of convolutional filters. Rather than approximating filters in previously-trained networks with more efficient versions, we learn a set of small basis filters from scratch; during training, the network learns to combine these basis filters into more complex filters… ▽ More

    Submitted 7 February, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: Published as a conference paper at ICLR 2016. v3: updated ICLR status. v2: Incorporated reviewer's feedback including: Amend Fig. 2 and 5 descriptions to explain that there are no ReLUs within the figures. Fix headings of Table 5 - Fix typo in the sentence at bottom of page 6. Add ref. to Predicting Parameters in Deep Learning. Fix Table 6, GMP-LR and GMP-LR-2x had incorrect numbers of filters

    Journal ref: International Conference on Learning Representations (ICLR), San Juan, Puerto Rico, 2-4 May 2016

  12. Difference of Normals as a Multi-Scale Operator in Unorganized Point Clouds

    Authors: Yani Ioannou, Babak Taati, Robin Harrap, Michael Greenspan

    Abstract: A novel multi-scale operator for unorganized 3D point clouds is introduced. The Difference of Normals (DoN) provides a computationally efficient, multi-scale approach to processing large unorganized 3D point clouds. The application of DoN in the multi-scale filtering of two different real-world outdoor urban LIDAR scene datasets is quantitatively and qualitatively demonstrated. In both datasets th… ▽ More

    Submitted 8 September, 2012; originally announced September 2012.

    Comments: To be published in proceedings of 3DIMPVT 2012

    Journal ref: Proceedings of the 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission (3DIMPVT)