Skip to main content

Showing 1–20 of 20 results for author: Oramas, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.18130  [pdf, other

    cs.LG cs.CV

    The Trifecta: Three simple techniques for training deeper Forward-Forward networks

    Authors: Thomas Dooms, Ing Jyh Tsang, Jose Oramas

    Abstract: Modern machine learning models are able to outperform humans on a variety of non-trivial tasks. However, as the complexity of the models increases, they consume significant amounts of power and still struggle to generalize effectively to unseen data. Local learning, which focuses on updating subsets of a model's parameters at a time, has emerged as a promising technique to address these issues. Re… ▽ More

    Submitted 12 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    MSC Class: 68T07

  2. arXiv:2305.10121  [pdf, other

    cs.CV

    FICNN: A Framework for the Interpretation of Deep Convolutional Neural Networks

    Authors: Hamed Behzadi-Khormouji, José Oramas

    Abstract: With the continue development of Convolutional Neural Networks (CNNs), there is a growing concern regarding representations that they encode internally. Analyzing these internal representations is referred to as model interpretation. While the task of model explanation, justifying the predictions of such models, has been studied extensively; the task of model interpretation has received less atten… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  3. arXiv:2305.05349  [pdf, other

    cs.LG cs.CV

    Towards the Characterization of Representations Learned via Capsule-based Network Architectures

    Authors: Saja AL-Tawalbeh, José Oramas

    Abstract: Capsule Networks (CapsNets) have been re-introduced as a more compact and interpretable alternative to standard deep neural networks. While recent efforts have proved their compression capabilities, to date, their interpretability properties have not been fully assessed. Here, we conduct a systematic and principled study towards assessing the interpretability of these types of networks. Moreover,… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: This paper consist of 7 pages including 8 figures. This paper concern about interpretation of capsule network

    MSC Class: ACM-class

  4. arXiv:2302.11244  [pdf, other

    cs.CV cs.LG

    Considering Layerwise Importance in the Lottery Ticket Hypothesis

    Authors: Benjamin Vandersmissen, Jose Oramas

    Abstract: The Lottery Ticket Hypothesis (LTH) showed that by iteratively training a model, removing connections with the lowest global weight magnitude and rewinding the remaining connections, sparse networks can be extracted. This global comparison removes context information between connections within a layer. Here we study means for recovering some of this layer distributional context and generalise th… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  5. On The Coherence of Quantitative Evaluation of Visual Explanations

    Authors: Benjamin Vandersmissen, Jose Oramas

    Abstract: Recent years have shown an increased development of methods for justifying the predictions of neural networks through visual explanations. These explanations usually take the form of heatmaps which assign a saliency (or relevance) value to each pixel of the input image that expresses how relevant the pixel is for the prediction of a label. Complementing this development, evaluation methods have… ▽ More

    Submitted 19 February, 2024; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: Accepted at CVIU

  6. arXiv:2301.06874  [pdf, ps, other

    cs.CV

    Training Methods of Multi-label Prediction Classifiers for Hyperspectral Remote Sensing Images

    Authors: Salma Haidar, José Oramas

    Abstract: With their combined spectral depth and geometric resolution, hyperspectral remote sensing images embed a wealth of complex, non-linear information that challenges traditional computer vision techniques. Yet, deep learning methods known for their representation learning capabilities prove more suitable for handling such complexities. Unlike applications that focus on single-label, pixel-level class… ▽ More

    Submitted 26 October, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: 1- Added references. 2- updated methodology figure and added new figures to visualise the different training schemes and 3- Correcting typos 4- Revised introduction, no change in results or discussion

  7. Deep set conditioned latent representations for action recognition

    Authors: Akash Singh, Tom De Schepper, Kevin Mets, Peter Hellinckx, Jose Oramas, Steven Latre

    Abstract: In recent years multi-label, multi-class video action recognition has gained significant popularity. While reasoning over temporally connected atomic actions is mundane for intelligent species, standard artificial neural networks (ANN) still struggle to classify them. In the real world, atomic actions often temporally connect to form more complex composite actions. The challenge lies in recognisin… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: Conference VISAPP 2022, 11 pages,5 figures, 2 Tables, 6 plots

    Journal ref: In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, ISBN 978-989-758-555-5; ISSN 2184-4321, year 2022, pages 456-466

  8. arXiv:2104.14375  [pdf, other

    cs.CV

    MinMaxCAM: Improving object coverage for CAM-basedWeakly Supervised Object Localization

    Authors: Kaili Wang, Jose Oramas, Tinne Tuytelaars

    Abstract: One of the most common problems of weakly supervised object localization is that of inaccurate object coverage. In the context of state-of-the-art methods based on Class Activation Map**, this is caused either by localization maps which focus, exclusively, on the most discriminative region of the objects of interest or by activations occurring in background regions. To address these two problems… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  9. arXiv:2104.07954  [pdf, other

    cs.CV

    Towards Human-Understandable Visual Explanations:Imperceptible High-frequency Cues Can Better Be Removed

    Authors: Kaili Wang, Jose Oramas, Tinne Tuytelaars

    Abstract: Explainable AI (XAI) methods focus on explaining what a neural network has learned - in other words, identifying the features that are the most influential to the prediction. In this paper, we call them "distinguishing features". However, whether a human can make sense of the generated explanation also depends on the perceptibility of these features to humans. To make sure an explanation is human-… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  10. arXiv:2010.15974  [pdf, other

    cs.CV cs.CR cs.LG

    Can the state of relevant neurons in a deep neural networks serve as indicators for detecting adversarial attacks?

    Authors: Roger Granda, Tinne Tuytelaars, Jose Oramas

    Abstract: We present a method for adversarial attack detection based on the inspection of a sparse set of neurons. We follow the hypothesis that adversarial attacks introduce imperceptible perturbations in the input and that these perturbations change the state of neurons relevant for the concepts modelled by the attacked model. Therefore, monitoring the status of these neurons would enable the detection of… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

  11. arXiv:2009.07827  [pdf, other

    cs.CV

    Multiple Exemplars-based Hallucinationfor Face Super-resolution and Editing

    Authors: Kaili Wang, Jose Oramas, Tinne Tuytelaars

    Abstract: Given a really low-resolution input image of a face (say 16x16 or 8x8 pixels), the goal of this paper is to reconstruct a high-resolution version thereof. This, by itself, is an ill-posed problem, as the high-frequency information is missing in the low-resolution input and needs to be hallucinated, based on prior knowledge about the image content. Rather than relying on a generic face prior, in th… ▽ More

    Submitted 14 January, 2021; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: accepted in ACCV 2020

  12. arXiv:2001.08559   

    cs.LG cs.CV stat.ML

    Information Compensation for Deep Conditional Generative Networks

    Authors: Zehao Wang, Kaili Wang, Tinne Tuytelaars, Jose Oramas

    Abstract: In recent years, unsupervised/weakly-supervised conditional generative adversarial networks (GANs) have achieved many successes on the task of modeling and generating data. However, one of their weaknesses lies in their poor ability to separate, or disentangle, the different factors that characterize the representation encoded in their latent space. To address this issue, we propose a novel struct… ▽ More

    Submitted 6 March, 2022; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: I think my previous work during master study is too naive

  13. arXiv:1909.05690  [pdf, other

    cs.CV cs.AI

    In Defense of LSTMs for Addressing Multiple Instance Learning Problems

    Authors: Kaili Wang, Jose Oramas, Tinne Tuytelaars

    Abstract: LSTMs have a proven track record in analyzing sequential data. But what about unordered instance bags, as found under a Multiple Instance Learning (MIL) setting? While not often used for this, we show LSTMs excell under this setting too. In addition, we show thatLSTMs are capable of indirectly capturing instance-level information us-ing only bag-level annotations. Thus, they can be used to learn i… ▽ More

    Submitted 14 January, 2021; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: accepted in ACCV 2020 (oral)

  14. arXiv:1812.02134  [pdf, other

    cs.CV

    An Unpaired Shape Transforming Method for Image Translation and Cross-Domain Retrieval

    Authors: Kaili Wang, Liqian Ma, Jose Oramas, Luc Van Gool, Tinne Tuytelaars

    Abstract: We address the problem of unpaired geometric image-to-image translation. Rather than transferring the style of an image as a whole, our goal is to translate the geometry of an object as depicted in different domains while preserving its appearance characteristics. Our model is trained in an unpaired fashion, i.e. without the need of paired images during training. It performs all steps of the shape… ▽ More

    Submitted 18 August, 2021; v1 submitted 5 December, 2018; originally announced December 2018.

    Comments: This manuscript is a pre-print currently under review at the Elsevier Journal Computer Vision and Image Under-standing (CVIU)

  15. arXiv:1712.06302  [pdf, other

    cs.CV cs.LG stat.ML

    Visual Explanation by Interpretation: Improving Visual Feedback Capabilities of Deep Neural Networks

    Authors: Jose Oramas, Kaili Wang, Tinne Tuytelaars

    Abstract: Interpretation and explanation of deep models is critical towards wide adoption of systems that rely on them. In this paper, we propose a novel scheme for both interpretation as well as explanation in which, given a pretrained model, we automatically identify internal features relevant for the set of classes considered by the model, without relying on additional annotations. We interpret the model… ▽ More

    Submitted 8 March, 2019; v1 submitted 18 December, 2017; originally announced December 2017.

    Comments: Accepted at International Conference on Learning Representations (ICLR) 2019. Project website: http://homes.esat.kuleuven.be/~joramas/projects/visualExplanationByInterpretation

  16. arXiv:1707.02905  [pdf, other

    cs.CV

    An Analysis of Human-centered Geolocation

    Authors: Kaili Wang, Yu-Hui Huang, Jose Oramas, Luc Van Gool, Tinne Tuytelaars

    Abstract: Online social networks contain a constantly increasing amount of images - most of them focusing on people. Due to cultural and climate factors, fashion trends and physical appearance of individuals differ from city to city. In this paper we investigate to what extent such cues can be exploited in order to infer the geographic location, i.e. the city, where a picture was taken. We conduct a user st… ▽ More

    Submitted 31 January, 2018; v1 submitted 10 July, 2017; originally announced July 2017.

    Comments: WACV'18

  17. Context-based Object Viewpoint Estimation: A 2D Relational Approach

    Authors: Jose Oramas, Luc De Raedt, Tinne Tuytelaars

    Abstract: The task of object viewpoint estimation has been a challenge since the early days of computer vision. To estimate the viewpoint (or pose) of an object, people have mostly looked at object intrinsic features, such as shape or appearance. Surprisingly, informative features provided by other, extrinsic elements in the scene, have so far mostly been ignored. At the same time, contextual cues have been… ▽ More

    Submitted 21 April, 2017; originally announced April 2017.

    Comments: Computer Vision and Image Understanding (CVIU)

  18. arXiv:1607.06356  [pdf, other

    cs.CV

    Reasoning about Body-Parts Relations for Sign Language Recognition

    Authors: Marc Martínez-Camarena, Jose Oramas, Mario Montagud-Climent, Tinne Tuytelaars

    Abstract: Over the years, hand gesture recognition has been mostly addressed considering hand trajectories in isolation. However, in most sign languages, hand gestures are defined on a particular context (body region). We propose a pipeline to perform sign language recognition which models hand movements in the context of other parts of the body captured in the 3D space using the MS Kinect sensor. In additi… ▽ More

    Submitted 21 July, 2016; originally announced July 2016.

    Comments: Under Review ( 15 Pages: 13 Figures, 6 Tables )

  19. arXiv:1604.00036  [pdf, other

    cs.CV

    Modeling Visual Compatibility through Hierarchical Mid-level Elements

    Authors: Jose Oramas, Tinne Tuytelaars

    Abstract: In this paper we present a hierarchical method to discover mid-level elements with the objective of modeling visual compatibility between objects. At the base-level, our method identifies patterns of CNN activations with the aim of modeling different variations/styles in which objects of the classes of interest may occur. At the top-level, the proposed method discovers patterns of co-occurring act… ▽ More

    Submitted 31 March, 2016; originally announced April 2016.

    Comments: 29 pages, 19 Figures

  20. Rank Pooling for Action Recognition

    Authors: Basura Fernando, Efstratios Gavves, Jose Oramas, Amir Ghodrati, Tinne Tuytelaars

    Abstract: We propose a function-based temporal pooling method that captures the latent structure of the video sequence data - e.g. how frame-level features evolve over time in a video. We show how the parameters of a function that has been fit to the video data can serve as a robust new video representation. As a specific example, we learn a pooling function via ranking machines. By learning to rank the fra… ▽ More

    Submitted 15 May, 2016; v1 submitted 6 December, 2015; originally announced December 2015.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence