Skip to main content

Showing 1–17 of 17 results for author: Van Noord, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06486  [pdf, other

    cs.LG cs.CV

    GO4Align: Group Optimization for Multi-Task Alignment

    Authors: Jiayi Shen, Cheems Wang, Zehao Xiao, Nanne Van Noord, Marcel Worring

    Abstract: This paper proposes \textit{GO4Align}, a multi-task optimization approach that tackles task imbalance by explicitly aligning the optimization across tasks. To achieve this, we design an adaptive group risk minimization strategy, compromising two crucial techniques in implementation: (i) dynamical group assignment, which clusters similar tasks based on task interactions; (ii) risk-guided group indi… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  2. Find the Cliffhanger: Multi-Modal Trailerness in Soap Operas

    Authors: Carlo Bretti, Pascal Mettes, Hendrik Vincent Koops, Daan Odijk, Nanne van Noord

    Abstract: Creating a trailer requires carefully picking out and piecing together brief enticing moments out of a longer video, making it a challenging and time-consuming task. This requires selecting moments based on both visual and dialogue information. We introduce a multi-modal method for predicting the trailerness to assist editors in selecting trailer-worthy moments from long-form videos. We present re… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: MMM24

  3. arXiv:2310.06633  [pdf, other

    cs.CV cs.CY

    Blind Dates: Examining the Expression of Temporality in Historical Photographs

    Authors: Alexandra Barancová, Melvin Wevers, Nanne van Noord

    Abstract: This paper explores the capacity of computer vision models to discern temporal information in visual content, focusing specifically on historical photographs. We investigate the dating of images using OpenCLIP, an open-source implementation of CLIP, a multi-modal language and vision model. Our experiment consists of three steps: zero-shot classification, fine-tuning, and analysis of visual content… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  4. arXiv:2309.02401  [pdf, other

    cs.CV cs.MM

    Prototype-based Dataset Comparison

    Authors: Nanne van Noord

    Abstract: Dataset summarisation is a fruitful approach to dataset inspection. However, when applied to a single dataset the discovery of visual concepts is restricted to those most prominent. We argue that a comparative approach can expand upon this paradigm to enable richer forms of dataset inspection that go beyond the most prominent concepts. To enable dataset comparison we present a module that learns c… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: To be presented at ICCV 2023

  5. arXiv:2301.00436  [pdf, other

    cs.CV cs.AI cs.LG

    Hierarchical Explanations for Video Action Recognition

    Authors: Sadaf Gulshad, Teng Long, Nanne van Noord

    Abstract: To interpret deep neural networks, one main approach is to dissect the visual input and find the prototypical parts responsible for the classification. However, existing methods often ignore the hierarchical relationship between these prototypes, and thus can not explain semantic concepts at both higher level (e.g., water sports) and lower level (e.g., swimming). In this paper inspired by human co… ▽ More

    Submitted 3 April, 2023; v1 submitted 1 January, 2023; originally announced January 2023.

  6. arXiv:2211.07460  [pdf, ps, other

    cs.CY cs.AI

    An Analytics of Culture: Modeling Subjectivity, Scalability, Contextuality, and Temporality

    Authors: Nanne van Noord, Melvin Wevers, Tobias Blanke, Julia Noordegraaf, Marcel Worring

    Abstract: There is a bidirectional relationship between culture and AI; AI models are increasingly used to analyse culture, thereby sha** our understanding of culture. On the other hand, the models are trained on collections of cultural artifacts thereby implicitly, and not always correctly, encoding expressions of culture. This creates a tension that both limits the use of AI for analysing culture and le… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: To be presented at Cultures in AI/AI in Culture workshop at NeurIPS 2022

  7. arXiv:2203.05898  [pdf, other

    cs.CV

    Hyperbolic Image Segmentation

    Authors: Mina GhadimiAtigh, Julian Schoep, Erman Acar, Nanne van Noord, Pascal Mettes

    Abstract: For image segmentation, the current standard is to perform pixel-level optimization and inference in Euclidean output embedding spaces through linear hyperplanes. In this work, we show that hyperbolic manifolds provide a valuable alternative for image segmentation and propose a tractable formulation of hierarchical pixel-level classification in hyperbolic space. Hyperbolic Image Segmentation opens… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: accepted to CVPR 2022

  8. arXiv:2202.01747  [pdf, other

    cs.CV

    The Met Dataset: Instance-level Recognition for Artworks

    Authors: Nikolaos-Antonios Ypsilantis, Noa Garcia, Guangxing Han, Sarah Ibrahimi, Nanne Van Noord, Giorgos Tolias

    Abstract: This work introduces a dataset for large-scale instance-level recognition in the domain of artworks. The proposed benchmark exhibits a number of different challenges such as large inter-class similarity, long tail distribution, and many classes. We rely on the open access collection of The Met museum to form a large training set of about 224k classes, where each class corresponds to a museum exhib… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  9. arXiv:2112.11294  [pdf, other

    cs.IR cs.LG cs.MM

    Extending CLIP for Category-to-image Retrieval in E-commerce

    Authors: Mariya Hendriksen, Maurits Bleeker, Svitlana Vakulenko, Nanne van Noord, Ernst Kuiper, Maarten de Rijke

    Abstract: E-commerce provides rich multimodal data that is barely leveraged in practice. One aspect of this data is a category tree that is being used in search and recommendation. However, in practice, during a user's session there is often a mismatch between a textual and a visual representation of a given category. Motivated by the problem, we introduce the task of category-to-image retrieval in e-commer… ▽ More

    Submitted 4 January, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: 15 pages, accepted as a full paper at ECIR 2022

  10. arXiv:2111.13546  [pdf, other

    cs.CV

    Inside Out Visual Place Recognition

    Authors: Sarah Ibrahimi, Nanne van Noord, Tim Alpherts, Marcel Worring

    Abstract: Visual Place Recognition (VPR) is generally concerned with localizing outdoor images. However, localizing indoor scenes that contain part of an outdoor scene can be of large value for a wide range of applications. In this paper, we introduce Inside Out Visual Place Recognition (IOVPR), a task aiming to localize images based on outdoor scenes visible through windows. For this task we present the ne… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: Accepted at British Machine Vision Conference (BMVC) 2021

  11. arXiv:1909.01218  [pdf, other

    cs.CV cs.HC cs.LG cs.SD eess.AS

    Translating Visual Art into Music

    Authors: Maximilian Müller-Eberstein, Nanne van Noord

    Abstract: The Synesthetic Variational Autoencoder (SynVAE) introduced in this research is able to learn a consistent map** between visual and auditive sensory modalities in the absence of paired datasets. A quantitative evaluation on MNIST as well as the Behance Artistic Media dataset (BAM) shows that SynVAE is capable of retaining sufficient information content during the translation while maintaining cr… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: Accepted for ICCV 2019 Workshop on Fashion, Art and Design

  12. arXiv:1908.02711  [pdf, other

    cs.CV

    I Bet You Are Wrong: Gambling Adversarial Networks for Structured Semantic Segmentation

    Authors: Laurens Samson, Nanne van Noord, Olaf Booij, Michael Hofmann, Efstratios Gavves, Mohsen Ghafoorian

    Abstract: Adversarial training has been recently employed for realizing structured semantic segmentation, in which the aim is to preserve higher-level scene structural consistencies in dense predictions. However, as we show, value-based discrimination between the predictions from the segmentation network and ground-truth annotations can hinder the training process from learning to improve structural qualiti… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: 13 pages, 8 figures

  13. arXiv:1904.03011  [pdf, other

    cs.CV

    Learning Task Relatedness in Multi-Task Learning for Images in Context

    Authors: Gjorgji Strezoski, Nanne van Noord, Marcel Worring

    Abstract: Multimedia applications often require concurrent solutions to multiple tasks. These tasks hold clues to each-others solutions, however as these relations can be complex this remains a rarely utilized property. When task relations are explicitly defined based on domain knowledge multi-task learning (MTL) offers such concurrent solutions, while exploiting relatedness between multiple tasks performed… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: To appear in ICMR 2019 (Oral + Lightning Talk + Poster)

  14. arXiv:1903.12117  [pdf, other

    cs.CV

    Many Task Learning with Task Routing

    Authors: Gjorgji Strezoski, Nanne van Noord, Marcel Worring

    Abstract: Typical multi-task learning (MTL) methods rely on architectural adjustments and a large trainable parameter set to jointly optimize over several tasks. However, when the number of tasks increases so do the complexity of the architectural adjustments and resource requirements. In this paper, we introduce a method which applies a conditional feature-wise transformation over the convolutional activat… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: 8 Pages, 5 Figures, 2 Tables

  15. arXiv:1801.05585  [pdf, other

    cs.CV

    Light-weight pixel context encoders for image inpainting

    Authors: Nanne van Noord, Eric Postma

    Abstract: In this work we propose Pixel Content Encoders (PCE), a light-weight image inpainting model, capable of generating novel con-tent for large missing regions in images. Unlike previously presented convolutional neural network based models, our PCE model has an order of magnitude fewer trainable parameters. Moreover, by incorporating dilated convolutions we are able to preserve fine grained spatial i… ▽ More

    Submitted 17 January, 2018; originally announced January 2018.

  16. arXiv:1602.01255  [pdf, other

    cs.CV

    Learning scale-variant and scale-invariant features for deep image classification

    Authors: Nanne van Noord, Eric Postma

    Abstract: Convolutional Neural Networks (CNNs) require large image corpora to be trained on classification tasks. The variation in image resolutions, sizes of objects and patterns depicted, and image scales, hampers CNN training and performance, because the task-relevant information varies over spatial scales. Previous work attempting to deal with such scale variations focused on encouraging scale-invariant… ▽ More

    Submitted 13 May, 2016; v1 submitted 3 February, 2016; originally announced February 2016.

  17. arXiv:1506.05929  [pdf, other

    cs.CV

    Exploring the influence of scale on artist attribution

    Authors: Nanne van Noord, Eric Postma

    Abstract: Previous work has shown that the artist of an artwork can be identified by use of computational methods that analyse digital images. However, the digitised artworks are often investigated at a coarse scale discarding many of the important details that may define an artist's style. In recent years high resolution images of artworks have become available, which, combined with increased processing po… ▽ More

    Submitted 19 June, 2015; originally announced June 2015.