Skip to main content

Showing 1–13 of 13 results for author: Rudinac, S

.
  1. arXiv:2405.13372  [pdf, other

    cs.LG

    Ada-HGNN: Adaptive Sampling for Scalable Hypergraph Neural Networks

    Authors: Shuai Wang, David W. Zhang, Jia-Hong Huang, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring

    Abstract: Hypergraphs serve as an effective model for depicting complex connections in various real-world scenarios, from social to biological networks. The development of Hypergraph Neural Networks (HGNNs) has emerged as a valuable method to manage the intricate associations in data, though scalability is a notable challenge due to memory limitations. In this study, we introduce a new adaptive sampling str… ▽ More

    Submitted 14 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  2. Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models

    Authors: Hongyi Zhu, Jia-Hong Huang, Stevan Rudinac, Evangelos Kanoulas

    Abstract: Image search stands as a pivotal task in multimedia and computer vision, finding applications across diverse domains, ranging from internet search to medical diagnostics. Conventional image search systems operate by accepting textual or visual queries, retrieving the top-relevant candidate results from the database. However, prevalent methods often rely on single-turn procedures, introducing poten… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  3. arXiv:2309.13092  [pdf, other

    cs.LG cs.SI

    Prototype-Enhanced Hypergraph Learning for Heterogeneous Information Networks

    Authors: Shuai Wang, Jiayi Shen, Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring

    Abstract: The variety and complexity of relations in multimedia data lead to Heterogeneous Information Networks (HINs). Capturing the semantics from such networks requires approaches capable of utilizing the full richness of the HINs. Existing methods for modeling HINs employ techniques originally designed for graph neural networks, and HINs decomposition analysis, like using manually predefined metapaths.… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  4. arXiv:2307.02578  [pdf, other

    cs.LG

    Multimodal Temporal Fusion Transformers Are Good Product Demand Forecasters

    Authors: Maarten Sukel, Stevan Rudinac, Marcel Worring

    Abstract: Multimodal demand forecasting aims at predicting product demand utilizing visual, textual, and contextual information. This paper proposes a method for multimodal product demand forecasting using convolutional, graph-based, and transformer-based architectures. Traditional approaches to demand forecasting rely on historical demand, product categories, and additional contextual information such as s… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  5. arXiv:2208.14295  [pdf, other

    cs.CV cs.MM

    PanorAMS: Automatic Annotation for Detecting Objects in Urban Context

    Authors: Inske Groenen, Stevan Rudinac, Marcel Worring

    Abstract: Large collections of geo-referenced panoramic images are freely available for cities across the globe, as well as detailed maps with location and meta-data on a great variety of urban objects. They provide a potentially rich source of information on urban objects, but manual annotation for object detection is costly, laborious and difficult. Can we utilize such multimedia sources to automatically… ▽ More

    Submitted 31 August, 2022; v1 submitted 30 August, 2022; originally announced August 2022.

  6. arXiv:2109.10683  [pdf, other

    cs.LG cs.MM

    Adaptive Neural Message Passing for Inductive Learning on Hypergraphs

    Authors: Devanshu Arya, Deepak K. Gupta, Stevan Rudinac, Marcel Worring

    Abstract: Graphs are the most ubiquitous data structures for representing relational datasets and performing inferences in them. They model, however, only pairwise relations between nodes and are not designed for encoding the higher-order relations. This drawback is mitigated by hypergraphs, in which an edge can connect an arbitrary number of nodes. Most hypergraph learning approaches convert the hypergraph… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

  7. arXiv:2105.08190  [pdf, other

    cs.CV cs.LG

    Graph Neural Networks for Knowledge Enhanced Visual Representation of Paintings

    Authors: Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Marcel Worring, Nachoem Wijnberg

    Abstract: We propose ArtSAGENet, a novel multimodal architecture that integrates Graph Neural Networks (GNNs) and Convolutional Neural Networks (CNNs), to jointly learn visual and semantic-based artistic representations. First, we illustrate the significant advantages of multi-task learning for fine art analysis and argue that it is conceptually a much more appropriate setting in the fine art domain than th… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

  8. arXiv:2010.04558  [pdf, other

    cs.LG stat.ML

    HyperSAGE: Generalizing Inductive Representation Learning on Hypergraphs

    Authors: Devanshu Arya, Deepak K. Gupta, Stevan Rudinac, Marcel Worring

    Abstract: Graphs are the most ubiquitous form of structured data representation used in machine learning. They model, however, only pairwise relations between nodes and are not designed for encoding the higher-order relations found in many real-world datasets. To model such complex relations, hypergraphs have proven to be a natural representation. Learning the node representations in a hypergraph is more co… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  9. arXiv:2001.11461  [pdf

    cs.SI stat.AP

    Echo Chambers Exist! (But They're Full of Opposing Views)

    Authors: Jonathan Bright, Nahema Marchal, Bharath Ganesh, Stevan Rudinac

    Abstract: The theory of echo chambers, which suggests that online political discussions take place in conditions of ideological homogeneity, has recently gained popularity as an explanation for patterns of political polarization and radicalization observed in many democratic countries. However, while micro-level experimental work has shown evidence that individuals may gravitate towards information that sup… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

  10. arXiv:1909.09252  [pdf, other

    cs.LG cs.CV cs.DC stat.ML

    HyperLearn: A Distributed Approach for Representation Learning in Datasets With Many Modalities

    Authors: Devanshu Arya, Stevan Rudinac, Marcel Worring

    Abstract: Multimodal datasets contain an enormous amount of relational information, which grows exponentially with the introduction of new modalities. Learning representations in such a scenario is inherently complex due to the presence of multiple heterogeneous information channels. These channels can encode both (a) inter-relations between the items of different modalities and (b) intra-relations between… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

  11. arXiv:1905.02430  [pdf, other

    cs.IR cs.CL cs.SI

    Interactive Search and Exploration in Online Discussion Forums Using Multimodal Embeddings

    Authors: Iva Gornishka, Stevan Rudinac, Marcel Worring

    Abstract: In this paper we present a novel interactive multimodal learning system, which facilitates search and exploration in large networks of social multimedia users. It allows the analyst to identify and select users of interest, and to find similar users in an interactive learning setting. Our approach is based on novel multimodal representations of users, words and concepts, which we simultaneously le… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

  12. arXiv:1904.13349  [pdf, other

    cs.LG stat.ML

    Multimodal Classification of Urban Micro-Events

    Authors: Maarten Sukel, Stevan Rudinac, Marcel Worring

    Abstract: In this paper we seek methods to effectively detect urban micro-events. Urban micro-events are events which occur in cities, have limited geographical coverage and typically affect only a small group of citizens. Because of their scale these are difficult to identify in most data sources. However, by using citizen sensing to gather data, detecting them becomes feasible. The data gathered by citize… ▽ More

    Submitted 30 April, 2019; originally announced April 2019.

  13. arXiv:1904.08689  [pdf, other

    cs.MM cs.IR

    Exquisitor: Interactive Learning at Large

    Authors: Björn Þór Jónsson, Omar Shahbaz Khan, Hanna Ragnarsdóttir, Þórhildur Þorleiksdóttir, Jan Zahálka, Stevan Rudinac, Gylfi Þór Guðmundsson, Laurent Amsaleg, Marcel Worring

    Abstract: Increasing scale is a dominant trend in today's multimedia collections, which especially impacts interactive applications. To facilitate interactive exploration of large multimedia collections, new approaches are needed that are capable of learning on the fly new analytic categories based on the visual and textual content. To facilitate general use on standard desktops, laptops, and mobile devices… ▽ More

    Submitted 17 July, 2019; v1 submitted 18 April, 2019; originally announced April 2019.