Skip to main content

Showing 1–37 of 37 results for author: van Gemert, J C

.
  1. arXiv:2402.01557  [pdf, other

    cs.CV

    Deep Continuous Networks

    Authors: Nergis Tomen, Silvia L. Pintea, Jan C. van Gemert

    Abstract: CNNs and computational models of biological vision share some fundamental principles, which opened new avenues of research. However, fruitful cross-field research is hampered by conventional CNN architectures being based on spatially and depthwise discrete representations, which cannot accommodate certain aspects of biological complexity such as continuously varying receptive field sizes and dynam… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Presented at ICML 2021

    Journal ref: In International Conference on Machine Learning 2021 Jul 1 (pp. 10324-10335). PMLR

  2. arXiv:2308.10603  [pdf, other

    cs.CV

    A step towards understanding why classification helps regression

    Authors: Silvia L. Pintea, Yancong Lin, Jouke Dijkstra, Jan C. van Gemert

    Abstract: A number of computer vision deep regression approaches report improved results when adding a classification loss to the regression loss. Here, we explore why this is useful in practice and when it is beneficial. To do so, we start from precisely controlled dataset variations and data samplings and find that the effect of adding a classification loss is the most pronounced for regression with imbal… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV-2023

  3. arXiv:2308.05533  [pdf, other

    cs.CV

    Is there progress in activity progress prediction?

    Authors: Frans de Boer, Jan C. van Gemert, Jouke Dijkstra, Silvia L. Pintea

    Abstract: Activity progress prediction aims to estimate what percentage of an activity has been completed. Currently this is done with machine learning approaches, trained and evaluated on complicated and realistic video datasets. The videos in these datasets vary drastically in length and appearance. And some of the activities have unanticipated developments, making activity progression difficult to estima… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCVw-2023 (AI for Creative Video Editing and Understanding, ICCV workshop 2023)

  4. arXiv:2308.04770  [pdf, other

    cs.CV

    Objects do not disappear: Video object detection by single-frame object location anticipation

    Authors: Xin Liu, Fatemeh Karimi Nejadasl, Jan C. van Gemert, Olaf Booij, Silvia L. Pintea

    Abstract: Objects in videos are typically characterized by continuous smooth motion. We exploit continuous smooth motion in three ways. 1) Improved accuracy by using object motion as an additional source of supervision, which we obtain by anticipating object locations from a static keyframe. 2) Improved efficiency by only doing the expensive feature computations on a small subset of all frames. Because neig… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  5. arXiv:2307.08483  [pdf, other

    cs.CV

    Differentiable Transportation Pruning

    Authors: Yunqiang Li, Jan C. van Gemert, Torsten Hoefler, Bert Moons, Evangelos Eleftheriou, Bram-Ernst Verhoef

    Abstract: Deep learning algorithms are increasingly employed at the edge. However, edge devices are resource constrained and thus require efficient deployment of deep neural networks. Pruning methods are a key tool for edge deployment as they can improve storage, compute, memory bandwidth, and energy usage. In this paper we propose a novel accurate pruning technique that allows precise control over the outp… ▽ More

    Submitted 31 July, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: ICCV 2023

  6. arXiv:2205.02887  [pdf, other

    cs.CV cs.AI cs.LG

    Evaluating Context for Deep Object Detectors

    Authors: Osman Semih Kayhan, Jan C. van Gemert

    Abstract: Which object detector is suitable for your context sensitive task? Deep object detectors exploit scene context for recognition differently. In this paper, we group object detectors into 3 categories in terms of context use: no context by crop** the input (RCNN), partial context by crop** the featuremap (two-stage methods) and full context without any crop** (single-stage methods). We systema… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: 4 pages, 5 figures

  7. arXiv:2203.08586  [pdf, other

    cs.CV

    Deep vanishing point detection: Geometric priors make dataset variations vanish

    Authors: Yancong Lin, Ruben Wiersma, Silvia L. Pintea, Klaus Hildebrandt, Elmar Eisemann, Jan C. van Gemert

    Abstract: Deep learning has improved vanishing point detection in images. Yet, deep networks require expensive annotated datasets trained on costly hardware and do not generalize to even slightly different domains, and minor problem variants. Here, we address these issues by injecting deep vanishing point detection networks with prior knowledge. This prior knowledge no longer needs to be learned from data,… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: CVPR2022, code available at https://github.com/yanconglin/VanishingPoint_HoughTransform_GaussianSphere

  8. arXiv:2112.03406  [pdf, other

    cs.LG cs.CV

    Equal Bits: Enforcing Equally Distributed Binary Network Weights

    Authors: Yunqiang Li, Silvia L. Pintea, Jan C. van Gemert

    Abstract: Binary networks are extremely efficient as they use only two symbols to define the network: $\{+1,-1\}$. One can make the prior distribution of these symbols a design choice. The recent IR-Net of Qin et al. argues that imposing a Bernoulli distribution with equal priors (equal bit ratios) over the binary weights leads to maximum entropy and thus minimizes information loss. However, prior work cann… ▽ More

    Submitted 6 March, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

  9. arXiv:2111.06660  [pdf, other

    cs.CV

    Frequency learning for structured CNN filters with Gaussian fractional derivatives

    Authors: Nikhil Saldanha, Silvia L. Pintea, Jan C. van Gemert, Nergis Tomen

    Abstract: Frequency information lies at the base of discriminating between textures, and therefore between different objects. Classical CNN architectures limit the frequency learning through fixed filter sizes, and lack a way of explicitly controlling it. Here, we build on the structured receptive field filters with Gaussian derivative basis. Yet, rather than using predetermined derivative orders, which typ… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: Accepted at BMVC 2021

  10. arXiv:2110.08059  [pdf, other

    cs.CV cs.LG

    FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes

    Authors: David W. Romero, Robert-Jan Bruintjes, Jakub M. Tomczak, Erik J. Bekkers, Mark Hoogendoorn, Jan C. van Gemert

    Abstract: When designing Convolutional Neural Networks (CNNs), one must select the size\break of the convolutional kernels before training. Recent works show CNNs benefit from different kernel sizes at different layers, but exploring all possible combinations is unfeasible in practice. A more efficient approach is to learn the kernel size during training. However, existing works that learn the kernel size h… ▽ More

    Submitted 17 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: First two authors contributed equally to this work

  11. arXiv:2108.07533  [pdf, other

    cs.CV

    Investigating transformers in the decomposition of polygonal shapes as point collections

    Authors: Andrea Alfieri, Yancong Lin, Jan C. van Gemert

    Abstract: Transformers can generate predictions in two approaches: 1. auto-regressively by conditioning each sequence element on the previous ones, or 2. directly produce an output sequences in parallel. While research has mostly explored upon this difference on sequential tasks in NLP, we study the difference between auto-regressive and parallel prediction on visual set prediction tasks, and in particular… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: DLGC@ICCVW 2021

  12. arXiv:2108.05137  [pdf, other

    cs.CV

    Zero-Shot Day-Night Domain Adaptation with a Physics Prior

    Authors: Attila Lengyel, Sourav Garg, Michael Milford, Jan C. van Gemert

    Abstract: We explore the zero-shot setting for day-night domain adaptation. The traditional domain adaptation setting is to train on one domain and adapt to the target domain by exploiting unlabeled data samples from the test set. As gathering relevant test data is expensive and sometimes even impossible, we remove any reliance on test data imagery and instead exploit a visual inductive prior derived from p… ▽ More

    Submitted 11 October, 2021; v1 submitted 11 August, 2021; originally announced August 2021.

    Comments: ICCV 2021 Oral presentation. Code, datasets and supplementary material: https://github.com/Attila94/CIConv

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 4399-4409

  13. arXiv:2106.11280  [pdf, other

    cs.CV

    The Arm-Swing Is Discriminative in Video Gait Recognition for Athlete Re-Identification

    Authors: Yapkan Choi, Yeshwanth Napolean, Jan C. van Gemert

    Abstract: In this paper we evaluate running gait as an attribute for video person re-identification in a long-distance running event. We show that running gait recognition achieves competitive performance compared to appearance-based approaches in the cross-camera retrieval task and that gait and appearance features are complementary to each other. For gait, the arm swing during running is less distinguisha… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: ICIP 2021

  14. arXiv:2106.04914  [pdf, other

    cs.CV

    Exploiting Learned Symmetries in Group Equivariant Convolutions

    Authors: Attila Lengyel, Jan C. van Gemert

    Abstract: Group Equivariant Convolutions (GConvs) enable convolutional neural networks to be equivariant to various transformation groups, but at an additional parameter and compute cost. We investigate the filter parameters learned by GConvs and find certain conditions under which they become highly redundant. We show that GConvs can be efficiently decomposed into depthwise separable convolutions while pre… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  15. Resolution learning in deep convolutional networks using scale-space theory

    Authors: Silvia L. Pintea, Nergis Tomen, Stanley F. Goes, Marco Loog, Jan C. van Gemert

    Abstract: Resolution in deep convolutional neural networks (CNNs) is typically bounded by the receptive field size through filter sizes, and subsampling layers or strided convolutions on feature maps. The optimal resolution may vary significantly depending on the dataset. Modern CNNs hard-code their resolution hyper-parameters in the network architecture which makes tuning such hyper-parameters cumbersome.… ▽ More

    Submitted 24 October, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Preprint accepted by IEEE Transactions on Image Processing, 2021 (TIP). Link to final published article: https://ieeexplore.ieee.org/abstract/document/9552550

    Journal ref: IEEE Transactions on Image Processing, vol. 30, pp. 8342-8353, 2021

  16. arXiv:2106.02523  [pdf, other

    cs.CV cs.AI cs.LG

    Hallucination In Object Detection -- A Study In Visual Part Verification

    Authors: Osman Semih Kayhan, Bart Vredebregt, Jan C. van Gemert

    Abstract: We show that object detectors can hallucinate and detect missing objects; potentially even accurately localized at their expected, but non-existing, position. This is particularly problematic for applications that rely on visual part verification: detecting if an object part is present or absent. We show how popular object detectors hallucinate objects in a visual part verification task and introd… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: ICIP 2021

  17. arXiv:2103.15395  [pdf, other

    cs.CV

    No frame left behind: Full Video Action Recognition

    Authors: Xin Liu, Silvia L. Pintea, Fatemeh Karimi Nejadasl, Olaf Booij, Jan C. van Gemert

    Abstract: Not all video frames are equally informative for recognizing an action. It is computationally infeasible to train deep networks on all video frames when actions develop over hundreds of frames. A common heuristic is uniformly sampling a small number of video frames and using these to recognize the action. Instead, here we propose full video action recognition and consider all video frames. To make… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021

  18. arXiv:2011.13202  [pdf, other

    cs.CV cs.GR cs.LG eess.IV

    t-EVA: Time-Efficient t-SNE Video Annotation

    Authors: Soroosh Poorgholi, Osman Semih Kayhan, Jan C. van Gemert

    Abstract: Video understanding has received more attention in the past few years due to the availability of several large-scale video datasets. However, annotating large-scale video datasets are cost-intensive. In this work, we propose a time-efficient video annotation method using spatio-temporal feature similarity and t-SNE dimensionality reduction to speed up the annotation process massively. Placing the… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

    Comments: ICPR 2020 (HCAU)

  19. arXiv:2010.10451  [pdf, other

    cs.CV cs.LG

    Tilting at windmills: Data augmentation for deep pose estimation does not help with occlusions

    Authors: Rafal Pytel, Osman Semih Kayhan, Jan C. van Gemert

    Abstract: Occlusion degrades the performance of human pose estimation. In this paper, we introduce targeted keypoint and body part occlusion attacks. The effects of the attacks are systematically analyzed on the best performing methods. In addition, we propose occlusion specific data augmentation techniques against keypoint and part attacks. Our extensive experiments show that human pose estimation methods… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: ICPR 2020

  20. arXiv:2007.09493  [pdf, other

    cs.CV

    Deep Hough-Transform Line Priors

    Authors: Yancong Lin, Silvia L. Pintea, Jan C. van Gemert

    Abstract: Classical work on line segment detection is knowledge-based; it uses carefully designed geometric priors using either image gradients, pixel grou**s, or Hough transform variants. Instead, current deep learning methods do away with all prior knowledge and replace priors by training deep networks on large manually annotated datasets. Here, we reduce the dependency on labeled data by building on th… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

    Comments: ECCV 2020, code online: https://github.com/yanconglin/Deep-Hough-Transform-Line-Priors

  21. arXiv:2004.07629  [pdf, other

    cs.CV cs.LG

    Top-Down Networks: A coarse-to-fine reimagination of CNNs

    Authors: Ioannis Lelekas, Nergis Tomen, Silvia L. Pintea, Jan C. van Gemert

    Abstract: Biological vision adopts a coarse-to-fine information processing pathway, from initial visual detection and binding of salient features of a visual scene, to the enhanced and preferential processing given relevant stimuli. On the contrary, CNNs employ a fine-to-coarse processing, moving from local, edge-detecting filters to more global ones extracting abstract representations of the input. In this… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: CVPR Workshop Deep Vision 2020

  22. arXiv:2003.07064  [pdf, other

    cs.CV cs.LG eess.IV

    On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location

    Authors: Osman Semih Kayhan, Jan C. van Gemert

    Abstract: In this paper we challenge the common assumption that convolutional layers in modern CNNs are translation invariant. We show that CNNs can and will exploit the absolute spatial location by learning filters that respond exclusively to particular absolute locations by exploiting image boundary effects. Because modern CNNs filters have a huge receptive field, these boundary effects operate even far f… ▽ More

    Submitted 30 May, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

    Comments: CVPR 2020. (Minor revision on Figure 4, arguments unchanged.)

  23. arXiv:1909.03552  [pdf, other

    cs.CV

    Cross Domain Image Matching in Presence of Outliers

    Authors: Xin Liu, Seyran Khademi, Jan C. van Gemert

    Abstract: Cross domain image matching between image collections from different source and target domains is challenging in times of deep learning due to i) limited variation of image conditions in a training set, ii) lack of paired-image labels during training, iii) the existing of outliers that makes image matching domains not fully overlap. To this end, we propose an end-to-end architecture that can match… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

    Comments: ICCV Workshop on Transferring and Adaptive Source Knowledge in Computer Vision (TASK-CV) 2019

  24. arXiv:1809.03258  [pdf, other

    cs.CV

    Using phase instead of optical flow for action recognition

    Authors: Omar Hommos, Silvia L. Pintea, Pascal S. M. Mettes, Jan C. van Gemert

    Abstract: Currently, the most common motion representation for action recognition is optical flow. Optical flow is based on particle tracking which adheres to a Lagrangian perspective on dynamics. In contrast to the Lagrangian perspective, the Eulerian model of dynamics does not track, but describes local changes. For video, an Eulerian phase-based motion representation, using complex steerable filters, has… ▽ More

    Submitted 14 September, 2018; v1 submitted 10 September, 2018; originally announced September 2018.

    Comments: ECCV-2018 Workshop on "What is Optical Flow for?"

  25. arXiv:1809.03218  [pdf, other

    cs.CV

    Hand-tremor frequency estimation in videos

    Authors: Silvia L. Pintea, Jian Zheng, Xilin Li, Paulina J. M. Bank, Jacobus J. van Hilten, Jan C. van Gemert

    Abstract: We focus on the problem of estimating human hand-tremor frequency from input RGB video data. Estimating tremors from video is important for non-invasive monitoring, analyzing and diagnosing patients suffering from motor-disorders such as Parkinson's disease. We consider two approaches for hand-tremor frequency estimation: (a) a Lagrangian approach where we detect the hand at every frame in the vid… ▽ More

    Submitted 10 September, 2018; originally announced September 2018.

    Comments: Best paper at ECCV-2018 Workshop on Observing and Understanding Hands in Action

  26. arXiv:1805.07170  [pdf, other

    cs.CV

    Recurrent knowledge distillation

    Authors: Silvia L. Pintea, Yue Liu, Jan C. van Gemert

    Abstract: Knowledge distillation compacts deep networks by letting a small student network learn from a large teacher network. The accuracy of knowledge distillation recently benefited from adding residual layers. We propose to reduce the size of the student network even further by recasting multiple residual layers in the teacher network into a single recurrent student layer. We propose three variants of a… ▽ More

    Submitted 18 May, 2018; originally announced May 2018.

    Comments: International Conference on Image Processing (ICIP), 2018

  27. arXiv:1803.06962  [pdf, other

    cs.CV

    Featureless: Bypassing feature extraction in action categorization

    Authors: Silvia L. Pintea, Pascal S. Mettes, Jan C. van Gemert, Arnold W. M. Smeulders

    Abstract: This method introduces an efficient manner of learning action categories without the need of feature estimation. The approach starts from low-level values, in a similar style to the successful CNN methods. However, rather than extracting general image features, we learn to predict specific video representations from raw video data. The benefit of such an approach is that at the same computational… ▽ More

    Submitted 19 March, 2018; originally announced March 2018.

    Comments: Published in the proceedings of the International Conference on Image Processing (ICIP), 2016

  28. arXiv:1803.06952  [pdf, other

    cs.LG cs.CV stat.ML

    Asymmetric kernel in Gaussian Processes for learning target variance

    Authors: Silvia L. Pintea, Jan C. van Gemert, Arnold W. M. Smeulders

    Abstract: This work incorporates the multi-modality of the data distribution into a Gaussian Process regression model. We approach the problem from a discriminative perspective by learning, jointly over the training data, the target space variance in the neighborhood of a certain sample through metric learning. We start by using data centers rather than all training samples. Subsequently, each center select… ▽ More

    Submitted 19 March, 2018; originally announced March 2018.

    Comments: Accepted in Pattern Recognition Letters, 2018

  29. arXiv:1803.06951  [pdf, other

    cs.CV

    Deja Vu: Motion Prediction in Static Images

    Authors: Silvia L. Pintea, Jan C. van Gemert, Arnold W. M. Smeulders

    Abstract: This paper proposes motion prediction in single still images by learning it from a set of videos. The building assumption is that similar motion is characterized by similar appearance. The proposed method learns local motion patterns given a specific appearance and adds the predicted motion in a number of applications. This work (i) introduces a novel method to predict motion from appearance in a… ▽ More

    Submitted 21 March, 2018; v1 submitted 19 March, 2018; originally announced March 2018.

    Comments: Published in the proceedings of the European Conference on Computer Vision (ECCV), 2014

  30. arXiv:1706.09556  [pdf, other

    cs.NE cs.MM cs.SD

    Vision-based Detection of Acoustic Timed Events: a Case Study on Clarinet Note Onsets

    Authors: A. Bazzica, J. C. van Gemert, C. C. S. Liem, A. Hanjalic

    Abstract: Acoustic events often have a visual counterpart. Knowledge of visual information can aid the understanding of complex auditory scenes, even when only a stereo mixdown is available in the audio domain, \eg identifying which musicians are playing in large musical ensembles. In this paper, we consider a vision-based approach to note onset detection. As a case study we focus on challenging, real-world… ▽ More

    Submitted 28 June, 2017; originally announced June 2017.

    Comments: Proceedings of the First International Conference on Deep Learning and Music, Anchorage, US, May, 2017 (arXiv:1706.08675v1 [cs.NE])

    Report number: DLM/2017/8 MSC Class: 68Txx ACM Class: C.1.3; H.5.1

    Journal ref: Proc of the First Int Workshop on Deep Learning and Music. Anchorage, US. 1(1). pp 31-36 (2017)

  31. arXiv:1704.04186  [pdf, other

    cs.CV

    Video Acceleration Magnification

    Authors: Yichao Zhang, Silvia L. Pintea, Jan C. van Gemert

    Abstract: The ability to amplify or reduce subtle image changes over time is useful in contexts such as video editing, medical video analysis, product quality control and sports. In these contexts there is often large motion present which severely distorts current video amplification methods that magnify change linearly. In this work we propose a method to cope with large motions while still magnifying smal… ▽ More

    Submitted 22 April, 2017; v1 submitted 13 April, 2017; originally announced April 2017.

    Comments: Accepted paper at CVPR 2017. Project webpage: http://acceleration-magnification.github.io/

  32. arXiv:1703.06971  [pdf, other

    cs.CV cs.LG

    Active Decision Boundary Annotation with Deep Generative Models

    Authors: Miriam W. Huijser, Jan C. van Gemert

    Abstract: This paper is on active learning where the goal is to reduce the data annotation burden by interacting with a (human) oracle during training. Standard active learning methods ask the oracle to annotate data samples. Instead, we take a profoundly different approach: we ask for annotations of the decision boundary. We achieve this using a deep generative model to create novel instances along a 1d li… ▽ More

    Submitted 2 August, 2017; v1 submitted 20 March, 2017; originally announced March 2017.

    Comments: ICCV 2017

  33. arXiv:1610.01801  [pdf, other

    cs.CV

    Searching Scenes by Abstracting Things

    Authors: Svetlana Kordumova, Jan C. van Gemert, Cees G. M. Snoek, Arnold W. M. Smeulders

    Abstract: In this paper we propose to represent a scene as an abstraction of 'things'. We start from 'things' as generated by modern object proposals, and we investigate their immediately observable properties: position, size, aspect ratio and color, and those only. Where the recent successes and excitement of the field lie in object identification, we represent the scene composition independent of object i… ▽ More

    Submitted 6 October, 2016; originally announced October 2016.

  34. arXiv:1609.01693  [pdf, other

    cs.CV

    Making a Case for Learning Motion Representations with Phase

    Authors: S. L. Pintea, J. C. van Gemert

    Abstract: This work advocates Eulerian motion representation learning over the current standard Lagrangian optical flow model. Eulerian motion is well captured by using phase, as obtained by decomposing the image through a complex-steerable pyramid. We discuss the gain of Eulerian motion in a set of practical use cases: (i) action recognition, (ii) motion prediction in static images, (iii) motion transfer i… ▽ More

    Submitted 8 September, 2016; v1 submitted 6 September, 2016; originally announced September 2016.

    Comments: ECCV 2016 Workshop on Brave new ideas for motion representations in videos

  35. arXiv:1604.07602  [pdf, other

    cs.CV

    Spot On: Action Localization from Pointly-Supervised Proposals

    Authors: Pascal Mettes, Jan C. van Gemert, Cees G. M. Snoek

    Abstract: We strive for spatio-temporal localization of actions in videos. The state-of-the-art relies on action proposals at test time and selects the best one with a classifier trained on carefully annotated box annotations. Annotating action boxes in video is cumbersome, tedious, and error prone. Rather than annotating boxes, we propose to annotate actions in video with points on a sparse subset of frame… ▽ More

    Submitted 25 July, 2016; v1 submitted 26 April, 2016; originally announced April 2016.

    Report number: ECCV/2016/10

  36. arXiv:1510.06939  [pdf, other

    cs.CV

    Objects2action: Classifying and localizing actions without any video example

    Authors: Mihir Jain, Jan C. van Gemert, Thomas Mensink, Cees G. M. Snoek

    Abstract: The goal of this paper is to recognize actions in video without the need for examples. Different from traditional zero-shot approaches we do not demand the design and specification of attribute classifiers and class-to-attribute map**s to allow for transfer from seen classes to unseen classes. Our key contribution is objects2action, a semantic word embedding that is spanned by a skip-gram model… ▽ More

    Submitted 23 October, 2015; originally announced October 2015.

  37. arXiv:1510.04908  [pdf, other

    cs.CV

    No Spare Parts: Sharing Part Detectors for Image Categorization

    Authors: Pascal Mettes, Jan C. van Gemert, Cees G. M. Snoek

    Abstract: This work aims for image categorization using a representation of distinctive parts. Different from existing part-based work, we argue that parts are naturally shared between image categories and should be modeled as such. We motivate our approach with a quantitative and qualitative analysis by backtracking where selected parts come from. Our analysis shows that in addition to the category parts d… ▽ More

    Submitted 12 July, 2016; v1 submitted 16 October, 2015; originally announced October 2015.