Skip to main content

Showing 1–19 of 19 results for author: Pintea, S L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.01557  [pdf, other

    cs.CV

    Deep Continuous Networks

    Authors: Nergis Tomen, Silvia L. Pintea, Jan C. van Gemert

    Abstract: CNNs and computational models of biological vision share some fundamental principles, which opened new avenues of research. However, fruitful cross-field research is hampered by conventional CNN architectures being based on spatially and depthwise discrete representations, which cannot accommodate certain aspects of biological complexity such as continuously varying receptive field sizes and dynam… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Presented at ICML 2021

    Journal ref: In International Conference on Machine Learning 2021 Jul 1 (pp. 10324-10335). PMLR

  2. arXiv:2308.10603  [pdf, other

    cs.CV

    A step towards understanding why classification helps regression

    Authors: Silvia L. Pintea, Yancong Lin, Jouke Dijkstra, Jan C. van Gemert

    Abstract: A number of computer vision deep regression approaches report improved results when adding a classification loss to the regression loss. Here, we explore why this is useful in practice and when it is beneficial. To do so, we start from precisely controlled dataset variations and data samplings and find that the effect of adding a classification loss is the most pronounced for regression with imbal… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV-2023

  3. arXiv:2308.05533  [pdf, other

    cs.CV

    Is there progress in activity progress prediction?

    Authors: Frans de Boer, Jan C. van Gemert, Jouke Dijkstra, Silvia L. Pintea

    Abstract: Activity progress prediction aims to estimate what percentage of an activity has been completed. Currently this is done with machine learning approaches, trained and evaluated on complicated and realistic video datasets. The videos in these datasets vary drastically in length and appearance. And some of the activities have unanticipated developments, making activity progression difficult to estima… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCVw-2023 (AI for Creative Video Editing and Understanding, ICCV workshop 2023)

  4. arXiv:2308.04770  [pdf, other

    cs.CV

    Objects do not disappear: Video object detection by single-frame object location anticipation

    Authors: Xin Liu, Fatemeh Karimi Nejadasl, Jan C. van Gemert, Olaf Booij, Silvia L. Pintea

    Abstract: Objects in videos are typically characterized by continuous smooth motion. We exploit continuous smooth motion in three ways. 1) Improved accuracy by using object motion as an additional source of supervision, which we obtain by anticipating object locations from a static keyframe. 2) Improved efficiency by only doing the expensive feature computations on a small subset of all frames. Because neig… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  5. arXiv:2203.08586  [pdf, other

    cs.CV

    Deep vanishing point detection: Geometric priors make dataset variations vanish

    Authors: Yancong Lin, Ruben Wiersma, Silvia L. Pintea, Klaus Hildebrandt, Elmar Eisemann, Jan C. van Gemert

    Abstract: Deep learning has improved vanishing point detection in images. Yet, deep networks require expensive annotated datasets trained on costly hardware and do not generalize to even slightly different domains, and minor problem variants. Here, we address these issues by injecting deep vanishing point detection networks with prior knowledge. This prior knowledge no longer needs to be learned from data,… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: CVPR2022, code available at https://github.com/yanconglin/VanishingPoint_HoughTransform_GaussianSphere

  6. arXiv:2112.03406  [pdf, other

    cs.LG cs.CV

    Equal Bits: Enforcing Equally Distributed Binary Network Weights

    Authors: Yunqiang Li, Silvia L. Pintea, Jan C. van Gemert

    Abstract: Binary networks are extremely efficient as they use only two symbols to define the network: $\{+1,-1\}$. One can make the prior distribution of these symbols a design choice. The recent IR-Net of Qin et al. argues that imposing a Bernoulli distribution with equal priors (equal bit ratios) over the binary weights leads to maximum entropy and thus minimizes information loss. However, prior work cann… ▽ More

    Submitted 6 March, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

  7. arXiv:2111.06660  [pdf, other

    cs.CV

    Frequency learning for structured CNN filters with Gaussian fractional derivatives

    Authors: Nikhil Saldanha, Silvia L. Pintea, Jan C. van Gemert, Nergis Tomen

    Abstract: Frequency information lies at the base of discriminating between textures, and therefore between different objects. Classical CNN architectures limit the frequency learning through fixed filter sizes, and lack a way of explicitly controlling it. Here, we build on the structured receptive field filters with Gaussian derivative basis. Yet, rather than using predetermined derivative orders, which typ… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: Accepted at BMVC 2021

  8. Resolution learning in deep convolutional networks using scale-space theory

    Authors: Silvia L. Pintea, Nergis Tomen, Stanley F. Goes, Marco Loog, Jan C. van Gemert

    Abstract: Resolution in deep convolutional neural networks (CNNs) is typically bounded by the receptive field size through filter sizes, and subsampling layers or strided convolutions on feature maps. The optimal resolution may vary significantly depending on the dataset. Modern CNNs hard-code their resolution hyper-parameters in the network architecture which makes tuning such hyper-parameters cumbersome.… ▽ More

    Submitted 24 October, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Preprint accepted by IEEE Transactions on Image Processing, 2021 (TIP). Link to final published article: https://ieeexplore.ieee.org/abstract/document/9552550

    Journal ref: IEEE Transactions on Image Processing, vol. 30, pp. 8342-8353, 2021

  9. arXiv:2103.15395  [pdf, other

    cs.CV

    No frame left behind: Full Video Action Recognition

    Authors: Xin Liu, Silvia L. Pintea, Fatemeh Karimi Nejadasl, Olaf Booij, Jan C. van Gemert

    Abstract: Not all video frames are equally informative for recognizing an action. It is computationally infeasible to train deep networks on all video frames when actions develop over hundreds of frames. A common heuristic is uniformly sampling a small number of video frames and using these to recognize the action. Instead, here we propose full video action recognition and consider all video frames. To make… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021

  10. arXiv:2007.09493  [pdf, other

    cs.CV

    Deep Hough-Transform Line Priors

    Authors: Yancong Lin, Silvia L. Pintea, Jan C. van Gemert

    Abstract: Classical work on line segment detection is knowledge-based; it uses carefully designed geometric priors using either image gradients, pixel grou**s, or Hough transform variants. Instead, current deep learning methods do away with all prior knowledge and replace priors by training deep networks on large manually annotated datasets. Here, we reduce the dependency on labeled data by building on th… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

    Comments: ECCV 2020, code online: https://github.com/yanconglin/Deep-Hough-Transform-Line-Priors

  11. arXiv:2004.07629  [pdf, other

    cs.CV cs.LG

    Top-Down Networks: A coarse-to-fine reimagination of CNNs

    Authors: Ioannis Lelekas, Nergis Tomen, Silvia L. Pintea, Jan C. van Gemert

    Abstract: Biological vision adopts a coarse-to-fine information processing pathway, from initial visual detection and binding of salient features of a visual scene, to the enhanced and preferential processing given relevant stimuli. On the contrary, CNNs employ a fine-to-coarse processing, moving from local, edge-detecting filters to more global ones extracting abstract representations of the input. In this… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: CVPR Workshop Deep Vision 2020

  12. arXiv:1809.03258  [pdf, other

    cs.CV

    Using phase instead of optical flow for action recognition

    Authors: Omar Hommos, Silvia L. Pintea, Pascal S. M. Mettes, Jan C. van Gemert

    Abstract: Currently, the most common motion representation for action recognition is optical flow. Optical flow is based on particle tracking which adheres to a Lagrangian perspective on dynamics. In contrast to the Lagrangian perspective, the Eulerian model of dynamics does not track, but describes local changes. For video, an Eulerian phase-based motion representation, using complex steerable filters, has… ▽ More

    Submitted 14 September, 2018; v1 submitted 10 September, 2018; originally announced September 2018.

    Comments: ECCV-2018 Workshop on "What is Optical Flow for?"

  13. arXiv:1809.03218  [pdf, other

    cs.CV

    Hand-tremor frequency estimation in videos

    Authors: Silvia L. Pintea, Jian Zheng, Xilin Li, Paulina J. M. Bank, Jacobus J. van Hilten, Jan C. van Gemert

    Abstract: We focus on the problem of estimating human hand-tremor frequency from input RGB video data. Estimating tremors from video is important for non-invasive monitoring, analyzing and diagnosing patients suffering from motor-disorders such as Parkinson's disease. We consider two approaches for hand-tremor frequency estimation: (a) a Lagrangian approach where we detect the hand at every frame in the vid… ▽ More

    Submitted 10 September, 2018; originally announced September 2018.

    Comments: Best paper at ECCV-2018 Workshop on Observing and Understanding Hands in Action

  14. arXiv:1805.07170  [pdf, other

    cs.CV

    Recurrent knowledge distillation

    Authors: Silvia L. Pintea, Yue Liu, Jan C. van Gemert

    Abstract: Knowledge distillation compacts deep networks by letting a small student network learn from a large teacher network. The accuracy of knowledge distillation recently benefited from adding residual layers. We propose to reduce the size of the student network even further by recasting multiple residual layers in the teacher network into a single recurrent student layer. We propose three variants of a… ▽ More

    Submitted 18 May, 2018; originally announced May 2018.

    Comments: International Conference on Image Processing (ICIP), 2018

  15. arXiv:1803.06962  [pdf, other

    cs.CV

    Featureless: Bypassing feature extraction in action categorization

    Authors: Silvia L. Pintea, Pascal S. Mettes, Jan C. van Gemert, Arnold W. M. Smeulders

    Abstract: This method introduces an efficient manner of learning action categories without the need of feature estimation. The approach starts from low-level values, in a similar style to the successful CNN methods. However, rather than extracting general image features, we learn to predict specific video representations from raw video data. The benefit of such an approach is that at the same computational… ▽ More

    Submitted 19 March, 2018; originally announced March 2018.

    Comments: Published in the proceedings of the International Conference on Image Processing (ICIP), 2016

  16. arXiv:1803.06952  [pdf, other

    cs.LG cs.CV stat.ML

    Asymmetric kernel in Gaussian Processes for learning target variance

    Authors: Silvia L. Pintea, Jan C. van Gemert, Arnold W. M. Smeulders

    Abstract: This work incorporates the multi-modality of the data distribution into a Gaussian Process regression model. We approach the problem from a discriminative perspective by learning, jointly over the training data, the target space variance in the neighborhood of a certain sample through metric learning. We start by using data centers rather than all training samples. Subsequently, each center select… ▽ More

    Submitted 19 March, 2018; originally announced March 2018.

    Comments: Accepted in Pattern Recognition Letters, 2018

  17. arXiv:1803.06951  [pdf, other

    cs.CV

    Deja Vu: Motion Prediction in Static Images

    Authors: Silvia L. Pintea, Jan C. van Gemert, Arnold W. M. Smeulders

    Abstract: This paper proposes motion prediction in single still images by learning it from a set of videos. The building assumption is that similar motion is characterized by similar appearance. The proposed method learns local motion patterns given a specific appearance and adds the predicted motion in a number of applications. This work (i) introduces a novel method to predict motion from appearance in a… ▽ More

    Submitted 21 March, 2018; v1 submitted 19 March, 2018; originally announced March 2018.

    Comments: Published in the proceedings of the European Conference on Computer Vision (ECCV), 2014

  18. arXiv:1704.04186  [pdf, other

    cs.CV

    Video Acceleration Magnification

    Authors: Yichao Zhang, Silvia L. Pintea, Jan C. van Gemert

    Abstract: The ability to amplify or reduce subtle image changes over time is useful in contexts such as video editing, medical video analysis, product quality control and sports. In these contexts there is often large motion present which severely distorts current video amplification methods that magnify change linearly. In this work we propose a method to cope with large motions while still magnifying smal… ▽ More

    Submitted 22 April, 2017; v1 submitted 13 April, 2017; originally announced April 2017.

    Comments: Accepted paper at CVPR 2017. Project webpage: http://acceleration-magnification.github.io/

  19. arXiv:1609.01693  [pdf, other

    cs.CV

    Making a Case for Learning Motion Representations with Phase

    Authors: S. L. Pintea, J. C. van Gemert

    Abstract: This work advocates Eulerian motion representation learning over the current standard Lagrangian optical flow model. Eulerian motion is well captured by using phase, as obtained by decomposing the image through a complex-steerable pyramid. We discuss the gain of Eulerian motion in a set of practical use cases: (i) action recognition, (ii) motion prediction in static images, (iii) motion transfer i… ▽ More

    Submitted 8 September, 2016; v1 submitted 6 September, 2016; originally announced September 2016.

    Comments: ECCV 2016 Workshop on Brave new ideas for motion representations in videos