Skip to main content

Showing 1–31 of 31 results for author: Keller, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.13414  [pdf, other

    cs.LG cs.AI cs.GT

    From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex?

    Authors: Yannik Keller, Jannis Blüml, Gopika Sudhakaran, Kristian Kersting

    Abstract: The gameplay of strategic board games such as chess, Go and Hex is often characterized by combinatorial, relational structures -- capturing distinct interactions and non-local patterns -- and not just images. Nonetheless, most common self-play reinforcement learning (RL) approaches simply approximate policy and value functions using convolutional neural networks (CNN). A key feature of CNNs is the… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  2. arXiv:2308.14075  [pdf, other

    cs.CV

    FaceCoresetNet: Differentiable Coresets for Face Set Recognition

    Authors: Gil Shapira, Yosi Keller

    Abstract: In set-based face recognition, we aim to compute the most discriminative descriptor from an unbounded set of images and videos showing a single person. A discriminative descriptor balances two policies when aggregating information from a given set. The first is a quality-based policy: emphasizing high-quality and down-weighting low-quality images. The second is a diversity-based policy: emphasizin… ▽ More

    Submitted 13 December, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

    Comments: Accepted to AAAI-24

  3. arXiv:2308.11783  [pdf, other

    cs.CV cs.LG

    Coarse-to-Fine Multi-Scene Pose Regression with Transformers

    Authors: Yoli Shavit, Ron Ferens, Yosi Keller

    Abstract: Absolute camera pose regressors estimate the position and orientation of a camera given the captured image alone. Typically, a convolutional backbone with a multi-layer perceptron (MLP) head is trained using images and pose labels to embed a single reference scene at a time. Recently, this scheme was extended to learn multiple scenes by replacing the MLP head with a set of fully connected layers.… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). arXiv admin note: substantial text overlap with arXiv:2103.11468

  4. Image Segmentation via Probabilistic Graph Matching

    Authors: Ayelet Heimowitz, Yosi Keller

    Abstract: This work presents an unsupervised and semi-automatic image segmentation approach where we formulate the segmentation as a inference problem based on unary and pairwise assignment probabilities computed using low-level image cues. The inference is solved via a probabilistic graph matching scheme, which allows rigorous incorporation of low level image cues and automatic tuning of parameters. The pr… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

    Journal ref: IEEE Transactions on Image Processing, vol. 25, no. 10, pp. 4743-4752, Oct. 2016

  5. arXiv:2305.02745  [pdf, other

    cs.CV

    Age-Invariant Face Embedding using the Wasserstein Distance

    Authors: Eran Dahan, Yosi Keller

    Abstract: In this work, we study face verification in datasets where images of the same individuals exhibit significant age differences. This poses a major challenge for current face recognition and verification techniques. To address this issue, we propose a novel approach that utilizes multitask learning and a Wasserstein distance discriminator to disentangle age and identity embeddings of facial images.… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  6. arXiv:2304.11706  [pdf, other

    cs.CV

    Deep Convolutional Tables: Deep Learning without Convolutions

    Authors: Shay Dekel, Yosi Keller, Aharon Bar-Hillel

    Abstract: We propose a novel formulation of deep networks that do not use dot-product neurons and rely on a hierarchy of voting tables instead, denoted as Convolutional Tables (CT), to enable accelerated CPU-based inference. Convolutional layers are the most time-consuming bottleneck in contemporary deep learning techniques, severely limiting their use in Internet of Things and CPU-based devices. The propos… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: Accepted for publication. IEEE Transactions on Neural Networks and Learning Systems

  7. arXiv:2303.02717  [pdf, other

    cs.CV

    Learning to Localize in Unseen Scenes with Relative Pose Regressors

    Authors: Ofer Idan, Yoli Shavit, Yosi Keller

    Abstract: Relative pose regressors (RPRs) localize a camera by estimating its relative translation and rotation to a pose-labelled reference. Unlike scene coordinate regression and absolute pose regression methods, which learn absolute scene parameters, RPRs can (theoretically) localize in unseen environments, since they only learn the residual pose between camera pairs. In practice, however, the performanc… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

  8. arXiv:2303.02615  [pdf, other

    cs.CV

    Estimating Extreme 3D Image Rotation with Transformer Cross-Attention

    Authors: Shay Dekel, Yosi Keller, Martin Cadik

    Abstract: The estimation of large and extreme image rotation plays a key role in multiple computer vision domains, where the rotated images are related by a limited or a non-overlap** field of view. Contemporary approaches apply convolutional neural networks to compute a 4D correlation volume to estimate the relative rotation between image pairs. In this work, we propose a cross-attention-based approach t… ▽ More

    Submitted 8 March, 2024; v1 submitted 5 March, 2023; originally announced March 2023.

    Journal ref: CVPR 2024

  9. arXiv:2303.02610  [pdf, other

    cs.CV

    HyperPose: Camera Pose Localization using Attention Hypernetworks

    Authors: Ron Ferens, Yosi Keller

    Abstract: In this study, we propose the use of attention hypernetworks in camera pose localization. The dynamic nature of natural scenes, including changes in environment, perspective, and lighting, creates an inherent domain gap between the training and test sets that limits the accuracy of contemporary localization networks. To overcome this issue, we suggest a camera pose regressor that integrates a hype… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

  10. Learning to embed semantic similarity for joint image-text retrieval

    Authors: Noam Malali, Yosi Keller

    Abstract: We present a deep learning approach for learning the joint semantic embeddings of images and captions in a Euclidean space, such that the semantic similarity is approximated by the L2 distances in the embedding space. For that, we introduce a metric learning scheme that utilizes multitask learning to learn the embedding of identical semantic concepts using a center loss. By introducing a different… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023

  11. arXiv:2207.05530  [pdf, other

    cs.CV cs.AI

    Camera Pose Auto-Encoders for Improving Pose Regression

    Authors: Yoli Shavit, Yosi Keller

    Abstract: Absolute pose regressor (APR) networks are trained to estimate the pose of the camera given a captured image. They compute latent image representations from which the camera position and orientation are regressed. APRs provide a different tradeoff between localization accuracy, runtime, and memory, compared to structure-based localization schemes that provide state-of-the-art accuracy. In this wor… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV22

  12. arXiv:2202.12972  [pdf, other

    cs.CV cs.GR cs.LG

    FSGANv2: Improved Subject Agnostic Face Swap** and Reenactment

    Authors: Yuval Nirkin, Yosi Keller, Tal Hassner

    Abstract: We present Face Swap** GAN (FSGAN) for face swap** and reenactment. Unlike previous work, we offer a subject agnostic swap** scheme that can be applied to pairs of faces without requiring training on those faces. We derive a novel iterative deep learning--based approach for face reenactment which adjusts significant pose and expression variations that can be applied to a single image or a vi… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: text overlap with arXiv:1908.05932

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2022

  13. arXiv:2201.06164  [pdf, other

    cs.CV

    Synthesis and Reconstruction of Fingerprints using Generative Adversarial Networks

    Authors: Rafael Bouzaglo, Yosi Keller

    Abstract: Deep learning-based models have been shown to improve the accuracy of fingerprint recognition. While these algorithms show exceptional performance, they require large-scale fingerprint datasets for training and evaluation. In this work, we propose a novel fingerprint synthesis and reconstruction framework based on the StyleGan2 architecture, to address the privacy issues related to the acquisition… ▽ More

    Submitted 12 March, 2023; v1 submitted 16 January, 2022; originally announced January 2022.

  14. arXiv:2106.01452  [pdf, other

    cs.CL cs.LG

    BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks

    Authors: Yannik Keller, Jan Mackensen, Steffen Eger

    Abstract: Adversarial attacks expose important blind spots of deep learning systems. While word- and sentence-level attack scenarios mostly deal with finding semantic paraphrases of the input that fool NLP models, character-level attacks typically insert typos into the input stream. It is commonly thought that these are easier to defend via spelling correction modules. In this work, we show that both a stan… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Findings of ACL 2021

  15. AdaptiFont: Increasing Individuals' Reading Speed with a Generative Font Model and Bayesian Optimization

    Authors: Florian Kadner, Yannik Keller, Constantin A. Rothkopf

    Abstract: Digital text has become one of the primary ways of exchanging knowledge, but text needs to be rendered to a screen to be read. We present AdaptiFont, a human-in-the-loop system that is aimed at interactively increasing readability of text displayed on a monitor. To this end, we first learn a generative font space with non-negative matrix factorization from a set of classic fonts. In this space we… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: 18 pages, 11 figures

    Journal ref: In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (CHI '21). Association for Computing Machinery, New York, NY, USA, Article 585, 1-11

  16. arXiv:2103.11477  [pdf, other

    cs.CV cs.AI

    Paying Attention to Activation Maps in Camera Pose Regression

    Authors: Yoli Shavit, Ron Ferens, Yosi Keller

    Abstract: Camera pose regression methods apply a single forward pass to the query image to estimate the camera pose. As such, they offer a fast and light-weight alternative to traditional localization schemes based on image retrieval. Pose regression approaches simultaneously learn two regression tasks, aiming to jointly estimate the camera position and orientation using a single embedding vector computed b… ▽ More

    Submitted 11 April, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

  17. arXiv:2103.11468  [pdf, other

    cs.CV

    Learning Multi-Scene Absolute Pose Regression with Transformers

    Authors: Yoli Shavit, Ron Ferens, Yosi Keller

    Abstract: Absolute camera pose regressors estimate the position and orientation of a camera from the captured image alone. Typically, a convolutional backbone with a multi-layer perceptron head is trained with images and pose labels to embed a single reference scene at a time. Recently, this scheme was extended for learning multiple scenes by replacing the MLP head with a set of fully connected layers. In t… ▽ More

    Submitted 26 July, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

  18. arXiv:2103.11247  [pdf, other

    cs.CV cs.LG

    Attention-Based Multimodal Image Matching

    Authors: Aviad Moreshet, Yosi Keller

    Abstract: We propose an attention-based approach for multimodal image patch matching using a Transformer encoder attending to the feature maps of a multiscale Siamese CNN. Our encoder is shown to efficiently aggregate multiscale image embeddings while emphasizing task-specific appearance-invariant image cues. We also introduce an attention-residual architecture, using a residual connection bypassing the enc… ▽ More

    Submitted 24 September, 2023; v1 submitted 20 March, 2021; originally announced March 2021.

  19. arXiv:2103.09882  [pdf, other

    cs.CV

    Hierarchical Attention-based Age Estimation and Bias Estimation

    Authors: Shakediel Hiba, Yosi Keller

    Abstract: In this work we propose a novel deep-learning approach for age estimation based on face images. We first introduce a dual image augmentation-aggregation approach based on attention. This allows the network to jointly utilize multiple face image augmentations whose embeddings are aggregated by a Transformer-Encoder. The resulting aggregated embedding is shown to better encode the face image attribu… ▽ More

    Submitted 27 September, 2023; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: 11 pages, 7 figures

  20. arXiv:2009.05871  [pdf, other

    cs.CV

    A Unified Approach to Kinship Verification

    Authors: Eran Dahan, Yosi Keller

    Abstract: In this work, we propose a deep learning-based approach for kin verification using a unified multi-task learning scheme where all kinship classes are jointly learned. This allows us to better utilize small training sets that are typical of kin verification. We introduce a novel approach for fusing the embeddings of kin images, to avoid overfitting, which is a common issue in training such networks… ▽ More

    Submitted 12 September, 2020; originally announced September 2020.

  21. arXiv:2008.12262  [pdf, other

    cs.CV cs.LG

    DeepFake Detection Based on the Discrepancy Between the Face and its Context

    Authors: Yuval Nirkin, Lior Wolf, Yosi Keller, Tal Hassner

    Abstract: We propose a method for detecting face swap** and other identity manipulations in single images. Face swap** methods, such as DeepFake, manipulate the face region, aiming to adjust the face to the appearance of its context, while leaving the context unchanged. We show that this modus operandi produces discrepancies between the two regions. These discrepancies offer exploitable telltale signs o… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

  22. arXiv:1908.05932  [pdf, other

    cs.CV cs.GR cs.LG

    FSGAN: Subject Agnostic Face Swap** and Reenactment

    Authors: Yuval Nirkin, Yosi Keller, Tal Hassner

    Abstract: We present Face Swap** GAN (FSGAN) for face swap** and reenactment. Unlike previous work, FSGAN is subject agnostic and can be applied to pairs of faces without requiring training on those faces. To this end, we describe a number of technical contributions. We derive a novel recurrent neural network (RNN)-based approach for face reenactment which adjusts for both pose and expression variations… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

    Comments: 2019 IEEE/CVF International Conference on Computer Vision (ICCV)

  23. arXiv:1811.01290  [pdf, ps, other

    cs.CV

    Auto-ML Deep Learning for Rashi Scripts OCR

    Authors: Shahar Mahpod, Yosi Keller

    Abstract: In this work we propose an OCR scheme for manuscripts printed in Rashi font that is an ancient Hebrew font and corresponding dialect used in religious Jewish literature, for more than 600 years. The proposed scheme utilizes a convolution neural network (CNN) for visual inference and Long-Short Term Memory (LSTM) to learn the Rashi scripts dialect. In particular, we derive an AutoML scheme to optim… ▽ More

    Submitted 22 February, 2020; v1 submitted 3 November, 2018; originally announced November 2018.

    Comments: The paper is under consideration at Pattern Recognition Letters

  24. arXiv:1810.12941  [pdf, other

    cs.CV

    Joint detection and matching of feature points in multimodal images

    Authors: Elad Ben Baruch, Yosi Keller

    Abstract: In this work, we propose a novel Convolutional Neural Network (CNN) architecture for the joint detection and matching of feature points in images acquired by different sensors using a single forward pass. The resulting feature detector is tightly coupled with the feature descriptor, in contrast to classical approaches (SIFT, etc.), where the detection phase precedes and differs from computing the… ▽ More

    Submitted 16 June, 2021; v1 submitted 30 October, 2018; originally announced October 2018.

  25. arXiv:1809.08493  [pdf, ps, other

    cs.CV

    SelfKin: Self Adjusted Deep Model For Kinship Verification

    Authors: Eran Dahan, Yosi Keller

    Abstract: One of the unsolved challenges in the field of biometrics and face recognition is Kinship Verification. This problem aims to understand if two people are family-related and how (sisters, brothers, etc.) Solving this problem can give rise to varied tasks and applications. In the area of homeland security (HLS) it is crucial to auto-detect if the person questioned is related to a wanted suspect, In… ▽ More

    Submitted 22 September, 2018; originally announced September 2018.

  26. Facial Landmarks Localization using Cascaded Neural Networks

    Authors: Shahar Mahpod, Rig Das, Emanuele Maiorana, Yosi Keller, Patrizio Campisi

    Abstract: The accurate localization of facial landmarks is at the core of face analysis tasks, such as face recognition and facial expression analysis, to name a few. In this work, we propose a novel localization approach based on a deep learning architecture that utilizes cascaded subnetworks with convolutional neural network units. The cascaded units of the first subnetwork estimate heatmap-based encoding… ▽ More

    Submitted 19 July, 2021; v1 submitted 3 May, 2018; originally announced May 2018.

  27. Multi-scale Processing of Noisy Images using Edge Preservation Losses

    Authors: Nati Ofir, Yosi Keller

    Abstract: Noisy images processing is a fundamental task of computer vision. The first example is the detection of faint edges in noisy images, a challenging problem studied in the last decades. A recent study introduced a fast method to detect faint edges in the highest accuracy among all the existing approaches. Their complexity is nearly linear in the image's pixels and their runtime is seconds for a nois… ▽ More

    Submitted 21 March, 2019; v1 submitted 26 March, 2018; originally announced March 2018.

  28. Deep Multi-Spectral Registration Using Invariant Descriptor Learning

    Authors: Nati Ofir, Shai Silberstein, Hila Levi, Dani Rozenbaum, Yosi Keller, Sharon Duvdevani Bar

    Abstract: In this paper, we introduce a novel deep-learning method to align cross-spectral images. Our approach relies on a learned descriptor which is invariant to different spectra. Multi-modal images of the same scene capture different signals and therefore their registration is challenging and it is not solved by classic approaches. To that end, we developed a feature-based approach that solves the visi… ▽ More

    Submitted 23 May, 2018; v1 submitted 16 January, 2018; originally announced January 2018.

  29. Registration and Fusion of Multi-Spectral Images Using a Novel Edge Descriptor

    Authors: Nati Ofir, Shai Silberstein, Dani Rozenbaum, Yosi Keller, Sharon Duvdevani Bar

    Abstract: In this paper we introduce a fully end-to-end approach for multi-spectral image registration and fusion. Our method for fusion combines images from different spectral channels into a single fused image by different approaches for low and high frequency signals. A prerequisite of fusion is a stage of geometric alignment between the spectral bands, commonly referred to as registration. Unfortunately… ▽ More

    Submitted 28 May, 2018; v1 submitted 5 November, 2017; originally announced November 2017.

  30. arXiv:1412.2067  [pdf, ps, other

    cs.CV math.GM

    An algorithm for improving Non-Local Means operators via low-rank approximation

    Authors: Victor May, Yosi Keller, Nir Sharon, Yoel Shkolnisky

    Abstract: We present a method for improving a Non Local Means operator by computing its low-rank approximation. The low-rank operator is constructed by applying a filter to the spectrum of the original Non Local Means operator. This results in an operator which is less sensitive to noise while preserving important properties of the original operator. The method is efficiently implemented based on Chebyshev… ▽ More

    Submitted 20 November, 2014; originally announced December 2014.

  31. arXiv:1102.4258  [pdf, other

    cs.CV

    SHREC 2011: robust feature detection and description benchmark

    Authors: E. Boyer, A. M. Bronstein, M. M. Bronstein, B. Bustos, T. Darom, R. Horaud, I. Hotz, Y. Keller, J. Keustermans, A. Kovnatsky, R. Litman, J. Reininghaus, I. Sipiran, D. Smeets, P. Suetens, D. Vandermeulen, A. Zaharescu, V. Zobel

    Abstract: Feature-based approaches have recently become very popular in computer vision and image analysis applications, and are becoming a promising direction in shape retrieval. SHREC'11 robust feature detection and description benchmark simulates the feature detection and description stages of feature-based shape retrieval algorithms. The benchmark tests the performance of shape feature detectors and des… ▽ More

    Submitted 21 February, 2011; originally announced February 2011.

    Comments: This is a full version of the SHREC'11 report published in 3DOR