Skip to main content

Showing 1–45 of 45 results for author: Avidan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00384  [pdf, other

    cs.CV

    CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation

    Authors: Matan Rusanovsky, Or Hirschorn, Shai Avidan

    Abstract: Conventional 2D pose estimation models are constrained by their design to specific object categories. This limits their applicability to predefined objects. To overcome these limitations, category-agnostic pose estimation (CAPE) emerged as a solution. CAPE aims to facilitate keypoint localization for diverse object categories using a unified model, which can generalize from minimal annotated suppo… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  2. arXiv:2404.03349  [pdf, other

    cs.CV

    VF-NeRF: Viewshed Fields for Rigid NeRF Registration

    Authors: Leo Segre, Shai Avidan

    Abstract: 3D scene registration is a fundamental problem in computer vision that seeks the best 6-DoF alignment between two scenes. This problem was extensively investigated in the case of point clouds and meshes, but there has been relatively limited work regarding Neural Radiance Fields (NeRF). In this paper, we consider the problem of rigid registration between two NeRFs when the position of the original… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2312.12540  [pdf, other

    cs.CV

    Regularized Newton Raphson Inversion for Text-to-Image Diffusion Models

    Authors: Dvir Samuel, Barak Meiri, Nir Darshan, Shai Avidan, Gal Chechik, Rami Ben-Ari

    Abstract: Diffusion inversion is the problem of taking an image and a text prompt that describes it and finding a noise latent that would generate the image. Most current inversion techniques operate by approximately solving an implicit equation and may converge slowly or yield poor reconstructed images. Here, we formulate the problem as finding the roots of an implicit equation and design a method to solve… ▽ More

    Submitted 27 June, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  4. arXiv:2312.11334  [pdf, other

    cs.CV cs.GR

    Optimize and Reduce: A Top-Down Approach for Image Vectorization

    Authors: Or Hirschorn, Amir Jevnisek, Shai Avidan

    Abstract: Vector image representation is a popular choice when editability and flexibility in resolution are desired. However, most images are only available in raster form, making raster-to-vector image conversion (vectorization) an important task. Classical methods for vectorization are either domain-specific or yield an abundance of shapes which limits editability and interpretability. Learning-based met… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  5. arXiv:2311.17891  [pdf, other

    cs.CV

    Pose Anything: A Graph-Based Approach for Category-Agnostic Pose Estimation

    Authors: Or Hirschorn, Shai Avidan

    Abstract: Traditional 2D pose estimation models are limited by their category-specific design, making them suitable only for predefined object categories. This restriction becomes particularly challenging when dealing with novel objects due to the lack of relevant training data. To address this limitation, category-agnostic pose estimation (CAPE) was introduced. CAPE aims to enable keypoint localization f… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  6. arXiv:2304.10442  [pdf, other

    cs.CV

    Securing Neural Networks with Knapsack Optimization

    Authors: Yakir Gorski, Amir Jevnisek, Shai Avidan

    Abstract: MLaaS Service Providers (SPs) holding a Neural Network would like to keep the Neural Network weights secret. On the other hand, users wish to utilize the SPs' Neural Network for inference without revealing their data. Multi-Party Computation (MPC) offers a solution to achieve this. Computations in MPC involve communication, as the parties send data back and forth. Non-linear operations are usually… ▽ More

    Submitted 29 December, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

  7. arXiv:2304.07099  [pdf, other

    cs.CV

    Prior based Sampling for Adaptive LiDAR

    Authors: Amit Shomer, Shai Avidan

    Abstract: We propose SampleDepth, a Convolutional Neural Network (CNN), that is suited for an adaptive LiDAR. Typically,LiDAR sampling strategy is pre-defined, constant and independent of the observed scene. Instead of letting a LiDAR sample the scene in this agnostic fashion, SampleDepth determines, adaptively, where it is best to sample the current frame. To do that, SampleDepth uses depth samples from pr… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  8. arXiv:2211.16488  [pdf, other

    cs.CV

    Taming Normalizing Flows

    Authors: Shimon Malnick, Shai Avidan, Ohad Fried

    Abstract: We propose an algorithm for taming Normalizing Flow models - changing the probability that the model will produce a specific image or image category. We focus on Normalizing Flows because they can calculate the exact generation probability likelihood for a given image. We demonstrate taming using models that generate human faces, a subdomain with many interesting privacy and bias considerations. O… ▽ More

    Submitted 3 April, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

  9. arXiv:2211.14020  [pdf, other

    cs.CV

    SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow

    Authors: Itai Lang, Dror Aiger, Forrester Cole, Shai Avidan, Michael Rubinstein

    Abstract: Scene flow estimation is a long-standing problem in computer vision, where the goal is to find the 3D motion of a scene from its consecutive observations. Recently, there have been efforts to compute the scene flow from 3D point clouds. A common approach is to train a regression model that consumes source and target point clouds and outputs the per-point translation vector. An alternative is to le… ▽ More

    Submitted 13 April, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: CVPR 2023. Project page: https://itailang.github.io/SCOOP/

  10. arXiv:2211.13775  [pdf, other

    cs.CV

    SAGA: Spectral Adversarial Geometric Attack on 3D Meshes

    Authors: Tomer Stolik, Itai Lang, Shai Avidan

    Abstract: A triangular mesh is one of the most popular 3D data representations. As such, the deployment of deep neural networks for mesh processing is widely spread and is increasingly attracting more attention. However, neural networks are prone to adversarial attacks, where carefully crafted inputs impair the model's functionality. The need to explore these vulnerabilities is a fundamental factor in the f… ▽ More

    Submitted 25 September, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: Presented at ICCV 2023. Project page: https://stoliktomer.github.io/SAGA/

  11. arXiv:2211.10946  [pdf, other

    cs.CV cs.AI eess.IV

    Normalizing Flows for Human Pose Anomaly Detection

    Authors: Or Hirschorn, Shai Avidan

    Abstract: Video anomaly detection is an ill-posed problem because it relies on many parameters such as appearance, pose, camera angle, background, and more. We distill the problem to anomaly detection of human pose, thus decreasing the risk of nuisance parameters such as appearance affecting the result. Focusing on pose alone also has the side benefit of reducing bias against distinct minority groups. Our m… ▽ More

    Submitted 16 August, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

  12. arXiv:2211.04442  [pdf, other

    cs.LG

    Algorithmic Bias in Machine Learning Based Delirium Prediction

    Authors: Sandhya Tripathi, Bradley A Fritz, Michael S Avidan, Yixin Chen, Christopher R King

    Abstract: Although prediction models for delirium, a commonly occurring condition during general hospitalization or post-surgery, have not gained huge popularity, their algorithmic bias evaluation is crucial due to the existing association between social determinants of health and delirium risk. In this context, using MIMIC-III and another academic hospital dataset, we present some initial experimental evid… ▽ More

    Submitted 26 November, 2022; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 14 pages

  13. arXiv:2210.05478  [pdf, other

    cs.CV cs.LG

    Aggregating Layers for Deepfake Detection

    Authors: Amir Jevnisek, Shai Avidan

    Abstract: The increasing popularity of facial manipulation (Deepfakes) and synthetic face creation raises the need to develop robust forgery detection solutions. Crucially, most work in this domain assume that the Deepfakes in the test set come from the same Deepfake algorithms that were used for training the network. This is not how things work in practice. Instead, we consider the case where the network i… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  14. arXiv:2207.03536  [pdf, other

    cs.DB cs.LG

    Deep Learning to Jointly Schema Match, Impute, and Transform Databases

    Authors: Sandhya Tripathi, Bradley A. Fritz, Mohamed Abdelhack, Michael S. Avidan, Yixin Chen, Christopher R. King

    Abstract: An applied problem facing all areas of data science is harmonizing data sources. Joining data from multiple origins with unmapped and only partially overlap** features is a prerequisite to develo** and testing robust, generalizable algorithms, especially in health care. We approach this issue in the common but difficult case of numeric features such as nearly Gaussian and binary features, wher… ▽ More

    Submitted 22 June, 2022; originally announced July 2022.

  15. arXiv:2204.07719  [pdf, other

    cs.CV cs.LG

    Stress-Testing Point Cloud Registration on Automotive LiDAR

    Authors: Amnon Drory, Shai Avidan, Raja Giryes

    Abstract: Rigid Point Cloud Registration (PCR) algorithms aim to estimate the 6-DOF relative motion between two point clouds, which is important in various fields, including autonomous driving. Recent years have seen a significant improvement in global PCR algorithms, i.e. algorithms that can handle a large relative motion. This has been demonstrated in various scenarios, including indoor scenes, but has on… ▽ More

    Submitted 25 November, 2022; v1 submitted 16 April, 2022; originally announced April 2022.

    Comments: Accepted to the NeurIPS 2022 workshop on Machine Learning for Autonomous Driving. Project Page: https://github.com/AmnonDrory/LidarRegistration

  16. arXiv:2201.10448  [pdf, other

    cs.CV

    How Low Can We Go? Pixel Annotation for Semantic Segmentation

    Authors: Daniel Kigli, Ariel Shamir, Shai Avidan

    Abstract: How many labeled pixels are needed to segment an image, without any prior knowledge? We conduct an experiment to answer this question. In our experiment, an Oracle is using Active Learning to train a network from scratch. The Oracle has access to the entire label map of the image, but the goal is to reveal as little pixel labels to the network as possible. We find that, on average, the Oracle ne… ▽ More

    Submitted 13 February, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: Paper and Supplementary

  17. arXiv:2112.04185  [pdf, other

    cs.CV cs.LG

    Transformaly -- Two (Feature Spaces) Are Better Than One

    Authors: Matan Jacob Cohen, Shai Avidan

    Abstract: Anomaly detection is a well-established research area that seeks to identify samples outside of a predetermined distribution. An anomaly detection pipeline is comprised of two main stages: (1) feature extraction and (2) normality score assignment. Recent papers used pre-trained networks for feature extraction achieving state-of-the-art results. However, the use of pre-trained networks does not ful… ▽ More

    Submitted 17 July, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: CVPR Workshop, 2022

  18. arXiv:2111.10759  [pdf, other

    cs.CV cs.CR cs.LG

    Adversarial Mask: Real-World Universal Adversarial Attack on Face Recognition Model

    Authors: Alon Zolfi, Shai Avidan, Yuval Elovici, Asaf Shabtai

    Abstract: Deep learning-based facial recognition (FR) models have demonstrated state-of-the-art performance in the past few years, even when wearing protective medical face masks became commonplace during the COVID-19 pandemic. Given the outstanding performance of these models, the machine learning research community has shown increasing interest in challenging their robustness. Initially, researchers prese… ▽ More

    Submitted 7 September, 2022; v1 submitted 21 November, 2021; originally announced November 2021.

    Comments: 16 pages, 9 figures

  19. DPC: Unsupervised Deep Point Correspondence via Cross and Self Construction

    Authors: Itai Lang, Dvir Ginzburg, Shai Avidan, Dan Raviv

    Abstract: We present a new method for real-time non-rigid dense correspondence between point clouds based on structured shape construction. Our method, termed Deep Point Correspondence (DPC), requires a fraction of the training data compared to previous techniques and presents better generalization capabilities. Until now, two main approaches have been suggested for the dense correspondence problem. The fir… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

    Comments: 3DV 2021

  20. arXiv:2110.03016  [pdf, other

    cs.CV

    DeepBBS: Deep Best Buddies for Point Cloud Registration

    Authors: Itan Hezroni, Amnon Drory, Raja Giryes, Shai Avidan

    Abstract: Recently, several deep learning approaches have been proposed for point cloud registration. These methods train a network to generate a representation that helps finding matching points in two 3D point clouds. Finding good matches allows them to calculate the transformation between the point clouds accurately. Two challenges of these techniques are dealing with occlusions and generalizing to objec… ▽ More

    Submitted 16 October, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: Accepted to 3DV 2021

  21. arXiv:2107.09735  [pdf, other

    cs.LG cs.CC cs.PF

    kNet: A Deep kNN Network To Handle Label Noise

    Authors: Itzik Mizrahi, Shai Avidan

    Abstract: Deep Neural Networks require large amounts of labeled data for their training. Collecting this data at scale inevitably causes label noise.Hence,the need to develop learning algorithms that are robust to label noise. In recent years, k Nearest Neighbors (kNN) emerged as a viable solution to this problem. Despite its success, kNN is not without its problems. Mainly, it requires a huge memory footpr… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  22. arXiv:2107.08574  [pdf, other

    cs.LG cs.AI

    A Modulation Layer to Increase Neural Network Robustness Against Data Quality Issues

    Authors: Mohamed Abdelhack, Jiaming Zhang, Sandhya Tripathi, Bradley A Fritz, Daniel Felsky, Michael S Avidan, Yixin Chen, Christopher R King

    Abstract: Data missingness and quality are common problems in machine learning, especially for high-stakes applications such as healthcare. Developers often train machine learning models on carefully curated datasets using only high quality data; however, this reduces the utility of such models in production environments. We propose a novel neural network modification to mitigate the impacts of low quality… ▽ More

    Submitted 22 April, 2023; v1 submitted 18 July, 2021; originally announced July 2021.

    Journal ref: Transactions on Machine Learning Research 2023

  23. arXiv:2101.11835  [pdf, other

    cs.CV cs.CR

    Reducing ReLU Count for Privacy-Preserving CNN Speedup

    Authors: Inbar Helbitz, Shai Avidan

    Abstract: Privacy-Preserving Machine Learning algorithms must balance classification accuracy with data privacy. This can be done using a combination of cryptographic and machine learning tools such as Convolutional Neural Networks (CNN). CNNs typically consist of two types of operations: a convolutional or linear layer, followed by a non-linear function such as ReLU. Each of these types can be implemented… ▽ More

    Submitted 28 January, 2021; originally announced January 2021.

  24. arXiv:2012.05657  [pdf, other

    cs.CV

    Geometric Adversarial Attacks and Defenses on 3D Point Clouds

    Authors: Itai Lang, Uriel Kotlicki, Shai Avidan

    Abstract: Deep neural networks are prone to adversarial examples that maliciously alter the network's outcome. Due to the increasing popularity of 3D sensors in safety-critical systems and the vast deployment of deep learning models for 3D point sets, there is a growing interest in adversarial attacks and defenses for such models. So far, the research has focused on the semantic level, namely, deep point cl… ▽ More

    Submitted 16 October, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 3DV 2021

  25. arXiv:2012.03357  [pdf, other

    cs.CV

    Rethinking FUN: Frequency-Domain Utilization Networks

    Authors: Kfir Goldberg, Stav Shapiro, Elad Richardson, Shai Avidan

    Abstract: The search for efficient neural network architectures has gained much focus in recent years, where modern architectures focus not only on accuracy but also on inference time and model size. Here, we present FUN, a family of novel Frequency-domain Utilization Networks. These networks utilize the inherent efficiency of the frequency-domain by working directly in that domain, represented with the Dis… ▽ More

    Submitted 6 December, 2020; originally announced December 2020.

    Comments: 9 pages, 7 figures

  26. arXiv:2011.02036  [pdf, other

    cs.LG cs.AI

    (Un)fairness in Post-operative Complication Prediction Models

    Authors: Sandhya Tripathi, Bradley A. Fritz, Mohamed Abdelhack, Michael S. Avidan, Yixin Chen, Christopher R. King

    Abstract: With the current ongoing debate about fairness, explainability and transparency of machine learning models, their application in high-impact clinical decision-making systems must be scrutinized. We consider a real-life example of risk estimation before surgery and investigate the potential for bias or unfairness of a variety of algorithms. Our approach creates transparent documentation of potentia… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

  27. arXiv:2010.01912  [pdf, other

    cs.CV

    Best Buddies Registration for Point Clouds

    Authors: Amnon Drory, Tal Shomer, Shai Avidan, Raja Giryes

    Abstract: We propose new, and robust, loss functions for the point cloud registration problem. Our loss functions are inspired by the Best Buddies Similarity (BBS) measure that counts the number of mutual nearest neighbors between two point sets. This measure has been shown to be robust to outliers and missing data in the case of template matching for images. We present several algorithms, collectively name… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted to ACCV 2020

  28. arXiv:2005.08186  [pdf, other

    cs.CV

    Co-occurrence Based Texture Synthesis

    Authors: Anna Darzi, Itai Lang, Ashutosh Taklikar, Hadar Averbuch-Elor, Shai Avidan

    Abstract: As image generation techniques mature, there is a growing interest in explainable representations that are easy to understand and intuitive to manipulate. In this work, we turn to co-occurrence statistics, which have long been used for texture analysis, to learn a controllable texture synthesis model. We propose a fully convolutional generative adversarial network, conditioned locally on co-occurr… ▽ More

    Submitted 22 July, 2020; v1 submitted 17 May, 2020; originally announced May 2020.

  29. arXiv:2002.01793  [pdf, other

    cs.LG cs.CV cs.IR stat.ML

    Proximity Preserving Binary Code using Signed Graph-Cut

    Authors: Inbal Lav, Shai Avidan, Yoram Singer, Yacov Hel-Or

    Abstract: We introduce a binary embedding framework, called Proximity Preserving Code (PPC), which learns similarity and dissimilarity between data points to create a compact and affinity-preserving binary code. This code can be used to apply fast and memory-efficient approximation to nearest-neighbor searches. Our framework is flexible, enabling different proximity definitions between data points. In contr… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

    Journal ref: AAAI Conference on Artificial Intelligence , Feb. 2020

  30. arXiv:2001.04753  [pdf, other

    cs.CV eess.IV

    Deep Image Compression using Decoder Side Information

    Authors: Sharon Ayzik, Shai Avidan

    Abstract: We present a Deep Image Compression neural network that relies on side information, which is only available to the decoder. We base our algorithm on the assumption that the image available to the encoder and the image available to the decoder are correlated, and we let the network learn these correlations in the training phase. Then, at run time, the encoder side encodes the input image without… ▽ More

    Submitted 29 July, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

  31. arXiv:1912.11850  [pdf, other

    cs.CV

    Graph Embedded Pose Clustering for Anomaly Detection

    Authors: Amir Markovitz, Gilad Sharir, Itamar Friedman, Lihi Zelnik-Manor, Shai Avidan

    Abstract: We propose a new method for anomaly detection of human actions. Our method works directly on human pose graphs that can be computed from an input video sequence. This makes the analysis independent of nuisance parameters such as viewpoint or illumination. We map these graphs to a latent space and cluster them. Each action is then represented by its soft-assignment to each of the clusters. This giv… ▽ More

    Submitted 10 April, 2020; v1 submitted 26 December, 2019; originally announced December 2019.

    Comments: Code is available at https://github.com/amirmk89/gepc. CVPR 2020

  32. arXiv:1912.03663  [pdf, other

    cs.CV

    SampleNet: Differentiable Point Cloud Sampling

    Authors: Itai Lang, Asaf Manor, Shai Avidan

    Abstract: There is a growing number of tasks that work directly on point clouds. As the size of the point cloud grows, so do the computational demands of these tasks. A possible solution is to sample the point cloud first. Classic sampling approaches, such as farthest point sampling (FPS), do not consider the downstream task. A recent work showed that learning a task-specific sampling can improve results si… ▽ More

    Submitted 4 April, 2020; v1 submitted 8 December, 2019; originally announced December 2019.

    Comments: CVPR 2020

  33. arXiv:1907.12596  [pdf, other

    cs.LG stat.ML

    A Factored Generalized Additive Model for Clinical Decision Support in the Operating Room

    Authors: Zhicheng Cui, Bradley A Fritz, Christopher R King, Michael S Avidan, Yixin Chen

    Abstract: Logistic regression (LR) is widely used in clinical prediction because it is simple to deploy and easy to interpret. Nevertheless, being a linear model, LR has limited expressive capability and often has unsatisfactory performance. Generalized additive models (GAMs) extend the linear model with transformations of input features, though feature interaction is not allowed for all GAM variants. In th… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: Accepted for publication in AMIA 2019 Annual Symposium

  34. arXiv:1905.00401  [pdf, other

    cs.CV

    Learn Stereo, Infer Mono: Siamese Networks for Self-Supervised, Monocular, Depth Estimation

    Authors: Matan Goldman, Tal Hassner, Shai Avidan

    Abstract: The field of self-supervised monocular depth estimation has seen huge advancements in recent years. Most methods assume stereo data is available during training but usually under-utilize it and only treat it as a reference signal. We propose a novel self-supervised approach which uses both left and right images equally during training, but can still be used with a single input image at test time,… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

  35. arXiv:1812.01659  [pdf, other

    cs.CV

    Learning to Sample

    Authors: Oren Dovrat, Itai Lang, Shai Avidan

    Abstract: Processing large point clouds is a challenging task. Therefore, the data is often sampled to a size that can be processed more easily. The question is how to sample the data? A popular sampling technique is Farthest Point Sampling (FPS). However, FPS is agnostic to a downstream application (classification, retrieval, etc.). The underlying assumption seems to be that minimizing the farthest point d… ▽ More

    Submitted 1 April, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: CVPR 2019

  36. arXiv:1811.11455  [pdf, other

    cs.CV

    CrowdCam: Dynamic Region Segmentation

    Authors: Nir Zarrabi, Shai Avidan, Yael Moses

    Abstract: We consider the problem of segmenting dynamic regions in CrowdCam images, where a dynamic region is the projection of a moving 3D object on the image plane. Quite often, these regions are the most interesting parts of an image. CrowdCam images is a set of images of the same dynamic event, captured by a group of non-collaborating users. Almost every event of interest today is captured this way. Thi… ▽ More

    Submitted 23 June, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

  37. arXiv:1811.01343  [pdf, other

    cs.CV

    Underwater Single Image Color Restoration Using Haze-Lines and a New Quantitative Dataset

    Authors: Dana Berman, Deborah Levy, Shai Avidan, Tali Treibitz

    Abstract: Underwater images suffer from color distortion and low contrast, because light is attenuated while it propagates through water. Attenuation under water varies with wavelength, unlike terrestrial images where attenuation is assumed to be spectrally uniform. The attenuation depends both on the water body and the 3D structure of the scene, making color restoration difficult. Unlike existing single… ▽ More

    Submitted 24 March, 2019; v1 submitted 4 November, 2018; originally announced November 2018.

  38. arXiv:1803.11410  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    The Resistance to Label Noise in K-NN and DNN Depends on its Concentration

    Authors: Amnon Drory, Oria Ratzon, Shai Avidan, Raja Giryes

    Abstract: We investigate the classification performance of K-nearest neighbors (K-NN) and deep neural networks (DNNs) in the presence of label noise. We first show empirically that a DNN's prediction for a given test example depends on the labels of the training examples in its local neighborhood. This motivates us to derive a realizable analytic expression that approximates the multi-class K-NN classificat… ▽ More

    Submitted 3 December, 2020; v1 submitted 30 March, 2018; originally announced March 2018.

    Comments: None

  39. arXiv:1703.04111  [pdf, other

    cs.CV

    Co-occurrence Filter

    Authors: Roy J Jevnisek, Shai Avidan

    Abstract: Co-occurrence Filter (CoF) is a boundary preserving filter. It is based on the Bilateral Filter (BF) but instead of using a Gaussian on the range values to preserve edges it relies on a co-occurrence matrix. Pixel values that co-occur frequently in the image (i.e., inside textured regions) will have a high weight in the co-occurrence matrix. This, in turn, means that such pixel pairs will be avera… ▽ More

    Submitted 24 December, 2017; v1 submitted 12 March, 2017; originally announced March 2017.

    Comments: accepted to CVPR 2017

  40. arXiv:1611.03270  [pdf, other

    cs.CV

    Detecting Moving Regions in CrowdCam Images

    Authors: Adi Dafni, Yael Moses, Shai Avidan

    Abstract: We address the novel problem of detecting dynamic regions in CrowdCam images, a set of still images captured by a group of people. These regions capture the most interesting parts of the scene, and detecting them plays an important role in the analysis of visual data. Our method is based on the observation that matching static points must satisfy the epipolar geometry constraints, but computing ex… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

  41. arXiv:1611.00148  [pdf, other

    cs.CV

    Best-Buddies Tracking

    Authors: Shaul Oron, Denis Suhanov, Shai Avidan

    Abstract: Best-Buddies Tracking (BBT) applies the Best-Buddies Similarity measure (BBS) to the problem of model-free online tracking. BBS was introduced as a similarity measure between two point sets and was shown to be very effective for template matching. Originally, BBS was designed to work with point sets of equal size, and we propose a modification that lets it handle point sets of different size. The… ▽ More

    Submitted 1 November, 2016; originally announced November 2016.

  42. arXiv:1609.01571  [pdf, other

    cs.CV

    Best-Buddies Similarity - Robust Template Matching using Mutual Nearest Neighbors

    Authors: Shaul Oron, Tali Dekel, Tianfan Xue, William T. Freeman, Shai Avidan

    Abstract: We propose a novel method for template matching in unconstrained environments. Its essence is the Best-Buddies Similarity (BBS), a useful, robust, and parameter-free similarity measure between two sets of points. BBS is based on counting the number of Best-Buddies Pairs (BBPs)--pairs of points in source and target sets, where each point is the nearest neighbor of the other. BBS has several key fea… ▽ More

    Submitted 6 September, 2016; originally announced September 2016.

  43. arXiv:1511.02911  [pdf, other

    cs.CV

    Spatially Coherent Random Forests

    Authors: Tal Remez, Shai Avidan

    Abstract: Spatially Coherent Random Forest (SCRF) extends Random Forest to create spatially coherent labeling. Each split function in SCRF is evaluated based on a traditional information gain measure that is regularized by a spatial coherency term. This way, SCRF is encouraged to choose split functions that cluster pixels both in appearance space and in image space. In particular, we use SCRF to detect cont… ▽ More

    Submitted 5 December, 2015; v1 submitted 9 November, 2015; originally announced November 2015.

  44. arXiv:1404.7059  [pdf, other

    cs.CV

    Stereo on a budget

    Authors: Dana Menaker, Shai Avidan

    Abstract: We propose an algorithm for recovering depth using less than two images. Instead of having both cameras send their entire image to the host computer, the left camera sends its image to the host while the right camera sends only a fraction $ε$ of its image. The key aspect is that the cameras send the information without communicating at all. Hence, the required communication bandwidth is significan… ▽ More

    Submitted 29 April, 2014; v1 submitted 28 April, 2014; originally announced April 2014.

    Comments: update flowchart in Fig. 2

  45. Probably Approximately Symmetric: Fast rigid Symmetry Detection with Global Guarantees

    Authors: Simon Korman, Roee Litman, Shai Avidan, Alex Bronstein

    Abstract: We present a fast algorithm for global rigid symmetry detection with approximation guarantees. The algorithm is guaranteed to find the best approximate symmetry of a given shape, to within a user-specified threshold, with very high probability. Our method uses a carefully designed sampling of the transformation space, where each transformation is efficiently evaluated using a sub-linear algorithm.… ▽ More

    Submitted 6 October, 2014; v1 submitted 26 March, 2014; originally announced March 2014.

    MSC Class: 65D18; 68U05 ACM Class: I.3.5

    Journal ref: Comput. Graph. Forum 34, 1 (February 2015), 2-13