Skip to main content

Showing 1–50 of 134 results for author: Giryes, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16109  [pdf, other

    eess.IV cs.CV

    X-ray2CTPA: Generating 3D CTPA scans from 2D X-ray conditioning

    Authors: Noa Cahan, Eyal Klang, Galit Aviram, Yiftach Barash, Eli Konen, Raja Giryes, Hayit Greenspan

    Abstract: Chest X-rays or chest radiography (CXR), commonly used for medical diagnostics, typically enables limited imaging compared to computed tomography (CT) scans, which offer more detailed and accurate three-dimensional data, particularly contrast-enhanced scans like CT Pulmonary Angiography (CTPA). However, CT scans entail higher costs, greater radiation exposure, and are less accessible than CXRs. In… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: preprint, project code: https://github.com/NoaCahan/X-ray2CTPA

  2. arXiv:2406.14528  [pdf, other

    cs.LG cs.AI

    DeciMamba: Exploring the Length Extrapolation Potential of Mamba

    Authors: Assaf Ben-Kish, Itamar Zimerman, Shady Abu-Hussein, Nadav Cohen, Amir Globerson, Lior Wolf, Raja Giryes

    Abstract: Long-range sequence processing poses a significant challenge for Transformers due to their quadratic complexity in input length. A promising alternative is Mamba, which demonstrates high performance and achieves Transformer-level capabilities while requiring substantially fewer computational resources. In this paper we explore the length-generalization capabilities of Mamba, which we find to be re… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Link To Official Implementation: https://github.com/assafbk/DeciMamba

  3. arXiv:2406.09240  [pdf, other

    cs.CV

    Comparison Visual Instruction Tuning

    Authors: Wei Lin, Muhammad Jehanzeb Mirza, Sivan Doveh, Rogerio Feris, Raja Giryes, Sepp Hochreiter, Leonid Karlinsky

    Abstract: Comparing two images in terms of Commonalities and Differences (CaD) is a fundamental human capability that forms the basis of advanced visual reasoning and interpretation. It is essential for the generation of detailed and contextually relevant descriptions, performing comparative analysis, novelty detection, and making informed decisions based on visual data. However, surprisingly, little attent… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Project page: https://wlin-at.github.io/cad_vi ; Huggingface dataset repo: https://huggingface.co/datasets/wlin21at/CaD-Inst

  4. arXiv:2406.01086  [pdf, other

    cs.LG cs.AI cs.CV

    Effective Subset Selection Through The Lens of Neural Network Pruning

    Authors: Noga Bar, Raja Giryes

    Abstract: Having large amounts of annotated data significantly impacts the effectiveness of deep neural networks. However, the annotation task can be very expensive in some domains, such as medical data. Thus, it is important to select the data to be annotated wisely, which is known as the subset selection problem. We investigate the relationship between subset selection and neural network pruning, which is… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  5. arXiv:2405.07232  [pdf, other

    cs.CR

    A Flow is a Stream of Packets: A Stream-Structured Data Approach for DDoS Detection

    Authors: Raja Giryes, Lior Shafir, Avishai Wool

    Abstract: Distributed Denial of Service (DDoS) attacks are getting increasingly harmful to the Internet, showing no signs of slowing down. Develo** an accurate detection mechanism to thwart DDoS attacks is still a big challenge due to the rich variety of these attacks and the emergence of new attack vectors. In this paper, we propose a new tree-based DDoS detection approach that operates on a flow as a st… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  6. arXiv:2404.03906  [pdf, other

    eess.IV cs.CV

    Deep Phase Coded Image Prior

    Authors: Nimrod Shabtay, Eli Schwartz, Raja Giryes

    Abstract: Phase-coded imaging is a computational imaging method designed to tackle tasks such as passive depth estimation and extended depth of field (EDOF) using depth cues inserted during image capture. Most of the current deep learning-based methods for depth estimation or all-in-focus imaging require a training dataset with high-quality depth maps and an optimal focus point at infinity for all-in-focus… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  7. arXiv:2403.03273  [pdf, other

    cs.CV cs.LG

    DINOv2 based Self Supervised Learning For Few Shot Medical Image Segmentation

    Authors: Lev Ayzenberg, Raja Giryes, Hayit Greenspan

    Abstract: Deep learning models have emerged as the cornerstone of medical image segmentation, but their efficacy hinges on the availability of extensive manually labeled datasets and their adaptability to unforeseen categories remains a challenge. Few-shot segmentation (FSS) offers a promising solution by endowing models with the capacity to learn novel classes from limited labeled examples. A leading metho… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  8. arXiv:2403.01306  [pdf, other

    cs.LG cs.CV

    ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation

    Authors: Moran Yanuka, Morris Alper, Hadar Averbuch-Elor, Raja Giryes

    Abstract: Web-scale training on paired text-image data is becoming increasingly central to multimodal learning, but is challenged by the highly noisy nature of datasets in the wild. Standard data filtering approaches succeed in removing mismatched text-image pairs, but permit semantically related but highly abstract or subjective text. These approaches lack the fine-grained ability to isolate the most concr… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: Accepted to ACL 2024 (Finding). For Project webpage, see https://moranyanuka.github.io/icc/

  9. arXiv:2402.07875  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States

    Authors: Noam Razin, Yotam Alexander, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen

    Abstract: In modern machine learning, models can often fit training data in numerous ways, some of which perform well on unseen (test) data, while others do not. Remarkably, in such cases gradient descent frequently exhibits an implicit bias that leads to excellent performance on unseen data. This implicit bias was extensively studied in supervised learning, but is far less understood in optimal control (re… ▽ More

    Submitted 1 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024

  10. arXiv:2402.05787  [pdf, other

    stat.ML cs.LG

    How do Transformers perform In-Context Autoregressive Learning?

    Authors: Michael E. Sander, Raja Giryes, Taiji Suzuki, Mathieu Blondel, Gabriel Peyré

    Abstract: Transformers have achieved state-of-the-art performance in language modeling tasks. However, the reasons behind their tremendous success are still unclear. In this paper, towards a better understanding, we train a Transformer model on a simple next token prediction task, where sequences are generated as a first-order autoregressive process $s_{t+1} = W s_t$. We show how a trained Transformer predi… ▽ More

    Submitted 5 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 20 pages ICML 2024

  11. arXiv:2401.06191  [pdf, other

    cs.CV

    TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation

    Authors: Rajaei Khatib, Raja Giryes

    Abstract: In recent years, the neural radiance field (NeRF) model has gained popularity due to its ability to recover complex 3D scenes. Following its success, many approaches proposed different NeRF representations in order to further improve both runtime and performance. One such example is Triplane, in which NeRF is represented using three 2D feature planes. This enables easily using existing 2D neural n… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: webpage link: https://rajaeekh.github.io/trinerflet-web

  12. arXiv:2312.17345  [pdf, other

    cs.CV

    3VL: using Trees to teach Vision & Language models compositional concepts

    Authors: Nir Yellinek, Leonid Karlinsky, Raja Giryes

    Abstract: Vision-Language models (VLMs) have proved effective at aligning image and text representations, producing superior zero-shot results when transferred to many downstream tasks. However, these representations suffer some key shortcomings in Compositional Language Concepts (CLC) understanding such as recognizing objects' attributes, states, and relations between different objects. Moreover, VLMs typi… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  13. arXiv:2312.15972  [pdf, other

    eess.IV cs.CV cs.LG

    A Self Supervised StyleGAN for Image Annotation and Classification with Extremely Limited Labels

    Authors: Dana Cohen Hochberg, Hayit Greenspan, Raja Giryes

    Abstract: The recent success of learning-based algorithms can be greatly attributed to the immense amount of annotated data used for training. Yet, many datasets lack annotations due to the high costs associated with labeling, resulting in degraded performances of deep learning methods. Self-supervised learning is frequently adopted to mitigate the reliance on massive labeled datasets since it exploits unla… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: Accepted to IEEE Transactions on Medical Imaging

    MSC Class: 92C55 ACM Class: J.3; I.5.3

    Journal ref: IEEE Transactions on Medical Imaging, 41(12), Dec. 2022

  14. arXiv:2312.10191  [pdf, other

    cs.CV eess.IV

    Tell Me What You See: Text-Guided Real-World Image Denoising

    Authors: Erez Yosef, Raja Giryes

    Abstract: Image reconstruction from noisy sensor measurements is a challenging problem. Many solutions have been proposed for it, where the main approach is learning good natural images prior along with modeling the true statistics of the noise in the scene. In the presence of very low lighting conditions, such approaches are usually not enough, and additional information is required, e.g., in the form of u… ▽ More

    Submitted 29 May, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  15. arXiv:2312.07425  [pdf, other

    cs.LG cs.CV eess.IV eess.SP

    Deep Internal Learning: Deep Learning from a Single Input

    Authors: Tom Tirer, Raja Giryes, Se Young Chun, Yonina C. Eldar

    Abstract: Deep learning, in general, focuses on training a neural network from large labeled datasets. Yet, in many cases there is value in training a network just from the input at hand. This is particularly relevant in many signal and image processing problems where training data is scarce and diversity is large on the one hand, and on the other, there is a lot of structure in the data that can be exploit… ▽ More

    Submitted 8 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted to IEEE Signal Processing Magazine

  16. arXiv:2312.03631  [pdf, other

    cs.CV cs.AI

    Mitigating Open-Vocabulary Caption Hallucinations

    Authors: Assaf Ben-Kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-Elor

    Abstract: While recent years have seen rapid progress in image-conditioned text generation, image captioning still suffers from the fundamental issue of hallucinations, namely, the generation of spurious details that cannot be inferred from the given image. Existing methods largely use closed-vocabulary object lists to mitigate or evaluate hallucinations in image captioning, ignoring the long-tailed nature… ▽ More

    Submitted 19 April, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Website Link: https://assafbk.github.io/mocha/

  17. On The Relationship Between Universal Adversarial Attacks And Sparse Representations

    Authors: Dana Weitzner, Raja Giryes

    Abstract: The prominent success of neural networks, mainly in computer vision tasks, is increasingly shadowed by their sensitivity to small, barely perceivable adversarial perturbations in image input. In this work, we aim at explaining this vulnerability through the framework of sparsity. We show the connection between adversarial attacks and sparse representations, with a focus on explaining the unive… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  18. arXiv:2306.10001  [pdf, other

    cs.CV cs.AI

    Group Orthogonalization Regularization For Vision Models Adaptation and Robustness

    Authors: Yoav Kurtz, Noga Bar, Raja Giryes

    Abstract: As neural networks become deeper, the redundancy within their parameters increases. This phenomenon has led to several methods that attempt to reduce the correlation between convolutional filters. We propose a computationally efficient regularization technique that encourages orthonormality between groups of filters within the same layer. Our experiments show that when incorporated into recent ada… ▽ More

    Submitted 18 February, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: BMVC 2023

  19. arXiv:2306.06743  [pdf, other

    hep-ex cs.LG physics.ins-det

    Trees versus Neural Networks for enhancing tau lepton real-time selection in proton-proton collisions

    Authors: Maayan Yaary, Uriel Barron, Luis Pascual Domínguez, Bo** Chen, Liron Barak, Erez Etzion, Raja Giryes

    Abstract: This paper introduces supervised learning techniques for real-time selection (triggering) of hadronically decaying tau leptons in proton-proton colliders. By implementing classic machine learning decision trees and advanced deep learning models, such as Multi-Layer Perceptron or residual neural networks, visible improvements in performance compared to standard threshold tau triggers are observed.… ▽ More

    Submitted 22 April, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

  20. arXiv:2306.06731  [pdf, other

    cs.LG

    An information-Theoretic Approach to Semi-supervised Transfer Learning

    Authors: Daniel Jakubovitz, David Uliel, Miguel Rodrigues, Raja Giryes

    Abstract: Transfer learning is a valuable tool in deep learning as it allows propagating information from one "source dataset" to another "target dataset", especially in the case of a small number of training examples in the latter. Yet, discrepancies between the underlying distributions of the source and target data are commonplace and are known to have a substantial impact on algorithm performance. In thi… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:1904.01670

  21. arXiv:2306.06370  [pdf, other

    cs.CV

    AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder

    Authors: Tal Shaharabany, Aviad Dahan, Raja Giryes, Lior Wolf

    Abstract: The recently introduced Segment Anything Model (SAM) combines a clever architecture and large quantities of training data to obtain remarkable image segmentation capabilities. However, it fails to reproduce such results for Out-Of-Distribution (OOD) domains such as medical images. Moreover, while SAM is conditioned on either a mask or a set of points, it may be desirable to have a fully automatic… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

  22. arXiv:2306.06088  [pdf, other

    cs.GR cs.CV cs.LG

    SENS: Part-Aware Sketch-based Implicit Neural Shape Modeling

    Authors: Alexandre Binninger, Amir Hertz, Olga Sorkine-Hornung, Daniel Cohen-Or, Raja Giryes

    Abstract: We present SENS, a novel method for generating and editing 3D models from hand-drawn sketches, including those of abstract nature. Our method allows users to quickly and easily sketch a shape, and then maps the sketch into the latent space of a part-aware neural implicit shape architecture. SENS analyzes the sketch and encodes its parts into ViT patch encoding, subsequently feeding them into a tra… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 25 pages, 24 figures

  23. arXiv:2305.19595  [pdf, other

    cs.CV

    Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models

    Authors: Sivan Doveh, Assaf Arbelle, Sivan Harary, Roei Herzig, Donghyun Kim, Paola Cascante-bonilla, Amit Alfassy, Rameswar Panda, Raja Giryes, Rogerio Feris, Shimon Ullman, Leonid Karlinsky

    Abstract: Vision and Language (VL) models offer an effective method for aligning representation spaces of images and text, leading to numerous applications such as cross-modal retrieval, visual question answering, captioning, and more. However, the aligned image-text spaces learned by all the popular VL models are still suffering from the so-called `object bias' - their representations behave as `bags of no… ▽ More

    Submitted 1 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  24. arXiv:2305.17559  [pdf, other

    cs.LG cs.CV

    Pruning at Initialization -- A Sketching Perspective

    Authors: Noga Bar, Raja Giryes

    Abstract: The lottery ticket hypothesis (LTH) has increased attention to pruning neural networks at initialization. We study this problem in the linear setting. We show that finding a sparse mask at initialization is equivalent to the sketching problem introduced for efficient matrix multiplication. This gives us tools to analyze the LTH problem and gain insights into it. Specifically, using the mask found… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Comments: 20 pages

  25. arXiv:2305.16427  [pdf, other

    cs.LG cs.AI

    Neural (Tangent Kernel) Collapse

    Authors: Mariia Seleznova, Dana Weitzner, Raja Giryes, Gitta Kutyniok, Hung-Hsu Chou

    Abstract: This work bridges two important concepts: the Neural Tangent Kernel (NTK), which captures the evolution of deep neural networks (DNNs) during training, and the Neural Collapse (NC) phenomenon, which refers to the emergence of symmetry and structure in the last-layer features of well-trained classification DNNs. We adopt the natural assumption that the empirical NTK develops a block structure align… ▽ More

    Submitted 26 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Journal ref: Proceedings of the 37th Conference on Neural Information Processing Systems, 2023

  26. arXiv:2305.16269  [pdf, other

    cs.CV cs.LG eess.IV

    UDPM: Upsampling Diffusion Probabilistic Models

    Authors: Shady Abu-Hussein, Raja Giryes

    Abstract: Denoising Diffusion Probabilistic Models (DDPM) have recently gained significant attention. DDPMs compose a Markovian process that begins in the data domain and gradually adds noise until reaching pure white noise. DDPMs generate high-quality samples from complex data distributions by defining an inverse process and training a deep neural network to learn this map**. However, these models are in… ▽ More

    Submitted 27 May, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  27. arXiv:2303.14828  [pdf, other

    cs.CV

    VisDA 2022 Challenge: Domain Adaptation for Industrial Waste Sorting

    Authors: Dina Bashkirova, Samarth Mishra, Diala Lteif, Piotr Teterwak, Donghyun Kim, Fadi Alladkani, James Akl, Berk Calli, Sarah Adel Bargal, Kate Saenko, Daehan Kim, Minseok Seo, Young** Jeon, Dong-Geol Choi, Shahaf Ettedgui, Raja Giryes, Shady Abu-Hussein, Binhui Xie, Shuang Li

    Abstract: Label-efficient and reliable semantic segmentation is essential for many real-life applications, especially for industrial settings with high visual diversity, such as waste sorting. In industrial waste sorting, one of the biggest challenges is the extreme diversity of the input stream depending on factors like the location of the sorting facility, the equipment available in the facility, and the… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Proceedings of Machine Learning Research

  28. arXiv:2303.13450  [pdf, other

    cs.CV cs.GR cs.LG

    Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes

    Authors: Dana Cohen-Bar, Elad Richardson, Gal Metzer, Raja Giryes, Daniel Cohen-Or

    Abstract: Recent breakthroughs in text-guided image generation have led to remarkable progress in the field of 3D synthesis from text. By optimizing neural radiance fields (NeRF) directly from text, recent methods are able to produce remarkable results. Yet, these methods are limited in their control of each object's placement or appearance, as they represent the scene as a whole. This can be a major issue… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: project page at https://danacohen95.github.io/Set-the-Scene/

  29. arXiv:2302.01721  [pdf, other

    cs.CV cs.GR

    TEXTure: Text-Guided Texturing of 3D Shapes

    Authors: Elad Richardson, Gal Metzer, Yuval Alaluf, Raja Giryes, Daniel Cohen-Or

    Abstract: In this paper, we present TEXTure, a novel method for text-guided generation, editing, and transfer of textures for 3D shapes. Leveraging a pretrained depth-to-image diffusion model, TEXTure applies an iterative scheme that paints a 3D model from different viewpoints. Yet, while depth-to-image models can create plausible textures from a single viewpoint, the stochastic nature of the generation pro… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: Project page available at https://texturepaper.github.io/TEXTurePaper/

  30. arXiv:2212.03221  [pdf, other

    cs.CV eess.IV

    ADIR: Adaptive Diffusion for Image Reconstruction

    Authors: Shady Abu-Hussein, Tom Tirer, Raja Giryes

    Abstract: In recent years, denoising diffusion models have demonstrated outstanding image generation performance. The information on natural images captured by these models is useful for many image reconstruction applications, where the task is to restore a clean image from its degraded observations. In this work, we propose a conditional sampling scheme that exploits the prior learned by diffusion models w… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Our code and additional results are available online in the project page https://shadyabh.github.io/ADIR/

  31. arXiv:2211.14307  [pdf, other

    cs.CV

    MAEDAY: MAE for few and zero shot AnomalY-Detection

    Authors: Eli Schwartz, Assaf Arbelle, Leonid Karlinsky, Sivan Harary, Florian Scheidegger, Sivan Doveh, Raja Giryes

    Abstract: We propose using Masked Auto-Encoder (MAE), a transformer model self-supervisedly trained on image inpainting, for anomaly detection (AD). Assuming anomalous regions are harder to reconstruct compared with normal regions. MAEDAY is the first image-reconstruction-based anomaly detection method that utilizes a pre-trained model, enabling its use for Few-Shot Anomaly Detection (FSAD). We also show th… ▽ More

    Submitted 15 February, 2024; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: Computer Vision and Image Understanding, 2024

  32. arXiv:2211.14298  [pdf, other

    cs.CV cs.AI

    PIP: Positional-encoding Image Prior

    Authors: Nimrod Shabtay, Eli Schwartz, Raja Giryes

    Abstract: In Deep Image Prior (DIP), a Convolutional Neural Network (CNN) is fitted to map a latent space to a degraded (e.g. noisy) image but in the process learns to reconstruct the clean image. This phenomenon is attributed to CNN's internal image-prior. We revisit the DIP framework, examining it from the perspective of a neural implicit representation. Motivated by this perspective, we replace the rando… ▽ More

    Submitted 3 March, 2024; v1 submitted 25 November, 2022; originally announced November 2022.

  33. arXiv:2211.11733  [pdf, other

    cs.CV

    Teaching Structured Vision&Language Concepts to Vision&Language Models

    Authors: Sivan Doveh, Assaf Arbelle, Sivan Harary, Rameswar Panda, Roei Herzig, Eli Schwartz, Donghyun Kim, Raja Giryes, Rogerio Feris, Shimon Ullman, Leonid Karlinsky

    Abstract: Vision and Language (VL) models have demonstrated remarkable zero-shot performance in a variety of tasks. However, some aspects of complex language understanding still remain a challenge. We introduce the collective notion of Structured Vision&Language Concepts (SVLC) which includes object attributes, relations, and states which are present in the text and visible in the image. Recent studies have… ▽ More

    Submitted 30 May, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Journal ref: CVPR 2023

  34. arXiv:2211.07600  [pdf, other

    cs.CV cs.GR

    Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures

    Authors: Gal Metzer, Elad Richardson, Or Patashnik, Raja Giryes, Daniel Cohen-Or

    Abstract: Text-guided image generation has progressed rapidly in recent years, inspiring major breakthroughs in text-guided shape generation. Recently, it has been shown that using score distillation, one can successfully text-guide a NeRF model to generate a 3D object. We adapt the score distillation to the publicly available, and computationally efficient, Latent Diffusion Models, which apply the entire d… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  35. arXiv:2210.14064  [pdf, other

    cs.LG

    Learning Low Dimensional State Spaces with Overparameterized Recurrent Neural Nets

    Authors: Edo Cohen-Karlik, Itamar Menuhin-Gruman, Raja Giryes, Nadav Cohen, Amir Globerson

    Abstract: Overparameterization in deep learning typically refers to settings where a trained neural network (NN) has representational capacity to fit the training data in many ways, some of which generalize well, while others do not. In the case of Recurrent Neural Networks (RNNs), there exists an additional layer of overparameterization, in the sense that a model may exhibit many solutions that generalize… ▽ More

    Submitted 23 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023, 9 pages, 2 figures plus supplementary

  36. arXiv:2208.14125  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    A Diffusion Model Predicts 3D Shapes from 2D Microscopy Images

    Authors: Dominik J. E. Waibel, Ernst Röell, Bastian Rieck, Raja Giryes, Carsten Marr

    Abstract: Diffusion models are a special type of generative model, capable of synthesising new data from a learnt distribution. We introduce DISPR, a diffusion-based model for solving the inverse problem of three-dimensional (3D) cell shape prediction from two-dimensional (2D) single cell microscopy images. Using the 2D microscopy image as a prior, DISPR is conditioned to predict realistic 3D shape reconstr… ▽ More

    Submitted 14 March, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    MSC Class: 68-06

  37. arXiv:2207.05532  [pdf, other

    cs.LG cs.CV

    Utilizing Excess Resources in Training Neural Networks

    Authors: Amit Henig, Raja Giryes

    Abstract: In this work, we suggest Kernel Filtering Linear Overparameterization (KFLO), where a linear cascade of filtering layers is used during training to improve network performance in test time. We implement this cascade in a kernel filtering fashion, which prevents the trained architecture from becoming unnecessarily deeper. This also allows using our approach with almost any network architecture and… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted to ICIP 2022. Code available at https://github.com/AmitHenig/KFLO

  38. arXiv:2205.13680  [pdf, other

    cs.LG cs.CV

    Membership Inference Attack Using Self Influence Functions

    Authors: Gilad Cohen, Raja Giryes

    Abstract: Member inference (MI) attacks aim to determine if a specific data sample was used to train a machine learning model. Thus, MI is a major privacy threat to models trained on private sensitive data, such as medical records. In MI attacks one may consider the black-box settings, where the model's parameters and activations are hidden from the adversary, or the white-box case where they are available… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

  39. arXiv:2204.11891  [pdf, other

    cs.CV cs.AI cs.LG

    ProCST: Boosting Semantic Segmentation Using Progressive Cyclic Style-Transfer

    Authors: Shahaf Ettedgui, Shady Abu-Hussein, Raja Giryes

    Abstract: Using synthetic data for training neural networks that achieve good performance on real-world data is an important task as it can reduce the need for costly data annotation. Yet, synthetic and real world data have a domain gap. Reducing this gap, also known as domain adaptation, has been widely studied in recent years. Closing the domain gap between the source (synthetic) and target (real) data by… ▽ More

    Submitted 11 August, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Code available at https://github.com/shahaf1313/ProCST

  40. arXiv:2204.07719  [pdf, other

    cs.CV cs.LG

    Stress-Testing Point Cloud Registration on Automotive LiDAR

    Authors: Amnon Drory, Shai Avidan, Raja Giryes

    Abstract: Rigid Point Cloud Registration (PCR) algorithms aim to estimate the 6-DOF relative motion between two point clouds, which is important in various fields, including autonomous driving. Recent years have seen a significant improvement in global PCR algorithms, i.e. algorithms that can handle a large relative motion. This has been demonstrated in various scenarios, including indoor scenes, but has on… ▽ More

    Submitted 25 November, 2022; v1 submitted 16 April, 2022; originally announced April 2022.

    Comments: Accepted to the NeurIPS 2022 workshop on Machine Learning for Autonomous Driving. Project Page: https://github.com/AmnonDrory/LidarRegistration

  41. arXiv:2203.00667  [pdf, other

    cs.CV cs.LG

    Generative Adversarial Networks

    Authors: Gilad Cohen, Raja Giryes

    Abstract: Generative Adversarial Networks (GANs) are very popular frameworks for generating high-quality data, and are immensely used in both the academia and industry in many domains. Arguably, their most substantial impact has been in the area of computer vision, where they achieve state-of-the-art image generation. This chapter gives an introduction to GANs, by discussing their principle mechanism and pr… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  42. arXiv:2201.13168  [pdf, other

    cs.GR cs.CV cs.LG

    SPAGHETTI: Editing Implicit Shapes Through Part Aware Generation

    Authors: Amir Hertz, Or Perel, Raja Giryes, Olga Sorkine-Hornung, Daniel Cohen-Or

    Abstract: Neural implicit fields are quickly emerging as an attractive representation for learning based techniques. However, adopting them for 3D shape modeling and editing is challenging. We introduce a method for $\mathbf{E}$diting $\mathbf{I}$mplicit $\mathbf{S}$hapes $\mathbf{T}$hrough $\mathbf{P}$art $\mathbf{A}$ware $\mathbf{G}$enera$\mathbf{T}$ion, permuted in short as SPAGHETTI. Our architecture al… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

  43. arXiv:2201.07288  [pdf

    cs.CL

    Extending the Vocabulary of Fictional Languages using Neural Networks

    Authors: Thomas Zacharias, Ashutosh Taklikar, Raja Giryes

    Abstract: Fictional languages have become increasingly popular over the recent years appearing in novels, movies, TV shows, comics, and video games. While some of these fictional languages have a complete vocabulary, most do not. We propose a deep learning solution to the problem. Using style transfer and machine translation tools, we generate new words for a given target fictional language, while maintaini… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: 10 pages, 1 figure, NeurIPS Workshop on Machine Learning for Creativity and Design 2021

  44. arXiv:2201.01873  [pdf, other

    cs.GR cs.CV cs.LG

    NeuralMLS: Geometry-Aware Control Point Deformation

    Authors: Meitar Shechter, Rana Hanocka, Gal Metzer, Raja Giryes, Daniel Cohen-Or

    Abstract: We introduce NeuralMLS, a space-based deformation technique, guided by a set of displaced control points. We leverage the power of neural networks to inject the underlying shape geometry into the deformation parameters. The goal of our technique is to enable a realistic and intuitive shape deformation. Our method is built upon moving least-squares (MLS), since it minimizes a weighted sum of the gi… ▽ More

    Submitted 11 June, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

    Comments: Eurographics 2022 Short Papers

  45. arXiv:2112.14768  [pdf, other

    eess.IV cs.CV

    Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding

    Authors: Erez Yosef, Shay Elmalem, Raja Giryes

    Abstract: Video reconstruction from a single motion-blurred image is a challenging problem, which can enhance the capabilities of existing cameras. Recently, several works addressed this task using conventional imaging and deep learning. Yet, such purely-digital methods are inherently limited, due to direction ambiguity and noise sensitivity. Some works proposed to address these limitations using non-conven… ▽ More

    Submitted 18 December, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

  46. arXiv:2112.02300  [pdf, other

    cs.CV

    Unsupervised Domain Generalization by Learning a Bridge Across Domains

    Authors: Sivan Harary, Eli Schwartz, Assaf Arbelle, Peter Staar, Shady Abu-Hussein, Elad Amrani, Roei Herzig, Amit Alfassy, Raja Giryes, Hilde Kuehne, Dina Katabi, Kate Saenko, Rogerio Feris, Leonid Karlinsky

    Abstract: The ability to generalize learned representations across significantly different visual domains, such as between real photos, clipart, paintings, and sketches, is a fundamental capacity of the human visual system. In this paper, different from most cross-domain works that utilize some (or full) source domain supervision, we approach a relatively new and very practical Unsupervised Domain Generaliz… ▽ More

    Submitted 17 May, 2022; v1 submitted 4 December, 2021; originally announced December 2021.

  47. arXiv:2110.05433  [pdf, other

    cs.GR cs.CV cs.LG

    Mesh Dra**: Parametrization-Free Neural Mesh Transfer

    Authors: Amir Hertz, Or Perel, Raja Giryes, Olga Sorkine-Hornung, Daniel Cohen-Or

    Abstract: Despite recent advances in geometric modeling, 3D mesh modeling still involves a considerable amount of manual labor by experts. In this paper, we introduce Mesh Dra**: a neural method for transferring existing mesh structure from one shape to another. The method drapes the source mesh over the target geometry and at the same time seeks to preserve the carefully designed characteristics of the s… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: 12 pages. Portions of this work previously appeared as arXiv:2104.09125v1 which has been split into two works: arXiv:2104.09125v2+ and this work

  48. arXiv:2110.03016  [pdf, other

    cs.CV

    DeepBBS: Deep Best Buddies for Point Cloud Registration

    Authors: Itan Hezroni, Amnon Drory, Raja Giryes, Shai Avidan

    Abstract: Recently, several deep learning approaches have been proposed for point cloud registration. These methods train a network to generate a representation that helps finding matching points in two 3D point clouds. Finding good matches allows them to calculate the transformation between the point clouds accurately. Two challenges of these techniques are dealing with occlusions and generalizing to objec… ▽ More

    Submitted 16 October, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: Accepted to 3DV 2021

  49. arXiv:2109.08191  [pdf, other

    cs.CV cs.LG

    Simple Post-Training Robustness Using Test Time Augmentations and Random Forest

    Authors: Gilad Cohen, Raja Giryes

    Abstract: Although Deep Neural Networks (DNNs) achieve excellent performance on many real-world tasks, they are highly vulnerable to adversarial attacks. A leading defense against such attacks is adversarial training, a technique in which a DNN is trained to be robust to adversarial attacks by introducing adversarial noise to its input. This procedure is effective but must be done during the training phase.… ▽ More

    Submitted 25 November, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

  50. arXiv:2108.05693  [pdf, ps, other

    eess.IV cs.CV cs.LG

    MISS GAN: A Multi-IlluStrator Style Generative Adversarial Network for image to illustration translation

    Authors: Noa Barzilay, Tal Berkovitz Shalev, Raja Giryes

    Abstract: Unsupervised style transfer that supports diverse input styles using only one trained generator is a challenging and interesting task in computer vision. This paper proposes a Multi-IlluStrator Style Generative Adversarial Network (MISS GAN) that is a multi-style framework for unsupervised image-to-illustration translation, which can generate styled yet content preserving images. The illustrations… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: Accepted to Pattern Recognition Letters