Skip to main content

Showing 1–50 of 96 results for author: Cohen, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16048  [pdf, other

    cs.IR

    Evaluating D-MERIT of Partial-annotation on Information Retrieval

    Authors: Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg

    Abstract: Retrieval models are often evaluated on partially-annotated datasets. Each query is mapped to a few relevant texts and the remaining corpus is assumed to be irrelevant. As a result, models that successfully retrieve false negatives are punished in evaluation. Unfortunately, completely annotating all texts for every query is not resource efficient. In this work, we show that using partially-annotat… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Our dataset can be downloaded from https://D-MERIT.github.io

  2. arXiv:2406.14528  [pdf, other

    cs.LG cs.AI

    DeciMamba: Exploring the Length Extrapolation Potential of Mamba

    Authors: Assaf Ben-Kish, Itamar Zimerman, Shady Abu-Hussein, Nadav Cohen, Amir Globerson, Lior Wolf, Raja Giryes

    Abstract: Long-range sequence processing poses a significant challenge for Transformers due to their quadratic complexity in input length. A promising alternative is Mamba, which demonstrates high performance and achieves Transformer-level capabilities while requiring substantially fewer computational resources. In this paper we explore the length-generalization capabilities of Mamba, which we find to be re… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Link To Official Implementation: https://github.com/assafbk/DeciMamba

  3. arXiv:2406.14027  [pdf, other

    cs.AI

    How to design a dataset compliant with an ML-based system ODD?

    Authors: Cyril Cappi, Noémie Cohen, Mélanie Ducoffe, Christophe Gabreau, Laurent Gardes, Adrien Gauffriau, Jean-Brice Ginestet, Franck Mamalet, Vincent Mussot, Claire Pagetti, David Vigouroux

    Abstract: This paper focuses on a Vision-based Landing task and presents the design and the validation of a dataset that would comply with the Operational Design Domain (ODD) of a Machine-Learning (ML) system. Relying on emerging certification standards, we describe the process for establishing ODDs at both the system and image levels. In the process, we present the translation of high-level system constrai… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 12th European Congress on Embedded Real Time Software and Systems, Jun 2024, Toulouse, France. arXiv admin note: text overlap with arXiv:2304.09938

  4. arXiv:2406.07954  [pdf, other

    cs.CR cs.AI

    Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

    Authors: Edoardo Debenedetti, Javier Rando, Daniel Paleka, Silaghi Fineas Florin, Dragos Albastroiu, Niv Cohen, Yuval Lemberg, Reshmi Ghosh, Rui Wen, Ahmed Salem, Giovanni Cherubin, Santiago Zanella-Beguelin, Robin Schmid, Victor Klemm, Takahiro Miki, Chenhao Li, Stefan Kraft, Mario Fritz, Florian Tramèr, Sahar Abdelnabi, Lea Schönherr

    Abstract: Large language model systems face important security risks from maliciously crafted messages that aim to overwrite the system's original instructions or leak private data. To study this problem, we organized a capture-the-flag competition at IEEE SaTML 2024, where the flag is a secret string in the LLM system prompt. The competition was organized in two phases. In the first phase, teams developed… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  5. arXiv:2406.05904  [pdf, other

    cs.DC cs.CR

    Aegis: A Decentralized Expansion Blockchain

    Authors: Yogev Bar-On, Roi Bar-Zur, Omer Ben-Porat, Nimrod Cohen, Ittay Eyal, Matan Sitbon

    Abstract: Blockchains implement monetary systems operated by committees of nodes. The robustness of established blockchains presents an opportunity to leverage their infrastructure for creating expansion chains. Expansion chains can provide additional functionality to the primary chain they leverage or implement separate functionalities, while benefiting from the primary chain's security and the stability o… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  6. arXiv:2405.12211  [pdf, other

    cs.CV

    Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices

    Authors: Nathaniel Cohen, Vladimir Kulikov, Matan Kleiner, Inbar Huberman-Spiegelglas, Tomer Michaeli

    Abstract: Text-to-image (T2I) diffusion models achieve state-of-the-art results in image synthesis and editing. However, leveraging such pretrained models for video editing is considered a major challenge. Many existing works attempt to enforce temporal consistency in the edited video through explicit correspondence mechanisms, either in pixel space or between deep features. These methods, however, struggle… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Code and examples are available at https://matankleiner.github.io/slicedit/

  7. arXiv:2404.13742  [pdf, other

    cs.RO cs.AI eess.SY

    Seamless Underwater Navigation with Limited Doppler Velocity Log Measurements

    Authors: Nadav Cohen, Itzik Klein

    Abstract: Autonomous Underwater Vehicles (AUVs) commonly utilize an inertial navigation system (INS) and a Doppler velocity log (DVL) for underwater navigation. To that end, their measurements are integrated through a nonlinear filter such as the extended Kalman filter (EKF). The DVL velocity vector estimate depends on retrieving reflections from the seabed, ensuring that at least three out of its four tran… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  8. arXiv:2404.06017  [pdf, other

    cs.CL

    Identifying Shop** Intent in Product QA for Proactive Recommendations

    Authors: Besnik Fetahu, Nachshon Cohen, Elad Haramaty, Liane Lewin-Eytan, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Voice assistants have become ubiquitous in smart devices allowing users to instantly access information via voice questions. While extensive research has been conducted in question answering for voice search, little attention has been paid on how to enable proactive recommendations from a voice assistant to its users. This is a highly challenging problem that often leads to user friction, mainly d… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted at IronGraphs@ECIR'2024

  9. arXiv:2404.03631  [pdf, other

    cs.CV

    Robust Concept Erasure Using Task Vectors

    Authors: Minh Pham, Kelly O. Marshall, Chinmay Hegde, Niv Cohen

    Abstract: With the rapid growth of text-to-image models, a variety of techniques have been suggested to prevent undesirable image generations. Yet, these methods often only protect against specific user prompts and have been shown to allow unsafe generations with other inputs. Here we focus on unconditionally erasing a concept from a text-to-image model rather than conditioning the erasure on the user's pro… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  10. arXiv:2403.08788  [pdf, other

    cs.CV cs.AI cs.NE

    Verification for Object Detection -- IBP IoU

    Authors: Noémie Cohen, Mélanie Ducoffe, Ryma Boumazouza, Christophe Gabreau, Claire Pagetti, Xavier Pucel, Audrey Galametz

    Abstract: We introduce a novel Interval Bound Propagation (IBP) approach for the formal verification of object detection models, specifically targeting the Intersection over Union (IoU) metric. The approach has been implemented in an open source code, named IBP IoU, compatible with popular abstract interpretation based verification tools. The resulting verifier is evaluated on landing approach runway detect… ▽ More

    Submitted 30 January, 2024; originally announced March 2024.

  11. arXiv:2402.11137  [pdf, other

    cs.LG

    TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks

    Authors: Benjamin Feuer, Robin Tibor Schirrmeister, Valeriia Cherepanova, Chinmay Hegde, Frank Hutter, Micah Goldblum, Niv Cohen, Colin White

    Abstract: While tabular classification has traditionally relied on from-scratch training, a recent breakthrough called prior-data fitted networks (PFNs) challenges this approach. Similar to large language models, PFNs make use of pretraining and in-context learning to achieve strong performance on new tasks in a single forward pass. However, current PFNs have limitations that prohibit their widespread adopt… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  12. arXiv:2402.07875  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States

    Authors: Noam Razin, Yotam Alexander, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen

    Abstract: In modern machine learning, models can often fit training data in numerous ways, some of which perform well on unseen (test) data, while others do not. Remarkably, in such cases gradient descent frequently exhibits an implicit bias that leads to excellent performance on unseen data. This implicit bias was extensively studied in supervised learning, but is far less understood in optimal control (re… ▽ More

    Submitted 1 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024

  13. arXiv:2402.07025  [pdf, other

    stat.ML cs.IT cs.LG

    Generalization Error of Graph Neural Networks in the Mean-field Regime

    Authors: Gholamali Aminian, Yixuan He, Gesine Reinert, Łukasz Szpruch, Samuel N. Cohen

    Abstract: This work provides a theoretical framework for assessing the generalization error of graph neural networks in the over-parameterized regime, where the number of parameters surpasses the quantity of data points. We explore two widely utilized types of graph neural networks: graph convolutional neural networks and message passing graph neural networks. Prior to this study, existing bounds on the gen… ▽ More

    Submitted 1 July, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted in ICML 2024

  14. arXiv:2402.05934  [pdf, other

    cs.LG cs.SI

    Classifying Nodes in Graphs without GNNs

    Authors: Daniel Winter, Niv Cohen, Yedid Hoshen

    Abstract: Graph neural networks (GNNs) are the dominant paradigm for classifying nodes in a graph, but they have several undesirable attributes stemming from their message passing architecture. Recently, distillation methods succeeded in eliminating the use of GNNs at test time but they still require them during training. We perform a careful analysis of the role that GNNs play in distillation methods. This… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  15. arXiv:2402.00035  [pdf, other

    cs.CV cs.LG cs.LO

    Robustness Assessment of a Runway Object Classifier for Safe Aircraft Taxiing

    Authors: Yizhak Elboher, Raya Elsaleh, Omri Isac, Mélanie Ducoffe, Audrey Galametz, Guillaume Povéda, Ryma Boumazouza, Noémie Cohen, Guy Katz

    Abstract: As deep neural networks (DNNs) are becoming the prominent solution for many computational problems, the aviation industry seeks to explore their potential in alleviating pilot workload and in improving operational safety. However, the use of DNNs in this type of safety-critical applications requires a thorough certification process. This need can be addressed through formal verification, which pro… ▽ More

    Submitted 28 June, 2024; v1 submitted 8 January, 2024; originally announced February 2024.

    Comments: This is a preprint version of the paper in the proceedings of 43rd Digital Avionics Systems Conference (DASC)

  16. arXiv:2401.15620  [pdf, other

    cs.RO cs.AI eess.SP eess.SY

    Data-Driven Strategies for Co** with Incomplete DVL Measurements

    Authors: Nadav Cohen, Itzik Klein

    Abstract: Autonomous underwater vehicles are specialized platforms engineered for deep underwater operations. Critical to their functionality is autonomous navigation, typically relying on an inertial navigation system and a Doppler velocity log. In real-world scenarios, incomplete Doppler velocity log measurements occur, resulting in positioning errors and mission aborts. To cope with such situations, a mo… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  17. arXiv:2401.09987  [pdf, other

    cs.RO cs.AI eess.SY

    A-KIT: Adaptive Kalman-Informed Transformer

    Authors: Nadav Cohen, Itzik Klein

    Abstract: The extended Kalman filter (EKF) is a widely adopted method for sensor fusion in navigation applications. A crucial aspect of the EKF is the online determination of the process noise covariance matrix reflecting the model uncertainty. While common EKF implementation assumes a constant process noise, in real-world scenarios, the process noise varies, leading to inaccuracies in the estimated state a… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  18. arXiv:2311.14773  [pdf, other

    cs.CV cs.LG

    Set Features for Anomaly Detection

    Authors: Niv Cohen, Issar Tzachor, Yedid Hoshen

    Abstract: This paper proposes to use set features for detecting anomalies in samples that consist of unusual combinations of normal elements. Many leading methods discover anomalies by detecting an unusual part of a sample. For example, state-of-the-art segmentation-based approaches, first classify each element of the sample (e.g., image patch) as normal or anomalous and then classify the entire sample as a… ▽ More

    Submitted 9 June, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.12245

  19. arXiv:2311.10609  [pdf, other

    cs.LG cs.DB

    Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks

    Authors: Benjamin Feuer, Chinmay Hegde, Niv Cohen

    Abstract: Tabular classification has traditionally relied on supervised algorithms, which estimate the parameters of a prediction model using its training data. Recently, Prior-Data Fitted Networks (PFNs) such as TabPFN have successfully learned to classify tabular data in-context: the model parameters are designed to classify new samples based on labelled training samples given after the model training. Wh… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 2nd Table Representation Learning Workshop: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  20. arXiv:2310.16047  [pdf, other

    cs.CV cs.LG eess.IV

    From Posterior Sampling to Meaningful Diversity in Image Restoration

    Authors: Noa Cohen, Hila Manor, Yuval Bahat, Tomer Michaeli

    Abstract: Image restoration problems are typically ill-posed in the sense that each degraded image can be restored in infinitely many valid ways. To accommodate this, many works generate a diverse set of outputs by attempting to randomly sample from the posterior distribution of natural images given the degraded input. Here we argue that this strategy is commonly of limited practical value because of the he… ▽ More

    Submitted 11 March, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted for ICLR 2024. Code and examples are available at https://noa-cohen.github.io/MeaningfulDiversityInIR

  21. arXiv:2309.14568  [pdf, other

    cs.CL

    Introducing DictaLM -- A Large Generative Language Model for Modern Hebrew

    Authors: Shaltiel Shmidman, Avi Shmidman, Amir David Nissan Cohen, Moshe Koppel

    Abstract: We present DictaLM, a large-scale language model tailored for Modern Hebrew. Boasting 7B parameters, this model is predominantly trained on Hebrew-centric data. As a commitment to promoting research and development in the Hebrew language, we release both the foundation model and the instruct-tuned model under a Creative Commons license. Concurrently, we introduce DictaLM-Rab, another foundation mo… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  22. arXiv:2308.01508  [pdf, other

    cs.LG cs.CR cs.CV

    Circumventing Concept Erasure Methods For Text-to-Image Generative Models

    Authors: Minh Pham, Kelly O. Marshall, Niv Cohen, Govind Mittal, Chinmay Hegde

    Abstract: Text-to-image generative models can produce photo-realistic images for an extremely broad range of concepts, and their usage has proliferated widely among the general public. On the flip side, these models have numerous drawbacks, including their potential to generate images featuring sexually explicit content, mirror artistic styles without permission, or even hallucinate (or deepfake) the likene… ▽ More

    Submitted 8 October, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  23. arXiv:2307.00014  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Inertial Navigation Meets Deep Learning: A Survey of Current Trends and Future Directions

    Authors: Nadav Cohen, Itzik Klein

    Abstract: Inertial sensing is used in many applications and platforms, ranging from day-to-day devices such as smartphones to very complex ones such as autonomous vehicles. In recent years, the development of machine learning and deep learning techniques has increased significantly in the field of inertial sensing and sensor fusion. This is due to the development of efficient computing hardware and the acce… ▽ More

    Submitted 25 February, 2024; v1 submitted 22 June, 2023; originally announced July 2023.

  24. arXiv:2306.11623  [pdf, ps, other

    stat.ML cs.LG math.ST

    Mean-field Analysis of Generalization Errors

    Authors: Gholamali Aminian, Samuel N. Cohen, Łukasz Szpruch

    Abstract: We propose a novel framework for exploring weak and $L_2$ generalization errors of algorithms through the lens of differential calculus on the space of probability measures. Specifically, we consider the KL-regularized empirical risk minimization problem and establish generic conditions under which the generalization error convergence rate, when training on a sample of size $n$, is… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 49 pages

    MSC Class: 62B10; 60F99; 49N80; 46N30

  25. arXiv:2306.07284  [pdf, other

    cs.LG cs.CV

    No Free Lunch: The Hazards of Over-Expressive Representations in Anomaly Detection

    Authors: Tal Reiss, Niv Cohen, Yedid Hoshen

    Abstract: Anomaly detection methods, powered by deep learning, have recently been making significant progress, mostly due to improved representations. It is tempting to hypothesize that anomaly detection can improve indefinitely by increasing the scale of our networks, making their representations more expressive. In this paper, we provide theoretical and empirical evidence to the contrary. In fact, we empi… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  26. arXiv:2305.06000  [pdf, ps, other

    math.NA cs.LG

    Global Convergence of Deep Galerkin and PINNs Methods for Solving Partial Differential Equations

    Authors: Deqing Jiang, Justin Sirignano, Samuel N. Cohen

    Abstract: Numerically solving high-dimensional partial differential equations (PDEs) is a major challenge. Conventional methods, such as finite difference methods, are unable to solve high-dimensional PDEs due to the curse-of-dimensionality. A variety of deep learning methods have been recently developed to try and solve high-dimensional PDEs by approximating the solution using a neural network. In this pap… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  27. arXiv:2304.14841  [pdf, other

    cs.CV

    3D shape reconstruction of semi-transparent worms

    Authors: Thomas P. Ilett, Omer Yuval, Thomas Ranner, Netta Cohen, David C. Hogg

    Abstract: 3D shape reconstruction typically requires identifying object features or textures in multiple images of a subject. This approach is not viable when the subject is semi-transparent and moving in and out of focus. Here we overcome these challenges by rendering a candidate shape with adaptive blurring and transparency for comparison with the images. We use the microscopic nematode Caenorhabditis ele… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: 18 pages, 10 figures, published at CVPR'23

  28. arXiv:2303.11249  [pdf, other

    cs.LG cs.AI quant-ph

    What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement

    Authors: Yotam Alexander, Nimrod De La Vega, Noam Razin, Nadav Cohen

    Abstract: The question of what makes a data distribution suitable for deep learning is a fundamental open problem. Focusing on locally connected neural networks (a prevalent family of architectures that includes convolutional and recurrent neural networks as well as local self-attention models), we address this problem by adopting theoretical tools from quantum physics. Our main theoretical result states th… ▽ More

    Submitted 21 January, 2024; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted to NeurIPS 2023

  29. arXiv:2302.12245  [pdf, other

    cs.CV cs.LG

    Set Features for Fine-grained Anomaly Detection

    Authors: Niv Cohen, Issar Tzachor, Yedid Hoshen

    Abstract: Fine-grained anomaly detection has recently been dominated by segmentation based approaches. These approaches first classify each element of the sample (e.g., image patch) as normal or anomalous and then classify the entire sample as anomalous if it contains anomalous elements. However, such approaches do not extend to scenarios where the anomalies are expressed by an unusual combination of normal… ▽ More

    Submitted 2 March, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

  30. arXiv:2212.11671  [pdf, other

    cs.RO cs.LG eess.SP

    Set-Transformer BeamsNet for AUV Velocity Forecasting in Complete DVL Outage Scenarios

    Authors: Nadav Cohen, Zeev Yampolsky, Itzik Klein

    Abstract: Autonomous underwater vehicles (AUVs) are regularly used for deep ocean applications. Commonly, the autonomous navigation task is carried out by a fusion between two sensors: the inertial navigation system and the Doppler velocity log (DVL). The DVL operates by transmitting four acoustic beams to the sea floor, and once reflected back, the AUV velocity vector can be estimated. However, in real-lif… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  31. arXiv:2212.00784  [pdf, other

    cs.CV cs.LG

    Improving Zero-Shot Models with Label Distribution Priors

    Authors: Jonathan Kahana, Niv Cohen, Yedid Hoshen

    Abstract: Labeling large image datasets with attributes such as facial age or object type is tedious and sometimes infeasible. Supervised machine learning methods provide a highly accurate solution, but require manual labels which are often unavailable. Zero-shot models (e.g., CLIP) do not require manual labels but are not as accurate as supervised ones, particularly when the attribute is numeric. We propos… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  32. arXiv:2211.16494  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    On the Ability of Graph Neural Networks to Model Interactions Between Vertices

    Authors: Noam Razin, Tom Verbin, Nadav Cohen

    Abstract: Graph neural networks (GNNs) are widely used for modeling complex interactions between entities represented as vertices of a graph. Despite recent efforts to theoretically analyze the expressive power of GNNs, a formal characterization of their ability to model interactions is lacking. The current paper aims to address this gap. Formalizing strength of interactions through an established measure k… ▽ More

    Submitted 23 October, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS 2023

  33. arXiv:2211.12904  [pdf

    cs.AI cs.HC

    Implementation and Evaluation of a System for Assessment of The Quality of Long-Term Management of Patients at a Geriatric Hospital

    Authors: Erez Shalom, Ayelet Goldstein, Roni Wais, Maya Slivanova, Nogah Melamed Cohen, Yuval Shahar

    Abstract: Background The use of a clinical decision support system for assessing the quality of care, based on computerized clinical guidelines (GLs), is likely to improve care, reduce costs, save time, and enhance the staff's capabilities. Objectives Implement and evaluate a system for assessment of the quality of the care, in the domain of management of pressure ulcers, by investigating the level of… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  34. arXiv:2211.11540  [pdf, other

    cs.CR

    A Framework for Auditable Synthetic Data Generation

    Authors: Florimond Houssiau, Samuel N. Cohen, Lukasz Szpruch, Owen Daniel, Michaela G. Lawrence, Robin Mitra, Henry Wilde, Callum Mole

    Abstract: Synthetic data has gained significant momentum thanks to sophisticated machine learning tools that enable the synthesis of high-dimensional datasets. However, many generation techniques do not give the data controller control over what statistical patterns are captured, leading to concerns over privacy protection. While synthetic records are not linked to a particular real-world individual, they c… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  35. arXiv:2211.06550  [pdf, other

    cs.CR cs.AI cs.LG

    TAPAS: a Toolbox for Adversarial Privacy Auditing of Synthetic Data

    Authors: Florimond Houssiau, James Jordon, Samuel N. Cohen, Owen Daniel, Andrew Elliott, James Geddes, Callum Mole, Camila Rangel-Smith, Lukasz Szpruch

    Abstract: Personal data collected at scale promises to improve decision-making and accelerate innovation. However, sharing and using such data raises serious privacy concerns. A promising solution is to produce synthetic data, artificial records to share instead of real data. Since synthetic records are not linked to real persons, this intuitively prevents classical re-identification attacks. However, this… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: Published at the SyntheticData4ML Neurips workshop

  36. arXiv:2210.14064  [pdf, other

    cs.LG

    Learning Low Dimensional State Spaces with Overparameterized Recurrent Neural Nets

    Authors: Edo Cohen-Karlik, Itamar Menuhin-Gruman, Raja Giryes, Nadav Cohen, Amir Globerson

    Abstract: Overparameterization in deep learning typically refers to settings where a trained neural network (NN) has representational capacity to fit the training data in many ways, some of which generalize well, while others do not. In the case of Recurrent Neural Networks (RNNs), there exists an additional layer of overparameterization, in the sense that a model may exhibit many solutions that generalize… ▽ More

    Submitted 23 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023, 9 pages, 2 figures plus supplementary

  37. arXiv:2210.12497  [pdf, other

    math.DS cs.LG

    Deep Linear Networks for Matrix Completion -- An Infinite Depth Limit

    Authors: Nadav Cohen, Govind Menon, Zsolt Veraszto

    Abstract: The deep linear network (DLN) is a model for implicit regularization in gradient based optimization of overparametrized learning architectures. Training the DLN corresponds to a Riemannian gradient flow, where the Riemannian metric is defined by the architecture of the network and the loss function is defined by the learning task. We extend this geometric framework, obtaining explicit expressions… ▽ More

    Submitted 10 May, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

    MSC Class: 68T07; 58D17; 37N40

  38. LiBeamsNet: AUV Velocity Vector Estimation in Situations of Limited DVL Beam Measurements

    Authors: Nadav Cohen, Itzik Klein

    Abstract: Autonomous underwater vehicles (AUVs) are employed for marine applications and can operate in deep underwater environments beyond human reach. A standard solution for the autonomous navigation problem can be obtained by fusing the inertial navigation system and the Doppler velocity log sensor (DVL). The latter measures four beam velocities to estimate the vehicle's velocity vector. In real-world s… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  39. arXiv:2210.10773  [pdf, other

    cs.LG cs.CV

    Anomaly Detection Requires Better Representations

    Authors: Tal Reiss, Niv Cohen, Eliahu Horwitz, Ron Abutbul, Yedid Hoshen

    Abstract: Anomaly detection seeks to identify unusual phenomena, a central task in science and industry. The task is inherently unsupervised as anomalies are unexpected and unknown during training. Recent advances in self-supervised representation learning have directly driven improvements in anomaly detection. In this position paper, we first explain how self-supervised representations can be easily used t… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to ECCV SSLWIN Workshop (2022)

  40. arXiv:2207.03478  [pdf, other

    cs.CV cs.LG

    Red PANDA: Disambiguating Anomaly Detection by Removing Nuisance Factors

    Authors: Niv Cohen, Jonathan Kahana, Yedid Hoshen

    Abstract: Anomaly detection methods strive to discover patterns that differ from the norm in a semantic way. This goal is ambiguous as a data point differing from the norm by an attribute e.g., age, race or gender, may be considered anomalous by some operators while others may consider this attribute irrelevant. Breaking from previous research, we present a new anomaly detection method that allows operators… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  41. arXiv:2206.13603  [pdf, other

    cs.RO cs.LG eess.SP eess.SY

    BeamsNet: A data-driven Approach Enhancing Doppler Velocity Log Measurements for Autonomous Underwater Vehicle Navigation

    Authors: Nadav Cohen, Itzik Klein

    Abstract: Autonomous underwater vehicles (AUV) perform various applications such as seafloor map** and underwater structure health monitoring. Commonly, an inertial navigation system aided by a Doppler velocity log (DVL) is used to provide the vehicle's navigation solution. In such fusion, the DVL provides the velocity vector of the AUV, which determines the navigation solution's accuracy and helps estima… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Report number: ISSN 0952-1976

    Journal ref: Engineering Applications of Artificial Intelligence, Volume 114, 2022, 105216

  42. arXiv:2205.03257  [pdf, other

    cs.LG

    Synthetic Data -- what, why and how?

    Authors: James Jordon, Lukasz Szpruch, Florimond Houssiau, Mirko Bottarelli, Giovanni Cherubin, Carsten Maple, Samuel N. Cohen, Adrian Weller

    Abstract: This explainer document aims to provide an overview of the current state of the rapidly expanding work on synthetic data technologies, with a particular focus on privacy. The article is intended for a non-technical audience, though some formal definitions have been given to provide clarity to specialists. This article is intended to enable the reader to quickly become familiar with the notion of s… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: Commissioned by the Royal Society. 57 pages 2 figures

  43. arXiv:2204.01694  [pdf, other

    cs.CV cs.LG

    "This is my unicorn, Fluffy": Personalizing frozen vision-language representations

    Authors: Niv Cohen, Rinon Gal, Eli A. Meirom, Gal Chechik, Yuval Atzmon

    Abstract: Large Vision & Language models pretrained on web-scale data provide representations that are invaluable for numerous V&L problems. However, it is unclear how they can be used for reasoning about user-specific visual concepts in unstructured language. This problem arises in multiple domains, from personalized image retrieval to personalized interaction with smart devices. We introduce a new learnin… ▽ More

    Submitted 2 August, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: Accepted to ECCV (Oral). Compared to the ECCV camera ready version, we moved the ablation study to the main text, and updated the related work

  44. arXiv:2203.17128  [pdf, other

    math.NA cs.LG math.AP math.PR stat.ML

    Neural Q-learning for solving PDEs

    Authors: Samuel N. Cohen, Deqing Jiang, Justin Sirignano

    Abstract: Solving high-dimensional partial differential equations (PDEs) is a major challenge in scientific computing. We develop a new numerical method for solving elliptic-type PDEs by adapting the Q-learning algorithm in reinforcement learning. Our "Q-PDE" algorithm is mesh-free and therefore has the potential to overcome the curse of dimensionality. Using a neural tangent kernel (NTK) approach, we prove… ▽ More

    Submitted 24 June, 2023; v1 submitted 31 March, 2022; originally announced March 2022.

    MSC Class: 65N12; 62M45; 37N30; 60H30

  45. arXiv:2203.03238  [pdf, other

    cs.CV cs.GR cs.LG

    Semantic Segmentation in Art Paintings

    Authors: Nadav Cohen, Yael Newman, Ariel Shamir

    Abstract: Semantic segmentation is a difficult task even when trained in a supervised manner on photographs. In this paper, we tackle the problem of semantic segmentation of artistic paintings, an even more challenging task because of a much larger diversity in colors, textures, and shapes and because there are no ground truth annotations available for segmentation. We propose an unsupervised method for sem… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: Published as a conference paper at EuroGraphics 2022

  46. arXiv:2202.04302  [pdf, other

    cs.LG

    On the Implicit Bias of Gradient Descent for Temporal Extrapolation

    Authors: Edo Cohen-Karlik, Avichai Ben David, Nadav Cohen, Amir Globerson

    Abstract: When using recurrent neural networks (RNNs) it is common practice to apply trained models to sequences longer than those seen in training. This "extrapolating" usage deviates from the traditional statistical learning setup where guarantees are provided under the assumption that train and test distributions are identical. Here we set out to understand when RNNs can extrapolate, focusing on a simple… ▽ More

    Submitted 24 March, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: 8 pages, 5 figures (plus appendix), AISTATS2022

  47. arXiv:2201.11729  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

    Authors: Noam Razin, Asaf Maman, Nadav Cohen

    Abstract: In the pursuit of explaining implicit regularization in deep learning, prominent focus was given to matrix and tensor factorizations, which correspond to simplified neural networks. It was shown that these models exhibit an implicit tendency towards low matrix and tensor ranks, respectively. Drawing closer to practical deep learning, the current paper theoretically analyzes the implicit regulariza… ▽ More

    Submitted 18 September, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted to ICML 2022

  48. arXiv:2112.07662  [pdf, other

    cs.CV cs.LG

    Out-of-Distribution Detection Without Class Labels

    Authors: Niv Cohen, Ron Abutbul, Yedid Hoshen

    Abstract: Out-of-distribution detection seeks to identify novelties, samples that deviate from the norm. The task has been found to be quite challenging, particularly in the case where the normal data distribution consists of multiple semantic classes (e.g., multiple object categories). To overcome this challenge, current approaches require manual labeling of the normal images provided during training. In t… ▽ More

    Submitted 22 September, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Accepted to ECCV L2ID Workshop (2022)

  49. arXiv:2112.07661  [pdf, other

    cs.CV

    Approaches Toward Physical and General Video Anomaly Detection

    Authors: Laura Kart, Niv Cohen

    Abstract: In recent years, many works have addressed the problem of finding never-seen-before anomalies in videos. Yet, most work has been focused on detecting anomalous frames in surveillance videos taken from security cameras. Meanwhile, the task of anomaly detection (AD) in videos exhibiting anomalous mechanical behavior, has been mostly overlooked. Anomaly detection in such videos is both of academic an… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  50. arXiv:2111.01726  [pdf, other

    cs.AI

    Instructive artificial intelligence (AI) for human training, assistance, and explainability

    Authors: Nicholas Kantack, Nina Cohen, Nathan Bos, Corey Lowman, James Everett, Timothy Endres

    Abstract: We propose a novel approach to explainable AI (XAI) based on the concept of "instruction" from neural networks. In this case study, we demonstrate how a superhuman neural network might instruct human trainees as an alternative to traditional approaches to XAI. Specifically, an AI examines human actions and calculates variations on the human strategy that lead to better performance. Experiments wit… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: 10 pages, 6 figures, to be published in SPIE Defense & Commercial Sensing (Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications IV) proceedings (April 2022)

    ACM Class: I.2.6