Skip to main content

Showing 1–28 of 28 results for author: Weinshall, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01804  [pdf, other

    cs.LG

    DCoM: Active Learning for All Learners

    Authors: Inbal Mishal, Daphna Weinshall

    Abstract: Deep Active Learning (AL) techniques can be effective in reducing annotation costs for training deep models. However, their effectiveness in low- and high-budget scenarios seems to require different strategies, and achieving optimal results across varying budget scenarios remains a challenge. In this study, we introduce Dynamic Coverage & Margin mix (DCoM), a novel active learning approach designe… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2407.00673  [pdf, other

    cs.LG

    TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learning

    Authors: Shahar Shaul-Ariel, Daphna Weinshall

    Abstract: Continual Learning is an unresolved challenge, whose relevance increases when considering modern applications. Unlike the human brain, trained deep neural networks suffer from a phenomenon called Catastrophic Forgetting, where they progressively lose previously acquired knowledge upon learning new tasks. To mitigate this problem, numerous methods have been developed, many relying on replaying past… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  3. arXiv:2310.11094  [pdf, other

    cs.LG

    Relearning Forgotten Knowledge: on Forgetting, Overfit and Training-Free Ensembles of DNNs

    Authors: Uri Stern, Daphna Weinshall

    Abstract: The infrequent occurrence of overfit in deep neural networks is perplexing. On the one hand, theory predicts that as models get larger they should eventually become too specialized for a specific training set, with ensuing decrease in generalization. In contrast, empirical results in image classification indicate that increasing the training time of deep models or using bigger models almost never… ▽ More

    Submitted 28 December, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

  4. arXiv:2310.11077  [pdf, other

    cs.LG cs.CV

    United We Stand: Using Epoch-wise Agreement of Ensembles to Combat Overfit

    Authors: Uri Stern, Daniel Shwartz, Daphna Weinshall

    Abstract: Deep neural networks have become the method of choice for solving many classification tasks, largely because they can fit very complex functions defined over raw data. The downside of such powerful learners is the danger of overfit. In this paper, we introduce a novel ensemble classifier for deep networks that effectively overcomes overfitting by combining models generated at specific intermediate… ▽ More

    Submitted 28 December, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Journal ref: Proceedings: 38th Annual AAAI Conference on Artificial Intelligence, Feb 2024

  5. arXiv:2308.14119  [pdf, other

    cs.CV cs.LG

    Semi-Supervised Learning in the Few-Shot Zero-Shot Scenario

    Authors: Noam Fluss, Guy Hacohen, Daphna Weinshall

    Abstract: Semi-Supervised Learning (SSL) is a framework that utilizes both labeled and unlabeled data to enhance model performance. Conventional SSL methods operate under the assumption that labeled and unlabeled data share the same label space. However, in practical real-world scenarios, especially when the labeled training dataset is limited in size, some classes may be totally absent from the labeled set… ▽ More

    Submitted 15 November, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

  6. arXiv:2308.14058  [pdf, other

    cs.LG cs.AI cs.CV

    Pruning the Unlabeled Data to Improve Semi-Supervised Learning

    Authors: Guy Hacohen, Daphna Weinshall

    Abstract: In the domain of semi-supervised learning (SSL), the conventional approach involves training a learner with a limited amount of labeled data alongside a substantial volume of unlabeled data, both drawn from the same underlying distribution. However, for deep learning models, this standard practice may not yield optimal results. In this research, we propose an alternative perspective, suggesting th… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

  7. arXiv:2306.03543  [pdf, other

    cs.LG

    How to Select Which Active Learning Strategy is Best Suited for Your Specific Problem and Budget

    Authors: Guy Hacohen, Daphna Weinshall

    Abstract: In the domain of Active Learning (AL), a learner actively selects which unlabeled examples to seek labels from an oracle, while operating within predefined budget constraints. Importantly, it has been recently shown that distinct query strategies are better suited for different conditions and budgetary constraints. In practice, the determination of the most appropriate AL strategy for a given situ… ▽ More

    Submitted 23 October, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Journal ref: Proceedings: 37th Conference on Neural Information Processing Systems (NeurIPS), Dec 2023

  8. arXiv:2210.00583  [pdf, other

    cs.LG cs.AI

    The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels

    Authors: Daniel Shwartz, Uri Stern, Daphna Weinshall

    Abstract: Deep neural networks have incredible capacity and expressibility, and can seemingly memorize any training set. This introduces a problem when training in the presence of noisy labels, as the noisy examples cannot be distinguished from clean examples by the end of training. Recent research has dealt with this challenge by utilizing the fact that deep networks seem to memorize clean examples much ea… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  9. arXiv:2205.11320  [pdf, other

    cs.LG

    Active Learning Through a Covering Lens

    Authors: Ofer Yehuda, Avihu Dekel, Guy Hacohen, Daphna Weinshall

    Abstract: Deep active learning aims to reduce the annotation cost for the training of deep models, which is notoriously data-hungry. Until recently, deep active learning methods were ineffectual in the low-budget regime, where only a small number of examples are annotated. The situation has been alleviated by recent advances in representation and self-supervised learning, which impart the geometry of the da… ▽ More

    Submitted 29 December, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Journal ref: Proceedings: 36th Conference on Neural Information Processing Systems (NeurIPS), Dec 2022

  10. arXiv:2202.02794  [pdf, other

    cs.LG

    Active Learning on a Budget: Opposite Strategies Suit High and Low Budgets

    Authors: Guy Hacohen, Avihu Dekel, Daphna Weinshall

    Abstract: Investigating active learning, we focus on the relation between the number of labeled examples (budget size), and suitable querying strategies. Our theoretical analysis shows a behavior reminiscent of phase transition: typical examples are best queried when the budget is low, while unrepresentative examples are best queried when the budget is large. Combined evidence shows that a similar phenomeno… ▽ More

    Submitted 16 June, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

    Comments: ICML 2022

    Journal ref: 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022

  11. arXiv:2109.06096  [pdf, other

    cs.CL cs.AI cs.LG

    The Grammar-Learning Trajectories of Neural Language Models

    Authors: Leshem Choshen, Guy Hacohen, Daphna Weinshall, Omri Abend

    Abstract: The learning trajectories of linguistic phenomena in humans provide insight into linguistic representation, beyond what can be gleaned from inspecting the behavior of an adult speaker. To apply a similar approach to analyze neural language models (NLM), it is first necessary to establish that different models are similar enough in the generalizations they make. In this paper, we show that NLMs wit… ▽ More

    Submitted 6 April, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: ACL camera-ready

  12. arXiv:2105.05553  [pdf, other

    cs.LG

    Principal Components Bias in Over-parameterized Linear Models, and its Manifestation in Deep Neural Networks

    Authors: Guy Hacohen, Daphna Weinshall

    Abstract: Recent work suggests that convolutional neural networks of different architectures learn to classify images in the same order. To understand this phenomenon, we revisit the over-parametrized deep linear network model. Our analysis reveals that, when the hidden layers are wide enough, the convergence rate of this model's parameters is exponentially faster along the directions of the larger principa… ▽ More

    Submitted 8 June, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: JMLR 2022

    Journal ref: Journal of Machine Learning Research, 23(155):1-46, 2022

  13. arXiv:2102.04924  [pdf, other

    cs.LG cs.CV

    More Is More -- Narrowing the Generalization Gap by Adding Classification Heads

    Authors: Roee Cates, Daphna Weinshall

    Abstract: Overfit is a fundamental problem in machine learning in general, and in deep learning in particular. In order to reduce overfit and improve generalization in the classification of images, some employ invariance to a group of transformations, such as rotations and reflections. However, since not all objects exhibit necessarily the same invariance, it seems desirable to allow the network to learn th… ▽ More

    Submitted 11 February, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

  14. arXiv:2012.00504  [pdf, other

    cs.CV cs.LG

    Boosting the Performance of Semi-Supervised Learning with Unsupervised Clustering

    Authors: Boaz Lerner, Guy Shiran, Daphna Weinshall

    Abstract: Recently, Semi-Supervised Learning (SSL) has shown much promise in leveraging unlabeled data while being provided with very few labels. In this paper, we show that ignoring the labels altogether for whole epochs intermittently during training can significantly improve performance in the small sample regime. More specifically, we propose to train a network on two tasks jointly. The primary classifi… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  15. arXiv:2011.12942  [pdf, other

    cs.CV cs.LG

    Multiclass non-Adversarial Image Synthesis, with Application to Classification from Very Small Sample

    Authors: Itamar Winter, Daphna Weinshall

    Abstract: The generation of synthetic images is currently being dominated by Generative Adversarial Networks (GANs). Despite their outstanding success in generating realistic looking images, they still suffer from major drawbacks, including an unstable and highly sensitive training procedure, mode-collapse and mode-mixture, and dependency on large training sets. In this work we present a novel non-adversari… ▽ More

    Submitted 1 December, 2020; v1 submitted 25 November, 2020; originally announced November 2020.

  16. arXiv:2003.14297  [pdf, other

    cs.LG cs.CV stat.ML

    Generative Latent Implicit Conditional Optimization when Learning from Small Sample

    Authors: Idan Azuri, Daphna Weinshall

    Abstract: We revisit the long-standing problem of learning from a small sample, to which end we propose a novel method called GLICO (Generative Latent Implicit Conditional Optimization). GLICO learns a map** from the training examples to a latent space and a generator that generates images from vectors in the latent space. Unlike most recent works, which rely on access to large amounts of unlabeled data,… ▽ More

    Submitted 15 December, 2020; v1 submitted 31 March, 2020; originally announced March 2020.

    Comments: Published at ICPR 2020

    Journal ref: Proc. ICPR, January 2021

  17. arXiv:1912.02678  [pdf, other

    cs.CV

    Multi-Modal Deep Clustering: Unsupervised Partitioning of Images

    Authors: Guy Shiran, Daphna Weinshall

    Abstract: The clustering of unlabeled raw images is a daunting task, which has recently been approached with some success by deep learning methods. Here we propose an unsupervised clustering framework, which learns a deep neural network in an end-to-end fashion, providing direct cluster assignments of images without additional processing. Multi-Modal Deep Clustering (MMDC), trains a deep network to align it… ▽ More

    Submitted 15 December, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Accepted to ICPR 2020

  18. arXiv:1905.10854  [pdf, other

    cs.LG stat.ML

    Let's Agree to Agree: Neural Networks Share Classification Order on Real Datasets

    Authors: Guy Hacohen, Leshem Choshen, Daphna Weinshall

    Abstract: We report a series of robust empirical observations, demonstrating that deep Neural Networks learn the examples in both the training and test sets in a similar order. This phenomenon is observed in all the commonly used benchmarks we evaluated, including many image classification benchmarks, and one text classification benchmark. While this phenomenon is strongest for models of the same architectu… ▽ More

    Submitted 20 July, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: Published at ICML 2020

    Journal ref: Proceedings: 37th International Conference on Machine Learning (ICML), Viena Austria, July 2020

  19. arXiv:1904.03626  [pdf, other

    cs.LG stat.ML

    On The Power of Curriculum Learning in Training Deep Networks

    Authors: Guy Hacohen, Daphna Weinshall

    Abstract: Training neural networks is traditionally done by providing a sequence of random mini-batches sampled uniformly from the entire training data. In this work, we analyze the effect of curriculum learning, which involves the non-uniform sampling of mini-batches, on the training of deep networks, and specifically CNNs trained for image recognition. To employ curriculum learning, the training algorithm… ▽ More

    Submitted 29 May, 2019; v1 submitted 7 April, 2019; originally announced April 2019.

    Comments: In proceedings, ICML 2019

    Journal ref: Proc. ICML, 2019

  20. arXiv:1901.10788  [pdf, other

    cs.CV

    Blurred Images Lead to Bad Local Minima

    Authors: Gal Katzhendler, Daphna Weinshall

    Abstract: Blurred Images Lead to Bad Local Minima

    Submitted 30 January, 2019; originally announced January 2019.

  21. arXiv:1812.03472  [pdf, other

    cs.LG stat.ML

    Theory of Curriculum Learning, with Convex Loss Functions

    Authors: Daphna Weinshall, Dan Amir

    Abstract: Curriculum Learning - the idea of teaching by gradually exposing the learner to examples in a meaningful order, from easy to hard, has been investigated in the context of machine learning long ago. Although methods based on this concept have been empirically shown to improve performance of several learning algorithms, no theoretical analysis has been provided even for simple cases. To address this… ▽ More

    Submitted 9 December, 2018; originally announced December 2018.

    Journal ref: Journal of Machine Learning Research (JMLR), 21:1-19, Nov 2020

  22. arXiv:1808.10356  [pdf, other

    cs.LG cs.CV stat.ML

    Gaussian Mixture Generative Adversarial Networks for Diverse Datasets, and the Unsupervised Clustering of Images

    Authors: Matan Ben-Yosef, Daphna Weinshall

    Abstract: Generative Adversarial Networks (GANs) have been shown to produce realistically looking synthetic images with remarkable success, yet their performance seems less impressive when the training set is highly diverse. In order to provide a better fit to the target data distribution when the dataset includes many different classes, we propose a variant of the basic GAN model, called Gaussian Mixture G… ▽ More

    Submitted 30 August, 2018; originally announced August 2018.

    Comments: 20 pages, 8 figures

  23. arXiv:1802.03796  [pdf, other

    cs.LG

    Curriculum Learning by Transfer Learning: Theory and Experiments with Deep Networks

    Authors: Daphna Weinshall, Gad Cohen, Dan Amir

    Abstract: We provide theoretical investigation of curriculum learning in the context of stochastic gradient descent when optimizing the convex linear regression loss. We prove that the rate of convergence of an ideal curriculum learning method is monotonically increasing with the difficulty of the examples. Moreover, among all equally difficult points, convergence is faster when using points which incur hig… ▽ More

    Submitted 8 June, 2018; v1 submitted 11 February, 2018; originally announced February 2018.

    Comments: ICML 2018

    Journal ref: Proceedings: 35th International Conference on Machine Learning (ICML), oral, Stockholm Sweden, July 2018

  24. arXiv:1709.09844  [pdf, other

    cs.AI cs.CV cs.LG stat.ML

    Distance-based Confidence Score for Neural Network Classifiers

    Authors: Amit Mandelbaum, Daphna Weinshall

    Abstract: The reliable measurement of confidence in classifiers' predictions is very important for many applications and is, therefore, an important part of classifier design. Yet, although deep learning has received tremendous attention in recent years, not much progress has been made in quantifying the prediction confidence of neural network classifiers. Bayesian models offer a mathematically grounded fra… ▽ More

    Submitted 28 September, 2017; originally announced September 2017.

  25. arXiv:1704.06062  [pdf, other

    cs.LG stat.ML

    Every Untrue Label is Untrue in its Own Way: Controlling Error Type with the Log Bilinear Loss

    Authors: Yehezkel S. Resheff, Amit Mandelbaum, Daphna Weinshall

    Abstract: Deep learning has become the method of choice in many application domains of machine learning in recent years, especially for multi-class classification tasks. The most common loss function used in this context is the cross-entropy loss, which reduces to the log loss in the typical case when there is a single correct response label. While this loss is insensitive to the identity of the assigned cl… ▽ More

    Submitted 20 April, 2017; originally announced April 2017.

  26. arXiv:1604.06242  [pdf, other

    cs.CV

    Novelty Detection in MultiClass Scenarios with Incomplete Set of Class Labels

    Authors: Nomi Vinokurov, Daphna Weinshall

    Abstract: We address the problem of novelty detection in multiclass scenarios where some class labels are missing from the training set. Our method is based on the initial assignment of confidence values, which measure the affinity between a new test point and each known class. We first compare the values of the two top elements in this vector of confidence values. In the heart of our method lies the traini… ▽ More

    Submitted 15 May, 2016; v1 submitted 21 April, 2016; originally announced April 2016.

    Comments: 10 pages

  27. arXiv:1511.05082  [pdf, other

    cs.LG

    Topic Modeling of Behavioral Modes Using Sensor Data

    Authors: Yehezkel S. Resheff, Shay Rotics, Ran Nathan, Daphna Weinshall

    Abstract: The field of Movement Ecology, like so many other fields, is experiencing a period of rapid growth in availability of data. As the volume rises, traditional methods are giving way to machine learning and data science, which are playing an increasingly large part it turning this data into science-driving insights. One rich and interesting source is the bio-logger. These small electronic wearable de… ▽ More

    Submitted 16 November, 2015; originally announced November 2015.

    Comments: Invited Extended version of a paper \cite{resheffmatrix} presented at the international conference \textit{Data Science and Advanced Analytics}, Paris, France, 19-21 OCtober 2015

    Journal ref: International Journal of Data Science and Analytics 1.1 (2016): 51-60

  28. arXiv:1501.06751  [pdf, other

    cs.CV

    A Cheap System for Vehicle Speed Detection

    Authors: Chaim Ginzburg, Amit Raphael, Daphna Weinshall

    Abstract: The reliable detection of speed of moving vehicles is considered key to traffic law enforcement in most countries, and is seen by many as an important tool to reduce the number of traffic accidents and fatalities. Many automatic systems and different methods are employed in different countries, but as a rule they tend to be expensive and/or labor intensive, often employing outdated technology due… ▽ More

    Submitted 27 January, 2015; originally announced January 2015.