Skip to main content

Showing 1–22 of 22 results for author: Yousefzadeh, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.06077  [pdf, other

    cs.LG cs.AI stat.ML

    An Ambiguity Measure for Recognizing the Unknowns in Deep Learning

    Authors: Roozbeh Yousefzadeh

    Abstract: We study the understanding of deep neural networks from the scope in which they are trained on. While the accuracy of these models is usually impressive on the aggregate level, they still make mistakes, sometimes on cases that appear to be trivial. Moreover, these models are not reliable in realizing what they do not know leading to failures such as adversarial vulnerability and out-of-distributio… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  2. arXiv:2311.07618  [pdf, other

    cs.LG cs.AI cs.CL math.HO

    Large Language Models' Understanding of Math: Source Criticism and Extrapolation

    Authors: Roozbeh Yousefzadeh, Xuenan Cao

    Abstract: It has been suggested that large language models such as GPT-4 have acquired some form of understanding beyond the correlations among the words in text including some understanding of mathematics as well. Here, we perform a critical inquiry into this claim by evaluating the mathematical understanding of the GPT-4 model. Considering that GPT-4's training set is a secret, it is not straightforward t… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  3. arXiv:2203.12131  [pdf, ps, other

    cs.LG cs.CY

    Should Machine Learning Models Report to Us When They Are Clueless?

    Authors: Roozbeh Yousefzadeh, Xuenan Cao

    Abstract: The right to AI explainability has consolidated as a consensus in the research community and policy-making. However, a key component of explainability has been missing: extrapolation, which describes the extent to which AI models can be clueless when they encounter unfamiliar samples (i.e., samples outside the convex hull of their training sets, as we will explain). We report that AI models extrap… ▽ More

    Submitted 28 April, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

  4. arXiv:2203.10447  [pdf, ps, other

    cs.LG math.AG

    Over-parameterization: A Necessary Condition for Models that Extrapolate

    Authors: Roozbeh Yousefzadeh

    Abstract: In this work, we study over-parameterization as a necessary condition for having the ability for the models to extrapolate outside the convex hull of training set. We specifically, consider classification models, e.g., image classification and other applications of deep learning. Such models are classification functions that partition their domain and assign a class to each partition \cite{strang2… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

  5. arXiv:2203.10366  [pdf, ps, other

    cs.LG

    Deep Learning Generalization, Extrapolation, and Over-parameterization

    Authors: Roozbeh Yousefzadeh

    Abstract: We study the generalization of over-parameterized deep networks (for image classification) in relation to the convex hull of their training sets. Despite their great success, generalization of deep networks is considered a mystery. These models have orders of magnitude more parameters than their training samples, and they can achieve perfect accuracy on their training sets, even when training imag… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: Abstract accepted and presented at the Workshop on Theory of Over-parameterized Machine Learning, April 2021. arXiv admin note: text overlap with arXiv:2101.09849

  6. arXiv:2202.04052  [pdf, other

    cs.CV cs.AI cs.LG math.OC

    Decision boundaries and convex hulls in the feature space that deep learning functions learn from images

    Authors: Roozbeh Yousefzadeh

    Abstract: The success of deep neural networks in image classification and learning can be partly attributed to the features they extract from images. It is often speculated about the properties of a low-dimensional manifold that models extract and learn from images. However, there is not sufficient understanding about this low-dimensional space based on theory or empirical evidence. For image classification… ▽ More

    Submitted 3 May, 2022; v1 submitted 5 February, 2022; originally announced February 2022.

  7. arXiv:2201.11260  [pdf, other

    cs.LG cs.AI math.OC

    To what extent should we trust AI models when they extrapolate?

    Authors: Roozbeh Yousefzadeh, Xuenan Cao

    Abstract: Many applications affecting human lives rely on models that have come to be known under the umbrella of machine learning and artificial intelligence. These AI models are usually complicated mathematical functions that map from an input space to an output space. Stakeholders are interested to know the rationales behind models' decisions and functional behavior. We study this functional behavior in… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  8. arXiv:2112.12021  [pdf, other

    eess.IV cs.CV cs.LG

    Community Detection in Medical Image Datasets: Using Wavelets and Spectral Methods

    Authors: Roozbeh Yousefzadeh

    Abstract: Medical image datasets can have large number of images representing patients with different health conditions and various disease severity. When dealing with raw unlabeled image datasets, the large number of samples often makes it hard for experts and non-experts to understand the variety of images present in a dataset. Supervised learning methods rely on labeled images which requires a considerab… ▽ More

    Submitted 26 April, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

  9. arXiv:2112.06763  [pdf, other

    math.NA cs.LG

    A Homotopy Algorithm for Optimal Transport

    Authors: Roozbeh Yousefzadeh

    Abstract: The optimal transport problem has many applications in machine learning, physics, biology, economics, etc. Although its goal is very clear and mathematically well-defined, finding its optimal solution can be challenging for large datasets in high-dimensional space. Here, we propose a homotopy algorithm that first transforms the problem into an easy form, by changing the target distribution. It the… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: OPT2020: 12th Annual Workshop on Optimization for Machine Learning

  10. arXiv:2112.03411  [pdf, other

    cs.LG cs.AI

    Extrapolation Frameworks in Cognitive Psychology Suitable for Study of Image Classification Models

    Authors: Roozbeh Yousefzadeh, Jessica A. Mollick

    Abstract: We study the functional task of deep learning image classification models and show that image classification requires extrapolation capabilities. This suggests that new theories have to be developed for the understanding of deep learning as the current theory assumes models are solely interpolating, leaving many questions about them unanswered. We investigate the pixel space and also the feature s… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 1st Workshop on Human and Machine Decisions (WHMD 2021) at NeurIPS 2021

  11. arXiv:2103.00695  [pdf, ps, other

    cs.LG cs.CR

    Federated Learning without Revealing the Decision Boundaries

    Authors: Roozbeh Yousefzadeh

    Abstract: We consider the recent privacy preserving methods that train the models not on original images, but on mixed images that look like noise and hard to trace back to the original images. We explain that those mixed images will be samples on the decision boundaries of the trained model, and although such methods successfully hide the contents of images from the entity in charge of federated learning,… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  12. arXiv:2102.10502  [pdf, ps, other

    math.DG cs.LG math.OC

    A Sketching Method for Finding the Closest Point on a Convex Hull

    Authors: Roozbeh Yousefzadeh

    Abstract: We develop a sketching algorithm to find the point on the convex hull of a dataset, closest to a query point outside it. Studying the convex hull of datasets can provide useful information about their geometric structure and their distribution. Many machine learning datasets have large number of samples with large number of features, but exact algorithms in computational geometry are usually not d… ▽ More

    Submitted 29 March, 2022; v1 submitted 20 February, 2021; originally announced February 2021.

  13. arXiv:2101.09849  [pdf, other

    cs.LG math.DG

    Deep Learning Generalization and the Convex Hull of Training Sets

    Authors: Roozbeh Yousefzadeh

    Abstract: We study the generalization of deep learning models in relation to the convex hull of their training sets. A trained image classifier basically partitions its domain via decision boundaries and assigns a class to each of those partitions. The location of decision boundaries inside the convex hull of training set can be investigated in relation to the training samples. However, our analysis shows t… ▽ More

    Submitted 24 January, 2021; originally announced January 2021.

  14. arXiv:2006.09879  [pdf, other

    cs.CV cs.LG eess.IV math.NA

    Using Wavelets and Spectral Methods to Study Patterns in Image-Classification Datasets

    Authors: Roozbeh Yousefzadeh, Furong Huang

    Abstract: Deep learning models extract, before a final classification layer, features or patterns which are key for their unprecedented advantageous performance. However, the process of complex nonlinear feature extraction is not well understood, a major reason why interpretation, adversarial robustness, and generalization of deep neural nets are all open research problems. In this paper, we use wavelet tra… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  15. arXiv:2002.10257  [pdf, other

    eess.IV cs.LG stat.ML

    Using Wavelets to Analyze Similarities in Image-Classification Datasets

    Authors: Roozbeh Yousefzadeh

    Abstract: Deep learning image classifiers usually rely on huge training sets and their training process can be described as learning the similarities and differences among training images. But, images in large training sets are not usually studied from this perspective and fine-level similarities and differences among images is usually overlooked. This is due to lack of fast and efficient computational meth… ▽ More

    Submitted 17 May, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

  16. arXiv:2001.00682  [pdf, other

    cs.LG cs.AI stat.ML

    Auditing and Debugging Deep Learning Models via Decision Boundaries: Individual-level and Group-level Analysis

    Authors: Roozbeh Yousefzadeh, Dianne P. O'Leary

    Abstract: Deep learning models have been criticized for their lack of easy interpretation, which undermines confidence in their use for important applications. Nevertheless, they are consistently utilized in many applications, consequential to humans' lives, mostly because of their better performance. Therefore, there is a great need for computational methods that can explain, audit, and debug such models.… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

  17. arXiv:1908.02802  [pdf, other

    cs.LG stat.ML

    Investigating Decision Boundaries of Trained Neural Networks

    Authors: Roozbeh Yousefzadeh, Dianne P O'Leary

    Abstract: Deep learning models have been the subject of study from various perspectives, for example, their training process, interpretation, generalization error, robustness to adversarial attacks, etc. A trained model is defined by its decision boundaries, and therefore, many of the studies about deep learning models speculate about the decision boundaries, and sometimes make simplifying assumptions about… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

  18. arXiv:1908.02400  [pdf, other

    cs.LG math.NA stat.ML

    Refining the Structure of Neural Networks Using Matrix Conditioning

    Authors: Roozbeh Yousefzadeh, Dianne P O'Leary

    Abstract: Deep learning models have proven to be exceptionally useful in performing many machine learning tasks. However, for each new dataset, choosing an effective size and structure of the model can be a time-consuming process of trial and error. While a small network with few neurons might not be able to capture the intricacies of a given task, having too many neurons can lead to overfitting and poor ge… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

  19. arXiv:1903.08789  [pdf, other

    cs.LG stat.ML

    Interpreting Neural Networks Using Flip Points

    Authors: Roozbeh Yousefzadeh, Dianne P. O'Leary

    Abstract: Neural networks have been criticized for their lack of easy interpretation, which undermines confidence in their use for important applications. Here, we introduce a novel technique, interpreting a trained neural network by investigating its flip points. A flip point is any point that lies on the boundary between two output classes: e.g. for a neural network with a binary yes/no output, a flip poi… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

  20. arXiv:1806.07754  [pdf, other

    cs.CV

    Spatio-Temporal Channel Correlation Networks for Action Classification

    Authors: Ali Diba, Mohsen Fayyaz, Vivek Sharma, M. Mahdi Arzani, Rahman Yousefzadeh, Juergen Gall, Luc Van Gool

    Abstract: The work in this paper is driven by the question if spatio-temporal correlations are enough for 3D convolutional neural networks (CNN)? Most of the traditional 3D networks use local spatio-temporal features. We introduce a new block that models correlations between channels of a 3D CNN with respect to temporal and spatial features. This new block can be added as a residual unit to different parts… ▽ More

    Submitted 7 February, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: Accepted in ECCV 2018. arXiv admin note: substantial text overlap with arXiv:1711.08200

  21. arXiv:1711.08200  [pdf, other

    cs.CV

    Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification

    Authors: Ali Diba, Mohsen Fayyaz, Vivek Sharma, Amir Hossein Karami, Mohammad Mahdi Arzani, Rahman Yousefzadeh, Luc Van Gool

    Abstract: The work in this paper is driven by the question how to exploit the temporal cues available in videos for their accurate classification, and for human action recognition in particular? Thus far, the vision community has focused on spatio-temporal approaches with fixed temporal convolution kernel depths. We introduce a new temporal layer that models variable temporal convolution kernel depths. We e… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

  22. RSCM Technology for Develo** Runtime-Reconfigurable Telecommunication Applications

    Authors: Sayed Kamaledin Ghiasi-Shirazi, Mahdi Mohseni, Majid Darvishan, Reza Yousefzadeh

    Abstract: Runtime reconfiguration is a fundamental requirement of many telecommunication applications which also has been addressed by management standards like CMIP, 3GPP TS 32.602, and NETCONF. Two basic commands considered by these standards are CREATE and DELETE which operate on managed objects inside an application. The available configuration management technologies, like JMX, OSGi, and Fractal, do no… ▽ More

    Submitted 5 November, 2016; v1 submitted 23 September, 2015; originally announced September 2015.