Skip to main content

Showing 1–13 of 13 results for author: Rahnama, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.03466  [pdf, other

    cs.LG

    The Blame Problem in Evaluating Local Explanations, and How to Tackle it

    Authors: Amir Hossein Akhavan Rahnama

    Abstract: The number of local model-agnostic explanation techniques proposed has grown rapidly recently. One main reason is that the bar for develo** new explainability techniques is low due to the lack of optimal evaluation measures. Without rigorous measures, it is hard to have concrete evidence of whether the new explanation techniques can significantly outperform their predecessors. Our study proposes… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: Accepted at Workshop: XAI methods, challenges and applications, 26th European Conference on Artificial Intelligence (ECAI 2023)

  2. arXiv:2203.02295  [pdf, ps, other

    stat.ML cs.LG

    Evaluating Local Model-Agnostic Explanations of Learning to Rank Models with Decision Paths

    Authors: Amir Hossein Akhavan Rahnama, Judith Butepage

    Abstract: Local explanations of learning-to-rank (LTR) models are thought to extract the most important features that contribute to the ranking predicted by the LTR model for a single data point. Evaluating the accuracy of such explanations is challenging since the ground truth feature importance scores are not available for most modern LTR models. In this work, we propose a systematic evaluation technique… ▽ More

    Submitted 16 March, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: 16 pages, 6 Figures, 3 Tables, Submitted to ECML PKDD 2022

  3. arXiv:2111.12906  [pdf, ps, other

    cs.LG cs.AI eess.SY

    Robustness against Adversarial Attacks in Neural Networks using Incremental Dissipativity

    Authors: Bernardo Aquino, Arash Rahnama, Peter Seiler, Lizhen Lin, Vijay Gupta

    Abstract: Adversarial examples can easily degrade the classification performance in neural networks. Empirical methods for promoting robustness to such examples have been proposed, but often lack both analytical insights and formal guarantees. Recently, some robustness certificates have appeared in the literature based on system theoretic notions. This work proposes an incremental dissipativity-based robust… ▽ More

    Submitted 13 February, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

  4. arXiv:2111.05972  [pdf, other

    cs.LG cs.AI cs.DC

    Amazon SageMaker Model Parallelism: A General and Flexible Framework for Large Model Training

    Authors: Can Karakus, Rahul Huilgol, Fei Wu, Anirudh Subramanian, Cade Daniel, Derya Cavdar, Teng Xu, Haohan Chen, Arash Rahnama, Luis Quintela

    Abstract: With deep learning models rapidly growing in size, systems-level solutions for large-model training are required. We present Amazon SageMaker model parallelism, a software library that integrates with PyTorch, and enables easy training of large models using model parallelism and other memory-saving features. In contrast to existing solutions, the implementation of the SageMaker library is much mor… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: 24 pages. Submitted for review

  5. arXiv:2106.02488  [pdf, other

    cs.LG stat.ML

    Evaluating Local Explanations using White-box Models

    Authors: Amir Hossein Akhavan Rahnama, Judith Butepage, Pierre Geurts, Henrik Bostrom

    Abstract: Evaluating explanation techniques using human subjects is costly, time-consuming and can lead to subjectivity in the assessments. To evaluate the accuracy of local explanations, we require access to the true feature importance scores for a given instance. However, the prediction function of a model usually does not decompose into linear additive terms that indicate how much a feature contributes t… ▽ More

    Submitted 28 January, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Submitted to ACM FaCCT 2022 Jan 21 2022, 13 pages, 4 Figures

  6. arXiv:2005.10284  [pdf, other

    cs.LG cs.AI stat.ML

    An Adversarial Approach for Explaining the Predictions of Deep Neural Networks

    Authors: Arash Rahnama, Andrew Tseng

    Abstract: Machine learning models have been successfully applied to a wide range of applications including computer vision, natural language processing, and speech recognition. A successful implementation of these models however, usually relies on deep neural networks (DNNs) which are treated as opaque black-box systems due to their incomprehensible complexity and intricate internal mechanism. In this work,… ▽ More

    Submitted 28 September, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

  7. arXiv:1911.04636  [pdf, other

    cs.LG eess.SY stat.ML

    Robust Design of Deep Neural Networks against Adversarial Attacks based on Lyapunov Theory

    Authors: Arash Rahnama, Andre T. Nguyen, Edward Raff

    Abstract: Deep neural networks (DNNs) are vulnerable to subtle adversarial perturbations applied to the input. These adversarial perturbations, though imperceptible, can easily mislead the DNN. In this work, we take a control theoretic approach to the problem of robustness in DNNs. We treat each individual layer of the DNN as a nonlinear dynamical system and use Lyapunov theory to prove stability and robust… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

  8. arXiv:1910.14421  [pdf, other

    stat.ML cs.LG

    A study of data and label shift in the LIME framework

    Authors: Amir Hossein Akhavan Rahnama, Henrik Boström

    Abstract: LIME is a popular approach for explaining a black-box prediction through an interpretable model that is trained on instances in the vicinity of the predicted instance. To generate these instances, LIME randomly selects a subset of the non-zero features of the predicted instance. After that, the perturbed instances are fed into the black-box model to obtain labels for these, which are then used for… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: Accepted at the Neurip 2019 Workshop "Human-Centric Machine Learning" (poster + spotlight talk)

  9. arXiv:1907.07732  [pdf, other

    cs.CR cs.LG

    Connecting Lyapunov Control Theory to Adversarial Attacks

    Authors: Arash Rahnama, Andre T. Nguyen, Edward Raff

    Abstract: Significant work is being done to develop the math and tools necessary to build provable defenses, or at least bounds, against adversarial attacks of neural networks. In this work, we argue that tools from control theory could be leveraged to aid in defending against such attacks. We do this by example, building a provable defense against a weaker adversary. This is done so we can focus on the mec… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: 8 pages, 3 figures, AdvML'19: Workshop on Adversarial Learning Methods for Machine Learning and Data Mining at KDD

  10. arXiv:1804.04725  [pdf, ps, other

    q-bio.MN cs.LG stat.ML

    Network-based protein structural classification

    Authors: Khalique Newaz, Mahboobeh Ghalehnovi, Arash Rahnama, Panos J. Antsaklis, Tijana Milenkovic

    Abstract: Experimental determination of protein function is resource-consuming. As an alternative, computational prediction of protein function has received attention. In this context, protein structural classification (PSC) can help, by allowing for determining structural classes of currently unclassified proteins based on their features, and then relying on the fact that proteins with similar structures h… ▽ More

    Submitted 15 March, 2020; v1 submitted 12 April, 2018; originally announced April 2018.

  11. arXiv:1803.10927  [pdf

    stat.ML cs.LG math.OC

    An LP-based hyperparameter optimization model for language modeling

    Authors: Amir Hossein Akhavan Rahnama, Mehdi Toloo, Nezer Jacob Zaidenberg

    Abstract: In order to find hyperparameters for a machine learning model, algorithms such as grid search or random search are used over the space of possible values of the models hyperparameters. These search algorithms opt the solution that minimizes a specific cost function. In language models, perplexity is one of the most popular cost functions. In this study, we propose a fractional nonlinear programmin… ▽ More

    Submitted 29 March, 2018; originally announced March 2018.

    Journal ref: The Journal of Supercomputing (2018)

  12. arXiv:1708.04232  [pdf, other

    stat.ML cs.CV q-bio.NC

    Encoding Multi-Resolution Brain Networks Using Unsupervised Deep Learning

    Authors: Arash Rahnama, Abdullah Alchihabi, Vijay Gupta, Panos Antsaklis, Fatos T. Yarman Vural

    Abstract: The main goal of this study is to extract a set of brain networks in multiple time-resolutions to analyze the connectivity patterns among the anatomic regions for a given cognitive task. We suggest a deep architecture which learns the natural grou**s of the connectivity patterns of human brain in multiple time-resolutions. The suggested architecture is tested on task data set of Human Connectome… ▽ More

    Submitted 12 August, 2017; originally announced August 2017.

    Comments: 6 pages, 3 figures, submitted to The 17th annual IEEE International Conference on BioInformatics and BioEngineering

  13. arXiv:1612.08543  [pdf

    stat.ML cs.CL cs.DB cs.DC cs.IR

    Distributed Real-Time Sentiment Analysis for Big Data Social Streams

    Authors: Amir Hossein Akhavan Rahnama

    Abstract: Big data trend has enforced the data-centric systems to have continuous fast data streams. In recent years, real-time analytics on stream data has formed into a new research field, which aims to answer queries about what-is-happening-now with a negligible delay. The real challenge with real-time stream data processing is that it is impossible to store instances of data, and therefore online analyt… ▽ More

    Submitted 27 December, 2016; originally announced December 2016.

    Journal ref: IEEE 2014 International Conference on Control, Decision and Information Technologies (CoDIT)