Skip to main content

Showing 1–7 of 7 results for author: Ismail, A A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.02107  [pdf, other

    cs.LG

    Interpretable Mixture of Experts

    Authors: Aya Abdelsalam Ismail, Sercan Ö. Arik, **sung Yoon, Ankur Taly, Soheil Feizi, Tomas Pfister

    Abstract: The need for reliable model explanations is prominent for many machine learning applications, particularly for tabular and time-series data as their use cases often involve high-stakes decision making. Towards this goal, we introduce a novel interpretable modeling framework, Interpretable Mixture of Experts (IME), that yields high accuracy, comparable to `black-box' Deep Neural Networks (DNNs) in… ▽ More

    Submitted 25 May, 2023; v1 submitted 5 June, 2022; originally announced June 2022.

  2. arXiv:2111.14338  [pdf, other

    cs.CV cs.AI cs.LG

    Improving Deep Learning Interpretability by Saliency Guided Training

    Authors: Aya Abdelsalam Ismail, Héctor Corrada Bravo, Soheil Feizi

    Abstract: Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in unfaithful feature attributions. In this paper, we tackle this issue and introduce a {\it saliency guided training}procedure for neural networks to reduce noisy gra… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Journal ref: Thirty-fifth Conference on Neural Information Processing Systems 2021

  3. arXiv:2011.06102  [pdf, other

    cs.AI

    Improving Multimodal Accuracy Through Modality Pre-training and Attention

    Authors: Aya Abdelsalam Ismail, Mahmudul Hasan, Faisal Ishtiaq

    Abstract: Training a multimodal network is challenging and it requires complex architectures to achieve reasonable performance. We show that one reason for this phenomena is the difference between the convergence rate of various modalities. We address this by pre-training modality-specific sub-networks in multimodal architectures independently before end-to-end training of the entire network. Furthermore, w… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

  4. arXiv:2010.13924  [pdf, other

    cs.LG stat.ML

    Benchmarking Deep Learning Interpretability in Time Series Predictions

    Authors: Aya Abdelsalam Ismail, Mohamed Gunady, Héctor Corrada Bravo, Soheil Feizi

    Abstract: Saliency methods are used extensively to highlight the importance of input features in model predictions. These methods are mostly used in vision and language tasks, and their applications to time series data is relatively unexplored. In this paper, we set out to extensively compare the performance of various saliency-based interpretability methods across diverse neural architectures, including Re… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Journal ref: NeurIPS 2020

  5. arXiv:1911.06816  [pdf

    eess.IV cs.CV cs.LG

    QC-Automator: Deep Learning-based Automated Quality Control for Diffusion MR Images

    Authors: Zahra Riahi Samani, Jacob Antony Alappatt, Drew Parker, Abdol Aziz Ould Ismail, Ragini Verma

    Abstract: Quality assessment of diffusion MRI (dMRI) data is essential prior to any analysis, so that appropriate pre-processing can be used to improve data quality and ensure that the presence of MRI artifacts do not affect the results of subsequent image analysis. Manual quality assessment of the data is subjective, possibly error-prone, and infeasible, especially considering the growing number of consort… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

  6. arXiv:1910.12370  [pdf, other

    cs.LG stat.ML

    Input-Cell Attention Reduces Vanishing Saliency of Recurrent Neural Networks

    Authors: Aya Abdelsalam Ismail, Mohamed Gunady, Luiz Pessoa, Héctor Corrada Bravo, Soheil Feizi

    Abstract: Recent efforts to improve the interpretability of deep neural networks use saliency to characterize the importance of input features to predictions made by models. Work on interpretability using saliency-based methods on Recurrent Neural Networks (RNNs) has mostly targeted language tasks, and their applicability to time series data is less understood. In this work we analyze saliency-based methods… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

    Journal ref: Neurips 2019

  7. arXiv:1804.06776  [pdf, other

    cs.LG stat.ML

    Improving Long-Horizon Forecasts with Expectation-Biased LSTM Networks

    Authors: Aya Abdelsalam Ismail, Timothy Wood, Héctor Corrada Bravo

    Abstract: State-of-the-art forecasting methods using Recurrent Neural Net- works (RNN) based on Long-Short Term Memory (LSTM) cells have shown exceptional performance targeting short-horizon forecasts, e.g given a set of predictor features, forecast a target value for the next few time steps in the future. However, in many applica- tions, the performance of these methods decays as the forecasting horizon ex… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.