Skip to main content

Showing 1–11 of 11 results for author: Bashivan, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14343  [pdf, other

    cs.AI

    iWISDM: Assessing instruction following in multimodal models at scale

    Authors: Xiaoxuan Lei, Lucas Gomez, Hao Yuan Bai, Pouya Bashivan

    Abstract: The ability to perform complex tasks from detailed instructions is a key to many remarkable achievements of our species. As humans, we are not only capable of performing a wide variety of tasks but also very complex ones that may entail hundreds or thousands of steps to complete. Large language models and their more recent multimodal counterparts that integrate textual and visual inputs have achie… ▽ More

    Submitted 25 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2210.03150  [pdf, other

    cs.LG cs.AI

    Towards Out-of-Distribution Adversarial Robustness

    Authors: Adam Ibrahim, Charles Guille-Escuret, Ioannis Mitliagkas, Irina Rish, David Krueger, Pouya Bashivan

    Abstract: Adversarial robustness continues to be a major challenge for deep learning. A core issue is that robustness to one type of attack often fails to transfer to other attacks. While prior work establishes a theoretical trade-off in robustness against different $L_p$ norms, we show that there is potential for improvement against many commonly used attacks by adopting a domain generalisation approach. C… ▽ More

    Submitted 26 June, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: Version of NeurIPS 2023 submission

  3. arXiv:2210.00062  [pdf, other

    cs.LG cs.AI cs.NE

    Learning Robust Kernel Ensembles with Kernel Average Pooling

    Authors: Pouya Bashivan, Adam Ibrahim, Amirozhan Dehghani, Yifei Ren

    Abstract: Model ensembles have long been used in machine learning to reduce the variance in individual model predictions, making them more robust to input perturbations. Pseudo-ensemble methods like dropout have also been commonly used in deep learning models to improve generalization. However, the application of these techniques to improve neural networks' robustness against input perturbations remains und… ▽ More

    Submitted 30 May, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

  4. arXiv:2006.04621  [pdf, other

    cs.LG stat.ML

    Adversarial Feature Desensitization

    Authors: Pouya Bashivan, Reza Bayat, Adam Ibrahim, Kartik Ahuja, Mojtaba Faramarzi, Touraj Laleh, Blake Aaron Richards, Irina Rish

    Abstract: Neural networks are known to be vulnerable to adversarial attacks -- slight but carefully constructed perturbations of the inputs which can drastically impair the network's performance. Many defense methods have been proposed for improving robustness of deep networks by training them on adversarially perturbed inputs. However, these models often remain vulnerable to new types of attacks not seen d… ▽ More

    Submitted 4 January, 2022; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: Accepted at Neurips 2021

  5. arXiv:1909.06161  [pdf, other

    cs.CV cs.LG cs.NE eess.IV q-bio.NC

    Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNs

    Authors: Jonas Kubilius, Martin Schrimpf, Kohitij Kar, Ha Hong, Najib J. Majaj, Rishi Rajalingham, Elias B. Issa, Pouya Bashivan, Jonathan Prescott-Roy, Kailyn Schmidt, Aran Nayebi, Daniel Bear, Daniel L. K. Yamins, James J. DiCarlo

    Abstract: Deep convolutional artificial neural networks (ANNs) are the leading class of candidate models of the mechanisms of visual processing in the primate ventral stream. While initially inspired by brain anatomy, over the past years, these ANNs have evolved from a simple eight-layer architecture in AlexNet to extremely deep and branching architectures, demonstrating increasingly better object categoriz… ▽ More

    Submitted 28 October, 2019; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: NeurIPS 2019 (Oral). Code available at https://github.com/dicarlolab/neurips2019

  6. arXiv:1904.09330  [pdf, other

    cs.NE

    Continual Learning with Self-Organizing Maps

    Authors: Pouya Bashivan, Martin Schrimpf, Robert Ajemian, Irina Rish, Matthew Riemer, Yuhai Tu

    Abstract: Despite remarkable successes achieved by modern neural networks in a wide range of applications, these networks perform best in domain-specific stationary environments where they are trained only once on large-scale controlled data repositories. When exposed to non-stationary learning environments, current neural networks tend to forget what they had previously learned, a phenomena known as catast… ▽ More

    Submitted 19 April, 2019; originally announced April 2019.

    Comments: Continual Learning Workshop - NeurIPS 2018

  7. arXiv:1808.01405  [pdf, other

    cs.CV

    Teacher Guided Architecture Search

    Authors: Pouya Bashivan, Mark Tensen, James J DiCarlo

    Abstract: Much of the recent improvement in neural networks for computer vision has resulted from discovery of new networks architectures. Most prior work has used the performance of candidate models following limited training to automatically guide the search in a feasible way. Could further gains in computational efficiency be achieved by guiding the search via measurements of a high performing network wi… ▽ More

    Submitted 6 September, 2019; v1 submitted 3 August, 2018; originally announced August 2018.

    Comments: Accepted to ICCV 2019

  8. arXiv:1805.10726  [pdf, other

    cs.CV

    A Neurobiological Evaluation Metric for Neural Network Model Search

    Authors: Nathaniel Blanchard, Jeffery Kinnison, Brandon RichardWebster, Pouya Bashivan, Walter J. Scheirer

    Abstract: Neuroscience theory posits that the brain's visual system coarsely identifies broad object categories via neural activation patterns, with similar objects producing similar neural responses. Artificial neural networks also have internal activation behavior in response to stimuli. We hypothesize that networks exhibiting brain-like activation behavior will demonstrate brain-like characteristics, e.g… ▽ More

    Submitted 26 November, 2018; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: Under review

  9. arXiv:1712.00512  [pdf, other

    cs.CV

    Learning Neural Markers of Schizophrenia Disorder Using Recurrent Neural Networks

    Authors: Jumana Dakka, Pouya Bashivan, Mina Gheiratmand, Irina Rish, Shantenu Jha, Russell Greiner

    Abstract: Smart systems that can accurately diagnose patients with mental disorders and identify effective treatments based on brain functional imaging data are of great applicability and are gaining much attention. Most previous machine learning studies use hand-designed features, such as functional connectivity, which does not maintain the potential useful information in the spatial relationship between b… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: To be published as a workshop paper at NIPS 2017 Machine Learning for Health (ML4H)

  10. arXiv:1602.00985  [pdf

    cs.CV cs.HC

    Mental State Recognition via Wearable EEG

    Authors: Pouya Bashivan, Irina Rish, Steve Heisig

    Abstract: The increasing quality and affordability of consumer electroencephalogram (EEG) headsets make them attractive for situations where medical grade devices are impractical. Predicting and tracking cognitive states is possible for tasks that were previously not conducive to EEG monitoring. For instance, monitoring operators for states inappropriate to the task (e.g. drowsy drivers), tracking mental he… ▽ More

    Submitted 5 June, 2016; v1 submitted 2 February, 2016; originally announced February 2016.

    Comments: Presented at MLINI-2015 workshop, 2015 (arXiv:cs/0101200)

    Report number: MLINI/2015/20

    Journal ref: Proceedings of 5th NIPS workshop on Machine Learning and Interpretation in Neuroimaging (MLINI15) (2015) 5-1

  11. arXiv:1511.06448  [pdf, other

    cs.LG cs.CV

    Learning Representations from EEG with Deep Recurrent-Convolutional Neural Networks

    Authors: Pouya Bashivan, Irina Rish, Mohammed Yeasin, Noel Codella

    Abstract: One of the challenges in modeling cognitive events from electroencephalogram (EEG) data is finding representations that are invariant to inter- and intra-subject differences, as well as to inherent noise associated with such data. Herein, we propose a novel approach for learning such representations from multi-channel EEG time-series, and demonstrate its advantages in the context of mental load cl… ▽ More

    Submitted 29 February, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: To be published as a conference paper at ICLR 2016