Skip to main content

Showing 1–50 of 51 results for author: Abbasnejad, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17139  [pdf, other

    cs.CV cs.AI cs.LG

    Synergy and Diversity in CLIP: Enhancing Performance Through Adaptive Backbone Ensembling

    Authors: Cristian Rodriguez-Opazo, Ehsan Abbasnejad, Damien Teney, Edison Marrese-Taylor, Hamed Damirchi, Anton van den Hengel

    Abstract: Contrastive Language-Image Pretraining (CLIP) stands out as a prominent method for image representation learning. Various architectures, from vision transformers (ViTs) to convolutional networks (ResNets) have been trained with CLIP to serve as general solutions to diverse vision tasks. This paper explores the differences across various CLIP-trained vision backbones. Despite using the same data an… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.14400

  2. arXiv:2404.05311  [pdf, other

    cs.LG cs.CR

    BruSLeAttack: A Query-Efficient Score-Based Black-Box Sparse Adversarial Attack

    Authors: Viet Quoc Vo, Ehsan Abbasnejad, Damith C. Ranasinghe

    Abstract: We study the unique, less-well understood problem of generating sparse adversarial samples simply by observing the score-based replies to model queries. Sparse attacks aim to discover a minimum number-the l0 bounded-perturbations to model inputs to craft adversarial examples and misguide model decisions. But, in contrast to query-based dense attack counterparts against black-box models, constructi… ▽ More

    Submitted 1 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Published as a conference paper at the International Conference on Learning Representations (ICLR 2024). Code is available at https://brusliattack.github.io/

  3. arXiv:2403.18309  [pdf, other

    cs.CR

    Bayesian Learned Models Can Detect Adversarial Malware For Free

    Authors: Bao Gia Doan, Dang Quang Nguyen, Paul Montague, Tamas Abraham, Olivier De Vel, Seyit Camtepe, Salil S. Kanhere, Ehsan Abbasnejad, Damith C. Ranasinghe

    Abstract: The vulnerability of machine learning-based malware detectors to adversarial attacks has prompted the need for robust solutions. Adversarial training is an effective method but is computationally expensive to scale up to large datasets and comes at the cost of sacrificing model performance for robustness. We hypothesize that adversarial malware exploits the low-confidence regions of models and can… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted to the 29th European Symposium on Research in Computer Security (ESORICS) 2024 Conference

  4. arXiv:2403.07968  [pdf, other

    cs.LG cs.AI

    Do Deep Neural Network Solutions Form a Star Domain?

    Authors: Ankit Sonthalia, Alexander Rubinstein, Ehsan Abbasnejad, Seong Joon Oh

    Abstract: It has recently been conjectured that neural network solution sets reachable via stochastic gradient descent (SGD) are convex, considering permutation invariances (Entezari et al., 2022). This means that a linear path can connect two independent solutions with low loss, given the weights of one of the models are appropriately permuted. However, current methods to test this theory often require ver… ▽ More

    Submitted 9 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  5. arXiv:2403.07356  [pdf, other

    cs.CV cs.LG

    Premonition: Using Generative Models to Preempt Future Data Changes in Continual Learning

    Authors: Mark D. McDonnell, Dong Gong, Ehsan Abbasnejad, Anton van den Hengel

    Abstract: Continual learning requires a model to adapt to ongoing changes in the data distribution, and often to the set of tasks to be performed. It is rare, however, that the data and task changes are completely unpredictable. Given a description of an overarching goal or data theme, which we call a realm, humans can often guess what concepts are associated with it. We show here that the combination of a… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 31 pages total (14 main paper, 5 references, 12 appendices)

  6. arXiv:2403.02241  [pdf, other

    cs.LG cs.AI cs.CV

    Neural Redshift: Random Networks are not Random Functions

    Authors: Damien Teney, Armand Nicolicioiu, Valentin Hartmann, Ehsan Abbasnejad

    Abstract: Our understanding of the generalization capabilities of neural networks (NNs) is still incomplete. Prevailing explanations are based on implicit biases of gradient descent (GD) but they cannot account for the capabilities of models from gradient-free methods nor the simplicity bias recently observed in untrained networks. This paper seeks other sources of generalization in NNs. Findings. To unde… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  7. arXiv:2312.14400  [pdf, other

    cs.CV

    Unveiling Backbone Effects in CLIP: Exploring Representational Synergies and Variances

    Authors: Cristian Rodriguez-Opazo, Edison Marrese-Taylor, Ehsan Abbasnejad, Hamed Damirchi, Ignacio M. Jara, Felipe Bravo-Marquez, Anton van den Hengel

    Abstract: Contrastive Language-Image Pretraining (CLIP) stands out as a prominent method for image representation learning. Various neural architectures, spanning Transformer-based models like Vision Transformers (ViTs) to Convolutional Networks (ConvNets) like ResNets, are trained with CLIP and serve as universal backbones across diverse vision tasks. Despite utilizing the same data and training objectives… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  8. arXiv:2311.17949  [pdf, other

    cs.CV

    Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines

    Authors: Hamed Damirchi, Cristian Rodríguez-Opazo, Ehsan Abbasnejad, Damien Teney, Javen Qinfeng Shi, Stephen Gould, Anton van den Hengel

    Abstract: Large pre-trained models can dramatically reduce the amount of task-specific data required to solve a problem, but they often fail to capture domain-specific nuances out of the box. The Web likely contains the information necessary to excel on any specific application, but identifying the right data a priori is challenging. This paper shows how to leverage recent advances in NLP and multi-modal le… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  9. arXiv:2311.03866  [pdf, other

    cs.CV

    SCONE-GAN: Semantic Contrastive learning-based Generative Adversarial Network for an end-to-end image translation

    Authors: Iman Abbasnejad, Fabio Zambetta, Flora Salim, Timothy Wiley, Jeffrey Chan, Russell Gallagher, Ehsan Abbasnejad

    Abstract: SCONE-GAN presents an end-to-end image translation, which is shown to be effective for learning to generate realistic and diverse scenery images. Most current image-to-image translation approaches are devised as two map**s: a translation from the source to target domain and another to represent its inverse. While successful in many applications, these approaches may suffer from generating trivia… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 9 pages, 5 figures

  10. arXiv:2309.04659  [pdf, other

    cs.CV

    Progressive Feature Adjustment for Semi-supervised Learning from Pretrained Models

    Authors: Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix

    Abstract: As an effective way to alleviate the burden of data annotation, semi-supervised learning (SSL) provides an attractive solution due to its ability to leverage both labeled and unlabeled data to build a predictive model. While significant progress has been made recently, SSL algorithms are often evaluated and developed under the assumption that the network is randomly initialized. This is in sharp c… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: to appear at ICCVW2023 (Workshop on Visual Continual Learning)

  11. arXiv:2307.02251  [pdf, other

    cs.LG cs.CV

    RanPAC: Random Projections and Pre-trained Models for Continual Learning

    Authors: Mark D. McDonnell, Dong Gong, Amin Parveneh, Ehsan Abbasnejad, Anton van den Hengel

    Abstract: Continual learning (CL) aims to incrementally learn different tasks (such as classification) in a non-stationary data stream without forgetting old ones. Most CL works focus on tackling catastrophic forgetting under a learning-from-scratch paradigm. However, with the increasing prominence of foundation models, pre-trained models equipped with informative representations have become available for v… ▽ More

    Submitted 15 January, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 32 pages, 11 figures

    Journal ref: 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023), Dec 2023, New Orleans, United States

  12. arXiv:2305.18026  [pdf, other

    cs.CL

    Semantic Role Labeling Guided Out-of-distribution Detection

    Authors: **an Zou, Maihao Guo, Yu Tian, Yuhao Lin, Haiyao Cao, Lingqiao Liu, Ehsan Abbasnejad, Javen Qinfeng Shi

    Abstract: Identifying unexpected domain-shifted instances in natural language processing is crucial in real-world applications. Previous works identify the out-of-distribution (OOD) instance by leveraging a single global feature embedding to represent the sentence, which cannot characterize subtle OOD patterns well. Another major challenge current OOD methods face is learning effective low-dimensional sente… ▽ More

    Submitted 18 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: accepted by COLING 2024

  13. arXiv:2305.16817  [pdf, other

    cs.LG

    Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup

    Authors: Damien Teney, **dong Wang, Ehsan Abbasnejad

    Abstract: Mixup is a highly successful technique to improve generalization of neural networks by augmenting the training data with combinations of random pairs. Selective mixup is a family of methods that apply mixup to specific pairs, e.g. only combining examples across classes or domains. These methods have claimed remarkable improvements on benchmarks with distribution shifts, but their mechanisms and li… ▽ More

    Submitted 2 June, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  14. arXiv:2303.13556  [pdf, other

    cs.CV

    ProtoCon: Pseudo-label Refinement via Online Clustering and Prototypical Consistency for Efficient Semi-supervised Learning

    Authors: Islam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Gholamreza Haffari

    Abstract: Confidence-based pseudo-labeling is among the dominant approaches in semi-supervised learning (SSL). It relies on including high-confidence predictions made on unlabeled data as additional targets to train the model. We propose ProtoCon, a novel SSL method aimed at the less-explored label-scarce SSL where such methods usually underperform. ProtoCon refines the pseudo-labels by leveraging their nea… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted in CVPR2023 (highlight)

  15. arXiv:2301.12680  [pdf, other

    cs.CR

    Feature-Space Bayesian Adversarial Learning Improved Malware Detector Robustness

    Authors: Bao Gia Doan, Shuiqiao Yang, Paul Montague, Olivier De Vel, Tamas Abraham, Seyit Camtepe, Salil S. Kanhere, Ehsan Abbasnejad, Damith C. Ranasinghe

    Abstract: We present a new algorithm to train a robust malware detector. Modern malware detectors rely on machine learning algorithms. Now, the adversarial objective is to devise alterations to the malware code to decrease the chance of being detected whilst preserving the functionality and realism of the malware. Adversarial learning is effective in improving robustness but generating functional and realis… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted to AAAI 2023 conference

  16. arXiv:2212.02003  [pdf, other

    cs.LG cs.CR cs.CV

    Bayesian Learning with Information Gain Provably Bounds Risk for a Robust Adversarial Defense

    Authors: Bao Gia Doan, Ehsan Abbasnejad, Javen Qinfeng Shi, Damith C. Ranasinghe

    Abstract: We present a new algorithm to learn a deep neural network model robust against adversarial attacks. Previous algorithms demonstrate an adversarially trained Bayesian Neural Network (BNN) provides improved robustness. We recognize the adversarial learning approach for approximating the multi-modal posterior distribution of a Bayesian model can lead to mode collapse; consequently, the model's achiev… ▽ More

    Submitted 1 December, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: Published at ICML 2022. Code is available at https://github.com/baogiadoan/IG-BNN

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:5309-5323, 2022

  17. arXiv:2210.10317  [pdf, other

    cs.CV

    LAVA: Label-efficient Visual Learning and Adaptation

    Authors: Islam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Mehrtash Harandi, Gholamreza Haffari

    Abstract: We present LAVA, a simple yet effective method for multi-domain visual transfer learning with limited data. LAVA builds on a few recent innovations to enable adapting to partially labelled datasets with class and domain shifts. First, LAVA learns self-supervised visual representations on the source dataset and ground them using class label semantics to overcome transfer collapse problems associate… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted in WACV2023

  18. arXiv:2209.00613  [pdf, other

    cs.LG cs.CV

    ID and OOD Performance Are Sometimes Inversely Correlated on Real-world Datasets

    Authors: Damien Teney, Yong Lin, Seong Joon Oh, Ehsan Abbasnejad

    Abstract: Several studies have compared the in-distribution (ID) and out-of-distribution (OOD) performance of models in computer vision and NLP. They report a frequent positive correlation and some surprisingly never even observe an inverse correlation indicative of a necessary trade-off. The possibility of inverse patterns is important to determine whether ID performance can serve as a proxy for OOD genera… ▽ More

    Submitted 19 May, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

  19. arXiv:2208.14571  [pdf, other

    cs.LG cs.AI stat.ML

    Truncated Matrix Power Iteration for Differentiable DAG Learning

    Authors: Zhen Zhang, Ignavier Ng, Dong Gong, Yuhang Liu, Ehsan M Abbasnejad, Mingming Gong, Kun Zhang, Javen Qinfeng Shi

    Abstract: Recovering underlying Directed Acyclic Graph (DAG) structures from observational data is highly challenging due to the combinatorial nature of the DAG-constrained optimization problem. Recently, DAG learning has been cast as a continuous optimization problem by characterizing the DAG constraint as a smooth equality one, generally based on polynomials over adjacency matrices. Existing methods place… ▽ More

    Submitted 20 December, 2022; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: Published in NeurIPS 2022

  20. arXiv:2207.02598  [pdf, other

    cs.LG cs.CV

    Predicting is not Understanding: Recognizing and Addressing Underspecification in Machine Learning

    Authors: Damien Teney, Maxime Peyrard, Ehsan Abbasnejad

    Abstract: Machine learning (ML) models are typically optimized for their accuracy on a given dataset. However, this predictive criterion rarely captures all desirable properties of a model, in particular how well it matches a domain expert's understanding of a task. Underspecification refers to the existence of multiple models that are indistinguishable in their in-domain accuracy, even though they differ i… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: Long version of a paper accepted at the 2022 European Conference on Computer Vision (ECCV)

  21. arXiv:2206.14355  [pdf, other

    cs.CV cs.CL cs.LG

    EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering

    Authors: Violetta Shevchenko, Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel, Damien Teney

    Abstract: The availability of clean and diverse labeled data is a major roadblock for training models on complex tasks such as visual question answering (VQA). The extensive work on large vision-and-language models has shown that self-supervised learning is effective for pretraining multimodal interactions. In this technical report, we focus on visual representations. We review and evaluate self-supervised… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  22. arXiv:2206.06606  [pdf, other

    cs.CL cs.LG

    Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing Model

    Authors: **an Zou, Haiyao Cao, Lingqiao Liu, Yuhao Lin, Ehsan Abbasnejad, Javen Qinfeng Shi

    Abstract: Natural Language Processing(NLP) demonstrates a great potential to support financial decision-making by analyzing the text from social media or news outlets. In this work, we build a platform to study the NLP-aided stock auto-trading algorithms systematically. In contrast to the previous work, our platform is characterized by three features: (1) We provide financial news for each specific stock. (… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  23. arXiv:2205.10189  [pdf, other

    cs.CL cs.AI

    Progressive Class Semantic Matching for Semi-supervised Text Classification

    Authors: Hai-Ming Xu, Lingqiao Liu, Ehsan Abbasnejad

    Abstract: Semi-supervised learning is a promising way to reduce the annotation cost for text-classification. Combining with pre-trained language models (PLMs), e.g., BERT, recent semi-supervised learning methods achieved impressive performance. In this work, we further investigate the marriage between semi-supervised learning and a pre-trained language model. Unlike existing approaches that utilize PLMs onl… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: NAACL2022 (oral)

  24. arXiv:2203.07034  [pdf, other

    cs.CV

    Active Learning by Feature Mixing

    Authors: Amin Parvaneh, Ehsan Abbasnejad, Damien Teney, Reza Haffari, Anton van den Hengel, Javen Qinfeng Shi

    Abstract: The promise of active learning (AL) is to reduce labelling costs by selecting the most valuable examples to annotate from a pool of unlabelled data. Identifying these examples is especially challenging with high-dimensional data (e.g. images, videos) and in low-data regimes. In this paper, we propose a novel method for batch AL called ALFA-Mix. We identify unlabelled instances with sufficiently-di… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  25. arXiv:2202.00091  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Query Efficient Decision Based Sparse Attacks Against Black-Box Deep Learning Models

    Authors: Viet Quoc Vo, Ehsan Abbasnejad, Damith C. Ranasinghe

    Abstract: Despite our best efforts, deep learning models remain highly vulnerable to even tiny adversarial perturbations applied to the inputs. The ability to extract information from solely the output of a machine learning model to craft adversarial perturbations to black-box models is a practical threat against real-world systems, such as autonomous cars or machine learning models exposed as a service (ML… ▽ More

    Submitted 23 March, 2023; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: Published as a conference paper at the International Conference on Learning Representations (ICLR 2022). Code is available at https://sparseevoattack.github.io/

  26. arXiv:2112.05282  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    RamBoAttack: A Robust Query Efficient Deep Neural Network Decision Exploit

    Authors: Viet Quoc Vo, Ehsan Abbasnejad, Damith C. Ranasinghe

    Abstract: Machine learning models are critically susceptible to evasion attacks from adversarial examples. Generally, adversarial examples, modified inputs deceptively similar to the original input, are constructed under whitebox settings by adversaries with full access to the model. However, recent attacks have shown a remarkable reduction in query numbers to craft adversarial examples using blackbox attac… ▽ More

    Submitted 23 March, 2023; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: Published in Network and Distributed System Security (NDSS) Symposium 2022. Code is available at https://ramboattack.github.io/

  27. arXiv:2111.09999  [pdf, other

    cs.CV cs.CR

    TnT Attacks! Universal Naturalistic Adversarial Patches Against Deep Neural Network Systems

    Authors: Bao Gia Doan, Minhui Xue, Shiqing Ma, Ehsan Abbasnejad, Damith C. Ranasinghe

    Abstract: Deep neural networks are vulnerable to attacks from adversarial inputs and, more recently, Trojans to misguide or hijack the model's decision. We expose the existence of an intriguing class of spatially bounded, physically realizable, adversarial examples -- Universal NaTuralistic adversarial paTches -- we call TnTs, by exploring the superset of the spatially bounded adversarial example space and… ▽ More

    Submitted 25 July, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: Accepted for publication in the IEEE Transactions on Information Forensics & Security (TIFS)

  28. arXiv:2106.10784  [pdf, other

    cs.LG

    iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients

    Authors: Miao Zhang, Steven Su, Shirui Pan, Xiaojun Chang, Ehsan Abbasnejad, Reza Haffari

    Abstract: \textit{Differentiable ARchiTecture Search} (DARTS) has recently become the mainstream of neural architecture search (NAS) due to its efficiency and simplicity. With a gradient-based bi-level optimization, DARTS alternately optimizes the inner model weights and the outer architecture parameter in a weight-sharing supernet. A key challenge to the scalability and quality of the learned architectures… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

    Comments: ICML2021

  29. arXiv:2105.05612  [pdf, other

    cs.LG cs.CV

    Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization

    Authors: Damien Teney, Ehsan Abbasnejad, Simon Lucey, Anton van den Hengel

    Abstract: Neural networks trained with SGD were recently shown to rely preferentially on linearly-predictive features and can ignore complex, equally-predictive ones. This simplicity bias can explain their lack of robustness out of distribution (OOD). The more complex the task to learn, the more likely it is that statistical artifacts (i.e. selection biases, spurious correlations) are simpler than the mecha… ▽ More

    Submitted 11 September, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: CVPR 2022

  30. arXiv:2104.05248  [pdf, other

    cs.CV cs.LG

    All Labels Are Not Created Equal: Enhancing Semi-supervision via Label Grou** and Co-training

    Authors: Islam Nassar, Samitha Herath, Ehsan Abbasnejad, Wray Buntine, Gholamreza Haffari

    Abstract: Pseudo-labeling is a key component in semi-supervised learning (SSL). It relies on iteratively using the model to generate artificial labels for the unlabeled data to train against. A common property among its various methods is that they only rely on the model's prediction to make labeling decisions without considering any prior knowledge about the visual similarity among the classes. In this pap… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted in CVPR2021

  31. arXiv:2103.00446  [pdf, other

    cs.CV

    Learning for Visual Navigation by Imagining the Success

    Authors: Mahdi Kazemi Moghaddam, Ehsan Abbasnejad, Qi Wu, Javen Shi, Anton Van Den Hengel

    Abstract: Visual navigation is often cast as a reinforcement learning (RL) problem. Current methods typically result in a suboptimal policy that learns general obstacle avoidance and search behaviours. For example, in the target-object navigation setting, the policies learnt by traditional methods often fail to complete the task, even when the target is clearly within reach from a human perspective. In orde… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  32. arXiv:2005.09241  [pdf, other

    cs.CV cs.LG

    On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law

    Authors: Damien Teney, Kushal Kafle, Robik Shrestha, Ehsan Abbasnejad, Christopher Kanan, Anton van den Hengel

    Abstract: Out-of-distribution (OOD) testing is increasingly popular for evaluating a machine learning system's ability to generalize beyond the biases of a training set. OOD benchmarks are designed to present a different joint distribution of data and labels between training and test time. VQA-CP has become the standard OOD benchmark for visual question answering, but we discovered three troubling practices… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  33. arXiv:2004.12794  [pdf, other

    cs.NE cs.LG eess.SP

    Hybrid Neuro-Evolutionary Method for Predicting Wind Turbine Power Output

    Authors: Mehdi Neshat, Meysam Majidi Nezhad, Ehsan Abbasnejad, Daniele Groppi, Azim Heydari, Lina Bertling Tjernberg, Davide Astiaso Garcia, Bradley Alexander, Markus Wagner

    Abstract: Reliable wind turbine power prediction is imperative to the planning, scheduling and control of wind energy farms for stable power production. In recent years Machine Learning (ML) methods have been successfully applied in a wide range of domains, including renewable energy. However, due to the challenging nature of power prediction in wind farms, current models are far short of the accuracy requi… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

  34. arXiv:2004.03222  [pdf, other

    cs.CV

    Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation

    Authors: Mahdi Kazemi Moghaddam, Qi Wu, Ehsan Abbasnejad, Javen Qinfeng Shi

    Abstract: We humans can impeccably search for a target object, given its name only, even in an unseen environment. We argue that this ability is largely due to three main reasons: the incorporation of prior knowledge (or experience), the adaptation of it to the new environment using the observed visual cues and most importantly optimistically searching without giving up early. This is currently missing in t… ▽ More

    Submitted 6 December, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted for publication at WACV 2021

  35. arXiv:2002.11894  [pdf, other

    cs.CV

    Unshuffling Data for Improved Generalization

    Authors: Damien Teney, Ehsan Abbasnejad, Anton van den Hengel

    Abstract: Generalization beyond the training distribution is a core challenge in machine learning. The common practice of mixing and shuffling examples when training neural networks may not be optimal in this regard. We show that partitioning the data into well-chosen, non-i.i.d. subsets treated as multiple training environments can guide the learning of models with better out-of-distribution generalization… ▽ More

    Submitted 20 November, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

  36. arXiv:2002.09106  [pdf, other

    cs.NE cs.LG eess.SP

    An Evolutionary Deep Learning Method for Short-term Wind Speed Prediction: A Case Study of the Lillgrund Offshore Wind Farm

    Authors: Mehdi Neshat, Meysam Majidi Nezhad, Ehsan Abbasnejad, Lina Bertling Tjernberg, Davide Astiaso Garcia, Bradley Alexander, Markus Wagner

    Abstract: Accurate short-term wind speed forecasting is essential for large-scale integration of wind power generation. However, the seasonal and stochastic characteristics of wind speed make forecasting a challenging task. This study uses a new hybrid evolutionary approach that uses a popular evolutionary search algorithm, CMA-ES, to tune the hyper-parameters of two Long short-term memory(LSTM) ANN models… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

  37. arXiv:1909.13471  [pdf, other

    cs.CV cs.LG

    On Incorporating Semantic Prior Knowledge in Deep Learning Through Embedding-Space Constraints

    Authors: Damien Teney, Ehsan Abbasnejad, Anton van den Hengel

    Abstract: The knowledge that humans hold about a problem often extends far beyond a set of training data and output labels. While the success of deep learning mostly relies on supervised training, important properties cannot be inferred efficiently from end-to-end annotations alone, for example causal relations or domain-specific invariances. We present a general technique to supplement supervised training… ▽ More

    Submitted 16 November, 2019; v1 submitted 30 September, 2019; originally announced September 2019.

  38. Februus: Input Purification Defense Against Trojan Attacks on Deep Neural Network Systems

    Authors: Bao Gia Doan, Ehsan Abbasnejad, Damith C. Ranasinghe

    Abstract: We propose Februus; a new idea to neutralize highly potent and insidious Trojan attacks on Deep Neural Network (DNN) systems at run-time. In Trojan attacks, an adversary activates a backdoor crafted in a deep neural network model using a secret trigger, a Trojan, applied to any input to alter the model's decision to a target prediction---a target determined by and only known to the attacker. Febru… ▽ More

    Submitted 28 September, 2020; v1 submitted 9 August, 2019; originally announced August 2019.

    Comments: 16 pages, to appear in the 36th Annual Computer Security Applications Conference (ACSAC 2020)

    Journal ref: In the 36th Annual Computer Security Applications Conference (ACSAC 2020)

  39. Adaptive Neuro-Surrogate-Based Optimisation Method for Wave Energy Converters Placement Optimisation

    Authors: Mehdi Neshat, Ehsan Abbasnejad, Qinfeng Shi, Bradley Alexander, Markus Wagner

    Abstract: The installed amount of renewable energy has expanded massively in recent years. Wave energy, with its high capacity factors has great potential to complement established sources of solar and wind energy. This study explores the problem of optimising the layout of advanced, three-tether wave energy converters in a size-constrained farm in a numerically modelled ocean environment. Simulating and co… ▽ More

    Submitted 8 July, 2019; v1 submitted 6 July, 2019; originally announced July 2019.

    Comments: 12 pages, 2 tables, 6 figures

    Journal ref: International Conference on Neural Information Processing,2019,pp. 353-366. Springer

  40. arXiv:1906.09433  [pdf, other

    cs.CV

    Deep Single Image Deraining Via Estimating Transmission and Atmospheric Light in rainy Scenes

    Authors: Yinglong Wang, Qinfeng Shi, Ehsan Abbasnejad, Chao Ma, ** Ma, Bing Zeng

    Abstract: Rain removal in images/videos is still an important task in computer vision field and attracting attentions of more and more people. Traditional methods always utilize some incomplete priors or filters (e.g. guided filter) to remove rain effect. Deep learning gives more probabilities to better solve this task. However, they remove rain either by evaluating background from rainy image directly or l… ▽ More

    Submitted 22 June, 2019; originally announced June 2019.

    Comments: 10 pages, 9 figures, 3 tables

  41. arXiv:1905.04430  [pdf, other

    cs.CV cs.LG

    Follow the Attention: Combining Partial Pose and Object Motion for Fine-Grained Action Detection

    Authors: Mohammad Mahdi Kazemi Moghaddam, Ehsan Abbasnejad, Javen Shi

    Abstract: Retailers have long been searching for ways to effectively understand their customers' behaviour in order to provide a smooth and pleasant shop** experience that attracts more customers everyday and maximises their revenue, consequently. Humans can flawlessly understand others' behaviour by combining different visual cues from activity to gestures and facial expressions. Empowering the computer… ▽ More

    Submitted 26 June, 2019; v1 submitted 10 May, 2019; originally announced May 2019.

  42. Show, Price and Negotiate: A Negotiator with Online Value Look-Ahead

    Authors: Amin Parvaneh, Ehsan Abbasnejad, Qi Wu, Javen Qinfeng Shi, Anton van den Hengel

    Abstract: Negotiation, as an essential and complicated aspect of online shop**, is still challenging for an intelligent agent. To that end, we propose the Price Negotiator, a modular deep neural network that addresses the unsolved problems in recent studies by (1) considering images of the items as a crucial, though neglected, source of information in a negotiation, (2) heuristically finding the most simi… ▽ More

    Submitted 12 March, 2021; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: published in IEEE Transactions on Multimedia

  43. arXiv:1904.03367  [pdf, other

    cs.LG stat.ML

    Reinforcement Learning with Attention that Works: A Self-Supervised Approach

    Authors: Anthony Manchin, Ehsan Abbasnejad, Anton van den Hengel

    Abstract: Attention models have had a significant positive impact on deep learning across a range of tasks. However previous attempts at integrating attention with reinforcement learning have failed to produce significant improvements. We propose the first combination of self attention and reinforcement learning that is capable of producing significant improvements, including new state of the art results in… ▽ More

    Submitted 6 April, 2019; originally announced April 2019.

  44. arXiv:1812.06401  [pdf, other

    cs.AI cs.CL cs.LG

    What's to know? Uncertainty as a Guide to Asking Goal-oriented Questions

    Authors: Ehsan Abbasnejad, Qi Wu, Javen Shi, Anton van den Hengel

    Abstract: One of the core challenges in Visual Dialogue problems is asking the question that will provide the most useful information towards achieving the required objective. Encouraging an agent to ask the right questions is difficult because we don't know a-priori what information the agent will need to achieve its task, and we don't have an explicit model of what it knows already. We propose a solution… ▽ More

    Submitted 16 December, 2018; originally announced December 2018.

  45. arXiv:1812.06398  [pdf, other

    cs.LG stat.ML

    Gold Seeker: Information Gain from Policy Distributions for Goal-oriented Vision-and-Langauge Reasoning

    Authors: Ehsan Abbasnejad, Iman Abbasnejad, Qi Wu, Javen Shi, Anton van den Hengel

    Abstract: As Computer Vision moves from a passive analysis of pixels to active analysis of semantics, the breadth of information algorithms need to reason over has expanded significantly. One of the key challenges in this vein is the ability to identify the information required to make a decision, and select an action that will recover it. We propose a reinforcement-learning approach that maintains a distri… ▽ More

    Submitted 29 March, 2020; v1 submitted 16 December, 2018; originally announced December 2018.

  46. arXiv:1811.08127  [pdf, other

    cs.LG cs.HC stat.ML

    Deep Auto-Set: A Deep Auto-Encoder-Set Network for Activity Recognition Using Wearables

    Authors: Alireza Abedin Varamin, Ehsan Abbasnejad, Qinfeng Shi, Damith Ranasinghe, Hamid Rezatofighi

    Abstract: Automatic recognition of human activities from time-series sensor data (referred to as HAR) is a growing area of research in ubiquitous computing. Most recent research in the field adopts supervised deep learning paradigms to automate extraction of intrinsic features from raw signal inputs and addresses HAR as a multi-class classification problem where detecting a single activity class within the… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

    Comments: Accepted at MobiQuitous 2018

  47. arXiv:1807.05380  [pdf, other

    cs.CV

    3D Hand Pose Estimation using Simulation and Partial-Supervision with a Shared Latent Space

    Authors: Masoud Abdi, Ehsan Abbasnejad, Chee Peng Lim, Saeid Nahavandi

    Abstract: Tremendous amounts of expensive annotated data are a vital ingredient for state-of-the-art 3d hand pose estimation. Therefore, synthetic data has been popularized as annotations are automatically available. However, models trained only with synthetic samples do not generalize to real data, mainly due to the gap between the distribution of synthetic and real data. In this paper, we propose a novel… ▽ More

    Submitted 14 July, 2018; originally announced July 2018.

    Comments: Oral presentation at British Machine Vision Conference (BMVC) 2018

  48. arXiv:1706.05477  [pdf, other

    cs.LG cs.AI stat.ML

    Bayesian Conditional Generative Adverserial Networks

    Authors: M. Ehsan Abbasnejad, Qinfeng Shi, Iman Abbasnejad, Anton van den Hengel, Anthony Dick

    Abstract: Traditional GANs use a deterministic generator function (typically a neural network) to transform a random noise input $z$ to a sample $\mathbf{x}$ that the discriminator seeks to distinguish. We propose a new GAN called Bayesian Conditional Generative Adversarial Networks (BC-GANs) that use a random generator function to transform a deterministic input $y'$ to a sample $\mathbf{x}$. Our BC-GANs e… ▽ More

    Submitted 17 June, 2017; originally announced June 2017.

  49. arXiv:1611.08998  [pdf, other

    cs.CV cs.AI cs.LG

    DeepSetNet: Predicting Sets with Deep Neural Networks

    Authors: S. Hamid Rezatofighi, Vijay Kumar B G, Anton Milan, Ehsan Abbasnejad, Anthony Dick, Ian Reid

    Abstract: This paper addresses the task of set prediction using deep learning. This is important because the output of many computer vision tasks, including image tagging and object detection, are naturally expressed as sets of entities rather than vectors. As opposed to a vector, the size of a set is not fixed in advance, and it is invariant to the ordering of entities within it. We define a likelihood for… ▽ More

    Submitted 10 August, 2017; v1 submitted 28 November, 2016; originally announced November 2016.

    Comments: Accepted in IEEE International Conference on Computer Vision (ICCV), Venice, 2017, (Spotlight)

  50. arXiv:1611.07800  [pdf, other

    cs.LG stat.ML

    Infinite Variational Autoencoder for Semi-Supervised Learning

    Authors: Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel

    Abstract: This paper presents an infinite variational autoencoder (VAE) whose capacity adapts to suit the input data. This is achieved using a mixture model where the mixing coefficients are modeled by a Dirichlet process, allowing us to integrate over the coefficients when performing inference. Critically, this then allows us to automatically vary the number of autoencoders in the mixture based on the data… ▽ More

    Submitted 23 November, 2016; v1 submitted 23 November, 2016; originally announced November 2016.