Skip to main content

Showing 1–50 of 55 results for author: Arani, E

.
  1. arXiv:2406.16231  [pdf, other

    cs.LG cs.AI cs.CV

    Gradual Divergence for Seamless Adaptation: A Novel Domain Incremental Learning Method

    Authors: Kishaan Jeeveswaran, Elahe Arani, Bahram Zonooz

    Abstract: Domain incremental learning (DIL) poses a significant challenge in real-world scenarios, as models need to be sequentially trained on diverse domains over time, all the while avoiding catastrophic forgetting. Mitigating representation drift, which refers to the phenomenon of learned representations undergoing changes as the model adapts to new tasks, can help alleviate catastrophic forgetting. In… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Accepted at 41st International Conference on Machine Learning (ICML 2024)

  2. arXiv:2406.10165  [pdf, other

    cs.CV cs.RO

    CarLLaVA: Vision language models for camera-only closed-loop driving

    Authors: Katrin Renz, Long Chen, Ana-Maria Marcu, Jan Hünermann, Benoit Hanotte, Alice Karnsund, Jamie Shotton, Elahe Arani, Oleg Sinavski

    Abstract: In this technical report, we present CarLLaVA, a Vision Language Model (VLM) for autonomous driving, developed for the CARLA Autonomous Driving Challenge 2.0. CarLLaVA uses the vision encoder of the LLaVA VLM and the LLaMA architecture as backbone, achieving state-of-the-art closed-loop driving performance with only camera input and without the need for complex or expensive labels. Additionally, w… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Outstanding Champion & Innovation Award @ CARLA Autonomous Driving Challenge 2024; Project video: https://youtu.be/E1nsEgcHRuc

  3. arXiv:2405.13978  [pdf, other

    cs.LG cs.AI cs.CV

    Mitigating Interference in the Knowledge Continuum through Attention-Guided Incremental Learning

    Authors: Prashant Bhat, Bharath Renjith, Elahe Arani, Bahram Zonooz

    Abstract: Continual learning (CL) remains a significant challenge for deep neural networks, as it is prone to forgetting previously acquired knowledge. Several approaches have been proposed in the literature, such as experience rehearsal, regularization, and parameter isolation, to address this problem. Although almost zero forgetting can be achieved in task-incremental learning, class-incremental learning… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Published at 3rd Conference on Lifelong Learning Agents (CoLLAs 2024)

  4. arXiv:2405.02766  [pdf, other

    cs.LG cs.AI cs.CV

    Beyond Unimodal Learning: The Importance of Integrating Multiple Modalities for Lifelong Learning

    Authors: Fahad Sarfraz, Bahram Zonooz, Elahe Arani

    Abstract: While humans excel at continual learning (CL), deep neural networks (DNNs) exhibit catastrophic forgetting. A salient feature of the brain that allows effective CL is that it utilizes multiple modalities for learning and inference, which is underexplored in DNNs. Therefore, we study the role and interactions of multiple modalities in mitigating forgetting and introduce a benchmark for multimodal c… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted at 3rd Conference on Lifelong Learning Agents (CoLLAs), 2024

  5. arXiv:2404.18161  [pdf, other

    cs.LG cs.AI cs.CV

    IMEX-Reg: Implicit-Explicit Regularization in the Function Space for Continual Learning

    Authors: Prashant Bhat, Bharath Renjith, Elahe Arani, Bahram Zonooz

    Abstract: Continual learning (CL) remains one of the long-standing challenges for deep neural networks due to catastrophic forgetting of previously acquired knowledge. Although rehearsal-based approaches have been fairly successful in mitigating catastrophic forgetting, they suffer from overfitting on buffered samples and prior information loss, hindering generalization under low-buffer regimes. Inspired by… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Published in Transactions on Machine Learning Research

  6. arXiv:2404.09752  [pdf, other

    cs.CV cs.AI cs.LG

    Can We Break Free from Strong Data Augmentations in Self-Supervised Learning?

    Authors: Shruthi Gowda, Elahe Arani, Bahram Zonooz

    Abstract: Self-supervised learning (SSL) has emerged as a promising solution for addressing the challenge of limited labeled data in deep neural networks (DNNs), offering scalability potential. However, the impact of design dependencies within the SSL framework remains insufficiently investigated. In this study, we comprehensively explore SSL behavior across a spectrum of augmentations, revealing their cruc… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  7. arXiv:2402.11733  [pdf, other

    cs.LG cs.AI cs.CV

    The Effectiveness of Random Forgetting for Robust Generalization

    Authors: Vijaya Raghavan T Ramkumar, Bahram Zonooz, Elahe Arani

    Abstract: Deep neural networks are susceptible to adversarial attacks, which can compromise their performance and accuracy. Adversarial Training (AT) has emerged as a popular approach for protecting neural networks against such attacks. However, a key challenge of AT is robust overfitting, where the network's robust performance on test data deteriorates with further training, thus hindering generalization.… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Published as a conference paper at ICLR 2024

  8. arXiv:2401.14948  [pdf, other

    cs.LG cs.AI cs.CV

    Conserve-Update-Revise to Cure Generalization and Robustness Trade-off in Adversarial Training

    Authors: Shruthi Gowda, Bahram Zonooz, Elahe Arani

    Abstract: Adversarial training improves the robustness of neural networks against adversarial attacks, albeit at the expense of the trade-off between standard and robust generalization. To unveil the underlying factors driving this phenomenon, we examine the layer-wise learning capabilities of neural networks during the transition from a standard to an adversarial setting. Our empirical findings demonstrate… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: Accepted as a conference paper at ICLR 2024

  9. arXiv:2312.14115  [pdf, other

    cs.RO cs.AI cs.CV

    LingoQA: Video Question Answering for Autonomous Driving

    Authors: Ana-Maria Marcu, Long Chen, Jan Hünermann, Alice Karnsund, Benoit Hanotte, Prajwal Chidananda, Saurabh Nair, Vijay Badrinarayanan, Alex Kendall, Jamie Shotton, Elahe Arani, Oleg Sinavski

    Abstract: Autonomous driving has long faced a challenge with public acceptance due to the lack of explainability in the decision-making process. Video question-answering (QA) in natural language provides the opportunity for bridging this gap. Nonetheless, evaluating the performance of Video QA models has proved particularly tough due to the absence of comprehensive benchmarks. To fill this gap, we introduce… ▽ More

    Submitted 19 March, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Benchmark and dataset are available at https://github.com/wayveai/LingoQA/

  10. Transformers in Unsupervised Structure-from-Motion

    Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

    Abstract: Transformers have revolutionized deep learning based computer vision with improved performance as well as robustness to natural corruptions and adversarial attacks. Transformers are used predominantly for 2D vision tasks, including image classification, semantic segmentation, and object detection. However, robots and advanced driver assistance systems also require 3D scene understanding for decisi… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: International Joint Conference on Computer Vision, Imaging and Computer Graphics. Cham: Springer Nature Switzerland, 2022. Published at "Communications in Computer and Information Science, vol 1815. Springer Nature". arXiv admin note: text overlap with arXiv:2202.03131

  11. arXiv:2311.02393  [pdf, other

    cs.CV cs.AI

    Continual Learning of Unsupervised Monocular Depth from Videos

    Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

    Abstract: Spatial scene understanding, including monocular depth estimation, is an important problem in various applications, such as robotics and autonomous driving. While improvements in unsupervised monocular depth estimation have potentially allowed models to be trained on diverse crowdsourced videos, this remains underexplored as most methods utilize the standard training protocol, wherein the models a… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: Accepted at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

  12. arXiv:2310.11341  [pdf, other

    cs.CV cs.AI cs.LG

    Dual Cognitive Architecture: Incorporating Biases and Multi-Memory Systems for Lifelong Learning

    Authors: Shruthi Gowda, Bahram Zonooz, Elahe Arani

    Abstract: Artificial neural networks (ANNs) exhibit a narrow scope of expertise on stationary independent data. However, the data in the real world is continuous and dynamic, and ANNs must adapt to novel scenarios while also retaining the learned knowledge to become lifelong learners. The ability of humans to excel at these tasks can be attributed to multiple factors ranging from cognitive computational str… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  13. arXiv:2310.08217  [pdf, other

    cs.AI cs.CV cs.LG

    TriRE: A Multi-Mechanism Learning Paradigm for Continual Knowledge Retention and Promotion

    Authors: Preetha Vijayan, Prashant Bhat, Elahe Arani, Bahram Zonooz

    Abstract: Continual learning (CL) has remained a persistent challenge for deep neural networks due to catastrophic forgetting (CF) of previously learned tasks. Several techniques such as weight regularization, experience rehearsal, and parameter isolation have been proposed to alleviate CF. Despite their relative success, these research directions have predominantly remained orthogonal and suffer from sever… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted at 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  14. arXiv:2307.00039  [pdf, other

    cs.NE cs.AI cs.CV cs.LG

    Towards Brain Inspired Design for Addressing the Shortcomings of ANNs

    Authors: Fahad Sarfraz, Elahe Arani, Bahram Zonooz

    Abstract: As our understanding of the mechanisms of brain function is enhanced, the value of insights gained from neuroscience to the development of AI algorithms deserves further consideration. Here, we draw parallels with an existing tree-based ANN architecture and a recent neuroscience study[27] arguing that the error-based organization of neurons in the cerebellum that share a preference for a personali… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

    Comments: 11 pages, 7 figures, and 4 tables

  15. arXiv:2305.08551  [pdf, other

    cs.CV cs.AI

    Enhancing Performance of Vision Transformers on Small Datasets through Local Inductive Bias Incorporation

    Authors: Ibrahim Batuhan Akkaya, Senthilkumar S. Kathiresan, Elahe Arani, Bahram Zonooz

    Abstract: Vision transformers (ViTs) achieve remarkable performance on large datasets, but tend to perform worse than convolutional neural networks (CNNs) when trained from scratch on smaller datasets, possibly due to a lack of local inductive bias in the architecture. Recent studies have therefore added locality to the architecture and demonstrated that it can help ViTs achieve performance comparable to CN… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  16. arXiv:2305.04769  [pdf, other

    cs.CV cs.LG cs.NE

    BiRT: Bio-inspired Replay in Vision Transformers for Continual Learning

    Authors: Kishaan Jeeveswaran, Prashant Bhat, Bahram Zonooz, Elahe Arani

    Abstract: The ability of deep neural networks to continually learn and adapt to a sequence of tasks has remained challenging due to catastrophic forgetting of previously learned tasks. Humans, on the other hand, have a remarkable ability to acquire, assimilate, and transfer knowledge across tasks throughout their lifetime without catastrophic forgetting. The versatility of the brain can be attributed to the… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted at 40th International Conference on Machine Learning (ICML 2023)

  17. arXiv:2305.00441  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Multi-Task Structural Learning using Local Task Similarity induced Neuron Creation and Removal

    Authors: Naresh Kumar Gurulingan, Bahram Zonooz, Elahe Arani

    Abstract: Multi-task learning has the potential to improve generalization by maximizing positive transfer between tasks while reducing task interference. Fully achieving this potential is hindered by manually designed architectures that remain static throughout training. On the contrary, learning in the brain occurs through structural changes that are in tandem with changes in synaptic strength. Thus, we pr… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: Accepted at 40th International Conference on Machine Learning (ICML)

  18. arXiv:2304.06738  [pdf, other

    cs.NE cs.AI cs.CV cs.LG

    A Study of Biologically Plausible Neural Network: The Role and Interactions of Brain-Inspired Mechanisms in Continual Learning

    Authors: Fahad Sarfraz, Elahe Arani, Bahram Zonooz

    Abstract: Humans excel at continually acquiring, consolidating, and retaining information from an ever-changing environment, whereas artificial neural networks (ANNs) exhibit catastrophic forgetting. There are considerable differences in the complexity of synapses, the processing of information, and the learning mechanisms in biological neural networks and their artificial counterparts, which may explain th… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  19. arXiv:2304.06672  [pdf, other

    cs.CV cs.AI

    LSFSL: Leveraging Shape Information in Few-shot Learning

    Authors: Deepan Chakravarthi Padmanabhan, Shruthi Gowda, Elahe Arani, Bahram Zonooz

    Abstract: Few-shot learning (FSL) techniques seek to learn the underlying patterns in data using fewer samples, analogous to how humans learn from limited experience. In this limited-data scenario, the challenges associated with deep neural networks, such as shortcut learning and texture bias behaviors, are further exacerbated. Moreover, the significance of addressing shortcut learning is not yet fully expl… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted at CVPR 2023 (2nd Workshop on Learning with Limited Labelled Data for Image and Video Understanding)

  20. arXiv:2303.10455  [pdf, other

    cs.LG cs.AI cs.CV

    Learn, Unlearn and Relearn: An Online Learning Paradigm for Deep Neural Networks

    Authors: Vijaya Raghavan T. Ramkumar, Elahe Arani, Bahram Zonooz

    Abstract: Deep neural networks (DNNs) are often trained on the premise that the complete training data set is provided ahead of time. However, in real-world scenarios, data often arrive in chunks over time. This leads to important considerations about the optimal strategy for training DNNs, such as whether to fine-tune them with each chunk of incoming data (warm-start) or to retrain them from scratch with t… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  21. arXiv:2302.11346  [pdf, other

    cs.LG cs.AI cs.CV

    Task-Aware Information Routing from Common Representation Space in Lifelong Learning

    Authors: Prashant Bhat, Bahram Zonooz, Elahe Arani

    Abstract: Intelligent systems deployed in the real world suffer from catastrophic forgetting when exposed to a sequence of tasks. Humans, on the other hand, acquire, consolidate, and transfer knowledge between tasks that rarely interfere with the consolidated knowledge. Accompanied by self-regulated neurogenesis, continual learning in the brain is governed by a rich set of neurophysiological processes that… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: Accepted as a conference paper at ICLR 2023

  22. arXiv:2302.11344  [pdf, other

    cs.LG cs.AI cs.CV

    Error Sensitivity Modulation based Experience Replay: Mitigating Abrupt Representation Drift in Continual Learning

    Authors: Fahad Sarfraz, Elahe Arani, Bahram Zonooz

    Abstract: Humans excel at lifelong learning, as the brain has evolved to be robust to distribution shifts and noise in our ever-changing environment. Deep neural networks (DNNs), however, exhibit catastrophic forgetting and the learned representations drift drastically as they encounter a new task. This alludes to a different error-based learning mechanism in the brain. Unlike DNNs, where learning scales li… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: Accepted as a conference paper at ICLR 2023

  23. arXiv:2301.05058  [pdf, other

    cs.NE cs.AI cs.CV cs.LG

    Sparse Coding in a Dual Memory System for Lifelong Learning

    Authors: Fahad Sarfraz, Elahe Arani, Bahram Zonooz

    Abstract: Efficient continual learning in humans is enabled by a rich set of neurophysiological mechanisms and interactions between multiple memory systems. The brain efficiently encodes information in non-overlap** sparse codes, which facilitates the learning of new associations faster with controlled interference with previous associations. To mimic sparse coding in DNNs, we enforce activation sparsity… ▽ More

    Submitted 28 December, 2022; originally announced January 2023.

    Comments: Camera ready version - "Thirty-Seventh AAAI Conference on Artificial Intelligence" (AAAI-2023)

  24. arXiv:2301.00620  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Dynamically Modular and Sparse General Continual Learning

    Authors: Arnav Varma, Elahe Arani, Bahram Zonooz

    Abstract: Real-world applications often require learning continuously from a stream of data under ever-changing conditions. When trying to learn from such non-stationary data, deep neural networks (DNNs) undergo catastrophic forgetting of previously learned information. Among the common approaches to avoid catastrophic forgetting, rehearsal-based methods have proven effective. However, they are still prone… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: Camera ready version - 18th International Conference on Computer Vision Theory and Applications (VISAPP 2023)

  25. arXiv:2210.03570  [pdf

    cs.CV

    AI-Driven Road Maintenance Inspection v2: Reducing Data Dependency & Quantifying Road Damage

    Authors: Haris Iqbal, Hemang Chawla, Arnav Varma, Terence Brouns, Ahmed Badar, Elahe Arani, Bahram Zonooz

    Abstract: Road infrastructure maintenance inspection is typically a labor-intensive and critical task to ensure the safety of all road users. Existing state-of-the-art techniques in Artificial Intelligence (AI) for object detection and segmentation help automate a huge chunk of this task given adequate annotated data. However, annotating videos from scratch is cost-prohibitive. For instance, it can take an… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Accepted at IRF Global R2T Conference & Exhibition 2022

  26. arXiv:2210.02357  [pdf, other

    cs.CV

    Image Masking for Robust Self-Supervised Monocular Depth Estimation

    Authors: Hemang Chawla, Kishaan Jeeveswaran, Elahe Arani, Bahram Zonooz

    Abstract: Self-supervised monocular depth estimation is a salient task for 3D scene understanding. Learned jointly with monocular ego-motion estimation, several methods have been proposed to predict accurate pixel-wise depth without using labeled data. Nevertheless, these methods focus on improving performance under ideal conditions without natural or digital corruptions. The general absence of occlusions i… ▽ More

    Submitted 1 February, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at 2023 IEEE International Conference on Robotics and Automation (ICRA)

  27. arXiv:2208.10895  [pdf, other

    cs.CV cs.AI

    A Comprehensive Study of Real-Time Object Detection Networks Across Multiple Domains: A Survey

    Authors: Elahe Arani, Shruthi Gowda, Ratnajit Mukherjee, Omar Magdy, Senthilkumar Kathiresan, Bahram Zonooz

    Abstract: Deep neural network based object detectors are continuously evolving and are used in a multitude of applications, each having its own set of requirements. While safety-critical applications need high accuracy and reliability, low-latency tasks need resource and energy-efficient networks. Real-time detectors, which are a necessity in high-impact real-world applications, are continuously proposed, b… ▽ More

    Submitted 14 February, 2023; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR) with Survey Certification

    Journal ref: Transactions on Machine Learning Research, 2022

  28. arXiv:2208.09427  [pdf, other

    cs.CV cs.AI

    Curbing Task Interference using Representation Similarity-Guided Multi-Task Feature Sharing

    Authors: Naresh Kumar Gurulingan, Elahe Arani, Bahram Zonooz

    Abstract: Multi-task learning of dense prediction tasks, by sharing both the encoder and decoder, as opposed to sharing only the encoder, provides an attractive front to increase both accuracy and computational efficiency. When the tasks are similar, sharing the decoder serves as an additional inductive bias providing more room for tasks to share complementary information among themselves. However, increase… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Comments: Published at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)

  29. arXiv:2208.05838  [pdf, other

    cs.CV

    Differencing based Self-supervised pretraining for Scene Change Detection

    Authors: Vijaya Raghavan T. Ramkumar, Elahe Arani, Bahram Zonooz

    Abstract: Scene change detection (SCD), a crucial perception task, identifies changes by comparing scenes captured at different times. SCD is challenging due to noisy changes in illumination, seasonal variations, and perspective differences across a pair of views. Deep neural network based solutions require a large quantity of annotated data which is tedious and expensive to obtain. On the other hand, trans… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: Published at Conference on Lifelong Learning Agents (CoLLAs 2022)

  30. Adversarial Attacks on Monocular Pose Estimation

    Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

    Abstract: Advances in deep learning have resulted in steady progress in computer vision with improved accuracy on tasks such as object detection and semantic segmentation. Nevertheless, deep neural networks are vulnerable to adversarial attacks, thus presenting a challenge in reliable deployment. Two of the prominent tasks in 3D scene-understanding for robotics and advanced drive assistance systems are mono… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted at the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  31. arXiv:2207.06267  [pdf, other

    cs.LG cs.AI cs.CV

    Task Agnostic Representation Consolidation: a Self-supervised based Continual Learning Approach

    Authors: Prashant Bhat, Bahram Zonooz, Elahe Arani

    Abstract: Continual learning (CL) over non-stationary data streams remains one of the long-standing challenges in deep neural networks (DNNs) as they are prone to catastrophic forgetting. CL models can benefit from self-supervised pre-training as it enables learning more generalizable task-agnostic features. However, the effect of self-supervised pre-training diminishes as the length of task sequences incre… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at Conference on Lifelong Learning Agents (CoLLAs 2022)

  32. arXiv:2207.04998  [pdf, other

    cs.LG cs.AI cs.CV

    Consistency is the key to further mitigating catastrophic forgetting in continual learning

    Authors: Prashant Bhat, Bahram Zonooz, Elahe Arani

    Abstract: Deep neural networks struggle to continually learn multiple sequential tasks due to catastrophic forgetting of previously learned tasks. Rehearsal-based methods which explicitly store previous task samples in the buffer and interleave them with the current task samples have proven to be the most effective in mitigating forgetting. However, Experience Replay (ER) does not perform well under low-buf… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted at Conference on Lifelong Learning Agents (CoLLAs 2022)

  33. arXiv:2206.05846  [pdf, other

    cs.CV cs.AI cs.LG

    InBiaseD: Inductive Bias Distillation to Improve Generalization and Robustness through Shape-awareness

    Authors: Shruthi Gowda, Bahram Zonooz, Elahe Arani

    Abstract: Humans rely less on spurious correlations and trivial cues, such as texture, compared to deep neural networks which lead to better generalization and robustness. It can be attributed to the prior knowledge or the high-level cognitive inductive bias present in the brain. Therefore, introducing meaningful inductive bias to neural networks can help learn more generic and high-level representations an… ▽ More

    Submitted 12 June, 2022; originally announced June 2022.

    Comments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)

  34. arXiv:2206.04016  [pdf, other

    cs.NE cs.AI cs.CV cs.LG

    SYNERgy between SYNaptic consolidation and Experience Replay for general continual learning

    Authors: Fahad Sarfraz, Elahe Arani, Bahram Zonooz

    Abstract: Continual learning (CL) in the brain is facilitated by a complex set of mechanisms. This includes the interplay of multiple memory systems for consolidating information as posited by the complementary learning systems (CLS) theory and synaptic consolidation for protecting the acquired knowledge from erasure. Thus, we propose a general CL method that creates a synergy between SYNaptic consolidation… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)

  35. Transformers in Self-Supervised Monocular Depth Estimation with Unknown Camera Intrinsics

    Authors: Arnav Varma, Hemang Chawla, Bahram Zonooz, Elahe Arani

    Abstract: The advent of autonomous driving and advanced driver assistance systems necessitates continuous developments in computer vision for 3D scene understanding. Self-supervised monocular depth estimation, a method for pixel-wise distance estimation of objects from a single camera without the use of ground truth labels, is an important task in 3D scene understanding. However, existing methods for this t… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: Published in 17th International Conference on Computer Vision Theory and Applications (VISAP, 2022)

  36. arXiv:2201.12604  [pdf, other

    cs.LG cs.AI cs.CV

    Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning System

    Authors: Elahe Arani, Fahad Sarfraz, Bahram Zonooz

    Abstract: Humans excel at continually learning from an ever-changing environment whereas it remains a challenge for deep neural networks which exhibit catastrophic forgetting. The complementary learning system (CLS) theory suggests that the interplay between rapid instance-based learning and slow structured learning in the brain is crucial for accumulating and retaining knowledge. Here, we propose CLS-ER, a… ▽ More

    Submitted 10 May, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

    Comments: Published as a conference paper at ICLR 2022 (camera-ready version)

  37. arXiv:2201.08683  [pdf, other

    cs.CV

    A Comprehensive Study of Vision Transformers on Dense Prediction Tasks

    Authors: Kishaan Jeeveswaran, Senthilkumar Kathiresan, Arnav Varma, Omar Magdy, Bahram Zonooz, Elahe Arani

    Abstract: Convolutional Neural Networks (CNNs), architectures consisting of convolutional layers, have been the standard choice in vision tasks. Recent studies have shown that Vision Transformers (VTs), architectures based on self-attention modules, achieve comparable performance in challenging tasks such as object detection and semantic segmentation. However, the image processing mechanism of VTs is differ… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Comments: 17th International Conference on Computer Vision Theory and Applications (VISAP, 2022)

  38. arXiv:2111.05191  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Does Thermal data make the detection systems more reliable?

    Authors: Shruthi Gowda, Bahram Zonooz, Elahe Arani

    Abstract: Deep learning-based detection networks have made remarkable progress in autonomous driving systems (ADS). ADS should have reliable performance across a variety of ambient lighting and adverse weather conditions. However, luminance degradation and visual obstructions (such as glare, fog) result in poor quality images by the visual camera which leads to performance decline. To overcome these challen… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Accepted at NeurIPS 2021 - ML4AD workshop (The code for this research is available at: https://github.com/NeurAI-Lab/MMC)

  39. arXiv:2108.04584  [pdf, other

    cs.CV cs.AI cs.LG

    UniNet: A Unified Scene Understanding Network and Exploring Multi-Task Relationships through the Lens of Adversarial Attacks

    Authors: Naresh Kumar Gurulingan, Elahe Arani, Bahram Zonooz

    Abstract: Scene understanding is crucial for autonomous systems which intend to operate in the real world. Single task vision networks extract information only based on some aspects of the scene. In multi-task learning (MTL), on the other hand, these single tasks are jointly learned, thereby providing an opportunity for tasks to share information and obtain a more comprehensive understanding. To this end, w… ▽ More

    Submitted 12 August, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: Accepted at DeepMTL workshop, ICCV 2021

  40. arXiv:2106.16006  [pdf, other

    cs.LG cs.CV

    Improving the Efficiency of Transformers for Resource-Constrained Devices

    Authors: Hamid Tabani, Ajay Balasubramaniam, Shabbir Marzban, Elahe Arani, Bahram Zonooz

    Abstract: Transformers provide promising accuracy and have become popular and used in various domains such as natural language processing and computer vision. However, due to their massive number of model parameters, memory and computation requirements, they are not suitable for resource-constrained low-power devices. Even with high-performance and specialized devices, the memory bandwidth can become a perf… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: This paper is accepted as a full paper at 24th Euromicro Conference on Digital System Design (DSD)

  41. arXiv:2106.03242  [pdf, other

    cs.CV cs.AI

    Highlighting the Importance of Reducing Research Bias and Carbon Emissions in CNNs

    Authors: Ahmed Badar, Arnav Varma, Adrian Staniec, Mahmoud Gamal, Omar Magdy, Haris Iqbal, Elahe Arani, Bahram Zonooz

    Abstract: Convolutional neural networks (CNNs) have become commonplace in addressing major challenges in computer vision. Researchers are not only coming up with new CNN architectures but are also researching different techniques to improve the performance of existing architectures. However, there is a tendency to over-emphasize performance improvement while neglecting certain important variables such as si… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

  42. arXiv:2106.02567  [pdf

    cs.CV cs.AI cs.LG

    AI Driven Road Maintenance Inspection

    Authors: Ratnajit Mukherjee, Haris Iqbal, Shabbir Marzban, Ahmed Badar, Terence Brouns, Shruthi Gowda, Elahe Arani, Bahram Zonooz

    Abstract: Road infrastructure maintenance inspection is typically a labour-intensive and critical task to ensure the safety of all the road users. In this work, we propose a detailed methodology to use state-of-the-art techniques in artificial intelligence and computer vision to automate a sizeable portion of the maintenance inspection subtasks and reduce the labour costs. The proposed methodology uses stat… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: accepted at 27th ITS World Congress, 2021

  43. arXiv:2105.02613  [pdf, other

    cs.LG cs.AR

    Challenges and Obstacles Towards Deploying Deep Learning Models on Mobile Devices

    Authors: Hamid Tabani, Ajay Balasubramaniam, Elahe Arani, Bahram Zonooz

    Abstract: From computer vision and speech recognition to forecasting trajectories in autonomous vehicles, deep learning approaches are at the forefront of so many domains. Deep learning models are developed using plethora of high-level, generic frameworks and libraries. Running those models on the mobile devices require hardware-aware optimizations and in most cases converting the models to other formats or… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  44. arXiv:2104.10011  [pdf, other

    cs.CV cs.AI cs.LG

    Perceptual Loss for Robust Unsupervised Homography Estimation

    Authors: Daniel Koguciuk, Elahe Arani, Bahram Zonooz

    Abstract: Homography estimation is often an indispensable step in many computer vision tasks. The existing approaches, however, are not robust to illumination and/or larger viewpoint changes. In this paper, we propose bidirectional implicit Homography Estimation (biHomE) loss for unsupervised homography estimation. biHomE minimizes the distance in the feature space between the warped image from the source v… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: Accepted at Image Matching: Local Features & Beyond (CVPR 2021 Workshop)

  45. arXiv:2104.09866  [pdf, other

    cs.CV cs.AI cs.LG

    Distill on the Go: Online knowledge distillation in self-supervised learning

    Authors: Prashant Bhat, Elahe Arani, Bahram Zonooz

    Abstract: Self-supervised learning solves pretext prediction tasks that do not require annotations to learn feature representations. For vision tasks, pretext tasks such as predicting rotation, solving jigsaw are solely created from the input data. Yet, predicting this known information helps in learning representations useful for downstream tasks. However, recent works have shown that wider and deeper mode… ▽ More

    Submitted 30 June, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: Spotlight @ Learning from Limited or Imperfect Data (L2ID) Workshop - CVPR 2021

  46. Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation

    Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

    Abstract: Dense depth estimation is essential to scene-understanding for autonomous driving. However, recent self-supervised approaches on monocular videos suffer from scale-inconsistency across long sequences. Utilizing data from the ubiquitously copresent global positioning systems (GPS), we tackle this challenge by proposing a dynamically-weighted GPS-to-Scale (g2s) loss to complement the appearance-base… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: Accepted at 2021 IEEE International Conference on Robotics and Automation (ICRA)

  47. Practical Auto-Calibration for Spatial Scene-Understanding from Crowdsourced Dashcamera Videos

    Authors: Hemang Chawla, Matti Jukola, Shabbir Marzban, Elahe Arani, Bahram Zonooz

    Abstract: Spatial scene-understanding, including dense depth and ego-motion estimation, is an important problem in computer vision for autonomous vehicles and advanced driver assistance systems. Thus, it is beneficial to design perception modules that can utilize crowdsourced videos collected from arbitrary vehicular onboard or dashboard cameras. However, the intrinsic parameters corresponding to such camer… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: Accepted at 16th International Conference on Computer Vision Theory and Applications (VISAP, 2021)

  48. arXiv:2009.08325  [pdf, other

    cs.CV cs.LG

    Noisy Concurrent Training for Efficient Learning under Label Noise

    Authors: Fahad Sarfraz, Elahe Arani, Bahram Zonooz

    Abstract: Deep neural networks (DNNs) fail to learn effectively under label noise and have been shown to memorize random labels which affect their generalization performance. We consider learning in isolation, using one-hot encoded labels as the sole source of supervision, and a lack of regularization to discourage memorization as the major shortcomings of the standard training procedure. Thus, we propose N… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted at IEEE Winter Conference on Applications of Computer Vision (WACV, 2021)

  49. arXiv:2008.07015  [pdf, other

    cs.CV cs.LG

    Adversarial Concurrent Training: Optimizing Robustness and Accuracy Trade-off of Deep Neural Networks

    Authors: Elahe Arani, Fahad Sarfraz, Bahram Zonooz

    Abstract: Adversarial training has been proven to be an effective technique for improving the adversarial robustness of models. However, there seems to be an inherent trade-off between optimizing the model for accuracy and robustness. To this end, we propose Adversarial Concurrent Training (ACT), which employs adversarial training in a collaborative learning framework whereby we train a robust model in conj… ▽ More

    Submitted 18 August, 2020; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: Accepted at 31st British Machine Vision Conference (BMVC) 2020

  50. Crowdsourced 3D Map**: A Combined Multi-View Geometry and Self-Supervised Learning Approach

    Authors: Hemang Chawla, Matti Jukola, Terence Brouns, Elahe Arani, Bahram Zonooz

    Abstract: The ability to efficiently utilize crowdsourced visual data carries immense potential for the domains of large scale dynamic map** and autonomous driving. However, state-of-the-art methods for crowdsourced 3D map** assume prior knowledge of camera intrinsics. In this work, we propose a framework that estimates the 3D positions of semantically meaningful landmarks such as traffic signs without… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

    Comments: Accepted at 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)