Skip to main content

Showing 1–17 of 17 results for author: Akagunduz, E

.
  1. arXiv:2405.18451  [pdf, other

    eess.SP

    Deep Learning-based Epicenter Localization using Single-Station Strong Motion Records

    Authors: Melek Türkmen, Sanem Meral, Baris Yilmaz, Melis Cikis, Erdem Akagündüz, Salih Tileylioglu

    Abstract: This paper explores the application of deep learning (DL) techniques to strong motion records for single-station epicenter localization. Often underutilized in seismology-related studies, strong motion records offer a potential wealth of information about seismic events. We investigate whether DL-based methods can effectively leverage this data for accurate epicenter localization. Our study introd… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2405.17901  [pdf, other

    cs.CV cs.AI

    Near-Infrared and Low-Rank Adaptation of Vision Transformers in Remote Sensing

    Authors: Irem Ulku, O. Ozgur Tanriover, Erdem Akagündüz

    Abstract: Plant health can be monitored dynamically using multispectral sensors that measure Near-Infrared reflectance (NIR). Despite this potential, obtaining and annotating high-resolution NIR images poses a significant challenge for training deep neural networks. Typically, large networks pre-trained on the RGB domain are utilized to fine-tune infrared images. This practice introduces a domain shift issu… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 7 pages, 3 figures, 3 tables

  3. arXiv:2405.06383  [pdf, other

    cs.CV

    How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models?

    Authors: Engin Uzun, Erdem Akagunduz

    Abstract: Atmospheric turbulence poses a significant challenge to the performance of object detection models. Turbulence causes distortions, blurring, and noise in images by bending and scattering light rays due to variations in the refractive index of air. This results in non-rigid geometric distortions and temporal fluctuations in the electromagnetic radiation received by optical systems. This paper explo… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  4. arXiv:2404.08589  [pdf, other

    cs.CV cs.AI

    Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts

    Authors: Övgü Özdemir, Erdem Akagündüz

    Abstract: Visual question answering (VQA) is known as an AI-complete task as it requires understanding, reasoning, and inferring about the vision and the language content. Over the past few years, numerous neural architectures have been suggested for the VQA problem. However, achieving success in zero-shot VQA remains a challenge due to its requirement for advanced generalization and reasoning skills. This… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: The paper has been accepted for presentation at CVPR 2024 Workshop on Prompting in Vision

  5. arXiv:2403.07569  [pdf, other

    eess.SP cs.CV cs.LG

    Exploring Challenges in Deep Learning of Single-Station Ground Motion Records

    Authors: Ümit Mert Çağlar, Baris Yilmaz, Melek Türkmen, Erdem Akagündüz, Salih Tileylioglu

    Abstract: Contemporary deep learning models have demonstrated promising results across various applications within seismology and earthquake engineering. These models rely primarily on utilizing ground motion records for tasks such as earthquake event classification, localization, earthquake early warning systems, and structural health monitoring. However, the extent to which these models effectively learn… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 9 Pages, 12 Figures, 5 Tables

  6. arXiv:2402.00971  [pdf, other

    cs.CV

    FuseFormer: A Transformer for Visual and Thermal Image Fusion

    Authors: Aytekin Erdogan, Erdem Akagündüz

    Abstract: Due to the lack of a definitive ground truth for the image fusion problem, the loss functions are structured based on evaluation metrics, such as the structural similarity index measure (SSIM). However, in doing so, a bias is introduced toward the SSIM and, consequently, the input visual band image. The objective of this study is to propose a novel methodology for the image fusion problem that mit… ▽ More

    Submitted 24 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 8 pages, 6 figures, 6 tables

  7. arXiv:2312.02194  [pdf, other

    cs.CV

    Local Masking Meets Progressive Freezing: Crafting Efficient Vision Transformers for Self-Supervised Learning

    Authors: Utku Mert Topcuoglu, Erdem Akagündüz

    Abstract: In this paper, we present an innovative approach to self-supervised learning for Vision Transformers (ViTs), integrating local masked image modeling with progressive layer freezing. This method focuses on enhancing the efficiency and speed of initial layer training in ViTs. By systematically freezing specific layers at strategic points during training, we reduce computational demands while maintai… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  8. arXiv:2307.01893  [pdf, other

    cs.CV

    EANet: Enhanced Attribute-based RGBT Tracker Network

    Authors: Abbas Türkoğlu, Erdem Akagündüz

    Abstract: Tracking objects can be a difficult task in computer vision, especially when faced with challenges such as occlusion, changes in lighting, and motion blur. Recent advances in deep learning have shown promise in challenging these conditions. However, most deep learning-based object trackers only use visible band (RGB) images. Thermal infrared electromagnetic waves (TIR) can provide additional infor… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  9. arXiv:2209.05269  [pdf, other

    cs.CV

    Detecting Driver Drowsiness as an Anomaly Using LSTM Autoencoders

    Authors: Gülin Tüfekci, Alper Kayabaşi, Erdem Akagündüz, İlkay Ulusoy

    Abstract: In this paper, an LSTM autoencoder-based architecture is utilized for drowsiness detection with ResNet-34 as feature extractor. The problem is considered as anomaly detection for a single subject; therefore, only the normal driving representations are learned and it is expected that drowsiness representations, yielding higher reconstruction losses, are to be distinguished according to the knowledg… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: Accepted to ECCV 2022 In-Vehicle Sensing and Monitorization (ISM) Workshop

  10. arXiv:2207.10409  [pdf, other

    cs.CV cs.LG

    Sequence Models for Drone vs Bird Classification

    Authors: Fatih Cagatay Akyon, Erdem Akagunduz, Sinan Onur Altinuc, Alptekin Temizel

    Abstract: Drone detection has become an essential task in object detection as drone costs have decreased and drone technology has improved. It is, however, difficult to detect distant drones when there is weak contrast, long range, and low visibility. In this work, we propose several sequence classification architectures to reduce the detected false-positive ratio of drone tracks. Moreover, we propose a new… ▽ More

    Submitted 19 December, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

  11. arXiv:2204.08745  [pdf, other

    cs.CV

    Augmentation of Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models

    Authors: Engin Uzun, Ahmet Anil Dursun, Erdem Akagunduz

    Abstract: Atmospheric turbulence has a degrading effect on the image quality of long-range observation systems. As a result of various elements such as temperature, wind velocity, humidity, etc., turbulence is characterized by random fluctuations in the refractive index of the atmosphere. It is a phenomenon that may occur in various imaging spectra such as the visible or the infrared bands. In this paper, w… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR 2022 Perception Beyond the Visible Spectrum (PBVS) Workshop

  12. arXiv:2203.08581  [pdf, other

    cs.CV cs.AI

    A Survey on Infrared Image and Video Sets

    Authors: Kevser Irem Danaci, Erdem Akagunduz

    Abstract: In this survey, we compile a list of publicly available infrared image and video sets for artificial intelligence and computer vision researchers. We mainly focus on IR image and video sets which are collected and labelled for computer vision applications such as object detection, object segmentation, classification, and motion detection. We categorize 92 different publicly available or private se… ▽ More

    Submitted 16 January, 2023; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: Updated with recent sets

  13. arXiv:2107.02427  [pdf, other

    cs.LG cs.AI eess.SY

    Dynamical System Parameter Identification using Deep Recurrent Cell Networks

    Authors: Erdem Akagündüz, Oguzhan Cifdaloz

    Abstract: In this paper, we investigate the parameter identification problem in dynamical systems through a deep learning approach. Focusing mainly on second-order, linear time-invariant dynamical systems, the topic of dam** factor identification is studied. By utilizing a six-layer deep neural network with different recurrent cells, namely GRUs, LSTMs or BiLSTMs; and by feeding input-output sequence pair… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: Final version published in Journal of Neural Computing and Applications

  14. Filter design for small target detection on infrared imagery using normalized-cross-correlation layer

    Authors: H. Seçkin Demir, Erdem Akagunduz

    Abstract: In this paper, we introduce a machine learning approach to the problem of infrared small target detection filter design. For this purpose, similarly to a convolutional layer of a neural network, the normalized-cross-correlational (NCC) layer, which we utilize for designing a target detection/recognition filter bank, is proposed. By employing the NCC layer in a neural network structure, we introduc… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Journal ref: published in Turkish Journal of Electrical Engineering and Computer Sciences, vol.28, 302:317, 2020

  15. arXiv:2006.05355  [pdf, other

    cs.CV

    A Hybrid Framework for Matching Printing Design Files to Product Photos

    Authors: Alper Kaplan, Erdem Akagunduz

    Abstract: We propose a real-time image matching framework, which is hybrid in the sense that it uses both hand-crafted features and deep features obtained from a well-tuned deep convolutional network. The matching problem, which we concentrate on, is specific to a certain application, that is, printing design to product photo matching. Printing designs are any kind of template image files, created using a d… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Journal ref: published in Balkan Journal of Electrical and Computer Engineering, Volume 8 - Issue 2 - Apr 30, 2020

  16. A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D images

    Authors: Irem Ulku, Erdem Akagunduz

    Abstract: Semantic segmentation is the pixel-wise labelling of an image. Since the problem is defined at the pixel level, determining image class labels only is not acceptable, but localising them at the original image pixel resolution is necessary. Boosted by the extraordinary ability of convolutional neural networks (CNN) in creating semantic, high level and hierarchical image features; several deep learn… ▽ More

    Submitted 16 March, 2022; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: published in the J. of Applied Artificial Intelligence (09 Feb 2022)

  17. arXiv:1903.02056  [pdf, other

    cs.CV

    Defining Image Memorability using the Visual Memory Schema

    Authors: Erdem Akagunduz, Adrian G. Bors, Karla K. Evans

    Abstract: Memorability of an image is a characteristic determined by the human observers' ability to remember images they have seen. Yet recent work on image memorability defines it as an intrinsic property that can be obtained independent of the observer. {The current study aims to enhance our understanding and prediction of image memorability, improving upon existing approaches by incorporating the proper… ▽ More

    Submitted 5 March, 2019; originally announced March 2019.

    Comments: Submitted to TPAMI on Aug 4, 2017