Skip to main content

Showing 1–32 of 32 results for author: Kalkan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08653  [pdf, other

    cs.RO

    BaSeNet: A Learning-based Mobile Manipulator Base Pose Sequence Planning for Pickup Tasks

    Authors: Lakshadeep Naik, Sinan Kalkan, Sune L. Sørensen, Mikkel B. Kjærgaard, Norbert Krüger

    Abstract: In many applications, a mobile manipulator robot is required to grasp a set of objects distributed in space. This may not be feasible from a single base pose and the robot must plan the sequence of base poses for gras** all objects, minimizing the total navigation and gras** time. This is a Combinatorial Optimization problem that can be solved using exact methods, which provide optimal solutio… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Submitted to IROS 2024

  2. arXiv:2405.13264  [pdf, other

    cs.LG cs.AI cs.CV

    Part-based Quantitative Analysis for Heatmaps

    Authors: Osman Tursun, Sinan Kalkan, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Heatmaps have been instrumental in hel** understand deep network decisions, and are a common approach for Explainable AI (XAI). While significant progress has been made in enhancing the informativeness and accessibility of heatmaps, heatmap analysis is typically very subjective and limited to domain experts. As such, develo** automatic, scalable, and numerical analysis methods to make heatmap-… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2404.09692  [pdf, other

    cs.CV

    XoFTR: Cross-modal Feature Matching Transformer

    Authors: Önder Tuzcuoğlu, Aybora Köksal, Buğra Sofu, Sinan Kalkan, A. Aydın Alatan

    Abstract: We introduce, XoFTR, a cross-modal cross-view method for local feature matching between thermal infrared (TIR) and visible images. Unlike visible images, TIR images are less susceptible to adverse lighting and weather conditions but present difficulties in matching due to significant texture and intensity differences. Current hand-crafted and learning-based methods for visible-TIR matching fall sh… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: CVPR Image Matching Workshop, 2024. 12 pages, 7 figures, 5 tables. Codes and dataset are available at https://github.com/OnderT/XoFTR

  4. arXiv:2403.01795  [pdf, other

    cs.CV

    RankED: Addressing Imbalance and Uncertainty in Edge Detection Using Ranking-based Losses

    Authors: Bedrettin Cetinkaya, Sinan Kalkan, Emre Akbas

    Abstract: Detecting edges in images suffers from the problems of (P1) heavy imbalance between positive and negative classes as well as (P2) label uncertainty owing to disagreement between different annotators. Existing solutions address P1 using class-balanced cross-entropy loss and dice loss and P2 by only predicting edges agreed upon by most annotators. In this paper, we propose RankED, a unified ranking-… ▽ More

    Submitted 7 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: accepted to CVPR 2024

  5. arXiv:2312.17031  [pdf, other

    cs.CV

    Generalized Mask-aware IoU for Anchor Assignment for Real-time Instance Segmentation

    Authors: Barış Can Çam, Kemal Öksüz, Fehmi Kahraman, Zeynep Sonat Baltacı, Sinan Kalkan, Emre Akbaş

    Abstract: This paper introduces Generalized Mask-aware Intersection-over-Union (GmaIoU) as a new measure for positive-negative assignment of anchor boxes during training of instance segmentation methods. Unlike conventional IoU measure or its variants, which only consider the proximity of anchor and ground-truth boxes; GmaIoU additionally takes into account the segmentation mask. This enables GmaIoU to prov… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 28 pages, 4 figures

  6. arXiv:2312.11299  [pdf, other

    cs.LG cs.CY stat.ML

    Uncertainty-based Fairness Measures

    Authors: Selim Kuzucu, Jiaee Cheong, Hatice Gunes, Sinan Kalkan

    Abstract: Unfair predictions of machine learning (ML) models impede their broad acceptance in real-world settings. Tackling this arduous challenge first necessitates defining what it means for an ML model to be fair. This has been addressed by the ML community with various measures of fairness that depend on the prediction outcomes of the ML models, either at the group level or the individual level. These f… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  7. arXiv:2311.14090  [pdf, other

    cs.LG cs.CV

    Class Uncertainty: A Measure to Mitigate Class Imbalance

    Authors: Z. S. Baltaci, K. Oksuz, S. Kuzucu, K. Tezoren, B. K. Konar, A. Ozkan, E. Akbas, S. Kalkan

    Abstract: Class-wise characteristics of training examples affect the performance of deep classifiers. A well-studied example is when the number of training examples of classes follows a long-tailed distribution, a situation that is likely to yield sub-optimal performance for under-represented classes. This class imbalance problem is conventionally addressed by approaches relying on the class-wise cardinalit… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  8. arXiv:2301.01019  [pdf, other

    cs.CV

    Correlation Loss: Enforcing Correlation between Classification and Localization

    Authors: Fehmi Kahraman, Kemal Oksuz, Sinan Kalkan, Emre Akbas

    Abstract: Object detectors are conventionally trained by a weighted sum of classification and localization losses. Recent studies (e.g., predicting IoU with an auxiliary head, Generalized Focal Loss, Rank & Sort Loss) have shown that forcing these two loss terms to interact with each other in non-conventional ways creates a useful inductive bias and improves performance. Inspired by these works, we focus on… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

    Comments: Accepted to AAAI 2023

  9. arXiv:2209.07268  [pdf, other

    cs.RO cs.AI

    AssembleRL: Learning to Assemble Furniture from Their Point Clouds

    Authors: Özgür Aslan, Burak Bolat, Batuhan Bal, Tuğba Tümer, Erol Şahin, Sinan Kalkan

    Abstract: The rise of simulation environments has enabled learning-based approaches for assembly planning, which is otherwise a labor-intensive and daunting task. Assembling furniture is especially interesting since furniture are intricate and pose challenges for learning-based approaches. Surprisingly, humans can solve furniture assembly mostly given a 2D snapshot of the assembled product. Although recent… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 6 pages, 6 figures, iros2022

  10. arXiv:2209.02482  [pdf, other

    cs.CV

    Segment Augmentation and Differentiable Ranking for Logo Retrieval

    Authors: Feyza Yavuz, Sinan Kalkan

    Abstract: Logo retrieval is a challenging problem since the definition of similarity is more subjective compared to image retrieval tasks and the set of known similarities is very scarce. To tackle this challenge, in this paper, we propose a simple but effective segment-based augmentation strategy to introduce artificially similar logos for training deep networks for logo retrieval. In this novel augmentati… ▽ More

    Submitted 13 September, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: ICPR2022, Poster Presentation

  11. arXiv:2204.06512  [pdf, other

    cs.CV

    Does depth estimation help object detection?

    Authors: Bedrettin Cetinkaya, Sinan Kalkan, Emre Akbas

    Abstract: Ground-truth depth, when combined with color data, helps improve object detection accuracy over baseline models that only use color. However, estimated depth does not always yield improvements. Many factors affect the performance of object detection when estimated depth is used. In this paper, we comprehensively investigate these factors with detailed experiments, such as using ground-truth vs. es… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: Accepted to Image and Vision Computing

  12. arXiv:2110.09734  [pdf, other

    cs.CV

    Mask-aware IoU for Anchor Assignment in Real-time Instance Segmentation

    Authors: Kemal Oksuz, Baris Can Cam, Fehmi Kahraman, Zeynep Sonat Baltaci, Sinan Kalkan, Emre Akbas

    Abstract: This paper presents Mask-aware Intersection-over-Union (maIoU) for assigning anchor boxes as positives and negatives during training of instance segmentation methods. Unlike conventional IoU or its variants, which only considers the proximity of two boxes; maIoU consistently measures the proximity of an anchor box with not only a ground truth box but also its associated ground truth mask. Thus, ad… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: BMVC 2021, camera ready version

  13. arXiv:2107.11669  [pdf, other

    cs.CV

    Rank & Sort Loss for Object Detection and Instance Segmentation

    Authors: Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan

    Abstract: We propose Rank & Sort (RS) Loss, a ranking-based loss function to train deep object detection and instance segmentation methods (i.e. visual detectors). RS Loss supervises the classifier, a sub-network of these methods, to rank each positive above all negatives as well as to sort positives among themselves with respect to (wrt.) their localisation qualities (e.g. Intersection-over-Union - IoU). T… ▽ More

    Submitted 30 August, 2021; v1 submitted 24 July, 2021; originally announced July 2021.

    Comments: ICCV 2021, oral presentation

  14. arXiv:2011.10772  [pdf, other

    cs.CV

    One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks

    Authors: Kemal Oksuz, Baris Can Cam, Sinan Kalkan, Emre Akbas

    Abstract: Despite being widely used as a performance measure for visual detection tasks, Average Precision (AP) is limited in (i) reflecting localisation quality, (ii) interpretability and (iii) robustness to the design choices regarding its computation, and its applicability to outputs without confidence scores. Panoptic Quality (PQ), a measure proposed for evaluating panoptic segmentation (Kirillov et al.… ▽ More

    Submitted 21 November, 2021; v1 submitted 21 November, 2020; originally announced November 2020.

    Comments: Accepted to TPAMI

  15. arXiv:2011.08819  [pdf, other

    cs.CV cs.LG

    Spatio-Temporal Analysis of Facial Actions using Lifecycle-Aware Capsule Networks

    Authors: Nikhil Churamani, Sinan Kalkan, Hatice Gunes

    Abstract: Most state-of-the-art approaches for Facial Action Unit (AU) detection rely upon evaluating facial expressions from static frames, encoding a snapshot of heightened facial activity. In real-world interactions, however, facial expressions are usually more subtle and evolve in a temporal manner requiring AU detection models to learn spatial as well as temporal information. In this paper, we focus on… ▽ More

    Submitted 3 March, 2021; v1 submitted 17 November, 2020; originally announced November 2020.

    Comments: Updated Figure 6 and the Acknowledgements. Corrected typos. 11 pages, 6 figures, 3 tables

  16. arXiv:2011.06978  [pdf, other

    cs.CV

    Transformer-Encoder Detector Module: Using Context to Improve Robustness to Adversarial Attacks on Object Detection

    Authors: Faisal Alamri, Sinan Kalkan, Nicolas Pugeault

    Abstract: Deep neural network approaches have demonstrated high performance in object recognition (CNN) and detection (Faster-RCNN) tasks, but experiments have shown that such architectures are vulnerable to adversarial attacks (FFF, UAP): low amplitude perturbations, barely perceptible by the human eye, can lead to a drastic reduction in labeling performance. This article proposes a new context module, cal… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: Accepted for the 25th International Conference on Pattern Recognition (ICPR'2020)

  17. arXiv:2009.13592  [pdf, other

    cs.CV

    A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection

    Authors: Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan

    Abstract: We propose average Localisation-Recall-Precision (aLRP), a unified, bounded, balanced and ranking-based loss function for both classification and localisation tasks in object detection. aLRP extends the Localisation-Recall-Precision (LRP) performance metric (Oksuz et al., 2018) inspired from how Average Precision (AP) Loss extends precision to a ranking-based loss function for classification (Chen… ▽ More

    Submitted 7 January, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: NeurIPS 2020 spotlight paper

  18. arXiv:2008.01232  [pdf, other

    cs.CV

    Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition

    Authors: M. Esat Kalfaoglu, Sinan Kalkan, A. Aydin Alatan

    Abstract: In this work, we combine 3D convolution with late temporal modeling for action recognition. For this aim, we replace the conventional Temporal Global Average Pooling (TGAP) layer at the end of 3D convolutional architecture with the Bidirectional Encoder Representations from Transformers (BERT) layer in order to better utilize the temporal information with BERT's attention mechanism. We show that t… ▽ More

    Submitted 17 September, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: Presented on the 2nd Workshop on Video Turing Test: Toward Human-Level Video Story Understanding, ECCV 2020

  19. arXiv:2007.12506  [pdf, other

    cs.RO cs.HC

    Mind Your Manners! A Dataset and A Continual Learning Approach for Assessing Social Appropriateness of Robot Actions

    Authors: Jonas Tjomsland, Sinan Kalkan, Hatice Gunes

    Abstract: To date, endowing robots with an ability to assess social appropriateness of their actions has not been possible. This has been mainly due to (i) the lack of relevant and labelled data, and (ii) the lack of formulations of this as a lifelong learning problem. In this paper, we address these two issues. We first introduce the Socially Appropriate Domestic Robot Actions dataset (MANNERS-DB), which c… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: Human-Robot Interaction; Social Robotics; Social Appropriateness; Continual Learning. Submitted to the RO-MAN 2020 Workshop on Lifelong Learning for Long-term Human-Robot Interaction (LL4LHRI)

  20. arXiv:2007.10075  [pdf, other

    cs.CV

    Investigating Bias and Fairness in Facial Expression Recognition

    Authors: Tian Xu, Jennifer White, Sinan Kalkan, Hatice Gunes

    Abstract: Recognition of expressions of emotions and affect from facial images is a well-studied research problem in the fields of affective computing and computer vision with a large number of datasets available containing facial images and corresponding expression labels. However, virtually none of these datasets have been acquired with consideration of fair distribution across the human population. There… ▽ More

    Submitted 21 August, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

  21. arXiv:1910.11713  [pdf, other

    cs.CV

    ALET (Automated Labeling of Equipment and Tools): A Dataset, a Baseline and a Usecase for Tool Detection in the Wild

    Authors: Fatih Can Kurnaz, Burak Hocaoğlu, Mert Kaan Yılmaz, İdil Sülo, Sinan Kalkan

    Abstract: Robots collaborating with humans in realistic environments will need to be able to detect the tools that can be used and manipulated. However, there is no available dataset or study that addresses this challenge in real settings. In this paper, we fill this gap by providing an extensive dataset (METU-ALET) for detecting farming, gardening, office, stonemasonry, vehicle, woodworking and workshop to… ▽ More

    Submitted 13 December, 2020; v1 submitted 25 October, 2019; originally announced October 2019.

    Comments: 7 pages, 4 figures

  22. arXiv:1909.09777  [pdf, other

    cs.CV

    Generating Positive Bounding Boxes for Balanced Training of Object Detectors

    Authors: Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan

    Abstract: Two-stage deep object detectors generate a set of regions-of-interest (RoI) in the first stage, then, in the second stage, identify objects among the proposed RoIs that sufficiently overlap with a ground truth (GT) box. The second stage is known to suffer from a bias towards RoIs that have low intersection-over-union (IoU) with the associated GT boxes. To address this issue, we first propose a sam… ▽ More

    Submitted 19 June, 2020; v1 submitted 21 September, 2019; originally announced September 2019.

    Comments: To appear in WACV 20

  23. arXiv:1909.00169  [pdf, other

    cs.CV

    Imbalance Problems in Object Detection: A Review

    Authors: Kemal Oksuz, Baris Can Cam, Sinan Kalkan, Emre Akbas

    Abstract: In this paper, we present a comprehensive review of the imbalance problems in object detection. To analyze the problems in a systematic manner, we introduce a problem-based taxonomy. Following this taxonomy, we discuss each problem in depth and present a unifying yet critical perspective on the solutions in the literature. In addition, we identify major open issues regarding the existing imbalance… ▽ More

    Submitted 11 March, 2020; v1 submitted 31 August, 2019; originally announced September 2019.

    Comments: Accepted to IEEE TPAMI; currently in press

  24. arXiv:1908.01189  [pdf, other

    cs.CV

    Searching for Ambiguous Objects in Videos using Relational Referring Expressions

    Authors: Hazan Anayurt, Sezai Artun Ozyegin, Ulfet Cetin, Utku Aktas, Sinan Kalkan

    Abstract: Humans frequently use referring (identifying) expressions to refer to objects. Especially in ambiguous settings, humans prefer expressions (called relational referring expressions) that describe an object with respect to a distinguishing, unique object. Unlike studies on video object search using referring expressions, in this paper, our focus is on (i) relational referring expressions in highly a… ▽ More

    Submitted 20 August, 2019; v1 submitted 3 August, 2019; originally announced August 2019.

    Comments: BMVC 2019 camera ready

  25. Learning to Generate Unambiguous Spatial Referring Expressions for Real-World Environments

    Authors: Fethiye Irmak Doğan, Sinan Kalkan, Iolanda Leite

    Abstract: Referring to objects in a natural and unambiguous manner is crucial for effective human-robot interaction. Previous research on learning-based referring expressions has focused primarily on comprehension tasks, while generating referring expressions is still mostly limited to rule-based methods. In this work, we propose a two-stage approach that relies on deep learning for estimating spatial relat… ▽ More

    Submitted 5 August, 2019; v1 submitted 15 April, 2019; originally announced April 2019.

    Comments: International Conference on Intelligent Robots and Systems (IROS 2019), Demo 1: Finding the described object (https://youtu.be/BE6-F6chW0w), Demo 2: Referring to the pointed object (https://youtu.be/nmmv6JUpy8M), Supplementary Video (https://youtu.be/sFjBa_MHS98)

    Journal ref: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (2019) 4992-4999

  26. arXiv:1807.01696  [pdf, other

    cs.CV

    Localization Recall Precision (LRP): A New Performance Metric for Object Detection

    Authors: Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan

    Abstract: Average precision (AP), the area under the recall-precision (RP) curve, is the standard performance measure for object detection. Despite its wide acceptance, it has a number of shortcomings, the most important of which are (i) the inability to distinguish very different RP curves, and (ii) the lack of directly measuring bounding box localization accuracy. In this paper, we propose 'Localization R… ▽ More

    Submitted 5 July, 2018; v1 submitted 4 July, 2018; originally announced July 2018.

    Comments: to appear in ECCV 2018

  27. arXiv:1807.00511  [pdf, other

    cs.RO cs.CV cs.LG

    COSMO: Contextualized Scene Modeling with Boltzmann Machines

    Authors: Ilker Bozcan, Sinan Kalkan

    Abstract: Scene modeling is very crucial for robots that need to perceive, reason about and manipulate the objects in their environments. In this paper, we adapt and extend Boltzmann Machines (BMs) for contextualized scene modeling. Although there are many models on the subject, ours is the first to bring together objects, relations, and affordances in a highly-capable generative model. For this end, we int… ▽ More

    Submitted 19 December, 2018; v1 submitted 2 July, 2018; originally announced July 2018.

    Comments: 40 pages, 15 figures, 9 tables, accepted to the Robotics and Autonomous Systems (RAS) special issue on Semantic Policy and Action Representations for Autonomous Robots (SPAR)

  28. arXiv:1710.05664  [pdf, other

    cs.CV cs.RO

    What is (missing or wrong) in the scene? A Hybrid Deep Boltzmann Machine For Contextualized Scene Modeling

    Authors: İlker Bozcan, Yağmur Oymak, İdil Zeynep Alemdar, Sinan Kalkan

    Abstract: Scene models allow robots to reason about what is in the scene, what else should be in it, and what should not be in it. In this paper, we propose a hybrid Boltzmann Machine (BM) for scene modeling where relations between objects are integrated. To be able to do that, we extend BM to include tri-way edges between visible (object) nodes and make the network to share the relations across different o… ▽ More

    Submitted 20 August, 2018; v1 submitted 16 October, 2017; originally announced October 2017.

    Comments: 6 pages, 7 figures, submitted to ICRA 2018

  29. arXiv:1710.04981  [pdf, other

    cs.RO cs.LG

    CINet: A Learning Based Approach to Incremental Context Modeling in Robots

    Authors: Fethiye Irmak Doğan, İlker Bozcan, Mehmet Çelik, Sinan Kalkan

    Abstract: There have been several attempts at modeling context in robots. However, either these attempts assume a fixed number of contexts or use a rule-based approach to determine when to increment the number of contexts. In this paper, we pose the task of when to increment as a learning problem, which we solve using a Recurrent Neural Network. We show that the network successfully (with 98\% testing accur… ▽ More

    Submitted 29 July, 2018; v1 submitted 13 October, 2017; originally announced October 2017.

    Comments: The first two authors have contributed equally, 6 pages, 8 figures, International Conference on Intelligent Robots (IROS 2018)

  30. arXiv:1710.04975  [pdf, other

    cs.RO cs.LG

    A Deep Incremental Boltzmann Machine for Modeling Context in Robots

    Authors: Fethiye Irmak Doğan, Hande Çelikkanat, Sinan Kalkan

    Abstract: Context is an essential capability for robots that are to be as adaptive as possible in challenging environments. Although there are many context modeling efforts, they assume a fixed structure and number of contexts. In this paper, we propose an incremental deep model that extends Restricted Boltzmann Machines. Our model gets one scene at a time, and gradually extends the contextual model when ne… ▽ More

    Submitted 2 March, 2018; v1 submitted 13 October, 2017; originally announced October 2017.

    Comments: 6 pages, 5 figures, International Conference on Robotics and Automation (ICRA 2018)

  31. arXiv:1706.05726  [pdf, other

    cs.CV

    Using Deep Networks for Drone Detection

    Authors: Cemal Aker, Sinan Kalkan

    Abstract: Drone detection is the problem of finding the smallest rectangle that encloses the drone(s) in a video sequence. In this study, we propose a solution using an end-to-end object detection model based on convolutional neural networks. To solve the scarce data problem for training the network, we propose an algorithm for creating an extensive artificial dataset by combining background-subtracted real… ▽ More

    Submitted 18 June, 2017; originally announced June 2017.

    Comments: To appear in International Workshop on Small-Drone Surveillance, Detection and Counteraction Techniques organised within AVSS 2017

  32. arXiv:1701.05766  [pdf, other

    cs.CV

    A Large-scale Dataset and Benchmark for Similar Trademark Retrieval

    Authors: Osman Tursun, Cemal Aker, Sinan Kalkan

    Abstract: Trademark retrieval (TR) has become an important yet challenging problem due to an ever increasing trend in trademark applications and infringement incidents. There have been many promising attempts for the TR problem, which, however, fell impracticable since they were evaluated with limited and mostly trivial datasets. In this paper, we provide a large-scale dataset with benchmark queries with wh… ▽ More

    Submitted 14 October, 2017; v1 submitted 20 January, 2017; originally announced January 2017.