Search | arXiv e-print repository

DeepMIF: Deep Monotonic Implicit Fields for Large-Scale LiDAR 3D Map**

Authors: Kutay Yılmaz, Matthias Nießner, Anastasiia Kornilova, Alexey Artemov

Abstract: Recently, significant progress has been achieved in sensing real large-scale outdoor 3D environments, particularly by using modern acquisition equipment such as LiDAR sensors. Unfortunately, they are fundamentally limited in their ability to produce dense, complete 3D scenes. To address this issue, recent learning-based methods integrate neural implicit representations and optimizable feature grid… ▽ More Recently, significant progress has been achieved in sensing real large-scale outdoor 3D environments, particularly by using modern acquisition equipment such as LiDAR sensors. Unfortunately, they are fundamentally limited in their ability to produce dense, complete 3D scenes. To address this issue, recent learning-based methods integrate neural implicit representations and optimizable feature grids to approximate surfaces of 3D scenes. However, naively fitting samples along raw LiDAR rays leads to noisy 3D map** results due to the nature of sparse, conflicting LiDAR measurements. Instead, in this work we depart from fitting LiDAR data exactly, instead letting the network optimize a non-metric monotonic implicit field defined in 3D space. To fit our field, we design a learning system integrating a monotonicity loss that enables optimizing neural monotonic fields and leverages recent progress in large-scale 3D map**. Our algorithm achieves high-quality dense 3D map** performance as captured by multiple quantitative and perceptual measures and visual results obtained for Mai City, Newer College, and KITTI benchmarks. The code of our approach will be made publicly available. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: 8 pages, 6 figures

arXiv:2312.05391 [pdf, other]

Loss Functions in the Era of Semantic Segmentation: A Survey and Outlook

Authors: Reza Azad, Moein Heidary, Kadir Yilmaz, Michael Hüttemann, Sanaz Karimijafarbigloo, Yuli Wu, Anke Schmeink, Dorit Merhof

Abstract: Semantic image segmentation, the process of classifying each pixel in an image into a particular class, plays an important role in many visual understanding systems. As the predominant criterion for evaluating the performance of statistical models, loss functions are crucial for sha** the development of deep learning-based segmentation algorithms and improving their overall performance. To aid r… ▽ More Semantic image segmentation, the process of classifying each pixel in an image into a particular class, plays an important role in many visual understanding systems. As the predominant criterion for evaluating the performance of statistical models, loss functions are crucial for sha** the development of deep learning-based segmentation algorithms and improving their overall performance. To aid researchers in identifying the optimal loss function for their particular application, this survey provides a comprehensive and unified review of $25$ loss functions utilized in image segmentation. We provide a novel taxonomy and thorough review of how these loss functions are customized and leveraged in image segmentation, with a systematic categorization emphasizing their significant features and applications. Furthermore, to evaluate the efficacy of these methods in real-world scenarios, we propose unbiased evaluations of some distinct and renowned loss functions on established medical and natural image datasets. We conclude this review by identifying current challenges and unveiling future research opportunities. Finally, we have compiled the reviewed studies that have open-source implementations on our GitHub page. △ Less

Submitted 8 December, 2023; originally announced December 2023.

arXiv:2309.16133 [pdf, other]

Mask4Former: Mask Transformer for 4D Panoptic Segmentation

Authors: Kadir Yilmaz, Jonas Schult, Alexey Nekrasov, Bastian Leibe

Abstract: Accurately perceiving and tracking instances over time is essential for the decision-making processes of autonomous agents interacting safely in dynamic environments. With this intention, we propose Mask4Former for the challenging task of 4D panoptic segmentation of LiDAR point clouds. Mask4Former is the first transformer-based approach unifying semantic instance segmentation and tracking of spars… ▽ More Accurately perceiving and tracking instances over time is essential for the decision-making processes of autonomous agents interacting safely in dynamic environments. With this intention, we propose Mask4Former for the challenging task of 4D panoptic segmentation of LiDAR point clouds. Mask4Former is the first transformer-based approach unifying semantic instance segmentation and tracking of sparse and irregular sequences of 3D point clouds into a single joint model. Our model directly predicts semantic instances and their temporal associations without relying on hand-crafted non-learned association strategies such as probabilistic clustering or voting-based center prediction. Instead, Mask4Former introduces spatio-temporal instance queries that encode the semantic and geometric properties of each semantic tracklet in the sequence. In an in-depth study, we find that promoting spatially compact instance predictions is critical as spatio-temporal instance queries tend to merge multiple semantically similar instances, even if they are spatially distant. To this end, we regress 6-DOF bounding box parameters from spatio-temporal instance queries, which are used as an auxiliary task to foster spatially compact predictions. Mask4Former achieves a new state-of-the-art on the SemanticKITTI test set with a score of 68.4 LSTQ. △ Less

Submitted 10 April, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

Comments: Renamed from MASK4D to Mask4Former. ICRA 2024. Project page: https://vision.rwth-aachen.de/Mask4Former

arXiv:2309.02196 [pdf, other]

Finite dimensional backstep** controller design

Authors: Varga Kalantarov, Türker Özsarı, Kemal Cem Yılmaz

Abstract: We introduce a finite dimensional version of backstep** controller design for stabilizing solutions of PDEs from boundary. Our controller uses only a finite number of Fourier modes of the state of solution, as opposed to the classical backstep** controller which uses all (infinitely many) modes. We apply our method to the reaction-diffusion equation, which serves only as a canonical example bu… ▽ More We introduce a finite dimensional version of backstep** controller design for stabilizing solutions of PDEs from boundary. Our controller uses only a finite number of Fourier modes of the state of solution, as opposed to the classical backstep** controller which uses all (infinitely many) modes. We apply our method to the reaction-diffusion equation, which serves only as a canonical example but the method is applicable also to other PDEs whose solutions can be decomposed into a slow finite-dimensional part and a fast tail, where the former dominates the evolution in large time. One of the main goals is to estimate the sufficient number of modes needed to stabilize the plant at a prescribed rate. In addition, we find the minimal number of modes that guarantee the stabilization at a certain (unprescribed) decay rate. Theoretical findings are supported with numerical solutions. △ Less

Submitted 5 September, 2023; originally announced September 2023.

Comments: 28 pages, 2 figures

MSC Class: 35B40; 35K57; 93C20; 93D15; 93D20; 93D23

arXiv:2211.04184 [pdf, other]

On the Past, Present, and Future of the Diebold-Yilmaz Approach to Dynamic Network Connectedness

Authors: Francis X. Diebold, Kamil Yilmaz

Abstract: We offer retrospective and prospective assessments of the Diebold-Yilmaz connectedness research program, combined with personal recollections of its development. Its centerpiece in many respects is Diebold and Yilmaz (2014), around which our discussion is organized. We offer retrospective and prospective assessments of the Diebold-Yilmaz connectedness research program, combined with personal recollections of its development. Its centerpiece in many respects is Diebold and Yilmaz (2014), around which our discussion is organized. △ Less

Submitted 9 January, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

arXiv:2204.03120 [pdf]

AutoCOR: Autonomous Condylar Offset Ratio Calculator on TKA-Postoperative Lateral Knee X-ray

Authors: Gulsade Rabia Cakmak, Ibrahim Ethem Hamamci, Mehmet Kursat Yilmaz, Reda Alhajj, Ibrahim Azboy, Mehmet Kemal Ozdemir

Abstract: The postoperative range of motion is one of the crucial factors indicating the outcome of Total Knee Arthroplasty (TKA). Although the correlation between range of knee flexion and posterior condylar offset (PCO) is controversial in the literature, PCO maintains its importance on evaluation of TKA. Due to limitations on PCO measurement, two novel parameters, posterior condylar offset ratio (PCOR) a… ▽ More The postoperative range of motion is one of the crucial factors indicating the outcome of Total Knee Arthroplasty (TKA). Although the correlation between range of knee flexion and posterior condylar offset (PCO) is controversial in the literature, PCO maintains its importance on evaluation of TKA. Due to limitations on PCO measurement, two novel parameters, posterior condylar offset ratio (PCOR) and anterior condylar offset ratio (ACOR), were introduced. Nowadays, the calculation of PCOR and ACOR on plain lateral radiographs is done manually by orthopedic surgeons. In this regard, we developed a software, AutoCOR, to calculate PCOR and ACOR autonomously, utilizing unsupervised machine learning algorithm (k-means clustering) and digital image processing techniques. The software AutoCOR is capable of detecting the anterior/posterior edge points and anterior/posterior cortex of the femoral shaft on true postoperative lateral conventional radiographs. To test the algorithm, 50 postoperative true lateral radiographs from Istanbul Kosuyolu Medipol Hospital Database were used (32 patients). The mean PCOR was 0.984 (SD 0.235) in software results and 0.972 (SD 0.164) in ground truth values. It shows strong and significant correlation between software and ground truth values (Pearson r=0.845 p<0.0001). The mean ACOR was 0.107 (SD 0.092) in software results and 0.107 (SD 0.070) in ground truth values. It shows moderate and significant correlation between software and ground truth values (Spearman's rs=0.519 p=0.0001412). We suggest that AutoCOR is a useful tool that can be used in clinical practice. △ Less

Submitted 6 April, 2022; originally announced April 2022.

Comments: 9 pages

MSC Class: 92C55 (Primary)

arXiv:2102.02095 [pdf, other]

Stabilization of higher order Schrödinger equations on a finite interval: Part II

Authors: Türker Özsarı, Kemal Cem Yılmaz

Abstract: Backstep** based controller and observer models were designed for higher order linear and nonlinear Schrödinger equations on a finite interval in Part I of this study where the controller was assumed to be acting from the left endpoint of the medium. In this companion paper, we further the analysis by considering boundary controller(s) acting at the right endpoint of the domain. It turns out tha… ▽ More Backstep** based controller and observer models were designed for higher order linear and nonlinear Schrödinger equations on a finite interval in Part I of this study where the controller was assumed to be acting from the left endpoint of the medium. In this companion paper, we further the analysis by considering boundary controller(s) acting at the right endpoint of the domain. It turns out that the problem is more challenging in this scenario as the associated boundary value problem for the backstep** kernel becomes overdetermined and lacks a smooth solution. The latter is essential to switch back and forth between the original plant and the so called target system. To overcome this difficulty we rely on the strategy of using an imperfect kernel, namely one of the boundary conditions in kernel PDE model is disregarded. The drawback is that one loses rapid stabilization in comparison with the left endpoint controllability. Nevertheless, the exponential decay of the $L^2$-norm with a certain rate still holds. The observer design is associated with new challenges from the point of view of wellposedness and one has to prove smoothing properties for an associated initial boundary value problem with inhomogeneous boundary data. This problem is solved by using Laplace transform in time. However, the Bromwich integral that inverts the transformed solution is associated with certain analyticity issues which are treated through a subtle analysis. Numerical algorithms and simulations verifying the theoretical results are given. △ Less

Submitted 3 February, 2021; originally announced February 2021.

Comments: 78 pages, 16 figures, 1 table

MSC Class: 35Q93; 93B52; 93C20; 93D15; 93D20; 93D23 (primary) and 35A01; 35A02; 35Q55; 35Q60 (secondary)

arXiv:2007.03948 [pdf, other]

doi 10.3390/ai2020010

A Study of Learning Search Approximation in Mixed Integer Branch and Bound: Node Selection in SCIP

Authors: Kaan Yilmaz, Neil Yorke-Smith

Abstract: In line with the growing trend of using machine learning to help solve combinatorial optimisation problems, one promising idea is to improve node selection within a mixed integer programming (MIP) branch-and-bound tree by using a learned policy. Previous work using imitation learning indicates the feasibility of acquiring a node selection policy, by learning an adaptive node searching order. In co… ▽ More In line with the growing trend of using machine learning to help solve combinatorial optimisation problems, one promising idea is to improve node selection within a mixed integer programming (MIP) branch-and-bound tree by using a learned policy. Previous work using imitation learning indicates the feasibility of acquiring a node selection policy, by learning an adaptive node searching order. In contrast, our imitation learning policy is focused solely on learning which of a node's children to select. We present an offline method to learn such a policy in two settings: one that comprises a heuristic by committing to pruning of nodes; one that is exact and backtracks from a leaf to guarantee finding the optimal integer solution. The former setting corresponds to a child selector during plunging, while the latter is akin to a diving heuristic. We apply the policy within the popular open-source solver SCIP, in both heuristic and exact settings. Empirical results on five MIP datasets indicate that our node selection policy leads to solutions significantly more quickly than the state-of-the-art precedent in the literature. While we do not beat the highly-optimised SCIP state-of-practice baseline node selector in terms of solving time on exact solutions, our heuristic policies have a consistently better optimality gap than all baselines, if the accuracy of the predictive model is sufficient. Further, the results also indicate that, when a time limit is applied, our heuristic method finds better solutions than all baselines in the majority of problems tested. We explain the results by showing that the learned policies have imitated the SCIP baseline, but without the latter's early plunge abort. Our recommendation is that, despite the clear improvements over the literature, this kind of MIP child selector is better seen in a broader approach using learning in MIP branch-and-bound tree decisions. △ Less

Submitted 3 January, 2022; v1 submitted 8 July, 2020; originally announced July 2020.

Comments: Authors' version, not publisher's final version which is available at DOI

MSC Class: 90C11 ACM Class: I.2.6; I.2.8

Journal ref: AI, volume 2, number 2, pages 150-178, 2021

arXiv:1910.11713 [pdf, other]

ALET (Automated Labeling of Equipment and Tools): A Dataset, a Baseline and a Usecase for Tool Detection in the Wild

Authors: Fatih Can Kurnaz, Burak Hocaoğlu, Mert Kaan Yılmaz, İdil Sülo, Sinan Kalkan

Abstract: Robots collaborating with humans in realistic environments will need to be able to detect the tools that can be used and manipulated. However, there is no available dataset or study that addresses this challenge in real settings. In this paper, we fill this gap by providing an extensive dataset (METU-ALET) for detecting farming, gardening, office, stonemasonry, vehicle, woodworking and workshop to… ▽ More Robots collaborating with humans in realistic environments will need to be able to detect the tools that can be used and manipulated. However, there is no available dataset or study that addresses this challenge in real settings. In this paper, we fill this gap by providing an extensive dataset (METU-ALET) for detecting farming, gardening, office, stonemasonry, vehicle, woodworking and workshop tools. The scenes correspond to sophisticated environments with or without humans using the tools. The scenes we consider introduce several challenges for object detection, including the small scale of the tools, their articulated nature, occlusion, inter-class invariance, etc. Moreover, we train and compare several state of the art deep object detectors (including Faster R-CNN, Cascade R-CNN, RepPoint and RetinaNet) on our dataset. We observe that the detectors have difficulty in detecting especially small-scale tools or tools that are visually similar to parts of other tools. This in turn supports the importance of our dataset and paper. With the dataset, the code and the trained models, our work provides a basis for further research into tools and their use in robotics applications. △ Less

Submitted 13 December, 2020; v1 submitted 25 October, 2019; originally announced October 2019.

Comments: 7 pages, 4 figures

arXiv:1908.11180 [pdf, other]

Stabilization of higher order Schrödinger equations on a finite interval: Part I

Authors: Ahmet Batal, Türker Özsarı, Kemal Cem Yılmaz

Abstract: We study the backstep** stabilization of higher order linear and nonlinear Schrödinger equations on a finite interval, where the boundary feedback acts from the left Dirichlet boundary condition. The plant is stabilized with a prescribed rate of decay. The construction of the backstep** kernel is based on a challenging successive approximation analysis. This contrasts with the case of second o… ▽ More We study the backstep** stabilization of higher order linear and nonlinear Schrödinger equations on a finite interval, where the boundary feedback acts from the left Dirichlet boundary condition. The plant is stabilized with a prescribed rate of decay. The construction of the backstep** kernel is based on a challenging successive approximation analysis. This contrasts with the case of second order pdes. Second, we consider the case where the full state of the system cannot be measured at all times but some partial information such as measurements of a boundary trace are available. For this problem, we simultaneously construct an observer and the associated backstep** controller which is capable of stabilizing the original plant. Wellposedness and regularity results are provided for all pde models. Although the linear part of the model is similar to the KdV equation, the power type nonlinearity brings additional difficulties. We give two examples of boundary conditions and partial measurements. We also present numerical algorithms and simulations verifying our theoretical results to the fullest extent. Our numerical approach is novel in the sense that we solve the target systems first and obtain the solution to the feedback system by using the bounded invertibility of the backstep** transformation. △ Less

Submitted 13 September, 2020; v1 submitted 29 August, 2019; originally announced August 2019.

Comments: 62 pages, 17 figures

MSC Class: 35Q93; 93B52; 93C20; 93D15; 93D20; 93D23 (primary); 35A01; 35A02; 35Q55; 35Q60 (secondary)

arXiv:1809.05699 [pdf]

Inferring Political Alignments of Twitter Users: A case study on 2017 Turkish constitutional referendum

Authors: Kutlu Emre Yilmaz, Osman Abul

Abstract: Increasing popularity of Twitter in politics is subject to commercial and academic interest. To fully exploit the merits of this platform, reaching the target audience with desired political leanings is critical. This paper extends the research on inferring political orientations of Twitter users to the case of 2017 Turkish constitutional referendum. After constructing a targeted dataset of tweets… ▽ More Increasing popularity of Twitter in politics is subject to commercial and academic interest. To fully exploit the merits of this platform, reaching the target audience with desired political leanings is critical. This paper extends the research on inferring political orientations of Twitter users to the case of 2017 Turkish constitutional referendum. After constructing a targeted dataset of tweets, we explore several types of potential features to build accurate machine learning based predictive models. In our experiments, a three-class support vector machine (SVM) classifier trained on semantic features achieves the best accuracy score of 89.9%. Moreover, an SVM classifier trained on full-text features performs better than an SVM classifier trained on hashtags, with respective accuracy scores of 89.05% and 85.9%. Relatively high accuracy scores obtained by full-text features may point to differences in language use, which deserves further research. △ Less

Submitted 15 September, 2018; originally announced September 2018.

arXiv:1409.8083 [pdf, other]

Variational Inference For Probabilistic Latent Tensor Factorization with KL Divergence

Authors: Beyza Ermis, Y. Kenan Yılmaz, A. Taylan Cemgil, Evrim Acar

Abstract: Probabilistic Latent Tensor Factorization (PLTF) is a recently proposed probabilistic framework for modelling multi-way data. Not only the common tensor factorization models but also any arbitrary tensor factorization structure can be realized by the PLTF framework. This paper presents full Bayesian inference via variational Bayes that facilitates more powerful modelling and allows more sophistica… ▽ More Probabilistic Latent Tensor Factorization (PLTF) is a recently proposed probabilistic framework for modelling multi-way data. Not only the common tensor factorization models but also any arbitrary tensor factorization structure can be realized by the PLTF framework. This paper presents full Bayesian inference via variational Bayes that facilitates more powerful modelling and allows more sophisticated inference on the PLTF framework. We illustrate our approach on model order selection and link prediction. △ Less

Submitted 29 September, 2014; originally announced September 2014.

arXiv:1405.1740 [pdf]

Turkish Text Retrieval Experiments Using Lemur Toolkit

Authors: Kutlu Emre Yılmaz, Ahmet Arslan, Ozgur Yilmazel

Abstract: We used Lemur Toolkit, an open source toolkit designed for Information Retrieval (IR) research, for our automated indexing and retrieval experiments on a TREC-like test collection for Turkish. We study and compare three retrieval models Lemur supports, especially Language modeling approach to IR, combined with language specific preprocessing techniques. Our experiments show that all retrieval mode… ▽ More We used Lemur Toolkit, an open source toolkit designed for Information Retrieval (IR) research, for our automated indexing and retrieval experiments on a TREC-like test collection for Turkish. We study and compare three retrieval models Lemur supports, especially Language modeling approach to IR, combined with language specific preprocessing techniques. Our experiments show that all retrieval models benefits from language specific preprocessing in terms of retrieval quality. Also Language Modeling approach is the best performing retrieval model when language specific preprocessing applied. △ Less

Submitted 7 May, 2014; originally announced May 2014.

Comments: 3 pages

Journal ref: IADIS AC 2009: Rome, Italy

arXiv:1405.1717 [pdf, other]

Entropy Based Cartoon Texture Separation

Authors: Kutlu Emre Yilmaz

Abstract: Separating an image into cartoon and texture components comes useful in image processing applications, such as image compression, image segmentation, image inpainting. Yves Meyer's influential cartoon texture decomposition model involves deriving an energy functional by choosing appropriate spaces and functionals. Minimizers of the derived energy functional are cartoon and texture components of an… ▽ More Separating an image into cartoon and texture components comes useful in image processing applications, such as image compression, image segmentation, image inpainting. Yves Meyer's influential cartoon texture decomposition model involves deriving an energy functional by choosing appropriate spaces and functionals. Minimizers of the derived energy functional are cartoon and texture components of an image. In this study, cartoon part of an image is separated, by reconstructing it from pixels of multi scale Total-Variation filtered versions of the original image which is sought to be decomposed into cartoon and texture parts. An information theoretic pixel by pixel selection criteria is employed to choose the contributing pixels and their scales. △ Less

Submitted 7 May, 2014; originally announced May 2014.

Comments: 12 pages

arXiv:1306.3530 [pdf, other]

Generalized Beta Divergence

Authors: Y. Kenan Yilmaz

Abstract: This paper generalizes beta divergence beyond its classical form associated with power variance functions of Tweedie models. Generalized form is represented by a compact definite integral as a function of variance function of the exponential dispersion model. This compact integral form simplifies derivations of many properties such as scaling, translation and expectation of the beta divergence. Fu… ▽ More This paper generalizes beta divergence beyond its classical form associated with power variance functions of Tweedie models. Generalized form is represented by a compact definite integral as a function of variance function of the exponential dispersion model. This compact integral form simplifies derivations of many properties such as scaling, translation and expectation of the beta divergence. Further, we show that beta divergence and (half of) the statistical deviance are equivalent measures. △ Less

Submitted 18 June, 2013; v1 submitted 14 June, 2013; originally announced June 2013.

arXiv:1209.4280 [pdf, ps, other]

Alpha/Beta Divergences and Tweedie Models

Authors: Y. Kenan Yilmaz, A. Taylan Cemgil

Abstract: We describe the underlying probabilistic interpretation of alpha and beta divergences. We first show that beta divergences are inherently tied to Tweedie distributions, a particular type of exponential family, known as exponential dispersion models. Starting from the variance function of a Tweedie model, we outline how to get alpha and beta divergences as special cases of Csiszár's $f$ and Bregman… ▽ More We describe the underlying probabilistic interpretation of alpha and beta divergences. We first show that beta divergences are inherently tied to Tweedie distributions, a particular type of exponential family, known as exponential dispersion models. Starting from the variance function of a Tweedie model, we outline how to get alpha and beta divergences as special cases of Csiszár's $f$ and Bregman divergences. This result directly generalizes the well-known relationship between the Gaussian distribution and least squares estimation to Tweedie models and beta divergence minimization. △ Less

Submitted 19 September, 2012; originally announced September 2012.

Showing 1–16 of 16 results for author: Yılmaz, K