-
Calibrating Bayesian UNet++ for Sub-Seasonal Forecasting
Authors:
Busra Asan,
Abdullah Akgül,
Alper Unal,
Melih Kandemir,
Gozde Unal
Abstract:
Seasonal forecasting is a crucial task when it comes to detecting the extreme heat and colds that occur due to climate change. Confidence in the predictions should be reliable since a small increase in the temperatures in a year has a big impact on the world. Calibration of the neural networks provides a way to ensure our confidence in the predictions. However, calibrating regression models is an…
▽ More
Seasonal forecasting is a crucial task when it comes to detecting the extreme heat and colds that occur due to climate change. Confidence in the predictions should be reliable since a small increase in the temperatures in a year has a big impact on the world. Calibration of the neural networks provides a way to ensure our confidence in the predictions. However, calibrating regression models is an under-researched topic, especially in forecasters. We calibrate a UNet++ based architecture, which was shown to outperform physics-based models in temperature anomalies. We show that with a slight trade-off between prediction error and calibration error, it is possible to get more reliable and sharper forecasts. We believe that calibration should be an important part of safety-critical machine learning applications such as weather forecasters.
△ Less
Submitted 4 April, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
PCLD: Point Cloud Layerwise Diffusion for Adversarial Purification
Authors:
Mert Gulsen,
Batuhan Cengiz,
Yusuf H. Sahin,
Gozde Unal
Abstract:
Point clouds are extensively employed in a variety of real-world applications such as robotics, autonomous driving and augmented reality. Despite the recent success of point cloud neural networks, especially for safety-critical tasks, it is essential to also ensure the robustness of the model. A typical way to assess a model's robustness is through adversarial attacks, where test-time examples are…
▽ More
Point clouds are extensively employed in a variety of real-world applications such as robotics, autonomous driving and augmented reality. Despite the recent success of point cloud neural networks, especially for safety-critical tasks, it is essential to also ensure the robustness of the model. A typical way to assess a model's robustness is through adversarial attacks, where test-time examples are generated based on gradients to deceive the model. While many different defense mechanisms are studied in 2D, studies on 3D point clouds have been relatively limited in the academic field. Inspired from PointDP, which denoises the network inputs by diffusion, we propose Point Cloud Layerwise Diffusion (PCLD), a layerwise diffusion based 3D point cloud defense strategy. Unlike PointDP, we propagated the diffusion denoising after each layer to incrementally enhance the results. We apply our defense method to different types of commonly used point cloud models and adversarial attacks to evaluate its robustness. Our experiments demonstrate that the proposed defense method achieved results that are comparable to or surpass those of existing methodologies, establishing robustness through a novel technique. Code is available at https://github.com/batuceng/diffusion-layer-robustness-pc.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
epsilon-Mesh Attack: A Surface-based Adversarial Point Cloud Attack for Facial Expression Recognition
Authors:
Batuhan Cengiz,
Mert Gulsen,
Yusuf H. Sahin,
Gozde Unal
Abstract:
Point clouds and meshes are widely used 3D data structures for many computer vision applications. While the meshes represent the surfaces of an object, point cloud represents sampled points from the surface which is also the output of modern sensors such as LiDAR and RGB-D cameras. Due to the wide application area of point clouds and the recent advancements in deep neural networks, studies focusin…
▽ More
Point clouds and meshes are widely used 3D data structures for many computer vision applications. While the meshes represent the surfaces of an object, point cloud represents sampled points from the surface which is also the output of modern sensors such as LiDAR and RGB-D cameras. Due to the wide application area of point clouds and the recent advancements in deep neural networks, studies focusing on robust classification of the 3D point cloud data emerged. To evaluate the robustness of deep classifier networks, a common method is to use adversarial attacks where the gradient direction is followed to change the input slightly. The previous studies on adversarial attacks are generally evaluated on point clouds of daily objects. However, considering 3D faces, these adversarial attacks tend to affect the person's facial structure more than the desired amount and cause malformation. Specifically for facial expressions, even a small adversarial attack can have a significant effect on the face structure. In this paper, we suggest an adversarial attack called $ε$-Mesh Attack, which operates on point cloud data via limiting perturbations to be on the mesh surface. We also parameterize our attack by $ε$ to scale the perturbation mesh. Our surface-based attack has tighter perturbation bounds compared to $L_2$ and $L_\infty$ norm bounded attacks that operate on unit-ball. Even though our method has additional constraints, our experiments on CoMA, Bosphorus and FaceWarehouse datasets show that $ε$-Mesh Attack (Perpendicular) successfully confuses trained DGCNN and PointNet models $99.72\%$ and $97.06\%$ of the time, with indistinguishable facial deformations. The code is available at https://github.com/batuceng/e-mesh-attack.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders
Authors:
Gulcin Baykal,
Melih Kandemir,
Gozde Unal
Abstract:
Codebook collapse is a common problem in training deep generative models with discrete representation spaces like Vector Quantized Variational Autoencoders (VQ-VAEs). We observe that the same problem arises for the alternatively designed discrete variational autoencoders (dVAEs) whose encoder directly learns a distribution over the codebook embeddings to represent the data. We hypothesize that usi…
▽ More
Codebook collapse is a common problem in training deep generative models with discrete representation spaces like Vector Quantized Variational Autoencoders (VQ-VAEs). We observe that the same problem arises for the alternatively designed discrete variational autoencoders (dVAEs) whose encoder directly learns a distribution over the codebook embeddings to represent the data. We hypothesize that using the softmax function to obtain a probability distribution causes the codebook collapse by assigning overconfident probabilities to the best matching codebook elements. In this paper, we propose a novel way to incorporate evidential deep learning (EDL) instead of softmax to combat the codebook collapse problem of dVAE. We evidentially monitor the significance of attaining the probability distribution over the codebook embeddings, in contrast to softmax usage. Our experiments using various datasets show that our model, called EdVAE, mitigates codebook collapse while improving the reconstruction performance, and enhances the codebook usage compared to dVAE and VQ-VAE based models. Our code can be found at https://github.com/ituvisionlab/EdVAE .
△ Less
Submitted 12 December, 2023; v1 submitted 9 October, 2023;
originally announced October 2023.
-
ProtoDiffusion: Classifier-Free Diffusion Guidance with Prototype Learning
Authors:
Gulcin Baykal,
Halil Faruk Karagoz,
Taha Binhuraib,
Gozde Unal
Abstract:
Diffusion models are generative models that have shown significant advantages compared to other generative models in terms of higher generation quality and more stable training. However, the computational need for training diffusion models is considerably increased. In this work, we incorporate prototype learning into diffusion models to achieve high generation quality faster than the original dif…
▽ More
Diffusion models are generative models that have shown significant advantages compared to other generative models in terms of higher generation quality and more stable training. However, the computational need for training diffusion models is considerably increased. In this work, we incorporate prototype learning into diffusion models to achieve high generation quality faster than the original diffusion model. Instead of randomly initialized class embeddings, we use separately learned class prototypes as the conditioning information to guide the diffusion process. We observe that our method, called ProtoDiffusion, achieves better performance in the early stages of training compared to the baseline method, signifying that using the learned prototypes shortens the training time. We demonstrate the performance of ProtoDiffusion using various datasets and experimental settings, achieving the best performance in shorter times across all settings.
△ Less
Submitted 4 July, 2023;
originally announced July 2023.
-
Non-Abelian Magnetic Field and Curvature Effects on Pair Production
Authors:
S. Kürkçüoğlu,
B. Özcan,
G. Ünal
Abstract:
We calculate the Schwinger pair production rates in $\mathbb{R}^{3,1}$ as well as in the positively curved space $S^2 \times \mathbb{R}^{1,1}$ for both spin-$0$ and spin-$\frac{1}{2}$ particles under the influence of an external $SU(2) \times U(1)$ gauge field producing an additional uniform non-abelian magnetic field besides the usual uniform abelian electric field. To this end, we determine and…
▽ More
We calculate the Schwinger pair production rates in $\mathbb{R}^{3,1}$ as well as in the positively curved space $S^2 \times \mathbb{R}^{1,1}$ for both spin-$0$ and spin-$\frac{1}{2}$ particles under the influence of an external $SU(2) \times U(1)$ gauge field producing an additional uniform non-abelian magnetic field besides the usual uniform abelian electric field. To this end, we determine and subsequently make use of the spectrum of the gauged Laplace and Dirac operators on both the flat and the curved geometries. We find that there are regimes in which the purely non-abelian and the abelian parts of the gauge field strength have either a counterplaying or reinforcing role, whose overall effect may be to enhance or suppress the pair production rates. Positive curvature tends to enhance the latter for spin-$0$ and suppress it for spin-$\frac{1}{2}$ fields, while the details of the couplings to the purely abelian and the non-abelian parts of the magnetic field, which are extracted from the spectrum of the Laplace and Dirac operators on $S^2$, determine the cumulative effect on the pair production rates. These features are elaborated in detail.
△ Less
Submitted 30 July, 2023; v1 submitted 7 June, 2023;
originally announced June 2023.
-
Textile Pattern Generation Using Diffusion Models
Authors:
Halil Faruk Karagoz,
Gulcin Baykal,
Irem Arikan Eksi,
Gozde Unal
Abstract:
The problem of text-guided image generation is a complex task in Computer Vision, with various applications, including creating visually appealing artwork and realistic product images. One popular solution widely used for this task is the diffusion model, a generative model that generates images through an iterative process. Although diffusion models have demonstrated promising results for various…
▽ More
The problem of text-guided image generation is a complex task in Computer Vision, with various applications, including creating visually appealing artwork and realistic product images. One popular solution widely used for this task is the diffusion model, a generative model that generates images through an iterative process. Although diffusion models have demonstrated promising results for various image generation tasks, they may only sometimes produce satisfactory results when applied to more specific domains, such as the generation of textile patterns based on text guidance. This study presents a fine-tuned diffusion model specifically trained for textile pattern generation by text guidance to address this issue. The study involves the collection of various textile pattern images and their captioning with the help of another AI model. The fine-tuned diffusion model is trained with this newly created dataset, and its results are compared with the baseline models visually and numerically. The results demonstrate that the proposed fine-tuned diffusion model outperforms the baseline models in terms of pattern quality and efficiency in textile pattern generation by text guidance. This study presents a promising solution to the problem of text-guided textile pattern generation and has the potential to simplify the design process within the textile industry.
△ Less
Submitted 2 April, 2023;
originally announced April 2023.
-
GaussianMLR: Learning Implicit Class Significance via Calibrated Multi-Label Ranking
Authors:
V. Bugra Yesilkaynak,
Emine Dari,
Alican Mertan,
Gozde Unal
Abstract:
Existing multi-label frameworks only exploit the information deduced from the bipartition of the labels into a positive and negative set. Therefore, they do not benefit from the ranking order between positive labels, which is the concept we introduce in this paper. We propose a novel multi-label ranking method: GaussianMLR, which aims to learn implicit class significance values that determine the…
▽ More
Existing multi-label frameworks only exploit the information deduced from the bipartition of the labels into a positive and negative set. Therefore, they do not benefit from the ranking order between positive labels, which is the concept we introduce in this paper. We propose a novel multi-label ranking method: GaussianMLR, which aims to learn implicit class significance values that determine the positive label ranks instead of treating them as of equal importance, by following an approach that unifies ranking and classification tasks associated with multi-label ranking. Due to the scarcity of public datasets, we introduce eight synthetic datasets generated under varying importance factors to provide an enriched and controllable experimental environment for this study. On both real-world and synthetic datasets, we carry out extensive comparisons with relevant baselines and evaluate the performance on both of the two sub-tasks. We show that our method is able to accurately learn a representation of the incorporated positive rank order, which is not only consistent with the ground truth but also proportional to the underlying information. We strengthen our claims empirically by conducting comprehensive experimental studies. Code is available at https://github.com/MrGranddy/GaussianMLR.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Climate Model Driven Seasonal Forecasting Approach with Deep Learning
Authors:
Alper Unal,
Busra Asan,
Ismail Sezen,
Bugra Yesilkaynak,
Yusuf Aydin,
Mehmet Ilicak,
Gozde Unal
Abstract:
Understanding seasonal climatic conditions is critical for better management of resources such as water, energy and agriculture. Recently, there has been a great interest in utilizing the power of artificial intelligence methods in climate studies. This paper presents a cutting-edge deep learning model (UNet++) trained by state-of-the-art global CMIP6 models to forecast global temperatures a month…
▽ More
Understanding seasonal climatic conditions is critical for better management of resources such as water, energy and agriculture. Recently, there has been a great interest in utilizing the power of artificial intelligence methods in climate studies. This paper presents a cutting-edge deep learning model (UNet++) trained by state-of-the-art global CMIP6 models to forecast global temperatures a month ahead using the ERA5 reanalysis dataset. ERA5 dataset was also used for finetuning as well performance analysis in the validation dataset. Three different setups (CMIP6; CMIP6 + elevation; CMIP6 + elevation + ERA5 finetuning) were used with both UNet and UNet++ algorithms resulting in six different models. For each model 14 different sequential and non-sequential temporal settings were used. The Mean Absolute Error (MAE) analysis revealed that UNet++ with CMIP6 with elevation and ERA5 finetuning model with "Year 3 Month 2" temporal case provided the best outcome with an MAE of 0.7. Regression analysis over the validation dataset between the ERA5 data values and the corresponding AI model predictions revealed slope and $R^2$ values close to 1 suggesting a very good agreement. The AI model predicts significantly better than the mean CMIP6 ensemble between 2016 and 2021. Both models predict the summer months more accurately than the winter months.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
RLSEP: Learning Label Ranks for Multi-label Classification
Authors:
Emine Dari,
V. Bugra Yesilkaynak,
Alican Mertan,
Gozde Unal
Abstract:
Multi-label ranking maps instances to a ranked set of predicted labels from multiple possible classes. The ranking approach for multi-label learning problems received attention for its success in multi-label classification, with one of the well-known approaches being pairwise label ranking. However, most existing methods assume that only partial information about the preference relation is known,…
▽ More
Multi-label ranking maps instances to a ranked set of predicted labels from multiple possible classes. The ranking approach for multi-label learning problems received attention for its success in multi-label classification, with one of the well-known approaches being pairwise label ranking. However, most existing methods assume that only partial information about the preference relation is known, which is inferred from the partition of labels into a positive and negative set, then treat labels with equal importance. In this paper, we focus on the unique challenge of ranking when the order of the true label set is provided. We propose a novel dedicated loss function to optimize models by incorporating penalties for incorrectly ranked pairs, and make use of the ranking information present in the input. Our method achieves the best reported performance measures on both synthetic and real world ranked datasets and shows improvements on overall ranking of labels. Our experimental results demonstrate that our approach is generalizable to a variety of multi-label classification and ranking tasks, while revealing a calibration towards a certain ranking ordering.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
Symmetry and Variance: Generative Parametric Modelling of Historical Brick Wall Patterns
Authors:
Sevgi Altun,
Mustafa Cem Gunes,
Yusuf H. Sahin,
Alican Mertan,
Gozde Unal,
Mine Ozkar
Abstract:
This study integrates artificial intelligence and computational design tools to extract information from architectural heritage. Photogrammetry-based point cloud models of brick walls from the Anatolian Seljuk period are analysed in terms of the interrelated units of construction, simultaneously considering both the inherent symmetries and irregularities. The real-world data is used as input for a…
▽ More
This study integrates artificial intelligence and computational design tools to extract information from architectural heritage. Photogrammetry-based point cloud models of brick walls from the Anatolian Seljuk period are analysed in terms of the interrelated units of construction, simultaneously considering both the inherent symmetries and irregularities. The real-world data is used as input for acquiring the stochastic parameters of spatial relations and a set of parametric shape rules to recreate designs of existing and hypothetical brick walls within the style. The motivation is to be able to generate large data sets for machine learning of the style and to devise procedures for robotic production of such designs with repetitive units.
△ Less
Submitted 23 October, 2022;
originally announced October 2022.
-
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning
Authors:
Doğay Kamar,
Nazım Kemal Üre,
Gözde Ünal
Abstract:
In this study, we address the problem of efficient exploration in reinforcement learning. Most common exploration approaches depend on random action selection, however these approaches do not work well in environments with sparse or no rewards. We propose Generative Adversarial Network-based Intrinsic Reward Module that learns the distribution of the observed states and sends an intrinsic reward t…
▽ More
In this study, we address the problem of efficient exploration in reinforcement learning. Most common exploration approaches depend on random action selection, however these approaches do not work well in environments with sparse or no rewards. We propose Generative Adversarial Network-based Intrinsic Reward Module that learns the distribution of the observed states and sends an intrinsic reward that is computed as high for states that are out of distribution, in order to lead agent to unexplored states. We evaluate our approach in Super Mario Bros for a no reward setting and in Montezuma's Revenge for a sparse reward setting and show that our approach is indeed capable of exploring efficiently. We discuss a few weaknesses and conclude by discussing future works.
△ Less
Submitted 28 June, 2022;
originally announced June 2022.
-
How to Combine Variational Bayesian Networks in Federated Learning
Authors:
Atahan Ozer,
Kadir Burak Buldu,
Abdullah Akgül,
Gozde Unal
Abstract:
Federated Learning enables multiple data centers to train a central model collaboratively without exposing any confidential data. Even though deterministic models are capable of performing high prediction accuracy, their lack of calibration and capability to quantify uncertainty is problematic for safety-critical applications. Different from deterministic models, probabilistic models such as Bayes…
▽ More
Federated Learning enables multiple data centers to train a central model collaboratively without exposing any confidential data. Even though deterministic models are capable of performing high prediction accuracy, their lack of calibration and capability to quantify uncertainty is problematic for safety-critical applications. Different from deterministic models, probabilistic models such as Bayesian neural networks are relatively well-calibrated and able to quantify uncertainty alongside their competitive prediction accuracy. Both of the approaches appear in the federated learning framework; however, the aggregation scheme of deterministic models cannot be directly applied to probabilistic models since weights correspond to distributions instead of point estimates. In this work, we study the effects of various aggregation schemes for variational Bayesian neural networks. With empirical results on three image classification datasets, we observe that the degree of spread for an aggregated distribution is a significant factor in the learning process. Hence, we present an investigation on the question of how to combine variational Bayesian networks in federated learning, while providing benchmarks for different aggregation settings.
△ Less
Submitted 23 November, 2022; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Continual Learning of Multi-modal Dynamics with External Memory
Authors:
Abdullah Akgül,
Gozde Unal,
Melih Kandemir
Abstract:
We study the problem of fitting a model to a dynamical environment when new modes of behavior emerge sequentially. The learning model is aware when a new mode appears, but it cannot access the true modes of individual training sequences. The state-of-the-art continual learning approaches cannot handle this setup, because parameter transfer suffers from catastrophic interference and episodic memory…
▽ More
We study the problem of fitting a model to a dynamical environment when new modes of behavior emerge sequentially. The learning model is aware when a new mode appears, but it cannot access the true modes of individual training sequences. The state-of-the-art continual learning approaches cannot handle this setup, because parameter transfer suffers from catastrophic interference and episodic memory design requires the knowledge of the ground-truth modes of sequences. We devise a novel continual learning method that overcomes both limitations by maintaining a \textit{descriptor} of the mode of an encountered sequence in a neural episodic memory. We employ a Dirichlet Process prior on the attention weights of the memory to foster efficient storage of the mode descriptors. Our method performs continual learning by transferring knowledge across tasks by retrieving the descriptors of similar modes of past tasks to the mode of a current sequence and feeding this descriptor into its transition kernel as control input. We observe the continual learning performance of our method to compare favorably to the mainstream parameter transfer approach.
△ Less
Submitted 9 May, 2024; v1 submitted 2 March, 2022;
originally announced March 2022.
-
The Phase-I Trigger Readout Electronics Upgrade of the ATLAS Liquid Argon Calorimeters
Authors:
G. Aad,
A. V. Akimov,
K. Al Khoury,
M. Aleksa,
T. Andeen,
C. Anelli,
N. Aranzabal,
C. Armijo,
A. Bagulia,
J. Ban,
T. Barillari,
F. Bellachia,
M. Benoit,
F. Bernon,
A. Berthold,
H. Bervas,
D. Besin,
A. Betti,
Y. Bianga,
M. Biaut,
D. Boline,
J. Boudreau,
T. Bouedo,
N. Braam,
M. Cano Bret
, et al. (173 additional authors not shown)
Abstract:
The Phase-I trigger readout electronics upgrade of the ATLAS Liquid Argon calorimeters enhances the physics reach of the experiment during the upcoming operation at increasing Large Hadron Collider luminosities. The new system, installed during the second Large Hadron Collider Long Shutdown, increases the trigger readout granularity by up to a factor of ten as well as its precision and range. Cons…
▽ More
The Phase-I trigger readout electronics upgrade of the ATLAS Liquid Argon calorimeters enhances the physics reach of the experiment during the upcoming operation at increasing Large Hadron Collider luminosities. The new system, installed during the second Large Hadron Collider Long Shutdown, increases the trigger readout granularity by up to a factor of ten as well as its precision and range. Consequently, the background rejection at trigger level is improved through enhanced filtering algorithms utilizing the additional information for topological discrimination of electromagnetic and hadronic shower shapes. This paper presents the final designs of the new electronic elements, their custom electronic devices, the procedures used to validate their proper functioning, and the performance achieved during the commissioning of this system.
△ Less
Submitted 16 May, 2022; v1 submitted 15 February, 2022;
originally announced February 2022.
-
Detecting Visual Design Principles in Art and Architecture through Deep Convolutional Neural Networks
Authors:
Gozdenur Demir,
Asli Cekmis,
Vahit Bugra Yesilkaynak,
Gozde Unal
Abstract:
Visual design is associated with the use of some basic design elements and principles. Those are applied by the designers in the various disciplines for aesthetic purposes, relying on an intuitive and subjective process. Thus, numerical analysis of design visuals and disclosure of the aesthetic value embedded in them are considered as hard. However, it has become possible with emerging artificial…
▽ More
Visual design is associated with the use of some basic design elements and principles. Those are applied by the designers in the various disciplines for aesthetic purposes, relying on an intuitive and subjective process. Thus, numerical analysis of design visuals and disclosure of the aesthetic value embedded in them are considered as hard. However, it has become possible with emerging artificial intelligence technologies. This research aims at a neural network model, which recognizes and classifies the design principles over different domains. The domains include artwork produced since the late 20th century; professional photos; and facade pictures of contemporary buildings. The data collection and curation processes, including the production of computationally-based synthetic dataset, is genuine. The proposed model learns from the knowledge of myriads of original designs, by capturing the underlying shared patterns. It is expected to consolidate design processes by providing an aesthetic evaluation of the visual compositions with objectivity.
△ Less
Submitted 9 August, 2021;
originally announced August 2021.
-
Uncertainty-Based Dynamic Graph Neighborhoods For Medical Segmentation
Authors:
Ufuk Demir,
Atahan Ozer,
Yusuf H. Sahin,
Gozde Unal
Abstract:
In recent years, deep learning based methods have shown success in essential medical image analysis tasks such as segmentation. Post-processing and refining the results of segmentation is a common practice to decrease the misclassifications originating from the segmentation network. In addition to widely used methods like Conditional Random Fields (CRFs) which focus on the structure of the segment…
▽ More
In recent years, deep learning based methods have shown success in essential medical image analysis tasks such as segmentation. Post-processing and refining the results of segmentation is a common practice to decrease the misclassifications originating from the segmentation network. In addition to widely used methods like Conditional Random Fields (CRFs) which focus on the structure of the segmented volume/area, a graph-based recent approach makes use of certain and uncertain points in a graph and refines the segmentation according to a small graph convolutional network (GCN). However, there are two drawbacks of the approach: most of the edges in the graph are assigned randomly and the GCN is trained independently from the segmentation network. To address these issues, we define a new neighbor-selection mechanism according to feature distances and combine the two networks in the training procedure. According to the experimental results on pancreas segmentation from Computed Tomography (CT) images, we demonstrate improvement in the quantitative measures. Also, examining the dynamic neighbors created by our method, edges between semantically similar image parts are observed. The proposed method also shows qualitative enhancements in the segmentation maps, as demonstrated in the visual results.
△ Less
Submitted 6 August, 2021;
originally announced August 2021.
-
Evidential Turing Processes
Authors:
Melih Kandemir,
Abdullah Akgül,
Manuel Haussmann,
Gozde Unal
Abstract:
A probabilistic classifier with reliable predictive uncertainties i) fits successfully to the target domain data, ii) provides calibrated class probabilities in difficult regions of the target domain (e.g.\ class overlap), and iii) accurately identifies queries coming out of the target domain and rejects them. We introduce an original combination of Evidential Deep Learning, Neural Processes, and…
▽ More
A probabilistic classifier with reliable predictive uncertainties i) fits successfully to the target domain data, ii) provides calibrated class probabilities in difficult regions of the target domain (e.g.\ class overlap), and iii) accurately identifies queries coming out of the target domain and rejects them. We introduce an original combination of Evidential Deep Learning, Neural Processes, and Neural Turing Machines capable of providing all three essential properties mentioned above for total uncertainty quantification. We observe our method on five classification tasks to be the only one that can excel all three aspects of total calibration with a single standalone predictor. Our unified solution delivers an implementation-friendly and compute efficient recipe for safety clearance and provides intellectual economy to an investigation of algorithmic roots of epistemic awareness in deep neural nets.
△ Less
Submitted 8 March, 2022; v1 submitted 2 June, 2021;
originally announced June 2021.
-
Single Image Depth Estimation: An Overview
Authors:
Alican Mertan,
Damien Jade Duff,
Gozde Unal
Abstract:
We review solutions to the problem of depth estimation, arguably the most important subtask in scene understanding. We focus on the single image depth estimation problem. Due to its properties, the single image depth estimation problem is currently best tackled with machine learning methods, most successfully with convolutional neural networks. We provide an overview of the field by examining key…
▽ More
We review solutions to the problem of depth estimation, arguably the most important subtask in scene understanding. We focus on the single image depth estimation problem. Due to its properties, the single image depth estimation problem is currently best tackled with machine learning methods, most successfully with convolutional neural networks. We provide an overview of the field by examining key works. We examine non-deep learning approaches that mostly predate deep learning and utilize hand-crafted features and assumptions, and more recent works that mostly use deep learning techniques. The single image depth estimation problem is tackled first in a supervised fashion with absolute or relative depth information acquired from human or sensor-labeled data, or in an unsupervised way using unlabelled stereo images or video datasets. We also study multitask approaches that combine the depth estimation problem with related tasks such as semantic segmentation and surface normal estimation. Finally, we discuss investigations into the mechanisms, principles, and failure cases of contemporary solutions.
△ Less
Submitted 13 April, 2021;
originally announced April 2021.
-
ODFNet: Using orientation distribution functions to characterize 3D point clouds
Authors:
Yusuf H. Sahin,
Alican Mertan,
Gozde Unal
Abstract:
Learning new representations of 3D point clouds is an active research area in 3D vision, as the order-invariant point cloud structure still presents challenges to the design of neural network architectures. Recent works explored learning either global or local features or both for point clouds, however none of the earlier methods focused on capturing contextual shape information by analysing local…
▽ More
Learning new representations of 3D point clouds is an active research area in 3D vision, as the order-invariant point cloud structure still presents challenges to the design of neural network architectures. Recent works explored learning either global or local features or both for point clouds, however none of the earlier methods focused on capturing contextual shape information by analysing local orientation distribution of points. In this paper, we leverage on point orientation distributions around a point in order to obtain an expressive local neighborhood representation for point clouds. We achieve this by dividing the spherical neighborhood of a given point into predefined cone volumes, and statistics inside each volume are used as point features. In this way, a local patch can be represented by not only the selected point's nearest neighbors, but also considering a point density distribution defined along multiple orientations around the point. We are then able to construct an orientation distribution function (ODF) neural network that involves an ODFBlock which relies on mlp (multi-layer perceptron) layers. The new ODFNet model achieves state-of the-art accuracy for object classification on ModelNet40 and ScanObjectNN datasets, and segmentation on ShapeNet S3DIS datasets.
△ Less
Submitted 15 July, 2022; v1 submitted 8 December, 2020;
originally announced December 2020.
-
Exploring DeshuffleGANs in Self-Supervised Generative Adversarial Networks
Authors:
Gulcin Baykal,
Furkan Ozcelik,
Gozde Unal
Abstract:
Generative Adversarial Networks (GANs) have become the most used networks towards solving the problem of image generation. Self-supervised GANs are later proposed to avoid the catastrophic forgetting of the discriminator and to improve the image generation quality without needing the class labels. However, the generalizability of the self-supervision tasks on different GAN architectures is not stu…
▽ More
Generative Adversarial Networks (GANs) have become the most used networks towards solving the problem of image generation. Self-supervised GANs are later proposed to avoid the catastrophic forgetting of the discriminator and to improve the image generation quality without needing the class labels. However, the generalizability of the self-supervision tasks on different GAN architectures is not studied before. To that end, we extensively analyze the contribution of a previously proposed self-supervision task, deshuffling of the DeshuffleGANs in the generalizability context. We assign the deshuffling task to two different GAN discriminators and study the effects of the task on both architectures. We extend the evaluations compared to the previously proposed DeshuffleGANs on various datasets. We show that the DeshuffleGAN obtains the best FID results for several datasets compared to the other self-supervised GANs. Furthermore, we compare the deshuffling with the rotation prediction that is firstly deployed to the GAN training and demonstrate that its contribution exceeds the rotation prediction. We design the conditional DeshuffleGAN called cDeshuffleGAN to evaluate the quality of the learnt representations. Lastly, we show the contribution of the self-supervision tasks to the GAN training on the loss landscape and present that the effects of these tasks may not be cooperative to the adversarial training in some settings. Our code can be found at https://github.com/gulcinbaykal/DeshuffleGAN.
△ Less
Submitted 1 September, 2021; v1 submitted 3 November, 2020;
originally announced November 2020.
-
A New Distributional Ranking Loss With Uncertainty: Illustrated in Relative Depth Estimation
Authors:
Alican Mertan,
Yusuf Huseyin Sahin,
Damien Jade Duff,
Gozde Unal
Abstract:
We propose a new approach for the problem of relative depth estimation from a single image. Instead of directly regressing over depth scores, we formulate the problem as estimation of a probability distribution over depth and aim to learn the parameters of the distributions which maximize the likelihood of the given data. To train our model, we propose a new ranking loss, Distributional Loss, whic…
▽ More
We propose a new approach for the problem of relative depth estimation from a single image. Instead of directly regressing over depth scores, we formulate the problem as estimation of a probability distribution over depth and aim to learn the parameters of the distributions which maximize the likelihood of the given data. To train our model, we propose a new ranking loss, Distributional Loss, which tries to increase the probability of farther pixel's depth being greater than the closer pixel's depth. Our proposed approach allows our model to output confidence in its estimation in the form of standard deviation of the distribution. We achieve state of the art results against a number of baselines while providing confidence in our estimations. Our analysis show that estimated confidence is actually a good indicator of accuracy. We investigate the usage of confidence information in a downstream task of metric depth estimation, to increase its performance.
△ Less
Submitted 14 October, 2020;
originally announced October 2020.
-
Relative Depth Estimation as a Ranking Problem
Authors:
Alican Mertan,
Damien Jade Duff,
Gozde Unal
Abstract:
We present a formulation of the relative depth estimation from a single image problem, as a ranking problem. By reformulating the problem this way, we were able to utilize literature on the ranking problem, and apply the existing knowledge to achieve better results. To this end, we have introduced a listwise ranking loss borrowed from ranking literature, weighted ListMLE, to the relative depth est…
▽ More
We present a formulation of the relative depth estimation from a single image problem, as a ranking problem. By reformulating the problem this way, we were able to utilize literature on the ranking problem, and apply the existing knowledge to achieve better results. To this end, we have introduced a listwise ranking loss borrowed from ranking literature, weighted ListMLE, to the relative depth estimation problem. We have also brought a new metric which considers pixel depth ranking accuracy, on which our method is stronger.
△ Less
Submitted 14 October, 2020;
originally announced October 2020.
-
EfficientSeg: An Efficient Semantic Segmentation Network
Authors:
Vahit Bugra Yesilkaynak,
Yusuf H. Sahin,
Gozde Unal
Abstract:
Deep neural network training without pre-trained weights and few data is shown to need more training iterations. It is also known that, deeper models are more successful than their shallow counterparts for semantic segmentation task. Thus, we introduce EfficientSeg architecture, a modified and scalable version of U-Net, which can be efficiently trained despite its depth. We evaluated EfficientSeg…
▽ More
Deep neural network training without pre-trained weights and few data is shown to need more training iterations. It is also known that, deeper models are more successful than their shallow counterparts for semantic segmentation task. Thus, we introduce EfficientSeg architecture, a modified and scalable version of U-Net, which can be efficiently trained despite its depth. We evaluated EfficientSeg architecture on Minicity dataset and outperformed U-Net baseline score (40% mIoU) using the same parameter count (51.5% mIoU). Our most successful model obtained 58.1% mIoU score and got the fourth place in semantic segmentation track of ECCV 2020 VIPriors challenge.
△ Less
Submitted 9 October, 2020; v1 submitted 14 September, 2020;
originally announced September 2020.
-
Rethinking CNN-Based Pansharpening: Guided Colorization of Panchromatic Images via GANs
Authors:
Furkan Ozcelik,
Ugur Alganci,
Elif Sertel,
Gozde Unal
Abstract:
Convolutional Neural Networks (CNN)-based approaches have shown promising results in pansharpening of satellite images in recent years. However, they still exhibit limitations in producing high-quality pansharpening outputs. To that end, we propose a new self-supervised learning framework, where we treat pansharpening as a colorization problem, which brings an entirely novel perspective and soluti…
▽ More
Convolutional Neural Networks (CNN)-based approaches have shown promising results in pansharpening of satellite images in recent years. However, they still exhibit limitations in producing high-quality pansharpening outputs. To that end, we propose a new self-supervised learning framework, where we treat pansharpening as a colorization problem, which brings an entirely novel perspective and solution to the problem compared to existing methods that base their solution solely on producing a super-resolution version of the multispectral image. Whereas CNN-based methods provide a reduced resolution panchromatic image as input to their model along with reduced resolution multispectral images, hence learn to increase their resolution together, we instead provide the grayscale transformed multispectral image as input, and train our model to learn the colorization of the grayscale input. We further address the fixed downscale ratio assumption during training, which does not generalize well to the full-resolution scenario. We introduce a noise injection into the training by randomly varying the downsampling ratios. Those two critical changes, along with the addition of adversarial training in the proposed PanColorization Generative Adversarial Networks (PanColorGAN) framework, help overcome the spatial detail loss and blur problems that are observed in CNN-based pansharpening. The proposed approach outperforms the previous CNN-based and traditional methods as demonstrated in our experiments.
△ Less
Submitted 30 June, 2020;
originally announced June 2020.
-
DeshuffleGAN: A Self-Supervised GAN to Improve Structure Learning
Authors:
Gulcin Baykal,
Gozde Unal
Abstract:
Generative Adversarial Networks (GANs) triggered an increased interest in problem of image generation due to their improved output image quality and versatility for expansion towards new methods. Numerous GAN-based works attempt to improve generation by architectural and loss-based extensions. We argue that one of the crucial points to improve the GAN performance in terms of realism and similarity…
▽ More
Generative Adversarial Networks (GANs) triggered an increased interest in problem of image generation due to their improved output image quality and versatility for expansion towards new methods. Numerous GAN-based works attempt to improve generation by architectural and loss-based extensions. We argue that one of the crucial points to improve the GAN performance in terms of realism and similarity to the original data distribution is to be able to provide the model with a capability to learn the spatial structure in data. To that end, we propose the DeshuffleGAN to enhance the learning of the discriminator and the generator, via a self-supervision approach. Specifically, we introduce a deshuffling task that solves a puzzle of randomly shuffled image tiles, which in turn helps the DeshuffleGAN learn to increase its expressive capacity for spatial structure and realistic appearance. We provide experimental evidence for the performance improvement in generated images, compared to the baseline methods, which is consistently observed over two different datasets.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
CHAOS Challenge -- Combined (CT-MR) Healthy Abdominal Organ Segmentation
Authors:
A. Emre Kavur,
N. Sinem Gezer,
Mustafa Barış,
Sinem Aslan,
Pierre-Henri Conze,
Vladimir Groza,
Duc Duy Pham,
Soumick Chatterjee,
Philipp Ernst,
Savaş Özkan,
Bora Baydar,
Dmitry Lachinov,
Shuo Han,
Josef Pauli,
Fabian Isensee,
Matthias Perkonigg,
Rachana Sathish,
Ronnie Rajan,
Debdoot Sheet,
Gurbandurdy Dovletov,
Oliver Speck,
Andreas Nürnberger,
Klaus H. Maier-Hein,
Gözde Bozdağı Akar,
Gözde Ünal
, et al. (2 additional authors not shown)
Abstract:
Segmentation of abdominal organs has been a comprehensive, yet unresolved, research field for many years. In the last decade, intensive developments in deep learning (DL) have introduced new state-of-the-art segmentation systems. In order to expand the knowledge on these topics, the CHAOS - Combined (CT-MR) Healthy Abdominal Organ Segmentation challenge has been organized in conjunction with IEEE…
▽ More
Segmentation of abdominal organs has been a comprehensive, yet unresolved, research field for many years. In the last decade, intensive developments in deep learning (DL) have introduced new state-of-the-art segmentation systems. In order to expand the knowledge on these topics, the CHAOS - Combined (CT-MR) Healthy Abdominal Organ Segmentation challenge has been organized in conjunction with IEEE International Symposium on Biomedical Imaging (ISBI), 2019, in Venice, Italy. CHAOS provides both abdominal CT and MR data from healthy subjects for single and multiple abdominal organ segmentation. Five different but complementary tasks have been designed to analyze the capabilities of current approaches from multiple perspectives. The results are investigated thoroughly, compared with manual annotations and interactive methods. The analysis shows that the performance of DL models for single modality (CT / MR) can show reliable volumetric analysis performance (DICE: 0.98 $\pm$ 0.00 / 0.95 $\pm$ 0.01) but the best MSSD performance remain limited (21.89 $\pm$ 13.94 / 20.85 $\pm$ 10.63 mm). The performances of participating models decrease significantly for cross-modality tasks for the liver (DICE: 0.88 $\pm$ 0.15 MSSD: 36.33 $\pm$ 21.97 mm) and all organs (DICE: 0.85 $\pm$ 0.21 MSSD: 33.17 $\pm$ 38.93 mm). Despite contrary examples on different applications, multi-tasking DL models designed to segment all organs seem to perform worse compared to organ-specific ones (performance drop around 5\%). Besides, such directions of further research for cross-modality segmentation would significantly support real-world clinical applications. Moreover, having more than 1500 participants, another important contribution of the paper is the analysis on shortcomings of challenge organizations such as the effects of multiple submissions and peeking phenomena.
△ Less
Submitted 7 January, 2021; v1 submitted 17 January, 2020;
originally announced January 2020.
-
Charting the European Course to the High-Energy Frontier
Authors:
U. Amaldi,
E. Aslanides,
R. Barate,
C. Benvenuti,
P. Bloch,
T. Camporesi,
A. David,
D. Denegri,
M. Diemoz,
L. Di Lella,
G. Dissertori,
N. Doble,
J. Dumarchez,
J. Ellis,
J. Engelen,
C. Fabjan,
B. Fuks,
P. Gavillet,
A. Hoecker,
J. Iliopoulos,
P. Innocenti,
W. Kozanecki,
P. Lebrun,
C. Llewellyn Smith,
C. Lourenço
, et al. (28 additional authors not shown)
Abstract:
We review the capabilities of two projects that have been proposed as the next major European facility, for consideration in the upcoming update of the European Strategy for Particle Physics: CLIC and FCC. We focus on their physics potentials and emphasise the key differences between the linear or circular approaches. We stress the uniqueness of the FCC-ee programme for precision electroweak physi…
▽ More
We review the capabilities of two projects that have been proposed as the next major European facility, for consideration in the upcoming update of the European Strategy for Particle Physics: CLIC and FCC. We focus on their physics potentials and emphasise the key differences between the linear or circular approaches. We stress the uniqueness of the FCC-ee programme for precision electroweak physics at the $Z$ peak and the $WW$ threshold, as well as its unequalled statistics for Higgs physics and high accuracy for observing possible new phenomena in Higgs and $Z$ decays, whereas CLIC and FCC-ee offer similar capabilities near the $t \overline t$ threshold. Whilst CLIC offers the possibility of energy upgrades to 1500 and 3000 GeV, FCC-ee paves the way for FCC-hh. The latter offers unique capabilities for making direct or indirect discoveries in a new energy range, and has the highest sensitivity to the self-couplings of the Higgs boson and any anomalous couplings. We consider the FCC programme to be the best option to maintain Europe's place at the high-energy frontier during the coming decades.
△ Less
Submitted 31 December, 2019;
originally announced December 2019.
-
Extended Dynamical Symmetries of Landau Levels in Higher Dimensions
Authors:
S. Kurkcuoglu,
G. Unal,
I. Yurdusen
Abstract:
Continuum models for time-reversal (TR) invariant topological insulators (TIs) in $d \geq 3$ dimensions are provided by harmonic oscillators coupled to certain $SO(d)$ gauge fields. These models are equivalent to the presence of spin-orbit (SO) interaction in the oscillator Hamiltonians at a critical coupling strength (equivalent to the harmonic oscillator frequency) and leads to flat Landau Level…
▽ More
Continuum models for time-reversal (TR) invariant topological insulators (TIs) in $d \geq 3$ dimensions are provided by harmonic oscillators coupled to certain $SO(d)$ gauge fields. These models are equivalent to the presence of spin-orbit (SO) interaction in the oscillator Hamiltonians at a critical coupling strength (equivalent to the harmonic oscillator frequency) and leads to flat Landau Level (LL) spectra and therefore to infinite degeneracy of either the positive or the negative helicity states depending on the sign of the SO coupling. Generalizing the results of Haaker et al. to $d \geq 4$, we construct vector operators commuting with these Hamiltonians and show that $SO(d,2)$ emerges as the non-compact extended dynamical symmetry. Focusing on the model in four dimensions, we demonstrate that the infinite degeneracy of the flat spectra can be fully explained in terms of the discrete unitary representations of $SO(4,2)$, i.e. the {\it doubletons}. The degeneracy in the opposite helicity branch is finite, but can still be explained exploiting the complex conjugate {\it doubleton} representations. Subsequently, the analysis is generalized to $d$ dimensions, distinguishing the cases of odd and even $d$. We also determine the spectrum generating algebra in these models and briefly comment on the algebraic organization of the LL states w.r.t to an underlying "deformed" AdS geometry as well as on the organization of the surface states under open boundary conditions in view of our results.
△ Less
Submitted 23 September, 2019;
originally announced September 2019.
-
Medical Imaging with Deep Learning: MIDL 2019 -- Extended Abstract Track
Authors:
M. Jorge Cardoso,
Aasa Feragen,
Ben Glocker,
Ender Konukoglu,
Ipek Oguz,
Gozde Unal,
Tom Vercauteren
Abstract:
This compendium gathers all the accepted extended abstracts from the Second International Conference on Medical Imaging with Deep Learning (MIDL 2019), held in London, UK, 8-10 July 2019. Note that only accepted extended abstracts are listed here, the Proceedings of the MIDL 2019 Full Paper Track are published as Volume 102 of the Proceedings of Machine Learning Research (PMLR) http://proceedings.…
▽ More
This compendium gathers all the accepted extended abstracts from the Second International Conference on Medical Imaging with Deep Learning (MIDL 2019), held in London, UK, 8-10 July 2019. Note that only accepted extended abstracts are listed here, the Proceedings of the MIDL 2019 Full Paper Track are published as Volume 102 of the Proceedings of Machine Learning Research (PMLR) http://proceedings.mlr.press/v102/.
△ Less
Submitted 22 July, 2019; v1 submitted 21 May, 2019;
originally announced July 2019.
-
Higgs boson measurements at the LHC
Authors:
Guillaume Unal
Abstract:
Measurements of the Higgs boson production and decay performed at the Large Hadron Collider by the ATLAS and CMS experiments are reviewed. These measurements are based on proton-proton collision data at $\sqrt{s}$=~13~TeV, corresponding to integrated luminosities ranging from 35 to 80~fb$^{-1}$. With these datasets, the associated production of the Higgs boson with a $t\bar{t}$ pair is observed an…
▽ More
Measurements of the Higgs boson production and decay performed at the Large Hadron Collider by the ATLAS and CMS experiments are reviewed. These measurements are based on proton-proton collision data at $\sqrt{s}$=~13~TeV, corresponding to integrated luminosities ranging from 35 to 80~fb$^{-1}$. With these datasets, the associated production of the Higgs boson with a $t\bar{t}$ pair is observed and the decay of the Higgs boson to $b\bar{b}$ pairs is established. Measurements involving leptonic and bosonic final states are described. The combined constraints on the Higgs boson coupling properties are summarized.
△ Less
Submitted 26 November, 2018;
originally announced November 2018.
-
Multi Modal Convolutional Neural Networks for Brain Tumor Segmentation
Authors:
Mehmet Aygün,
Yusuf Hüseyin Şahin,
Gözde Ünal
Abstract:
In this work, we propose a multi-modal Convolutional Neural Network (CNN) approach for brain tumor segmentation. We investigate how to combine different modalities efficiently in the CNN framework.We adapt various fusion methods, which are previously employed on video recognition problem, to the brain tumor segmentation problem,and we investigate their efficiency in terms of memory and performance…
▽ More
In this work, we propose a multi-modal Convolutional Neural Network (CNN) approach for brain tumor segmentation. We investigate how to combine different modalities efficiently in the CNN framework.We adapt various fusion methods, which are previously employed on video recognition problem, to the brain tumor segmentation problem,and we investigate their efficiency in terms of memory and performance.Our experiments, which are performed on BRATS dataset, lead us to the conclusion that learning separate representations for each modality and combining them for brain tumor segmentation could increase the performance of CNN systems.
△ Less
Submitted 20 September, 2018; v1 submitted 17 September, 2018;
originally announced September 2018.
-
Yang-Mills solutions on de Sitter space of any dimension
Authors:
Olaf Lechtenfeld,
Gönül Ünal
Abstract:
For gauge groups SO$(n{+}1)$, SU$(m{+}1)$ and Sp$(\ell{+}1)$, we construct equivariant Yang-Mills solutions on de Sitter space in $n{+}1$, $2(m{+}1)$ and $4(\ell{+}1)$ spacetime dimensions. The latter is conformally mapped to a finite cylinder over a coset space realizing an appropriate unit sphere. The equivariance condition reduces the Yang-Mills system to an analog Newtonian particle in one or…
▽ More
For gauge groups SO$(n{+}1)$, SU$(m{+}1)$ and Sp$(\ell{+}1)$, we construct equivariant Yang-Mills solutions on de Sitter space in $n{+}1$, $2(m{+}1)$ and $4(\ell{+}1)$ spacetime dimensions. The latter is conformally mapped to a finite cylinder over a coset space realizing an appropriate unit sphere. The equivariance condition reduces the Yang-Mills system to an analog Newtonian particle in one or two dimensions subject to a time-dependent friction and a particular potential. We analyze some properties of the solutions such as their action and energy and display all analytic ones. Beyond dS$_4$ all such configurations have finite energy but infinite action.
△ Less
Submitted 10 July, 2018;
originally announced July 2018.
-
Chaos from Equivariant Fields on Fuzzy $S^4$
Authors:
U. H. Coskun,
S. Kurkcuoglu,
G. C. Toga,
G. Unal
Abstract:
We examine the $5d$ Yang-Mills matrix model in $0+1$-dimensions with $U(4N)$ gauge symmetry and a mass deformation term. We determine the explicit $SU(4)\approx SO(6)$ equivariant parametrizations of the gauge field and the fluctuations about the classical four concentric fuzzy four sphere configuration and obtain the low energy reduced actions(LEAs) by tracing over the $S_F^4$s for the first five…
▽ More
We examine the $5d$ Yang-Mills matrix model in $0+1$-dimensions with $U(4N)$ gauge symmetry and a mass deformation term. We determine the explicit $SU(4)\approx SO(6)$ equivariant parametrizations of the gauge field and the fluctuations about the classical four concentric fuzzy four sphere configuration and obtain the low energy reduced actions(LEAs) by tracing over the $S_F^4$s for the first five lowest matrix levels. The LEA's so obtained have potentials bounded from below indicating that the equivariant fluctuations about the $S_F^4$ do not lead to any instabilities. These reduced systems exhibit chaotic dynamics, which we reveal by computing their Lyapunov exponents.Using our numerical results, we explore various aspects of chaotic dynamics emerging from the LEAs. In particular, we model how the largest Lyapunov exponents change as a function of the energy. We also show that, in the Euclidean signature, the LEAs support the usual kink type soliton solutions, i.e. instantons in $1+0$-dimensions, which may be seen as the imprints of the topological fluxes penetrating the concentric $S_F^4$s due to the equivariance conditions, and preventing them to shrink to zero radius.
△ Less
Submitted 13 February, 2019; v1 submitted 27 June, 2018;
originally announced June 2018.
-
Generative Adversarial Training for MRA Image Synthesis Using Multi-Contrast MRI
Authors:
Sahin Olut,
Yusuf Huseyin Sahin,
Ugur Demir,
Gozde Unal
Abstract:
Magnetic Resonance Angiography (MRA) has become an essential MR contrast for imaging and evaluation of vascular anatomy and related diseases. MRA acquisitions are typically ordered for vascular interventions, whereas in typical scenarios, MRA sequences can be absent in the patient scans. This motivates the need for a technique that generates inexistent MRA from existing MR multi-contrast, which co…
▽ More
Magnetic Resonance Angiography (MRA) has become an essential MR contrast for imaging and evaluation of vascular anatomy and related diseases. MRA acquisitions are typically ordered for vascular interventions, whereas in typical scenarios, MRA sequences can be absent in the patient scans. This motivates the need for a technique that generates inexistent MRA from existing MR multi-contrast, which could be a valuable tool in retrospective subject evaluations and imaging studies. In this paper, we present a generative adversarial network (GAN) based technique to generate MRA from T1-weighted and T2-weighted MRI images, for the first time to our knowledge. To better model the representation of vessels which the MRA inherently highlights, we design a loss term dedicated to a faithful reproduction of vascularities. To that end, we incorporate steerable filter responses of the generated and reference images inside a Huber function loss term. Extending the well- established generator-discriminator architecture based on the recent PatchGAN model with the addition of steerable filter loss, the proposed steerable GAN (sGAN) method is evaluated on the large public database IXI. Experimental results show that the sGAN outperforms the baseline GAN method in terms of an overlap score with similar PSNR values, while it leads to improved visual perceptual quality.
△ Less
Submitted 12 April, 2018;
originally announced April 2018.
-
Patch-Based Image Inpainting with Generative Adversarial Networks
Authors:
Ugur Demir,
Gozde Unal
Abstract:
Area of image inpainting over relatively large missing regions recently advanced substantially through adaptation of dedicated deep neural networks. However, current network solutions still introduce undesired artifacts and noise to the repaired regions. We present an image inpainting method that is based on the celebrated generative adversarial network (GAN) framework. The proposed PGGAN method i…
▽ More
Area of image inpainting over relatively large missing regions recently advanced substantially through adaptation of dedicated deep neural networks. However, current network solutions still introduce undesired artifacts and noise to the repaired regions. We present an image inpainting method that is based on the celebrated generative adversarial network (GAN) framework. The proposed PGGAN method includes a discriminator network that combines a global GAN (G-GAN) architecture with a patchGAN approach. PGGAN first shares network layers between G-GAN and patchGAN, then splits paths to produce two adversarial losses that feed the generator network in order to capture both local continuity of image texture and pervasive global features in images. The proposed framework is evaluated extensively, and the results including comparison to recent state-of-the-art demonstrate that it achieves considerable improvements on both visual and quantitative evaluations.
△ Less
Submitted 20 March, 2018;
originally announced March 2018.
-
Deep Stacked Networks with Residual Polishing for Image Inpainting
Authors:
Ugur Demir,
Gozde Unal
Abstract:
Deep neural networks have shown promising results in image inpainting even if the missing area is relatively large. However, most of the existing inpainting networks introduce undesired artifacts and noise to the repaired regions. To solve this problem, we present a novel framework which consists of two stacked convolutional neural networks that inpaint the image and remove the artifacts, respecti…
▽ More
Deep neural networks have shown promising results in image inpainting even if the missing area is relatively large. However, most of the existing inpainting networks introduce undesired artifacts and noise to the repaired regions. To solve this problem, we present a novel framework which consists of two stacked convolutional neural networks that inpaint the image and remove the artifacts, respectively. The first network considers the global structure of the damaged image and coarsely fills the blank area. Then the second network modifies the repaired image to cancel the noise introduced by the first network. The proposed framework splits the problem into two distinct partitions that can be optimized separately, therefore it can be applied to any inpainting algorithm by changing the first network. Second stage in our framework which aims at polishing the inpainted images can be treated as a denoising problem where a wide range of algorithms can be employed. Our results demonstrate that the proposed framework achieves significant improvement on both visual and quantitative evaluations.
△ Less
Submitted 31 December, 2017;
originally announced January 2018.
-
A $U(3)$ Gauge Theory on Fuzzy Extra Dimensions
Authors:
Seckin Kurkcuoglu,
Gonul Unal
Abstract:
In this article, we explore the low energy structure of a $U(3)$ gauge theory over spaces with fuzzy sphere(s) as extra dimensions. In particular, we determine the equivariant parametrization of the gauge fields, which transform either invariantly or as vectors under the combined action of $SU(2)$ rotations of the fuzzy spheres and those $U(3)$ gauge transformations generated by…
▽ More
In this article, we explore the low energy structure of a $U(3)$ gauge theory over spaces with fuzzy sphere(s) as extra dimensions. In particular, we determine the equivariant parametrization of the gauge fields, which transform either invariantly or as vectors under the combined action of $SU(2)$ rotations of the fuzzy spheres and those $U(3)$ gauge transformations generated by $SU(2) \subset U(3)$ carrying the spin $1$ irreducible representation of $SU(2)$. The cases of a single fuzzy sphere $S_F^2$ and a particular direct sum of concentric fuzzy spheres, $S_F^{2 \, Int}$, covering the monopole bundle sectors with windings $\pm 1$ are treated in full and the low energy degrees of freedom for the gauge fields are obtained. Employing the parametrizations of the fields in the former case, we determine a low energy action by tracing over the fuzzy sphere and show that the emerging model is abelian Higgs type with $U(1) \times U(1)$ gauge symmetry and possess vortex solutions on ${\mathbb R}^2$, which we discuss in some detail. Generalization of our formulation to the equivariant parametrization of gauge fields in $U(n)$ theories is also briefly addressed.
△ Less
Submitted 30 June, 2016;
originally announced July 2016.
-
Multiple Wavelet Coherency Analysis and Forecasting of Metal Prices
Authors:
Emre Kahraman,
Gazanfer Ünal
Abstract:
The assessment of co-movement among metals is crucial to better understand the behaviors of the metal prices and the interactions with others that affect the changes in prices. In this study, both Wavelet Analysis and VARMA (Vector Autoregressive Moving Average) models are utilized. First, Multiple Wavelet Coherence (MWC), where Wavelet Analysis is needed, is utilized to determine dynamic correlat…
▽ More
The assessment of co-movement among metals is crucial to better understand the behaviors of the metal prices and the interactions with others that affect the changes in prices. In this study, both Wavelet Analysis and VARMA (Vector Autoregressive Moving Average) models are utilized. First, Multiple Wavelet Coherence (MWC), where Wavelet Analysis is needed, is utilized to determine dynamic correlation time interval and scales. VARMA is then used for forecasting which results in reduced errors.
The daily prices of steel, aluminium, copper and zinc between 10.05.2010 and 29.05.2014 are analyzed via wavelet analysis to highlight the interactions. Results uncover interesting dynamics between mentioned metals in the time-frequency space. VARMA (1,1) model forecasting is carried out considering the daily prices between 14.11.2011 and 16.11.2012 where the interactions are quite high and prediction errors are found quite limited with respect to ARMA(1.1). It is shown that dynamic co-movement detection via four variables wavelet coherency analysis in the determination of VARMA time interval enables to improve forecasting power of ARMA by decreasing forecasting errors.
△ Less
Submitted 5 February, 2016;
originally announced February 2016.
-
Chaos in Fractionally Integrated Generalized Autoregressive Conditional Heteroskedastic Processes
Authors:
Adil Yilmaz,
Gazanfer Unal
Abstract:
Fractionally integrated generalized autoregressive conditional heteroskedasticity (FIGARCH) arises in modeling of financial time series. FIGARCH is essentially governed by a system of nonlinear stochastic difference equations ${u_t}$ = ${z_t}$ $(1-\sum\limits_{j=1}^q β_j L^j)σ_{t}^2 = ω+(1-\sum\limits_{j=1}^q β_j L^j - (\sum\limits_{k=1}^p \varphi_k L^k) (1-L)^d) u_t^2$, where $ω\in$ R, and…
▽ More
Fractionally integrated generalized autoregressive conditional heteroskedasticity (FIGARCH) arises in modeling of financial time series. FIGARCH is essentially governed by a system of nonlinear stochastic difference equations ${u_t}$ = ${z_t}$ $(1-\sum\limits_{j=1}^q β_j L^j)σ_{t}^2 = ω+(1-\sum\limits_{j=1}^q β_j L^j - (\sum\limits_{k=1}^p \varphi_k L^k) (1-L)^d) u_t^2$, where $ω\in$ R, and $β_j\in$ R are constant parameters, $\{u_t\}_{{t\in}^+}$ and $\{σ_t\}_{{t\in}^+}$ are the discrete time real valued stochastic processes which represent FIGARCH (p,d,q) and stochastic volatility, respectively. Moreover, L is the backward shift operator, i.e. $L^d u_t \equiv u_{t-d}$ (d is the fractional differencing parameter 0$<$d$<$1).
In this work, we have studied the chaoticity properties of FIGARCH (p,d,q) processes by computing mutual information, correlation dimensions, FNNs (False Nearest Neighbour), the Lyapunov exponents, and for both the stochastic difference equation given above and for the financial time series. We have observed that maximal Lyapunov exponents are negative, therefore, it can be suggested that FIGARCH (p,d,q) is not deterministic chaotic process.
△ Less
Submitted 12 February, 2016; v1 submitted 29 January, 2016;
originally announced January 2016.
-
Equivariant Fields in an $SU({\cal N})$ Gauge Theory with new Spontaneously Generated Fuzzy Extra Dimensions
Authors:
S. Kurkcuoglu,
G. Unal
Abstract:
We find new spontaneously generated fuzzy extra dimensions emerging from a certain deformation of $N=4$ supersymmetric Yang-Mills (SYM) theory with cubic soft supersymmetry breaking and mass deformation terms. First, we determine a particular four dimensional fuzzy vacuum that may be expressed in terms of a direct sum of product of two fuzzy spheres, and denote it in short as…
▽ More
We find new spontaneously generated fuzzy extra dimensions emerging from a certain deformation of $N=4$ supersymmetric Yang-Mills (SYM) theory with cubic soft supersymmetry breaking and mass deformation terms. First, we determine a particular four dimensional fuzzy vacuum that may be expressed in terms of a direct sum of product of two fuzzy spheres, and denote it in short as $S_F^{2\, Int}\times S_F^{2\, Int}$. The direct sum structure of the vacuum is revealed by a suitable splitting of the scalar fields in the model in a manner that generalizes our approach in \cite{Seckinson}. Fluctuations around this vacuum have the structure of gauge fields over $S_F^{2\, Int}\times S_F^{2\, Int}$, and this enables us to conjecture the spontaneous broken model as an effective $U(n)$ $(n < {\cal N})$ gauge theory on the product manifold $M^4 \times S_F^{2\, Int} \times S_F^{2\, Int}$. We support this interpretation by examining the $U(4)$ theory and determining all of the $SU(2)\times SU(2)$ equivariant fields in the model, characterizing its low energy degrees of freedom. Monopole sectors with winding numbers $(\pm 1,0),\,(0,\pm1),\,(\pm1,\pm 1)$ are accessed from $S_F^{2\, Int}\times S_F^{2\, Int}$ after suitable projections and subsequently equivariant fields in these sectors are obtained. We indicate how Abelian Higgs type models with vortex solutions emerge after dimensionally reducing over the fuzzy monopole sectors as well. A family of fuzzy vacua is determined by giving a systematic treatment for the splitting of the scalar fields and it is made manifest that suitable projections of these vacuum solutions yield all higher winding number fuzzy monopole sectors. We observe that the vacuum configuration $S_F^{2\, Int}\times S_F^{2\, Int}$ identifies with the bosonic part of the product of two fuzzy superspheres with $OSP(2,2)\times OSP(2,2)$ supersymmetry and elaborate on this feature.
△ Less
Submitted 24 May, 2016; v1 submitted 13 June, 2015;
originally announced June 2015.
-
Quantum Hall Effect on the Grassmannians $\mathbf{Gr}_2(\mathbb{C}^N)$
Authors:
F. Balli,
A. Behtash,
S. Kurkcuoglu,
G. Unal
Abstract:
Quantum Hall Effects (QHEs) on the complex Grassmann manifolds $\mathbf{Gr}_2(\mathbb{C}^N)$ are formulated. We set up the Landau problem in $\mathbf{Gr}_2(\mathbb{C}^N)$ and solve it using group theoretical techniques and provide the energy spectrum and the eigenstates in terms of the $SU(N)$ Wigner ${\cal D}$-functions for charged particles on $\mathbf{Gr}_2(\mathbb{C}^N)$ under the influence of…
▽ More
Quantum Hall Effects (QHEs) on the complex Grassmann manifolds $\mathbf{Gr}_2(\mathbb{C}^N)$ are formulated. We set up the Landau problem in $\mathbf{Gr}_2(\mathbb{C}^N)$ and solve it using group theoretical techniques and provide the energy spectrum and the eigenstates in terms of the $SU(N)$ Wigner ${\cal D}$-functions for charged particles on $\mathbf{Gr}_2(\mathbb{C}^N)$ under the influence of abelian and non-abelian background magnetic monopoles or a combination of these thereof. In particular, for the simplest case of $\mathbf{Gr}_2(\mathbb{C}^4)$ we explicitly write down the $U(1)$ background gauge field as well as the single and many-particle eigenstates by introducing the Plücker coordinates and show by calculating the two-point correlation function that the Lowest Landau Level (LLL) at filling factor $ν=1$ forms an incompressible fluid. Our results are in agreement with the previous results in the literature for QHE on ${\mathbb C}P^N$ and generalize them to all $\mathbf{Gr}_2(\mathbb{C}^N)$ in a suitable manner. Finally, we heuristically identify a relation between the $U(1)$ Hall effect on $\mathbf{Gr}_2(\mathbb{C}^4)$ and the Hall effect on the odd sphere $S^5$, which is yet to be investigated in detail, by appealing to the already known analogous relations between the Hall effects on ${\mathbb C}P^3$ and ${\mathbb C}P^7$ and those on the spheres $S^4$ and $S^8$, respectively.
△ Less
Submitted 2 April, 2014; v1 submitted 15 March, 2014;
originally announced March 2014.
-
A Layer Correlation technique for pion energy calibration at the 2004 ATLAS Combined Beam Test
Authors:
E. Abat,
J. M. Abdallah,
T. N. Addy,
P. Adragna,
M. Aharrouche,
A. Ahmad,
T. P. A. Akesson,
M. Aleksa,
C. Alexa,
K. Anderson,
A. Andreazza,
F. Anghinolfi,
A. Antonaki,
G. Arabidze,
E. Arik,
T. Atkinson,
J. Baines,
O. K. Baker,
D. Banfi,
S. Baron,
A. J. Barr,
R. Beccherle,
H. P. Beck,
B. Belhorma,
P. J. Bell
, et al. (460 additional authors not shown)
Abstract:
A new method for calibrating the hadron response of a segmented calorimeter is developed and successfully applied to beam test data. It is based on a principal component analysis of energy deposits in the calorimeter layers, exploiting longitudinal shower development information to improve the measured energy resolution. Corrections for invisible hadronic energy and energy lost in dead material in…
▽ More
A new method for calibrating the hadron response of a segmented calorimeter is developed and successfully applied to beam test data. It is based on a principal component analysis of energy deposits in the calorimeter layers, exploiting longitudinal shower development information to improve the measured energy resolution. Corrections for invisible hadronic energy and energy lost in dead material in front of and between the calorimeters of the ATLAS experiment were calculated with simulated Geant4 Monte Carlo events and used to reconstruct the energy of pions im**ing on the calorimeters during the 2004 Barrel Combined Beam Test at the CERN H8 area. For pion beams with energies between 20 GeV and 180 GeV, the particle energy is reconstructed within 3% and the energy resolution is improved by between 11% and 25% compared to the resolution at the electromagnetic scale.
△ Less
Submitted 12 May, 2011; v1 submitted 20 December, 2010;
originally announced December 2010.
-
Expected Performance of the ATLAS Experiment - Detector, Trigger and Physics
Authors:
The ATLAS Collaboration,
G. Aad,
E. Abat,
B. Abbott,
J. Abdallah,
A. A. Abdelalim,
A. Abdesselam,
O. Abdinov,
B. Abi,
M. Abolins,
H. Abramowicz,
B. S. Acharya,
D. L. Adams,
T. N. Addy,
C. Adorisio,
P. Adragna,
T. Adye,
J. A. Aguilar-Saavedra,
M. Aharrouche,
S. P. Ahlen,
F. Ahles,
A. Ahmad,
H. Ahmed,
G. Aielli,
T. Akdogan
, et al. (2587 additional authors not shown)
Abstract:
A detailed study is presented of the expected performance of the ATLAS detector. The reconstruction of tracks, leptons, photons, missing energy and jets is investigated, together with the performance of b-tagging and the trigger. The physics potential for a variety of interesting physics processes, within the Standard Model and beyond, is examined. The study comprises a series of notes based on…
▽ More
A detailed study is presented of the expected performance of the ATLAS detector. The reconstruction of tracks, leptons, photons, missing energy and jets is investigated, together with the performance of b-tagging and the trigger. The physics potential for a variety of interesting physics processes, within the Standard Model and beyond, is examined. The study comprises a series of notes based on simulations of the detector and physics processes, with particular emphasis given to the data expected from the first years of operation of the LHC at CERN.
△ Less
Submitted 14 August, 2009; v1 submitted 28 December, 2008;
originally announced January 2009.
-
Response Uniformity of the ATLAS Liquid Argon Electromagnetic Calorimeter
Authors:
M. Aharrouche,
J. Colas,
L. Di Ciaccio,
M. El Kacimi,
O. Gaumer,
M. Gouanere,
D. Goujdami,
R. Lafaye,
S. Laplace,
C. Le Maner,
L. Neukermans,
P. Perrodo,
L. Poggioli,
D. Prieur,
H. Przysiezniak,
G. Sauvage,
I. Wingerter-Seez,
R. Zitoun,
F. Lanni,
L. Lu,
H. Ma,
S. Rajago palan,
H. Takai,
A. Belymam,
D. Benchekroun
, et al. (77 additional authors not shown)
Abstract:
The construction of the ATLAS electromagnetic liquid argon calorimeter modules is completed and all the modules are assembled and inserted in the cryostats. During the production period four barrel and three endcap modules were exposed to test beams in order to assess their performance, ascertain the production quality and reproducibility, and to scrutinize the complete energy reconstruction cha…
▽ More
The construction of the ATLAS electromagnetic liquid argon calorimeter modules is completed and all the modules are assembled and inserted in the cryostats. During the production period four barrel and three endcap modules were exposed to test beams in order to assess their performance, ascertain the production quality and reproducibility, and to scrutinize the complete energy reconstruction chain from the readout and calibration electronics to the signal and energy reconstruction. It was also possible to check the full Monte Carlo simulation of the calorimeter. The analysis of the uniformity, resolution and extraction of constant term is presented. Typical non-uniformities of 0.5% and typical global constant terms of 0.6% are measured for the barrel and end-cap modules.
△ Less
Submitted 7 September, 2007;
originally announced September 2007.
-
The final measurement of $ε'ε$ by NA48
Authors:
G. Unal
Abstract:
The direct CP violation parameter Re($ε'ε$) has been measured from the decay rates of neutral kaons into two pions using the NA48 detector at the CERN SPS. The 2001 running period was devoted to collecting additional data under varied conditions compared to earlier years (1997-99). The 2001 data yield the result: Re($εε'$)=$(13.7\pm3.1)\times10^{-4}$. Combining this result with that published fr…
▽ More
The direct CP violation parameter Re($ε'ε$) has been measured from the decay rates of neutral kaons into two pions using the NA48 detector at the CERN SPS. The 2001 running period was devoted to collecting additional data under varied conditions compared to earlier years (1997-99). The 2001 data yield the result: Re($εε'$)=$(13.7\pm3.1)\times10^{-4}$. Combining this result with that published from the 1997,98 and 99 data, an overall value of Re($ε'ε$)=$(14.7\pm2.2)\times10^{-4}$ is obtained from the NA48 experiment.
△ Less
Submitted 24 September, 2002;
originally announced September 2002.
-
Performances of the NA48 Liquid Krypton calorimeter
Authors:
Guillaume Unal
Abstract:
The NA48 experiments aims at a precise measurement of direct CP violation in the neutral Kaon system. This puts stringent requirements on the electromagnetic calorimeter used to detect photons of average energy 25 GeV. The choice of NA48 is a quasi homogeneous Liquid Krypton calorimeter with fast readout. The operation of this device and the performances achieved are described.
The NA48 experiments aims at a precise measurement of direct CP violation in the neutral Kaon system. This puts stringent requirements on the electromagnetic calorimeter used to detect photons of average energy 25 GeV. The choice of NA48 is a quasi homogeneous Liquid Krypton calorimeter with fast readout. The operation of this device and the performances achieved are described.
△ Less
Submitted 5 December, 2000;
originally announced December 2000.