-
Denoising OCT Images Using Steered Mixture of Experts with Multi-Model Inference
Authors:
Aytaç Özkan,
Elena Stoykova,
Thomas Sikora,
Violeta Madjarova
Abstract:
In Optical Coherence Tomography (OCT), speckle noise significantly hampers image quality, affecting diagnostic accuracy. Current methods, including traditional filtering and deep learning techniques, have limitations in noise reduction and detail preservation. Addressing these challenges, this study introduces a novel denoising algorithm, Block-Matching Steered-Mixture of Experts with Multi-Model…
▽ More
In Optical Coherence Tomography (OCT), speckle noise significantly hampers image quality, affecting diagnostic accuracy. Current methods, including traditional filtering and deep learning techniques, have limitations in noise reduction and detail preservation. Addressing these challenges, this study introduces a novel denoising algorithm, Block-Matching Steered-Mixture of Experts with Multi-Model Inference and Autoencoder (BM-SMoE-AE). This method combines block-matched implementation of the SMoE algorithm with an enhanced autoencoder architecture, offering efficient speckle noise reduction while retaining critical image details. Our method stands out by providing improved edge definition and reduced processing time. Comparative analysis with existing denoising techniques demonstrates the superior performance of BM-SMoE-AE in maintaining image integrity and enhancing OCT image usability for medical diagnostics.
△ Less
Submitted 23 February, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.
-
TURead: An eye movement dataset of Turkish reading
Authors:
Cengiz Acarturk,
Aysegul Ozkan,
Tugce Nur Pekcetin,
Zuhal Ormanoglu,
Bilal Kirkici
Abstract:
In this study, we present TURead, an eye movement dataset of silent and oral sentence reading in Turkish, an agglutinative language with a shallow orthography understudied in reading research. TURead provides empirical data to investigate the relationship between morphology and oculomotor control. We employ a target-word approach in which target words are manipulated by word length and by the addi…
▽ More
In this study, we present TURead, an eye movement dataset of silent and oral sentence reading in Turkish, an agglutinative language with a shallow orthography understudied in reading research. TURead provides empirical data to investigate the relationship between morphology and oculomotor control. We employ a target-word approach in which target words are manipulated by word length and by the addition of two commonly used suffixes in Turkish. The dataset contains well-established eye movement variables; prelexical characteristics such as vowel harmony and bigram-trigram frequencies and word features, such as word length, predictability, frequency, eye voice span measures, Cloze test scores of the root word and suffix predictabilities, as well as the scores obtained from two working memory tests. Our findings on fixation parameters and word characteristics are in line with the patterns reported in the relevant literature.
△ Less
Submitted 2 December, 2023;
originally announced December 2023.
-
Class Uncertainty: A Measure to Mitigate Class Imbalance
Authors:
Z. S. Baltaci,
K. Oksuz,
S. Kuzucu,
K. Tezoren,
B. K. Konar,
A. Ozkan,
E. Akbas,
S. Kalkan
Abstract:
Class-wise characteristics of training examples affect the performance of deep classifiers. A well-studied example is when the number of training examples of classes follows a long-tailed distribution, a situation that is likely to yield sub-optimal performance for under-represented classes. This class imbalance problem is conventionally addressed by approaches relying on the class-wise cardinalit…
▽ More
Class-wise characteristics of training examples affect the performance of deep classifiers. A well-studied example is when the number of training examples of classes follows a long-tailed distribution, a situation that is likely to yield sub-optimal performance for under-represented classes. This class imbalance problem is conventionally addressed by approaches relying on the class-wise cardinality of training examples, such as data resampling. In this paper, we demonstrate that considering solely the cardinality of classes does not cover all issues causing class imbalance. To measure class imbalance, we propose "Class Uncertainty" as the average predictive uncertainty of the training examples, and we show that this novel measure captures the differences across classes better than cardinality. We also curate SVCI-20 as a novel dataset in which the classes have equal number of training examples but they differ in terms of their hardness; thereby causing a type of class imbalance which cannot be addressed by the approaches relying on cardinality. We incorporate our "Class Uncertainty" measure into a diverse set of ten class imbalance mitigation methods to demonstrate its effectiveness on long-tailed datasets as well as on our SVCI-20. Code and datasets will be made available.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
A Decentralized Resource Management System Proposal For Disasters: NGO-RMSD (STK-AKYS)
Authors:
Arzu Özkan,
Umutcan Korkmaz,
Cemal Dak,
Enis Karaarslan
Abstract:
Disaster and emergency management are under the responsibility of many organizations and there are serious coordination problems in post-disaster crisis management. This paper proposes a decentralized non-governmental organization resource management system for disasters (NGO-RMSD / STK-AKYS). This system is based on blockchain technology and it will enable the non-governmental organizations (NGO)…
▽ More
Disaster and emergency management are under the responsibility of many organizations and there are serious coordination problems in post-disaster crisis management. This paper proposes a decentralized non-governmental organization resource management system for disasters (NGO-RMSD / STK-AKYS). This system is based on blockchain technology and it will enable the non-governmental organizations (NGO) and public institutions to manage and coordinate the resources in a trusted environment in the case of disasters. A proof of concept implementation is developed by using the Quorum blockchain framework which is more energy-efficient than crypto currency-based blockchain solutions. Smart contracts are developed for the autonomous working of the system. These smart contacts are used for the verification of the needs of the one who is in need, delivering resources to the right people, and identifying the urgent needs. The system aims to reach more disaster victims in a more timely manner. NGO-RMSD is designed according to the needs of the NGOs in the field. The application is shared with the free software license and further development with the community is aimed.
△ Less
Submitted 12 April, 2022;
originally announced April 2022.
-
Efficient Hardware Acceleration of Sparsely Active Convolutional Spiking Neural Networks
Authors:
Jan Sommer,
M. Akif Özkan,
Oliver Keszocze,
Jürgen Teich
Abstract:
Spiking Neural Networks (SNNs) compute in an event-based matter to achieve a more efficient computation than standard Neural Networks. In SNNs, neuronal outputs (i.e. activations) are not encoded with real-valued activations but with sequences of binary spikes. The motivation of using SNNs over conventional neural networks is rooted in the special computational aspects of SNNs, especially the very…
▽ More
Spiking Neural Networks (SNNs) compute in an event-based matter to achieve a more efficient computation than standard Neural Networks. In SNNs, neuronal outputs (i.e. activations) are not encoded with real-valued activations but with sequences of binary spikes. The motivation of using SNNs over conventional neural networks is rooted in the special computational aspects of SNNs, especially the very high degree of sparsity of neural output activations. Well established architectures for conventional Convolutional Neural Networks (CNNs) feature large spatial arrays of Processing Elements (PEs) that remain highly underutilized in the face of activation sparsity. We propose a novel architecture that is optimized for the processing of Convolutional SNNs (CSNNs) that feature a high degree of activation sparsity. In our architecture, the main strategy is to use less but highly utilized PEs. The PE array used to perform the convolution is only as large as the kernel size, allowing all PEs to be active as long as there are spikes to process. This constant flow of spikes is ensured by compressing the feature maps (i.e. the activations) into queues that can then be processed spike by spike. This compression is performed in run-time using dedicated circuitry, leading to a self-timed scheduling. This allows the processing time to scale directly with the number of spikes. A novel memory organization scheme called memory interlacing is used to efficiently store and retrieve the membrane potentials of the individual neurons using multiple small parallel on-chip RAMs. Each RAM is hardwired to its PE, reducing switching circuitry and allowing RAMs to be located in close proximity to the respective PE. We implemented the proposed architecture on an FPGA and achieved a significant speedup compared to other implementations while needing less hardware resources and maintaining a lower energy consumption.
△ Less
Submitted 23 March, 2022;
originally announced March 2022.
-
DSP-Packing: Squeezing Low-precision Arithmetic into FPGA DSP Blocks
Authors:
Jan Sommer,
M. Akif Özkan,
Oliver Keszocze,
Jürgen Teich
Abstract:
The number of Digital Signal Processor (DSP) resources available in Field Programmable Gate Arrays (FPGAs) is often quite limited. Therefore, full utilization of available DSP resources for the computationally intensive parts of an algorithm is paramount for optimizing the non-functional properties of an implementation (i.e., performance, power, and area). The DSPs available in Xilinx devices impl…
▽ More
The number of Digital Signal Processor (DSP) resources available in Field Programmable Gate Arrays (FPGAs) is often quite limited. Therefore, full utilization of available DSP resources for the computationally intensive parts of an algorithm is paramount for optimizing the non-functional properties of an implementation (i.e., performance, power, and area). The DSPs available in Xilinx devices implement large bit width operators (i.e. a 48-bit accumulator or a $18 \times 27$ multiplier). However, using such a DSP for low-precision quantized data (as is common in image processing or machine learning applications) leaves the DSP resources underutilized. As a remedy, A method has been proposed to pack and compute four 4-bit multiplications on a single DSP in a single clock cycle. This paper presents a generalization of this scheme to arbitrary bit widths and number of multiplications. We also demonstrate that the previously proposed approach leads to errors (Mean Absolute Error (MAE) = 0.37). Furthermore, we explain where these errors come from and how they can be corrected. On top, we introduce a novel approximate method called "Overpacking" which allows to squeeze even more multiplications into a single DSP at the cost of small errors (MAE = 0.47). Overpacking allows to squeeze six 4-bit multiplications into a single DSP compared to just four in the literature. Finally, we introduce an alternative method for packing multiple small-bit width additions into a single 48-bit accumulator for use in applications such as Spiking Neural Networks.
△ Less
Submitted 21 March, 2022;
originally announced March 2022.
-
Environmental stress level to model tumor cell growth and survival
Authors:
Sabrina Schönfeld,
Alican Ozkan,
Laura Scarabosio,
Marissa Nichole Rylander,
Christina Kuttler
Abstract:
Survival of living tumor cells underlies many influences such as nutrient saturation, oxygen level, drug concentrations or mechanical forces. Data-supported mathematical modeling can be a powerful tool to get a better understanding of cell behavior in different settings. However, under consideration of numerous environmental factors mathematical modeling can get challenging. We present an approach…
▽ More
Survival of living tumor cells underlies many influences such as nutrient saturation, oxygen level, drug concentrations or mechanical forces. Data-supported mathematical modeling can be a powerful tool to get a better understanding of cell behavior in different settings. However, under consideration of numerous environmental factors mathematical modeling can get challenging. We present an approach to model the separate influences of each environmental quantity on the cells in a collective manner by introducing the "environmental stress level". It is an immeasurable auxiliary variable, which quantifies to what extent viable cells would get in a stressed state, if exposed to certain conditions. A high stress level can inhibit cell growth, promote cell death and influence cell movement. As a proof of concept, we compare two systems of ordinary differential equations, which model tumor cell dynamics under various nutrient saturations respectively with and without considering an environmental stress level. Particle-based Bayesian inversion methods are used to quantify uncertainties and calibrate unknown model parameters with time resolved measurements of in vitro populations of liver cancer cells. The calibration results of both models are compared and the quality of fit is quantified. While predictions of both models show good agreement with the data, there is indication that the model considering the stress level yields a better fitting. The proposed modeling approach offers a flexible and extendable framework for considering systems with additional environmental factors affecting the cell dynamics.
△ Less
Submitted 23 March, 2022; v1 submitted 18 January, 2022;
originally announced January 2022.
-
HipaccVX: Wedding of OpenVX and DSL-based Code Generation
Authors:
M. Akif Özkan,
Burak Ok,
Bo Qiao,
Jürgen Teich,
Frank Hannig
Abstract:
Writing programs for heterogeneous platforms optimized for high performance is hard since this requires the code to be tuned at a low level with architecture-specific optimizations that are most times based on fundamentally differing programming paradigms and languages. OpenVX promises to solve this issue for computer vision applications with a royalty-free industry standard that is based on a gra…
▽ More
Writing programs for heterogeneous platforms optimized for high performance is hard since this requires the code to be tuned at a low level with architecture-specific optimizations that are most times based on fundamentally differing programming paradigms and languages. OpenVX promises to solve this issue for computer vision applications with a royalty-free industry standard that is based on a graph-execution model. Yet, the OpenVX' algorithm space is constrained to a small set of vision functions. This hinders accelerating computations that are not included in the standard.
In this paper, we analyze OpenVX vision functions to find an orthogonal set of computational abstractions. Based on these abstractions, we couple an existing Domain-Specific Language (DSL) back end to the OpenVX environment and provide language constructs to the programmer for the definition of user-defined nodes. In this way, we enable optimizations that are not possible to detect with OpenVX graph implementations using the standard computer vision functions. These optimizations can double the throughput on an Nvidia GTX GPU and decrease the resource usage of a Xilinx Zynq FPGA by 50% for our benchmarks. Finally, we show that our proposed compiler framework, called HipaccVX, can achieve better results than the state-of-the-art approaches Nvidia VisionWorks and Halide-HLS.
△ Less
Submitted 26 August, 2020;
originally announced August 2020.
-
AnyHLS: High-Level Synthesis with Partial Evaluation
Authors:
M. Akif Özkan,
Arsène Pérard-Gayot,
Richard Membarth,
Philipp Slusallek,
Roland Leissa,
Sebastian Hack,
Jürgen Teich,
Frank Hannig
Abstract:
FPGAs excel in low power and high throughput computations, but they are challenging to program. Traditionally, developers rely on hardware description languages like Verilog or VHDL to specify the hardware behavior at the register-transfer level. High-Level Synthesis (HLS) raises the level of abstraction, but still requires FPGA design knowledge. Programmers usually write pragma-annotated C/C++ pr…
▽ More
FPGAs excel in low power and high throughput computations, but they are challenging to program. Traditionally, developers rely on hardware description languages like Verilog or VHDL to specify the hardware behavior at the register-transfer level. High-Level Synthesis (HLS) raises the level of abstraction, but still requires FPGA design knowledge. Programmers usually write pragma-annotated C/C++ programs to define the hardware architecture of an application. However, each hardware vendor extends its own C dialect using its own vendor-specific set of pragmas. This prevents portability across different vendors. Furthermore, pragmas are not first-class citizens in the language. This makes it hard to use them in a modular way or design proper abstractions. In this paper, we present AnyHLS, an approach to synthesize FPGA designs in a modular and abstract way. AnyHLS is able to raise the abstraction level of existing HLS tools by resorting to programming language features such as types and higher-order functions as follows: It relies on partial evaluation to specialize and to optimize the user application based on a library of abstractions. Then, vendor-specific HLS code is generated for Intel and Xilinx FPGAs. Portability is obtained by avoiding any vendor-specific pragmas at the source code. In order to validate achievable gains in productivity, a library for the domain of image processing is introduced as a case study, and its synthesis results are compared with several state-of-theart Domain-Specific Language (DSL) approaches for this domain.
△ Less
Submitted 21 July, 2020; v1 submitted 13 February, 2020;
originally announced February 2020.
-
Supervised Segmentation of Retinal Vessel Structures Using ANN
Authors:
Esra Kaya,
İsmail Sarıtaş,
Ilker Ali Ozkan
Abstract:
In this study, a supervised retina blood vessel segmentation process was performed on the green channel of the RGB image using artificial neural network (ANN). The green channel is preferred because the retinal vessel structures can be distinguished most clearly from the green channel of the RGB image. The study was performed using 20 images in the DRIVE data set which is one of the most common re…
▽ More
In this study, a supervised retina blood vessel segmentation process was performed on the green channel of the RGB image using artificial neural network (ANN). The green channel is preferred because the retinal vessel structures can be distinguished most clearly from the green channel of the RGB image. The study was performed using 20 images in the DRIVE data set which is one of the most common retina data sets known. The images went through some preprocessing stages like contrastlimited adaptive histogram equalization (CLAHE), color intensity adjustment, morphological operations and median and Gaussian filtering to obtain a good segmentation. Retinal vessel structures were highlighted with top-hat and bot-hat morphological operations and converted to binary image by using global thresholding. Then, the network was trained by the binary version of the images specified as training images in the dataset and the targets are the images segmented manually by a specialist. The average segmentation accuracy for 20 images was found as 0.9492.
△ Less
Submitted 15 January, 2020;
originally announced January 2020.
-
KinshipGAN: Synthesizing of Kinship Faces From Family Photos by Regularizing a Deep Face Network
Authors:
Savas Ozkan,
Akin Ozkan
Abstract:
In this paper, we propose a kinship generator network that can synthesize a possible child face by analyzing his/her parent's photo. For this purpose, we focus on to handle the scarcity of kinship datasets throughout the paper by proposing novel solutions in particular. To extract robust features, we integrate a pre-trained face model to the kinship face generator. Moreover, the generator network…
▽ More
In this paper, we propose a kinship generator network that can synthesize a possible child face by analyzing his/her parent's photo. For this purpose, we focus on to handle the scarcity of kinship datasets throughout the paper by proposing novel solutions in particular. To extract robust features, we integrate a pre-trained face model to the kinship face generator. Moreover, the generator network is regularized with an additional face dataset and adversarial loss to decrease the overfitting of the limited samples. Lastly, we adapt cycle-domain transformation to attain a more stable results. Experiments are conducted on Families in the Wild (FIW) dataset. The experimental results show that the contributions presented in the paper provide important performance improvements compared to the baseline architecture and our proposed method yields promising perceptual results.
△ Less
Submitted 17 July, 2018; v1 submitted 22 June, 2018;
originally announced June 2018.
-
Efficient Atlasing and Search of Configuration Spaces of Point-Sets Constrained by Distance Intervals
Authors:
Aysegul Ozkan,
Rahul Prabhu,
Troy Baker,
James Pence,
Jorg Peters,
Meera Sitharam
Abstract:
For configurations of point-sets that are pairwise constrained by distance intervals, the EASAL software implements a suite of algorithms that characterize the structure and geometric properties of the configuration space. The algorithms generate, describe and explore these configuration spaces using generic rigidity properties, classical results for stratification of semi-algebraic sets, and new…
▽ More
For configurations of point-sets that are pairwise constrained by distance intervals, the EASAL software implements a suite of algorithms that characterize the structure and geometric properties of the configuration space. The algorithms generate, describe and explore these configuration spaces using generic rigidity properties, classical results for stratification of semi-algebraic sets, and new results for efficient sampling by convex parametrization. The paper reviews the key theoretical underpinnings, major algorithms and their implementation. The paper outlines the main applications such as the computation of free energy and kinetics of assembly of supramolecular structures or of clusters in colloidal and soft materials. In addition, the paper surveys select experimental results and comparisons.
△ Less
Submitted 4 June, 2018; v1 submitted 18 May, 2018;
originally announced May 2018.
-
Atlasing of Assembly Landscapes using Distance Geometry and Graph Rigidity
Authors:
Rahul Prabhu,
Meera Sitharam,
Aysegul Ozkan,
Rui** Wu
Abstract:
We describe a novel geometric methodology for analyzing free-energy and kinetics of assembly driven by short-range pair-potentials in an implicit solvent, and provides illustrations of its unique capabilities. An atlas is a labeled partition of the assembly landscape into a topological roadmap of maximal, contiguous, nearly-equipotential-energy conformational regions or macrostates, together with…
▽ More
We describe a novel geometric methodology for analyzing free-energy and kinetics of assembly driven by short-range pair-potentials in an implicit solvent, and provides illustrations of its unique capabilities. An atlas is a labeled partition of the assembly landscape into a topological roadmap of maximal, contiguous, nearly-equipotential-energy conformational regions or macrostates, together with their neighborhood relationships. The new methodology decouples the roadmap generation from sampling and produces: (1) a query-able atlas of local potential energy minima, their basin structure, energy barriers, and neighboring basins; (2) paths between a specified pair of basins; and (3) approximations of relative path lengths, basin volumes (configurational entropy), and path probabilities. Results demonstrating the core algorithm's capabilities have been generated by a resource-light, opensource software implementation EASAL. EASAL atlases several hundred thousand macrostates in minutes on a standard laptop. Subsequent path and basin computations each take seconds. The core algorithm's correctness, time complexity, and efficiency-accuracy tradeoffs are formally guaranteed using modern geometric constraint systems. The methodology further links geometric variables of the input assembling units to a type of intuitive topological bar-code of the output atlas, which in turn determine stable assembled structures and kinetics. This succinct input-output relationship facilitates reverse analysis, and control towards design. We use the novel convex Cayley (distance-based) parametrization that is unique to assembly, as opposed to folding. Sampling microstates with macrostate-specific Cayley parameters avoids gradient-descent search used by all prevailing methods. This increases sampling efficiency, significantly reduces the number of repeated and discarded samples.
△ Less
Submitted 16 October, 2020; v1 submitted 16 March, 2012;
originally announced March 2012.