-
$L^p-L^q$ estimates for solutions to the plate equation with mass term
Authors:
Alexandre Arias Junior,
Halit Sevki Aslan,
Antonio Lagioia,
Marcelo Rempel Ebert
Abstract:
In this paper, we study the Cauchy problem for the linear plate equation with mass term and its applications to semilinear models. For the linear problem we obtain $L^p-L^q$ estimates for the solutions in the full range $1\leq p\leq q\leq \infty$, and we show that such estimates are optimal. In the sequel, we discuss the global in time existence of solutions to the associated semilinear problem wi…
▽ More
In this paper, we study the Cauchy problem for the linear plate equation with mass term and its applications to semilinear models. For the linear problem we obtain $L^p-L^q$ estimates for the solutions in the full range $1\leq p\leq q\leq \infty$, and we show that such estimates are optimal. In the sequel, we discuss the global in time existence of solutions to the associated semilinear problem with power nonlinearity $|u|^α$. For low dimension space $n\leq 4$, and assuming $L^1$ regularity on the second datum, we were able to prove global existence for $α> \max\{α_c(n), \tildeα_c(n)\}$ where $α_c = 1+4/n$ and $\tilde α_c = 2+2/n$. However, assuming initial data in $H^2(\mathbb{R}^n)\times L^2(\mathbb{R}^n)$, the presence of the mass term allows us to obtain global in time existence for all $1<α\leq (n+4)/[n-4]_+$. We also show that the latter upper bound is optimal, since we prove that there exist data such that a non-existence result for local weak solutions holds when $α> (n+4)/[n-4]_+$.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Combinatorial Curve Neighborhood of the Affine Flag Manifold of Type $A_{n-1}^1$
Authors:
Songul Aslan
Abstract:
Let $\mathscr{X}$ be the affine flag manifold of Lie type $A_{n-1}^{(1)}$ where $n \geq 3$ and let $W_{\text{aff}}$ be the associated affine Weyl group. The moment graph for $\mathscr{X}$ encodes the torus fixed points (corresponding to elements of the affine Weyl group $W_{\text{aff}}$) and the torus stable curves in $\mathscr{X}$. Given a fixed point $u\in W_{\text{aff}}$ and a degree…
▽ More
Let $\mathscr{X}$ be the affine flag manifold of Lie type $A_{n-1}^{(1)}$ where $n \geq 3$ and let $W_{\text{aff}}$ be the associated affine Weyl group. The moment graph for $\mathscr{X}$ encodes the torus fixed points (corresponding to elements of the affine Weyl group $W_{\text{aff}}$) and the torus stable curves in $\mathscr{X}$. Given a fixed point $u\in W_{\text{aff}}$ and a degree $\mathbf{d}=(d_0,d_1,...,d_{n-1})\in \mathbb{Z}_{\geq 0}^{n}$, the combinatorial curve neighborhood is the set of maximal elements in the moment graph of $\mathscr{X}$ which can be reached from $u'\leq u$ by a chain of curves of total degree $\leq \mathbf{d}$. In this paper we give combinatorial formulas and algorithms for calculating these elements in $\mathscr{X}$.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
A Practical Approach for Exploring Granger Connectivity in High-Dimensional Networks of Time Series
Authors:
Sipan Aslan,
Hernando Ombao
Abstract:
This manuscript presents a novel method for discovering effective connectivity between specified pairs of nodes in a high-dimensional network of time series. To accurately perform Granger causality analysis from the first node to the second node, it is essential to eliminate the influence of all other nodes within the network. The approach proposed is to create a low-dimensional representation of…
▽ More
This manuscript presents a novel method for discovering effective connectivity between specified pairs of nodes in a high-dimensional network of time series. To accurately perform Granger causality analysis from the first node to the second node, it is essential to eliminate the influence of all other nodes within the network. The approach proposed is to create a low-dimensional representation of all other nodes in the network using frequency-domain-based dynamic principal component analysis (spectral DPCA). The resulting scores are subsequently removed from the first and second nodes of interest, thus eliminating the confounding effect of other nodes within the high-dimensional network. To conduct hypothesis testing on Granger causality, we propose a permutation-based causality test. This test enhances the accuracy of our findings when the error structures are non-Gaussian. The approach has been validated in extensive simulation studies, which demonstrate the efficacy of the methodology as a tool for causality analysis in complex time series networks. The proposed methodology has also been demonstrated to be both expedient and viable on real datasets, with particular success observed on multichannel EEG networks.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Visco-elastic damped wave models with time-dependent coefficient
Authors:
Halit Sevki Aslan,
Michael Reissig
Abstract:
In this paper, we study the following Cauchy problem for linear visco-elastic damped wave models with a general time-dependent coefficient $g=g(t)$: \begin{equation} \label{EqAbstract} \tag{$\star$} \begin{cases} u_{tt}- Δu + g(t)(-Δ)u_t=0, &(t,x) \in (0,\infty) \times \mathbb{R}^n, \\ u(0,x)= u_0(x),\quad u_t(0,x)= u_1(x), &x \in \mathbb{R}^n. \end{cases} \end{equation} We are interested to study…
▽ More
In this paper, we study the following Cauchy problem for linear visco-elastic damped wave models with a general time-dependent coefficient $g=g(t)$: \begin{equation} \label{EqAbstract} \tag{$\star$} \begin{cases} u_{tt}- Δu + g(t)(-Δ)u_t=0, &(t,x) \in (0,\infty) \times \mathbb{R}^n, \\ u(0,x)= u_0(x),\quad u_t(0,x)= u_1(x), &x \in \mathbb{R}^n. \end{cases} \end{equation} We are interested to study the influence of the dam** term $g(t)(-Δ)u_t$ on qualitative properties of solutions to \eqref{EqAbstract} as decay estimates for energies of higher order and the parabolic effect. The main tools are related to WKB-analysis. We apply elliptic as well as hyperbolic WKB-analysis in different parts of the extended phase space.
△ Less
Submitted 23 July, 2023;
originally announced July 2023.
-
Reassembling Broken Objects using Breaking Curves
Authors:
Ali Alagrami,
Luca Palmieri,
Sinem Aslan,
Marcello Pelillo,
Sebastiano Vascon
Abstract:
Reassembling 3D broken objects is a challenging task. A robust solution that generalizes well must deal with diverse patterns associated with different types of broken objects. We propose a method that tackles the pairwise assembly of 3D point clouds, that is agnostic on the type of object, and that relies solely on their geometrical information, without any prior information on the shape of the r…
▽ More
Reassembling 3D broken objects is a challenging task. A robust solution that generalizes well must deal with diverse patterns associated with different types of broken objects. We propose a method that tackles the pairwise assembly of 3D point clouds, that is agnostic on the type of object, and that relies solely on their geometrical information, without any prior information on the shape of the reconstructed object. The method receives two point clouds as input and segments them into regions using detected closed boundary contours, known as breaking curves. Possible alignment combinations of the regions of each broken object are evaluated and the best one is selected as the final alignment. Experiments were carried out both on available 3D scanned objects and on a recent benchmark for synthetic broken objects. Results show that our solution performs well in reassembling different kinds of broken objects.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
View Correspondence Network for Implicit Light Field Representation
Authors:
Süleyman Aslan,
Brandon Yushan Feng,
Amitabh Varshney
Abstract:
We present a novel technique for implicit neural representation of light fields at continuously defined viewpoints with high quality and fidelity. Our implicit neural representation maps 4D coordinates defining two-plane parameterization of the light fields to the corresponding color values. We leverage periodic activations to achieve high expressivity and accurate reconstruction for complex data…
▽ More
We present a novel technique for implicit neural representation of light fields at continuously defined viewpoints with high quality and fidelity. Our implicit neural representation maps 4D coordinates defining two-plane parameterization of the light fields to the corresponding color values. We leverage periodic activations to achieve high expressivity and accurate reconstruction for complex data manifolds while kee** low storage and inference time requirements. However, naïvely trained non-3D structured networks do not adequately satisfy the multi-view consistency; instead, they perform alpha blending of nearby viewpoints. In contrast, our View Correspondence Network, or VICON, leverages stereo matching, optimization by automatic differentiation with respect to the input space, and multi-view pixel correspondence to provide a novel implicit representation of the light fields faithful to the novel views that are unseen during the training. Experimental results show VICON superior to the state-of-the-art non-3D implicit light field representations both qualitatively and quantitatively. Moreover, our implicit representation captures a larger field of view (FoV), surpassing the extent of the observable scene by the cameras of the ground truth renderings.
△ Less
Submitted 10 May, 2023;
originally announced May 2023.
-
CodEx: A Modular Framework for Joint Temporal De-blurring and Tomographic Reconstruction
Authors:
Soumendu Majee,
Selin Aslan,
Doga Gursoy,
Charles A. Bouman
Abstract:
In many computed tomography (CT) imaging applications, it is important to rapidly collect data from an object that is moving or changing with time. Tomographic acquisition is generally assumed to be step-and-shoot, where the object is rotated to each desired angle, and a view is taken. However, step-and-shoot acquisition is slow and can waste photons, so in practice fly-scanning is done where the…
▽ More
In many computed tomography (CT) imaging applications, it is important to rapidly collect data from an object that is moving or changing with time. Tomographic acquisition is generally assumed to be step-and-shoot, where the object is rotated to each desired angle, and a view is taken. However, step-and-shoot acquisition is slow and can waste photons, so in practice fly-scanning is done where the object is continuously rotated while collecting data. However, this can result in motion-blurred views and consequently reconstructions with severe motion artifacts.
In this paper, we introduce CodEx, a modular framework for joint de-blurring and tomographic reconstruction that can effectively invert the motion blur introduced in sparse view fly-scanning. The method is a synergistic combination of a novel acquisition method with a novel non-convex Bayesian reconstruction algorithm. CodEx works by encoding the acquisition with a known binary code that the reconstruction algorithm then inverts. Using a well chosen binary code to encode the measurements can improve the accuracy of the inversion process. The CodEx reconstruction method uses the alternating direction method of multipliers (ADMM) to split the inverse problem into iterative deblurring and reconstruction sub-problems, making reconstruction practical to implement. We present reconstruction results on both simulated and binned experimental data to demonstrate the effectiveness of our method.
△ Less
Submitted 30 July, 2022; v1 submitted 11 November, 2021;
originally announced November 2021.
-
Scalable and accurate multi-GPU based image reconstruction of large-scale ptychography data
Authors:
Xiaodong Yu,
Viktor Nikitin,
Daniel J. Ching,
Selin Aslan,
Doga Gursoy,
Tekin Bicer
Abstract:
While the advances in synchrotron light sources, together with the development of focusing optics and detectors, allow nanoscale ptychographic imaging of materials and biological specimens, the corresponding experiments can yield terabyte-scale large volumes of data that can impose a heavy burden on the computing platform. While Graphical Processing Units (GPUs) provide high performance for such l…
▽ More
While the advances in synchrotron light sources, together with the development of focusing optics and detectors, allow nanoscale ptychographic imaging of materials and biological specimens, the corresponding experiments can yield terabyte-scale large volumes of data that can impose a heavy burden on the computing platform. While Graphical Processing Units (GPUs) provide high performance for such large-scale ptychography datasets, a single GPU is typically insufficient for analysis and reconstruction. Several existing works have considered leveraging multiple GPUs to accelerate the ptychographic reconstruction. However, they utilize only Message Passing Interface (MPI) to handle the communications between GPUs. It poses inefficiency for the configuration that has multiple GPUs in a single node, especially while processing a single large projection, since it provides no optimizations to handle the heterogeneous GPU interconnections containing both low-speed links, e.g., PCIe, and high-speed links, e.g., NVLink. In this paper, we provide a multi-GPU implementation that can effectively solve large-scale ptychographic reconstruction problem with optimized performance on intra-node multi-GPU. We focus on the conventional maximum-likelihood reconstruction problem using conjugate-gradient (CG) for the solution and propose a novel hybrid parallelization model to address the performance bottlenecks in CG solver. Accordingly, we develop a tool called PtyGer (Ptychographic GPU(multiple)-based reconstruction), implementing our hybrid parallelization model design. The comprehensive evaluation verifies that PtyGer can fully preserve the original algorithm's accuracy while achieving outstanding intra-node GPU scalability.
△ Less
Submitted 14 June, 2021;
originally announced June 2021.
-
An Improved Real-Time Face Recognition System at Low Resolution Based on Local Binary Pattern Histogram Algorithm and CLAHE
Authors:
Kamal Chandra Paul,
Semih Aslan
Abstract:
This research presents an improved real-time face recognition system at a low resolution of 15 pixels with pose and emotion and resolution variations. We have designed our datasets named LRD200 and LRD100, which have been used for training and classification. The face detection part uses the Viola-Jones algorithm, and the face recognition part receives the face image from the face detection part t…
▽ More
This research presents an improved real-time face recognition system at a low resolution of 15 pixels with pose and emotion and resolution variations. We have designed our datasets named LRD200 and LRD100, which have been used for training and classification. The face detection part uses the Viola-Jones algorithm, and the face recognition part receives the face image from the face detection part to process it using the Local Binary Pattern Histogram (LBPH) algorithm with preprocessing using contrast limited adaptive histogram equalization (CLAHE) and face alignment. The face database in this system can be updated via our custom-built standalone android app and automatic restarting of the training and recognition process with an updated database. Using our proposed algorithm, a real-time face recognition accuracy of 78.40% at 15 px and 98.05% at 45 px have been achieved using the LRD200 database containing 200 images per person. With 100 images per person in the database (LRD100) the achieved accuracies are 60.60% at 15 px and 95% at 45 px respectively. A facial deflection of about 30 degrees on either side from the front face showed an average face recognition precision of 72.25% - 81.85%. This face recognition system can be employed for law enforcement purposes, where the surveillance camera captures a low-resolution image because of the distance of a person from the camera. It can also be used as a surveillance system in airports, bus stations, etc., to reduce the risk of possible criminal threats.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
Identifying centres of interest in paintings using alignment and edge detection: Case studies on works by Luc Tuymans
Authors:
Sinem Aslan,
Luc Steels
Abstract:
What is the creative process through which an artist goes from an original image to a painting? Can we examine this process using techniques from computer vision and pattern recognition? Here we set the first preliminary steps to algorithmically deconstruct some of the transformations that an artist applies to an original image in order to establish centres of interest, which are focal areas of a…
▽ More
What is the creative process through which an artist goes from an original image to a painting? Can we examine this process using techniques from computer vision and pattern recognition? Here we set the first preliminary steps to algorithmically deconstruct some of the transformations that an artist applies to an original image in order to establish centres of interest, which are focal areas of a painting that carry meaning. We introduce a comparative methodology that first cuts out the minimal segment from the original image on which the painting is based, then aligns the painting with this source, investigates micro-differences to identify centres of interest and attempts to understand their role. In this paper we focus exclusively on micro-differences with respect to edges. We believe that research into where and how artists create centres of interest in paintings is valuable for curators, art historians, viewers, and art educators, and might even help artists to understand and refine their own artistic method.
△ Less
Submitted 4 January, 2021;
originally announced January 2021.
-
Transductive Visual Verb Sense Disambiguation
Authors:
Sebastiano Vascon,
Sinem Aslan,
Gianluca Bigaglia,
Lorenzo Giudice,
Marcello Pelillo
Abstract:
Verb Sense Disambiguation is a well-known task in NLP, the aim is to find the correct sense of a verb in a sentence. Recently, this problem has been extended in a multimodal scenario, by exploiting both textual and visual features of ambiguous verbs leading to a new problem, the Visual Verb Sense Disambiguation (VVSD). Here, the sense of a verb is assigned considering the content of an image paire…
▽ More
Verb Sense Disambiguation is a well-known task in NLP, the aim is to find the correct sense of a verb in a sentence. Recently, this problem has been extended in a multimodal scenario, by exploiting both textual and visual features of ambiguous verbs leading to a new problem, the Visual Verb Sense Disambiguation (VVSD). Here, the sense of a verb is assigned considering the content of an image paired with it rather than a sentence in which the verb appears. Annotating a dataset for this task is more complex than textual disambiguation, because assigning the correct sense to a pair of $<$image, verb$>$ requires both non-trivial linguistic and visual skills. In this work, differently from the literature, the VVSD task will be performed in a transductive semi-supervised learning (SSL) setting, in which only a small amount of labeled information is required, reducing tremendously the need for annotated data. The disambiguation process is based on a graph-based label propagation method which takes into account mono or multimodal representations for $<$image, verb$>$ pairs. Experiments have been carried out on the recently published dataset VerSe, the only available dataset for this task. The achieved results outperform the current state-of-the-art by a large margin while using only a small fraction of labeled samples per sense. Code available: https://github.com/GiBg1aN/TVVSD.
△ Less
Submitted 19 December, 2020;
originally announced December 2020.
-
EEG based Major Depressive disorder and Bipolar disorder detection using Neural Networks: A review
Authors:
Sana Yasin,
Syed Asad Hussain,
Sinem Aslan,
Imran Raza,
Muhammad Muzammel,
Alice Othmani
Abstract:
Mental disorders represent critical public health challenges as they are leading contributors to the global burden of disease and intensely influence social and financial welfare of individuals. The present comprehensive review concentrate on the two mental disorders: Major depressive Disorder (MDD) and Bipolar Disorder (BD) with noteworthy publications during the last ten years. There is a big ne…
▽ More
Mental disorders represent critical public health challenges as they are leading contributors to the global burden of disease and intensely influence social and financial welfare of individuals. The present comprehensive review concentrate on the two mental disorders: Major depressive Disorder (MDD) and Bipolar Disorder (BD) with noteworthy publications during the last ten years. There is a big need nowadays for phenotypic characterization of psychiatric disorders with biomarkers. Electroencephalography (EEG) signals could offer a rich signature for MDD and BD and then they could improve understanding of pathophysiological mechanisms underling these mental disorders. In this review, we focus on the literature works adopting neural networks fed by EEG signals. Among those studies using EEG and neural networks, we have discussed a variety of EEG based protocols, biomarkers and public datasets for depression and bipolar disorder detection. We conclude with a discussion and valuable recommendations that will help to improve the reliability of developed models and for more accurate and more deterministic computational intelligence based systems in psychiatry. This review will prove to be a structured and valuable initial point for the researchers working on depression and bipolar disorders recognition by using EEG signals.
△ Less
Submitted 4 February, 2021; v1 submitted 28 September, 2020;
originally announced September 2020.
-
Joint ptycho-tomography with deep generative priors
Authors:
Selin Aslan,
Zhengchun Liu,
Viktor Nikitin,
Tekin Bicer,
Sven Leyffer,
Doga Gursoy
Abstract:
Joint ptycho-tomography is a powerful computational imaging framework to recover the refractive properties of a 3D object while relaxing the requirements for probe overlap that is common in conventional phase retrieval. We use an augmented Lagrangian scheme for formulating the constrained optimization problem and employ an alternating direction method of multipliers (ADMM) for the joint solution.…
▽ More
Joint ptycho-tomography is a powerful computational imaging framework to recover the refractive properties of a 3D object while relaxing the requirements for probe overlap that is common in conventional phase retrieval. We use an augmented Lagrangian scheme for formulating the constrained optimization problem and employ an alternating direction method of multipliers (ADMM) for the joint solution. ADMM allows the problem to be split into smaller and computationally more efficient subproblems: ptychographic phase retrieval, tomographic reconstruction, and regularization of the solution. We extend our ADMM framework with plug-and-play (PnP) denoisers by replacing the regularization subproblem with a general denoising operator based on machine learning. While the PnP framework enables integrating such learned priors as denoising operators, tuning of the denoiser prior remains challenging. To overcome this challenge, we propose a denoiser parameter to control the effect of the denoiser and to accelerate the solution. In our simulations, we demonstrate that our proposed framework with parameter tuning and learned priors generates high-quality reconstructions under limited and noisy measurement data.
△ Less
Submitted 27 August, 2021; v1 submitted 20 September, 2020;
originally announced September 2020.
-
Randomization for the Efficient Computation of Parametric Reduced Order Models for Inversion
Authors:
Selin Aslan,
Eric de Sturler,
Serkan Gugercin
Abstract:
Nonlinear parametric inverse problems appear in many applications. Here, we focus on diffuse optical tomography (DOT) in medical imaging to recover unknown images of interest, such as cancerous tissue in a given medium, using a mathematical (forward) model. The forward model in DOT is a diffusion-absorption model for the photon flux. The main bottleneck in these problems is the repeated evaluation…
▽ More
Nonlinear parametric inverse problems appear in many applications. Here, we focus on diffuse optical tomography (DOT) in medical imaging to recover unknown images of interest, such as cancerous tissue in a given medium, using a mathematical (forward) model. The forward model in DOT is a diffusion-absorption model for the photon flux. The main bottleneck in these problems is the repeated evaluation of the large-scale forward model. For DOT, this corresponds to solving large linear systems for each source and frequency at each optimization step. Moreover, Newton-type methods, often the method of choice, require additional linear solves with the adjoint to compute derivative information. Emerging technology allows for large numbers of sources and detectors, making these problems prohibitively expensive. Reduced order models (ROM) have been used to drastically reduce the system size in each optimization step, while solving the inverse problem accurately. However, for large numbers of sources and detectors, just the construction of the candidate basis for the ROM projection space incurs a substantial cost, as matching the full parameter gradient matrix in interpolatory model reduction requires large linear solves for all sources and frequencies and all detectors and frequencies for each parameter interpolation point. As this candidate basis numerically has low rank, this construction is followed by a rank-revealing factorization that typically reduces the number of vectors in the candidate basis substantially. We propose to use randomization to approximate this basis with a drastically reduced number of large linear solves. We also provide a detailed analysis for the low-rank structure of the candidate basis for our problem of interest. Even though we focus on the DOT problem, the ideas presented are relevant to many other large scale inverse problems and optimization problems.
△ Less
Submitted 12 July, 2020;
originally announced July 2020.
-
Evaluation of modified uniformly redundant arrays as structured illuminations for ptychography
Authors:
Daniel J. Ching,
Selin Aslan,
Viktor Nikitin,
Michael J. Wojcik,
Doga Gursoy
Abstract:
Previous studies have shown that the frequency content of an illumination affects the convergence rate and reconstruction quality of ptychographic reconstructions. In this numerical study, we demonstrate that structuring a large illumination as a modified uniformly redundant array (MURA) can yield higher resolution and faster convergence for ptychography by improving the signal-to-noise ratio of h…
▽ More
Previous studies have shown that the frequency content of an illumination affects the convergence rate and reconstruction quality of ptychographic reconstructions. In this numerical study, we demonstrate that structuring a large illumination as a modified uniformly redundant array (MURA) can yield higher resolution and faster convergence for ptychography by improving the signal-to-noise ratio of high spatial frequencies in the far-field diffraction pattern.
△ Less
Submitted 3 April, 2020;
originally announced April 2020.
-
CHAOS Challenge -- Combined (CT-MR) Healthy Abdominal Organ Segmentation
Authors:
A. Emre Kavur,
N. Sinem Gezer,
Mustafa Barış,
Sinem Aslan,
Pierre-Henri Conze,
Vladimir Groza,
Duc Duy Pham,
Soumick Chatterjee,
Philipp Ernst,
Savaş Özkan,
Bora Baydar,
Dmitry Lachinov,
Shuo Han,
Josef Pauli,
Fabian Isensee,
Matthias Perkonigg,
Rachana Sathish,
Ronnie Rajan,
Debdoot Sheet,
Gurbandurdy Dovletov,
Oliver Speck,
Andreas Nürnberger,
Klaus H. Maier-Hein,
Gözde Bozdağı Akar,
Gözde Ünal
, et al. (2 additional authors not shown)
Abstract:
Segmentation of abdominal organs has been a comprehensive, yet unresolved, research field for many years. In the last decade, intensive developments in deep learning (DL) have introduced new state-of-the-art segmentation systems. In order to expand the knowledge on these topics, the CHAOS - Combined (CT-MR) Healthy Abdominal Organ Segmentation challenge has been organized in conjunction with IEEE…
▽ More
Segmentation of abdominal organs has been a comprehensive, yet unresolved, research field for many years. In the last decade, intensive developments in deep learning (DL) have introduced new state-of-the-art segmentation systems. In order to expand the knowledge on these topics, the CHAOS - Combined (CT-MR) Healthy Abdominal Organ Segmentation challenge has been organized in conjunction with IEEE International Symposium on Biomedical Imaging (ISBI), 2019, in Venice, Italy. CHAOS provides both abdominal CT and MR data from healthy subjects for single and multiple abdominal organ segmentation. Five different but complementary tasks have been designed to analyze the capabilities of current approaches from multiple perspectives. The results are investigated thoroughly, compared with manual annotations and interactive methods. The analysis shows that the performance of DL models for single modality (CT / MR) can show reliable volumetric analysis performance (DICE: 0.98 $\pm$ 0.00 / 0.95 $\pm$ 0.01) but the best MSSD performance remain limited (21.89 $\pm$ 13.94 / 20.85 $\pm$ 10.63 mm). The performances of participating models decrease significantly for cross-modality tasks for the liver (DICE: 0.88 $\pm$ 0.15 MSSD: 36.33 $\pm$ 21.97 mm) and all organs (DICE: 0.85 $\pm$ 0.21 MSSD: 33.17 $\pm$ 38.93 mm). Despite contrary examples on different applications, multi-tasking DL models designed to segment all organs seem to perform worse compared to organ-specific ones (performance drop around 5\%). Besides, such directions of further research for cross-modality segmentation would significantly support real-world clinical applications. Moreover, having more than 1500 participants, another important contribution of the paper is the analysis on shortcomings of challenge organizations such as the effects of multiple submissions and peeking phenomena.
△ Less
Submitted 7 January, 2021; v1 submitted 17 January, 2020;
originally announced January 2020.
-
Multimodal Video-based Apparent Personality Recognition Using Long Short-Term Memory and Convolutional Neural Networks
Authors:
Süleyman Aslan,
Uğur Güdükbay
Abstract:
Personality computing and affective computing, where the recognition of personality traits is essential, have gained increasing interest and attention in many research areas recently. We propose a novel approach to recognize the Big Five personality traits of people from videos. Personality and emotion affect the speaking style, facial expressions, body movements, and linguistic factors in social…
▽ More
Personality computing and affective computing, where the recognition of personality traits is essential, have gained increasing interest and attention in many research areas recently. We propose a novel approach to recognize the Big Five personality traits of people from videos. Personality and emotion affect the speaking style, facial expressions, body movements, and linguistic factors in social contexts, and they are affected by environmental elements. We develop a multimodal system to recognize apparent personality based on various modalities such as the face, environment, audio, and transcription features. We use modality-specific neural networks that learn to recognize the traits independently and we obtain a final prediction of apparent personality with a feature-level fusion of these networks. We employ pre-trained deep convolutional neural networks such as ResNet and VGGish networks to extract high-level features and Long Short-Term Memory networks to integrate temporal information. We train the large model consisting of modality-specific subnetworks using a two-stage training process. We first train the subnetworks separately and then fine-tune the overall model using these trained networks. We evaluate the proposed method using ChaLearn First Impressions V2 challenge dataset. Our approach obtains the best overall "mean accuracy" score, averaged over five personality traits, compared to the state-of-the-art.
△ Less
Submitted 1 November, 2019;
originally announced November 2019.
-
Weakly Supervised Semantic Segmentation Using Constrained Dominant Sets
Authors:
Sinem Aslan,
Marcello Pelillo
Abstract:
The availability of large-scale data sets is an essential pre-requisite for deep learning based semantic segmentation schemes. Since obtaining pixel-level labels is extremely expensive, supervising deep semantic segmentation networks using low-cost weak annotations has been an attractive research problem in recent years. In this work, we explore the potential of Constrained Dominant Sets (CDS) for…
▽ More
The availability of large-scale data sets is an essential pre-requisite for deep learning based semantic segmentation schemes. Since obtaining pixel-level labels is extremely expensive, supervising deep semantic segmentation networks using low-cost weak annotations has been an attractive research problem in recent years. In this work, we explore the potential of Constrained Dominant Sets (CDS) for generating multi-labeled full mask predictions to train a fully convolutional network (FCN) for semantic segmentation. Our experimental results show that using CDS's yields higher-quality mask predictions compared to methods that have been adopted in the literature for the same purpose.
△ Less
Submitted 20 September, 2019;
originally announced September 2019.
-
Unsupervised Domain Adaptation using Graph Transduction Games
Authors:
Sebastiano Vascon,
Sinem Aslan,
Alessandro Torcinovich,
Twan van Laarhoven,
Elena Marchiori,
Marcello Pelillo
Abstract:
Unsupervised domain adaptation (UDA) amounts to assigning class labels to the unlabeled instances of a dataset from a target domain, using labeled instances of a dataset from a related source domain. In this paper, we propose to cast this problem in a game-theoretic setting as a non-cooperative game and introduce a fully automatized iterative algorithm for UDA based on graph transduction games (GT…
▽ More
Unsupervised domain adaptation (UDA) amounts to assigning class labels to the unlabeled instances of a dataset from a target domain, using labeled instances of a dataset from a related source domain. In this paper, we propose to cast this problem in a game-theoretic setting as a non-cooperative game and introduce a fully automatized iterative algorithm for UDA based on graph transduction games (GTG). The main advantages of this approach are its principled foundation, guaranteed termination of the iterative algorithms to a Nash equilibrium (which corresponds to a consistent labeling condition) and soft labels quantifying the uncertainty of the label assignment process. We also investigate the beneficial effect of using pseudo-labels from linear classifiers to initialize the iterative process. The performance of the resulting methods is assessed on publicly available object recognition benchmark datasets involving both shallow and deep features. Results of experiments demonstrate the suitability of the proposed game-theoretic approach for solving UDA tasks.
△ Less
Submitted 6 May, 2019;
originally announced May 2019.
-
Global existence and blow-up for weakly coupled systems of semilinear thermoelastic plate equations
Authors:
Halit Sevki Aslan,
Wenhui Chen
Abstract:
We derive $L^p-L^r$ estimates away of the conjugate line for linear thermoelastic plate equations. Up to now, the authors think the results for the semilinear problem are not good enough.
We derive $L^p-L^r$ estimates away of the conjugate line for linear thermoelastic plate equations. Up to now, the authors think the results for the semilinear problem are not good enough.
△ Less
Submitted 2 July, 2019; v1 submitted 1 April, 2019;
originally announced April 2019.
-
Deep Convolutional Generative Adversarial Networks Based Flame Detection in Video
Authors:
Süleyman Aslan,
Uğur Güdükbay,
B. Uğur Töreyin,
A. Enis Çetin
Abstract:
Real-time flame detection is crucial in video based surveillance systems. We propose a vision-based method to detect flames using Deep Convolutional Generative Adversarial Neural Networks (DCGANs). Many existing supervised learning approaches using convolutional neural networks do not take temporal information into account and require substantial amount of labeled data. In order to have a robust r…
▽ More
Real-time flame detection is crucial in video based surveillance systems. We propose a vision-based method to detect flames using Deep Convolutional Generative Adversarial Neural Networks (DCGANs). Many existing supervised learning approaches using convolutional neural networks do not take temporal information into account and require substantial amount of labeled data. In order to have a robust representation of sequences with and without flame, we propose a two-stage training of a DCGAN exploiting spatio-temporal flame evolution. Our training framework includes the regular training of a DCGAN with real spatio-temporal images, namely, temporal slice images, and noise vectors, and training the discriminator separately using the temporal flame images without the generator. Experimental results show that the proposed method effectively detects flame in video with negligible false positive rates in real-time.
△ Less
Submitted 5 February, 2019;
originally announced February 2019.
-
Detecting Behavioral Engagement of Students in the Wild Based on Contextual and Visual Data
Authors:
Eda Okur,
Nese Alyuz,
Sinem Aslan,
Utku Genc,
Cagri Tanriover,
Asli Arslan Esme
Abstract:
To investigate the detection of students' behavioral engagement (On-Task vs. Off-Task), we propose a two-phase approach in this study. In Phase 1, contextual logs (URLs) are utilized to assess active usage of the content platform. If there is active use, the appearance information is utilized in Phase 2 to infer behavioral engagement. Incorporating the contextual information improved the overall F…
▽ More
To investigate the detection of students' behavioral engagement (On-Task vs. Off-Task), we propose a two-phase approach in this study. In Phase 1, contextual logs (URLs) are utilized to assess active usage of the content platform. If there is active use, the appearance information is utilized in Phase 2 to infer behavioral engagement. Incorporating the contextual information improved the overall F1-scores from 0.77 to 0.82. Our cross-classroom and cross-platform experiments showed the proposed generic and multi-modal behavioral engagement models' applicability to a different set of students or different subject areas.
△ Less
Submitted 15 January, 2019;
originally announced January 2019.
-
Unobtrusive and Multimodal Approach for Behavioral Engagement Detection of Students
Authors:
Nese Alyuz,
Eda Okur,
Utku Genc,
Sinem Aslan,
Cagri Tanriover,
Asli Arslan Esme
Abstract:
We propose a multimodal approach for detection of students' behavioral engagement states (i.e., On-Task vs. Off-Task), based on three unobtrusive modalities: Appearance, Context-Performance, and Mouse. Final behavioral engagement states are achieved by fusing modality-specific classifiers at the decision level. Various experiments were conducted on a student dataset collected in an authentic class…
▽ More
We propose a multimodal approach for detection of students' behavioral engagement states (i.e., On-Task vs. Off-Task), based on three unobtrusive modalities: Appearance, Context-Performance, and Mouse. Final behavioral engagement states are achieved by fusing modality-specific classifiers at the decision level. Various experiments were conducted on a student dataset collected in an authentic classroom.
△ Less
Submitted 15 January, 2019;
originally announced January 2019.
-
The Importance of Socio-Cultural Differences for Annotating and Detecting the Affective States of Students
Authors:
Eda Okur,
Sinem Aslan,
Nese Alyuz,
Asli Arslan Esme,
Ryan S. Baker
Abstract:
The development of real-time affect detection models often depends upon obtaining annotated data for supervised learning by employing human experts to label the student data. One open question in annotating affective data for affect detection is whether the labelers (i.e., human experts) need to be socio-culturally similar to the students being labeled, as this impacts the cost feasibility of obta…
▽ More
The development of real-time affect detection models often depends upon obtaining annotated data for supervised learning by employing human experts to label the student data. One open question in annotating affective data for affect detection is whether the labelers (i.e., human experts) need to be socio-culturally similar to the students being labeled, as this impacts the cost feasibility of obtaining the labels. In this study, we investigate the following research questions: For affective state annotation, how does the socio-cultural background of human expert labelers, compared to the subjects, impact the degree of consensus and distribution of affective states obtained? Secondly, how do differences in labeler background impact the performance of affect detection models that are trained using these labels?
△ Less
Submitted 11 January, 2019;
originally announced January 2019.
-
Compressively Sensed Image Recognition
Authors:
Aysen Degerli,
Sinem Aslan,
Mehmet Yamac,
Bulent Sankur,
Moncef Gabbouj
Abstract:
Compressive Sensing (CS) theory asserts that sparse signal reconstruction is possible from a small number of linear measurements. Although CS enables low-cost linear sampling, it requires non-linear and costly reconstruction. Recent literature works show that compressive image classification is possible in CS domain without reconstruction of the signal. In this work, we introduce a DCT base method…
▽ More
Compressive Sensing (CS) theory asserts that sparse signal reconstruction is possible from a small number of linear measurements. Although CS enables low-cost linear sampling, it requires non-linear and costly reconstruction. Recent literature works show that compressive image classification is possible in CS domain without reconstruction of the signal. In this work, we introduce a DCT base method that extracts binary discriminative features directly from CS measurements. These CS measurements can be obtained by using (i) a random or a pseudo-random measurement matrix, or (ii) a measurement matrix whose elements are learned from the training data to optimize the given classification task. We further introduce feature fusion by concatenating Bag of Words (BoW) representation of our binary features with one of the two state-of-the-art CNN-based feature vectors. We show that our fused feature outperforms the state-of-the-art in both cases.
△ Less
Submitted 15 October, 2018;
originally announced October 2018.
-
Ancient Coin Classification Using Graph Transduction Games
Authors:
Sinem Aslan,
Sebastiano Vascon,
Marcello Pelillo
Abstract:
Recognizing the type of an ancient coin requires theoretical expertise and years of experience in the field of numismatics. Our goal in this work is automatizing this time consuming and demanding task by a visual classification framework. Specifically, we propose to model ancient coin image classification using Graph Transduction Games (GTG). GTG casts the classification problem as a non-cooperati…
▽ More
Recognizing the type of an ancient coin requires theoretical expertise and years of experience in the field of numismatics. Our goal in this work is automatizing this time consuming and demanding task by a visual classification framework. Specifically, we propose to model ancient coin image classification using Graph Transduction Games (GTG). GTG casts the classification problem as a non-cooperative game where the players (the coin images) decide their strategies (class labels) according to the choices made by the others, which results with a global consensus at the final labeling. Experiments are conducted on the only publicly available dataset which is composed of 180 images of 60 types of Roman coins. We demonstrate that our approach outperforms the literature work on the same dataset with the classification accuracy of 73.6% and 87.3% when there are one and two images per class in the training set, respectively.
△ Less
Submitted 2 October, 2018;
originally announced October 2018.
-
The influence of oscillations on energy estimates for damped wave models with time-dependent propagation speed and dissipation
Authors:
Halit Sevki Aslan,
Michael Reissig
Abstract:
The aim of this paper is to derive higher order energy estimates for solutions to the Cauchy problem for damped wave models with time-dependent propagation speed and dissipation. The model of interest is \begin{equation*} u_{tt}-λ^2(t)ω^2(t)Δu +ρ(t)ω(t)u_t=0, \quad u(0,x)=u_0(x), \,\, u_t(0,x)=u_1(x). \end{equation*} The coefficients $λ=λ(t)$ and $ρ=ρ(t)$ are shape functions and $ω=ω(t)$ is an osc…
▽ More
The aim of this paper is to derive higher order energy estimates for solutions to the Cauchy problem for damped wave models with time-dependent propagation speed and dissipation. The model of interest is \begin{equation*} u_{tt}-λ^2(t)ω^2(t)Δu +ρ(t)ω(t)u_t=0, \quad u(0,x)=u_0(x), \,\, u_t(0,x)=u_1(x). \end{equation*} The coefficients $λ=λ(t)$ and $ρ=ρ(t)$ are shape functions and $ω=ω(t)$ is an oscillating function. If $ω(t)\equiv1$ and $ρ(t)u_t$ is an "effective" dissipation term, then $L^2-L^2$ energy estimates are proved in [2]. In contrast, the main goal of the present paper is to generalize the previous results to coefficients including an oscillating function in the time-dependent coefficients. We will explain how the interplay between the shape functions and oscillating behavior of the coefficient will influence energy estimates.
△ Less
Submitted 28 August, 2019; v1 submitted 26 September, 2018;
originally announced September 2018.
-
Randomized Approach to Nonlinear Inversion Combining Simultaneous Random and Optimized Sources and Detectors
Authors:
Selin Aslan,
Eric de Sturler,
Misha E. Kilmer
Abstract:
In partial differential equations-based (PDE-based) inverse problems with many measurements, many large-scale discretized PDEs must be solved for each evaluation of the misfit or objective function. In the nonlinear case, evaluating the Jacobian requires solving an additional set of systems. This leads to a tremendous computational cost, and this is by far the dominant cost for these problems. Sev…
▽ More
In partial differential equations-based (PDE-based) inverse problems with many measurements, many large-scale discretized PDEs must be solved for each evaluation of the misfit or objective function. In the nonlinear case, evaluating the Jacobian requires solving an additional set of systems. This leads to a tremendous computational cost, and this is by far the dominant cost for these problems. Several authors have proposed randomization and stochastic programming techniques to drastically reduce the number of system solves by estimating the objective function using only a few appropriately chosen random linear combinations of the sources. While some have reported good solution quality at a greatly reduced cost, for our problem of interest, diffuse optical tomography, the approach often does not lead to sufficiently accurate solutions.
We propose two improvements. First, to efficiently exploit Newton-type methods, we modify the stochastic estimates to include random linear combinations of detectors, drastically reducing the number of adjoint solves. Second, after solving to a modest tolerance, we compute a few simultaneous sources and detectors that maximize the Frobenius norm of the sampled Jacobian to improve the rate of convergence and obtain more accurate solutions. We complement these optimized simultaneous sources and detectors by random simultaneous sources and detectors constrained to a complementary subspace. Our approach leads to solutions of the same quality as obtained using all sources and detectors but at a greatly reduced computational cost, as the number of large-scale linear systems to be solved is significantly reduced.
△ Less
Submitted 17 July, 2018; v1 submitted 17 June, 2017;
originally announced June 2017.
-
Temporal Clustering of Time Series via Threshold Autoregressive Models: Application to Commodity Prices
Authors:
Sipan Aslan,
Ceylan Yozgatligil,
Cem Iyigun
Abstract:
This study aimed to find temporal clusters for several commodity prices using the threshold non-linear autoregressive model. It is expected that the process of determining the commodity groups that are time-dependent will advance the current knowledge about the dynamics of co-moving and coherent prices, and can serve as a basis for multivariate time series analyses. The clustering of commodity pri…
▽ More
This study aimed to find temporal clusters for several commodity prices using the threshold non-linear autoregressive model. It is expected that the process of determining the commodity groups that are time-dependent will advance the current knowledge about the dynamics of co-moving and coherent prices, and can serve as a basis for multivariate time series analyses. The clustering of commodity prices was examined using the proposed clustering approach based on time series models to incorporate the time varying properties of price series into the clustering scheme. Accordingly, the primary aim in this study was grou** time series according to the similarity between their Data Generating Mechanisms (DGMs) rather than comparing pattern similarities in the time series traces. The approximation to the DGM of each series was accomplished using threshold autoregressive models, which are recognized for their ability to represent nonlinear features in time series, such as abrupt changes, time-irreversibility and regime-shifting behavior. Through the use of the proposed approach, one can determine and monitor the set of co-moving time series variables across the time dimension. Furthermore, generating a time varying commodity price index and sub-indexes can become possible. Consequently, we conducted a simulation study to assess the effectiveness of the proposed clustering approach and the results are presented for both the simulated and real data sets.
△ Less
Submitted 3 May, 2016;
originally announced May 2016.