-
Enhancing Predictive Accuracy in Pharmaceutical Sales Through An Ensemble Kernel Gaussian Process Regression Approach
Authors:
Shahin Mirshekari,
Mohammadreza Moradi,
Hossein Jafari,
Mehdi Jafari,
Mohammad Ensaf
Abstract:
This research employs Gaussian Process Regression (GPR) with an ensemble kernel, integrating Exponential Squared, Revised Matérn, and Rational Quadratic kernels to analyze pharmaceutical sales data. Bayesian optimization was used to identify optimal kernel weights: 0.76 for Exponential Squared, 0.21 for Revised Matérn, and 0.13 for Rational Quadratic. The ensemble kernel demonstrated superior perf…
▽ More
This research employs Gaussian Process Regression (GPR) with an ensemble kernel, integrating Exponential Squared, Revised Matérn, and Rational Quadratic kernels to analyze pharmaceutical sales data. Bayesian optimization was used to identify optimal kernel weights: 0.76 for Exponential Squared, 0.21 for Revised Matérn, and 0.13 for Rational Quadratic. The ensemble kernel demonstrated superior performance in predictive accuracy, achieving an \( R^2 \) score near 1.0, and significantly lower values in Mean Squared Error (MSE), Mean Absolute Error (MAE), and Root Mean Squared Error (RMSE). These findings highlight the efficacy of ensemble kernels in GPR for predictive analytics in complex pharmaceutical sales datasets.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet
Authors:
Hossein Jafari,
Karim Faez,
Hamidreza Amindavar
Abstract:
Lung cancer is highly lethal, emphasizing the critical need for early detection. However, identifying lung nodules poses significant challenges for radiologists, who rely heavily on their expertise for accurate diagnosis. To address this issue, computer-aided diagnosis (CAD) systems based on machine learning techniques have emerged to assist doctors in identifying lung nodules from computed tomogr…
▽ More
Lung cancer is highly lethal, emphasizing the critical need for early detection. However, identifying lung nodules poses significant challenges for radiologists, who rely heavily on their expertise for accurate diagnosis. To address this issue, computer-aided diagnosis (CAD) systems based on machine learning techniques have emerged to assist doctors in identifying lung nodules from computed tomography (CT) scans. Unfortunately, existing networks in this domain often suffer from computational complexity, leading to high rates of false negatives and false positives, limiting their effectiveness. To address these challenges, we present an innovative model that harnesses the strengths of both convolutional neural networks and vision transformers. Inspired by object detection in videos, we treat each 3D CT image as a video, individual slices as frames, and lung nodules as objects, enabling a time-series application. The primary objective of our work is to overcome hardware limitations during model training, allowing for efficient processing of 2D data while utilizing inter-slice information for accurate identification based on 3D image context. We validated the proposed network by applying a 10-fold cross-validation technique to the publicly available Lung Nodule Analysis 2016 dataset. Our proposed architecture achieves an average sensitivity criterion of 97.84% and a competition performance metrics (CPM) of 96.0% with few parameters. Comparative analysis with state-of-the-art advancements in lung nodule identification demonstrates the significant accuracy achieved by our proposed model.
△ Less
Submitted 14 October, 2023; v1 submitted 5 October, 2023;
originally announced October 2023.
-
Multi-Stage NMPC for a MAV based Collision Free Navigation under Varying Communication Delays
Authors:
Andreas Papadimitriou,
Hedyeh Jafari,
Sina Sharif Mansouri,
George Nikolakopoulos
Abstract:
Time delays in communication networks are one of the main concerns in deploying robots with computation boards on the edge. This article proposes a multi-stage Nonlinear Model Predictive Control (NMPC) that is capable of handling varying network-induced time delays for establishing a control framework being able to guarantee collision-free Micro Aerial Vehicles (MAVs) navigation. This study introd…
▽ More
Time delays in communication networks are one of the main concerns in deploying robots with computation boards on the edge. This article proposes a multi-stage Nonlinear Model Predictive Control (NMPC) that is capable of handling varying network-induced time delays for establishing a control framework being able to guarantee collision-free Micro Aerial Vehicles (MAVs) navigation. This study introduces a novel approach that considers different sampling times by a tree of discretization scenarios contrary to the existing typical multi-stage NMPC where system uncertainties are modeled by a tree of scenarios. Additionally, the proposed method considers adaptive weights for the multi-stage NMPC scenarios based on the probability of time delays in the communication link. As a result of the multi-stage NMPC, the obtained optimal control action is valid for multiple sampling times. Finally, the overall effectiveness of the proposed novel control framework is demonstrated in various tests and different simulation environments.
△ Less
Submitted 7 August, 2022;
originally announced August 2022.
-
Cyrus 2D Simulation Team Description Paper 2016
Authors:
Nader Zare,
Ashkan Keshavarzi,
Seyed Ehsan Beheshtian,
Hadi Mowla,
Aryan Akbarpour,
Hossein Jafari,
Keyvan Arab Baraghi,
Mohammad Amin Zarifi,
Reza Javidan
Abstract:
This description includes some explanation about algorithms and also algorithms that are being implemented by Cyrus team members. The objectives of this description are to express a brief explanation about shoot, block, mark and defensive decision will be given. It also explained about the parts that has been implemented. The base code that Cyrus used is agent 3.11.
This description includes some explanation about algorithms and also algorithms that are being implemented by Cyrus team members. The objectives of this description are to express a brief explanation about shoot, block, mark and defensive decision will be given. It also explained about the parts that has been implemented. The base code that Cyrus used is agent 3.11.
△ Less
Submitted 8 February, 2022;
originally announced February 2022.
-
COfEE: A Comprehensive Ontology for Event Extraction from text
Authors:
Ali Balali,
Masoud Asadpour,
Seyed Hossein Jafari
Abstract:
Data is published on the web over time in great volumes, but majority of the data is unstructured, making it hard to understand and difficult to interpret. Information Extraction (IE) methods obtain structured information from unstructured data. One of the challenging IE tasks is Event Extraction (EE) which seeks to derive information about specific incidents and their actors from the text. EE is…
▽ More
Data is published on the web over time in great volumes, but majority of the data is unstructured, making it hard to understand and difficult to interpret. Information Extraction (IE) methods obtain structured information from unstructured data. One of the challenging IE tasks is Event Extraction (EE) which seeks to derive information about specific incidents and their actors from the text. EE is useful in many domains such as building a knowledge base, information retrieval and summarization. In the past decades, some event ontologies like ACE, CAMEO and ICEWS were developed to define event forms, actors and dimensions of events observed in the text. These event ontologies still have some shortcomings such as covering only a few topics like political events, having inflexible structure in defining argument roles and insufficient gold-standard data. To address these concerns, we propose an event ontology, namely COfEE, that incorporates both expert domain knowledge and a data-driven approach for identifying events from text. COfEE consists of two hierarchy levels (event types and event sub-types) that include new categories relating to environmental issues, cyberspace and criminal activity which need to be monitored instantly. Also, dynamic roles according to each event sub-type are defined to capture various dimensions of events. In a follow-up experiment, the proposed ontology is evaluated on Wikipedia events, and it is shown to be general and comprehensive. Moreover, in order to facilitate the preparation of gold-standard data for event extraction, a language-independent online tool is presented based on COfEE. A gold-standard dataset annotated by 10 human experts is also prepared consisting 24K news articles in Persian language. Finally, we present a supervised method based on deep learning techniques to automatically extract relevant events and corresponding actors.
△ Less
Submitted 10 November, 2021; v1 submitted 21 July, 2021;
originally announced July 2021.
-
U-LanD: Uncertainty-Driven Video Landmark Detection
Authors:
Mohammad H. Jafari,
Christina Luong,
Michael Tsang,
Ang Nan Gu,
Nathan Van Woudenberg,
Robert Rohling,
Teresa Tsang,
Purang Abolmaesumi
Abstract:
This paper presents U-LanD, a framework for joint detection of key frames and landmarks in videos. We tackle a specifically challenging problem, where training labels are noisy and highly sparse. U-LanD builds upon a pivotal observation: a deep Bayesian landmark detector solely trained on key video frames, has significantly lower predictive uncertainty on those frames vs. other frames in videos. W…
▽ More
This paper presents U-LanD, a framework for joint detection of key frames and landmarks in videos. We tackle a specifically challenging problem, where training labels are noisy and highly sparse. U-LanD builds upon a pivotal observation: a deep Bayesian landmark detector solely trained on key video frames, has significantly lower predictive uncertainty on those frames vs. other frames in videos. We use this observation as an unsupervised signal to automatically recognize key frames on which we detect landmarks. As a test-bed for our framework, we use ultrasound imaging videos of the heart, where sparse and noisy clinical labels are only available for a single frame in each video. Using data from 4,493 patients, we demonstrate that U-LanD can exceedingly outperform the state-of-the-art non-Bayesian counterpart by a noticeable absolute margin of 42% in R2 score, with almost no overhead imposed on the model size. Our approach is generic and can be potentially applied to other challenging data with noisy and sparse training labels.
△ Less
Submitted 2 February, 2021;
originally announced February 2021.
-
Enhanced Balancing GAN: Minority-class Image Generation
Authors:
Gaofeng Huang,
Amir H. Jafari
Abstract:
Generative adversarial networks (GANs) are one of the most powerful generative models, but always require a large and balanced dataset to train. Traditional GANs are not applicable to generate minority-class images in a highly imbalanced dataset. Balancing GAN (BAGAN) is proposed to mitigate this problem, but it is unstable when images in different classes look similar, e.g. flowers and cells. In…
▽ More
Generative adversarial networks (GANs) are one of the most powerful generative models, but always require a large and balanced dataset to train. Traditional GANs are not applicable to generate minority-class images in a highly imbalanced dataset. Balancing GAN (BAGAN) is proposed to mitigate this problem, but it is unstable when images in different classes look similar, e.g. flowers and cells. In this work, we propose a supervised autoencoder with an intermediate embedding model to disperse the labeled latent vectors. With the improved autoencoder initialization, we also build an architecture of BAGAN with gradient penalty (BAGAN-GP). Our proposed model overcomes the unstable issue in original BAGAN and converges faster to high quality generations. Our model achieves high performance on the imbalanced scale-down version of MNIST Fashion, CIFAR-10, and one small-scale medical image dataset.
△ Less
Submitted 31 October, 2020;
originally announced November 2020.
-
Split-Merge Pooling
Authors:
Omid Hosseini Jafari,
Carsten Rother
Abstract:
There are a variety of approaches to obtain a vast receptive field with convolutional neural networks (CNNs), such as pooling or striding convolutions. Most of these approaches were initially designed for image classification and later adapted to dense prediction tasks, such as semantic segmentation. However, the major drawback of this adaptation is the loss of spatial information. Even the popula…
▽ More
There are a variety of approaches to obtain a vast receptive field with convolutional neural networks (CNNs), such as pooling or striding convolutions. Most of these approaches were initially designed for image classification and later adapted to dense prediction tasks, such as semantic segmentation. However, the major drawback of this adaptation is the loss of spatial information. Even the popular dilated convolution approach, which in theory is able to operate with full spatial resolution, needs to subsample features for large image sizes in order to make the training and inference tractable. In this work, we introduce Split-Merge pooling to fully preserve the spatial information without any subsampling. By applying Split-Merge pooling to deep networks, we achieve, at the same time, a very large receptive field. We evaluate our approach for dense semantic segmentation of large image sizes taken from the Cityscapes and GTA-5 datasets. We demonstrate that by replacing max-pooling and striding convolutions with our split-merge pooling, we are able to improve the accuracy of different variations of ResNet significantly.
△ Less
Submitted 13 June, 2020;
originally announced June 2020.
-
A Study into Echocardiography View Conversion
Authors:
Amir H. Abdi,
Mohammad H. Jafari,
Sidney Fels,
Theresa Tsang,
Purang Abolmaesumi
Abstract:
Transthoracic echo is one of the most common means of cardiac studies in the clinical routines. During the echo exam, the sonographer captures a set of standard cross sections (echo views) of the heart. Each 2D echo view cuts through the 3D cardiac geometry via a unique plane. Consequently, different views share some limited information. In this work, we investigate the feasibility of generating a…
▽ More
Transthoracic echo is one of the most common means of cardiac studies in the clinical routines. During the echo exam, the sonographer captures a set of standard cross sections (echo views) of the heart. Each 2D echo view cuts through the 3D cardiac geometry via a unique plane. Consequently, different views share some limited information. In this work, we investigate the feasibility of generating a 2D echo view using another view based on adversarial generative models. The objective optimized to train the view-conversion model is based on the ideas introduced by LSGAN, PatchGAN and Conditional GAN (cGAN). The size and length of the left ventricle in the generated target echo view is compared against that of the target ground-truth to assess the validity of the echo view conversion. Results show that there is a correlation of 0.50 between the LV areas and 0.49 between the LV lengths of the generated target frames and the real target frames.
△ Less
Submitted 5 December, 2019;
originally announced December 2019.
-
Growth Dynamics of Value and Cost Trade-off in Temporal Networks
Authors:
Sheida Hasani,
Razieh Masoomi,
Jamshid Ardalankia,
Mohammadbashir Sedighi,
Hamid Jafari
Abstract:
The question is: What does happen to the real-world networks which cause them not to grow permanently? The idea here is that real-world networks have to pay the cost of growth. We investigate the growth and trade-off between value and cost in the networks with cost and preferential attachment together. Since the preferential attachment in the BA model does not consider any stop against the infinit…
▽ More
The question is: What does happen to the real-world networks which cause them not to grow permanently? The idea here is that real-world networks have to pay the cost of growth. We investigate the growth and trade-off between value and cost in the networks with cost and preferential attachment together. Since the preferential attachment in the BA model does not consider any stop against the infinite growth of networks, we introduce a modified version of preferential attachment of the BA model. This idea makes sense because the growth of real networks may be finite. In the present study, by combining preferential attachment in the science of temporal networks (interval graphs), and, the first-order differential equations of value and cost of making links, the future equilibrium of an evolving network is illustrated. During the process of achieving a winning position, the variables against growth such as the competition cost, besides the internally structural cost may emerge. In the end, by applying this modified model, we found the circumstances in which a trade-off between value and cost emerges.
△ Less
Submitted 14 August, 2020; v1 submitted 29 August, 2019;
originally announced August 2019.
-
SimBins: An information-theoretic approach to link prediction in real multiplex networks
Authors:
Seyed Hossein Jafari,
Amir Mahdi Abdolhosseini-Qomi,
Maseud Rahgozar,
Masoud Asadpour,
Naser Yazdani
Abstract:
The entities of real-world networks are connected via different types of connections (i.e. layers). The task of link prediction in multiplex networks is about finding missing connections based on both intra-layer and inter-layer correlations. Our observations confirm that that in a wide range of real-world multiplex networks, from social to biological and technological, a positive correlation exis…
▽ More
The entities of real-world networks are connected via different types of connections (i.e. layers). The task of link prediction in multiplex networks is about finding missing connections based on both intra-layer and inter-layer correlations. Our observations confirm that that in a wide range of real-world multiplex networks, from social to biological and technological, a positive correlation exists between connection probability in one layer and similarity in other layers. Accordingly, a similarity-based automatic general-purpose multiplex link prediction method -- SimBins -- is devised that quantifies the amount of connection uncertainty based on observed inter-layer correlations in a multiplex network. Moreover, SimBins enhances the prediction quality in the target layer by incorporating the effect of link overlap across layers. Applied to various datasets from different domains, SimBins proves to be robust and superior than compared methods in majority of experimented cases in terms of accuracy of link prediction. Furthermore, it is discussed that SimBins imposes minor computational overhead to the base similarity measures making it a potentially fast method, suitable for large-scale multiplex networks.
△ Less
Submitted 4 December, 2020; v1 submitted 27 August, 2019;
originally announced August 2019.
-
Link Prediction in Real-World Multiplex Networks via Layer Reconstruction Method
Authors:
Amir Mahdi Abdolhosseini-Qomi,
Seyed Hossein Jafari,
Amirheckmat Taghizadeh,
Naser Yazdani,
Masoud Asadpour,
Masoud Rahgozar
Abstract:
A large body of research on link prediction problem is devoted to finding missing links in single-layer (simplex) networks. The proposed link prediction methods compute a similarity measure between unconnected node pairs based on the observed structure of the network. However, extension of notion of similarity to multiplex networks is a two-fold challenge. The layers of real-world multiplex networ…
▽ More
A large body of research on link prediction problem is devoted to finding missing links in single-layer (simplex) networks. The proposed link prediction methods compute a similarity measure between unconnected node pairs based on the observed structure of the network. However, extension of notion of similarity to multiplex networks is a two-fold challenge. The layers of real-world multiplex networks do not have the same organization yet are not of totally different organizations. So, it should be determined that how similar are the layers of a multiplex network. On the other hand, it is needed to be known that how similar layers can contribute in link prediction task on a target layer with missing links. Eigenvectors are known to well reflect the structural features of networks. Therefore, two layers of a multiplex network are similar w.r.t. structural features if they share similar eigenvectors. Experiments show that layers of real-world multiplex networks are similar w.r.t. structural features and the value of similarity is far beyond their randomized counterparts. Furthermore, it is shown that missing links are highly predictable if their addition or removal do not significantly change the network structural features. Otherwise, if the change is significant a similar copy of structural features may come to help. Based on this concept, Layer Reconstruction Method (LRM) finds the best reconstruction of the observed structure of the target layer with structural features of other similar layers. Experiments on real multiplex networks from different disciplines show that this method benefits from information redundancy in the networks and helps the performance of link prediction to stay robust even under high fraction of missing links.
△ Less
Submitted 22 June, 2019;
originally announced June 2019.
-
Deep Object Co-Segmentation
Authors:
Weihao Li,
Omid Hosseini Jafari,
Carsten Rother
Abstract:
This work presents a deep object co-segmentation (DOCS) approach for segmenting common objects of the same class within a pair of images. This means that the method learns to ignore common, or uncommon, background stuff and focuses on objects. If multiple object classes are presented in the image pair, they are jointly extracted as foreground. To address this task, we propose a CNN-based Siamese e…
▽ More
This work presents a deep object co-segmentation (DOCS) approach for segmenting common objects of the same class within a pair of images. This means that the method learns to ignore common, or uncommon, background stuff and focuses on objects. If multiple object classes are presented in the image pair, they are jointly extracted as foreground. To address this task, we propose a CNN-based Siamese encoder-decoder architecture. The encoder extracts high-level semantic features of the foreground objects, a mutual correlation layer detects the common objects, and finally, the decoder generates the output foreground masks for each image. To train our model, we compile a large object co-segmentation dataset consisting of image pairs from the PASCAL VOC dataset with common objects masks. We evaluate our approach on commonly used datasets for co-segmentation tasks and observe that our approach consistently outperforms competing methods, for both seen and unseen object classes.
△ Less
Submitted 28 May, 2019; v1 submitted 17 April, 2018;
originally announced April 2018.
-
Dense Pooling layers in Fully Convolutional Network for Skin Lesion Segmentation
Authors:
Ebrahim Nasr-Esfahani,
Shima Rafiei,
Mohammad H. Jafari,
Nader Karimi,
James S. Wrobel,
S. M. Reza Soroushmehr,
Shadrokh Samavi,
Kayvan Najarian
Abstract:
One of the essential tasks in medical image analysis is segmentation and accurate detection of borders. Lesion segmentation in skin images is an essential step in the computerized detection of skin cancer. However, many of the state-of-the-art segmentation methods have deficiencies in their border detection phase. In this paper, a new class of fully convolutional network is proposed, with new dens…
▽ More
One of the essential tasks in medical image analysis is segmentation and accurate detection of borders. Lesion segmentation in skin images is an essential step in the computerized detection of skin cancer. However, many of the state-of-the-art segmentation methods have deficiencies in their border detection phase. In this paper, a new class of fully convolutional network is proposed, with new dense pooling layers for segmentation of lesion regions in skin images. This network leads to highly accurate segmentation of lesions on skin lesion datasets which outperforms state-of-the-art algorithms in the skin lesion segmentation.
△ Less
Submitted 31 August, 2019; v1 submitted 29 December, 2017;
originally announced December 2017.
-
iPose: Instance-Aware 6D Pose Estimation of Partly Occluded Objects
Authors:
Omid Hosseini Jafari,
Siva Karthik Mustikovela,
Karl Pertsch,
Eric Brachmann,
Carsten Rother
Abstract:
We address the task of 6D pose estimation of known rigid objects from single input images in scenarios where the objects are partly occluded. Recent RGB-D-based methods are robust to moderate degrees of occlusion. For RGB inputs, no previous method works well for partly occluded objects. Our main contribution is to present the first deep learning-based system that estimates accurate poses for part…
▽ More
We address the task of 6D pose estimation of known rigid objects from single input images in scenarios where the objects are partly occluded. Recent RGB-D-based methods are robust to moderate degrees of occlusion. For RGB inputs, no previous method works well for partly occluded objects. Our main contribution is to present the first deep learning-based system that estimates accurate poses for partly occluded objects from RGB-D and RGB input. We achieve this with a new instance-aware pipeline that decomposes 6D object pose estimation into a sequence of simpler steps, where each step removes specific aspects of the problem. The first step localizes all known objects in the image using an instance segmentation network, and hence eliminates surrounding clutter and occluders. The second step densely maps pixels to 3D object surface positions, so called object coordinates, using an encoder-decoder network, and hence eliminates object appearance. The third, and final, step predicts the 6D pose using geometric optimization. We demonstrate that we significantly outperform the state-of-the-art for pose estimation of partly occluded objects for both RGB and RGB-D input.
△ Less
Submitted 18 June, 2018; v1 submitted 5 December, 2017;
originally announced December 2017.
-
Performance Evaluation of Spatial Complementary Code Keying Modulation in MIMO Systems
Authors:
A. H. Jafari,
T. O'Farrell
Abstract:
Spatial complementary code keying modulation (SCCKM) is proposed as a novel block coding modulation scheme. An input binary sequence is modulated based on the different lengths of complementary code keying (CCK) modulation and then spread across the transmit antennas (spatial domain) in a multiple input multiple output (MIMO) system exploiting orthogonal frequency division multiplexing (OFDM). At…
▽ More
Spatial complementary code keying modulation (SCCKM) is proposed as a novel block coding modulation scheme. An input binary sequence is modulated based on the different lengths of complementary code keying (CCK) modulation and then spread across the transmit antennas (spatial domain) in a multiple input multiple output (MIMO) system exploiting orthogonal frequency division multiplexing (OFDM). At the receiver side, zero forcing equalization is applied to the OFDM modulated data to mitigate the effect of the multipath fast fading channel and then followed by maximum likelihood (ML) detection to retrieve the input sequence. The performance of SCCKM in different MIMO systems is compared to that of spatial modulation (SM) as a baseline scheme. Simulation results show that for the same spectral efficiency, SCCKM is able to substantially improve the bit error rate (BER).
△ Less
Submitted 16 September, 2017;
originally announced September 2017.
-
Ultra-Dense Networks: A New Look at the Proportional Fair Scheduler
Authors:
Ming Ding,
David Lopez Perez,
Amir H. Jafari,
Guoqiang Mao,
Zihuai Lin
Abstract:
In this paper, we theoretically study the proportional fair (PF) scheduler in the context of ultra-dense networks (UDNs). Analytical results are obtained for the coverage probability and the area spectral efficiency (ASE) performance of dense small cell networks (SCNs) with the PF scheduler employed at base stations (BSs). The key point of our analysis is that the typical user is no longer a rando…
▽ More
In this paper, we theoretically study the proportional fair (PF) scheduler in the context of ultra-dense networks (UDNs). Analytical results are obtained for the coverage probability and the area spectral efficiency (ASE) performance of dense small cell networks (SCNs) with the PF scheduler employed at base stations (BSs). The key point of our analysis is that the typical user is no longer a random user as assumed in most studies in the literature. Instead, a user with the maximum PF metric is chosen by its serving BS as the typical user. By comparing the previous results of the round-robin (RR) scheduler with our new results of the PF scheduler, we quantify the loss of the multi-user diversity of the PF scheduler with the network densification, which casts a new look at the role of the PF scheduler in UDNs. Our conclusion is that the RR scheduler should be used in UDNs to simplify the radio resource management (RRM).
△ Less
Submitted 26 September, 2017; v1 submitted 26 August, 2017;
originally announced August 2017.
-
Analyzing Modular CNN Architectures for Joint Depth Prediction and Semantic Segmentation
Authors:
Omid Hosseini Jafari,
Oliver Groth,
Alexander Kirillov,
Michael Ying Yang,
Carsten Rother
Abstract:
This paper addresses the task of designing a modular neural network architecture that jointly solves different tasks. As an example we use the tasks of depth estimation and semantic segmentation given a single RGB image. The main focus of this work is to analyze the cross-modality influence between depth and semantic prediction maps on their joint refinement. While most previous works solely focus…
▽ More
This paper addresses the task of designing a modular neural network architecture that jointly solves different tasks. As an example we use the tasks of depth estimation and semantic segmentation given a single RGB image. The main focus of this work is to analyze the cross-modality influence between depth and semantic prediction maps on their joint refinement. While most previous works solely focus on measuring improvements in accuracy, we propose a way to quantify the cross-modality influence. We show that there is a relationship between final accuracy and cross-modality influence, although not a simple linear one. Hence a larger cross-modality influence does not necessarily translate into an improved accuracy. We find that a beneficial balance between the cross-modality influences can be achieved by network architecture and conjecture that this relationship can be utilized to understand different network design choices. Towards this end we propose a Convolutional Neural Network (CNN) architecture that fuses the state of the state-of-the-art results for depth estimation and semantic labeling. By balancing the cross-modality influences between depth and semantic prediction, we achieve improved results for both tasks using the NYU-Depth v2 benchmark.
△ Less
Submitted 26 February, 2017;
originally announced February 2017.
-
Hand Gesture Recognition for Contactless Device Control in Operating Rooms
Authors:
Ebrahim Nasr-Esfahani,
Nader Karimi,
S. M. Reza Soroushmehr,
M. Hossein Jafari,
M. Amin Khorsandi,
Shadrokh Samavi,
Kayvan Najarian
Abstract:
Hand gesture is one of the most important means of touchless communication between human and machines. There is a great interest for commanding electronic equipment in surgery rooms by hand gesture for reducing the time of surgery and the potential for infection. There are challenges in implementation of a hand gesture recognition system. It has to fulfill requirements such as high accuracy and fa…
▽ More
Hand gesture is one of the most important means of touchless communication between human and machines. There is a great interest for commanding electronic equipment in surgery rooms by hand gesture for reducing the time of surgery and the potential for infection. There are challenges in implementation of a hand gesture recognition system. It has to fulfill requirements such as high accuracy and fast response. In this paper we introduce a system of hand gesture recognition based on a deep learning approach. Deep learning is known as an accurate detection model, but its high complexity prevents it from being fabricated as an embedded system. To cope with this problem, we applied some changes in the structure of our work to achieve low complexity. As a result, the proposed method could be implemented on a naive embedded system. Our experiments show that the proposed system results in higher accuracy while having less complexity in comparison with the existing comparable methods.
△ Less
Submitted 13 November, 2016;
originally announced November 2016.
-
Diversity Pulse Shaped Transmission in Ultra-Dense Small Cell Networks
Authors:
Amir H. Jafari,
Vijay Venkateswaran,
David Lopez-Perez,
Jie Zhang
Abstract:
In ultra-dense small cell networks, spatial multiplexing gain is a challenge because of the different propagation conditions. The channels associated with different transmitreceive pairs can be highly correlated due to the i) high probability of line-of-sight (LOS) communication between user equipment (UE) and base station (BS), and ii) insufficient spacing between antenna elements at both UE and…
▽ More
In ultra-dense small cell networks, spatial multiplexing gain is a challenge because of the different propagation conditions. The channels associated with different transmitreceive pairs can be highly correlated due to the i) high probability of line-of-sight (LOS) communication between user equipment (UE) and base station (BS), and ii) insufficient spacing between antenna elements at both UE and BS. In this paper, we propose a novel transmission technique titled Diversity Pulse Shaped Transmission (DPST) to enhance the throughput over the correlated MIMO channels in an ultra-dense small cell network. The fundamental of DPST is to shape transmit signals at adjacent antennas with distinct interpolating filters, introducing pulse sha** diversity. In DPST, each antenna transmits its own data stream with a relative deterministic time offset-which must be a fraction of the symbol period-with respect to the adjacent antenna. The delay is interpolated with the pulse shaped signal generating a virtual MIMO channel that benefits from increased diversity from the receiver perspective. To extract the diversity, the receiver must operate in an over-sampled domain and hence a fractionally spaced equaliser (FSE) is proposed. The joint impact of DPST and FSE helps the receiver to sense a less correlated channel, eventually enhancing the UE's throughput. Moreover, in order to minimise the spatial correlation, we aim to optimise the deterministic fractional delay. Simulation results show that applying DPST to a correlated channel can approximately enhance the UE throughput by 1.93x and 3.76x in 2x2 and 4x4 MIMO systems, respectively.
△ Less
Submitted 4 November, 2016;
originally announced November 2016.
-
Performance Impact of LOS and NLOS Transmissions in Dense Cellular Networks under Rician Fading
Authors:
Amir H. Jafari,
Ming Ding,
David Lopez-Perez,
Jie Zhang
Abstract:
In this paper, we analyse the performance of dense small cell network (SCNs). We derive analytical expressions for both their coverage probability and their area spectral efficiency (ASE) using a path loss model that considers both line-of-sight (LOS) and non-LOS (NLOS) components. Due to the close proximity of small cell base stations (BSs) and user equipments (UEs) in such dense SCNs, we also co…
▽ More
In this paper, we analyse the performance of dense small cell network (SCNs). We derive analytical expressions for both their coverage probability and their area spectral efficiency (ASE) using a path loss model that considers both line-of-sight (LOS) and non-LOS (NLOS) components. Due to the close proximity of small cell base stations (BSs) and user equipments (UEs) in such dense SCNs, we also consider Rician fading as the multi-path fading channel model for both the LOS and NLOS fading transmissions. The Rayleigh fading used in most of existing works analysing dense SCNs is not accurate enough. Then, we compare the performance impact of LOS and NLOS transmissions in dense SCNs under Rician fading with that based on Rayleigh fading. The analysis and the simulation results show that in dense SCNs where LOS transmissions dominate the performance, the impact of Rician fading on the overall system performance is minor, and does not help to address the performance losses brought by the transition of many interfering signals from NLOS to LOS.
△ Less
Submitted 28 October, 2016;
originally announced October 2016.
-
Real-Time RGB-D based Template Matching Pedestrian Detection
Authors:
Omid Hosseini jafari,
Michael Ying Yang
Abstract:
Pedestrian detection is one of the most popular topics in computer vision and robotics. Considering challenging issues in multiple pedestrian detection, we present a real-time depth-based template matching people detector. In this paper, we propose different approaches for training the depth-based template. We train multiple templates for handling issues due to various upper-body orientations of t…
▽ More
Pedestrian detection is one of the most popular topics in computer vision and robotics. Considering challenging issues in multiple pedestrian detection, we present a real-time depth-based template matching people detector. In this paper, we propose different approaches for training the depth-based template. We train multiple templates for handling issues due to various upper-body orientations of the pedestrians and different levels of detail in depth-map of the pedestrians with various distances from the camera. And, we take into account the degree of reliability for different regions of sliding window by proposing the weighted template approach. Furthermore, we combine the depth-detector with an appearance based detector as a verifier to take advantage of the appearance cues for dealing with the limitations of depth data. We evaluate our method on the challenging ETH dataset sequence. We show that our method outperforms the state-of-the-art approaches.
△ Less
Submitted 3 October, 2016;
originally announced October 2016.
-
Extraction of Skin Lesions from Non-Dermoscopic Images Using Deep Learning
Authors:
Mohammad H. Jafari,
Ebrahim Nasr-Esfahani,
Nader Karimi,
S. M. Reza Soroushmehr,
Shadrokh Samavi,
Kayvan Najarian
Abstract:
Melanoma is amongst most aggressive types of cancer. However, it is highly curable if detected in its early stages. Prescreening of suspicious moles and lesions for malignancy is of great importance. Detection can be done by images captured by standard cameras, which are more preferable due to low cost and availability. One important step in computerized evaluation of skin lesions is accurate dete…
▽ More
Melanoma is amongst most aggressive types of cancer. However, it is highly curable if detected in its early stages. Prescreening of suspicious moles and lesions for malignancy is of great importance. Detection can be done by images captured by standard cameras, which are more preferable due to low cost and availability. One important step in computerized evaluation of skin lesions is accurate detection of lesion region, i.e. segmentation of an image into two regions as lesion and normal skin. Accurate segmentation can be challenging due to burdens such as illumination variation and low contrast between lesion and healthy skin. In this paper, a method based on deep neural networks is proposed for accurate extraction of a lesion region. The input image is preprocessed and then its patches are fed to a convolutional neural network (CNN). Local texture and global structure of the patches are processed in order to assign pixels to lesion or normal classes. A method for effective selection of training patches is used for more accurate detection of a lesion border. The output segmentation mask is refined by some post processing operations. The experimental results of qualitative and quantitative evaluations demonstrate that our method can outperform other state-of-the-art algorithms exist in the literature.
△ Less
Submitted 8 September, 2016;
originally announced September 2016.
-
Pulse Sha** Diversity to Enhance Throughput in Ultra-Dense Small Cell Networks
Authors:
Amir H. Jafari,
Vijay Venkateswaran,
David Lopez-Perez,
Jie Zhang
Abstract:
Spatial multiplexing (SM) gains in multiple input multiple output (MIMO) cellular networks are limited when used in combination with ultra-dense small cell networks. This limitation is due to large spatial correlation among channel pairs. More specifically, it is due to i) line-of-sight (LOS) communication between user equipment (UE) and base station (BS) and ii) in-sufficient spacing between ante…
▽ More
Spatial multiplexing (SM) gains in multiple input multiple output (MIMO) cellular networks are limited when used in combination with ultra-dense small cell networks. This limitation is due to large spatial correlation among channel pairs. More specifically, it is due to i) line-of-sight (LOS) communication between user equipment (UE) and base station (BS) and ii) in-sufficient spacing between antenna elements. We propose to shape transmit signals at adjacent antennas with distinct interpolating filters which introduces pulse sha** diversity eventually leading to improved SINR and throughput at the UEs. In this technique, each antenna transmits its own data stream with a relative offset with respect to adjacent antenna. The delay which must be a fraction of symbol period is interpolated with the pulse shaped signal and generates a virtual MIMO channel that leads to improved diversity and SINR at the receiver. Note that non-integral sampling periods with inter-symbol interference (ISI) should be mitigated at the receiver. For this, we propose to use a fractionally spaced equalizer (FSE) designed based on the minimum mean squared error (MMSE) criterion. Simulation results show that for a 2x2 MIMO and with inter-site-distance (ISD) of 50 m, the median received SINR and throughput at the UE improves by a factor of 11 dB and 2x, respectively, which verifies that pulse sha** can overcome poor SM gains in ultra-dense small cell networks.
△ Less
Submitted 12 May, 2016;
originally announced May 2016.
-
Study on Scheduling Techniques for Ultra Dense Small Cell Networks
Authors:
Amir H. Jafari,
David Lopez-Perez,
Ming Ding,
Jie Zhang
Abstract:
The most promising approach to enhance network capacity for the next generation of wireless cellular networks (5G) is densification, which benefits from the extensive spatial reuse of the spectrum and the reduced distance between transmitters and receivers. In this paper, we examine the performance of different schedulers in ultra dense small cell deployments. Due to the stronger line of sight (LO…
▽ More
The most promising approach to enhance network capacity for the next generation of wireless cellular networks (5G) is densification, which benefits from the extensive spatial reuse of the spectrum and the reduced distance between transmitters and receivers. In this paper, we examine the performance of different schedulers in ultra dense small cell deployments. Due to the stronger line of sight (LOS) at low inter-site distances (ISDs), we discuss that the Rician fading channel model is more suitable to study network performance than the Rayleigh one, and model the Rician K factor as a function of distance between the user equipment (UE) and its serving base station (BS). We also construct a cross-correlation shadowing model that takes into account the ISD, and finally investigate potential multi-user diversity gains in ultra dense small cell deployments by comparing the performances of proportional fair (PF) and round robin (RR) schedulers. Our study shows that as network becomes denser, the LOS component starts to dominate the path loss model which significantly increases the interference. Simulation results also show that multi-user diversity is considerably reduced at low ISDs, and thus the PF scheduling gain over the RR one is small, around 10% in terms of cell throughput. As a result, the RR scheduling may be preferred for dense small cell deployments due to its simplicity. Despite both the interference aggravation as well as the multi-user diversity loss, network densification is still worth it from a capacity view point.
△ Less
Submitted 21 June, 2015;
originally announced June 2015.
-
Towards 1 Gbps/UE in Cellular Systems: Understanding Ultra-Dense Small Cell Deployments
Authors:
David Lopez-Perez,
Ming Ding,
Holger Claussen,
Amir H. Jafari
Abstract:
Todays heterogeneous networks comprised of mostly macrocells and indoor small cells will not be able to meet the upcoming traffic demands. Indeed, it is forecasted that at least a 100x network capacity increase will be required to meet the traffic demands in 2020. As a result, vendors and operators are now looking at using every tool at hand to improve network capacity. In this epic campaign, thre…
▽ More
Todays heterogeneous networks comprised of mostly macrocells and indoor small cells will not be able to meet the upcoming traffic demands. Indeed, it is forecasted that at least a 100x network capacity increase will be required to meet the traffic demands in 2020. As a result, vendors and operators are now looking at using every tool at hand to improve network capacity. In this epic campaign, three paradigms are noteworthy, i.e., network densification, the use of higher frequency bands and spectral efficiency enhancement techniques. This paper aims at bringing further common understanding and analysing the potential gains and limitations of these three paradigms, together with the impact of idle mode capabilities at the small cells as well as the user equipment density and distribution in outdoor scenarios. Special attention is paid to network densification and its implications when transitioning to ultra-dense small cell deployments. Simulation results show that network densification with an average inter site distance of 35 m can increase the cell- edge UE throughput by up to 48x, while the use of the 10GHz band with a 500MHz bandwidth can increase the network capacity up to 5x. The use of beamforming with up to 4 antennas per small cell base station lacks behind with cell-edge throughput gains of up to 1.49x. Our study also shows how network densifications reduces multi-user diversity, and thus proportional fair alike schedulers start losing their advantages with respect to round robin ones. The energy efficiency of these ultra-dense small cell deployments is also analysed, indicating the need for energy harvesting approaches to make these deployments energy- efficient. Finally, the top ten challenges to be addressed to bring ultra-dense small cell deployments to reality are also discussed.
△ Less
Submitted 12 March, 2015;
originally announced March 2015.