Search | arXiv e-print repository

Leveraging SPD Matrices on Riemannian Manifolds in Quantum Classical Hybrid Models for Structural Health Monitoring

Authors: Azadeh Alavi, Sanduni Jayasinghe

Abstract: Realtime finite element modeling of bridges assists modern structural health monitoring systems by providing comprehensive insights into structural integrity. This capability is essential for ensuring the safe operation of bridges and preventing sudden catastrophic failures. However, FEM computational cost and the need for realtime analysis pose significant challenges. Additionally, the input data… ▽ More Realtime finite element modeling of bridges assists modern structural health monitoring systems by providing comprehensive insights into structural integrity. This capability is essential for ensuring the safe operation of bridges and preventing sudden catastrophic failures. However, FEM computational cost and the need for realtime analysis pose significant challenges. Additionally, the input data is a 7 dimensional vector, while the output is a 1017 dimensional vector, making accurate and efficient analysis particularly difficult. In this study, we propose a novel hybrid quantum classical Multilayer Perceptron pipeline leveraging Symmetric Positive Definite matrices and Riemannian manifolds for effective data representation. To maintain the integrity of the qubit structure, we utilize SPD matrices, ensuring data representation is well aligned with the quantum computational framework. Additionally, the method leverages polynomial feature expansion to capture nonlinear relationships within the data. The proposed pipeline combines classical fully connected neural network layers with quantum circuit layers to enhance model performance and efficiency. Our experiments focused on various configurations of such hybrid models to identify the optimal structure for accurate and efficient realtime analysis. The best performing model achieved a Mean Squared Error of 0.00031, significantly outperforming traditional methods. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: 3 pages, 1 figure

arXiv:2406.00544 [pdf, other]

Leveraging Knowlegde Graphs for Interpretable Feature Generation

Authors: Mohamed Bouadi, Arta Alavi, Salima Benbernou, Mourad Ouziri

Abstract: The quality of Machine Learning (ML) models strongly depends on the input data, as such Feature Engineering (FE) is often required in ML. In addition, with the proliferation of ML-powered systems, especially in critical contexts, the need for interpretability and explainability becomes increasingly important. Since manual FE is time-consuming and requires case specific knowledge, we propose KRAFT,… ▽ More The quality of Machine Learning (ML) models strongly depends on the input data, as such Feature Engineering (FE) is often required in ML. In addition, with the proliferation of ML-powered systems, especially in critical contexts, the need for interpretability and explainability becomes increasingly important. Since manual FE is time-consuming and requires case specific knowledge, we propose KRAFT, an AutoFE framework that leverages a knowledge graph to guide the generation of interpretable features. Our hybrid AI approach combines a neural generator to transform raw features through a series of transformations and a knowledge-based reasoner to evaluate features interpretability using Description Logics (DL). The generator is trained through Deep Reinforcement Learning (DRL) to maximize the prediction accuracy and the interpretability of the generated features. Extensive experiments on real datasets demonstrate that KRAFT significantly improves accuracy while ensuring a high level of interpretability. △ Less

Submitted 1 June, 2024; originally announced June 2024.

arXiv:2403.09809 [pdf, other]

Self-Supervised Learning for Time Series: Contrastive or Generative?

Authors: Ziyu Liu, Azadeh Alavi, Minyi Li, Xiang Zhang

Abstract: Self-supervised learning (SSL) has recently emerged as a powerful approach to learning representations from large-scale unlabeled data, showing promising results in time series analysis. The self-supervised representation learning can be categorized into two mainstream: contrastive and generative. In this paper, we will present a comprehensive comparative study between contrastive and generative m… ▽ More Self-supervised learning (SSL) has recently emerged as a powerful approach to learning representations from large-scale unlabeled data, showing promising results in time series analysis. The self-supervised representation learning can be categorized into two mainstream: contrastive and generative. In this paper, we will present a comprehensive comparative study between contrastive and generative methods in time series. We first introduce the basic frameworks for contrastive and generative SSL, respectively, and discuss how to obtain the supervision signal that guides the model optimization. We then implement classical algorithms (SimCLR vs. MAE) for each type and conduct a comparative analysis in fair settings. Our results provide insights into the strengths and weaknesses of each approach and offer practical recommendations for choosing suitable SSL methods. We also discuss the implications of our findings for the broader field of representation learning and propose future research directions. All the code and data are released at \url{https://github.com/DL4mHealth/SSL_Comparison}. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: Published at the AI4TS Workshop, IJCAI 2023

arXiv:2310.05229 [pdf, other]

Design Verification of the Quantum Control Stack

Authors: Seyed Amir Alavi, Samin Ishtiaq, Nick Johnson, Rojalin Mishra, Dwaraka Oruganti Nagalakshmi, Asher Pearl, Jan Snoeijs

Abstract: This paper describes the verification of the classical software and hardware stack that is used to control cold atom- and superconducting-based quantum computing hardware. The paper serves both as an introduction to quantum computing and to how classical device verification techniques can be employed there. Two main challenges in building a quantum control stack are generating precise deterministi… ▽ More This paper describes the verification of the classical software and hardware stack that is used to control cold atom- and superconducting-based quantum computing hardware. The paper serves both as an introduction to quantum computing and to how classical device verification techniques can be employed there. Two main challenges in building a quantum control stack are generating precise deterministic-timing operations at the edge and scaled-out processing in the middle layer. Both challenges are to do with a certain kind of functional performance correctness. And, as usual, the design lives under tight power, memory and latency constraints. The quantum control stack is a complex interaction of algorithms, software runtimes and digital hardware. We take inspiration from modern software approaches to engineering, such as continuous integration and hardware automation, to quickly ship experimental features to customers in the field. △ Less

Submitted 8 October, 2023; originally announced October 2023.

Comments: In DVCon Europe 2023

ACM Class: D.1; C.1

arXiv:2110.06340 [pdf]

doi 10.1155/2022/4694567

A novel framework based on deep learning and ANOVA feature selection method for diagnosis of COVID-19 cases from chest X-ray Images

Authors: Hamid Nasiri, Seyyed Ali Alavi

Abstract: The new coronavirus (known as COVID-19) was first identified in Wuhan and quickly spread worldwide, wreaking havoc on the economy and people's everyday lives. Fever, cough, sore throat, headache, exhaustion, muscular aches, and difficulty breathing are all typical symptoms of COVID-19. A reliable detection technique is needed to identify affected individuals and care for them in the early stages o… ▽ More The new coronavirus (known as COVID-19) was first identified in Wuhan and quickly spread worldwide, wreaking havoc on the economy and people's everyday lives. Fever, cough, sore throat, headache, exhaustion, muscular aches, and difficulty breathing are all typical symptoms of COVID-19. A reliable detection technique is needed to identify affected individuals and care for them in the early stages of COVID-19 and reduce the virus's transmission. The most accessible method for COVID-19 identification is RT-PCR; however, due to its time commitment and false-negative results, alternative options must be sought. Indeed, compared to RT-PCR, chest CT scans and chest X-ray images provide superior results. Because of the scarcity and high cost of CT scan equipment, X-ray images are preferable for screening. In this paper, a pre-trained network, DenseNet169, was employed to extract features from X-ray images. Features were chosen by a feature selection method (ANOVA) to reduce computations and time complexity while overcoming the curse of dimensionality to improve predictive accuracy. Finally, selected features were classified by XGBoost. The ChestX-ray8 dataset, which was employed to train and evaluate the proposed method. This method reached 98.72% accuracy for two-class classification (COVID-19, healthy) and 92% accuracy for three-class classification (COVID-19, healthy, pneumonia). △ Less

Submitted 30 September, 2021; originally announced October 2021.

Journal ref: Comput. Intell. Neurosci., vol. 2022, p. 4694567, 2022

arXiv:2110.02222 [pdf, other]

Hybrid Classical-Quantum method for Diabetic Foot Ulcer Classification

Authors: Azadeh Alavi, Hossein Akhoundi

Abstract: Diabetes is a raising problem that affects many people globally. Diabetic patients are at risk of develo** foot ulcer that usually leads to limb amputation, causing significant morbidity, and psychological distress. In order to develop a self monitoring mobile application, it is necessary to be able to classify such ulcers into either of the following classes: Infection, Ischaemia, None, or Both… ▽ More Diabetes is a raising problem that affects many people globally. Diabetic patients are at risk of develo** foot ulcer that usually leads to limb amputation, causing significant morbidity, and psychological distress. In order to develop a self monitoring mobile application, it is necessary to be able to classify such ulcers into either of the following classes: Infection, Ischaemia, None, or Both. In this work, we compare the performance of a classical transfer-learning-based method, with the performance of a hybrid classical-quantum Classifier on diabetic foot ulcer classification task. As such, we merge the pre-trained Xception network with a multi-class variational classifier. Thus, after modifying and re-training the Xception network, we extract the output of a mid-layer and employ it as deep-features presenters of the given images. Finally, we use those deep-features to train multi-class variational classifier, where each classifier is implemented on an individual variational circuit. The method is then evaluated on the blind test set DFUC2021. The results proves that our proposed hybrid classical-quantum Classifier leads to considerable improvement compared to solely relying on transfer learning concept through training the modified version of Xception network. △ Less

Submitted 5 October, 2021; originally announced October 2021.

Comments: arXiv admin note: substantial text overlap with arXiv:2110.01795

arXiv:2110.01795 [pdf, other]

Deep Subspace analysing for Semi-Supervised multi-label classification of Diabetic Foot Ulcer

Authors: Azadeh Alavi

Abstract: Diabetes is a global raising pandemic. Diabetes patients are at risk of develo** foot ulcer that usually leads to limb amputation. In order to develop a self monitoring mobile application, in this work, we propose a novel deep subspace analysis pipeline for semi-supervised diabetic foot ulcer mulit-label classification. To avoid any chance of over-fitting, unlike recent state of the art deep sem… ▽ More Diabetes is a global raising pandemic. Diabetes patients are at risk of develo** foot ulcer that usually leads to limb amputation. In order to develop a self monitoring mobile application, in this work, we propose a novel deep subspace analysis pipeline for semi-supervised diabetic foot ulcer mulit-label classification. To avoid any chance of over-fitting, unlike recent state of the art deep semi-supervised methods, the proposed pipeline dose not include any data augmentation. Whereas, after extracting deep features, in order to make the representation shift invariant, we employ variety of data augmentation methods on each image and generate an image-sets, which is then mapped into a linear subspace. Moreover, the proposed pipeline reduces the cost of retraining when more new unlabelled data become available. Thus, the first stage of the pipeline employs the concept of transfer learning for feature extraction purpose through modifying and retraining a deep convolutional network architect known as Xception. Then, the output of a mid-layer is extracted to generate an image set representer of any given image with help of data augmentation methods. At this stage, each image is transferred to a linear subspace which is a point on a Grassmann Manifold topological space. Hence, to perform analyse them, the geometry of such manifold must be considered. As such, each labelled image is represented as a vector of distances to number of unlabelled images using geodesic distance on Grassmann manifold. Finally, Random Forest is trained for multi-label classification of diabetic foot ulcer images. The method is then evaluated on the blind test set provided by DFU2021 competition, and the result considerable improvement compared to using classical transfer learning with data augmentation. △ Less

Submitted 4 October, 2021; originally announced October 2021.

Comments: 10 pages

arXiv:2011.09853 [pdf]

A Deep Learning Approach to Predict Hamburg Rutting Curve

Authors: Hamed Majidifard, Behnam Jahangiri, Punyaslok Rath, Amir H. Alavi, William G. Buttlar

Abstract: Rutting continues to be one of the principal distresses in asphalt pavements worldwide. This type of distress is caused by permanent deformation and shear failure of the asphalt mix under the repetition of heavy loads. The Hamburg wheel tracking test (HWTT) is a widely used testing procedure designed to accelerate, and to simulate the rutting phenomena in the laboratory. Rut depth, as one of the o… ▽ More Rutting continues to be one of the principal distresses in asphalt pavements worldwide. This type of distress is caused by permanent deformation and shear failure of the asphalt mix under the repetition of heavy loads. The Hamburg wheel tracking test (HWTT) is a widely used testing procedure designed to accelerate, and to simulate the rutting phenomena in the laboratory. Rut depth, as one of the outputs of the HWTT, is dependent on a number of parameters related to mix design and testing conditions. This study introduces a new model for predicting the rutting depth of asphalt mixtures using a deep learning technique - the convolution neural network (CNN). A database containing a comprehensive collection of HWTT results was used to develop a CNN-based machine learning prediction model. The database includes 10,000 rutting depth data points measured across a large variety of asphalt mixtures. The model has been formulated in terms of known influencing mixture variables such as asphalt binder high temperature performance grade, mixture type, aggregate size, aggregate gradation, asphalt content, total asphalt binder recycling content, and testing parameters, including testing temperature and number of wheel passes. A rigorous validation process was used to assess the accuracy of the model to predict total rut depth and the HWTT rutting curve. A sensitivity analysis is presented, which evaluates the effect of the investigated variables on rutting depth predictions by the CNN model. The model can be used as a tool to estimate the rut depth in asphalt mixtures when laboratory testing is not feasible, or for cost saving, pre-design trials. △ Less

Submitted 12 November, 2020; originally announced November 2020.

arXiv:2011.06934 [pdf, ps, other]

Neural network for estimation of optical characteristics of optically active and turbid scattering media

Authors: Ali Alavi

Abstract: One native source of quality deterioration in medical imaging, and especially in our case optical coherence tomography (OCT), is the turbid biological media in which photon does not take a predictable path and many scattering events would influence the effective path length and change the polarization of polarized light. This inherent problem would cause imaging errors even in the case of high res… ▽ More One native source of quality deterioration in medical imaging, and especially in our case optical coherence tomography (OCT), is the turbid biological media in which photon does not take a predictable path and many scattering events would influence the effective path length and change the polarization of polarized light. This inherent problem would cause imaging errors even in the case of high resolution of interferometric methods. To address this problem and considering the inherent random nature of this problem, in the last decades some methods including Monte Carlo simulation for OCT was proposed. In this approach simulation would give us a one on one comparison of underlying physical structure and its OCT imaging counterpart. Although its goal was to give the practitioners a better understanding of underlying structure, it lacks in providing a comprehensive approach to increase the accuracy and imaging quality of OCT imaging and would only provide a set of examples on how imaging method might falter. To mitigate this problem and to demonstrate a new approach to improve the medical imaging without changing any hardware, we introduce a new pipeline consisting of Monte Carlo simulation followed by a deep neural network. △ Less

Submitted 25 November, 2020; v1 submitted 11 November, 2020; originally announced November 2020.

Comments: 12 pages, presubmission

arXiv:2010.03341 [pdf, other]

doi 10.1016/j.compbiomed.2021.104596

Deep Learning in Diabetic Foot Ulcers Detection: A Comprehensive Evaluation

Authors: Moi Hoon Yap, Ryo Hachiuma, Azadeh Alavi, Raphael Brungel, Bill Cassidy, Manu Goyal, Hongtao Zhu, Johannes Ruckert, Moshe Olshansky, Xiao Huang, Hideo Saito, Saeed Hassanpour, Christoph M. Friedrich, David Ascher, An** Song, Hiroki Kajita, David Gillespie, Neil D. Reeves, Joseph Pappachan, Claire O'Shea, Eibe Frank

Abstract: There has been a substantial amount of research involving computer methods and technology for the detection and recognition of diabetic foot ulcers (DFUs), but there is a lack of systematic comparisons of state-of-the-art deep learning object detection frameworks applied to this problem. DFUC2020 provided participants with a comprehensive dataset consisting of 2,000 images for training and 2,000 i… ▽ More There has been a substantial amount of research involving computer methods and technology for the detection and recognition of diabetic foot ulcers (DFUs), but there is a lack of systematic comparisons of state-of-the-art deep learning object detection frameworks applied to this problem. DFUC2020 provided participants with a comprehensive dataset consisting of 2,000 images for training and 2,000 images for testing. This paper summarises the results of DFUC2020 by comparing the deep learning-based algorithms proposed by the winning teams: Faster R-CNN, three variants of Faster R-CNN and an ensemble method; YOLOv3; YOLOv5; EfficientDet; and a new Cascade Attention Network. For each deep learning method, we provide a detailed description of model architecture, parameter settings for training and additional stages including pre-processing, data augmentation and post-processing. We provide a comprehensive evaluation for each method. All the methods required a data augmentation stage to increase the number of images available for training and a post-processing stage to remove false positives. The best performance was obtained from Deformable Convolution, a variant of Faster R-CNN, with a mean average precision (mAP) of 0.6940 and an F1-Score of 0.7434. Finally, we demonstrate that the ensemble method based on different deep learning methods can enhanced the F1-Score but not the mAP. △ Less

Submitted 24 May, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

Comments: 19 pages, 18 figures, 10 tables

Journal ref: Computers in Biology and Medicine, Volume 135, 2021, 104596, ISSN 0010-4825,

arXiv:2006.00887 [pdf]

doi 10.1007/s44150-021-00015-8

Insights into Performance Fitness and Error Metrics for Machine Learning

Authors: M. Z. Naser, Amir Alavi

Abstract: Machine learning (ML) is the field of training machines to achieve high level of cognition and perform human-like analysis. Since ML is a data-driven approach, it seemingly fits into our daily lives and operations as well as complex and interdisciplinary fields. With the rise of commercial, open-source and user-catered ML tools, a key question often arises whenever ML is applied to explore a pheno… ▽ More Machine learning (ML) is the field of training machines to achieve high level of cognition and perform human-like analysis. Since ML is a data-driven approach, it seemingly fits into our daily lives and operations as well as complex and interdisciplinary fields. With the rise of commercial, open-source and user-catered ML tools, a key question often arises whenever ML is applied to explore a phenomenon or a scenario: what constitutes a good ML model? Kee** in mind that a proper answer to this question depends on a variety of factors, this work presumes that a good ML model is one that optimally performs and best describes the phenomenon on hand. From this perspective, identifying proper assessment metrics to evaluate performance of ML models is not only necessary but is also warranted. As such, this paper examines a number of the most commonly-used performance fitness and error metrics for regression and classification algorithms, with emphasis on engineering applications. △ Less

Submitted 17 May, 2020; originally announced June 2020.

Comments: 18 pages, 2 tables

Journal ref: 2021

arXiv:1906.03623 [pdf, other]

doi 10.1109/TSG.2018.2856893

A Distributed Event-Triggered Control Strategy for DC Microgrids Based on Publish-Subscribe Model Over Industrial Wireless Sensor Networks

Authors: Seyed Amir Alavi, Kamyar Mehran, Yang Hao, Ardavan Rahimian, Hamed Mirsaeedi, Vahid Vahidinasab

Abstract: This paper presents a complete design, analysis, and performance evaluation of a novel distributed event-triggered control and estimation strategy for DC microgrids. The primary objective of this work is to efficiently stabilize the grid voltage, and to further balance the energy level of the energy storage (ES) systems. The locally-installed distributed controllers are utilised to reduce the numb… ▽ More This paper presents a complete design, analysis, and performance evaluation of a novel distributed event-triggered control and estimation strategy for DC microgrids. The primary objective of this work is to efficiently stabilize the grid voltage, and to further balance the energy level of the energy storage (ES) systems. The locally-installed distributed controllers are utilised to reduce the number of transmitted packets and battery usage of the installed sensors, based on a proposed event-triggered communication scheme. Also, to reduce the network traffic, an optimal observer is employed which utilizes a modified Kalman consensus filter (KCF) to estimate the state of the DC microgrid via the distributed sensors. Furthermore, in order to effectively provide an intelligent data exchange mechanism for the proposed event-triggered controller, the publish-subscribe communication model is employed to setup a distributed control infrastructure in industrial wireless sensor networks (WSNs). The performance of the proposed control and estimation strategy is validated via the simulations of a DC microgrid composed of renewable energy sources (RESs). The results confirm the appropriateness of the implemented strategy for the optimal utilization of the advanced industrial network architectures in the smart grids. △ Less

Submitted 9 June, 2019; originally announced June 2019.

arXiv:1906.00437 [pdf, other]

State Monitoring for Situational Awareness in Rural Microgrids Using the IoT Infrastructure

Authors: Seyed Amir Alavi, Mehrnaz Javadipour, Kamyar Mehran

Abstract: This paper presents an event-triggered estimation strategy and a data collection architecture for situational awareness (SA) in microgrids. An estimation agent structure based on the event-triggered Kalman filter is proposed and implemented for state estimation layer of the SA using long range wide area network (LoRAWAN) protocol. A setup has been developed which can provide enormous data collecti… ▽ More This paper presents an event-triggered estimation strategy and a data collection architecture for situational awareness (SA) in microgrids. An estimation agent structure based on the event-triggered Kalman filter is proposed and implemented for state estimation layer of the SA using long range wide area network (LoRAWAN) protocol. A setup has been developed which can provide enormous data collection capabilities from smart meters, in order to realise an adequate SA level in microgrids. Thingsboard Internet of things (IoT) platform is used for the SA visualisation with a customised dashboard. It is shown by using the developed estimation strategy, an adequate level of SA can be achieved with a minimum installation and communication cost to have an accurate average state estimation of the microgrid. △ Less

Submitted 2 June, 2019; originally announced June 2019.

arXiv:1804.04687 [pdf, other]

Cross-Domain Visual Recognition via Domain Adaptive Dictionary Learning

Authors: Hongyu Xu, **g**g Zheng, Azadeh Alavi, Rama Chellappa

Abstract: In real-world visual recognition problems, the assumption that the training data (source domain) and test data (target domain) are sampled from the same distribution is often violated. This is known as the domain adaptation problem. In this work, we propose a novel domain-adaptive dictionary learning framework for cross-domain visual recognition. Our method generates a set of intermediate domains.… ▽ More In real-world visual recognition problems, the assumption that the training data (source domain) and test data (target domain) are sampled from the same distribution is often violated. This is known as the domain adaptation problem. In this work, we propose a novel domain-adaptive dictionary learning framework for cross-domain visual recognition. Our method generates a set of intermediate domains. These intermediate domains form a smooth path and bridge the gap between the source and target domains. Specifically, we not only learn a common dictionary to encode the domain-shared features, but also learn a set of domain-specific dictionaries to model the domain shift. The separation of the common and domain-specific dictionaries enables us to learn more compact and reconstructive dictionaries for domain adaptation. These dictionaries are learned by alternating between domain-adaptive sparse coding and dictionary updating steps. Meanwhile, our approach gradually recovers the feature representations of both source and target data along the domain path. By aligning all the recovered domain data, we derive the final domain-adaptive features for cross-domain visual recognition. Extensive experiments on three public datasets demonstrates that our approach outperforms most state-of-the-art methods. △ Less

Submitted 15 April, 2018; v1 submitted 12 April, 2018; originally announced April 2018.

Comments: Submitted to IEEE TIP Journal

arXiv:1702.05085 [pdf, other]

KEPLER: Keypoint and Pose Estimation of Unconstrained Faces by Learning Efficient H-CNN Regressors

Authors: Amit Kumar, Azadeh Alavi, Rama Chellappa

Abstract: Keypoint detection is one of the most important pre-processing steps in tasks such as face modeling, recognition and verification. In this paper, we present an iterative method for Keypoint Estimation and Pose prediction of unconstrained faces by Learning Efficient H-CNN Regressors (KEPLER) for addressing the face alignment problem. Recent state of the art methods have shown improvements in face k… ▽ More Keypoint detection is one of the most important pre-processing steps in tasks such as face modeling, recognition and verification. In this paper, we present an iterative method for Keypoint Estimation and Pose prediction of unconstrained faces by Learning Efficient H-CNN Regressors (KEPLER) for addressing the face alignment problem. Recent state of the art methods have shown improvements in face keypoint detection by employing Convolution Neural Networks (CNNs). Although a simple feed forward neural network can learn the map** between input and output spaces, it cannot learn the inherent structural dependencies. We present a novel architecture called H-CNN (Heatmap-CNN) which captures structured global and local features and thus favors accurate keypoint detecion. HCNN is jointly trained on the visibility, fiducials and 3D-pose of the face. As the iterations proceed, the error decreases making the gradients small and thus requiring efficient training of DCNNs to mitigate this. KEPLER performs global corrections in pose and fiducials for the first four iterations followed by local corrections in the subsequent stage. As a by-product, KEPLER also provides 3D pose (pitch, yaw and roll) of the face accurately. In this paper, we show that without using any 3D information, KEPLER outperforms state of the art methods for alignment on challenging datasets such as AFW and AFLW. △ Less

Submitted 16 February, 2017; originally announced February 2017.

Comments: Accept as Oral FG'17

arXiv:1606.04232 [pdf, other]

DCNNs on a Diet: Sampling Strategies for Reducing the Training Set Size

Authors: Maya Kabkab, Azadeh Alavi, Rama Chellappa

Abstract: Large-scale supervised classification algorithms, especially those based on deep convolutional neural networks (DCNNs), require vast amounts of training data to achieve state-of-the-art performance. Decreasing this data requirement would significantly speed up the training process and possibly improve generalization. Motivated by this objective, we consider the task of adaptively finding concise t… ▽ More Large-scale supervised classification algorithms, especially those based on deep convolutional neural networks (DCNNs), require vast amounts of training data to achieve state-of-the-art performance. Decreasing this data requirement would significantly speed up the training process and possibly improve generalization. Motivated by this objective, we consider the task of adaptively finding concise training subsets which will be iteratively presented to the learner. We use convex optimization methods, based on an objective criterion and feedback from the current performance of the classifier, to efficiently identify informative samples to train on. We propose an algorithm to decompose the optimization problem into smaller per-class problems, which can be solved in parallel. We test our approach on standard classification tasks and demonstrate its effectiveness in decreasing the training set size without compromising performance. We also show that our approach can make the classifier more robust in the presence of label noise and class imbalance. △ Less

Submitted 14 June, 2016; originally announced June 2016.

arXiv:1604.05417 [pdf, other]

doi 10.1109/BTAS.2016.7791205

Triplet Probabilistic Embedding for Face Verification and Clustering

Authors: Swami Sankaranarayanan, Azadeh Alavi, Carlos Castillo, Rama Chellappa

Abstract: Despite significant progress made over the past twenty five years, unconstrained face verification remains a challenging problem. This paper proposes an approach that couples a deep CNN-based approach with a low-dimensional discriminative embedding learned using triplet probability constraints to solve the unconstrained face verification problem. Aside from yielding performance improvements, this… ▽ More Despite significant progress made over the past twenty five years, unconstrained face verification remains a challenging problem. This paper proposes an approach that couples a deep CNN-based approach with a low-dimensional discriminative embedding learned using triplet probability constraints to solve the unconstrained face verification problem. Aside from yielding performance improvements, this embedding provides significant advantages in terms of memory and for post-processing operations like subject specific clustering. Experiments on the challenging IJB-A dataset show that the proposed algorithm performs comparably or better than the state of the art methods in verification and identification metrics, while requiring much less training data and training time. The superior performance of the proposed method on the CFP dataset shows that the representation learned by our deep CNN is robust to extreme pose variation. Furthermore, we demonstrate the robustness of the deep features to challenges including age, pose, blur and clutter by performing simple clustering experiments on both IJB-A and LFW datasets. △ Less

Submitted 17 January, 2017; v1 submitted 18 April, 2016; originally announced April 2016.

Comments: Oral Paper in BTAS 2016; NVIDIA Best paper Award (http://ieee-biometrics.org/btas2016/awards.html)

arXiv:1602.03570 [pdf, other]

Optimized Kernel-based Projection Space of Riemannian Manifolds

Authors: Azadeh Alavi, Vishal M Patel, Rama Chellappa

Abstract: It is proven that encoding images and videos through Symmetric Positive Definite (SPD) matrices, and considering the Riemannian geometry of the resulting space, can lead to increased classification performance. Taking into account manifold geometry is typically done via embedding the manifolds in tangent spaces, or Reproducing Kernel Hilbert Spaces (RKHS). Recently, it was shown that embedding suc… ▽ More It is proven that encoding images and videos through Symmetric Positive Definite (SPD) matrices, and considering the Riemannian geometry of the resulting space, can lead to increased classification performance. Taking into account manifold geometry is typically done via embedding the manifolds in tangent spaces, or Reproducing Kernel Hilbert Spaces (RKHS). Recently, it was shown that embedding such manifolds into a Random Projection Spaces (RPS), rather than RKHS or tangent space, leads to higher classification and clustering performance. However, based on structure and dimensionality of the randomly generated hyperplanes, the classification performance over RPS may vary significantly. In addition, fine-tuning RPS is data expensive (as it requires validation-data), time consuming, and resource demanding. In this paper, we introduce an approach to learn an optimized kernel-based projection (with fixed dimensionality), by employing the concept of subspace clustering. As such, we encode the association of data points to the underlying subspace of each point, to generate meaningful hyperplanes. Further, we adopt the concept of dictionary learning and sparse coding, and discriminative analysis, for the optimized kernel-based projection space (OPS) on SPD manifolds. We validate our algorithm on several classification tasks. The experiment results also demonstrate that the proposed method outperforms state-of-the-art methods on such manifolds. △ Less

Submitted 15 March, 2016; v1 submitted 10 February, 2016; originally announced February 2016.

Comments: 14 pages, 6 figures, conference

arXiv:1602.03418 [pdf, ps, other]

Triplet Similarity Embedding for Face Verification

Authors: Swami Sankaranarayanan, Azadeh Alavi, Rama Chellappa

Abstract: In this work, we present an unconstrained face verification algorithm and evaluate it on the recently released IJB-A dataset that aims to push the boundaries of face verification methods. The proposed algorithm couples a deep CNN-based approach with a low-dimensional discriminative embedding learnt using triplet similarity constraints in a large margin fashion. Aside from yielding performance impr… ▽ More In this work, we present an unconstrained face verification algorithm and evaluate it on the recently released IJB-A dataset that aims to push the boundaries of face verification methods. The proposed algorithm couples a deep CNN-based approach with a low-dimensional discriminative embedding learnt using triplet similarity constraints in a large margin fashion. Aside from yielding performance improvement, this embedding provides significant advantages in terms of memory and post-processing operations like hashing and visualization. Experiments on the IJB-A dataset show that the proposed algorithm outperforms state of the art methods in verification and identification metrics, while requiring less training time. △ Less

Submitted 13 March, 2016; v1 submitted 10 February, 2016; originally announced February 2016.

arXiv:1509.05536 [pdf, other]

Efficient Clustering on Riemannian Manifolds: A Kernelised Random Projection Approach

Authors: Kun Zhao, Azadeh Alavi, Arnold Wiliem, Brian C. Lovell

Abstract: Reformulating computer vision problems over Riemannian manifolds has demonstrated superior performance in various computer vision applications. This is because visual data often forms a special structure lying on a lower dimensional space embedded in a higher dimensional space. However, since these manifolds belong to non-Euclidean topological spaces, exploiting their structures is computationally… ▽ More Reformulating computer vision problems over Riemannian manifolds has demonstrated superior performance in various computer vision applications. This is because visual data often forms a special structure lying on a lower dimensional space embedded in a higher dimensional space. However, since these manifolds belong to non-Euclidean topological spaces, exploiting their structures is computationally expensive, especially when one considers the clustering analysis of massive amounts of data. To this end, we propose an efficient framework to address the clustering problem on Riemannian manifolds. This framework implements random projections for manifold points via kernel space, which can preserve the geometric structure of the original space, but is computationally efficient. Here, we introduce three methods that follow our framework. We then validate our framework on several computer vision applications by comparing against popular clustering methods on Riemannian manifolds. Experimental results demonstrate that our framework maintains the performance of the clustering whilst massively reducing computational complexity by over two orders of magnitude in some cases. △ Less

Submitted 18 September, 2015; originally announced September 2015.

arXiv:1403.0700 [pdf, other]

doi 10.1109/WACV.2014.6836085

Random Projections on Manifolds of Symmetric Positive Definite Matrices for Image Classification

Authors: Azadeh Alavi, Arnold Wiliem, Kun Zhao, Brian C. Lovell, Conrad Sanderson

Abstract: Recent advances suggest that encoding images through Symmetric Positive Definite (SPD) matrices and then interpreting such matrices as points on Riemannian manifolds can lead to increased classification performance. Taking into account manifold geometry is typically done via (1) embedding the manifolds in tangent spaces, or (2) embedding into Reproducing Kernel Hilbert Spaces (RKHS). While embeddi… ▽ More Recent advances suggest that encoding images through Symmetric Positive Definite (SPD) matrices and then interpreting such matrices as points on Riemannian manifolds can lead to increased classification performance. Taking into account manifold geometry is typically done via (1) embedding the manifolds in tangent spaces, or (2) embedding into Reproducing Kernel Hilbert Spaces (RKHS). While embedding into tangent spaces allows the use of existing Euclidean-based learning algorithms, manifold shape is only approximated which can cause loss of discriminatory information. The RKHS approach retains more of the manifold structure, but may require non-trivial effort to kernelise Euclidean-based learning algorithms. In contrast to the above approaches, in this paper we offer a novel solution that allows SPD matrices to be used with unmodified Euclidean-based learning algorithms, with the true manifold shape well-preserved. Specifically, we propose to project SPD matrices using a set of random projection hyperplanes over RKHS into a random projection space, which leads to representing each matrix as a vector of projection coefficients. Experiments on face recognition, person re-identification and texture classification show that the proposed approach outperforms several recent methods, such as Tensor Sparse Coding, Histogram Plus Epitome, Riemannian Locality Preserving Projection and Relational Divergence Classification. △ Less

Submitted 4 March, 2014; originally announced March 2014.

Comments: IEEE Winter Conference on Applications of Computer Vision (WACV), 2014

ACM Class: I.4.7; I.4.10; I.5.1; I.5.4

arXiv:1403.0699 [pdf, other]

doi 10.1109/ICIP.2013.6738731

Multi-Shot Person Re-Identification via Relational Stein Divergence

Authors: Azadeh Alavi, Yan Yang, Mehrtash Harandi, Conrad Sanderson

Abstract: Person re-identification is particularly challenging due to significant appearance changes across separate camera views. In order to re-identify people, a representative human signature should effectively handle differences in illumination, pose and camera parameters. While general appearance-based methods are modelled in Euclidean spaces, it has been argued that some applications in image and vid… ▽ More Person re-identification is particularly challenging due to significant appearance changes across separate camera views. In order to re-identify people, a representative human signature should effectively handle differences in illumination, pose and camera parameters. While general appearance-based methods are modelled in Euclidean spaces, it has been argued that some applications in image and video analysis are better modelled via non-Euclidean manifold geometry. To this end, recent approaches represent images as covariance matrices, and interpret such matrices as points on Riemannian manifolds. As direct classification on such manifolds can be difficult, in this paper we propose to represent each manifold point as a vector of similarities to class representers, via a recently introduced form of Bregman matrix divergence known as the Stein divergence. This is followed by using a discriminative map** of similarity vectors for final classification. The use of similarity vectors is in contrast to the traditional approach of embedding manifolds into tangent spaces, which can suffer from representing the manifold structure inaccurately. Comparative evaluations on benchmark ETHZ and iLIDS datasets for the person re-identification task show that the proposed approach obtains better performance than recent techniques such as Histogram Plus Epitome, Partial Least Squares, and Symmetry-Driven Accumulation of Local Features. △ Less

Submitted 4 March, 2014; originally announced March 2014.

Comments: IEEE International Conference on Image Processing (ICIP), 2013

ACM Class: I.5.1; I.5.4; I.2.10; I.4.7; I.4.8; I.4.10

Showing 1–22 of 22 results for author: Alavi, A