-
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis
Authors:
Moein Heidari,
Sina Ghorbani Kolahi,
Sanaz Karimijafarbigloo,
Bobby Azad,
Afshin Bozorgpour,
Soheila Hatami,
Reza Azad,
Ali Diba,
Ulas Bagci,
Dorit Merhof,
Ilker Hacihaliloglu
Abstract:
Sequence modeling plays a vital role across various domains, with recurrent neural networks being historically the predominant method of performing these tasks. However, the emergence of transformers has altered this paradigm due to their superior performance. Built upon these advances, transformers have conjoined CNNs as two leading foundational models for learning visual representations. However…
▽ More
Sequence modeling plays a vital role across various domains, with recurrent neural networks being historically the predominant method of performing these tasks. However, the emergence of transformers has altered this paradigm due to their superior performance. Built upon these advances, transformers have conjoined CNNs as two leading foundational models for learning visual representations. However, transformers are hindered by the $\mathcal{O}(N^2)$ complexity of their attention mechanisms, while CNNs lack global receptive fields and dynamic weight allocation. State Space Models (SSMs), specifically the \textit{\textbf{Mamba}} model with selection mechanisms and hardware-aware architecture, have garnered immense interest lately in sequential modeling and visual representation learning, challenging the dominance of transformers by providing infinite context lengths and offering substantial efficiency maintaining linear complexity in the input sequence. Capitalizing on the advances in computer vision, medical imaging has heralded a new epoch with Mamba models. Intending to help researchers navigate the surge, this survey seeks to offer an encyclopedic review of Mamba models in medical imaging. Specifically, we start with a comprehensive theoretical review forming the basis of SSMs, including Mamba architecture and its alternatives for sequence modeling paradigms in this context. Next, we offer a structured classification of Mamba models in the medical field and introduce a diverse categorization scheme based on their application, imaging modalities, and targeted organs. Finally, we summarize key challenges, discuss different future research directions of the SSMs in the medical domain, and propose several directions to fulfill the demands of this field. In addition, we have compiled the studies discussed in this paper along with their open-source implementations on our GitHub repository.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Radial perturbations of Ellis-Bronnikov wormholes in slow rotation up to second order
Authors:
Bahareh Azad,
Jose Luis Blázquez-Salcedo,
Fech Scen Khoo,
Jutta Kunz
Abstract:
We consider slowly rotating Ellis-Bronnikov wormholes and investigate their radial perturbations ($\mathrm{l}=0$), expanding up to second order in rotation. We present the detailed derivations in the general case, including symmetric and non-symmetric wormholes. The calculations show that the unstable mode present in the static case becomes less unstable with increasing rotation, until it reaches…
▽ More
We consider slowly rotating Ellis-Bronnikov wormholes and investigate their radial perturbations ($\mathrm{l}=0$), expanding up to second order in rotation. We present the detailed derivations in the general case, including symmetric and non-symmetric wormholes. The calculations show that the unstable mode present in the static case becomes less unstable with increasing rotation, until it reaches zero and then disappears. This indicates that wormhole solutions may become linearly mode stable at sufficiently fast rotation.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Quasinormal modes of rapidly rotating Ellis-Bronnikov wormholes
Authors:
Fech Scen Khoo,
Bahareh Azad,
Jose Luis Blázquez-Salcedo,
Luis Manuel González-Romero,
Burkhard Kleihaus,
Jutta Kunz,
Francisco Navarro-Lérida
Abstract:
We present for the first time a study of the quasinormal modes of rapidly rotating Ellis-Bronnikov wormholes in General Relativity. We compute the spectrum of the wormholes using a spectral decomposition of the metric perturbations on a numerical background. We focus on the $M_z=2,3$ sector of the perturbations, and show that the triple isospectrality of the symmetric and static Ellis-Bronnikov wo…
▽ More
We present for the first time a study of the quasinormal modes of rapidly rotating Ellis-Bronnikov wormholes in General Relativity. We compute the spectrum of the wormholes using a spectral decomposition of the metric perturbations on a numerical background. We focus on the $M_z=2,3$ sector of the perturbations, and show that the triple isospectrality of the symmetric and static Ellis-Bronnikov wormhole is broken due to rotation, giving rise to a much richer spectrum than the spectrum of Kerr black holes. We do not find any instabilities for $M_z=2,3$ perturbations.
△ Less
Submitted 8 April, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
HCA-Net: Hierarchical Context Attention Network for Intervertebral Disc Semantic Labeling
Authors:
Afshin Bozorgpour,
Bobby Azad,
Reza Azad,
Yury Velichko,
Ulas Bagci,
Dorit Merhof
Abstract:
Accurate and automated segmentation of intervertebral discs (IVDs) in medical images is crucial for assessing spine-related disorders, such as osteoporosis, vertebral fractures, or IVD herniation. We present HCA-Net, a novel contextual attention network architecture for semantic labeling of IVDs, with a special focus on exploiting prior geometric information. Our approach excels at processing feat…
▽ More
Accurate and automated segmentation of intervertebral discs (IVDs) in medical images is crucial for assessing spine-related disorders, such as osteoporosis, vertebral fractures, or IVD herniation. We present HCA-Net, a novel contextual attention network architecture for semantic labeling of IVDs, with a special focus on exploiting prior geometric information. Our approach excels at processing features across different scales and effectively consolidating them to capture the intricate spatial relationships within the spinal cord. To achieve this, HCA-Net models IVD labeling as a pose estimation problem, aiming to minimize the discrepancy between each predicted IVD location and its corresponding actual joint location. In addition, we introduce a skeletal loss term to reinforce the model's geometric dependence on the spine. This loss function is designed to constrain the model's predictions to a range that matches the general structure of the human vertebral skeleton. As a result, the network learns to reduce the occurrence of false predictions and adaptively improves the accuracy of IVD location estimation. Through extensive experimental evaluation on multi-center spine datasets, our approach consistently outperforms previous state-of-the-art methods on both MRI T1w and T2w modalities. The codebase is accessible to the public on \href{https://github.com/xmindflow/HCA-Net}{GitHub}.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Foundational Models in Medical Imaging: A Comprehensive Survey and Future Vision
Authors:
Bobby Azad,
Reza Azad,
Sania Eskandari,
Afshin Bozorgpour,
Amirhossein Kazerouni,
Islem Rekik,
Dorit Merhof
Abstract:
Foundation models, large-scale, pre-trained deep-learning models adapted to a wide range of downstream tasks have gained significant interest lately in various deep-learning problems undergoing a paradigm shift with the rise of these models. Trained on large-scale dataset to bridge the gap between different modalities, foundation models facilitate contextual reasoning, generalization, and prompt c…
▽ More
Foundation models, large-scale, pre-trained deep-learning models adapted to a wide range of downstream tasks have gained significant interest lately in various deep-learning problems undergoing a paradigm shift with the rise of these models. Trained on large-scale dataset to bridge the gap between different modalities, foundation models facilitate contextual reasoning, generalization, and prompt capabilities at test time. The predictions of these models can be adjusted for new tasks by augmenting the model input with task-specific hints called prompts without requiring extensive labeled data and retraining. Capitalizing on the advances in computer vision, medical imaging has also marked a growing interest in these models. To assist researchers in navigating this direction, this survey intends to provide a comprehensive overview of foundation models in the domain of medical imaging. Specifically, we initiate our exploration by providing an exposition of the fundamental concepts forming the basis of foundation models. Subsequently, we offer a methodical taxonomy of foundation models within the medical domain, proposing a classification system primarily structured around training strategies, while also incorporating additional facets such as application domains, imaging modalities, specific organs of interest, and the algorithms integral to these models. Furthermore, we emphasize the practical use case of some selected approaches and then discuss the opportunities, applications, and future directions of these large-scale pre-trained models, for analyzing medical images. In the same vein, we address the prevailing challenges and research pathways associated with foundational models in medical imaging. These encompass the areas of interpretability, data management, computational requirements, and the nuanced issue of contextual comprehension.
△ Less
Submitted 28 October, 2023;
originally announced October 2023.
-
The simple $\mathscr{B}_ψ$-groups
Authors:
Morteza Baniasad Azad
Abstract:
In a finite group $ G $, $ ψ(G) $ denotes the sum of element orders of $ G $.
A finite group $ G $ is said to be a $\mathscr{B}_ψ$-group if $ ψ(H) < |G| $ for any proper subgroup $ H $ of $ G $.
In \cite{Lazorec} Lazorec asked: "what can be said about the $\mathscr{B}_ψ$ property of the finite simple groups $ \operatorname{PSL}(2, q) $?" In this paper, we answer this question for the case of no…
▽ More
In a finite group $ G $, $ ψ(G) $ denotes the sum of element orders of $ G $.
A finite group $ G $ is said to be a $\mathscr{B}_ψ$-group if $ ψ(H) < |G| $ for any proper subgroup $ H $ of $ G $.
In \cite{Lazorec} Lazorec asked: "what can be said about the $\mathscr{B}_ψ$ property of the finite simple groups $ \operatorname{PSL}(2, q) $?" In this paper, we answer this question for the case of not only the finite simple groups $ \operatorname{PSL}(2, q) $ but also all other finite simple groups. We show that if $ S $ is a finite simple group, such that $ S \neq Alt(n) $ for any $ n \geq 14 $, then $S$ is a $\mathscr{B}_ψ$-group.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Laplacian-Former: Overcoming the Limitations of Vision Transformers in Local Texture Detection
Authors:
Reza Azad,
Amirhossein Kazerouni,
Babak Azad,
Ehsan Khodapanah Aghdam,
Yury Velichko,
Ulas Bagci,
Dorit Merhof
Abstract:
Vision Transformer (ViT) models have demonstrated a breakthrough in a wide range of computer vision tasks. However, compared to the Convolutional Neural Network (CNN) models, it has been observed that the ViT models struggle to capture high-frequency components of images, which can limit their ability to detect local textures and edge information. As abnormalities in human tissue, such as tumors a…
▽ More
Vision Transformer (ViT) models have demonstrated a breakthrough in a wide range of computer vision tasks. However, compared to the Convolutional Neural Network (CNN) models, it has been observed that the ViT models struggle to capture high-frequency components of images, which can limit their ability to detect local textures and edge information. As abnormalities in human tissue, such as tumors and lesions, may greatly vary in structure, texture, and shape, high-frequency information such as texture is crucial for effective semantic segmentation tasks. To address this limitation in ViT models, we propose a new technique, Laplacian-Former, that enhances the self-attention map by adaptively re-calibrating the frequency information in a Laplacian pyramid. More specifically, our proposed method utilizes a dual attention mechanism via efficient attention and frequency attention while the efficient attention mechanism reduces the complexity of self-attention to linear while producing the same output, selectively intensifying the contribution of shape and texture features. Furthermore, we introduce a novel efficient enhancement multi-scale bridge that effectively transfers spatial information from the encoder to the decoder while preserving the fundamental features. We demonstrate the efficacy of Laplacian-former on multi-organ and skin lesion segmentation tasks with +1.87\% and +0.76\% dice scores compared to SOTA approaches, respectively. Our implementation is publically available at https://github.com/mindflow-institue/Laplacian-Former
△ Less
Submitted 31 August, 2023;
originally announced September 2023.
-
Improving FHB Screening in Wheat Breeding Using an Efficient Transformer Model
Authors:
Babak Azad,
Ahmed Abdalla,
Kwanghee Won,
Ali Mirzakhani Nafchi
Abstract:
Fusarium head blight is a devastating disease that causes significant economic losses annually on small grains. Efficiency, accuracy, and timely detection of FHB in the resistance screening are critical for wheat and barley breeding programs. In recent years, various image processing techniques have been developed using supervised machine learning algorithms for the early detection of FHB. The sta…
▽ More
Fusarium head blight is a devastating disease that causes significant economic losses annually on small grains. Efficiency, accuracy, and timely detection of FHB in the resistance screening are critical for wheat and barley breeding programs. In recent years, various image processing techniques have been developed using supervised machine learning algorithms for the early detection of FHB. The state-of-the-art convolutional neural network-based methods, such as U-Net, employ a series of encoding blocks to create a local representation and a series of decoding blocks to capture the semantic relations. However, these methods are not often capable of long-range modeling dependencies inside the input data, and their ability to model multi-scale objects with significant variations in texture and shape is limited. Vision transformers as alternative architectures with innate global self-attention mechanisms for sequence-to-sequence prediction, due to insufficient low-level details, may also limit localization capabilities. To overcome these limitations, a new Context Bridge is proposed to integrate the local representation capability of the U-Net network in the transformer model. In addition, the standard attention mechanism of the original transformer is replaced with Efficient Self-attention, which is less complicated than other state-of-the-art methods. To train the proposed network, 12,000 wheat images from an FHB-inoculated wheat field at the SDSU research farm in Volga, SD, were captured. In addition to healthy and unhealthy plants, these images encompass various stages of the disease. A team of expert pathologists annotated the images for training and evaluating the developed model. As a result, the effectiveness of the transformer-based method for FHB-disease detection, through extensive experiments across typical tasks for plant image segmentation, is demonstrated.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Implicit Neural Representation in Medical Imaging: A Comparative Survey
Authors:
Amirali Molaei,
Amirhossein Aminimehr,
Armin Tavakoli,
Amirhossein Kazerouni,
Bobby Azad,
Reza Azad,
Dorit Merhof
Abstract:
Implicit neural representations (INRs) have gained prominence as a powerful paradigm in scene reconstruction and computer graphics, demonstrating remarkable results. By utilizing neural networks to parameterize data through implicit continuous functions, INRs offer several benefits. Recognizing the potential of INRs beyond these domains, this survey aims to provide a comprehensive overview of INR…
▽ More
Implicit neural representations (INRs) have gained prominence as a powerful paradigm in scene reconstruction and computer graphics, demonstrating remarkable results. By utilizing neural networks to parameterize data through implicit continuous functions, INRs offer several benefits. Recognizing the potential of INRs beyond these domains, this survey aims to provide a comprehensive overview of INR models in the field of medical imaging. In medical settings, numerous challenging and ill-posed problems exist, making INRs an attractive solution. The survey explores the application of INRs in various medical imaging tasks, such as image reconstruction, segmentation, registration, novel view synthesis, and compression. It discusses the advantages and limitations of INRs, highlighting their resolution-agnostic nature, memory efficiency, ability to avoid locality biases, and differentiability, enabling adaptation to different tasks. Furthermore, the survey addresses the challenges and considerations specific to medical imaging data, such as data availability, computational complexity, and dynamic clinical scene analysis. It also identifies future research directions and opportunities, including integration with multi-modal imaging, real-time and interactive systems, and domain adaptation for clinical decision support. To facilitate further exploration and implementation of INRs in medical image analysis, we have provided a compilation of cited studies along with their available open-source implementations on \href{https://github.com/mindflow-institue/Awesome-Implicit-Neural-Representations-in-Medical-imaging}. Finally, we aim to consistently incorporate the most recent and relevant papers regularly.
△ Less
Submitted 30 July, 2023;
originally announced July 2023.
-
Are slowly rotating Ellis-Bronnikov wormholes stable?
Authors:
Bahareh Azad,
Jose Luis Blázquez-Salcedo,
Fech Scen Khoo,
Jutta Kunz
Abstract:
We investigate the radial perturbations of Ellis-Bronnikov wormholes ($\mathrm{l}=0$) in a slowly rotating background expanded up to second order in rotation. We find indications that simple wormhole solutions such as Ellis-Bronnikov in General Relativity can be stabilized by rotation, thus favoring a viable traversable wormhole. This opens up the intriguing question whether the many other wormhol…
▽ More
We investigate the radial perturbations of Ellis-Bronnikov wormholes ($\mathrm{l}=0$) in a slowly rotating background expanded up to second order in rotation. We find indications that simple wormhole solutions such as Ellis-Bronnikov in General Relativity can be stabilized by rotation, thus favoring a viable traversable wormhole. This opens up the intriguing question whether the many other wormhole solutions with or without the support of exotic matter can become linearly mode stable when the wormhole rotates.
△ Less
Submitted 23 November, 2023; v1 submitted 12 January, 2023;
originally announced January 2023.
-
Polar modes and isospectrality of Ellis-Bronnikov wormholes
Authors:
Bahareh Azad,
Jose Luis Blázquez-Salcedo,
Xiao Yan Chew,
Jutta Kunz,
Dong-han Yeom
Abstract:
We consider polar perturbations of static Ellis-Bronnikov wormholes and derive the coupled set of perturbation equations for the gravitational and the scalar field. For massless wormholes the perturbations decouple, and we obtain two identical master equations for the scalar and gravitational modes, which moreover agree with the master equation for the axial modes. Consequently there is isospectra…
▽ More
We consider polar perturbations of static Ellis-Bronnikov wormholes and derive the coupled set of perturbation equations for the gravitational and the scalar field. For massless wormholes the perturbations decouple, and we obtain two identical master equations for the scalar and gravitational modes, which moreover agree with the master equation for the axial modes. Consequently there is isospectrality with threefold degenerate modes. For a finite mass of the background wormhole solutions, the equations are coupled. We then obtain two distinct branches of polar quasinormal modes for a given multipole number l, associated with the presence of the two types of fields. We calculate the quasi-normal mode frequencies and decay rates for the branches with l=2, 3 and 4. For a given l the real frequencies of the two branchesget the closer, the higher the multipole number gets.
△ Less
Submitted 23 December, 2022;
originally announced December 2022.
-
Safe Imitation Learning of Nonlinear Model Predictive Control for Flexible Robots
Authors:
Shamil Mamedov,
Rudolf Reiter,
Seyed Mahdi Basiri Azad,
Joschka Boedecker,
Moritz Diehl,
Jan Swevers
Abstract:
Flexible robots may overcome some of the industry's major challenges, such as enabling intrinsically safe human-robot collaboration and achieving a higher load-to-mass ratio. However, controlling flexible robots is complicated due to their complex dynamics, which include oscillatory behavior and a high-dimensional state space. NMPC offers an effective means to control such robots, but its extensiv…
▽ More
Flexible robots may overcome some of the industry's major challenges, such as enabling intrinsically safe human-robot collaboration and achieving a higher load-to-mass ratio. However, controlling flexible robots is complicated due to their complex dynamics, which include oscillatory behavior and a high-dimensional state space. NMPC offers an effective means to control such robots, but its extensive computational demands often limit its application in real-time scenarios. To enable fast control of flexible robots, we propose a framework for a safe approximation of NMPC using imitation learning and a predictive safety filter. Our framework significantly reduces computation time while incurring a slight loss in performance. Compared to NMPC, our framework shows more than a eightfold improvement in computation time when controlling a three-dimensional flexible robot arm in simulation, all while guaranteeing safety constraints. Notably, our approach outperforms conventional reinforcement learning methods. The development of fast and safe approximate NMPC holds the potential to accelerate the adoption of flexible robots in industry.
△ Less
Submitted 28 September, 2023; v1 submitted 6 December, 2022;
originally announced December 2022.
-
T3VIP: Transformation-based 3D Video Prediction
Authors:
Iman Nematollahi,
Erick Rosete-Beas,
Seyed Mahdi B. Azad,
Raghu Rajan,
Frank Hutter,
Wolfram Burgard
Abstract:
For autonomous skill acquisition, robots have to learn about the physical rules governing the 3D world dynamics from their own past experience to predict and reason about plausible future outcomes. To this end, we propose a transformation-based 3D video prediction (T3VIP) approach that explicitly models the 3D motion by decomposing a scene into its object parts and predicting their corresponding r…
▽ More
For autonomous skill acquisition, robots have to learn about the physical rules governing the 3D world dynamics from their own past experience to predict and reason about plausible future outcomes. To this end, we propose a transformation-based 3D video prediction (T3VIP) approach that explicitly models the 3D motion by decomposing a scene into its object parts and predicting their corresponding rigid transformations. Our model is fully unsupervised, captures the stochastic nature of the real world, and the observational cues in image and point cloud domains constitute its learning signals. To fully leverage all the 2D and 3D observational signals, we equip our model with automatic hyperparameter optimization (HPO) to interpret the best way of learning from them. To the best of our knowledge, our model is the first generative model that provides an RGB-D video prediction of the future for a static camera. Our extensive evaluation with simulated and real-world datasets demonstrates that our formulation leads to interpretable 3D models that predict future depth videos while achieving on-par performance with 2D models on RGB video prediction. Moreover, we demonstrate that our model outperforms 2D baselines on visuomotor control. Videos, code, dataset, and pre-trained models are available at http://t3vip.cs.uni-freiburg.de.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
Densely-Populated Traffic Detection using YOLOv5 and Non-Maximum Suppression Ensembling
Authors:
Raian Rahman,
Zadid Bin Azad,
Md. Bakhtiar Hasan
Abstract:
Vehicular object detection is the heart of any intelligent traffic system. It is essential for urban traffic management. R-CNN, Fast R-CNN, Faster R-CNN and YOLO were some of the earlier state-of-the-art models. Region based CNN methods have the problem of higher inference time which makes it unrealistic to use the model in real-time. YOLO on the other hand struggles to detect small objects that a…
▽ More
Vehicular object detection is the heart of any intelligent traffic system. It is essential for urban traffic management. R-CNN, Fast R-CNN, Faster R-CNN and YOLO were some of the earlier state-of-the-art models. Region based CNN methods have the problem of higher inference time which makes it unrealistic to use the model in real-time. YOLO on the other hand struggles to detect small objects that appear in groups. In this paper, we propose a method that can locate and classify vehicular objects from a given densely crowded image using YOLOv5. The shortcoming of YOLO was solved my ensembling 4 different models. Our proposed model performs well on images taken from both top view and side view of the street in both day and night. The performance of our proposed model was measured on Dhaka AI dataset which contains densely crowded vehicular images. Our experiment shows that our model achieved [email protected] of 0.458 with inference time of 0.75 sec which outperforms other state-of-the-art models on performance. Hence, the model can be implemented in the street for real-time traffic detection which can be used for traffic control and data collection.
△ Less
Submitted 27 August, 2021;
originally announced August 2021.
-
Transmission of low-energy scalar waves through a traversable wormhole
Authors:
Bahareh Azad,
Farhang Loran,
Ali Mostafazadeh
Abstract:
We study the scattering of low-energy massless and massive minimally coupled scalar fields by an asymptotically flat traversable wormhole. We provide a comprehensive treatment of this problem offering analytic expressions for the transmission and reflection amplitudes of the corresponding effective potential and the absorption cross section of the wormhole. Our results, which are based on a recent…
▽ More
We study the scattering of low-energy massless and massive minimally coupled scalar fields by an asymptotically flat traversable wormhole. We provide a comprehensive treatment of this problem offering analytic expressions for the transmission and reflection amplitudes of the corresponding effective potential and the absorption cross section of the wormhole. Our results, which are based on a recently developed dynamical formulation of time-independent scattering theory, apply to a large class of wormhole spacetimes including a wormhole with a sharp transition, the Ellis wormhole, and a family of its generalizations.
△ Less
Submitted 14 November, 2020; v1 submitted 28 October, 2020;
originally announced October 2020.
-
On Two Conjectures about the Sum of Element Orders
Authors:
Morteza Baniasad Azad,
Behrooz Khosravi
Abstract:
Let $G$ be a finite group and $ψ(G) = \sum_{g \in G} o(g)$, where $o(g)$ denotes the order of $g \in G$. First, we prove that if $G$ is a group of order $n$ and $ψ(G) >31ψ(C_n)/77$, where $C_n$ is the cyclic group of order $n$, then $G$ is supersolvable. This proves a conjecture of M.~{Tărnăuceanu}. Moreover, M. Herzog, P. Longobardi and M. Maj put forward the following conjecture: If $H\leq G$, t…
▽ More
Let $G$ be a finite group and $ψ(G) = \sum_{g \in G} o(g)$, where $o(g)$ denotes the order of $g \in G$. First, we prove that if $G$ is a group of order $n$ and $ψ(G) >31ψ(C_n)/77$, where $C_n$ is the cyclic group of order $n$, then $G$ is supersolvable. This proves a conjecture of M.~{Tărnăuceanu}. Moreover, M. Herzog, P. Longobardi and M. Maj put forward the following conjecture: If $H\leq G$, then $ψ(G) \leqslant ψ(H) |G:H|^2$. In the sequel, by an example we show that this conjecture is not satisfied in general.
△ Less
Submitted 2 May, 2019;
originally announced May 2019.
-
A Criterion for Solvability of a Finite Group by the Sum of Element Orders
Authors:
Morteza Baniasad Azad,
Behrooz Khosravi
Abstract:
Let $G$ be a finite group and $ψ(G) = \sum_{g \in G} o(g)$, where $o(g)$ denotes the order of $g \in G$. In [M. Herzog, et. al., Two new criteria for solvability of finite groups, J. Algebra, 2018], the authors put forward the following conjecture: \textbf{Conjecture.} \textit{If $G$ is a group of order $n$ and $ψ(G)>211ψ(C_n)/1617 $, where $C_n$ is the cyclic group of order $n$, then $G$ is solva…
▽ More
Let $G$ be a finite group and $ψ(G) = \sum_{g \in G} o(g)$, where $o(g)$ denotes the order of $g \in G$. In [M. Herzog, et. al., Two new criteria for solvability of finite groups, J. Algebra, 2018], the authors put forward the following conjecture: \textbf{Conjecture.} \textit{If $G$ is a group of order $n$ and $ψ(G)>211ψ(C_n)/1617 $, where $C_n$ is the cyclic group of order $n$, then $G$ is solvable.} In this paper we prove the validity of this conjecture.
△ Less
Submitted 1 August, 2018;
originally announced August 2018.
-
Real-Time and Robust Method for Hand Gesture Recognition System Based on Cross-Correlation Coefficient
Authors:
Reza Azad,
Babak Azad,
Iman Tavakoli Kazerooni
Abstract:
Hand gesture recognition possesses extensive applications in virtual reality, sign language recognition, and computer games. The direct interface of hand gestures provides us a new way for communicating with the virtual environment. In this paper a novel and real-time approach for hand gesture recognition system is presented. In the suggested method, first, the hand gesture is extracted from the m…
▽ More
Hand gesture recognition possesses extensive applications in virtual reality, sign language recognition, and computer games. The direct interface of hand gestures provides us a new way for communicating with the virtual environment. In this paper a novel and real-time approach for hand gesture recognition system is presented. In the suggested method, first, the hand gesture is extracted from the main image by the image segmentation and morphological operation and then is sent to feature extraction stage. In feature extraction stage the Cross-correlation coefficient is applied on the gesture to recognize it. In the result part, the proposed approach is applied on American Sign Language (ASL) database and the accuracy rate obtained 98.34%.
△ Less
Submitted 8 August, 2014;
originally announced August 2014.
-
Real-Time Human-Computer Interaction Based on Face and Hand Gesture Recognition
Authors:
Reza Azad,
Babak Azad,
Nabil Belhaj Khalifa,
Shahram Jamali
Abstract:
At the present time, hand gestures recognition system could be used as a more expected and useable approach for human computer interaction. Automatic hand gesture recognition system provides us a new tactic for interactive with the virtual environment. In this paper, a face and hand gesture recognition system which is able to control computer media player is offered. Hand gesture and human face ar…
▽ More
At the present time, hand gestures recognition system could be used as a more expected and useable approach for human computer interaction. Automatic hand gesture recognition system provides us a new tactic for interactive with the virtual environment. In this paper, a face and hand gesture recognition system which is able to control computer media player is offered. Hand gesture and human face are the key element to interact with the smart system. We used the face recognition scheme for viewer verification and the hand gesture recognition in mechanism of computer media player, for instance, volume down/up, next music and etc. In the proposed technique, first, the hand gesture and face location is extracted from the main image by combination of skin and cascade detector and then is sent to recognition stage. In recognition stage, first, the threshold condition is inspected then the extracted face and gesture will be recognized. In the result stage, the proposed technique is applied on the video dataset and the high precision ratio acquired. Additional the recommended hand gesture recognition method is applied on static American Sign Language (ASL) database and the correctness rate achieved nearby 99.40%. also the planned method could be used in gesture based computer games and virtual reality.
△ Less
Submitted 7 August, 2014;
originally announced August 2014.
-
Real-Time and Efficient Method for Accuracy Enhancement of Edge Based License Plate Recognition System
Authors:
Reza Azad,
Babak Azad,
Hamid Reza Shayegh
Abstract:
License Plate Recognition plays an important role on the traffic monitoring and parking management. Administration and restriction of those transportation tools for their better service becomes very essential. In this paper, a fast and real time method has an appropriate application to find plates that the plat has tilt and the picture quality is poor. In the proposed method, at the beginning, the…
▽ More
License Plate Recognition plays an important role on the traffic monitoring and parking management. Administration and restriction of those transportation tools for their better service becomes very essential. In this paper, a fast and real time method has an appropriate application to find plates that the plat has tilt and the picture quality is poor. In the proposed method, at the beginning, the image is converted into binary mode with use of adaptive threshold. And with use of edge detection and morphology operation, plate number location has been specified and if the plat has tilt; its tilt is removed away. Then its characters are distinguished using image processing techniques. Finally, K Nearest Neighbour (KNN) classifier was used for character recognition. This method has been tested on available data set that has different images of the background, considering distance, and angel of view so that the correct extraction rate of plate reached at 98% and character recognition rate achieved at 99.12%. Further we tested our character recognition stage on Persian vehicle data set and we achieved 99% correct recognition rate.
△ Less
Submitted 24 July, 2014;
originally announced July 2014.
-
Optimized Method for Iranian Road Signs Detection and recognition system
Authors:
Reza Azad,
Babak Azad,
Iman Tavakoli Kazerooni
Abstract:
Road sign recognition is one of the core technologies in Intelligent Transport Systems. In the current study, a robust and real-time method is presented to identify and detect the roads speed signs in road image in different situations. In our proposed method, first, the connected components are created in the main image using the edge detection and mathematical morphology and the location of the…
▽ More
Road sign recognition is one of the core technologies in Intelligent Transport Systems. In the current study, a robust and real-time method is presented to identify and detect the roads speed signs in road image in different situations. In our proposed method, first, the connected components are created in the main image using the edge detection and mathematical morphology and the location of the road signs extracted by the geometric and color data; then the letters are segmented and recognized by Multiclass Support Vector Machine (SVMs) classifiers. Regarding that the geometric and color features ate properly used in detection the location of the road signs, so it is not sensitive to the distance and noise and has higher speed and efficiency. In the result part, the proposed approach is applied on Iranian road speed sign database and the detection and recognition accuracy rate achieved 98.66% and 100% respectively.
△ Less
Submitted 20 July, 2014;
originally announced July 2014.
-
Classifiers fusion method to recognize handwritten persian numerals
Authors:
Reza Azad,
Babak Azad,
Iraj Mogharreb,
Shahram Jamali
Abstract:
Recognition of Persian handwritten characters has been considered as a significant field of research for the last few years under pattern analysing technique. In this paper, a new approach for robust handwritten Persian numerals recognition using strong feature set and a classifier fusion method is scrutinized to increase the recognition percentage. For implementing the classifier fusion technique…
▽ More
Recognition of Persian handwritten characters has been considered as a significant field of research for the last few years under pattern analysing technique. In this paper, a new approach for robust handwritten Persian numerals recognition using strong feature set and a classifier fusion method is scrutinized to increase the recognition percentage. For implementing the classifier fusion technique, we have considered k nearest neighbour (KNN), linear classifier (LC) and support vector machine (SVM) classifiers. The innovation of this tactic is to attain better precision with few features using classifier fusion method. For evaluation of the proposed method we considered a Persian numerals database with 20,000 handwritten samples. Spending 15,000 samples for training stage, we verified our technique on other 5,000 samples, and the correct recognition ratio achieved approximately 99.90%. Additional, we got 99.97% exactness using four-fold cross validation procedure on 20,000 databases.
△ Less
Submitted 15 August, 2014; v1 submitted 9 July, 2014;
originally announced July 2014.