Search | arXiv e-print repository

Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

Authors: Moein Heidari, Sina Ghorbani Kolahi, Sanaz Karimijafarbigloo, Bobby Azad, Afshin Bozorgpour, Soheila Hatami, Reza Azad, Ali Diba, Ulas Bagci, Dorit Merhof, Ilker Hacihaliloglu

Abstract: Sequence modeling plays a vital role across various domains, with recurrent neural networks being historically the predominant method of performing these tasks. However, the emergence of transformers has altered this paradigm due to their superior performance. Built upon these advances, transformers have conjoined CNNs as two leading foundational models for learning visual representations. However… ▽ More Sequence modeling plays a vital role across various domains, with recurrent neural networks being historically the predominant method of performing these tasks. However, the emergence of transformers has altered this paradigm due to their superior performance. Built upon these advances, transformers have conjoined CNNs as two leading foundational models for learning visual representations. However, transformers are hindered by the $\mathcal{O}(N^2)$ complexity of their attention mechanisms, while CNNs lack global receptive fields and dynamic weight allocation. State Space Models (SSMs), specifically the \textit{\textbf{Mamba}} model with selection mechanisms and hardware-aware architecture, have garnered immense interest lately in sequential modeling and visual representation learning, challenging the dominance of transformers by providing infinite context lengths and offering substantial efficiency maintaining linear complexity in the input sequence. Capitalizing on the advances in computer vision, medical imaging has heralded a new epoch with Mamba models. Intending to help researchers navigate the surge, this survey seeks to offer an encyclopedic review of Mamba models in medical imaging. Specifically, we start with a comprehensive theoretical review forming the basis of SSMs, including Mamba architecture and its alternatives for sequence modeling paradigms in this context. Next, we offer a structured classification of Mamba models in the medical field and introduce a diverse categorization scheme based on their application, imaging modalities, and targeted organs. Finally, we summarize key challenges, discuss different future research directions of the SSMs in the medical domain, and propose several directions to fulfill the demands of this field. In addition, we have compiled the studies discussed in this paper along with their open-source implementations on our GitHub repository. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: This is the first version of our survey, and the paper is currently under review

arXiv:2403.08387 [pdf, ps, other]

doi 10.1103/PhysRevD.109.124051

Radial perturbations of Ellis-Bronnikov wormholes in slow rotation up to second order

Authors: Bahareh Azad, Jose Luis Blázquez-Salcedo, Fech Scen Khoo, Jutta Kunz

Abstract: We consider slowly rotating Ellis-Bronnikov wormholes and investigate their radial perturbations ($\mathrm{l}=0$), expanding up to second order in rotation. We present the detailed derivations in the general case, including symmetric and non-symmetric wormholes. The calculations show that the unstable mode present in the static case becomes less unstable with increasing rotation, until it reaches… ▽ More We consider slowly rotating Ellis-Bronnikov wormholes and investigate their radial perturbations ($\mathrm{l}=0$), expanding up to second order in rotation. We present the detailed derivations in the general case, including symmetric and non-symmetric wormholes. The calculations show that the unstable mode present in the static case becomes less unstable with increasing rotation, until it reaches zero and then disappears. This indicates that wormhole solutions may become linearly mode stable at sufficiently fast rotation. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: 27 pages, 8 figures

Journal ref: Phys. Rev. D 109, 124051 (2024)

arXiv:2401.02898 [pdf, other]

doi 10.1103/PhysRevD.109.084013

Quasinormal modes of rapidly rotating Ellis-Bronnikov wormholes

Authors: Fech Scen Khoo, Bahareh Azad, Jose Luis Blázquez-Salcedo, Luis Manuel González-Romero, Burkhard Kleihaus, Jutta Kunz, Francisco Navarro-Lérida

Abstract: We present for the first time a study of the quasinormal modes of rapidly rotating Ellis-Bronnikov wormholes in General Relativity. We compute the spectrum of the wormholes using a spectral decomposition of the metric perturbations on a numerical background. We focus on the $M_z=2,3$ sector of the perturbations, and show that the triple isospectrality of the symmetric and static Ellis-Bronnikov wo… ▽ More We present for the first time a study of the quasinormal modes of rapidly rotating Ellis-Bronnikov wormholes in General Relativity. We compute the spectrum of the wormholes using a spectral decomposition of the metric perturbations on a numerical background. We focus on the $M_z=2,3$ sector of the perturbations, and show that the triple isospectrality of the symmetric and static Ellis-Bronnikov wormhole is broken due to rotation, giving rise to a much richer spectrum than the spectrum of Kerr black holes. We do not find any instabilities for $M_z=2,3$ perturbations. △ Less

Submitted 8 April, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

Comments: 19 pages, 4 figures, 3 tables; v2: equations added, results unchanged, matches published version

Journal ref: Phys. Rev. D 109, 084013 (2024)

arXiv:2311.12486 [pdf, other]

HCA-Net: Hierarchical Context Attention Network for Intervertebral Disc Semantic Labeling

Authors: Afshin Bozorgpour, Bobby Azad, Reza Azad, Yury Velichko, Ulas Bagci, Dorit Merhof

Abstract: Accurate and automated segmentation of intervertebral discs (IVDs) in medical images is crucial for assessing spine-related disorders, such as osteoporosis, vertebral fractures, or IVD herniation. We present HCA-Net, a novel contextual attention network architecture for semantic labeling of IVDs, with a special focus on exploiting prior geometric information. Our approach excels at processing feat… ▽ More Accurate and automated segmentation of intervertebral discs (IVDs) in medical images is crucial for assessing spine-related disorders, such as osteoporosis, vertebral fractures, or IVD herniation. We present HCA-Net, a novel contextual attention network architecture for semantic labeling of IVDs, with a special focus on exploiting prior geometric information. Our approach excels at processing features across different scales and effectively consolidating them to capture the intricate spatial relationships within the spinal cord. To achieve this, HCA-Net models IVD labeling as a pose estimation problem, aiming to minimize the discrepancy between each predicted IVD location and its corresponding actual joint location. In addition, we introduce a skeletal loss term to reinforce the model's geometric dependence on the spine. This loss function is designed to constrain the model's predictions to a range that matches the general structure of the human vertebral skeleton. As a result, the network learns to reduce the occurrence of false predictions and adaptively improves the accuracy of IVD location estimation. Through extensive experimental evaluation on multi-center spine datasets, our approach consistently outperforms previous state-of-the-art methods on both MRI T1w and T2w modalities. The codebase is accessible to the public on \href{https://github.com/xmindflow/HCA-Net}{GitHub}. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2310.18689 [pdf, other]

Foundational Models in Medical Imaging: A Comprehensive Survey and Future Vision

Authors: Bobby Azad, Reza Azad, Sania Eskandari, Afshin Bozorgpour, Amirhossein Kazerouni, Islem Rekik, Dorit Merhof

Abstract: Foundation models, large-scale, pre-trained deep-learning models adapted to a wide range of downstream tasks have gained significant interest lately in various deep-learning problems undergoing a paradigm shift with the rise of these models. Trained on large-scale dataset to bridge the gap between different modalities, foundation models facilitate contextual reasoning, generalization, and prompt c… ▽ More Foundation models, large-scale, pre-trained deep-learning models adapted to a wide range of downstream tasks have gained significant interest lately in various deep-learning problems undergoing a paradigm shift with the rise of these models. Trained on large-scale dataset to bridge the gap between different modalities, foundation models facilitate contextual reasoning, generalization, and prompt capabilities at test time. The predictions of these models can be adjusted for new tasks by augmenting the model input with task-specific hints called prompts without requiring extensive labeled data and retraining. Capitalizing on the advances in computer vision, medical imaging has also marked a growing interest in these models. To assist researchers in navigating this direction, this survey intends to provide a comprehensive overview of foundation models in the domain of medical imaging. Specifically, we initiate our exploration by providing an exposition of the fundamental concepts forming the basis of foundation models. Subsequently, we offer a methodical taxonomy of foundation models within the medical domain, proposing a classification system primarily structured around training strategies, while also incorporating additional facets such as application domains, imaging modalities, specific organs of interest, and the algorithms integral to these models. Furthermore, we emphasize the practical use case of some selected approaches and then discuss the opportunities, applications, and future directions of these large-scale pre-trained models, for analyzing medical images. In the same vein, we address the prevailing challenges and research pathways associated with foundational models in medical imaging. These encompass the areas of interpretability, data management, computational requirements, and the nuanced issue of contextual comprehension. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Comments: The paper is currently in the process of being prepared for submission to MIA

arXiv:2309.03881 [pdf, ps, other]

The simple $\mathscr{B}_ψ$-groups

Authors: Morteza Baniasad Azad

Abstract: In a finite group $ G $, $ ψ(G) $ denotes the sum of element orders of $ G $. A finite group $ G $ is said to be a $\mathscr{B}_ψ$-group if $ ψ(H) < |G| $ for any proper subgroup $ H $ of $ G $. In \cite{Lazorec} Lazorec asked: "what can be said about the $\mathscr{B}_ψ$ property of the finite simple groups $ \operatorname{PSL}(2, q) $?" In this paper, we answer this question for the case of no… ▽ More In a finite group $ G $, $ ψ(G) $ denotes the sum of element orders of $ G $. A finite group $ G $ is said to be a $\mathscr{B}_ψ$-group if $ ψ(H) < |G| $ for any proper subgroup $ H $ of $ G $. In \cite{Lazorec} Lazorec asked: "what can be said about the $\mathscr{B}_ψ$ property of the finite simple groups $ \operatorname{PSL}(2, q) $?" In this paper, we answer this question for the case of not only the finite simple groups $ \operatorname{PSL}(2, q) $ but also all other finite simple groups. We show that if $ S $ is a finite simple group, such that $ S \neq Alt(n) $ for any $ n \geq 14 $, then $S$ is a $\mathscr{B}_ψ$-group. △ Less

Submitted 7 September, 2023; originally announced September 2023.

MSC Class: 20D05; 20D60; 20D06; 20D08

arXiv:2309.00108 [pdf, other]

Laplacian-Former: Overcoming the Limitations of Vision Transformers in Local Texture Detection

Authors: Reza Azad, Amirhossein Kazerouni, Babak Azad, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof

Abstract: Vision Transformer (ViT) models have demonstrated a breakthrough in a wide range of computer vision tasks. However, compared to the Convolutional Neural Network (CNN) models, it has been observed that the ViT models struggle to capture high-frequency components of images, which can limit their ability to detect local textures and edge information. As abnormalities in human tissue, such as tumors a… ▽ More Vision Transformer (ViT) models have demonstrated a breakthrough in a wide range of computer vision tasks. However, compared to the Convolutional Neural Network (CNN) models, it has been observed that the ViT models struggle to capture high-frequency components of images, which can limit their ability to detect local textures and edge information. As abnormalities in human tissue, such as tumors and lesions, may greatly vary in structure, texture, and shape, high-frequency information such as texture is crucial for effective semantic segmentation tasks. To address this limitation in ViT models, we propose a new technique, Laplacian-Former, that enhances the self-attention map by adaptively re-calibrating the frequency information in a Laplacian pyramid. More specifically, our proposed method utilizes a dual attention mechanism via efficient attention and frequency attention while the efficient attention mechanism reduces the complexity of self-attention to linear while producing the same output, selectively intensifying the contribution of shape and texture features. Furthermore, we introduce a novel efficient enhancement multi-scale bridge that effectively transfers spatial information from the encoder to the decoder while preserving the fundamental features. We demonstrate the efficacy of Laplacian-former on multi-organ and skin lesion segmentation tasks with +1.87\% and +0.76\% dice scores compared to SOTA approaches, respectively. Our implementation is publically available at https://github.com/mindflow-institue/Laplacian-Former △ Less

Submitted 31 August, 2023; originally announced September 2023.

Comments: Accepted in the main conference MICCAI 2023

arXiv:2308.03670 [pdf]

doi 10.13031/aim.202300569

Improving FHB Screening in Wheat Breeding Using an Efficient Transformer Model

Authors: Babak Azad, Ahmed Abdalla, Kwanghee Won, Ali Mirzakhani Nafchi

Abstract: Fusarium head blight is a devastating disease that causes significant economic losses annually on small grains. Efficiency, accuracy, and timely detection of FHB in the resistance screening are critical for wheat and barley breeding programs. In recent years, various image processing techniques have been developed using supervised machine learning algorithms for the early detection of FHB. The sta… ▽ More Fusarium head blight is a devastating disease that causes significant economic losses annually on small grains. Efficiency, accuracy, and timely detection of FHB in the resistance screening are critical for wheat and barley breeding programs. In recent years, various image processing techniques have been developed using supervised machine learning algorithms for the early detection of FHB. The state-of-the-art convolutional neural network-based methods, such as U-Net, employ a series of encoding blocks to create a local representation and a series of decoding blocks to capture the semantic relations. However, these methods are not often capable of long-range modeling dependencies inside the input data, and their ability to model multi-scale objects with significant variations in texture and shape is limited. Vision transformers as alternative architectures with innate global self-attention mechanisms for sequence-to-sequence prediction, due to insufficient low-level details, may also limit localization capabilities. To overcome these limitations, a new Context Bridge is proposed to integrate the local representation capability of the U-Net network in the transformer model. In addition, the standard attention mechanism of the original transformer is replaced with Efficient Self-attention, which is less complicated than other state-of-the-art methods. To train the proposed network, 12,000 wheat images from an FHB-inoculated wheat field at the SDSU research farm in Volga, SD, were captured. In addition to healthy and unhealthy plants, these images encompass various stages of the disease. A team of expert pathologists annotated the images for training and evaluating the developed model. As a result, the effectiveness of the transformer-based method for FHB-disease detection, through extensive experiments across typical tasks for plant image segmentation, is demonstrated. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: 10 pages, 5 figures, 1 table. Presented at the 2023 ASABE Annual International Meeting conference in Omaha, Nebraska. Also available at https://elibrary.asabe.org/abstract.asp?aid=54149

MSC Class: 68T07; 68T10

Journal ref: In 2023 ASABE Annual International Meeting (p. 1). American Society of Agricultural and Biological Engineers

arXiv:2307.16142 [pdf, other]

Implicit Neural Representation in Medical Imaging: A Comparative Survey

Authors: Amirali Molaei, Amirhossein Aminimehr, Armin Tavakoli, Amirhossein Kazerouni, Bobby Azad, Reza Azad, Dorit Merhof

Abstract: Implicit neural representations (INRs) have gained prominence as a powerful paradigm in scene reconstruction and computer graphics, demonstrating remarkable results. By utilizing neural networks to parameterize data through implicit continuous functions, INRs offer several benefits. Recognizing the potential of INRs beyond these domains, this survey aims to provide a comprehensive overview of INR… ▽ More Implicit neural representations (INRs) have gained prominence as a powerful paradigm in scene reconstruction and computer graphics, demonstrating remarkable results. By utilizing neural networks to parameterize data through implicit continuous functions, INRs offer several benefits. Recognizing the potential of INRs beyond these domains, this survey aims to provide a comprehensive overview of INR models in the field of medical imaging. In medical settings, numerous challenging and ill-posed problems exist, making INRs an attractive solution. The survey explores the application of INRs in various medical imaging tasks, such as image reconstruction, segmentation, registration, novel view synthesis, and compression. It discusses the advantages and limitations of INRs, highlighting their resolution-agnostic nature, memory efficiency, ability to avoid locality biases, and differentiability, enabling adaptation to different tasks. Furthermore, the survey addresses the challenges and considerations specific to medical imaging data, such as data availability, computational complexity, and dynamic clinical scene analysis. It also identifies future research directions and opportunities, including integration with multi-modal imaging, real-time and interactive systems, and domain adaptation for clinical decision support. To facilitate further exploration and implementation of INRs in medical image analysis, we have provided a compilation of cited studies along with their available open-source implementations on \href{https://github.com/mindflow-institue/Awesome-Implicit-Neural-Representations-in-Medical-imaging}. Finally, we aim to consistently incorporate the most recent and relevant papers regularly. △ Less

Submitted 30 July, 2023; originally announced July 2023.

arXiv:2301.05243 [pdf, ps, other]

doi 10.1016/j.physletb.2023.138349

Are slowly rotating Ellis-Bronnikov wormholes stable?

Authors: Bahareh Azad, Jose Luis Blázquez-Salcedo, Fech Scen Khoo, Jutta Kunz

Abstract: We investigate the radial perturbations of Ellis-Bronnikov wormholes ($\mathrm{l}=0$) in a slowly rotating background expanded up to second order in rotation. We find indications that simple wormhole solutions such as Ellis-Bronnikov in General Relativity can be stabilized by rotation, thus favoring a viable traversable wormhole. This opens up the intriguing question whether the many other wormhol… ▽ More We investigate the radial perturbations of Ellis-Bronnikov wormholes ($\mathrm{l}=0$) in a slowly rotating background expanded up to second order in rotation. We find indications that simple wormhole solutions such as Ellis-Bronnikov in General Relativity can be stabilized by rotation, thus favoring a viable traversable wormhole. This opens up the intriguing question whether the many other wormhole solutions with or without the support of exotic matter can become linearly mode stable when the wormhole rotates. △ Less

Submitted 23 November, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

Comments: 6 pages, 3 figures; v2: equations added, minor typos corrected, results unchanged, matches published version

arXiv:2212.12601 [pdf, ps, other]

doi 10.1103/PhysRevD.107.084024

Polar modes and isospectrality of Ellis-Bronnikov wormholes

Authors: Bahareh Azad, Jose Luis Blázquez-Salcedo, Xiao Yan Chew, Jutta Kunz, Dong-han Yeom

Abstract: We consider polar perturbations of static Ellis-Bronnikov wormholes and derive the coupled set of perturbation equations for the gravitational and the scalar field. For massless wormholes the perturbations decouple, and we obtain two identical master equations for the scalar and gravitational modes, which moreover agree with the master equation for the axial modes. Consequently there is isospectra… ▽ More We consider polar perturbations of static Ellis-Bronnikov wormholes and derive the coupled set of perturbation equations for the gravitational and the scalar field. For massless wormholes the perturbations decouple, and we obtain two identical master equations for the scalar and gravitational modes, which moreover agree with the master equation for the axial modes. Consequently there is isospectrality with threefold degenerate modes. For a finite mass of the background wormhole solutions, the equations are coupled. We then obtain two distinct branches of polar quasinormal modes for a given multipole number l, associated with the presence of the two types of fields. We calculate the quasi-normal mode frequencies and decay rates for the branches with l=2, 3 and 4. For a given l the real frequencies of the two branchesget the closer, the higher the multipole number gets. △ Less

Submitted 23 December, 2022; originally announced December 2022.

Comments: 22 pages, 7 figures

arXiv:2212.02941 [pdf, other]

Safe Imitation Learning of Nonlinear Model Predictive Control for Flexible Robots

Authors: Shamil Mamedov, Rudolf Reiter, Seyed Mahdi Basiri Azad, Joschka Boedecker, Moritz Diehl, Jan Swevers

Abstract: Flexible robots may overcome some of the industry's major challenges, such as enabling intrinsically safe human-robot collaboration and achieving a higher load-to-mass ratio. However, controlling flexible robots is complicated due to their complex dynamics, which include oscillatory behavior and a high-dimensional state space. NMPC offers an effective means to control such robots, but its extensiv… ▽ More Flexible robots may overcome some of the industry's major challenges, such as enabling intrinsically safe human-robot collaboration and achieving a higher load-to-mass ratio. However, controlling flexible robots is complicated due to their complex dynamics, which include oscillatory behavior and a high-dimensional state space. NMPC offers an effective means to control such robots, but its extensive computational demands often limit its application in real-time scenarios. To enable fast control of flexible robots, we propose a framework for a safe approximation of NMPC using imitation learning and a predictive safety filter. Our framework significantly reduces computation time while incurring a slight loss in performance. Compared to NMPC, our framework shows more than a eightfold improvement in computation time when controlling a three-dimensional flexible robot arm in simulation, all while guaranteeing safety constraints. Notably, our approach outperforms conventional reinforcement learning methods. The development of fast and safe approximate NMPC holds the potential to accelerate the adoption of flexible robots in industry. △ Less

Submitted 28 September, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

Comments: Submitted to ICRA 2024

arXiv:2209.11693 [pdf, other]

T3VIP: Transformation-based 3D Video Prediction

Authors: Iman Nematollahi, Erick Rosete-Beas, Seyed Mahdi B. Azad, Raghu Rajan, Frank Hutter, Wolfram Burgard

Abstract: For autonomous skill acquisition, robots have to learn about the physical rules governing the 3D world dynamics from their own past experience to predict and reason about plausible future outcomes. To this end, we propose a transformation-based 3D video prediction (T3VIP) approach that explicitly models the 3D motion by decomposing a scene into its object parts and predicting their corresponding r… ▽ More For autonomous skill acquisition, robots have to learn about the physical rules governing the 3D world dynamics from their own past experience to predict and reason about plausible future outcomes. To this end, we propose a transformation-based 3D video prediction (T3VIP) approach that explicitly models the 3D motion by decomposing a scene into its object parts and predicting their corresponding rigid transformations. Our model is fully unsupervised, captures the stochastic nature of the real world, and the observational cues in image and point cloud domains constitute its learning signals. To fully leverage all the 2D and 3D observational signals, we equip our model with automatic hyperparameter optimization (HPO) to interpret the best way of learning from them. To the best of our knowledge, our model is the first generative model that provides an RGB-D video prediction of the future for a static camera. Our extensive evaluation with simulated and real-world datasets demonstrates that our formulation leads to interpretable 3D models that predict future depth videos while achieving on-par performance with 2D models on RGB video prediction. Moreover, we demonstrate that our model outperforms 2D baselines on visuomotor control. Videos, code, dataset, and pre-trained models are available at http://t3vip.cs.uni-freiburg.de. △ Less

Submitted 19 September, 2022; originally announced September 2022.

Comments: Accepted at the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

arXiv:2108.12118 [pdf, other]

doi 10.1007/978-981-16-6636-0_43

Densely-Populated Traffic Detection using YOLOv5 and Non-Maximum Suppression Ensembling

Authors: Raian Rahman, Zadid Bin Azad, Md. Bakhtiar Hasan

Abstract: Vehicular object detection is the heart of any intelligent traffic system. It is essential for urban traffic management. R-CNN, Fast R-CNN, Faster R-CNN and YOLO were some of the earlier state-of-the-art models. Region based CNN methods have the problem of higher inference time which makes it unrealistic to use the model in real-time. YOLO on the other hand struggles to detect small objects that a… ▽ More Vehicular object detection is the heart of any intelligent traffic system. It is essential for urban traffic management. R-CNN, Fast R-CNN, Faster R-CNN and YOLO were some of the earlier state-of-the-art models. Region based CNN methods have the problem of higher inference time which makes it unrealistic to use the model in real-time. YOLO on the other hand struggles to detect small objects that appear in groups. In this paper, we propose a method that can locate and classify vehicular objects from a given densely crowded image using YOLOv5. The shortcoming of YOLO was solved my ensembling 4 different models. Our proposed model performs well on images taken from both top view and side view of the street in both day and night. The performance of our proposed model was measured on Dhaka AI dataset which contains densely crowded vehicular images. Our experiment shows that our model achieved [email protected] of 0.458 with inference time of 0.75 sec which outperforms other state-of-the-art models on performance. Hence, the model can be implemented in the street for real-time traffic detection which can be used for traffic control and data collection. △ Less

Submitted 27 August, 2021; originally announced August 2021.

Comments: 13 pages, 4 figures, conference: International Conference on Big Data, IoT and Machine Learning 2021 (BIM 2021)

arXiv:2010.15023 [pdf, ps, other]

doi 10.1140/epjc/s10052-020-08668-3

Transmission of low-energy scalar waves through a traversable wormhole

Authors: Bahareh Azad, Farhang Loran, Ali Mostafazadeh

Abstract: We study the scattering of low-energy massless and massive minimally coupled scalar fields by an asymptotically flat traversable wormhole. We provide a comprehensive treatment of this problem offering analytic expressions for the transmission and reflection amplitudes of the corresponding effective potential and the absorption cross section of the wormhole. Our results, which are based on a recent… ▽ More We study the scattering of low-energy massless and massive minimally coupled scalar fields by an asymptotically flat traversable wormhole. We provide a comprehensive treatment of this problem offering analytic expressions for the transmission and reflection amplitudes of the corresponding effective potential and the absorption cross section of the wormhole. Our results, which are based on a recently developed dynamical formulation of time-independent scattering theory, apply to a large class of wormhole spacetimes including a wormhole with a sharp transition, the Ellis wormhole, and a family of its generalizations. △ Less

Submitted 14 November, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

Comments: 21 pages, 3 figures, references added, accepted for publication in European Physical Journal C

arXiv:1905.00815 [pdf, ps, other]

doi 10.4153/S0008439521000047

On Two Conjectures about the Sum of Element Orders

Authors: Morteza Baniasad Azad, Behrooz Khosravi

Abstract: Let $G$ be a finite group and $ψ(G) = \sum_{g \in G} o(g)$, where $o(g)$ denotes the order of $g \in G$. First, we prove that if $G$ is a group of order $n$ and $ψ(G) >31ψ(C_n)/77$, where $C_n$ is the cyclic group of order $n$, then $G$ is supersolvable. This proves a conjecture of M.~{Tărnăuceanu}. Moreover, M. Herzog, P. Longobardi and M. Maj put forward the following conjecture: If $H\leq G$, t… ▽ More Let $G$ be a finite group and $ψ(G) = \sum_{g \in G} o(g)$, where $o(g)$ denotes the order of $g \in G$. First, we prove that if $G$ is a group of order $n$ and $ψ(G) >31ψ(C_n)/77$, where $C_n$ is the cyclic group of order $n$, then $G$ is supersolvable. This proves a conjecture of M.~{Tărnăuceanu}. Moreover, M. Herzog, P. Longobardi and M. Maj put forward the following conjecture: If $H\leq G$, then $ψ(G) \leqslant ψ(H) |G:H|^2$. In the sequel, by an example we show that this conjecture is not satisfied in general. △ Less

Submitted 2 May, 2019; originally announced May 2019.

Comments: 8 pages

MSC Class: 20D60; 20F16

arXiv:1808.00253 [pdf, ps, other]

A Criterion for Solvability of a Finite Group by the Sum of Element Orders

Authors: Morteza Baniasad Azad, Behrooz Khosravi

Abstract: Let $G$ be a finite group and $ψ(G) = \sum_{g \in G} o(g)$, where $o(g)$ denotes the order of $g \in G$. In [M. Herzog, et. al., Two new criteria for solvability of finite groups, J. Algebra, 2018], the authors put forward the following conjecture: \textbf{Conjecture.} \textit{If $G$ is a group of order $n$ and $ψ(G)>211ψ(C_n)/1617 $, where $C_n$ is the cyclic group of order $n$, then $G$ is solva… ▽ More Let $G$ be a finite group and $ψ(G) = \sum_{g \in G} o(g)$, where $o(g)$ denotes the order of $g \in G$. In [M. Herzog, et. al., Two new criteria for solvability of finite groups, J. Algebra, 2018], the authors put forward the following conjecture: \textbf{Conjecture.} \textit{If $G$ is a group of order $n$ and $ψ(G)>211ψ(C_n)/1617 $, where $C_n$ is the cyclic group of order $n$, then $G$ is solvable.} In this paper we prove the validity of this conjecture. △ Less

Submitted 1 August, 2018; originally announced August 2018.

arXiv:1408.1759 [pdf]

Real-Time and Robust Method for Hand Gesture Recognition System Based on Cross-Correlation Coefficient

Authors: Reza Azad, Babak Azad, Iman Tavakoli Kazerooni

Abstract: Hand gesture recognition possesses extensive applications in virtual reality, sign language recognition, and computer games. The direct interface of hand gestures provides us a new way for communicating with the virtual environment. In this paper a novel and real-time approach for hand gesture recognition system is presented. In the suggested method, first, the hand gesture is extracted from the m… ▽ More Hand gesture recognition possesses extensive applications in virtual reality, sign language recognition, and computer games. The direct interface of hand gestures provides us a new way for communicating with the virtual environment. In this paper a novel and real-time approach for hand gesture recognition system is presented. In the suggested method, first, the hand gesture is extracted from the main image by the image segmentation and morphological operation and then is sent to feature extraction stage. In feature extraction stage the Cross-correlation coefficient is applied on the gesture to recognize it. In the result part, the proposed approach is applied on American Sign Language (ASL) database and the accuracy rate obtained 98.34%. △ Less

Submitted 8 August, 2014; originally announced August 2014.

Comments: arXiv admin note: substantial text overlap with http://dx.doi.org/10.1109/ICCCA.2012.6179213 by other author

Journal ref: Advances in Computer Science: an International Journal, Vol. 2, Issue 5, No.6, pp. 121-125, 2013

arXiv:1408.1549 [pdf]

doi 10.5121/ijfcst.2014.4403

Real-Time Human-Computer Interaction Based on Face and Hand Gesture Recognition

Authors: Reza Azad, Babak Azad, Nabil Belhaj Khalifa, Shahram Jamali

Abstract: At the present time, hand gestures recognition system could be used as a more expected and useable approach for human computer interaction. Automatic hand gesture recognition system provides us a new tactic for interactive with the virtual environment. In this paper, a face and hand gesture recognition system which is able to control computer media player is offered. Hand gesture and human face ar… ▽ More At the present time, hand gestures recognition system could be used as a more expected and useable approach for human computer interaction. Automatic hand gesture recognition system provides us a new tactic for interactive with the virtual environment. In this paper, a face and hand gesture recognition system which is able to control computer media player is offered. Hand gesture and human face are the key element to interact with the smart system. We used the face recognition scheme for viewer verification and the hand gesture recognition in mechanism of computer media player, for instance, volume down/up, next music and etc. In the proposed technique, first, the hand gesture and face location is extracted from the main image by combination of skin and cascade detector and then is sent to recognition stage. In recognition stage, first, the threshold condition is inspected then the extracted face and gesture will be recognized. In the result stage, the proposed technique is applied on the video dataset and the high precision ratio acquired. Additional the recommended hand gesture recognition method is applied on static American Sign Language (ASL) database and the correctness rate achieved nearby 99.40%. also the planned method could be used in gesture based computer games and virtual reality. △ Less

Submitted 7 August, 2014; originally announced August 2014.

Journal ref: International Journal in Foundations of Computer Science & Technology 07/2014; 4(4):37-48

arXiv:1407.6498 [pdf]

Real-Time and Efficient Method for Accuracy Enhancement of Edge Based License Plate Recognition System

Authors: Reza Azad, Babak Azad, Hamid Reza Shayegh

Abstract: License Plate Recognition plays an important role on the traffic monitoring and parking management. Administration and restriction of those transportation tools for their better service becomes very essential. In this paper, a fast and real time method has an appropriate application to find plates that the plat has tilt and the picture quality is poor. In the proposed method, at the beginning, the… ▽ More License Plate Recognition plays an important role on the traffic monitoring and parking management. Administration and restriction of those transportation tools for their better service becomes very essential. In this paper, a fast and real time method has an appropriate application to find plates that the plat has tilt and the picture quality is poor. In the proposed method, at the beginning, the image is converted into binary mode with use of adaptive threshold. And with use of edge detection and morphology operation, plate number location has been specified and if the plat has tilt; its tilt is removed away. Then its characters are distinguished using image processing techniques. Finally, K Nearest Neighbour (KNN) classifier was used for character recognition. This method has been tested on available data set that has different images of the background, considering distance, and angel of view so that the correct extraction rate of plate reached at 98% and character recognition rate achieved at 99.12%. Further we tested our character recognition stage on Persian vehicle data set and we achieved 99% correct recognition rate. △ Less

Submitted 24 July, 2014; originally announced July 2014.

Comments: 2013 First International Conference on computer, Information Technology and Digital Media. arXiv admin note: substantial text overlap with arXiv:1407.6321

arXiv:1407.5324 [pdf]

doi 10.7815/ijorcs.41.2014.077

Optimized Method for Iranian Road Signs Detection and recognition system

Authors: Reza Azad, Babak Azad, Iman Tavakoli Kazerooni

Abstract: Road sign recognition is one of the core technologies in Intelligent Transport Systems. In the current study, a robust and real-time method is presented to identify and detect the roads speed signs in road image in different situations. In our proposed method, first, the connected components are created in the main image using the edge detection and mathematical morphology and the location of the… ▽ More Road sign recognition is one of the core technologies in Intelligent Transport Systems. In the current study, a robust and real-time method is presented to identify and detect the roads speed signs in road image in different situations. In our proposed method, first, the connected components are created in the main image using the edge detection and mathematical morphology and the location of the road signs extracted by the geometric and color data; then the letters are segmented and recognized by Multiclass Support Vector Machine (SVMs) classifiers. Regarding that the geometric and color features ate properly used in detection the location of the road signs, so it is not sensitive to the distance and noise and has higher speed and efficiency. In the result part, the proposed approach is applied on Iranian road speed sign database and the detection and recognition accuracy rate achieved 98.66% and 100% respectively. △ Less

Submitted 20 July, 2014; originally announced July 2014.

Journal ref: International Journal of Research in Computer Science, 4 (1): pp. 19-26, January 2014

arXiv:1407.2572

doi 10.5121/ijci.2014.3301

Classifiers fusion method to recognize handwritten persian numerals

Authors: Reza Azad, Babak Azad, Iraj Mogharreb, Shahram Jamali

Abstract: Recognition of Persian handwritten characters has been considered as a significant field of research for the last few years under pattern analysing technique. In this paper, a new approach for robust handwritten Persian numerals recognition using strong feature set and a classifier fusion method is scrutinized to increase the recognition percentage. For implementing the classifier fusion technique… ▽ More Recognition of Persian handwritten characters has been considered as a significant field of research for the last few years under pattern analysing technique. In this paper, a new approach for robust handwritten Persian numerals recognition using strong feature set and a classifier fusion method is scrutinized to increase the recognition percentage. For implementing the classifier fusion technique, we have considered k nearest neighbour (KNN), linear classifier (LC) and support vector machine (SVM) classifiers. The innovation of this tactic is to attain better precision with few features using classifier fusion method. For evaluation of the proposed method we considered a Persian numerals database with 20,000 handwritten samples. Spending 15,000 samples for training stage, we verified our technique on other 5,000 samples, and the correct recognition ratio achieved approximately 99.90%. Additional, we got 99.97% exactness using four-fold cross validation procedure on 20,000 databases. △ Less

Submitted 15 August, 2014; v1 submitted 9 July, 2014; originally announced July 2014.

Comments: This paper has been withdrawn by the author due to a crucial sign error in equation 5 and 6, and some mistake in Table 1 information. please let me for changing this information and updating this paper

Journal ref: International Journal on Cybernetics & Informatics (IJCI) Vol. 3, No. 3, June 2014

Showing 1–22 of 22 results for author: Azad, B