-
Koopman-LQR Controller for Quadrotor UAVs from Data
Authors:
Zeyad M. Manaa,
Ayman M. Abdallah,
Mohammad A. Abido,
Syed S. Azhar Ali
Abstract:
Quadrotor systems are common and beneficial for many fields, but their intricate behavior often makes it challenging to design effective and optimal control strategies. Some traditional approaches to nonlinear control often rely on local linearizations or complex nonlinear models, which can be inaccurate or computationally expensive. We present a data-driven approach to identify the dynamics of a…
▽ More
Quadrotor systems are common and beneficial for many fields, but their intricate behavior often makes it challenging to design effective and optimal control strategies. Some traditional approaches to nonlinear control often rely on local linearizations or complex nonlinear models, which can be inaccurate or computationally expensive. We present a data-driven approach to identify the dynamics of a given quadrotor system using Koopman operator theory. Koopman theory offers a framework for representing nonlinear dynamics as linear operators acting on observable functions of the state space. This allows to approximate nonlinear systems with globally linear models in a higher dimensional space, which can be analyzed and controlled using standard linear optimal control techniques. We leverage the method of extended dynamic mode decomposition (EDMD) to identify Koopman operator from data with total least squares. We demonstrate that the identified model can be stabilized and controllable by designing a controller using linear quadratic regulator (LQR).
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
ComplexTempQA: A Large-Scale Dataset for Complex Temporal Question Answering
Authors:
Raphael Gruber,
Abdelrahman Abdallah,
Michael Färber,
Adam Jatowt
Abstract:
We introduce ComplexTempQA,a large-scale dataset consisting of over 100 million question-answer pairs designed to tackle the challenges in temporal question answering. ComplexTempQA significantly surpasses existing benchmarks like HOTPOTQA, TORQUE, and TEQUILA in scale and scope. Utilizing data from Wikipedia and Wikidata, the dataset covers questions spanning over two decades and offers an unmatc…
▽ More
We introduce ComplexTempQA,a large-scale dataset consisting of over 100 million question-answer pairs designed to tackle the challenges in temporal question answering. ComplexTempQA significantly surpasses existing benchmarks like HOTPOTQA, TORQUE, and TEQUILA in scale and scope. Utilizing data from Wikipedia and Wikidata, the dataset covers questions spanning over two decades and offers an unmatched breadth of topics. We introduce a unique taxonomy that categorizes questions as attributes, comparisons, and counting questions, each revolving around events, entities, and time periods. One standout feature of ComplexTempQA is the high complexity of its questions, which demand effective capabilities for answering such as across-time comparison, temporal aggregation, and multi-hop reasoning involving temporal event ordering and entity recognition. Additionally, each question is accompanied by detailed metadata, including specific time scopes, allowing for comprehensive evaluation and enhancement of the temporal reasoning abilities of large language models. ComplexTempQA serves both as a testing ground for develo** sophisticated AI models and as a foundation for advancing research in question answering, information retrieval, and language understanding. Dataset and code are freely available at: https://github.com/DataScienceUIBK/ComplexTempQA.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset
Authors:
Abdelrahman Abdallah,
Mahmoud Abdalla,
Mahmoud SalahEldin Kasem,
Mohamed Mahmoud,
Ibrahim Abdelhalim,
Mohamed Elkasaby,
Yasser ElBendary,
Adam Jatowt
Abstract:
In the fields of Optical Character Recognition (OCR) and Natural Language Processing (NLP), integrating multilingual capabilities remains a critical challenge, especially when considering languages with complex scripts such as Arabic. This paper introduces the Comprehensive Post-OCR Parsing and Receipt Understanding Dataset (CORU), a novel dataset specifically designed to enhance OCR and informati…
▽ More
In the fields of Optical Character Recognition (OCR) and Natural Language Processing (NLP), integrating multilingual capabilities remains a critical challenge, especially when considering languages with complex scripts such as Arabic. This paper introduces the Comprehensive Post-OCR Parsing and Receipt Understanding Dataset (CORU), a novel dataset specifically designed to enhance OCR and information extraction from receipts in multilingual contexts involving Arabic and English. CORU consists of over 20,000 annotated receipts from diverse retail settings, including supermarkets and clothing stores, alongside 30,000 annotated images for OCR that were utilized to recognize each detected line, and 10,000 items annotated for detailed information extraction. These annotations capture essential details such as merchant names, item descriptions, total prices, receipt numbers, and dates. They are structured to support three primary computational tasks: object detection, OCR, and information extraction. We establish the baseline performance for a range of models on CORU to evaluate the effectiveness of traditional methods, like Tesseract OCR, and more advanced neural network-based approaches. These baselines are crucial for processing the complex and noisy document layouts typical of real-world receipts and for advancing the state of automated multilingual document processing. Our datasets are publicly accessible (https://github.com/Update-For-Integrated-Business-AI/CORU).
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
ArabicaQA: A Comprehensive Dataset for Arabic Question Answering
Authors:
Abdelrahman Abdallah,
Mahmoud Kasem,
Mahmoud Abdalla,
Mohamed Mahmoud,
Mohamed Elkasaby,
Yasser Elbendary,
Adam Jatowt
Abstract:
In this paper, we address the significant gap in Arabic natural language processing (NLP) resources by introducing ArabicaQA, the first large-scale dataset for machine reading comprehension and open-domain question answering in Arabic. This comprehensive dataset, consisting of 89,095 answerable and 3,701 unanswerable questions created by crowdworkers to look similar to answerable ones, along with…
▽ More
In this paper, we address the significant gap in Arabic natural language processing (NLP) resources by introducing ArabicaQA, the first large-scale dataset for machine reading comprehension and open-domain question answering in Arabic. This comprehensive dataset, consisting of 89,095 answerable and 3,701 unanswerable questions created by crowdworkers to look similar to answerable ones, along with additional labels of open-domain questions marks a crucial advancement in Arabic NLP resources. We also present AraDPR, the first dense passage retrieval model trained on the Arabic Wikipedia corpus, specifically designed to tackle the unique challenges of Arabic text retrieval. Furthermore, our study includes extensive benchmarking of large language models (LLMs) for Arabic question answering, critically evaluating their performance in the Arabic language context. In conclusion, ArabicaQA, AraDPR, and the benchmarking of LLMs in Arabic question answering offer significant advancements in the field of Arabic NLP. The dataset and code are publicly accessible for further research https://github.com/DataScienceUIBK/ArabicaQA.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Transformers and Language Models in Form Understanding: A Comprehensive Review of Scanned Document Analysis
Authors:
Abdelrahman Abdallah,
Daniel Eberharter,
Zoe Pfister,
Adam Jatowt
Abstract:
This paper presents a comprehensive survey of research works on the topic of form understanding in the context of scanned documents. We delve into recent advancements and breakthroughs in the field, highlighting the significance of language models and transformers in solving this challenging task. Our research methodology involves an in-depth analysis of popular documents and forms of understandin…
▽ More
This paper presents a comprehensive survey of research works on the topic of form understanding in the context of scanned documents. We delve into recent advancements and breakthroughs in the field, highlighting the significance of language models and transformers in solving this challenging task. Our research methodology involves an in-depth analysis of popular documents and forms of understanding of trends over the last decade, enabling us to offer valuable insights into the evolution of this domain. Focusing on cutting-edge models, we showcase how transformers have propelled the field forward, revolutionizing form-understanding techniques. Our exploration includes an extensive examination of state-of-the-art language models designed to effectively tackle the complexities of noisy scanned documents. Furthermore, we present an overview of the latest and most relevant datasets, which serve as essential benchmarks for evaluating the performance of selected models. By comparing and contrasting the capabilities of these models, we aim to provide researchers and practitioners with useful guidance in choosing the most suitable solutions for their specific form understanding tasks.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Recurrent Neural Networks for Multivariate Loss Reserving and Risk Capital Analysis
Authors:
Pengfei Cai,
Anas Abdallah,
Pratheepa Jeganathan
Abstract:
Reserves comprise most of the liabilities of a property and casualty (P&C) company and are actuaries' best estimate for unpaid future claims. Notably, the reserves for different lines of business (LOB) are related, as there may be dependence between events related to claims. There have been parametric and non-parametric methods in the actuarial industry for loss reserving; only a few tools have be…
▽ More
Reserves comprise most of the liabilities of a property and casualty (P&C) company and are actuaries' best estimate for unpaid future claims. Notably, the reserves for different lines of business (LOB) are related, as there may be dependence between events related to claims. There have been parametric and non-parametric methods in the actuarial industry for loss reserving; only a few tools have been developed to use the recurrent neural network (RNN) for multivariate loss reserving and risk capital analyses. This paper aims to study RNN methods to model dependence between loss triangles and develop predictive distribution for reserves using machine learning. Thus, we create an RNN model to capture dependence between LOBs by extending the Deep Triangle (DT) model from Kuo (2019). In the extended Deep Triangle (EDT), we use the incremental paid loss from two LOBs as input and the symmetric squared loss of two LOBs as the loss function. Then, we extend generative adversarial networks (GANs) by transforming the two loss triangles into a tabular format and generating synthetic loss triangles to obtain the predictive distribution for reserves. To illustrate our method, we apply and calibrate these methods on personal and commercial automobile lines from a large US P&C insurance company and compare the results with copula regression models. The results show that the EDT model performs better than the copula regression models in predicting total loss reserve. In addition, with the obtained predictive distribution for reserves, we show that risk capitals calculated from EDT combined with GAN are smaller than that of the copula regression models, which implies a more considerable diversification benefit. Finally, these findings are also confirmed in a simulation study.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Leveraging Data Collection and Unsupervised Learning for Code-switched Tunisian Arabic Automatic Speech Recognition
Authors:
Ahmed Amine Ben Abdallah,
Ata Kabboudi,
Amir Kanoun,
Salah Zaiem
Abstract:
Crafting an effective Automatic Speech Recognition (ASR) solution for dialects demands innovative approaches that not only address the data scarcity issue but also navigate the intricacies of linguistic diversity. In this paper, we address the aforementioned ASR challenge, focusing on the Tunisian dialect. First, textual and audio data is collected and in some cases annotated. Second, we explore s…
▽ More
Crafting an effective Automatic Speech Recognition (ASR) solution for dialects demands innovative approaches that not only address the data scarcity issue but also navigate the intricacies of linguistic diversity. In this paper, we address the aforementioned ASR challenge, focusing on the Tunisian dialect. First, textual and audio data is collected and in some cases annotated. Second, we explore self-supervision, semi-supervision and few-shot code-switching approaches to push the state-of-the-art on different Tunisian test sets; covering different acoustic, linguistic and prosodic conditions. Finally, and given the absence of conventional spelling, we produce a human evaluation of our transcripts to avoid the noise coming from spelling inadequacies in our testing references. Our models, allowing to transcribe audio samples in a linguistic mix involving Tunisian Arabic, English and French, and all the data used during training and testing are released for public use and further improvements.
△ Less
Submitted 25 September, 2023; v1 submitted 20 September, 2023;
originally announced September 2023.
-
AMuRD: Annotated Arabic-English Receipt Dataset for Key Information Extraction and Classification
Authors:
Abdelrahman Abdallah,
Mahmoud Abdalla,
Mohamed Elkasaby,
Yasser Elbendary,
Adam Jatowt
Abstract:
The extraction of key information from receipts is a complex task that involves the recognition and extraction of text from scanned receipts. This process is crucial as it enables the retrieval of essential content and organizing it into structured documents for easy access and analysis. In this paper, we present AMuRD, a novel multilingual human-annotated dataset specifically designed for informa…
▽ More
The extraction of key information from receipts is a complex task that involves the recognition and extraction of text from scanned receipts. This process is crucial as it enables the retrieval of essential content and organizing it into structured documents for easy access and analysis. In this paper, we present AMuRD, a novel multilingual human-annotated dataset specifically designed for information extraction from receipts. This dataset comprises $47,720$ samples and addresses the key challenges in information extraction and item classification - the two critical aspects of data analysis in the retail industry. Each sample includes annotations for item names and attributes such as price, brand, and more. This detailed annotation facilitates a comprehensive understanding of each item on the receipt. Furthermore, the dataset provides classification into $44$ distinct product categories. This classification feature allows for a more organized and efficient analysis of the items, enhancing the usability of the dataset for various applications. In our study, we evaluated various language model architectures, e.g., by fine-tuning LLaMA models on the AMuRD dataset. Our approach yielded exceptional results, with an F1 score of 97.43\% and accuracy of 94.99\% in information extraction and classification, and an even higher F1 score of 98.51\% and accuracy of 97.06\% observed in specific tasks. The dataset and code are publicly accessible for further researchhttps://github.com/Update-For-Integrated-Business-AI/AMuRD.
△ Less
Submitted 26 March, 2024; v1 submitted 18 September, 2023;
originally announced September 2023.
-
ICU Mortality Prediction Using Long Short-Term Memory Networks
Authors:
Manel Mili,
Asma Kerkeni,
Asma Ben Abdallah,
Mohamed Hedi Bedoui
Abstract:
Extensive bedside monitoring in Intensive Care Units (ICUs) has resulted in complex temporal data regarding patient physiology, which presents an upscale context for clinical data analysis. In the other hand, identifying the time-series patterns within these data may provide a high aptitude to predict clinical events. Hence, we investigate, during this work, the implementation of an automatic data…
▽ More
Extensive bedside monitoring in Intensive Care Units (ICUs) has resulted in complex temporal data regarding patient physiology, which presents an upscale context for clinical data analysis. In the other hand, identifying the time-series patterns within these data may provide a high aptitude to predict clinical events. Hence, we investigate, during this work, the implementation of an automatic data-driven system, which analyzes large amounts of multivariate temporal data derived from Electronic Health Records (EHRs), and extracts high-level information so as to predict in-hospital mortality and Length of Stay (LOS) early. Practically, we investigate the applicability of LSTM network by reducing the time-frame to 6-hour so as to enhance clinical tasks. The experimental results highlight the efficiency of LSTM model with rigorous multivariate time-series measurements for building real-world prediction engines.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
Generator-Retriever-Generator Approach for Open-Domain Question Answering
Authors:
Abdelrahman Abdallah,
Adam Jatowt
Abstract:
Open-domain question answering (QA) tasks usually require the retrieval of relevant information from a large corpus to generate accurate answers. We propose a novel approach called Generator-Retriever-Generator (GRG) that combines document retrieval techniques with a large language model (LLM), by first prompting the model to generate contextual documents based on a given question. In parallel, a…
▽ More
Open-domain question answering (QA) tasks usually require the retrieval of relevant information from a large corpus to generate accurate answers. We propose a novel approach called Generator-Retriever-Generator (GRG) that combines document retrieval techniques with a large language model (LLM), by first prompting the model to generate contextual documents based on a given question. In parallel, a dual-encoder network retrieves documents that are relevant to the question from an external corpus. The generated and retrieved documents are then passed to the second LLM, which generates the final answer. By combining document retrieval and LLM generation, our approach addresses the challenges of open-domain QA, such as generating informative and contextually relevant answers. GRG outperforms the state-of-the-art generate-then-read and retrieve-then-read pipelines (GENREAD and RFiD) improving their performance by at least by +5.2, +4.2, and +1.6 on TriviaQA, NQ, and WebQ datasets, respectively. We provide code, datasets, and checkpoints at https://github.com/abdoelsayed2016/GRG.
△ Less
Submitted 26 March, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Antenna Selection With Beam Squint Compensation for Integrated Sensing and Communications
Authors:
Ahmet M. Elbir,
Asmaa Abdallah,
Abdulkadir Celik,
Ahmed M. Eltawil
Abstract:
Next-generation wireless networks strive for higher communication rates, ultra-low latency, seamless connectivity, and high-resolution sensing capabilities. To meet these demands, terahertz (THz)-band signal processing is envisioned as a key technology offering wide bandwidth and sub-millimeter wavelength. Furthermore, THz integrated sensing and communications (ISAC) paradigm has emerged jointly a…
▽ More
Next-generation wireless networks strive for higher communication rates, ultra-low latency, seamless connectivity, and high-resolution sensing capabilities. To meet these demands, terahertz (THz)-band signal processing is envisioned as a key technology offering wide bandwidth and sub-millimeter wavelength. Furthermore, THz integrated sensing and communications (ISAC) paradigm has emerged jointly access spectrum and reduced hardware costs through a unified platform. To address the challenges in THz propagation, THz-ISAC systems employ extremely large antenna arrays to improve the beamforming gain for communications with high data rates and sensing with high resolution. However, the cost and power consumption of implementing fully digital beamformers are prohibitive. While hybrid analog/digital beamforming can be a potential solution, the use of subcarrier-independent analog beamformers leads to the beam-squint phenomenon where different subcarriers observe distinct directions because of adopting the same analog beamformer across all subcarriers. In this paper, we develop a sparse array architecture for THz-ISAC with hybrid beamforming to provide a cost-effective solution. We analyze the antenna selection problem under beam-squint influence and introduce a manifold optimization approach for hybrid beamforming design. To reduce computational and memory costs, we propose novel algorithms leveraging grouped subarrays, quantized performance metrics, and sequential optimization. These approaches yield a significant reduction in the number of possible subarray configurations, which enables us to devise a neural network with classification model to accurately perform antenna selection.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Data-driven Discovery of The Quadrotor Equations of Motion Via Sparse Identification of Nonlinear Dynamics
Authors:
Zeyad M. Manaa,
Mohammed R. Elbalshy,
Ayman M. Abdallah
Abstract:
Dynamical systems provide a mathematical framework for understanding complex physical phenomena. The mathematical formulation of these systems plays a crucial role in numerous applications; however, it often proves to be quite intricate. Fortunately, data can be readily available through sensor measurements or numerical simulations. In this study, we employ the Sparse Identification of Nonlinear D…
▽ More
Dynamical systems provide a mathematical framework for understanding complex physical phenomena. The mathematical formulation of these systems plays a crucial role in numerous applications; however, it often proves to be quite intricate. Fortunately, data can be readily available through sensor measurements or numerical simulations. In this study, we employ the Sparse Identification of Nonlinear Dynamics (SINDy) algorithm to extract a mathematical model solely from data. The influence of the hyperparameter $λ$ on the sparsity of the identified dynamics is discussed. Additionally, we investigate the impact of data size and the time step between snapshots on the discovered model. To serve as a data source, a ground truth mathematical model was derived from the first principals, we focus on modeling the dynamics of a generic 6 Degrees of Freedom (DOF) quadrotor. For the scope of this initial manuscript and for simplicity and algorithm validation purposes, we specifically consider a sub-case of the 6 DOF system for simulation, restricting the quadrotor's motion to a 2-dimensional plane (i.e. 3 DOF). To evaluate the efficacy of the SINDy algorithm, we simulate three cases employing a Proportional-Derivative (PD) controller for the 3 DOF case including different trajectories. The performance of SINDy model is assessed through the evaluation of absolute error metrics and root mean squared error (RMSE). Interestingly, the predicted states exhibit at most a RMSE of order of magnitude approximately $10^{-4}$, manifestation of the algorithm's effectiveness. This research highlights the application of the SINDy algorithm in extracting the quadrotor mathematical models from data.
△ Less
Submitted 4 November, 2023; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Exploring the State of the Art in Legal QA Systems
Authors:
Abdelrahman Abdallah,
Bhawna Piryani,
Adam Jatowt
Abstract:
Answering questions related to the legal domain is a complex task, primarily due to the intricate nature and diverse range of legal document systems. Providing an accurate answer to a legal query typically necessitates specialized knowledge in the relevant domain, which makes this task all the more challenging, even for human experts. Question answering (QA) systems are designed to generate answer…
▽ More
Answering questions related to the legal domain is a complex task, primarily due to the intricate nature and diverse range of legal document systems. Providing an accurate answer to a legal query typically necessitates specialized knowledge in the relevant domain, which makes this task all the more challenging, even for human experts. Question answering (QA) systems are designed to generate answers to questions asked in human languages. QA uses natural language processing to understand questions and search through information to find relevant answers. QA has various practical applications, including customer service, education, research, and cross-lingual communication. However, QA faces challenges such as improving natural language understanding and handling complex and ambiguous questions. Answering questions related to the legal domain is a complex task, primarily due to the intricate nature and diverse range of legal document systems. Providing an accurate answer to a legal query typically necessitates specialized knowledge in the relevant domain, which makes this task all the more challenging, even for human experts. At this time, there is a lack of surveys that discuss legal question answering. To address this problem, we provide a comprehensive survey that reviews 14 benchmark datasets for question-answering in the legal field as well as presents a comprehensive review of the state-of-the-art Legal Question Answering deep learning models. We cover the different architectures and techniques used in these studies and the performance and limitations of these models. Moreover, we have established a public GitHub repository where we regularly upload the most recent articles, open data, and source code. The repository is available at: \url{https://github.com/abdoelsayed2016/Legal-Question-Answering-Review}.
△ Less
Submitted 15 September, 2023; v1 submitted 13 April, 2023;
originally announced April 2023.
-
Spatial Path Index Modulation in mmWave/THz-Band Integrated Sensing and Communications
Authors:
Ahmet M. Elbir,
Kumar Vijay Mishra,
Asmaa Abdallah,
Abdulkadir Celik,
Ahmed M. Eltawil
Abstract:
As the demand for wireless connectivity continues to soar, the fifth generation and beyond wireless networks are exploring new ways to efficiently utilize the wireless spectrum and reduce hardware costs. One such approach is the integration of sensing and communications (ISAC) paradigms to jointly access the spectrum. Recent ISAC studies have focused on upper millimeter-wave and low terahertz band…
▽ More
As the demand for wireless connectivity continues to soar, the fifth generation and beyond wireless networks are exploring new ways to efficiently utilize the wireless spectrum and reduce hardware costs. One such approach is the integration of sensing and communications (ISAC) paradigms to jointly access the spectrum. Recent ISAC studies have focused on upper millimeter-wave and low terahertz bands to exploit ultrawide bandwidths. At these frequencies, hybrid beamformers that employ fewer radio-frequency chains are employed to offset expensive hardware but at the cost of lower multiplexing gains. Wideband hybrid beamforming also suffers from the beam-split effect arising from the subcarrier-independent (SI) analog beamformers. To overcome these limitations, this paper introduces a spatial path index modulation (SPIM) ISAC architecture, which transmits additional information bits via modulating the spatial paths between the base station and communications users. We design the SPIM-ISAC beamformers by first estimating both radar and communications parameters by develo** beam-split-aware algorithms. Then, we propose to employ a family of hybrid beamforming techniques such as hybrid, SI, and subcarrier-dependent analog-only, and beam-split-aware beamformers. Numerical experiments demonstrate that the proposed SPIM-ISAC approach exhibits significantly improved spectral efficiency performance in the presence of beam-split than that of even fully digital non-SPIM beamformers.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
Low-rank LQR Optimal Control Design over Wireless Communication Networks
Authors:
Myung Cho,
Abdallah Abdallah,
Mohammad Rasouli
Abstract:
This paper considers a LQR optimal control design problem for distributed control systems with multi-agents. To control large-scale distributed systems such as smart-grid and multi-agent robotic systems over wireless communication networks, it is desired to design a feedback controller by considering various constraints on communication such as limited power, limited energy, or limited communicati…
▽ More
This paper considers a LQR optimal control design problem for distributed control systems with multi-agents. To control large-scale distributed systems such as smart-grid and multi-agent robotic systems over wireless communication networks, it is desired to design a feedback controller by considering various constraints on communication such as limited power, limited energy, or limited communication bandwidth, etc. In this paper, we focus on the reduction of communication energy in an LQR optimal control design problem on wireless communication networks. By considering the characteristic of wireless communication, i.e., Radio Frequency (RF) signal can spread in all directions in a broadcast way, we formulate a low-rank LQR optimal control model to reduce the communication energy in the distributed feedback control system. To solve the problem, we propose an Alternating Direction Method of Multipliers (ADMM) based algorithm. Through various numerical experiments, we demonstrate that a feedback controller designed using low-rank structure can outperform the previous work on sparse LQR optimal control design, which focuses on reducing the number of communication links in a network, in terms of energy consumption, system stability margin against noise and error in communication.
△ Less
Submitted 31 January, 2023;
originally announced January 2023.
-
Deep learning for table detection and structure recognition: A survey
Authors:
Mahmoud Kasem,
Abdelrahman Abdallah,
Alexander Berendeyev,
Ebrahem Elkady,
Mahmoud Abdalla,
Mohamed Mahmoud,
Mohamed Hamada,
Daniyar Nurseitov,
Islam Taj-Eddin
Abstract:
Tables are everywhere, from scientific journals, papers, websites, and newspapers all the way to items we buy at the supermarket. Detecting them is thus of utmost importance to automatically understanding the content of a document. The performance of table detection has substantially increased thanks to the rapid development of deep learning networks. The goals of this survey are to provide a prof…
▽ More
Tables are everywhere, from scientific journals, papers, websites, and newspapers all the way to items we buy at the supermarket. Detecting them is thus of utmost importance to automatically understanding the content of a document. The performance of table detection has substantially increased thanks to the rapid development of deep learning networks. The goals of this survey are to provide a profound comprehension of the major developments in the field of Table Detection, offer insight into the different methodologies, and provide a systematic taxonomy of the different approaches. Furthermore, we provide an analysis of both classic and new applications in the field. Lastly, the datasets and source code of the existing models are organized to provide the reader with a compass on this vast literature. Finally, we go over the architecture of utilizing various object detection and table structure recognition methods to create an effective and efficient system, as well as a set of development trends to keep up with state-of-the-art algorithms and future research. We have also set up a public GitHub repository where we will be updating the most recent publications, open data, and source code. The GitHub repository is available at https://github.com/abdoelsayed2016/table-detection-structure-recognition.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
RIS-Assisted Grant-Free NOMA
Authors:
Recep Akif Tasci,
Fatih Kilinc,
Abdulkadir Celik,
Asmaa Abdallah,
Ahmed M. Eltawil,
Ertugrul Basar
Abstract:
This paper introduces a reconfigurable intelligent surface (RIS)-assisted grant-free non-orthogonal multiple-access (GF-NOMA) scheme. To ensure the power reception disparity required by the power domain NOMA (PD-NOMA), we propose a joint user clustering and RIS assignment/alignment approach that maximizes the network sum rate by judiciously pairing user equipments (UEs) with distinct channel gains…
▽ More
This paper introduces a reconfigurable intelligent surface (RIS)-assisted grant-free non-orthogonal multiple-access (GF-NOMA) scheme. To ensure the power reception disparity required by the power domain NOMA (PD-NOMA), we propose a joint user clustering and RIS assignment/alignment approach that maximizes the network sum rate by judiciously pairing user equipments (UEs) with distinct channel gains, assigning RISs to proper clusters, and aligning RIS phase shifts to the cluster members yielding the highest cluster sum rate. Once UEs are acknowledged with the cluster index, they are allowed to access their resource blocks (RBs) at any time requiring neither further grant acquisitions from the base station (BS) nor power control as all UEs are requested to transmit at the same power. In this way, the proposed approach performs an implicit over-the-air power control with minimal control signaling between BS and UEs, which has shown to deliver up to 20% higher network sum rate than benchmark GF-NOMA and optimal grant-based PD-NOMA schemes depending on the network parameters. The given numerical results also investigate the impact of UE density, RIS deployment, and RIS hardware specifications on the overall performance of the proposed RIS-aided GF-NOMA scheme.
△ Less
Submitted 15 June, 2023; v1 submitted 23 July, 2022;
originally announced July 2022.
-
Enhancing Core Image Classification Using Generative Adversarial Networks (GANs)
Authors:
Galymzhan Abdimanap,
Kairat Bostanbekov,
Abdelrahman Abdallah,
Anel Alimova,
Darkhan Kurmangaliyev,
Daniyar Nurseitov
Abstract:
In the thrilling world of oil exploration, drill core samples are key to unlocking geological information critical to finding lucrative oil deposits. Despite the importance of these samples, traditional core logging techniques are known to be laborious and, worse still, subjective. Thankfully, the industry has embraced an innovative solution core imaging that allows for nondestructive and noninvas…
▽ More
In the thrilling world of oil exploration, drill core samples are key to unlocking geological information critical to finding lucrative oil deposits. Despite the importance of these samples, traditional core logging techniques are known to be laborious and, worse still, subjective. Thankfully, the industry has embraced an innovative solution core imaging that allows for nondestructive and noninvasive rapid characterization of large quantities of drill cores. Our preeminent research paper aims to tackle the pressing problem of core detection and classification. Using state-of-the-art techniques, we present a groundbreaking solution that will transform the industry. Our first challenge is detecting the cores and segmenting the holes in images, which we will achieve using the Faster RCNN and Mask RCNN models, respectively. Then, we will address the problem of filling the hole in the core image, utilizing the powerful Generative Adversarial Networks (GANs) and employing Contextual Residual Aggregation (CRA) to create high-frequency residuals for missing contents in images. Finally, we will apply sophisticated texture recognition models for the classification of core images, revealing crucial information to oil companies in their quest to uncover valuable oil deposits. Our research paper presents an innovative and groundbreaking approach to tackling the complex issues surrounding core detection and classification. By harnessing cutting-edge techniques and technologies, we are poised to revolutionize the industry and make significant contributions to the field of oil exploration.
△ Less
Submitted 25 August, 2023; v1 submitted 21 April, 2022;
originally announced April 2022.
-
KOHTD: Kazakh Offline Handwritten Text Dataset
Authors:
Nazgul Toiganbayeva,
Mahmoud Kasem,
Galymzhan Abdimanap,
Kairat Bostanbekov,
Abdelrahman Abdallah,
Anel Alimova,
Daniyar Nurseitov
Abstract:
Despite the transition to digital information exchange, many documents, such as invoices, taxes, memos and questionnaires, historical data, and answers to exam questions, still require handwritten inputs. In this regard, there is a need to implement Handwritten Text Recognition (HTR) which is an automatic way to decrypt records using a computer. Handwriting recognition is challenging because of th…
▽ More
Despite the transition to digital information exchange, many documents, such as invoices, taxes, memos and questionnaires, historical data, and answers to exam questions, still require handwritten inputs. In this regard, there is a need to implement Handwritten Text Recognition (HTR) which is an automatic way to decrypt records using a computer. Handwriting recognition is challenging because of the virtually infinite number of ways a person can write the same message. For this proposal we introduce Kazakh handwritten text recognition research, a comprehensive dataset of Kazakh handwritten texts is necessary. This is particularly true given the lack of a dataset for handwritten Kazakh text. In this paper, we proposed our extensive Kazakh offline Handwritten Text dataset (KOHTD), which has 3000 handwritten exam papers and more than 140335 segmented images and there are approximately 922010 symbols. It can serve researchers in the field of handwriting recognition tasks by using deep and machine learning. We used a variety of popular text recognition methods for word and line recognition in our studies, including CTC-based and attention-based methods. The findings demonstrate KOHTD's diversity. Also, we proposed a Genetic Algorithm (GA) for line and word segmentation based on random enumeration of a parameter. The dataset and GA code are available at https://github.com/abdoelsayed2016/KOHTD.
△ Less
Submitted 22 September, 2021;
originally announced October 2021.
-
TNCR: Table Net Detection and Classification Dataset
Authors:
Abdelrahman Abdallah,
Alexander Berendeyev,
Islam Nuradin,
Daniyar Nurseitov
Abstract:
We present TNCR, a new table dataset with varying image quality collected from free websites. The TNCR dataset can be used for table detection in scanned document images and their classification into 5 different classes. TNCR contains 9428 high-quality labeled images. In this paper, we have implemented state-of-the-art deep learning-based methods for table detection to create several strong baseli…
▽ More
We present TNCR, a new table dataset with varying image quality collected from free websites. The TNCR dataset can be used for table detection in scanned document images and their classification into 5 different classes. TNCR contains 9428 high-quality labeled images. In this paper, we have implemented state-of-the-art deep learning-based methods for table detection to create several strong baselines. Cascade Mask R-CNN with ResNeXt-101-64x4d Backbone Network achieves the highest performance compared to other methods with a precision of 79.7%, recall of 89.8%, and f1 score of 84.4% on the TNCR dataset. We have made TNCR open source in the hope of encouraging more deep learning approaches to table detection, classification, and structure recognition. The dataset and trained model checkpoints are available at https://github.com/abdoelsayed2016/TNCR_Dataset.
△ Less
Submitted 19 June, 2021;
originally announced June 2021.
-
Terahertz-Band MIMO-NOMA: Adaptive Superposition Coding and Subspace Detection
Authors:
Hadi Sarieddeen,
Asmaa Abdallah,
Mohammad M. Mansour,
Mohamed-Slim Alouini,
Tareq Y. Al-Naffouri
Abstract:
We consider the problem of efficient ultra-massive multiple-input multiple-output (UM-MIMO) data detection in terahertz (THz)-band non-orthogonal multiple access (NOMA) systems. We argue that the most common THz NOMA configuration is power-domain superposition coding over quasi-optical doubly-massive MIMO channels. We propose spatial tuning techniques that modify antenna subarray arrangements to e…
▽ More
We consider the problem of efficient ultra-massive multiple-input multiple-output (UM-MIMO) data detection in terahertz (THz)-band non-orthogonal multiple access (NOMA) systems. We argue that the most common THz NOMA configuration is power-domain superposition coding over quasi-optical doubly-massive MIMO channels. We propose spatial tuning techniques that modify antenna subarray arrangements to enhance channel conditions. Towards recovering the superposed data at the receiver side, we propose a family of data detectors based on low-complexity channel matrix puncturing, in which higher-order detectors are dynamically formed from lower-order component detectors. We first detail the proposed solutions for the case of superposition coding of multiple streams in point-to-point THz MIMO links. We then extend the study to multi-user NOMA, in which randomly distributed users get grouped into narrow cell sectors and are allocated different power levels depending on their proximity to the base station. We show that successive interference cancellation is carried with minimal performance and complexity costs under spatial tuning. We derive approximate bit error rate (BER) equations, and we propose an architectural design to illustrate complexity reductions. Under typical THz conditions, channel puncturing introduces more than an order of magnitude reduction in BER at high signal-to-noise ratios while reducing complexity by approximately 90%.
△ Less
Submitted 3 March, 2021;
originally announced March 2021.
-
Deep Learning Based Frequency-Selective Channel Estimation for Hybrid mmWave MIMO Systems
Authors:
Asmaa Abdallah,
Abdulkadir Celik,
Mohammad M. Mansour,
Ahmed M. Eltawil
Abstract:
Millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) systems typically employ hybrid mixed signal processing to avoid expensive hardware and high training overheads. {However, the lack of fully digital beamforming at mmWave bands imposes additional challenges in channel estimation. Prior art on hybrid architectures has mainly focused on greedy optimization algorithms to estimate…
▽ More
Millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) systems typically employ hybrid mixed signal processing to avoid expensive hardware and high training overheads. {However, the lack of fully digital beamforming at mmWave bands imposes additional challenges in channel estimation. Prior art on hybrid architectures has mainly focused on greedy optimization algorithms to estimate frequency-flat narrowband mmWave channels, despite the fact that in practice, the large bandwidth associated with mmWave channels results in frequency-selective channels. In this paper, we consider a frequency-selective wideband mmWave system and propose two deep learning (DL) compressive sensing (CS) based algorithms for channel estimation.} The proposed algorithms learn critical apriori information from training data to provide highly accurate channel estimates with low training overhead. In the first approach, a DL-CS based algorithm simultaneously estimates the channel supports in the frequency domain, which are then used for channel reconstruction. The second approach exploits the estimated supports to apply a low-complexity multi-resolution fine-tuning method to further enhance the estimation performance. Simulation results demonstrate that the proposed DL-based schemes significantly outperform conventional orthogonal matching pursuit (OMP) techniques in terms of the normalized mean-squared error (NMSE), computational complexity, and spectral efficiency, particularly in the low signal-to-noise ratio regime. When compared to OMP approaches that achieve an NMSE gap of \$\unit[\{4-10\}]{dB}\$ with respect to the Cramer Rao Lower Bound (CRLB), the proposed algorithms reduce the CRLB gap to only \$\unit[\{1-1.5\}]{dB}\$, while significantly reducing complexity by two orders of magnitude.
△ Less
Submitted 22 February, 2021;
originally announced February 2021.
-
Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models
Authors:
Daniyar Nurseitov,
Kairat Bostanbekov,
Maksat Kanatov,
Anel Alimova,
Abdelrahman Abdallah,
Galymzhan Abdimanap
Abstract:
This article discusses the problem of handwriting recognition in Kazakh and Russian languages. This area is poorly studied since in the literature there are almost no works in this direction. We have tried to describe various approaches and achievements of recent years in the development of handwritten recognition models in relation to Cyrillic graphics. The first model uses deep convolutional neu…
▽ More
This article discusses the problem of handwriting recognition in Kazakh and Russian languages. This area is poorly studied since in the literature there are almost no works in this direction. We have tried to describe various approaches and achievements of recent years in the development of handwritten recognition models in relation to Cyrillic graphics. The first model uses deep convolutional neural networks (CNNs) for feature extraction and a fully connected multilayer perceptron neural network (MLP) for word classification. The second model, called SimpleHTR, uses CNN and recurrent neural network (RNN) layers to extract information from images. We also proposed the Bluechet and Puchserver models to compare the results. Due to the lack of available open datasets in Russian and Kazakh languages, we carried out work to collect data that included handwritten names of countries and cities from 42 different Cyrillic words, written more than 500 times in different handwriting. We also used a handwritten database of Kazakh and Russian languages (HKR). This is a new database of Cyrillic words (not only countries and cities) for the Russian and Kazakh languages, created by the authors of this work.
△ Less
Submitted 9 February, 2021;
originally announced February 2021.
-
Estimate The Efficiency Of Multiprocessor's Cash Memory Work Algorithms
Authors:
Mohamed A. Hamada,
Abdelrahman Abdallah
Abstract:
Many computer systems for calculating the proper organization of memory are among the most critical issues. Using a tier cache memory (along with branching prediction) is an effective means of increasing modern multi-core processors' performance. Designing high-performance processors is a complex task and requires preliminary verification and analysis of the model level, usually used in analytical…
▽ More
Many computer systems for calculating the proper organization of memory are among the most critical issues. Using a tier cache memory (along with branching prediction) is an effective means of increasing modern multi-core processors' performance. Designing high-performance processors is a complex task and requires preliminary verification and analysis of the model level, usually used in analytical and simulation modeling. The refinement of extreme programming is an unfortunate challenge. Few experts disagree with the synthesis of access points. This article demonstrates that Internet QoS and 16-bit architectures are always incompatible, but it's the same situation for write-back caches. The solution to this problem can be implemented by analyzing simulation models of different complexity in combination with the analytical evaluation of individual algorithms. This work is devoted to designing a multi-parameter simulation model of a multi-process for evaluating the performance of cache memory algorithms and the optimality of the structure. Optimization of the structures and algorithms of the cache memory allows you to accelerate the interaction of the memory process and improve the performance of the entire system.
△ Less
Submitted 20 May, 2021; v1 submitted 7 February, 2021;
originally announced February 2021.
-
An Efficient Paradigm for Feasibility Guarantees in Legged Locomotion
Authors:
Abdelrahman Abdallah,
Michele Focchi,
Romeo Orsolino,
Claudio Semini
Abstract:
Develo** feasible body trajectories for legged systems on arbitrary terrains is a challenging task. In this paper, we present a paradigm that allows to design feasible Center of Mass (CoM) and body trajectories in an efficient manner. In our previous work [1], we introduced the notion of the 2D feasible region, where static balance and the satisfaction of joint torque limits were guaranteed, whe…
▽ More
Develo** feasible body trajectories for legged systems on arbitrary terrains is a challenging task. In this paper, we present a paradigm that allows to design feasible Center of Mass (CoM) and body trajectories in an efficient manner. In our previous work [1], we introduced the notion of the 2D feasible region, where static balance and the satisfaction of joint torque limits were guaranteed, whenever the projection of the CoM lied inside the proposed admissible region. In this work we propose a general formulation of the improved feasible region that guarantees dynamic balance alongside the satisfaction of both joint-torque and kinematic limits in an efficient manner. To incorporate the feasibility of the kinematic limits, we introduce an algorithm that computes the reachable region of the CoM. Furthermore, we propose an efficient planning strategy that utilizes the improved feasible region to design feasible CoM and body orientation trajectories. Finally, we validate the capabilities of the improved feasible region and the effectiveness of the proposed planning strategy, using simulations and experiments on the 90 kg Hydraulically actuated Quadruped (HyQ) and the 21 kg Aliengo robots.
△ Less
Submitted 30 May, 2023; v1 submitted 16 November, 2020;
originally announced November 2020.
-
Towards Spiral Brick Column Building Robots
Authors:
Yaseer Ashraf,
Ahmed Abdallah,
Abdelhaleem Osman,
Victor Parque,
Samy Assal
Abstract:
Automation in construction has the potential to expand the technological landscape of labor intensive tasks, and bring gains in efficiency and productivity to sustain global competitiveness. In this paper we propose a task-level approach for assembly of spiral brick columns. Our extensive computational simulations using the generalized models of spiral brick columns show the feasibility, the effec…
▽ More
Automation in construction has the potential to expand the technological landscape of labor intensive tasks, and bring gains in efficiency and productivity to sustain global competitiveness. In this paper we propose a task-level approach for assembly of spiral brick columns. Our extensive computational simulations using the generalized models of spiral brick columns show the feasibility, the effectiveness and efficiency of our proposed approach. Our results offer the potential to use robots in automated construction of spiral brick columns with utmost efficiency.
△ Less
Submitted 6 November, 2020;
originally announced November 2020.
-
Training Transformers for Information Security Tasks: A Case Study on Malicious URL Prediction
Authors:
Ethan M. Rudd,
Ahmed Abdallah
Abstract:
Machine Learning (ML) for information security (InfoSec) utilizes distinct data types and formats which require different treatments during optimization/training on raw data. In this paper, we implement a malicious/benign URL predictor based on a transformer architecture that is trained from scratch. We show that in contrast to conventional natural language processing (NLP) transformers, this mode…
▽ More
Machine Learning (ML) for information security (InfoSec) utilizes distinct data types and formats which require different treatments during optimization/training on raw data. In this paper, we implement a malicious/benign URL predictor based on a transformer architecture that is trained from scratch. We show that in contrast to conventional natural language processing (NLP) transformers, this model requires a different training approach to work well. Specifically, we show that 1) pre-training on a massive corpus of unlabeled URL data for an auto-regressive task does not readily transfer to malicious/benign prediction but 2) that using an auxiliary auto-regressive loss improves performance when training from scratch. We introduce a method for mixed objective optimization, which dynamically balances contributions from both loss terms so that neither one of them dominates. We show that this method yields performance comparable to that of several top-performing benchmark classifiers.
△ Less
Submitted 5 November, 2020;
originally announced November 2020.
-
Neural Network-Based Ranging with LTE Channel Impulse Response for Localization in Indoor Environments
Authors:
Halim Lee,
Ali A. Abdallah,
Jongmin Park,
Jiwon Seo,
Zaher M. Kassas
Abstract:
A neural network (NN)-based approach for indoor localization via cellular long-term evolution (LTE) signals is proposed. The approach estimates, from the channel impulse response (CIR), the range between an LTE eNodeB and a receiver. A software-defined radio (SDR) extracts the CIR, which is fed to a long short-term memory model (LSTM) recurrent neural network (RNN) to estimate the range. Experimen…
▽ More
A neural network (NN)-based approach for indoor localization via cellular long-term evolution (LTE) signals is proposed. The approach estimates, from the channel impulse response (CIR), the range between an LTE eNodeB and a receiver. A software-defined radio (SDR) extracts the CIR, which is fed to a long short-term memory model (LSTM) recurrent neural network (RNN) to estimate the range. Experimental results are presented comparing the proposed approach against a baseline RNN without LSTM. The results show a receiver navigating for 100 m in an indoor environment, while receiving signals from one LTE eNodeB. The ranging root-mean squared error (RMSE) and ranging maximum error along the receiver's trajectory were reduced from 13.11 m and 55.68 m, respectively, in the baseline RNN to 9.02 m and 27.40 m, respectively, with the proposed RNN-LSTM.
△ Less
Submitted 24 September, 2020;
originally announced September 2020.
-
Attention-based Fully Gated CNN-BGRU for Russian Handwritten Text
Authors:
Abdelrahman Abdallah,
Mohamed Hamada,
Daniyar Nurseitov
Abstract:
This research approaches the task of handwritten text with attention encoder-decoder networks that are trained on Kazakh and Russian language. We developed a novel deep neural network model based on Fully Gated CNN, supported by Multiple bidirectional GRU and Attention mechanisms to manipulate sophisticated features that achieve 0.045 Character Error Rate (CER), 0.192 Word Error Rate (WER) and 0.2…
▽ More
This research approaches the task of handwritten text with attention encoder-decoder networks that are trained on Kazakh and Russian language. We developed a novel deep neural network model based on Fully Gated CNN, supported by Multiple bidirectional GRU and Attention mechanisms to manipulate sophisticated features that achieve 0.045 Character Error Rate (CER), 0.192 Word Error Rate (WER) and 0.253 Sequence Error Rate (SER) for the first test dataset and 0.064 CER, 0.24 WER and 0.361 SER for the second test dataset. Also, we propose fully gated layers by taking the advantage of multiple the output feature from Tahn and input feature, this proposed work achieves better results and We experimented with our model on the Handwritten Kazakh & Russian Database (HKR). Our research is the first work on the HKR dataset and demonstrates state-of-the-art results to most of the other existing models.
△ Less
Submitted 20 August, 2020; v1 submitted 12 August, 2020;
originally announced August 2020.
-
HKR For Handwritten Kazakh & Russian Database
Authors:
Daniyar Nurseitov,
Kairat Bostanbekov,
Daniyar Kurmankhojayev,
Anel Alimova,
Abdelrahman Abdallah
Abstract:
In this paper, we present a new Russian and Kazakh database (with about 95% of Russian and 5% of Kazakh words/sentences respectively) for offline handwriting recognition. A few pre-processing and segmentation procedures have been developed together with the database. The database is written in Cyrillic and shares the same 33 characters. Besides these characters, the Kazakh alphabet also contains 9…
▽ More
In this paper, we present a new Russian and Kazakh database (with about 95% of Russian and 5% of Kazakh words/sentences respectively) for offline handwriting recognition. A few pre-processing and segmentation procedures have been developed together with the database. The database is written in Cyrillic and shares the same 33 characters. Besides these characters, the Kazakh alphabet also contains 9 additional specific characters. This dataset is a collection of forms. The sources of all the forms in the datasets were generated by \LaTeX which subsequently was filled out by persons with their handwriting. The database consists of more than 1400 filled forms. There are approximately 63000 sentences, more than 715699 symbols produced by approximately 200 different writers. It can serve researchers in the field of handwriting recognition tasks by using deep and machine learning.
△ Less
Submitted 8 July, 2020; v1 submitted 7 July, 2020;
originally announced July 2020.
-
Automated Question Answer medical model based on Deep Learning Technology
Authors:
Abdelrahman Abdallah,
Mahmoud Kasem,
Mohamed Hamada,
Shaymaa Sdeek
Abstract:
Artificial intelligence can now provide more solutions for different problems, especially in the medical field. One of those problems the lack of answers to any given medical/health-related question. The Internet is full of forums that allow people to ask some specific questions and get great answers for them. Nevertheless, browsing these questions in order to locate one similar to your own, also…
▽ More
Artificial intelligence can now provide more solutions for different problems, especially in the medical field. One of those problems the lack of answers to any given medical/health-related question. The Internet is full of forums that allow people to ask some specific questions and get great answers for them. Nevertheless, browsing these questions in order to locate one similar to your own, also finding a satisfactory answer is a difficult and time-consuming task. This research will introduce a solution to this problem by automating the process of generating qualified answers to these questions and creating a kind of digital doctor. Furthermore, this research will train an end-to-end model using the framework of RNN and the encoder-decoder to generate sensible and useful answers to a small set of medical/health issues. The proposed model was trained and evaluated using data from various online services, such as WebMD, HealthTap, eHealthForums, and iCliniq.
△ Less
Submitted 20 May, 2020;
originally announced May 2020.
-
A low-overhead soft-hard fault-tolerant architecture, design and management scheme for reliable high-performance many-core 3D-NoC systems
Authors:
Khanh N Dang,
Michael Meyer,
Yuichi Okuyama,
Abderazek Ben Abdallah
Abstract:
The Network-on-Chip (NoC) paradigm has been proposed as a favorable solution to handle the strict communication requirements between the increasingly large number of cores on a single chip. However, NoC systems are exposed to the aggressive scaling down of transistors, low operating voltages, and high integration and power densities, making them vulnerable to permanent (hard) faults and transient…
▽ More
The Network-on-Chip (NoC) paradigm has been proposed as a favorable solution to handle the strict communication requirements between the increasingly large number of cores on a single chip. However, NoC systems are exposed to the aggressive scaling down of transistors, low operating voltages, and high integration and power densities, making them vulnerable to permanent (hard) faults and transient (soft) errors. A hard fault in a NoC can lead to external blocking, causing congestion across the whole network. A soft error is more challenging because of its silent data corruption, which leads to a large area of erroneous data due to error propagation, packet re-transmission, and deadlock. In this paper, we present the architecture and design of a comprehensive soft error and hard fault-tolerant 3D-NoC system, named 3D-Hard-Fault-Soft-Error-Tolerant-OASIS-NoC (3D-FETO). With the aid of efficient mechanisms and algorithms, 3D-FETO is capable of detecting and recovering from soft errors which occur in the routing pipeline stages and leverages reconfigurable components to handle permanent faults in links, input buffers, and crossbars. In-depth evaluation results show that the 3D-FETO system is able to work around different kinds of hard faults and soft errors, ensuring graceful performance degradation, while minimizing additional hardware complexity and remaining power efficient.
△ Less
Submitted 21 March, 2020;
originally announced March 2020.
-
An Efficient Software-Hardware Design Framework for Spiking Neural Network Systems
Authors:
Khanh N. Dang,
Abderazek Ben Abdallah
Abstract:
Spiking Neural Network (SNN) is the third generation of Neural Network (NN) mimicking the natural behavior of the brain. By processing based on binary input/output, SNNs offer lower complexity, higher density and lower power consumption. This work presents an efficient software-hardware design framework for develo** SNN systems in hardware. In addition, a design of low-cost neurosynaptic core is…
▽ More
Spiking Neural Network (SNN) is the third generation of Neural Network (NN) mimicking the natural behavior of the brain. By processing based on binary input/output, SNNs offer lower complexity, higher density and lower power consumption. This work presents an efficient software-hardware design framework for develo** SNN systems in hardware. In addition, a design of low-cost neurosynaptic core is presented based on packet-switching communication approach. The evaluation results show that the ANN to SNN conversion method with the size 784:1200:1200:10 performs 99% accuracy for MNIST while the unsupervised STDP archives 89% with the size 784:400 with recurrent connections. The design of 256-neurons and 65k synapses is also implemented in ASIC 45nm technology with an area cost of 0.205 $m m^2$.
△ Less
Submitted 22 March, 2020;
originally announced March 2020.
-
Reliability Assessment and Quantitative Evaluation of Soft-Error Resilient 3D Network-on-Chip Systems
Authors:
Khanh N Dang,
Michael Meyer,
Yuichi Okuyama,
Abderazek Ben Abdallah
Abstract:
Three-Dimensional Networks-on-Chips (3D-NoCs) have been proposed as an auspicious solution, merging the high parallelism of the Network-on-Chip (NoC) paradigm with the high-performance and low-power cost of 3D-ICs. However, as technology scales down, the reliability issues are becoming more crucial, especially for complex 3D-NoC which provides the communication requirements of multi and many-core…
▽ More
Three-Dimensional Networks-on-Chips (3D-NoCs) have been proposed as an auspicious solution, merging the high parallelism of the Network-on-Chip (NoC) paradigm with the high-performance and low-power cost of 3D-ICs. However, as technology scales down, the reliability issues are becoming more crucial, especially for complex 3D-NoC which provides the communication requirements of multi and many-core systems-on-chip. Reliability assessment is prominent for early stages of the manufacturing process to prevent costly redesigns of a target system. In this paper, we present an accurate reliability assessment and quantitative evaluation of a soft-error resilient 3D-NoC based on a soft-error resilient mechanism. The system can recover from transient errors occurring in different pipeline stages of the router. Based on this analysis, the effects of failures in the network's principal components are determined.
△ Less
Submitted 21 March, 2020;
originally announced March 2020.
-
Soft-Error and Hard-fault Tolerant Architecture and Routing Algorithm for Reliable 3D-NoC Systems
Authors:
Khanh N. Dang,
Yuichi Okuyama,
Abderazek Ben Abdallah
Abstract:
Network-on-Chip (NoC) paradigm has been proposed as an auspicious solution to handle the strict communication requirements between the increasingly large number of cores on a single multi and many-core chips. However, NoC systems are exposed to a variety of manufacturing, design and energetic particles factors making them vulnerable to permanent (hard) faults and transient (soft) errors. In this p…
▽ More
Network-on-Chip (NoC) paradigm has been proposed as an auspicious solution to handle the strict communication requirements between the increasingly large number of cores on a single multi and many-core chips. However, NoC systems are exposed to a variety of manufacturing, design and energetic particles factors making them vulnerable to permanent (hard) faults and transient (soft) errors. In this paper, we present a comprehensive soft error and hard fault tolerant 3D-NoC architecture, named 3D-Hard-Fault-Soft-Error-Tolerant-OASIS-NoC (3D-FETO). With the aid of adaptive algorithms, 3D-FETO is capable of detecting and recovering from soft errors occurring in the routing pipeline stages and is leveraging on reconfigurable components to handle permanent faults occurrence in links, input buffers, and crossbar. In-depth evaluation results show that the 3D-FETO system is able to work around different kinds of hard faults and soft errors while ensuring graceful performance degradation, minimizing the additional hardware complexity and remaining power-efficient.
△ Less
Submitted 21 March, 2020;
originally announced March 2020.
-
Report on power, thermal and reliability prediction for 3D Networks-on-Chip
Authors:
Khanh N. Dang,
Akram Ben Ahmed,
Abderazek Ben Abdallah,
Xuan-Tu Tran
Abstract:
By combining Three Dimensional Integrated Circuits with the Network-on-Chip infrastructure to obtain 3D Networks-on-Chip (3D-NoCs), the new on-chip communication paradigm brings several advantages on lower power, smaller footprint and lower latency. However, thermal dissipation is one of the most critical challenges for 3D-ICs where the heat cannot easily transfer through several layers of silicon…
▽ More
By combining Three Dimensional Integrated Circuits with the Network-on-Chip infrastructure to obtain 3D Networks-on-Chip (3D-NoCs), the new on-chip communication paradigm brings several advantages on lower power, smaller footprint and lower latency. However, thermal dissipation is one of the most critical challenges for 3D-ICs where the heat cannot easily transfer through several layers of silicon. Consequently, the high-temperature area also confronts the reliability threat as the Mean Time to Failure (MTTF) decreases exponentially with the operating temperature. Apparently, 3D-NoCs must tackle this fundamental problem in order to be widely used. Therefore, in this work, we investigate the thermal distribution and reliability prediction of 3D-NoCs. We first present a new method to help simulate the temperature (both steady and transient) using traffics value from realistic and synthetic benchmarks and the power consumption from standard VLSI design flow. Then, based on the proposed method, we further predict the relative reliability between different parts of the network. Experimental results show that the method has an extremely fast execution time in comparison to the acceleration lifetime test. Furthermore, we compare the thermal behavior and reliability between Monolithic design and TSV-based TSV. We also explorer the ability to implement the thermal via a mechanism to help reduce the operating temperature.
△ Less
Submitted 19 March, 2020;
originally announced March 2020.
-
Efficient Angle-Domain Processing for FDD-based Cell-free Massive MIMO Systems
Authors:
Asmaa Abdallah,
Mohammad M. Mansour
Abstract:
Cell-free massive MIMO communications is an emerging network technology for 5G wireless communications wherein distributed multi-antenna access points (APs) serve many users simultaneously. Most prior work on cell-free massive MIMO systems assume time-division duplexing mode, although frequency-division duplexing (FDD) systems dominate current wireless standards. The key challenges in FDD massive…
▽ More
Cell-free massive MIMO communications is an emerging network technology for 5G wireless communications wherein distributed multi-antenna access points (APs) serve many users simultaneously. Most prior work on cell-free massive MIMO systems assume time-division duplexing mode, although frequency-division duplexing (FDD) systems dominate current wireless standards. The key challenges in FDD massive MIMO systems are channel-state information (CSI) acquisition and feedback overhead. To address these challenges, we exploit the so-called angle reciprocity of multipath components in the uplink and downlink, so that the required CSI acquisition overhead scales only with the number of served users, and not the number of AP antennas nor APs. We propose a low complexity multipath component estimation technique and present linear angle-of-arrival (AoA)-based beamforming/combining schemes for FDD-based cell-free massive MIMO systems. We analyze the performance of these schemes by deriving closed-form expressions for the mean-square-error of the estimated multipath components, as well as expressions for the uplink and downlink spectral efficiency. Using semi-definite programming, we solve a max-min power allocation problem that maximizes the minimum user rate under per-user power constraints. Furthermore, we present a user-centric (UC) AP selection scheme in which each user chooses a subset of APs to improve the overall energy efficiency of the system. Simulation results demonstrate that the proposed multipath component estimation technique outperforms conventional subspace-based and gradient-descent based techniques. We also show that the proposed beamforming and combining techniques along with the proposed power control scheme substantially enhance the spectral and energy efficiencies with an adequate number of antennas at the APs.
△ Less
Submitted 21 January, 2020;
originally announced January 2020.
-
Improved Self-cleaning Properties of an Efficient and Easy to Scale up TiO2 Thin Films Prepared by Adsorptive Self-Assembly
Authors:
Rima J. Isaifan,
Ayman Samara,
Wafa Suwaileh,
Daniel Johnson,
Wubulikasimu Yiming,
Amir A. Abdallah,
Brahim Aïssa
Abstract:
Transparent titania coatings have self-cleaning and anti-reflection properties (AR) that are of great importance to minimize soiling effect on photovoltaic modules. In this work, TiO2 nanocolloids prepared by polyol reduction method were successfully used as coating thin films onto borosilicate glass substrates via adsorptive self-assembly process. The nanocolloids were characterized by transmissi…
▽ More
Transparent titania coatings have self-cleaning and anti-reflection properties (AR) that are of great importance to minimize soiling effect on photovoltaic modules. In this work, TiO2 nanocolloids prepared by polyol reduction method were successfully used as coating thin films onto borosilicate glass substrates via adsorptive self-assembly process. The nanocolloids were characterized by transmission electron microscopy and x-ray diffraction. The average particle size was around 2.6 nm. The films which have an average thickness of 76.2 nm and refractive index of 1.51 showed distinctive anti soiling properties under desert environment. The film surface topography, uniformity, wettability, thickness and refractive index were characterized using x-ray diffraction, atomic force microscopy, scanning electron microscopy, water contact angle measurements and ellipsometry. The self-cleaning properties were investigated by optical microscopy and UV-Vis spectroscopy. The optical images show 56% reduction of dust deposition rate over the coated surfaces compared with bare glass substrates after 7 days of soiling. The transmission optical spectra of these films collected at normal incidence angle show high anti-reflection properties with the coated substrates having transmission loss of less than 6% compared to bare clean glass.
△ Less
Submitted 10 April, 2019;
originally announced July 2019.
-
Power Control and Channel Allocation for D2D Underlaid Cellular Networks
Authors:
Asmaa Abdallah,
Mohammad M. Mansour,
Ali Chehab
Abstract:
Device-to-Device (D2D) communications underlaying cellular networks is a viable network technology that can potentially increase spectral utilization and improve power efficiency for proximitybased wireless applications and services. However, a major challenge in such deployment scenarios is the interference caused by D2D links when sharing the same resources with cellular users. In this work, we…
▽ More
Device-to-Device (D2D) communications underlaying cellular networks is a viable network technology that can potentially increase spectral utilization and improve power efficiency for proximitybased wireless applications and services. However, a major challenge in such deployment scenarios is the interference caused by D2D links when sharing the same resources with cellular users. In this work, we propose a channel allocation (CA) scheme together with a set of three power control (PC) schemes to mitigate interference in a D2D underlaid cellular system modeled as a random network using the mathematical tool of stochastic geometry. The novel aspect of the proposed CA scheme is that it enables D2D links to share resources with multiple cellular users as opposed to one as previously considered in the literature. Moreover, the accompanying distributed PC schemes further manage interference during link establishment and maintenance. The first two PC schemes compensate for large-scale path-loss effects and maximize the D2D sum rate by employing distance-dependent pathloss parameters of the D2D link and the base station, including an error estimation margin. The third scheme is an adaptive PC scheme based on a variable target signal-to-interference-plus-noise ratio, which limits the interference caused by D2D users and provides sufficient coverage probability for cellular users. Closed-form expressions for the coverage probability of cellular links, D2D links, and sum rate of D2D links are derived in terms of the allocated power, density of D2D links, and path-loss exponent. The impact of these key system parameters on network performance is analyzed and compared with previous work. Simulation results demonstrate an enhancement in cellular and D2D coverage probabilities, and an increase in spectral and power efficiency.
△ Less
Submitted 2 March, 2018;
originally announced March 2018.
-
Thermocompression Bonding Technology for Multilayer Superconducting Quantum Circuits
Authors:
C. R. H. McRae,
J. H. Béjanin,
Z. Pagel,
A. O. Abdallah,
T. G. McConkey,
C. T. Earnest,
J. R. Rinehart,
M. Mariantoni
Abstract:
Extensible quantum computing architectures require a large array of quantum devices operating with low error rates. A quantum processor based on superconducting quantum bits can be scaled up by stacking microchips that each perform different computational functions. In this article, we experimentally demonstrate a thermocompression bonding technology that utilizes indium films as a welding agent t…
▽ More
Extensible quantum computing architectures require a large array of quantum devices operating with low error rates. A quantum processor based on superconducting quantum bits can be scaled up by stacking microchips that each perform different computational functions. In this article, we experimentally demonstrate a thermocompression bonding technology that utilizes indium films as a welding agent to attach pairs of lithographically-patterned chips. We perform chip-to-chip indium bonding in vacuum at $190^{\circ}C$ with indium film thicknesses of $150 nm$. We characterize the dc and microwave performance of bonded devices at room and cryogenic temperatures. At $10 mK$, we find a dc bond resistance of $515 nΩmm^2$. Additionally, we show minimal microwave reflections and good transmission up to $6.8 GHz$ in a tunnel-capped, bonded device as compared to a similar uncapped device. As a proof of concept, we fabricate and measure a set of tunnel-capped superconducting resonators, demonstrating that our bonding technology can be used in quantum computing applications.
△ Less
Submitted 5 May, 2017;
originally announced May 2017.
-
Public key cryptography based on some extensions of group
Authors:
Ali Abdallah
Abstract:
Bogopolski, Martino and Ventura in [BMV10] introduced a general criteria to construct groups extensions with unsolvable conjugacy problem using short exact sequences. We prove that such extensions have always solvable word problem. This makes the proposed construction a systematic way to obtain finitely presented groups with solvable word problem and unsolvable conjugacy problem. It is believed th…
▽ More
Bogopolski, Martino and Ventura in [BMV10] introduced a general criteria to construct groups extensions with unsolvable conjugacy problem using short exact sequences. We prove that such extensions have always solvable word problem. This makes the proposed construction a systematic way to obtain finitely presented groups with solvable word problem and unsolvable conjugacy problem. It is believed that such groups are important in cryptography. For this, and as an example, we provide an explicit construction of an extension of Thompson group F and we propose it as a base for a public key cryptography protocol.
△ Less
Submitted 15 April, 2016;
originally announced April 2016.
-
Survey on Feature Selection
Authors:
Tarek Amr Abdallah,
Beatriz de La Iglesia
Abstract:
Feature selection plays an important role in the data mining process. It is needed to deal with the excessive number of features, which can become a computational burden on the learning algorithms. It is also necessary, even when computational resources are not scarce, since it improves the accuracy of the machine learning tasks, as we will see in the upcoming sections. In this review, we discuss…
▽ More
Feature selection plays an important role in the data mining process. It is needed to deal with the excessive number of features, which can become a computational burden on the learning algorithms. It is also necessary, even when computational resources are not scarce, since it improves the accuracy of the machine learning tasks, as we will see in the upcoming sections. In this review, we discuss the different feature selection approaches, and the relation between them and the various machine learning algorithms.
△ Less
Submitted 10 October, 2015;
originally announced October 2015.
-
Design and Performance Study of Smart Antenna Systems for WIMAX Applications
Authors:
Ayman Abdallah,
Seifedine Kadry,
Chibli Joumaa
Abstract:
In this paper we propose an approach that uses homodyne receivers to design smart antenna systems. The receivers functions are to detect angles of arrivals of seven incoming RF signals using MUSIC or ESPRIT algorithms. The characteristics of each algorithm are critical for the systems precision as well as receivers types. Results are deduced from the simulation of each system, using the Advanced D…
▽ More
In this paper we propose an approach that uses homodyne receivers to design smart antenna systems. The receivers functions are to detect angles of arrivals of seven incoming RF signals using MUSIC or ESPRIT algorithms. The characteristics of each algorithm are critical for the systems precision as well as receivers types. Results are deduced from the simulation of each system, using the Advanced Design System (ADS) and MATLAB. These are compared to results deduced from real systems in the WIMAX (3.5GHz) domains.
△ Less
Submitted 25 December, 2012;
originally announced December 2012.
-
On The Optimization of Dijkstras Algorithm
Authors:
Seifedine Kadry,
Ayman Abdallah,
Chibli Joumaa
Abstract:
In this paper, we propose some amendment on Dijkstras algorithm in order to optimize it by reducing the number of iterations. The main idea is to solve the problem where more than one node satisfies the condition of the second step in the traditional Dijkstras algorithm. After application of the proposed modifications, the maximum number of iterations of Dijkstras algorithm is less than the number…
▽ More
In this paper, we propose some amendment on Dijkstras algorithm in order to optimize it by reducing the number of iterations. The main idea is to solve the problem where more than one node satisfies the condition of the second step in the traditional Dijkstras algorithm. After application of the proposed modifications, the maximum number of iterations of Dijkstras algorithm is less than the number of the graphs nodes.
△ Less
Submitted 25 December, 2012;
originally announced December 2012.
-
Predictive Information Rate in Discrete-time Gaussian Processes
Authors:
Samer A. Abdallah,
Mark D. Plumbley
Abstract:
We derive expressions for the predicitive information rate (PIR) for the class of autoregressive Gaussian processes AR(N), both in terms of the prediction coefficients and in terms of the power spectral density. The latter result suggests a duality between the PIR and the multi-information rate for processes with mutually inverse power spectra (i.e. with poles and zeros of the transfer function ex…
▽ More
We derive expressions for the predicitive information rate (PIR) for the class of autoregressive Gaussian processes AR(N), both in terms of the prediction coefficients and in terms of the power spectral density. The latter result suggests a duality between the PIR and the multi-information rate for processes with mutually inverse power spectra (i.e. with poles and zeros of the transfer function exchanged). We investigate the behaviour of the PIR in relation to the multi-information rate for some simple examples, which suggest, somewhat counter-intuitively, that the PIR is maximised for very `smooth' AR processes whose power spectra have multiple poles at zero frequency. We also obtain results for moving average Gaussian processes which are consistent with the duality conjectured earlier. One consequence of this is that the PIR is unbounded for MA(N) processes.
△ Less
Submitted 14 August, 2012; v1 submitted 1 June, 2012;
originally announced June 2012.
-
Constraint Propagation as Information Maximization
Authors:
A. Nait Abdallah,
M. H. van Emden
Abstract:
This paper draws on diverse areas of computer science to develop a unified view of computation:
(1) Optimization in operations research, where a numerical objective function is maximized under constraints, is generalized from the numerical total order to a non-numerical partial order that can be interpreted in terms of information. (2) Relations are generalized so that there are relations of whi…
▽ More
This paper draws on diverse areas of computer science to develop a unified view of computation:
(1) Optimization in operations research, where a numerical objective function is maximized under constraints, is generalized from the numerical total order to a non-numerical partial order that can be interpreted in terms of information. (2) Relations are generalized so that there are relations of which the constituent tuples have numerical indexes, whereas in other relations these indexes are variables. The distinction is essential in our definition of constraint satisfaction problems. (3) Constraint satisfaction problems are formulated in terms of semantics of conjunctions of atomic formulas of predicate logic. (4) Approximation structures, which are available for several important domains, are applied to solutions of constraint satisfaction problems.
As application we treat constraint satisfaction problems over reals. These cover a large part of numerical analysis, most significantly nonlinear equations and inequalities. The chaotic algorithm analyzed in the paper combines the efficiency of floating-point computation with the correctness guarantees of arising from our logico-mathematical model of constraint-satisfaction problems.
△ Less
Submitted 7 February, 2013; v1 submitted 25 January, 2012;
originally announced January 2012.
-
A measure of statistical complexity based on predictive information
Authors:
Samer A. Abdallah,
Mark D. Plumbley
Abstract:
We introduce an information theoretic measure of statistical structure, called 'binding information', for sets of random variables, and compare it with several previously proposed measures including excess entropy, Bialek et al.'s predictive information, and the multi-information. We derive some of the properties of the binding information, particularly in relation to the multi-information, and sh…
▽ More
We introduce an information theoretic measure of statistical structure, called 'binding information', for sets of random variables, and compare it with several previously proposed measures including excess entropy, Bialek et al.'s predictive information, and the multi-information. We derive some of the properties of the binding information, particularly in relation to the multi-information, and show that, for finite sets of binary random variables, the processes which maximises binding information are the 'parity' processes. Finally we discuss some of the implications this has for the use of the binding information as a measure of complexity.
△ Less
Submitted 8 December, 2010;
originally announced December 2010.
-
Formal Modelling of a Usable Identity Management Solution for Virtual Organisations
Authors:
Ali N. Haidar,
P. V. Coveney,
Ali E. Abdallah,
P. Y. A Ryan,
B. Beckles,
J. M. Brooke,
M . A. S. Jones
Abstract:
This paper attempts to accurately model security requirements for computational grid environments with particular focus on authentication. We introduce the Audited Credential Delegation (ACD) architecture as a solution to some of the virtual organisations identity management usability problems. The approach uses two complementary models: one is state based, described in Z notation, and the other…
▽ More
This paper attempts to accurately model security requirements for computational grid environments with particular focus on authentication. We introduce the Audited Credential Delegation (ACD) architecture as a solution to some of the virtual organisations identity management usability problems. The approach uses two complementary models: one is state based, described in Z notation, and the other is event-based, expressed in the Process Algebra of Hoare's Communicating Sequential Processes (CSP). The former will be used to capture the state of the WS and to model back-end operations on it whereas the latter will be used to model behavior, and in particular, front-end interactions and communications. The modelling helps to clearly and precisely understand functional and security requirements and provide a basis for verifying that the system meets its intended requirements.
△ Less
Submitted 27 January, 2010;
originally announced January 2010.
-
One million year old groundwater in the Sahara revealed by krypton-81 and chlorine-36
Authors:
N. C. Sturchio,
X. Du,
R. Purtschert,
B. E. Lehmann,
M. Sultan,
L. J. Patterson,
Z. -T. Lu,
P. Mueller,
T. Bigler,
K. Bailey,
T. P. O'Connor,
L. Young,
R. Lorenzo,
R. Becker,
Z. El Alfy,
B. El Kaliouby,
Y. Dawood,
A. M. A. Abdallah
Abstract:
Measurements of 81Kr/Kr in deep groundwater from the Nubian Aquifer (Egypt) were performed by a new laser-based atom-counting method. 81Kr ages range from \~2x10^5 to ~1x10^6 yr, correlate with 36Cl/Cl ratios, and are consistent with lateral flow of groundwater from a recharge area near the Uweinat Uplift in SW Egypt. Low delta-2H values of the 81Kr-dated groundwater reveal a recurrent Atlantic…
▽ More
Measurements of 81Kr/Kr in deep groundwater from the Nubian Aquifer (Egypt) were performed by a new laser-based atom-counting method. 81Kr ages range from \~2x10^5 to ~1x10^6 yr, correlate with 36Cl/Cl ratios, and are consistent with lateral flow of groundwater from a recharge area near the Uweinat Uplift in SW Egypt. Low delta-2H values of the 81Kr-dated groundwater reveal a recurrent Atlantic moisture source during Pleistocene pluvial periods. These results indicate that the 81Kr method for dating old groundwater is robust and such measurements can now be applied to a wide range of hydrologic problems.
△ Less
Submitted 18 February, 2004;
originally announced February 2004.