-
Dynamics and solar wind control of the recovery of strong geomagnetic storms
Authors:
O. Ahmed,
B. Badruddin,
M. Derouich
Abstract:
In this work, we studied the characteristics and dynamical changes during the recovery time of moderate and strong geomagnetic storms (Dst $<-50$ nT). Investigating 57 storms triggered by CMEs/CIRs, we focused on the solar wind's influence on their decay phases. Selected storms were classified into distinct groups based on their recovery characteristics. Using superposed epoch analysis and best fi…
▽ More
In this work, we studied the characteristics and dynamical changes during the recovery time of moderate and strong geomagnetic storms (Dst $<-50$ nT). Investigating 57 storms triggered by CMEs/CIRs, we focused on the solar wind's influence on their decay phases. Selected storms were classified into distinct groups based on their recovery characteristics. Using superposed epoch analysis and best fit methods, we scrutinized several interplanetary solar wind plasma and field parameters/functions. The analysis included single, dual, and multiple interplanetary plasma and field parameters/functions. We determined the most representative characteristic time for the storm's recovery profile by fitting an exponential curve. A correlation analysis between Dst and solar wind parameters/functions isolated a coupling function ($ρ^{1/2}$Ey) best describing the decay rate of the ring current. This shows that the electric field term (Ey) coupled with a viscous term ($ρ^{1/2}$) plays a pivotal role in determining the recovery rate of geomagnetic storms. Additionally, we modeled the complex patterns of Dst recovery in relation to solar wind parameters/functions using a second-order polynomial. During the recovery phase, a dynamic correlation between Dst and solar wind parameters/functions was revealed. The three-parameter solar wind-magnetosphere electrodynamical coupling function, combining the viscous term ($ρ^{1/2}$) and the electric field-related function (v$^{4/3}$B) ($ρ^{1/2}$v$^{4/3}$B), significantly impacts the recovery phase of geomagnetic disturbances. Our investigation extended to the relationship between main and recovery phase durations, providing valuable insights into the solar wind's control over the decay of geomagnetic disturbances. These findings advance our comprehension of the complex relationship between solar wind dynamics and the evolution of geomagnetic disturbances.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Attention as an RNN
Authors:
Leo Feng,
Frederick Tung,
Hossein Hajimirsadeghi,
Mohamed Osama Ahmed,
Yoshua Bengio,
Greg Mori
Abstract:
The advent of Transformers marked a significant breakthrough in sequence modelling, providing a highly performant architecture capable of leveraging GPU parallelism. However, Transformers are computationally expensive at inference time, limiting their applications, particularly in low-resource settings (e.g., mobile and embedded devices). Addressing this, we (1) begin by showing that attention can…
▽ More
The advent of Transformers marked a significant breakthrough in sequence modelling, providing a highly performant architecture capable of leveraging GPU parallelism. However, Transformers are computationally expensive at inference time, limiting their applications, particularly in low-resource settings (e.g., mobile and embedded devices). Addressing this, we (1) begin by showing that attention can be viewed as a special Recurrent Neural Network (RNN) with the ability to compute its \textit{many-to-one} RNN output efficiently. We then (2) show that popular attention-based models such as Transformers can be viewed as RNN variants. However, unlike traditional RNNs (e.g., LSTMs), these models cannot be updated efficiently with new tokens, an important property in sequence modelling. Tackling this, we (3) introduce a new efficient method of computing attention's \textit{many-to-many} RNN output based on the parallel prefix scan algorithm. Building on the new attention formulation, we (4) introduce \textbf{Aaren}, an attention-based module that can not only (i) be trained in parallel (like Transformers) but also (ii) be updated efficiently with new tokens, requiring only constant memory for inferences (like traditional RNNs). Empirically, we show Aarens achieve comparable performance to Transformers on $38$ datasets spread across four popular sequential problem settings: reinforcement learning, event forecasting, time series classification, and time series forecasting tasks while being more time and memory-efficient.
△ Less
Submitted 28 May, 2024; v1 submitted 22 May, 2024;
originally announced May 2024.
-
Prediction of chaotic dynamics and extreme events: A recurrence-free quantum reservoir computing approach
Authors:
Osama Ahmed,
Felix Tennie,
Luca Magri
Abstract:
In chaotic dynamical systems, extreme events manifest in time series as unpredictable large-amplitude peaks. Although deterministic, extreme events appear seemingly randomly, which makes their forecasting difficult. By learning the dynamics from observables (data), reservoir computers can time-accurately predict extreme events and chaotic dynamics, but they may require many degrees of freedom (lar…
▽ More
In chaotic dynamical systems, extreme events manifest in time series as unpredictable large-amplitude peaks. Although deterministic, extreme events appear seemingly randomly, which makes their forecasting difficult. By learning the dynamics from observables (data), reservoir computers can time-accurately predict extreme events and chaotic dynamics, but they may require many degrees of freedom (large reservoirs). In this paper, by exploiting quantum-computer ansätze and entanglement, we design reservoir computers with compact reservoirs and accurate prediction capabilities. First, we propose the recurrence-free quantum reservoir computer (RF-QRC) architecture. By develo** ad-hoc quantum feature maps and removing recurrent connections, the RF-QRC has quantum circuits with small depths. This allows the RF-QRC to scale well with higher-dimensional chaotic systems, which makes it suitable for hardware implementation. Second, we forecast the temporal chaotic dynamics and their long-term statistics of low- and higher-dimensional dynamical systems. We find that RF-QRC requires smaller reservoirs than classical reservoir computers. Third, we apply the RF-QRC to the time prediction of extreme events in a model of a turbulent shear flow with turbulent bursts. We find that the RF-QRC has a longer predictability than the classical reservoir computer. The results and analyses indicate that quantum-computer ansätze offer nonlinear expressivity and computational scalability, which are useful for forecasting chaotic dynamics and extreme events. This work opens new opportunities for using quantum machine learning on near-term quantum computers.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Taxonomic classification with maximal exact matches in KATKA kernels and minimizer digests
Authors:
Dominika Draesslerová,
Omar Ahmed,
Travis Gagie,
Jan Holub,
Ben Langmead,
Giovanni Manzini,
Gonzalo Navarro
Abstract:
For taxonomic classification, we are asked to index the genomes in a phylogenetic tree such that later, given a DNA read, we can quickly choose a small subtree likely to contain the genome from which that read was drawn. Although popular classifiers such as Kraken use $k$-mers, recent research indicates that using maximal exact matches (MEMs) can lead to better classifications. For example, we can…
▽ More
For taxonomic classification, we are asked to index the genomes in a phylogenetic tree such that later, given a DNA read, we can quickly choose a small subtree likely to contain the genome from which that read was drawn. Although popular classifiers such as Kraken use $k$-mers, recent research indicates that using maximal exact matches (MEMs) can lead to better classifications. For example, we can build an augmented FM-index over the the genomes in the tree concatenated in left-to-right order; for each MEM in a read, find the interval in the suffix array containing the starting positions of that MEM's occurrences in those genomes; find the minimum and maximum values stored in that interval; take the lowest common ancestor (LCA) of the genomes containing the characters at those positions. This solution is practical, however, only when the total size of the genomes in the tree is fairly small. In this paper we consider applying the same solution to three lossily compressed representations of the genomes' concatenation: a KATKA kernel, which discards characters that are not in the first or last occurrence of any $k_{\max}$-tuple, for a parameter $k_{\max}$; a minimizer digest; a KATKA kernel of a minimizer digest. With a test dataset and these three representations of it, simulated reads and various parameter settings, we checked how many reads' longest MEMs occurred only in the sequences from which those reads were generated ("true positive" reads). For some parameter settings we achieved significant compression while only slightly decreasing the true-positive rate.
△ Less
Submitted 4 April, 2024; v1 submitted 10 February, 2024;
originally announced February 2024.
-
Characteristics and development of the main phase disturbance in geomagnetic storms (Dst $\le$ -50 nT)
Authors:
Osman M. Ahmed,
Badruudin Zaheer Ahmad,
Moncef Derouich
Abstract:
We present geomagnetic storms (GSs) selected from three solar cycles, spanning the years 1995 to 2022. We studied the development of the main phase of storms within disturbance storm time (Dst) amplitudes ranging from Dst =-64 nT to Dst=- 422 nT. In order to determine the solar wind (SW) parameters that mainly influence the main phase development of a GS, which can best describe the SW-magnetosphe…
▽ More
We present geomagnetic storms (GSs) selected from three solar cycles, spanning the years 1995 to 2022. We studied the development of the main phase of storms within disturbance storm time (Dst) amplitudes ranging from Dst =-64 nT to Dst=- 422 nT. In order to determine the solar wind (SW) parameters that mainly influence the main phase development of a GS, which can best describe the SW-magnetosphere coupling, we divided our selected GSs into four groups based on main phase duration. Superposed epoch analysis was performed on the selected geomagnetic indices, SW plasma and field parameters, and their derivatives separately for each group. To that end, the dynamics of GS main phase development is mainly guided by interplanetary driver magnetic field southward component, (-Bz). It has been determined that there is a temporal difference between the peak values of Bz and Dst. As a result, Dst is delayed from Bz by 1-4 hours, which is crucial for space weather forecasting. The peak of Dst has a direct relationship with the amplitude of storm sudden commencement (SSC) and an inverse relationship with the duration of SSC. The inter-relationship between the peaks of the three indices (Dst, AE, and ap) during GS, is also obtained. Dst is found to be more closely related to ap than AE. To determine the best fit SW parameter to the geomagnetic activity indices, we used a linear correlation between the peak values of individual geomagnetic indices and SW plasma and field parameters and their derivatives. An electric field related function involving speed and IMF (v$^{4/3}$Bz) when coupled with a viscous term ($ρ^{1/2}$) correlates very well with the intensity of the GS (Dst$ _{min} $ or $ Δ$Dst) and the magnitude of (ap$_{max}$) and (AE$_{max}$) during storms. However, a related function (v$^{4/3}$B$ρ^{1/2}$) represents slightly better the peak of AE$_{max}$ during the storms.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Adaptive Prognostic Malfunction Based Processor for Autonomous Landing Guidance Assistance System Using FPGA
Authors:
Hossam O. Ahmed,
David Wyatt
Abstract:
The demand for more developed and agile urban taxi drones is increasing rapidly nowadays to sustain crowded cities and their traffic issues. The critical factor for spreading such technology could be related to the safety criteria that must be considered. One of the most critical safety aspects for such vertical and/or Short Take-Off and Landing (V/STOL) drones is related to safety during the land…
▽ More
The demand for more developed and agile urban taxi drones is increasing rapidly nowadays to sustain crowded cities and their traffic issues. The critical factor for spreading such technology could be related to the safety criteria that must be considered. One of the most critical safety aspects for such vertical and/or Short Take-Off and Landing (V/STOL) drones is related to safety during the landing stage, in which most of the recent flight accidents have occurred. This paper focused on solving this issue by proposing decentralized processing cores that could improve the landing failure rate by depending on a Fuzzy Logic System (FLS) and additional Digital Signal Processing (DSP) elements. Also, the proposed system will enhance the safety factor during the landing stages by adding a self-awareness feature in case a certain sensor malfunction occurs using the proposed Adaptive Prognostic Malfunction Unit (APMU). This proposed coarse-grained Autonomous Landing Guidance Assistance System (ALGAS4) processing architecture has been optimized using different optimization techniques. The ALGAS4 architecture has been designed completely using VHDL, and the targeted FPGA was the INTEL Cyclone V 5CGXFC9D6F27C7 chip. According to the synthesis findings of the INTEL Quartus Prime software, the maximum working frequency of the ALGAS4 system is 278.24 MHz. In addition, the proposed ALGAS4 system could maintain a maximum computing performance of approximately 74.85 GOPS while using just 166.56 mW for dynamic and I/O power dissipation.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
AdaFlood: Adaptive Flood Regularization
Authors:
Wonho Bae,
Yi Ren,
Mohamad Osama Ahmed,
Frederick Tung,
Danica J. Sutherland,
Gabriel L. Oliveira
Abstract:
Although neural networks are conventionally optimized towards zero training loss, it has been recently learned that targeting a non-zero training loss threshold, referred to as a flood level, often enables better test time generalization. Current approaches, however, apply the same constant flood level to all training samples, which inherently assumes all the samples have the same difficulty. We p…
▽ More
Although neural networks are conventionally optimized towards zero training loss, it has been recently learned that targeting a non-zero training loss threshold, referred to as a flood level, often enables better test time generalization. Current approaches, however, apply the same constant flood level to all training samples, which inherently assumes all the samples have the same difficulty. We present AdaFlood, a novel flood regularization method that adapts the flood level of each training sample according to the difficulty of the sample. Intuitively, since training samples are not equal in difficulty, the target training loss should be conditioned on the instance. Experiments on datasets covering four diverse input modalities - text, images, asynchronous event sequences, and tabular - demonstrate the versatility of AdaFlood across data domains and noise levels.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Cloud-Connected Wireless Holter Monitor Machine with Neural Networks Based ECG Analysis for Remote Health Monitoring
Authors:
Azlaan Ranjha,
Laiba Jabbar,
Osaid Ahmed
Abstract:
This study describes the creation of a wireless, transportable Holter monitor to improve the accuracy of cardiac disease diagnosis. The main goal of this study is to develop a low-cost cardiac screening system suited explicitly for underprivileged areas, addressing the rising rates of cardiovascular death. The suggested system includes a wireless Electrocardiogram (ECG) module for real-time cardia…
▽ More
This study describes the creation of a wireless, transportable Holter monitor to improve the accuracy of cardiac disease diagnosis. The main goal of this study is to develop a low-cost cardiac screening system suited explicitly for underprivileged areas, addressing the rising rates of cardiovascular death. The suggested system includes a wireless Electrocardiogram (ECG) module for real-time cardiac signal gathering using attached electrodes, with data transfer made possible by WiFi to a cloud server for archival and analysis. The system uses a neural network model for automated ECG classification, concentrating on the identification of cardiac anomalies. The diagnostic performance of cardiologist-level ECG analysis is surpassed by our upgraded deep neural network architecture, which underwent thorough evaluation and showed a stunning accuracy rate of more than 88\%. A quick, accurate, and reasonably priced option for cardiac screening is provided by this ground-breaking technology, which smoothly merges wireless data transfer with AI-assisted diagnostics. In addition to providing a thorough overview of the development process, this paper also highlights methods used to improve model accuracy, such as data preparation, class imbalance correction using oversampling, and model fine-tuning. The work shows the viability of a comprehensive remote cardiac screening system powered by AI and maximising the use of wearable and cloud computing resources. Such cutting-edge remote health monitoring technologies have great promise for improved health outcomes and early identification, especially in resource-constrained countries.
△ Less
Submitted 21 October, 2023;
originally announced October 2023.
-
Tree Cross Attention
Authors:
Leo Feng,
Frederick Tung,
Hossein Hajimirsadeghi,
Yoshua Bengio,
Mohamed Osama Ahmed
Abstract:
Cross Attention is a popular method for retrieving information from a set of context tokens for making predictions. At inference time, for each prediction, Cross Attention scans the full set of $\mathcal{O}(N)$ tokens. In practice, however, often only a small subset of tokens are required for good performance. Methods such as Perceiver IO are cheap at inference as they distill the information to a…
▽ More
Cross Attention is a popular method for retrieving information from a set of context tokens for making predictions. At inference time, for each prediction, Cross Attention scans the full set of $\mathcal{O}(N)$ tokens. In practice, however, often only a small subset of tokens are required for good performance. Methods such as Perceiver IO are cheap at inference as they distill the information to a smaller-sized set of latent tokens $L < N$ on which cross attention is then applied, resulting in only $\mathcal{O}(L)$ complexity. However, in practice, as the number of input tokens and the amount of information to distill increases, the number of latent tokens needed also increases significantly. In this work, we propose Tree Cross Attention (TCA) - a module based on Cross Attention that only retrieves information from a logarithmic $\mathcal{O}(\log(N))$ number of tokens for performing inference. TCA organizes the data in a tree structure and performs a tree search at inference time to retrieve the relevant tokens for prediction. Leveraging TCA, we introduce ReTreever, a flexible architecture for token-efficient inference. We show empirically that Tree Cross Attention (TCA) performs comparable to Cross Attention across various classification and uncertainty regression tasks while being significantly more token-efficient. Furthermore, we compare ReTreever against Perceiver IO, showing significant gains while using the same number of tokens for inference.
△ Less
Submitted 1 March, 2024; v1 submitted 29 September, 2023;
originally announced September 2023.
-
MVD:A Novel Methodology and Dataset for Acoustic Vehicle Type Classification
Authors:
Mohd Ashhad,
Omar Ahmed,
Sooraj K. Ambat,
Zeeshan Ali Haq,
Mansaf Alam
Abstract:
Rising urban populations have led to a surge in vehicle use and made traffic monitoring and management indispensable. Acoustic traffic monitoring (ATM) offers a cost-effective and efficient alternative to more computationally expensive methods of monitoring traffic such as those involving computer vision technologies. In this paper, we present MVD and MVDA: two open datasets for the development of…
▽ More
Rising urban populations have led to a surge in vehicle use and made traffic monitoring and management indispensable. Acoustic traffic monitoring (ATM) offers a cost-effective and efficient alternative to more computationally expensive methods of monitoring traffic such as those involving computer vision technologies. In this paper, we present MVD and MVDA: two open datasets for the development of acoustic traffic monitoring and vehicle-type classification algorithms, which contain audio recordings of moving vehicles. The dataset contain four classes- Trucks, Cars, Motorbikes, and a No-vehicle class. Additionally, we propose a novel and efficient way to accurately classify these acoustic signals using cepstrum and spectrum based local and global audio features, and a multi-input neural network. Experimental results show that our methodology improves upon the established baselines of previous works and achieves an accuracy of 91.98% and 96.66% on MVD and MVDA Datasets, respectively. Finally, the proposed model was deployed through an Android application to make it accessible for testing and demonstrate its efficacy.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
ANER: Arabic and Arabizi Named Entity Recognition using Transformer-Based Approach
Authors:
Abdelrahman "Boda" Sadallah,
Omar Ahmed,
Shimaa Mohamed,
Omar Hatem,
Doaa Hesham,
Ahmed H. Yousef
Abstract:
One of the main tasks of Natural Language Processing (NLP), is Named Entity Recognition (NER). It is used in many applications and also can be used as an intermediate step for other tasks. We present ANER, a web-based named entity recognizer for the Arabic, and Arabizi languages. The model is built upon BERT, which is a transformer-based encoder. It can recognize 50 different entity classes, cover…
▽ More
One of the main tasks of Natural Language Processing (NLP), is Named Entity Recognition (NER). It is used in many applications and also can be used as an intermediate step for other tasks. We present ANER, a web-based named entity recognizer for the Arabic, and Arabizi languages. The model is built upon BERT, which is a transformer-based encoder. It can recognize 50 different entity classes, covering various fields. We trained our model on the WikiFANE\_Gold dataset which consists of Wikipedia articles. We achieved an F1 score of 88.7\%, which beats CAMeL Tools' F1 score of 83\% on the ANERcorp dataset, which has only 4 classes. We also got an F1 score of 77.7\% on the NewsFANE\_Gold dataset which contains out-of-domain data from News articles. The system is deployed on a user-friendly web interface that accepts users' inputs in Arabic, or Arabizi. It allows users to explore the entities in the text by highlighting them. It can also direct users to get information about entities through Wikipedia directly. We added the ability to do NER using our model, or CAMeL Tools' model through our website. ANER is publicly accessible at \url{http://www.aner.online}. We also deployed our model on HuggingFace at https://huggingface.co/boda/ANER, to allow developers to test and use it.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Constant Memory Attention Block
Authors:
Leo Feng,
Frederick Tung,
Hossein Hajimirsadeghi,
Yoshua Bengio,
Mohamed Osama Ahmed
Abstract:
Modern foundation model architectures rely on attention mechanisms to effectively capture context. However, these methods require linear or quadratic memory in terms of the number of inputs/datapoints, limiting their applicability in low-compute domains. In this work, we propose Constant Memory Attention Block (CMAB), a novel general-purpose attention block that computes its output in constant mem…
▽ More
Modern foundation model architectures rely on attention mechanisms to effectively capture context. However, these methods require linear or quadratic memory in terms of the number of inputs/datapoints, limiting their applicability in low-compute domains. In this work, we propose Constant Memory Attention Block (CMAB), a novel general-purpose attention block that computes its output in constant memory and performs updates in constant computation. Highlighting CMABs efficacy, we introduce methods for Neural Processes and Temporal Point Processes. Empirically, we show our proposed methods achieve results competitive with state-of-the-art while being significantly more memory efficient.
△ Less
Submitted 21 June, 2023;
originally announced June 2023.
-
Segregated FLS Processing Cores for V/STOL Autonomous Landing Guidance Assistant System using FPGA
Authors:
Hossam O. Ahmed
Abstract:
It is highly predicted that the roads and parking areas will be extremely congested with vehicles to the point that searching for a novel solution will not be an optional choice for conserving the sustainability rate of the overall humanity's development growth. Such issue could be overcome by develo** modified generations of the Urban Air Mobility (UAM) vehicles that essentially depend on the V…
▽ More
It is highly predicted that the roads and parking areas will be extremely congested with vehicles to the point that searching for a novel solution will not be an optional choice for conserving the sustainability rate of the overall humanity's development growth. Such issue could be overcome by develo** modified generations of the Urban Air Mobility (UAM) vehicles that essentially depend on the Vertical and/or Short Take-Off and Landing (V/STOL) feature to increase the efficiency of landing capabilities on limited-space parking areas. The complexity of integrating an efficient and safe V/STOL feature in such UAM vehicles is notably difficult comparing with the conventional and normal techniques for landing and take-off. The efficient V/STOL feature should be carried out by a complete and collaborative Cyber-Physical System (CPS) processing architecture, such as the CPS-5C architecture. In this paper, we only proposed two CPS-5C physical layers of a V/STOL Autonomous Landing Guidance Assistant System (ALGAS2) processing unit to increase the reliability of the vertical landing mechanism. The proposed V/STOL-ALGAS2 system depends on Fuzzy Logic System (FLS) as the advanced control unit. Furthermore, the proposed ALGAS2 system depends on four symmetric and segregated processing ALGAS2 cores that processing the data in a fully parallel and independent manner to enhance many essential security and safety factors for the futuristic UAM vehicles. The proposed ALGAS2 digital circuits architecture has been designed using MATLAB and VHDL. Also, it has been further analyzed for the implementation and validation tests using the Intel Altera OpenVINO FPGA board. The proposed ALGAS processing unit attained a maximum computational processing performance of about 21.22 Giga Operations per Seconds (GOPS).
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Memory Efficient Neural Processes via Constant Memory Attention Block
Authors:
Leo Feng,
Frederick Tung,
Hossein Hajimirsadeghi,
Yoshua Bengio,
Mohamed Osama Ahmed
Abstract:
Neural Processes (NPs) are popular meta-learning methods for efficiently modelling predictive uncertainty. Recent state-of-the-art methods, however, leverage expensive attention mechanisms, limiting their applications, particularly in low-resource settings. In this work, we propose Constant Memory Attentive Neural Processes (CMANPs), an NP variant that only requires constant memory. To do so, we f…
▽ More
Neural Processes (NPs) are popular meta-learning methods for efficiently modelling predictive uncertainty. Recent state-of-the-art methods, however, leverage expensive attention mechanisms, limiting their applications, particularly in low-resource settings. In this work, we propose Constant Memory Attentive Neural Processes (CMANPs), an NP variant that only requires constant memory. To do so, we first propose an efficient update operation for Cross Attention. Leveraging the update operation, we propose Constant Memory Attention Block (CMAB), a novel attention block that (i) is permutation invariant, (ii) computes its output in constant memory, and (iii) performs constant computation updates. Finally, building on CMAB, we detail Constant Memory Attentive Neural Processes. Empirically, we show CMANPs achieve state-of-the-art results on popular NP benchmarks while being significantly more memory efficient than prior methods.
△ Less
Submitted 27 May, 2024; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Fault Tolerant Processing Unit Using Gamma Distribution Sliding Window For Autonomous Landing Guidance System
Authors:
Hossam O. Ahmed
Abstract:
To keep up with today's dense metropolitan areas and their accompanying traffic problems, a growing number of towns are looking for more advanced and swift urban taxi drones. The safety parameters that must be taken into consideration may be the most important element in the widespread use of such technology. Most recent aviation mishaps have happened during the landing phase, making this a partic…
▽ More
To keep up with today's dense metropolitan areas and their accompanying traffic problems, a growing number of towns are looking for more advanced and swift urban taxi drones. The safety parameters that must be taken into consideration may be the most important element in the widespread use of such technology. Most recent aviation mishaps have happened during the landing phase, making this a particularly important safety consideration for Vertical and/or Short Take-Off and Landing (V/STOL) drones. In this study, we focused on improving the fault tolerance of the processor architectures used by the predecessors of Autonomous Landing Guidance Assistance Systems (ALGAS), which in turn improves their decision-making capabilities. Furthermore, this is achieved by proposing a fault-tolerant processing architecture that depends on the Gamma Distribution Sliding Window Unit (GDSWU). This proposed GDSWU has been designed completely using VHDL, and the targeted FPFA was the Intel Cyclone V 5CGXFC9D6F27C7 chip. The GDSWU could operate at a maximum frequency of 369.96 MHz, as calculated by the synthesis results of the INTEL Quartus Prime program. The suggested GDSWU core only requires 20.36 mW for dynamic core and I/O power consumption.
△ Less
Submitted 3 June, 2023; v1 submitted 2 May, 2023;
originally announced May 2023.
-
Coarse Grained FLS-based Processor with Prognostic Malfunction Feature for UAM Drones using FPGA
Authors:
Hossam O. Ahmed
Abstract:
Many overall safety factors need to be considered in the next generation of Urban Air Mobility (UAM) systems and addressing these can become the anchor point for such technology to reach consent for worldwide application. On the other hand, fulfilling the safety requirements from an exponential increase of prolific UAM systems, is extremely complicated, and requires careful consideration of a vari…
▽ More
Many overall safety factors need to be considered in the next generation of Urban Air Mobility (UAM) systems and addressing these can become the anchor point for such technology to reach consent for worldwide application. On the other hand, fulfilling the safety requirements from an exponential increase of prolific UAM systems, is extremely complicated, and requires careful consideration of a variety of issues. One of the key goals of these Unmanned Air Systems (UAS) is the requirement to support the launch and control of hundreds of thousands of these advanced drones in the air simultaneously. Given the impracticalities of training the corresponding number of expert pilots, achieving this goal can only be realized through safe operation in either fullautonomous or semi-autonomous modes. According to many recent studies, the majority of flight accidents are concentrated on the last three stages of a flight trip, which include the Initial Approach, Final Approach, and Landing Phases of an airplane trip. Therefore, this paper proposes a novel decentralized processing system for enhancing the safety factors during the critical phases of Vertical and/or Short Take-Off and Landing (V/STOL) drones. This has been achieved by adopting several processing and control algorithms such as an Open Fuzzy Logic System (FLS) integrated with a Flight Rules Unit (FRU), FIR filters, and a novel Prognostic Malfunction processing unit. After applying several optimization techniques, this novel coarse-grained Autonomous Landing Guidance Assistance System (ALGAS3) processing architecture has been optimized to achieve a maximum computational processing performance of 70.82 Giga Operations per Second (GOPS). Also, the proposed ALGAS3 system shows an ultra-low dynamic thermal power dissipation (I/O and core) of 145.4 mW which is ideal for mobile avionic systems using INTEL 5CGXFC9D6F27C7 FPGA chip.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Numerical Investigation of a Rotating Double Compression Ramp Intake
Authors:
Lubna Margha,
Ahmed A. Hamada,
Othman Ahmed,
Ahmed Eltaweel
Abstract:
The intakes of air-breathing high-speed flying vehicles produce a large share of the thrust propulsion. Furthermore, the propulsion performance of these engines increases when the single-ramp intake is replaced with a multiple-ramps intake. Many scholars numerically and experimentally studied the high-speed engine performance over static single and multiple compression ramps. However, the transien…
▽ More
The intakes of air-breathing high-speed flying vehicles produce a large share of the thrust propulsion. Furthermore, the propulsion performance of these engines increases when the single-ramp intake is replaced with a multiple-ramps intake. Many scholars numerically and experimentally studied the high-speed engine performance over static single and multiple compression ramps. However, the transient behavior of the flow during the rotation of the double compression ramp from a single ramp is not fully investigated. The present paper aims to numerically investigate the transient shock reflection phenomenon over a rotating double wedge. The problem will start with a 3-Mach number inviscid flow over a single wedge. Then, a portion of the wedge will be rotated upstream at a quite low trailing Mach number to avoid the significant lag effect in the shock waves system. This idea could be applied in the supersonic intake or extensionally in the hypersonic intake of scramjets with a somehow complex mechanism. Further, the length of the rotating portion of the wedge will be changed three times to study its effect on the shock system. The results show a high gain in the pressure due to the rotation of the wedge. Moreover, the wave angles were larger at the low chord ratio value of $w_2/w_i= 0.25$ than at the high values of $w_2/w_i$ at the same second wedge rotating angle, $θ_2$, resulting in a higher pressure distribution.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Meta Temporal Point Processes
Authors:
Wonho Bae,
Mohamed Osama Ahmed,
Frederick Tung,
Gabriel L. Oliveira
Abstract:
A temporal point process (TPP) is a stochastic process where its realization is a sequence of discrete events in time. Recent work in TPPs model the process using a neural network in a supervised learning framework, where a training set is a collection of all the sequences. In this work, we propose to train TPPs in a meta learning framework, where each sequence is treated as a different task, via…
▽ More
A temporal point process (TPP) is a stochastic process where its realization is a sequence of discrete events in time. Recent work in TPPs model the process using a neural network in a supervised learning framework, where a training set is a collection of all the sequences. In this work, we propose to train TPPs in a meta learning framework, where each sequence is treated as a different task, via a novel framing of TPPs as neural processes (NPs). We introduce context sets to model TPPs as an instantiation of NPs. Motivated by attentive NP, we also introduce local history matching to help learn more informative features. We demonstrate the potential of the proposed method on popular public benchmark datasets and tasks, and compare with state-of-the-art TPP methods.
△ Less
Submitted 27 January, 2023;
originally announced January 2023.
-
Gumbel-Softmax Selective Networks
Authors:
Mahmoud Salem,
Mohamed Osama Ahmed,
Frederick Tung,
Gabriel Oliveira
Abstract:
ML models often operate within the context of a larger system that can adapt its response when the ML model is uncertain, such as falling back on safe defaults or a human in the loop. This commonly encountered operational context calls for principled techniques for training ML models with the option to abstain from predicting when uncertain. Selective neural networks are trained with an integrated…
▽ More
ML models often operate within the context of a larger system that can adapt its response when the ML model is uncertain, such as falling back on safe defaults or a human in the loop. This commonly encountered operational context calls for principled techniques for training ML models with the option to abstain from predicting when uncertain. Selective neural networks are trained with an integrated option to abstain, allowing them to learn to recognize and optimize for the subset of the data distribution for which confident predictions can be made. However, optimizing selective networks is challenging due to the non-differentiability of the binary selection function (the discrete decision of whether to predict or abstain). This paper presents a general method for training selective networks that leverages the Gumbel-softmax reparameterization trick to enable selection within an end-to-end differentiable training framework. Experiments on public datasets demonstrate the potential of Gumbel-softmax selective networks for selective regression and classification.
△ Less
Submitted 18 November, 2022;
originally announced November 2022.
-
Latent Bottlenecked Attentive Neural Processes
Authors:
Leo Feng,
Hossein Hajimirsadeghi,
Yoshua Bengio,
Mohamed Osama Ahmed
Abstract:
Neural Processes (NPs) are popular methods in meta-learning that can estimate predictive uncertainty on target datapoints by conditioning on a context dataset. Previous state-of-the-art method Transformer Neural Processes (TNPs) achieve strong performance but require quadratic computation with respect to the number of context datapoints, significantly limiting its scalability. Conversely, existing…
▽ More
Neural Processes (NPs) are popular methods in meta-learning that can estimate predictive uncertainty on target datapoints by conditioning on a context dataset. Previous state-of-the-art method Transformer Neural Processes (TNPs) achieve strong performance but require quadratic computation with respect to the number of context datapoints, significantly limiting its scalability. Conversely, existing sub-quadratic NP variants perform significantly worse than that of TNPs. Tackling this issue, we propose Latent Bottlenecked Attentive Neural Processes (LBANPs), a new computationally efficient sub-quadratic NP variant, that has a querying computational complexity independent of the number of context datapoints. The model encodes the context dataset into a constant number of latent vectors on which self-attention is performed. When making predictions, the model retrieves higher-order information from the context dataset via multiple cross-attention mechanisms on the latent vectors. We empirically show that LBANPs achieve results competitive with the state-of-the-art on meta-regression, image completion, and contextual multi-armed bandits. We demonstrate that LBANPs can trade-off the computational cost and performance according to the number of latent vectors. Finally, we show LBANPs can scale beyond existing attention-based NP variants to larger dataset settings.
△ Less
Submitted 1 March, 2023; v1 submitted 15 November, 2022;
originally announced November 2022.
-
An Optical Parametric Amplifier via $ χ^{(2)} $ in AlGaAs Waveguides
Authors:
Zhizhong Yan,
Haoyu He,
Han Liu,
Meng Iu,
Osman Ahmed,
Eric Chen,
Youichi Akasaka,
Tadashi Ikeuchi,
Amr S. Helmy
Abstract:
We report parametric gain by utilizing $ χ^{(2)} $ non-linearities in a semiconductor Bragg Reflection Waveguide (BRW) waveguide chip. Under the two-mode degenerate type II phase matching, it can be shown that more than 18 dBs of parametric gain for both TE and TM modes is tenable in 100s of micrometers of device length. Polarization insensitive parametric gain can be attained within the 1550 nm r…
▽ More
We report parametric gain by utilizing $ χ^{(2)} $ non-linearities in a semiconductor Bragg Reflection Waveguide (BRW) waveguide chip. Under the two-mode degenerate type II phase matching, it can be shown that more than 18 dBs of parametric gain for both TE and TM modes is tenable in 100s of micrometers of device length. Polarization insensitive parametric gain can be attained within the 1550 nm region of the spectrum. These AlGaAs BRW waveguides exhibit sub-photon per pulse sensitivity. This is in sharp contrast to other types of parametric gain devices which utilize $ χ^{(3)} $, where the pump wavelength is in the vicinity of the signal wavelength. This sensitivity, which reached 0.1~photon/pulse, can usher a new era for on-chip quantum information processing using compact, micrometer-scale devices.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
Towards Better Selective Classification
Authors:
Leo Feng,
Mohamed Osama Ahmed,
Hossein Hajimirsadeghi,
Amir Abdi
Abstract:
We tackle the problem of Selective Classification where the objective is to achieve the best performance on a predetermined ratio (coverage) of the dataset. Recent state-of-the-art selective methods come with architectural changes either via introducing a separate selection head or an extra abstention logit. In this paper, we challenge the aforementioned methods. The results suggest that the super…
▽ More
We tackle the problem of Selective Classification where the objective is to achieve the best performance on a predetermined ratio (coverage) of the dataset. Recent state-of-the-art selective methods come with architectural changes either via introducing a separate selection head or an extra abstention logit. In this paper, we challenge the aforementioned methods. The results suggest that the superior performance of state-of-the-art methods is owed to training a more generalizable classifier rather than their proposed selection mechanisms. We argue that the best performing selection mechanism should instead be rooted in the classifier itself. Our proposed selection strategy uses the classification scores and achieves better results by a significant margin, consistently, across all coverages and all datasets, without any added compute cost. Furthermore, inspired by semi-supervised learning, we propose an entropy-based regularizer that improves the performance of selective classification methods. Our proposed selection mechanism with the proposed entropy-based regularizer achieves new state-of-the-art results.
△ Less
Submitted 1 March, 2023; v1 submitted 17 June, 2022;
originally announced June 2022.
-
Quantifying the dust in SN 2012aw and iPTF14hls with ORBYTS
Authors:
Maria Niculescu-Duvaz,
M. J. Barlow,
W. Dunn,
A. Bevan,
Omar Ahmed,
David Arkless,
Jon Barker,
Sidney Bartolotta,
Liam Brockway,
Daniel Browne,
Ubaid Esmail,
Max Garner,
Wiktoria Guz,
Scarlett King,
Hayri Kose,
Madeline Lampstaes-Capes,
Joseph Magen,
Nicole Morrison,
Kyaw Oo,
Balvinder Paik,
Joanne Primrose,
Danny Quick,
Anais Radeka,
Anthony Rodney,
Eleanor Sandeman
, et al. (10 additional authors not shown)
Abstract:
Core-collapse supernovae (CCSNe) are potentially capable of producing large quantities of dust, with strong evidence that ejecta dust masses can grow significantly over extended periods of time. Red-blue asymmetries in the broad emission lines of CCSNe can be modelled using the Monte Carlo radiative transfer code DAMOCLES, to determine ejecta dust masses. To facilitate easier use of DAMOCLES, we p…
▽ More
Core-collapse supernovae (CCSNe) are potentially capable of producing large quantities of dust, with strong evidence that ejecta dust masses can grow significantly over extended periods of time. Red-blue asymmetries in the broad emission lines of CCSNe can be modelled using the Monte Carlo radiative transfer code DAMOCLES, to determine ejecta dust masses. To facilitate easier use of DAMOCLES, we present a Tkinter graphical user interface (GUI) running DAMOCLES. The GUI was tested by high school students as part of the Original Research By Young Twinkle Students (ORBYTS) programme, who used it to measure the dust masses formed at two epochs in two Type IIP CCSNe: SN 2012aw and iPTF14hls, demonstrating that a wide range of people can contribute significantly to scientific advancement. Bayesian methods were used to quantify uncertainties on our model parameters. From the presence of a red scattering wing in the day 1863 H$α$ profile of SN 2012aw, we were able to constrain the dust composition to large (radius $>0.1 μ$m) silicate grains, with a dust mass of $6.0^{+21.9}_{-3.6}\times10^{-4} M_\odot$. From the day 1158 H$α$ profile of SN 2012aw, we found a dust mass of $3.0^{+14}_{-2.5}\times10^{-4}$ M$_\odot$. For iPTF14hls, we found a day 1170 dust mass of 8.1 $^{+81}_{-7.6}\times10^{-5}$ M$_{\odot}$ for a dust composition consisting of 50% amorphous carbon and 50% astronomical silicate. At 1000 days post explosion, SN 2012aw and iPTF14hls have formed less dust than SN 1987A, suggesting that SN 1987A could form larger dust masses than other Type IIP's.
△ Less
Submitted 4 January, 2023; v1 submitted 1 June, 2022;
originally announced June 2022.
-
Monotonicity Regularization: Improved Penalties and Novel Applications to Disentangled Representation Learning and Robust Classification
Authors:
Joao Monteiro,
Mohamed Osama Ahmed,
Hossein Hajimirsadeghi,
Greg Mori
Abstract:
We study settings where gradient penalties are used alongside risk minimization with the goal of obtaining predictors satisfying different notions of monotonicity. Specifically, we present two sets of contributions. In the first part of the paper, we show that different choices of penalties define the regions of the input space where the property is observed. As such, previous methods result in mo…
▽ More
We study settings where gradient penalties are used alongside risk minimization with the goal of obtaining predictors satisfying different notions of monotonicity. Specifically, we present two sets of contributions. In the first part of the paper, we show that different choices of penalties define the regions of the input space where the property is observed. As such, previous methods result in models that are monotonic only in a small volume of the input space. We thus propose an approach that uses mixtures of training instances and random points to populate the space and enforce the penalty in a much larger region. As a second set of contributions, we introduce regularization strategies that enforce other notions of monotonicity in different settings. In this case, we consider applications, such as image classification and generative modeling, where monotonicity is not a hard constraint but can help improve some aspects of the model. Namely, we show that inducing monotonicity can be beneficial in applications such as: (1) allowing for controllable data generation, (2) defining strategies to detect anomalous data, and (3) generating explanations for predictions. Our proposed approaches do not introduce relevant computational overhead while leading to efficient procedures that provide extra benefits over baseline models.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
AI-Bind: Improving Binding Predictions for Novel Protein Targets and Ligands
Authors:
Ayan Chatterjee,
Robin Walters,
Zohair Shafi,
Omair Shafi Ahmed,
Michael Sebek,
Deisy Gysi,
Rose Yu,
Tina Eliassi-Rad,
Albert-László Barabási,
Giulia Menichetti
Abstract:
Identifying novel drug-target interactions (DTI) is a critical and rate limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We first unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortc…
▽ More
Identifying novel drug-target interactions (DTI) is a critical and rate limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We first unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Then, we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training, allowing us to limit the annotation imbalance and improve binding predictions for novel proteins and ligands. We illustrate the value of AI-Bind by predicting drugs and natural compounds with binding affinity to SARS-CoV-2 viral proteins and the associated human proteins. We also validate these predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. Overall, AI-Bind offers a powerful high-throughput approach to identify drug-target combinations, with the potential of becoming a powerful tool in drug discovery.
△ Less
Submitted 9 November, 2022; v1 submitted 24 December, 2021;
originally announced December 2021.
-
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning
Authors:
Ossama Ahmed,
Frederik Träuble,
Anirudh Goyal,
Alexander Neitz,
Yoshua Bengio,
Bernhard Schölkopf,
Manuel Wüthrich,
Stefan Bauer
Abstract:
Despite recent successes of reinforcement learning (RL), it remains a challenge for agents to transfer learned skills to related environments. To facilitate research addressing this problem, we propose CausalWorld, a benchmark for causal structure and transfer learning in a robotic manipulation environment. The environment is a simulation of an open-source robotic platform, hence offering the poss…
▽ More
Despite recent successes of reinforcement learning (RL), it remains a challenge for agents to transfer learned skills to related environments. To facilitate research addressing this problem, we propose CausalWorld, a benchmark for causal structure and transfer learning in a robotic manipulation environment. The environment is a simulation of an open-source robotic platform, hence offering the possibility of sim-to-real transfer. Tasks consist of constructing 3D shapes from a given set of blocks - inspired by how children learn to build complex structures. The key strength of CausalWorld is that it provides a combinatorial family of such tasks with common causal structure and underlying factors (including, e.g., robot and object masses, colors, sizes). The user (or the agent) may intervene on all causal variables, which allows for fine-grained control over how similar different tasks (or task distributions) are. One can thus easily define training and evaluation distributions of a desired difficulty level, targeting a specific form of generalization (e.g., only changes in appearance or object mass). Further, this common parametrization facilitates defining curricula by interpolating between an initial and a target task. While users may define their own task distributions, we present eight meaningful distributions as concrete benchmarks, ranging from simple to very challenging, all of which require long-horizon planning as well as precise low-level motor control. Finally, we provide baseline results for a subset of these tasks on distinct training curricula and corresponding evaluation protocols, verifying the feasibility of the tasks in this benchmark.
△ Less
Submitted 24 November, 2020; v1 submitted 8 October, 2020;
originally announced October 2020.
-
The Effect of Pitch Distance on the Statistics and Morphology of Through-Silicon Via Extrusion
Authors:
Golareh Jalilvand,
Omar Ahmed,
Nicolas Dube,
Tengfei Jiang
Abstract:
In this work, we investigated the effect of pitch distance on the statistical variation and morphology of extrusion in Cu TSVs and the underlying mechanisms. Extrusion statistics were obtained from TSV samples with two different pitch distances. A notable increase in the magnitude of extrusion was observed in vias with smaller pitch, yet the extrusion spread was largely unaffected. The morphologie…
▽ More
In this work, we investigated the effect of pitch distance on the statistical variation and morphology of extrusion in Cu TSVs and the underlying mechanisms. Extrusion statistics were obtained from TSV samples with two different pitch distances. A notable increase in the magnitude of extrusion was observed in vias with smaller pitch, yet the extrusion spread was largely unaffected. The morphologies of the extruded vias were characterized and categorized, and finite element analysis was carried out to study the effect of pitch distance on stress and deformation. The results suggested that the overlap** of stress fields from neighboring vias resulted in larger stress in the small-pitch vias, which subsequently led to higher extrusion. The morphologies observed in the extruded vias were related to the operation of different deformation mechanisms under the combined effect of stress and microstructure. The statistical spread of via extrusion, which was similar in both groups of vias, was related to the stochastic nature of the via microstructure. By using a thin cap layer of Ta to suppress the vacancy sources at the via top surface, the adverse effect of the pitch distance was minimized and a pronounced reduction of extrusion was achieved in vias of both pitch distances.
△ Less
Submitted 25 September, 2020;
originally announced September 2020.
-
Point Process Flows
Authors:
Nazanin Mehrasa,
Ruizhi Deng,
Mohamed Osama Ahmed,
Bo Chang,
Jiawei He,
Thibaut Durand,
Marcus Brubaker,
Greg Mori
Abstract:
Event sequences can be modeled by temporal point processes (TPPs) to capture their asynchronous and probabilistic nature. We propose an intensity-free framework that directly models the point process distribution by utilizing normalizing flows. This approach is capable of capturing highly complex temporal distributions and does not rely on restrictive parametric forms. Comparisons with state-of-th…
▽ More
Event sequences can be modeled by temporal point processes (TPPs) to capture their asynchronous and probabilistic nature. We propose an intensity-free framework that directly models the point process distribution by utilizing normalizing flows. This approach is capable of capturing highly complex temporal distributions and does not rely on restrictive parametric forms. Comparisons with state-of-the-art baseline models on both synthetic and challenging real-life datasets show that the proposed framework is effective at modeling the stochasticity of discrete event sequences.
△ Less
Submitted 22 December, 2019; v1 submitted 18 October, 2019;
originally announced October 2019.
-
Human Intracranial EEG Quantitative Analysis and Automatic Feature Learning for Epileptic Seizure Prediction
Authors:
Ramy Hussein,
Mohamed Osama Ahmed,
Rabab Ward,
Z. Jane Wang,
Levin Kuhlmann,
Yi Guo
Abstract:
Objective: The aim of this study is to develop an efficient and reliable epileptic seizure prediction system using intracranial EEG (iEEG) data, especially for people with drug-resistant epilepsy. The prediction procedure should yield accurate results in a fast enough fashion to alert patients of impending seizures. Methods: We quantitatively analyze the human iEEG data to obtain insights into how…
▽ More
Objective: The aim of this study is to develop an efficient and reliable epileptic seizure prediction system using intracranial EEG (iEEG) data, especially for people with drug-resistant epilepsy. The prediction procedure should yield accurate results in a fast enough fashion to alert patients of impending seizures. Methods: We quantitatively analyze the human iEEG data to obtain insights into how the human brain behaves before and between epileptic seizures. We then introduce an efficient pre-processing method for reducing the data size and converting the time-series iEEG data into an image-like format that can be used as inputs to convolutional neural networks (CNNs). Further, we propose a seizure prediction algorithm that uses cooperative multi-scale CNNs for automatic feature learning of iEEG data. Results: 1) iEEG channels contain complementary information and excluding individual channels is not advisable to retain the spatial information needed for accurate prediction of epileptic seizures. 2) The traditional PCA is not a reliable method for iEEG data reduction in seizure prediction. 3) Hand-crafted iEEG features may not be suitable for reliable seizure prediction performance as the iEEG data varies between patients and over time for the same patient. 4) Seizure prediction results show that our algorithm outperforms existing methods by achieving an average sensitivity of 87.85% and AUC score of 0.84. Conclusion: Understanding how the human brain behaves before seizure attacks and far from them facilitates better designs of epileptic seizure predictors. Significance: Accurate seizure prediction algorithms can warn patients about the next seizure attack so they could avoid dangerous activities. Medications could then be administered to abort the impending seizure and minimize the risk of injury.
△ Less
Submitted 7 April, 2019;
originally announced April 2019.
-
Combining Bayesian Optimization and Lipschitz Optimization
Authors:
Mohamed Osama Ahmed,
Sharan Vaswani,
Mark Schmidt
Abstract:
Bayesian optimization and Lipschitz optimization have developed alternative techniques for optimizing black-box functions. They each exploit a different form of prior about the function. In this work, we explore strategies to combine these techniques for better global optimization. In particular, we propose ways to use the Lipschitz continuity assumption within traditional BO algorithms, which we…
▽ More
Bayesian optimization and Lipschitz optimization have developed alternative techniques for optimizing black-box functions. They each exploit a different form of prior about the function. In this work, we explore strategies to combine these techniques for better global optimization. In particular, we propose ways to use the Lipschitz continuity assumption within traditional BO algorithms, which we call Lipschitz Bayesian optimization (LBO). This approach does not increase the asymptotic runtime and in some cases drastically improves the performance (while in the worst-case the performance is similar). Indeed, in a particular setting, we prove that using the Lipschitz information yields the same or a better bound on the regret compared to using Bayesian optimization on its own. Moreover, we propose a simple heuristics to estimate the Lipschitz constant, and prove that a growing estimate of the Lipschitz constant is in some sense ``harmless''. Our experiments on 15 datasets with 4 acquisition functions show that in the worst case LBO performs similar to the underlying BO method while in some cases it performs substantially better. Thompson sampling in particular typically saw drastic improvements (as the Lipschitz information corrected for its well-known ``over-exploration'' phenomenon) and its LBO variant often outperformed other acquisition functions.
△ Less
Submitted 28 July, 2020; v1 submitted 9 October, 2018;
originally announced October 2018.
-
$(α,β)$-A-Normal Operators in Semi-Hilbertian Spaces
Authors:
Abdelkader Benali,
Ould Ahmed Mahmoud Sid Ahmed
Abstract:
In this paper we introduce and prove some properties of $(α;β)$-normal operators according to semi-Hilbertian space structures. Furthermore we s,ate various inequalities between the A-operator norm and A-numerical radius of $(α,β)$-normal operators in semi Hilbertian spaces.
In this paper we introduce and prove some properties of $(α;β)$-normal operators according to semi-Hilbertian space structures. Furthermore we s,ate various inequalities between the A-operator norm and A-numerical radius of $(α,β)$-normal operators in semi Hilbertian spaces.
△ Less
Submitted 11 October, 2016;
originally announced October 2016.
-
On a joint $(m, (q_1, ..., q_d))$-partial isometries and a joint $m$-invertible $d$-tuple of operators on a Hilbert space
Authors:
Ould Ahmed Mahmoud Sid Ahmed
Abstract:
The aim of the present paper is, firstly we study the concepts of (m, (q_1, ..., q_d))- partial isometries on a Hilbert space, secondly, we introduce the notion of m- invertibility of tuples of operators as a natural generalization of the m-invertibility in single variable operators.
The aim of the present paper is, firstly we study the concepts of (m, (q_1, ..., q_d))- partial isometries on a Hilbert space, secondly, we introduce the notion of m- invertibility of tuples of operators as a natural generalization of the m-invertibility in single variable operators.
△ Less
Submitted 26 February, 2016;
originally announced February 2016.
-
Generalizations of Kaplansky Theorem for some (p,k)-Quasihyponormal Operators
Authors:
Abdelkader Benali,
Ould Ahmed Mahmoud Sid Ahmed
Abstract:
In the present paper, we generalized some notions of bounded operators to un- bounded operators on Hilbert space such as k-quasihyponormal and k-paranormal unbounded operators. Furthermore, we extend the Kaplansky theorem for normal operators to some (p; k)-quasihyponormal operators. Namely the (p; k)-quasihyponormality of the product AB and BA for two operators.
In the present paper, we generalized some notions of bounded operators to un- bounded operators on Hilbert space such as k-quasihyponormal and k-paranormal unbounded operators. Furthermore, we extend the Kaplansky theorem for normal operators to some (p; k)-quasihyponormal operators. Namely the (p; k)-quasihyponormality of the product AB and BA for two operators.
△ Less
Submitted 8 February, 2016;
originally announced February 2016.
-
Stop Wasting My Gradients: Practical SVRG
Authors:
Reza Babanezhad,
Mohamed Osama Ahmed,
Alim Virani,
Mark Schmidt,
Jakub Konečný,
Scott Sallinen
Abstract:
We present and analyze several strategies for improving the performance of stochastic variance-reduced gradient (SVRG) methods. We first show that the convergence rate of these methods can be preserved under a decreasing sequence of errors in the control variate, and use this to derive variants of SVRG that use growing-batch strategies to reduce the number of gradient calculations required in the…
▽ More
We present and analyze several strategies for improving the performance of stochastic variance-reduced gradient (SVRG) methods. We first show that the convergence rate of these methods can be preserved under a decreasing sequence of errors in the control variate, and use this to derive variants of SVRG that use growing-batch strategies to reduce the number of gradient calculations required in the early iterations. We further (i) show how to exploit support vectors to reduce the number of gradient computations in the later iterations, (ii) prove that the commonly-used regularized SVRG iteration is justified and improves the convergence rate, (iii) consider alternate mini-batch selection strategies, and (iv) consider the generalization error of the method.
△ Less
Submitted 5 November, 2015;
originally announced November 2015.
-
Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields
Authors:
Mark Schmidt,
Reza Babanezhad,
Mohamed Osama Ahmed,
Aaron Defazio,
Ann Clifton,
Anoop Sarkar
Abstract:
We apply stochastic average gradient (SAG) algorithms for training conditional random fields (CRFs). We describe a practical implementation that uses structure in the CRF gradient to reduce the memory requirement of this linearly-convergent stochastic gradient method, propose a non-uniform sampling scheme that substantially improves practical performance, and analyze the rate of convergence of the…
▽ More
We apply stochastic average gradient (SAG) algorithms for training conditional random fields (CRFs). We describe a practical implementation that uses structure in the CRF gradient to reduce the memory requirement of this linearly-convergent stochastic gradient method, propose a non-uniform sampling scheme that substantially improves practical performance, and analyze the rate of convergence of the SAGA variant under non-uniform sampling. Our experimental results reveal that our method often significantly outperforms existing methods in terms of the training objective, and performs as well or better than optimally-tuned stochastic gradient methods in terms of test error.
△ Less
Submitted 16 April, 2015;
originally announced April 2015.
-
Decentralized, Adaptive, Look-Ahead Particle Filtering
Authors:
Mohamed Osama Ahmed,
Pouyan T. Bibalan,
Nando de Freitas,
Simon Fauvel
Abstract:
The decentralized particle filter (DPF) was proposed recently to increase the level of parallelism of particle filtering. Given a decomposition of the state space into two nested sets of variables, the DPF uses a particle filter to sample the first set and then conditions on this sample to generate a set of samples for the second set of variables. The DPF can be understood as a variant of the popu…
▽ More
The decentralized particle filter (DPF) was proposed recently to increase the level of parallelism of particle filtering. Given a decomposition of the state space into two nested sets of variables, the DPF uses a particle filter to sample the first set and then conditions on this sample to generate a set of samples for the second set of variables. The DPF can be understood as a variant of the popular Rao-Blackwellized particle filter (RBPF), where the second step is carried out using Monte Carlo approximations instead of analytical inference. As a result, the range of applications of the DPF is broader than the one for the RBPF. In this paper, we improve the DPF in two ways. First, we derive a Monte Carlo approximation of the optimal proposal distribution and, consequently, design and implement a more efficient look-ahead DPF. Although the decentralized filters were initially designed to capitalize on parallel implementation, we show that the look-ahead DPF can outperform the standard particle filter even on a single machine. Second, we propose the use of bandit algorithms to automatically configure the state space decomposition of the DPF.
△ Less
Submitted 11 March, 2012;
originally announced March 2012.