Skip to main content

Showing 1–36 of 36 results for author: Ahmed, O

.
  1. arXiv:2407.02312  [pdf, other

    physics.space-ph astro-ph.EP astro-ph.SR

    Dynamics and solar wind control of the recovery of strong geomagnetic storms

    Authors: O. Ahmed, B. Badruddin, M. Derouich

    Abstract: In this work, we studied the characteristics and dynamical changes during the recovery time of moderate and strong geomagnetic storms (Dst $<-50$ nT). Investigating 57 storms triggered by CMEs/CIRs, we focused on the solar wind's influence on their decay phases. Selected storms were classified into distinct groups based on their recovery characteristics. Using superposed epoch analysis and best fi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at Astrophysics and Space Science journal

  2. arXiv:2405.13956  [pdf, other

    cs.LG

    Attention as an RNN

    Authors: Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Mohamed Osama Ahmed, Yoshua Bengio, Greg Mori

    Abstract: The advent of Transformers marked a significant breakthrough in sequence modelling, providing a highly performant architecture capable of leveraging GPU parallelism. However, Transformers are computationally expensive at inference time, limiting their applications, particularly in low-resource settings (e.g., mobile and embedded devices). Addressing this, we (1) begin by showing that attention can… ▽ More

    Submitted 28 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2405.03390  [pdf, other

    quant-ph

    Prediction of chaotic dynamics and extreme events: A recurrence-free quantum reservoir computing approach

    Authors: Osama Ahmed, Felix Tennie, Luca Magri

    Abstract: In chaotic dynamical systems, extreme events manifest in time series as unpredictable large-amplitude peaks. Although deterministic, extreme events appear seemingly randomly, which makes their forecasting difficult. By learning the dynamics from observables (data), reservoir computers can time-accurately predict extreme events and chaotic dynamics, but they may require many degrees of freedom (lar… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  4. arXiv:2402.06935  [pdf, other

    cs.DS q-bio.GN q-bio.PE

    Taxonomic classification with maximal exact matches in KATKA kernels and minimizer digests

    Authors: Dominika Draesslerová, Omar Ahmed, Travis Gagie, Jan Holub, Ben Langmead, Giovanni Manzini, Gonzalo Navarro

    Abstract: For taxonomic classification, we are asked to index the genomes in a phylogenetic tree such that later, given a DNA read, we can quickly choose a small subtree likely to contain the genome from which that read was drawn. Although popular classifiers such as Kraken use $k$-mers, recent research indicates that using maximal exact matches (MEMs) can lead to better classifications. For example, we can… ▽ More

    Submitted 4 April, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  5. arXiv:2402.03261  [pdf, other

    astro-ph.SR astro-ph.IM physics.space-ph

    Characteristics and development of the main phase disturbance in geomagnetic storms (Dst $\le$ -50 nT)

    Authors: Osman M. Ahmed, Badruudin Zaheer Ahmad, Moncef Derouich

    Abstract: We present geomagnetic storms (GSs) selected from three solar cycles, spanning the years 1995 to 2022. We studied the development of the main phase of storms within disturbance storm time (Dst) amplitudes ranging from Dst =-64 nT to Dst=- 422 nT. In order to determine the solar wind (SW) parameters that mainly influence the main phase development of a GS, which can best describe the SW-magnetosphe… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted for publication in Advances in Space Research

  6. Adaptive Prognostic Malfunction Based Processor for Autonomous Landing Guidance Assistance System Using FPGA

    Authors: Hossam O. Ahmed, David Wyatt

    Abstract: The demand for more developed and agile urban taxi drones is increasing rapidly nowadays to sustain crowded cities and their traffic issues. The critical factor for spreading such technology could be related to the safety criteria that must be considered. One of the most critical safety aspects for such vertical and/or Short Take-Off and Landing (V/STOL) drones is related to safety during the land… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: Published in: IEEE Access ( Volume: 12) - Page(s): 2113 - 2122

  7. arXiv:2311.02891  [pdf, other

    cs.LG

    AdaFlood: Adaptive Flood Regularization

    Authors: Wonho Bae, Yi Ren, Mohamad Osama Ahmed, Frederick Tung, Danica J. Sutherland, Gabriel L. Oliveira

    Abstract: Although neural networks are conventionally optimized towards zero training loss, it has been recently learned that targeting a non-zero training loss threshold, referred to as a flood level, often enables better test time generalization. Current approaches, however, apply the same constant flood level to all training samples, which inherently assumes all the samples have the same difficulty. We p… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  8. arXiv:2310.13965  [pdf, other

    eess.SP

    Cloud-Connected Wireless Holter Monitor Machine with Neural Networks Based ECG Analysis for Remote Health Monitoring

    Authors: Azlaan Ranjha, Laiba Jabbar, Osaid Ahmed

    Abstract: This study describes the creation of a wireless, transportable Holter monitor to improve the accuracy of cardiac disease diagnosis. The main goal of this study is to develop a low-cost cardiac screening system suited explicitly for underprivileged areas, addressing the rising rates of cardiovascular death. The suggested system includes a wireless Electrocardiogram (ECG) module for real-time cardia… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  9. arXiv:2309.17388  [pdf, other

    cs.LG

    Tree Cross Attention

    Authors: Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

    Abstract: Cross Attention is a popular method for retrieving information from a set of context tokens for making predictions. At inference time, for each prediction, Cross Attention scans the full set of $\mathcal{O}(N)$ tokens. In practice, however, often only a small subset of tokens are required for good performance. Methods such as Perceiver IO are cheap at inference as they distill the information to a… ▽ More

    Submitted 1 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted by ICLR 2024

  10. arXiv:2309.03544  [pdf, other

    cs.SD cs.LG eess.AS

    MVD:A Novel Methodology and Dataset for Acoustic Vehicle Type Classification

    Authors: Mohd Ashhad, Omar Ahmed, Sooraj K. Ambat, Zeeshan Ali Haq, Mansaf Alam

    Abstract: Rising urban populations have led to a surge in vehicle use and made traffic monitoring and management indispensable. Acoustic traffic monitoring (ATM) offers a cost-effective and efficient alternative to more computationally expensive methods of monitoring traffic such as those involving computer vision technologies. In this paper, we present MVD and MVDA: two open datasets for the development of… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  11. ANER: Arabic and Arabizi Named Entity Recognition using Transformer-Based Approach

    Authors: Abdelrahman "Boda" Sadallah, Omar Ahmed, Shimaa Mohamed, Omar Hatem, Doaa Hesham, Ahmed H. Yousef

    Abstract: One of the main tasks of Natural Language Processing (NLP), is Named Entity Recognition (NER). It is used in many applications and also can be used as an intermediate step for other tasks. We present ANER, a web-based named entity recognizer for the Arabic, and Arabizi languages. The model is built upon BERT, which is a transformer-based encoder. It can recognize 50 different entity classes, cover… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  12. arXiv:2306.12599  [pdf, other

    cs.LG

    Constant Memory Attention Block

    Authors: Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

    Abstract: Modern foundation model architectures rely on attention mechanisms to effectively capture context. However, these methods require linear or quadratic memory in terms of the number of inputs/datapoints, limiting their applicability in low-compute domains. In this work, we propose Constant Memory Attention Block (CMAB), a novel general-purpose attention block that computes its output in constant mem… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: Workshop version of arXiv:2305.14567

  13. Segregated FLS Processing Cores for V/STOL Autonomous Landing Guidance Assistant System using FPGA

    Authors: Hossam O. Ahmed

    Abstract: It is highly predicted that the roads and parking areas will be extremely congested with vehicles to the point that searching for a novel solution will not be an optional choice for conserving the sustainability rate of the overall humanity's development growth. Such issue could be overcome by develo** modified generations of the Urban Air Mobility (UAM) vehicles that essentially depend on the V… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 2021 Integrated Communications Navigation and Surveillance Conference (ICNS), Dulles, VA, USA

  14. arXiv:2305.14567  [pdf, other

    cs.LG cs.CV

    Memory Efficient Neural Processes via Constant Memory Attention Block

    Authors: Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

    Abstract: Neural Processes (NPs) are popular meta-learning methods for efficiently modelling predictive uncertainty. Recent state-of-the-art methods, however, leverage expensive attention mechanisms, limiting their applications, particularly in low-resource settings. In this work, we propose Constant Memory Attentive Neural Processes (CMANPs), an NP variant that only requires constant memory. To do so, we f… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  15. arXiv:2305.01771  [pdf

    eess.SY cs.DC

    Fault Tolerant Processing Unit Using Gamma Distribution Sliding Window For Autonomous Landing Guidance System

    Authors: Hossam O. Ahmed

    Abstract: To keep up with today's dense metropolitan areas and their accompanying traffic problems, a growing number of towns are looking for more advanced and swift urban taxi drones. The safety parameters that must be taken into consideration may be the most important element in the widespread use of such technology. Most recent aviation mishaps have happened during the landing phase, making this a partic… ▽ More

    Submitted 3 June, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 21st IEEE Interregional NEWCAS Conference, Edinburgh, Scotland

  16. arXiv:2304.02099  [pdf

    eess.SY cs.AR cs.DC eess.SP

    Coarse Grained FLS-based Processor with Prognostic Malfunction Feature for UAM Drones using FPGA

    Authors: Hossam O. Ahmed

    Abstract: Many overall safety factors need to be considered in the next generation of Urban Air Mobility (UAM) systems and addressing these can become the anchor point for such technology to reach consent for worldwide application. On the other hand, fulfilling the safety requirements from an exponential increase of prolific UAM systems, is extremely complicated, and requires careful consideration of a vari… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: The paper is accepted

    Journal ref: The 23rd Integrated Communications, Navigation, and Surveillance Conference, 2023, USA

  17. Numerical Investigation of a Rotating Double Compression Ramp Intake

    Authors: Lubna Margha, Ahmed A. Hamada, Othman Ahmed, Ahmed Eltaweel

    Abstract: The intakes of air-breathing high-speed flying vehicles produce a large share of the thrust propulsion. Furthermore, the propulsion performance of these engines increases when the single-ramp intake is replaced with a multiple-ramps intake. Many scholars numerically and experimentally studied the high-speed engine performance over static single and multiple compression ramps. However, the transien… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 8 pages, 9 figures, Proceedings of the ASME 2022 Fluids Engineering Division Summer Meeting, ASME, https://doi.org/10.1115/FEDSM2022-87753

  18. arXiv:2301.12023  [pdf, other

    cs.LG

    Meta Temporal Point Processes

    Authors: Wonho Bae, Mohamed Osama Ahmed, Frederick Tung, Gabriel L. Oliveira

    Abstract: A temporal point process (TPP) is a stochastic process where its realization is a sequence of discrete events in time. Recent work in TPPs model the process using a neural network in a supervised learning framework, where a training set is a collection of all the sequences. In this work, we propose to train TPPs in a meta learning framework, where each sequence is treated as a different task, via… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: Accepted to ICLR2023

  19. arXiv:2211.10564  [pdf, other

    cs.LG cs.CV

    Gumbel-Softmax Selective Networks

    Authors: Mahmoud Salem, Mohamed Osama Ahmed, Frederick Tung, Gabriel Oliveira

    Abstract: ML models often operate within the context of a larger system that can adapt its response when the ML model is uncertain, such as falling back on safe defaults or a human in the loop. This commonly encountered operational context calls for principled techniques for training ML models with the option to abstain from predicting when uncertain. Selective neural networks are trained with an integrated… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  20. arXiv:2211.08458  [pdf, other

    cs.LG cs.AI

    Latent Bottlenecked Attentive Neural Processes

    Authors: Leo Feng, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

    Abstract: Neural Processes (NPs) are popular methods in meta-learning that can estimate predictive uncertainty on target datapoints by conditioning on a context dataset. Previous state-of-the-art method Transformer Neural Processes (TNPs) achieve strong performance but require quadratic computation with respect to the number of context datapoints, significantly limiting its scalability. Conversely, existing… ▽ More

    Submitted 1 March, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  21. arXiv:2206.10678  [pdf, other

    physics.optics quant-ph

    An Optical Parametric Amplifier via $ χ^{(2)} $ in AlGaAs Waveguides

    Authors: Zhizhong Yan, Haoyu He, Han Liu, Meng Iu, Osman Ahmed, Eric Chen, Youichi Akasaka, Tadashi Ikeuchi, Amr S. Helmy

    Abstract: We report parametric gain by utilizing $ χ^{(2)} $ non-linearities in a semiconductor Bragg Reflection Waveguide (BRW) waveguide chip. Under the two-mode degenerate type II phase matching, it can be shown that more than 18 dBs of parametric gain for both TE and TM modes is tenable in 100s of micrometers of device length. Polarization insensitive parametric gain can be attained within the 1550 nm r… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: 9 pages, 9 figures

  22. arXiv:2206.09034  [pdf, other

    cs.LG cs.AI cs.CV

    Towards Better Selective Classification

    Authors: Leo Feng, Mohamed Osama Ahmed, Hossein Hajimirsadeghi, Amir Abdi

    Abstract: We tackle the problem of Selective Classification where the objective is to achieve the best performance on a predetermined ratio (coverage) of the dataset. Recent state-of-the-art selective methods come with architectural changes either via introducing a separate selection head or an extra abstention logit. In this paper, we challenge the aforementioned methods. The results suggest that the super… ▽ More

    Submitted 1 March, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

  23. arXiv:2206.00643  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.HE

    Quantifying the dust in SN 2012aw and iPTF14hls with ORBYTS

    Authors: Maria Niculescu-Duvaz, M. J. Barlow, W. Dunn, A. Bevan, Omar Ahmed, David Arkless, Jon Barker, Sidney Bartolotta, Liam Brockway, Daniel Browne, Ubaid Esmail, Max Garner, Wiktoria Guz, Scarlett King, Hayri Kose, Madeline Lampstaes-Capes, Joseph Magen, Nicole Morrison, Kyaw Oo, Balvinder Paik, Joanne Primrose, Danny Quick, Anais Radeka, Anthony Rodney, Eleanor Sandeman , et al. (10 additional authors not shown)

    Abstract: Core-collapse supernovae (CCSNe) are potentially capable of producing large quantities of dust, with strong evidence that ejecta dust masses can grow significantly over extended periods of time. Red-blue asymmetries in the broad emission lines of CCSNe can be modelled using the Monte Carlo radiative transfer code DAMOCLES, to determine ejecta dust masses. To facilitate easier use of DAMOCLES, we p… ▽ More

    Submitted 4 January, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Accepted by MNRAS on 28/11/2022, 10 pages, 6 figures. Author accepted manuscript

  24. arXiv:2205.08247  [pdf, other

    cs.LG cs.AI

    Monotonicity Regularization: Improved Penalties and Novel Applications to Disentangled Representation Learning and Robust Classification

    Authors: Joao Monteiro, Mohamed Osama Ahmed, Hossein Hajimirsadeghi, Greg Mori

    Abstract: We study settings where gradient penalties are used alongside risk minimization with the goal of obtaining predictors satisfying different notions of monotonicity. Specifically, we present two sets of contributions. In the first part of the paper, we show that different choices of penalties define the regions of the input space where the property is observed. As such, previous methods result in mo… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: Accepted to UAI 2022

  25. arXiv:2112.13168  [pdf, other

    q-bio.QM cs.LG

    AI-Bind: Improving Binding Predictions for Novel Protein Targets and Ligands

    Authors: Ayan Chatterjee, Robin Walters, Zohair Shafi, Omair Shafi Ahmed, Michael Sebek, Deisy Gysi, Rose Yu, Tina Eliassi-Rad, Albert-László Barabási, Giulia Menichetti

    Abstract: Identifying novel drug-target interactions (DTI) is a critical and rate limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We first unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortc… ▽ More

    Submitted 9 November, 2022; v1 submitted 24 December, 2021; originally announced December 2021.

    Comments: 83 pages, 26 figures, all references moved to a single section, new results added on AI interpretability, added comparison with MolTrans, added validation using gold standard experimental data

  26. arXiv:2010.04296  [pdf, other

    cs.RO cs.LG stat.ML

    CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

    Authors: Ossama Ahmed, Frederik Träuble, Anirudh Goyal, Alexander Neitz, Yoshua Bengio, Bernhard Schölkopf, Manuel Wüthrich, Stefan Bauer

    Abstract: Despite recent successes of reinforcement learning (RL), it remains a challenge for agents to transfer learned skills to related environments. To facilitate research addressing this problem, we propose CausalWorld, a benchmark for causal structure and transfer learning in a robotic manipulation environment. The environment is a simulation of an open-source robotic platform, hence offering the poss… ▽ More

    Submitted 24 November, 2020; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: The first two authors contributed equally, the last two authors avised jointly

  27. arXiv:2009.12388  [pdf, other

    physics.app-ph

    The Effect of Pitch Distance on the Statistics and Morphology of Through-Silicon Via Extrusion

    Authors: Golareh Jalilvand, Omar Ahmed, Nicolas Dube, Tengfei Jiang

    Abstract: In this work, we investigated the effect of pitch distance on the statistical variation and morphology of extrusion in Cu TSVs and the underlying mechanisms. Extrusion statistics were obtained from TSV samples with two different pitch distances. A notable increase in the magnitude of extrusion was observed in vias with smaller pitch, yet the extrusion spread was largely unaffected. The morphologie… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

  28. arXiv:1910.08281  [pdf, other

    cs.LG stat.ML

    Point Process Flows

    Authors: Nazanin Mehrasa, Ruizhi Deng, Mohamed Osama Ahmed, Bo Chang, Jiawei He, Thibaut Durand, Marcus Brubaker, Greg Mori

    Abstract: Event sequences can be modeled by temporal point processes (TPPs) to capture their asynchronous and probabilistic nature. We propose an intensity-free framework that directly models the point process distribution by utilizing normalizing flows. This approach is capable of capturing highly complex temporal distributions and does not rely on restrictive parametric forms. Comparisons with state-of-th… ▽ More

    Submitted 22 December, 2019; v1 submitted 18 October, 2019; originally announced October 2019.

  29. arXiv:1904.03603  [pdf, other

    cs.NE q-bio.NC

    Human Intracranial EEG Quantitative Analysis and Automatic Feature Learning for Epileptic Seizure Prediction

    Authors: Ramy Hussein, Mohamed Osama Ahmed, Rabab Ward, Z. Jane Wang, Levin Kuhlmann, Yi Guo

    Abstract: Objective: The aim of this study is to develop an efficient and reliable epileptic seizure prediction system using intracranial EEG (iEEG) data, especially for people with drug-resistant epilepsy. The prediction procedure should yield accurate results in a fast enough fashion to alert patients of impending seizures. Methods: We quantitatively analyze the human iEEG data to obtain insights into how… ▽ More

    Submitted 7 April, 2019; originally announced April 2019.

  30. arXiv:1810.04336  [pdf, other

    cs.LG stat.ML

    Combining Bayesian Optimization and Lipschitz Optimization

    Authors: Mohamed Osama Ahmed, Sharan Vaswani, Mark Schmidt

    Abstract: Bayesian optimization and Lipschitz optimization have developed alternative techniques for optimizing black-box functions. They each exploit a different form of prior about the function. In this work, we explore strategies to combine these techniques for better global optimization. In particular, we propose ways to use the Lipschitz continuity assumption within traditional BO algorithms, which we… ▽ More

    Submitted 28 July, 2020; v1 submitted 9 October, 2018; originally announced October 2018.

  31. arXiv:1610.03462  [pdf, ps, other

    math.FA

    $(α,β)$-A-Normal Operators in Semi-Hilbertian Spaces

    Authors: Abdelkader Benali, Ould Ahmed Mahmoud Sid Ahmed

    Abstract: In this paper we introduce and prove some properties of $(α;β)$-normal operators according to semi-Hilbertian space structures. Furthermore we s,ate various inequalities between the A-operator norm and A-numerical radius of $(α,β)$-normal operators in semi Hilbertian spaces.

    Submitted 11 October, 2016; originally announced October 2016.

  32. arXiv:1602.08485  [pdf, ps, other

    math.FA math.OA

    On a joint $(m, (q_1, ..., q_d))$-partial isometries and a joint $m$-invertible $d$-tuple of operators on a Hilbert space

    Authors: Ould Ahmed Mahmoud Sid Ahmed

    Abstract: The aim of the present paper is, firstly we study the concepts of (m, (q_1, ..., q_d))- partial isometries on a Hilbert space, secondly, we introduce the notion of m- invertibility of tuples of operators as a natural generalization of the m-invertibility in single variable operators.

    Submitted 26 February, 2016; originally announced February 2016.

    MSC Class: Primary: 17A13. Secondary: 47A16

  33. arXiv:1602.02748  [pdf, ps, other

    math.FA

    Generalizations of Kaplansky Theorem for some (p,k)-Quasihyponormal Operators

    Authors: Abdelkader Benali, Ould Ahmed Mahmoud Sid Ahmed

    Abstract: In the present paper, we generalized some notions of bounded operators to un- bounded operators on Hilbert space such as k-quasihyponormal and k-paranormal unbounded operators. Furthermore, we extend the Kaplansky theorem for normal operators to some (p; k)-quasihyponormal operators. Namely the (p; k)-quasihyponormality of the product AB and BA for two operators.

    Submitted 8 February, 2016; originally announced February 2016.

    MSC Class: 47B15 (Primary); 46L10 (Secondary)

  34. arXiv:1511.01942  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    Stop Wasting My Gradients: Practical SVRG

    Authors: Reza Babanezhad, Mohamed Osama Ahmed, Alim Virani, Mark Schmidt, Jakub Konečný, Scott Sallinen

    Abstract: We present and analyze several strategies for improving the performance of stochastic variance-reduced gradient (SVRG) methods. We first show that the convergence rate of these methods can be preserved under a decreasing sequence of errors in the control variate, and use this to derive variants of SVRG that use growing-batch strategies to reduce the number of gradient calculations required in the… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

  35. arXiv:1504.04406  [pdf, other

    stat.ML cs.LG math.OC stat.CO

    Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields

    Authors: Mark Schmidt, Reza Babanezhad, Mohamed Osama Ahmed, Aaron Defazio, Ann Clifton, Anoop Sarkar

    Abstract: We apply stochastic average gradient (SAG) algorithms for training conditional random fields (CRFs). We describe a practical implementation that uses structure in the CRF gradient to reduce the memory requirement of this linearly-convergent stochastic gradient method, propose a non-uniform sampling scheme that substantially improves practical performance, and analyze the rate of convergence of the… ▽ More

    Submitted 16 April, 2015; originally announced April 2015.

    Comments: AI/Stats 2015, 24 pages

  36. arXiv:1203.2394  [pdf, other

    stat.ML cs.LG stat.CO

    Decentralized, Adaptive, Look-Ahead Particle Filtering

    Authors: Mohamed Osama Ahmed, Pouyan T. Bibalan, Nando de Freitas, Simon Fauvel

    Abstract: The decentralized particle filter (DPF) was proposed recently to increase the level of parallelism of particle filtering. Given a decomposition of the state space into two nested sets of variables, the DPF uses a particle filter to sample the first set and then conditions on this sample to generate a set of samples for the second set of variables. The DPF can be understood as a variant of the popu… ▽ More

    Submitted 11 March, 2012; originally announced March 2012.

    Comments: 16 pages, 11 figures, Authorship in alphabetical order