Skip to main content

Showing 1–29 of 29 results for author: Attia, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05830  [pdf, other

    math.OC cs.CE cs.LG math.CO stat.AP

    Probabilistic Approach to Black-Box Binary Optimization with Budget Constraints: Application to Sensor Placement

    Authors: Ahmed Attia

    Abstract: We present a fully probabilistic approach for solving binary optimization problems with black-box objective functions and with budget constraints. In the probabilistic approach, the optimization variable is viewed as a random variable and is associated with a parametric probability distribution. The original optimization problem is replaced with an optimization over the expected value of the origi… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 54 pages, 20 figures, 6 sections, 2 appendices

    MSC Class: 90C27; 60C05; 62K05; 35R30; 35Q93; 65C60; 93E35

  2. arXiv:2405.13018  [pdf, other

    cs.CL cs.AI eess.AS

    Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings

    Authors: Ahmed Adel Attia, Dorottya Demszky, Tolulope Ogunremi, **g Liu, Carol Espy-Wilson

    Abstract: Creating Automatic Speech Recognition (ASR) systems that are robust and resilient to classroom conditions is paramount to the development of AI tools to aid teachers and students. In this work, we study the efficacy of continued pretraining (CPT) in adapting Wav2vec2.0 to the classroom domain. We show that CPT is a powerful tool in that regard and reduces the Word Error Rate (WER) of Wav2vec2.0-ba… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  3. arXiv:2404.03661  [pdf, other

    cs.SE cs.CE

    Benchmarking formalisms for dynamic structure system Modeling and Simulation

    Authors: Aya Attia, Clément Foucher, Luiz Fernando Lavado Villa

    Abstract: Modeling and simulation of complex systems is key to explore systems dynamics. Many scientific approaches were developed to represent dynamic structure systems but most of these approaches are efficient for some kinds of systems and inefficient for others. Which approach can be adopted for different dynamic structure systems categories is a topic of interest for many researchers and until now has… ▽ More

    Submitted 25 January, 2024; originally announced April 2024.

  4. arXiv:2403.02873  [pdf, other

    cs.LG cs.DS math.PR

    A Note on High-Probability Analysis of Algorithms with Exponential, Sub-Gaussian, and General Light Tails

    Authors: Amit Attia, Tomer Koren

    Abstract: This short note describes a simple technique for analyzing probabilistic algorithms that rely on a light-tailed (but not necessarily bounded) source of randomization. We show that the analysis of such an algorithm can be reduced, in a black-box manner and with only a small loss in logarithmic factors, to an analysis of a simpler variant of the same algorithm that uses bounded random variables and… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 9 pages

  5. arXiv:2402.03126  [pdf, other

    cs.LG math.OC stat.ML

    How Free is Parameter-Free Stochastic Optimization?

    Authors: Amit Attia, Tomer Koren

    Abstract: We study the problem of parameter-free stochastic optimization, inquiring whether, and under what conditions, do fully parameter-free methods exist: these are methods that achieve convergence rates competitive with optimally tuned methods, without requiring significant knowledge of the true problem parameters. Existing parameter-free methods can only be considered ``partially'' parameter-free, as… ▽ More

    Submitted 18 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 28 pages

  6. arXiv:2311.07676  [pdf, other

    math.OC cs.CE

    Centralized calibration of power system dynamic models using variational data assimilation

    Authors: Ahmed Attia, D. Adrian Maldonado, Emil Constantinescu, Mihai Anitescu

    Abstract: This paper presents a novel centralized, variational data assimilation approach for calibrating transient dynamic models in electrical power systems, focusing on load model parameters. With the increasing importance of inverter-based resources, assessing power systems' dynamic performance under disturbances has become challenging, necessitating robust model calibration methods. The proposed approa… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 9 pages, 8 figures, and 1 table

  7. arXiv:2309.09220  [pdf, other

    eess.AS cs.AI cs.SD

    Improving Speech Inversion Through Self-Supervised Embeddings and Enhanced Tract Variables

    Authors: Ahmed Adel Attia, Yashish M. Siriwardena, Carol Espy-Wilson

    Abstract: The performance of deep learning models depends significantly on their capacity to encode input features efficiently and decode them into meaningful outputs. Better input and output representation has the potential to boost models' performance and generalization. In the context of acoustic-to-articulatory speech inversion (SI) systems, we study the impact of utilizing speech representations acquir… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  8. arXiv:2309.07927  [pdf, ps, other

    eess.AS cs.CL cs.SD

    Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults

    Authors: Ahmed Adel Attia, **g Liu, Wei Ai, Dorottya Demszky, Carol Espy-Wilson

    Abstract: Recent advancements in Automatic Speech Recognition (ASR) systems, exemplified by Whisper, have demonstrated the potential of these systems to approach human-level performance given sufficient data. However, this progress doesn't readily extend to ASR for children due to the limited availability of suitable child-specific databases and the distinct characteristics of children's speech. A recent st… ▽ More

    Submitted 15 May, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  9. Enhancing Speech Articulation Analysis using a Geometric Transformation of the X-ray Microbeam Dataset

    Authors: Ahmed Adel Attia, Mark Tiede, Carol Y. Espy-Wilson

    Abstract: Accurate analysis of speech articulation is crucial for speech analysis. However, X-Y coordinates of articulators strongly depend on the anatomy of the speakers and the variability of pellet placements, and existing methods for map** anatomical landmarks in the X-ray Microbeam Dataset (XRMB) fail to capture the entire anatomy of the vocal tract. In this paper, we propose a new geometric transfor… ▽ More

    Submitted 28 September, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  10. arXiv:2305.03855  [pdf, other

    math.OC cs.LG

    Robust A-Optimal Experimental Design for Bayesian Inverse Problems

    Authors: Ahmed Attia, Sven Leyffer, Todd Munson

    Abstract: Optimal design of experiments for Bayesian inverse problems has recently gained wide popularity and attracted much attention, especially in the computational science and Bayesian inversion communities. An optimal design maximizes a predefined utility function that is formulated in terms of the elements of an inverse problem, an example being optimal sensor placement for parameter identification. T… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 25 pages, 11 figures

    MSC Class: 62K05; 35Q62; 62F15; 35R30; 35Q93; 65C60; 93E35

  11. arXiv:2302.08783  [pdf, ps, other

    cs.LG math.OC stat.ML

    SGD with AdaGrad Stepsizes: Full Adaptivity with High Probability to Unknown Parameters, Unbounded Gradients and Affine Variance

    Authors: Amit Attia, Tomer Koren

    Abstract: We study Stochastic Gradient Descent with AdaGrad stepsizes: a popular adaptive (self-tuning) method for first-order stochastic optimization. Despite being well studied, existing analyses of this method suffer from various shortcomings: they either assume some knowledge of the problem parameters, impose strong global Lipschitz conditions, or fail to give bounds that hold with high probability. We… ▽ More

    Submitted 11 June, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: 27 pages

  12. arXiv:2301.08336  [pdf, other

    cs.MS

    PyOED: An Extensible Suite for Data Assimilation and Model-Constrained Optimal Design of Experiments

    Authors: Abhijit Chowdhary, Shady E. Ahmed, Ahmed Attia

    Abstract: This paper describes PyOED, a highly extensible scientific package that enables develo** and testing model-constrained optimal experimental design (OED) for inverse problems. Specifically, PyOED aims to be a comprehensive Python toolkit for model-constrained OED. The package targets scientists and researchers interested in understanding the details of OED formulations and approaches. It is also… ▽ More

    Submitted 19 December, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: 22 pages, 8 figures

    MSC Class: 68Vxx

  13. Masked Autoencoders Are Articulatory Learners

    Authors: Ahmed Adel Attia, Carol Espy-Wilson

    Abstract: Articulatory recordings track the positions and motion of different articulators along the vocal tract and are widely used to study speech production and to develop speech technologies such as articulatory based speech synthesizers and speech inversion systems. The University of Wisconsin X-Ray microbeam (XRMB) dataset is one of various datasets that provide articulatory recordings synced with aud… ▽ More

    Submitted 18 May, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

  14. arXiv:2207.08257  [pdf, ps, other

    cs.LG math.OC stat.ML

    Uniform Stability for First-Order Empirical Risk Minimization

    Authors: Amit Attia, Tomer Koren

    Abstract: We consider the problem of designing uniformly stable first-order optimization algorithms for empirical risk minimization. Uniform stability is often used to obtain generalization error bounds for optimization algorithms, and we are interested in a general approach to achieve it. For Euclidean geometry, we suggest a black-box conversion which given a smooth optimization algorithm, produces a unifo… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: 18 pages, Proceedings of Thirty Fifth Conference on Learning Theory, PMLR 178:3313-3332, 2022

  15. arXiv:2102.02167  [pdf, other

    cs.LG math.OC stat.ML

    Algorithmic Instabilities of Accelerated Gradient Descent

    Authors: Amit Attia, Tomer Koren

    Abstract: We study the algorithmic stability of Nesterov's accelerated gradient method. For convex quadratic objectives, Chen et al. (2018) proved that the uniform stability of the method grows quadratically with the number of optimization steps, and conjectured that the same is true for the general convex and smooth case. We disprove this conjecture and show, for two notions of algorithmic stability (inclu… ▽ More

    Submitted 19 June, 2021; v1 submitted 3 February, 2021; originally announced February 2021.

    Comments: 37 pages

  16. arXiv:2101.05958  [pdf, other

    math.OC cs.LG

    Stochastic Learning Approach to Binary Optimization for Optimal Design of Experiments

    Authors: Ahmed Attia, Sven Leyffer, Todd Munson

    Abstract: We present a novel stochastic approach to binary optimization for optimal experimental design (OED) for Bayesian inverse problems governed by mathematical models such as partial differential equations. The OED utility function, namely, the regularized optimality criterion, is cast into a stochastic objective function in the form of an expectation over a multivariate Bernoulli distribution. The pro… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

    Comments: 34 pages, 12 figures

  17. arXiv:2006.03048  [pdf, other

    cs.IT cs.DB cs.IR

    Asymmetric Leaky Private Information Retrieval

    Authors: Islam Samy, Mohamed A. Attia, Ravi Tandon, Loukas Lazos

    Abstract: Information-theoretic formulations of the private information retrieval (PIR) problem have been investigated under a variety of scenarios. Symmetric private information retrieval (SPIR) is a variant where a user is able to privately retrieve one out of $K$ messages from $N$ non-colluding replicated databases without learning anything about the remaining $K-1$ messages. However, the goal of perfect… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

  18. arXiv:2001.05998  [pdf, other

    cs.IT cs.DB cs.IR

    Latent-variable Private Information Retrieval

    Authors: Islam Samy, Mohamed A. Attia, Ravi Tandon, Loukas Lazos

    Abstract: In many applications, content accessed by users (movies, videos, news articles, etc.) can leak sensitive latent attributes, such as religious and political views, sexual orientation, ethnicity, gender, and others. To prevent such information leakage, the goal of classical PIR is to hide the identity of the content/message being accessed, which subsequently also hides the latent attributes. This so… ▽ More

    Submitted 14 May, 2020; v1 submitted 16 January, 2020; originally announced January 2020.

  19. arXiv:1806.10655  [pdf, other

    cs.CE stat.AP

    An Optimal Experimental Design Framework for Adaptive Inflation and Covariance Localization for Ensemble Filters

    Authors: Ahmed Attia, Emil Constantinescu

    Abstract: We develop an optimal experimental design framework for adapting the covariance inflation and localization in data assimilation problems. Covariance inflation and localization are ubiquitously employed to alleviate the effect of using ensembles of finite sizes in all practical data assimilation systems. The choice of both the inflation factor and the localization radius can have a significant impa… ▽ More

    Submitted 24 March, 2019; v1 submitted 27 June, 2018; originally announced June 2018.

    Comments: 31 pages, 15 figures

  20. arXiv:1805.04104  [pdf, other

    cs.IT cs.CR cs.DB cs.IR

    The Capacity of Private Information Retrieval from Uncoded Storage Constrained Databases

    Authors: Mohamed Adel Attia, Deepak Kumar, Ravi Tandon

    Abstract: Private information retrieval (PIR) allows a user to retrieve a desired message from a set of databases without revealing the identity of the desired message. The replicated databases scenario was considered by Sun and Jafar, 2016, where $N$ databases can store the same $K$ messages completely. A PIR scheme was developed to achieve the optimal download cost given by… ▽ More

    Submitted 23 October, 2018; v1 submitted 10 May, 2018; originally announced May 2018.

  21. arXiv:1802.06517  [pdf, other

    cs.CE math.NA math.OC stat.AP

    Goal-Oriented Optimal Design of Experiments for Large-Scale Bayesian Linear Inverse Problems

    Authors: Ahmed Attia, Alen Alexanderian, Arvind K. Saibaba

    Abstract: We develop a framework for goal-oriented optimal design of experiments (GOODE) for large-scale Bayesian linear inverse problems governed by PDEs. This framework differs from classical Bayesian optimal design of experiments (ODE) in the following sense: we seek experimental designs that minimize the posterior uncertainty in the experiment end-goal, e.g., a quantity of interest (QoI), rather than th… ▽ More

    Submitted 11 June, 2018; v1 submitted 18 February, 2018; originally announced February 2018.

    Comments: 25 pages, 13 figures

  22. arXiv:1801.06504  [pdf, other

    cs.CV stat.ML

    Detecting and counting tiny faces

    Authors: Alexandre Attia, Sharone Dayan

    Abstract: Finding Tiny Faces (by Hu and Ramanan) proposes a novel approach to find small objects in an image. Our contribution consists in deeply understanding the choices of the paper together with applying and extending a similar method to a real world subject which is the counting of people in a public demonstration.

    Submitted 24 January, 2018; v1 submitted 19 January, 2018; originally announced January 2018.

    Comments: 4 pages, 10 figures, 2 appendix page

  23. arXiv:1801.06503  [pdf, other

    stat.ML cs.LG

    Global overview of Imitation Learning

    Authors: Alexandre Attia, Sharone Dayan

    Abstract: Imitation Learning is a sequential task where the learner tries to mimic an expert's action in order to achieve the best performance. Several algorithms have been proposed recently for this task. In this project, we aim at proposing a wide review of these algorithms, presenting their main features and comparing them on their performance and their regret bounds.

    Submitted 19 January, 2018; originally announced January 2018.

    Comments: 9 pages, 5 figures, 5 appendix pages

  24. arXiv:1801.02171  [pdf, other

    cs.CV stat.ML

    Detection and segmentation of the Left Ventricle in Cardiac MRI using Deep Learning

    Authors: Alexandre Attia, Sharone Dayan

    Abstract: Manual segmentation of the Left Ventricle (LV) is a tedious and meticulous task that can vary depending on the patient, the Magnetic Resonance Images (MRI) cuts and the experts. Still today, we consider manual delineation done by experts as being the ground truth for cardiac diagnosticians. Thus, we are reviewing the paper - written by Avendi and al. - who presents a combined approach with Convolu… ▽ More

    Submitted 7 January, 2018; originally announced January 2018.

  25. arXiv:1801.01875  [pdf, other

    cs.IT cs.DC cs.LG

    Near Optimal Coded Data Shuffling for Distributed Learning

    Authors: Mohamed A. Attia, Ravi Tandon

    Abstract: Data shuffling between distributed cluster of nodes is one of the critical steps in implementing large-scale learning algorithms. Randomly shuffling the data-set among a cluster of workers allows different nodes to obtain fresh data assignments at each learning epoch. This process has been shown to provide improvements in the learning process. However, the statistical benefits of distributed data… ▽ More

    Submitted 5 January, 2018; originally announced January 2018.

  26. arXiv:1801.00548  [pdf, other

    stat.ME cs.LG

    A Machine Learning Approach to Adaptive Covariance Localization

    Authors: Azam Moosavi, Ahmed Attia, Adrian Sandu

    Abstract: Data assimilation plays a key role in large-scale atmospheric weather forecasting, where the state of the physical system is estimated from model outputs and observations, and is then used as initial condition to produce accurate future forecasts. The Ensemble Kalman Filter (EnKF) provides a practical implementation of the statistical solution of the data assimilation problem and has gained wide p… ▽ More

    Submitted 10 February, 2018; v1 submitted 1 January, 2018; originally announced January 2018.

    Comments: 23 pages, 12 figures

    Report number: CSTR-01

  27. arXiv:1711.08452  [pdf, other

    cs.DC cs.IT

    Combating Computational Heterogeneity in Large-Scale Distributed Computing via Work Exchange

    Authors: Mohamed A. Attia, Ravi Tandon

    Abstract: Owing to data-intensive large-scale applications, distributed computation systems have gained significant recent interest, due to their ability of running such tasks over a large number of commodity nodes in a time efficient manner. One of the major bottlenecks that adversely impacts the time efficiency is the computational heterogeneity of distributed nodes, often limiting the task completion tim… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

  28. arXiv:1704.05594  [pdf, other

    cs.MS

    DATeS: A Highly-Extensible Data Assimilation Testing Suite v1.0

    Authors: Ahmed Attia, Adrian Sandu

    Abstract: A flexible and highly-extensible data assimilation testing suite, named DATeS, is described in this paper. DATeS aims to offer a unified testing environment that allows researchers to compare different data assimilation methodologies and understand their performance in various settings. The core of DATeS is implemented in Python and takes advantage of its object-oriented capabilities. The main com… ▽ More

    Submitted 1 July, 2018; v1 submitted 18 April, 2017; originally announced April 2017.

    Report number: CSTR-5/2017

  29. arXiv:1403.7137  [pdf, other

    cs.CE stat.CO

    A Sampling Filter for Non-Gaussian Data Assimilation

    Authors: Ahmed Attia, Adrian Sandu

    Abstract: Data assimilation combines information from models, measurements, and priors to estimate the state of a dynamical system such as the atmosphere. The Ensemble Kalman filter (EnKF) is a family of ensemble-based data assimilation approaches that has gained wide popularity due its simple formulation, ease of implementation, and good practical results. Most EnKF algorithms assume that the underlying pr… ▽ More

    Submitted 5 December, 2014; v1 submitted 27 March, 2014; originally announced March 2014.

    Comments: 52 pages, 24 figures, 4 tables

    Report number: CSTR-4/2014