Skip to main content

Showing 1–4 of 4 results for author: Ghaderi, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.12680  [pdf, other

    cs.LG math.OC stat.ML

    On the Optimization and Generalization of Multi-head Attention

    Authors: Puneesh Deora, Rouzbeh Ghaderi, Hossein Taheri, Christos Thrampoulidis

    Abstract: The training and generalization dynamics of the Transformer's core mechanism, namely the Attention mechanism, remain under-explored. Besides, existing analyses primarily focus on single-head attention. Inspired by the demonstrated benefits of overparameterization when training fully-connected networks, we investigate the potential optimization and generalization advantages of using multiple attent… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 48 page; presented in the Workshop on High-dimensional Learning Dynamics, ICML 2023

  2. arXiv:2205.02121  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Accelerating phase-field-based simulation via machine learning

    Authors: Iman Peivaste, Nima H. Siboni, Ghasem Alahyarizadeh, Reza Ghaderi, Bob Svendsen, Dierk Raabe, Jaber R. Mianroodi

    Abstract: Phase-field-based models have become common in material science, mechanics, physics, biology, chemistry, and engineering for the simulation of microstructure evolution. Yet, they suffer from the drawback of being computationally very costly when applied to large, complex systems. To reduce such computational costs, a Unet-based artificial neural network is developed as a surrogate model in the cur… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  3. ECOC-Based Training of Neural Networks for Face Recognition

    Authors: Nima Hatami, Reza Ebrahimpour, Reza Ghaderi

    Abstract: Error Correcting Output Codes, ECOC, is an output representation method capable of discovering some of the errors produced in classification tasks. This paper describes the application of ECOC to the training of feed forward neural networks, FFNN, for improving the overall accuracy of classification systems. Indeed, to improve the generalization of FFNN classifiers, this paper proposes an ECOC-Bas… ▽ More

    Submitted 13 December, 2013; originally announced December 2013.

    Journal ref: Cybernetics and Intelligent Systems, IEEE Conference on, 450-454, 2008

  4. arXiv:1206.2027  [pdf

    nlin.AO cs.RO

    Adaptive Fractional PID Controller for Robot Manipulator

    Authors: H. Delavari, R. Ghaderi, N. A. Ranjbar, S. H. HosseinNia, S. Momani

    Abstract: A Fractional adaptive PID (FPID) controller for a robot manipulator will be proposed. The PID parameters have been optimized by Genetic algorithm. The proposed controller is found robust by means of simulation in a tracking job. The validity of the proposed controller is shown by simulation of two-link robot manipulator. The result then is compared with integer type adaptive PID controller. It is… ▽ More

    Submitted 10 June, 2012; originally announced June 2012.

    Comments: Proceedings of FDA'10. The 4th IFAC Workshop Fractional Differentiation and its Applications. Article no. FDA10-038 Badajoz, Spain, October 18-20, 2010