Skip to main content

Showing 1–7 of 7 results for author: Galatolo, F A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.15698  [pdf, other

    cs.CL cs.AI

    Cerbero-7B: A Leap Forward in Language-Specific LLMs Through Enhanced Chat Corpus Generation and Evaluation

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino

    Abstract: This study introduces a novel approach for generating high-quality, language-specific chat corpora using a self-chat mechanism. We combine a generator LLM for creating new samples and an embedder LLM to ensure diversity. A new Masked Language Modelling (MLM) model-based quality assessment metric is proposed for evaluating and filtering the corpora. Utilizing the llama2-70b as the generator and a m… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  2. arXiv:2212.07839  [pdf, other

    cs.CV cs.CL cs.LG

    TeTIm-Eval: a novel curated evaluation data set for comparing text-to-image models

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Edoardo Cogotti

    Abstract: Evaluating and comparing text-to-image models is a challenging problem. Significant advances in the field have recently been made, piquing interest of various industrial sectors. As a consequence, a gold standard in the field should cover a variety of tasks and application contexts. In this paper a novel evaluation approach is experimented, on the basis of: (i) a curated data set, made by high-qua… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  3. arXiv:2102.01645  [pdf, other

    cs.NE cs.AI cs.LG

    Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Gigliola Vaglini

    Abstract: In this research work we present CLIP-GLaSS, a novel zero-shot framework to generate an image (or a caption) corresponding to a given caption (or image). CLIP-GLaSS is based on the CLIP neural network, which, given an image and a descriptive caption, provides similar embeddings. Differently, CLIP-GLaSS takes a caption (or an image) as an input, and generates the image (or the caption) whose CLIP e… ▽ More

    Submitted 1 October, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Journal ref: IMPROVE, ISBN 978-989-758-511-1, pages 166-174 (2021)

  4. Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Gigliola Vaglini

    Abstract: In this research, some of the issues that arise from the scalarization of the multi-objective optimization problem in the Advantage Actor Critic (A2C) reinforcement learning algorithm are investigated. The paper shows how a naive scalarization can lead to gradients overlap**. Furthermore, the possibility that the entropy regularization term can be a source of uncontrolled noise is discussed. Wit… ▽ More

    Submitted 1 October, 2021; v1 submitted 8 April, 2020; originally announced April 2020.

    Journal ref: Computers & Electrical Engineering, 92, 107117 (2021)

  5. arXiv:1905.06684  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Formal derivation of Mesh Neural Networks with their Forward-Only gradient Propagation

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Gigliola Vaglini

    Abstract: This paper proposes the Mesh Neural Network (MNN), a novel architecture which allows neurons to be connected in any topology, to efficiently route information. In MNNs, information is propagated between neurons throughout a state transition function. State and error gradients are then directly computed from state updates without backward computation. The MNN architecture and the error propagation… ▽ More

    Submitted 30 September, 2021; v1 submitted 16 May, 2019; originally announced May 2019.

    Journal ref: Galatolo, F. A., Cimino, M. G., & Vaglini, G. (2021). Formal Derivation of Mesh Neural Networks with Their Forward-Only Gradient Propagation. Neural Processing Letters, 1-16

  6. arXiv:1903.01341  [pdf

    cs.NE cs.LG stat.ML

    Using stigmergy as a computational memory in the design of recurrent neural networks

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Gigliola Vaglini

    Abstract: In this paper, a novel architecture of Recurrent Neural Network (RNN) is designed and experimented. The proposed RNN adopts a computational memory based on the concept of stigmergy. The basic principle of a Stigmergic Memory (SM) is that the activity of deposit/removal of a quantity in the SM stimulates the next activities of deposit/removal. Accordingly, subsequent SM activities tend to reinforce… ▽ More

    Submitted 9 January, 2019; originally announced March 2019.

  7. arXiv:1811.10574  [pdf

    cs.NE cs.LG stat.ML

    Using stigmergy to incorporate the time into artificial neural networks

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Gigliola Vaglini

    Abstract: A current research trend in neurocomputing involves the design of novel artificial neural networks incorporating the concept of time into their operating model. In this paper, a novel architecture that employs stigmergy is proposed. Computational stigmergy is used to dynamically increase (or decrease) the strength of a connection, or the activation level, of an artificial neuron when stimulated (o… ▽ More

    Submitted 25 October, 2018; originally announced November 2018.