Skip to main content

Showing 1–20 of 20 results for author: Gallo, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.11678  [pdf, other

    cs.ET

    Design and simulation of memristor-based neural networks

    Authors: Pablo Alex Lázaro, Ignacio Jiménez Gallo, Juan Roldán Aranda, Alberto del Barrio García, Guillermo Botella Juan, Francisco Jiménez Molinos

    Abstract: In recent times, neural networks have been gaining increasing importance in fields such as pattern recognition and computer vision. However, their usage entails significant energy and hardware costs, limiting the domains in which this technology can be employed. In this context, the feasibility of utilizing analog circuits based on memristors as efficient alternatives in neural network inference… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  2. arXiv:2011.08042  [pdf, other

    cs.LG math.OC

    Mixing ADAM and SGD: a Combined Optimization Method

    Authors: Nicola Landro, Ignazio Gallo, Riccardo La Grassa

    Abstract: Optimization methods (optimizers) get special attention for the efficient training of neural networks in the field of deep learning. In literature there are many papers that compare neural models trained with the use of different optimizers. Each paper demonstrates that for a particular problem an optimizer is better than the others but as the problem changes this type of result is no longer valid… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

  3. arXiv:2009.08796  [pdf, other

    cs.CV cs.LG

    $σ^2$R Loss: a Weighted Loss by Multiplicative Factors using Sigmoidal Functions

    Authors: Riccardo La Grassa, Ignazio Gallo, Nicola Landro

    Abstract: In neural networks, the loss function represents the core of the learning process that leads the optimizer to an approximation of the optimal convergence error. Convolutional neural networks (CNN) use the loss function as a supervisory signal to train a deep model and contribute significantly to achieving the state of the art in some fields of artificial vision. Cross-entropy and Center loss funct… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: 9 pages

  4. arXiv:2005.08622  [pdf, other

    cs.CV cs.LG

    Learn Class Hierarchy using Convolutional Neural Networks

    Authors: Riccardo La Grassa, Ignazio Gallo, Nicola Landro

    Abstract: A large amount of research on Convolutional Neural Networks has focused on flat Classification in the multi-class domain. In the real world, many problems are naturally expressed as problems of hierarchical classification, in which the classes to be predicted are organized in a hierarchy of classes. In this paper, we propose a new architecture for hierarchical classification of images, introducing… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 7 pages

  5. arXiv:2005.00393  [pdf, other

    cs.LG stat.ML

    Can a powerful neural network be a teacher for a weaker neural network?

    Authors: Nicola Landro, Ignazio Gallo, Riccardo La Grassa

    Abstract: The transfer learning technique is widely used to learning in one context and applying it to another, i.e. the capacity to apply acquired knowledge and skills to new situations. But is it possible to transfer the learning from a deep neural network to a weaker neural network? Is it possible to improve the performance of a weak neural network using the knowledge acquired by a more powerful neural n… ▽ More

    Submitted 7 May, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

  6. arXiv:2004.13780  [pdf, other

    cs.CV cs.CL cs.SD eess.AS

    Cross-modal Speaker Verification and Recognition: A Multilingual Perspective

    Authors: Muhammad Saad Saeed, Shah Nawaz, Pietro Morerio, Arif Mahmood, Ignazio Gallo, Muhammad Haroon Yousaf, Alessio Del Bue

    Abstract: Recent years have seen a surge in finding association between faces and voices within a cross-modal biometric application along with speaker recognition. Inspired from this, we introduce a challenging task in establishing association between faces and voices across multiple languages spoken by the same set of persons. The aim of this paper is to answer two closely related questions: "Is face-voice… ▽ More

    Submitted 22 April, 2021; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: Accepted: CVPRW

  7. arXiv:2004.02273  [pdf, other

    cs.LG cs.CV stat.ML

    Dynamic Decision Boundary for One-class Classifiers applied to non-uniformly Sampled Data

    Authors: Riccardo La Grassa, Ignazio Gallo, Nicola Landro

    Abstract: A typical issue in Pattern Recognition is the non-uniformly sampled data, which modifies the general performance and capability of machine learning algorithms to make accurate predictions. Generally, the data is considered non-uniformly sampled when in a specific area of data space, they are not enough, leading us to misclassification problems. This issue cut down the goal of the one-class classif… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Comments: 7 pages

  8. arXiv:2003.13524  [pdf, other

    cs.LG cs.CV stat.ML

    OCmst: One-class Novelty Detection using Convolutional Neural Network and Minimum Spanning Trees

    Authors: Riccardo La Grassa, Ignazio Gallo, Nicola Landro

    Abstract: We present a novel model called One Class Minimum Spanning Tree (OCmst) for novelty detection problem that uses a Convolutional Neural Network (CNN) as deep feature extractor and graph-based model based on Minimum Spanning Tree (MST). In a novelty detection scenario, the training data is no polluted by outliers (abnormal class) and the goal is to recognize if a test instance belongs to the normal… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: 16 pages

  9. arXiv:1909.08685  [pdf, ps, other

    cs.CV cs.SD eess.AS

    Deep Latent Space Learning for Cross-modal Map** of Audio and Visual Signals

    Authors: Shah Nawaz, Muhammad Kamran Janjua, Ignazio Gallo, Arif Mahmood, Alessandro Calefati

    Abstract: We propose a novel deep training algorithm for joint representation of audio and visual information which consists of a single stream network (SSNet) coupled with a novel loss function to learn a shared deep latent space representation of multimodal information. The proposed framework characterizes the shared latent space by leveraging the class centers which helps to eliminate the need for pairwi… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: Accepted to DICTA 2019

  10. arXiv:1909.05663  [pdf, other

    cs.CV cs.AI

    Picture What you Read

    Authors: Ignazio Gallo, Shah Nawaz, Alessandro Calefati, Riccardo La Grassa, Nicola Landro

    Abstract: Visualization refers to our ability to create an image in our head based on the text we read or the words we hear. It is one of the many skills that makes reading comprehension possible. Convolutional Neural Networks (CNN) are an excellent tool for recognizing and classifying text documents. In addition, it can generate images conditioned on natural language. In this work, we utilize CNNs capabili… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: 7 pages, Dicta2019 conference

  11. arXiv:1909.04078  [pdf, other

    cs.LG cs.AI stat.ML

    A Classification Methodology based on Subspace Graphs Learning

    Authors: Riccardo La Grassa, Ignazio Gallo, Alessandro Calefati, Dimitri Ognibene

    Abstract: In this paper, we propose a design methodology for one-class classifiers using an ensemble-of-classifiers approach. The objective is to select the best structures created during the training phase using an ensemble of spanning trees. It takes the best classifier, partitioning the area near a pattern into $γ^{γ-2}$ sub-spaces and combining all possible spanning trees that can be created starting fr… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: 8 pages, Dicta Conference

  12. arXiv:1909.01976  [pdf, other

    cs.CV

    Do Cross Modal Systems Leverage Semantic Relationships?

    Authors: Shah Nawaz, Muhammad Kamran Janjua, Ignazio Gallo, Arif Mahmood, Alessandro Calefati, Faisal Shafait

    Abstract: Current cross-modal retrieval systems are evaluated using R@K measure which does not leverage semantic relationships rather strictly follows the manually marked image text query pairs. Therefore, current systems do not generalize well for the unseen data in the wild. To handle this, we propose a new measure, SemanticMap, to evaluate the performance of cross-modal systems. Our proposed measure eval… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: Accepted to cross modal learning in real world in conjunction with ICCV 2019. arXiv admin note: text overlap with arXiv:1807.07364

  13. arXiv:1906.06090  [pdf, other

    cs.LG stat.ML

    Binary Classification using Pairs of Minimum Spanning Trees or N-ary Trees

    Authors: Riccardo La Grassa, Ignazio Gallo, Alessandro Calefati, Dimitri Ognibene

    Abstract: One-class classifiers are trained with target class only samples. Intuitively, their conservative modelling of the class description may benefit classical classification tasks where classes are difficult to separate due to overlap** and data imbalance. In this work, three methods are proposed which leverage on the combination of one-class classifiers based on non-parametric models, N-ary Trees a… ▽ More

    Submitted 25 June, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  14. arXiv:1904.01356  [pdf, other

    cs.CV cs.CL

    Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition

    Authors: Omer Arshad, Ignazio Gallo, Shah Nawaz, Alessandro Calefati

    Abstract: With massive explosion of social media such as Twitter and Instagram, people daily share billions of multimedia posts, containing images and text. Typically, text in these posts is short, informal and noisy, leading to ambiguities which can be resolved using images. In this paper we explore text-centric Named Entity Recognition task on these multimedia posts. We propose an end to end model which l… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

  15. arXiv:1810.07037  [pdf, other

    cs.CV

    Learning Inward Scaled Hypersphere Embedding: Exploring Projections in Higher Dimensions

    Authors: Muhammad Kamran Janjua, Shah Nawaz, Alessandro Calefati, Ignazio Gallo

    Abstract: Majority of the current dimensionality reduction or retrieval techniques rely on embedding the learned feature representations onto a computable metric space. Once the learned features are mapped, a distance metric aids the bridging of gaps between similar instances. Since the scaled projection is not exploited in these methods, discriminative embedding onto a hyperspace becomes a challenge. In th… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

  16. arXiv:1810.02001  [pdf, ps, other

    cs.CV

    Image and Encoded Text Fusion for Multi-Modal Classification

    Authors: Ignazio Gallo, Alessandro Calefati, Shah Nawaz, Muhammad Kamran Janjua

    Abstract: Multi-modal approaches employ data from multiple input streams such as textual and visual domains. Deep neural networks have been successfully employed for these approaches. In this paper, we present a novel multi-modal approach that fuses images and text descriptions to improve multi-modal classification performance in real-world scenarios. The proposed approach embeds an encoded text onto an ima… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

    Comments: Accepted to DICTA 2018

  17. arXiv:1808.10822  [pdf, other

    cs.CV

    Seeing Colors: Learning Semantic Text Encoding for Classification

    Authors: Shah Nawaz, Alessandro Calefati, Muhammad Kamran Janjua, Ignazio Gallo

    Abstract: The question we answer with this work is: can we convert a text document into an image to exploit best image classification models to classify documents? To answer this question we present a novel text classification method which converts a text document into an encoded image, using word embedding and capabilities of Convolutional Neural Networks (CNNs), successfully employed in image classificati… ▽ More

    Submitted 31 August, 2018; originally announced August 2018.

    Comments: 9 pages. Under review at IJDAR

  18. arXiv:1807.08512  [pdf, other

    cs.CV

    Git Loss for Deep Face Recognition

    Authors: Alessandro Calefati, Muhammad Kamran Janjua, Shah Nawaz, Ignazio Gallo

    Abstract: Convolutional Neural Networks (CNNs) have been widely used in computer vision tasks, such as face recognition and verification, and have achieved state-of-the-art results due to their ability to capture discriminative deep features. Conventionally, CNNs have been trained with softmax as supervision signal to penalize the classification loss. In order to further enhance the discriminative capabilit… ▽ More

    Submitted 28 July, 2018; v1 submitted 23 July, 2018; originally announced July 2018.

    Comments: 12 pages. Accepted at BMVC2018

  19. arXiv:1807.07364  [pdf, other

    cs.CV

    Revisiting Cross Modal Retrieval

    Authors: Shah Nawaz, Muhammad Kamran Janjua, Alessandro Calefati, Ignazio Gallo

    Abstract: This paper proposes a cross-modal retrieval system that leverages on image and text encoding. Most multimodal architectures employ separate networks for each modality to capture the semantic relationship between them. However, in our work image-text encoding can achieve comparable results in terms of cross-modal retrieval without having to use a separate network for each modality. We show that tex… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

    Comments: 14 pages. Under review at ECCVW (MULA 2018)

  20. arXiv:1711.02661  [pdf, other

    cs.CE

    e-Fair: Aggregation in e-Commerce for Exploiting Economies of Scale

    Authors: Pierluigi Gallo, Francesco Randazzo, Ignazio Gallo

    Abstract: In recent years, many new and interesting models of successful online business have been developed, including competitive models such as auctions, where the product price tends to rise, and group-buying, where users cooperate obtaining a dynamic price that tends to go down. We propose the e-fair as a business model for social commerce, where both sellers and buyers are grouped to maximize benefits… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.