Skip to main content

Showing 1–7 of 7 results for author: Polito, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2111.09785  [pdf, other

    cs.LG

    DIVA: Dataset Derivative of a Learning Task

    Authors: Yonatan Dukler, Alessandro Achille, Giovanni Paolini, Avinash Ravichandran, Marzia Polito, Stefano Soatto

    Abstract: We present a method to compute the derivative of a learning task with respect to a dataset. A learning task is a function from a training set to the validation error, which can be represented by a trained deep neural network (DNN). The "dataset derivative" is a linear operator, computed around the trained model, that informs how perturbations of the weight of each training sample affect the valida… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

  2. arXiv:2107.08039  [pdf, other

    cs.CV cs.LG

    Representation Consolidation for Training Expert Students

    Authors: Zhizhong Li, Avinash Ravichandran, Charless Fowlkes, Marzia Polito, Rahul Bhotika, Stefano Soatto

    Abstract: Traditionally, distillation has been used to train a student model to emulate the input/output functionality of a teacher. A more useful goal than emulation, yet under-explored, is for the student to learn feature representations that transfer well to future tasks. However, we observe that standard distillation of task-specific teachers actually *reduces* the transferability of student representat… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  3. arXiv:2101.11058  [pdf, other

    cs.CV cs.LG

    Supervised Momentum Contrastive Learning for Few-Shot Classification

    Authors: Orchid Majumder, Avinash Ravichandran, Subhransu Maji, Alessandro Achille, Marzia Polito, Stefano Soatto

    Abstract: Few-shot learning aims to transfer information from one task to enable generalization on novel tasks given a few examples. This information is present both in the domain and the class labels. In this work we investigate the complementary roles of these two sources of information by combining instance-discriminative contrastive learning and supervised learning in a single framework called Supervise… ▽ More

    Submitted 21 June, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: V2 version; updated with new experiments and figures

  4. arXiv:2012.13431  [pdf, other

    cs.LG cs.AI cs.CV

    Mixed-Privacy Forgetting in Deep Networks

    Authors: Aditya Golatkar, Alessandro Achille, Avinash Ravichandran, Marzia Polito, Stefano Soatto

    Abstract: We show that the influence of a subset of the training samples can be removed -- or "forgotten" -- from the weights of a network trained on large-scale image classification tasks, and we provide strong computable bounds on the amount of remaining information after forgetting. Inspired by real-world applications of forgetting techniques, we introduce a novel notion of forgetting in mixed-privacy se… ▽ More

    Submitted 20 June, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

    Comments: CVPR 2021

  5. arXiv:2012.11140  [pdf, other

    cs.LG cs.CV stat.ML

    LQF: Linear Quadratic Fine-Tuning

    Authors: Alessandro Achille, Aditya Golatkar, Avinash Ravichandran, Marzia Polito, Stefano Soatto

    Abstract: Classifiers that are linear in their parameters, and trained by optimizing a convex loss function, have predictable behavior with respect to changes in the training data, initial conditions, and optimization. Such desirable properties are absent in deep neural networks (DNNs), typically trained by non-linear fine-tuning of a pre-trained model. Previous attempts to linearize DNNs have led to intere… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

  6. arXiv:2002.00757  [pdf

    cs.CL

    Similarità per la ricerca del dominio di una frase

    Authors: Massimiliano Morrelli, Giacomo Pansini, Massimiliano Polito, Arturo Vitale

    Abstract: English. This document aims to study the best algorithms to verify the belonging of a specific document to a related domain by comparing different methods for calculating the distance between two vectors. This study has been made possible with the help of the structures made available by the Apache Spark framework. Starting from the study illustrated in the publication "New frontier of textual cla… ▽ More

    Submitted 31 January, 2020; originally announced February 2020.

    Comments: in Italian

  7. arXiv:1902.00555  [pdf

    cs.CL

    Riconoscimento ortografico per apostrofo ed espressioni polirematiche

    Authors: Massimiliano Polito

    Abstract: The work presents two algorithms of manipulation and comparison between strings whose purpose is the orthographic recognition of the apostrophe and of the compound expressions. The theory supporting general reasoning refers to the basic concept of EditDistance, the improvements that ensure the achievement of the objective are achieved with the aid of tools borrowed from the use of techniques for p… ▽ More

    Submitted 31 January, 2019; originally announced February 2019.

    Comments: in Italian