Skip to main content

Showing 1–4 of 4 results for author: Trani, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.08960  [pdf, other

    cs.CV cs.LG

    Neural Network Compression using Binarization and Few Full-Precision Weights

    Authors: Franco Maria Nardini, Cosimo Rulli, Salvatore Trani, Rossano Venturini

    Abstract: Quantization and pruning are two effective Deep Neural Networks model compression methods. In this paper, we propose Automatic Prune Binarization (APB), a novel compression technique combining quantization with pruning. APB enhances the representational capability of binary networks using a few full-precision weights. Our technique jointly maximizes the accuracy of the network while minimizing its… ▽ More

    Submitted 15 September, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: 15 pages, 6 figures, 3 tables

    ACM Class: I.2.6

  2. arXiv:2202.10728  [pdf, other

    cs.LG cs.AI cs.IR cs.PF

    Distilled Neural Networks for Efficient Learning to Rank

    Authors: F. M. Nardini, C. Rulli, S. Trani, R. Venturini

    Abstract: Recent studies in Learning to Rank have shown the possibility to effectively distill a neural network from an ensemble of regression trees. This result leads neural networks to become a natural competitor of tree-based ensembles on the ranking task. Nevertheless, ensembles of regression trees outperform neural models both in terms of efficiency and effectiveness, particularly when scoring on CPU.… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. Learning Early Exit Strategies for Additive Ranking Ensembles

    Authors: Francesco Busolin, Claudio Lucchese, Franco Maria Nardini, Salvatore Orlando, Raffaele Perego, Salvatore Trani

    Abstract: Modern search engine ranking pipelines are commonly based on large machine-learned ensembles of regression trees. We propose LEAR, a novel - learned - technique aimed to reduce the average number of trees traversed by documents to accumulate the scores, thus reducing the overall query response time. LEAR exploits a classifier that predicts whether a document can early exit the ensemble because it… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: 5 pages, 3 figures, ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 21)

    ACM Class: H.3.3

    Journal ref: 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Association for Computing Machinery, 2021, 2217-2221

  4. arXiv:2004.14641  [pdf, other

    cs.IR cs.LG

    Query-level Early Exit for Additive Learning-to-Rank Ensembles

    Authors: Claudio Lucchese, Franco Maria Nardini, Salvatore Orlando, Raffaele Perego, Salvatore Trani

    Abstract: Search engine ranking pipelines are commonly based on large ensembles of machine-learned decision trees. The tight constraints on query response time recently motivated researchers to investigate algorithms to make faster the traversal of the additive ensemble or to early terminate the evaluation of documents that are unlikely to be ranked among the top-k. In this paper, we investigate the novel p… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: Accepted at SIGIR 2020 (short paper)

    MSC Class: 68P20