Skip to main content

Showing 1–2 of 2 results for author: Vitagliano, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14696  [pdf, other

    cs.CL cs.AI cs.DB

    A Declarative System for Optimizing AI Workloads

    Authors: Chunwei Liu, Matthew Russo, Michael Cafarella, Lei Cao, Peter Baille Chen, Zui Chen, Michael Franklin, Tim Kraska, Samuel Madden, Gerardo Vitagliano

    Abstract: A long-standing goal of data management systems has been to build systems which can compute quantitative insights over large corpora of unstructured data in a cost-effective manner. Until recently, it was difficult and expensive to extract facts from company documents, data from scientific papers, or metrics from image and video corpora. Today's models can accomplish these tasks with high accuracy… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 29 pages, 9 figures

    ACM Class: H.2.3; I.2.5

  2. Detecting Layout Templates in Complex Multiregion Files

    Authors: Gerardo Vitagliano, Lan Jiang, Felix Naumann

    Abstract: Spreadsheets are among the most commonly used file formats for data management, distribution, and analysis. Their widespread employment makes it easy to gather large collections of data, but their flexible canvas-based structure makes automated analysis difficult without heavy preparation. One of the common problems that practitioners face is the presence of multiple, independent regions in a sing… ▽ More

    Submitted 15 September, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

    Journal ref: Proceedings of the VLDB Endowment, Volume 15, Issue 3, November 2021