Skip to main content

Showing 1–8 of 8 results for author: Chirigati, F

.
  1. arXiv:2104.03353  [pdf, other

    cs.DB cs.DS cs.IR

    Correlation Sketches for Approximate Join-Correlation Queries

    Authors: Aécio Santos, Aline Bessa, Fernando Chirigati, Christopher Musco, Juliana Freire

    Abstract: The increasing availability of structured datasets, from Web tables and open-data portals to enterprise data, opens up opportunities~to enrich analytics and improve machine learning models through relational data augmentation. In this paper, we introduce a new class of data augmentation queries: join-correlation queries. Given a column $Q$ and a join column $K_Q$ from a query table… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: Proceedings of the 2021 International Conference on Management of Data (SIGMOD '21)

  2. arXiv:2102.05716  [pdf, other

    cs.IR cs.DB

    Auctus: A Dataset Search Engine for Data Augmentation

    Authors: Sonia Castelo, Rémi Rampin, Aécio Santos, Aline Bessa, Fernando Chirigati, Juliana Freire

    Abstract: The large volumes of structured data currently available, from Web tables to open-data portals and enterprise data, open up new opportunities for progress in answering many important scientific, societal, and business questions. However, finding relevant data is difficult. While search engines have addressed this problem for Web documents, there are many new challenges involved in supporting the d… ▽ More

    Submitted 31 August, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

  3. arXiv:1808.01406  [pdf, other

    cs.SE cs.DL

    ReproServer: Making Reproducibility Easier and Less Intensive

    Authors: Remi Rampin, Fernando Chirigati, Vicky Steeves, Juliana Freire

    Abstract: Reproducibility in the computational sciences has been stymied because of the complex and rapidly changing computational environments in which modern research takes place. While many will espouse reproducibility as a value, the challenge of making it happen (both for themselves and testing the reproducibility of others' work) often outweigh the benefits. There have been a few reproducibility solut… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

  4. A Collaborative Approach to Computational Reproducibility

    Authors: Fernando Chirigati, Rebecca Capone, Dennis Shasha, Remi Rampin, Juliana Freire

    Abstract: Although a standard in natural science, reproducibility has been only episodically applied in experimental computer science. Scientific papers often present a large number of tables, plots and pictures that summarize the obtained results, but then loosely describe the steps taken to derive them. Not only can the methods and the implementation be complex, but also their configuration may require se… ▽ More

    Submitted 9 August, 2017; originally announced September 2017.

    Journal ref: The Journal of Information Systems, Volume 59, Pages 95-97, ISSN 0306-4379 (2016)

  5. Reproducible experiments on dynamic resource allocation in cloud data centers

    Authors: Andreas Wolke, Martin Bichler, Fernando Chirigati, Victoria Steeves

    Abstract: In Wolke et al. [1] we compare the efficiency of different resource allocation strategies experimentally. We focused on dynamic environments where virtual machines need to be allocated and deallocated to servers over time. In this companion paper, we describe the simulation framework and how to run simulations to replicate experiments or run new experiments within the framework.

    Submitted 28 February, 2017; originally announced March 2017.

    Journal ref: Information Systems, Volume 59, July 2016, Pages 98-101, ISSN 0306-4379

  6. Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets

    Authors: Fernando Chirigati, Harish Doraiswamy, Theodoros Damoulas, Juliana Freire

    Abstract: The increasing ability to collect data from urban environments, coupled with a push towards openness by governments, has resulted in the availability of numerous spatio-temporal data sets covering diverse aspects of a city. Discovering relationships between these data sets can produce new insights by enabling domain experts to not only test but also generate hypotheses. However, discovering these… ▽ More

    Submitted 21 October, 2016; originally announced October 2016.

    Journal ref: Proceedings of the 2016 International Conference on Management of Data (SIGMOD '16), pp. 1011-1025

  7. arXiv:1502.02403  [pdf, other

    cs.SE

    YesWorkflow: A User-Oriented, Language-Independent Tool for Recovering Workflow Information from Scripts

    Authors: Timothy McPhillips, Tianhong Song, Tyler Kolisnik, Steve Aulenbach, Khalid Belhajjame, Kyle Bocinsky, Yang Cao, Fernando Chirigati, Saumen Dey, Juliana Freire, Deborah Huntzinger, Christopher Jones, David Koop, Paolo Missier, Mark Schildhauer, Christopher Schwalm, Yaxing Wei, James Cheney, Mark Bieda, Bertram Ludaescher

    Abstract: Scientific workflow management systems offer features for composing complex computational pipelines from modular building blocks, for executing the resulting automated workflows, and for recording the provenance of data products resulting from workflow runs. Despite the advantages such features provide, many automated workflows continue to be implemented and executed outside of scientific workflow… ▽ More

    Submitted 9 February, 2015; originally announced February 2015.

  8. arXiv:1401.2000  [pdf, other

    cs.CE cond-mat.stat-mech physics.comp-ph

    A model project for reproducible papers: critical temperature for the Ising model on a square lattice

    Authors: M. Dolfi, J. Gukelberger, A. Hehn, J. Imriška, K. Pakrouski, T. F. Rønnow, M. Troyer, I. Zintchenko, F. Chirigati, J. Freire, D. Shasha

    Abstract: In this paper we present a simple, yet typical simulation in statistical physics, consisting of large scale Monte Carlo simulations followed by an involved statistical analysis of the results. The purpose is to provide an example publication to explore tools for writing reproducible papers. The simulation estimates the critical temperature where the Ising model on the square lattice becomes magnet… ▽ More

    Submitted 9 January, 2014; originally announced January 2014.

    Comments: Authors are listed in alphabetical order by institution and name. 5 pages, 4 figures