Skip to main content

Showing 1–1 of 1 results for author: De La Torre, M F

.
  1. arXiv:1912.07747  [pdf

    cs.IR cs.CL cs.LG

    Pipelines for Procedural Information Extraction from Scientific Literature: Towards Recipes using Machine Learning and Data Science

    Authors: Huichen Yang, Carlos A. Aguirre, Maria F. De La Torre, Derek Christensen, Luis Bobadilla, Emily Davich, Jordan Roth, Lei Luo, Yihong Theis, Alice Lam, T. Yong-** Han, David Buttler, William H. Hsu

    Abstract: This paper describes a machine learning and data science pipeline for structured information extraction from documents, implemented as a suite of open-source tools and extensions to existing tools. It centers around a methodology for extracting procedural information in the form of recipes, stepwise procedures for creating an artifact (in this case synthesizing a nanomaterial), from published scie… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

    Comments: 15th International Conference on Document Analysis and Recognition Workshops (ICDARW 2019)

    Report number: 2019-1 MSC Class: I.2.7; I.2.6; H.3.3; H.3.4; I.2.10; I.5.4 ACM Class: I.2.7; I.2.6; H.3.3; H.3.4; I.2.10; I.5.4