Skip to main content

Showing 1–3 of 3 results for author: Štěpánek, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.15166  [pdf, other

    cs.CL cs.AI physics.ao-ph

    Pixels and Predictions: Potential of GPT-4V in Meteorological Imagery Analysis and Forecast Communication

    Authors: John R. Lawson, Montgomery L. Flora, Kevin H. Goebbert, Seth N. Lyman, Corey K. Potvin, David M. Schultz, Adam J. Stepanek, Joseph E. Trujillo-Falcón

    Abstract: Generative AI, such as OpenAI's GPT-4V large-language model, has rapidly entered mainstream discourse. Novel capabilities in image processing and natural-language communication may augment existing forecasting methods. Large language models further display potential to better communicate weather hazards in a style honed for diverse communities and different languages. This study evaluates GPT-4V's… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Supplementary material PDF attached. Submitted to Artificial Intelligence for the Earth Systems (American Meteorological Society) on 18 April 2024

  2. arXiv:2306.09307  [pdf, other

    cs.CL

    Quality and Efficiency of Manual Annotation: Pre-annotation Bias

    Authors: Marie Mikulová, Milan Straka, Jan Štěpánek, Barbora Štěpánková, Jan Hajič

    Abstract: This paper presents an analysis of annotation using an automatic pre-annotation for a mid-level annotation complexity task -- dependency syntax annotation. It compares the annotation efforts made by annotators using a pre-annotated version (with a high-accuracy parser) and those made by fully manual annotation. The aim of the experiment is to judge the final annotation quality when pre-annotation… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Published in LREC 2022

  3. arXiv:2006.03679  [pdf, other

    cs.CL

    Prague Dependency Treebank -- Consolidated 1.0

    Authors: Jan Hajič, Eduard Bejček, Jaroslava Hlaváčová, Marie Mikulová, Milan Straka, Jan Štěpánek, Barbora Štěpánková

    Abstract: We present a richly annotated and genre-diversified language resource, the Prague Dependency Treebank-Consolidated 1.0 (PDT-C 1.0), the purpose of which is - as it always been the case for the family of the Prague Dependency Treebanks - to serve both as a training data for various types of NLP tasks as well as for linguistically-oriented research. PDT-C 1.0 contains four different datasets of Czec… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

    Comments: Accepted at LREC 2020 (Proceedings of Language Resources and Evaluation, Marseille, France)