Skip to main content

Showing 1–4 of 4 results for author: Özateş, Ş B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.14743  [pdf, other

    cs.CL

    Dependency Annotation of Ottoman Turkish with Multilingual BERT

    Authors: Şaziye Betül Özateş, Tarık Emre Tıraş, Efe Eren Genç, Esma Fatıma Bilgin Taşdemir

    Abstract: This study introduces a pretrained large language model-based annotation methodology for the first dependency treebank in Ottoman Turkish. Our experimental results show that, iteratively, i) pseudo-annotating data using a multilingual BERT-based parsing model, ii) manually correcting the pseudo-annotations, and iii) fine-tuning the parsing model with the corrected annotations, we speed up and simp… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 9 pages, 5 figures. Accepted to LAW-XVIII

  2. arXiv:2207.11782  [pdf, other

    cs.CL

    Enhancements to the BOUN Treebank Reflecting the Agglutinative Nature of Turkish

    Authors: Büşra Marşan, Salih Furkan Akkurt, Muhammet Şen, Merve Gürbüz, Onur Güngör, Şaziye Betül Özateş, Suzan Üsküdarlı, Arzucan Özgür, Tunga Güngör, Balkız Öztürk

    Abstract: In this study, we aim to offer linguistically motivated solutions to resolve the issues of the lack of representation of null morphemes, highly productive derivational processes, and syncretic morphemes of Turkish in the BOUN Treebank without diverging from the Universal Dependencies framework. In order to tackle these issues, new annotation conventions were introduced by splitting certain lemma… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

    Comments: This is a peer reviewed article that has been presented in The International Conference on Agglutinative Language Technologies as a challenge of Natural Language Processing (ALTNLP) 2022

  3. arXiv:2002.10416  [pdf, other

    cs.CL

    Resources for Turkish Dependency Parsing: Introducing the BOUN Treebank and the BoAT Annotation Tool

    Authors: Utku Türk, Furkan Atmaca, Şaziye Betül Özateş, Gözde Berk, Seyyit Talha Bedir, Abdullatif Köksal, Balkız Öztürk Başaran, Tunga Güngör, Arzucan Özgür

    Abstract: In this paper, we introduce the resources that we developed for Turkish dependency parsing, which include a novel manually annotated treebank (BOUN Treebank), along with the guidelines we adopted, and a new annotation tool (BoAT). The manual annotation process we employed was shaped and implemented by a team of four linguists and five Natural Language Processing (NLP) specialists. Decisions regard… ▽ More

    Submitted 16 September, 2021; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Language Resource and Evaluation

  4. A Hybrid Approach to Dependency Parsing: Combining Rules and Morphology with Deep Learning

    Authors: Şaziye Betül Özateş, Arzucan Özgür, Tunga Güngör, Balkız Öztürk

    Abstract: Fully data-driven, deep learning-based models are usually designed as language-independent and have been shown to be successful for many natural language processing tasks. However, when the studied language is low-resourced and the amount of training data is insufficient, these models can benefit from the integration of natural language grammar-based information. We propose two approaches to depen… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: 25 pages, 7 figures

    ACM Class: I.2.7