Skip to main content

Showing 1–17 of 17 results for author: Güngör, T

.
  1. A Comprehensive Analysis of Static Word Embeddings for Turkish

    Authors: Karahan Sarıtaş, Cahid Arda Öz, Tunga Güngör

    Abstract: Word embeddings are fixed-length, dense and distributed word representations that are used in natural language processing (NLP) applications. There are basically two types of word embedding models which are non-contextual (static) models and contextual models. The former method generates a single embedding for a word regardless of its context, while the latter method produces distinct embeddings f… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Journal ref: Expert Systems with Applications Volume 252, Part A, 15 October 2024, 124123

  2. arXiv:2402.13067  [pdf, ps, other

    physics.flu-dyn

    Turbulent boundary layer response to uniform changes of the pressure force contribution

    Authors: Taygun R. Gungor, Ayse G. Gungor. Yvan Maciel

    Abstract: We investigate a turbulent boundary layer (TBL) with uniform pressure force variations, focusing on understanding its response to local pressure force, local pressure force variation (local disequilibrating effect), and upstream history. The studied flow starts as a zero-pressure-gradient (ZPG) TBL, followed by a uniform increase in the ratio of pressure force to turbulent force in the outer regio… ▽ More

    Submitted 22 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  3. arXiv:2402.10328  [pdf, other

    physics.flu-dyn

    Turbulent activity in the near-wall region of adverse pressure gradient turbulent boundary layers

    Authors: Taygun R. Gungor, Yvan Maciel, Ayse G. Gungor

    Abstract: Two direct numerical simulation (DNS) databases are investigated to understand the effect of the outer-layer turbulence on the inner layer's structures and energy transfer mechanisms. The first DNS database is the non-equilibrium adverse-pressure-gradient (APG) turbulence boundary layer (TBL) of Gungor et al. (2022). Its Reynolds number and the inner-layer pressure gradient parameter reach above 8… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  4. arXiv:2401.03590  [pdf, other

    cs.CL

    Building Efficient and Effective OpenQA Systems for Low-Resource Languages

    Authors: Emrah Budur, Rıza Özçelik, Dilara Soylu, Omar Khattab, Tunga Güngör, Christopher Potts

    Abstract: Question answering (QA) is the task of answering questions posed in natural language with free-form natural language answers extracted from a given passage. In the OpenQA variant, only a question text is given, and the system must retrieve relevant passages from an unstructured knowledge source and use them to provide answers, which is the case in the mainstream QA systems on the Web. QA systems c… ▽ More

    Submitted 4 June, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  5. arXiv:2307.11457  [pdf, ps, other

    cs.CL cs.AI

    Incorporating Human Translator Style into English-Turkish Literary Machine Translation

    Authors: Zeynep Yirmibeşoğlu, Olgun Dursun, Harun Dallı, Mehmet Şahin, Ena Hodzik, Sabri Gürses, Tunga Güngör

    Abstract: Although machine translation systems are mostly designed to serve in the general domain, there is a growing tendency to adapt these systems to other domains like literary translation. In this paper, we focus on English-Turkish literary translation and develop machine translation models that take into account the stylistic features of translators. We fine-tune a pre-trained machine translation mode… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Journal ref: 24th Annual Conference of the European Association of Machine Translation (EAMT), June 2023, Tampere, Finland

  6. arXiv:2207.11782  [pdf, other

    cs.CL

    Enhancements to the BOUN Treebank Reflecting the Agglutinative Nature of Turkish

    Authors: Büşra Marşan, Salih Furkan Akkurt, Muhammet Şen, Merve Gürbüz, Onur Güngör, Şaziye Betül Özateş, Suzan Üsküdarlı, Arzucan Özgür, Tunga Güngör, Balkız Öztürk

    Abstract: In this study, we aim to offer linguistically motivated solutions to resolve the issues of the lack of representation of null morphemes, highly productive derivational processes, and syncretic morphemes of Turkish in the BOUN Treebank without diverging from the Universal Dependencies framework. In order to tackle these issues, new annotation conventions were introduced by splitting certain lemma… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

    Comments: This is a peer reviewed article that has been presented in The International Conference on Agglutinative Language Technologies as a challenge of Natural Language Processing (ALTNLP) 2022

  7. arXiv:2112.02980  [pdf, other

    physics.flu-dyn

    Energy transfer mechanisms in adverse pressure gradient turbulent boundary layers

    Authors: Taygun R. Gungor, Yvan Maciel, Ayse G. Gungor

    Abstract: The energy transfer mechanisms and structures playing a role in these mechanisms in adverse-pressure-gradient (APG) turbulent boundary layers (TBLs) with small and large velocity defects are investigated. We examine the wall-normal and spectral distributions of energy, production and pressure-strain in APG TBLs and compare these distributions with those in canonical flows. It is found that the spe… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

  8. arXiv:2011.04451  [pdf, other

    cs.CL cs.LG

    Hierarchical Multitask Learning Approach for BERT

    Authors: Çağla Aksoy, Alper Ahmetoğlu, Tunga Güngör

    Abstract: Recent works show that learning contextualized embeddings for words is beneficial for downstream tasks. BERT is one successful example of this approach. It learns embeddings by solving two tasks, which are masked language model (masked LM) and the next sentence prediction (NSP). The pre-training of BERT can also be framed as a multitask learning problem. In this work, we adopt hierarchical multita… ▽ More

    Submitted 17 October, 2020; originally announced November 2020.

    Comments: 9 pages, 3 figures

  9. arXiv:2004.14963  [pdf, other

    cs.CL

    Data and Representation for Turkish Natural Language Inference

    Authors: Emrah Budur, Rıza Özçelik, Tunga Güngör, Christopher Potts

    Abstract: Large annotated datasets in NLP are overwhelmingly in English. This is an obstacle to progress in other languages. Unfortunately, obtaining new annotated resources for each task in each language would be prohibitively expensive. At the same time, commercial machine translation systems are now robust. Can we leverage these systems to translate English-language datasets automatically? In this paper,… ▽ More

    Submitted 20 October, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: Accepted to EMNLP 2020

  10. arXiv:2004.12247  [pdf, other

    cs.CL cs.IR cs.LG

    Hierarchical Multi Task Learning with Subword Contextual Embeddings for Languages with Rich Morphology

    Authors: Arda Akdemir, Tetsuo Shibuya, Tunga Güngör

    Abstract: Morphological information is important for many sequence labeling tasks in Natural Language Processing (NLP). Yet, existing approaches rely heavily on manual annotations or external software to capture this information. In this study, we propose using subword contextual embeddings to capture the morphological information for languages with rich morphology. In addition, we incorporate these embeddi… ▽ More

    Submitted 25 April, 2020; originally announced April 2020.

  11. arXiv:2002.10416  [pdf, other

    cs.CL

    Resources for Turkish Dependency Parsing: Introducing the BOUN Treebank and the BoAT Annotation Tool

    Authors: Utku Türk, Furkan Atmaca, Şaziye Betül Özateş, Gözde Berk, Seyyit Talha Bedir, Abdullatif Köksal, Balkız Öztürk Başaran, Tunga Güngör, Arzucan Özgür

    Abstract: In this paper, we introduce the resources that we developed for Turkish dependency parsing, which include a novel manually annotated treebank (BOUN Treebank), along with the guidelines we adopted, and a new annotation tool (BoAT). The manual annotation process we employed was shaped and implemented by a team of four linguists and five Natural Language Processing (NLP) specialists. Decisions regard… ▽ More

    Submitted 16 September, 2021; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Language Resource and Evaluation

  12. A Hybrid Approach to Dependency Parsing: Combining Rules and Morphology with Deep Learning

    Authors: Şaziye Betül Özateş, Arzucan Özgür, Tunga Güngör, Balkız Öztürk

    Abstract: Fully data-driven, deep learning-based models are usually designed as language-independent and have been shown to be successful for many natural language processing tasks. However, when the studied language is low-resourced and the amount of training data is insufficient, these models can benefit from the integration of natural language grammar-based information. We propose two approaches to depen… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: 25 pages, 7 figures

    ACM Class: I.2.7

  13. arXiv:2002.05606  [pdf, ps, other

    cs.CL cs.IR

    Sentiment Analysis Using Averaged Weighted Word Vector Features

    Authors: Ali Erkan, Tunga Gungor

    Abstract: People use the world wide web heavily to share their experience with entities such as products, services, or travel destinations. Texts that provide online feedback in the form of reviews and comments are essential to make consumer decisions. These comments create a valuable source that may be used to measure satisfaction related to products or services. Sentiment analysis is the task of identifyi… ▽ More

    Submitted 15 October, 2023; v1 submitted 13 February, 2020; originally announced February 2020.

  14. arXiv:2001.01269  [pdf, other

    cs.CL

    Generating Word and Document Embeddings for Sentiment Analysis

    Authors: Cem Rıfkı Aydın, Tunga Güngör, Ali Erkan

    Abstract: Sentiments of words differ from one corpus to another. Inducing general sentiment lexicons for languages and using them cannot, in general, produce meaningful results for different domains. In this paper, we combine contextual and supervised information with the general semantic representations of words occurring in the dictionary. Contexts of words help us capture the domain-specific information… ▽ More

    Submitted 7 December, 2020; v1 submitted 5 January, 2020; originally announced January 2020.

    Comments: Accepted and presented as a full paper at the 20th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2019), April 7-13, 2019, La Rochelle, France

    Journal ref: Springer LNCS Proceedings for CICLing 2019

  15. arXiv:1807.06683  [pdf, other

    cs.CL

    Improving Named Entity Recognition by Jointly Learning to Disambiguate Morphological Tags

    Authors: Onur Güngör, Suzan Üsküdarlı, Tunga Güngör

    Abstract: Previous studies have shown that linguistic features of a word such as possession, genitive or other grammatical cases can be employed in word representations of a named entity recognition (NER) tagger to improve the performance for morphologically rich languages. However, these taggers require external morphological disambiguation (MD) tools to function which are hard to obtain or non-existent fo… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: COLING 2018 (accepted)

    Journal ref: Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018). pp. 2082-2092

  16. arXiv:1706.00506  [pdf, other

    cs.CL

    Morphological Embeddings for Named Entity Recognition in Morphologically Rich Languages

    Authors: Onur Gungor, Eray Yildiz, Suzan Uskudarli, Tunga Gungor

    Abstract: In this work, we present new state-of-the-art results of 93.59,% and 79.59,% for Turkish and Czech named entity recognition based on the model of (Lample et al., 2016). We contribute by proposing several schemes for representing the morphological analysis of a word in the context of named entity recognition. We show that a concatenation of this representation with the word and character embeddings… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

    Comments: Working draft

  17. arXiv:1401.2663  [pdf

    cs.CL

    Dictionary-Based Concept Mining: An Application for Turkish

    Authors: Cem Rıfkı Aydın, Ali Erkan, Tunga Güngör, Hidayet Takçı

    Abstract: In this study, a dictionary-based method is used to extract expressive concepts from documents. So far, there have been many studies concerning concept mining in English, but this area of study for Turkish, an agglutinative language, is still immature. We used dictionary instead of WordNet, a lexical database grou** words into synsets that is widely used for concept extraction. The dictionaries… ▽ More

    Submitted 12 January, 2014; originally announced January 2014.

    Comments: 12 pages with 3 figures, to be published in "International Conference on Foundations of Computer Science & Technology (CST 2014), Zurich, Switzerland - January 2014 Proceedings, AIRCC"

    ACM Class: I.2.7