Skip to main content

Showing 1–5 of 5 results for author: Outsios, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.00869  [pdf, other

    cs.CL

    GreekBART: The First Pretrained Greek Sequence-to-Sequence Model

    Authors: Iakovos Evdaimon, Hadi Abdine, Christos Xypolopoulos, Stamatis Outsios, Michalis Vazirgiannis, Giorgos Stamou

    Abstract: The era of transfer learning has revolutionized the fields of Computer Vision and Natural Language Processing, bringing powerful pretrained models with exceptional performance across a variety of tasks. Specifically, Natural Language Processing tasks have been dominated by transformer-based language models. In Natural Language Inference and Natural Language Generation tasks, the BERT model and its… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  2. arXiv:2112.00566  [pdf, ps, other

    cs.CL

    NLP Research and Resources at DaSciM, Ecole Polytechnique

    Authors: Hadi Abdine, Yanzhu Guo, Moussa Kamal Eddine, Giannis Nikolentzos, Stamatis Outsios, Guokan Shang, Christos Xypolopoulos, Michalis Vazirgiannis

    Abstract: DaSciM (Data Science and Mining) part of LIX at Ecole Polytechnique, established in 2013 and since then producing research results in the area of large scale data analysis via methods of machine and deep learning. The group has been specifically active in the area of NLP and text mining with interesting results at methodological and resources level. Here follow our different contributions of inter… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  3. arXiv:1912.04965  [pdf, other

    cs.CL

    An Ensemble Method for Producing Word Representations focusing on the Greek Language

    Authors: Michalis Lioudakis, Stamatis Outsios, Michalis Vazirgiannis

    Abstract: In this paper we present a new ensemble method, Continuous Bag-of-Skip-grams (CBOS), that produces high-quality word representations putting emphasis on the modern Greek language. The CBOS method combines the pioneering approaches for learning word representations: Continuous Bag-of-Words (CBOW) and Continuous Skip-gram. These methods are compared through intrinsic and extrinsic evaluation tasks o… ▽ More

    Submitted 11 November, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

  4. arXiv:1904.04032  [pdf, ps, other

    cs.CL

    Evaluation of Greek Word Embeddings

    Authors: Stamatis Outsios, Christos Karatsalos, Konstantinos Skianis, Michalis Vazirgiannis

    Abstract: Since word embeddings have been the most popular input for many NLP tasks, evaluating their quality is of critical importance. Most research efforts are focusing on English word embeddings. This paper addresses the problem of constructing and evaluating such models for the Greek language. We created a new word analogy corpus considering the original English Word2vec word analogy corpus and some sp… ▽ More

    Submitted 4 April, 2020; v1 submitted 8 April, 2019; originally announced April 2019.

  5. arXiv:1810.06694  [pdf, ps, other

    cs.CL

    Word Embeddings from Large-Scale Greek Web Content

    Authors: Stamatis Outsios, Konstantinos Skianis, Polykarpos Meladianos, Christos Xypolopoulos, Michalis Vazirgiannis

    Abstract: Word embeddings are undoubtedly very useful components in many NLP tasks. In this paper, we present word embeddings and other linguistic resources trained on the largest to date digital Greek language corpus. We also present a live web tool for testing the Greek word embeddings, by offering "analogy", "similarity score" and "most similar words" functions. Through our explorer, one could interact w… ▽ More

    Submitted 26 October, 2018; v1 submitted 8 October, 2018; originally announced October 2018.