Skip to main content

Showing 1–9 of 9 results for author: Alikaniotis, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14057  [pdf, other

    cs.CL cs.AI

    Your Large Language Models Are Leaving Fingerprints

    Authors: Hope McGovern, Rickard Stureborg, Yoshi Suhara, Dimitris Alikaniotis

    Abstract: It has been shown that finetuned transformers and other supervised detectors effectively distinguish between human and machine-generated text in some situations arXiv:2305.13242, but we find that even simple classifiers on top of n-gram and part-of-speech features can achieve very robust performance on both in- and out-of-domain data. To understand how this is possible, we analyze machine-generate… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2405.01724  [pdf, other

    cs.CL cs.AI

    Large Language Models are Inconsistent and Biased Evaluators

    Authors: Rickard Stureborg, Dimitris Alikaniotis, Yoshi Suhara

    Abstract: The zero-shot capability of Large Language Models (LLMs) has enabled highly flexible, reference-free metrics for various tasks, making LLM evaluators common tools in NLP. However, the robustness of these LLM evaluators remains relatively understudied; existing work mainly pursued optimal performance in terms of correlating LLM scores with human expert scores. In this paper, we conduct a series of… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 9 pages, 7 figures

    MSC Class: 68T50 (Primary) 68T01; 68T37; 91F20 (Secondary) ACM Class: I.2; I.2.7; I.7

  3. arXiv:2402.16472  [pdf, other

    cs.CL cs.AI

    mEdIT: Multilingual Text Editing via Instruction Tuning

    Authors: Vipul Raheja, Dimitris Alikaniotis, Vivek Kulkarni, Bashar Alhafni, Dhruv Kumar

    Abstract: We introduce mEdIT, a multi-lingual extension to CoEdIT -- the recent state-of-the-art text editing models for writing assistance. mEdIT models are trained by fine-tuning multi-lingual large, pre-trained language models (LLMs) via instruction tuning. They are designed to take instructions from the user specifying the attributes of the desired text in the form of natural language instructions, such… ▽ More

    Submitted 17 April, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted to NAACL 2024 (Main). 23 pages, 8 tables, 11 figures

    ACM Class: I.2.7

  4. arXiv:2402.04677  [pdf, other

    cs.CL

    Source Identification in Abstractive Summarization

    Authors: Yoshi Suhara, Dimitris Alikaniotis

    Abstract: Neural abstractive summarization models make summaries in an end-to-end manner, and little is known about how the source information is actually converted into summaries. In this paper, we define input sentences that contain essential information in the generated summary as $\textit{source sentences}$ and study how abstractive summaries are made by analyzing the source sentences. To this end, we a… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: EACL 2024

  5. arXiv:2010.02407  [pdf, other

    cs.CL cs.AI cs.LG

    Adversarial Grammatical Error Correction

    Authors: Vipul Raheja, Dimitrios Alikaniotis

    Abstract: Recent works in Grammatical Error Correction (GEC) have leveraged the progress in Neural Machine Translation (NMT), to learn rewrites from parallel corpora of grammatically incorrect and corrected sentences, achieving state-of-the-art results. At the same time, Generative Adversarial Networks (GANs) have been successful in generating realistic texts across many different tasks by learning to direc… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 13 Pages, EMNLP 2020

  6. arXiv:1906.01733  [pdf, ps, other

    cs.CL cs.LG cs.NE

    The Unreasonable Effectiveness of Transformer Language Models in Grammatical Error Correction

    Authors: Dimitrios Alikaniotis, Vipul Raheja

    Abstract: Recent work on Grammatical Error Correction (GEC) has highlighted the importance of language modeling in that it is certainly possible to achieve good performance by comparing the probabilities of the proposed edits. At the same time, advancements in language modeling have managed to generate linguistic output, which is almost indistinguishable from that of human-generated text. In this paper, we… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: 7 pages, 3 tables, accepted at the 14th Workshop on Innovative Use of NLP for Building Educational Applications

  7. arXiv:1606.09058  [pdf, other

    cs.CL cs.LG

    A Distributional Semantics Approach to Implicit Language Learning

    Authors: Dimitrios Alikaniotis, John N. Williams

    Abstract: In the present paper we show that distributional information is particularly important when considering concept availability under implicit language learning conditions. Based on results from different behavioural experiments we argue that the implicit learnability of semantic regularities depends on the degree to which the relevant concept is reflected in language use. In our simulations, we trai… ▽ More

    Submitted 29 June, 2016; originally announced June 2016.

    Comments: 5 pages, 7 figures, NetWords 2015

    ACM Class: I.5.1; I.2.6; I.2.7

  8. arXiv:1606.06996  [pdf, other

    cs.CL

    The word entropy of natural languages

    Authors: Christian Bentz, Dimitrios Alikaniotis

    Abstract: The average uncertainty associated with words is an information-theoretic concept at the heart of quantitative and computational linguistics. The entropy has been established as a measure of this average uncertainty - also called average information content. We here use parallel texts of 21 languages to establish the number of tokens at which word entropies converge to stable values. These converg… ▽ More

    Submitted 22 June, 2016; originally announced June 2016.

  9. arXiv:1606.04289  [pdf, other

    cs.CL cs.LG cs.NE

    Automatic Text Scoring Using Neural Networks

    Authors: Dimitrios Alikaniotis, Helen Yannakoudakis, Marek Rei

    Abstract: Automated Text Scoring (ATS) provides a cost-effective and consistent alternative to human marking. However, in order to achieve good performance, the predictive features of the system need to be manually engineered by human experts. We introduce a model that forms word representations by learning the extent to which specific words contribute to the text's score. Using Long-Short Term Memory netwo… ▽ More

    Submitted 16 June, 2016; v1 submitted 14 June, 2016; originally announced June 2016.

    Comments: 11 pages, 3 figures, 2 tables, ACL-2016

    ACM Class: I.5.1; I.2.6; I.2.7