Skip to main content

Showing 1–9 of 9 results for author: Gkatzia, D

.
  1. arXiv:2305.01633  [pdf, other

    cs.CL

    Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

    Authors: Anya Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, Jose M. Alonso-Moral, Mohammad Arvan, Anouck Braggaar, Mark Cieliebak, Elizabeth Clark, Kees van Deemter, Tanvi Dinkar, Ondřej Dušek, Steffen Eger, Qixiang Fang, Mingqi Gao, Albert Gatt, Dimitra Gkatzia, Javier González-Corbelle, Dirk Hovy, Manuela Hürlimann, Takumi Ito, John D. Kelleher, Filip Klubicka, Emiel Krahmer, Huiyuan Lai , et al. (17 additional authors not shown)

    Abstract: We report our efforts in identifying a set of previous human evaluations in NLP that would be suitable for a coordinated study examining what makes human evaluations in NLP more/less reproducible. We present our results and findings, which include that just 13\% of papers had (i) sufficiently low barriers to reproduction, and (ii) enough obtainable information, to be considered for reproduction, a… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 5 pages plus appendix, 4 tables, 1 figure. To appear at "Workshop on Insights from Negative Results in NLP" (co-located with EACL2023). Updated author list and acknowledgements

    MSC Class: 68 ACM Class: I.2.7

  2. arXiv:2206.11249  [pdf, other

    cs.CL cs.AI cs.LG

    GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

    Authors: Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di **, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter , et al. (52 additional authors not shown)

    Abstract: Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  3. arXiv:2204.09391  [pdf, other

    cs.CL

    You Are What You Write: Preserving Privacy in the Era of Large Language Models

    Authors: Richard Plant, Valerio Giuffrida, Dimitra Gkatzia

    Abstract: Large scale adoption of large language models has introduced a new era of convenient knowledge transfer for a slew of natural language processing tasks. However, these models also run the risk of undermining user trust by exposing unwanted information about the data subjects, which may be extracted by a malicious party, e.g. through adversarial attacks. We present an empirical investigation into t… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

  4. arXiv:2204.01061  [pdf, other

    cs.CL cs.AI

    Task2Dial: A Novel Task and Dataset for Commonsense enhanced Task-based Dialogue Grounded in Documents

    Authors: Carl Strathearn, Dimitra Gkatzia

    Abstract: This paper proposes a novel task on commonsense-enhanced task-based dialogue grounded in documents and describes the Task2Dial dataset, a novel dataset of document-grounded task-based dialogues, where an Information Giver (IG) provides instructions (by consulting a document) to an Information Follower (IF), so that the latter can successfully complete the task. In this unique setting, the IF can a… ▽ More

    Submitted 3 April, 2022; originally announced April 2022.

    Journal ref: Proceedings of The Fourth International Conference on Natural Language and Speech Processing (ICNLSP 2021)

  5. arXiv:2108.12318  [pdf, other

    cs.CL cs.LG

    CAPE: Context-Aware Private Embeddings for Private Language Learning

    Authors: Richard Plant, Dimitra Gkatzia, Valerio Giuffrida

    Abstract: Deep learning-based language models have achieved state-of-the-art results in a number of applications including sentiment analysis, topic labelling, intent classification and others. Obtaining text representations or embeddings using these models presents the possibility of encoding personally identifiable information learned from language and context cues that may present a risk to reputation or… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: Accepted into EMNLP21 main conference

  6. arXiv:2108.01182  [pdf, other

    cs.CL

    Underreporting of errors in NLG output, and what to do about it

    Authors: Emiel van Miltenburg, Miruna-Adriana Clinciu, Ondřej Dušek, Dimitra Gkatzia, Stephanie Inglis, Leo Leppänen, Saad Mahamood, Emma Manning, Stephanie Schoch, Craig Thomson, Luou Wen

    Abstract: We observe a severe under-reporting of the different kinds of errors that Natural Language Generation systems make. This is a problem, because mistakes are an important indicator of where systems should still be improved. If authors only report overall performance metrics, the research community is left in the dark about the specific weaknesses that are exhibited by `state-of-the-art' research. Ne… ▽ More

    Submitted 8 August, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Prefinal version, accepted for publication in the Proceedings of the 14th International Conference on Natural Language Generation (INLG 2021, Aberdeen). Comments welcome

  7. arXiv:1610.08375  [pdf, other

    cs.CL

    Content Selection in Data-to-Text Systems: A Survey

    Authors: Dimitra Gkatzia

    Abstract: Data-to-text systems are powerful in generating reports from data automatically and thus they simplify the presentation of complex data. Rather than presenting data using visualisation techniques, data-to-text systems use natural (human) language, which is the most common way for human-human communication. In addition, data-to-text systems can adapt their output content to users' preferences, back… ▽ More

    Submitted 26 October, 2016; originally announced October 2016.

  8. arXiv:1606.03254  [pdf, other

    cs.CL cs.AI

    Natural Language Generation enhances human decision-making with uncertain information

    Authors: Dimitra Gkatzia, Oliver Lemon, Verena Rieser

    Abstract: Decision-making is often dependent on uncertain data, e.g. data associated with confidence scores or probabilities. We present a comparison of different information presentations for uncertain data and, for the first time, measure their effects on human decision-making. We show that the use of Natural Language Generation (NLG) improves decision-making under uncertainty, compared to state-of-the-ar… ▽ More

    Submitted 15 August, 2016; v1 submitted 10 June, 2016; originally announced June 2016.

    Comments: 54th annual meeting of the Association for Computational Linguistics (ACL), Berlin 2016

  9. arXiv:1506.02922  [pdf, other

    cs.CL cs.AI

    An Ensemble method for Content Selection for Data-to-text Systems

    Authors: Dimitra Gkatzia, Helen Hastie

    Abstract: We present a novel approach for automatic report generation from time-series data, in the context of student feedback generation. Our proposed methodology treats content selection as a multi-label classification (MLC) problem, which takes as input time-series data (students' learning data) and outputs a summary of these data (feedback). Unlike previous work, this method considers all data simultan… ▽ More

    Submitted 9 June, 2015; originally announced June 2015.

    Comments: 3 pages, 2 figures, 1st International Workshop on Data-to-text Generation