Skip to main content

Showing 1–18 of 18 results for author: Kochkina, E

.
  1. arXiv:2403.18152  [pdf, other

    cs.CL

    Large Language Models as Financial Data Annotators: A Study on Effectiveness and Efficiency

    Authors: Toyin Aguda, Suchetha Siddagangappa, Elena Kochkina, Simerjot Kaur, Dongsheng Wang, Charese Smiley, Sameena Shah

    Abstract: Collecting labeled datasets in finance is challenging due to scarcity of domain experts and higher cost of employing them. While Large Language Models (LLMs) have demonstrated remarkable performance in data annotation tasks on general domain datasets, their effectiveness on domain specific datasets remains underexplored. To address this gap, we investigate the potential of LLMs as efficient data a… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

  2. arXiv:2312.03523  [pdf, other

    cs.CL

    Sig-Networks Toolkit: Signature Networks for Longitudinal Language Modelling

    Authors: Talia Tseriotou, Ryan Sze-Yin Chan, Adam Tsakalidis, Iman Munire Bilal, Elena Kochkina, Terry Lyons, Maria Liakata

    Abstract: We present an open-source, pip installable toolkit, Sig-Networks, the first of its kind for longitudinal language modelling. A central focus is the incorporation of Signature-based Neural Network models, which have recently shown success in temporal tasks. We apply and extend published research providing a full suite of signature-based models. Their components can be used as PyTorch building block… ▽ More

    Submitted 6 February, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: To appear in EACL 2024: System Demonstrations

  3. arXiv:2305.02224  [pdf, ps, other

    cs.HC

    Some Observations on Fact-Checking Work with Implications for Computational Support

    Authors: Rob Procter, Miguel Arana-Catania, Yulan He, Maria Liakata, Arkaitz Zubiaga, Elena Kochkina, Runcong Zhao

    Abstract: Social media and user-generated content (UGC) have become increasingly important features of journalistic work in a number of different ways. However, the growth of misinformation means that news organisations have had devote more and more resources to determining its veracity and to publishing corrections if it is found to be misleading. In this work, we present the results of interviews with eig… ▽ More

    Submitted 6 July, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: 11 pages. International AAAI Conference on Web and Social Media, Mediate 2023: News Media and Computational Journalism Workshop

    ACM Class: H.1.2; H.5.2

  4. arXiv:2303.01241  [pdf, other

    cs.CL cs.LG

    PANACEA: An Automated Misinformation Detection System on COVID-19

    Authors: Runcong Zhao, Miguel Arana-Catania, Lixing Zhu, Elena Kochkina, Lin Gui, Arkaitz Zubiaga, Rob Procter, Maria Liakata, Yulan He

    Abstract: In this demo, we introduce a web-based misinformation detection system PANACEA on COVID-19 related claims, which has two modules, fact-checking and rumour detection. Our fact-checking module, which is supported by novel natural language inference methods with a self-attention network, outperforms state-of-the-art approaches. It is also able to give automated veracity assessment and ranked supporti… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

  5. arXiv:2207.13970  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    PHEMEPlus: Enriching Social Media Rumour Verification with External Evidence

    Authors: John Dougrez-Lewis, Elena Kochkina, M. Arana-Catania, Maria Liakata, Yulan He

    Abstract: Work on social media rumour verification utilises signals from posts, their propagation and users involved. Other lines of work target identifying and fact-checking claims based on information from Wikipedia, or trustworthy news articles without considering social media context. However works combining the information from social media with external evidence from the wider web are lacking. To faci… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: 10 pages, 1 figure, 5 tables, presented in the Fifth Fact Extraction and VERification Workshop (FEVER). 2022

  6. arXiv:2205.05435  [pdf

    cs.CL cs.AI

    Building for Tomorrow: Assessing the Temporal Persistence of Text Classifiers

    Authors: Rabab Alkhalifa, Elena Kochkina, Arkaitz Zubiaga

    Abstract: Performance of text classification models tends to drop over time due to changes in data, which limits the lifetime of a pretrained model. Therefore an ability to predict a model's ability to persist over time can help design models that can be effectively used over a longer period of time. In this paper, we provide a thorough discussion into the problem, establish an evaluation setup for the task… ▽ More

    Submitted 19 November, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

  7. arXiv:2205.02596  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims

    Authors: M. Arana-Catania, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata, Rob Procter, Yulan He

    Abstract: We present a comprehensive work on automated veracity assessment from dataset creation to develo** novel methods based on Natural Language Inference (NLI), focusing on misinformation related to the COVID-19 pandemic. We first describe the construction of the novel PANACEA dataset consisting of heterogeneous claims on COVID-19 and their respective information sources. The dataset construction inc… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: 16 pages, 1 figure, 8 tables, presented in NAACL 2022

  8. Opinions are Made to be Changed: Temporally Adaptive Stance Classification

    Authors: Rabab Alkhalifa, Elena Kochkina, Arkaitz Zubiaga

    Abstract: Given the rapidly evolving nature of social media and people's views, word usage changes over time. Consequently, the performance of a classifier trained on old textual data can drop dramatically when tested on newer data. While research in stance classification has advanced in recent years, no effort has been invested in making these classifiers have persistent performance over time. To study thi… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

  9. arXiv:2102.08366  [pdf, other

    cs.CL cs.IR cs.LG

    Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies

    Authors: Gabriele Pergola, Elena Kochkina, Lin Gui, Maria Liakata, Yulan He

    Abstract: Biomedical question-answering (QA) has gained increased attention for its capability to provide users with high-quality information from a vast scientific literature. Although an increasing number of biomedical QA datasets has been recently made available, those resources are still rather limited and expensive to produce. Transfer learning via pre-trained language models (LMs) has been shown as a… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: EACL 2021 - Short Paper - European Chapter of the Association for Computational Linguistics

  10. arXiv:2008.13160  [pdf, other

    cs.CL cs.LG cs.SI

    QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

    Authors: Rabab Alkhalifa, Theodore Yoong, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata

    Abstract: This paper describes the participation of the QMUL-SDS team for Task 1 of the CLEF 2020 CheckThat! shared task. The purpose of this task is to determine the check-worthiness of tweets about COVID-19 to identify and prioritise tweets that need fact-checking. The overarching aim is to further support ongoing efforts to protect the public from fake news and help people find reliable information. We d… ▽ More

    Submitted 30 August, 2020; originally announced August 2020.

  11. arXiv:2005.07174  [pdf, other

    cs.CL cs.LG

    Estimating predictive uncertainty for rumour verification models

    Authors: Elena Kochkina, Maria Liakata

    Abstract: The inability to correctly resolve rumours circulating online can have harmful real-world consequences. We present a method for incorporating model and data uncertainty estimates into natural language processing models for automatic rumour verification. We show that these estimates can be used to filter out model predictions likely to be erroneous, so that these difficult instances can be prioriti… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: Accepted to the Annual Conference of the Association for Computational Linguistics (ACL) 2020

  12. arXiv:2003.11563  [pdf, other

    cs.CL cs.LG stat.ML

    Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data

    Authors: Harish Tayyar Madabushi, Elena Kochkina, Michael Castelle

    Abstract: The automatic identification of propaganda has gained significance in recent years due to technological and social changes in the way news is generated and consumed. That this task can be addressed effectively using BERT, a powerful new architecture which can be fine-tuned for text classification tasks, is not surprising. However, propaganda detection, like other tasks that deal with news document… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: NLP4IF 2019

  13. arXiv:1809.06683  [pdf, other

    cs.CL

    RumourEval 2019: Determining Rumour Veracity and Support for Rumours

    Authors: Genevieve Gorrell, Kalina Bontcheva, Leon Derczynski, Elena Kochkina, Maria Liakata, Arkaitz Zubiaga

    Abstract: This is the proposal for RumourEval-2019, which will run in early 2019 as part of that year's SemEval event. Since the first RumourEval shared task in 2017, interest in automated claim validation has greatly increased, as the dangers of "fake news" have become a mainstream concern. Yet automated support for rumour checking remains in its infancy. For this reason, it is important that a shared task… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

  14. arXiv:1806.03713  [pdf, other

    cs.CL

    All-in-one: Multi-task Learning for Rumour Verification

    Authors: Elena Kochkina, Maria Liakata, Arkaitz Zubiaga

    Abstract: Automatic resolution of rumours is a challenging task that can be broken down into smaller components that make up a pipeline, including rumour detection, rumour tracking and stance classification, leading to the final outcome of determining the veracity of a rumour. In previous work, these steps in the process of rumour verification have been developed as separate components where the output of o… ▽ More

    Submitted 10 June, 2018; originally announced June 2018.

  15. Discourse-Aware Rumour Stance Classification in Social Media Using Sequential Classifiers

    Authors: Arkaitz Zubiaga, Elena Kochkina, Maria Liakata, Rob Procter, Michal Lukasik, Kalina Bontcheva, Trevor Cohn, Isabelle Augenstein

    Abstract: Rumour stance classification, defined as classifying the stance of specific social media posts into one of supporting, denying, querying or commenting on an earlier post, is becoming of increasing interest to researchers. While most previous work has focused on using individual tweets as classifier inputs, here we report on the performance of sequential classifiers that exploit the discourse featu… ▽ More

    Submitted 6 December, 2017; originally announced December 2017.

    Journal ref: Information Processing & Management, Volume 54, Issue 2, March 2018, Pages 273-290

  16. arXiv:1704.07221  [pdf, other

    cs.CL cs.AI

    Turing at SemEval-2017 Task 8: Sequential Approach to Rumour Stance Classification with Branch-LSTM

    Authors: Elena Kochkina, Maria Liakata, Isabelle Augenstein

    Abstract: This paper describes team Turing's submission to SemEval 2017 RumourEval: Determining rumour veracity and support for rumours (SemEval 2017 Task 8, Subtask A). Subtask A addresses the challenge of rumour stance classification, which involves identifying the attitude of Twitter users towards the truthfulness of the rumour they are discussing. Stance classification is considered to be an important s… ▽ More

    Submitted 24 April, 2017; originally announced April 2017.

    Comments: SemEval 2017 RumourEval: Determining rumour veracity and support for rumours (SemEval 2017 Task 8, Subtask A)

  17. arXiv:1609.09028  [pdf, other

    cs.CL cs.SI

    Stance Classification in Rumours as a Sequential Task Exploiting the Tree Structure of Social Media Conversations

    Authors: Arkaitz Zubiaga, Elena Kochkina, Maria Liakata, Rob Procter, Michal Lukasik

    Abstract: Rumour stance classification, the task that determines if each tweet in a collection discussing a rumour is supporting, denying, questioning or simply commenting on the rumour, has been attracting substantial interest. Here we introduce a novel approach that makes use of the sequence of transitions observed in tree-structured conversation threads in Twitter. The conversation threads are formed by… ▽ More

    Submitted 11 October, 2016; v1 submitted 28 September, 2016; originally announced September 2016.

    Comments: COLING 2016

  18. arXiv:1305.5720  [pdf

    astro-ph.CO gr-qc

    The Gravitational Universe

    Authors: The eLISA Consortium, :, P. Amaro Seoane, S. Aoudia, H. Audley, G. Auger, S. Babak, J. Baker, E. Barausse, S. Barke, M. Bassan, V. Beckmann, M. Benacquista, P. L. Bender, E. Berti, P. Binétruy, J. Bogenstahl, C. Bonvin, D. Bortoluzzi, N. C. Brause, J. Brossard, S. Buchman, I. Bykov, J. Camp, C. Caprini , et al. (136 additional authors not shown)

    Abstract: The last century has seen enormous progress in our understanding of the Universe. We know the life cycles of stars, the structure of galaxies, the remnants of the big bang, and have a general understanding of how the Universe evolved. We have come remarkably far using electromagnetic radiation as our tool for observing the Universe. However, gravity is the engine behind many of the processes in th… ▽ More

    Submitted 24 May, 2013; originally announced May 2013.

    Comments: 20 pages; submitted to the European Space Agency on May 24th, 2013 for the L2/L3 selection of ESA's Cosmic Vision program