Skip to main content

Showing 1–10 of 10 results for author: Weeds, J

.
  1. arXiv:2303.08652  [pdf, other

    cs.CL cs.IR

    Automated Query Generation for Evidence Collection from Web Search Engines

    Authors: Nestor Prieto-Chavana, Julie Weeds, David Weir

    Abstract: It is widely accepted that so-called facts can be checked by searching for information on the Internet. This process requires a fact-checker to formulate a search query based on the fact and to present it to a search engine. Then, relevant and believable passages need to be identified in the search results before a decision is made. This process is carried out by sub-editors at many news and media… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  2. arXiv:2302.14828  [pdf, other

    cs.CL

    Automatic Scoring of Dream Reports' Emotional Content with Large Language Models

    Authors: Lorenzo Bertolini, Valentina Elce, Adriana Michalak, Giulio Bernardi, Julie Weeds

    Abstract: In the field of dream research, the study of dream content typically relies on the analysis of verbal reports provided by dreamers upon awakening from their sleep. This task is classically performed through manual scoring provided by trained annotators, at a great time expense. While a consistent body of work suggests that natural language processing (NLP) tools can support the automatic analysis… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

  3. arXiv:2210.05302  [pdf, other

    cs.CL

    Towards Structure-aware Paraphrase Identification with Phrase Alignment Using Sentence Encoders

    Authors: Qiwei Peng, David Weir, Julie Weeds

    Abstract: Previous works have demonstrated the effectiveness of utilising pre-trained sentence encoders based on their sentence representations for meaning comparison tasks. Though such representations are shown to capture hidden syntax structures, the direct similarity comparison between them exhibits weak sensitivity to word order and structural differences in given sentences. A single similarity score fu… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: COLING 2022 Oral

  4. arXiv:2106.01904  [pdf, other

    cs.CL

    Representing Syntax and Composition with Geometric Transformations

    Authors: Lorenzo Bertolini, Julie Weeds, David Weir, Qiwei Peng

    Abstract: The exploitation of syntactic graphs (SyGs) as a word's context has been shown to be beneficial for distributional semantic models (DSMs), both at the level of individual word representations and in deriving phrasal representations via composition. However, notwithstanding the potential performance benefit, the syntactically-aware DSMs proposed to date have huge numbers of parameters (compared to… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: to appear in Findings of ACL 2021

  5. arXiv:2005.01854  [pdf, other

    cs.CL cs.LG

    Data Augmentation for Hypernymy Detection

    Authors: Thomas Kober, Julie Weeds, Lorenzo Bertolini, David Weir

    Abstract: The automatic detection of hypernymy relationships represents a challenging problem in NLP. The successful application of state-of-the-art supervised approaches using distributed representations has generally been impeded by the limited availability of high quality training data. We have developed two novel data augmentation techniques which generate new training examples from existing ones. First… ▽ More

    Submitted 21 January, 2021; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: to appear at EACL 2021

  6. arXiv:2001.11268  [pdf, other

    cs.CL cs.LG

    Data Mining in Clinical Trial Text: Transformers for Classification and Question Answering Tasks

    Authors: Lena Schmidt, Julie Weeds, Julian P. T. Higgins

    Abstract: This research on data extraction methods applies recent advances in natural language processing to evidence synthesis based on medical texts. Texts of interest include abstracts of clinical trials in English and in multilingual contexts. The main focus is on information characterized via the Population, Intervention, Comparator, and Outcome (PICO) framework, but data extraction is not limited to t… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

    Journal ref: HEALTHINF 2020

  7. arXiv:1704.06692  [pdf, other

    cs.CL

    Improving Semantic Composition with Offset Inference

    Authors: Thomas Kober, Julie Weeds, Jeremy Reffin, David Weir

    Abstract: Count-based distributional semantic models suffer from sparsity due to unobserved but plausible co-occurrences in any text collection. This problem is amplified for models like Anchored Packed Trees (APTs), that take the grammatical type of a co-occurrence into account. We therefore introduce a novel form of distributional inference that exploits the rich type structure in APTs and infers missing… ▽ More

    Submitted 21 April, 2017; originally announced April 2017.

    Comments: to appear at ACL 2017 (short papers)

  8. arXiv:1702.06696  [pdf, other

    cs.CL

    One Representation per Word - Does it make Sense for Composition?

    Authors: Thomas Kober, Julie Weeds, John Wilkie, Jeremy Reffin, David Weir

    Abstract: In this paper, we investigate whether an a priori disambiguation of word senses is strictly necessary or whether the meaning of a word in context can be disambiguated through composition alone. We evaluate the performance of off-the-shelf single-vector and multi-sense vector models on a benchmark phrase similarity task and a novel task for word-sense discrimination. We find that single-sense vecto… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

    Comments: to appear at the EACL 2017 workshop on Sense, Concept and Entity Representations and their Applications

  9. arXiv:1608.07115  [pdf, other

    cs.CL

    Aligning Packed Dependency Trees: a theory of composition for distributional semantics

    Authors: David Weir, Julie Weeds, Jeremy Reffin, Thomas Kober

    Abstract: We present a new framework for compositional distributional semantics in which the distributional contexts of lexemes are expressed in terms of anchored packed dependency trees. We show that these structures have the potential to capture the full sentential contexts of a lexeme and provide a uniform basis for the composition of distributional knowledge in a way that captures both mutual disambigua… ▽ More

    Submitted 25 August, 2016; originally announced August 2016.

    Comments: To appear in Special issue of Computational Linguistics - Formal Distributional Semantics

  10. arXiv:1608.06794  [pdf, other

    cs.CL

    Improving Sparse Word Representations with Distributional Inference for Semantic Composition

    Authors: Thomas Kober, Julie Weeds, Jeremy Reffin, David Weir

    Abstract: Distributional models are derived from co-occurrences in a corpus, where only a small proportion of all possible plausible co-occurrences will be observed. This results in a very sparse vector space, requiring a mechanism for inferring missing knowledge. Most methods face this challenge in ways that render the resulting word representations uninterpretable, with the consequence that semantic compo… ▽ More

    Submitted 24 August, 2016; originally announced August 2016.

    Comments: To appear at EMNLP 2016