Skip to main content

Showing 1–6 of 6 results for author: Micklem, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07309  [pdf, other

    cs.LG cs.CL stat.ML

    HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed Hypergraphs

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: Hypergraphs are characterized by complex topological structure, representing higher-order interactions among multiple entities through hyperedges. Lately, hypergraph-based deep learning methods to learn informative data representations for the problem of node classification on text-attributed hypergraphs have garnered increasing research attention. However, existing methods struggle to simultaneou… ▽ More

    Submitted 25 May, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures

  2. arXiv:2312.04193  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Language Model Knowledge Distillation for Efficient Question Answering in Spanish

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: Recent advances in the development of pre-trained Spanish language models has led to significant progress in many Natural Language Processing (NLP) tasks, such as question answering. However, the lack of efficient models imposes a barrier for the adoption of such models in resource-constrained environments. Therefore, smaller distilled models for the Spanish language could be proven to be highly s… ▽ More

    Submitted 16 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: ICLR 2024 Tiny Paper (6 pages, 2 tables)

  3. arXiv:2310.18376  [pdf, other

    cs.CL cs.LG

    SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: In recent years, the task of text-to-SQL translation, which converts natural language questions into executable SQL queries, has gained significant attention for its potential to democratize data access. Despite its promise, challenges such as adapting to unseen databases and aligning natural language with SQL syntax have hindered widespread adoption. To overcome these issues, we introduce SQLform… ▽ More

    Submitted 27 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 13 pages, 4 figures, 11 tables

  4. arXiv:2309.16540  [pdf, other

    cs.CL cs.LG stat.ML

    Unsupervised Pretraining for Fact Verification by Language Model Distillation

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: Fact verification aims to verify a claim using evidence from a trustworthy knowledge base. To address this challenge, algorithms must produce features for every claim that are both semantically meaningful, and compact enough to find a semantic alignment with the source information. In contrast to previous work, which tackled the alignment problem by learning over annotated corpora of claims and th… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: ICLR 2024 Camera Ready

  5. arXiv:2111.13786  [pdf, other

    cs.LG cs.AI

    Learning from learning machines: a new generation of AI technology to meet the needs of science

    Authors: Luca Pion-Tonachini, Kristofer Bouchard, Hector Garcia Martin, Sean Peisert, W. Bradley Holtz, Anil Aswani, Dipankar Dwivedi, Haruko Wainwright, Ghanshyam Pilania, Benjamin Nachman, Babetta L. Marrone, Nicola Falco, Prabhat, Daniel Arnold, Alejandro Wolf-Yadlin, Sarah Powers, Sharlee Climer, Quinn Jackson, Ty Carlson, Michael Sohn, Petrus Zwart, Neeraj Kumar, Amy Justice, Claire Tomlin, Daniel Jacobson , et al. (11 additional authors not shown)

    Abstract: We outline emerging opportunities and challenges to enhance the utility of AI for scientific discovery. The distinct goals of AI for industry versus the goals of AI for science create tension between identifying patterns in data versus discovering patterns in the world from data. If we address the fundamental challenges associated with "bridging the gap" between domain-driven scientific models and… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

  6. arXiv:2104.07010  [pdf, other

    cs.CL cs.DB cs.LG

    Translating synthetic natural language to database queries: a polyglot deep learning framework

    Authors: Adrián Bazaga, Nupur Gunwant, Gos Micklem

    Abstract: The number of databases as well as their size and complexity is increasing. This creates a barrier to use especially for non-experts, who have to come to grips with the nature of the data, the way it has been represented in the database, and the specific query languages or user interfaces by which data are accessed. These difficulties worsen in research settings, where it is common to work with ma… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.