Skip to main content

Showing 1–9 of 9 results for author: Bazaga, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04501  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    FLUID-LLM: Learning Computational Fluid Dynamics with Spatiotemporal-aware Large Language Models

    Authors: Max Zhu, Adrián Bazaga, Pietro Liò

    Abstract: Learning computational fluid dynamics (CFD) traditionally relies on computationally intensive simulations of the Navier-Stokes equations. Recently, large language models (LLMs) have shown remarkable pattern recognition and reasoning abilities in natural language processing (NLP) and computer vision (CV). However, these models struggle with the complex geometries inherent in fluid dynamics. We intr… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2406.01805  [pdf, other

    cs.LG cs.AI

    TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context Subsetting

    Authors: Andrei Margeloiu, Adrián Bazaga, Nikola Simidjievski, Pietro Liò, Mateja Jamnik

    Abstract: Tabular data is prevalent in many critical domains, yet it is often challenging to acquire in large quantities. This scarcity usually results in poor performance of machine learning models on such data. Data augmentation, a common strategy for performance improvement in vision and language tasks, typically underperforms for tabular data due to the lack of explicit symmetries in the input space. To… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2402.07309  [pdf, other

    cs.LG cs.CL stat.ML

    HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed Hypergraphs

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: Hypergraphs are characterized by complex topological structure, representing higher-order interactions among multiple entities through hyperedges. Lately, hypergraph-based deep learning methods to learn informative data representations for the problem of node classification on text-attributed hypergraphs have garnered increasing research attention. However, existing methods struggle to simultaneou… ▽ More

    Submitted 25 May, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures

  4. arXiv:2312.04193  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Language Model Knowledge Distillation for Efficient Question Answering in Spanish

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: Recent advances in the development of pre-trained Spanish language models has led to significant progress in many Natural Language Processing (NLP) tasks, such as question answering. However, the lack of efficient models imposes a barrier for the adoption of such models in resource-constrained environments. Therefore, smaller distilled models for the Spanish language could be proven to be highly s… ▽ More

    Submitted 16 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: ICLR 2024 Tiny Paper (6 pages, 2 tables)

  5. arXiv:2310.18376  [pdf, other

    cs.CL cs.LG

    SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: In recent years, the task of text-to-SQL translation, which converts natural language questions into executable SQL queries, has gained significant attention for its potential to democratize data access. Despite its promise, challenges such as adapting to unseen databases and aligning natural language with SQL syntax have hindered widespread adoption. To overcome these issues, we introduce SQLform… ▽ More

    Submitted 27 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 13 pages, 4 figures, 11 tables

  6. arXiv:2309.16540  [pdf, other

    cs.CL cs.LG stat.ML

    Unsupervised Pretraining for Fact Verification by Language Model Distillation

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: Fact verification aims to verify a claim using evidence from a trustworthy knowledge base. To address this challenge, algorithms must produce features for every claim that are both semantically meaningful, and compact enough to find a semantic alignment with the source information. In contrast to previous work, which tackled the alignment problem by learning over annotated corpora of claims and th… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: ICLR 2024 Camera Ready

  7. arXiv:2104.07010  [pdf, other

    cs.CL cs.DB cs.LG

    Translating synthetic natural language to database queries: a polyglot deep learning framework

    Authors: Adrián Bazaga, Nupur Gunwant, Gos Micklem

    Abstract: The number of databases as well as their size and complexity is increasing. This creates a barrier to use especially for non-experts, who have to come to grips with the nature of the data, the way it has been represented in the database, and the specific query languages or user interfaces by which data are accessed. These difficulties worsen in research settings, where it is common to work with ma… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

  8. arXiv:1901.11074  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    A Convolutional Neural Network for the Automatic Diagnosis of Collagen VI related Muscular Dystrophies

    Authors: Adrián Bazaga, Mònica Roldán, Carmen Badosa, Cecilia Jiménez-Mallebrera, Josep M. Porta

    Abstract: The development of machine learning systems for the diagnosis of rare diseases is challenging mainly due the lack of data to study them. Despite this challenge, this paper proposes a system for the Computer Aided Diagnosis (CAD) of low-prevalence, congenital muscular dystrophies from confocal microscopy images. The proposed CAD system relies on a Convolutional Neural Network (CNN) which performs a… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

    Comments: Submitted for review to Expert Systems With Applications

  9. arXiv:1804.11312  [pdf, other

    cs.DC

    Performance Evaluation of an Algorithm-based Asynchronous Checkpoint-Restart Fault Tolerant Application Using Mixed MPI/GPI-2

    Authors: Adrian Bazaga, Michal Pitonak

    Abstract: One of the hardest challenges of the current Big Data landscape is the lack of ability to process huge volumes of information in an acceptable time. The goal of this work, is to ascertain if it is useful to use typical Big Data tools to solve High Performance Computing problems, by exploring and comparing a distributed computing framework implemented on a commodity cluster architecture: the experi… ▽ More

    Submitted 6 May, 2018; v1 submitted 30 April, 2018; originally announced April 2018.

    Comments: Submitted to conference EuroMPI/USA'18