Skip to main content

Showing 1–50 of 399 results for author: Freitas, A

.
  1. arXiv:2407.02006  [pdf, other

    hep-ph hep-ex

    The three-loop single-mass heavy flavor corrections to deep-inelastic scattering

    Authors: J. Ablinger, A. Behring, J. Blümlein, A. De Freitas, A. von Manteuffel, C. Schneider, K. Schoenwald

    Abstract: We report on the status of the calculation of the massive Wilson coefficients and operator matrix elements for deep-inelastic scatterung to three-loop order. We discuss both the unpolarized and the polarized case, for which all the single-mass and nearly all two-mass contributions have been calculated. Numerical results on the structure function $F_2(x,Q^2)$ are presented. In the polarized case, w… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 12 pages

    Report number: CERN-TH-2024-100, ZU-TH 31/24, RISC Report number 24-04, PoS (LL2024) 047, DESY-24-096,

  2. arXiv:2406.18626  [pdf, other

    q-bio.QM cs.AI cs.CL

    An LLM-based Knowledge Synthesis and Scientific Reasoning Framework for Biomedical Discovery

    Authors: Oskar Wysocki, Magdalena Wysocka, Danilo Carvalho, Alex Teodor Bogatu, Danilo Miranda Gusicuma, Maxime Delmas, Harriet Unsworth, Andre Freitas

    Abstract: We present BioLunar, developed using the Lunar framework, as a tool for supporting biological analyses, with a particular emphasis on molecular-level evidence enrichment for biomarker discovery in oncology. The platform integrates Large Language Models (LLMs) to facilitate complex scientific reasoning across distributed evidence spaces, enhancing the capability for harmonizing and reasoning over h… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: accepted for ACL 2024 System Demonstration Track

  3. arXiv:2406.17837  [pdf, other

    cs.LG cs.AI

    Transformer Normalisation Layers and the Independence of Semantic Subspaces

    Authors: Stephen Menary, Samuel Kaski, Andre Freitas

    Abstract: Recent works have shown that transformers can solve contextual reasoning tasks by internally executing computational graphs called circuits. Circuits often use attention to logically match information from subspaces of the representation, e.g. using position-in-sequence to identify the previous token. In this work, we consider a semantic subspace to be any independent subspace of the latent repres… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.14807  [pdf, other

    math.DS math.PR

    Multivariate extreme values for dynamical systems

    Authors: Romain Aimino, Ana Cristina Moreira Freitas, Jorge Milhazes Freitas, Mike Todd

    Abstract: We establish a theory for multivariate extreme value analysis of dynamical systems. Namely, we provide conditions adapted to the dynamical setting which enable the study of dependence between extreme values of the components of $\R^d$-valued observables evaluated along the orbits of the systems. We study this cross-sectional dependence, which results from the combination of a spatial and a tempora… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    MSC Class: 37A50; 37A25; 37B20; 60G70; 62H05

  5. arXiv:2406.09898  [pdf, other

    cs.LG

    Positive-Unlabelled Learning for Identifying New Candidate Dietary Restriction-related Genes among Ageing-related Genes

    Authors: Jorge Paz-Ruza, Alex A. Freitas, Amparo Alonso-Betanzos, Bertha Guijarro-Berdiñas

    Abstract: Dietary Restriction (DR) is one of the most popular anti-ageing interventions, prompting exhaustive research into genes associated with its mechanisms. Recently, Machine Learning (ML) has been explored to identify potential DR-related genes among ageing-related genes, aiming to minimize costly wet lab experiments needed to expand our knowledge on DR. However, to train a model from positive (DR-rel… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  6. arXiv:2405.17723  [pdf, other

    cs.DB

    TableDC: Deep Clustering for Tabular Data

    Authors: Hafiz Tayyab Rauf, Andre Freitas, Norman W. Paton

    Abstract: Deep clustering (DC), a fusion of deep representation learning and clustering, has recently demonstrated positive results in data science, particularly text processing and computer vision. However, joint optimization of feature learning and data distribution in the multi-dimensional space is domain-specific, so existing DC methods struggle to generalize to other application domains (such as data i… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  7. arXiv:2405.17312  [pdf, ps, other

    math.DG math.AP

    Rigidity results for Serrin's overdetermined problems in Riemannian manifolds

    Authors: Maria Andrade, Allan Freitas, Diego A. Marín

    Abstract: In this work, we are interested in studying Serrin's overdetermined problems in Riemannian manifolds. For manifolds endowed with a conformal vector field, we prove a Pohozoaev-type identity to show a Serrin's type rigidity result using the P-function approach introduced by Weinberger. We proceed with a conformal change to achieve this goal, starting from a geometric Pohozaev identity due to Schoen… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Comments are welcome!

  8. arXiv:2405.01379  [pdf, other

    cs.CL

    Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving

    Authors: Xin Quan, Marco Valentino, Louise A. Dennis, André Freitas

    Abstract: Natural language explanations have become a proxy for evaluating explainable and multi-step Natural Language Inference (NLI) models. However, assessing the validity of explanations for NLI is challenging as it typically involves the crowd-sourcing of apposite datasets, a process that is time-consuming and prone to logical errors. To address existing limitations, this paper investigates the verific… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  9. arXiv:2405.00402  [pdf, other

    cs.CL

    Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models

    Authors: Leonardo Ranaldi, Andrè Freitas

    Abstract: The alignments of reasoning abilities between smaller and larger Language Models are largely conducted via Supervised Fine-Tuning (SFT) using demonstrations generated from robust Large Language Models (LLMs). Although these approaches deliver more performant models, they do not show sufficiently strong generalization ability as the training only relies on the provided demonstrations. In this pap… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  10. arXiv:2404.18384  [pdf, other

    cs.CL

    Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions

    Authors: Jordan Meadows, Tamsin James, Andre Freitas

    Abstract: Language models can hallucinate when performing complex and detailed mathematical reasoning. Physics provides a rich domain for assessing mathematical reasoning capabilities where physical context imbues the use of symbols which needs to satisfy complex semantics (\textit{e.g.,} units, tensorial order), leading to instances where inference may be algebraically coherent, yet unphysical. In this wor… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  11. arXiv:2404.12197  [pdf, ps, other

    math.DG

    A congruence theorem for compact embedded hypersurfaces in $\mathbb{S}^{n+1}_+$

    Authors: Allan Freitas, Felippe Guimarães

    Abstract: We prove a codimension reduction and congruence theorem for compact $n$-dimensional submanifolds of $\mathbb{S}^{n+p}$ that admit a mean convex isometric embedding into $\mathbb{S}^{n+1}_+$ using a Reilly type formula for space forms.

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Comments are welcome!

    MSC Class: 53A07; 53C42

  12. arXiv:2404.09369  [pdf, ps, other

    math.DG math.AP

    Perelman singular manifolds

    Authors: Márcio Batista, Allan Freitas, Márcio Santos

    Abstract: On a Riemannian manifold with a smooth function $f: M\to \mathbb{R}$, we consider the linearization of the Perelman scalar curvature $\mathcal{R}$ and its $L^2$-formal adjoint operator $δ\mathcal{R}^*$. A manifold endowed with a metric $g$ whose operator $δ\mathcal{R}^*$ has a nontrivial kernel is called a Perelman singular manifold. In this paper, we present examples and apply general maximum pri… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Comments are welcome!

    MSC Class: 53C21; 53C15; 58JXX

  13. arXiv:2404.04963  [pdf, other

    cs.CL cs.AI

    SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials

    Authors: Mael Jullien, Marco Valentino, André Freitas

    Abstract: Large Language Models (LLMs) are at the forefront of NLP achievements but fall short in dealing with shortcut learning, factual inconsistency, and vulnerability to adversarial inputs.These shortcomings are especially critical in medical contexts, where they can misrepresent actual model capabilities. Addressing this, we present SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Cl… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  14. arXiv:2404.02625  [pdf, other

    cs.CL cs.AI cs.LG

    A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference

    Authors: Mokanarangan Thayaparan, Marco Valentino, André Freitas

    Abstract: Integer Linear Programming (ILP) has been proposed as a formalism for encoding precise structural and semantic constraints for Natural Language Inference (NLI). However, traditional ILP frameworks are non-differentiable, posing critical challenges for the integration of continuous language representations based on deep learning. In this paper, we introduce a novel approach, named Diff-Comb Explain… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024 - Camera Ready. arXiv admin note: substantial text overlap with arXiv:2208.03339

  15. arXiv:2404.02622  [pdf, other

    cs.CL

    Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models

    Authors: Julia Rozanova, Marco Valentino, André Freitas

    Abstract: Rigorous evaluation of the causal effects of semantic features on language model predictions can be hard to achieve for natural language reasoning problems. However, this is such a desirable form of analysis from both an interpretability and model evaluation perspective, that it is valuable to investigate specific patterns of reasoning with enough structure and regularity to identify and quantify… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024 - Camera Ready. arXiv admin note: substantial text overlap with arXiv:2305.08572

  16. The non-first-order-factorizable contributions to the three-loop single-mass operator matrix elements $A_{Qg}^{(3)}$ and $ΔA_{Qg}^{(3)}$

    Authors: J. Ablinger, A. Behring, J. Blümlein, A. De Freitas, A. von Manteuffel, C. Schneider, K. Schönwald

    Abstract: The non-first-order-factorizable contributions (The terms 'first-order-factorizable contributions' and 'non-first-order-factorizable contributions' have been introduced and discussed in Refs. \cite{Behring:2023rlq,Ablinger:2023ahe}. They describe the factorization behaviour of the difference- or differential equations for a subset of master integrals of a given problem.) to the unpolarized and pol… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Report number: DO--TH 23/15. DESY 24--027, RISC Report series 24--02, ZU-TH 13/24, CERN-TH-2024-30

  17. arXiv:2402.10767  [pdf, other

    cs.CL cs.AI

    Inference to the Best Explanation in Large Language Models

    Authors: Dhairya Dalal, Marco Valentino, André Freitas, Paul Buitelaar

    Abstract: While Large Language Models (LLMs) have found success in real-world applications, their underlying explanatory process is still poorly understood. This paper proposes IBE-Eval, a framework inspired by philosophical accounts on Inference to the Best Explanation (IBE) to advance the interpretation and evaluation of LLMs' explanations. IBE-Eval estimates the plausibility of natural language explanati… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    ACM Class: I.2.7

  18. arXiv:2402.00745  [pdf, other

    cs.CL

    Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement

    Authors: Xin Quan, Marco Valentino, Louise A. Dennis, André Freitas

    Abstract: An increasing amount of research in Natural Language Inference (NLI) focuses on the application and evaluation of Large Language Models (LLMs) and their reasoning capabilities. Despite their success, however, LLMs are still prone to factual errors and inconsistencies in their explanations, offering limited control and interpretability for inference in complex domains. In this paper, we focus on et… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Camera-ready for EACL 2024

  19. arXiv:2402.00723  [pdf, other

    cs.CL

    Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders

    Authors: Yingji Zhang, Danilo S. Carvalho, Marco Valentino, Ian Pratt-Hartmann, Andre Freitas

    Abstract: Achieving precise semantic control over the latent spaces of Variational AutoEncoders (VAEs) holds significant value for downstream tasks in NLP as the underlying generative mechanisms could be better localised, explained and improved upon. Recent research, however, has struggled to achieve consistent results, primarily due to the inevitable loss of semantic information in the variational bottlene… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  20. arXiv:2401.07564  [pdf, other

    hep-ph hep-ex

    Focus topics for the ECFA study on Higgs / Top / EW factories

    Authors: Jorge de Blas, Patrick Koppenburg, Jenny List, Fabio Maltoni, Juan Alcaraz Maestre, Juliette Alimena, John Alison, Patrizia Azzi, Paolo Azzurri, Emanuele Bagnaschi, Timothy Barklow, Matthew J. Basso, Josh Bendavid, Martin Beneke, Eli Ben-Haim, Mikael Berggren, Marzia Bordone, Ivanka Bozovic, Valentina Cairo, Nuno Filipe Castro, Marina Cobal, Paula Collins, Mogens Dam, Valerio Dao, Matteo Defranchis , et al. (83 additional authors not shown)

    Abstract: In order to stimulate new engagement and trigger some concrete studies in areas where further work would be beneficial towards fully understanding the physics potential of an $e^+e^-$ Higgs / Top / Electroweak factory, we propose to define a set of focus topics. The general reasoning and the proposed topics are described in this document.

    Submitted 18 January, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: v3: fixed spelling of two authors

  21. arXiv:2401.06452  [pdf, other

    cs.LG

    Automated Machine Learning for Positive-Unlabelled Learning

    Authors: Jack D. Saunders, Alex A. Freitas

    Abstract: Positive-Unlabelled (PU) learning is a growing field of machine learning that aims to learn classifiers from data consisting of labelled positive and unlabelled instances, which can be in reality positive or negative, but whose label is unknown. An extensive number of methods have been proposed to address PU learning over the last two decades, so many so that selecting an optimal method for a give… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: 36 pages, 4 figures

  22. arXiv:2312.13208  [pdf, other

    cs.CL

    LlaMaVAE: Guiding Large Language Model Generation via Continuous Latent Sentence Spaces

    Authors: Yingji Zhang, Danilo S. Carvalho, Ian Pratt-Hartmann, André Freitas

    Abstract: Deep generative neural networks, such as Variational AutoEncoders (VAEs), offer an opportunity to better understand and control language models from the perspective of sentence-level latent spaces. To combine the controllability of VAE latent spaces with the state-of-the-art performance of recent large language models (LLMs), we present in this work LlaMaVAE, which combines expressive encoder and… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  23. arXiv:2311.08579  [pdf, other

    cs.CL

    Graph-Induced Syntactic-Semantic Spaces in Transformer-Based Variational AutoEncoders

    Authors: Yingji Zhang, Marco Valentino, Danilo S. Carvalho, Ian Pratt-Hartmann, André Freitas

    Abstract: The injection of syntactic information in Variational AutoEncoders (VAEs) has been shown to result in an overall improvement of performances and generalisation. An effective strategy to achieve such a goal is to separate the encoding of distributional semantic features and syntactic structures into heterogeneous latent spaces via multi-task learning or dual encoder architectures. However, existing… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  24. arXiv:2311.06364  [pdf, other

    cs.CL

    Relation Extraction in underexplored biomedical domains: A diversity-optimised sampling and synthetic data generation approach

    Authors: Maxime Delmas, Magdalena Wysocka, André Freitas

    Abstract: The sparsity of labelled data is an obstacle to the development of Relation Extraction models and the completion of databases in various biomedical areas. While being of high interest in drug-discovery, the natural-products literature, reporting the identification of potential bioactive compounds from organisms, is a concrete example of such an overlooked topic. To mark the start of this new task,… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  25. arXiv:2311.05330  [pdf, other

    stat.AP cs.CY stat.ME

    A Bayesian framework for measuring association and its application to emotional dynamics in Web discourse

    Authors: Henrique S. Xavier, Diogo Cortiz, Mateus Silvestrin, Ana Luísa Freitas, Letícia Yumi Nakao Morello, Fernanda Naomi Pantaleão, Gabriel Gaudencio do Rêgo

    Abstract: This paper introduces a Bayesian framework designed to measure the degree of association between categorical random variables. The method is grounded in the formal definition of variable independence and is implemented using Markov Chain Monte Carlo (MCMC) techniques. Unlike commonly employed techniques in Association Rule Learning, this approach enables a clear and precise estimation of confidenc… ▽ More

    Submitted 11 March, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: 9 pages, 2 tables, 4 figures. Accepted for publication at the Beyond Facts workshop of the Web Conference 2024

  26. Detecting Relevant Information in High-Volume Chat Logs: Keyphrase Extraction for Grooming and Drug Dealing Forensic Analysis

    Authors: Jeovane Honório Alves, Horácio A. C. G. Pedroso, Rafael Honorio Venetikides, Joel E. M. Köster, Luiz Rodrigo Grochocki, Cinthia O. A. Freitas, Jean Paul Barddal

    Abstract: The growing use of digital communication platforms has given rise to various criminal activities, such as grooming and drug dealing, which pose significant challenges to law enforcement and forensic experts. This paper presents a supervised keyphrase extraction approach to detect relevant information in high-volume chat logs involving grooming and drug dealing for forensic analysis. The proposed m… ▽ More

    Submitted 14 September, 2023; originally announced November 2023.

    Comments: Accepted for presentation at the 22nd IEEE International Conference on Machine Learning and Applications (ICMLA) 2023

  27. arXiv:2311.01230  [pdf, other

    cs.LG cs.AI cs.SC

    Multi-Operational Mathematical Derivations in Latent Space

    Authors: Marco Valentino, Jordan Meadows, Lan Zhang, André Freitas

    Abstract: This paper investigates the possibility of approximating multiple mathematical operations in latent space for expression derivation. To this end, we introduce different multi-operational representation paradigms, modelling mathematical operations as explicit geometric transformations. By leveraging a symbolic engine, we construct a large-scale dataset comprising 1.7M derivation steps stemming from… ▽ More

    Submitted 3 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024 - Camera Ready

  28. The first-order factorizable contributions to the three-loop massive operator matrix elements $A_{Qg}^{(3)}$ and $ΔA_{Qg}^{(3)}$

    Authors: J. Ablinger, A. Behring, J. Blümlein, A. De Freitas, A. von Manteuffel, C. Schneider, K. Schönwald

    Abstract: The unpolarized and polarized massive operator matrix elements $A_{Qg}^{(3)}$ and $ΔA_{Qg}^{(3)}$ contain first-order factorizable and non-first-order factorizable contributions in the determining difference or differential equations of their master integrals. We compute their first-order factorizable contributions in the single heavy mass case for all contributing Feynman diagrams. Moreover, we p… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 58 pages, 4 Figures

    Report number: DO-TH 23/12, DESY 23-142, CERN-TH-2023-164,RISC Report series 23-12, ZU-TH 60/23, MSUHEP-23-025

  29. arXiv:2310.02752  [pdf, ps, other

    cs.LG

    Fair Feature Selection: A Comparison of Multi-Objective Genetic Algorithms

    Authors: James Brookhouse, Alex Freitas

    Abstract: Machine learning classifiers are widely used to make decisions with a major impact on people's lives (e.g. accepting or denying a loan, hiring decisions, etc). In such applications,the learned classifiers need to be both accurate and fair with respect to different groups of people, with different values of variables such as sex and race. This paper focuses on fair feature selection for classificat… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 10 pages, 1 figure, 3 tables

  30. arXiv:2310.00978  [pdf, other

    math.DS math.PR

    Convergence to decorated Lévy processes in non-Skorohod topologies for dynamical systems

    Authors: Ana Cristina Moreira Freitas, Jorge Milhazes Freitas, Ian Melbourne, Mike Todd

    Abstract: We present a general framework for weak convergence to decorated Lévy processes in enriched spaces of càdlàg functions for vector-valued processes arising in deterministic systems. Applications include uniformly expanding maps and unbounded observables as well as nonuniformly expanding/hyperbolic maps with bounded observables. The latter includes intermittent maps and dispersing billiards with fla… ▽ More

    Submitted 9 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Slight change to hypothesis (5.2)

    MSC Class: 37A50; 60F17; 37D25; 37C83

  31. arXiv:2309.10405  [pdf, other

    math.DG

    Gap results and existence of CMC free boundary hypersurfaces in rotational domains

    Authors: Allan Freitas, Márcio Santos, J. Sindeaux

    Abstract: In this paper, we work with the existence and uniqueness of free boundary constant mean curvature hypersurfaces in rotational domains. These are domains whose boundary is generated by a rotation of a graph. Under some conditions on the function that generates the graph and a gap condition on the umbilicity tensor, we classify the CMC free boundary hypersurfaces as topological disks or annulus. Als… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Comments are welcome!

  32. arXiv:2308.14186  [pdf, other

    cs.CL cs.AI

    Empowering Cross-lingual Abilities of Instruction-tuned Large Language Models by Translation-following demonstrations

    Authors: Leonardo Ranaldi, Giulia Pucci, Andre Freitas

    Abstract: The language ability of Large Language Models (LLMs) is often unbalanced towards English because of the imbalance in the distribution of the pre-training data. This disparity is demanded in further fine-tuning and affecting the cross-lingual abilities of LLMs. In this paper, we propose to empower Instructiontuned LLMs (It-LLMs) in languages other than English by building semantic alignment between… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

  33. Probing dark sector fermions in Higgs precision studies and direct searches

    Authors: Ayres Freitas, Qian Song

    Abstract: In this paper, we investigate the discovery prospect of simplified fermionic dark sectors models through Higgs precision measurements at $e^+e^-$ colliders and direct searches at hadron colliders. These models extend the Standard Model with two Majorana or Dirac fermions that are singlets, doublets or triplets under the weak SU(2) group. For all models, we consider two scenarios where the lightest… ▽ More

    Submitted 26 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 28 pages, 15 figures; update: add 2 references and 1 table

    Journal ref: JHEP 01 (2024), 137

  34. arXiv:2308.03581  [pdf, other

    cs.CL

    Towards Controllable Natural Language Inference through Lexical Inference Types

    Authors: Yingji Zhang, Danilo S. Carvalho, Ian Pratt-Hartmann, Andre Freitas

    Abstract: Explainable natural language inference aims to provide a mechanism to produce explanatory (abductive) inference chains which ground claims to their supporting premises. A recent corpus called EntailmentBank strives to advance this task by explaining the answer to a question using an entailment tree \cite{dalvi2021explaining}. They employ the T5 model to directly generate the tree, which can explai… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  35. arXiv:2308.00425  [pdf, ps, other

    cs.CL cs.AI

    Discourse-Aware Text Simplification: From Complex Sentences to Linked Propositions

    Authors: Christina Niklaus, Matthias Cetto, André Freitas, Siegfried Handschuh

    Abstract: Sentences that present a complex syntax act as a major stumbling block for downstream Natural Language Processing applications whose predictive quality deteriorates with sentence length and complexity. The task of Text Simplification (TS) may remedy this situation. It aims to modify sentences in order to make them easier to process, using a set of rewriting operations, such as reordering, deletion… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  36. arXiv:2307.09998  [pdf, other

    cs.CL math.HO

    Generating Mathematical Derivations with Large Language Models

    Authors: Jordan Meadows, Marco Valentino, Andre Freitas

    Abstract: The derivation of mathematical results in specialised fields, using Large Language Models (LLMs), is an emerging research direction that can help identify models' limitations, and potentially support mathematical discovery. In this paper, we leverage a symbolic engine to generate derivations of equations at scale, and investigate the capabilities of LLMs when deriving goal equations from premises.… ▽ More

    Submitted 8 August, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 10 pages

  37. arXiv:2307.02983  [pdf, other

    hep-ph math-ph

    Analytic results on the massive three-loop form factors: quarkonic contributions

    Authors: Johannes Blümlein, Abilio De Freitas, Peter Marquard, Narayan Rana, Carsten Schneider

    Abstract: The quarkonic contributions to the three-loop heavy-quark form factors for vector, axial-vector, scalar and pseudoscalar currents are described by closed form difference equations for the expansion coefficients in the limit of small virtualities $q^2/m^2$. A part of the contributions can be solved analytically and expressed in terms of harmonic and cyclotomic harmonic polylogarithms and square-roo… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 92 pages, 14 figures

    Report number: DESY 23--012, DO--TH 23/02, RISC Report Series 23-08

  38. arXiv:2306.16550  [pdf, other

    hep-ph hep-ex

    Recent 3-Loop Heavy Flavor Corrections to Deep-Inelastic Scattering

    Authors: J. Ablinger, A. Behring, J. Blümlein, A. De Freitas, A. Goedicke, A. von Manteuffel, C. Schneider, K. Schönwald

    Abstract: We report on recent progress in calculating the three loop QCD corrections of the heavy flavor contributions in deep--inelastic scattering and the massive operator matrix elements of the variable flavor number scheme. Notably we deal with the operator matrix elements $A_{gg,Q}^{(3)}$ and $A_{Qg}^{(3)}$ and technical steps to their calculation. In particular, a new method to obtain the inverse Mell… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Proc RADCOR 2023, 7 pages, 1 figure

    Report number: DESY-23-089, DO-TH 23/09, CERN-TH-2023-122, ZU-TH 29/23, RISC Report Series 23-09, MSUHEP-23-018

  39. arXiv:2306.08563  [pdf, other

    quant-ph

    Microscopic origin of polarization-entangled Stokes-anti-Stokes photons in diamond

    Authors: Tiago A. Freitas, Paula Machado, Lucas V. de Carvalho, Diego Sier, Raul Corrêa, Riichiro Saito, Marcelo F. Santos, Carlos H. Monken, Ado Jorio

    Abstract: Violation of the Clauser-Horne-Shimony-Holt inequality for the polarization of Stokes-anti-Stokes (SaS) photon pairs near a Raman resonance is demonstrated. The pairs are generated by shining a pulsed laser on a diamond sample, where two photons of the laser are converted into a pair of photons of different frequencies. The generated pairs are collected by standard Bell analyzers and shown to be e… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 12 pages, 3 figures

  40. arXiv:2305.19772  [pdf, ps, other

    math.DG math.AP

    A note on Serrin's type problem on Riemannian manifolds

    Authors: Allan Freitas, Alberto Roncoroni, Márcio Santos

    Abstract: In this paper, we deal with Serrin-type problems in Riemannian manifolds. First, we obtain a Heintze-Karcher inequality and a Soap Bubble result, with its respective rigidity, when the ambient space has a Ricci tensor bounded below. After, we approach a Serrin problem in bounded domains of manifolds endowed with a closed conformal vector field. Our primary tool, in this case, is a new Pohozaev ide… ▽ More

    Submitted 6 March, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: Comments are welcome!

  41. arXiv:2305.17819  [pdf, other

    cs.CL cs.AI

    Large Language Models, scientific knowledge and factuality: A systematic analysis in antibiotic discovery

    Authors: Magdalena Wysocka, Oskar Wysocki, Maxime Delmas, Vincent Mutel, Andre Freitas

    Abstract: Inferring over and extracting information from Large Language Models (LLMs) trained on a large corpus of scientific literature can potentially drive a new era in biomedical research, reducing the barriers for accessing existing medical evidence. This work examines the potential of LLMs for dialoguing with biomedical background knowledge, using the context of antibiotic discovery. The systematic an… ▽ More

    Submitted 5 December, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: 28 pages, 3 figures

  42. arXiv:2305.16547  [pdf, other

    hep-ph

    Fermionic Electroweak NNLO Corrections to $e^+ e^- \to ZH$ with Polarized Beams and Different Renormalization Schemes

    Authors: Ayres Freitas, Qian Song, Abstract: Recently, the next-to-next-to-leading order (NNLO) electroweak corrections with fermion loops to the Higgsstrahling process were computed. Here we present numerical results for polarized electron/positron beams, as well as for two input parameter schemes known as the $α(0)$ and $G_μ$ schemes. The size of the NNLO corrections strongly depends on the beam polarization, leading to an increase of the… ▽ More

    Submitted 29 May, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 6 pages, 2 figures and 3 tables

  43. arXiv:2305.13494  [pdf, other

    cs.DB

    Deep Clustering for Data Cleaning and Integration

    Authors: Hafiz Tayyab Rauf, Andre Freitas, Norman W. Paton

    Abstract: Deep Learning (DL) techniques now constitute the state-of-the-art for important problems in areas such as text and image processing, and there have been impactful results that deploy DL in several data management tasks. Deep Clustering (DC) has recently emerged as a sub-discipline of DL, in which data representations are learned in tandem with clustering, with a view to automatically identifying t… ▽ More

    Submitted 22 September, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: The following enhancements have been carried out in the updated version of the manuscript: *Evaluated each data integration problem on additional datasets. *Added more DC and SC methods to the evaluation *Discussed algorithmic-specific observations

  44. arXiv:2305.12563  [pdf, other

    cs.CL cs.LG

    A Symbolic Framework for Evaluating Mathematical Reasoning and Generalisation with Transformers

    Authors: Jordan Meadows, Marco Valentino, Damien Teney, Andre Freitas

    Abstract: This paper proposes a methodology for generating and perturbing detailed derivations of equations at scale, aided by a symbolic engine, to evaluate the generalisability of Transformers to out-of-distribution mathematical reasoning problems. Instantiating the framework in the context of sequence classification tasks, we compare the capabilities of GPT-4, GPT-3.5, and a canon of fine-tuned BERT mode… ▽ More

    Submitted 8 April, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: NAACL 2024

  45. arXiv:2305.11391  [pdf, other

    cs.AI cs.LG

    A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation

    Authors: Xiaowei Huang, Wenjie Ruan, Wei Huang, Gaojie **, Yi Dong, Changshun Wu, Saddek Bensalem, Ronghui Mu, Yi Qi, Xingyu Zhao, Kaiwen Cai, Yanghao Zhang, Sihao Wu, Peipei Xu, Dengyu Wu, Andre Freitas, Mustafa A. Mustafa

    Abstract: Large Language Models (LLMs) have exploded a new heatwave of AI for their ability to engage end-users in human-level conversations with detailed and articulate answers across many knowledge domains. In response to their fast adoption in many industrial applications, this survey concerns their safety and trustworthiness. First, we review known vulnerabilities and limitations of the LLMs, categorisi… ▽ More

    Submitted 27 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  46. arXiv:2305.08572  [pdf, other

    cs.CL

    Estimating the Causal Effects of Natural Logic Features in Neural NLI Models

    Authors: Julia Rozanova, Marco Valentino, Andre Freitas

    Abstract: Rigorous evaluation of the causal effects of semantic features on language model predictions can be hard to achieve for natural language reasoning problems. However, this is such a desirable form of analysis from both an interpretability and model evaluation perspective, that it is valuable to zone in on specific patterns of reasoning with enough structure and regularity to be able to identify and… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  47. arXiv:2305.07303  [pdf, other

    cs.CL cs.LG

    Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions

    Authors: Marco Valentino, Danilo S. Carvalho, André Freitas

    Abstract: Natural language definitions possess a recursive, self-explanatory semantic structure that can support representation learning methods able to preserve explicit conceptual relations and constraints in the latent space. This paper presents a multi-relational model that explicitly leverages such a structure to derive word embeddings from definitions. By automatically extracting the relations linking… ▽ More

    Submitted 16 February, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted at the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), camera-ready

  48. arXiv:2305.04675  [pdf, other

    nucl-th cs.LG

    Predicting nuclear masses with product-unit networks

    Authors: Babette Dellen, Uwe Jaekel, Paulo S. A. Freitas, John W. Clark

    Abstract: Accurate estimation of nuclear masses and their prediction beyond the experimentally explored domains of the nuclear landscape are crucial to an understanding of the fundamental origin of nuclear properties and to many applications of nuclear science, most notably in quantifying the $r$-process of stellar nucleosynthesis. Neural networks have been applied with some success to the prediction of nuc… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  49. arXiv:2305.03598  [pdf, other

    cs.CL cs.AI cs.LG

    NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports

    Authors: Maël Jullien, Marco Valentino, Hannah Frost, Paul O'Regan, Donal Landers, André Freitas

    Abstract: How can we interpret and retrieve medical evidence to support clinical decisions? Clinical trial reports (CTR) amassed over the years contain indispensable information for the development of personalized medicine. However, it is practically infeasible to manually inspect over 400,000+ clinical trial reports in order to find the best evidence for experimental treatments. Natural Language Inference… ▽ More

    Submitted 28 October, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Camera-ready, 15 pages

  50. arXiv:2305.02993  [pdf, other

    cs.CL cs.AI cs.LG

    SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data

    Authors: Maël Jullien, Marco Valentino, Hannah Frost, Paul O'Regan, Donal Landers, André Freitas

    Abstract: This paper describes the results of SemEval 2023 task 7 -- Multi-Evidence Natural Language Inference for Clinical Trial Data (NLI4CT) -- consisting of 2 tasks, a Natural Language Inference (NLI) task, and an evidence selection task on clinical trial data. The proposed challenges require multi-hop biomedical and numerical reasoning, which are of significant importance to the development of systems… ▽ More

    Submitted 11 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.