Skip to main content

Showing 51–100 of 347 results for author: Martins, F

.
  1. arXiv:2301.04653  [pdf, other

    q-bio.GN cs.LG

    Optirank: classification for RNA-Seq data with optimal ranking reference genes

    Authors: Paola Malsot, Filipe Martins, Didier Trono, Guillaume Obozinski

    Abstract: Classification algorithms using RNA-Sequencing (RNA-Seq) data as input are used in a variety of biological applications. By nature, RNA-Seq data is subject to uncontrolled fluctuations both within and especially across datasets, which presents a major difficulty for a trained classifier to generalize to an external dataset. Replacing raw gene counts with the rank of gene counts inside an observati… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

  2. arXiv:2301.02491  [pdf, ps, other

    math.CT math.AT math.GT math.QA

    A categorification of Quinn's finite total homotopy TQFT with application to TQFTs and once-extended TQFTs derived from strict omega-groupoids

    Authors: João Faria Martins, Timothy Porter

    Abstract: We first revisit the construction of Quinn's Finite Total Homotopy TQFT, which depends on the choice of a homotopy finite space, $\boldsymbol{B}$. We build our construction directly from homotopy theoretical techniques, and hence, as in Quinn's original notes from 1995, the construction works in all dimensions. Our aim in this is to provide background for giving in detail the construction of a o… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: 235 pages

  3. arXiv:2212.09885  [pdf, other

    cs.CL

    Python Code Generation by Asking Clarification Questions

    Authors: Haau-Sing Li, Mohsen Mesgar, André F. T. Martins, Iryna Gurevych

    Abstract: Code generation from text requires understanding the user's intent from a natural language description and generating an executable code snippet that satisfies this intent. While recent pretrained language models demonstrate remarkable performance for this task, these models fail when the given natural language description is under-specified. In this work, we introduce a novel and more realistic s… ▽ More

    Submitted 26 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 9 pages (excluding Limitations and Ethics Concerns)

  4. arXiv:2212.09631  [pdf, other

    cs.CL cs.LG

    Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation

    Authors: Nuno M. Guerreiro, Pierre Colombo, Pablo Piantanida, André F. T. Martins

    Abstract: Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications. However, NMT models can unpredictably produce severely pathological translations, known as hallucinations, that seriously undermine user trust. It becomes thus crucial to implement effective preventive strategies to guarantee their proper functioning. In this paper, we address the prob… ▽ More

    Submitted 19 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted at ACL 2023

  5. arXiv:2211.17186  [pdf, other

    cs.LO

    Linear Rank Intersection Types

    Authors: Fábio Reis, Sandra Alves, Mário Florido

    Abstract: Non-idempotent intersection types provide quantitative information about typed programs, and have been used to obtain time and space complexity measures. Intersection type systems characterize termination, so restrictions need to be made in order to make typability decidable. One such restriction consists in using a notion of finite rank for the idempotent intersection types. In this work, we defi… ▽ More

    Submitted 4 May, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

  6. arXiv:2211.14127  [pdf, other

    cond-mat.mes-hall quant-ph

    A quantum dot-based frequency multiplier

    Authors: G. A. Oakes, L. Peri, L. Cochrane, F. Martins, L. Hutin, B. Bertrand, M. Vinet, A. Gomez Saiz, C. J. B. Ford, C. G. Smith, M. F. Gonzalez-Zalba

    Abstract: Silicon offers the enticing opportunity to integrate hybrid quantum-classical computing systems on a single platform. For qubit control and readout, high-frequency signals are required. Therefore, devices that can facilitate its generation are needed. Here, we present a quantum dot-based radiofrequency multiplier operated at cryogenic temperatures. The device is based on the non-linear capacitance… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 17 pages, 16 figures

  7. arXiv:2210.15553  [pdf, other

    cs.CL cs.AI cs.LG

    Improving abstractive summarization with energy-based re-ranking

    Authors: Diogo Pernes, Afonso Mendes, André F. T. Martins

    Abstract: Current abstractive summarization systems present important weaknesses which prevent their deployment in real-world applications, such as the omission of relevant information and the generation of factual inconsistencies (also known as hallucinations). At the same time, automatic evaluation metrics such as CTC scores have been recently proposed that exhibit a higher correlation with human judgment… ▽ More

    Submitted 7 November, 2022; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 2nd Workshop on Natural Language Generation, Evaluation, and Metrics (GEM) at EMNLP 2022

  8. arXiv:2209.06243  [pdf, other

    cs.CL cs.LG

    CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task

    Authors: Ricardo Rei, Marcos Treviso, Nuno M. Guerreiro, Chrysoula Zerva, Ana C. Farinha, Christine Maroti, José G. C. de Souza, Taisiya Glushkova, Duarte M. Alves, Alon Lavie, Luisa Coheur, André F. T. Martins

    Abstract: We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equip** it w… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: WMT 2022 Quality Estimation shared task

  9. arXiv:2209.03171  [pdf, other

    physics.soc-ph cs.LG cs.SI stat.ML

    Machine Learning Partners in Criminal Networks

    Authors: Diego D. Lopes, Bruno R. da Cunha, Alvaro F. Martins, Sebastian Goncalves, Ervin K. Lenzi, Quentin S. Hanley, Matjaz Perc, Haroldo V. Ribeiro

    Abstract: Recent research has shown that criminal networks have complex organizational structures, but whether this can be used to predict static and dynamic properties of criminal networks remains little explored. Here, by combining graph representation learning and machine learning methods, we show that structural properties of political corruption, police intelligence, and money laundering networks can b… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 10 pages, 4 figures, supplementary information; accepted for publication in Scientific Reports

    Journal ref: Sci. Rep. 12, 15746 (2022)

  10. arXiv:2209.00099  [pdf, other

    cs.CL

    Efficient Methods for Natural Language Processing: A Survey

    Authors: Marcos Treviso, Ji-Ung Lee, Tianchu Ji, Betty van Aken, Qingqing Cao, Manuel R. Ciosici, Michael Hassid, Kenneth Heafield, Sara Hooker, Colin Raffel, Pedro H. Martins, André F. T. Martins, Jessica Zosa Forde, Peter Milder, Edwin Simpson, Noam Slonim, Jesse Dodge, Emma Strubell, Niranjan Balasubramanian, Leon Derczynski, Iryna Gurevych, Roy Schwartz

    Abstract: Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows. Such resources include data, time, storage, or energy, all of which are naturally limited and unevenly distributed. This motivates research into efficient methods that require few… ▽ More

    Submitted 24 March, 2023; v1 submitted 31 August, 2022; originally announced September 2022.

    Comments: Accepted at TACL, pre publication version

  11. arXiv:2208.05309  [pdf, other

    cs.CL cs.LG

    Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation

    Authors: Nuno M. Guerreiro, Elena Voita, André F. T. Martins

    Abstract: Although the problem of hallucinations in neural machine translation (NMT) has received some attention, research on this highly pathological phenomenon lacks solid ground. Previous work has been limited in several ways: it often resorts to artificial settings where the problem is amplified, it disregards some (common) types of hallucinations, and it does not validate adequacy of detection heuristi… ▽ More

    Submitted 5 March, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: Accepted at EACL23 (main)

  12. arXiv:2206.11487  [pdf, ps, other

    math.DG

    Boundedness of geometric invariants near a singularity which is a suspension of a singular curve

    Authors: Luciana F. Martins, Kentaro Saji, Samuel P. dos Santos, Keisuke Teramoto

    Abstract: Near a singular point of a surface or a curve, geometric invariants diverge in general, and the orders of diverge, in particular the boundedness about these invariants represent geometry of the surface and the curve. In this paper, we study boundedness and orders of several geometric invariants near a singular point of a surface which is a suspension of a singular curve in the plane and those of c… ▽ More

    Submitted 21 December, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: 27 pages, 15 figures

    MSC Class: 57R45

  13. arXiv:2205.12230  [pdf, other

    cs.CL

    Chunk-based Nearest Neighbor Machine Translation

    Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

    Abstract: Semi-parametric models, which augment generation with retrieval, have led to impressive results in language modeling and machine translation, due to their ability to retrieve fine-grained information from a datastore of examples. One of the most prominent approaches, $k$NN-MT, exhibits strong domain adaptation capabilities by retrieving tokens from domain-specific datastores \citep{khandelwal2020n… ▽ More

    Submitted 7 November, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  14. arXiv:2205.00978  [pdf, other

    cs.CL

    Quality-Aware Decoding for Neural Machine Translation

    Authors: Patrick Fernandes, António Farinhas, Ricardo Rei, José G. C. de Souza, Perez Ogayo, Graham Neubig, André F. T. Martins

    Abstract: Despite the progress in machine translation quality estimation and evaluation in the last years, decoding in neural machine translation (NMT) is mostly oblivious to this and centers around finding the most probable translation according to the model (MAP decoding), approximated with beam search. In this paper, we bring together these two lines of research and propose quality-aware decoding for NMT… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: NAACL2022

  15. arXiv:2204.12608  [pdf, other

    cs.CL

    Efficient Machine Translation Domain Adaptation

    Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

    Abstract: Machine translation models struggle when translating out-of-domain text, which makes domain adaptation a topic of critical importance. However, most domain adaptation methods focus on fine-tuning or training the entire or part of the model on every new domain, which can be costly. On the other hand, semi-parametric models have been shown to successfully perform domain adaptation by retrieving exam… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: Workshop Semiparametric Methods in NLP: Decoupling Logic from Knowledge

  16. arXiv:2204.10810  [pdf, other

    cs.LG cs.CL cs.CV

    Learning to Scaffold: Optimizing Model Explanations for Teaching

    Authors: Patrick Fernandes, Marcos Treviso, Danish Pruthi, André F. T. Martins, Graham Neubig

    Abstract: Modern machine learning models are opaque, and as a result there is a burgeoning academic subfield on methods that explain these models' behavior. However, what is the precise goal of providing such explanations, and how can we demonstrate that explanations achieve this goal? Some research argues that explanations should help teach a student (either human or machine) to simulate the model being ex… ▽ More

    Submitted 29 November, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: 10 pages. NeurIPS 2022

  17. arXiv:2204.08606  [pdf, other

    math.NT math.CO

    Sharp inequalities for discrete and continuous multi-tiling, using the Bombieri-Siegel approach

    Authors: Michel Faleiros Martins, Sinai Robins

    Abstract: Given a finite subset $F$ of integer points in $\mathbb Z^d$, it is of interest to seek conditions on $F$ that allow it to multi-tile $\mathbb Z^d$ by translations. In addition to the continuous multi-tiling results presented here, we also give analogous discrete applications to arithmetic combinatorics. Namely we give a discretized version of the Bombieri-Siegel formula, namely a finite sum of di… ▽ More

    Submitted 15 December, 2023; v1 submitted 18 April, 2022; originally announced April 2022.

    Comments: 32 pages, 9 figures

    MSC Class: 52C07; 52C22; 11H06; 11P21

  18. arXiv:2204.06546  [pdf, other

    cs.CL

    Disentangling Uncertainty in Machine Translation Evaluation

    Authors: Chrysoula Zerva, Taisiya Glushkova, Ricardo Rei, André F. T. Martins

    Abstract: Trainable evaluation metrics for machine translation (MT) exhibit strong correlation with human judgements, but they are often hard to interpret and might produce unreliable scores under noisy or out-of-domain data. Recent work has attempted to mitigate this with simple uncertainty quantification techniques (Monte Carlo dropout and deep ensembles), however these techniques (as we show) are limited… ▽ More

    Submitted 29 November, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: accepted at EMNLP 2022

  19. arXiv:2204.05097  [pdf, other

    physics.soc-ph cond-mat.stat-mech

    Universality of political corruption networks

    Authors: Alvaro F. Martins, Bruno R. da Cunha, Quentin S. Hanley, Sebastian Goncalves, Matjaz Perc, Haroldo V. Ribeiro

    Abstract: Corruption crimes demand highly coordinated actions among criminal agents to succeed. But research dedicated to corruption networks is still in its infancy and indeed little is known about the properties of these networks. Here we present a comprehensive investigation of corruption networks related to political scandals in Spain and Brazil over nearly three decades. We show that corruption network… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 11 pages, 6 figures, supplementary information; accepted for publication in Scientific Reports

    Journal ref: Sci. Rep. 12, 6858 (2022)

  20. arXiv:2203.15635  [pdf, other

    q-bio.MN cs.CE cs.IT cs.LG q-bio.GN

    BASiNETEntropy: an alignment-free method for classification of biological sequences through complex networks and entropy maximization

    Authors: Murilo Montanini Breve, Matheus Henrique Pimenta-Zanon, Fabrício Martins Lopes

    Abstract: The discovery of nucleic acids and the structure of DNA have brought considerable advances in the understanding of life. The development of next-generation sequencing technologies has led to a large-scale generation of data, for which computational methods have become essential for analysis and knowledge discovery. In particular, RNAs have received much attention because of the diversity of their… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  21. arXiv:2203.06608  [pdf, other

    cond-mat.mes-hall quant-ph

    Fast high-fidelity single-shot readout of spins in silicon using a single-electron box

    Authors: G. A. Oakes, V. N. Ciriano-Tejel, D. Wise, M. A. Fogarty, T. Lundberg, C. Lainé, S. Schaal, F. Martins, D. J. Ibberson, L. Hutin, B. Bertrand, N. Stelmashenko, J. A. W. Robinson, L. Ibberson, A. Hashim, I. Siddiqi, A. Lee, M. Vinet, C. G. Smith, J. J. L. Morton, M. F. Gonzalez-Zalba

    Abstract: Three key metrics for readout systems in quantum processors are measurement speed, fidelity and footprint. Fast high-fidelity readout enables mid-circuit measurements, a necessary feature for many dynamic algorithms and quantum error correction, while a small footprint facilitates the design of scalable, highly-connected architectures with the associated increase in computing performance. Here, we… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: Main: 9 pages, 4 figures, 1 table. Supplementary: 33 pages, 18 figures

  22. arXiv:2203.02336  [pdf, other

    cs.LG stat.ME

    Differentiable Causal Discovery Under Latent Interventions

    Authors: Gonçalo R. A. Faria, André F. T. Martins, Mário A. T. Figueiredo

    Abstract: Recent work has shown promising results in causal discovery by leveraging interventional data with gradient-based methods, even when the intervened variables are unknown. However, previous work assumes that the correspondence between samples and interventions is known, which is often unrealistic. We envision a scenario with an extensive dataset sampled from multiple intervention distributions and… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Journal ref: Proceedings of the First Conference on Causal Learning and Reasoning, PMLR 177:253-274, 2022

  23. arXiv:2202.13703  [pdf, ps, other

    astro-ph.SR astro-ph.GA

    Spectroscopic evolution of very massive stars at Z = 1/2.5 Zsun

    Authors: F. Martins, A. Palacios

    Abstract: Stars with masses in excess of 100 Msun are observed in the Local Universe, but they remain rare objects. Because of the shape of the mass function, they are expected to be present only in the most massive and youngest clusters. They may thus be formed in number in highly star-forming galaxies. Very massive stars (VMSs) experience strong stellar winds that are stronger than those of their less mas… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: 13 pages, 11 figures + appendix. Accepted in Astronomy & Astrophysics

    Journal ref: A&A 659, A163 (2022)

  24. arXiv:2202.08662  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.IM

    The Gaia-ESO Survey: The analysis of the hot-star spectra

    Authors: R. Blomme, S. Daflon, M. Gebran, A. Herrero, A. Lobel, L. Mahy, F. Martins, T. Morel, S. R. Berlanas, A. Blazere, Y. Fremat, E. Gosset, J. Maiz Apellaniz, W. Santos, T. Semaan, S. Simon-Diaz, D. Volpi, G. Holgado, F. Jimenez-Esteban, M. F. Nieva, N. Przybilla, G. Gilmore, S. Randich, I. Negueruela, T. Prusti , et al. (22 additional authors not shown)

    Abstract: The Gaia-ESO Survey (GES) is a large public spectroscopic survey that has collected, over a period of 6 years, spectra of ~ 10^5 stars. This survey provides not only the reduced spectra, but also the stellar parameters and abundances resulting from the analysis of the spectra. The GES dataflow is organised in 19 working groups. Working group 13 (WG13) is responsible for the spectral analysis of th… ▽ More

    Submitted 1 March, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: 26 pages, 14 figures, accepted for publication in Astronomy & Astrophysics; language-edited version; two appendices merged

    Journal ref: A&A 661, A120 (2022)

  25. arXiv:2202.03760  [pdf, other

    cs.LG cs.CL

    Modeling Structure with Undirected Neural Networks

    Authors: Tsvetomila Mihaylova, Vlad Niculae, André F. T. Martins

    Abstract: Neural networks are powerful function estimators, leading to their status as a paradigm of choice for modeling structured data. However, unlike other structured representations that emphasize the modularity of the problem -- e.g., factor graphs -- neural networks are usually monolithic map**s from inputs to outputs, with a fixed computation order. This limitation prevents them from capturing dif… ▽ More

    Submitted 17 June, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: ICML 2022

  26. arXiv:2112.14532  [pdf, other

    physics.atom-ph

    Multipole-moment effects in ion-molecule reactions at low temperatures: part I -- Ion-dipole enhancement of the rate coefficients of the He$^+$ + NH$_3$ and He$^+$ + ND$_3$ reactions at collision energies near $0$ K

    Authors: Valentina Zhelyazkova, Fernanda B. V. Martins, Josef A. Agner, Hansjürg Schmutz, Frédéric Merkt

    Abstract: The energy dependence of the rates of the reactions between He$^+$ and ammonia (NY$_3$, Y= {H,D}), forming NY$_2^+$, Y and He as well as NY$^+$, Y$_2$ and He, and the corresponding product branching ratios have been measured at low collision energies between 0 and $k_{\mathrm{B}}\cdot40$ K using a recently developed merged-beam technique [Allmendinger {\it et al.}, ChemPhysChem {\bf 17}, 3596 (201… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.

    Comments: 18 pages, 8 figures

    Journal ref: Phys. Chem. Chem. Phys., 2021, 23, 21606-21622

  27. arXiv:2112.12056  [pdf, other

    physics.atom-ph quant-ph

    Ion-molecule reactions below 1~K: Observation of a strong enhancement of the reaction rate of the ion-dipole reaction He$^+$+ CH$_3$F

    Authors: Valentina Zhelyazkova, Fernanda B. V. Martins, Josef A. Agner, Hansjürg Schmutz, Frédéric Merkt

    Abstract: The reaction between He$^+$ and CH$_3$F forming predominantly CH$_2^+$ and CHF$^+$ has been studied at collision energies $E_{\rm coll}$ between 0 and $k_{\rm B}\cdot 10$~K in a merged-beam apparatus. To avoid heating of the ions by stray electric fields, the reaction was observed within the orbit of a highly excited Rydberg electron. Supersonic beams of CH$_3$F and He($n$) Rydberg atoms with prin… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Journal ref: Physical Review Letters, 2020

  28. Dynamics of hole singlet triplet qubits with large g-factor differences

    Authors: Daniel Jirovec, Philipp M. Mutter, Andrea Hofmann, Josip Kukucka, Alessandro Crippa, Frederico Martins, Andrea Ballabio, Daniel Chrastina, Giovanni Isella, Guido Burkard, Georgios Katsaros

    Abstract: The spin-orbit interaction is the key element for electrically tunable spin qubits. Here we probe the effect of cubic Rashba spin-orbit interaction on mixing of the spin states by investigating singlet-triplet oscillations in a planar Ge hole double quantum dot. By varying the magnetic field direction we find an intriguing transformation of the funnel into a butterfly-shaped pattern. Landau-Zener… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

  29. Revisiting Coulomb diamond signatures in quantum Hall interferometers

    Authors: N. Moreau, S. Faniel, F. Martins, L. Desplanque, X. Wallart, S. Melinte, V. Bayot, B. Hackens

    Abstract: Coulomb diamonds are the archetypal signatures of Coulomb blockade, a well-known charging effect mainly observed in nanometer-sized "electronic islands" tunnel-coupled with charge reservoirs. Here, we identify apparent Coulomb diamond features in the scanning gate spectroscopy of a quantum point contact carved out of a semiconductor heterostructure, in the quantum Hall regime. Varying the scanning… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: 8 pages, 4 figures

  30. arXiv:2110.04654  [pdf, other

    eess.AS cs.CV cs.LG cs.SD

    Complex Network-Based Approach for Feature Extraction and Classification of Musical Genres

    Authors: Matheus Henrique Pimenta-Zanon, Glaucia Maria Bressan, Fabrício Martins Lopes

    Abstract: Musical genre's classification has been a relevant research topic. The association between music and genres is fundamental for the media industry, which manages musical recommendation systems, and for music streaming services, which may appear classified by genres. In this context, this work presents a feature extraction method for the automatic classification of musical genres, based on complex n… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

  31. arXiv:2109.12188  [pdf, other

    cs.CL

    Predicting Attention Sparsity in Transformers

    Authors: Marcos Treviso, António Góis, Patrick Fernandes, Erick Fonseca, André F. T. Martins

    Abstract: Transformers' quadratic complexity with respect to the input sequence length has motivated a body of work on efficient sparse approximations to softmax. An alternative path, used by entmax transformers, consists of having built-in exact sparse attention; however this approach still requires quadratic computation. In this paper, we propose Sparsefinder, a simple model trained to identify the sparsi… ▽ More

    Submitted 21 April, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: SPNLP22

  32. arXiv:2109.07446  [pdf, other

    cs.CL cs.LG

    When Does Translation Require Context? A Data-driven, Multilingual Exploration

    Authors: Patrick Fernandes, Kayo Yin, Emmy Liu, André F. T. Martins, Graham Neubig

    Abstract: Although proper handling of discourse significantly contributes to the quality of machine translation (MT), these improvements are not adequately measured in common translation quality metrics. Recent works in context-aware MT attempt to target a small set of discourse phenomena during evaluation, however not in a fully systematic way. In this paper, we develop the Multilingual Discourse-Aware (Mu… ▽ More

    Submitted 27 June, 2023; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: Accepted at ACL2023

  33. Uncertainty-Aware Machine Translation Evaluation

    Authors: Taisiya Glushkova, Chrysoula Zerva, Ricardo Rei, André F. T. Martins

    Abstract: Several neural-based metrics have been recently proposed to evaluate machine translation quality. However, all of them resort to point estimates, which provide limited information at segment level. This is made worse as they are trained on noisy, biased and scarce human judgements, often resulting in unreliable quality predictions. In this paper, we introduce uncertainty-aware MT evaluation and an… ▽ More

    Submitted 24 March, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Findings of EMNLP 2021 v2: corrected typos (esp. Tab 5)

  34. arXiv:2109.04552  [pdf, other

    cs.CL cs.LG

    SPECTRA: Sparse Structured Text Rationalization

    Authors: Nuno Miguel Guerreiro, André F. T. Martins

    Abstract: Selective rationalization aims to produce decisions along with rationales (e.g., text highlights or word alignments between two sentences). Commonly, rationales are modeled as stochastic binary masks, requiring sampling-based gradient estimators, which complicates training and requires careful hyperparameter tuning. Sparse attention mechanisms are a deterministic alternative, but they lack a way t… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021 (main conference)

  35. arXiv:2109.03625  [pdf, other

    q-bio.GN cs.CE

    Computational methods for differentially expressed gene analysis from RNA-Seq: an overview

    Authors: Juliana Costa-Silva, Douglas S. Domingues, David Menotti, Mariangela Hungria, Fabricio M Lopes

    Abstract: The analysis of differential gene expression from RNA-Seq data has become a standard for several research areas mainly involving bioinformatics. The steps for the computational analysis of these data include many data types and file formats, and a wide variety of computational tools that can be applied alone or together as pipelines. This paper presents a review of differential expression analysis… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  36. arXiv:2109.00301  [pdf, other

    cs.CL

    $\infty$-former: Infinite Memory Transformer

    Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

    Abstract: Transformers are unable to model long-term memories effectively, since the amount of computation they need to perform grows with the context length. While variations of efficient transformers have been proposed, they all have a finite memory capacity and are forced to drop old information. In this paper, we propose the $\infty$-former, which extends the vanilla transformer with an unbounded long-t… ▽ More

    Submitted 25 March, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: ACL 2022

  37. arXiv:2108.02658  [pdf, other

    cs.LG

    Sparse Communication via Mixed Distributions

    Authors: António Farinhas, Wilker Aziz, Vlad Niculae, André F. T. Martins

    Abstract: Neural networks and other machine learning models compute continuous representations, while humans communicate mostly through discrete symbols. Reconciling these two forms of communication is desirable for generating human-readable interpretations or learning discrete latent variable models, while maintaining end-to-end differentiability. Some existing approaches (such as the Gumbel-Softmax transf… ▽ More

    Submitted 11 February, 2022; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: Accepted for oral presentation at ICLR 2022

  38. arXiv:2108.01988  [pdf, other

    cs.LG cs.AI stat.ML

    Sparse Continuous Distributions and Fenchel-Young Losses

    Authors: André F. T. Martins, Marcos Treviso, António Farinhas, Pedro M. Q. Aguiar, Mário A. T. Figueiredo, Mathieu Blondel, Vlad Niculae

    Abstract: Exponential families are widely used in machine learning, including many distributions in continuous and discrete domains (e.g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation). Distributions in each of these families have fixed support. In contrast, for finite domains, recent work on sparse alternatives to softmax (e.g., sparsemax, $α$-entmax, and fused… ▽ More

    Submitted 4 August, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: JMLR 2022 camera ready version. arXiv admin note: text overlap with arXiv:2006.07214

  39. arXiv:2106.12895  [pdf, other

    cs.LG cs.AI cs.RO

    rSoccer: A Framework for Studying Reinforcement Learning in Small and Very Small Size Robot Soccer

    Authors: Felipe B. Martins, Mateus G. Machado, Hansenclever F. Bassani, Pedro H. M. Braga, Edna S. Barros

    Abstract: Reinforcement learning is an active research area with a vast number of applications in robotics, and the RoboCup competition is an interesting environment for studying and evaluating reinforcement learning methods. A known difficulty in applying reinforcement learning to robotics is the high number of experience samples required, being the use of simulated environments for training the agents fol… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

  40. arXiv:2105.06977  [pdf, other

    cs.CL cs.AI cs.LG

    Do Context-Aware Translation Models Pay the Right Attention?

    Authors: Kayo Yin, Patrick Fernandes, Danish Pruthi, Aditi Chaudhary, André F. T. Martins, Graham Neubig

    Abstract: Context-aware machine translation models are designed to leverage contextual information, but often fail to do so. As a result, they inaccurately disambiguate pronouns and polysemous words that require context for resolution. In this paper, we ask several questions: What contexts do human translators use to resolve ambiguous words? Are models paying large amounts of attention to the same context?… ▽ More

    Submitted 7 August, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: Accepted to ACL 2021

  41. arXiv:2105.03482  [pdf, other

    cs.CL

    Measuring and Increasing Context Usage in Context-Aware Machine Translation

    Authors: Patrick Fernandes, Kayo Yin, Graham Neubig, André F. T. Martins

    Abstract: Recent work in neural machine translation has demonstrated both the necessity and feasibility of using inter-sentential context -- context from sentences other than those currently being translated. However, while many current methods present model architectures that theoretically can use this extra context, it is often not clear how much they do actually utilize it at translation time. In this pa… ▽ More

    Submitted 2 June, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: ACL 2021

  42. arXiv:2104.13988  [pdf, ps, other

    astro-ph.SR astro-ph.GA

    On the maximum helium content of multiple populations in the globular cluster NGC6752

    Authors: Fabrice Martins, William Chantereau, Corinne Charbonnel

    Abstract: Multiple populations in globular clusters are usually explained by the formation of stars out of material with a chemical composition that is polluted to different degrees by the ejecta of short-lived, massive stars of various type. Among other things, these polluters differ by the amount of helium they spread in the surrounding medium. In this study we investigate whether the present-day photomet… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: 17 pages, 21 figures + appendix. Accepted in Astronomy & Astrophysics

    Journal ref: A&A 650, A162 (2021)

  43. arXiv:2104.03046  [pdf, other

    cs.CV cs.LG

    Multimodal Continuous Visual Attention Mechanisms

    Authors: António Farinhas, André F. T. Martins, Pedro M. Q. Aguiar

    Abstract: Visual attention mechanisms are a key component of neural network models for computer vision. By focusing on a discrete set of objects or image regions, these mechanisms identify the most relevant features and use them to build more powerful representations. Recently, continuous-domain alternatives to discrete attention models have been proposed, which exploit the continuity of images. These appro… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

  44. arXiv:2104.02766  [pdf, other

    math-ph hep-th math.QA

    Exactly solvable models for 2+1D topological phases derived from crossed modules of semisimple Hopf algebras

    Authors: Vincent Koppen, João Faria Martins, Paul Purdon Martin

    Abstract: We define an exactly solvable model for 2+1D topological phases of matter on a triangulated surface derived from a crossed module of semisimple finite-dimensional Hopf algebras, the `Hopf-algebraic higher Kitaev model'. This model generalizes both the Kitaev quantum double model for a semisimple Hopf algebra and the full higher Kitaev model derived from a 2-group, and can hence be interpreted as a… ▽ More

    Submitted 22 July, 2023; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: v2: minor clarifications and corrections; 71 pages, 3 figures

  45. arXiv:2104.00755  [pdf, other

    cs.LG

    Reconciling the Discrete-Continuous Divide: Towards a Mathematical Theory of Sparse Communication

    Authors: André F. T. Martins

    Abstract: Neural networks and other machine learning models compute continuous representations, while humans communicate with discrete symbols. Reconciling these two forms of communication is desirable to generate human-readable interpretations or to learn discrete latent variable models, while maintaining end-to-end differentiability. Some existing approaches (such as the Gumbel-softmax transformation) bui… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  46. The effects of surface fossil magnetic fields on massive star evolution: III. The case of $τ$ Sco

    Authors: Z. Keszthelyi, G. Meynet, F. Martins, A. de Koter, A. David-Uraz

    Abstract: $τ… ▽ More

    Submitted 23 April, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted for publication in MNRAS. A full reproduction package is shared on zenodo in accordance with the Research Data Management plan of the Anton Pannekoek Institute for Astronomy at the University of Amsterdam: https://doi.org/10.5281/zenodo.4633408

  47. arXiv:2103.10377  [pdf, other

    math-ph cond-mat.str-el hep-th math.CT math.GT

    Motion groupoids and map** class groupoids

    Authors: Fiona Torzewska, João Faria Martins, Paul Purdon Martin

    Abstract: Here $\underline{M}$ denotes a pair $(M,A)$ of a manifold and a subset (e.g. $A=\partial M$ or $A=\emptyset$). We construct for each $\underline{M}$ its motion groupoid $\mathrm{Mot}_{\underline{M}}$, whose object set is the power set $ {\mathcal P} M$ of $M$, and whose morphisms are certain equivalence classes of continuous flows of the `ambient space' $M$, that fix $A$, acting on… ▽ More

    Submitted 6 September, 2023; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: Version accepted for publication in CMP. 75 pages, 14 figures

    MSC Class: 20F36 Braid groups; Artin groups; 20L05 Groupoids (i.e. small categories in which all morphisms are isomorphisms)

    Journal ref: Comm. Math. Phys. 402, 1621-1705 (2023)

  48. arXiv:2103.10291  [pdf, other

    cs.CL

    Smoothing and Shrinking the Sparse Seq2Seq Search Space

    Authors: Ben Peters, André F. T. Martins

    Abstract: Current sequence-to-sequence models are trained to minimize cross-entropy and use softmax to compute the locally normalized probabilities over target sequences. While this setup has led to strong results in a variety of tasks, one unsatisfying aspect is its length bias: models give high scores to short, inadequate hypotheses and often make the empty string the argmax -- the so-called cat got your… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: NAACL 2021

  49. arXiv:2103.09884  [pdf, other

    physics.flu-dyn

    A Voronoi-tessellation-based approach for detection of coherent structures in sparsely-seeded flows

    Authors: F. A. C. Martins, D. E. Rival

    Abstract: A novel algorithm to detect coherent structures with sparse Lagrangian particle tracking data, using Voronoi tessellation and techniques from spectral graph theory, is tested. Neighbouring tracer particles are naturally identified through the Voronoi tessellation of the tracers' distribution. The method examines the \textit{neighbouring time} of tracer trajectories, defined as the total flow time… ▽ More

    Submitted 28 July, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

  50. arXiv:2102.04088  [pdf, other

    physics.ins-det hep-ex hep-ph

    Study of energy response and resolution of the ATLAS Tile Calorimeter to hadrons of energies from 16 to 30 GeV

    Authors: Jalal Abdallah, Stylianos Angelidakis, Giorgi Arabidze, Nikolay Atanov, Johannes Bernhard, Romeo Bonnefoy, Jonathan Bossio, Ryan Bouabid, Fernando Carrio, Tomas Davidek, Michal Dubovsky, Luca Fiorini, Francisco Brandan Garcia Aparisi, Tancredi Carli, Alexander Gerbershagen, Hazal Goksu, Haleh Hadavand, Siarhei Harkusha, Dingane Hlaluku, Michael James Hibbard, Kevin Hildebrand, Juansher Jejelava, Andrey Kamenshchikov, Stergios Kazakos, Tomas Kello , et al. (46 additional authors not shown)

    Abstract: Three spare modules of the ATLAS Tile Calorimeter were exposed to test beams from the Super Proton Synchrotron accelerator at CERN in 2017. The measurements of the energy response and resolution of the detector to positive pions and kaons and protons with energy in the range 16 to 30 GeV are reported. The results have uncertainties of few percent. They were compared to the predictions of the Geant… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.