Skip to main content

Showing 1–20 of 20 results for author: Finger,, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17569  [pdf, other

    cs.LG cs.AI cs.SD eess.AS

    Discriminant audio properties in deep learning based respiratory insufficiency detection in Brazilian Portuguese

    Authors: Marcelo Matheus Gauy, Larissa Cristina Berti, Arnaldo Cândido Jr, Augusto Camargo Neto, Alfredo Goldman, Anna Sara Shafferman Levin, Marcus Martins, Beatriz Raposo de Medeiros, Marcelo Queiroz, Ester Cerdeira Sabino, Flaviane Romani Fernandes Svartman, Marcelo Finger

    Abstract: This work investigates Artificial Intelligence (AI) systems that detect respiratory insufficiency (RI) by analyzing speech audios, thus treating speech as a RI biomarker. Previous works collected RI data (P1) from COVID-19 patients during the first phase of the pandemic and trained modern AI models, such as CNNs and Transformers, which achieved $96.5\%$ accuracy, showing the feasibility of RI dete… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 5 pages, 2 figures, 1 table. Published in Artificial Intelligence in Medicine (AIME) 2023

    Journal ref: Artificial Intellingence in Medicine Proceedings 2023, page 271-275

  2. arXiv:2402.19204  [pdf, other

    cs.CL

    PeLLE: Encoder-based language models for Brazilian Portuguese based on open data

    Authors: Guilherme Lamartine de Mello, Marcelo Finger, and Felipe Serras, Miguel de Mello Carpi, Marcos Menon Jose, Pedro Henrique Domingues, Paulo Cavalim

    Abstract: In this paper we present PeLLE, a family of large language models based on the RoBERTa architecture, for Brazilian Portuguese, trained on curated, open data from the Carolina corpus. Aiming at reproducible results, we describe details of the pretraining of the models. We also evaluate PeLLE models against a set of existing multilingual and PT-BR refined pretrained Transformer-based LLM encoders, c… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 15 pages

    ACM Class: I.2.7

  3. arXiv:2312.09265  [pdf, ps, other

    cs.SD cs.CL cs.LG eess.AS

    Acoustic models of Brazilian Portuguese Speech based on Neural Transformers

    Authors: Marcelo Matheus Gauy, Marcelo Finger

    Abstract: An acoustic model, trained on a significant amount of unlabeled data, consists of a self-supervised learned speech representation useful for solving downstream tasks, perhaps after a fine-tuning of the model in the respective downstream task. In this work, we build an acoustic model of Brazilian Portuguese Speech through a Transformer neural network. This model was pretrained on more than $800$ ho… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Under review at Journal of Brazilian Computer Society

  4. arXiv:2307.08593  [pdf, other

    physics.acc-ph cs.LG hep-ex nucl-ex nucl-th

    Artificial Intelligence for the Electron Ion Collider (AI4EIC)

    Authors: C. Allaire, R. Ammendola, E. -C. Aschenauer, M. Balandat, M. Battaglieri, J. Bernauer, M. Bondì, N. Branson, T. Britton, A. Butter, I. Chahrour, P. Chatagnon, E. Cisbani, E. W. Cline, S. Dash, C. Dean, W. Deconinck, A. Deshpande, M. Diefenthaler, R. Ent, C. Fanelli, M. Finger, M. Finger, Jr., E. Fol, S. Furletov , et al. (70 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 27 pages, 11 figures, AI4EIC workshop, tutorials and hackathon

  5. arXiv:2303.16098  [pdf, other

    cs.CL cs.AI

    Carolina: a General Corpus of Contemporary Brazilian Portuguese with Provenance, Typology and Versioning Information

    Authors: Maria Clara Ramos Morales Crespo, Maria Lina de Souza Jeannine Rocha, Mariana Lourenço Sturzeneker, Felipe Ribas Serras, Guilherme Lamartine de Mello, Aline Silva Costa, Mayara Feliciano Palma, Renata Morais Mesquita, Raquel de Paula Guets, Mariana Marques da Silva, Marcelo Finger, Maria Clara Paixão de Sousa, Cristiane Namiuti, Vanessa Martins do Monte

    Abstract: This paper presents the first publicly available version of the Carolina Corpus and discusses its future directions. Carolina is a large open corpus of Brazilian Portuguese texts under construction using web-as-corpus methodology enhanced with provenance, typology, versioning, and text integrality. The corpus aims at being used both as a reliable source for research in Linguistics and as an import… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 14 pages, 3 figures, 1 appendix

    MSC Class: 68T50 ACM Class: I.2.7

  6. arXiv:2211.14372  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Interpretability Analysis of Deep Models for COVID-19 Detection

    Authors: Daniel Peixoto Pinto da Silva, Edresson Casanova, Lucas Rafael Stefanel Gris, Arnaldo Candido Junior, Marcelo Finger, Flaviane Svartman, Beatriz Raposo, Marcus Vinícius Moreira Martins, Sandra Maria Aluísio, Larissa Cristina Berti, João Paulo Teixeira

    Abstract: During the outbreak of COVID-19 pandemic, several research areas joined efforts to mitigate the damages caused by SARS-CoV-2. In this paper we present an interpretability analysis of a convolutional neural network based model for COVID-19 detection in audios. We investigate which features are important for model decision process, investigating spectrograms, F0, F0 standard deviation, sex and age.… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 14 pages, 4 figures

  7. arXiv:2210.14716  [pdf, other

    cs.SD cs.LG eess.AS

    Pretrained audio neural networks for Speech emotion recognition in Portuguese

    Authors: Marcelo Matheus Gauy, Marcelo Finger

    Abstract: The goal of speech emotion recognition (SER) is to identify the emotional aspects of speech. The SER challenge for Brazilian Portuguese speech was proposed with short snippets of Portuguese which are classified as neutral, non-neutral female and non-neutral male according to paralinguistic elements (laughing, crying, etc). This dataset contains about $50$ minutes of Brazilian Portuguese speech. As… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Journal ref: First Workshop on Automatic Speech Recognition for Spontaneous and Prepared Speech Speech emotion recognition in Portuguese (SER 2022)

  8. arXiv:2210.14085  [pdf, other

    cs.SD cs.LG eess.AS

    Audio MFCC-gram Transformers for respiratory insufficiency detection in COVID-19

    Authors: Marcelo Matheus Gauy, Marcelo Finger

    Abstract: This work explores speech as a biomarker and investigates the detection of respiratory insufficiency (RI) by analyzing speech samples. Previous work \cite{spira2021} constructed a dataset of respiratory insufficiency COVID-19 patient utterances and analyzed it by means of a convolutional neural network achieving an accuracy of $87.04\%$, validating the hypothesis that one can detect RI through spe… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Journal ref: SIMPÓSIO BRASILEIRO DE TECNOLOGIA DA INFORMAÇÃO E DA LINGUAGEM HUMANA (STIL), 13. , 2021, Evento Online. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2021 . p. 143-152

  9. arXiv:2205.09185  [pdf, other

    physics.ins-det cs.LG hep-ex nucl-ex physics.comp-ph

    AI-assisted Optimization of the ECCE Tracking System at the Electron Ion Collider

    Authors: C. Fanelli, Z. Papandreou, K. Suresh, J. K. Adkins, Y. Akiba, A. Albataineh, M. Amaryan, I. C. Arsene, C. Ayerbe Gayoso, J. Bae, X. Bai, M. D. Baker, M. Bashkanov, R. Bellwied, F. Benmokhtar, V. Berdnikov, J. C. Bernauer, F. Bock, W. Boeglin, M. Borysova, E. Brash, P. Brindza, W. J. Briscoe, M. Brooks, S. Bueltmann , et al. (258 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC) is a cutting-edge accelerator facility that will study the nature of the "glue" that binds the building blocks of the visible matter in the universe. The proposed experiment will be realized at Brookhaven National Laboratory in approximately 10 years from now, with detector design and R&D currently ongoing. Notably, EIC is one of the first large-scale facilities to… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: 16 pages, 18 figures, 2 appendices, 3 tables

  10. arXiv:2205.00361  [pdf, other

    cs.LG cs.AI cs.CR

    Combined Learning of Neural Network Weights for Privacy in Collaborative Tasks

    Authors: Aline R. Ioste, Alan M. Durham, Marcelo Finger

    Abstract: We introduce CoLN, Combined Learning of Neural network weights, a novel method to securely combine Machine Learning models over sensitive data with no sharing of data. With CoLN, local hosts use the same Neural Network architecture and base parameters to train a model using only locally available data. Locally trained models are then submitted to a combining agent, which produces a combined model.… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

  11. verBERT: Automating Brazilian Case Law Document Multi-label Categorization Using BERT

    Authors: Felipe R. Serras, Marcelo Finger

    Abstract: In this work, we carried out a study about the use of attention-based algorithms to automate the categorization of Brazilian case law documents. We used data from the Kollemata Project to produce two distinct datasets with adequate class systems. Then, we implemented a multi-class and multi-label version of BERT and fine-tuned different BERT models with the produced datasets. We evaluated several… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 10 pages, 2 tables

    ACM Class: I.2.7

    Journal ref: SERRAS, F. R.; FINGER, M. verBERT: Automating Brazilian Case Law Document Multi-label Categorization Using BERT. In 13th Brazilian Simposiun on Human Language and Information Technology (STIL), 2021. pp. 237-246

  12. arXiv:2201.00746  [pdf, other

    cs.GT

    Coherence of probabilistic constraints on Nash equilibria

    Authors: Sandro Preto, Eduardo Fermé, Marcelo Finger

    Abstract: Observable games are game situations that reach one of possibly many Nash equilibria. Before an instance of the game starts, an external observer does not know, a priori, what is the exact profile of actions that will occur; thus, he assigns subjective probabilities to players' actions. However, not all probabilistic assignments are coherent with a given game. We study the decision problem of dete… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: Preprint

  13. Text-to-hashtag Generation using Seq2seq Learning

    Authors: Augusto Camargo, Wesley Carvalho, Felipe Peressim, Alan Barzilay, Marcelo Finger

    Abstract: In this paper, we studied whether models based on BiLSTM and BERT can predict hashtags in Brazilian Portuguese for Ecommerce websites. Hashtags have a sizable financial impact on Ecommerce. We processed a corpus of Ecommerce reviews as inputs, and predicted hashtags as outputs. We evaluated the results using four quantitative metrics: NIST, BLEU, METEOR and a crowdsourced score. A word cloud was u… ▽ More

    Submitted 13 July, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

  14. Extending Description Logic EL++ with Linear Constraints on the Probability of Axioms

    Authors: Marcelo Finger

    Abstract: One of the main reasons to employ a description logic such as EL or EL++ is the fact that it has efficient, polynomial-time algorithmic properties such as deciding consistency and inferring subsumption. However, simply by adding negation of concepts to it, we obtain the expressivity of description logics whose decision procedure is {ExpTime}-complete. Similar complexity explosion occurs if we add… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: An earlier version of this work has appeared at Franz Baader's festschrift. Here we detail the column generation method and present a detailed example

    MSC Class: 03B70; 03B48 ACM Class: I.2.4; F.4.1; G.3

    Journal ref: In Lecture Notes in Computer Science 11560, pp. 286--300. Springer (2019)

  15. arXiv:1905.05704  [pdf, other

    cs.CL cs.AI cs.LG

    A logical-based corpus for cross-lingual evaluation

    Authors: Felipe Salvatore, Marcelo Finger, Roberto Hirata Jr

    Abstract: At present, different deep learning models are presenting high accuracy on popular inference datasets such as SNLI, MNLI, and SciTail. However, there are different indicators that those datasets can be exploited by using some simple linguistic patterns. This fact poses difficulties to our understanding of the actual capacity of machine learning models to solve the complex task of textual inference… ▽ More

    Submitted 23 October, 2019; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: To appear in the proceedings of the Deep Learning for low-resource NLP (DeepLo) workshop at EMNLP 2019

  16. arXiv:1905.05665  [pdf, ps, other

    cs.LO cs.AI

    Quantitative Logic Reasoning

    Authors: Marcelo Finger

    Abstract: In this paper we show several similarities among logic systems that deal simultaneously with deductive and quantitative inference. We claim it is appropriate to call the tasks those systems perform as Quantitative Logic Reasoning. Analogous properties hold throughout that class, for whose members there exists a set of linear algebraic techniques applicable in the study of satisfiability decision p… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Appeared as a chapter in Trends in Logic series

    Journal ref: In W. Carnielli and J. Malinowski, editors, Contradictions, from Consistency to Inconsistency, Trends in Logic, pages 241-272. Springer International Publishing, 2018

  17. arXiv:1807.07108  [pdf, other

    cs.CL

    Semantic Parsing: Syntactic assurance to target sentence using LSTM Encoder CFG-Decoder

    Authors: Fabiano Ferreira Luz, Marcelo Finger

    Abstract: Semantic parsing can be defined as the process of map** natural language sentences into a machine interpretable, formal representation of its meaning. Semantic parsing using LSTM encoder-decoder neural networks have become promising approach. However, human automated translation of natural language does not provide grammaticality guarantees for the sentences generate such a guarantee is particul… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

  18. arXiv:1803.04329  [pdf, other

    cs.CL

    Semantic Parsing Natural Language into SPARQL: Improving Target Language Representation with Neural Attention

    Authors: Fabiano Ferreira Luz, Marcelo Finger

    Abstract: Semantic parsing is the process of map** a natural language sentence into a formal representation of its meaning. In this work we use the neural network approach to transform natural language sentence into a query to an ontology database in the SPARQL language. This method does not rely on handcraft-rules, high-quality lexicons, manually-built templates or other handmade complex structures. Our… ▽ More

    Submitted 12 March, 2018; originally announced March 2018.

  19. arXiv:1203.6161  [pdf, ps, other

    cs.CC quant-ph

    Classical and quantum satisfiability

    Authors: Anderson de Araújo, Marcelo Finger

    Abstract: We present the linear algebraic definition of QSAT and propose a direct logical characterization of such a definition. We then prove that this logical version of QSAT is not an extension of classical satisfiability problem (SAT). This shows that QSAT does not allow a direct comparison between the complexity classes NP and QMA, for which SAT and QSAT are respectively complete.

    Submitted 28 March, 2012; originally announced March 2012.

    Comments: In Proceedings LSFA 2011, arXiv:1203.5423

    Journal ref: EPTCS 81, 2012, pp. 79-84

  20. Towards an efficient prover for the C1 paraconsistent logic

    Authors: Adolfo Neto, Celso A. A. Kaestner, Marcelo Finger

    Abstract: The KE inference system is a tableau method developed by Marco Mondadori which was presented as an improvement, in the computational efficiency sense, over Analytic Tableaux. In the literature, there is no description of a theorem prover based on the KE method for the C1 paraconsistent logic. Paraconsistent logics have several applications, such as in robot control and medicine. These applications… ▽ More

    Submitted 19 February, 2012; originally announced February 2012.

    Comments: 16 pages

    Journal ref: Electronic Notes in Theoretical Computer Science. Volume 256, 2 December 2009, Pages 87-102. Proceedings of the Fourth Workshop on Logical and Semantic Frameworks, with Applications (LSFA 2009)