Skip to main content

Showing 1–50 of 325 results for author: Ribeiro, M

.
  1. arXiv:2407.04858  [pdf, other

    cs.CL

    Question Answering with Texts and Tables through Deep Reinforcement Learning

    Authors: Marcos M. José, Flávio N. Cação, Maria F. Ribeiro, Rafael M. Cheang, Paulo Pirozelli, Fabio G. Cozman

    Abstract: This paper proposes a novel architecture to generate multi-hop answers to open domain questions that require information from texts and tables, using the Open Table-and-Text Question Answering dataset for validation and training. One of the most common ways to generate answers in this setting is to retrieve information sequentially, where a selected piece of data helps searching for the next piece… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2405.02150  [pdf, other

    cs.CY

    The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates

    Authors: Giuseppe Russo Latona, Manoel Horta Ribeiro, Tim R. Davidson, Veniamin Veselovsky, Robert West

    Abstract: Journals and conferences worry that peer reviews assisted by artificial intelligence (AI), in particular, large language models (LLMs), may negatively influence the validity and fairness of the peer-review system, a cornerstone of modern science. In this work, we address this concern with a quasi-experimental study of the prevalence and impact of AI-assisted peer reviews in the context of the 2024… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Manoel Horta Ribeiro, Tim R. Davidson, and Veniamin Veselovsky contributed equally to this work

  3. A Catalog of Transformations to Remove Smells From Natural Language Tests

    Authors: Manoel Aranda, Naelson Oliveira, Elvys Soares, Márcio Ribeiro, Davi Romão, Ullyanne Patriota, Rohit Gheyi, Emerson Souza, Ivan Machado

    Abstract: Test smells can pose difficulties during testing activities, such as poor maintainability, non-deterministic behavior, and incomplete verification. Existing research has extensively addressed test smells in automated software tests but little attention has been given to smells in natural language tests. While some research has identified and catalogued such smells, there is a lack of systematic ap… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Distinguished Paper Award at International Conference on Evaluation and Assessment in Software Engineering (EASE), 2024 edition

    ACM Class: D.2.5

  4. arXiv:2404.11789  [pdf, other

    physics.flu-dyn

    An Invitation to Resolvent Analysis

    Authors: Laura Victoria Rolandi, Jean Hélder Marques Ribeiro, Chi-An Yeh, Kunihiko Taira

    Abstract: Resolvent analysis is a powerful tool that can reveal the linear amplification mechanisms between the forcing inputs and the response outputs about a base flow. These mechanisms can be revealed in terms of a pair of forcing and response modes and the associated gains (amplification magnitude) in the order of energy contents at a given frequency. The linear relationship that ties the forcing and th… ▽ More

    Submitted 25 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  5. arXiv:2404.00750  [pdf, other

    cs.CL cs.CY

    Can Language Models Recognize Convincing Arguments?

    Authors: Paula Rescala, Manoel Horta Ribeiro, Tiancheng Hu, Robert West

    Abstract: The remarkable and ever-increasing capabilities of Large Language Models (LLMs) have raised concerns about their potential misuse for creating personalized, convincing misinformation and propaganda. To gain insights into LLMs' persuasive capabilities without directly engaging in experimentation with humans, we propose studying their performance on the related task of detecting convincing arguments… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  6. arXiv:2403.14380  [pdf, other

    cs.CY

    On the Conversational Persuasiveness of Large Language Models: A Randomized Controlled Trial

    Authors: Francesco Salvi, Manoel Horta Ribeiro, Riccardo Gallotti, Robert West

    Abstract: The development and popularization of large language models (LLMs) have raised concerns that they will be used to create tailor-made, convincing arguments to push false or misleading narratives online. Early work has found that language models can generate content perceived as at least on par and often more persuasive than human-written messages. However, there is still limited knowledge about LLM… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 33 pages, 10 figures, 7 tables

  7. arXiv:2403.08655  [pdf, ps, other

    cond-mat.supr-con

    Efficient electronic cooling above 2 K by niobium-based superconducting tunnel junctions

    Authors: J. Hätinen, A. Ronzani, R. P. Loreto, E. Mykkänen, A. Kemppinen, K. Viisanen, T. Rantanen, J. Geisor, J. Lehtinen, M. Ribeiro, J-P. Kaikkonen, O. Prakash, V. Vesterinen, W. Förbom, E. T. Mannila, M. Kervinen, J. Govenius, M. Prunnila

    Abstract: Numerous applications, from industrial non-destructive imaging through ultra-sensitive photon counting to various implementations of solid-state quantum computers require low temperatures for their sensor and processor chips. Replacing the bulky cryo-liquid based cooling stages of cryo-enabled instruments by chip scale refrigeration is envisioned to disruptively reduce the system size similarly as… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  8. arXiv:2402.12553  [pdf, ps, other

    physics.flu-dyn

    Triglobal resolvent-analysis-based control of separated flows around low-aspect-ratio wings

    Authors: Jean Hélder Marques Ribeiro, Kunihiko Taira

    Abstract: We perform direct numerical simulations (DNS) of actively controlled laminar separated wakes around low-aspect-ratio wings with two primary goals: (i) reducing the size of the separation bubble and (ii) attenuating the wing tip vortex. Instead of preventing separation, we modify the three-dimensional ($3$-D) dynamics to exploit wake vortices for aerodynamic enhancements. A direct wake modification… ▽ More

    Submitted 18 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  9. arXiv:2402.01573  [pdf, other

    cs.SE

    An Actionable Framework for Understanding and Improving Talent Retention as a Competitive Advantage in IT Organizations

    Authors: Luiz Alexandre Costa, Edson Dias, Danilo Monteiro Ribeiro, Awdren Fontão, Gustavo Pinto, Rodrigo Pereira dos Santos, Alexander Serebrenik

    Abstract: In the rapidly evolving global business landscape, the demand for software has intensified competition among organizations, leading to challenges in retaining highly qualified IT members in software organizations. One of the problems faced by IT organizations is the retention of these strategic professionals, also known as talent. This work presents an actionable framework for Talent Retention (TR… ▽ More

    Submitted 24 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2205.06352 by other authors

  10. arXiv:2401.18060  [pdf, ps, other

    math.NT cs.DM math.CO

    Rarity of the infinite chains in the tree of numerical semigroups

    Authors: Maria Bras-Amorós, Mariana Rosas Ribeiro

    Abstract: We prove that, for each fixed genus, the portion of semigroups of that genus belonging to infinite chains in the semigroup tree approaches 0 as the genus grows to infinite. This means that most numerical semigroups have a finite number of descendants in the semigroup tree. This problem has been open since 2009.

    Submitted 31 January, 2024; originally announced January 2024.

    MSC Class: 68W30; 06F05; 20M14; 05A99

  11. arXiv:2401.01406  [pdf

    cond-mat.soft

    Influence of Surface Roughness on Linear Behavior and Mechanical Properties of Three Cyanoacrylate-Based Adhesives Used to Bond Strain Gages

    Authors: L. G. Simao, W. P. Jesus, M. E. A. Ribeiro, H. C. Rangel, R. J. S. Rodriguez, E. A. Carvalho

    Abstract: The challenge of accessing specialized adhesives designed for strain gage applications has been highlighted due to failures in logistic chains, requiring the exploration of local alternatives. A direct simulation of strain gage bonding behavior with two steel plates is infeasible due to the unique construction of strain gages. Therefore, an indirect simulation method, comparing local alternatives… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 44 pages, 24 figures

  12. arXiv:2401.01253  [pdf, other

    cs.SI cs.CY

    Deplatforming Norm-Violating Influencers on Social Media Reduces Overall Online Attention Toward Them

    Authors: Manoel Horta Ribeiro, Shagun Jhaver, Jordi Cluet i Martinell, Marie Reignier-Tayar, Robert West

    Abstract: From politicians to podcast hosts, online platforms have systematically banned (``deplatformed'') influential users for breaking platform guidelines. Previous inquiries on the effectiveness of this intervention are inconclusive because 1) they consider only few deplatforming events; 2) they consider only overt engagement traces (e.g., likes and posts) but not passive engagement (e.g., views); 3) t… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  13. arXiv:2312.11307  [pdf, other

    hep-ph

    New Constraint for Isotropic Lorentz Violation from LHC Data

    Authors: David Amram, Killian Bouzoud, Nicolas Chanon, Hubert Hansen, Marcos R. Ribeiro Jr., Marco Schreck

    Abstract: New calculations for the kinematics of photon decay to fermions in vacuo under an isotropic violation of Lorentz invariance (LV), parameterized by the Standard-Model Extension (SME), are presented in this paper and used to interpret prompt photon production in LHC data. The measurement of inclusive prompt photon production at the LHC Run 2, with photons observed up to a transverse energy of 2.5 Te… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 9 pages, 3 figures

  14. arXiv:2312.11240  [pdf, other

    cs.SD eess.AS

    Evaluation of Barlow Twins and VICReg self-supervised learning for sound patterns of bird and anuran species

    Authors: Fábio Felix Dias, Moacir Antonelli Ponti, Mílton Cezar Ribeiro, Rosane Minghim

    Abstract: Taking advantage of the structure of large datasets to pre-train Deep Learning models is a promising strategy to decrease the need for supervised data. Self-supervised learning methods, such as contrastive and its variation are a promising way towards obtaining better representations in many Deep Learning applications. Soundscape ecology is one application in which annotations are expensive and sc… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 10 pages, 2 figures, 3 tables

  15. arXiv:2310.15683  [pdf, other

    cs.CL

    Prevalence and prevention of large language model use in crowd work

    Authors: Veniamin Veselovsky, Manoel Horta Ribeiro, Philip Cozzolino, Andrew Gordon, David Rothschild, Robert West

    Abstract: We show that the use of large language models (LLMs) is prevalent among crowd workers, and that targeted mitigation strategies can significantly reduce, but not eliminate, LLM use. On a text summarization task where workers were not directed in any way regarding their LLM use, the estimated prevalence of LLM use was around 30%, but was reduced by about half by asking workers to not use LLMs and by… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: VV and MHR equal contribution. 14 pages, 1 figure, 1 table

  16. arXiv:2310.12696  [pdf, other

    cs.CY

    Protection from Evil and Good: The Differential Effects of Page Protection on Wikipedia Article Quality

    Authors: Thorsten Ruprechter, Manoel Horta Ribeiro, Robert West, Denis Helic

    Abstract: Wikipedia, the Web's largest encyclopedia, frequently faces content disputes or malicious users seeking to subvert its integrity. Administrators can mitigate such disruptions by enforcing "page protection" that selectively limits contributions to specific articles to help prevent the degradation of content. However, this practice contradicts one of Wikipedia's fundamental principles$-$that it is o… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Under Review, 11 pages

  17. arXiv:2310.12186  [pdf, other

    cs.SI cs.AI

    Stranger Danger! Cross-Community Interactions with Fringe Users Increase the Growth of Fringe Communities on Reddit

    Authors: Giuseppe Russo, Manoel Horta Ribeiro, Robert West

    Abstract: Fringe communities promoting conspiracy theories and extremist ideologies have thrived on mainstream platforms, raising questions about the mechanisms driving their growth. Here, we hypothesize and study a possible mechanism: new members may be recruited through fringe-interactions: the exchange of comments between members and non-members of fringe communities. We apply text-based causal inference… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 11 Pages, 7 Figures, 3 Tables

  18. arXiv:2309.10190  [pdf, ps, other

    math.AP math.OC

    Revisited convexity notions for $L^\infty$ variational problems

    Authors: Ana Margarida Ribeiro, Elvira Zappale

    Abstract: We address a deep study of the convexity notions that arise in the study of weak* lower semicontinuity of supremal functionals as well as those raised by the power-law approximation of such functionals. Our quest is motivated by the knowledge we have on the analogous integral functionals and aims at establishing a solid groundwork to ease any research in the $L^\infty$ context.

    Submitted 18 September, 2023; originally announced September 2023.

    MSC Class: 26B25; 49J45

  19. arXiv:2308.16428  [pdf, other

    math.DG

    On the topology of the Milnor Boundary for real analytic singularities

    Authors: R. Araújo dos Santos, A. Menegon, M. Ribeiro, J. Seade, I. D. Santamaria Guarín

    Abstract: We study the topology of the boundaries of the Milnor fibers of real analytics map-germs $f: (\mathbb{R}^M,0) \to (\mathbb{R}^K,0)$ and $f_{I}:=Π_{I}\circ f : (\mathbb{R}^M,0) \to (\mathbb{R}^I,0)$ that admit Milnor's tube fibrations, where $Π_{I}:(\mathbb{R}^K,0)\to (\mathbb{R}^{I},0)$ is the canonical projection for $1\leq I<K.$ For each $I$ we prove that the Milnor boundary $\partial F_{I}$ is… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  20. arXiv:2308.14624  [pdf, ps, other

    math.AC

    Universally injective and integral contractions on relative Lipschitz saturation of algebras

    Authors: Thiago da Silva, Maico Ribeiro

    Abstract: In this work, we obtain contraction results for a class of diagrams of ring morphisms which strictly includes the ones obtained by Lipman. Further, we present some applications on quotient and in the changing of the base ring in the saturation.

    Submitted 28 August, 2023; originally announced August 2023.

  21. arXiv:2308.13359  [pdf, ps, other

    math.DS math.AP math.DG math.GT

    Topology of first integrals via Milnor fibrations II

    Authors: Fernando Reis, Maico Ribeiro, Euripedes da Silva

    Abstract: This survey is the continuation of a series of works aimed at applying tools from Singularity Theory to Differential Equations. More precisely, we utilize the powerfull Milnor's Fibration Theory to give geometric-topological classifications of first integrals of differential systems. In the previous paper, systems of first-order quasilinear partial differential equations were examined, focusing on… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    MSC Class: 14J17; 57R30; 14D06

  22. Some remarks about $ρ$-regularity for real analytic maps

    Authors: Maico Ribeiro, Ivan Santamaria, Thiago da Silva

    Abstract: In this paper, we discuss the concept of $ρ$-regularity of analytic map germs and its close relationship with the existence of locally trivial smooth fibrations, known as the Milnor fibrations. The presence of a Thom regular stratification or the Milnor condition (b) at the origin, indicates the transversality of the fibers of the map G with respect to the levels of a function $ρ$, which guarantee… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  23. arXiv:2308.12442  [pdf, other

    physics.flu-dyn

    Similarities in Massive Separation Across Reynolds Numbers for Swept and Tapered Finite Span Wings

    Authors: Jacob Neal, Anton Burtsev, Jean Helder Marques Ribeiro, Kunihiko Taira, Vassilios Theofilis, Michael Amitay

    Abstract: Experimental investigations were performed to elucidate the features of flow fields occurring over cantilevered finite-aspect ratio NACA 0015 wings at high angles of attack with various sweep angles and taper ratios. Volumetric Stereoscopic Particle Image Velocimetry experiments were performed at mean chord based Reynolds number of 247,500 in a wind tunnel and 600 in a water tunnel. Direct Numeric… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  24. arXiv:2308.10398  [pdf, other

    cs.SI

    Causally estimating the effect of YouTube's recommender system using counterfactual bots

    Authors: Homa Hosseinmardi, Amir Ghasemian, Miguel Rivera-Lanas, Manoel Horta Ribeiro, Robert West, Duncan J. Watts

    Abstract: In recent years, critics of online platforms have raised concerns about the ability of recommendation algorithms to amplify problematic content, with potentially radicalizing consequences. However, attempts to evaluate the effect of recommenders have suffered from a lack of appropriate counterfactuals -- what a user would have viewed in the absence of algorithmic recommendations -- and hence canno… ▽ More

    Submitted 1 December, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

  25. arXiv:2308.01386  [pdf, other

    cs.SE

    Manual Tests Do Smell! Cataloging and Identifying Natural Language Test Smells

    Authors: Elvys Soares, Manoel Aranda, Naelson Oliveira, Márcio Ribeiro, Rohit Gheyi, Emerson Souza, Ivan Machado, André Santos, Baldoino Fonseca, Rodrigo Bonifácio

    Abstract: Background: Test smells indicate potential problems in the design and implementation of automated software tests that may negatively impact test code maintainability, coverage, and reliability. When poorly described, manual tests written in natural language may suffer from related problems, which enable their analysis from the point of view of test smells. Despite the possible prejudice to manuall… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: The 17th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM), 2023

  26. arXiv:2307.16709  [pdf, other

    cs.CL eess.AS

    Multilingual context-based pronunciation learning for Text-to-Speech

    Authors: Giulia Comini, Manuel Sam Ribeiro, Fan Yang, Heereen Shim, Jaime Lorenzo-Trueba

    Abstract: Phonetic information and linguistic knowledge are an essential component of a Text-to-speech (TTS) front-end. Given a language, a lexicon can be collected offline and Grapheme-to-Phoneme (G2P) relationships are usually modeled in order to predict the pronunciation for out-of-vocabulary (OOV) words. Additionally, post-lexical phonology, often defined in the form of rule-based systems, is used to co… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 5 pages, 2 figures, 5 tables. Interspeech 2023

  27. arXiv:2307.16696  [pdf, other

    cs.SE

    Large Language Models for Education: Grading Open-Ended Questions Using ChatGPT

    Authors: Gustavo Pinto, Isadora Cardoso-Pereira, Danilo Monteiro Ribeiro, Danilo Lucena, Alberto de Souza, Kiev Gama

    Abstract: As a way of addressing increasingly sophisticated problems, software professionals face the constant challenge of seeking improvement. However, for these individuals to enhance their skills, their process of studying and training must involve feedback that is both immediate and accurate. In the context of software companies, where the scale of professionals undergoing training is large, but the nu… ▽ More

    Submitted 1 August, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: 10 pages, 2 figures

    Journal ref: SBES EDU Track, 2023

  28. arXiv:2307.16679  [pdf, other

    eess.AS cs.CL cs.LG

    Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

    Authors: Guangyan Zhang, Thomas Merritt, Manuel Sam Ribeiro, Biel Tura-Vecino, Kayoko Yanagisawa, Kamil Pokora, Abdelhamid Ezzerg, Sebastian Cygert, Ammar Abbas, Piotr Bilinski, Roberto Barra-Chicote, Daniel Korzekwa, Jaime Lorenzo-Trueba

    Abstract: Neural text-to-speech systems are often optimized on L1/L2 losses, which make strong assumptions about the distributions of the target data space. Aiming to improve those assumptions, Normalizing Flows and Diffusion Probabilistic Models were recently proposed as alternatives. In this paper, we compare traditional L1/L2-based approaches to diffusion and flow-based approaches for the tasks of prosod… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 5 pages, 2 figures, 5 tables. Interspeech 2023

  29. arXiv:2307.16643  [pdf, other

    eess.AS cs.CL

    Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings

    Authors: Manuel Sam Ribeiro, Giulia Comini, Jaime Lorenzo-Trueba

    Abstract: The Grapheme-to-Phoneme (G2P) task aims to convert orthographic input into a discrete phonetic representation. G2P conversion is beneficial to various speech processing applications, such as text-to-speech and speech recognition. However, these tend to rely on manually-annotated pronunciation dictionaries, which are often time-consuming and costly to acquire. In this paper, we propose a method to… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 5 pages, 2 figures, 4 tables. Interspeech 2023

  30. Enhancing Network Slicing Architectures with Machine Learning, Security, Sustainability and Experimental Networks Integration

    Authors: Joberto S. B. Martins, Tereza C. Carvalho, Rodrigo Moreira, Cristiano Both, Adnei Donatti, João H. Corrêa, José A. Suruagy, Sand L. Corrêa, Antonio J. G. Abelem, Moisés R. N. Ribeiro, Jose-Marcos Nogueira, Luiz C. S. Magalhães, Juliano Wickboldt, Tiago Ferreto, Ricardo Mello, Rafael Pasquini, Marcos Schwarz, Leobino N. Sampaio, Daniel F. Macedo, José F. de Rezende, Kleber V. Cardoso, Flávio O. Silva

    Abstract: Network Slicing (NS) is an essential technique extensively used in 5G networks computing strategies, mobile edge computing, mobile cloud computing, and verticals like the Internet of Vehicles and industrial IoT, among others. NS is foreseen as one of the leading enablers for 6G futuristic and highly demanding applications since it allows the optimization and customization of scarce and disputed re… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 10 pages, 11 figures

    ACM Class: I.2.1; C.2.1; C.2.3

    Journal ref: IEEE ACCESS 2023

  31. arXiv:2307.06954  [pdf, other

    cs.CL cs.AI

    ACTI at EVALITA 2023: Overview of the Conspiracy Theory Identification Task

    Authors: Giuseppe Russo, Niklas Stoehr, Manoel Horta Ribeiro

    Abstract: Conspiracy Theory Identication task is a new shared task proposed for the first time at the Evalita 2023. The ACTI challenge, based exclusively on comments published on conspiratorial channels of telegram, is divided into two subtasks: (i) Conspiratorial Content Classification: identifying conspiratorial content and (ii) Conspiratorial Category Classification about specific conspiracy theory class… ▽ More

    Submitted 2 September, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted at the Evalita Workshop 2023

  32. arXiv:2307.03791  [pdf, other

    math.AG

    Tameness conditions and the Milnor fibrations for composite singularities

    Authors: R. N. Araújo dos Santos, D. Dreibelbis, M. F. Ribeiro, I. D. Santamaría Guarín

    Abstract: In this paper, we introduce a new regularity condition that characterizes the tameness of a composite singularity $H=G\circ F$ in a sharp way. Our approach provides a natural tool that links the topology of the Milnor tube fibrations through the Milnor fibers of the respective components of the map germs $F$, $G$ and $H = G\circ F$. We also study the invariance of tameness by $\mathcal{L}$-equival… ▽ More

    Submitted 24 May, 2023; originally announced July 2023.

    MSC Class: 58K15; 14D06; 58K35; 14B05; 32S55; 32S05

  33. arXiv:2307.02903  [pdf

    physics.chem-ph cs.LG

    PUFFIN: A Path-Unifying Feed-Forward Interfaced Network for Vapor Pressure Prediction

    Authors: Vinicius Viena Santana, Carine Menezes Rebello, Luana P. Queiroz, Ana Mafalda Ribeiro, Nadia Shardt, Idelfonso B. R. Nogueira

    Abstract: Accurately predicting vapor pressure is vital for various industrial and environmental applications. However, obtaining accurate measurements for all compounds of interest is not possible due to the resource and labor intensity of experiments. The demand for resources and labor further multiplies when a temperature-dependent relationship for predicting vapor pressure is desired. In this paper, we… ▽ More

    Submitted 8 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

  34. arXiv:2306.17298  [pdf, other

    cs.CY

    Tube2Vec: Social and Semantic Embeddings of YouTube Channels

    Authors: Léopaul Boesinger, Manoel Horta Ribeiro, Veniamin Veselovsky, Robert West

    Abstract: Research using YouTube data often explores social and semantic dimensions of channels and videos. Typically, analyses rely on laborious manual annotation of content and content creators, often found by low-recall methods such as keyword search. Here, we explore an alternative approach, using latent representations (embeddings) obtained via machine learning. Using a large dataset of YouTube links s… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  35. arXiv:2306.07899  [pdf, other

    cs.CL cs.CY

    Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks

    Authors: Veniamin Veselovsky, Manoel Horta Ribeiro, Robert West

    Abstract: Large language models (LLMs) are remarkable data annotators. They can be used to generate high-fidelity supervised training data, as well as survey and experimental data. With the widespread adoption of LLMs, human gold--standard annotations are key to understanding the capabilities of LLMs and the validity of their results. However, crowdsourcing, an important, inexpensive way to obtain human ann… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 9 pages, 4 figures

  36. arXiv:2306.03280  [pdf, other

    cs.HC

    AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms

    Authors: Zana Buçinca, Chau Minh Pham, Maurice Jakesch, Marco Tulio Ribeiro, Alexandra Olteanu, Saleema Amershi

    Abstract: While demands for change and accountability for harmful AI consequences mount, foreseeing the downstream effects of deploying AI systems remains a challenging task. We developed AHA! (Anticipating Harms of AI), a generative framework to assist AI practitioners and decision-makers in anticipating potential harms and unintended consequences of AI systems prior to development or deployment. Given an… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  37. arXiv:2305.17804  [pdf, other

    cs.CL

    Targeted Data Generation: Finding and Fixing Model Weaknesses

    Authors: Zexue He, Marco Tulio Ribeiro, Fereshte Khani

    Abstract: Even when aggregate accuracy is high, state-of-the-art NLP models often fail systematically on specific subgroups of data, resulting in unfair outcomes and eroding user trust. Additional data collection may not help in addressing these weaknesses, as such challenging subgroups may be unknown to users, and underrepresented in the existing and new data. We propose Targeted Data Generation (TDG), a f… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  38. arXiv:2305.17106  [pdf, other

    cs.SE

    Understanding Self-Efficacy in the Context of Software Engineering: A Qualitative Study in the Industry

    Authors: Danilo Monteiro Ribeiro, Rayfran Rocha Lima, César França, Alberto de Souza, Isadora Cardoso-Pereira, Gustavo Pinto

    Abstract: CONTEXT: Self-efficacy is a concept researched in various areas of knowledge that impacts various factors such as performance, satisfaction, and motivation. In Software Engineering, it has mainly been studied in the academic context, presenting results similar to other areas of knowledge. However, it is also important to understand its impact in the industrial context. OBJECTIVE: Therefore, this s… ▽ More

    Submitted 2 June, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 10 pages, 3 figures

    Journal ref: Published at EASE 2023

  39. arXiv:2305.15041  [pdf, other

    cs.CL

    Generating Faithful Synthetic Data with Large Language Models: A Case Study in Computational Social Science

    Authors: Veniamin Veselovsky, Manoel Horta Ribeiro, Akhil Arora, Martin Josifoski, Ashton Anderson, Robert West

    Abstract: Large Language Models (LLMs) have democratized synthetic data generation, which in turn has the potential to simplify and broaden a wide gamut of NLP tasks. Here, we tackle a pervasive problem in synthetic data generation: its generative distribution often differs from the distribution of real-world data researchers care about (in other words, it is unfaithful). In a case study on sarcasm detectio… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 8 pages

  40. arXiv:2305.12219  [pdf, other

    cs.LG cs.AI cs.CL

    Collaborative Development of NLP models

    Authors: Fereshte Khani, Marco Tulio Ribeiro

    Abstract: Despite substantial advancements, Natural Language Processing (NLP) models often require post-training adjustments to enforce business rules, rectify undesired behavior, and align with user values. These adjustments involve operationalizing "concepts"--dictating desired model responses to certain inputs. However, it's difficult for a single entity to enumerate and define all possible concepts, ind… ▽ More

    Submitted 24 May, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

  41. arXiv:2304.09991  [pdf, other

    cs.HC cs.AI cs.CL

    Supporting Human-AI Collaboration in Auditing LLMs with LLMs

    Authors: Charvi Rastogi, Marco Tulio Ribeiro, Nicholas King, Harsha Nori, Saleema Amershi

    Abstract: Large language models are becoming increasingly pervasive and ubiquitous in society via deployment in sociotechnical systems. Yet these language models, be it for classification or generation, have been shown to be biased and behave irresponsibly, causing harm to people at scale. It is crucial to audit these language models rigorously. Existing auditing tools leverage either or both humans and AI… ▽ More

    Submitted 30 November, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 21 pages, 3 figures

    Journal ref: In Proceedings of the 2023 AAAI and ACM Conference on AI, Ethics, and Society. Association for Computing Machinery, New York, NY, USA, 913-926

  42. arXiv:2304.07587  [pdf, other

    physics.flu-dyn

    Laminar post-stall wakes of tapered swept wings

    Authors: Jean Hélder Marques Ribeiro, Jacob Neal, Anton Burtsev, Michael Amitay, Vassilios Theofilis, Kunihiko Taira

    Abstract: While tapered swept wings are widely used, the influence of taper on their post-stall wake characteristics remains largely unexplored. To address this issue, we conduct an extensive study using direct numerical simulations to characterize the wing taper and sweep effects on laminar separated wakes. We analyze flows behind NACA 0015 cross-sectional profile wings at post-stall angles of attack… ▽ More

    Submitted 19 October, 2023; v1 submitted 15 April, 2023; originally announced April 2023.

  43. arXiv:2304.06774  [pdf

    cond-mat.soft physics.chem-ph physics.flu-dyn

    Confined ionic liquids films under shear: The importance of the chemical nature of the solid surface

    Authors: Kalil Bernardino, Mauro C. C. Ribeiro

    Abstract: Ionic liquids have generated interest in applications as lubricants and as additives to conventional lubricants due to their unique physical properties. In these applications, the liquid thin film can be subjected simultaneously to extremely high shear and loads in addition to nanoconfinement effects. Here, we use molecular dynamics simulations with a coarse grained model to study a nanometric fil… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 32 pages, 8 figures

    Journal ref: J. Chem. Phys. 158, 094712 (2023)

  44. arXiv:2303.16151  [pdf, other

    q-fin.ST cs.LG econ.EM stat.ML

    Forecasting Large Realized Covariance Matrices: The Benefits of Factor Models and Shrinkage

    Authors: Rafael Alves, Diego S. de Brito, Marcelo C. Medeiros, Ruy M. Ribeiro

    Abstract: We propose a model to forecast large realized covariance matrices of returns, applying it to the constituents of the S\&P 500 daily. To address the curse of dimensionality, we decompose the return covariance matrix using standard firm-level factors (e.g., size, value, and profitability) and use sectoral restrictions in the residual covariance matrix. This restricted model is then estimated using v… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  45. arXiv:2303.13555  [pdf, other

    cs.CE cs.LG

    Efficient hybrid modeling and sorption model discovery for non-linear advection-diffusion-sorption systems: A systematic scientific machine learning approach

    Authors: Vinicius V. Santana, Erbet Costa, Carine M. Rebello, Ana Mafalda Ribeiro, Chris Rackauckas, Idelfonso B. R. Nogueira

    Abstract: This study presents a systematic machine learning approach for creating efficient hybrid models and discovering sorption uptake models in non-linear advection-diffusion-sorption systems. It demonstrates an effective method to train these complex systems using gradient based optimizers, adjoint sensitivity analysis, and JIT-compiled vector Jacobian products, combined with spatial discretization and… ▽ More

    Submitted 25 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: GitHub repo made available

  46. arXiv:2303.12712  [pdf, other

    cs.CL cs.AI

    Sparks of Artificial General Intelligence: Early experiments with GPT-4

    Authors: Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang

    Abstract: Artificial intelligence (AI) researchers have been develo** and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an earl… ▽ More

    Submitted 13 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  47. arXiv:2303.11202  [pdf, other

    cond-mat.supr-con

    Effect of ion irradiation on superconducting thin films

    Authors: Katja Kohopää, Alberto Ronzani, Robab Najafi Jabdaraghi, Arijit Bera, Mário Ribeiro, Dibyendu Hazra, Jorden Senior, Mika Prunnila, Joonas Govenius, Janne S. Lehtinen, Antti Kemppinen

    Abstract: We demonstrate ion irradiation by argon or gallium as a wafer-scale post-processing method to increase disorder in superconducting thin films. We study several widely used superconductors, both single-elements and compounds. We show that ion irradiation increases normal-state resistivity in all our films, which is expected to enable tuning their superconducting properties, for example, toward high… ▽ More

    Submitted 19 June, 2024; v1 submitted 20 March, 2023; originally announced March 2023.

  48. arXiv:2303.09014  [pdf, other

    cs.CL

    ART: Automatic multi-step reasoning and tool-use for large language models

    Authors: Bhargavi Paranjape, Scott Lundberg, Sameer Singh, Hannaneh Hajishirzi, Luke Zettlemoyer, Marco Tulio Ribeiro

    Abstract: Large language models (LLMs) can perform complex reasoning in few- and zero-shot settings by generating intermediate chain of thought (CoT) reasoning steps. Further, each reasoning step can rely on external tools to support computation beyond the core LLM capabilities (e.g. search/running code). Prior work on CoT prompting and tool use typically requires hand-crafting task-specific demonstrations… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  49. arXiv:2303.05429  [pdf, other

    cs.SE

    Supporting the Careers of Developers with Disabilities: Lessons from Zup Innovation

    Authors: Isadora Cardoso-Pereira, Geraldo Gomes, Danilo Monteiro Ribeiro, Alberto de Souza, Danilo Lucena, Gustavo Pinto

    Abstract: People with still face discrimination, which creates significant obstacles to accessing higher education, ultimately hindering their access to high-skilled occupations. In this study we present Catalisa, an eight-month training camp (developed by Zup Innovation) that hires and trains people with disabilities as software developers. We interviewed 12 Catalisa participants to better understand their… ▽ More

    Submitted 26 May, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: 5 pages (two columns), 1 figures

  50. arXiv:2303.02655  [pdf, other

    cs.AI cs.CV cs.LG cs.NE

    On Modifying a Neural Network's Perception

    Authors: Manuel de Sousa Ribeiro, João Leite

    Abstract: Artificial neural networks have proven to be extremely useful models that have allowed for multiple recent breakthroughs in the field of Artificial Intelligence and many others. However, they are typically regarded as black boxes, given how difficult it is for humans to interpret how these models reach their results. In this work, we propose a method which allows one to modify what an artificial n… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.