Skip to main content

Showing 1–8 of 8 results for author: de Paula, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.15238  [pdf, other

    cs.CL cs.AI

    CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies

    Authors: Weiyan Shi, Ryan Li, Yutong Zhang, Caleb Ziems, Chunhua yu, Raya Horesh, Rogério Abreu de Paula, Diyi Yang

    Abstract: To enhance language models' cultural awareness, we design a generalizable pipeline to construct cultural knowledge bases from different online communities on a massive scale. With the pipeline, we construct CultureBank, a knowledge base built upon users' self-narratives with 12K cultural descriptors sourced from TikTok and 11K from Reddit. Unlike previous cultural knowledge resources, CultureBank… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 32 pages, 7 figures, preprint

  2. arXiv:2403.06009  [pdf, other

    cs.LG

    Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

    Authors: Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Kirushikesh DB, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Nishtha Madaan, Sameep Mehta, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations. Due to several limiting factors surrounding LLMs (training cost, API access, data availability, etc.), it may not always be feasible to impose direct safety constraints on a deployed model. Therefore, an efficient and reliable alternative is required. To this end, we presen… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  3. arXiv:2401.17866  [pdf

    cs.HC cs.AI

    Making Sense of Knowledge Intensive Processes: an Oil & Gas Industry Scenario

    Authors: Juliana Jansen Ferreira, Vinícius Segura, Ana Fucs, Rogério de Paula

    Abstract: Sensemaking is a constant and ongoing process by which people associate meaning to experiences. It can be an individual process, known as abduction, or a group process by which people give meaning to collective experiences. The sensemaking of a group is influenced by the abduction process of each person about the experience. Every collaborative process needs some level of sensemaking to show resul… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 9 pages. This paper was presented at the Sensemaking in a Senseless World workshop during the 2018 ACM CHI Conference on Human Factors in Computing Systems

  4. arXiv:2205.09508  [pdf, other

    econ.GN cs.CY cs.LG

    Practical Skills Demand Forecasting via Representation Learning of Temporal Dynamics

    Authors: Maysa M. Garcia de Macedo, Wyatt Clarke, Eli Lucherini, Tyler Baldwin, Dilermando Queiroz Neto, Rogerio de Paula, Subhro Das

    Abstract: Rapid technological innovation threatens to leave much of the global workforce behind. Today's economy juxtaposes white-hot demand for skilled labor against stagnant employment prospects for workers unprepared to participate in a digital economy. It is a moment of peril and opportunity for every country, with outcomes measured in long-term capital allocation and the life satisfaction of billions o… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: 15 pages, 5th AAAI/ACM Conference on AI, Ethics, and Society

  5. arXiv:2204.01489  [pdf, other

    cs.CY cs.AI cs.SI

    Towards a New Science of Disinformation

    Authors: Claudio S. Pinhanez, German H. Flores, Marisa A. Vasconcelos, Mu Qiao, Nick Linck, Rogério de Paula, Yuya J. Ong

    Abstract: How can we best address the dangerous impact that deep learning-generated fake audios, photographs, and videos (a.k.a. deepfakes) may have in personal and societal life? We foresee that the availability of cheap deepfake technology will create a second wave of disinformation where people will receive specific, personalized disinformation through different channels, making the current approaches to… ▽ More

    Submitted 17 March, 2022; originally announced April 2022.

  6. arXiv:2008.07363  [pdf, other

    cs.LG

    Predicting Account Receivables with Machine Learning

    Authors: Ana Paula Appel, Gabriel Louzada Malfatti, Renato Luiz de Freitas Cunha, Bruno Lima, Rogerio de Paula

    Abstract: Being able to predict when invoices will be paid is valuable in multiple industries and supports decision-making processes in most financial workflows. However, due to the complexity of data related to invoices and the fact that the decision-making process is not registered in the accounts receivable system, performing this prediction becomes a challenge. In this paper, we present a prototype able… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: 9 pages, 6 figures, Workshop Machine Learning in Finance. arXiv admin note: substantial text overlap with arXiv:1912.10828

  7. arXiv:1912.10828  [pdf, other

    cs.LG

    Optimize Cash Collection: Use Machine learning to Predicting Invoice Payment

    Authors: Ana Paula Appel, Victor Oliveira, Bruno Lima, Gabriel Louzada Malfatti, Vagner Figueredo de Santana, Rogerio de Paula

    Abstract: Predicting invoice payment is valuable in multiple industries and supports decision-making processes in most financial workflows. However, the challenge in this realm involves dealing with complex data and the lack of data related to decisions-making processes not registered in the accounts receivable system. This work presents a prototype developed as a solution devised during a partnership with… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: 10 pages, 5 figures, 5 tables

  8. arXiv:1808.06044  [pdf, ps, other

    cs.CE

    Maximising Throughput in a Complex Coal Export System

    Authors: Mateus Rocha de Paula, Natashia Boland, Andreas Ernst, Alexandre Mendes, Martin Savelsbergh

    Abstract: The Port of Newcastle features three coal export terminals, operating primarily in cargo assembly mode, that share a rail network on their inbound side, and a channel on their outbound side. Maximising throughput at a single coal terminal, taking into account its layout, its equipment, and its operating policies, is already challenging, but maximising throughput of the Hunter Valley coal export sy… ▽ More

    Submitted 22 August, 2018; v1 submitted 18 August, 2018; originally announced August 2018.