Skip to main content

Showing 1–5 of 5 results for author: Braz, R d S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.03814  [pdf, other

    cs.LG cs.AI

    Pearl: A Production-ready Reinforcement Learning Agent

    Authors: Zheqing Zhu, Rodrigo de Salvo Braz, Jalaj Bhandari, Daniel Jiang, Yi Wan, Yonathan Efroni, Liyuan Wang, Ruiyang Xu, Hongbo Guo, Alex Nikulkov, Dmytro Korenkevych, Urun Dogan, Frank Cheng, Zheng Wu, Wanqiao Xu

    Abstract: Reinforcement Learning (RL) offers a versatile framework for achieving long-term goals. Its generality allows us to formalize a wide range of problems that real-world intelligent systems encounter, such as dealing with delayed rewards, handling partial observability, addressing the exploration and exploitation dilemma, utilizing offline data to improve online performance, and ensuring safety const… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  2. arXiv:1709.01122  [pdf, ps, other

    cs.AI cs.SC

    Exact Inference for Relational Graphical Models with Interpreted Functions: Lifted Probabilistic Inference Modulo Theories

    Authors: Rodrigo de Salvo Braz, Ciaran O'Reilly

    Abstract: Probabilistic Inference Modulo Theories (PIMT) is a recent framework that expands exact inference on graphical models to use richer languages that include arithmetic, equalities, and inequalities on both integers and real numbers. In this paper, we expand PIMT to a lifted version that also processes random functions and relations. This enhancement is achieved by adapting Inversion, a method from L… ▽ More

    Submitted 4 September, 2017; originally announced September 2017.

    Comments: Appeared in the Uncertainty in Artificial Intelligence Conference, August 2017

  3. arXiv:1707.08704  [pdf, other

    cs.AI

    Anytime Exact Belief Propagation

    Authors: Gabriel Azevedo Ferreira, Quentin Bertrand, Charles Maussion, Rodrigo de Salvo Braz

    Abstract: Statistical Relational Models and, more recently, Probabilistic Programming, have been making strides towards an integration of logic and probabilistic reasoning. A natural expectation for this project is that a probabilistic logic reasoning algorithm reduces to a logic reasoning algorithm when provided a model that only involves 0-1 probabilities, exhibiting all the advantages of logic reasoning… ▽ More

    Submitted 27 July, 2017; originally announced July 2017.

    Comments: Submission to StaRAI-17 workshop at UAI-17 conference

  4. arXiv:1605.08367  [pdf, other

    cs.AI cs.LO

    Probabilistic Inference Modulo Theories

    Authors: Rodrigo de Salvo Braz, Ciaran O'Reilly, Vibhav Gogate, Rina Dechter

    Abstract: We present SGDPLL(T), an algorithm that solves (among many other problems) probabilistic inference modulo theories, that is, inference problems over probabilistic models defined via a logic theory provided as a parameter (currently, propositional, equalities on discrete sorts, and inequalities, more specifically difference arithmetic, on bounded integers). While many solutions to probabilistic inf… ▽ More

    Submitted 26 May, 2016; v1 submitted 26 May, 2016; originally announced May 2016.

    Comments: Submitted to StarAI-16 workshop as closely revised version of IJCAI-16 paper

  5. arXiv:1203.3464  [pdf

    cs.AI

    Gibbs Sampling in Open-Universe Stochastic Languages

    Authors: Nimar S. Arora, Rodrigo de Salvo Braz, Erik B. Sudderth, Stuart Russell

    Abstract: Languages for open-universe probabilistic models (OUPMs) can represent situations with an unknown number of objects and iden- tity uncertainty. While such cases arise in a wide range of important real-world appli- cations, existing general purpose inference methods for OUPMs are far less efficient than those available for more restricted lan- guages and model classes. This paper goes some way to r… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-30-39