Skip to main content

Showing 1–22 of 22 results for author: Braga, P

.
  1. arXiv:2404.15410  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Planning the path with Reinforcement Learning: Optimal Robot Motion Planning in RoboCup Small Size League Environments

    Authors: Mateus G. Machado, João G. Melo, Cleber Zanchettin, Pedro H. M. Braga, Pedro V. Cunha, Edna N. S. Barros, Hansenclever F. Bassani

    Abstract: This work investigates the potential of Reinforcement Learning (RL) to tackle robot motion planning challenges in the dynamic RoboCup Small Size League (SSL). Using a heuristic control approach, we evaluate RL's effectiveness in obstacle-free and single-obstacle path-planning environments. Ablation studies reveal significant performance improvements. Our method achieved a 60% time gain in obstacle… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 12 pages, 3 figures, 3 tables

  2. arXiv:2402.15849  [pdf, other

    cs.GT econ.TH math.DS

    On the Redistribution of Maximal Extractable Value: A Dynamic Mechanism

    Authors: Pedro Braga, Georgios Chionas, Stefanos Leonardos, Piotr Krysta, Georgios Piliouras, Carmine Ventre

    Abstract: Maximal Extractable Value (MEV) has emerged as a new frontier in the design of blockchain systems. The marriage between decentralization and finance gives the power to block producers (a.k.a., miners) not only to select and add transactions to the blockchain but, crucially, also to order them so as to extract as much financial gain as possible for themselves. Whilst this price may be unavoidable f… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: Extended abstract in the 23rd International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS 2024)

    MSC Class: 37; 65P20; 91; 93A14; 93A16 ACM Class: C.2.4; F.2; I.2.11; J.2; J.4

  3. arXiv:2212.08131  [pdf, other

    cs.LG

    Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies

    Authors: Shivakanth Sujit, Pedro H. M. Braga, Jorg Bornschein, Samira Ebrahimi Kahou

    Abstract: Reinforcement learning (RL) has shown great promise with algorithms learning in environments with large state and action spaces purely from scalar reward signals. A crucial challenge for current deep RL algorithms is that they require a tremendous amount of environment interactions for learning. This can be infeasible in situations where such interactions are expensive; such as in robotics. Offlin… ▽ More

    Submitted 21 November, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: TMLR 2023

  4. arXiv:2208.10483  [pdf, other

    cs.LG cs.AI

    Prioritizing Samples in Reinforcement Learning with Reducible Loss

    Authors: Shivakanth Sujit, Somjit Nath, Pedro H. M. Braga, Samira Ebrahimi Kahou

    Abstract: Most reinforcement learning algorithms take advantage of an experience replay buffer to repeatedly train on samples the agent has observed in the past. Not all samples carry the same amount of significance and simply assigning equal importance to each of the samples is a naïve strategy. In this paper, we propose a method to prioritize samples based on how much we can learn from a sample. We define… ▽ More

    Submitted 1 November, 2023; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: NeurIPS 2023

  5. arXiv:2202.09197  [pdf, other

    cond-mat.mtrl-sci

    Synergism between B and Nb improves fire resistance in microalloyed steels

    Authors: Pedro P. Ferreira, Felipe M. Carvalho, Edwan A. Ariza-Echeverri, Pedro M. Delfino, Luiz F. Bauri, Andrei M. Ferreira, Ana P. V. Braga, Luiz T. F. Eleno, Hélio Goldenstein, André P. Tschiptschin

    Abstract: The development of new fire-resistant steels represents a challenge in materials science and engineering of utmost importance. Alloying elements such as Nb and Mo are generally used to improve the strength at both room- and high-temperatures due to, for example, the formation of precipitates and harder microconstituents. In this study we show alternatively that the addition of small amounts of bor… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Comments: 7 pages, 4 figures

  6. arXiv:2106.12895  [pdf, other

    cs.LG cs.AI cs.RO

    rSoccer: A Framework for Studying Reinforcement Learning in Small and Very Small Size Robot Soccer

    Authors: Felipe B. Martins, Mateus G. Machado, Hansenclever F. Bassani, Pedro H. M. Braga, Edna S. Barros

    Abstract: Reinforcement learning is an active research area with a vast number of applications in robotics, and the RoboCup competition is an interesting environment for studying and evaluating reinforcement learning methods. A known difficulty in applying reinforcement learning to robotics is the high number of experience samples required, being the use of simulated environments for training the agents fol… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

  7. arXiv:2012.12342  [pdf, ps, other

    physics.chem-ph

    Inverse classical scattering using fractional derivative

    Authors: F. S. Carvalho, J. P. Braga, N. H. T. Lemes

    Abstract: The fractional calculus framework will be used to invert the potential energy function from the classical scattering angle, which will be related to Riemann-Liouville fractional integral. Numerical solution of this fractional order problem will be applied to the inverse Rutherford scattering and to the inverse scattering of Xe--Rn atoms, in which the potential is given by Lennard-Jones function. P… ▽ More

    Submitted 10 February, 2020; originally announced December 2020.

  8. arXiv:2011.11785  [pdf, other

    cs.RO cs.AI cs.LG

    An analysis of Reinforcement Learning applied to Coach task in IEEE Very Small Size Soccer

    Authors: Carlos H. C. Pena, Mateus G. Machado, Mariana S. Barros, José D. P. Silva, Lucas D. Maciel, Tsang Ing Ren, Edna N. S. Barros, Pedro H. M. Braga, Hansenclever F. Bassani

    Abstract: The IEEE Very Small Size Soccer (VSSS) is a robot soccer competition in which two teams of three small robots play against each other. Traditionally, a deterministic coach agent will choose the most suitable strategy and formation for each adversary's strategy. Therefore, the role of a coach is of great importance to the game. In this sense, this paper proposes an end-to-end approach for the coach… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Comments: 6 pages, 9 figures, to be published in Latin American Robotics Symposium

  9. arXiv:2008.12624  [pdf, other

    cs.RO cs.AI cs.LG

    A Framework for Studying Reinforcement Learning and Sim-to-Real in Robot Soccer

    Authors: Hansenclever F. Bassani, Renie A. Delgado, José Nilton de O. Lima Junior, Heitor R. Medeiros, Pedro H. M. Braga, Mateus G. Machado, Lucas H. C. Santos, Alain Tapp

    Abstract: This article introduces an open framework, called VSSS-RL, for studying Reinforcement Learning (RL) and sim-to-real in robot soccer, focusing on the IEEE Very Small Size Soccer (VSSS) league. We propose a simulated environment in which continuous or discrete control policies can be trained to control the complete behavior of soccer agents and a sim-to-real method based on domain adaptation to adap… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

  10. arXiv:2006.13682  [pdf, other

    cs.LG cs.NE stat.ML

    Deep Categorization with Semi-Supervised Self-Organizing Maps

    Authors: Pedro H. M. Braga, Heitor R. Medeiros, Hansenclever F. Bassani

    Abstract: Nowadays, with the advance of technology, there is an increasing amount of unstructured data being generated every day. However, it is a painful job to label and organize it. Labeling is an expensive, time-consuming, and difficult task. It is usually done manually, which collaborates with the incorporation of noise and errors to the data. Hence, it is of great importance to develo** intelligent… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: Accepted for publication at the 2020 International Joint Conference on Neural Networks (IJCNN)

  11. arXiv:2003.11102  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Learning to Play Soccer by Reinforcement and Applying Sim-to-Real to Compete in the Real World

    Authors: Hansenclever F. Bassani, Renie A. Delgado, Jose Nilton de O. Lima Junior, Heitor R. Medeiros, Pedro H. M. Braga, Alain Tapp

    Abstract: This work presents an application of Reinforcement Learning (RL) for the complete control of real soccer robots of the IEEE Very Small Size Soccer (VSSS), a traditional league in the Latin American Robotics Competition (LARC). In the VSSS league, two teams of three small robots play against each other. We propose a simulated environment in which continuous or discrete control policies can be train… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Journal ref: LatinX in AI Research Workshop at NeurIPS 2019

  12. arXiv:2001.11853  [pdf, ps, other

    cs.LG cs.NE

    The Gâteaux-Hopfield Neural Network method

    Authors: Felipe Silva Carvalho, João Pedro Braga

    Abstract: In the present work a new set of differential equations for the Hopfield Neural Network (HNN) method were established by means of the Linear Extended Gateaux Derivative (LEGD). This new approach will be referred to as Gâteaux-Hopfiel Neural Network (GHNN). A first order Fredholm integral problem was used to test this new method and it was found to converge 22 times faster to the exact solutions fo… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

    Comments: 15 pages, 2 figures

  13. MOEA/D with Uniformly Randomly Adaptive Weights

    Authors: Lucas R. C. de Farias, Pedro H. M. Braga, Hansenclever F. Bassani, Aluizio F. R. Araújo

    Abstract: When working with decomposition-based algorithms, an appropriate set of weights might improve quality of the final solution. A set of uniformly distributed weights usually leads to well-distributed solutions on a Pareto front. However, there are two main difficulties with this approach. Firstly, it may fail depending on the problem geometry. Secondly, the population size becomes not flexible as th… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Journal ref: 2018 Genetic and Evolutionary Computation Conference (GECCO)

  14. A Semi-Supervised Self-Organizing Map with Adaptive Local Thresholds

    Authors: Pedro H. M. Braga, Hansenclever F. Bassani

    Abstract: In the recent years, there is a growing interest in semi-supervised learning, since, in many learning tasks, there is a plentiful supply of unlabeled data, but insufficient labeled ones. Hence, Semi-Supervised learning models can benefit from both types of data to improve the obtained performance. Also, it is important to develop methods that are easy to parameterize in a way that is robust to the… ▽ More

    Submitted 25 March, 2020; v1 submitted 1 July, 2019; originally announced July 2019.

    Journal ref: 2019 International Joint Conference on Neural Networks (IJCNN)

  15. A Semi-Supervised Self-Organizing Map for Clustering and Classification

    Authors: Pedro H. M. Braga, Hansenclever F. Bassani

    Abstract: There has been an increasing interest in semi-supervised learning in the recent years because of the great number of datasets with a large number of unlabeled data but only a few labeled samples. Semi-supervised learning algorithms can work with both types of data, combining them to obtain better performance for both clustering and classification. Also, these datasets commonly have a high number o… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Journal ref: 2018 International Joint Conference on Neural Networks (IJCNN)

  16. arXiv:1905.00261  [pdf, other

    cs.CV cs.LG

    Automatic Dataset Augmentation Using Virtual Human Simulation

    Authors: Marcelo C. Ghilardi, Leandro Dihl, Estevão Testa, Pedro Braga, João P. Pianta, Isabel H. Manssour, Soraia R. Musse

    Abstract: Virtual Human Simulation has been widely used for different purposes, such as comfort or accessibility analysis. In this paper, we investigate the possibility of using this type of technique to extend the training datasets of pedestrians to be used with machine learning techniques. Our main goal is to verify if Computer Graphics (CG) images of virtual humans with a simplistic rendering can be effi… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

  17. arXiv:1812.01705  [pdf, ps, other

    hep-th cond-mat.supr-con

    Multivalued fields and monopole operators

    Authors: P. R. Braga, M. S. Guimaraes, M. M. A. Paganelly

    Abstract: In this work, we investigate the role of multivalued fields in the formulation of monopole operators and their connection with topological states of matter. In quantum field theory it is known that certain states describe collective modes of the fundamental fields and are created by operators that are often non-local, being defined over lines or higher-dimensional surfaces. For this reason, they m… ▽ More

    Submitted 1 July, 2020; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: 21 pages. Version accepted for publication in the Annals of Physics

    Journal ref: Annals of Physics 419, 168245 (2020)

  18. arXiv:1612.01195  [pdf, ps, other

    physics.chem-ph cond-mat.stat-mech

    Accurate potential energy curve for helium dimer retrieved from viscosity coefficient data at very low temperatures

    Authors: Éderson D'M. Costa, Nelson H. T. Lemes, João P. Braga

    Abstract: The long range potential of helium-helium interaction, which requires accurate 'ab initio' calculation, due to the small value of the potential depth, approximately 11 K (0.091 kJ/mol) at 2.96 angstrom, will be obtained in this study by an alternative technique. This work presents a robust and consistent procedure that provides the long range potential directly from experimental data. However, it… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

  19. arXiv:1604.02886  [pdf, ps, other

    hep-th cond-mat.supr-con

    Effective field theories for superconducting systems with multiple Fermi surfaces

    Authors: P. R. Braga, D. R. Granado, M. S. Guimaraes, C. Wotzasek

    Abstract: In this work we investigate the description of superconducting systems with multiple Fermi surfaces. For the case of one Fermi surface we re-obtain the result that the superconductor is more precisely described as a topological state of matter. Studying the case of more than one Fermi surface, we obtain the effective theory describing a time reversal symmetric topological superconductor. These res… ▽ More

    Submitted 21 August, 2016; v1 submitted 11 April, 2016; originally announced April 2016.

    Comments: 19 pages. Version accepted for publication in the Annals of Physics

  20. arXiv:1604.02700  [pdf, ps, other

    cs.DC

    GPIC - GPU Power Iteration Cluster

    Authors: Gustavo R. L Silva, Rafael R. Medeiros, Antonio P. Braga, Douglas A. G. Vieira

    Abstract: This work presents a new clustering algorithm, the GPIC, a Graphics Processing Unit (GPU) accelerated algorithm for Power Iteration Clustering (PIC). Our algorithm is based on the original PIC proposal, adapted to take advantage of the GPU architecture, maintining the algorith original properties. The proposed method was compared against the serial and parallel Spark implementation, achieving a co… ▽ More

    Submitted 10 April, 2016; originally announced April 2016.

  21. arXiv:1602.02078  [pdf, ps, other

    physics.chem-ph

    Accurate multireference study of Si3 electronic manifold

    Authors: Cayo Emilio Monteiro Goncalves, Breno Rodrigues Lamaghere Galvao, Joao Pedro Braga

    Abstract: Since it has been shown that the silicon trimer has a highly multi-reference character, accurate multi-reference configuration interaction calculations are performed to elucidate its electronic manifold. Emphasis is given to the long range part of the potential, aiming to understand the atom-diatom collisions dynamical aspects, to describe conical intersections and important saddle points along th… ▽ More

    Submitted 5 February, 2016; originally announced February 2016.

    Comments: 16 pages, 6 figures

  22. arXiv:1408.6869  [pdf, ps, other

    cond-mat.stat-mech

    A generalized Mittag-Leffer function to describe nonexponential chemical effects

    Authors: Nelson H. T. Lemes, José Paulo C. dos Santos, João P. Braga

    Abstract: In this paper a differential equation with noninteger order was used to model an anomalous luminescence decay process. Although this process is in principle an exponential decaying process, recent data indicates that is not the case for longer observation time. The theoretical fractional differential calculus applied in the present work was able to describe this process at short and long time, exp… ▽ More

    Submitted 28 August, 2014; originally announced August 2014.

    Comments: 12 pages, 2 figures