Skip to main content

Showing 1–50 of 244 results for author: Machado, M

.
  1. arXiv:2406.12284  [pdf, other

    cs.LG cs.AI

    Demystifying the Recency Heuristic in Temporal-Difference Learning

    Authors: Brett Daley, Marlos C. Machado, Martha White

    Abstract: The recency heuristic in reinforcement learning is the assumption that stimuli that occurred closer in time to an acquired reward should be more heavily reinforced. The recency heuristic is one of the key assumptions made by TD($λ$), which reinforces recent experiences according to an exponentially decaying weighting. In fact, all other widely used return estimators for TD learning, such as $n$-st… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: RLC 2024. 18 pages, 8 figures, 1 table

  2. arXiv:2406.06811  [pdf, other

    cs.LG

    Learning Continually by Spectral Regularization

    Authors: Alex Lewandowski, Saurabh Kumar, Dale Schuurmans, András György, Marlos C. Machado

    Abstract: Loss of plasticity is a phenomenon where neural networks become more difficult to train during the course of learning. Continual learning algorithms seek to mitigate this effect by sustaining good predictive performance while maintaining network trainability. We develop new techniques for improving continual learning by first reconsidering how initialization can ensure trainability during early ph… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  3. arXiv:2405.01712  [pdf, other

    hep-ph hep-ex

    Multiplicity dependence of the $p_T$-spectra for identified particles and its relationship with partonic entropy

    Authors: L. S. Moriggi, G. S. Ramos, M. V. T. Machado

    Abstract: We investigate the multiplicity dependence of the transverse momentum $p_T$ spectra of hadrons produced in high-energy collisions. We propose that the partonic distribution be parameterized by its non-extensive entropy and the parton saturation scale $Q_s(x)$. These two variables can be identified from the produced charged hadron distributions and provide important information on the gluon dynamic… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures

  4. arXiv:2404.15410  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Planning the path with Reinforcement Learning: Optimal Robot Motion Planning in RoboCup Small Size League Environments

    Authors: Mateus G. Machado, João G. Melo, Cleber Zanchettin, Pedro H. M. Braga, Pedro V. Cunha, Edna N. S. Barros, Hansenclever F. Bassani

    Abstract: This work investigates the potential of Reinforcement Learning (RL) to tackle robot motion planning challenges in the dynamic RoboCup Small Size League (SSL). Using a heuristic control approach, we evaluate RL's effectiveness in obstacle-free and single-obstacle path-planning environments. Ablation studies reveal significant performance improvements. Our method achieved a 60% time gain in obstacle… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 12 pages, 3 figures, 3 tables

  5. arXiv:2403.10304  [pdf, other

    cs.AI cs.DB

    KIF: A Framework for Virtual Integration of Heterogeneous Knowledge Bases using Wikidata

    Authors: Guilherme Lima, Marcelo Machado, Elton Soares, Sandro R. Fiorini, Raphael Thiago, Leonardo G. Azevedo, Viviane T. da Silva, Renato Cerqueira

    Abstract: We present a knowledge integration framework (called KIF) that uses Wikidata as a lingua franca to integrate heterogeneous knowledge bases. These can be triplestores, relational databases, CSV files, etc., which may or may not use the Wikidata dialect of RDF. KIF leverages Wikidata's data model and vocabulary plus user-defined map**s to expose a unified view of the integrated bases while kee**… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  6. arXiv:2402.12458  [pdf, other

    hep-ph

    Testing the double-logarithm asymptotic gluon density in ultraperipheral heavy ion collisions at the Large Hadron Collider

    Authors: D. A. Fagundes, M. V. T. Machado

    Abstract: In this work we analyze the application of the analytical gluon distribution based on the double asymptotic scaling for the photoproduction of vector mesons in coherent $pp$, $pA$ and $AA$ collisions at the LHC energies using the color dipole formalism. Predictions for the rapidity distribution are presented for $ρ^0$ and $J/ ψ$, $ψ(2S)$ and $Υ(1S)$ photoproduction. An analysis on the uncertaintie… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 12 pages, 12 figures, 2 tables

  7. arXiv:2402.06619  [pdf, other

    cs.CL cs.AI

    Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

    Authors: Shivalika Singh, Freddie Vargus, Daniel Dsouza, Börje F. Karlsson, Abinaya Mahendiran, Wei-Yin Ko, Herumb Shandilya, Jay Patel, Deividas Mataciunas, Laura OMahony, Mike Zhang, Ramith Hettiarachchi, Joseph Wilson, Marina Machado, Luisa Souza Moura, Dominik Krzemiński, Hakimeh Fadaei, Irem Ergün, Ifeoma Okoh, Aisha Alaagib, Oshan Mudannayake, Zaid Alyafeai, Vu Minh Chien, Sebastian Ruder, Surya Guthikonda , et al. (8 additional authors not shown)

    Abstract: Datasets are foundational to many breakthroughs in modern artificial intelligence. Many recent achievements in the space of natural language processing (NLP) can be attributed to the finetuning of pre-trained models on a diverse set of tasks that enables a large language model (LLM) to respond to instructions. Instruction fine-tuning (IFT) requires specifically constructed and annotated datasets.… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  8. arXiv:2402.03903  [pdf, other

    cs.LG

    Averaging $n$-step Returns Reduces Variance in Reinforcement Learning

    Authors: Brett Daley, Martha White, Marlos C. Machado

    Abstract: Multistep returns, such as $n$-step returns and $λ$-returns, are commonly used to improve the sample efficiency of reinforcement learning (RL) methods. The variance of the multistep returns becomes the limiting factor in their length; looking too far into the future increases variance and reverses the benefits of multistep learning. In our work, we demonstrate the ability of compound returns -- we… ▽ More

    Submitted 5 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICML 2024. 27 pages, 7 figures, 3 tables

  9. arXiv:2312.01624  [pdf, other

    cs.LG cs.AI

    GVFs in the Real World: Making Predictions Online for Water Treatment

    Authors: Muhammad Kamran Janjua, Haseeb Shah, Martha White, Erfan Miahi, Marlos C. Machado, Adam White

    Abstract: In this paper we investigate the use of reinforcement-learning based prediction approaches for a real drinking-water treatment plant. Develo** such a prediction system is a critical step on the path to optimizing and automating water treatment. Before that, there are many questions to answer about the predictability of the data, suitable neural network architectures, how to overcome partial obse… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Published in Machine Learning (2023)

    Journal ref: Machine Learning (2023): 1-31

  10. arXiv:2312.01203  [pdf, other

    cs.LG cs.AI

    Harnessing Discrete Representations For Continual Reinforcement Learning

    Authors: Edan Meyer, Adam White, Marlos C. Machado

    Abstract: Reinforcement learning (RL) agents make decisions using nothing but observations from the environment, and consequently, heavily rely on the representations of those observations. Though some recent breakthroughs have used vector-based categorical representations of observations, often referred to as discrete representations, there is little work explicitly assessing the significance of such a cho… ▽ More

    Submitted 5 December, 2023; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: 23 pages, 16 figures, submitted to ICLR 2024

  11. arXiv:2312.00246  [pdf, other

    cs.LG

    Directions of Curvature as an Explanation for Loss of Plasticity

    Authors: Alex Lewandowski, Haruto Tanaka, Dale Schuurmans, Marlos C. Machado

    Abstract: Loss of plasticity is a phenomenon in which neural networks lose their ability to learn from new experience. Despite being empirically observed in several problem settings, little is understood about the mechanisms that lead to loss of plasticity. In this paper, we offer a consistent explanation for loss of plasticity: Neural networks lose directions of curvature during training and that loss of p… ▽ More

    Submitted 27 June, 2024; v1 submitted 30 November, 2023; originally announced December 2023.

  12. arXiv:2310.15719  [pdf, other

    cs.LG cs.AI

    Recurrent Linear Transformers

    Authors: Subhojeet Pramanik, Esraa Elelimy, Marlos C. Machado, Adam White

    Abstract: The self-attention mechanism in the transformer architecture is capable of capturing long-range dependencies and it is the main reason behind its effectiveness in processing sequential data. Nevertheless, despite their success, transformers have two significant drawbacks that still limit their broader applicability: (1) In order to remember past information, the self-attention mechanism requires a… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: transformers, reinforcement learning, partial observability

  13. arXiv:2310.10833  [pdf, other

    cs.LG cs.AI

    Proper Laplacian Representation Learning

    Authors: Diego Gomez, Michael Bowling, Marlos C. Machado

    Abstract: The ability to learn good representations of states is essential for solving large reinforcement learning problems, where exploration, generalization, and transfer are particularly challenging. The Laplacian representation is a promising approach to address these problems by inducing informative state encoding and intrinsic rewards for temporally-extended action discovery and reward sha**. To ob… ▽ More

    Submitted 3 April, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  14. arXiv:2310.04252  [pdf, ps, other

    math.PR

    Scaling limit of an equilibrium surface under the Random Average Process

    Authors: Luiz Renato Fontes, Mariela Pentón Machado, Leonel Zuaznábar

    Abstract: We consider the equilibrium surface of the Random Average Process started from an inclined plane, as seen from the height of the origin, obtained in [Ferrari & Fontes, 1998], where its fluctuations were shown to be of order of the square root of the distance to the origin in one dimension, and the square root of the log of that distance in two dimensions (and constant in higher dimensions). Remark… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: 23 pages

    MSC Class: 60K35; 82C41

  15. arXiv:2309.15888  [pdf

    q-bio.QM

    Explainable machine learning identifies multi-omics signatures of muscle response to spaceflight in mice

    Authors: Kevin Li, Riya Desai, Ryan T. Scott, Joel Ricky Steele, Meera Machado, Samuel Demharter, Adrienne Hoarfrost, Jessica L. Braun, Val A. Fajardo, Lauren M. Sanders, Sylvain V. Costes

    Abstract: The adverse effects of microgravity exposure on mammalian physiology during spaceflight necessitate a deep understanding of the underlying mechanisms to develop effective countermeasures. One such concern is muscle atrophy, which is partly attributed to the dysregulation of calcium levels due to abnormalities in SERCA pump functioning. To identify potential biomarkers for this condition, multi-omi… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  16. arXiv:2309.07686  [pdf, ps, other

    hep-ph hep-ex

    Double charmed meson production in $pp$ and $pA$ collisions at the LHC within the dipole approach in momentum representation

    Authors: G. Sampaio dos Santos, G. Gil da Silveira, M. V. T. Machado

    Abstract: A study of double charmed meson production in proton-proton and proton-nucleus collisions at the LHC energies is performed. Based on the color dipole formalism developed in the transverse momentum representation and the double parton scattering mechanism, predictions are made for the transverse momentum differential cross section for different pairs of $D$-mesons. The theoretical results consider… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 20 pages, 4 figures

  17. arXiv:2308.10753  [pdf, other

    math.AP math.OC

    The Total Variation-Wasserstein Problem

    Authors: Antonin Chambolle, Vincent Duval, Joao Miguel Machado

    Abstract: In this work we analyze the Total Variation-Wasserstein minimization problem. We propose an alternative form of deriving optimality conditions from the approach of Calier\&Poon'18, and as result obtain further regularity for the quantities involved. In the sequel we propose an algorithm to solve this problem alongside two numerical experiments.

    Submitted 21 August, 2023; originally announced August 2023.

  18. arXiv:2308.00181  [pdf, other

    hep-ph nucl-th

    Study of the azimuthal asymmetry in heavy ion collisions combining initial state momentum orientation and final state collective effects

    Authors: Lucas Soster Moriggi, Érison dos Santos Rocha, Magno Valério Trindade Machado

    Abstract: In the present work we investigate the source of azimuthal asymmetry for nuclear collision using a model that contemplates particles produced in the initial hard collisions and the collective effects described by a Blast-Wave like expansion. The latter is described by the relaxation time approximation of the Boltzmann transport equation. The parameters regarding collective flow and asymmetry are f… ▽ More

    Submitted 22 September, 2023; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: Version to be published in Physical Review D

  19. arXiv:2307.10180  [pdf

    cond-mat.mtrl-sci

    Liquidus temperature nonlinear modeling of silicates $SiO_2-R_2O-RO$

    Authors: Patrick dos Anjos, Lucas A. Quaresma, Marcelo L. P. Machado

    Abstract: The liquidus temperature is an important parameter in understanding the crystalline behavior of materials and in the operation of blast furnaces. Its modeling can be carried out by linear and nonlinear methods through data, considering the artificial neural network a modeling method with high efficiency because it presents the theorem of universal approximation and with that better performances an… ▽ More

    Submitted 21 July, 2023; v1 submitted 19 June, 2023; originally announced July 2023.

    Comments: 11 pages, 8 figures, 3 tables

  20. arXiv:2306.10572  [pdf, ps, other

    quant-ph cs.DS

    Quantum Algorithms for the Shortest Common Superstring and Text Assembling Problems

    Authors: Kamil Khadiev, Carlos Manuel Bosch Machado, Zeyu Chen, Junde Wu

    Abstract: In this paper, we consider two versions of the Text Assembling problem. We are given a sequence of strings $s^1,\dots,s^n$ of total length $L$ that is a dictionary, and a string $t$ of length $m$ that is texts. The first version of the problem is assembling $t$ from the dictionary. The second version is the ``Shortest Superstring Problem''(SSP) or the ``Shortest Common Superstring Problem''(SCS).… ▽ More

    Submitted 31 December, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2112.13319

    Journal ref: In: Qunatum Information & Computation, Vol.24, 2024,pp0267-0294

  21. arXiv:2305.14572  [pdf, ps, other

    hep-ph hep-ex

    The case for an EIC Theory Alliance: Theoretical Challenges of the EIC

    Authors: Raktim Abir, Igor Akushevich, Tolga Altinoluk, Daniele Paolo Anderle, Fatma P. Aslan, Alessandro Bacchetta, Baha Balantekin, Joao Barata, Marco Battaglieri, Carlos A. Bertulani, Guillaume Beuf, Chiara Bissolotti, Daniël Boer, M. Boglione, Radja Boughezal, Eric Braaten, Nora Brambilla, Vladimir Braun, Duane Byer, Francesco Giovanni Celiberto, Yang-Ting Chien, Ian C. Cloët, Martha Constantinou, Wim Cosyn, Aurore Courtoy , et al. (146 additional authors not shown)

    Abstract: We outline the physics opportunities provided by the Electron Ion Collider (EIC). These include the study of the parton structure of the nucleon and nuclei, the onset of gluon saturation, the production of jets and heavy flavor, hadron spectroscopy and tests of fundamental symmetries. We review the present status and future challenges in EIC theory that have to be addressed in order to realize thi… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 44 pages, ReVTeX, White Paper on EIC Theory Alliance

  22. arXiv:2305.13519  [pdf

    stat.AP cs.LG cs.NE

    Development of Non-Linear Equations for Predicting Electrical Conductivity in Silicates

    Authors: Patrick dos Anjos, Lucas A. Quaresma, Marcelo L. P. Machado

    Abstract: Electrical conductivity is of fundamental importance in electric arc furnaces (EAF) and the interaction of this phenomenon with the process slag results in energy losses and low optimization. As mathematical modeling helps in understanding the behavior of phenomena and it was used to predict the electrical conductivity of EAF slags through artificial neural networks. The best artificial neural net… ▽ More

    Submitted 28 May, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 8 pages, 6 figures, 1 table (AISTech 2023 - Presented and Accepted)

  23. arXiv:2304.14781  [pdf, other

    math.AP

    1D approximation of measures in Wasserstein spaces

    Authors: Antonin Chambolle, Vincent Duval, Joao Miguel Machado

    Abstract: We propose a variational approach to approximate measures with measures uniformly distributed over a 1 dimentional set. The problem consists in minimizing a Wasserstein distance as a data term with a regularization given by the length of the support. As it is challenging to prove existence of solutions to this problem, we propose a relaxed formulation, which always admits a solution. In the sequel… ▽ More

    Submitted 26 June, 2024; v1 submitted 28 April, 2023; originally announced April 2023.

  24. arXiv:2304.01117  [pdf, other

    cs.LG cs.AI

    Interpretable Symbolic Regression for Data Science: Analysis of the 2022 Competition

    Authors: F. O. de Franca, M. Virgolin, M. Kommenda, M. S. Majumder, M. Cranmer, G. Espada, L. Ingelse, A. Fonseca, M. Landajuela, B. Petersen, R. Glatt, N. Mundhenk, C. S. Lee, J. D. Hochhalter, D. L. Randall, P. Kamienny, H. Zhang, G. Dick, A. Simon, B. Burlacu, Jaan Kasak, Meera Machado, Casper Wilstrup, W. G. La Cava

    Abstract: Symbolic regression searches for analytic expressions that accurately describe studied phenomena. The main attraction of this approach is that it returns an interpretable model that can be insightful to users. Historically, the majority of algorithms for symbolic regression have been based on evolutionary algorithms. However, there has been a recent surge of new proposals that instead utilize appr… ▽ More

    Submitted 3 July, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 13 pages, 13 figures, submitted to IEEE Transactions on Evolutionary Computation

  25. arXiv:2303.07507  [pdf, other

    cs.LG cs.AI

    Loss of Plasticity in Continual Deep Reinforcement Learning

    Authors: Zaheer Abbas, Rosie Zhao, Joseph Modayil, Adam White, Marlos C. Machado

    Abstract: The ability to learn continually is essential in a complex and changing world. In this paper, we characterize the behavior of canonical value-based deep reinforcement learning (RL) approaches under varying degrees of non-stationarity. In particular, we demonstrate that deep RL agents lose their ability to learn good policies when they cycle through a sequence of Atari 2600 games. This phenomenon i… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  26. arXiv:2302.07873  [pdf, other

    cs.CY

    Separating Technological and Clinical Safety Assurance for Medical Devices

    Authors: Spencer Deevy, Tiago de Moraes Machado, Amen Modhafar, Wesley O'Beirne, Richard Paige, Alan Wassyng

    Abstract: The safety and clinical effectiveness of medical devices are closely associated with their specific use in clinical treatments. Assuring safety and the desired clinical effectiveness is challenging. Different people may react differently to the same treatment due to variability in their physiology and genetics. Thus, we need to consider the outputs and behaviour of the device itself as well as the… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  27. arXiv:2301.11321  [pdf, other

    cs.LG

    Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning

    Authors: Brett Daley, Martha White, Christopher Amato, Marlos C. Machado

    Abstract: Off-policy learning from multistep returns is crucial for sample-efficient reinforcement learning, but counteracting off-policy bias without exacerbating variance is challenging. Classically, off-policy bias is corrected in a per-decision manner: past temporal-difference errors are re-weighted by the instantaneous Importance Sampling (IS) ratio after each action via eligibility traces. Many off-po… ▽ More

    Submitted 31 May, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: ICML 2023. 8 pages, 2 figures. arXiv admin note: text overlap with arXiv:2112.12281

  28. arXiv:2301.11181  [pdf, other

    cs.LG cs.AI

    Deep Laplacian-based Options for Temporally-Extended Exploration

    Authors: Martin Klissarov, Marlos C. Machado

    Abstract: Selecting exploratory actions that generate a rich stream of experience for better learning is a fundamental challenge in reinforcement learning (RL). An approach to tackle this problem consists in selecting actions according to specific policies for an extended period of time, also known as options. A recent line of work to derive such exploratory options builds upon the eigenfunctions of the gra… ▽ More

    Submitted 9 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  29. Light vector meson photoproduction in ultraperipheral heavy ion collisions at the LHC within the Reggeometric Pomeron approach

    Authors: László Jenkovszky, Érison S. Rocha, Magno V. T. Machado

    Abstract: By using the Reggeometric Pomeron model for vector meson production which successfully describes the high energy lepton-nucleon data, we analyse the light meson production in ultra-peripheral heavy ion collisions at the Large Hadron Collider (LHC). The rapidity distributions for $ρ$ and $φ$ photoproduction in lead-lead, xenon-xenon and oxygen-oxygen collisions are investigated.

    Submitted 12 January, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Proceedings IWARA 2022

  30. arXiv:2211.11533  [pdf

    cond-mat.mtrl-sci cs.PL math.NA

    Linear Modeling of the Glass Transition Temperature of the system $SiO_2-Na_2O-CaO$

    Authors: Patrick dos Anjos, Lucas A. Quaresma, Marcelo L. P. Machado

    Abstract: This work aimed to mathematically model the glass transition temperature (Tg), one of the most important parameters regarding the behavior of slag, responsible for the sudden change in thermomechanical properties of non-crystalline materials, by the chemical composition of the SiO2-Na2O-CaO system, widely applicable in the production of glasses and constituent of iron, magnesium and aluminum metal… ▽ More

    Submitted 21 July, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: 5 pages, 3 figures, 2 tables (CONTECC 2022 - Accepted and presented)

  31. arXiv:2211.07805  [pdf, other

    cs.LG cs.AI

    Agent-State Construction with Auxiliary Inputs

    Authors: Ruo Yu Tao, Adam White, Marlos C. Machado

    Abstract: In many, if not every realistic sequential decision-making task, the decision-making agent is not able to model the full complexity of the world. The environment is often much larger and more complex than the agent, a setting also known as partial observability. In such settings, the agent must leverage more than just the current sensory inputs; it must construct an agent state that summarizes pre… ▽ More

    Submitted 5 May, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: Published in Transactions on Machine Learning Research. 13 pages + 2 references + 15 appendix, 12 figures

  32. arXiv:2211.07587  [pdf

    cond-mat.soft cs.LG math.NA stat.AP

    Artificial neural networks for predicting the viscosity of lead-containing glasses

    Authors: Patrick dos Anjos, Lucas A. Quaresma, Marcelo L. P. Machado

    Abstract: The viscosity of lead-containing glasses is of fundamental importance for the manufacturing process, and can be predicted by algorithms such as artificial neural networks. The SciGlass database was used to provide training, validation and test data of chemical composition, temperature and viscosity for the construction of artificial neural networks with node variation in the hidden layer. The best… ▽ More

    Submitted 20 November, 2022; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: 6 pages, 5 figures, 2 tables

  33. Investigating exclusive $ρ^0$ photoproduction within the Regge phenomenology approach

    Authors: László Jenkovszky, Érison dos Santos Rocha, Magno V. T. Machado

    Abstract: The elastic differential and integrated total cross section for the exclusive $ρ^0$ photoproduction in electron-proton ($ep$) collisions are evaluated taking into account nonperturbative Pomeron exchange approach. By using three different models based on Regge phenomenology the results are compared to recent measurements by H1 Collaboration in $ep$ collisions and by the CMS collaboration from ultr… ▽ More

    Submitted 12 December, 2022; v1 submitted 27 October, 2022; originally announced October 2022.

  34. arXiv:2209.14374  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Functional thin films as cathode/electrolyte interlayers: a strategy to enhance the performance and durability of solid oxide fuel cells

    Authors: Marina Machado, Federico Baiutti, Lucile Bernadet, Alex Morata, Marc Nuñez, Jan Pieter Ouweltjes, Fabio Coral Fonseca, Marc Torrell, Albert Tarancónb

    Abstract: Electrochemical devices such as solid oxide fuel cells (SOFC) may greatly benefit from the implementation of nanoengineered thin-film multifunctional layers providing, alongside enhanced electrochemical activity, improved mechanical, and long-term stability. In this study, an ultrathin (400 nm) bilayer of samarium-doped ceria and a self-assembled nanocomposite made of Sm0.2Ce0.8O1.9-La0.8Sr0.2MnO3… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 24 pages, 12 figures

    Journal ref: Journal of Materials Chemistry A, 2022, 10, 17317-17325

  35. arXiv:2207.07794  [pdf, other

    hep-ph hep-ex nucl-ex

    Nuclear Modification Factor in Small System Collisions within Perturbative QCD Including Thermal Effects

    Authors: L. S. Moriggi, M. V. T. Machado

    Abstract: In this paper, dedicated to the memory of the late Prof. Jean Cleymans, the nuclear modification factors, $R_{xA}$, are investigated for pion production in small system collisions, measured by PHENIX experiment at RHIC (Relativistic Heavy Ion Collider). The theoretical framework is the transverse momentum $k_T$-factorization formalism for hard processes at small momentum fraction, $x$. Evidence fo… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: 12 pages, 4 figures. Contribution to MDPI Physics Special Issue "Jean Cleymans: A Life for Physics", dedicated to the memory of Professor Jean Cleymans

  36. Asymptotic gluon density within the color dipole picture in the light of HERA high-precision data

    Authors: D. A. Fagundes, M. V. T. Machado

    Abstract: We present an analysis of the most precise set of HERA data within the color dipole formalism, by using an analytical gluon density, based on the double-logarithm approximation of the DGLAP equations in the asymptotic limit of the scaling variable, $σ=\log{(1/x)}\log{(\log{(Q^2/Q_ 0^2)})}\rightarrow \infty$. Fits to data, including charm and bottom quarks are performed and demonstrate the efficien… ▽ More

    Submitted 5 January, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 17 pages, 9 figures, 6 tables. Matches published version

    Journal ref: Phys.Rev. D 107, 014004 (2023)

  37. $D$-meson production in high energy $pA$ collisions within the QCD color dipole transverse momentum representation

    Authors: G. Sampaio dos Santos, G. Gil da Silveira, M. V. T. Machado

    Abstract: The $D$-meson production is investigated by considering the unintegrated gluon distribution within the dipole approach in the momentum representation. We analyze the $D$-meson spectrum accounting for the effects of nonlinear behavior of the QCD dynamics which can be accordingly addressed in the dipole framework. The unintegrated gluon distribution is obtained by using geometric scaling property an… ▽ More

    Submitted 13 October, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: 23 pages, 9 figures, 1 table

    Journal ref: Eur.Phys.J. C 82 (2022) 795

  38. arXiv:2204.10350  [pdf, other

    hep-ph hep-ex nucl-ex

    Exclusive $Z^0$ production in $ep$ and $eA$ collisions at high energies

    Authors: G. M. Peccini, L. S. Moriggi, M. V. T. Machado

    Abstract: In this work the $k_{\perp}$-factorization formalism is applied to compute the exclusive $Z^0$ boson photoproduction in $ep$ and $eA$ collisions. The study is also extended to $pp$ and $AA$ processes. The nuclear effects are investigated considering heavy and light ions. Analytical models for the unintegrated gluon distribution are taken into account and the corresponding theoretical uncertainty i… ▽ More

    Submitted 26 June, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: 8 pages, 5 figures

  39. arXiv:2204.04700  [pdf, other

    astro-ph.SR astro-ph.EP

    Activity and Rotation of Nearby Field M Dwarfs in the TESS Southern Continuous Viewing Zone

    Authors: Francys Anthony, Alejandro Núñez, Marcel A. Agüeros, Jason L. Curtis, J. -D. do Nascimento, Jr., João M. Machado, Andrew W. Mann, Elisabeth R. Newton, Rayna Rampalli, Pa Chia Thao, Mackenna L. Wood

    Abstract: The evolution of magnetism in late-type dwarfs remains murky, as we can only weakly predict levels of activity for M dwarfs of a given mass and age. We report results from our spectroscopic survey of M dwarfs in the Southern Continuous Viewing Zone (CVZ) of the Transiting Exoplanet Survey Satellite (TESS). As the TESS CVZs overlap with those of the James Webb Space Telescope, our targets constitut… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: Accepted for publication in AJ, 17 pages, 10 figures, 2 tables

  40. arXiv:2203.15955  [pdf, other

    cs.LG

    Investigating the Properties of Neural Network Representations in Reinforcement Learning

    Authors: Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas, Raksha Kumaraswamy, Vincent Liu, Adam White

    Abstract: In this paper we investigate the properties of representations learned by deep reinforcement learning systems. Much of the early work on representations for reinforcement learning focused on designing fixed-basis architectures to achieve properties thought to be desirable, such as orthogonality and sparsity. In contrast, the idea behind deep reinforcement learning methods is that the agent designe… ▽ More

    Submitted 5 May, 2023; v1 submitted 29 March, 2022; originally announced March 2022.

  41. arXiv:2203.11369  [pdf, other

    cs.LG

    Temporal Abstractions-Augmented Temporally Contrastive Learning: An Alternative to the Laplacian in RL

    Authors: Akram Erraqabi, Marlos C. Machado, Mingde Zhao, Sainbayar Sukhbaatar, Alessandro Lazaric, Ludovic Denoyer, Yoshua Bengio

    Abstract: In reinforcement learning, the graph Laplacian has proved to be a valuable tool in the task-agnostic setting, with applications ranging from skill discovery to reward sha**. Recently, learning the Laplacian representation has been framed as the optimization of a temporally-contrastive objective to overcome its computational limitations in large (or continuous) state spaces. However, this approac… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  42. arXiv:2203.10986  [pdf, ps, other

    hep-ph hep-ex hep-th

    Investigating the QCD dynamical entropy in high-energy hadronic collisions

    Authors: G. S. Ramos, M. V. T. Machado

    Abstract: The dynamical entropy of dense gluonic states in proton-proton collisions at high energies is studied by using phenomenological models for the unintegrated gluon distribution. The corresponding transverse momentum probability distributions are evaluated in terms of rapidity. The dynamical entropy density is obtained in the rapidity range relevant for the collisions at the Large Hadron Collider. Th… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: 08 pages, 4 figures

  43. Reward-Respecting Subtasks for Model-Based Reinforcement Learning

    Authors: Richard S. Sutton, Marlos C. Machado, G. Zacharias Holland, David Szepesvari, Finbarr Timbers, Brian Tanner, Adam White

    Abstract: To achieve the ambitious goals of artificial intelligence, reinforcement learning must include planning with a model of the world that is abstract in state and time. Deep learning has made progress with state abstraction, but temporal abstraction has rarely been used, despite extensively developed theory based on the options framework. One reason for this is that the space of possible options is i… ▽ More

    Submitted 16 September, 2023; v1 submitted 7 February, 2022; originally announced February 2022.

    Journal ref: Artificial Intelligence, first published online September 6, 2023

  44. Regge phenomenology and coherent photoproduction of charmonium in peripheral heavy ion collisions

    Authors: Laszlo Jenkovszky, Vladyslav Libov, Magno V. T. Machado

    Abstract: By using models based on Regge phenomenology we analyse the coherent photoproduction of charmonium in peripheral heavy-ion collisions at the Large Hadron Collider (LHC). The centrality dependence is investigated and compared to the experimental results for coherent $J/ψ$ production in lead-lead LHC runs at the energies of 2.76 and 5.02 TeV. Theoretical uncertainties and possible limitations of the… ▽ More

    Submitted 11 March, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: 12 pages, 7 figures

    Journal ref: Physics Letters B, Volume 827, 10 April 2022, 137004

  45. arXiv:2201.13432  [pdf, other

    hep-ph hep-ex nucl-th

    Nuclear transverse momentum imbalance in the color dipole approach at the LHC regime

    Authors: F. G. Ben, A. V. Giannini, M. V. T. Machado

    Abstract: Transverse momentum broadening of a parton propagating through a large nucleus is evaluated in the color dipole approach using different models for the dipole cross section or unintegrated gluon distribution, which lead to different values of the coefficient $C_{\mathcal{F}}(0,s)$. Numerical calculations are compared to data extracted from LHCb and ALICE experiments for nuclear broadening of… ▽ More

    Submitted 20 March, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: 6 pages, 2 figure, 2 tables

  46. arXiv:2112.13319  [pdf, ps, other

    quant-ph cs.CL cs.DS

    Quantum Algorithm for the Shortest Superstring Problem

    Authors: Kamil Khadiev, Carlos Manuel Bosch Machado

    Abstract: In this paper, we consider the ``Shortest Superstring Problem''(SSP) or the ``Shortest Common Superstring Problem''(SCS). The problem is as follows. For a positive integer $n$, a sequence of n strings $S=(s^1,\dots,s^n)$ is given. We should construct the shortest string $t$ (we call it superstring) that contains each string from the given sequence as a substring. The problem is connected with the… ▽ More

    Submitted 26 December, 2021; originally announced December 2021.

    Comments: 11 pages

  47. The reggeometric pomeron and exclusive production of $J/ψ$ and $ψ(2S)$ in ultraperipheral collisions at the LHC

    Authors: Laszlo Jenkovszky, Vladyslav Libov, Magno V. T. Machado

    Abstract: By using a Regge-pole model for vector meson production (VMP), successfully describing the HERA data, we analyse the correlation between VMP cross sections in photon-induced reactions at HERA and those in ultra-peripheral collisions at the Large Hadron Collider (LHC). The rapidity distributions of proton-proton collisions at 13~TeV and lead-lead collisions at 2.76 and 5.02 TeV are investigated. Th… ▽ More

    Submitted 22 January, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: 14 pages, 9 figures. arXiv admin note: text overlap with arXiv:1408.0530

    Journal ref: Physics Letters B 824 (2022) 136836

  48. arXiv:2110.05740  [pdf, other

    cs.LG cs.AI

    Temporal Abstraction in Reinforcement Learning with the Successor Representation

    Authors: Marlos C. Machado, Andre Barreto, Doina Precup, Michael Bowling

    Abstract: Reasoning at multiple levels of temporal abstraction is one of the key attributes of intelligence. In reinforcement learning, this is often modeled through temporally extended courses of actions called options. Options allow agents to make predictions and to operate at different levels of abstraction within an environment. Nevertheless, approaches based on the options framework often start with th… ▽ More

    Submitted 11 April, 2023; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: This is the final, published JMLR version

    Journal ref: Journal of Machine Learning Research (JMLR), 24(80):1-69, 2023

  49. arXiv:2109.11052  [pdf, other

    cs.LG

    On Bonus-Based Exploration Methods in the Arcade Learning Environment

    Authors: Adrien Ali Taïga, William Fedus, Marlos C. Machado, Aaron Courville, Marc G. Bellemare

    Abstract: Research on exploration in reinforcement learning, as applied to Atari 2600 game-playing, has emphasized tackling difficult exploration problems such as Montezuma's Revenge (Bellemare et al., 2016). Recently, bonus-based exploration methods, which explore by augmenting the environment reward, have reached above-human average performance on such domains. In this paper we reassess popular bonus-base… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: Full version of arXiv:1908.02388

    Journal ref: Published as a conference paper at ICLR 2020

  50. arXiv:2108.05828  [pdf, other

    cs.LG cs.AI stat.ML

    A general class of surrogate functions for stable and efficient reinforcement learning

    Authors: Sharan Vaswani, Olivier Bachem, Simone Totaro, Robert Mueller, Shivam Garg, Matthieu Geist, Marlos C. Machado, Pablo Samuel Castro, Nicolas Le Roux

    Abstract: Common policy gradient methods rely on the maximization of a sequence of surrogate functions. In recent years, many such surrogate functions have been proposed, most without strong theoretical guarantees, leading to algorithms such as TRPO, PPO or MPO. Rather than design yet another surrogate function, we instead propose a general framework (FMA-PG) based on functional mirror ascent that gives ris… ▽ More

    Submitted 30 October, 2023; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: Fixed minor typos