Skip to main content

Showing 1–11 of 11 results for author: Marcolino, L S

.
  1. arXiv:2312.06436  [pdf, other

    cs.LG cs.AI

    Reward Certification for Policy Smoothed Reinforcement Learning

    Authors: Ronghui Mu, Leandro Soriano Marcolino, Tianle Zhang, Yanghao Zhang, Xiaowei Huang, Wenjie Ruan

    Abstract: Reinforcement Learning (RL) has achieved remarkable success in safety-critical areas, but it can be weakened by adversarial attacks. Recent studies have introduced "smoothed policies" in order to enhance its robustness. Yet, it is still challenging to establish a provable guarantee to certify the bound of its total reward. Prior methods relied primarily on computing bounds using Lipschitz continui… ▽ More

    Submitted 12 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: This paper will be presented in AAAI2024

  2. arXiv:2212.11746  [pdf, other

    cs.LG cs.MA

    Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning

    Authors: Ronghui Mu, Wenjie Ruan, Leandro Soriano Marcolino, Gaojie **, Qiang Ni

    Abstract: Cooperative multi-agent reinforcement learning (c-MARL) is widely applied in safety-critical scenarios, thus the analysis of robustness for c-MARL models is profoundly important. However, robustness certification for c-MARLs has not yet been explored in the community. In this paper, we propose a novel certification method, which is the first work to leverage a scalable approach for c-MARLs to dete… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: This paper will appear in AAAI2023

  3. arXiv:2210.05626  [pdf, other

    cs.CV cs.GR cs.LG

    Semantic Segmentation under Adverse Conditions: A Weather and Nighttime-aware Synthetic Data-based Approach

    Authors: Abdulrahman Kerim, Felipe Chamone, Washington Ramos, Leandro Soriano Marcolino, Erickson R. Nascimento, Richard Jiang

    Abstract: Recent semantic segmentation models perform well under standard weather conditions and sufficient illumination but struggle with adverse weather conditions and nighttime. Collecting and annotating training data under these conditions is expensive, time-consuming, error-prone, and not always practical. Usually, synthetic data is used as a feasible data source to increase the amount of training data… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: This paper is accepted by BMVC 2022

  4. arXiv:2208.12763  [pdf, other

    cs.CV cs.GR

    Leveraging Synthetic Data to Learn Video Stabilization Under Adverse Conditions

    Authors: Abdulrahman Kerim, Washington L. S. Ramos, Leandro Soriano Marcolino, Erickson R. Nascimento, Richard Jiang

    Abstract: Video stabilization plays a central role to improve videos quality. However, despite the substantial progress made by these methods, they were, mainly, tested under standard weather and lighting conditions, and may perform poorly under adverse conditions. In this paper, we propose a synthetic-aware adverse weather robust algorithm for video stabilization that does not require real data and can be… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    ACM Class: I.4.0; I.4.1; I.6.0

  5. arXiv:2207.07539  [pdf, other

    cs.CV cs.LG

    3DVerifier: Efficient Robustness Verification for 3D Point Cloud Models

    Authors: Ronghui Mu, Wenjie Ruan, Leandro S. Marcolino, Qiang Ni

    Abstract: 3D point cloud models are widely applied in safety-critical scenes, which delivers an urgent need to obtain more solid proofs to verify the robustness of models. Existing verification method for point cloud model is time-expensive and computationally unattainable on large networks. Additionally, they cannot handle the complete PointNet model with joint alignment network (JANet) that contains multi… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  6. Text-Driven Video Acceleration: A Weakly-Supervised Reinforcement Learning Method

    Authors: Washington Ramos, Michel Silva, Edson Araujo, Victor Moura, Keller Oliveira, Leandro Soriano Marcolino, Erickson R. Nascimento

    Abstract: The growth of videos in our digital age and the users' limited time raise the demand for processing untrimmed videos to produce shorter versions conveying the same information. Despite the remarkable progress that summarization methods have made, most of them can only select a few frames or skims, creating visual gaps and breaking the video context. This paper presents a novel weakly-supervised me… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted to the IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2022. arXiv admin note: text overlap with arXiv:2003.14229

  7. arXiv:2201.09337  [pdf, other

    cs.RO cs.MA

    Congestion control algorithms for robotic swarms with a common target based on the throughput of the target area

    Authors: Yuri Tavares dos Passos, Xavier Duquesne, Leandro Soriano Marcolino

    Abstract: When a large number of robots try to reach a common area, congestions happen, causing severe delays. To minimise congestion in a robotic swarm system, traffic control algorithms must be employed in a decentralised manner. Based on strategies aimed to maximise the throughput of the common target area, we developed two novel algorithms for robots using artificial potential fields for obstacle avoida… ▽ More

    Submitted 23 August, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

    Comments: Corrections were made to the TRVF algorithm and the text, and new references were added

  8. arXiv:2201.09335  [pdf, other

    cs.RO cs.MA

    On the throughput of the common target area for robotic swarm strategies -- extended version

    Authors: Yuri Tavares dos Passos, Xavier Duquesne, Leandro Soriano Marcolino

    Abstract: A robotic swarm may encounter traffic congestion when many robots simultaneously attempt to reach the same area. For solving that efficiently, robots must execute decentralised traffic control algorithms. In this work, we propose a measure for evaluating the access efficiency of a common target area as the number of robots in the swarm rises: the common target area throughput. We also employ here… ▽ More

    Submitted 30 March, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

    Comments: The proof for finite throughput was removed

  9. arXiv:2111.05468  [pdf, other

    cs.CV

    Sparse Adversarial Video Attacks with Spatial Transformations

    Authors: Ronghui Mu, Wenjie Ruan, Leandro Soriano Marcolino, Qiang Ni

    Abstract: In recent years, a significant amount of research efforts concentrated on adversarial attacks on images, while adversarial video attacks have seldom been explored. We propose an adversarial attack strategy on videos, called DeepSAVA. Our model includes both additive perturbation and spatial transformation by a unified optimisation framework, where the structural similarity index (SSIM) measure is… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: The short version of this work will appear in the BMVC 2021 conference

  10. The paradox of productivity during quarantine: an agent-based simulation

    Authors: Peter Hardy, Leandro Soriano Marcolino, José F. Fontanari

    Abstract: Economies across the globe were brought to their knees due to lockdowns and social restriction measures to contain the spread of the SARS-CoV-2, despite the quick switch to remote working. This downfall may be partially explained by the "water cooler effect", which holds that higher levels of social interaction lead to higher productivity due to a boost in people's mood. Somewhat paradoxically, ho… ▽ More

    Submitted 19 November, 2020; v1 submitted 21 August, 2020; originally announced August 2020.

    Journal ref: Eur. Phys. J. B (2021) 94: 40

  11. arXiv:2003.14229  [pdf, other

    cs.CV

    Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data

    Authors: Washington Ramos, Michel Silva, Edson Araujo, Leandro Soriano Marcolino, Erickson Nascimento

    Abstract: The rapid increase in the amount of published visual data and the limited time of users bring the demand for processing untrimmed videos to produce shorter versions that convey the same information. Despite the remarkable progress that has been made by summarization methods, most of them can only select a few frames or skims, which creates visual gaps and breaks the video context. In this paper, w… ▽ More

    Submitted 31 March, 2020; originally announced March 2020.

    Comments: CVPR 2020