Skip to main content

Showing 1–22 of 22 results for author: Pinheiro, P O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03961  [pdf, other

    cs.LG q-bio.BM

    Structure-based drug design by denoising voxel grids

    Authors: Pedro O. Pinheiro, Arian Jamasb, Omar Mahmood, Vishnu Sresht, Saeed Saremi

    Abstract: We present VoxBind, a new score-based generative model for 3D molecules conditioned on protein structures. Our approach represents molecules as 3D atomic density grids and leverages a 3D voxel-denoising network for learning and generation. We extend the neural empirical Bayes formalism (Saremi & Hyvarinen, 2019) to the conditional setting and generate structure-conditioned molecules with a two-ste… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  2. arXiv:2306.07473  [pdf, other

    cs.LG q-bio.QM

    3D molecule generation by denoising voxel grids

    Authors: Pedro O. Pinheiro, Joshua Rackers, Joseph Kleinhenz, Michael Maser, Omar Mahmood, Andrew Martin Watkins, Stephen Ra, Vishnu Sresht, Saeed Saremi

    Abstract: We propose a new score-based approach to generate 3D molecules represented as atomic densities on regular grids. First, we train a denoising neural network that learns to map from a smooth distribution of noisy molecules to the distribution of real molecules. Then, we follow the neural empirical Bayes framework (Saremi and Hyvarinen, 19) and generate molecules in two steps: (i) sample noisy densit… ▽ More

    Submitted 8 March, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  3. arXiv:2203.05446  [pdf, other

    cs.DS

    Algorithms for the Maximum Eulerian Cycle Decomposition Problem

    Authors: Pedro O. Pinheiro, Alexsandro Oliveira Alexandrino, Andre R. Oliveira, Cid C. de Souza, Zanoni Dias

    Abstract: Given an Eulerian graph G, in the Maximum Eulerian Cycle Decomposition problem, we are interested in finding a collection of edge-disjoint cycles {E_1, E_2, ..., E_k} in G such that all edges of G are in exactly one cycle and k is maximum. We present an algorithm to solve the pricing problem of a column generation Integer Linear Programming (ILP) model introduced by Lancia and Serafini (2016). Fur… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Journal ref: LIII S. Brasileiro de Pesquisa Operacional (SBPO 2021), Galoa, 2021. v. 53. p. 139228

  4. arXiv:2104.00442  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Touch-based Curiosity for Sparse-Reward Tasks

    Authors: Sai Rajeswar, Cyril Ibrahim, Nitin Surya, Florian Golemo, David Vazquez, Aaron Courville, Pedro O. Pinheiro

    Abstract: Robots in many real-world settings have access to force/torque sensors in their gripper and tactile sensing is often necessary in tasks that involve contact-rich motion. In this work, we leverage surprise from mismatches in touch feedback to guide exploration in hard sparse-reward reinforcement learning tasks. Our approach, Touch-based Curiosity (ToC), learns what visible objects interactions are… ▽ More

    Submitted 26 June, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

  5. arXiv:2011.05499  [pdf, other

    cs.CV

    Unsupervised Learning of Dense Visual Representations

    Authors: Pedro O. Pinheiro, Amjad Almahairi, Ryan Y. Benmalek, Florian Golemo, Aaron Courville

    Abstract: Contrastive self-supervised learning has emerged as a promising approach to unsupervised visual representation learning. In general, these methods learn global (image-level) representations that are invariant to different views (i.e., compositions of data augmentation) of the same image. However, many visual understanding tasks require dense (pixel-level) representations. In this paper, we propose… ▽ More

    Submitted 7 December, 2020; v1 submitted 10 November, 2020; originally announced November 2020.

  6. arXiv:2002.06583  [pdf, other

    cs.CV

    Reinforced active learning for image segmentation

    Authors: Arantxa Casanova, Pedro O. Pinheiro, Negar Rostamzadeh, Christopher J. Pal

    Abstract: Learning-based approaches for semantic segmentation have two inherent challenges. First, acquiring pixel-wise labels is expensive and time-consuming. Second, realistic segmentation datasets are highly unbalanced: some categories are much more abundant than others, biasing the performance to the most represented ones. In this paper, we are interested in focusing human labelling effort on a small su… ▽ More

    Submitted 16 February, 2020; originally announced February 2020.

    Comments: Accepted to ICLR2020

  7. arXiv:1910.02344  [pdf, other

    cs.LG stat.ML

    Neural Multisensory Scene Inference

    Authors: Jae Hyun Lim, Pedro O. Pinheiro, Negar Rostamzadeh, Christopher Pal, Sung** Ahn

    Abstract: For embodied agents to infer representations of the underlying 3D physical world they inhabit, they should efficiently combine multisensory cues from numerous trials, e.g., by looking at and touching objects. Despite its importance, multisensory 3D scene representation learning has received less attention compared to the unimodal setting. In this paper, we propose the Generative Multisensory Netwo… ▽ More

    Submitted 7 November, 2019; v1 submitted 5 October, 2019; originally announced October 2019.

  8. arXiv:1906.11892  [pdf, other

    cs.CV cs.LG stat.ML

    CLAREL: Classification via retrieval loss for zero-shot learning

    Authors: Boris N. Oreshkin, Negar Rostamzadeh, Pedro O. Pinheiro, Christopher Pal

    Abstract: We address the problem of learning fine-grained cross-modal representations. We propose an instance-based deep metric learning approach in joint visual and textual space. The key novelty of this paper is that it shows that using per-image semantic supervision leads to substantial improvement in zero-shot performance over using class-only supervision. On top of that, we provide a probabilistic just… ▽ More

    Submitted 5 April, 2020; v1 submitted 31 May, 2019; originally announced June 2019.

  9. arXiv:1906.06392  [pdf, other

    cs.CV

    Instance Segmentation with Point Supervision

    Authors: Issam H. Laradji, Negar Rostamzadeh, Pedro O. Pinheiro, David Vazquez, Mark Schmidt

    Abstract: Instance segmentation methods often require costly per-pixel labels. We propose a method that only requires point-level annotations. During training, the model only has access to a single pixel label per object, yet the task is to output full segmentation masks. To address this challenge, we construct a network with two branches: (1) a localization network (L-Net) that predicts the location of eac… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

  10. arXiv:1906.03975  [pdf, other

    eess.IV cs.LG

    Predicting Global Variations in Outdoor PM2.5 Concentrations using Satellite Images and Deep Convolutional Neural Networks

    Authors: Kris Y. Hong, Pedro O. Pinheiro, Scott Weichenthal

    Abstract: Here we present a new method of estimating global variations in outdoor PM$_{2.5}$ concentrations using satellite images combined with ground-level measurements and deep convolutional neural networks. Specifically, new deep learning models were trained over the global PM$_{2.5}$ concentration range ($<$1-436 $μ$g/m$^3$) using a large database of satellite images paired with ground level PM… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.

    Comments: 8 pages, 6 figures, Submitted to Scientific Reports

  11. arXiv:1904.03438  [pdf, other

    cs.LG cs.AI stat.ML

    Reinforced Imitation in Heterogeneous Action Space

    Authors: Konrad Zolna, Negar Rostamzadeh, Yoshua Bengio, Sung** Ahn, Pedro O. Pinheiro

    Abstract: Imitation learning is an effective alternative approach to learn a policy when the reward function is sparse. In this paper, we consider a challenging setting where an agent and an expert use different actions from each other. We assume that the agent has access to a sparse reward function and state-only expert observations. We propose a method which gradually balances between the imitation learni… ▽ More

    Submitted 26 August, 2019; v1 submitted 6 April, 2019; originally announced April 2019.

    Comments: The extended version of the work "Reinforced Imitation Learning from Observations" presented on the NeurIPS workshop "Imitation Learning and its Challenges in Robotics"

  12. arXiv:1902.07104  [pdf, other

    cs.LG stat.ML

    Adaptive Cross-Modal Few-Shot Learning

    Authors: Chen Xing, Negar Rostamzadeh, Boris N. Oreshkin, Pedro O. Pinheiro

    Abstract: Metric-based meta-learning techniques have successfully been applied to few-shot classification problems. In this paper, we propose to leverage cross-modal information to enhance metric-based few-shot learning methods. Visual and semantic feature spaces have different structures by definition. For certain concepts, visual features might be richer and more discriminative than text ones. While for o… ▽ More

    Submitted 17 February, 2020; v1 submitted 19 February, 2019; originally announced February 2019.

  13. arXiv:1812.04599  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Adversarial Framing for Image and Video Classification

    Authors: Konrad Zolna, Michal Zajac, Negar Rostamzadeh, Pedro O. Pinheiro

    Abstract: Neural networks are prone to adversarial attacks. In general, such attacks deteriorate the quality of the input by either slightly modifying most of its pixels, or by occluding it with a patch. In this paper, we propose a method that keeps the image unchanged and only adds an adversarial framing on the border of the image. We show empirically that our method is able to successfully attack state-of… ▽ More

    Submitted 17 October, 2019; v1 submitted 11 December, 2018; originally announced December 2018.

    Comments: This is an extended version of the paper published at 33rd AAAI Conference on Artificial Intelligence (see https://doi.org/10.1609/aaai.v33i01.330110077 )

  14. arXiv:1812.01742  [pdf, other

    cs.CV

    Domain-Adaptive Single-View 3D Reconstruction

    Authors: Pedro O. Pinheiro, Negar Rostamzadeh, Sung** Ahn

    Abstract: Single-view 3D shape reconstruction is an important but challenging problem, mainly for two reasons. First, as shape annotation is very expensive to acquire, current methods rely on synthetic data, in which ground-truth 3D annotation is easy to obtain. However, this results in domain adaptation problem when applied to natural images. The second challenge is that there are multiple shapes that can… ▽ More

    Submitted 26 August, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

  15. arXiv:1807.09856  [pdf, other

    cs.CV

    Where are the Blobs: Counting by Localization with Point Supervision

    Authors: Issam H. Laradji, Negar Rostamzadeh, Pedro O. Pinheiro, David Vazquez, Mark Schmidt

    Abstract: Object counting is an important task in computer vision due to its growing demand in applications such as surveillance, traffic monitoring, and counting everyday objects. State-of-the-art methods use regression-based optimization where they explicitly learn to count the objects of interest. These often perform better than detection-based methods that need to learn the more difficult task of predic… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

  16. arXiv:1711.08995  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation with Similarity Learning

    Authors: Pedro O. Pinheiro

    Abstract: The objective of unsupervised domain adaptation is to leverage features from a labeled source domain and learn a classifier for an unlabeled target domain, with a similar but different data distribution. Most deep learning approaches to domain adaptation consist of two steps: (i) learn features that preserve a low risk on labeled samples (source domain) and (ii) make the features from both domains… ▽ More

    Submitted 17 April, 2018; v1 submitted 24 November, 2017; originally announced November 2017.

  17. arXiv:1604.02135  [pdf, other

    cs.CV

    A MultiPath Network for Object Detection

    Authors: Sergey Zagoruyko, Adam Lerer, Tsung-Yi Lin, Pedro O. Pinheiro, Sam Gross, Soumith Chintala, Piotr Dollár

    Abstract: The recent COCO object detection dataset presents several new challenges for object detection. In particular, it contains objects at a broad range of scales, less prototypical images, and requires more precise localization. To address these challenges, we test three modifications to the standard Fast R-CNN object detector: (1) skip connections that give the detector access to features at multiple… ▽ More

    Submitted 8 August, 2016; v1 submitted 7 April, 2016; originally announced April 2016.

  18. arXiv:1603.08695  [pdf, other

    cs.CV

    Learning to Refine Object Segments

    Authors: Pedro O. Pinheiro, Tsung-Yi Lin, Ronan Collobert, Piotr Dollàr

    Abstract: Object segmentation requires both object-level information and low-level pixel data. This presents a challenge for feedforward networks: lower layers in convolutional nets capture rich spatial information, while upper layers encode object-level knowledge but are invariant to factors such as pose and appearance. In this work we propose to augment feedforward nets for object segmentation with a nove… ▽ More

    Submitted 26 July, 2016; v1 submitted 29 March, 2016; originally announced March 2016.

    Comments: extended version of ECCV camera-ready (figures 6-9 only in arXiv)

  19. arXiv:1506.06204  [pdf, other

    cs.CV

    Learning to Segment Object Candidates

    Authors: Pedro O. Pinheiro, Ronan Collobert, Piotr Dollar

    Abstract: Recent object detection systems rely on two critical steps: (1) a set of object proposals is predicted as efficiently as possible, and (2) this set of candidate proposals is then passed to an object classifier. Such approaches have been shown they can be fast, while achieving the state of the art in detection performance. In this paper, we propose a new way to generate object proposals, introducin… ▽ More

    Submitted 1 September, 2015; v1 submitted 20 June, 2015; originally announced June 2015.

  20. arXiv:1502.03671  [pdf, other

    cs.CL

    Phrase-based Image Captioning

    Authors: Rémi Lebret, Pedro O. Pinheiro, Ronan Collobert

    Abstract: Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing. In this paper, we present a simple model that is able to generate descriptive sentences given a sample image. This model has a strong focus on the syntax of the descriptions. We train a purely bilinear model that learns a metric between an image representation… ▽ More

    Submitted 9 April, 2015; v1 submitted 12 February, 2015; originally announced February 2015.

  21. arXiv:1412.8419  [pdf, other

    cs.CL cs.CV cs.NE

    Simple Image Description Generator via a Linear Phrase-Based Approach

    Authors: Remi Lebret, Pedro O. Pinheiro, Ronan Collobert

    Abstract: Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing. In this paper, we present a simple model that is able to generate descriptive sentences given a sample image. This model has a strong focus on the syntax of the descriptions. We train a purely bilinear model that learns a metric between an image representation… ▽ More

    Submitted 10 April, 2015; v1 submitted 29 December, 2014; originally announced December 2014.

    Comments: Accepted as a workshop paper at ICLR 2015

  22. arXiv:1411.6228  [pdf, other

    cs.CV

    From Image-level to Pixel-level Labeling with Convolutional Networks

    Authors: Pedro O. Pinheiro, Ronan Collobert

    Abstract: We are interested in inferring object segmentation by leveraging only object class information, and by considering only minimal priors on the object segmentation task. This problem could be viewed as a kind of weakly supervised segmentation task, and naturally fits the Multiple Instance Learning (MIL) framework: every training image is known to have (or not) at least one pixel corresponding to the… ▽ More

    Submitted 24 April, 2015; v1 submitted 23 November, 2014; originally announced November 2014.

    Comments: CVPR2015