Skip to main content

Showing 1–21 of 21 results for author: Phielipp, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05495  [pdf, other

    cs.OH

    PARSAC: Fast, Human-quality Floorplanning for Modern SoCs with Complex Design Constraints

    Authors: Hesham Mostafa, Uday Mallappa, Mikhail Galkin, Mariano Phielipp, Somdeb Majumdar

    Abstract: The floorplanning of Systems-on-a-Chip (SoCs) and of chip sub-systems is a crucial step in the physical design flow as it determines the optimal shapes and locations of the blocks that make up the system. Simulated Annealing (SA) has been the method of choice for tackling classical floorplanning problems where the objective is to minimize wire-length and the total placement area. The goal in indus… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 9 pages, 7 figures

  2. arXiv:2405.05480  [pdf, other

    cs.AR cs.AI cs.LG

    FloorSet -- a VLSI Floorplanning Dataset with Design Constraints of Real-World SoCs

    Authors: Uday Mallappa, Hesham Mostafa, Mikhail Galkin, Mariano Phielipp, Somdeb Majumdar

    Abstract: Floorplanning for systems-on-a-chip (SoCs) and its sub-systems is a crucial and non-trivial step of the physical design flow. It represents a difficult combinatorial optimization problem. A typical large scale SoC with 120 partitions generates a search-space of nearly 10E250. As novel machine learning (ML) approaches emerge to tackle such problems, there is a growing need for a modern benchmark th… ▽ More

    Submitted 27 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 10 pages, 11 figures

  3. arXiv:2401.03306  [pdf, other

    cs.LG cs.AI cs.RO

    MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning

    Authors: Rafael Rafailov, Kyle Hatch, Victor Kolev, John D. Martin, Mariano Phielipp, Chelsea Finn

    Abstract: We study the problem of offline pre-training and online fine-tuning for reinforcement learning from high-dimensional observations in the context of realistic robot tasks. Recent offline model-free approaches successfully use online fine-tuning to either improve the performance of the agent over the data collection policy or adapt to novel tasks. At the same time, model-based RL algorithms have ach… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: This is an updated version of a manuscript that originally appeared at CoRL 2023. The project website is here https://sites.google.com/view/mo2o

    Journal ref: Proceedings of The 7th Conference on Robot Learning, PMLR 229:3654-3671, 2023

  4. arXiv:2310.02902  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI

    Searching for High-Value Molecules Using Reinforcement Learning and Transformers

    Authors: Raj Ghugare, Santiago Miret, Adriana Hugessen, Mariano Phielipp, Glen Berseth

    Abstract: Reinforcement learning (RL) over text representations can be effective for finding high-value policies that can search over graphs. However, RL requires careful structuring of the search space and algorithm design to be effective in this challenge. Through extensive experiments, we explore how different design choices for text grammar and algorithmic choices for training can affect an RL policy's… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  5. arXiv:2302.14242  [pdf, other

    cs.RO

    Learning Sparse Control Tasks from Pixels by Latent Nearest-Neighbor-Guided Explorations

    Authors: Ruihan Zhao, Ufuk Topcu, Sandeep Chinchali, Mariano Phielipp

    Abstract: Recent progress in deep reinforcement learning (RL) and computer vision enables artificial agents to solve complex tasks, including locomotion, manipulation and video games from high-dimensional pixel observations. However, domain specific reward functions are often engineered to provide sufficient learning signals, requiring expert knowledge. While it is possible to train vision-based RL agents u… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  6. arXiv:2212.04573  [pdf, other

    cs.RO

    Modularity through Attention: Efficient Training and Transfer of Language-Conditioned Policies for Robot Manipulation

    Authors: Yifan Zhou, Shubham Sonawani, Mariano Phielipp, Simon Stepputtis, Heni Ben Amor

    Abstract: Language-conditioned policies allow robots to interpret and execute human instructions. Learning such policies requires a substantial investment with regards to time and compute resources. Still, the resulting controllers are highly device-specific and cannot easily be transferred to a robot with different morphology, capability, appearance or dynamics. In this paper, we propose a sample-efficient… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 2022 Conference on Robot Learning (CoRL)

  7. arXiv:2211.13322  [pdf, other

    cs.LG cs.NE physics.chem-ph

    Group SELFIES: A Robust Fragment-Based Molecular String Representation

    Authors: Austin Cheng, Andy Cai, Santiago Miret, Gustavo Malkomes, Mariano Phielipp, Alán Aspuru-Guzik

    Abstract: We introduce Group SELFIES, a molecular string representation that leverages group tokens to represent functional groups or entire substructures while maintaining chemical robustness guarantees. Molecular string representations, such as SMILES and SELFIES, serve as the basis for molecular generation and optimization in chemical language models, deep generative models, and evolutionary methods. Whi… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 11 pages + references and appendix

    Journal ref: Digital Discovery (2023)

  8. arXiv:2206.12279  [pdf, other

    cs.LG cs.AI cs.RO

    AnyMorph: Learning Transferable Polices By Inferring Agent Morphology

    Authors: Brandon Trabucco, Mariano Phielipp, Glen Berseth

    Abstract: The prototypical approach to reinforcement learning involves training policies tailored to a particular agent from scratch for every new morphology. Recent work aims to eliminate the re-training of policies by investigating whether a morphology-agnostic policy, trained on a diverse set of agents with similar task objectives, can be transferred to new agents with unseen morphologies without re-trai… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: published at ICML 2022

  9. arXiv:2205.10739  [pdf, other

    cs.LG cs.AI

    Offline Policy Comparison with Confidence: Benchmarks and Baselines

    Authors: Anurag Koul, Mariano Phielipp, Alan Fern

    Abstract: Decision makers often wish to use offline historical data to compare sequential-action policies at various world states. Importantly, computational tools should produce confidence values for such offline policy comparison (OPC) to account for statistical variance and limited data coverage. Nevertheless, there is little work that directly evaluates the quality of confidence values for OPC. In this… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

  10. arXiv:2203.15913  [pdf, other

    cs.LG cs.AI

    Pretraining Graph Neural Networks for few-shot Analog Circuit Modeling and Design

    Authors: Kourosh Hakhamaneshi, Marcel Nassar, Mariano Phielipp, Pieter Abbeel, Vladimir Stojanović

    Abstract: Being able to predict the performance of circuits without running expensive simulations is a desired capability that can catalyze automated design. In this paper, we present a supervised pretraining approach to learn circuit representations that can be adapted to new circuit topologies or unseen prediction tasks. We hypothesize that if we train a neural network (NN) that can predict the output DC… ▽ More

    Submitted 1 April, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

  11. arXiv:2201.13357  [pdf, other

    cs.LG

    DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning

    Authors: Hassam Sheikh, Kizza Frisbee, Mariano Phielipp

    Abstract: Application of ensemble of neural networks is becoming an imminent tool for advancing the state-of-the-art in deep reinforcement learning algorithms. However, training these large numbers of neural networks in the ensemble has an exceedingly high computation cost which may become a hindrance in training large-scale systems. In this paper, we propose DNS: a Determinantal Point Process based Neural… ▽ More

    Submitted 17 May, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: Accepted for Publication at ICML 2022

  12. arXiv:2106.08482  [pdf, other

    cs.AI

    Minimizing Communication while Maximizing Performance in Multi-Agent Reinforcement Learning

    Authors: Varun Kumar Vijay, Hassam Sheikh, Somdeb Majumdar, Mariano Phielipp

    Abstract: Inter-agent communication can significantly increase performance in multi-agent tasks that require co-ordination to achieve a shared goal. Prior work has shown that it is possible to learn inter-agent communication protocols using multi-agent reinforcement learning and message-passing network architectures. However, these models use an unconstrained broadcast communication model, in which an agent… ▽ More

    Submitted 8 December, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

  13. arXiv:2106.07611  [pdf

    cs.NE cs.AI

    Neuroevolution-Enhanced Multi-Objective Optimization for Mixed-Precision Quantization

    Authors: Santiago Miret, Vui Seng Chua, Mattias Marder, Mariano Phielipp, Nilesh Jain, Somdeb Majumdar

    Abstract: Mixed-precision quantization is a powerful tool to enable memory and compute savings of neural network workloads by deploying different sets of bit-width precisions on separate compute operations. In this work, we present a flexible and scalable framework for automated mixed-precision quantization that concurrently optimizes task performance, memory compression, and compute savings through multi-o… ▽ More

    Submitted 1 April, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

  14. arXiv:2011.01089  [pdf, other

    cs.LG stat.ML

    Instance based Generalization in Reinforcement Learning

    Authors: Martin Bertran, Natalia Martinez, Mariano Phielipp, Guillermo Sapiro

    Abstract: Agents trained via deep reinforcement learning (RL) routinely fail to generalize to unseen environments, even when these share the same underlying dynamics as the training levels. Understanding the generalization properties of RL is one of the challenges of modern machine learning. Towards this goal, we analyze policy learning in the context of Partially Observable Markov Decision Processes (POMDP… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: Accepted on NeurIPS 2020

  15. arXiv:2010.12083  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    Language-Conditioned Imitation Learning for Robot Manipulation Tasks

    Authors: Simon Stepputtis, Joseph Campbell, Mariano Phielipp, Stefan Lee, Chitta Baral, Heni Ben Amor

    Abstract: Imitation learning is a popular approach for teaching motor skills to robots. However, most approaches focus on extracting policy parameters from execution traces alone (i.e., motion trajectories and perceptual data). No adequate communication channel exists between the human expert and the robot to describe critical aspects of the task, such as the properties of the target object or the intended… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted to the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada as spotlight presentation

  16. arXiv:2006.00545  [pdf, other

    cs.RO cs.CV

    Motion2Vec: Semi-Supervised Representation Learning from Surgical Videos

    Authors: Ajay Kumar Tanwani, Pierre Sermanet, Andy Yan, Raghav Anand, Mariano Phielipp, Ken Goldberg

    Abstract: Learning meaningful visual representations in an embedding space can facilitate generalization in downstream tasks such as action segmentation and imitation. In this paper, we learn a motion-centric representation of surgical video demonstrations by grou** them into action segments/sub-goals/options in a semi-supervised manner. We present Motion2Vec, an algorithm that learns a deep embedding fea… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

    Comments: IEEE International Conference on Robotics and Automation (ICRA), 2020

  17. Clone Swarms: Learning to Predict and Control Multi-Robot Systems by Imitation

    Authors: Siyu Zhou, Mariano Phielipp, Jorge A. Sefair, Sara I. Walker, Heni Ben Amor

    Abstract: In this paper, we propose SwarmNet -- a neural network architecture that can learn to predict and imitate the behavior of an observed swarm of agents in a centralized manner. Tested on artificially generated swarm motion data, the network achieves high levels of prediction accuracy and imitation authenticity. We compare our model to previous approaches for modelling interaction systems and show ho… ▽ More

    Submitted 2 November, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

  18. arXiv:1911.11744  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    Imitation Learning of Robot Policies by Combining Language, Vision and Demonstration

    Authors: Simon Stepputtis, Joseph Campbell, Mariano Phielipp, Chitta Baral, Heni Ben Amor

    Abstract: In this work we propose a novel end-to-end imitation learning approach which combines natural language, vision, and motion information to produce an abstract representation of a task, which in turn is used to synthesize specific motion controllers at run-time. This multimodal approach enables generalization to a wide variety of environmental conditions and allows an end-user to direct a robot poli… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: Accepted to the NeurIPS 2019 Workshop on Robot Learning: Control and Interaction in the Real World, Vancouver, Canada

  19. arXiv:1907.00269  [pdf, other

    cs.RO cs.AI cs.LG

    On Training Flexible Robots using Deep Reinforcement Learning

    Authors: Zach Dwiel, Madhavun Candadai, Mariano Phielipp

    Abstract: The use of robotics in controlled environments has flourished over the last several decades and training robots to perform tasks using control strategies developed from dynamical models of their hardware have proven very effective. However, in many real-world settings, the uncertainties of the environment, the safety requirements and generalized capabilities that are expected of robots make rigid… ▽ More

    Submitted 13 July, 2019; v1 submitted 29 June, 2019; originally announced July 2019.

    Comments: Accepted at the Intelligent Robots and Systems (IRoS) conference, 2019. Camera-ready version coming soon

  20. arXiv:1906.05838  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Goal-conditioned Imitation Learning

    Authors: Yiming Ding, Carlos Florensa, Mariano Phielipp, Pieter Abbeel

    Abstract: Designing rewards for Reinforcement Learning (RL) is challenging because it needs to convey the desired task, be efficient to optimize, and be easy to compute. The latter is particularly problematic when applying RL to robotics, where detecting whether the desired configuration is reached might require considerable supervision and instrumentation. Furthermore, we are often interested in being able… ▽ More

    Submitted 27 May, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: Published at NeurIPS 2019

  21. arXiv:1905.01537  [pdf, other

    cs.LG cs.AI

    Hierarchical Policy Learning is Sensitive to Goal Space Design

    Authors: Zach Dwiel, Madhavun Candadai, Mariano Phielipp, Arjun K. Bansal

    Abstract: Hierarchy in reinforcement learning agents allows for control at multiple time scales yielding improved sample efficiency, the ability to deal with long time horizons and transferability of sub-policies to tasks outside the training distribution. It is often implemented as a master policy providing goals to a sub-policy. Ideally, we would like the goal-spaces to be learned, however, properties of… ▽ More

    Submitted 25 June, 2019; v1 submitted 4 May, 2019; originally announced May 2019.

    Comments: Accepted to be presented at Task-Agnostic Reinforcement Learning (TARL) workshop at ICLR'19