Skip to main content

Showing 1–8 of 8 results for author: Wilcox, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.09530  [pdf, other

    cs.CY cs.CV cs.LG

    A community palm model

    Authors: Nicholas Clinton, Andreas Vollrath, Remi D'annunzio, Desheng Liu, Henry B. Glick, AdriĆ  Descals, Alicia Sullivan, Oliver Guinan, Jacob Abramowitz, Fred Stolle, Chris Goodman, Tanya Birch, David Quinn, Olga Danylo, Tijs Lips, Daniel Coelho, Enikoe Bihari, Bryce Cronkite-Ratcliff, Ate Poortinga, Atena Haghighattalab, Evan Notman, Michael DeWitt, Aaron Yonas, Gennadii Donchyts, Devaja Shah , et al. (5 additional authors not shown)

    Abstract: Palm oil production has been identified as one of the major drivers of deforestation for tropical countries. To meet supply chain objectives, commodity producers and other stakeholders need timely information of land cover dynamics in their supply shed. However, such data are difficult to obtain from suppliers who may lack digital geographic representations of their supply sheds and production loc… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: v0

  2. arXiv:2210.07432  [pdf, other

    cs.LG cs.AI

    Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations

    Authors: Albert Wilcox, Ashwin Balakrishna, Jules Dedieu, Wyame Benslimane, Daniel S. Brown, Ken Goldberg

    Abstract: Providing densely shaped reward functions for RL algorithms is often exceedingly challenging, motivating the development of RL algorithms that can learn from easier-to-specify sparse reward functions. This sparsity poses new exploration challenges. One common way to address this problem is using demonstrations to provide initial signal about regions of the state space with high rewards. However, p… ▽ More

    Submitted 20 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: To be published in the 36th Conference on Neural Information Processing Systems (NeurIPS 2022). 19 pages. 11 figures

  3. arXiv:2209.13042  [pdf, other

    cs.RO

    Self-Supervised Visuo-Tactile Pretraining to Locate and Follow Garment Features

    Authors: Justin Kerr, Huang Huang, Albert Wilcox, Ryan Hoque, Jeffrey Ichnowski, Roberto Calandra, Ken Goldberg

    Abstract: Humans make extensive use of vision and touch as complementary senses, with vision providing global information about the scene and touch measuring local information during manipulation without suffering from occlusions. While prior work demonstrates the efficacy of tactile sensing for precise manipulation of deformables, they typically rely on supervised, human-labeled datasets. We propose Self-S… ▽ More

    Submitted 31 July, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: RSS 2023, site: https://sites.google.com/berkeley.edu/ssvtp

  4. arXiv:2112.04071  [pdf, other

    cs.RO

    Learning to Localize, Grasp, and Hand Over Unmodified Surgical Needles

    Authors: Albert Wilcox, Justin Kerr, Brijen Thananjeyan, Jeffrey Ichnowski, Minho Hwang, Samuel Paradis, Danyal Fer, Ken Goldberg

    Abstract: Robotic Surgical Assistants (RSAs) are commonly used to perform minimally invasive surgeries by expert surgeons. However, long procedures filled with tedious and repetitive tasks such as suturing can lead to surgeon fatigue, motivating the automation of suturing. As visual tracking of a thin reflective needle is extremely challenging, prior work has modified the needle with nonreflective contrasti… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: 8 pages, 7 figures. First two authors contributed equally

  5. arXiv:2109.08273  [pdf, other

    cs.RO cs.AI

    ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive Imitation Learning

    Authors: Ryan Hoque, Ashwin Balakrishna, Ellen Novoseller, Albert Wilcox, Daniel S. Brown, Ken Goldberg

    Abstract: Effective robot learning often requires online human feedback and interventions that can cost significant human time, giving rise to the central challenge in interactive imitation learning: is it possible to control the timing and length of interventions to both facilitate learning and limit burden on the human supervisor? This paper presents ThriftyDAgger, an algorithm for actively querying a hum… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: CoRL 2021 Oral

  6. arXiv:2107.04775  [pdf, other

    cs.LG cs.AI cs.RO

    LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks

    Authors: Albert Wilcox, Ashwin Balakrishna, Brijen Thananjeyan, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Reinforcement learning (RL) has shown impressive success in exploring high-dimensional environments to learn complex tasks, but can often exhibit unsafe behaviors and require extensive environment interaction when exploration is unconstrained. A promising strategy for learning in dynamically uncertain environments is requiring that the agent can robustly return to learned safe sets, where task suc… ▽ More

    Submitted 20 September, 2021; v1 submitted 10 July, 2021; originally announced July 2021.

    Comments: Conference on Robot Learning (CoRL) 2021. First two authors contributed equally

    Journal ref: Conference on Robot Learning (CoRL) 2021

  7. arXiv:2012.09156  [pdf, other

    cs.LG cs.RO

    Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning

    Authors: Nathan O. Lambert, Albert Wilcox, Howard Zhang, Kristofer S. J. Pister, Roberto Calandra

    Abstract: Accurately predicting the dynamics of robotic systems is crucial for model-based control and reinforcement learning. The most common way to estimate dynamics is by fitting a one-step ahead prediction model and using it to recursively propagate the predicted state distribution over long horizons. Unfortunately, this approach is known to compound even small prediction errors, making long-term predic… ▽ More

    Submitted 31 August, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: 8 pages, +4 pages appendix

  8. arXiv:1401.2943  [pdf, ps, other

    cs.CL

    ONTS: "Optima" News Translation System

    Authors: Marco Turchi, Martin Atkinson, Alastair Wilcox, Brett Crawley, Stefano Bucci, Ralf Steinberger, Erik Van der Goot

    Abstract: We propose a real-time machine translation system that allows users to select a news category and to translate the related live news articles from Arabic, Czech, Danish, Farsi, French, German, Italian, Polish, Portuguese, Spanish and Turkish into English. The Moses-based system was optimised for the news domain and differs from other available systems in four ways: (1) News items are automatically… ▽ More

    Submitted 13 January, 2014; originally announced January 2014.

    ACM Class: I.2.7; H.3.3; H.3.6

    Journal ref: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 25-30, Avignon, France, April 23 - 27 2012. Association for Computational Linguistics