Skip to main content

Showing 1–23 of 23 results for author: Arenas, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16021  [pdf, other

    cs.RO

    VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

    Authors: Michael Ahn, Montserrat Gonzalez Arenas, Matthew Bennice, Noah Brown, Christine Chan, Byron David, Anthony Francis, Gavin Gonzalez, Rainer Hessmer, Tomas Jackson, Nikhil J Joshi, Daniel Lam, Tsang-Wei Edward Lee, Alex Luong, Sharath Maddineni, Harsh Patel, Jodilyn Peralta, Jornell Quiambao, Diego Reyes, Rosario M Jauregui Ruano, Dorsa Sadigh, Pannag Sanketi, Leila Takayama, Pavel Vodenski, Fei Xia

    Abstract: Robots today can exploit the rich world knowledge of large language models to chain simple behavioral skills into long-horizon tasks. However, robots often get interrupted during long-horizon tasks due to primitive skill failures and dynamic environments. We propose VADER, a plan, execute, detect framework with seeking help as a new skill that enables robots to recover and complete long-horizon ta… ▽ More

    Submitted 30 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures

  2. arXiv:2404.15118  [pdf, other

    cs.NE cond-mat.dis-nn cond-mat.stat-mech hep-lat physics.comp-ph

    Identifying phase transitions in physical systems with neural networks: a neural architecture search perspective

    Authors: Rodrigo Carmo Terin, Zochil González Arenas, Roberto Santana

    Abstract: The use of machine learning algorithms to investigate phase transitions in physical systems is a valuable way to better understand the characteristics of these systems. Neural networks have been used to extract information of phases and phase transitions directly from many-body configurations. However, one limitation of neural networks is that they require the definition of the model architecture… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 9 pages, 7 figures

  3. arXiv:2402.14867  [pdf

    cs.CL cs.AI cs.LG

    Effects of term weighting approach with and without stop words removing on Arabic text classification

    Authors: Esra'a Alhenawi, Ruba Abu Khurma, Pedro A. Castillo, Maribel G. Arenas

    Abstract: Classifying text is a method for categorizing documents into pre-established groups. Text documents must be prepared and represented in a way that is appropriate for the algorithms used for data mining prior to classification. As a result, a number of term weighting strategies have been created in the literature to enhance text categorization algorithms' functionality. This study compares the effe… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  4. arXiv:2402.11450  [pdf, other

    cs.RO

    Learning to Learn Faster from Human Feedback with Language Model Predictive Control

    Authors: Jacky Liang, Fei Xia, Wenhao Yu, Andy Zeng, Montserrat Gonzalez Arenas, Maria Attarian, Maria Bauza, Matthew Bennice, Alex Bewley, Adil Dostmohamed, Chuyuan Kelly Fu, Nimrod Gileadi, Marissa Giustina, Keerthana Gopalakrishnan, Leonard Hasenclever, Jan Humplik, Jasmine Hsu, Nikhil Joshi, Ben Jyenis, Chase Kew, Sean Kirmani, Tsang-Wei Edward Lee, Kuang-Huei Lee, Assaf Hurwitz Michaely, Joss Moore , et al. (25 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to exhibit a wide range of capabilities, such as writing robot code from language commands -- enabling non-experts to direct robot behaviors, modify them based on feedback, or compose them to perform new tasks. However, these capabilities (driven by in-context learning) are limited to short-term interactions, where users' feedback remains relevant for o… ▽ More

    Submitted 31 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  5. Determining the significance and relative importance of parameters of a simulated quenching algorithm using statistical tools

    Authors: Pedro A. Castillo, Maribel García Arenas, Nuria Rico, Antonio Miguel Mora, Pablo García-Sánchez, Juan Luis Jiménez Laredo, Juan Julián Merelo Guervós

    Abstract: When search methods are being designed it is very important to know which parameters have the greatest influence on the behaviour and performance of the algorithm. To this end, algorithm parameters are commonly calibrated by means of either theoretic analysis or intensive experimentation. When undertaking a detailed statistical analysis of the influence of each parameter, the designer should pay a… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  6. arXiv:2401.12963  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents

    Authors: Michael Ahn, Debidatta Dwibedi, Chelsea Finn, Montse Gonzalez Arenas, Keerthana Gopalakrishnan, Karol Hausman, Brian Ichter, Alex Irpan, Nikhil Joshi, Ryan Julian, Sean Kirmani, Isabel Leal, Edward Lee, Sergey Levine, Yao Lu, Isabel Leal, Sharath Maddineni, Kanishka Rao, Dorsa Sadigh, Pannag Sanketi, Pierre Sermanet, Quan Vuong, Stefan Welker, Fei Xia, Ted Xiao , et al. (3 additional authors not shown)

    Abstract: Foundation models that incorporate language, vision, and more recently actions have revolutionized the ability to harness internet scale data to reason about useful tasks. However, one of the key challenges of training embodied foundation models is the lack of data grounded in the physical world. In this paper, we propose AutoRT, a system that leverages existing foundation models to scale up the d… ▽ More

    Submitted 1 July, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 26 pages, 9 figures, ICRA 2024 VLMNM Workshop

  7. arXiv:2311.10678  [pdf, other

    cs.RO cs.AI cs.LG

    Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections

    Authors: Lihan Zha, Yuchen Cui, Li-Heng Lin, Minae Kwon, Montserrat Gonzalez Arenas, Andy Zeng, Fei Xia, Dorsa Sadigh

    Abstract: Today's robot policies exhibit subpar performance when faced with the challenge of generalizing to novel environments. Human corrective feedback is a crucial form of guidance to enable such generalization. However, adapting to and learning from online human corrections is a non-trivial endeavor: not only do robots need to remember human feedback over time to retrieve the right information in new s… ▽ More

    Submitted 21 March, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: 8 pages, 4 figures, videos and code links on website https://sites.google.com/stanford.edu/droc

  8. arXiv:2311.01977  [pdf, other

    cs.RO cs.AI

    RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches

    Authors: Jiayuan Gu, Sean Kirmani, Paul Wohlhart, Yao Lu, Montserrat Gonzalez Arenas, Kanishka Rao, Wenhao Yu, Chuyuan Fu, Keerthana Gopalakrishnan, Zhuo Xu, Priya Sundaresan, Peng Xu, Hao Su, Karol Hausman, Chelsea Finn, Quan Vuong, Ted Xiao

    Abstract: Generalization remains one of the most important desiderata for robust robot learning systems. While recently proposed approaches show promise in generalization to novel objects, semantic concepts, or visual distribution shifts, generalization to new tasks remains challenging. For example, a language-conditioned policy trained on pick-and-place tasks will not be able to generalize to a folding tas… ▽ More

    Submitted 6 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: Evaluation videos can be found at https://rt-trajectory.github.io/

  9. arXiv:2310.13607  [pdf, ps, other

    cs.LG

    Analyzing the contribution of different passively collected data to predict Stress and Depression

    Authors: Irene Bonafonte, Cristina Bustos, Abraham Larrazolo, Gilberto Lorenzo Martinez Luna, Adolfo Guzman Arenas, Xavier Baro, Isaac Tourgeman, Mercedes Balcells, Agata Lapedriza

    Abstract: The possibility of recognizing diverse aspects of human behavior and environmental context from passively captured data motivates its use for mental health assessment. In this paper, we analyze the contribution of different passively collected sensor data types (WiFi, GPS, Social interaction, Phone Log, Physical Activity, Audio, and Academic features) to predict daily selfreport stress and PHQ-9 d… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  10. arXiv:2308.10382  [pdf, other

    cs.CV cs.AI

    False Negative/Positive Control for SAM on Noisy Medical Images

    Authors: Xing Yao, Han Liu, Dewei Hu, Daiwei Lu, Ange Lou, Hao Li, Ruining Deng, Gabriel Arenas, Baris Oguz, Nadav Schwartz, Brett C Byram, Ipek Oguz

    Abstract: The Segment Anything Model (SAM) is a recently developed all-range foundation model for image segmentation. It can use sparse manual prompts such as bounding boxes to generate pixel-level segmentation in natural images but struggles in medical images such as low-contrast, noisy ultrasound images. We propose a refined test-phase prompt augmentation technique designed to improve SAM's performance in… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

  11. arXiv:2307.15818  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

    Authors: Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Xi Chen, Krzysztof Choromanski, Tianli Ding, Danny Driess, Avinava Dubey, Chelsea Finn, Pete Florence, Chuyuan Fu, Montse Gonzalez Arenas, Keerthana Gopalakrishnan, Kehang Han, Karol Hausman, Alexander Herzog, Jasmine Hsu, Brian Ichter, Alex Irpan, Nikhil Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal , et al. (29 additional authors not shown)

    Abstract: We study how vision-language models trained on Internet-scale data can be incorporated directly into end-to-end robotic control to boost generalization and enable emergent semantic reasoning. Our goal is to enable a single end-to-end trained model to both learn to map robot observations to actions and enjoy the benefits of large-scale pretraining on language and vision-language data from the web.… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Website: https://robotics-transformer.github.io/

  12. arXiv:2307.04721  [pdf, other

    cs.AI cs.CL cs.RO

    Large Language Models as General Pattern Machines

    Authors: Suvir Mirchandani, Fei Xia, Pete Florence, Brian Ichter, Danny Driess, Montserrat Gonzalez Arenas, Kanishka Rao, Dorsa Sadigh, Andy Zeng

    Abstract: We observe that pre-trained large language models (LLMs) are capable of autoregressively completing complex token sequences -- from arbitrary ones procedurally generated by probabilistic context-free grammars (PCFG), to more rich spatial patterns found in the Abstraction and Reasoning Corpus (ARC), a general AI benchmark, prompted in the style of ASCII art. Surprisingly, pattern completion profici… ▽ More

    Submitted 25 October, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: 21 pages, 25 figures. To appear at Conference on Robot Learning (CoRL) 2023

  13. arXiv:2307.02358  [pdf

    cs.CL cs.HC

    To be or not to be: a translation reception study of a literary text translated into Dutch and Catalan using machine translation

    Authors: Ana Guerberof Arenas, Antonio Toral

    Abstract: This article presents the results of a study involving the reception of a fictional story by Kurt Vonnegut translated from English into Catalan and Dutch in three conditions: machine-translated (MT), post-edited (PE) and translated from scratch (HT). 223 participants were recruited who rated the reading conditions using three scales: Narrative Engagement, Enjoyment and Translation Reception. The r… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 39 pages, 9 figures, authors' manuscript approved for publication in Target International Journal of Translation published by John Benjamins

  14. arXiv:2306.08647  [pdf, other

    cs.RO cs.AI cs.LG

    Language to Rewards for Robotic Skill Synthesis

    Authors: Wenhao Yu, Nimrod Gileadi, Chuyuan Fu, Sean Kirmani, Kuang-Huei Lee, Montse Gonzalez Arenas, Hao-Tien Lewis Chiang, Tom Erez, Leonard Hasenclever, Jan Humplik, Brian Ichter, Ted Xiao, Peng Xu, Andy Zeng, Tingnan Zhang, Nicolas Heess, Dorsa Sadigh, Jie Tan, Yuval Tassa, Fei Xia

    Abstract: Large language models (LLMs) have demonstrated exciting progress in acquiring diverse new capabilities through in-context learning, ranging from logical reasoning to code-writing. Robotics researchers have also explored using LLMs to advance the capabilities of robotic control. However, since low-level robot actions are hardware-dependent and underrepresented in LLM training corpora, existing effo… ▽ More

    Submitted 16 June, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: https://language-to-reward.github.io/

  15. arXiv:2305.03270  [pdf, other

    cs.RO

    Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

    Authors: Alexander Herzog, Kanishka Rao, Karol Hausman, Yao Lu, Paul Wohlhart, Mengyuan Yan, Jessica Lin, Montserrat Gonzalez Arenas, Ted Xiao, Daniel Kappler, Daniel Ho, Jarek Rettinghouse, Yevgen Chebotar, Kuang-Huei Lee, Keerthana Gopalakrishnan, Ryan Julian, Adrian Li, Chuyuan Kelly Fu, Bob Wei, Sangeetha Ramesh, Khem Holden, Kim Kleiven, David Rendleman, Sean Kirmani, Jeff Bingham , et al. (15 additional authors not shown)

    Abstract: We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Published at Robotics: Science and Systems 2023

  16. DivEMT: Neural Machine Translation Post-Editing Effort Across Typologically Diverse Languages

    Authors: Gabriele Sarti, Arianna Bisazza, Ana Guerberof Arenas, Antonio Toral

    Abstract: We introduce DivEMT, the first publicly available post-editing study of Neural Machine Translation (NMT) over a typologically diverse set of target languages. Using a strictly controlled setup, 18 professional translators were instructed to translate or post-edit the same set of English documents into Arabic, Dutch, Italian, Turkish, Ukrainian, and Vietnamese. During the process, their edits, keys… ▽ More

    Submitted 18 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: EMNLP 2022, materials: https://github.com/gsarti/divemt

    Journal ref: Proceedings of EMNLP (2022) 7795-7816

  17. Creativity in translation: machine translation as a constraint for literary texts

    Authors: Ana Guerberof Arenas, Antonio Toral

    Abstract: This article presents the results of a study involving the translation of a short story by Kurt Vonnegut from English to Catalan and Dutch using three modalities: machine-translation (MT), post-editing (PE) and translation without aid (HT). Our aim is to explore creativity, understood to involve novelty and acceptability, from a quantitative perspective. The results show that HT has the highest cr… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 28 pages, 2 figures, 10 tables

    Journal ref: Translation Spaces 2022

  18. The Impact of Post-editing and Machine Translation on Creativity and Reading Experience

    Authors: Ana Guerberof Arenas, Antonio Toral

    Abstract: This article presents the results of a study involving the translation of a fictional story from English into Catalan in three modalities: machine-translated (MT), post-edited (MTPE) and translated without aid (HT). Each translation was analysed to evaluate its creativity. Subsequently, a cohort of 88 Catalan participants read the story in a randomly assigned modality and completed a survey. The r… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

    Comments: 28 pages, 10 tables, 4 figures. Translation Spaces (2020)

  19. arXiv:1603.03185  [pdf, other

    cs.CL cs.LG cs.SD

    Personalized Speech recognition on mobile devices

    Authors: Ian McGraw, Rohit Prabhavalkar, Raziel Alvarez, Montse Gonzalez Arenas, Kanishka Rao, David Rybach, Ouais Alsharif, Hasim Sak, Alexander Gruenstein, Francoise Beaufays, Carolina Parada

    Abstract: We describe a large vocabulary speech recognition system that is accurate, has low latency, and yet has a small enough memory and computational footprint to run faster than real-time on a Nexus 5 Android smartphone. We employ a quantized Long Short-Term Memory (LSTM) acoustic model trained with connectionist temporal classification (CTC) to directly predict phoneme targets, and further reduce its… ▽ More

    Submitted 11 March, 2016; v1 submitted 10 March, 2016; originally announced March 2016.

  20. arXiv:1501.06857  [pdf, other

    cs.SI cs.SE

    Measuring the local GitHub developer community

    Authors: J. J. Merelo, Nuria Rico, Israel Blancas, M. G. Arenas, Fernando Tricas, José Antonio Vacas

    Abstract: Creating rankings might seem like a vain exercise in belly-button gazing, even more so for people so unlike that kind of things as programmers. However, in this paper we will try to prove how creating city (or province) based rankings in Spain has led to all kind of interesting effects, including increased productivity and community building. We describe the methodology we have used to search for… ▽ More

    Submitted 27 January, 2015; originally announced January 2015.

    Comments: Paper supporting presentation at Floss Community Metrics conference

  21. arXiv:1105.4978  [pdf, ps, other

    cs.NE

    SOAP vs REST: Comparing a master-slave GA implementation

    Authors: P. A. Castillo, J. L. Bernier, M. G. Arenas, J. J. Merelo, P. Garcia-Sanchez

    Abstract: In this paper, a high-level comparison of both SOAP (Simple Object Access Protocol) and REST (Representational State Transfer) is made. These are the two main approaches for interfacing to the web with web services. Both approaches are different and present some advantages and disadvantages for interfacing to web services: SOAP is conceptually more difficult (has a steeper learning curve) and more… ▽ More

    Submitted 25 May, 2011; originally announced May 2011.

    Comments: Paper 2 for the First International Workshop of Distributed Evolutionary computation in Informal Environments

  22. arXiv:1105.4971  [pdf, ps, other

    cs.NE

    Distributed Evolutionary Computation using REST

    Authors: P. A. Castillo, M. G. Arenas, A. M. Mora, J. L. J. Laredo, G. Romero, V. M Rivas, J. J. Merelo

    Abstract: This paper analises distributed evolutionary computation based on the Representational State Transfer (REST) protocol, which overlays a farming model on evolutionary computation. An approach to evolutionary distributed optimisation of multilayer perceptrons (MLP) using REST and language Perl has been done. In these experiments, a master-slave based evolutionary algorithm (EA) has been implemented,… ▽ More

    Submitted 25 May, 2011; originally announced May 2011.

    Comments: Paper 3 for the First International Workshop of Distributed Evolutionary computation in Informal Environments

  23. arXiv:cs/0603004  [pdf, ps, other

    cs.NE

    Lamarckian Evolution and the Baldwin Effect in Evolutionary Neural Networks

    Authors: P. A. Castillo, M. G. Arenas, J. G. Castellano, J. J. Merelo, A. Prieto, V. Rivas, G. Romero

    Abstract: Hybrid neuro-evolutionary algorithms may be inspired on Darwinian or Lamarckian evolu- tion. In the case of Darwinian evolution, the Baldwin effect, that is, the progressive incorporation of learned characteristics to the genotypes, can be observed and leveraged to improve the search. The purpose of this paper is to carry out an exper- imental study into how learning can improve G-Prop genetic s… ▽ More

    Submitted 1 March, 2006; originally announced March 2006.

    Comments: Presented in a Spanish conference, MAEB

    ACM Class: C.1.3