Skip to main content

Showing 1–29 of 29 results for author: Gomez, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.11571  [pdf, other

    cs.RO cs.AI cs.CL

    Ain't Misbehavin' -- Using LLMs to Generate Expressive Robot Behavior in Conversations with the Tabletop Robot Haru

    Authors: Zining Wang, Paul Reisert, Eric Nichols, Randy Gomez

    Abstract: Social robots aim to establish long-term bonds with humans through engaging conversation. However, traditional conversational approaches, reliant on scripted interactions, often fall short in maintaining engaging conversations. This paper addresses this limitation by integrating large language models (LLMs) into social robots to achieve more dynamic and expressive conversations. We introduce a ful… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted as Late Breaking Report (LBR) at the 19th Annual ACM/IEEE International Conference on Human Robot Interaction (HRI '24)

    Journal ref: Companion of HRI '24, March 11-14, 2024, Boulder, CO, USA

  2. arXiv:2402.11569  [pdf, other

    cs.RO cs.AI cs.CL

    Develo** Autonomous Robot-Mediated Behavior Coaching Sessions with Haru

    Authors: Matouš Jelínek, Eric Nichols, Randy Gomez

    Abstract: This study presents an empirical investigation into the design and impact of autonomous dialogues in human-robot interaction for behavior change coaching. We focus on the use of Haru, a tabletop social robot, and explore the implementation of the Tiny Habits method for fostering positive behavior change. The core of our study lies in develo** a fully autonomous dialogue system that maximizes Har… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted as Late Breaking Report (LBR) at the 19th Annual ACM/IEEE International Conference on Human Robot Interaction (HRI '24)

    Journal ref: HRI '24 Companion, March 11-14, 2024, Boulder, CO, USA

  3. arXiv:2402.05088  [pdf, other

    math.CO cs.DM

    Domination and packing in graphs

    Authors: Renzo Gómez, Juan Gutiérrez

    Abstract: Given a graph~$G$, the domination number, denoted by~$γ(G)$, is the minimum cardinality of a dominating set in~$G$. Dual to the notion of domination number is the packing number of a graph. A packing of~$G$ is a set of vertices whose pairwise distance is at least three. The packing number~$ρ(G)$ of~$G$ is the maximum cardinality of one such set. Furthermore, the inequality~$ρ(G) \leq γ(G)$ is well… ▽ More

    Submitted 8 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 12 pages, 6 figures

    MSC Class: 05C69 ACM Class: G.2.2

  4. arXiv:2312.12473  [pdf, ps, other

    cs.HC cs.AI

    A Study on Social Robot Behavior in Group Conversation

    Authors: Tung Nguyen, Eric Nichols, Randy Gomez

    Abstract: Recently, research in human-robot interaction began to consider a robot's influence at the group level. Despite the recent growth in research investigating the effects of robots within groups of people, our overall understanding of what happens when robots are placed within groups or teams of people is still limited. This paper investigates several key problems for social robots that manage conver… ▽ More

    Submitted 20 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: 5 pages

  5. arXiv:2310.13508  [pdf, other

    cs.RO cs.HC

    Social Robot Mediator for Multiparty Interaction

    Authors: Manith Adikari, Angelo Cangelosi, Randy Gomez

    Abstract: A social robot acting as a 'mediator' can enhance interactions between humans, for example, in fields such as education and healthcare. A particularly promising area of research is the use of a social robot mediator in a multiparty setting, which tends to be the most applicable in real-world scenarios. However, research in social robot mediation for multiparty interactions is still emerging and fa… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 2023 IEEE International Conference on Robotics and Automation (ICRA 2023) Workshop Towards a Balanced Cyberphysical Society: A Focus on Group Social Dynamics

  6. arXiv:2308.15710  [pdf, ps, other

    cs.AI cs.LG

    Speech Wikimedia: A 77 Language Multilingual Speech Dataset

    Authors: Rafael Mosquera Gómez, Julián Eusse, Juan Ciro, Daniel Galvez, Ryan Hileman, Kurt Bollacker, David Kanter

    Abstract: The Speech Wikimedia Dataset is a publicly available compilation of audio with transcriptions extracted from Wikimedia Commons. It includes 1780 hours (195 GB) of CC-BY-SA licensed transcribed speech from a diverse set of scenarios and speakers, in 77 different languages. Each audio file has one or more transcriptions in different languages, making this dataset suitable for training speech recogni… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: Data-Centric Machine Learning Workshop at the International Machine Learning Conference 2023 (ICML)

  7. arXiv:2305.05389  [pdf, other

    cs.LG

    Two to Five Truths in Non-Negative Matrix Factorization

    Authors: John M. Conroy, Neil P Molino, Brian Baughman, Rod Gomez, Ryan Kaliszewski, Nicholas A. Lines

    Abstract: In this paper, we explore the role of matrix scaling on a matrix of counts when building a topic model using non-negative matrix factorization. We present a scaling inspired by the normalized Laplacian (NL) for graphs that can greatly improve the quality of a non-negative matrix factorization. The results parallel those in the spectral graph clustering work of \cite{Priebe:2019}, where the authors… ▽ More

    Submitted 5 September, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

  8. Towards AutoQML: A Cloud-Based Automated Circuit Architecture Search Framework

    Authors: Raúl Berganza Gómez, Corey O'Meara, Giorgio Cortiana, Christian B. Mendl, Juan Bernabé-Moreno

    Abstract: The learning process of classical machine learning algorithms is tuned by hyperparameters that need to be customized to best learn and generalize from an input dataset. In recent years, Quantum Machine Learning (QML) has been gaining traction as a possible application of quantum computing which may provide quantum advantage in the future. However, quantum versions of classical machine learning alg… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: 8 pages, to appear in QSA 2022 (IEEE ICSA 2022)

    Journal ref: 2022 IEEE 19th International Conference on Software Architecture Companion (ICSA-C), 129-136 (2022)

  9. arXiv:2202.02599  [pdf, other

    math.CO cs.DM

    Path eccentricity of graphs

    Authors: Renzo Gómez, Juan Gutiérrez

    Abstract: Let $G$ be a connected graph. The eccentricity of a path $P$, denoted by ecc$_G(P)$, is the maximum distance from $P$ to any vertex in $G$. In the \textsc{Central path} (CP) problem our aim is to find a path of minimum eccentricity. This problem was introduced by Cockayne et al., in 1981, in the study of different centrality measures on graphs. They showed that CP can be solved in linear time in t… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    MSC Class: 05C38 ACM Class: G.2.2

  10. arXiv:2104.06600  [pdf, other

    cs.LG cs.AI cs.RO

    GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback

    Authors: Jie Huang, Rongshun Juan, Randy Gomez, Keisuke Nakamura, Qixin Sha, Bo He, Guangliang Li

    Abstract: Deep reinforcement learning (DRL) has achieved great successes in many simulated tasks. The sample inefficiency problem makes applying traditional DRL methods to real-world robots a great challenge. Generative Adversarial Imitation Learning (GAIL) -- a general model-free imitation learning method, allows robots to directly learn policies from expert trajectories in large environments. However, GAI… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

  11. arXiv:2102.08358  [pdf, ps, other

    cs.LG cs.GT

    Efficient Competitions and Online Learning with Strategic Forecasters

    Authors: Rafael Frongillo, Robert Gomez, Anish Thilagar, Bo Waggoner

    Abstract: Winner-take-all competitions in forecasting and machine-learning suffer from distorted incentives. Witkowski et al. 2018 identified this problem and proposed ELF, a truthful mechanism to select a winner. We show that, from a pool of $n$ forecasters, ELF requires $Θ(n\log n)$ events or test data points to select a near-optimal forecaster with high probability. We then show that standard online lear… ▽ More

    Submitted 10 June, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: This paper will be presented at The Twenty-Second ACM Conference on Economics and Computation (EC '21), July 18-23, 2021, Budapest, Hungary

  12. arXiv:2101.09824  [pdf, other

    cs.HC cs.CY cs.LG

    Beyond Expertise and Roles: A Framework to Characterize the Stakeholders of Interpretable Machine Learning and their Needs

    Authors: Harini Suresh, Steven R. Gomez, Kevin K. Nam, Arvind Satyanarayan

    Abstract: To ensure accountability and mitigate harm, it is critical that diverse stakeholders can interrogate black-box automated systems and find information that is understandable, relevant, and useful to them. In this paper, we eschew prior expertise- and role-based categorizations of interpretability stakeholders in favor of a more granular framework that decouples stakeholders' knowledge from their in… ▽ More

    Submitted 24 January, 2021; originally announced January 2021.

    Comments: In CHI Conference on Human Factors in Computing Systems (CHI '21)

  13. arXiv:2011.10208  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Collaborative Storytelling with Large-scale Neural Language Models

    Authors: Eric Nichols, Leo Gao, Randy Gomez

    Abstract: Storytelling plays a central role in human socializing and entertainment. However, much of the research on automatic storytelling generation assumes that stories will be generated by an agent without any human interaction. In this paper, we introduce the task of collaborative storytelling, where an artificial intelligence agent and a person collaborate to create a unique story by taking turns addi… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: To appear in Proceedings of the 13th Annual ACM SIGGRAPH Conference on Motion, Interaction and Games (MIG 2020)

  14. arXiv:2008.04991  [pdf, other

    cs.CV

    Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation

    Authors: Raul Gomez, Yahui Liu, Marco De Nadai, Dimosthenis Karatzas, Bruno Lepri, Nicu Sebe

    Abstract: Image to image translation aims to learn a map** that transforms an image from one visual domain to another. Recent works assume that images descriptors can be disentangled into a domain-invariant content representation and a domain-specific style representation. Thus, translation models seek to preserve the content of source images while changing the style to a target visual domain. However, sy… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: Submitted to ACM MM '20, October 12-16, 2020, Seattle, WA, USA

  15. arXiv:2007.03375  [pdf, other

    cs.CV

    Location Sensitive Image Retrieval and Tagging

    Authors: Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas

    Abstract: People from different parts of the globe describe objects and concepts in distinct manners. Visual appearance can thus vary across different geographic locations, which makes location a relevant contextual information when analysing visual data. In this work, we address the task of image retrieval related to a given tag conditioned on a certain location on Earth. We present LocSens, a model that l… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    MSC Class: 68T07 ACM Class: I.2.10

    Journal ref: ECCV 2020

  16. arXiv:2005.01600  [pdf, other

    physics.plasm-ph cs.CV

    Quantification of MagLIF morphology using the Mallat Scattering Transformation

    Authors: Michael E. Glinsky, Thomas W. Moore, William E. Lewis, Matthew R. Weis, Christopher A. Jennings, David J. Ampleford, Patrick F. Knapp, Eric C. Harding, Matthew R. Gomez, Adam J. Harvey-Thompson

    Abstract: The morphology of the stagnated plasma resulting from Magnetized Liner Inertial Fusion (MagLIF) is measured by imaging the self-emission x-rays coming from the multi-keV plasma. Equivalent diagnostic response can be generated by integrated radiation-magnetohydrodynamic (rad-MHD) simulations from programs such as HYDRA and GORGON. There have been only limited quantitative ways to compare the image… ▽ More

    Submitted 15 October, 2020; v1 submitted 13 April, 2020; originally announced May 2020.

    Comments: 19 pages, 18 figures, 3 tables, 4 animations, accepted for publication in Physics of Plasmas; arXiv admin note: substantial text overlap with arXiv:1911.02359

    Report number: Sandia National Laboratories Report: SAND2020-10785 J and SAND2020-11336 O

    Journal ref: Phys. Plasmas 27, 112703 (2020)

  17. arXiv:1910.03814  [pdf, other

    cs.CV cs.CL

    Exploring Hate Speech Detection in Multimodal Publications

    Authors: Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas

    Abstract: In this work we target the problem of hate speech detection in multimodal publications formed by a text and an image. We gather and annotate a large scale dataset from Twitter, MMHS150K, and propose different models that jointly analyze textual and visual information for hate speech detection, comparing them with unimodal detection. We provide quantitative and qualitative results and analyze the c… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

  18. arXiv:1907.13275  [pdf, other

    cs.AI

    Towards a Theory of Intentions for Human-Robot Collaboration

    Authors: Rocio Gomez, Mohan Sridharan, Heather Riley

    Abstract: The architecture described in this paper encodes a theory of intentions based on the the key principles of non-procrastination, persistence, and automatically limiting reasoning to relevant knowledge and observations. The architecture reasons with transition diagrams of any given domain at two different resolutions, with the fine-resolution description defined as a refinement of, and hence tightly… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: 25 pages, 4 figures

  19. arXiv:1906.01466  [pdf, other

    cs.CV

    Selective Style Transfer for Text

    Authors: Raul Gomez, Ali Furkan Biten, Lluis Gomez, Jaume Gibert, Marçal Rusiñol, Dimosthenis Karatzas

    Abstract: This paper explores the possibilities of image style transfer applied to text maintaining the original transcriptions. Results on different text domains (scene text, machine printed text and handwritten text) and cross modal results demonstrate that this is feasible, and open different research lines. Furthermore, two architectures for selective style transfer, which means transferring style to on… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted in ICDAR 2019

  20. Software System Design based on Patterns for Newton-Type Methods

    Authors: Ricardo Serrato Barrera, Gustavo Rodríguez Gómez, Julio César Pérez Sansalvador, Saul E. Pomares Hernández, Leticia Flores Pulido, Antonio Muñoz

    Abstract: A wide range of engineering applications uses optimisation techniques as part of their solution process. The researcher uses specialized software that implements well-known optimisation techniques to solve his problem. However, when it comes to develop original optimisation techniques that fit a particular problem the researcher has no option but to implement his own new method from scratch. This… ▽ More

    Submitted 12 May, 2019; originally announced May 2019.

    Comments: 19 pages, 11 Figures

    MSC Class: 68N19

  21. arXiv:1904.08621  [pdf, other

    cs.AI cs.HC cs.LG

    Improving Interactive Reinforcement Agent Planning with Human Demonstration

    Authors: Guangliang Li, Randy Gomez, Keisuke Nakamura, **ying Lin, Qilei Zhang, Bo He

    Abstract: TAMER has proven to be a powerful interactive reinforcement learning method for allowing ordinary people to teach and personalize autonomous agents' behavior by providing evaluative feedback. However, a TAMER agent planning with UCT---a Monte Carlo Tree Search strategy, can only update states along its path and might induce high learning cost especially for a physical robot. In this paper, we prop… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

  22. arXiv:1901.02004  [pdf, other

    cs.CV

    Self-Supervised Learning from Web Data for Multimodal Retrieval

    Authors: Raul Gomez, Lluis Gomez, Jaume Gibert, Dimosthenis Karatzas

    Abstract: Self-Supervised learning from multimodal image and text data allows deep neural networks to learn powerful features with no need of human annotated data. Web and Social Media platforms provide a virtually unlimited amount of this multimodal data. In this work we propose to exploit this free available data to learn a multimodal image and text embedding, aiming to leverage the semantic knowledge lea… ▽ More

    Submitted 7 January, 2019; originally announced January 2019.

    Comments: Submitted to Multi-Modal Scene Understanding. arXiv admin note: substantial text overlap with arXiv:1808.06368

  23. arXiv:1808.06369  [pdf, other

    cs.CV

    Learning from #Barcelona Instagram data what Locals and Tourists post about its Neighbourhoods

    Authors: Raul Gomez, Lluis Gomez, Jaume Gibert, Dimosthenis Karatzas

    Abstract: Massive tourism is becoming a big problem for some cities, such as Barcelona, due to its concentration in some neighborhoods. In this work we gather Instagram data related to Barcelona consisting on images-captions pairs and, using the text as a supervisory signal, we learn relations between images, words and neighborhoods. Our goal is to learn which visual elements appear in photos when people is… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    Comments: ECCV MULA Workshop 2018

  24. arXiv:1808.06368  [pdf, other

    cs.CV

    Learning to Learn from Web Data through Deep Semantic Embeddings

    Authors: Raul Gomez, Lluis Gomez, Jaume Gibert, Dimosthenis Karatzas

    Abstract: In this paper we propose to learn a multimodal image and text embedding from Web and Social Media data, aiming to leverage the semantic knowledge learnt in the text domain and transfer it to a visual model for semantic image retrieval. We demonstrate that the pipeline can learn from images with associated text without supervision and perform a thourough analysis of five different text embeddings i… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    Comments: ECCV MULA Workshop 2018

  25. arXiv:1807.02110  [pdf, other

    cs.CV

    TextTopicNet - Self-Supervised Learning of Visual Features Through Embedding Images on Semantic Text Spaces

    Authors: Yash Patel, Lluis Gomez, Raul Gomez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar

    Abstract: The immense success of deep learning based methods in computer vision heavily relies on large scale training datasets. These richly annotated datasets help the network learn discriminative visual features. Collecting and annotating such datasets requires a tremendous amount of human effort and annotations are limited to popular set of classes. As an alternative, learning visual features by designi… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

    Comments: arXiv admin note: text overlap with arXiv:1705.08631

  26. arXiv:1712.07086  [pdf, ps, other

    cs.DM math.CO

    Transversals of Longest Paths

    Authors: Márcia R. Cerioli, Cristina G. Fernandes, Renzo Gómez, Juan Gutiérrez, Paloma T. Lima

    Abstract: Let $\lpt(G)$ be the minimum cardinality of a set of vertices that intersects all longest paths in a graph $G$. Let $ω(G)$ be the size of a maximum clique in $G$, and $\tw(G)$ be the treewidth of $G$. We prove that $ \lpt(G) \leq \max\{1,ω(G)-2\}$ when $G$ is a connected chordal graph; that $\lpt(G) =1$ when $G$ is a connected bipartite permutation graph or a connected full substar graph; and that… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: 19 pages, 9 figures

  27. arXiv:1702.05089  [pdf, other

    cs.CV

    Improving Text Proposals for Scene Images with Fully Convolutional Networks

    Authors: Dena Bazazian, Raul Gomez, Anguelos Nicolaou, Lluis Gomez, Dimosthenis Karatzas, Andrew D. Bagdanov

    Abstract: Text Proposals have emerged as a class-dependent version of object proposals - efficient approaches to reduce the search space of possible text object locations in an image. Combined with strong word classifiers, text proposals currently yield top state of the art results in end-to-end scene text recognition. In this paper we propose an improvement over the original Text Proposals algorithm of Gom… ▽ More

    Submitted 16 February, 2017; originally announced February 2017.

    Comments: 6 pages, 8 figures, International Conference on Pattern Recognition (ICPR) - DLPR (Deep Learning for Pattern Recognition) workshop

  28. arXiv:1106.2684  [pdf

    cs.SE cs.ET cs.PL quant-ph

    QIS-XML: An Extensible Markup Language for Quantum Information Science

    Authors: Pascal Heus, Richard Gomez

    Abstract: This Master thesis examines issues of interoperability and integration between the Classic Information Science (CIS) and Quantum Information Science (QIS). It provides a short introduction to the Extensible Markup Language (XML) and proceeds to describe the development steps that have lead to a prototype XML specification for quantum computing (QIS-XML). QIS-XML is a proposed framework, based on t… ▽ More

    Submitted 14 June, 2011; originally announced June 2011.

    Comments: 83 pages, 58 figures

  29. arXiv:0712.3925  [pdf

    cs.SE cs.DB quant-ph

    QIS-XML: A metadata specification for Quantum Information Science

    Authors: Pascal Heus, Richard Gomez

    Abstract: While Quantum Information Science (QIS) is still in its infancy, the ability for quantum based hardware or computers to communicate and integrate with their classical counterparts will be a major requirement towards their success. Little attention however has been paid to this aspect of QIS. To manage and exchange information between systems, today's classic Information Technology (IT) commonly… ▽ More

    Submitted 23 December, 2007; originally announced December 2007.

    Comments: 26 pages, 22 figures