Skip to main content

Showing 1–26 of 26 results for author: Murillo, A C

.
  1. arXiv:2406.09575  [pdf, other

    cs.CV

    CARLOR @ Ego4D Step Grounding Challenge: Bayesian temporal-order priors for test time refinement

    Authors: Carlos Plou, Lorenzo Mur-Labadia, Ruben Martinez-Cantin, Ana C. Murillo

    Abstract: The goal of the Step Grounding task is to locate temporal boundaries of activities based on natural language descriptions. This technical report introduces a Bayesian-VSLNet to address the challenge of identifying such temporal segments in lengthy, untrimmed egocentric videos. Our model significantly improves upon traditional models by incorporating a novel Bayesian temporal-order prior during inf… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2404.01867  [pdf, other

    cs.RO cs.LG

    Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation

    Authors: Carlos Plou, Ana C. Murillo, Ruben Martinez-Cantin

    Abstract: Efficiently tackling multiple tasks within complex environment, such as those found in robot manipulation, remains an ongoing challenge in robotics and an opportunity for data-driven solutions, such as reinforcement learning (RL). Model-based RL, by building a dynamic model of the robot, enables data reuse and transfer learning between tasks with the same robot and similar environment. Furthermore… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  3. arXiv:2404.01801  [pdf, other

    cs.CV

    EventSleep: Sleep Activity Recognition with Event Cameras

    Authors: Carlos Plou, Nerea Gallego, Alberto Sabater, Eduardo Montijano, Pablo Urcola, Luis Montesano, Ruben Martinez-Cantin, Ana C. Murillo

    Abstract: Event cameras are a promising technology for activity recognition in dark environments due to their unique properties. However, real event camera datasets under low-lighting conditions are still scarce, which also limits the number of approaches to solve these kind of problems, hindering the potential of this technology in many applications. We present EventSleep, a new dataset and methodology to… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  4. arXiv:2403.18033  [pdf, other

    cs.CV cs.RO

    SpectralWaste Dataset: Multimodal Data for Waste Sorting Automation

    Authors: Sara Casao, Fernando Peña, Alberto Sabater, Rosa Castillón, Darío Suárez, Eduardo Montijano, Ana C. Murillo

    Abstract: The increase in non-biodegradable waste is a worldwide concern. Recycling facilities play a crucial role, but their automation is hindered by the complex characteristics of waste recycling lines like clutter or object deformation. In addition, the lack of publicly available labeled data for these environments makes develo** robust perception systems challenging. Our work explores the benefits of… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  5. arXiv:2403.13467  [pdf, other

    cs.RO cs.CV

    CLIPSwarm: Generating Drone Shows from Text Prompts with Vision-Language Models

    Authors: Pablo Pueyo, Eduardo Montijano, Ana C. Murillo, Mac Schwager

    Abstract: This paper introduces CLIPSwarm, a new algorithm designed to automate the modeling of swarm drone formations based on natural language. The algorithm begins by enriching a provided word, to compose a text prompt that serves as input to an iterative approach to find the formation that best matches the provided word. The algorithm iteratively refines formations of robots to align with the textual de… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  6. arXiv:2401.05272  [pdf, other

    cs.RO

    CineMPC: A Fully Autonomous Drone Cinematography System Incorporating Zoom, Focus, Pose, and Scene Composition

    Authors: Pablo Pueyo, Juan Dendarieta, Eduardo Montijano, Ana C. Murillo, Mac Schwager

    Abstract: We present CineMPC, a complete cinematographic system that autonomously controls a drone to film multiple targets recording user-specified aesthetic objectives. Existing solutions in autonomous cinematography control only the camera extrinsics, namely its position, and orientation. In contrast, CineMPC is the first solution that includes the camera intrinsic parameters in the control loop, which a… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  7. arXiv:2311.11047  [pdf, other

    cs.RO

    CLIPSwarm: Converting text into formations of robots

    Authors: Pablo Pueyo, Eduardo Montijano, Ana C. Murillo, Mac Schwager

    Abstract: We present CLIPSwarm, an algorithm to generate robot swarm formations from natural language descriptions. CLIPSwarm receives an input text and finds the position of the robots to form a shape that corresponds to the given text. To do so, we implement a variation of the Montecarlo particle filter to obtain a matching formation iteratively. In every iteration, we generate a set of new formations and… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Please cite this article as "P. Pueyo, E. Montijano, A. C. Murillo, and M. Schwager, CLIPSwarm: Converting text into formations of robots. ICRA 2023 Workshop on Multi-Robot Learning"

    Journal ref: ICRA 2023, Workshop on Multi-Robot Learning

  8. arXiv:2310.03953  [pdf, other

    cs.RO

    CineTransfer: Controlling a Robot to Imitate Cinematographic Style from a Single Example

    Authors: Pablo Pueyo, Eduardo Montijano, Ana C. Murillo, Mac Schwager

    Abstract: This work presents CineTransfer, an algorithmic framework that drives a robot to record a video sequence that mimics the cinematographic style of an input video. We propose features that abstract the aesthetic style of the input video, so the robot can transfer this style to a scene with visual details that are significantly different from the input video. The framework builds upon CineMPC, a tool… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  9. arXiv:2304.07059  [pdf, other

    cs.RO

    A Framework for Fast Prototy** of Photo-realistic Environments with Multiple Pedestrians

    Authors: Sara Casao, Andrés Otero, Álvaro Serra-Gómez, Ana C. Murillo, Javier Alonso-Mora, Eduardo Montijano

    Abstract: Robotic applications involving people often require advanced perception systems to better understand complex real-world scenarios. To address this challenge, photo-realistic and physics simulators are gaining popularity as a means of generating accurate data labeling and designing scenarios for evaluating generalization capabilities, e.g., lighting changes, camera movements or different weather co… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  10. Event Transformer+. A multi-purpose solution for efficient event data processing

    Authors: Alberto Sabater, Luis Montesano, Ana C. Murillo

    Abstract: Event cameras record sparse illumination changes with high temporal resolution and high dynamic range. Thanks to their sparse recording and low consumption, they are increasingly used in applications such as AR/VR and autonomous driving. Current topperforming methods often ignore specific event-data properties, leading to the development of generic but computationally expensive algorithms, while e… ▽ More

    Submitted 3 September, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2204.03355

  11. EndoMapper dataset of complete calibrated endoscopy procedures

    Authors: Pablo Azagra, Carlos Sostres, Ángel Ferrandez, Luis Riazuelo, Clara Tomasini, Oscar León Barbed, Javier Morlana, David Recasens, Victor M. Batlle, Juan J. Gómez-Rodríguez, Richard Elvira, Julia López, Cristina Oriol, Javier Civera, Juan D. Tardós, Ana Cristina Murillo, Angel Lanas, José M. M. Montiel

    Abstract: Computer-assisted systems are becoming broadly used in medicine. In endoscopy, most research focuses on the automatic detection of polyps or other pathologies, but localization and navigation of the endoscope are completely performed manually by physicians. To broaden this research and bring spatial Artificial Intelligence to endoscopies, data from complete procedures is needed. This paper introdu… ▽ More

    Submitted 10 October, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: 17 pages, 14 figures, 8 tables

    Journal ref: Sci Data 10, 671 (2023)

  12. arXiv:2204.03355  [pdf, other

    cs.CV

    Event Transformer. A sparse-aware solution for efficient event data processing

    Authors: Alberto Sabater, Luis Montesano, Ana C. Murillo

    Abstract: Event cameras are sensors of great interest for many applications that run in low-resource and challenging environments. They log sparse illumination changes with high temporal resolution and high dynamic range, while they present minimal power consumption. However, top-performing methods often ignore specific event-data properties, leading to the development of generic but computationally expensi… ▽ More

    Submitted 18 April, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  13. SuperPoint features in endoscopy

    Authors: O. L. Barbed, F. Chadebecq, J. Morlana, J. M. Martínez-Montiel, A. C. Murillo

    Abstract: There is often a significant gap between research results and applicability in routine medical practice. This work studies the performance of well-known local features on a medical dataset captured during routine colonoscopy procedures. Local feature extraction and matching is a key step for many computer vision applications, specially regarding 3D modelling. In the medical domain, handcrafted loc… ▽ More

    Submitted 9 January, 2023; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: 9 pages, 5 figures

  14. arXiv:2104.13415  [pdf, other

    cs.CV

    Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive Learning from a Class-wise Memory Bank

    Authors: Inigo Alonso, Alberto Sabater, David Ferstl, Luis Montesano, Ana C. Murillo

    Abstract: This work presents a novel approach for semi-supervised semantic segmentation. The key element of this approach is our contrastive learning module that enforces the segmentation network to yield similar pixel-level feature representations for same-class samples across the whole dataset. To achieve this, we maintain a memory bank continuously updated with relevant and high-quality feature vectors f… ▽ More

    Submitted 6 August, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Journal ref: IEEE International Conference on Computer Vision 2021

  15. arXiv:2104.03634  [pdf, other

    cs.RO eess.SY

    CineMPC: Controlling Camera Intrinsics and Extrinsics for Autonomous Cinematography

    Authors: Pablo Pueyo, Eduardo Montijano, Ana C. Murillo, Mac Schwager

    Abstract: We present CineMPC, an algorithm to autonomously control a UAV-borne video camera in a nonlinear Model Predicted Control (MPC) loop. CineMPC controls both the position and orientation of the camera -- the camera extrinsics -- as well as the lens focal length, focal distance, and aperture -- the camera intrinsics. While some existing solutions autonomously control the position and orientation of th… ▽ More

    Submitted 22 February, 2022; v1 submitted 8 April, 2021; originally announced April 2021.

  16. arXiv:2103.02303  [pdf, other

    cs.CV

    Domain and View-point Agnostic Hand Action Recognition

    Authors: Alberto Sabater, Iñigo Alonso, Luis Montesano, Ana C. Murillo

    Abstract: Hand action recognition is a special case of action recognition with applications in human-robot interaction, virtual reality or life-logging systems. Building action classifiers able to work for such heterogeneous action domains is very challenging. There are very subtle changes across different actions from a given application but also large variations across domains (e.g. virtual reality vs lif… ▽ More

    Submitted 7 October, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  17. arXiv:2102.08997  [pdf, other

    cs.CV

    One-shot action recognition in challenging therapy scenarios

    Authors: Alberto Sabater, Laura Santos, Jose Santos-Victor, Alexandre Bernardino, Luis Montesano, Ana C. Murillo

    Abstract: One-shot action recognition aims to recognize new action categories from a single reference example, typically referred to as the anchor example. This work presents a novel approach for one-shot action recognition in the wild that computes motion representations robust to variable kinematic conditions. One-shot action recognition is then performed by evaluating anchor and target motion representat… ▽ More

    Submitted 29 July, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

  18. arXiv:2010.13701  [pdf, other

    cs.CV cs.MA

    Distributed Multi-Target Tracking in Camera Networks

    Authors: Sara Casao, Abel Naya, Ana C. Murillo, Eduardo Montijano

    Abstract: Most recent works on multi-target tracking with multiple cameras focus on centralized systems. In contrast, this paper presents a multi-target tracking approach implemented in a distributed camera network. The advantages of distributed systems lie in lighter communication management, greater robustness to failures and local decision making. On the other hand, data association and information fusio… ▽ More

    Submitted 16 April, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

  19. arXiv:2010.12239  [pdf, other

    cs.CV

    Domain Adaptation in LiDAR Semantic Segmentation by Aligning Class Distributions

    Authors: Inigo Alonso, Luis Riazuelo, Luis Montesano, Ana C. Murillo

    Abstract: LiDAR semantic segmentation provides 3D semantic information about the environment, an essential cue for intelligent systems during their decision making processes. Deep neural networks are achieving state-of-the-art results on large public benchmarks on this task. Unfortunately, finding models that generalize well or adapt to additional domains, where data distribution is different, remains a maj… ▽ More

    Submitted 3 December, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: 7 pages, 3 figures

  20. arXiv:2009.11050  [pdf, other

    cs.CV cs.LG

    Robust and efficient post-processing for video object detection

    Authors: Alberto Sabater, Luis Montesano, Ana C. Murillo

    Abstract: Object recognition in video is an important task for plenty of applications, including autonomous driving perception, surveillance tasks, wearable devices or IoT networks. Object recognition using video data is more challenging than using still images due to blur, occlusions or rare object poses. Specific video detectors with high computational cost or standard image detectors together with a fast… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

    Comments: Submitted to the International Conference on Intelligent Robots and Systems, IROS 2020

  21. Performance of object recognition in wearable videos

    Authors: Alberto Sabater, Luis Montesano, Ana C. Murillo

    Abstract: Wearable technologies are enabling plenty of new applications of computer vision, from life logging to health assistance. Many of them are required to recognize the elements of interest in the scene captured by the camera. This work studies the problem of object detection and localization on videos captured by this type of camera. Wearable videos are a much more challenging scenario for object det… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: Emerging Technologies and Factory Automation, ETFA, 2019

  22. arXiv:2002.10893  [pdf, other

    cs.CV

    3D-MiniNet: Learning a 2D Representation from Point Clouds for Fast and Efficient 3D LIDAR Semantic Segmentation

    Authors: Iñigo Alonso, Luis Riazuelo, Luis Montesano, Ana C. Murillo

    Abstract: LIDAR semantic segmentation, which assigns a semantic label to each 3D point measured by the LIDAR, is becoming an essential task for many robotic applications such as autonomous driving. Fast and efficient semantic segmentation methods are needed to match the strong computational and temporal restrictions of many of these real-world applications. This work presents 3D-MiniNet, a novel approach… ▽ More

    Submitted 27 April, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: 8 pages, 4 figures

  23. arXiv:1912.09614  [pdf

    cs.LG stat.ML

    Features or Shape? Tackling the False Dichotomy of Time Series Classification

    Authors: Sara Alaee, Alireza Abdoli, Christian Shelton, Amy C. Murillo, Alec C. Gerry, Eamonn Keogh

    Abstract: Time series classification is an important task in its own right, and it is often a precursor to further downstream analytics. To date, virtually all works in the literature have used either shape-based classification using a distance measure or feature-based classification after finding some suitable features for the domain. It seems to be underappreciated that in many datasets it is the case tha… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

  24. Time Series Classification: Lessons Learned in the (Literal) Field while Studying Chicken Behavior

    Authors: Alireza Abdoli, Amy C. Murillo, Alec C. Gerry, Eamonn J. Keogh

    Abstract: Poultry farms are a major contributor to the human food chain. However, around the world, there have been growing concerns about the quality of life for the livestock in poultry farms; and increasingly vocal demands for improved standards of animal welfare. Recent advances in sensing technologies and machine learning allow the possibility of monitoring birds, and employing the lessons learned to i… ▽ More

    Submitted 20 December, 2019; v1 submitted 21 November, 2019; originally announced December 2019.

    Comments: arXiv admin note: text overlap with arXiv:1811.03149

  25. arXiv:1811.12039  [pdf, other

    cs.CV

    EV-SegNet: Semantic Segmentation for Event-based Cameras

    Authors: Iñigo Alonso, Ana C. Murillo

    Abstract: Event cameras, or Dynamic Vision Sensor (DVS), are very promising sensors which have shown several advantages over frame based cameras. However, most recent work on real applications of these cameras is focused on 3D reconstruction and 6-DOF camera tracking. Deep learning based approaches, which are leading the state-of-the-art in visual recognition tasks, could potentially take advantage of the b… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2019

  26. arXiv:1811.03149  [pdf

    cs.LG stat.ML

    Time Series Classification to Improve Poultry Welfare

    Authors: Alireza Abdoli, Amy C. Murillo, Chin-Chia M. Yeh, Alec C. Gerry, Eamonn J. Keogh

    Abstract: Poultry farms are an important contributor to the human food chain. Worldwide, humankind keeps an enormous number of domesticated birds (e.g. chickens) for their eggs and their meat, providing rich sources of low-fat protein. However, around the world, there have been growing concerns about the quality of life for the livestock in poultry farms; and increasingly vocal demands for improved standard… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.