Skip to main content

Showing 1–50 of 62 results for author: Isler, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16358  [pdf, other

    cs.RO

    Neural L1 Adaptive Control of Vehicle Lateral Dynamics

    Authors: Pratik Mukherjee, Burak M. Gonultas, O. Goktug Poyrazoglu, Volkan Isler

    Abstract: We address the problem of stable and robust control of vehicles with lateral error dynamics for the application of lane kee**. Lane departure is the primary reason for half of the fatalities in road accidents, making the development of stable, adaptive and robust controllers a necessity. Traditional linear feedback controllers achieve satisfactory tracking performance, however, they exhibit unst… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2405.05372  [pdf, other

    cs.RO

    Learning to Play Pursuit-Evasion with Dynamic and Sensor Constraints

    Authors: Burak M. Gonultas, Volkan Isler

    Abstract: We present a multi-agent reinforcement learning approach to solve a pursuit-evasion game between two players with car-like dynamics and sensing limitations. We develop a curriculum for an existing multi-agent deterministic policy gradient algorithm to simultaneously obtain strategies for both players, and deploy the learned strategies on real robots moving as fast as 2 m/s in indoor environments.… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  3. arXiv:2403.13294  [pdf, other

    cs.RO

    Map-Aware Human Pose Prediction for Robot Follow-Ahead

    Authors: Qingyuan Jiang, Burak Susam, Jun-Jee Chao, Volkan Isler

    Abstract: In the robot follow-ahead task, a mobile robot is tasked to maintain its relative position in front of a moving human actor while kee** the actor in sight. To accomplish this task, it is important that the robot understand the full 3D pose of the human (since the head orientation can be different than the torso) and predict future human poses so as to plan accordingly. This prediction task is es… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  4. arXiv:2312.09252  [pdf, other

    cs.CV

    FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection

    Authors: Hongsuk Choi, Isaac Kasahara, Selim Engin, Moritz Graule, Nikhil Chavan-Dafle, Volkan Isler

    Abstract: Recently introduced ControlNet has the ability to steer the text-driven image generation process with geometric input such as human 2D pose, or edge features. While ControlNet provides control over the geometric form of the instances in the generated image, it lacks the capability to dictate the visual appearance of each instance. We present FineControlNet to provide fine control over each instanc… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Hongsuk Choi and Isaac Kasahara have eqaul contributions. 19 pages, 15 figures, 3 tables

  5. arXiv:2311.04783  [pdf, other

    cs.CV

    VioLA: Aligning Videos to 2D LiDAR Scans

    Authors: Jun-Jee Chao, Selim Engin, Nikhil Chavan-Dafle, Bhoram Lee, Volkan Isler

    Abstract: We study the problem of aligning a video that captures a local portion of an environment to the 2D LiDAR scan of the entire environment. We introduce a method (VioLA) that starts with building a semantic map of the local scene from the image sequence, then extracts points at a fixed height for registering to the LiDAR map. Due to reconstruction errors or partial coverage of the camera scan, the re… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 8 pages

  6. arXiv:2310.20034  [pdf, other

    cs.RO

    GG-LLM: Geometrically Grounding Large Language Models for Zero-shot Human Activity Forecasting in Human-Aware Task Planning

    Authors: Moritz A. Graule, Volkan Isler

    Abstract: A robot in a human-centric environment needs to account for the human's intent and future motion in its task and motion planning to ensure safe and effective operation. This requires symbolic reasoning about probable future actions and the ability to tie these actions to specific locations in the physical environment. While one can train behavioral models capable of predicting human motion from pa… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  7. arXiv:2310.18473  [pdf, other

    cs.RO

    Pouring by Feel: An Analysis of Tactile and Proprioceptive Sensing for Accurate Pouring

    Authors: Pedro Piacenza, Daewon Lee, Volkan Isler

    Abstract: As service robots begin to be deployed to assist humans, it is important for them to be able to perform a skill as ubiquitous as pouring. Specifically, we focus on the task of pouring an exact amount of water without any environmental instrumentation, that is, using only the robot's own sensors to perform this task in a general way robustly. In our approach we use a simple PID controller which use… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  8. arXiv:2310.18459  [pdf, other

    cs.RO

    VFAS-Grasp: Closed Loop Gras** with Visual Feedback and Adaptive Sampling

    Authors: Pedro Piacenza, Jiacheng Yuan, **wook Huh, Volkan Isler

    Abstract: We consider the problem of closed-loop robotic gras** and present a novel planner which uses Visual Feedback and an uncertainty-aware Adaptive Sampling strategy (VFAS) to close the loop. At each iteration, our method VFAS-Grasp builds a set of candidate grasps by generating random perturbations of a seed grasp. The candidates are then scored using a novel metric which combines a learned grasp-qu… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  9. arXiv:2310.09463  [pdf, other

    cs.RO cs.AI

    HIO-SDF: Hierarchical Incremental Online Signed Distance Fields

    Authors: Vasileios Vasilopoulos, Suveer Garg, **wook Huh, Bhoram Lee, Volkan Isler

    Abstract: A good representation of a large, complex mobile robot workspace must be space-efficient yet capable of encoding relevant geometric details. When exploring unknown environments, it needs to be updatable incrementally in an online fashion. We introduce HIO-SDF, a new method that represents the environment as a Signed Distance Field (SDF). State of the art representations of SDFs are based on either… ▽ More

    Submitted 3 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: IEEE International Conference on Robotics and Automation (ICRA 2024) - 7 pages, 7 figures

  10. arXiv:2309.07891  [pdf, other

    cs.CV

    HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image

    Authors: Hongsuk Choi, Nikhil Chavan-Dafle, Jiacheng Yuan, Volkan Isler, Hyunsoo Park

    Abstract: This paper presents a method to learn hand-object interaction prior for reconstructing a 3D hand-object scene from a single RGB image. The inference as well as training-data generation for 3D hand-object scene reconstruction is challenging due to the depth ambiguity of a single image and occlusions by the hand and object. We turn this challenge into an opportunity by utilizing the hand shape to co… ▽ More

    Submitted 11 February, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: In ICRA 2024; 13 pages including the supplementary material, 8 tables, 12 figures

  11. arXiv:2308.03898  [pdf, other

    cs.RO

    System Identification and Control of Front-Steered Ackermann Vehicles through Differentiable Physics

    Authors: Burak M. Gonultas, Pratik Mukherjee, O. Goktug Poyrazoglu, Volkan Isler

    Abstract: In this paper, we address the problem of system identification and control of a front-steered vehicle which abides by the Ackermann geometry constraints. This problem arises naturally for on-road and off-road vehicles that require reliable system identification and basic feedback controllers for various applications such as lane kee** and way-point navigation. Traditional system identification r… ▽ More

    Submitted 8 November, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: Accepted for IROS 2023

  12. arXiv:2308.00134  [pdf, other

    cs.RO

    Onboard View Planning of a Flying Camera for High Fidelity 3D Reconstruction of a Moving Actor

    Authors: Qingyuan Jiang, Volkan Isler

    Abstract: Capturing and reconstructing a human actor's motion is important for filmmaking and gaming. Currently, motion capture systems with static cameras are used for pixel-level high-fidelity reconstructions. Such setups are costly, require installation and calibration and, more importantly, confine the user to a predetermined area. In this work, we present a drone-based motion capture system that can al… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  13. arXiv:2307.11932  [pdf, other

    cs.CV

    RIC: Rotate-Inpaint-Complete for Generalizable Scene Reconstruction

    Authors: Isaac Kasahara, Shubham Agrawal, Selim Engin, Nikhil Chavan-Dafle, Shuran Song, Volkan Isler

    Abstract: General scene reconstruction refers to the task of estimating the full 3D geometry and texture of a scene containing previously unseen objects. In many practical applications such as AR/VR, autonomous navigation, and robotics, only a single view of the scene may be available, making the scene reconstruction task challenging. In this paper, we present a method for scene reconstruction by structural… ▽ More

    Submitted 4 October, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  14. arXiv:2305.10534  [pdf, other

    cs.RO eess.SY

    RAMP: Hierarchical Reactive Motion Planning for Manipulation Tasks Using Implicit Signed Distance Functions

    Authors: Vasileios Vasilopoulos, Suveer Garg, Pedro Piacenza, **wook Huh, Volkan Isler

    Abstract: We introduce Reactive Action and Motion Planner (RAMP), which combines the strengths of sampling-based and reactive approaches for motion planning. In essence, RAMP is a hierarchical approach where a novel variant of a Model Predictive Path Integral (MPPI) controller is used to generate trajectories which are then followed asynchronously by a local vector field controller. We demonstrate, in the c… ▽ More

    Submitted 31 July, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023) - 8 pages, 6 figures

  15. arXiv:2305.09510  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Real-time Simultaneous Multi-Object 3D Shape Reconstruction, 6DoF Pose Estimation and Dense Grasp Prediction

    Authors: Shubham Agrawal, Nikhil Chavan-Dafle, Isaac Kasahara, Selim Engin, **wook Huh, Volkan Isler

    Abstract: Robotic manipulation systems operating in complex environments rely on perception systems that provide information about the geometry (pose and 3D shape) of the objects in the scene along with other semantic information such as object labels. This information is then used for choosing the feasible grasps on relevant objects. In this paper, we present a novel method to provide this geometric and se… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    ACM Class: I.4.5; I.4.8; I.4.10; I.2.9; I.2.10; I.6.3

  16. EV-Catcher: High-Speed Object Catching Using Low-latency Event-based Neural Networks

    Authors: Ziyun Wang, Fernando Cladera Ojeda, Anthony Bisulco, Daewon Lee, Camillo J. Taylor, Kostas Daniilidis, M. Ani Hsieh, Daniel D. Lee, Volkan Isler

    Abstract: Event-based sensors have recently drawn increasing interest in robotic perception due to their lower latency, higher dynamic range, and lower bandwidth requirements compared to standard CMOS-based imagers. These properties make them ideal tools for real-time perception tasks in highly dynamic environments. In this work, we demonstrate an application where event cameras excel: accurately estimating… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: 8 pages, 6 figures, IEEE Robotics and Automation Letters ( Volume: 7, Issue: 4, October 2022)

  17. arXiv:2304.04100  [pdf, other

    cs.RO

    Pick2Place: Task-aware 6DoF Grasp Estimation via Object-Centric Perspective Affordance

    Authors: Zhanpeng He, Nikhil Chavan-Dafle, **wook Huh, Shuran Song, Volkan Isler

    Abstract: The choice of a grasp plays a critical role in the success of downstream manipulation tasks. Consider a task of placing an object in a cluttered scene; the majority of possible grasps may not be suitable for the desired placement. In this paper, we study the synergy between the picking and placing of an object in a cluttered scene to develop an algorithm for task-aware grasp estimation. We present… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: IEEE International Conference on Robotics and Automation 2023

  18. arXiv:2303.01010  [pdf, other

    cs.RO

    Active Mass Distribution Estimation from Tactile Feedback

    Authors: Jiacheng Yuan, Changhyun Choi, Ellad B. Tadmor, Volkan Isler

    Abstract: In this work, we present a method to estimate the mass distribution of a rigid object through robotic interactions and tactile feedback. This is a challenging problem because of the complexity of physical dynamics modeling and the action dependencies across the model parameters. We propose a sequential estimation strategy combined with a set of robot action selection rules based on the analytical… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  19. arXiv:2302.12883  [pdf, other

    cs.CV

    3D Surface Reconstruction in the Wild by Deforming Shape Priors from Synthetic Data

    Authors: Nicolai Häni, Jun-Jee Chao, Volkan Isler

    Abstract: Reconstructing the underlying 3D surface of an object from a single image is a challenging problem that has received extensive attention from the computer vision community. Many learning-based approaches tackle this problem by learning a 3D shape prior from either ground truth 3D data or multi-view observations. To achieve state-of-the-art results, these methods assume that the objects are specifi… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  20. arXiv:2302.09846  [pdf, other

    cs.RO

    Neural Optimal Control using Learned System Dynamics

    Authors: Selim Engin, Volkan Isler

    Abstract: We study the problem of generating control laws for systems with unknown dynamics. Our approach is to represent the controller and the value function with neural networks, and to train them using loss functions adapted from the Hamilton-Jacobi-Bellman (HJB) equations. In the absence of a known dynamics model, our method first learns the state transitions from data collected by interacting with the… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  21. arXiv:2212.06393  [pdf

    cs.RO

    Predicting Energy Consumption of Ground Robots On Uneven Terrains

    Authors: Minghan Wei, Volkan Isler

    Abstract: Optimizing energy consumption for robot navigation in fields requires energy-cost maps. However, obtaining such a map is still challenging, especially for large, uneven terrains. Physics-based energy models work for uniform, flat surfaces but do not generalize well to these terrains. Furthermore, slopes make the energy consumption at every location directional and add to the complexity of data col… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Journal ref: IEEE Robotics and Automation Letters, 2021

  22. arXiv:2209.14419  [pdf, other

    cs.CV

    Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences

    Authors: Jun-Jee Chao, Selim Engin, Nicolai Häni, Volkan Isler

    Abstract: Correspondence search is an essential step in rigid point cloud registration algorithms. Most methods maintain a single correspondence at each step and gradually remove wrong correspondances. However, building one-to-one correspondence with hard assignments is extremely difficult, especially when matching two point clouds with many locally similar features. This paper proposes an optimization meth… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 8 pages

  23. arXiv:2209.05432  [pdf, other

    cs.RO cs.AI cs.CV

    Self-supervised Wide Baseline Visual Servoing via 3D Equivariance

    Authors: **wook Huh, Jungseok Hong, Suveer Garg, Hyun Soo Park, Volkan Isler

    Abstract: One of the challenging input settings for visual servoing is when the initial and goal camera views are far apart. Such settings are difficult because the wide baseline can cause drastic changes in object appearance and cause occlusions. This paper presents a novel self-supervised visual servoing method for wide baseline images which does not require 3D ground truth supervision. Existing approache… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: Accepted at the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  24. Apple Counting using Convolutional Neural Networks

    Authors: Nicolai Häni, Pravakar Roy, Volkan Isler

    Abstract: Estimating accurate and reliable fruit and vegetable counts from images in real-world settings, such as orchards, is a challenging problem that has received significant recent attention. Estimating fruit counts before harvest provides useful information for logistics planning. While considerable progress has been made toward fruit detection, estimating the actual counts remains challenging. In pra… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Journal ref: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  25. Visual Servoing in Orchard Settings

    Authors: Nicolai Häni, Volkan Isler

    Abstract: We present a general framework for accurate positioning of sensors and end effectors in farm settings using a camera mounted on a robotic manipulator. Our main contribution is a visual servoing approach based on a new and robust feature tracking algorithm. Results from field experiments performed at an apple orchard demonstrate that our approach converges to a given termination criterion even unde… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Journal ref: In 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 2946-2953)

  26. arXiv:2112.00216  [pdf, other

    cs.CV cs.SD eess.AS

    PoseKernelLifter: Metric Lifting of 3D Human Pose using Sound

    Authors: Zhijian Yang, Xiaoran Fan, Volkan Isler, Hyun Soo Park

    Abstract: Reconstructing the 3D pose of a person in metric scale from a single view image is a geometrically ill-posed problem. For example, we can not measure the exact distance of a person to the camera from a single view image without additional scene assumptions (e.g., known height). Existing learning based approaches circumvent this issue by reconstructing the 3D pose up to scale. However, there are ma… ▽ More

    Submitted 2 December, 2021; v1 submitted 30 November, 2021; originally announced December 2021.

  27. arXiv:2111.10462  [pdf, other

    cs.RO

    Online Coverage Planning for an Autonomous Weed Mowing Robot with Curvature Constraints

    Authors: Parikshit Maini, Burak M. Gonultas, Volkan Isler

    Abstract: The land used for grazing cattle takes up about one-third of the land in the United States. These areas can be highly rugged. Yet, they need to be maintained to prevent weeds from taking over the nutritious grassland. This can be a daunting task especially in the case of organic farming since herbicides cannot be used. In this paper, we present the design of Cowbot, an autonomous weed mowing robot… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

  28. arXiv:2109.07134  [pdf, other

    cs.RO

    ROW-SLAM: Under-Canopy Cornfield Semantic SLAM

    Authors: Jiacheng Yuan, Jungseok Hong, Junaed Sattar, Volkan Isler

    Abstract: We study a semantic SLAM problem faced by a robot tasked with autonomous weeding under the corn canopy. The goal is to detect corn stalks and localize them in a global coordinate frame. This is a challenging setup for existing algorithms because there is very little space between the camera and the plants, and the camera motion is primarily restricted to be along the row. To overcome these challen… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: 7 pages, 6 figures

  29. arXiv:2109.06837  [pdf, other

    cs.RO

    Simultaneous Object Reconstruction and Grasp Prediction using a Camera-centric Object Shell Representation

    Authors: Nikhil Chavan-Dafle, Sergiy Popovych, Shubham Agrawal, Daniel D. Lee, Volkan Isler

    Abstract: Being able to grasp objects is a fundamental component of most robotic manipulation systems. In this paper, we present a new approach to simultaneously reconstruct a mesh and a dense grasp quality map of an object from a depth image. At the core of our approach is a novel camera-centric object representation called the "object shell" which is composed of an observed "entry image" and a predicted "… ▽ More

    Submitted 19 December, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: 18 pages, 12 figures, 8 tables

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  30. arXiv:2103.11168  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Continuous Cost-to-Go Functions for Non-holonomic Systems

    Authors: **wook Huh, Daniel D. Lee, Volkan Isler

    Abstract: This paper presents a supervised learning method to generate continuous cost-to-go functions of non-holonomic systems directly from the workspace description. Supervision from informative examples reduces training time and improves network performance. The manifold representing the optimal trajectories of a non-holonomic system has high-curvature regions which can not be efficiently captured with… ▽ More

    Submitted 20 March, 2021; originally announced March 2021.

  31. arXiv:2101.05212  [pdf, other

    cs.CV cs.RO

    Ellipse Regression with Predicted Uncertainties for Accurate Multi-View 3D Object Estimation

    Authors: Wenbo Dong, Volkan Isler

    Abstract: Convolutional neural network (CNN) based architectures, such as Mask R-CNN, constitute the state of the art in object detection and segmentation. Recently, these methods have been extended for model-based segmentation where the network outputs the parameters of a geometric model (e.g. an ellipse) directly. This work considers objects whose three-dimensional models can be represented as ellipsoids.… ▽ More

    Submitted 27 December, 2020; originally announced January 2021.

    Comments: 9 pages, 9 figures

  32. arXiv:2012.06023  [pdf, other

    cs.RO cs.AI cs.LG

    Cost-to-Go Function Generating Networks for High Dimensional Motion Planning

    Authors: **wook Huh, Volkan Isler, Daniel D. Lee

    Abstract: This paper presents c2g-HOF networks which learn to generate cost-to-go functions for manipulator motion planning. The c2g-HOF architecture consists of a cost-to-go function over the configuration space represented as a neural network (c2g-network) as well as a Higher Order Function (HOF) network which outputs the weights of the c2g-network for a given input workspace. Both networks are trained en… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

  33. arXiv:2011.09427  [pdf, other

    cs.CV cs.LG cs.RO

    Fast Motion Understanding with Spatiotemporal Neural Networks and Dynamic Vision Sensors

    Authors: Anthony Bisulco, Fernando Cladera Ojeda, Volkan Isler, Daniel D. Lee

    Abstract: This paper presents a Dynamic Vision Sensor (DVS) based system for reasoning about high speed motion. As a representative scenario, we consider the case of a robot at rest reacting to a small, fast approaching object at speeds higher than 15m/s. Since conventional image sensors at typical frame rates observe such an object for only a few frames, estimating the underlying motion presents a consider… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Journal ref: International Conference on Robotics and Automation (ICRA) 2021

  34. arXiv:2011.08319  [pdf, other

    cs.RO

    Multi-Step Recurrent Q-Learning for Robotic Velcro Peeling

    Authors: Jiacheng Yuan, Nicolai Häni, Volkan Isler

    Abstract: Learning object manipulation is a critical skill for robots to interact with their environment. Even though there has been significant progress in robotic manipulation of rigid objects, interacting with non-rigid objects remains challenging for robots. In this work, we introduce velcro peeling as a representative application for robotic manipulation of non-rigid objects in complex environments. We… ▽ More

    Submitted 22 February, 2022; v1 submitted 16 November, 2020; originally announced November 2020.

  35. arXiv:2010.14597  [pdf, other

    cs.RO

    Learning to Generate Cost-to-Go Functions for Efficient Motion Planning

    Authors: **wook Huh, Galen Xing, Ziyun Wang, Volkan Isler, Daniel D. Lee

    Abstract: Traditional motion planning is computationally burdensome for practical robots, involving extensive collision checking and considerable iterative propagation of cost values. We present a novel neural network architecture which can directly generate the cost-to-go (c2g) function for a given configuration space and a goal configuration. The output of the network is a continuous function whose gradie… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

  36. arXiv:2007.15627  [pdf, other

    cs.CV

    Continuous Object Representation Networks: Novel View Synthesis without Target View Supervision

    Authors: Nicolai Häni, Selim Engin, Jun-Jee Chao, Volkan Isler

    Abstract: Novel View Synthesis (NVS) is concerned with synthesizing views under camera viewpoint transformations from one or multiple input images. NVS requires explicit reasoning about 3D object structure and unseen parts of the scene to synthesize convincing results. As a result, current approaches typically rely on supervised training with either ground truth 3D models or multiple target images. We propo… ▽ More

    Submitted 23 October, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: To appear at Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  37. arXiv:2006.07981  [pdf, other

    cs.CV

    Geodesic-HOF: 3D Reconstruction Without Cutting Corners

    Authors: Ziyun Wang, Eric A. Mitchell, Volkan Isler, Daniel D. Lee

    Abstract: Single-view 3D object reconstruction is a challenging fundamental problem in computer vision, largely due to the morphological diversity of objects in the natural world. In particular, high curvature regions are not always captured effectively by methods trained using only set-based loss functions, resulting in reconstructions short-circuiting the surface or cutting corners. In particular, high cu… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  38. arXiv:2004.01689  [pdf, other

    cs.CV cs.AR cs.LG eess.IV

    Near-chip Dynamic Vision Filtering for Low-Bandwidth Pedestrian Detection

    Authors: Anthony Bisulco, Fernando Cladera Ojeda, Volkan Isler, Daniel D. Lee

    Abstract: This paper presents a novel end-to-end system for pedestrian detection using Dynamic Vision Sensors (DVSs). We target applications where multiple sensors transmit data to a local processing unit, which executes a detection algorithm. Our system is composed of (i) a near-chip event filter that compresses and denoises the event stream from the DVS, and (ii) a Binary Neural Network (BNN) detection mo… ▽ More

    Submitted 3 April, 2020; originally announced April 2020.

    Comments: 6 pages, 5 figures

  39. arXiv:2003.01649  [pdf, other

    cs.RO

    Robotic Gras** through Combined Image-Based Grasp Proposal and 3D Reconstruction

    Authors: Daniel Yang, Tarik Tosun, Ben Eisner, Volkan Isler, Daniel Lee

    Abstract: We present a novel approach to robotic grasp planning using both a learned grasp proposal network and a learned 3D shape reconstruction network. Our system generates 6-DOF grasps from a single RGB-D image of the target object, which is provided as input to both networks. By using the geometric reconstruction to refine the the candidate grasp produced by the grasp proposal network, our system is ab… ▽ More

    Submitted 6 November, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 7 pages, 7 figures

  40. arXiv:2002.09850  [pdf, other

    cs.RO

    Active localization of multiple targets using noisy relative measurements

    Authors: Selim Engin, Volkan Isler

    Abstract: Consider a mobile robot tasked with localizing targets at unknown locations by obtaining relative measurements. The observations can be bearing or range measurements. How should the robot move so as to localize the targets and minimize the uncertainty in their locations as quickly as possible? Most existing approaches are either greedy in nature or rely on accurate initial estimates. We formulat… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

    Comments: 8 pages, 5 figures

  41. Ellipse R-CNN: Learning to Infer Elliptical Object from Clustering and Occlusion

    Authors: Wenbo Dong, Pravakar Roy, Cheng Peng, Volkan Isler

    Abstract: Images of heavily occluded objects in cluttered scenes, such as fruit clusters in trees, are hard to segment. To further retrieve the 3D size and 6D pose of each individual object in such cases, bounding boxes are not reliable from multiple views since only a little portion of the object's geometry is captured. We introduce the first CNN-based ellipse detector, called Ellipse R-CNN, to represent a… ▽ More

    Submitted 14 November, 2020; v1 submitted 30 January, 2020; originally announced January 2020.

    Comments: 18 pages, 20 figures, 7 tables

  42. arXiv:1912.08852  [pdf, other

    cs.CV

    Surface HOF: Surface Reconstruction from a Single Image Using Higher Order Function Networks

    Authors: Ziyun Wang, Volkan Isler, Daniel D. Lee

    Abstract: We address the problem of generating a high-resolution surface reconstruction from a single image. Our approach is to learn a Higher Order Function (HOF) which takes an image of an object as input and generates a map** function. The map** function takes samples from a canonical domain (e.g. the unit sphere) and maps each sample to a local tangent plane on the 3D reconstruction of the object. E… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

  43. arXiv:1910.05766  [pdf, other

    cs.NI cs.LG

    QoS and Jamming-Aware Wireless Networking Using Deep Reinforcement Learning

    Authors: Nof Abuzainab, Tugba Erpek, Kemal Davaslioglu, Yalin E. Sagduyu, Yi Shi, Sharon J. Mackey, Mitesh Patel, Frank Panettieri, Muhammad A. Qureshi, Volkan Isler, Aylin Yener

    Abstract: The problem of quality of service (QoS) and jamming-aware communications is considered in an adversarial wireless network subject to external eavesdrop** and jamming attacks. To ensure robust communication against jamming, an interference-aware routing protocol is developed that allows nodes to avoid communication holes created by jamming attacks. Then, a distributed cooperation framework, based… ▽ More

    Submitted 13 October, 2019; originally announced October 2019.

  44. arXiv:1910.02066  [pdf, other

    cs.RO cs.CV

    Higher Order Function Networks for View Planning and Multi-View Reconstruction

    Authors: Selim Engin, Eric Mitchell, Daewon Lee, Volkan Isler, Daniel D. Lee

    Abstract: We consider the problem of planning views for a robot to acquire images of an object for visual inspection and reconstruction. In contrast to offline methods which require a 3D model of the object as input or online methods which rely on only local measurements, our method uses a neural network which encodes shape information for a large number of objects. We build on recent deep learning methods… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: 7 pages, 6 figures

  45. MinneApple: A Benchmark Dataset for Apple Detection and Segmentation

    Authors: Nicolai Häni, Pravakar Roy, Volkan Isler

    Abstract: In this work, we present a new dataset to advance the state-of-the-art in fruit detection, segmentation, and counting in orchard environments. While there has been significant recent interest in solving these problems, the lack of a unified dataset has made it difficult to compare results. We hope to enable direct comparisons by providing a large variety of high-resolution images acquired in orcha… ▽ More

    Submitted 3 January, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

  46. Asynchronous Network Formation in Unknown Unbounded Environments

    Authors: Selim Engin, Volkan Isler

    Abstract: In this paper, we study the Online Network Formation Problem (ONFP) for a mobile multi-robot system. Consider a group of robots with a bounded communication range operating in a large open area. One of the robots has a piece of information which has to be propagated to all other robots. What strategy should the robots pursue to disseminate the information to the rest of the robots as quickly as po… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

  47. arXiv:1907.10388  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Higher-Order Function Networks for Learning Composable 3D Object Representations

    Authors: Eric Mitchell, Selim Engin, Volkan Isler, Daniel D Lee

    Abstract: We present a new approach to 3D object representation where a neural network encodes the geometry of an object directly into the weights and biases of a second 'map**' network. This map** network can be used to reconstruct an object by applying its encoded transformation to points randomly sampled from a simple geometric space, such as the unit sphere. We study the effectiveness of our method… ▽ More

    Submitted 6 April, 2020; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: To be published in International Conference on Learning Representations (ICLR 2020) [https://openreview.net/forum?id=HJgfDREKDB]; 19 pages

  48. arXiv:1907.06337  [pdf, other

    cs.RO

    Energy-efficient Path Planning for Ground Robots by Combining Air and Ground Measurements

    Authors: Minghan Wei, Volkan Isler

    Abstract: As mobile robots find increasing use in outdoor applications, designing energy-efficient robot navigation algorithms is gaining importance. There are two primary approaches to energy efficient navigation: Offline approaches rely on a previously built energy map as input to a path planner. Obtaining energy maps for large environments is challenging. Alternatively, the robot can navigate in an onlin… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

  49. arXiv:1904.03260  [pdf, other

    cs.RO

    Pixels to Plans: Learning Non-Prehensile Manipulation by Imitating a Planner

    Authors: Tarik Tosun, Eric Mitchell, Ben Eisner, **wook Huh, Bhoram Lee, Daewon Lee, Volkan Isler, H. Sebastian Seung, Daniel Lee

    Abstract: We present a novel method enabling robots to quickly learn to manipulate objects by leveraging a motion planner to generate "expert" training trajectories from a small amount of human-labeled data. In contrast to the traditional sense-plan-act cycle, we propose a deep learning architecture and training regimen called PtPNet that can estimate effective end-effector trajectories for manipulation dir… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: 8 pages

  50. arXiv:1904.02203  [pdf, other

    cs.CV

    Semantics-Aware Image to Image Translation and Domain Transfer

    Authors: Pravakar Roy, Nicolai Häni, Jun-Jee Chao, Volkan Isler

    Abstract: Image to image translation is the problem of transferring an image from a source domain to a different (but related) target domain. We present a new unsupervised image to image translation technique that leverages the underlying semantic information for object transfiguration and domain transfer tasks. Specifically, we present a generative adversarial learning approach that jointly translates imag… ▽ More

    Submitted 1 March, 2021; v1 submitted 3 April, 2019; originally announced April 2019.