Skip to main content

Showing 1–11 of 11 results for author: Engin, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.09252  [pdf, other

    cs.CV

    FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection

    Authors: Hongsuk Choi, Isaac Kasahara, Selim Engin, Moritz Graule, Nikhil Chavan-Dafle, Volkan Isler

    Abstract: Recently introduced ControlNet has the ability to steer the text-driven image generation process with geometric input such as human 2D pose, or edge features. While ControlNet provides control over the geometric form of the instances in the generated image, it lacks the capability to dictate the visual appearance of each instance. We present FineControlNet to provide fine control over each instanc… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Hongsuk Choi and Isaac Kasahara have eqaul contributions. 19 pages, 15 figures, 3 tables

  2. arXiv:2311.04783  [pdf, other

    cs.CV

    VioLA: Aligning Videos to 2D LiDAR Scans

    Authors: Jun-Jee Chao, Selim Engin, Nikhil Chavan-Dafle, Bhoram Lee, Volkan Isler

    Abstract: We study the problem of aligning a video that captures a local portion of an environment to the 2D LiDAR scan of the entire environment. We introduce a method (VioLA) that starts with building a semantic map of the local scene from the image sequence, then extracts points at a fixed height for registering to the LiDAR map. Due to reconstruction errors or partial coverage of the camera scan, the re… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 8 pages

  3. arXiv:2307.11932  [pdf, other

    cs.CV

    RIC: Rotate-Inpaint-Complete for Generalizable Scene Reconstruction

    Authors: Isaac Kasahara, Shubham Agrawal, Selim Engin, Nikhil Chavan-Dafle, Shuran Song, Volkan Isler

    Abstract: General scene reconstruction refers to the task of estimating the full 3D geometry and texture of a scene containing previously unseen objects. In many practical applications such as AR/VR, autonomous navigation, and robotics, only a single view of the scene may be available, making the scene reconstruction task challenging. In this paper, we present a method for scene reconstruction by structural… ▽ More

    Submitted 4 October, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  4. arXiv:2305.09510  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Real-time Simultaneous Multi-Object 3D Shape Reconstruction, 6DoF Pose Estimation and Dense Grasp Prediction

    Authors: Shubham Agrawal, Nikhil Chavan-Dafle, Isaac Kasahara, Selim Engin, **wook Huh, Volkan Isler

    Abstract: Robotic manipulation systems operating in complex environments rely on perception systems that provide information about the geometry (pose and 3D shape) of the objects in the scene along with other semantic information such as object labels. This information is then used for choosing the feasible grasps on relevant objects. In this paper, we present a novel method to provide this geometric and se… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    ACM Class: I.4.5; I.4.8; I.4.10; I.2.9; I.2.10; I.6.3

  5. arXiv:2302.09846  [pdf, other

    cs.RO

    Neural Optimal Control using Learned System Dynamics

    Authors: Selim Engin, Volkan Isler

    Abstract: We study the problem of generating control laws for systems with unknown dynamics. Our approach is to represent the controller and the value function with neural networks, and to train them using loss functions adapted from the Hamilton-Jacobi-Bellman (HJB) equations. In the absence of a known dynamics model, our method first learns the state transitions from data collected by interacting with the… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  6. arXiv:2209.14419  [pdf, other

    cs.CV

    Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences

    Authors: Jun-Jee Chao, Selim Engin, Nicolai Häni, Volkan Isler

    Abstract: Correspondence search is an essential step in rigid point cloud registration algorithms. Most methods maintain a single correspondence at each step and gradually remove wrong correspondances. However, building one-to-one correspondence with hard assignments is extremely difficult, especially when matching two point clouds with many locally similar features. This paper proposes an optimization meth… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 8 pages

  7. arXiv:2007.15627  [pdf, other

    cs.CV

    Continuous Object Representation Networks: Novel View Synthesis without Target View Supervision

    Authors: Nicolai Häni, Selim Engin, Jun-Jee Chao, Volkan Isler

    Abstract: Novel View Synthesis (NVS) is concerned with synthesizing views under camera viewpoint transformations from one or multiple input images. NVS requires explicit reasoning about 3D object structure and unseen parts of the scene to synthesize convincing results. As a result, current approaches typically rely on supervised training with either ground truth 3D models or multiple target images. We propo… ▽ More

    Submitted 23 October, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: To appear at Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  8. arXiv:2002.09850  [pdf, other

    cs.RO

    Active localization of multiple targets using noisy relative measurements

    Authors: Selim Engin, Volkan Isler

    Abstract: Consider a mobile robot tasked with localizing targets at unknown locations by obtaining relative measurements. The observations can be bearing or range measurements. How should the robot move so as to localize the targets and minimize the uncertainty in their locations as quickly as possible? Most existing approaches are either greedy in nature or rely on accurate initial estimates. We formulat… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

    Comments: 8 pages, 5 figures

  9. arXiv:1910.02066  [pdf, other

    cs.RO cs.CV

    Higher Order Function Networks for View Planning and Multi-View Reconstruction

    Authors: Selim Engin, Eric Mitchell, Daewon Lee, Volkan Isler, Daniel D. Lee

    Abstract: We consider the problem of planning views for a robot to acquire images of an object for visual inspection and reconstruction. In contrast to offline methods which require a 3D model of the object as input or online methods which rely on only local measurements, our method uses a neural network which encodes shape information for a large number of objects. We build on recent deep learning methods… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: 7 pages, 6 figures

  10. Asynchronous Network Formation in Unknown Unbounded Environments

    Authors: Selim Engin, Volkan Isler

    Abstract: In this paper, we study the Online Network Formation Problem (ONFP) for a mobile multi-robot system. Consider a group of robots with a bounded communication range operating in a large open area. One of the robots has a piece of information which has to be propagated to all other robots. What strategy should the robots pursue to disseminate the information to the rest of the robots as quickly as po… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

  11. arXiv:1907.10388  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Higher-Order Function Networks for Learning Composable 3D Object Representations

    Authors: Eric Mitchell, Selim Engin, Volkan Isler, Daniel D Lee

    Abstract: We present a new approach to 3D object representation where a neural network encodes the geometry of an object directly into the weights and biases of a second 'map**' network. This map** network can be used to reconstruct an object by applying its encoded transformation to points randomly sampled from a simple geometric space, such as the unit sphere. We study the effectiveness of our method… ▽ More

    Submitted 6 April, 2020; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: To be published in International Conference on Learning Representations (ICLR 2020) [https://openreview.net/forum?id=HJgfDREKDB]; 19 pages