Skip to main content

Showing 1–15 of 15 results for author: Armeni, I

.
  1. arXiv:2406.05849  [pdf, other

    cs.RO

    MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps

    Authors: Jianhao Zheng, Daniel Barath, Marc Pollefeys, Iro Armeni

    Abstract: Creating 3D semantic reconstructions of environments is fundamental to many applications, especially when related to autonomous agent operation (e.g., goal-oriented navigation or object interaction and manipulation). Commonly, 3D semantic reconstruction systems capture the entire scene in the same level of detail. However, certain tasks (e.g., object interaction) require a fine-grained and high-re… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  2. arXiv:2404.14565  [pdf, other

    cs.CV

    "Where am I?" Scene Retrieval with Language

    Authors: Jiaqi Chen, Daniel Barath, Iro Armeni, Marc Pollefeys, Hermann Blum

    Abstract: Natural language interfaces to embodied AI are becoming more ubiquitous in our daily lives. This opens further opportunities for language-based interaction with embodied agents, such as a user instructing an agent to execute some task in a specific location. For example, "put the bowls back in the cupboard next to the fridge" or "meet me at the intersection under the red sign." As such, we need me… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  3. arXiv:2404.02838  [pdf, other

    cs.AI

    I-Design: Personalized LLM Interior Designer

    Authors: Ata Çelen, Guo Han, Konrad Schindler, Luc Van Gool, Iro Armeni, Anton Obukhov, Xi Wang

    Abstract: Interior design allows us to be who we are and live how we want - each design is as unique as our distinct personality. However, it is not trivial for non-professionals to express and materialize this since it requires aligning functional and visual expectations with the constraints of physical space; this renders interior design a luxury. To make it more accessible, we present I-Design, a persona… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  4. arXiv:2404.00429  [pdf, other

    cs.CV

    Multiway Point Cloud Mosaicking with Diffusion and Global Optimization

    Authors: Shengze **, Iro Armeni, Marc Pollefeys, Daniel Barath

    Abstract: We introduce a novel framework for multiway point cloud mosaicking (named Wednesday), designed to co-align sets of partially overlap** point clouds -- typically obtained from 3D scanners or moving RGB-D cameras -- into a unified coordinate system. At the core of our approach is ODIN, a learned pairwise registration algorithm that iteratively identifies overlaps and refines attention scores, empl… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  5. arXiv:2312.09138  [pdf, other

    cs.CV

    Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments

    Authors: Liyuan Zhu, Shengyu Huang, Konrad Schindler, Iro Armeni

    Abstract: Research into dynamic 3D scene understanding has primarily focused on short-term change tracking from dense observations, while little attention has been paid to long-term changes with sparse observations. We address this gap with MoRE, a novel approach for multi-object relocalization and reconstruction in evolving environments. We view these environments as "living scenes" and consider the proble… ▽ More

    Submitted 26 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 camera-ready

  6. arXiv:2311.09346  [pdf, other

    cs.CV cs.LG cs.RO

    Nothing Stands Still: A Spatiotemporal Benchmark on 3D Point Cloud Registration Under Large Geometric and Temporal Change

    Authors: Tao Sun, Yan Hao, Shengyu Huang, Silvio Savarese, Konrad Schindler, Marc Pollefeys, Iro Armeni

    Abstract: Building 3D geometric maps of man-made spaces is a well-established and active field that is fundamental to computer vision and robotics. However, considering the evolving nature of built environments, it is essential to question the capabilities of current map** efforts in handling temporal changes. In addition, spatiotemporal map** holds significant potential for achieving sustainability and… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 27 pages, 29 figures. For the project page, see http://nothing-stands-still.com

  7. arXiv:2309.16023  [pdf, other

    cs.CV

    Q-REG: End-to-End Trainable Point Cloud Registration with Surface Curvature

    Authors: Shengze **, Daniel Barath, Marc Pollefeys, Iro Armeni

    Abstract: Point cloud registration has seen recent success with several learning-based methods that focus on correspondence matching and, as such, optimize only for this objective. Following the learning step of correspondence matching, they evaluate the estimated rigid transformation with a RANSAC-like framework. While it is an indispensable component of these methods, it prevents a fully end-to-end traini… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  8. arXiv:2309.14737  [pdf, other

    cs.RO cs.CV

    Volumetric Semantically Consistent 3D Panoptic Map**

    Authors: Yang Miao, Iro Armeni, Marc Pollefeys, Daniel Barath

    Abstract: We introduce an online 2D-to-3D semantic instance map** algorithm aimed at generating comprehensive, accurate, and efficient semantic 3D maps suitable for autonomous agents in unstructured environments. The proposed approach is based on a Voxel-TSDF representation used in recent algorithms. It introduces novel ways of integrating semantic prediction confidence during map**, producing semantic… ▽ More

    Submitted 5 March, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: 8 pages, 2 figures

  9. arXiv:2305.02398  [pdf, other

    cs.CV

    Learning-based Relational Object Matching Across Views

    Authors: Cathrin Elich, Iro Armeni, Martin R. Oswald, Marc Pollefeys, Joerg Stueckler

    Abstract: Intelligent robots require object-level scene understanding to reason about possible tasks and interactions with the environment. Moreover, many perception tasks such as scene reconstruction, image retrieval, or place recognition can benefit from reasoning on the level of objects. While keypoint-based matching can yield strong results for finding correspondences for images with small to medium vie… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted for publication in IEEE International Conference on Robotics and Automation (ICRA), 2023

    MSC Class: 68T45 ACM Class: I.2.10; I.4.8

  10. arXiv:2304.14880  [pdf, other

    cs.CV

    SGAligner : 3D Scene Alignment with Scene Graphs

    Authors: Sayan Deb Sarkar, Ondrej Miksik, Marc Pollefeys, Daniel Barath, Iro Armeni

    Abstract: Building 3D scene graphs has recently emerged as a topic in scene representation for several embodied AI applications to represent the world in a structured and rich manner. With their increased use in solving downstream tasks (eg, navigation and room rearrangement), can we leverage and recycle them for creating 3D maps of environments, a pivotal step in agent operation? We focus on the fundamenta… ▽ More

    Submitted 26 September, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

    Comments: Accepted at ICCV 2023

  11. ImpliCity: City Modeling from Satellite Images with Deep Implicit Occupancy Fields

    Authors: Corinne Stucker, Bingxin Ke, Yuanwen Yue, Shengyu Huang, Iro Armeni, Konrad Schindler

    Abstract: High-resolution optical satellite sensors, combined with dense stereo algorithms, have made it possible to reconstruct 3D city models from space. However, these models are, in practice, rather noisy and tend to miss small geometric features that are clearly visible in the images. We argue that one reason for the limited quality may be a too early, heuristic reduction of the triangulated 3D point c… ▽ More

    Submitted 6 May, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: Accepted for publication in the International Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (camera-ready version including keywords + supplementary material)

    Journal ref: ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., V-2-2022, 193-201, 2022

  12. arXiv:2011.06698  [pdf, other

    cs.RO cs.CV cs.LG

    Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation

    Authors: Bryan Chen, Alexander Sax, Gene Lewis, Iro Armeni, Silvio Savarese, Amir Zamir, Jitendra Malik, Lerrel Pinto

    Abstract: Vision-based robotics often separates the control loop into one module for perception and a separate module for control. It is possible to train the whole system end-to-end (e.g. with deep RL), but doing it "from scratch" comes with a high sample complexity cost and the final result is often brittle, failing unexpectedly if the test environment differs from that of training. We study the effects… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Extended version of CoRL 2020 camera ready. Supplementary released separately

  13. arXiv:1910.02527  [pdf, other

    cs.CV cs.RO

    3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

    Authors: Iro Armeni, Zhi-Yang He, JunYoung Gwak, Amir R. Zamir, Martin Fischer, Jitendra Malik, Silvio Savarese

    Abstract: A comprehensive semantic understanding of a scene is important for many applications - but in what space should diverse semantic information (e.g., objects, scene categories, material types, texture, etc.) be grounded and what should be its structure? Aspiring to have one unified structure that hosts diverse types of semantics, we follow the Scene Graph paradigm in 3D, generating a 3D Scene Graph.… ▽ More

    Submitted 6 October, 2019; originally announced October 2019.

    Comments: ICCV 2019

  14. arXiv:1710.07563  [pdf, other

    cs.CV

    SEGCloud: Semantic Segmentation of 3D Point Clouds

    Authors: Lyne P. Tchapmi, Christopher B. Choy, Iro Armeni, JunYoung Gwak, Silvio Savarese

    Abstract: 3D semantic scene labeling is fundamental to agents operating in the real world. In particular, labeling raw 3D point sets from sensors provides fine-grained semantics. Recent works leverage the capabilities of Neural Networks (NNs), but are limited to coarse voxel predictions and do not explicitly enforce global consistency. We present SEGCloud, an end-to-end framework to obtain 3D point-level se… ▽ More

    Submitted 20 October, 2017; originally announced October 2017.

    Comments: Accepted as a spotlight at the International Conference of 3D Vision (3DV 2017)

  15. arXiv:1702.01105  [pdf, other

    cs.CV cs.RO

    Joint 2D-3D-Semantic Data for Indoor Scene Understanding

    Authors: Iro Armeni, Sasha Sax, Amir R. Zamir, Silvio Savarese

    Abstract: We present a dataset of large-scale indoor spaces that provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotations. The dataset covers over 6,000m2 and contains over 70,000 RGB images, along with the corresponding depths, surface normals, semantic annotations, global XYZ images (all in forms of both regular and 360° equi… ▽ More

    Submitted 5 April, 2017; v1 submitted 3 February, 2017; originally announced February 2017.

    Comments: The dataset is available http://3Dsemantics.stanford.edu/