Search | arXiv e-print repository

PuzzleFusion: Unleashing the Power of Diffusion Models for Spatial Puzzle Solving

Authors: Sepidehsadat Hosseini, Mohammad Amin Shabani, Saghar Irandoust, Yasutaka Furukawa

Abstract: This paper presents an end-to-end neural architecture based on Diffusion Models for spatial puzzle solving, particularly jigsaw puzzle and room arrangement tasks. In the latter task, for instance, the proposed system "PuzzleFusion" takes a set of room layouts as polygonal curves in the top-down view and aligns the room layout pieces by estimating their 2D translations and rotations, akin to solvin… ▽ More This paper presents an end-to-end neural architecture based on Diffusion Models for spatial puzzle solving, particularly jigsaw puzzle and room arrangement tasks. In the latter task, for instance, the proposed system "PuzzleFusion" takes a set of room layouts as polygonal curves in the top-down view and aligns the room layout pieces by estimating their 2D translations and rotations, akin to solving the jigsaw puzzle of room layouts. A surprising discovery of the paper is that the simple use of a Diffusion Model effectively solves these challenging spatial puzzle tasks as a conditional generation process. To enable learning of an end-to-end neural system, the paper introduces new datasets with ground-truth arrangements: 1) 2D Voronoi jigsaw dataset, a synthetic one where pieces are generated by Voronoi diagram of 2D pointset; and 2) MagicPlan dataset, a real one offered by MagicPlan from its production pipeline, where pieces are room layouts constructed by augmented reality App by real-estate consumers. The qualitative and quantitative evaluations demonstrate that our approach outperforms the competing methods by significant margins in all the tasks. △ Less

Submitted 3 October, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

arXiv:2211.05187 [pdf, other]

Training a Vision Transformer from scratch in less than 24 hours with 1 GPU

Authors: Saghar Irandoust, Thibaut Durand, Yunduz Rakhmangulova, Wenjie Zi, Hossein Hajimirsadeghi

Abstract: Transformers have become central to recent advances in computer vision. However, training a vision Transformer (ViT) model from scratch can be resource intensive and time consuming. In this paper, we aim to explore approaches to reduce the training costs of ViT models. We introduce some algorithmic improvements to enable training a ViT model from scratch with limited hardware (1 GPU) and time (24… ▽ More Transformers have become central to recent advances in computer vision. However, training a vision Transformer (ViT) model from scratch can be resource intensive and time consuming. In this paper, we aim to explore approaches to reduce the training costs of ViT models. We introduce some algorithmic improvements to enable training a ViT model from scratch with limited hardware (1 GPU) and time (24 hours) resources. First, we propose an efficient approach to add locality to the ViT architecture. Second, we develop a new image size curriculum learning strategy, which allows to reduce the number of patches extracted from each image at the beginning of the training. Finally, we propose a new variant of the popular ImageNet1k benchmark by adding hardware and time constraints. We evaluate our contributions on this benchmark, and show they can significantly improve performances given the proposed training budget. We will share the code in https://github.com/BorealisAI/efficient-vit-training. △ Less

Submitted 9 November, 2022; originally announced November 2022.

Comments: 7 pages, 2 figures, 1 table, published in "Has it Trained Yet? Workshop at the Conference on Neural Information Processing Systems (NeurIPS 2022)"

ACM Class: I.2.10

arXiv:2105.08837 [pdf, other]

Fusion-DHL: WiFi, IMU, and Floorplan Fusion for Dense History of Locations in Indoor Environments

Authors: Sachini Herath, Saghar Irandoust, Bowen Chen, Yiming Qian, Pyo** Kim, Yasutaka Furukawa

Abstract: The paper proposes a multi-modal sensor fusion algorithm that fuses WiFi, IMU, and floorplan information to infer an accurate and dense location history in indoor environments. The algorithm uses 1) an inertial navigation algorithm to estimate a relative motion trajectory from IMU sensor data; 2) a WiFi-based localization API in industry to obtain positional constraints and geo-localize the trajec… ▽ More The paper proposes a multi-modal sensor fusion algorithm that fuses WiFi, IMU, and floorplan information to infer an accurate and dense location history in indoor environments. The algorithm uses 1) an inertial navigation algorithm to estimate a relative motion trajectory from IMU sensor data; 2) a WiFi-based localization API in industry to obtain positional constraints and geo-localize the trajectory; and 3) a convolutional neural network to refine the location history to be consistent with the floorplan. We have developed a data acquisition app to build a new dataset with WiFi, IMU, and floorplan data with ground-truth positions at 4 university buildings and 3 shop** malls. Our qualitative and quantitative evaluations demonstrate that the proposed system is able to produce twice as accurate and a few orders of magnitude denser location history than the current standard, while requiring minimal additional energy consumption. We will publicly share our code, data and models. △ Less

Submitted 18 May, 2021; originally announced May 2021.

Comments: To be published in ICRA 2021. Code and data: https://github.com/Sachini/Fusion-DHL

Journal ref: ICRA 2021

arXiv:1907.08277 [pdf]

doi 10.1038/s41598-020-60735-7

The interplay between tissue healing and bone remodeling around immediately loaded tooth replacement implants

Authors: Soroush Irandoust, Sinan Muftu

Abstract: Long-term bone healing/adaptation after a dental implant treatment starts with diffusion of mesenchymal stem cells to the fracture callus and their subsequent differentiation. The healing phase is followed by the bone-remodeling phase. In this work, a mechano-regulatory cellular differentiation model was used to simulate tissue healing around an immediately loaded dental implant. All tissue types… ▽ More Long-term bone healing/adaptation after a dental implant treatment starts with diffusion of mesenchymal stem cells to the fracture callus and their subsequent differentiation. The healing phase is followed by the bone-remodeling phase. In this work, a mechano-regulatory cellular differentiation model was used to simulate tissue healing around an immediately loaded dental implant. All tissue types were modeled as poroelastic in the healing phase. Material properties of the healing region were updated after each loading cycle for 30 cycles (days). The tissue distribution in the healed state was then used as the initial condition for the remodeling phase during which regions healed into bone adapt their internal density with respect to a homeostatic remodeling stimulus. The short- and long-term effects of micro-motion on bone healing and remodeling were studied. Development of soft tissue was observed both in the coronal region due to high fluid velocity, and on the vertical sides of the healing-callus due to high shear stress. In cases with small implant micromotion, tissue between the implant threads differentiated into bone during the healing phase, but resorbed during remodeling. In cases with large implant micromotion, higher percentage of the healing region differentiated into soft tissue resulting in less volume available for bone remodeling. But, the remaining bone region developed higher density bone tissue. It was concluded that an optimal range of controlled micromotion could be designed for a given implant in order to achieve the desired functional properties. △ Less

Submitted 18 July, 2019; originally announced July 2019.

Showing 1–4 of 4 results for author: Irandoust, S