Search | arXiv e-print repository

Make-A-Shape: a Ten-Million-scale 3D Shape Model

Authors: Ka-Hei Hui, Aditya Sanghi, Arianna Rampini, Kamal Rahimi Malekshan, Zhengzhe Liu, Hooman Shayani, Chi-Wing Fu

Abstract: Significant progress has been made in training large generative models for natural language and images. Yet, the advancement of 3D generative models is hindered by their substantial resource demands for training, along with inefficient, non-compact, and less expressive representations. This paper introduces Make-A-Shape, a new 3D generative model designed for efficient training on a vast scale, ca… ▽ More Significant progress has been made in training large generative models for natural language and images. Yet, the advancement of 3D generative models is hindered by their substantial resource demands for training, along with inefficient, non-compact, and less expressive representations. This paper introduces Make-A-Shape, a new 3D generative model designed for efficient training on a vast scale, capable of utilizing 10 millions publicly-available shapes. Technical-wise, we first innovate a wavelet-tree representation to compactly encode shapes by formulating the subband coefficient filtering scheme to efficiently exploit coefficient relations. We then make the representation generatable by a diffusion model by devising the subband coefficients packing scheme to layout the representation in a low-resolution grid. Further, we derive the subband adaptive training strategy to train our model to effectively learn to generate coarse and detail wavelet coefficients. Last, we extend our framework to be controlled by additional input conditions to enable it to generate shapes from assorted modalities, e.g., single/multi-view images, point clouds, and low-resolution voxels. In our extensive set of experiments, we demonstrate various applications, such as unconditional generation, shape completion, and conditional generation on a wide range of modalities. Our approach not only surpasses the state of the art in delivering high-quality results but also efficiently generates shapes within a few seconds, often achieving this in just 2 seconds for most conditions. △ Less

Submitted 19 January, 2024; originally announced January 2024.

arXiv:2305.17252 [pdf, other]

Generalizable Pose Estimation Using Implicit Scene Representations

Authors: Vaibhav Saxena, Kamal Rahimi Malekshan, Linh Tran, Yotto Koga

Abstract: 6-DoF pose estimation is an essential component of robotic manipulation pipelines. However, it usually suffers from a lack of generalization to new instances and object types. Most widely used methods learn to infer the object pose in a discriminative setup where the model filters useful information to infer the exact pose of the object. While such methods offer accurate poses, the model does not… ▽ More 6-DoF pose estimation is an essential component of robotic manipulation pipelines. However, it usually suffers from a lack of generalization to new instances and object types. Most widely used methods learn to infer the object pose in a discriminative setup where the model filters useful information to infer the exact pose of the object. While such methods offer accurate poses, the model does not store enough information to generalize to new objects. In this work, we address the generalization capability of pose estimation using models that contain enough information about the object to render it in different poses. We follow the line of work that inverts neural renderers to infer the pose. We propose i-$σ$SRN to maximize the information flowing from the input pose to the rendered scene and invert them to infer the pose given an input image. Specifically, we extend Scene Representation Networks (SRNs) by incorporating a separate network for density estimation and introduce a new way of obtaining a weighted scene representation. We investigate several ways of initial pose estimates and losses for the neural renderer. Our final evaluation shows a significant improvement in inference performance and speed compared to existing approaches. △ Less

Submitted 26 May, 2023; originally announced May 2023.

arXiv:2209.01161 [pdf, other]

doi 10.1145/3550469.3555424

Reconstructing editable prismatic CAD from rounded voxel models

Authors: Joseph G. Lambourne, Karl D. D. Willis, Pradeep Kumar Jayaraman, Longfei Zhang, Aditya Sanghi, Kamal Rahimi Malekshan

Abstract: Reverse Engineering a CAD shape from other representations is an important geometric processing step for many downstream applications. In this work, we introduce a novel neural network architecture to solve this challenging task and approximate a smoothed signed distance function with an editable, constrained, prismatic CAD model. During training, our method reconstructs the input geometry in the… ▽ More Reverse Engineering a CAD shape from other representations is an important geometric processing step for many downstream applications. In this work, we introduce a novel neural network architecture to solve this challenging task and approximate a smoothed signed distance function with an editable, constrained, prismatic CAD model. During training, our method reconstructs the input geometry in the voxel space by decomposing the shape into a series of 2D profile images and 1D envelope functions. These can then be recombined in a differentiable way allowing a geometric loss function to be defined. During inference, we obtain the CAD data by first searching a database of 2D constrained sketches to find curves which approximate the profile images, then extrude them and use Boolean operations to build the final CAD model. Our method approximates the target shape more closely than other methods and outputs highly editable constrained parametric sketches which are compatible with existing CAD software. △ Less

Submitted 2 September, 2022; originally announced September 2022.

Comments: SIGGRAPH Asia 2022 Conference Paper

ACM Class: I.2.10

arXiv:2110.02624 [pdf, other]

CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation

Authors: Aditya Sanghi, Hang Chu, Joseph G. Lambourne, Ye Wang, Chin-Yi Cheng, Marco Fumero, Kamal Rahimi Malekshan

Abstract: Generating shapes using natural language can enable new ways of imagining and creating the things around us. While significant recent progress has been made in text-to-image generation, text-to-shape generation remains a challenging problem due to the unavailability of paired text and shape data at a large scale. We present a simple yet effective method for zero-shot text-to-shape generation that… ▽ More Generating shapes using natural language can enable new ways of imagining and creating the things around us. While significant recent progress has been made in text-to-image generation, text-to-shape generation remains a challenging problem due to the unavailability of paired text and shape data at a large scale. We present a simple yet effective method for zero-shot text-to-shape generation that circumvents such data scarcity. Our proposed method, named CLIP-Forge, is based on a two-stage training process, which only depends on an unlabelled shape dataset and a pre-trained image-text network such as CLIP. Our method has the benefits of avoiding expensive inference time optimization, as well as the ability to generate multiple shapes for a given text. We not only demonstrate promising zero-shot generalization of the CLIP-Forge model qualitatively and quantitatively, but also provide extensive comparative evaluations to better understand its behavior. △ Less

Submitted 28 April, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

Comments: Accepted by CVPR 2022

MSC Class: 68T07 ACM Class: I.2.10

arXiv:1706.05596 [pdf, other]

Joint Scheduling and Transmission Power Control in Wireless Ad Hoc Networks

Authors: Kamal Rahimi Malekshan, Weihua Zhuang

Abstract: In this paper, we study how to determine concurrent transmissions and the transmission power level of each link to maximize spectrum efficiency and minimize energy consumption in a wireless ad hoc network. The optimal joint transmission packet scheduling and power control strategy are determined when the node density goes to infinity and the network area is unbounded. Based on the asymptotic analy… ▽ More In this paper, we study how to determine concurrent transmissions and the transmission power level of each link to maximize spectrum efficiency and minimize energy consumption in a wireless ad hoc network. The optimal joint transmission packet scheduling and power control strategy are determined when the node density goes to infinity and the network area is unbounded. Based on the asymptotic analysis, we determine the fundamental capacity limits of a wireless network, subject to an energy consumption constraint. We propose a scheduling and transmission power control mechanism to approach the optimal solution to maximize spectrum and energy efficiencies in a practical network. The distributed implementation of the proposed scheduling and transmission power control scheme is presented based on our MAC framework proposed in [1]. Simulation results demonstrate that the proposed scheme achieves 40% higher throughput than existing schemes. Also, the energy consumption using the proposed scheme is about 20% of the energy consumed using existing power saving MAC protocols. △ Less

Submitted 17 June, 2017; originally announced June 2017.

arXiv:1510.05338 [pdf, other]

doi 10.1109/TWC.2015.2493544

Coordination-based Medium Access Control with Space-reservation for Wireless Ad Hoc Networks

Authors: Kamal Rahimi Malekshan, Weihua Zhuang, Yves Lostanlen

Abstract: Efficient radio spectrum utilization and low energy consumption in mobile devices are essential in develo** next generation wireless networks. This paper presents a new medium access control (MAC) mechanism to enhance spectrum efficiency and reduce energy consumption in a wireless ad hoc network. A set of coordinator nodes, distributed in the network area, periodically schedule contention-free t… ▽ More Efficient radio spectrum utilization and low energy consumption in mobile devices are essential in develo** next generation wireless networks. This paper presents a new medium access control (MAC) mechanism to enhance spectrum efficiency and reduce energy consumption in a wireless ad hoc network. A set of coordinator nodes, distributed in the network area, periodically schedule contention-free time slots for all data transmissions/receptions in the network, based on transmission requests from source nodes. Adjacent coordinators exchange scheduling information to effectively increase spatial spectrum reuse and avoid transmission collisions. Moreover, the proposed MAC scheme allows a node to put its radio interface into a sleep mode when it is not transmitting/receiving a packet, in order to reduce energy consumption. Simulation results demonstrate that the proposed scheme achieves substantially higher throughput and has significantly lower energy consumption in comparison with existing schemes. △ Less

Submitted 18 October, 2015; originally announced October 2015.

Comments: 12 pages, 12 figures, IEEE Transactions on Wireless Communications

arXiv:1407.6266 [pdf, other]

doi 10.1109/TWC.2014.2336801

An Energy Efficient MAC Protocol for Fully Connected Wireless Ad Hoc Networks

Authors: K. Rahimi Malekshan, W. Zhuang, Y. Lostanlen

Abstract: Energy efficiency is an important performance measure of wireless network protocols, especially for battery-powered mobile devices such as smartphones. This paper presents a new energy-efficient medium access control (MAC) scheme for fully connected wireless ad hoc networks. The proposed scheme reduces energy consumption by putting radio interfaces in the sleep state periodically and by reducing t… ▽ More Energy efficiency is an important performance measure of wireless network protocols, especially for battery-powered mobile devices such as smartphones. This paper presents a new energy-efficient medium access control (MAC) scheme for fully connected wireless ad hoc networks. The proposed scheme reduces energy consumption by putting radio interfaces in the sleep state periodically and by reducing transmission collisions, which results in high throughput and low packet transmission delay. The proposed MAC scheme can also address the energy saving in realtime traffics which require very low packet transmission delay. An analytical model is established to evaluate the performance of the proposed MAC scheme. Analytical and simulation results demonstrate that the proposed scheme has a significantly lower power consumption, achieves substantially higher throughput, and has lower packet transmission delay in comparison with existing power saving MAC protocols. △ Less

Submitted 23 July, 2014; originally announced July 2014.

Comments: Published in IEEE Transaction on Wireless Communications

arXiv:1407.5718 [pdf, other]

Distributed Cross-layer Dynamic Route Selection in Wireless Multiuser Multihop Networks

Authors: K. Rahimi Malekshan, F. Lahouti

Abstract: In wireless ad-hoc networks, forwarding data through intermediate relays extends the coverage area and enhances the network throughput. We consider a general wireless multiuser multihop transmission, where each data flow is subject to a constraint on the end-to-end buffering delay and the associated packet drop rate as a quality of service (QoS) requirement. The objective is to maximize the weight… ▽ More In wireless ad-hoc networks, forwarding data through intermediate relays extends the coverage area and enhances the network throughput. We consider a general wireless multiuser multihop transmission, where each data flow is subject to a constraint on the end-to-end buffering delay and the associated packet drop rate as a quality of service (QoS) requirement. The objective is to maximize the weighted sum-rate between source destination pairs, while the corresponding QoS requirements are satisfied. We introduce two new distributed cross-layer dynamic route selection schemes in this setting that are designed involving physical, MAC, and network layers. In the proposed opportunistic cross-layer dynamic route selection scheme, routes are assigned dynamically based on the state of network nodes' buffers and the instantaneous state of fading channels. In the same setting, the proposed time division cross layer dynamic route selection scheme utilizes the average quality of channels instead for more efficient implementation. Detailed results and comparisons are provided, which demonstrate the superior performance of the proposed cross-layer dynamic route selection schemes. △ Less

Submitted 21 July, 2014; originally announced July 2014.

Comments: Submitted to IEEE Transaction on Wireless Comunications

Showing 1–8 of 8 results for author: Malekshan, K R