Skip to main content

Showing 1–18 of 18 results for author: Azadi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.18827  [pdf, other

    cs.GR cs.AI cs.CV cs.LG cs.MM

    Motion-Conditioned Image Animation for Video Editing

    Authors: Wilson Yan, Andrew Brown, Pieter Abbeel, Rohit Girdhar, Samaneh Azadi

    Abstract: We introduce MoCA, a Motion-Conditioned Image Animation approach for video editing. It leverages a simple decomposition of the video editing problem into image editing followed by motion-conditioned image animation. Furthermore, given the lack of robust evaluation datasets for video editing, we introduce a new benchmark that measures edit capability across a wide variety of tasks, such as object r… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Project page: https://facebookresearch.github.io/MoCA

  2. arXiv:2311.10709  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.MM

    Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

    Authors: Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra

    Abstract: We present Emu Video, a text-to-video generation model that factorizes the generation into two steps: first generating an image conditioned on the text, and then generating a video conditioned on the text and the generated image. We identify critical design decisions--adjusted noise schedules for diffusion, and multi-stage training--that enable us to directly generate high quality and high resolut… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Project page: https://emu-video.metademolab.com

  3. arXiv:2310.09243  [pdf, other

    cs.AI cs.CE

    Augmented Computational Design: Methodical Application of Artificial Intelligence in Generative Design

    Authors: Pirouz Nourian, Shervin Azadi, Roy Uijtendaal, Nan Bai

    Abstract: This chapter presents methodological reflections on the necessity and utility of artificial intelligence in generative design. Specifically, the chapter discusses how generative design processes can be augmented by AI to deliver in terms of a few outcomes of interest or performance indicators while dealing with hundreds or thousands of small decisions. The core of the performance-based generative… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: This is the author's version of the book chapter Augmented Computational Design: Methodical Application of Artificial Intelligence in Generative Design. In Artificial Intelligence in Performance-Driven Design: Theories, Methods, and Tools Towards Sustainability, edited by Narjes Abbasabadi and Mehdi Ashayeri. Wiley, 2023

  4. arXiv:2309.15472  [pdf, other

    cs.GR cs.CG cs.DM math.AP math.GT

    Voxel Graph Operators: Topological Voxelization, Graph Generation, and Derivation of Discrete Differential Operators from Voxel Complexes

    Authors: Pirouz Nourian, Shervin Azadi

    Abstract: In this paper, we present a novel workflow consisting of algebraic algorithms and data structures for fast and topologically accurate conversion of vector data models such as Boundary Representations into voxels (topological voxelization); spatially indexing them; constructing connectivity graphs from voxels; and constructing a coherent set of multivariate differential and integral operators from… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 23 pages

  5. arXiv:2309.13396  [pdf, other

    cs.CY cs.HC

    EquiCity Game: A mathematical serious game for participatory design of spatial configurations

    Authors: Pirouz Nourian, Shervin Azadi, Nan Bai, Bruno de Andrade, Nour Abu Zaid, Samaneh Rezvani, Ana Pereira Roders

    Abstract: We propose mechanisms for a mathematical social-choice game that is designed to mediate decision-making processes for city planning, urban area redevelopment, and architectural design (massing) of urban housing complexes. The proposed game is effectively a multi-player generative configurator equipped with automated appraisal/scoring mechanisms for revealing the aggregate impact of alternatives; f… ▽ More

    Submitted 30 September, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: 16 pages (the paper), 15 pages (supplemental materials), references missing in the supplemental document

  6. arXiv:2305.09662  [pdf, other

    cs.CV cs.AI

    Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation

    Authors: Samaneh Azadi, Akbar Shah, Thomas Hayes, Devi Parikh, Sonal Gupta

    Abstract: Text-guided human motion generation has drawn significant interest because of its impactful applications spanning animation and robotics. Recently, application of diffusion models for motion generation has enabled improvements in the quality of generated motions. However, existing approaches are limited by their reliance on relatively small-scale motion capture data, leading to poor performance on… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2304.07410

  7. arXiv:2304.07410  [pdf, other

    cs.CV cs.AI

    Text-Conditional Contextualized Avatars For Zero-Shot Personalization

    Authors: Samaneh Azadi, Thomas Hayes, Akbar Shah, Guan Pang, Devi Parikh, Sonal Gupta

    Abstract: Recent large-scale text-to-image generation models have made significant improvements in the quality, realism, and diversity of the synthesized images and enable users to control the created content through language. However, the personalization aspect of these generative models is still challenging and under-explored. In this work, we propose a pipeline that enables personalization of image gener… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  8. arXiv:2212.00210  [pdf, other

    cs.CV cs.AI cs.LG

    Shape-Guided Diffusion with Inside-Outside Attention

    Authors: Dong Huk Park, Grace Luo, Clayton Toste, Samaneh Azadi, Xihui Liu, Maka Karalashvili, Anna Rohrbach, Trevor Darrell

    Abstract: We introduce precise object silhouette as a new form of user control in text-to-image diffusion models, which we dub Shape-Guided Diffusion. Our training-free method uses an Inside-Outside Attention mechanism during the inversion and generation process to apply a shape constraint to the cross- and self-attention maps. Our mechanism designates which spatial region is the object (inside) vs. backgro… ▽ More

    Submitted 1 April, 2024; v1 submitted 30 November, 2022; originally announced December 2022.

    Comments: WACV 2024

  9. arXiv:2202.05183  [pdf, other

    physics.comp-ph cond-mat.other cond-mat.str-el cs.LG

    Discovering Quantum Phase Transitions with Fermionic Neural Networks

    Authors: G. Cassella, H. Sutterud, S. Azadi, N. D. Drummond, D. Pfau, J. S. Spencer, W. M. C. Foulkes

    Abstract: Deep neural networks have been extremely successful as highly accurate wave function ansätze for variational Monte Carlo calculations of molecular ground states. We present an extension of one such ansatz, FermiNet, to calculations of the ground states of periodic Hamiltonians, and study the homogeneous electron gas. FermiNet calculations of the ground-state energies of small electron gas systems… ▽ More

    Submitted 5 July, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: 12 pages, 3 figures

  10. arXiv:2112.05744  [pdf, other

    cs.CV cs.GR

    More Control for Free! Image Synthesis with Semantic Diffusion Guidance

    Authors: Xihui Liu, Dong Huk Park, Samaneh Azadi, Gong Zhang, Arman Chopikyan, Yuxiao Hu, Humphrey Shi, Anna Rohrbach, Trevor Darrell

    Abstract: Controllable image synthesis models allow creation of diverse images based on text instructions or guidance from a reference image. Recently, denoising diffusion probabilistic models have been shown to generate more realistic imagery than prior methods, and have been successfully demonstrated in unconditional and class-conditional settings. We investigate fine-grained, continuous control of this m… ▽ More

    Submitted 5 December, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: WACV 2023. Project page https://xh-liu.github.io/sdg/

  11. arXiv:2109.11037  [pdf, other

    cs.DC cs.GR

    A Computational Approach for Checking Compliance with European View and Sunlight Exposure Criteria

    Authors: Eleonora Brembilla, Shervin Azadi, Pirouz Nourian

    Abstract: The paper presents open-source computational workflows for assessing the "Exposure to sunlight" and "View out" criteria as defined in the European standard EN 17037 "Daylight in Buildings", issued by the European Committee for Standardization. In addition to these factors, the standard document also addresses daylight provision and protection from glare, both of which fall out of the scope of this… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 7 pages, 8 figures, accepted and presented in the 17th IBPSA Conference Bruges, Belgium, Sept. 1-3, 2021

    ACM Class: J.6; J.2; I.3

  12. arXiv:1911.11357  [pdf, other

    cs.LG cs.CV stat.ML

    Semantic Bottleneck Scene Generation

    Authors: Samaneh Azadi, Michael Tschannen, Eric Tzeng, Sylvain Gelly, Trevor Darrell, Mario Lucic

    Abstract: Coupling the high-fidelity generation capabilities of label-conditional image synthesis methods with the flexibility of unconditional generative models, we propose a semantic bottleneck GAN model for unconditional synthesis of complex scenes. We assume pixel-wise segmentation labels are available during training and use them to learn the scene structure. During inference, our model first synthesiz… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

  13. Time Distance: A Novel Collision Prediction and Path Planning Method

    Authors: Ali Analooee, Shahram Azadi, Reza Kazemi

    Abstract: In this paper, a new fast algorithm for path planning and a collision prediction framework for two dimensional dynamically changing environments are introduced. The method is called Time Distance (TD) and benefits from the space-time space idea. First, the TD concept is defined as the time interval that must be spent in order for an object to reach another object or a location. Next, TD functions… ▽ More

    Submitted 6 April, 2023; v1 submitted 7 July, 2019; originally announced July 2019.

    Journal ref: Journal of Applied and Computational Mechanics, Vol. 9, No. 3, (2023), 656-677

  14. arXiv:1810.06758  [pdf, other

    stat.ML cs.LG

    Discriminator Rejection Sampling

    Authors: Samaneh Azadi, Catherine Olsson, Trevor Darrell, Ian Goodfellow, Augustus Odena

    Abstract: We propose a rejection sampling scheme using the discriminator of a GAN to approximately correct errors in the GAN generator distribution. We show that under quite strict assumptions, this will allow us to recover the data distribution exactly. We then examine where those strict assumptions break down and design a practical algorithm - called Discriminator Rejection Sampling (DRS) - that can be us… ▽ More

    Submitted 26 February, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: Published as a conference paper at ICLR 2019

  15. arXiv:1807.07560  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Compositional GAN: Learning Image-Conditional Binary Composition

    Authors: Samaneh Azadi, Deepak Pathak, Sayna Ebrahimi, Trevor Darrell

    Abstract: Generative Adversarial Networks (GANs) can produce images of remarkable complexity and realism but are generally structured to sample from a single latent source ignoring the explicit spatial interaction between multiple entities that could be present in a scene. Capturing such complex interactions between different objects in the world, including their relative scaling, spatial layout, occlusion,… ▽ More

    Submitted 28 March, 2019; v1 submitted 19 July, 2018; originally announced July 2018.

  16. arXiv:1712.00516  [pdf, other

    cs.CV

    Multi-Content GAN for Few-Shot Font Style Transfer

    Authors: Samaneh Azadi, Matthew Fisher, Vladimir Kim, Zhaowen Wang, Eli Shechtman, Trevor Darrell

    Abstract: In this work, we focus on the challenge of taking partial observations of highly-stylized text and generalizing the observations to generate unobserved glyphs in the ornamented typeface. To generate a set of multi-content images following a consistent style from very few examples, we propose an end-to-end stacked conditional GAN model considering content along channels and style along network laye… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

  17. arXiv:1704.03533  [pdf, other

    cs.CV

    Learning Detection with Diverse Proposals

    Authors: Samaneh Azadi, Jiashi Feng, Trevor Darrell

    Abstract: To predict a set of diverse and informative proposals with enriched representations, this paper introduces a differentiable Determinantal Point Process (DPP) layer that is able to augment the object detection architectures. Most modern object detection architectures, such as Faster R-CNN, learn to localize objects by minimizing deviations from the ground-truth but ignore correlation between multip… ▽ More

    Submitted 11 April, 2017; originally announced April 2017.

    Comments: Accepted to CVPR 2017

  18. arXiv:1511.07069  [pdf, other

    cs.CV

    Auxiliary Image Regularization for Deep CNNs with Noisy Labels

    Authors: Samaneh Azadi, Jiashi Feng, Stefanie Jegelka, Trevor Darrell

    Abstract: Precisely-labeled data sets with sufficient amount of samples are very important for training deep convolutional neural networks (CNNs). However, many of the available real-world data sets contain erroneously labeled samples and those errors substantially hinder the learning of very accurate CNN models. In this work, we consider the problem of training a deep CNN model for image classification wit… ▽ More

    Submitted 2 March, 2016; v1 submitted 22 November, 2015; originally announced November 2015.

    Comments: Published as a conference paper at ICLR 2016