Skip to main content

Showing 1–31 of 31 results for author: Dundar, A

.
  1. arXiv:2406.09368  [pdf, other

    cs.CV

    CLIPAway: Harmonizing Focused Embeddings for Removing Objects via Diffusion Models

    Authors: Yigit Ekin, Ahmet Burak Yildirim, Erdem Eren Caglar, Aykut Erdem, Erkut Erdem, Aysegul Dundar

    Abstract: Advanced image editing techniques, particularly inpainting, are essential for seamlessly removing unwanted elements while preserving visual integrity. Traditional GAN-based methods have achieved notable success, but recent advancements in diffusion models have produced superior results due to their training on large-scale datasets, enabling the generation of remarkably realistic inpainted images.… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Project page: https://yigitekin.github.io/CLIPAway/

  2. arXiv:2404.03632  [pdf, other

    cs.CV

    Reference-Based 3D-Aware Image Editing with Triplane

    Authors: Bahri Batuhan Bilecen, Yigit Yalin, Ning Yu, Aysegul Dundar

    Abstract: Generative Adversarial Networks (GANs) have emerged as powerful tools not only for high-quality image generation but also for real image editing through manipulation of their interpretable latent spaces. Recent advancements in GANs include the development of 3D-aware models such as EG3D, characterized by efficient triplane-based architectures enabling the reconstruction of 3D geometry from single… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2312.11422  [pdf, other

    cs.CV

    War** the Residuals for Image Editing with StyleGAN

    Authors: Ahmet Burak Yildirim, Hamza Pehlivan, Aysegul Dundar

    Abstract: StyleGAN models show editing capabilities via their semantically interpretable latent organizations which require successful GAN inversion methods to edit real images. Many works have been proposed for inverting images into StyleGAN's latent space. However, their results either suffer from low fidelity to the input image or poor editing qualities, especially for edits that require large transforma… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  4. arXiv:2309.13975  [pdf, other

    cs.CV

    Diverse Semantic Image Editing with Style Codes

    Authors: Hakan Sivuk, Aysegul Dundar

    Abstract: Semantic image editing requires inpainting pixels following a semantic map. It is a challenging task since this inpainting requires both harmony with the context and strict compliance with the semantic maps. The majority of the previous methods proposed for this task try to encode the whole information from erased images. However, when an object is added to a scene such as a car, its style cannot… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  5. arXiv:2307.15033  [pdf, other

    cs.CV

    Diverse Inpainting and Editing with GAN Inversion

    Authors: Ahmet Burak Yildirim, Hamza Pehlivan, Bahri Batuhan Bilecen, Aysegul Dundar

    Abstract: Recent inversion methods have shown that real images can be inverted into StyleGAN's latent space and numerous edits can be achieved on those images thanks to the semantically rich feature representations of well-trained GAN models. However, extensive research has also shown that image inversion is challenging due to the trade-off between high-fidelity reconstruction and editability. In this paper… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: ICCV 2023

  6. arXiv:2305.11102  [pdf, other

    cs.CV

    Progressive Learning of 3D Reconstruction Network from 2D GAN Data

    Authors: Aysegul Dundar, Jun Gao, Andrew Tao, Bryan Catanzaro

    Abstract: This paper presents a method to reconstruct high-quality textured 3D models from single images. Current methods rely on datasets with expensive annotations; multi-view images and their camera parameters. Our method relies on GAN generated multi-view image datasets which have a negligible annotation cost. However, they are not strictly multi-view consistent and sometimes GANs output distorted image… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Web-page: https://research.nvidia.com/labs/adlr/progressive-3d-learning. arXiv admin note: text overlap with arXiv:2203.09362

  7. arXiv:2304.03246  [pdf, other

    cs.CV

    Inst-Inpaint: Instructing to Remove Objects with Diffusion Models

    Authors: Ahmet Burak Yildirim, Vedat Baday, Erkut Erdem, Aykut Erdem, Aysegul Dundar

    Abstract: Image inpainting task refers to erasing unwanted pixels from images and filling them in a semantically consistent and realistic way. Traditionally, the pixels that are wished to be erased are defined with binary masks. From the application point of view, a user needs to generate the masks for the objects they would like to remove which can be time-consuming and prone to errors. In this work, we ar… ▽ More

    Submitted 9 August, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  8. arXiv:2303.03471  [pdf, other

    cs.CV

    Refining 3D Human Texture Estimation from a Single Image

    Authors: Said Fahri Altindis, Adil Meric, Yusuf Dalva, Ugur Gudukbay, Aysegul Dundar

    Abstract: Estimating 3D human texture from a single image is essential in graphics and vision. It requires learning a map** function from input images of humans with diverse poses into the parametric (UV) space and reasonably hallucinating invisible parts. To achieve a high-quality 3D human texture estimation, we propose a framework that adaptively samples the input by a deformable convolution where offse… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  9. arXiv:2301.04628  [pdf, other

    cs.CV

    Face Attribute Editing with Disentangled Latent Vectors

    Authors: Yusuf Dalva, Hamza Pehlivan, Cansu Moran, Öykü Irmak Hatipoğlu, Ayşegül Dündar

    Abstract: We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength and disentanglement in the representations of attributes to preserve the other attributes during edits. For this goal, inspired by the latent space factorization… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: See https://yusufdalva.github.io/vecgan for the project webpage. arXiv admin note: substantial text overlap with arXiv:2207.03411

  10. arXiv:2212.14359  [pdf, other

    cs.CV

    StyleRes: Transforming the Residuals for Real Image Editing with StyleGAN

    Authors: Hamza Pehlivan, Yusuf Dalva, Aysegul Dundar

    Abstract: We present a novel image inversion framework and a training pipeline to achieve high-fidelity image inversion with high-quality attribute editing. Inverting real images into StyleGAN's latent space is an extensively studied problem, yet the trade-off between the image reconstruction fidelity and image editing quality remains an open challenge. The low-rate latent spaces are limited in their expres… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

  11. arXiv:2207.03411  [pdf, other

    cs.CV cs.AI cs.LG

    VecGAN: Image-to-Image Translation with Interpretable Latent Directions

    Authors: Yusuf Dalva, Said Fahri Altindis, Aysegul Dundar

    Abstract: We propose VecGAN, an image-to-image translation framework for facial attribute editing with interpretable latent directions. Facial attribute editing task faces the challenges of precise attribute editing with controllable strength and preservation of the other attributes of an image. For this goal, we design the attribute editing by latent space factorization and for each attribute, we learn a l… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: ECCV 2022

  12. arXiv:2203.09362  [pdf, other

    cs.CV

    Fine Detailed Texture Learning for 3D Meshes with Generative Models

    Authors: Aysegul Dundar, Jun Gao, Andrew Tao, Bryan Catanzaro

    Abstract: This paper presents a method to reconstruct high-quality textured 3D models from both multi-view and single-view images. The reconstruction is posed as an adaptation problem and is done progressively where in the first stage, we focus on learning accurate geometry, whereas in the second stage, we focus on learning the texture with a generative adversarial network. In the generative learning pipeli… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

  13. arXiv:2109.01123  [pdf, other

    cs.CV cs.AI cs.LG

    Benchmarking the Robustness of Instance Segmentation Models

    Authors: Said Fahri Altindis, Yusuf Dalva, Hamza Pehlivan, Aysegul Dundar

    Abstract: This paper presents a comprehensive evaluation of instance segmentation models with respect to real-world image corruptions as well as out-of-domain image collections, e.g. images captured by a different set-up than the training dataset. The out-of-domain image evaluation shows the generalization capability of models, an essential aspect of real-world applications and an extensively studied topic… ▽ More

    Submitted 10 August, 2022; v1 submitted 2 September, 2021; originally announced September 2021.

  14. arXiv:2106.06533  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    View Generalization for Single Image Textured 3D Models

    Authors: Anand Bhattad, Aysegul Dundar, Guilin Liu, Andrew Tao, Bryan Catanzaro

    Abstract: Humans can easily infer the underlying 3D geometry and texture of an object only from a single 2D image. Current computer vision methods can do this, too, but suffer from view generalization problems - the models inferred tend to make poor predictions of appearance in novel views. As for generalization problems in machine learning, the difficulty is balancing single-view accuracy (cf. training err… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: CVPR 2021. Project website: https://nv-adlr.github.io/view-generalization

  15. arXiv:2103.16748  [pdf, other

    cs.CV cs.GR

    Dual Contrastive Loss and Attention for GANs

    Authors: Ning Yu, Guilin Liu, Aysegul Dundar, Andrew Tao, Bryan Catanzaro, Larry Davis, Mario Fritz

    Abstract: Generative Adversarial Networks (GANs) produce impressive results on unconditional image generation when powered with large-scale image datasets. Yet generated images are still easy to spot especially on datasets with high variance (e.g. bedroom, church). In this paper, we propose various improvements to further push the boundaries in image generation. Specifically, we propose a novel dual contras… ▽ More

    Submitted 17 March, 2022; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted to ICCV'21

  16. arXiv:2004.10289  [pdf, other

    cs.CV

    Panoptic-based Image Synthesis

    Authors: Aysegul Dundar, Karan Sapra, Guilin Liu, Andrew Tao, Bryan Catanzaro

    Abstract: Conditional image synthesis for generating photorealistic images serves various applications for content editing to content generation. Previous conditional image synthesis algorithms mostly rely on semantic maps, and often fail in complex environments where multiple instances occlude each other. We propose a panoptic aware image synthesis network to generate high fidelity and photorealistic image… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: CVPR 2020

  17. arXiv:2001.09518  [pdf, other

    cs.CV

    Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos

    Authors: Aysegul Dundar, Kevin J. Shih, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro

    Abstract: Unsupervised landmark learning is the task of learning semantic keypoint-like representations without the use of expensive input keypoint-level annotations. A popular approach is to factorize an image into a pose and appearance data stream, then to reconstruct the image from the factorized components. The pose representation should capture a set of consistent and tightly localized landmarks in ord… ▽ More

    Submitted 26 January, 2020; originally announced January 2020.

  18. arXiv:1909.02749  [pdf, other

    cs.CV cs.LG stat.ML

    Video Interpolation and Prediction with Unsupervised Landmarks

    Authors: Kevin J. Shih, Aysegul Dundar, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro

    Abstract: Prediction and interpolation for long-range video data involves the complex task of modeling motion trajectories for each visible object, occlusions and dis-occlusions, as well as appearance changes due to viewpoint and lighting. Optical flow based techniques generalize but are suitable only for short temporal ranges. Many methods opt to project the video frames to a low dimensional latent space,… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

    Comments: Technical Report

  19. arXiv:1906.05928  [pdf, other

    cs.CV

    Unsupervised Video Interpolation Using Cycle Consistency

    Authors: Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro

    Abstract: Learning to synthesize high frame rate videos via interpolation requires large quantities of high frame rate training videos, which, however, are scarce, especially at high resolutions. Here, we propose unsupervised techniques to synthesize high frame rate videos directly from low frame rate videos using cycle consistency. For a triplet of consecutive frames, we optimize models to minimize the dis… ▽ More

    Submitted 27 March, 2021; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: Published in ICCV 2019. Codes are available at https://github.com/NVIDIA/unsupervised-video-interpolation. Project website https://nv-adlr.github.io/publication/2019-UnsupervisedVideoInterpolation

  20. arXiv:1807.09384  [pdf, other

    cs.CV cs.LG

    Domain Stylization: A Strong, Simple Baseline for Synthetic to Real Image Domain Adaptation

    Authors: Aysegul Dundar, Ming-Yu Liu, Ting-Chun Wang, John Zedlewski, Jan Kautz

    Abstract: Deep neural networks have largely failed to effectively utilize synthetic data when applied to real images due to the covariate shift problem. In this paper, we show that by applying a straightforward modification to an existing photorealistic style transfer algorithm, we achieve state-of-the-art synthetic-to-real domain adaptation results. We conduct extensive experimental validations on four syn… ▽ More

    Submitted 24 July, 2018; originally announced July 2018.

  21. arXiv:1712.01653  [pdf, other

    cs.CV cs.LG

    Context Augmentation for Convolutional Neural Networks

    Authors: Aysegul Dundar, Ignacio Garcia-Dorado

    Abstract: Recent enhancements of deep convolutional neural networks (ConvNets) empowered by enormous amounts of labeled data have closed the gap with human performance for many object recognition tasks. These impressive results have generated interest in understanding and visualization of ConvNets. In this work, we study the effect of background in the task of image classification. Our results show that cha… ▽ More

    Submitted 11 December, 2017; v1 submitted 22 November, 2017; originally announced December 2017.

    Comments: 8 pages, 7 figures

  22. arXiv:1706.05048  [pdf, other

    cs.LG cs.CV

    Human-like Clustering with Deep Convolutional Neural Networks

    Authors: Ali Borji, Aysegul Dundar

    Abstract: Classification and clustering have been studied separately in machine learning and computer vision. Inspired by the recent success of deep learning models in solving various vision problems (e.g., object recognition, semantic segmentation) and the fact that humans serve as the gold standard in assessing clustering algorithms, here, we advocate for a unified treatment of the two problems and sugges… ▽ More

    Submitted 11 December, 2017; v1 submitted 15 June, 2017; originally announced June 2017.

  23. arXiv:1511.06306  [pdf, other

    cs.LG cs.CV

    Robust Convolutional Neural Networks under Adversarial Noise

    Authors: Jonghoon **, Aysegul Dundar, Eugenio Culurciello

    Abstract: Recent studies have shown that Convolutional Neural Networks (CNNs) are vulnerable to a small perturbation of input called "adversarial examples". In this work, we propose a new feedforward CNN that improves robustness in the presence of adversarial noise. Our model uses stochastic additive noise added to the input image and to the CNN models. The proposed model operates in conjunction with a CNN… ▽ More

    Submitted 25 February, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: 8 pages

  24. arXiv:1511.06241  [pdf, other

    cs.LG cs.CV

    Convolutional Clustering for Unsupervised Learning

    Authors: Aysegul Dundar, Jonghoon **, Eugenio Culurciello

    Abstract: The task of labeling data for training deep neural networks is daunting and tedious, requiring millions of labels to achieve the current state-of-the-art results. Such reliance on large amounts of labeled data can be relaxed by exploiting hierarchical features via unsupervised learning techniques. In this work, we propose to train a deep convolutional network based on an enhanced version of the k-… ▽ More

    Submitted 16 February, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: 11 pages

  25. arXiv:1412.5474  [pdf, other

    cs.NE cs.LG

    Flattened Convolutional Neural Networks for Feedforward Acceleration

    Authors: Jonghoon **, Aysegul Dundar, Eugenio Culurciello

    Abstract: We present flattened convolutional neural networks that are designed for fast feedforward execution. The redundancy of the parameters, especially weights of the convolutional filters in convolutional neural networks has been extensively studied and different heuristics have been proposed to construct a low rank basis of the filters after training. In this work, we train flattened networks that con… ▽ More

    Submitted 20 November, 2015; v1 submitted 17 December, 2014; originally announced December 2014.

    Comments: International Conference on Learning Representations (ICLR) 2015

  26. arXiv:1306.0152  [pdf, other

    cs.CV

    An Analysis of the Connections Between Layers of Deep Neural Networks

    Authors: Eugenio Culurciello, Jonghoon **, Aysegul Dundar, Jordan Bates

    Abstract: We present an analysis of different techniques for selecting the connection be- tween layers of deep neural networks. Traditional deep neural networks use ran- dom connection tables between layers to keep the number of connections small and tune to different image features. This kind of connection performs adequately in supervised deep networks because their values are refined during the training.… ▽ More

    Submitted 1 June, 2013; originally announced June 2013.

  27. arXiv:1301.2820  [pdf, other

    cs.CV

    Clustering Learning for Robotic Vision

    Authors: Eugenio Culurciello, Jordan Bates, Aysegul Dundar, Jose Carrasco, Clement Farabet

    Abstract: We present the clustering learning technique applied to multi-layer feedforward deep neural networks. We show that this unsupervised learning technique can compute network filters with only a few minutes and a much reduced set of parameters. The goal of this paper is to promote the technique for general-purpose robotic vision systems. We report its use in static image datasets and object tracking… ▽ More

    Submitted 13 March, 2013; v1 submitted 13 January, 2013; originally announced January 2013.

    Comments: Code for this paper is available here: https://github.com/culurciello/CL_paper1_code

  28. arXiv:1209.2696  [pdf, ps, other

    cs.CV cs.RO

    Visual Tracking with Similarity Matching Ratio

    Authors: Aysegul Dundar, Jonghoon **, Eugenio Culurciello

    Abstract: This paper presents a novel approach to visual tracking: Similarity Matching Ratio (SMR). The traditional approach of tracking is minimizing some measures of the difference between the template and a patch from the frame. This approach is vulnerable to outliers and drastic appearance changes and an extensive study is focusing on making the approach more tolerant to them. However, this often result… ▽ More

    Submitted 12 September, 2012; originally announced September 2012.

  29. arXiv:1104.1112  [pdf, ps, other

    physics.optics

    Electromechanical wavelength tuning of double-membrane photonic crystal cavities

    Authors: L. Midolo, P. J. van Veldhoven, M. A. Dundar, R. Nötzel, A. Fiore

    Abstract: We present a method for tuning the resonant wavelength of photonic crystal cavities (PCCs) around 1.55 um. Large tuning of the PCC mode is enabled by electromechanically controlling the separation between two parallel InGaAsP membranes. A fabrication method to avoid sticking between the membranes is discussed. Reversible red/blue shifting of the symmetric/anti-symmetric modes has been observed, wh… ▽ More

    Submitted 6 April, 2011; originally announced April 2011.

    Comments: 9 pages, 3 figures

    Journal ref: Appl. Phys. Lett. 98, 211120 (2011)

  30. arXiv:0705.2637  [pdf

    physics.optics physics.ao-ph

    A method for volume stabilization of single, dye-doped water microdroplets with femtoliter resolution

    Authors: A. Kiraz, A. Kurt, M. A. Dündar, M. Y. Yüce, A. L. Demirel

    Abstract: A self-control mechanism that stabilizes the size of Rhodamine B-doped water microdroplets standing on a superhydrophobic surface is demonstrated. The mechanism relies on the interplay between the condensation rate that was kept constant and evaporation rate induced by laser excitation which critically depends on the size of the microdroplets. The radii of individual water microdroplets (>5 um)… ▽ More

    Submitted 18 May, 2007; originally announced May 2007.

    Comments: to appear in the J. Op. Soc. Am. B

  31. Lasing from single, stationary, dye-doped glycerol/water microdroplets located on a superhydrophobic surface

    Authors: A. Kiraz, A. Sennaroglu, S. Doğanay, M. A. Dündar, A. Kurt, H. Kalaycıoğlu, A. L. Demirel

    Abstract: We report laser emission from single, stationary, Rhodamine B-doped glycerol/water microdroplets located on a superhydrophobic surface. In the experiments, a pulsed, frequency-doubled Nd:YAG laser operating at 532 nm was used as the excitation source. The microdroplets ranged in diameter from a few to 20 um. Lasing was achieved in the red-shifted portion of the dye emission spectrum with thresho… ▽ More

    Submitted 17 May, 2007; originally announced May 2007.

    Comments: to appear in Optics Communications