Skip to main content

Showing 1–13 of 13 results for author: Simsar, E

.
  1. arXiv:2406.14599  [pdf, other

    cs.CV

    Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models

    Authors: Matthew Zheng, Enis Simsar, Hidir Yesiltepe, Federico Tombari, Joel Simon, Pinar Yanardag

    Abstract: Text-to-image models are becoming increasingly popular, revolutionizing the landscape of digital art creation by enabling highly detailed and creative visual content generation. These models have been widely employed across various domains, particularly in art generation, where they facilitate a broad spectrum of creative expression and democratize access to artistic creation. In this paper, we in… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2403.19776  [pdf, other

    cs.CV cs.LG

    CLoRA: A Contrastive Approach to Compose Multiple LoRA Models

    Authors: Tuna Han Salih Meral, Enis Simsar, Federico Tombari, Pinar Yanardag

    Abstract: Low-Rank Adaptations (LoRAs) have emerged as a powerful and popular technique in the field of image generation, offering a highly effective way to adapt and refine pre-trained deep learning models for specific tasks without the need for comprehensive retraining. By employing pre-trained LoRA models, such as those representing a specific cat and a particular dog, the objective is to generate an ima… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  3. arXiv:2403.17834  [pdf, other

    cs.CV

    A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities

    Authors: Ibrahim Ethem Hamamci, Sezgin Er, Furkan Almas, Ayse Gulnihan Simsek, Sevval Nil Esirgun, Irem Dogan, Muhammed Furkan Dasdelen, Bastian Wittmann, Enis Simsar, Mehmet Simsar, Emine Bensu Erdemir, Abdullah Alanbay, Anjany Sekuboyina, Berkan Lafci, Mehmet K. Ozdemir, Bjoern Menze

    Abstract: A major challenge in computational research in 3D medical imaging is the lack of comprehensive datasets. Addressing this issue, our study introduces CT-RATE, the first 3D medical imaging dataset that pairs images with textual reports. CT-RATE consists of 25,692 non-contrast chest CT volumes, expanded to 50,188 through various reconstructions, from 21,304 unique patients, along with corresponding r… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  4. arXiv:2312.09256  [pdf, other

    cs.CV

    LIME: Localized Image Editing via Attention Regularization in Diffusion Models

    Authors: Enis Simsar, Alessio Tonioni, Yongqin Xian, Thomas Hofmann, Federico Tombari

    Abstract: Diffusion models (DMs) have gained prominence due to their ability to generate high-quality, varied images, with recent advancements in text-to-image generation. The research focus is now shifting towards the controllability of DMs. A significant challenge within this domain is localized editing, where specific areas of an image are modified without affecting the rest of the content. This paper in… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  5. arXiv:2312.06059  [pdf, other

    cs.CV cs.AI cs.LG

    CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models

    Authors: Tuna Han Salih Meral, Enis Simsar, Federico Tombari, Pinar Yanardag

    Abstract: Images produced by text-to-image diffusion models might not always faithfully represent the semantic intent of the provided text prompt, where the model might overlook or entirely fail to produce certain objects. Existing solutions often require customly tailored functions for each of these problems, leading to sub-optimal results, especially for complex prompts. Our work introduces a novel perspe… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  6. arXiv:2305.19112  [pdf, other

    cs.CV

    DENTEX: An Abnormal Tooth Detection with Dental Enumeration and Diagnosis Benchmark for Panoramic X-rays

    Authors: Ibrahim Ethem Hamamci, Sezgin Er, Enis Simsar, Atif Emre Yuksel, Sadullah Gultekin, Serife Damla Ozdemir, Kaiyuan Yang, Hongwei Bran Li, Sarthak Pati, Bernd Stadlinger, Albert Mehl, Mustafa Gundogar, Bjoern Menze

    Abstract: Panoramic X-rays are frequently used in dentistry for treatment planning, but their interpretation can be both time-consuming and prone to error. Artificial intelligence (AI) has the potential to aid in the analysis of these X-rays, thereby improving the accuracy of dental diagnoses and treatment plans. Nevertheless, designing automated algorithms for this purpose poses significant challenges, mai… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: MICCAI 2023 Challenge

  7. arXiv:2305.16037  [pdf, other

    cs.CV

    GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes

    Authors: Ibrahim Ethem Hamamci, Sezgin Er, Anjany Sekuboyina, Enis Simsar, Alperen Tezcan, Ayse Gulnihan Simsek, Sevval Nil Esirgun, Furkan Almas, Irem Dogan, Muhammed Furkan Dasdelen, Chinmay Prabhakar, Hadrien Reynaud, Sarthak Pati, Christian Bluethgen, Mehmet Kemal Ozdemir, Bjoern Menze

    Abstract: GenerateCT, the first approach to generating 3D medical imaging conditioned on free-form medical text prompts, incorporates a text encoder and three key components: a novel causal vision transformer for encoding 3D CT volumes, a text-image transformer for aligning CT and text tokens, and a text-conditional super-resolution diffusion model. Given the absence of directly comparable methods in 3D med… ▽ More

    Submitted 11 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  8. arXiv:2303.06500  [pdf, other

    cs.CV

    Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays

    Authors: Ibrahim Ethem Hamamci, Sezgin Er, Enis Simsar, Anjany Sekuboyina, Mustafa Gundogar, Bernd Stadlinger, Albert Mehl, Bjoern Menze

    Abstract: Due to the necessity for precise treatment planning, the use of panoramic X-rays to identify different dental diseases has tremendously increased. Although numerous ML models have been developed for the interpretation of panoramic X-rays, there has not been an end-to-end model developed that can identify problematic teeth with dental enumeration and associated diagnoses at the same time. To develo… ▽ More

    Submitted 5 June, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: MICCAI 2023

  9. arXiv:2212.01381  [pdf, other

    cs.CV

    LatentSwap3D: Semantic Edits on 3D Image GANs

    Authors: Enis Simsar, Alessio Tonioni, Evin Pınar Örnek, Federico Tombari

    Abstract: 3D GANs have the ability to generate latent codes for entire 3D volumes rather than only 2D images. These models offer desirable features like high-quality geometry and multi-view consistency, but, unlike their 2D counterparts, complex semantic image editing tasks for 3D GANs have only been partially explored. To address this problem, we propose LatentSwap3D, a semantic edit approach based on late… ▽ More

    Submitted 4 September, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: The paper has been accepted by ICCV'23 AI3DCC

  10. arXiv:2203.08516  [pdf, other

    cs.CV

    Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs

    Authors: Enis Simsar, Umut Kocasari, Ezgi Gülperi Er, Pinar Yanardag

    Abstract: The discovery of interpretable directions in the latent spaces of pre-trained GAN models has recently become a popular topic. In particular, StyleGAN2 has enabled various image generation and manipulation tasks due to its rich and disentangled latent spaces. The discovery of such directions is typically done either in a supervised manner, which requires annotated data for each desired manipulation… ▽ More

    Submitted 31 March, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

  11. arXiv:2112.01521  [pdf, other

    cs.CV cs.LG

    Object-aware Monocular Depth Prediction with Instance Convolutions

    Authors: Enis Simsar, Evin Pınar Örnek, Fabian Manhardt, Helisa Dhamo, Nassir Navab, Federico Tombari

    Abstract: With the advent of deep learning, estimating depth from a single RGB image has recently received a lot of attention, being capable of empowering many different applications ranging from path planning for robotics to computational cinematography. Nevertheless, while the depth maps are in their entirety fairly reliable, the estimates around object discontinuities are still far from satisfactory. Thi… ▽ More

    Submitted 24 February, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

  12. arXiv:2108.09752  [pdf, other

    cs.CV

    Graph2Pix: A Graph-Based Image to Image Translation Framework

    Authors: Dilara Gokay, Enis Simsar, Efehan Atici, Alper Ahmetoglu, Atif Emre Yuksel, Pinar Yanardag

    Abstract: In this paper, we propose a graph-based image-to-image translation framework for generating images. We use rich data collected from the popular creativity platform Artbreeder (http://artbreeder.com), where users interpolate multiple GAN-generated images to create artworks. This unique approach of creating new images leads to a tree-like structure where one can track historical data about the creat… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

  13. arXiv:2104.00820  [pdf, other

    cs.LG cs.CV

    LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions

    Authors: Oğuz Kaan Yüksel, Enis Simsar, Ezgi Gülperi Er, Pinar Yanardag

    Abstract: Recent research has shown that it is possible to find interpretable directions in the latent spaces of pre-trained Generative Adversarial Networks (GANs). These directions enable controllable image generation and support a wide range of semantic editing operations, such as zoom or rotation. The discovery of such directions is often done in a supervised or semi-supervised manner and requires manual… ▽ More

    Submitted 6 October, 2021; v1 submitted 1 April, 2021; originally announced April 2021.