Enis Simsar

Profile
I'm a PhD student in Computer Vision and Machine Learning at ETH Zurich currently working on generative models and image editing. I previously did research at Bogazici University working with Prof. Ali Taylan Cemgil on deep generative models. My work focuses on developing novel methods for controllable image generation and editing using diffusion models and GANs. My recent research has centered on making image editing more precise and intuitive through approaches like attention regularization and contrastive learning. I've developed methods like LIME for localized image editing and UIP2P for unsupervised instruction-based editing. I'm particularly interested in pushing the boundaries of what's possible with generative AI while making these powerful tools more accessible and controllable.

Beyond 2D image generation, I've also worked extensively on 3D-aware generative models and medical imaging applications. I've contributed to projects involving 3D chest CT generation, dental X-ray analysis, and monocular depth estimation. My work aims to bridge the gap between theoretical advances in generative modeling and practical applications that can benefit real-world use cases.

Publications

UIP2P: Unsupervised Instruction-based Image Editing via Cycle Edit Consistency

UIP2P: Unsupervised Instruction-based Image Editing via Cycle Edit Consistency

Enis Simsar, A. Tonioni, Yongqin Xian, Thomas Hofmann, Federico Tombari

LoRACLR: Contrastive Adaptation for Customization of Diffusion Models

LoRACLR: Contrastive Adaptation for Customization of Diffusion Models

Enis Simsar, Thomas Hofmann, Federico Tombari, Pinar Yanardag

MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation

MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation

Han Yang, Sotiris Anagnostidis, Enis Simsar, Thomas Hofmann

arXiv.org 2024

PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAM

PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAM

Stefan Stefanache, Llu'is Pastor P'erez, Julen Costa Watanabe, Ernesto Sanchez Tejedor, Thomas Hofmann, Enis Simsar

arXiv.org 2024

Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models

Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models

Matthew Zheng, Enis Simsar, Hidir Yesiltepe, Federico Tombari, Joel Simon, Pinar Yanardag

arXiv.org 2024

CLoRA: A Contrastive Approach to Compose Multiple LoRA Models

CLoRA: A Contrastive Approach to Compose Multiple LoRA Models

Tuna Han Salih Meral, Enis Simsar, Federico Tombari, Pinar Yanardag

arXiv.org 2024

Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography

Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography

Ibrahim Ethem Hamamci, Sezgin Er, Furkan Almas, Ayse Gulnihan Simsek, S. Esirgun, Irem Dogan, Muhammed Furkan Dasdelen, Bastian Wittmann, Enis Simsar, Mehmet Simsar, Emine Bensu Erdemir, Abdullah Alanbay, A. Sekuboyina, Berkan Lafci, M. K. Ozdemir, Bjoern H Menze

LIME: Localized Image Editing via Attention Regularization in Diffusion Models

LIME: Localized Image Editing via Attention Regularization in Diffusion Models

Enis Simsar, A. Tonioni, Yongqin Xian, Thomas Hofmann, Federico Tombari

arXiv.org 2023

CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models

CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models

Tuna Han Salih Meral, Enis Simsar, Federico Tombari, Pinar Yanardag

Computer Vision and Pattern Recognition 2023

DENTEX: An Abnormal Tooth Detection with Dental Enumeration and Diagnosis Benchmark for Panoramic X-rays

DENTEX: An Abnormal Tooth Detection with Dental Enumeration and Diagnosis Benchmark for Panoramic X-rays

Ibrahim Ethem Hamamci, Sezgin Er, Enis Simsar, Atif Emre Yuksel, Sadullah Gultekin, S. Ozdemir, Kai‐Ting Yang, Hongwei Li, Sarthak Pati, B. Stadlinger, A. Mehl, Mustafa Gundogar, Bjoern H Menze

arXiv.org 2023

GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes

Ibrahim Ethem Hamamci, Sezgin Er, Enis Simsar, A. Sekuboyina, Chinmay Prabhakar, A. Tezcan, Ayse Gulnihan Simsek, S. Esirgun, Furkan Almas, Irem Dougan, Muhammed Furkan Dasdelen, Hadrien Reynaud, Sarthak Pati, Christian Bluethgen, M. K. Ozdemir, Bjoern H Menze

European Conference on Computer Vision 2023

Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays

Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays

Ibrahim Ethem Hamamci, Sezgin Er, Enis Simsar, A. Sekuboyina, Mustafa Gundogar, B. Stadlinger, A. Mehl, Bjoern H Menze

International Conference on Medical Image Computing and Computer-Assisted Intervention 2023

LatentSwap3D: Semantic Edits on 3D Image GANs

LatentSwap3D: Semantic Edits on 3D Image GANs

Enis Simsar, A. Tonioni, Evin Pınar Örnek, F. Tombari

2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) 2022

Rank in Style: A Ranking-based Approach to Find Interpretable Directions

Rank in Style: A Ranking-based Approach to Find Interpretable Directions

Umut Kocasari, Kerem Zaman, Mert Tiftikci, Enis Simsar, Pinar Yanardag

2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022

Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs

Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs

Enis Simsar, Umut Kocasari, Ezgi Gulperi Er, Pinar Yanardag

IEEE Workshop/Winter Conference on Applications of Computer Vision 2022

Object-Aware Monocular Depth Prediction With Instance Convolutions

Object-Aware Monocular Depth Prediction With Instance Convolutions

Enis Simsar, Evin Pınar Örnek, Fabian Manhardt, Helisa Dhamo, Nassir Navab, F. Tombari

IEEE Robotics and Automation Letters 2021

Graph2Pix: A Graph-Based Image to Image Translation Framework

Graph2Pix: A Graph-Based Image to Image Translation Framework

Dilara Gokay, Enis Simsar, Efehan Atici, Alper Ahmetoglu, Atif Emre Yuksel, Pinar Yanardag

2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) 2021

Dental enumeration and multiple treatment detection on panoramic X-rays using deep learning

Atif Emre Yüksel, Sadullah Gültekin, Enis Simsar, Ş. Özdemir, M. Gündoğar, Salih Barkın Tokgöz, Ibrahim Ethem Hamamci

Scientific Reports 2021

LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions

LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions

Oğuz Kaan Yüksel, Enis Simsar, Ezgi Gulperi Er, Pinar Yanardag

IEEE International Conference on Computer Vision 2021

Comparison of Deep Generative Models for the Generation of Handwritten Character Images

Ömer Kirbiyik, Enis Simsar, Ali Taylan Cemgil

Signal Processing and Communications Applications Conference 2019

GenerateCT: Text-Guided 3D Chest CT Generation

GenerateCT: Text-Guided 3D Chest CT Generation

Ibrahim Ethem Hamamci, Sezgin Er, Enis Simsar, A. Tezcan, Aysegul Simsek, Furkan Almas, S. Esirgun, Hadrien Reynaud, Sarthak Pati, Christian Blüthgen, Bjoern H Menze

arXiv.org 2023