Home Artists Posts Import Register

Content

Now You Can Full Fine Tune / DreamBooth Stable Diffusion XL (SDXL) with only 10.3 GB VRAM via OneTrainer — Both U-NET and Text Encoder 1 is trained — Compared 14 GB config vs slower 10.3 GB Config

Full config and instructions are shared here : https://www.patreon.com/posts/96028218

Used SG161222/RealVisXL_V4.0 as a base model and OneTrainer to train on Windows 10 : https://github.com/Nerogar/OneTrainer

The posted example x/y/z checkpoint comparison images are not cherry picked. So I can get perfect images with multiple tries.

Trained 150 epochs, 15 images and used my ground truth 5200 regularization images : https://www.patreon.com/posts/massive-4k-woman-87700469

In each epoch only 15 of regularization images used to make DreamBooth training affect

As a caption only “ohwx man” is used, for regularization images just “man”

You can download configs and full instructions here : https://www.patreon.com/posts/96028218

Hopefully full public tutorial coming within 2 weeks. I will show all configuration as well

The tutorial will be on our channel : https://www.youtube.com/SECourses

Training speeds are as below thus durations:

RTX 3060 — slow preset : 3.72 second / it thus 15 train images 150 epoch 2 (reg images concept) : 4500 steps = 4500 3.72 / 3600 = 4.6 hours

RTX 3090 TI — slow preset : 1.58 second / it thus : 4500 * 1.58 / 3600 = 2 hours

RTX 3090 TI — fast preset : 1.45 second / it thus : 4500 * 1.45 / 3600 = 1.8 hours

A quick tutorial for how to use concepts in OneTrainer : https://youtu.be/yPOadldf6bI

Comments

Alex

Thanks, I'll give it a try.

Hey Ooo

Does this only work with diffusers

C. Jonas

Hi Furkan, would you rather do a full training on SD1.5 with your preset for 12GB cards or a full training on SDXL with your new config? Which training ist better for realistic faces whilst still enabling flexibility in the generated pictures?

Doc Snyder

With the latest version of onetrainer I get "AttributeError: 'str' object has no attribute 'is_wuerstchen'" while changing to the added preset and it doesn't load.

Leonardo Chocron

Hi doc! Does more images require more vram? I don't get the consistency that I have with sd1.5 using the same photos, especially in the shape and physiognomy of the face.

Furkan Gözükara

more images requires same vram it wont make difference. but more images would require lesser number of total epochs

Tom Bloomingdale

Hello! I have this set up for SDXL using the "fast" option for fine tuning. I have a 4060ti with 16gb vram. With 150 epoch, Im looking at like 35 hours training time. Am I doing something wrong?

Javi dltr

does this work with 1,5?

Furkan Gözükara

for SD 1.5 we have different config here : https://www.patreon.com/posts/very-best-config-97381002 please also watch this tutorial : https://youtu.be/0t5l6CP9eBg