Did you use xformers when training the circle dataset? #275

offchan42 · 2023-03-15T02:22:24Z

offchan42
Mar 15, 2023

@lllyasviel
I saw you said The training on circle dataset is fast. After 4000 steps (batch size 4, learning rate 1e-5, about 50 minutes on A100 PCIE 40G), you converged.
That's around 1.33 steps/sec.
I tried running the same program on RunPod using either A40, A6000, or A100 GPU and the speed is much lower (0.55-0.7 steps/sec).
I also installed xformers (and triton) but got an error like in #218. He suggested to try float16 but #265 (comment) said that SD doesn't work that well with float16.
I tried float16 with xformers and the iteration speed becomes 3x faster (1.5 steps/sec) but training doesn't converge. It's the same issue mentioned in #265.

I have to eventually uninstall xformers, use float32, and tolerate 0.55-0.7 steps/sec speed. My problem is that I'm not able to replicate the training speed you have using the same GPU (A100) and it confuses me.

I wonder if you used xformers (and/or triton) package to help accelerate training. Does environment.yaml fully list the packages used to train the model?

geroldmeisinger · 2023-09-19T11:08:19Z

geroldmeisinger
Sep 19, 2023

use the HuggingFace ControlNet Training script which has more optimization builtin. I wrote an article about controlnet training based on this script here https://civitai.com/articles/2078

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Did you use xformers when training the circle dataset? #275

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Did you use xformers when training the circle dataset? #275

Uh oh!

Uh oh!

offchan42 Mar 15, 2023

Replies: 1 comment

Uh oh!

geroldmeisinger Sep 19, 2023

offchan42
Mar 15, 2023

geroldmeisinger
Sep 19, 2023