multimodalart HF Staff commited on
Commit
c2e944d
·
1 Parent(s): 81d1597

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,11 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ image-0.png filter=lfs diff=lfs merge=lfs -text
37
+ image-1.png filter=lfs diff=lfs merge=lfs -text
38
+ image-2.png filter=lfs diff=lfs merge=lfs -text
39
+ image-3.png filter=lfs diff=lfs merge=lfs -text
40
+ image-4.png filter=lfs diff=lfs merge=lfs -text
41
+ image-5.png filter=lfs diff=lfs merge=lfs -text
42
+ image-6.png filter=lfs diff=lfs merge=lfs -text
43
+ image-7.png filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,98 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - stable-diffusion-xl
4
+ - stable-diffusion-xl-diffusers
5
+ - text-to-image
6
+ - diffusers
7
+ - lora
8
+ - template:sd-lora
9
+ widget:
10
+ - text: A photo of <s0><s1> a man wearing headphones and a blue shirt
11
+ output:
12
+ url: image-0.png
13
+ - text: A photo of <s0><s1> a bald man wearing glasses and a white t - shirt
14
+ output:
15
+ url: image-1.png
16
+ - text: A photo of <s0><s1> a man with glasses and a beard smiles
17
+ output:
18
+ url: image-2.png
19
+ - text: A photo of <s0><s1> a bald man with glasses and a colorful shirt
20
+ output:
21
+ url: image-3.png
22
+ - text: A photo of <s0><s1> a man with glasses and a hat wearing an orange cap
23
+ output:
24
+ url: image-4.png
25
+ - text: A photo of <s0><s1> a man wearing glasses and a yellow hat taking a selfie
26
+ output:
27
+ url: image-5.png
28
+ - text: A photo of <s0><s1> a man wearing a yellow hat and glasses
29
+ output:
30
+ url: image-6.png
31
+ - text: A photo of <s0><s1> a man with glasses and a beard smiles for the camera
32
+ output:
33
+ url: image-7.png
34
+ base_model: stabilityai/stable-diffusion-xl-base-1.0
35
+ instance_prompt: A photo of <s0><s1>
36
+ license: openrail++
37
+ ---
38
+
39
+ # SDXL LoRA DreamBooth - multimodalart/poli-steps-final-face
40
+
41
+ <Gallery />
42
+
43
+ ## Model description
44
+
45
+ ### These are multimodalart/poli-steps-final-face LoRA adaption weights for stabilityai/stable-diffusion-xl-base-1.0.
46
+
47
+ ## Download model
48
+
49
+ ### Use it with UIs such as AUTOMATIC1111, Comfy UI, SD.Next, Invoke
50
+
51
+ - **LoRA**: download **[`poli-steps-final-face.safetensors` here 💾](/multimodalart/poli-steps-final-face/blob/main/poli-steps-final-face.safetensors)**.
52
+ - Place it on your `models/Lora` folder.
53
+ - On AUTOMATIC1111, load the LoRA by adding `<lora:poli-steps-final-face:1>` to your prompt. On ComfyUI just [load it as a regular LoRA](https://comfyanonymous.github.io/ComfyUI_examples/lora/).
54
+ - *Embeddings*: download **[`poli-steps-final-face_emb.safetensors` here 💾](/multimodalart/poli-steps-final-face/blob/main/poli-steps-final-face_emb.safetensors)**.
55
+ - Place it on it on your `embeddings` folder
56
+ - Use it by adding `poli-steps-final-face_emb` to your prompt. For example, `A photo of poli-steps-final-face_emb`
57
+ (you need both the LoRA and the embeddings as they were trained together for this LoRA)
58
+
59
+
60
+ ## Use it with the [🧨 diffusers library](https://github.com/huggingface/diffusers)
61
+
62
+ ```py
63
+ from diffusers import AutoPipelineForText2Image
64
+ import torch
65
+ from huggingface_hub import hf_hub_download
66
+ from safetensors.torch import load_file
67
+
68
+ pipeline = AutoPipelineForText2Image.from_pretrained('stabilityai/stable-diffusion-xl-base-1.0', torch_dtype=torch.float16).to('cuda')
69
+ pipeline.load_lora_weights('multimodalart/poli-steps-final-face', weight_name='pytorch_lora_weights.safetensors')
70
+ embedding_path = hf_hub_download(repo_id='multimodalart/poli-steps-final-face', filename='poli-steps-final-face_emb.safetensors' repo_type="model")
71
+ state_dict = load_file(embedding_path)
72
+ pipeline.load_textual_inversion(state_dict["clip_l"], token=["<s0>", "<s1>"], text_encoder=pipeline.text_encoder, tokenizer=pipeline.tokenizer)
73
+ pipeline.load_textual_inversion(state_dict["clip_g"], token=["<s0>", "<s1>"], text_encoder=pipeline.text_encoder_2, tokenizer=pipeline.tokenizer_2)
74
+
75
+ image = pipeline('A photo of <s0><s1>').images[0]
76
+ ```
77
+
78
+ For more details, including weighting, merging and fusing LoRAs, check the [documentation on loading LoRAs in diffusers](https://huggingface.co/docs/diffusers/main/en/using-diffusers/loading_adapters)
79
+
80
+ ## Trigger words
81
+
82
+ To trigger image generation of trained concept(or concepts) replace each concept identifier in you prompt with the new inserted tokens:
83
+
84
+ to trigger concept `TOK` → use `<s0><s1>` in your prompt
85
+
86
+
87
+
88
+ ## Details
89
+ All [Files & versions](/multimodalart/poli-steps-final-face/tree/main).
90
+
91
+ The weights were trained using [🧨 diffusers Advanced Dreambooth Training Script](https://github.com/huggingface/diffusers/blob/main/examples/advanced_diffusion_training/train_dreambooth_lora_sdxl_advanced.py).
92
+
93
+ LoRA for the text encoder was enabled. False.
94
+
95
+ Pivotal tuning was enabled: True.
96
+
97
+ Special VAE used for training: madebyollin/sdxl-vae-fp16-fix.
98
+
image-0.png ADDED

Git LFS Details

  • SHA256: 59c82cac8136acface45cf87d68d95fb0894df87ab1f39ced85bc8656c65f02c
  • Pointer size: 132 Bytes
  • Size of remote file: 1.41 MB
image-1.png ADDED

Git LFS Details

  • SHA256: f13c344f9e4a3e7e6d62bea7ec1ac4a4208663d728bf919f652f508a32cf481d
  • Pointer size: 132 Bytes
  • Size of remote file: 1.47 MB
image-2.png ADDED

Git LFS Details

  • SHA256: 93013354c8d706fc2fe8d7b91387e6d7288e77cb491f1386c4d14d92f5383ba7
  • Pointer size: 132 Bytes
  • Size of remote file: 1.45 MB
image-3.png ADDED

Git LFS Details

  • SHA256: 39b765d9ef98fc5076f7f8cd32ed48357c2cadd1c7f2084ca9dd9ae2163d737d
  • Pointer size: 132 Bytes
  • Size of remote file: 1.42 MB
image-4.png ADDED

Git LFS Details

  • SHA256: 4b477b1e638b5646980bd700ecabe394b961966e998610c8934017ae8a87c24e
  • Pointer size: 132 Bytes
  • Size of remote file: 1.44 MB
image-5.png ADDED

Git LFS Details

  • SHA256: de02d103721791ba0da3ca6b59d5e35943a0080e23eaedd60b798db050017673
  • Pointer size: 132 Bytes
  • Size of remote file: 1.44 MB
image-6.png ADDED

Git LFS Details

  • SHA256: 1c97d1f9c2447e6ca7f6863ef45ef3bf284e1a93b8d98d78c147ef203aa7d5a9
  • Pointer size: 132 Bytes
  • Size of remote file: 1.54 MB
image-7.png ADDED

Git LFS Details

  • SHA256: 815f27993f44f96247b12e9790ebb6afc90b2e393699778be15f2f2c97767224
  • Pointer size: 132 Bytes
  • Size of remote file: 1.43 MB
logs/dreambooth-lora-sd-xl/1703955036.7084467/events.out.tfevents.1703955036.r-multimodalart-autotrain-poli-steps-final-face-pezns-c565bpn88.204.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:60b9a9971d07cd578d5954cb98e6cf8b9bb511714902252fb87801e08b4e3985
3
+ size 3669
logs/dreambooth-lora-sd-xl/1703955036.7103908/hparams.yml ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ adam_beta1: 0.9
2
+ adam_beta2: 0.999
3
+ adam_epsilon: 1.0e-08
4
+ adam_weight_decay: 0.0001
5
+ adam_weight_decay_text_encoder: null
6
+ allow_tf32: false
7
+ cache_dir: null
8
+ cache_latents: true
9
+ caption_column: prompt
10
+ center_crop: false
11
+ checkpointing_steps: 5000
12
+ checkpoints_total_limit: null
13
+ class_data_dir: f8d74f28-ec4d-4f5e-ac00-dd7d80c27e5c
14
+ class_prompt: a photo of a person
15
+ crops_coords_top_left_h: 0
16
+ crops_coords_top_left_w: 0
17
+ dataloader_num_workers: 0
18
+ dataset_config_name: null
19
+ dataset_name: ./8991b3f3-213a-48f4-95cb-9e643bf833dc
20
+ enable_xformers_memory_efficient_attention: false
21
+ gradient_accumulation_steps: 1
22
+ gradient_checkpointing: true
23
+ hub_model_id: null
24
+ hub_token: null
25
+ image_column: image
26
+ instance_data_dir: null
27
+ instance_prompt: A photo of <s0><s1>
28
+ learning_rate: 1.0
29
+ local_rank: -1
30
+ logging_dir: logs
31
+ lr_num_cycles: 1
32
+ lr_power: 1.0
33
+ lr_scheduler: constant
34
+ lr_warmup_steps: 0
35
+ max_grad_norm: 1.0
36
+ max_train_steps: 960
37
+ mixed_precision: bf16
38
+ num_class_images: 150
39
+ num_new_tokens_per_abstraction: 2
40
+ num_train_epochs: 13
41
+ num_validation_images: 4
42
+ optimizer: prodigy
43
+ output_dir: poli-steps-final-face
44
+ pretrained_model_name_or_path: stabilityai/stable-diffusion-xl-base-1.0
45
+ pretrained_vae_model_name_or_path: madebyollin/sdxl-vae-fp16-fix
46
+ prior_generation_precision: null
47
+ prior_loss_weight: 1.0
48
+ prodigy_beta3: null
49
+ prodigy_decouple: true
50
+ prodigy_safeguard_warmup: true
51
+ prodigy_use_bias_correction: true
52
+ push_to_hub: false
53
+ rank: 32
54
+ repeats: 3
55
+ report_to: tensorboard
56
+ resolution: 1024
57
+ resume_from_checkpoint: null
58
+ revision: null
59
+ sample_batch_size: 4
60
+ scale_lr: false
61
+ seed: 42
62
+ snr_gamma: null
63
+ text_encoder_lr: 1.0
64
+ token_abstraction: TOK
65
+ train_batch_size: 2
66
+ train_text_encoder: false
67
+ train_text_encoder_frac: 1.0
68
+ train_text_encoder_ti: true
69
+ train_text_encoder_ti_frac: 0.5
70
+ use_8bit_adam: false
71
+ validation_epochs: 50
72
+ validation_prompt: null
73
+ variant: null
74
+ with_prior_preservation: true
logs/dreambooth-lora-sd-xl/events.out.tfevents.1703955036.r-multimodalart-autotrain-poli-steps-final-face-pezns-c565bpn88.204.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:60ad515de6600dd8869134a2084cad8c0e1c99700eec4f217f5f797f7e0992fb
3
+ size 80474
poli-steps-final-face.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bb110c9ab2eb69cb16fbda35b6c5f18881c5f8e2cf55f3fe6a0ef5606ba9a7ac
3
+ size 186046568
poli-steps-final-face_emb.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4b6903c7e01ef2894e322ac85bc4c4e371e2cedaacfa1f9d84dc9c3c644bd15
3
+ size 8344
pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c9e5394f76c5d05d98aa999e7c6de40641dd2a2693e97dc7c49c10d415926c32
3
+ size 185963768