multimodalart HF Staff commited on
Commit
b13e71c
·
1 Parent(s): be245a8

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,11 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ image-0.png filter=lfs diff=lfs merge=lfs -text
37
+ image-1.png filter=lfs diff=lfs merge=lfs -text
38
+ image-2.png filter=lfs diff=lfs merge=lfs -text
39
+ image-3.png filter=lfs diff=lfs merge=lfs -text
40
+ image-4.png filter=lfs diff=lfs merge=lfs -text
41
+ image-5.png filter=lfs diff=lfs merge=lfs -text
42
+ image-6.png filter=lfs diff=lfs merge=lfs -text
43
+ image-7.png filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,98 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - stable-diffusion-xl
4
+ - stable-diffusion-xl-diffusers
5
+ - text-to-image
6
+ - diffusers
7
+ - lora
8
+ - template:sd-lora
9
+ widget:
10
+ - text: A photo of <s0><s1>
11
+ output:
12
+ url: image-0.png
13
+ - text: A photo of <s0><s1>
14
+ output:
15
+ url: image-1.png
16
+ - text: A photo of <s0><s1>
17
+ output:
18
+ url: image-2.png
19
+ - text: A photo of <s0><s1>
20
+ output:
21
+ url: image-3.png
22
+ - text: A photo of <s0><s1>
23
+ output:
24
+ url: image-4.png
25
+ - text: A photo of <s0><s1>
26
+ output:
27
+ url: image-5.png
28
+ - text: A photo of <s0><s1>
29
+ output:
30
+ url: image-6.png
31
+ - text: A photo of <s0><s1>
32
+ output:
33
+ url: image-7.png
34
+ base_model: stabilityai/stable-diffusion-xl-base-1.0
35
+ instance_prompt: A photo of <s0><s1>
36
+ license: openrail++
37
+ ---
38
+
39
+ # SDXL LoRA DreamBooth - multimodalart/poli-steps-final-face-no-caps
40
+
41
+ <Gallery />
42
+
43
+ ## Model description
44
+
45
+ ### These are multimodalart/poli-steps-final-face-no-caps LoRA adaption weights for stabilityai/stable-diffusion-xl-base-1.0.
46
+
47
+ ## Download model
48
+
49
+ ### Use it with UIs such as AUTOMATIC1111, Comfy UI, SD.Next, Invoke
50
+
51
+ - **LoRA**: download **[`poli-steps-final-face-no-caps.safetensors` here 💾](/multimodalart/poli-steps-final-face-no-caps/blob/main/poli-steps-final-face-no-caps.safetensors)**.
52
+ - Place it on your `models/Lora` folder.
53
+ - On AUTOMATIC1111, load the LoRA by adding `<lora:poli-steps-final-face-no-caps:1>` to your prompt. On ComfyUI just [load it as a regular LoRA](https://comfyanonymous.github.io/ComfyUI_examples/lora/).
54
+ - *Embeddings*: download **[`poli-steps-final-face-no-caps_emb.safetensors` here 💾](/multimodalart/poli-steps-final-face-no-caps/blob/main/poli-steps-final-face-no-caps_emb.safetensors)**.
55
+ - Place it on it on your `embeddings` folder
56
+ - Use it by adding `poli-steps-final-face-no-caps_emb` to your prompt. For example, `A photo of poli-steps-final-face-no-caps_emb`
57
+ (you need both the LoRA and the embeddings as they were trained together for this LoRA)
58
+
59
+
60
+ ## Use it with the [🧨 diffusers library](https://github.com/huggingface/diffusers)
61
+
62
+ ```py
63
+ from diffusers import AutoPipelineForText2Image
64
+ import torch
65
+ from huggingface_hub import hf_hub_download
66
+ from safetensors.torch import load_file
67
+
68
+ pipeline = AutoPipelineForText2Image.from_pretrained('stabilityai/stable-diffusion-xl-base-1.0', torch_dtype=torch.float16).to('cuda')
69
+ pipeline.load_lora_weights('multimodalart/poli-steps-final-face-no-caps', weight_name='pytorch_lora_weights.safetensors')
70
+ embedding_path = hf_hub_download(repo_id='multimodalart/poli-steps-final-face-no-caps', filename='poli-steps-final-face-no-caps_emb.safetensors' repo_type="model")
71
+ state_dict = load_file(embedding_path)
72
+ pipeline.load_textual_inversion(state_dict["clip_l"], token=["<s0>", "<s1>"], text_encoder=pipeline.text_encoder, tokenizer=pipeline.tokenizer)
73
+ pipeline.load_textual_inversion(state_dict["clip_g"], token=["<s0>", "<s1>"], text_encoder=pipeline.text_encoder_2, tokenizer=pipeline.tokenizer_2)
74
+
75
+ image = pipeline('A photo of <s0><s1>').images[0]
76
+ ```
77
+
78
+ For more details, including weighting, merging and fusing LoRAs, check the [documentation on loading LoRAs in diffusers](https://huggingface.co/docs/diffusers/main/en/using-diffusers/loading_adapters)
79
+
80
+ ## Trigger words
81
+
82
+ To trigger image generation of trained concept(or concepts) replace each concept identifier in you prompt with the new inserted tokens:
83
+
84
+ to trigger concept `TOK` → use `<s0><s1>` in your prompt
85
+
86
+
87
+
88
+ ## Details
89
+ All [Files & versions](/multimodalart/poli-steps-final-face-no-caps/tree/main).
90
+
91
+ The weights were trained using [🧨 diffusers Advanced Dreambooth Training Script](https://github.com/huggingface/diffusers/blob/main/examples/advanced_diffusion_training/train_dreambooth_lora_sdxl_advanced.py).
92
+
93
+ LoRA for the text encoder was enabled. False.
94
+
95
+ Pivotal tuning was enabled: True.
96
+
97
+ Special VAE used for training: madebyollin/sdxl-vae-fp16-fix.
98
+
image-0.png ADDED

Git LFS Details

  • SHA256: 49a87e8cb615086ffb63b486060f1c3b4455f03226709d961bc2b480c45d46c6
  • Pointer size: 132 Bytes
  • Size of remote file: 1.45 MB
image-1.png ADDED

Git LFS Details

  • SHA256: f3bb3801e60270ea537d5c3e2f68b65535b37c70ff2829ea1830952ab3042b5a
  • Pointer size: 132 Bytes
  • Size of remote file: 1.33 MB
image-2.png ADDED

Git LFS Details

  • SHA256: 4d2293919226b89edca69d71a2a89a3a7bb7869597e975787eb7bb3ce014bcb7
  • Pointer size: 132 Bytes
  • Size of remote file: 1.39 MB
image-3.png ADDED

Git LFS Details

  • SHA256: 023f542c216539e4dd2f487bc0c02a1b6ae78c659acddfe70487f499eced96ca
  • Pointer size: 132 Bytes
  • Size of remote file: 1.39 MB
image-4.png ADDED

Git LFS Details

  • SHA256: 19c0a5c4bbe70ef870dfb3b6c0c5cf47ee63ff86027c6110e3ff243d9e1f6841
  • Pointer size: 132 Bytes
  • Size of remote file: 1.43 MB
image-5.png ADDED

Git LFS Details

  • SHA256: 4687f0cf28e814158c2dd04bf32bb2354016670483234ef51671cc2524eb2fb7
  • Pointer size: 132 Bytes
  • Size of remote file: 1.4 MB
image-6.png ADDED

Git LFS Details

  • SHA256: 7d8320e0da345a0e8b0aa04e1f907df99fb03997c24676262de65dcd1ccf93c3
  • Pointer size: 132 Bytes
  • Size of remote file: 1.45 MB
image-7.png ADDED

Git LFS Details

  • SHA256: b64e19ebfeedbef880e48662be95c000859319b3d0401ec3ba38d6ec35162990
  • Pointer size: 132 Bytes
  • Size of remote file: 1.4 MB
logs/dreambooth-lora-sd-xl/1703955016.7583504/events.out.tfevents.1703955016.r-multimodalart-autotrain-poli-steps-final-face-no-ca-843d2jd4v.204.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8e62a38351605db37dc7a734a17f50fa76dad732c5d5b8640b197f84769775a6
3
+ size 3677
logs/dreambooth-lora-sd-xl/1703955016.7603874/hparams.yml ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ adam_beta1: 0.9
2
+ adam_beta2: 0.999
3
+ adam_epsilon: 1.0e-08
4
+ adam_weight_decay: 0.0001
5
+ adam_weight_decay_text_encoder: null
6
+ allow_tf32: false
7
+ cache_dir: null
8
+ cache_latents: true
9
+ caption_column: prompt
10
+ center_crop: false
11
+ checkpointing_steps: 5000
12
+ checkpoints_total_limit: null
13
+ class_data_dir: 913cd212-ee0c-48ad-bdce-ee3f63ef034b
14
+ class_prompt: a photo of a person
15
+ crops_coords_top_left_h: 0
16
+ crops_coords_top_left_w: 0
17
+ dataloader_num_workers: 0
18
+ dataset_config_name: null
19
+ dataset_name: ./1fdc41e0-a38e-432a-a1b3-e00844997c55
20
+ enable_xformers_memory_efficient_attention: false
21
+ gradient_accumulation_steps: 1
22
+ gradient_checkpointing: true
23
+ hub_model_id: null
24
+ hub_token: null
25
+ image_column: image
26
+ instance_data_dir: null
27
+ instance_prompt: A photo of <s0><s1>
28
+ learning_rate: 1.0
29
+ local_rank: -1
30
+ logging_dir: logs
31
+ lr_num_cycles: 1
32
+ lr_power: 1.0
33
+ lr_scheduler: constant
34
+ lr_warmup_steps: 0
35
+ max_grad_norm: 1.0
36
+ max_train_steps: 960
37
+ mixed_precision: bf16
38
+ num_class_images: 150
39
+ num_new_tokens_per_abstraction: 2
40
+ num_train_epochs: 13
41
+ num_validation_images: 4
42
+ optimizer: prodigy
43
+ output_dir: poli-steps-final-face-no-caps
44
+ pretrained_model_name_or_path: stabilityai/stable-diffusion-xl-base-1.0
45
+ pretrained_vae_model_name_or_path: madebyollin/sdxl-vae-fp16-fix
46
+ prior_generation_precision: null
47
+ prior_loss_weight: 1.0
48
+ prodigy_beta3: null
49
+ prodigy_decouple: true
50
+ prodigy_safeguard_warmup: true
51
+ prodigy_use_bias_correction: true
52
+ push_to_hub: false
53
+ rank: 32
54
+ repeats: 3
55
+ report_to: tensorboard
56
+ resolution: 1024
57
+ resume_from_checkpoint: null
58
+ revision: null
59
+ sample_batch_size: 4
60
+ scale_lr: false
61
+ seed: 42
62
+ snr_gamma: null
63
+ text_encoder_lr: 1.0
64
+ token_abstraction: TOK
65
+ train_batch_size: 2
66
+ train_text_encoder: false
67
+ train_text_encoder_frac: 1.0
68
+ train_text_encoder_ti: true
69
+ train_text_encoder_ti_frac: 0.5
70
+ use_8bit_adam: false
71
+ validation_epochs: 50
72
+ validation_prompt: null
73
+ variant: null
74
+ with_prior_preservation: true
logs/dreambooth-lora-sd-xl/events.out.tfevents.1703955016.r-multimodalart-autotrain-poli-steps-final-face-no-ca-843d2jd4v.204.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:278439065344e7d2a18dd5bcf576a7a10fa6273c1d09a830b184eea2be38c774
3
+ size 80474
poli-steps-final-face-no-caps.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5ab46f98504faf6c520499d5fa7825eade9f0da008764bb9034ea3e1fc5728fb
3
+ size 186046568
poli-steps-final-face-no-caps_emb.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4b6903c7e01ef2894e322ac85bc4c4e371e2cedaacfa1f9d84dc9c3c644bd15
3
+ size 8344
pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6a1985b99a683a16bc3312953e3e7e4e806e9ff68f4675530e3a2c7c94aa1205
3
+ size 185963768