multimodalart HF Staff commited on
Commit
5534875
·
1 Parent(s): 083123c

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,11 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ image-0.png filter=lfs diff=lfs merge=lfs -text
37
+ image-1.png filter=lfs diff=lfs merge=lfs -text
38
+ image-2.png filter=lfs diff=lfs merge=lfs -text
39
+ image-3.png filter=lfs diff=lfs merge=lfs -text
40
+ image-4.png filter=lfs diff=lfs merge=lfs -text
41
+ image-5.png filter=lfs diff=lfs merge=lfs -text
42
+ image-6.png filter=lfs diff=lfs merge=lfs -text
43
+ image-7.png filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,98 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - stable-diffusion-xl
4
+ - stable-diffusion-xl-diffusers
5
+ - text-to-image
6
+ - diffusers
7
+ - lora
8
+ - template:sd-lora
9
+ widget:
10
+ - text: <s0><s1>
11
+ output:
12
+ url: image-0.png
13
+ - text: <s0><s1>
14
+ output:
15
+ url: image-1.png
16
+ - text: <s0><s1>
17
+ output:
18
+ url: image-2.png
19
+ - text: <s0><s1>
20
+ output:
21
+ url: image-3.png
22
+ - text: <s0><s1>
23
+ output:
24
+ url: image-4.png
25
+ - text: <s0><s1>
26
+ output:
27
+ url: image-5.png
28
+ - text: <s0><s1>
29
+ output:
30
+ url: image-6.png
31
+ - text: <s0><s1>
32
+ output:
33
+ url: image-7.png
34
+ base_model: stabilityai/stable-diffusion-xl-base-1.0
35
+ instance_prompt: <s0><s1>
36
+ license: openrail++
37
+ ---
38
+
39
+ # SDXL LoRA DreamBooth - multimodalart/poli-steps-final-face-token-only
40
+
41
+ <Gallery />
42
+
43
+ ## Model description
44
+
45
+ ### These are multimodalart/poli-steps-final-face-token-only LoRA adaption weights for stabilityai/stable-diffusion-xl-base-1.0.
46
+
47
+ ## Download model
48
+
49
+ ### Use it with UIs such as AUTOMATIC1111, Comfy UI, SD.Next, Invoke
50
+
51
+ - **LoRA**: download **[`poli-steps-final-face-token-only.safetensors` here 💾](/multimodalart/poli-steps-final-face-token-only/blob/main/poli-steps-final-face-token-only.safetensors)**.
52
+ - Place it on your `models/Lora` folder.
53
+ - On AUTOMATIC1111, load the LoRA by adding `<lora:poli-steps-final-face-token-only:1>` to your prompt. On ComfyUI just [load it as a regular LoRA](https://comfyanonymous.github.io/ComfyUI_examples/lora/).
54
+ - *Embeddings*: download **[`poli-steps-final-face-token-only_emb.safetensors` here 💾](/multimodalart/poli-steps-final-face-token-only/blob/main/poli-steps-final-face-token-only_emb.safetensors)**.
55
+ - Place it on it on your `embeddings` folder
56
+ - Use it by adding `poli-steps-final-face-token-only_emb` to your prompt.
57
+ (you need both the LoRA and the embeddings as they were trained together for this LoRA)
58
+
59
+
60
+ ## Use it with the [🧨 diffusers library](https://github.com/huggingface/diffusers)
61
+
62
+ ```py
63
+ from diffusers import AutoPipelineForText2Image
64
+ import torch
65
+ from huggingface_hub import hf_hub_download
66
+ from safetensors.torch import load_file
67
+
68
+ pipeline = AutoPipelineForText2Image.from_pretrained('stabilityai/stable-diffusion-xl-base-1.0', torch_dtype=torch.float16).to('cuda')
69
+ pipeline.load_lora_weights('multimodalart/poli-steps-final-face-token-only', weight_name='pytorch_lora_weights.safetensors')
70
+ embedding_path = hf_hub_download(repo_id='multimodalart/poli-steps-final-face-token-only', filename='poli-steps-final-face-token-only_emb.safetensors' repo_type="model")
71
+ state_dict = load_file(embedding_path)
72
+ pipeline.load_textual_inversion(state_dict["clip_l"], token=["<s0>", "<s1>"], text_encoder=pipeline.text_encoder, tokenizer=pipeline.tokenizer)
73
+ pipeline.load_textual_inversion(state_dict["clip_g"], token=["<s0>", "<s1>"], text_encoder=pipeline.text_encoder_2, tokenizer=pipeline.tokenizer_2)
74
+
75
+ image = pipeline('<s0><s1>').images[0]
76
+ ```
77
+
78
+ For more details, including weighting, merging and fusing LoRAs, check the [documentation on loading LoRAs in diffusers](https://huggingface.co/docs/diffusers/main/en/using-diffusers/loading_adapters)
79
+
80
+ ## Trigger words
81
+
82
+ To trigger image generation of trained concept(or concepts) replace each concept identifier in you prompt with the new inserted tokens:
83
+
84
+ to trigger concept `TOK` → use `<s0><s1>` in your prompt
85
+
86
+
87
+
88
+ ## Details
89
+ All [Files & versions](/multimodalart/poli-steps-final-face-token-only/tree/main).
90
+
91
+ The weights were trained using [🧨 diffusers Advanced Dreambooth Training Script](https://github.com/huggingface/diffusers/blob/main/examples/advanced_diffusion_training/train_dreambooth_lora_sdxl_advanced.py).
92
+
93
+ LoRA for the text encoder was enabled. False.
94
+
95
+ Pivotal tuning was enabled: True.
96
+
97
+ Special VAE used for training: madebyollin/sdxl-vae-fp16-fix.
98
+
image-0.png ADDED

Git LFS Details

  • SHA256: b26b1fab4af3492b5ee78c75a1e19be77e980ee3632ffad629196db151904695
  • Pointer size: 132 Bytes
  • Size of remote file: 1.48 MB
image-1.png ADDED

Git LFS Details

  • SHA256: 4fd987b353e608b9e4cb2d36cd4286b11d0cda9b324b0f9468406eddff9bc02a
  • Pointer size: 132 Bytes
  • Size of remote file: 1.43 MB
image-2.png ADDED

Git LFS Details

  • SHA256: 833302de201574d8a3a079ba92fae1dccec9c2799bd3deddc988d4e94083396a
  • Pointer size: 132 Bytes
  • Size of remote file: 1.44 MB
image-3.png ADDED

Git LFS Details

  • SHA256: 56f8cc774b6e80068f24e74c7f9dce8035c17d8ca2c0e9589a3ff2533649c59c
  • Pointer size: 132 Bytes
  • Size of remote file: 1.39 MB
image-4.png ADDED

Git LFS Details

  • SHA256: 87ee7e0cb1bc167fd14a5a6764ba5c8db7c302cd494cbc703da074b4485eb6f7
  • Pointer size: 132 Bytes
  • Size of remote file: 1.39 MB
image-5.png ADDED

Git LFS Details

  • SHA256: e57903d9eb15e70c89e2d2eee034e3330e50fe2ecb993a2ef7df2f20ba2d6b4b
  • Pointer size: 132 Bytes
  • Size of remote file: 1.46 MB
image-6.png ADDED

Git LFS Details

  • SHA256: 1be0fb9b074bb3fedd9bde3a8a378e68062b022d7d483873c8ef88d548bf8b35
  • Pointer size: 132 Bytes
  • Size of remote file: 1.47 MB
image-7.png ADDED

Git LFS Details

  • SHA256: 8af747178137de3466ac6d01e21f40f44a1f7bf73a121f112faacd37c8c736c8
  • Pointer size: 132 Bytes
  • Size of remote file: 1.42 MB
logs/dreambooth-lora-sd-xl/1703955068.3748171/events.out.tfevents.1703955068.r-multimodalart-autotrain-poli-steps-final-face-token-fe36gj8j8.204.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd5c988b46187c4740756dc4446e3d7e98d1eb70a5c5a96c4a8a90df862f0dbc
3
+ size 3669
logs/dreambooth-lora-sd-xl/1703955068.376803/hparams.yml ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ adam_beta1: 0.9
2
+ adam_beta2: 0.999
3
+ adam_epsilon: 1.0e-08
4
+ adam_weight_decay: 0.0001
5
+ adam_weight_decay_text_encoder: null
6
+ allow_tf32: false
7
+ cache_dir: null
8
+ cache_latents: true
9
+ caption_column: prompt
10
+ center_crop: false
11
+ checkpointing_steps: 5000
12
+ checkpoints_total_limit: null
13
+ class_data_dir: 43354837-6197-464a-b8fb-91610f16ea81
14
+ class_prompt: a photo of a person
15
+ crops_coords_top_left_h: 0
16
+ crops_coords_top_left_w: 0
17
+ dataloader_num_workers: 0
18
+ dataset_config_name: null
19
+ dataset_name: ./a04c294d-cd2f-414a-8030-c453d5f60289
20
+ enable_xformers_memory_efficient_attention: false
21
+ gradient_accumulation_steps: 1
22
+ gradient_checkpointing: true
23
+ hub_model_id: null
24
+ hub_token: null
25
+ image_column: image
26
+ instance_data_dir: null
27
+ instance_prompt: <s0><s1>
28
+ learning_rate: 1.0
29
+ local_rank: -1
30
+ logging_dir: logs
31
+ lr_num_cycles: 1
32
+ lr_power: 1.0
33
+ lr_scheduler: constant
34
+ lr_warmup_steps: 0
35
+ max_grad_norm: 1.0
36
+ max_train_steps: 960
37
+ mixed_precision: bf16
38
+ num_class_images: 150
39
+ num_new_tokens_per_abstraction: 2
40
+ num_train_epochs: 13
41
+ num_validation_images: 4
42
+ optimizer: prodigy
43
+ output_dir: poli-steps-final-face-token-only
44
+ pretrained_model_name_or_path: stabilityai/stable-diffusion-xl-base-1.0
45
+ pretrained_vae_model_name_or_path: madebyollin/sdxl-vae-fp16-fix
46
+ prior_generation_precision: null
47
+ prior_loss_weight: 1.0
48
+ prodigy_beta3: null
49
+ prodigy_decouple: true
50
+ prodigy_safeguard_warmup: true
51
+ prodigy_use_bias_correction: true
52
+ push_to_hub: false
53
+ rank: 32
54
+ repeats: 3
55
+ report_to: tensorboard
56
+ resolution: 1024
57
+ resume_from_checkpoint: null
58
+ revision: null
59
+ sample_batch_size: 4
60
+ scale_lr: false
61
+ seed: 42
62
+ snr_gamma: null
63
+ text_encoder_lr: 1.0
64
+ token_abstraction: TOK
65
+ train_batch_size: 2
66
+ train_text_encoder: false
67
+ train_text_encoder_frac: 1.0
68
+ train_text_encoder_ti: true
69
+ train_text_encoder_ti_frac: 0.5
70
+ use_8bit_adam: false
71
+ validation_epochs: 50
72
+ validation_prompt: null
73
+ variant: null
74
+ with_prior_preservation: true
logs/dreambooth-lora-sd-xl/events.out.tfevents.1703955068.r-multimodalart-autotrain-poli-steps-final-face-token-fe36gj8j8.204.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0a032cdcbcf895ccc79a9766d82916085a60089c926fcd02b6e1f46dd67b07ad
3
+ size 80474
poli-steps-final-face-token-only.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c2da1ead39f22c8c57847f16b6763ad7f33bd6a0f0ed258d71fb7365eda210a4
3
+ size 186046568
poli-steps-final-face-token-only_emb.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4b6903c7e01ef2894e322ac85bc4c4e371e2cedaacfa1f9d84dc9c3c644bd15
3
+ size 8344
pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:58e63bc4d360962d098987067db56771ac1f12647b4e883ef2bc8231a8275680
3
+ size 185963768