jimmycarter commited on
Commit
20e4709
·
verified ·
1 Parent(s): 76460c7

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +334 -0
README.md ADDED
@@ -0,0 +1,334 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ base_model: "black-forest-labs/FLUX.1-dev"
4
+ tags:
5
+ - flux
6
+ - flux-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - simpletuner
10
+ - not-for-all-audiences
11
+ - lora
12
+ - template:sd-lora
13
+ - lycoris
14
+ inference: true
15
+ widget:
16
+ - text: 'unconditional (blank prompt)'
17
+ parameters:
18
+ negative_prompt: 'blurry, cropped, ugly'
19
+ output:
20
+ url: ./assets/image_0_0.png
21
+ - text: 'In this scene from the animated series "Helluva Boss," Loona, the wolf-like receptionist of the Immediate Murder Professionals (I.M.P), is depicted leaning against a wall outside the office. She is casually engrossed in her phone, displaying her typical aloof and detached demeanor. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.'
22
+ parameters:
23
+ negative_prompt: 'blurry, cropped, ugly'
24
+ output:
25
+ url: ./assets/image_1_0.png
26
+ - text: 'Loona shrugs with an exasperated expression, her red eyes wide and frustrated, as she seemingly questions or challenges something said in the I.M.P office. Still from Helluva boss. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.'
27
+ parameters:
28
+ negative_prompt: 'blurry, cropped, ugly'
29
+ output:
30
+ url: ./assets/image_2_0.png
31
+ - text: 'A scene from the animated series "Helluva Boss," set in the office. Loona, the wolf-like receptionist with white fur, black-tipped ears, and red eyes, is seated on a couch, facing towards the viewer. Loona''s appearance is complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts. She holds a piece of paper that says,"Welcome to Losercity, jerks". In the background, the office has a striped wall pattern and visible damage on the ceiling, indicating a chaotic or rough environment. On the right side of the image, two imp characters appear to be engaged in conversation.'
32
+ parameters:
33
+ negative_prompt: 'blurry, cropped, ugly'
34
+ output:
35
+ url: ./assets/image_3_0.png
36
+ - text: 'Loona from Helluva Boss is dressed in an oversized taco costume, looking visibly irritated and embarrassed. Her red eyes convey her annoyance as she crosses her arms and glares to the side. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes'
37
+ parameters:
38
+ negative_prompt: 'blurry, cropped, ugly'
39
+ output:
40
+ url: ./assets/image_4_0.png
41
+ - text: 'Loona is standing next to Blitzo (Helluva boss)'
42
+ parameters:
43
+ negative_prompt: 'blurry, cropped, ugly'
44
+ output:
45
+ url: ./assets/image_5_0.png
46
+ - text: 'In this "Helluva Boss" scene, Loona, the wolf-like receptionist, stands in an elevator with a tense and irritated expression, her teeth bared in a snarl. Blitzø, the red demon with distinctive black and white horns, leans close and makes an adorable look, as if asking for a favor. The ornate elevator setting hints at a tense moment, possibly involving a challenging mission or conflict within the I.M.P team.'
47
+ parameters:
48
+ negative_prompt: 'blurry, cropped, ugly'
49
+ output:
50
+ url: ./assets/image_6_0.png
51
+ - text: 'a 2D simple drawing of a madeleine cake, with a green cloud drawn next to it'
52
+ parameters:
53
+ negative_prompt: 'blurry, cropped, ugly'
54
+ output:
55
+ url: ./assets/image_7_0.png
56
+ - text: 'a 3D captivating YouTube thumbnail depicting of a full detailed,it''s on a party real people like, on front there is a giant pulling a nose of a black African real like lady down to size of elephant nose,be creative and unique'
57
+ parameters:
58
+ negative_prompt: 'blurry, cropped, ugly'
59
+ output:
60
+ url: ./assets/image_8_0.png
61
+ - text: 'Whiskers the cat. Whiskers becomes a mentor to other animals.Impressed by Whiskers'' intelligence, other animals in the neighborhood seek his guidance. Whiskers sets up a virtual learning platform using AI technology, where animals can ask questions, receive personalized lessons, and acquire knowledge in various subjects. Whiskers becomes a mentor, helping others unlock their potential.'
62
+ parameters:
63
+ negative_prompt: 'blurry, cropped, ugly'
64
+ output:
65
+ url: ./assets/image_9_0.png
66
+ - text: 'As the stock market fluctuates, the investor remains calm and collected at their desk, surrounded by charts and graphs. Their tailored suit and polished briefcase are a symbol of their expertise and experience in the world of finance. '
67
+ parameters:
68
+ negative_prompt: 'blurry, cropped, ugly'
69
+ output:
70
+ url: ./assets/image_10_0.png
71
+ - text: 'loona from helluva boss is eating a donut'
72
+ parameters:
73
+ negative_prompt: 'blurry, cropped, ugly'
74
+ output:
75
+ url: ./assets/image_11_0.png
76
+ ---
77
+
78
+ # flux-training-losercity-next-lycoris8
79
+
80
+ This is a LyCORIS adapter derived from [black-forest-labs/FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev).
81
+
82
+
83
+ The main validation prompt used during training was:
84
+
85
+
86
+
87
+ ```
88
+ loona from helluva boss is eating a donut
89
+ ```
90
+
91
+ ## Validation settings
92
+ - CFG: `3.5`
93
+ - CFG Rescale: `0.0`
94
+ - Steps: `15`
95
+ - Sampler: `None`
96
+ - Seed: `42`
97
+ - Resolution: `1024`
98
+
99
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
100
+
101
+ You can find some example images in the following gallery:
102
+
103
+
104
+ <Gallery />
105
+
106
+ The text encoder **was not** trained.
107
+ You may reuse the base model text encoder for inference.
108
+
109
+
110
+ ## Training settings
111
+
112
+ - Training epochs: 0
113
+ - Training steps: 100
114
+ - Learning rate: 9e-05
115
+ - Effective batch size: 16
116
+ - Micro-batch size: 1
117
+ - Gradient accumulation steps: 16
118
+ - Number of GPUs: 1
119
+ - Prediction type: flow-matching
120
+ - Rescaled betas zero SNR: False
121
+ - Optimizer: adamw_bf16
122
+ - Precision: Pure BF16
123
+ - Quantised: Yes: fp8-quanto
124
+ - Xformers: Not used
125
+ - LyCORIS Config:
126
+ ```json
127
+ {
128
+ "algo": "lokr",
129
+ "multiplier": 1.0,
130
+ "linear_dim": 1000000,
131
+ "linear_alpha": 1,
132
+ "factor": 10,
133
+ "full_matrix": true,
134
+ "apply_preset": {
135
+ "name_algo_map": {
136
+ "transformer_blocks.[0-7].attn..*": {
137
+ "algo": "lokr",
138
+ "factor": 5,
139
+ "linear_dim": 1000000,
140
+ "linear_alpha": 1,
141
+ "full_matrix": true
142
+ },
143
+ "transformer_blocks.(8|9|1[0-5])..*": {
144
+ "algo": "lokr",
145
+ "factor": 6,
146
+ "linear_dim": 1000000,
147
+ "linear_alpha": 1,
148
+ "full_matrix": true
149
+ },
150
+ "transformer_blocks.(1[6-8])..*": {
151
+ "algo": "lokr",
152
+ "factor": 12,
153
+ "linear_dim": 1000000,
154
+ "linear_alpha": 1,
155
+ "full_matrix": true
156
+ },
157
+ "single_transformer_blocks.(0|1[0-5]|[1-9])..*": {
158
+ "algo": "lokr",
159
+ "factor": 10,
160
+ "linear_dim": 1000000,
161
+ "linear_alpha": 1,
162
+ "full_matrix": true
163
+ },
164
+ "single_transformer_blocks.(1[6-9]|2[0-3])..*": {
165
+ "algo": "lokr",
166
+ "factor": 8,
167
+ "linear_dim": 1000000,
168
+ "linear_alpha": 1,
169
+ "full_matrix": true
170
+ },
171
+ "single_transformer_blocks.(2[4-9]|3[0-7])..*": {
172
+ "algo": "lokr",
173
+ "factor": 6,
174
+ "linear_dim": 1000000,
175
+ "linear_alpha": 1,
176
+ "full_matrix": true
177
+ }
178
+ }
179
+ }
180
+ }
181
+ ```
182
+
183
+ ## Datasets
184
+
185
+ ### default_dataset_arb
186
+ - Repeats: 9999
187
+ - Total number of images: 41
188
+ - Total number of aspect buckets: 11
189
+ - Resolution: 1.33 megapixels
190
+ - Cropped: False
191
+ - Crop style: None
192
+ - Crop aspect: None
193
+ ### default_dataset_arb2
194
+ - Repeats: 9999
195
+ - Total number of images: 2565
196
+ - Total number of aspect buckets: 1
197
+ - Resolution: 1.33 megapixels
198
+ - Cropped: False
199
+ - Crop style: None
200
+ - Crop aspect: None
201
+ ### default_dataset_arb3
202
+ - Repeats: 9999
203
+ - Total number of images: 3220
204
+ - Total number of aspect buckets: 24
205
+ - Resolution: 1.33 megapixels
206
+ - Cropped: False
207
+ - Crop style: None
208
+ - Crop aspect: None
209
+ ### default_dataset
210
+ - Repeats: 9999
211
+ - Total number of images: 42
212
+ - Total number of aspect buckets: 1
213
+ - Resolution: 1.048576 megapixels
214
+ - Cropped: True
215
+ - Crop style: center
216
+ - Crop aspect: square
217
+ ### default_dataset_512
218
+ - Repeats: 9999
219
+ - Total number of images: 42
220
+ - Total number of aspect buckets: 1
221
+ - Resolution: 0.262144 megapixels
222
+ - Cropped: True
223
+ - Crop style: center
224
+ - Crop aspect: square
225
+ ### default_dataset_640
226
+ - Repeats: 9999
227
+ - Total number of images: 42
228
+ - Total number of aspect buckets: 1
229
+ - Resolution: 0.4096 megapixels
230
+ - Cropped: True
231
+ - Crop style: center
232
+ - Crop aspect: square
233
+ ### default_dataset_768
234
+ - Repeats: 9999
235
+ - Total number of images: 42
236
+ - Total number of aspect buckets: 1
237
+ - Resolution: 0.589824 megapixels
238
+ - Cropped: True
239
+ - Crop style: center
240
+ - Crop aspect: square
241
+ ### default_dataset_896
242
+ - Repeats: 9999
243
+ - Total number of images: 42
244
+ - Total number of aspect buckets: 1
245
+ - Resolution: 0.802816 megapixels
246
+ - Cropped: True
247
+ - Crop style: center
248
+ - Crop aspect: square
249
+ ### default_dataset_uncaptioned
250
+ - Repeats: 9999
251
+ - Total number of images: 2565
252
+ - Total number of aspect buckets: 1
253
+ - Resolution: 1.048576 megapixels
254
+ - Cropped: True
255
+ - Crop style: center
256
+ - Crop aspect: square
257
+ ### default_dataset_uncaptioned_512
258
+ - Repeats: 9999
259
+ - Total number of images: 2565
260
+ - Total number of aspect buckets: 1
261
+ - Resolution: 0.262144 megapixels
262
+ - Cropped: True
263
+ - Crop style: center
264
+ - Crop aspect: square
265
+ ### default_dataset_art
266
+ - Repeats: 9999
267
+ - Total number of images: 2482
268
+ - Total number of aspect buckets: 1
269
+ - Resolution: 1.048576 megapixels
270
+ - Cropped: True
271
+ - Crop style: center
272
+ - Crop aspect: square
273
+ ### default_dataset_art_512
274
+ - Repeats: 9999
275
+ - Total number of images: 3193
276
+ - Total number of aspect buckets: 1
277
+ - Resolution: 0.262144 megapixels
278
+ - Cropped: True
279
+ - Crop style: center
280
+ - Crop aspect: square
281
+ ### default_dataset_art_640
282
+ - Repeats: 9999
283
+ - Total number of images: 3115
284
+ - Total number of aspect buckets: 1
285
+ - Resolution: 0.4096 megapixels
286
+ - Cropped: True
287
+ - Crop style: random
288
+ - Crop aspect: square
289
+ ### default_dataset_art_768
290
+ - Repeats: 9999
291
+ - Total number of images: 2989
292
+ - Total number of aspect buckets: 1
293
+ - Resolution: 0.589824 megapixels
294
+ - Cropped: True
295
+ - Crop style: random
296
+ - Crop aspect: square
297
+ ### default_dataset_art_896
298
+ - Repeats: 9999
299
+ - Total number of images: 2787
300
+ - Total number of aspect buckets: 1
301
+ - Resolution: 0.802816 megapixels
302
+ - Cropped: True
303
+ - Crop style: random
304
+ - Crop aspect: square
305
+
306
+
307
+ ## Inference
308
+
309
+
310
+ ```python
311
+ import torch
312
+ from diffusers import DiffusionPipeline
313
+ from lycoris import create_lycoris_from_weights
314
+
315
+ model_id = 'black-forest-labs/FLUX.1-dev'
316
+ adapter_id = 'pytorch_lora_weights.safetensors' # you will have to download this manually
317
+ lora_scale = 1.0
318
+ wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_id, pipeline.transformer)
319
+ wrapper.merge_to()
320
+
321
+ prompt = "loona from helluva boss is eating a donut"
322
+
323
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
324
+ image = pipeline(
325
+ prompt=prompt,
326
+ num_inference_steps=15,
327
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
328
+ width=1024,
329
+ height=1024,
330
+ guidance_scale=3.5,
331
+ ).images[0]
332
+ image.save("output.png", format="PNG")
333
+ ```
334
+