You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- The current implementation of ggml_conv_2d is slow and has high memory usage
68
-
-[ ] Continuing to reduce memory usage (quantizing the weights of ggml_conv_2d)
69
-
-[ ] Implement Inpainting support
70
-
71
63
## Usage
72
64
73
65
For most users, you can download the built executable program from the latest [release](https://github.com/leejet/stable-diffusion.cpp/releases/latest).
@@ -307,9 +299,6 @@ arguments:
307
299
--taesd [TAESD_PATH] path to taesd. Using Tiny AutoEncoder for fast decoding (low quality)
308
300
--control-net [CONTROL_PATH] path to control net model
309
301
--embd-dir [EMBEDDING_PATH] path to embeddings
310
-
--stacked-id-embd-dir [DIR] path to PHOTOMAKER stacked id embeddings
311
-
--input-id-images-dir [DIR] path to PHOTOMAKER input id images dir
312
-
--normalize-input normalize PHOTOMAKER input id images
313
302
--upscale-model [ESRGAN_PATH] path to esrgan model. Upscale images after generate, just RealESRGAN_x4plus_anime_6B supported by now
314
303
--upscale-repeats Run the ESRGAN upscaler this many times (default 1)
--high-noise-steps STEPS (high noise) number of sample steps (default: -1 = auto)
354
347
SLG will be enabled at step int([STEPS]*[START]) and disabled at int([STEPS]*[END])
355
348
--strength STRENGTH strength for noising/unnoising (default: 0.75)
356
-
--style-ratio STYLE-RATIO strength for keeping input identity (default: 20)
357
349
--control-strength STRENGTH strength to apply Control Net (default: 0.9)
358
350
1.0 corresponds to full destruction of information in init image
359
351
-H, --height H image height, in pixel space (default: 512)
360
352
-W, --width W image width, in pixel space (default: 512)
361
353
--rng {std_default, cuda} RNG (default: cuda)
362
354
-s SEED, --seed SEED RNG seed (default: 42, use random seed for < 0)
363
355
-b, --batch-count COUNT number of images to generate
364
-
--clip-skip N ignore last_dot_pos layers of CLIP network; 1 ignores none, 2 ignores one layer (default: -1)
356
+
--clip-skip N ignore last layers of CLIP network; 1 ignores none, 2 ignores one layer (default: -1)
365
357
<= 0 represents unspecified, will be 1 for SD1.x, 2 for SD2.x
366
358
--vae-tiling process vae in tiles to reduce memory usage
359
+
--vae-tile-size [X]x[Y] tile size for vae tiling (default: 32x32)
360
+
--vae-relative-tile-size [X]x[Y] relative tile size for vae tiling, in fraction of image size if < 1, in number of tiles per dim if >=1 (overrides --vae-tile-size)
361
+
--vae-tile-overlap OVERLAP tile overlap for vae tiling, in fraction of tile size (default: 0.5)
367
362
--vae-on-cpu keep vae in cpu (for low vram)
368
363
--clip-on-cpu keep clip in cpu (for low vram)
369
364
--diffusion-fa use flash attention in the diffusion model (for low vram)
@@ -384,6 +379,11 @@ arguments:
384
379
--moe-boundary BOUNDARY timestep boundary for Wan2.2 MoE model. (default: 0.875)
385
380
only enabled if `--high-noise-steps` is set to -1
386
381
--flow-shift SHIFT shift value for Flow models like SD3.x or WAN (default: auto)
382
+
--vace-strength wan vace strength
383
+
--photo-maker path to PHOTOMAKER model
384
+
--pm-id-images-dir [DIR] path to PHOTOMAKER input id images dir
385
+
--pm-id-embed-path [PATH] path to PHOTOMAKER v2 id embed
386
+
--pm-style-strength strength for keeping PHOTOMAKER input identity (default: 20)
387
387
-v, --verbose print extra info
388
388
```
389
389
@@ -393,9 +393,9 @@ arguments:
393
393
./bin/sd -m ../models/sd-v1-4.ckpt -p "a lovely cat"
394
394
# ./bin/sd -m ../models/v1-5-pruned-emaonly.safetensors -p "a lovely cat"
0 commit comments