diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2025-12-14 00:14:23 +08:00

Author	SHA1	Message	Date
Sayak Paul	4ace7d0483	[chore] change licensing to 2025 from 2024. (#10615 ) change licensing to 2025 from 2024.	2025-01-20 16:57:27 -10:00
Tolga Cangöz	7071b7461b	Errata: Fix typos & `\s+$` (#9008 ) * Fix typos * chore: Fix typos * chore: Update README.md for promptdiffusion example * Trim trailing white spaces * Fix a typo * update number * chore: update number * Trim trailing white space * Update README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-08-02 21:24:25 -07:00
Sayak Paul	e25e525fde	[LoRA test suite] refactor the test suite and cleanse it (#7316 ) * cleanse and refactor lora testing suite. * more cleanup. * make check_if_lora_correctly_set a utility function * fix: typo * retrigger ci * style	2024-03-20 17:13:52 +05:30
Sayak Paul	30e5e81d58	change to 2024 in the license (#6902 ) change to 2024	2024-02-08 08:19:31 -10:00
Sayak Paul	1835510524	Remove `torch_dtype` in to() to end deprecation (#6886 ) * remove torch_dtype from to() * remove torch_dtype from usage scripts. * remove old lora backend * Revert "remove old lora backend" This reverts commit `adcddf6ba4`.	2024-02-08 09:38:57 +05:30
김태민	5b78141fd3	[FIX BUG] add config_files parser #5114 (#5115 ) * add config_files parser #5114 * add config_files parser_fix #5114	2023-09-20 16:17:47 +02:00
Vladimir Mandic	ef29b24fda	allow loading of sd models from safetensors without online lookups using local config files (#5019 ) finish config_files implementation	2023-09-14 12:30:15 +02:00
Alexsey Shestacov	3eeaf4e041	Fix convert_original_stable_diffusion_to_diffusers script (#4817 ) Fix stable diffusion conversion script	2023-08-29 09:14:45 +02:00
AisingioroHao	1b739e7344	Fixed invalid pipeline_class_name parameter. (#4590 ) * Fixed invalid pipeline_class_name parameter. * Fix the format	2023-08-14 17:21:17 +05:30
YiYi Xu	aef11cbf66	add pipeline_class_name argument to Stable Diffusion conversion script (#4461 ) * add pipeline class * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * style --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-08-07 06:44:31 -10:00
Patrick von Platen	bc9a8cef6f	[SD-XL] Add new pipelines (#3859 ) * Add new text encoder * add transformers depth * More * Correct conversion script * Fix more * Fix more * Correct more * correct text encoder * Finish all * proof that in works in run local xl * clean up * Get refiner to work * Add red castle * Fix batch size * Improve pipelines more * Finish text2image tests * Add img2img test * Fix more * fix import * Fix embeddings for classic models (#3888) Fix embeddings for classic SD models. * Allow multiple prompts to be passed to the refiner (#3895) * finish more * Apply suggestions from code review * add watermarker * Model offload (#3889) * Model offload. * Model offload for refiner / img2img * Hardcode encoder offload on img2img vae encode Saves some GPU RAM in img2img / refiner tasks so it remains below 8 GB. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * correct * fix * clean print * Update install warning for `invisible-watermark` * add: missing docstrings. * fix and simplify the usage example in img2img. * fix setup for watermarking. * Revert "fix setup for watermarking." This reverts commit `491bc9f5a6`. * fix: watermarking setup. * fix: op. * run make fix-copies. * make sure tests pass * improve convert * make tests pass * make tests pass * better error message * fiinsh * finish * Fix final test --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-07-06 13:37:27 +02:00
Patrick von Platen	49609768b4	make style	2023-03-30 18:26:41 +02:00
Alon Burg	9062b2847d	Support fp16 in conversion from original ckpt (#2733 ) add --half to convert_original_stable_diffusion_to_diffusers.py	2023-03-30 17:26:18 +01:00
Patrick von Platen	d761b58bfc	[From pretrained] Speed-up loading from cache (#2515 ) * [From pretrained] Speed-up loading from cache * up * Fix more * fix one more bug * make style * bigger refactor * factor out function * Improve more * better * deprecate return cache folder * clean up * improve tests * up * upload * add nice tests * simplify * finish * correct * fix version * rename * Apply suggestions from code review Co-authored-by: Lucain <lucainp@gmail.com> * rename * correct doc string * correct more * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * apply code suggestions * finish --------- Co-authored-by: Lucain <lucainp@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-03-10 11:56:10 +01:00
Takuma Mori	8dfff7c015	Add a ControlNet model & pipeline (#2407 ) * add scaffold - copied convert_controlnet_to_diffusers.py from convert_original_stable_diffusion_to_diffusers.py * Add support to load ControlNet (WIP) - this makes Missking Key error on ControlNetModel * Update to convert ControlNet without error msg - init impl for StableDiffusionControlNetPipeline - init impl for ControlNetModel * cleanup of commented out * split create_controlnet_diffusers_config() from create_unet_diffusers_config() - add config: hint_channels * Add input_hint_block, input_zero_conv and middle_block_out - this makes missing key error on loading model * add unet_2d_blocks_controlnet.py - copied from unet_2d_blocks.py as impl CrossAttnDownBlock2D,DownBlock2D - this makes missing key error on loading model * Add loading for input_hint_block, zero_convs and middle_block_out - this makes no error message on model loading * Copy from UNet2DConditionalModel except __init__ * Add ultra primitive test for ControlNetModel inference * Support ControlNetModel inference - without exceptions * copy forward() from UNet2DConditionModel * Impl ControlledUNet2DConditionModel inference - test_controlled_unet_inference passed * Frozen weight & biases for training * Minimized version of ControlNet/ControlledUnet - test_modules_controllnet.py passed * make style * Add support model loading for minimized ver * Remove all previous version files * from_pretrained and inference test passed * copied from pipeline_stable_diffusion.py except `__init__()` * Impl pipeline, pixel match test (almost) passed. * make style * make fix-copies * Fix to add import ControlNet blocks for `make fix-copies` * Remove einops dependency * Support np.ndarray, PIL.Image for controlnet_hint * set default config file as lllyasviel's * Add support grayscale (hw) numpy array * Add and update docstrings * add control_net.mdx * add control_net.mdx to toctree * Update copyright year * Fix to add PIL.Image RGB->BGR conversion - thanks @Mystfit * make fix-copies * add basic fast test for controlnet * add slow test for controlnet/unet * Ignore down/up_block len check on ControlNet * add a copy from test_stable_diffusion.py * Accept controlnet_hint is None * merge pipeline_stable_diffusion.py diff * Update class name to SDControlNetPipeline * make style * Baseline fast test almost passed (w long desc) * still needs investigate. Following didn't passed descriped in TODO comment: - test_stable_diffusion_long_prompt - test_stable_diffusion_no_safety_checker Following didn't passed same as stable_diffusion_pipeline: - test_attention_slicing_forward_pass - test_inference_batch_single_identical - test_xformers_attention_forwardGenerator_pass these seems come from calc accuracy. * Add note comment related vae_scale_factor * add test_stable_diffusion_controlnet_ddim * add assertion for vae_scale_factor != 8 * slow test of pipeline almost passed Failed: test_stable_diffusion_pipeline_with_model_offloading - ImportError: `enable_model_offload` requires `accelerate v0.17.0` or higher but currently latest version == 0.16.0 * test_stable_diffusion_long_prompt passed * test_stable_diffusion_no_safety_checker passed - due to its model size, move to slow test * remove PoC test files * fix num_of_image, prompt length issue add add test * add support List[PIL.Image] for controlnet_hint * wip * all slow test passed * make style * update for slow test * RGB(PIL)->BGR(ctrlnet) conversion * fixes * remove manual num_images_per_prompt test * add document * add `image` argument docstring * make style * Add line to correct conversion * add controlnet_conditioning_scale (aka control_scales strength) * rgb channel ordering by default * image batching logic * Add control image descriptions for each checkpoint * Only save controlnet model in conversion script * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_controlnet.py typo Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/api/pipelines/stable_diffusion/control_net.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/api/pipelines/stable_diffusion/control_net.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/api/pipelines/stable_diffusion/control_net.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/api/pipelines/stable_diffusion/control_net.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/api/pipelines/stable_diffusion/control_net.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/api/pipelines/stable_diffusion/control_net.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/api/pipelines/stable_diffusion/control_net.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/api/pipelines/stable_diffusion/control_net.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/api/pipelines/stable_diffusion/control_net.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * add gerated image example * a depth mask -> a depth map * rename control_net.mdx to controlnet.mdx * fix toc title * add ControlNet abstruct and link * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_controlnet.py Co-authored-by: dqueue <dbyqin@gmail.com> * remove controlnet constructor arguments re: @patrickvonplaten * [integration tests] test canny * test_canny fixes * [integration tests] test_depth * [integration tests] test_hed * [integration tests] test_mlsd * add channel order config to controlnet * [integration tests] test normal * [integration tests] test_openpose test_scribble * change height and width to default to conditioning image * [integration tests] test seg * style * test_depth fix * [integration tests] size fixes * [integration tests] cpu offloading * style * generalize controlnet embedding * fix conversion script * Update docs/source/en/api/pipelines/stable_diffusion/controlnet.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/controlnet.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/controlnet.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/controlnet.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Style adapted to the documentation of pix2pix * merge main by hand * style * [docs] controlling generation doc nits * correct some things * add: controlnetmodel to autodoc. * finish docs * finish * finish 2 * correct images * finish controlnet * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * uP * upload model * up * up --------- Co-authored-by: William Berman <WLBberman@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: dqueue <dbyqin@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-02 15:34:07 +01:00
Patrick von Platen	eadf0e2555	[Copyright] 2023 (#2524 )	2023-03-01 10:31:00 +01:00
Will Berman	62b3c9e06a	unCLIP variant (#2297 ) * pipeline_variant * Add docs for when clip_stats_path is specified * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * prepare_latents # Copied from re: @patrickvonplaten * NoiseAugmentor->ImageNormalizer * stable_unclip_prior default to None re: @patrickvonplaten * prepare_prior_extra_step_kwargs * prior denoising scale model input * {DDIM,DDPM}Scheduler -> KarrasDiffusionSchedulers re: @patrickvonplaten * docs * Update docs/source/en/api/pipelines/stable_unclip.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-02-14 11:28:57 -08:00
Will Berman	fd5c3c09af	misc fixes (#2282 ) Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-02-08 09:02:42 -08:00
Damian Stewart	3d2f24b099	Module-ise "original stable diffusion to diffusers" conversion script (#2019 ) * convert __main__ to a function call and call it * add missing type hint * make style check pass * move loading to src/diffusers Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-01-20 17:30:44 +01:00
Patrick von Platen	8a3f0c1f71	[Conversion] Improve safetensors (#1989 )	2023-01-16 14:26:56 +01:00
Katsuya	9147c4c954	Fix unused upcast_attn flag in convert_original_stable_diffusion_to_diffusers script (#1942 ) Fix unused upcast_attn flag in sd to diffusers script	2023-01-12 19:55:40 +01:00
Patrick von Platen	beb932c5d1	[Conversion SD] Make sure weirdly sorted keys work as well (#1959 )	2023-01-10 01:23:14 +01:00
Patrick von Platen	409387889d	[Conversion] Make sure ema weights are extracted correctly (#1937 ) * [Conversion] Make sure ema weights are extracted correctly * up * finish	2023-01-06 07:08:39 +01:00
Patrick von Platen	d67c305120	allow conversion from no state dict checkpoints	2023-01-03 19:48:13 +00:00
camenduru	1f1b6c6544	Device to use (e.g. cpu, cuda:0, cuda:1, etc.) (#1844 ) * Device to use (e.g. cpu, cuda:0, cuda:1, etc.) * "cuda" if torch.cuda.is_available() else "cpu"	2022-12-27 14:42:56 +01:00
Mikołaj Siedlarek	8890758823	Correct help text for scheduler_type flag in scripts. (#1749 )	2022-12-19 11:27:23 +01:00
Patrick von Platen	3ce6380d3a	[SD] Make sure scheduler is correct when converting (#1667 )	2022-12-12 16:57:48 +01:00
Cyberes	d2dc4de303	Handle missing global_step key in scripts/convert_original_stable_diffusion_to_diffusers.py (#1612 ) handle missing global_step key and don't download config if it already exists	2022-12-12 16:10:52 +01:00
lawfordp2017	31444f5790	Add text encoder conversion (#1559 ) * Initial code for attempt at improving SD <--> diffusers conversions for v2.0 * Updates to support round-trip between orig. SD 2.0 and diffusers models * Corrected formatting to Black standard * Correcting import formatting * Fixed imports (properly this time) * add some corrections * remove inference files Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-12-12 10:07:42 +01:00
Patrick von Platen	896c98a2ae	Add paint by example (#1533 ) * add paint by example * mkae loading possibel * up * Update src/diffusers/models/attention.py * up * finalize weight structure * make example work * make it work * up * up * fix * del * add * update * Apply suggestions from code review * correct transformer 2d * finish * up * up * up * up * fix * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Apply suggestions from code review * up * finish Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-12-07 11:06:30 +01:00
Patrick von Platen	922d56a19c	Correct type from int to str in conversion script sd	2022-12-05 18:51:29 +00:00
Patrick von Platen	f21415d1d9	Update conversion script to correctly handle SD 2 (#1511 ) * Conversion SD 2 * finish	2022-12-02 12:28:01 +01:00
Anton Lozhkov	e65b71aba4	Add an explicit `--image_size` to the conversion script (#1509 ) * Add an explicit `--image_size` to the conversion script * style	2022-12-01 19:22:48 +01:00
Patrick von Platen	9f10c545cb	Fix sample size conversion script (#1408 ) up	2022-11-25 11:26:27 +01:00
Patrick von Platen	0248541dea	[Conversion] Improve conversion script (#1218 ) up	2022-11-09 15:46:08 +01:00
Patrick von Platen	d9cfe325a5	CompVis -> diffusers script - allow converting from merged checkpoint to either EMA or non-EMA (#991 ) * improve script * up	2022-10-26 12:32:07 +02:00
Kane Wallmann	b9eea06e9f	Include CLIPTextModel parameters in conversion (#695 )	2022-10-05 12:22:07 +02:00
Suraj Patil	039958eae5	Stable diffusion text2img conversion script. (#154 ) * begin text2img conversion script * add fn to convert config * create config if not provided * update imports and use UNet2DConditionModel * fix imports, layer names * fix unet coversion * add function to convert VAE * fix vae conversion * update main * create text model * update config creating logic for unet * fix config creation * update script to create and save pipeline * remove unused imports * fix checkpoint loading * better name * save progress * finish * up * up Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-09-16 00:07:32 +02:00

38 Commits