diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2025-12-06 12:34:13 +08:00

Author	SHA1	Message	Date
Sayak Paul	a519272d97	[ci] revisit the installations in CI. (#12450 ) * revisit the installations in CI. * up * up * up * empty * up * up * up	2025-10-08 19:21:24 +05:30
Lucain	ec5449f3a1	Support both huggingface_hub `v0.x` and `v1.x` (#12389 ) * Support huggingface_hub 0.x and 1.x * httpx	2025-09-25 18:28:54 +02:00
Ishan Modi	4acbfbf13b	[Quantization] Add TRT-ModelOpt as a Backend (#11173 ) * initial commit * update * updates * update * update * update * update * update * update * addressed PR comments * update * addressed PR comments * update * update * update * update * update * update * updates * update * update * addressed PR comments * updates * code formatting * update * addressed PR comments * addressed PR comments * addressed PR comments * addressed PR comments * fix docs and dependencies * fixed dependency test --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-09-03 10:14:52 +05:30
Sayak Paul	7a2b78bf0f	post release v0.35.0 (#12184 ) * post release v0.35.0 * quality	2025-08-19 22:10:08 +05:30
Sayak Paul	a8e47978c6	[lora] adapt new LoRA config injection method (#11999 ) * use state dict when setting up LoRA. * up * up * up * comment * up * up	2025-08-08 09:22:48 +05:30
Álvaro Somoza	edcbe8038b	Fix huggingface-hub failing tests (#11994 ) * login * more logins * uploads * missed login * another missed login * downloads * examples and more logins * fix * setup * Apply style fixes * fix * Apply style fixes	2025-07-29 02:34:58 -04:00
Sayak Paul	86becea77f	Pin k-diffusion for CI (#11894 ) * remove k-diffusion as we don't use it from the core. * Revert "remove k-diffusion as we don't use it from the core." This reverts commit `8bc86925a0`. * pin k-diffusion	2025-07-09 12:17:45 +05:30
Sayak Paul	10c36e0b78	[chore] post release v0.34.0 (#11800 ) * post release v0.34.0 * code quality --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-06-26 06:56:46 +05:30
Aryan	a4df8dbc40	Update more licenses to 2025 (#11746 ) update	2025-06-19 07:46:01 +05:30
Sayak Paul	53f1043cbb	Update setup.py to pin min version of `peft` (#11502 )	2025-05-06 10:23:16 +05:30
Sayak Paul	4b868f14c1	post release 0.33.0 (#11255 ) * post release * update * fix deprecations * remaining * update --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-04-15 06:50:08 -10:00
Sayak Paul	5b27f8aba8	fix consisid imports (#11254 ) * fix consisid imports * fix opencv import * fix	2025-04-09 18:49:32 +05:30
Dhruv Nair	edc154da09	Update Ruff to latest Version (#10919 ) * update * update * update * update	2025-04-09 16:51:34 +05:30
Dhruv Nair	f5edaa7894	[Quantization] Add Quanto backend (#10756 ) * update * updaet * update * update * update * update * update * update * update * update * update * update * Update docs/source/en/quantization/quanto.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * Update src/diffusers/quantizers/quanto/utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-10 08:33:05 +05:30
Sayak Paul	3579cd2bb7	[chore] update notes generation spaces (#10592 ) fix	2025-02-17 09:26:15 +05:30
Yuxuan Zhang	d90cd3621d	CogView4 (supports different length c and uc) (#10649 ) * init * encode with glm * draft schedule * feat(scheduler): Add CogView scheduler implementation * feat(embeddings): add CogView 2D rotary positional embedding * 1 * Update pipeline_cogview4.py * fix the timestep init and sigma * update latent * draft patch(not work) * fix * [WIP][cogview4]: implement initial CogView4 pipeline Implement the basic CogView4 pipeline structure with the following changes: - Add CogView4 pipeline implementation - Implement DDIM scheduler for CogView4 - Add CogView3Plus transformer architecture - Update embedding models Current limitations: - CFG implementation uses padding for sequence length alignment - Need to verify transformer inference alignment with Megatron TODO: - Consider separate forward passes for condition/uncondition instead of padding approach * [WIP][cogview4][refactor]: Split condition/uncondition forward pass in CogView4 pipeline Split the forward pass for conditional and unconditional predictions in the CogView4 pipeline to match the original implementation. The noise prediction is now done separately for each case before combining them for guidance. However, the results still need improvement. This is a work in progress as the generated images are not yet matching expected quality. * use with -2 hidden state * remove text_projector * 1 * [WIP] Add tensor-reload to align input from transformer block * [WIP] for older glm * use with cogview4 transformers forward twice of u and uc * Update convert_cogview4_to_diffusers.py * remove this * use main example * change back * reset * setback * back * back 4 * Fix qkv conversion logic for CogView4 to Diffusers format * back5 * revert to sat to cogview4 version * update a new convert from megatron * [WIP][cogview4]: implement CogView4 attention processor Add CogView4AttnProcessor class for implementing scaled dot-product attention with rotary embeddings for the CogVideoX model. This processor concatenates encoder and hidden states, applies QKV projections and RoPE, but does not include spatial normalization. TODO: - Fix incorrect QKV projection weights - Resolve ~25% error in RoPE implementation compared to Megatron * [cogview4] implement CogView4 transformer block Implement CogView4 transformer block following the Megatron architecture: - Add multi-modulate and multi-gate mechanisms for adaptive layer normalization - Implement dual-stream attention with encoder-decoder structure - Add feed-forward network with GELU activation - Support rotary position embeddings for image tokens The implementation follows the original CogView4 architecture while adapting it to work within the diffusers framework. * with new attn * [bugfix] fix dimension mismatch in CogView4 attention * [cogview4][WIP]: update final normalization in CogView4 transformer Refactored the final normalization layer in CogView4 transformer to use separate layernorm and AdaLN operations instead of combined AdaLayerNormContinuous. This matches the original implementation but needs validation. Needs verification against reference implementation. * 1 * put back * Update transformer_cogview4.py * change time_shift * Update pipeline_cogview4.py * change timesteps * fix * change text_encoder_id * [cogview4][rope] align RoPE implementation with Megatron - Implement apply_rope method in attention processor to match Megatron's implementation - Update position embeddings to ensure compatibility with Megatron-style rotary embeddings - Ensure consistent rotary position encoding across attention layers This change improves compatibility with Megatron-based models and provides better alignment with the original implementation's positional encoding approach. * [cogview4][bugfix] apply silu activation to time embeddings in CogView4 Applied silu activation to time embeddings before splitting into conditional and unconditional parts in CogView4Transformer2DModel. This matches the original implementation and helps ensure correct time conditioning behavior. * [cogview4][chore] clean up pipeline code - Remove commented out code and debug statements - Remove unused retrieve_timesteps function - Clean up code formatting and documentation This commit focuses on code cleanup in the CogView4 pipeline implementation, removing unnecessary commented code and improving readability without changing functionality. * [cogview4][scheduler] Implement CogView4 scheduler and pipeline * now It work * add timestep * batch * change convert scipt * refactor pt. 1; make style * refactor pt. 2 * refactor pt. 3 * add tests * make fix-copies * update toctree.yml * use flow match scheduler instead of custom * remove scheduling_cogview.py * add tiktoken to test dependencies * Update src/diffusers/models/embeddings.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * apply suggestions from review * use diffusers apply_rotary_emb * update flow match scheduler to accept timesteps * fix comment * apply review sugestions * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: 三洋三洋 <1258009915@qq.com> Co-authored-by: OleehyO <leehy0357@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-02-15 21:46:48 +05:30
Marc Sun	fbff43acc9	[FEAT] DDUF format (#10037 ) * load and save dduf archive * style * switch to zip uncompressed * updates * Update src/diffusers/pipelines/pipeline_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update src/diffusers/pipelines/pipeline_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * first draft * remove print * switch to dduf_file for consistency * switch to huggingface hub api * fix log * add a basic test * Update src/diffusers/configuration_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update src/diffusers/pipelines/pipeline_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update src/diffusers/pipelines/pipeline_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * fix * fix variant * change saving logic * DDUF - Load transformers components manually (#10171) * update hfh version * Load transformers components manually * load encoder from_pretrained with state_dict * working version with transformers and tokenizer ! * add generation_config case * fix tests * remove saving for now * typing * need next version from transformers * Update src/diffusers/configuration_utils.py Co-authored-by: Lucain <lucain@huggingface.co> * check path corectly * Apply suggestions from code review Co-authored-by: Lucain <lucain@huggingface.co> * udapte * typing * remove check for subfolder * quality * revert setup changes * oups * more readable condition * add loading from the hub test * add basic docs. * Apply suggestions from code review Co-authored-by: Lucain <lucain@huggingface.co> * add example * add * make functions private * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * minor. * fixes * fix * change the precdence of parameterized. * error out when custom pipeline is passed with dduf_file. * updates * fix * updates * fixes * updates * fix xfail condition. * fix xfail * fixes * sharded checkpoint compat * add test for sharded checkpoint * add suggestions * Update src/diffusers/models/model_loading_utils.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * from suggestions * add class attributes to flag dduf tests * last one * fix logic * remove comment * revert changes --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Lucain <lucain@huggingface.co> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-01-14 13:21:42 +05:30
hlky	c3478a42b9	Fix Nightly AudioLDM2PipelineFastTests (#10556 ) * Fix Nightly AudioLDM2PipelineFastTests * add phonemizer to setup extras test * fix * make style	2025-01-13 13:54:06 +00:00
Sayak Paul	92933ec36a	[chore] post release 0.32.0 (#10361 ) * post release 0.32.0 * stylew	2024-12-23 10:03:34 -10:00
Dhruv Nair	ea40933f36	[CI] Unpin torch<2.5 in CI (#9961 ) * update * update	2024-11-19 18:50:46 +05:30
Sayak Paul	e45c25d03a	post-release 0.31.0 (#9742 ) * post-release * style	2024-10-22 20:42:30 +05:30
Aryan	56d6d21bae	[CI] pin max torch version to fix CI errors (#9709 ) * pin max torch version * update * Update setup.py	2024-10-20 01:50:56 +05:30
Álvaro Somoza	82058a5413	post release 0.30.0 (#9173 ) * post release * fix quality	2024-08-14 12:55:55 +05:30
Sayak Paul	130dd936bb	pin accelerate to 0.31.0 (#8563 ) * pin accelerate to 0.31.0 * update dep table * empty	2024-06-16 08:37:00 -10:00
Sayak Paul	f96e4a16ad	pin transformers to the latest (#8522 ) thanks!	2024-06-13 07:39:24 -10:00
Sayak Paul	2e4841ef1e	post release 0.29.0 (#8492 ) post release	2024-06-13 06:14:20 -10:00
Sayak Paul	7d887118b9	[Core] support saving and loading of sharded checkpoints (#7830 ) * feat: support saving a model in sharded checkpoints. * feat: make loading of sharded checkpoints work. * add tests * cleanse the loading logic a bit more. * more resilience while loading from the Hub. * parallelize shard downloads by using snapshot_download()/ * default to a shard size. * more fix * Empty-Commit * debug * fix * uality * more debugging * fix more * initial comments from Benjamin * move certain methods to loading_utils * add test to check if the correct number of shards are present. * add a test to check if loading of sharded checkpoints from the Hub is okay * clarify the unit when passed as an int. * use hf_hub for sharding. * remove unnecessary code * remove unnecessary function * lucain's comments. * fixes * address high-level comments. * fix test * subfolder shenanigans./ * Update src/diffusers/utils/hub_utils.py Co-authored-by: Lucain <lucainp@gmail.com> * Apply suggestions from code review Co-authored-by: Lucain <lucainp@gmail.com> * remove _huggingface_hub_version as not needed. * address more feedback. * add a test for local_files_only=True/ * need hf hub to be at least 0.23.2 * style * final comment. * clean up subfolder. * deal with suffixes in code. * _add_variant default. * use weights_name_pattern * remove add_suffix_keyword * clean up downloading of sharded ckpts. * don't return something special when using index.json * fix more * don't use bare except * remove comments and catch the errors better * fix a couple of things when using is_file() * empty --------- Co-authored-by: Lucain <lucainp@gmail.com>	2024-06-07 14:49:10 +05:30
Sayak Paul	581d8aacf7	post release v0.28.0 (#8286 ) * post release v0.28.0 * style	2024-05-29 07:13:22 +05:30
Sayak Paul	aa676c641f	change to yiyi's address. (#7981 ) * change to yiyi's address. * update to diffusers@huggingface.co	2024-05-28 08:28:55 -10:00
YiYi Xu	e5674015f3	adding back test_conversion_when_using_device_map (#7704 ) * style * Fix device map nits (#7705) --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-18 19:21:32 -10:00
Sayak Paul	4a34307702	add: utility to format our docs too 📜 (#7314 ) * add: utility to format our docs too 📜 * debugging saga * fix: message * checking * should be fixed. * revert pipeline_fixture * remove empty line * make style * fix: setup.py * style.	2024-04-02 20:49:43 +05:30
Sayak Paul	5d83f50c23	[Release tests] make nightly workflow dispatchable. (#7541 ) * make nightly workflow dispatchable. * add a note about running the release tests to setup.py	2024-04-02 12:21:17 +05:30
Sayak Paul	ab38ddf64f	[chore] make the istructions on fetching all commits clearer. (#7474 ) * make the istructions on fetching all commits clearer. * Update setup.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-03-27 08:16:46 +05:30
Sayak Paul	3c67864c5a	Remove `distutils` (#7455 ) * strtobool * replace Command from setuptools.	2024-03-25 06:44:53 +05:30
Sayak Paul	76de6a09fb	post-release v0.27.0 (#7329 ) * post-release * quality	2024-03-18 10:52:20 +05:30
Dhruv Nair	215e6804d3	Unpin torch versions in CI (#6945 ) * update * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-02-12 16:01:05 +05:30
Sayak Paul	7c8cab313e	post release 0.26.2 (#6885 ) * post release * style * Empty-Commit --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2024-02-09 07:36:38 -10:00
Sayak Paul	30e5e81d58	change to 2024 in the license (#6902 ) change to 2024	2024-02-08 08:19:31 -10:00
Dhruv Nair	f4d3f913f4	Pin torch < 2.2.0 in test runners (#6780 ) * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-31 13:41:18 +05:30
Sayak Paul	cb4b3f0b78	[OmegaConf] replace it with `yaml` (#6488 ) * remove omegaconf from convert_from_ckpt. * remove from single_file. * change to string based ubscription. * style * okay * fix: vae_param * no . indexing. * style * style * turn getattrs into explicit if/else * style * propagate changes to ldm_uncond. * propagate to gligen * propagate to if. * fix: quotes. * propagate to audioldm. * propagate to audioldm2 * propagate to musicldm. * propagate to vq_diffusion * propagate to zero123. * remove omegaconf from diffusers codebase.	2024-01-15 20:02:10 +05:30
Lucain	9a9daee724	Fix offline mode import (#6467 )	2024-01-05 15:34:40 +01:00
Sayak Paul	9d945b2b90	0.25.0 post release (#6358 ) * post release * style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2024-01-05 16:13:27 +05:30
Dhruv Nair	93ea26f272	Add PEFT to training deps (#6148 ) add peft to training deps Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-15 08:39:59 +05:30
Dhruv Nair	bbd3572044	Pin Ruff Version (#6059 ) pinn ruff	2023-12-05 17:51:37 +05:30
Dhruv Nair	b21729225a	Update Tests Fetcher (#5950 ) * update setup and deps table * update * update * update * up * up * update * up * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * quality fix * fix failure reporting --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-12-04 12:59:41 +05:30
Patrick von Platen	dadd55fb36	Post Release: v0.24.0 (#5985 ) * Post Release: v0.24.0 * post pone deprecation * post pone deprecation * Add model_index.json	2023-12-01 18:43:44 +01:00
Sayak Paul	af378c1dd1	[Easy] minor edits to setup.py (#5996 ) minor edits to setup	2023-12-01 20:38:46 +05:30
Kashif Rasul	6b04d61cf6	[Styling] stylify using ruff (#5841 ) * ruff format * not need to use doc-builder's black styling as the doc is styled in ruff * make fix-copies * comment * use run_ruff	2023-11-20 11:48:34 +01:00
Patrick von Platen	c6f90daea6	[PEFT] Unpin peft (#5850 )	2023-11-17 19:15:02 +01:00
Lucain	c896b841e4	Set `usedforsecurity=False` in hashlib methods (FIPS compliance) (#5790 ) * Set usedforsecurity=False in hashlib methods (FIPS compliance) * update version dependency * bump hfh version * bump hfh version	2023-11-17 14:56:58 +01:00

1 2 3 4

173 Commits