yiyi@huggingface.co
c91835c943
u pup
2026-02-05 09:06:00 +00:00
yiyi@huggingface.co
98b3a31259
does not need to specify the params:
2026-02-05 09:05:25 +00:00
yiyi@huggingface.co
4c1a5bcfeb
fix more
2026-02-05 08:40:52 +00:00
yiyi@huggingface.co
027394d392
up up
2026-02-04 19:48:56 +00:00
yiyi@huggingface.co
5c378a9415
text_encoder should not be auto for qwen-image
2026-02-04 19:48:11 +00:00
yiyi@huggingface.co
f34cc7b344
style
2026-02-04 11:31:16 +00:00
yiyi@huggingface.co
24c4b1c47d
add required param tests
2026-02-04 11:30:38 +00:00
yiyi@huggingface.co
13c922972e
more fix
2026-02-04 11:13:58 +00:00
yiyi@huggingface.co
f4d27b9a8a
style
2026-02-04 11:00:12 +00:00
yiyi@huggingface.co
1a2e736166
try to fix modular tests
2026-02-04 10:59:03 +00:00
yiyi@huggingface.co
c293ad7899
fix default_repo_id
2026-02-04 10:07:58 +00:00
YiYi Xu
2c7f5d7421
Merge branch 'main' into modular-test
2026-02-03 22:43:09 -10:00
Alan Ponnachan
430c557b6a
Add support for Magcache ( #12744 )
...
* add magcache
* formatting
* add magcache support with calibration mode
* add imports
* improvements
* Apply style fixes
* fix kandinsky errors
* add tests and documentation
* Apply style fixes
* improvements
* Apply style fixes
* make fix-copies.
* minor fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2026-02-04 13:45:12 +05:30
Sayak Paul
1b8fc6c589
[modular] change the template modular pipeline card ( #13072 )
...
* start better template for modular pipeline card.
* simplify structure.
* refine.
* style.
* up
* add tests
2026-02-04 10:09:10 +05:30
YiYi Xu
6d4fc6baa0
[Modular] mellon doc etc ( #13051 )
...
* add metadata field to input/output param
* refactor mellonparam: move the template outside, add metaclass, define some generic template for custom node
* add from_custom_block
* style
* up up fix
* add mellon guide
* add to toctree
* style
* add mellon_types
* style
* mellon_type -> inpnt_types + output_types
* update doc
* add quant info to components manager
* fix more
* up up
* fix components manager
* update custom block guide
* update
* style
* add a warn for mellon and add new guides to overview
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/modular_diffusers/mellon.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* more update on custom block guide
* Update docs/source/en/modular_diffusers/mellon.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* a few mamual
* apply suggestion: turn into bullets
* support define mellon meta with MellonParam directly, and update doc
* add the video
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal >
2026-02-03 13:38:57 -10:00
YiYi Xu
ebd06f9b11
[Modular] loader related ( #13025 )
...
* tag loader_id from Automodel
* style
* load_components by default only load components that are not already loaded
* by default, skip loading the componeneets does not have the repo id
2026-02-03 05:34:33 -10:00
songkey
b712042da1
[Flux2] Fix LoRA loading for Flux2 Klein by adaptively enumerating transformer blocks ( #13030 )
...
* Resolve Flux2 Klein 4B/9B LoRA loading errors
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2026-02-02 20:36:19 +05:30
Dhruv Nair
0b76728e27
Refactor Model Tests ( #12822 )
...
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2026-02-02 18:51:44 +05:30
DefTruth
973e334443
feat: support Ulysses Anything Attention ( #12996 )
...
* feat: support Ulysses Anything Attention
* feat: support Ulysses Anything Attention
* feat: support Ulysses Anything Attention
* feat: support Ulysses Anything Attention
* fix UAA broken while using joint attn
* update
* post check
* add docs
* add docs
* remove lru cache
* move codes
* update
2026-02-02 17:04:32 +05:30
YiYi Xu
769a1f3a12
[Modular]add a real quick start guide ( #13029 )
...
* add a real quick start guide
* Update docs/source/en/modular_diffusers/quickstart.md
* update a bit more
* fix
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/modular_diffusers/quickstart.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/modular_diffusers/quickstart.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* update more
* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* address more feedbacks: move components amnager earlier, explain blocks vs sub-blocks etc
* more
* remove the link to mellon guide, not exist in this PR yet
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2026-01-31 09:43:20 -10:00
Mikko Lauri
ec6b2bcccb
Fix aiter availability check ( #13059 )
...
Update import_utils.py
2026-01-30 19:24:05 +05:30
Jared Wen
6a1904eb06
[bug fix] GLM-Image fit new get_image_features API ( #13052 )
...
change get_image_features API
Signed-off-by: JaredforReal <w13431838023@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2026-01-29 16:16:42 -10:00
Sayak Paul
f5b6b6625a
[wan] fix wan 2.2 when either of the transformers isn't present. ( #13055 )
...
fix wan 2.2 when either of the transformers isn't present.
2026-01-29 08:45:24 -10:00
Olexandr88
1be2f7e8c5
docs: fix grammar in fp16_safetensors CLI warning ( #13040 )
...
* docs: fix grammar in fp16_safetensors CLI warning
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2026-01-29 21:33:09 +05:30
Sayak Paul
314cfddf3a
[ci] uniform run times and wheels for pytorch cuda. ( #13047 )
...
* uniform run times and wheels for pytorch cuda.
* 12.9
* change to 24.04.
* change to 24.04.
2026-01-29 19:22:30 +05:30
Sayak Paul
e7de7d8449
[wan] fix layerwise upcasting tests on CPU ( #13039 )
...
up
2026-01-29 13:16:57 +05:30
Vinh H. Pham
a2ea45a5da
LTX2 distilled checkpoint support ( #12934 )
...
* add constants for distill sigmas values and allow ltx pipeline to pass in sigmas
* add time conditioning conversion and token packing for latents
* make style & quality
* remove prenorm
* add sigma param to ltx2 i2v
* fix copies and add pack latents to i2v
* Apply suggestions from code review
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com >
* Infer latent dims if latents/audio_latents is supplied
* add note for predefined sigmas
* run make style and quality
* revert distill timesteps & set original_state_dict_repo_idd to default None
* add latent normalize
* add create noised state, delete last sigmas
* remove normalize step in latent upsample pipeline and move it to ltx2 pipeline
* add create noise latent to i2v pipeline
* fix copies
* parse none value in weight conversion script
* explicit shape handling
* Apply suggestions from code review
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com >
* make style
* add two stage inference tests
* add ltx2 documentation
* update i2v expected_audio_slice
* Apply suggestions from code review
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com >
* Apply suggestion from @dg845
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com >
* Update ltx2.md to remove one-stage example
Removed one-stage generation example code and added comments for noise scale in two-stage generation.
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com >
Co-authored-by: Daniel Gu <dgu8957@gmail.com >
2026-01-28 19:35:43 -08:00
Jayce
a58d0b9bec
Fix Wan/WanI2V patchification ( #13038 )
...
* Fix Wan/WanI2V patchification
* Apply style fixes
* Apply suggestions from code review
I agree with you for the idea of using `patch_size` instead. Thanks!😊
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com >
* Fix logger warning
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com >
2026-01-28 18:06:58 -08:00
David El Malih
0ab2124958
docs: improve docstring scheduling_dpm_cogvideox.py ( #13044 )
2026-01-28 10:40:08 -08:00
Vasiliy Kuznetsov
74a0f0b694
remove torchao autoquant from diffusers docs ( #13048 )
...
Summary:
Context: https://github.com/pytorch/ao/issues/3739
Test Plan: CI, since this does not change any Python code
2026-01-28 21:10:22 +05:30
Sayak Paul
2c669e8480
change to CUDA 12.9. ( #13045 )
...
* change to CUDA 12.9.
* up
* change runtime base
* FROM
2026-01-28 17:22:27 +05:30
Ita Zaporozhets
2ac39ba664
fast tok update ( #13036 )
...
* v5 tok update
* ruff
* keep pre v5 slow code path
* Apply style fixes
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2026-01-28 17:13:04 +05:30
Sayak Paul
ef913010d4
[QwenImage] fix prompt isolation tests ( #13042 )
...
* up
* up
* up
* fix
2026-01-28 15:44:12 +05:30
YiYi Xu
53d8a1e310
[modular]support klein ( #13002 )
...
* support klein
* style
* copies
* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
* Update src/diffusers/modular_pipelines/flux2/encoders.py
* a few fix: unpack latents before decoder etc
* style
* remove guidannce to its own block
* style
* flux2-dev work in modular setting
* up
* up up
* add tests
---------
Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
2026-01-27 15:43:14 -10:00
Kashif Rasul
d54669a73e
[Qwen] avoid creating attention masks when there is no padding ( #12987 )
...
* avoid creating attention masks when there is no padding
* make fix-copies
* torch compile tests
* set all ones mask to none
* fix positional encoding from becoming > 4096
* fix from review
* slice freqs_cis to match the input sequence length
* keep only attenton masking change
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2026-01-27 12:42:48 -10:00
Jared Wen
22ac6fae24
[GLM-Image] Add batch support for GlmImagePipeline ( #13007 )
...
* init
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* change from right padding to left padding
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* try i2i batch
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* fix: revert i2i prior_token_image_ids to original 1D tensor format
* refactor KVCache for per prompt batching
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* fix KVCache
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* fix shape error
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* refactor pipeline
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* fix for left padding
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* insert seed to AR model
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* delete generator, use torch manual_seed
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* add batch processing unit tests for GlmImagePipeline
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* simplify normalize images method
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* fix grids_per_sample
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* fix t2i
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* delete comments, simplify condition statement
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* chage generate_prior_tokens outputs
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* simplify if logic
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* support user provided prior_token_ids directly
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* remove blank lines
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* align with transformers
Signed-off-by: JaredforReal <w13431838023@gmail.com >
* Apply style fixes
---------
Signed-off-by: JaredforReal <w13431838023@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2026-01-27 12:22:02 -10:00
Aditya Borate
71a865b742
Fix: Cosmos2.5 Video2World frame extraction and add default negative prompt ( #13018 )
...
* fix: Extract last frames for conditioning in Cosmos Video2World
* Added default negative prompt
* Apply style fixes
* Added default negative prompt in cosmos2 text2image pipeline
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2026-01-27 12:20:44 -10:00
Sam Edwards
53279ef017
[From Single File] support from_single_file method for WanAnimateTransformer3DModel ( #12691 )
...
* Add `WanAnimateTransformer3DModel` to `SINGLE_FILE_LOADABLE_CLASSES`
* Fixed dtype mismatch when loading a single file
* Fixed a bug that results in white noise for generation
* Update dtype check for time embedder - caused white noise output
* Improve code readability
* Optimize dtype handling
Removed unnecessary dtype conversions for timestep and weight.
* Apply style fixes
* Refactor time embedding dtype handling
Adjust time embedding type conversion for compatibility.
* Apply style fixes
* Modify comment for WanTimeTextImageEmbedding class
---------
Co-authored-by: Sam Edwards <sam.edwards1976@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2026-01-27 11:57:21 +05:30
Salman Chishti
d9959bd53b
Upgrade GitHub Actions to latest versions ( #12866 )
...
* Upgrade GitHub Actions to latest versions
Signed-off-by: Salman Muin Kayser Chishti <13schishti@gmail.com >
* fix: Correct GitHub Actions upgrade (fix branch refs and version formats)
* fix: Correct GitHub Actions upgrade (fix branch refs and version formats)
* fix: Correct GitHub Actions upgrade (fix branch refs and version formats)
---------
Signed-off-by: Salman Muin Kayser Chishti <13schishti@gmail.com >
2026-01-27 11:52:50 +05:30
YiYi Xu
b1c77f67ac
[modular] add auto_docstring & more doc related refactors ( #12958 )
...
* up
* up up
* update outputs
* style
* add modular_auto_docstring!
* more auto docstring
* style
* up up up
* more more
* up
* address feedbacks
* add TODO in the description for empty docstring
* refactor based on dhruv's feedback: remove the class method
* add template method
* up
* up up up
* apply auto docstring
* make style
* rmove space in make docstring
* Apply suggestions from code review
* revert change in z
* fix
* Apply style fixes
* include auto-docstring check in the modular ci. (#13004 )
* Run ruff format after auto docstring generation
* up
* upup
* upup
* style
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2026-01-26 17:12:13 -10:00
David El Malih
956bdcc3ea
Flag Flax schedulers as deprecated ( #13031 )
...
flag flax schedulers as deprecated
2026-01-26 09:41:48 -08:00
Hameer Abbasi
2af7baa040
Remove *pooled_* mentions from Chroma inpaint ( #13026 )
...
Remove `*pooled_*` mentions from Chroma as it has just one TE.
2026-01-26 10:18:29 -03:00
David El Malih
a7cb14efbe
Improve docstrings and type hints in scheduling_ddpm_parallel.py ( #13027 )
...
* docs: improve docstring scheduling_ddpm_parallel.py
* Update scheduling_ddpm_parallel.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2026-01-25 10:43:43 -08:00
David El Malih
e8e88ff2ce
Improve docstrings and type hints in scheduling_ddpm_flax.py ( #13024 )
...
docs: improve docstring scheduling_ddpm_flax.py
2026-01-23 11:51:47 -08:00
David El Malih
6e24cd842c
Improve docstrings and type hints in scheduling_ddim_parallel.py ( #13023 )
...
* docs: improve docstring scheduling_ddim_parallel.py
* docs: improve docstring scheduling_ddim_parallel.py
* Update src/diffusers/schedulers/scheduling_ddim_parallel.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/schedulers/scheduling_ddim_parallel.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/schedulers/scheduling_ddim_parallel.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/schedulers/scheduling_ddim_parallel.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* fix style
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2026-01-23 10:00:32 -08:00
Garry Ling
981eb802c6
feat: add qkv projection fuse for longcat transformers ( #13021 )
...
feat: add qkv fuse for longcat transformers
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2026-01-23 23:02:03 +05:30
jiqing-feng
1eb40c6dbd
Resnet only use contiguous in training mode. ( #12977 )
...
* fix contiguous
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
* update tol
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
* bigger tol
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
* fix tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
* update tol
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2026-01-23 18:40:10 +05:30
Sayak Paul
bff672f47f
fix Dockerfiles for cuda and xformers. ( #13022 )
2026-01-23 16:45:14 +05:30
David El Malih
d4f97d1921
Improve docstrings and type hints in scheduling_ddim_inverse.py ( #13020 )
...
docs: improve docstring scheduling_ddim_inverse.py
2026-01-22 15:42:45 -08:00
David El Malih
1d32b19ad4
Improve docstrings and type hints in scheduling_ddim_flax.py ( #13010 )
...
* docs: improve docstring scheduling_ddim_flax.py
* docs: improve docstring scheduling_ddim_flax.py
* docs: improve docstring scheduling_ddim_flax.py
2026-01-22 09:11:14 -08:00