diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-02-14 23:05:20 +08:00

Author	SHA1	Message	Date
Aryan	f33b233789	[bug] remove unreachable norm_type=ada_norm_continuous from norm3 initialization conditions (#9006 ) remove ada_norm_continuous from norm3 list Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:15 +05:30
Sayak Paul	1616a6372e	[Core] add QKV fusion to AuraFlow and PixArt Sigma (#8952 ) * add fusion support to pixart * add to auraflow. * add tests * apply review feedback. * add back args and kwargs * style	2024-12-23 13:02:15 +05:30
Tolga Cangöz	51f45da25f	Update `CLIPFeatureExtractor` to `CLIPImageProcessor` and `DPTFeatureExtractor` to `DPTImageProcessor` (#9002 ) * fix: update `CLIPFeatureExtractor` to `CLIPImageProcessor` in codebase * `make style && make quality` * Update `DPTFeatureExtractor` to `DPTImageProcessor` in codebase * `make style` --------- Co-authored-by: Aryan <aryan@huggingface.co>	2024-12-23 13:02:15 +05:30
omahs	49544cc1fb	Fix typos (#9077 ) * fix typo	2024-12-23 13:02:15 +05:30
YiYi Xu	627fd46ab8	add sentencepiece as a soft dependency (#9065 ) * add sentencepiece as soft dependency for kolors * up --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:15 +05:30
Sayak Paul	8881fc9872	[Docs] add stable cascade unet doc. (#9066 ) * add stable cascade unet doc. * fix path	2024-12-23 13:02:15 +05:30
Aryan	9dbffc8c60	PAG variant for HunyuanDiT, PAG refactor (#8936 ) * copy hunyuandit pipeline * pag variant of hunyuan dit * add tests * update docs * make style * make fix-copies * Update src/diffusers/pipelines/pag/pag_utils.py * remove incorrect copied from * remove pag hunyuan attn procs to resolve conflicts * add pag attn procs again * new implementation for pag_utils * revert pag changes * add pag refactor back; update pixart sigma * update pixart pag tests * apply suggestions from review Co-Authored-By: yixu310@gmail.com * make style * update docs, fix tests * fix tests * fix test_components_function since list not accepted as valid __init__ param * apply patch to fix broken tests Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com> * make style * fix hunyuan tests --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:15 +05:30
Vinh H. Pham	fa55429b04	[Tests] Improve transformers model test suite coverage - Latte (#8919 ) * add LatteTransformer3DModel model test * change patch_size to 1 * reduce req len * reduce channel dims * increase num_layers * reduce dims further * run make style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2024-12-23 13:02:15 +05:30
Sayak Paul	499b7d6dde	[FLUX] support LoRA (#9057 ) * feat: lora support for Flux. add tests fix imports major fixes. * fix fixes final fixes? * fix * remove is_peft_available.	2024-12-23 13:02:15 +05:30
Aryan	44a4886771	[refactor] apply qk norm in attention processors (#9071 ) * apply qk norm in attention processors * revert attention processor * qk-norm in only attention proc 2.0 and fused variant	2024-12-23 13:02:15 +05:30
psychedelicious	01829c699a	type `get_attention_scores` as optional in `get_attention_scores` (#9075 ) `None` is valid for `get_attention_scores`, should be typed as such	2024-12-23 13:02:15 +05:30
asfiyab-nvidia	fce5debd8c	Update TensorRT txt2img and inpaint community pipelines (#9037 ) * Update TensorRT txt2img and inpaint community pipelines Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * update tensorrt install instructions Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:15 +05:30
Sayak Paul	0db81141b9	[Flux] minor documentation fixes for flux. (#9048 ) * minor documentation fixes for flux. * clipskip * add gist	2024-12-23 13:02:15 +05:30
Philip Rideout	9c02c40a13	Fix grammar mistake. (#9072 )	2024-12-23 13:02:15 +05:30
Aryan	6e5b374630	[refactor] create modeling blocks specific to AnimateDiff (#8979 ) * animatediff specific transformer model * make style * make fix-copies * move blocks to unet motion model * make style * remove dummy object * fix incorrectly passed param causing test failures * rename model and output class * fix sparsectrl imports * remove todo comments * remove temporal double self attn param from controlnet sparsectrl * add deprecated versions of blocks * apply suggestions from review * update --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:15 +05:30
Tolga Cangöz	c6ac793955	Errata: Fix typos & `\s+$` (#9008 ) * Fix typos * chore: Fix typos * chore: Update README.md for promptdiffusion example * Trim trailing white spaces * Fix a typo * update number * chore: update number * Trim trailing white space * Update README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update README.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:14 +05:30
Frank (Haofan) Wang	9769fae865	Update transformer_flux.py (#9060 )	2024-12-23 13:02:14 +05:30
Dhruv Nair	a615e524e5	Fix Nightly Deps (#9036 ) update	2024-12-23 13:02:14 +05:30
Sayak Paul	fdce85ccf1	[Flux] allow tests to run (#9050 ) * fix tests * fix * float64 skip * remove sample_size. * remove * remove more * default_sample_size. * credit black forest for flux model. * skip * fix: tests * remove OriginalModelMixin * add transformer model test * add: transformer model tests	2024-12-23 13:02:14 +05:30
Sayak Paul	c8a236ba5c	[Core] Add PAG support for PixArtSigma (#8921 ) * feat: add pixart sigma pag. * inits. * fixes * fix * remove print. * copy paste methods to the pixart pag mixin * fix-copies * add documentation. * add tests. * remove correction file. * remove pag_applied_layers * empty	2024-12-23 13:02:14 +05:30
Sayak Paul	7739beb740	Flux pipeline (#9043 ) add flux! Signed-off-by: Adrien <adrien@huggingface.co> Co-authored-by: Adrien <adrien.69740@gmail.com> Co-authored-by: Anatoly Belikov <abelikov@singularitynet.io> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-12-23 13:02:14 +05:30
Aryan	6f90bc1a63	[docs] fix pia example (#9015 ) fix pia example docstring	2024-12-23 13:02:14 +05:30
YiYi Xu	ceeaf1d469	fix load sharded checkpoint from a subfolder (local path) (#8913 ) fix Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
Dhruv Nair	232a967613	Updates deps for pipeline test fetcher (#9033 ) update	2024-12-23 13:02:14 +05:30
Aryan	e28e5373f9	PAG variant for AnimateDiff (#8789 ) * add animatediff pag pipeline * remove unnecessary print * make fix-copies * fix ip-adapter bug * update docs * add fast tests and fix bugs * update * update * address review comments * update ip adapter single test expected slice * implement test_from_pipe_consistent_config; fix expected slice values * LoraLoaderMixin->StableDiffusionLoraLoaderMixin; add latest freeinit test	2024-12-23 13:02:14 +05:30
Yoach Lacombe	8c154daddd	Fix Stable Audio repository id (#9016 ) Fix Stable Audio repo id	2024-12-23 13:02:14 +05:30
Aryan	cf513e4205	[core] Move community AnimateDiff ControlNet to core (#8972 ) * add animatediff controlnet to core * make style; remove unused method * fix copied from comment * add tests * changes to make tests work * add utility function to load videos * update docs * update pipeline example * make style * update docs with example * address review comments * add latest freeinit test from #8969 * LoraLoaderMixin -> StableDiffusionLoraLoaderMixin * fix docs * Update src/diffusers/utils/loading_utils.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * fix: variable out of scope --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Yoach Lacombe	030a134311	Stable Audio integration (#8716 ) * WIP modeling code and pipeline * add custom attention processor + custom activation + add to init * correct ProjectionModel forward * add stable audio to __initèè * add autoencoder and update pipeline and modeling code * add half Rope * add partial rotary v2 * add temporary modfis to scheduler * add EDM DPM Solver * remove TODOs * clean GLU * remove att.group_norm to attn processor * revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py * refactor GLU -> SwiGLU * remove redundant args * add channel multiples in autoencoder docstrings * changes in docsrtings and copyright headers * clean pipeline * further cleaning * remove peft and lora and fromoriginalmodel * Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace * make style * dummy models * fix copied from * add fast oobleck tests * add brownian tree * oobleck autoencoder slow tests * remove TODO * fast stable audio pipeline tests * add slow tests * make style * add first version of docs * wrap is_torchsde_available to the scheduler * fix slow test * test with input waveform * add input waveform * remove some todos * create stableaudio gaussian projection + make style * add pipeline to toctree * fix copied from * make quality * refactor timestep_features->time_proj * refactor joint_attention_kwargs->cross_attention_kwargs * remove forward_chunk * move StableAudioDitModel to transformers folder * correct convert + remove partial rotary embed * apply suggestions from yiyixuxu -> removing attn.kv_heads * remove temb * remove cross_attention_kwargs * further removal of cross_attention_kwargs * remove text encoder autocast to fp16 * continue removing autocast * make style * refactor how text and audio are embedded * add paper * update example code * make style * unify projection model forward + fix device placement * make style * remove fuse qkv * apply suggestions from review * Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * make style * smaller models in fast tests * pass sequential offloading fast tests * add docs for vae and autoencoder * make style and update example * remove useless import * add cosine scheduler * dummy classes * cosine scheduler docs * better description of scheduler --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	1a903d8019	[LoRA] fix: animate diff lora stuff. (#8995 ) * fix: animate diff lora stuff. * fix scaling function for UNetMotionModel * emoty	2024-12-23 13:02:14 +05:30
Anatoly Belikov	c7452308f5	handle lora scale and clip skip in lpw sd and sdxl community pipelines (#8988 ) * handle lora scale and clip skip in lpw sd and sdxl * use StableDiffusionLoraLoaderMixin * use StableDiffusionXLLoraLoaderMixin * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	3566f4b18a	[Docs] credit where it's due for Lumina and Latte. (#9000 ) credit where it's due for Lumina and Latte.	2024-12-23 13:02:14 +05:30
Adrien	f87ef1d061	[CI] Update runner configuration for setup and nightly tests (#9005 ) * [CI] Update runner configuration for setup and nightly tests Signed-off-by: Adrien <adrien@huggingface.co> * fix group Signed-off-by: Adrien <adrien@huggingface.co> * update for t4 Signed-off-by: Adrien <adrien@huggingface.co> --------- Signed-off-by: Adrien <adrien@huggingface.co>	2024-12-23 13:02:14 +05:30
Álvaro Somoza	edddf3d417	[Kolors] Add IP Adapter (#8901 ) * initial draft * apply suggestions * fix failing test * added ipa to img2img * add docs * apply suggestions	2024-12-23 13:02:14 +05:30
Aryan	a9de5cf59a	remove unused code from pag attn procs (#8928 )	2024-12-23 13:02:14 +05:30
Aryan	b7ddd2bb99	[core] AnimateDiff SparseCtrl (#8897 ) * initial sparse control model draft * remove unnecessary implementation * copy animatediff pipeline * remove deprecated callbacks * update * update pipeline implementation progress * make style * make fix-copies * update progress * add partially working pipeline * remove debug prints * add model docs * dummy objects * improve motion lora conversion script * fix bugs * update docstrings * remove unnecessary model params; docs * address review comment * add copied from to zero_module * copy animatediff test * add fast tests * update docs * update * update pipeline docs * fix expected slice values * fix license * remove get_down_block usage * remove temporal_double_self_attention from get_down_block * update * update docs with org and documentation images * make from_unet work in sparsecontrolnetmodel * add latest freeinit test from #8969 * make fix-copies * LoraLoaderMixin -> StableDiffsuionLoraLoaderMixin	2024-12-23 13:02:14 +05:30
Aryan	5e532e58cf	[fix] FreeInit step index out of bounds (#8969 ) * fix step index out of bounds * add test for free_init with different schedulers * add test to vid2vid and pia	2024-12-23 13:02:14 +05:30
Dhruv Nair	ca299f0430	[CI] Nightly Test Runner explicitly set runner for Setup Pipeline Matrix (#8986 ) * update * update * update	2024-12-23 13:02:14 +05:30
Dhruv Nair	e5113511cd	[CI] Fix parallelism in nightly tests (#8983 ) update	2024-12-23 13:02:14 +05:30
RandomGamingDev	13ef7e1b98	Added `accelerator` based gradient accumulation for basic_example (#8966 ) added accelerator based gradient accumulation for basic_example Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	6d11129c5a	[Chore] add `LoraLoaderMixin` to the inits (#8981 ) * introduce to promote reusability. * up * add more tests * up * remove comments. * fix fuse_nan test * clarify the scope of fuse_lora and unfuse_lora * remove space * rewrite fuse_lora a bit. * feedback * copy over load_lora_into_text_encoder. * address dhruv's feedback. * fix-copies * fix issubclass. * num_fused_loras * fix * fix * remove mapping * up * fix * style * fix-copies * change to SD3TransformerLoRALoadersMixin * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * up * handle wuerstchen * up * move lora to lora_pipeline.py * up * fix-copies * fix documentation. * comment set_adapters(). * fix-copies * fix set_adapters() at the model level. * fix? * fix * loraloadermixin. --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	999e8c4496	[Chore] remove all is from auraflow. (#8980 ) remove all is from auraflow.	2024-12-23 13:02:14 +05:30
efwfe	5473b3d475	fix guidance_scale value not equal to the value in comments (#8941 ) fix guidance_scale value not equal with the value in comments	2024-12-23 13:02:14 +05:30
YiYi Xu	a754d9071e	Revert "[LoRA] introduce LoraBaseMixin to promote reusability." (#8976 ) Revert "[LoRA] introduce LoraBaseMixin to promote reusability. (#8774)" This reverts commit `527430d0a4`.	2024-12-23 13:02:14 +05:30
mazharosama	7f74c09107	Enable CivitAI SDXL Inpainting Models Conversion (#8795 ) modify in_channels in network_config params Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
asfiyab-nvidia	d1e1676b9d	Update TensorRT img2img community pipeline (#8899 ) * Update TensorRT img2img pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * Update TensorRT version installed Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * make style and quality Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * Update examples/community/stable_diffusion_tensorrt_img2img.py Co-authored-by: Tolga Cangöz <46008593+tolgacangoz@users.noreply.github.com> * Update examples/community/README.md Co-authored-by: Tolga Cangöz <46008593+tolgacangoz@users.noreply.github.com> * Apply style and quality using ruff 0.1.5 Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Tolga Cangöz <46008593+tolgacangoz@users.noreply.github.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	82b37a4cc3	[LoRA] introduce LoraBaseMixin to promote reusability. (#8774 ) * introduce to promote reusability. * up * add more tests * up * remove comments. * fix fuse_nan test * clarify the scope of fuse_lora and unfuse_lora * remove space * rewrite fuse_lora a bit. * feedback * copy over load_lora_into_text_encoder. * address dhruv's feedback. * fix-copies * fix issubclass. * num_fused_loras * fix * fix * remove mapping * up * fix * style * fix-copies * change to SD3TransformerLoRALoadersMixin * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * up * handle wuerstchen * up * move lora to lora_pipeline.py * up * fix-copies * fix documentation. * comment set_adapters(). * fix-copies * fix set_adapters() at the model level. * fix? * fix --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Aryan	41f79e9576	[tests] speed up animatediff tests (#8846 ) * speed up animatediff tests * fix pia test_ip_adapter_single * fix tests/pipelines/pia/test_pia.py::PIAPipelineFastTests::test_dict_tuple_outputs_equivalent * update * fix ip adapter tests * skip test_from_pipe_consistent_config tests * fix prompt_embeds test * update test_from_pipe_consistent_config tests * fix expected_slice values * remove temporal_norm_num_groups from UpBlockMotion --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Dhruv Nair	e42c333819	[CI] Slow Test Updates (#8870 ) * update * update * update	2024-12-23 13:02:14 +05:30
Sayak Paul	015019ab7d	[Tests] fix slices of 26 tests (first half) (#8959 ) * check for assertions. * update with correct slices. * okay * style * get it ready * update * update * update --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Sanchit Gandhi	bc70d92317	[AudioLDM2] Fix cache pos for GPT-2 generation (#8964 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30

... 9 10 11 12 13 ...

4913 Commits