diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-02-15 23:37:08 +08:00

Author	SHA1	Message	Date
Sayak Paul	c8a236ba5c	[Core] Add PAG support for PixArtSigma (#8921 ) * feat: add pixart sigma pag. * inits. * fixes * fix * remove print. * copy paste methods to the pixart pag mixin * fix-copies * add documentation. * add tests. * remove correction file. * remove pag_applied_layers * empty	2024-12-23 13:02:14 +05:30
Sayak Paul	7739beb740	Flux pipeline (#9043 ) add flux! Signed-off-by: Adrien <adrien@huggingface.co> Co-authored-by: Adrien <adrien.69740@gmail.com> Co-authored-by: Anatoly Belikov <abelikov@singularitynet.io> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-12-23 13:02:14 +05:30
Aryan	6f90bc1a63	[docs] fix pia example (#9015 ) fix pia example docstring	2024-12-23 13:02:14 +05:30
YiYi Xu	ceeaf1d469	fix load sharded checkpoint from a subfolder (local path) (#8913 ) fix Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
Dhruv Nair	232a967613	Updates deps for pipeline test fetcher (#9033 ) update	2024-12-23 13:02:14 +05:30
Aryan	e28e5373f9	PAG variant for AnimateDiff (#8789 ) * add animatediff pag pipeline * remove unnecessary print * make fix-copies * fix ip-adapter bug * update docs * add fast tests and fix bugs * update * update * address review comments * update ip adapter single test expected slice * implement test_from_pipe_consistent_config; fix expected slice values * LoraLoaderMixin->StableDiffusionLoraLoaderMixin; add latest freeinit test	2024-12-23 13:02:14 +05:30
Yoach Lacombe	8c154daddd	Fix Stable Audio repository id (#9016 ) Fix Stable Audio repo id	2024-12-23 13:02:14 +05:30
Aryan	cf513e4205	[core] Move community AnimateDiff ControlNet to core (#8972 ) * add animatediff controlnet to core * make style; remove unused method * fix copied from comment * add tests * changes to make tests work * add utility function to load videos * update docs * update pipeline example * make style * update docs with example * address review comments * add latest freeinit test from #8969 * LoraLoaderMixin -> StableDiffusionLoraLoaderMixin * fix docs * Update src/diffusers/utils/loading_utils.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * fix: variable out of scope --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Yoach Lacombe	030a134311	Stable Audio integration (#8716 ) * WIP modeling code and pipeline * add custom attention processor + custom activation + add to init * correct ProjectionModel forward * add stable audio to __initèè * add autoencoder and update pipeline and modeling code * add half Rope * add partial rotary v2 * add temporary modfis to scheduler * add EDM DPM Solver * remove TODOs * clean GLU * remove att.group_norm to attn processor * revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py * refactor GLU -> SwiGLU * remove redundant args * add channel multiples in autoencoder docstrings * changes in docsrtings and copyright headers * clean pipeline * further cleaning * remove peft and lora and fromoriginalmodel * Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace * make style * dummy models * fix copied from * add fast oobleck tests * add brownian tree * oobleck autoencoder slow tests * remove TODO * fast stable audio pipeline tests * add slow tests * make style * add first version of docs * wrap is_torchsde_available to the scheduler * fix slow test * test with input waveform * add input waveform * remove some todos * create stableaudio gaussian projection + make style * add pipeline to toctree * fix copied from * make quality * refactor timestep_features->time_proj * refactor joint_attention_kwargs->cross_attention_kwargs * remove forward_chunk * move StableAudioDitModel to transformers folder * correct convert + remove partial rotary embed * apply suggestions from yiyixuxu -> removing attn.kv_heads * remove temb * remove cross_attention_kwargs * further removal of cross_attention_kwargs * remove text encoder autocast to fp16 * continue removing autocast * make style * refactor how text and audio are embedded * add paper * update example code * make style * unify projection model forward + fix device placement * make style * remove fuse qkv * apply suggestions from review * Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * make style * smaller models in fast tests * pass sequential offloading fast tests * add docs for vae and autoencoder * make style and update example * remove useless import * add cosine scheduler * dummy classes * cosine scheduler docs * better description of scheduler --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	1a903d8019	[LoRA] fix: animate diff lora stuff. (#8995 ) * fix: animate diff lora stuff. * fix scaling function for UNetMotionModel * emoty	2024-12-23 13:02:14 +05:30
Anatoly Belikov	c7452308f5	handle lora scale and clip skip in lpw sd and sdxl community pipelines (#8988 ) * handle lora scale and clip skip in lpw sd and sdxl * use StableDiffusionLoraLoaderMixin * use StableDiffusionXLLoraLoaderMixin * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	3566f4b18a	[Docs] credit where it's due for Lumina and Latte. (#9000 ) credit where it's due for Lumina and Latte.	2024-12-23 13:02:14 +05:30
Adrien	f87ef1d061	[CI] Update runner configuration for setup and nightly tests (#9005 ) * [CI] Update runner configuration for setup and nightly tests Signed-off-by: Adrien <adrien@huggingface.co> * fix group Signed-off-by: Adrien <adrien@huggingface.co> * update for t4 Signed-off-by: Adrien <adrien@huggingface.co> --------- Signed-off-by: Adrien <adrien@huggingface.co>	2024-12-23 13:02:14 +05:30
Álvaro Somoza	edddf3d417	[Kolors] Add IP Adapter (#8901 ) * initial draft * apply suggestions * fix failing test * added ipa to img2img * add docs * apply suggestions	2024-12-23 13:02:14 +05:30
Aryan	a9de5cf59a	remove unused code from pag attn procs (#8928 )	2024-12-23 13:02:14 +05:30
Aryan	b7ddd2bb99	[core] AnimateDiff SparseCtrl (#8897 ) * initial sparse control model draft * remove unnecessary implementation * copy animatediff pipeline * remove deprecated callbacks * update * update pipeline implementation progress * make style * make fix-copies * update progress * add partially working pipeline * remove debug prints * add model docs * dummy objects * improve motion lora conversion script * fix bugs * update docstrings * remove unnecessary model params; docs * address review comment * add copied from to zero_module * copy animatediff test * add fast tests * update docs * update * update pipeline docs * fix expected slice values * fix license * remove get_down_block usage * remove temporal_double_self_attention from get_down_block * update * update docs with org and documentation images * make from_unet work in sparsecontrolnetmodel * add latest freeinit test from #8969 * make fix-copies * LoraLoaderMixin -> StableDiffsuionLoraLoaderMixin	2024-12-23 13:02:14 +05:30
Aryan	5e532e58cf	[fix] FreeInit step index out of bounds (#8969 ) * fix step index out of bounds * add test for free_init with different schedulers * add test to vid2vid and pia	2024-12-23 13:02:14 +05:30
Dhruv Nair	ca299f0430	[CI] Nightly Test Runner explicitly set runner for Setup Pipeline Matrix (#8986 ) * update * update * update	2024-12-23 13:02:14 +05:30
Dhruv Nair	e5113511cd	[CI] Fix parallelism in nightly tests (#8983 ) update	2024-12-23 13:02:14 +05:30
RandomGamingDev	13ef7e1b98	Added `accelerator` based gradient accumulation for basic_example (#8966 ) added accelerator based gradient accumulation for basic_example Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	6d11129c5a	[Chore] add `LoraLoaderMixin` to the inits (#8981 ) * introduce to promote reusability. * up * add more tests * up * remove comments. * fix fuse_nan test * clarify the scope of fuse_lora and unfuse_lora * remove space * rewrite fuse_lora a bit. * feedback * copy over load_lora_into_text_encoder. * address dhruv's feedback. * fix-copies * fix issubclass. * num_fused_loras * fix * fix * remove mapping * up * fix * style * fix-copies * change to SD3TransformerLoRALoadersMixin * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * up * handle wuerstchen * up * move lora to lora_pipeline.py * up * fix-copies * fix documentation. * comment set_adapters(). * fix-copies * fix set_adapters() at the model level. * fix? * fix * loraloadermixin. --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	999e8c4496	[Chore] remove all is from auraflow. (#8980 ) remove all is from auraflow.	2024-12-23 13:02:14 +05:30
efwfe	5473b3d475	fix guidance_scale value not equal to the value in comments (#8941 ) fix guidance_scale value not equal with the value in comments	2024-12-23 13:02:14 +05:30
YiYi Xu	a754d9071e	Revert "[LoRA] introduce LoraBaseMixin to promote reusability." (#8976 ) Revert "[LoRA] introduce LoraBaseMixin to promote reusability. (#8774)" This reverts commit `527430d0a4`.	2024-12-23 13:02:14 +05:30
mazharosama	7f74c09107	Enable CivitAI SDXL Inpainting Models Conversion (#8795 ) modify in_channels in network_config params Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
asfiyab-nvidia	d1e1676b9d	Update TensorRT img2img community pipeline (#8899 ) * Update TensorRT img2img pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * Update TensorRT version installed Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * make style and quality Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * Update examples/community/stable_diffusion_tensorrt_img2img.py Co-authored-by: Tolga Cangöz <46008593+tolgacangoz@users.noreply.github.com> * Update examples/community/README.md Co-authored-by: Tolga Cangöz <46008593+tolgacangoz@users.noreply.github.com> * Apply style and quality using ruff 0.1.5 Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Tolga Cangöz <46008593+tolgacangoz@users.noreply.github.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	82b37a4cc3	[LoRA] introduce LoraBaseMixin to promote reusability. (#8774 ) * introduce to promote reusability. * up * add more tests * up * remove comments. * fix fuse_nan test * clarify the scope of fuse_lora and unfuse_lora * remove space * rewrite fuse_lora a bit. * feedback * copy over load_lora_into_text_encoder. * address dhruv's feedback. * fix-copies * fix issubclass. * num_fused_loras * fix * fix * remove mapping * up * fix * style * fix-copies * change to SD3TransformerLoRALoadersMixin * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * up * handle wuerstchen * up * move lora to lora_pipeline.py * up * fix-copies * fix documentation. * comment set_adapters(). * fix-copies * fix set_adapters() at the model level. * fix? * fix --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Aryan	41f79e9576	[tests] speed up animatediff tests (#8846 ) * speed up animatediff tests * fix pia test_ip_adapter_single * fix tests/pipelines/pia/test_pia.py::PIAPipelineFastTests::test_dict_tuple_outputs_equivalent * update * fix ip adapter tests * skip test_from_pipe_consistent_config tests * fix prompt_embeds test * update test_from_pipe_consistent_config tests * fix expected_slice values * remove temporal_norm_num_groups from UpBlockMotion --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Dhruv Nair	e42c333819	[CI] Slow Test Updates (#8870 ) * update * update * update	2024-12-23 13:02:14 +05:30
Sayak Paul	015019ab7d	[Tests] fix slices of 26 tests (first half) (#8959 ) * check for assertions. * update with correct slices. * okay * style * get it ready * update * update * update --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:14 +05:30
Sanchit Gandhi	bc70d92317	[AudioLDM2] Fix cache pos for GPT-2 generation (#8964 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
RandomGamingDev	7f10257852	Added Code for Gradient Accumulation to work for basic_training (#8961 ) added line allowing gradient accumulation to work for basic_training example	2024-12-23 13:02:14 +05:30
Sayak Paul	48ad9a9e93	[AuraFlow] fix long prompt handling (#8937 ) fix	2024-12-23 13:02:14 +05:30
Dhruv Nair	d9a9cf4c49	[CI] Skip flaky download tests in PR CI (#8945 ) update	2024-12-23 13:02:14 +05:30
Sayak Paul	edc20c3199	remove residual i from auraflow. (#8949 ) * remove residual i. * rename to aura_flow in pipeline test	2024-12-23 13:02:14 +05:30
Sayak Paul	c4c822b14b	[Core] fix QKV fusion for attention (#8829 ) * start debugging the problem, * start * fix * fix * fix imports. * handle hunyuan * remove residuals. * add a check for making sure there's appropriate procs. * add more rigor to the tests. * fix test * remove redundant check * fix-copies * move check_qkv_fusion_matches_attn_procs_length and check_qkv_fusion_processors_exist.	2024-12-23 13:02:14 +05:30
Dhruv Nair	df4e3f45c1	Fix name when saving text inversion embeddings in dreambooth advanced scripts (#8927 ) update	2024-12-23 13:02:14 +05:30
Tolga Cangöz	5af8e68d97	Fix Colab and Notebook checks for `diffusers-cli env` (#8408 ) * chore: Update is_google_colab check to use environment variable * Check Colab with all possible COLAB_* env variables * Remove unnecessary word * Make `_is_google_colab` more inclusive * Revert "Make `_is_google_colab` more inclusive" This reverts commit `6406db21ac`. * Make `_is_google_colab` more inclusive. * chore: Update import_utils.py with notebook check improvement * Refactor import_utils.py to improve notebook detection for VS Code's notebook * chore: Remove `is_notebook()` function and related code --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
Vinh H. Pham	0a6b9da6bb	[Tests] Improve transformers model test suite coverage - Temporal Transformer (#8932 ) * add test for temporal transformer * remove unused variable * fix code quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
akbaig	2d738e2c71	fix: checkpoint save issue in advanced dreambooth lora sdxl script (#8926 ) Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2024-12-23 13:02:14 +05:30
Aritra Roy Gosthipaty	1c550bf64d	[Tests] reduce the model size in the audioldm2 fast test (#7846 ) * chore: initial model size reduction * chore: fixing expected values for failing tests * requested edits --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
Dhruv Nair	3cfd02a4c8	Update pipeline test fetcher (#8931 ) update	2024-12-23 13:02:14 +05:30
Sayak Paul	a151876058	[Benchmarking] check if runner helps to restore benchmarking (#8929 ) * check if runner helps. * remove caching * gpus * update runner group	2024-12-23 13:02:14 +05:30
Vishnu V Jaddipal	ff7925a4de	Add attentionless VAE support (#8769 ) * Add attentionless VAE support * make style and quality, fix-copies --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:14 +05:30
Sayak Paul	94bc157cef	[Tests] proper skipping of request caching test (#8908 ) proper skipping of request caching test	2024-12-23 13:02:14 +05:30
Jiwook Han	eed4531f9c	Reflect few contributions on `ethical_guidelines.md` that were not reflected on #8294 (#8914 ) fix_ethical_guidelines.md	2024-12-23 13:02:14 +05:30
Sayak Paul	8e76f5b6b5	[Docs] small fixes to pag guide. (#8920 ) small fixes to pag guide.	2024-12-23 13:02:14 +05:30
Seongsu Park	0e59db02d9	🌐 [i18n-KO] Translated docs to Korean (added 7 docs and etc) (#8804 ) * remove unused docs * add ko-18n docs * docs typo, edit etc * reorder list, add `in translation` in toctree * fix minor translation * fix docs minor tone, etc	2024-12-23 13:02:14 +05:30
Sayak Paul	1b5d74a9e3	[Training] SD3 training fixes (#8917 ) * SD3 training fixes Co-authored-by: bghira <59658056+bghira@users.noreply.github.com> * rewrite noise addition part to respect the eqn. * styler * Update examples/dreambooth/README_sd3.md Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> --------- Co-authored-by: bghira <59658056+bghira@users.noreply.github.com> Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>	2024-12-23 13:02:14 +05:30
Lucain	26ade526bc	Use model_info.id instead of model_info.modelId (#8912 ) Mention model_info.id instead of model_info.modelId	2024-12-23 13:02:14 +05:30

1 2 3 4 5 ...

4394 Commits