diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-04-05 23:32:13 +08:00

Files

dg845 072d15ee42 Add Support for LTX-2.3 Models (#13217 )

* Initial implementation of perturbed attn processor for LTX 2.3

* Update DiT block for LTX 2.3 + add self_attention_mask

* Add flag to control using perturbed attn processor for now

* Add support for new video upsampling blocks used by LTX-2.3

* Support LTX-2.3 Big-VGAN V2-style vocoder

* Initial implementation of LTX-2.3 vocoder with bandwidth extender

* Initial support for LTX-2.3 per-modality feature extractor

* Refactor so that text connectors own all text encoder hidden_states normalization logic

* Fix some bugs for inference

* Fix LTX-2.X DiT block forward pass

* Support prompt timestep embeds and prompt cross attn modulation

* Add LTX-2.3 configs to conversion script

* Support converting LTX-2.3 DiT checkpoints

* Support converting LTX-2.3 Video VAE checkpoints

* Support converting LTX-2.3 Vocoder with bandwidth extender

* Support converting LTX-2.3 text connectors

* Don't convert any upsamplers for now

* Support self attention mask for LTX2Pipeline

* Fix some inference bugs

* Support self attn mask and sigmas for LTX-2.3 I2V, Cond pipelines

* Support STG and modality isolation guidance for LTX-2.3

* make style and make quality

* Make audio guidance values default to video values by default

* Update to LTX-2.3 style guidance rescaling

* Support cross timesteps for LTX-2.3 cross attention modulation

* Fix RMS norm bug for LTX-2.3 text connectors

* Perform guidance rescale in sample (x0) space following original code

* Support LTX-2.3 Latent Spatial Upsampler model

* Support LTX-2.3 distilled LoRA

* Support LTX-2.3 Distilled checkpoint

* Support LTX-2.3 prompt enhancement

* Make LTX-2.X processor non-required so that tests pass

* Fix test_components_function tests for LTX2 T2V and I2V

* Fix LTX-2.3 Video VAE configuration bug causing pixel jitter

* Apply suggestions from code review

Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Refactor LTX-2.X Video VAE upsampler block init logic

* Refactor LTX-2.X guidance rescaling to use rescale_noise_cfg

* Use generator initial seed to control prompt enhancement if available

* Remove self attention mask logic as it is not used in any current pipelines

* Commit fixes suggested by claude code (guidance in sample (x0) space, denormalize after timestep conditioning)

* Use constant shift following original code

---------

Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

2026-03-19 14:58:29 -07:00

__init__.py

Fix conversion script

2022-07-15 17:00:41 +00:00

change_naming_configs_and_checkpoints.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

conversion_ldm_uncond.py

[OmegaConf] replace it with yaml (#6488 )

2024-01-15 20:02:10 +05:30

convert_amused.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_animatediff_motion_lora_to_diffusers.py

[core] AnimateDiff SparseCtrl (#8897 )

2024-07-26 17:46:05 +05:30

convert_animatediff_motion_module_to_diffusers.py

[Pipeline] AnimateDiff SDXL (#6721 )

2024-05-08 21:27:14 +05:30

convert_animatediff_sparsectrl_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_asymmetric_vqgan_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_aura_flow_to_diffusers.py

[Core] Add AuraFlow (#8796 )

2024-07-11 08:50:19 -10:00

convert_blipdiffusion_to_diffusers.py

Fix style (#10478 )

2025-01-07 11:06:36 +05:30

convert_cogvideox_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_cogview3_to_diffusers.py

[Fix] Syntax error (#10068 )

2024-12-02 11:28:00 +05:30

convert_cogview4_to_diffusers_megatron.py

CogView4 Control Block (#10809 )

2025-03-15 07:15:56 -10:00

convert_cogview4_to_diffusers.py

CogView4 Control Block (#10809 )

2025-03-15 07:15:56 -10:00

convert_consistency_decoder.py

docs: cleanup of runway model (#12503 )

2025-10-17 14:10:50 -07:00

convert_consistency_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_cosmos_to_diffusers.py

Cosmos Transfer2.5 Auto-Regressive Inference Pipeline (#13114 )

2026-02-25 14:42:29 -10:00

convert_dance_diffusion_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_dcae_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_ddpm_original_checkpoint_to_diffusers.py

Ruff: apply same rules as in transformers (#2827 )

2023-03-27 16:18:57 +02:00

convert_diffusers_sdxl_lora_to_webui.py

changed positional parameters to named parameters like in docs (#6905 )

2024-02-08 21:39:03 +05:30

convert_diffusers_to_original_sdxl.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_diffusers_to_original_stable_diffusion.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_dit_to_diffusers.py

Replace flake8 with ruff and update black (#2279 )

2023-02-07 23:46:23 +01:00

convert_flux2_to_diffusers.py

Flux2 klein (#12982 )

2026-01-15 09:10:54 -10:00

convert_flux_to_diffusers.py

Fix typos in docs and comments (#11416 )

2025-04-30 20:30:53 -10:00

convert_flux_xlabs_ipadapter_to_diffusers.py

Support Flux IP Adapter (#10261 )

2024-12-21 17:49:58 +00:00

convert_gligen_to_diffusers.py

Remove torch_dtype in to() to end deprecation (#6886 )

2024-02-08 09:38:57 +05:30

convert_hunyuan_image_to_diffusers.py

HunyuanImage21 (#12333 )

2025-10-23 22:31:12 -10:00

convert_hunyuan_video1_5_to_diffusers.py

[HunyuanVideo1.5] support step-distilled (#12802 )

2025-12-07 21:50:36 -10:00

convert_hunyuan_video_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_hunyuandit_controlnet_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_hunyuandit_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_i2vgen_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_if.py

Update access of configuration attributes (#7343 )

2024-03-18 08:53:29 -10:00

convert_k_upscaler_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_kakao_brain_unclip_to_diffusers.py

[Core] move transformer scripts to transformers modules (#6747 )

2024-01-29 22:28:28 +05:30

convert_kandinsky3_unet.py

[@cene555][Kandinsky 3.0] Add Kandinsky 3.0 (#5913 )

2023-11-24 17:46:00 +01:00

convert_kandinsky_to_diffusers.py

[Core] move transformer scripts to transformers modules (#6747 )

2024-01-29 22:28:28 +05:30

convert_ldm_original_checkpoint_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_lora_safetensor_to_diffusers.py

[LoRA test suite] refactor the test suite and cleanse it (#7316 )

2024-03-20 17:13:52 +05:30

convert_ltx2_to_diffusers.py

Add Support for LTX-2.3 Models (#13217 )

2026-03-19 14:58:29 -07:00

convert_ltx_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_lumina_to_diffusers.py

Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline (#10827 )

2025-03-13 09:24:21 -10:00

convert_mochi_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_models_diffuser_to_diffusers.py

Ruff: apply same rules as in transformers (#2827 )

2023-03-27 16:18:57 +02:00

convert_ms_text_to_video_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_music_spectrogram_to_diffusers.py

#7535 Update FloatTensor type hints to Tensor (#7883 )

2024-05-10 09:53:31 -10:00

convert_ncsnpp_original_checkpoint_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_omnigen_to_diffusers.py

Add OmniGen (#10148 )

2025-02-12 02:16:38 +05:30

convert_original_audioldm2_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_original_audioldm_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_original_controlnet_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_original_musicldm_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_original_stable_diffusion_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_original_t2i_adapter.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_ovis_image_to_diffusers.py

Add support for Ovis-Image (#12740 )

2025-12-02 11:48:07 -10:00

convert_pixart_alpha_to_diffusers.py

Fix PixArt 256px inference (#6789 )

2024-03-03 10:31:21 +05:30

convert_pixart_sigma_to_diffusers.py

PixArt-Sigma Implementation (#7654 )

2024-04-23 22:33:08 -10:00

convert_prx_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_rae_to_diffusers.py

feat: implement rae autoencoder. (#13046 )

2026-03-05 20:17:14 +05:30

convert_sana_controlnet_to_diffusers.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_sana_to_diffusers.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_sana_video_to_diffusers.py

add ltx2 vae in sana-video; (#13229 )

2026-03-17 18:09:52 -10:00

convert_sd3_controlnet_to_diffusers.py

Sd35 controlnet (#10020 )

2024-11-27 10:44:48 -10:00

convert_sd3_to_diffusers.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_shap_e_to_diffusers.py

Fix typos in docs and comments (#11416 )

2025-04-30 20:30:53 -10:00

convert_skyreelsv2_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_stable_audio.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_stable_cascade_lite.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_stable_cascade.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_stable_diffusion_checkpoint_to_onnx.py

Update more licenses to 2025 (#11746 )

2025-06-19 07:46:01 +05:30

convert_stable_diffusion_controlnet_to_onnx.py

Convert Stable Diffusion ControlNet to TensorRT (#4465 )

2023-08-11 08:12:26 +05:30

convert_stable_diffusion_controlnet_to_tensorrt.py

Convert Stable Diffusion ControlNet to TensorRT (#4465 )

2023-08-11 08:12:26 +05:30

convert_svd_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_tiny_autoencoder_to_diffusers.py

Remove code snippets containing is_safetensors_available() (#4521 )

2023-08-11 11:05:22 +05:30

convert_unclip_txt2img_to_image_variation.py

Replace flake8 with ruff and update black (#2279 )

2023-02-07 23:46:23 +01:00

convert_unidiffuser_to_diffusers.py

[WIP] Refactor UniDiffuser Pipeline and Tests (#4948 )

2023-10-02 18:24:55 +02:00

convert_vae_diff_to_onnx.py

make style

2023-03-06 10:40:18 +00:00

convert_vae_pt_to_diffusers.py

[BUG] Fix convert_vae_pt_to_diffusers bug (#11078 )

2025-04-10 06:59:45 +01:00

convert_versatile_diffusion_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_vq_diffusion_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_wan_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_wuerstchen.py

Fix typos in docs and comments (#11416 )

2025-04-30 20:30:53 -10:00

convert_zero123_to_diffusers.py

Remove dead code and fix f-string issue (#7720 )

2024-05-08 13:15:28 -10:00

extract_lora_from_model.py

[chore] add a script to extract loras from full fine-tuned models (#10631 )

2025-01-24 11:50:36 +05:30

generate_logits.py

Use model_info.id instead of model_info.modelId (#8912 )

2024-07-20 20:01:21 +05:30