DefTruth
973e334443
feat: support Ulysses Anything Attention ( #12996 )
...
* feat: support Ulysses Anything Attention
* feat: support Ulysses Anything Attention
* feat: support Ulysses Anything Attention
* feat: support Ulysses Anything Attention
* fix UAA broken while using joint attn
* update
* post check
* add docs
* add docs
* remove lru cache
* move codes
* update
2026-02-02 17:04:32 +05:30
Sayak Paul
3c70440d26
Update distributed_inference.md to reposition sections ( #12971 )
2026-01-13 11:07:39 -08:00
Bissmella Bahaduri
9d68742214
Add Unified Sequence Parallel attention ( #12693 )
...
* initial scheme of unified-sp
* initial all_to_all_double
* bug fixes, added cmnts
* unified attention prototype done
* remove raising value error in contextParallelConfig to enable unified attention
* bug fix
* feat: Adds Test for Unified SP Attention and Fixes a bug in Template Ring Attention
* bug fix, lse calculation, testing
bug fixes, lse calculation
-
switched to _all_to_all_single helper in _all_to_all_dim_exchange due contiguity issues
bug fix
bug fix
bug fix
* addressing comments
* sequence parallelsim bug fixes
* code format fixes
* Apply style fixes
* code formatting fix
* added unified attention docs and removed test file
* Apply style fixes
* tip for unified attention in docs at distributed_inference.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* Update distributed_inference.md, adding benchmarks
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* Update docs/source/en/training/distributed_inference.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* function name fix
* fixed benchmark in docs
---------
Co-authored-by: KarthikSundar2002 <karthiksundar30092002@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2026-01-13 09:16:51 +05:30
Sayak Paul
7c6d314549
fix the use of device_map in CP docs ( #12902 )
...
up
2026-01-05 19:42:32 -10:00
Sayak Paul
0eac64c7a6
Update distributed_inference.md to correct syntax ( #12827 )
2025-12-11 08:46:43 -08:00
Sayak Paul
6708f5c76d
[docs] improve distributed inference cp docs. ( #12810 )
...
* improve distributed inference cp docs.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-12-10 08:25:07 -08:00
Ali Imran
1b456bd5d5
docs: cleanup of runway model ( #12503 )
...
* cleanup of runway model
* quality fixes
2025-10-17 14:10:50 -07:00
Steven Liu
b4e6dc3037
[docs] Fix broken links ( #12487 )
...
fix broken links
2025-10-15 06:42:10 +05:30
Steven Liu
cc5b31ffc9
[docs] Migrate syntax ( #12390 )
...
* change syntax
* make style
2025-09-30 10:11:19 -07:00
Steven Liu
d7a1a0363f
[docs] CP ( #12331 )
...
* init
* feedback
* feedback
* feedback
* feedback
* feedback
* feedback
2025-09-30 09:33:41 -07:00
Steven Liu
ccedeca96e
[docs] Distributed inference ( #12285 )
...
* init
* feedback
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-09-29 11:24:26 -07:00
Ziheng Zhang
9e4a75b142
[docs] Fix VAE scale factor calculation in distributed inference docs ( #12259 )
...
docs: Fix VAE scale factor calculation
2025-09-01 16:34:16 -10:00
Steven Liu
5237a82a35
[docs] Remove Flax ( #12244 )
...
* remove flax
* toctree
* feedback
2025-08-27 11:11:07 -07:00
Álvaro Somoza
edcbe8038b
Fix huggingface-hub failing tests ( #11994 )
...
* login
* more logins
* uploads
* missed login
* another missed login
* downloads
* examples and more logins
* fix
* setup
* Apply style fixes
* fix
* Apply style fixes
2025-07-29 02:34:58 -04:00
Aryan
a4df8dbc40
Update more licenses to 2025 ( #11746 )
...
update
2025-06-19 07:46:01 +05:30
Steven Liu
be2fb77dc1
[docs] PyTorch 2.0 ( #11618 )
...
* combine
* Update docs/source/en/optimization/fp16.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-05-28 09:42:41 -07:00
osrm
8705af0914
docs: fix invalid links ( #11505 )
...
* fix invalid link lora.md
* fix invalid link controlnet_sdxl.md
The Hugging Face models page now uses the tags parameter instead of the other parameter for tag-based filtering. Therefore, to simultaneously apply both the "Stable Diffusion XL" and "ControlNet" tags, the following URL should be used: https://huggingface.co/models?tags=stable-diffusion-xl,controlnet
* fix invalid link cosine_dpm.md
"https://github.com/Stability-AI/stable-audio-tool " -> "https://github.com/Stability-AI/stable-audio-tools "
* Update controlnet_sdxl.md
* Update cosine_dpm.md
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-20 08:55:41 -07:00
Quentin Gallouédec
c8bb1ff53e
Use HF Papers ( #11567 )
...
* Use HF Papers
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-05-19 06:22:33 -10:00
co63oc
f00a995753
Fix typos in strings and comments ( #11407 )
2025-04-24 08:53:47 -10:00
Sayak Paul
cefa28f449
[docs] Promote AutoModel usage ( #11300 )
...
* docs: promote the usage of automodel.
* bitsandbytes
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-04-15 09:25:40 +05:30
puhuk
27b90235e4
Update Custom Diffusion Documentation for Multiple Concept Inference to resolve issue #10791 ( #10792 )
...
Update Custom Diffusion Documentation for Multiple Concept Inference
This PR updates the Custom Diffusion documentation to correctly demonstrate multiple concept inference by:
- Initializing the pipeline from a proper foundation model (e.g., "CompVis/stable-diffusion-v1-4") instead of a fine-tuned model.
- Defining model_id explicitly to avoid NameError.
- Correcting method calls for loading attention processors and textual inversion embeddings.
2025-02-14 08:19:11 -08:00
ChG
c44fba8899
fix link in the docs ( #10058 )
...
* fix link in the docs
* fix same issue for ko
2024-12-02 11:45:12 -08:00
Vahid Askari
a03bf4a531
Fix: Remove duplicated comma in distributed_inference.md ( #9868 )
...
Fix: Remove duplicated comma
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-11-05 23:37:11 +01:00
YiYi Xu
d2e5cb3c10
Revert "[LoRA] fix: lora loading when using with a device_mapped mode… ( #9823 )
...
Revert "[LoRA] fix: lora loading when using with a device_mapped model. (#9449 )"
This reverts commit 41e4779d98 .
2024-10-31 08:19:32 -10:00
Sayak Paul
41e4779d98
[LoRA] fix: lora loading when using with a device_mapped model. ( #9449 )
...
* fix: lora loading when using with a device_mapped model.
* better attibutung
* empty
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com >
* Apply suggestions from code review
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com >
* minors
* better error messages.
* fix-copies
* add: tests, docs.
* add hardware note.
* quality
* Update docs/source/en/training/distributed_inference.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* fixes
* skip properly.
* fixes
---------
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com >
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-10-31 21:17:41 +05:30
glide-the
0d935df67d
Docs: CogVideoX ( #9578 )
...
* CogVideoX docs
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-15 14:41:56 -10:00
Sayak Paul
63a5c8742a
Update distributed_inference.md to include transformer.device_map ( #9553 )
...
* Update distributed_inference.md to include `transformer.device_map`
* Update docs/source/en/training/distributed_inference.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-10-08 08:03:51 +05:30
Steven Liu
d9c969172d
[docs] Model sharding ( #9521 )
...
* flux shard
* feedback
2024-09-25 09:33:54 -07:00
suzukimain
b52119ae92
[docs] Replace runwayml/stable-diffusion-v1-5 with Lykon/dreamshaper-8 ( #9428 )
...
* [docs] Replace runwayml/stable-diffusion-v1-5 with Lykon/dreamshaper-8
Updated documentation as runwayml/stable-diffusion-v1-5 has been removed from Huggingface.
* Update docs/source/en/using-diffusers/inpaint.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Replace with stable-diffusion-v1-5/stable-diffusion-v1-5
* Update inpaint.md
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-09-16 10:18:45 -07:00
Sayak Paul
15eb77bc4c
Update distributed_inference.md to include a fuller example on distributed inference ( #9152 )
...
* Update distributed_inference.md
* Update docs/source/en/training/distributed_inference.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-08-12 09:56:03 -07:00
omahs
6d32b29239
Fix typos ( #9077 )
...
* fix typo
2024-08-05 09:00:08 -10:00
Sayak Paul
e8284281c1
add docs on model sharding ( #8658 )
...
* add docs on model sharding
* add entry to _toctree.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* simplify wording
* add a note on transformer library handling
* move device placement section
* Update docs/source/en/training/distributed_inference.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-06-26 07:35:11 +05:30
Tolga Cangöz
468ae09ed8
Errata - Trim trailing white space in the whole repo ( #8575 )
...
* Trim all the trailing white space in the whole repo
* Remove unnecessary empty places
* make style && make quality
* Trim trailing white space
* trim
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-06-24 18:39:15 +05:30
MaoXianXin
10d3220abe
A backslash is missing from the run command ( #8471 )
2024-06-18 16:44:34 +01:00
AmosDinh
eeb70033a6
Syntax error in readme example "pipe" -> "pipeline" ( #8601 )
...
Update controlnet.md
Syntax error pipe -> pipeline
2024-06-17 11:02:07 -07:00
Steven Liu
f7a4626f4b
[docs] DeepFloyd training ( #8224 )
...
deepfloyd training
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-05-29 09:27:37 -07:00
Bagheera
8edaf3b79c
7879 - adjust documentation to use naruto dataset, since pokemon is now gated ( #7880 )
...
* 7879 - adjust documentation to use naruto dataset, since pokemon is now gated
* replace references to pokemon in docs
* more references to pokemon replaced
* Japanese translation update
---------
Co-authored-by: bghira <bghira@users.github.com >
2024-05-07 09:36:39 -07:00
Sayak Paul
a402431de0
[docs] remove duplicate tip block. ( #7625 )
...
remove duplicate tip block.
2024-04-10 10:31:11 +05:30
Sayak Paul
3e4a6bd2d4
[Core] add "balanced" device_map support to pipelines ( #6857 )
...
* get device <-> component mapping when using multiple gpus.
* condition the device_map bits.
* relax condition
* device_map progress.
* device_map enhancement
* some cleaning up and debugging
* Apply suggestions from code review
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com >
* incorporate suggestions from PR.
* remove multi-gpu condition for now.
* guard check the component -> device mapping
* fix: device_memory variable
* dispatching transformers model to have force_hooks=True
* better guarding for transformers device_map
* introduce support balanced_low_memory and balanced_ultra_low_memory.
* remove device_map patch.
* fix: intermediate variable scoping.
* fix: condition in cpu offload.
* fix: flax class restrictions.
* remove modifications from cpu_offload and model_offload
* incorporate changes.
* add a simple forward pass test
* add: torch_device in get_inputs()
* add: tests
* remove print
* safe-guard to(), model offloading and cpu offloading when balanced is used as a device_map.
* style
* remove .
* safeguard device_map with more checks and remove invalid device_mapping strategues.
* make a class attribute and adjust tests accordingly.
* fix device_map check
* fix test
* adjust comment
* fix: device_map attribute
* fix: dispatching.
* max_memory test for pipeline
* version guard the tests
* fix guard.
* address review feedback.
* reset_device_map method.
* add: test for reset_hf_device_map
* fix a couple things.
* add reset_device_map() in the error message.
* add tests for checking reset_device_map doesn't have unintended consequences.
* fix reset_device_map and offloading tests.
* create _get_final_device_map utility.
* hf_device_map -> _hf_device_map
* add documentation
* add notes suggested by Marc.
* styling.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* move updates within gpu condition.
* other docs related things
* note on ignore a device not specified in .
* provide a suggestion if device mapping errors out.
* fix: typo.
* _hf_device_map -> hf_device_map
* Empty-Commit
* add: example hf_device_map.
---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
2024-04-10 08:59:05 +05:30
M. Tolga Cangöz
a51b6cc86a
[Docs] Fix typos ( #7451 )
...
* Fix typos
* Fix typos
* Fix typos
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-03-25 11:48:02 -07:00
M. Tolga Cangöz
f4977abcd8
Fix typos ( #7181 )
...
* Fix typos
* Fix typos
* Fix typos and update documentation in lora.md
2024-03-04 10:28:23 -08:00
Steven Liu
3dd4168d4c
[docs] Minor updates ( #7063 )
...
* updates
* feedback
2024-02-25 09:38:02 -08:00
Sayak Paul
30e5e81d58
change to 2024 in the license ( #6902 )
...
change to 2024
2024-02-08 08:19:31 -10:00
xhedit
7f58a76f48
Update lora.md with a more accurate description of rank ( #6724 )
...
* Update lora.md with a more accurate description of rank
* Update docs/source/en/training/lora.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-01-29 09:41:51 -08:00
Sayak Paul
008d9818a2
fix: t2i apdater paper link ( #6314 )
2023-12-25 10:45:14 +05:30
Abin Thomas
d816bcb5e8
Fix t2i. blog url ( #6205 )
2023-12-18 09:12:28 -08:00
Omar Sanseviero
74558ff65b
Nit fix to training params ( #6200 )
2023-12-18 11:06:16 +01:00
Kuba
1ccbfbb663
[docs] Add missing \ in lora.md ( #6174 )
2023-12-14 16:55:43 -08:00
Steven Liu
ddd8bd53ed
[docs] LCM training ( #5796 )
...
* first draft
* feedback
2023-11-29 16:08:05 +01:00
Steven Liu
bae14c8bcb
[docs] Update training docs ( #5512 )
...
* first draft
* try hfoption syntax
* fix hfoption id
* add text2image
* fix tag
* feedback
* feedbacks
* add textual inversion
* DreamBooth
* lora
* controlnet
* instructpix2pix
* custom diffusion
* t2i
* separate training methods and models
* sdxl
* kandinsky
* wuerstchen
* light edits
2023-11-14 10:29:56 -08:00