diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-02-15 23:37:08 +08:00

Author	SHA1	Message	Date
Dhruv Nair	56735380d8	Fix mistake in Single File Docs page (#8765 ) update	2024-12-23 13:02:13 +05:30
YiYi Xu	ace869b5ac	[doc] add a tip about using SDXL refiner with hunyuan-dit and pixart (#8735 ) * up * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:13 +05:30
YiYi Xu	b078f93d6a	[doc] add more about `from_pipe` API for PAG doc (#8701 ) * add more about from_pipe API * Update docs/source/en/using-diffusers/pag.md * Update docs/source/en/using-diffusers/pag.md --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-12-23 13:02:13 +05:30
YiYi Xu	5efc438c7e	add PAG support (#7944 ) * first draft --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Junhwa Song <ethan9867@gmail.com> Co-authored-by: Ahn Donghoon (안동훈 / suno) <suno.vivid@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:13 +05:30
Tolga Cangöz	1ced1c40d8	Discourage using deprecated `revision` parameter (#8573 ) * Discourage using `revision` * `make style && make quality` * Refactor code to use 'variant' instead of 'revision' * `revision="bf16"` -> `variant="bf16"` --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:13 +05:30
Tolga Cangöz	2c56360222	Errata - Trim trailing white space in the whole repo (#8575 ) * Trim all the trailing white space in the whole repo * Remove unnecessary empty places * make style && make quality * Trim trailing white space * trim --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:13 +05:30
Steven Liu	072b14dea1	[docs] Single file usage (#8412 ) * single file usage * edit	2024-12-23 13:02:12 +05:30
Tolga Cangöz	d027cb4326	Errata (#8322 ) * Fix typos * Trim trailing whitespaces * Remove a trailing whitespace * chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0 * Revert "chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0" This reverts commit `fd742b30b4`. * pokemon -> naruto * `DPMSolverMultistep` -> `DPMSolverMultistepScheduler` * Improve Markdown stylization * Improve style * Improve style * Refactor pipeline variable names for consistency * up style	2024-12-23 13:02:12 +05:30
Anton Obukhov	a495ed3e8b	Fix marigold documentation (#8372 ) * rename prs-eth/marigold-lcm-v1-0 into prs-eth/marigold-depth-lcm-v1-0 * update image paths in https://huggingface.co/datasets/huggingface/documentation-images to use main branch * fix relative paths to other diffusers pages * Update docs/source/en/using-diffusers/marigold_usage.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:12 +05:30
Steven Liu	c62a927ba1	[docs] Files and formats (#7874 ) * files and formats * fix callout * feedback * code sample * feedback	2024-12-23 13:02:12 +05:30
Steven Liu	6e0c2947e7	[docs] Scheduler features (#7990 ) * noise schedule * sigmas and zero snr * feedback * feedback	2024-12-23 13:02:12 +05:30
Álvaro Somoza	eedcdafe25	[docs] Add controlnet example to marigold (#8289 ) * initial doc * fix wrong LCM sentence * implement binary colormap without requiring matplotlib update section about Marigold for ControlNet update formatting of marigold_usage.md * fix indentation --------- Co-authored-by: anton <anton.obukhov@gmail.com>	2024-12-23 13:02:12 +05:30
Anton Obukhov	0be111f3d0	[Pipeline] Marigold depth and normals estimation (#7847 ) * implement marigold depth and normals pipelines in diffusers core * remove bibtex * remove deprecations * remove save_memory argument * remove validate_vae * remove config output * remove batch_size autodetection * remove presets logic move default denoising_steps and processing_resolution into the model config make default ensemble_size 1 * remove no_grad * add fp16 to the example usage * implement is_matplotlib_available use is_matplotlib_available, is_scipy_available for conditional imports in the marigold depth pipeline * move colormap, visualize_depth, and visualize_normals into export_utils.py * make the denoising loop more lucid fix the outputs to always be 4d tensors or lists of pil images support a 4d input_image case attempt to support model_cpu_offload_seq move check_inputs into a separate function change default batch_size to 1, remove any logic to make it bigger implicitly * style * rename denoising_steps into num_inference_steps * rename input_image into image * rename input_latent into latents * remove decode_image change decode_prediction to use the AutoencoderKL.decode method * move clean_latent outside of progress_bar * refactor marigold-reusable image processing bits into MarigoldImageProcessor class * clean up the usage example docstring * make ensemble functions members of the pipelines * add early checks in check_inputs rename E into ensemble_size in depth ensembling * fix vae_scale_factor computation * better compatibility with torch.compile better variable naming * move export_depth_to_png to export_utils * remove encode_prediction * improve visualize_depth and visualize_normals to accept multi-dimensional data and lists remove visualization functions from the pipelines move exporting depth as 16-bit PNGs functionality from the depth pipeline update example docstrings * do not shortcut vae.config variables * change all asserts to raise ValueError * rename output_prediction_type to output_type * better variable names clean up variable deletion code * better variable names * pass desc and leave kwargs into the diffusers progress_bar implement nested progress bar for images and steps loops * implement scale_invariant and shift_invariant flags in the ensemble_depth function add scale_invariant and shift_invariant flags readout from the model config further refactor ensemble_depth support ensembling without alignment add ensemble_depth docstring * fix generator device placement checks * move encode_empty_text body into the pipeline call * minor empty text encoding simplifications * adjust pipelines' class docstrings to explain the added construction arguments * improve the scipy failure condition add comments improve docstrings change the default use_full_z_range to True * make input image values range check configurable in the preprocessor refactor load_image_canonical in preprocessor to reject unknown types and return the image in the expected 4D format of tensor and on right device support a list of everything as inputs to the pipeline, change type to PipelineImageInput implement a check that all input list elements have the same dimensions improve docstrings of pipeline outputs remove check_input pipeline argument * remove forgotten print * add prediction_type model config * add uncertainty visualization into export utils fix NaN values in normals uncertainties * change default of output_uncertainty to False better handle the case of an attempt to export or visualize none * fix `output_uncertainty=False` * remove kwargs fix check_inputs according to the new inputs of the pipeline * rename prepare_latent into prepare_latents as in other pipelines annotate prepare_latents in normals pipeline with "Copied from" annotate encode_image in normals pipeline with "Copied from" * move nested-capable `progress_bar` method into the pipelines revert the original `progress_bar` method in pipeline_utils * minor message improvement * fix cpu offloading * move colormap, visualize_depth, export_depth_to_16bit_png, visualize_normals, visualize_uncertainty to marigold_image_processing.py update example docstrings * fix missing comma * change torch.FloatTensor to torch.Tensor * fix importing of MarigoldImageProcessor * fix vae offloading fix batched image encoding remove separate encode_image function and use vae.encode instead * implement marigold's intial tests relax generator checks in line with other pipelines implement return_dict __call__ argument in line with other pipelines * fix num_images computation * remove MarigoldImageProcessor and outputs from import structure update tests * update docstrings * update init * update * style * fix * fix * up * up * up * add simple test * up * update expected np input/output to be channel last * move expand_tensor_or_array into the MarigoldImageProcessor * rewrite tests to follow conventions - hardcoded slices instead of image artifacts write more smoke tests * add basic docs. * add anton's contribution statement * remove todos. * fix assertion values for marigold depth slow tests * fix assertion values for depth normals. * remove print * support AutoencoderTiny in the pipelines * update documentation page add Available Pipelines section add Available Checkpoints section add warning about num_inference_steps * fix missing import in docstring fix wrong value in visualize_depth docstring * [doc] add marigold to pipelines overview * [doc] add section "usage examples" * fix an issue with latents check in the pipelines * add "Frame-by-frame Video Processing with Consistency" section * grammarly * replace tables with images with css-styled images (blindly) * style * print * fix the assertions. * take from the github runner. * take the slices from action artifacts * style. * update with the slices from the runner. * remove unnecessary code blocks. * Revert "[doc] add marigold to pipelines overview" This reverts commit a505165150afd8dab23c474d1a054ea505a56a5f. * remove invitation for new modalities * split out marigold usage examples * doc cleanup --------- Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2024-12-23 13:02:12 +05:30
Tolga Cangöz	d8d7a0e307	Fix CPU Offloading Usage & Typos (#8230 ) * Fix typos * Fix `pipe.enable_model_cpu_offload()` usage * Fix cpu offloading * Update numbers	2024-12-23 13:02:12 +05:30
Álvaro Somoza	d6de291238	Official callbacks (#7761 )	2024-12-23 13:02:11 +05:30
YiYi Xu	0404c72b15	[scheduler] support custom `timesteps` and `sigmas` (#7817 ) * support custom sigmas and timesteps, dpm euler --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:11 +05:30
Steven Liu	f9c78fc6f8	[docs] Distilled inference (#7834 ) * combine * edits	2024-12-23 13:02:11 +05:30
Steven Liu	e2d7831b8f	[docs] LCM (#7829 ) * lcm * lcm lora * fix * fix hfoption * edits	2024-12-23 13:02:11 +05:30
Steven Liu	18f67e82d8	[docs] Community pipelines (#7819 ) * community pipelines * feedback * consolidate	2024-12-23 13:02:11 +05:30
Jenyuan-Huang	5fcb90f180	Update InstantStyle usage in IP-Adapter documentation (#7806 ) * enable control ip-adapter per-transformer block on-the-fly --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: ResearcherXman <xhs.research@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-12-23 13:02:11 +05:30
Fabio Rigano	ec8ce0c2a0	[Docs] Update image masking and face id example (#7780 ) * [Docs] Update image masking and face id example * Update docs * Fix docs	2024-12-23 13:02:11 +05:30
Steven Liu	de414618ba	[docs] Refactor image quality docs (#7758 ) * refactor * code snippets * fix path * fix path in guide * code outputs * align toctree title * title * fix title	2024-12-23 13:02:11 +05:30
Steven Liu	bebfb61c5c	[docs] Reproducible pipelines (#7769 ) * reproducibility * feedback * feedback * fix path * github link	2024-12-23 13:02:11 +05:30
Steven Liu	ed20a5ac49	[docs] Clean up toctree (#7715 ) * toctree * optim * feedback * improve overview	2024-12-23 13:02:11 +05:30
Jenyuan-Huang	4ec19bc3d5	Support InstantStyle (#7668 ) * enable control ip-adapter per-transformer block on-the-fly --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: ResearcherXman <xhs.research@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-12-23 13:02:10 +05:30
Steven Liu	1cce2c1c25	[docs] AutoPipeline (#7714 ) * autopipeline * edits * feedback --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:10 +05:30
Fabio Rigano	c5a2b97bff	Move IP Adapter Face ID to core (#7186 ) * Switch to peft and multi proj layers * Move Face ID loading and inference to core --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:10 +05:30
Steven Liu	2bed2f4c45	[docs] Pipeline loading (#7684 ) * pipelines * schedulers and models * community pipelines * feedback	2024-12-23 13:02:10 +05:30
Steven Liu	9b5e666e73	[docs] T2I (#7623 ) * refactor t2i * add code snippets	2024-12-23 13:02:10 +05:30
Steven Liu	f32af25416	[docs] Prompt enhancer (#7565 ) * prompt enhance * edits * align titles * feedback * feedback * feedback * link to style	2024-12-23 13:02:10 +05:30
Junjie	14b463902c	[Docs] fix bugs in callback docs (#7594 )	2024-12-23 13:02:10 +05:30
YiYi Xu	aa2f59fd64	add a `from_pipe` method to `DiffusionPipeline` (#7241 ) * add from_pipe --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-12-23 13:02:10 +05:30
UmerHA	16e445a49f	Implements Blockwise lora (#7352 ) * Initial commit * Implemented block lora - implemented block lora - updated docs - added tests * Finishing up * Reverted unrelated changes made by make style * Fixed typo * Fixed bug + Made text_encoder_2 scalable * Integrated some review feedback * Incorporated review feedback * Fix tests * Made every module configurable * Adapter to new lora test structure * Final cleanup * Some more final fixes - Included examples in `using_peft_for_inference.md` - Added hint that only attns are scaled - Removed NoneTypes - Added test to check mismatching lens of adapter names / weights raise error * Update using_peft_for_inference.md * Update using_peft_for_inference.md * Make style, quality, fix-copies * Updated tutorial;Warning if scale/adapter mismatch * floats are forwarded as-is; changed tutorial scale * make style, quality, fix-copies * Fixed typo in tutorial * Moved some warnings into `lora_loader_utils.py` * Moved scale/lora mismatch warnings back * Integrated final review suggestions * Empty commit to trigger CI * Reverted emoty commit to trigger CI --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:10 +05:30
Sayak Paul	a48c41f4c7	add: space for calculating memory usagee. (#7414 ) * add: space for calculating memory usahe. * Update docs/source/en/using-diffusers/loading.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:09 +05:30
sayakpaul	fec596ddf1	Revert "add: space within docs to calculate mememory usage." This reverts commit `78990dd960`.	2024-12-23 13:02:09 +05:30
sayakpaul	634c467193	add: space within docs to calculate mememory usage.	2024-12-23 13:02:09 +05:30
M. Tolga Cangöz	651dac5447	Fix typos (#7411 ) * Fix typos * Fix typo in SVD.md	2024-12-23 13:02:09 +05:30
Sayak Paul	8d9dadaa64	[Custom Pipelines with Custom Components] fix multiple things (#7304 ) * checking to improve pipelines. * more fixes. * add: tip to encourage the usage of revision * Apply suggestions from code review * retrigger ci --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-12-23 13:02:09 +05:30
Michael	f08278b391	Add Intro page of TCD (#7259 ) * add tcd intro * resolve repos * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * revise NFEs related * change inpainting location --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:09 +05:30
UmerHA	c5c113369b	Adds `denoising_end` parameter to ControlNetPipeline for SDXL (#6175 ) * Initial commit * Removed copy hints, as in original SDXLControlNetPipeline Removed copy hints, as in original SDXLControlNetPipeline, as the `make fix-copies` seems to have issues with the @property decorator. * Reverted changes to ControlNetXS * Addendum to: Removed changes to ControlNetXS * Added test+docs for mixture of denoiser * Update docs/source/en/using-diffusers/controlnet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/using-diffusers/controlnet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-12-23 13:02:09 +05:30
Steven Liu	6b2f8109bc	[docs] IP-Adapter image embedding (#7226 ) * update * fix parameter name * feedback * add no mask version --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:09 +05:30
Steven Liu	9d10e629af	[docs] Community tips (#7137 ) * tips * feedback * callback only	2024-12-23 13:02:08 +05:30
Steven Liu	19d9e7d5d9	[docs] Merge LoRAs (#7213 ) * merge loras * feedback * torch.compile * feedback	2024-12-23 13:02:08 +05:30
bimsarapathiraja	e5b4915090	Remove the line. Using it create wrong output (#7075 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:08 +05:30
Vinh H. Pham	eb3c60cbde	[Docs] Update callback.md code example (#7150 ) Update callback.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:08 +05:30
M. Tolga Cangöz	06de6e0b0f	Fix typos (#7181 ) * Fix typos * Fix typos * Fix typos and update documentation in lora.md	2024-12-23 13:02:08 +05:30
YiYi Xu	f38417fe30	[ip-adapter] refactor `prepare_ip_adapter_image_embeds` and skip load `image_encoder` (#7016 ) * add Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-23 13:02:08 +05:30
M. Tolga Cangöz	a3c115949d	[`Docs`] Fix typos (#7131 ) * Add copyright notice to relevant files and fix typos * Set `timestep_spacing` parameter of `StableDiffusionXLPipeline`'s scheduler to `'trailing'`. * Update `StableDiffusionXLPipeline.from_single_file` by including EulerAncestralDiscreteScheduler with `timestep_spacing="trailing"` param. * Update model loading method in SDXL Turbo documentation	2024-12-23 13:02:08 +05:30
M. Tolga Cangöz	59ed616c16	[`Docs`] Fix typos (#7118 ) Fix typos, formatting and remove trailing whitespace	2024-12-23 13:02:08 +05:30
Steven Liu	d6e432c38a	[docs] Minor updates (#7063 ) * updates * feedback	2024-12-23 13:02:08 +05:30

1 2 3 4

200 Commits