mirror of https://github.com/huggingface/diffusers.git synced 2025-12-16 01:14:47 +08:00

Files

Sayak Paul 30e5e81d58 change to 2024 in the license (#6902 )

change to 2024

2024-02-08 08:19:31 -10:00

1.5 KiB

Raw Blame History

Overview

Generating high-quality outputs is computationally intensive, especially during each iterative step where you go from a noisy output to a less noisy output. One of 🤗 Diffuser's goals is to make this technology widely accessible to everyone, which includes enabling fast inference on consumer and specialized hardware.

This section will cover tips and tricks - like half-precision weights and sliced attention - for optimizing inference speed and reducing memory-consumption. You'll also learn how to speed up your PyTorch code with torch.compile or ONNX Runtime, and enable memory-efficient attention with xFormers. There are also guides for running inference on specific hardware like Apple Silicon, and Intel or Habana processors.

1.5 KiB Raw Blame History

Overview

1.5 KiB

Raw Blame History