mirror of https://github.com/vllm-project/vllm.git synced 2025-12-06 15:04:47 +08:00

Files

Cyrus Leung 89d3679221 [Doc] Fix failing doc build (#28772 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-11-15 05:33:27 -08:00

445 B

Raw Permalink Blame History

Using vLLM

First, vLLM must be installed for your chosen device in either a Python or Docker environment.

Then, vLLM supports the following usage patterns:

Inference and Serving: Run a single instance of a model.
Deployment: Scale up model instances for production.
Training: Train or fine-tune a model.

445 B Raw Permalink Blame History

Using vLLM

445 B

Raw Permalink Blame History