Dhruv Nair
|
edc154da09
|
Update Ruff to latest Version (#10919)
* update
* update
* update
* update
|
2025-04-09 16:51:34 +05:30 |
|
Sayak Paul
|
4ace7d0483
|
[chore] change licensing to 2025 from 2024. (#10615)
change licensing to 2025 from 2024.
|
2025-01-20 16:57:27 -10:00 |
|
Sayak Paul
|
e25e525fde
|
[LoRA test suite] refactor the test suite and cleanse it (#7316)
* cleanse and refactor lora testing suite.
* more cleanup.
* make check_if_lora_correctly_set a utility function
* fix: typo
* retrigger ci
* style
|
2024-03-20 17:13:52 +05:30 |
|
Sayak Paul
|
30e5e81d58
|
change to 2024 in the license (#6902)
change to 2024
|
2024-02-08 08:19:31 -10:00 |
|
Sayak Paul
|
cb4b3f0b78
|
[OmegaConf] replace it with yaml (#6488)
* remove omegaconf from convert_from_ckpt.
* remove from single_file.
* change to string based ubscription.
* style
* okay
* fix: vae_param
* no . indexing.
* style
* style
* turn getattrs into explicit if/else
* style
* propagate changes to ldm_uncond.
* propagate to gligen
* propagate to if.
* fix: quotes.
* propagate to audioldm.
* propagate to audioldm2
* propagate to musicldm.
* propagate to vq_diffusion
* propagate to zero123.
* remove omegaconf from diffusers codebase.
|
2024-01-15 20:02:10 +05:30 |
|
Sanchit Gandhi
|
7a24977ce3
|
Add AudioLDM 2 (#4549)
* from audioldm
* unet down + mid
* vae, clap, flan-t5
* start sequence audio mae
* iterate on audioldm encoder
* finish encoder
* finish weight conversion
* text pre-processing
* gpt2 pre-processing
* fix projection model
* working
* unet equivalence
* finish in base
* add unet cond
* finish unet
* finish custom unet
* start clean-up
* revert base unet changes
* refactor pre-processing
* tests: from audioldm
* fix some tests
* more fixes
* iterate on tests
* make fix copies
* harden fast tests
* slow integration tests
* finish tests
* update checkpoint
* update copyright
* docs
* remove outdated method
* add docstring
* make style
* remove decode latents
* enable cpu offload
* (text_encoder_1, tokenizer_1) -> (text_encoder, tokenizer)
* more clean up
* more refactor
* build pr docs
* Update docs/source/en/api/pipelines/audioldm2.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
* small clean
* tidy conversion
* update for large checkpoint
* generate -> generate_language_model
* full clap model
* shrink clap-audio in tests
* fix large integration test
* fix fast tests
* use generation config
* make style
* update docs
* finish docs
* finish doc
* update tests
* fix last test
* syntax
* finalise tests
* refactor projection model in prep for TTS
* fix fast tests
* style
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
|
2023-08-21 12:34:21 +01:00 |
|