SDEdit-AudioLDM2

This is the SDEdit implementation for AudioLDM2 use in AP-adapter "Audio Prompt Adapter: Unleashing Music Editing Abilities for Text-to-Music with Lightweight Finetuning" in Proc. Int. Society for Music Information Retrieval Conf. (ISMIR), 2024..

Installation

git clone https://github.com/fundwotsai2001/SDEdit-AudioLDM2.git
pip install -r requirements.txt

Inference

You can edit the config.py file. Most importantly, if noise_scale = 1.0, it means full steps of noise will be added. If noise_scale = 0, it means no noise will be added. You can tune this parameter based on your preference.

python inference.py

Key change from original AudioLDM2

Basically we add a few lines to encode the mel and add desired degree of noise. All other parts remains the same.

# Get mel from the input audio
mel = wav_to_mel(audio_path,
                10,
                augment_data=False,
                mix_data=None,
                snr=None)
mel = mel.unsqueeze(0).to(device).to(torch.float16)

# Decode mel
latents = self.vae.encode(mel).latent_dist.sample()
latents = latents * self.vae.config.scaling_factor
noise = torch.randn_like(latents)

# Decide the noise level
shallow_reverse_step = int(num_inference_steps * (1 - noise_scale))
timesteps = timesteps[shallow_reverse_step:]
timesteps_tensor = torch.tensor([timesteps[0]], dtype=torch.int32)

# Add the coresponding noise to the latent
noisy_sample = self.scheduler.add_noise(latents,noise,timesteps_tensor)

Acknowledgments

This code is heavily based on AudioLDM2, Diffusers

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
SDEdit_test		SDEdit_test
__pycache__		__pycache__
README.md		README.md
config.py		config.py
inference.py		inference.py
modeling_audioldm2.py		modeling_audioldm2.py
piano.wav		piano.wav
requirements.txt		requirements.txt
style_transfer_pipeline.py		style_transfer_pipeline.py
train_ipadapter_v2.py		train_ipadapter_v2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SDEdit-AudioLDM2

Installation

Inference

Key change from original AudioLDM2

Acknowledgments

About

Releases

Packages

Languages

fundwotsai2001/SDEdit-AudioLDM2

Folders and files

Latest commit

History

Repository files navigation

SDEdit-AudioLDM2

Installation

Inference

Key change from original AudioLDM2

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages