Higher values increase adherence to prompt, lower values allow more creativity
Number of inference timesteps for generation (higher values may improve quality but slower)
We use wetext library to normalize the input text.