Gradio

Prompt Speech (Optional, or let VoxCPM improvise)

0:00

Prompt Text

CFG Value (Guidance Scale)

Higher values increase adherence to prompt, lower values allow more creativity

1 3

Inference Timesteps

Number of inference timesteps for generation (higher values may improve quality but slower)

4 30

Target Text

We use wetext library to normalize the input text.

Text Normalization

Output Audio