MegaTTS 3

A high-quality multilingual text-to-speech model from ByteDance, capable of generating human-like speech with emotion, prosody, and cross-lingual support.

1 min