Skip to content
Discussion options

You must be logged in to vote

XTTS doesn't use a phonemizer at all, it goes directly from text to speech, so there's no way to manually provide the correct stress position. The model would have to be trained on much more Russian speech to learn it on its own, it has seen "only" 147 hours.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by Tarken-ai
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants