Skip to content
Discussion options

You must be logged in to vote

speaker embeddings are computed using a speaker embedding layer.

d_vectors are computed externally from a speaker encoder model.

speaker embedding model is harder to expand for more speakers once trained since each new speaker needs to be added to the speaker embedding layer

d_vectors do not have this issue but you need a high-quality pre-trained speaker encoder to make this work well.

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@Ca-ressemble-a-du-fake
Comment options

@jaggukaka
Comment options

Answer selected by Ca-ressemble-a-du-fake
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
3 participants