Skip to content

Question on the design choice of volume-preserving flow in the prior encoder #235

@AnitaLiu98

Description

@AnitaLiu98

Hello, thank you for the excellent work on VITS.

My question is about the volume-preserving design of the normalizing flow in the prior encoder (Section 2.5.2 of the paper).

Why did you choose this design over a more expressive non-volume-preserving flow? Was it primarily for training stability, simplicity, or due to empirical results showing no significant performance gain?

I would greatly appreciate any insights into this design choice. Thank you.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions