1 Comment
User's avatar
Rainbow Roxy's avatar

It's interesting how you highlit the space between stimulus and output as the core difference; could you elaborate on how inductive human bias mite bridge that gap in TTS?