NeurIPS,
[Paper]
[Demo]
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representation for Speech Synthesis
TASLP,
[Paper]
[Demo]
Duration Controllable Voice Conversion via Phoneme-Based Information Bottleneck
ICPR,
[Paper]
[Demo]
StyleVC: Non-parallel Voice Conversion with Adversarial Style Generalization
ICASSP,
[Paper]
[Demo]
EmoQ-TTS: Emotion intensity Quantization for Fine-grained Controllable Emotional Text-to-Speech
ICASSP,
[Paper]
[Demo]
Fre-GAN 2: Fast and Efficient Frequency-consistent Audio Synthesis
ICASSP,
[Paper]
[Demo]
PVAE-TTS: High-Quality Adaptive Text-to-Speech via Progressive Variational Autoencoder