Prof. David Harel Group Department of Computer Science & Applied Mathematics

Home
Full Thesis (PDF)
CV

Name Yadin Benyamin

Role MSc student

Email yadin.benyamin@weizmann.ac.il

Whisper Emotional Speech Synthesis (WESS)

WESS architecture: T2S → S2A → Vocoder with emotion/dominance prefix and speaker embedding

Interactive Audio Demo

Sentence

Speaker sex

Emotion

WS (before fine-tuning)

WESS (ours)

GPT-4o mini TTS (SOTA)

Emotion-ID accuracy by emotion (95% CI, FDR-adjusted p-values)

© 2025 Yadin Benyamin · Weizmann Institute of Science