Job Description
We are now expanding our team and are looking for skilled, goal-oriented MLE (TTS) to join our teams.
Requirements
3+ years of hands-on experience with Text-to-Speech (TTS) / speech synthesis
Proficiency in Python and deep learning frameworks (especially, PyTorch).
Strong understanding of speech synthesis processing techniques.
Experience with Fast Attention-Based Models: (FastPitch, FastSpeech 2) and modern variative approaches: (e.g., VITS, Glow-TTS).
Strong understanding of techniques to control prosody, rhythm, and emotional tone for expressive speech synthesis.
Knowledge of normalization techniques, FSTs, NN for normalization.
Familiarity with TTS evaluation techniques, including MOS and A/B testing.
Familiarity with vocoder models (e.g. Vocos, HiFi-GAN, mimi).
Knowledge of signal processing, statistical modeling, and language structure.
Responsibilities
Design and optimize TTS models to ensure our voice assistant sounds as natural and accurate ...
Requirements
3+ years of hands-on experience with Text-to-Speech (TTS) / speech synthesis
Proficiency in Python and deep learning frameworks (especially, PyTorch).
Strong understanding of speech synthesis processing techniques.
Experience with Fast Attention-Based Models: (FastPitch, FastSpeech 2) and modern variative approaches: (e.g., VITS, Glow-TTS).
Strong understanding of techniques to control prosody, rhythm, and emotional tone for expressive speech synthesis.
Knowledge of normalization techniques, FSTs, NN for normalization.
Familiarity with TTS evaluation techniques, including MOS and A/B testing.
Familiarity with vocoder models (e.g. Vocos, HiFi-GAN, mimi).
Knowledge of signal processing, statistical modeling, and language structure.
Responsibilities
Design and optimize TTS models to ensure our voice assistant sounds as natural and accurate ...
Ready to Apply?
Take the next step in your AI career. Submit your application to Aiphoria today.
Submit Application