Overcoming Phonetic Underspecification
for Hebrew
Text-To-Speech
For further improved Hebrew G2P, see our follow-up work: Renikud
Text-to-speech for Modern Hebrew is challenged by underspecified phonetic features such as vowels and stress. Phonikud is an open-source grapheme-to-phoneme system that produces fully specified IPA transcriptions for more accurate Hebrew TTS. The project also introduces ILSpeech, a Hebrew audio, text, and IPA corpus for G2P benchmarking, TTS training, and audio-to-IPA evaluation.
Works with real-time TTS like Piper using IPA phonemes.
Runs locally on Raspberry Pi and edge devices.
Fine-tunes TTS with as little as 2 hours of data.
Handles stress and vocal shva missed by others.
Low-latency screen reader support, even offline.
Studio-quality Hebrew speech with IPA annotations.
Weights, TTS models, and training code included.
Edit phonemes directly or let G2P handle it.
See how Phonikud transforms Hebrew text through each stage.
Comparative evaluation of Phonikud against existing Hebrew TTS approaches
| Text Sample |
ElevenLabs
Eleven v3
|
Google
Gemini v2.5
|
RoboShaul
1st place
|
Phonikud (Ours)
Ours v1 (alpha)
|
|---|---|---|---|---|
| הוא צפה בס֫רט וראה חיה שצ֫פה במ֫ים 🐸 | ||||
| הוא רצה את זה גם אבל היא ר֫צה מהר והקד֫ימה אותו 🏃♀️ | ||||
| בוא תרד לאכול יש בור֫קס עם ת֫רד 🥬 |
More resources, demos, and tools for Phonikud
@inproceedings{kolani2026phonikud,
title={Phonikud: Overcoming Phonetic Underspecification for Hebrew Text-To-Speech},
author={Yakov Kolani and Maxim Melichov and Cobi Calev and Morris Alper},
booktitle={Proc. Interspeech 2026},
year={2026},
url={https://arxiv.org/abs/2506.12311},
}