Explore Phonikud - Resources, Demos & Tools

ReNikud

Follow-up G2P Audio-supervised

Demonstration of audio-supervised Hebrew G2P as a follow-up to Phonikud. Uses weak supervision from speech and ASR pseudo-labels to model spoken pronunciation.

BlueTTS Hebrew Model

ONNX Multilingual Zero-shot

Fast open-source TTS with ONNX Runtime inference supporting 5 languages including Hebrew. Features zero-shot voice cloning and emotion control, with Phonikud powering Hebrew G2P.

ChatterBox Hebrew Model

ChatterBox Zero-shot

Demonstration of Hebrew Text-to-Speech using ChatterBox AI with Phonikud integration. Features multilingual zero-shot voice cloning and emotion control with performance that outperforms ElevenLabs.

ZipVoice Hebrew Model

ZipVoice Zero-shot

Hebrew TTS with ZipVoice and Phonikud integration. Natural speech, accurate pronunciation, and zero-shot voice cloning with low-latency ONNX inference.

Voice Cloning Integration

Voice Clone AI

Demonstration of seamless integration with voice cloning approaches. Phonikud enables accurate Hebrew pronunciation modeling that works perfectly with voice cloning techniques for personalized speech synthesis.

StyleTTS2 Hebrew Model

TTS StyleTTS2

New Hebrew TTS model based on StyleTTS2 with accurate IPA transcription and stress markers. Optimized for local deployment on simple hardware.

Zonos Hebrew Model

Zonos Multilingual

Advanced Hebrew TTS model based on Zonos architecture, trained on Phonikud IPA and Saspeech datasets. Features zero-shot voice cloning and multilingual support with high-quality 44kHz output.

Whisper-Heb-IPA

Whisper IPA ASR

Fine-tuned from ivrit.ai Whisper Large v3 Turbo model for transcribing Hebrew speech into IPA phonetic representation. Trained on the ILSpeech dataset with ~90% accuracy, providing highly accurate Hebrew phonetic transcription for speech recognition applications.

ILSpeech Dataset

Audio 2 hours

Studio-quality Hebrew speech dataset with two male speakers. Includes clean text and phoneme annotations in LJSpeech format, phonemized using Phonikud.

SASpeech Dataset

Audio 13+ hours

Large-scale Hebrew speech dataset with single-speaker audio at 44.1kHz. Enhanced from OpenSLR with Hebrew diacritics and Phonikud-generated phonemes.

Hebrew Text Dataset

Text 2M lines

Clean Hebrew text dataset based on Common Crawl containing modern Hebrew content from across the internet. Enhanced with diacritics, stress marks, and morphological information.

Phonikud Training Data

Text 5M lines

Training dataset used to create the first version of Phonikud. Contains clean Hebrew sentences with nikud and phonetic marks, with manual corrections for high-frequency words.

Phonikud Phonemes Dataset

Text 7.58M lines G2P

Dataset of text and phonemes that can be used to train G2P models. Contains Hebrew text with diacritics paired with IPA phonetic transcriptions. Includes hedc4-phonemes (2M lines) and knesset_phonemes (5M lines) generated with Phonikud.

Interactive Presentation

Interactive

Visual presentation that explains the challenges of Hebrew writing system and how Phonikud solves the phonetic ambiguity problem. Demonstrates multiple pronunciations of the same Hebrew text.

Hebrew TTS Benchmark

Benchmark WER/CER

Comprehensive Hebrew TTS benchmark comparing 17 models using Word Error Rate vs Character Error Rate metrics. Features interactive scatter plot visualization and uses whisper-heb-ipa for evaluation on hand-annotated SASpeech dataset samples.

Hebrew G2P Benchmark

Benchmark G2P WER/CER

Benchmark comparing Hebrew G2P models using WER and CER metrics on IPA transcriptions. Includes an interactive scatter plot and leaderboard showing how different models perform. Add your model!

HF Space: Hebrew TTS

Interactive

Fast Text-to-Speech in Hebrew with Phonetic Control. Enter unvocalized Hebrew text to generate speech with control over text, diacritics, and phonemes.

Hebrew AI Assistant

Interactive

Local AI assistant powered by Phonikud for natural Hebrew speech synthesis. Wake it up with "Picovoice!" and have conversations with full offline TTS capabilities.

Phonikud-TTS Python Package

Library TTS

Python library for Hebrew text-to-speech using Phonikud with Piper and StyleTTS2 support. Easy pip installation with ONNX models for efficient offline Hebrew speech synthesis. Includes examples and non-commercial license.

Discord Community

Community

Join our Discord community to discuss Text-to-Speech, Grapheme-to-Phoneme conversion, Hebrew linguistics, and collaborate on advancing Hebrew speech technology.

Gemma3-G2P

G2P LLM

Fine-tuned Gemma3 language model for Hebrew grapheme-to-phoneme conversion. Provides training scripts, inference tools, and deployment options including GGUF export for efficient local inference with Ollama and llama.cpp.