Listening interfaces from the ongoing research. Pick a session below.
→Blind A/B — 27 voices
All current candidates rendering the same JP passage. Engines · blind reveal · pipeline tabs.
→Emotion sweep — instruct mode
Qwen3-TTS vs VoxCPM2 across 5 emotions (calm / sad / happy / angry / anxious).
chatterbox_* — Resemble AI's MIT-licensed 23-lang model (CPU on Apple Silicon).
sarashina22_clone — SB Intuitions/SoftBank's 0.8B JP-first LLM-based TTS (April 2026, NC license, SilentCipher watermarked).