JP TTS — Preview

Listening interfaces from the ongoing research. Pick a session below.

Blind A/B — 27 voices

All current candidates rendering the same JP passage. Engines · blind reveal · pipeline tabs.

Round-6 adds: Chatterbox (MIT) ×2 · Sarashina2.2-TTS (SoftBank, NC) ×1 · plus prior Irodori v3 · VoxCPM2 · Qwen3-TTS · Google Chirp3-HD · Supertonic 3 · Fish S2 Pro

→

Emotion sweep — instruct mode

Qwen3-TTS vs VoxCPM2 across 5 emotions (calm / sad / happy / angry / anxious).

Tests soft instruct steering · same speaker · same passage

New Round-6: chatterbox_* — Resemble AI's MIT-licensed 23-lang model (CPU on Apple Silicon). sarashina22_clone — SB Intuitions/SoftBank's 0.8B JP-first LLM-based TTS (April 2026, NC license, SilentCipher watermarked).

Listening order from RESEARCH.md: Sarashina2.2 → Irodori-TTS v3 → VoxCPM2 Voice Design → Qwen3-TTS → Chatterbox → Google Chirp3-HD → Supertonic 3 → Fish S2 Pro (NC ceiling). Round-3 consensus says Irodori-TTS v3 fixes the fluent-foreigner accent that bit earlier rounds.