Add support for Raw Phoneme Input and Other Phonemes #742

kbickar · 2025-03-07T23:51:17Z

These changes update Piper to add a new phoneme type "phonemes" that a model can be configured with signifying it accepts raw phonemes as input and a phonemizer should not be used.

The phonemes can now also be defined as multi-character (1-3) ASCII which adds support for other phonetic alphabets such as Sampa and Arpabet,

The changes from #403 have been added to allow the --phoneme-input flag to be used in addition to the model config.

Some of the code from the piper_phonemize library relating to converting phonemes to IDs has been moved back into the piper application.

The rational for these changes is it means Piper can be used with a custom phonemizer working to preprocess sentences and without the dependency on the espeak library. On the python side, the espeak library is not needed to be even installed, on the C++ side it is still linked as it's a little complicated to disentangle it.

This relates to the piper phonemizer issue: rhasspy/piper-phonemize#17

balloob · 2025-03-08T04:27:36Z

src/cpp/main.cpp

-void rawOutputProc(vector<int16_t> &sharedAudioBuffer, mutex &mutAudio,
-                   condition_variable &cvAudio, bool &audioReady,
-                   bool &audioFinished);
+void parseArgs(int argc, char* argv[], RunConfig& runConfig);


Style changes for unrelated code should not be in this PR. If a formatter should be used, we should put that in a different PR.

Fair enough, I've created another containing just the formatting changes: #744

balloob

Please split the style changes out of this PR.

synesthesiam · 2025-07-10T21:21:22Z

Development has moved: https://github.com/OHF-Voice/piper1-gpl

- Added current implementation status of piper-plus - Noted that Japanese phoneme issues (rhasspy#787) are already resolved - Listed existing PRs (#96-#100) for TTS improvements - Updated priorities based on what's already implemented - Focused recommendations on features not yet in piper-plus: * Raw Phoneme Input (PR rhasspy#742) * IPA phonemes support (PR rhasspy#401) * GPU device designation (PR rhasspy#429) * Silence randomness control (Issue rhasspy#817) 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

kbickar added 3 commits March 6, 2025 18:26

Add input that takes raw phonemes as input

b073d33

Incorporate rhasspy#403 changes for command line arg

4cbeef2

Update C++ version to support raw phoneme input

8e9d72c

balloob reviewed Mar 8, 2025

View reviewed changes

balloob suggested changes Mar 8, 2025

View reviewed changes

ayutaz mentioned this pull request Jul 19, 2025

feat: Raw Phoneme Input サポートの追加 ayutaz/piper-plus#104

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for Raw Phoneme Input and Other Phonemes #742

Add support for Raw Phoneme Input and Other Phonemes #742

Uh oh!

kbickar commented Mar 7, 2025

Uh oh!

balloob Mar 8, 2025

Uh oh!

kbickar Mar 8, 2025

Uh oh!

balloob left a comment

Uh oh!

synesthesiam commented Jul 10, 2025

Uh oh!

Uh oh!

Add support for Raw Phoneme Input and Other Phonemes #742

Are you sure you want to change the base?

Add support for Raw Phoneme Input and Other Phonemes #742

Uh oh!

Conversation

kbickar commented Mar 7, 2025

Uh oh!

balloob Mar 8, 2025

Choose a reason for hiding this comment

Uh oh!

kbickar Mar 8, 2025

Choose a reason for hiding this comment

Uh oh!

balloob left a comment

Choose a reason for hiding this comment

Uh oh!

synesthesiam commented Jul 10, 2025

Uh oh!

Uh oh!