Closed
Description
All Features open to public Contributions ⭐
- -h -help parameter info in different languages
- Notebooks Folder Talked about here
- Make Chinese text splitting not split words and improve pause timing Talked about here
- Get Kaggel Notebook working
- Get Working Google Colab Notebook Talked about here
- Make a ios app
- Make an android app
Wanted Extra Parameters
- Translate from translate the full ebook to X lang and audiobook it, output original ebookfile, audiobook file, and translated ebook file
- Make parameter for specifying output audio file format
- The F5-TTS model referenced here
- Make ebook input parameter accept a list of files for multiple files.
- Make a way for multiple lines to have audio generated at a time using multiple instances of coqui tts running for more beefy hardware.
- Make ebook input parameter accept a folder containing ebook files to auto-run through.
- OCR for PDF files (as a Parameter) Talked about here
- Add a force use device (cpu or GPU) (This will force set the device at the top of the script) Talked about here(currently being added by @ROBERT-MCDOWELL) ref here
- Use Deepfilternet2 to de-noise any reference audio for voice cloning, demo huggingfacespace using it, Talked about here
- Custom model dir input for pointing to a folder containing all of the custom model files if available instead of having to point to each model file individually
- Change voices per chapter parameter Talked about here
My Other Repos I Want to Integrate into the App for Extra Options :)
- Add app parameter that launches the ebook2audiobookpiper-tts GUI.(Piper tts appears to have issues working in ARM (apple silicon)MAC But runs fine in the docker on ARM)
- Add app parameter that launches the ebook2audiobookstyletts2 GUI.
- Add app parameter that launches the ebook2audiobook-espeak GUI.
- Add app parameter that launches the FineTune XTTS GUI.
- Add app parameter for using Barktts documentation from coqui tts
Create a standard function for load_model() and inference_model() for:
- XTTSv2
- Style-TTS2
- Bark
- Fairseq
- VITS
- YourTTS
- Tortoise
- Glow-TTS
- Tacotron2
- Speedy-Speech
- Align-TTS
- FastPitch
- FastSpeech2
- Delightful-TTS
- Zonos
- F5-TTS
- Cozy-Voice
- Spark-TTS
- Orpheus-TTS
- Kokoro
- GPT-SoVITS
Create Readme in these languages
- Arabic (ara)
- Chinese (zho)
- English (eng)
- Spanish (spa)
- French (fra)
- German (deu)
- Italian (ita)
- Portuguese (por)
- Polish (pol)
- Turkish (tur)
- Russian (rus)
- Dutch (nld)
- Czech (ces)
- Japanese (jpn)
- Hindi (hin)
- Bengali (ben)
- Hungarian (hun)
- Korean (kor)
- Vietnamese (vie)
- Swedish (swe)
- Persian (fas)
- Yoruba (yor)
- Swahili (swa)
- Indonesian (ind)
- Slovak (slk)
- Croatian (hrv)
Binary builds Working pyinstaller script for:
- 🍎 Mac Intel x86
- 🪟 Windows x86
- 🐧 Linux x86
- 🖥️🍏 Apple Silicon Mac
- 🪟💪 ARM Windows
- 🐧💪 ARM Linux
🐍 Single pip command install that works for:
- being overseen by @ROBERT-MCDOWELL
- 🍎 Mac Intel x86
- 🪟 Windows x86
- 🐧 Linux x86
- 🖥️🍏 Apple Silicon Mac
- 🪟💪 ARM Windows
- 🐧💪 ARM Linux
Extra Overkill for training models and such (All supported Coqui-tts models and piper-tts in one easy command)
- For info about this @DrewThomasson, he is currently working on the development of this, work-in-progress-repo here
- Make a easy to use training gui for all coqui-tts models in the ljspeech format training recipes here from coqui tts
For higher level developers:
- Integrate VoxNovel experimental functionality into this 🤷 eventually. . .
Wanted Auto-testing scripts for development
- Standard model headless run through every language sample Samples located here
@DrewThomasson if you want to help out at all! 😃