&#128218; [Contribution] ebook2audiobook roadmap

## All Features open to public Contributions &#11088;
- [x] -h -help parameter info in different languages
- [x] Notebooks Folder [Talked about here](https://github.com/DrewThomasson/ebook2audiobookXTTS/issues/5#issuecomment-2408773254)
- [x] Make Chinese text splitting not split words and improve pause timing [Talked about here](https://github.com/DrewThomasson/ebook2audiobookXTTS/issues/18#issuecomment-2401154894)
- [x] Get Kaggel Notebook working
- [x] Get Working Google Colab Notebook [Talked about here
](https://github.com/DrewThomasson/ebook2audiobookXTTS/issues/5#issuecomment-2408773254)
- [ ] [Make a ios app](https://github.com/DrewThomasson/ebook2audiobook/pull/35#issuecomment-2496495212)
- [ ] [Make an android app](https://github.com/DrewThomasson/ebook2audiobook/pull/35#issuecomment-2496495212)
## Wanted Extra Parameters
- [ ] [Translate from translate the full ebook to X lang and audiobook it, output original ebookfile, audiobook file, and translated ebook file](https://github.com/DrewThomasson/ebook2audiobook/pull/35#issuecomment-2496305631)
- [x] Make parameter for specifying output audio file format
- [ ] The F5-TTS model referenced [here](https://github.com/DrewThomasson/ebook2audiobookXTTS/issues/38#issuecomment-2453224267)
- [x] Make ebook input parameter accept a list of files for multiple files.
- [x] Make a way for multiple lines to have audio generated at a time using multiple instances of coqui tts running for more beefy hardware.
- [x] Make ebook input parameter accept a folder containing ebook files to auto-run through.
- [ ] OCR for PDF files (as a Parameter) [Talked about here](https://github.com/DrewThomasson/ebook2audiobookXTTS/issues/21)
- [x] Add a force use device (cpu or GPU) (This will force set the device at the top of the script) [Talked about here](https://github.com/DrewThomasson/ebook2audiobookXTTS/issues/17#issuecomment-2400976734)(currently being added by @ROBERT-MCDOWELL) [ref here](https://github.com/DrewThomasson/ebook2audiobookXTTS/issues/17#issuecomment-2412577620)
- [x] Use [Deepfilternet2](https://github.com/DrewThomasson/ebook2audiobookXTTS/issues/33#issuecomment-2412883354) to de-noise any reference audio for voice cloning, [demo huggingfacespace using it](https://huggingface.co/spaces/drewThomasson/DeepFilterNet2_no_limit), Talked about [here](https://github.com/DrewThomasson/ebook2audiobookXTTS/issues/33#issuecomment-2412883354)
- [x] Custom model dir input for pointing to a folder containing all of the custom model files if available instead of having to point to each model file individually
- [ ] Change voices per chapter parameter [Talked about here](https://github.com/DrewThomasson/ebook2audiobookXTTS/issues/32#issuecomment-2413998689)

## My Other Repos I Want to Integrate into the App for Extra Options :)
- [ ] Add app parameter that launches the [ebook2audiobookpiper-tts GUI](https://github.com/DrewThomasson/ebook2audiobookpiper-tts).(Piper tts appears to have issues working in ARM (apple silicon)MAC  But runs fine in the docker on ARM)
- [ ] Add app parameter that launches the [ebook2audiobookstyletts2 GUI](https://github.com/DrewThomasson/ebook2audiobookSTYLETTS2).
- [ ] Add app parameter that launches the [ebook2audiobook-espeak GUI](https://github.com/DrewThomasson/ebook2audiobookEspeak).
- [ ] Add app parameter that launches the [FineTune XTTS GUI](https://github.com/DrewThomasson/finetuneXtts_apple_silicone). 
- [ ] Add app parameter for using Barktts [documentation from coqui tts](https://docs.coqui.ai/en/latest/models/bark.html)


## Create a standard function for load_model() and inference_model() for:
- [x] XTTSv2
- [ ] Style-TTS2
- [x] Bark
- [x] Fairseq
- [x] VITS
- [x] YourTTS
- [ ] Tortoise
- [ ] Glow-TTS
- [x] Tacotron2
- [ ] Speedy-Speech
- [ ] Align-TTS
- [ ] FastPitch
- [ ] FastSpeech2
- [ ] Delightful-TTS
- [ ] Zonos
- [ ] F5-TTS
- [ ] Cozy-Voice
- [ ] Spark-TTS
- [ ] Orpheus-TTS
- [ ] Kokoro
- [ ] GPT-SoVITS


## Create Readme in these languages

- [x] Arabic (ara)
- [x] Chinese (zho)
- [x] English (eng)
- [x] Spanish (spa)
- [x] French (fra)
- [x] German (deu)
- [x] Italian (ita)
- [x] Portuguese (por)
- [x] Polish (pol)
- [x] Turkish (tur)
- [x] Russian (rus)
- [x] Dutch (nld)
- [x] Czech (ces)
- [x] Japanese (jpn)
- [x] Hindi (hin)
- [x] Bengali (ben)
- [x] Hungarian (hun)
- [x] Korean (kor)
- [x] Vietnamese (vie)
- [x] Swedish (swe)
- [x] Persian (fas)
- [x] Yoruba (yor)
- [x] Swahili (swa)
- [x] Indonesian (ind)
- [x] Slovak (slk)
- [x] Croatian (hrv)   

## Binary builds Working pyinstaller script for:

- [x] &#127822; Mac Intel x86
- [x] &#129695; Windows x86
- [x] &#128039; Linux x86
- [x] &#128421;&#65039;&#127823; Apple Silicon Mac
- [x] &#129695;&#128170; ARM Windows
- [x] &#128039;&#128170; ARM Linux

## &#128013; Single pip command install that works for:
- being overseen by @ROBERT-MCDOWELL
- [x] &#127822; Mac Intel x86
- [x] &#129695; Windows x86
- [x] &#128039; Linux x86
- [x] &#128421;&#65039;&#127823; Apple Silicon Mac
- [x] &#129695;&#128170; ARM Windows
- [x] &#128039;&#128170; ARM Linux

## Extra Overkill for training models and such (All supported Coqui-tts models and piper-tts in one easy command) 
- For info about this @DrewThomasson, he is currently working on the development of this, [work-in-progress-repo here](https://github.com/DrewThomasson/Universal_TTS_Finetune)
- [ ] Make a easy to use training gui for all coqui-tts models in the ljspeech format training recipes [here from coqui tts](https://github.com/coqui-ai/TTS/tree/dev/recipes/ljspeech)

## For higher level developers:
- [ ] Integrate VoxNovel experimental functionality into this &#129335; eventually. . .

## Wanted Auto-testing scripts for development

- [x] Standard model headless run through every language sample [Samples located here](https://github.com/DrewThomasson/ebook2audiobookXTTS/tree/main)


@DrewThomasson if you want to help out at all! &#128515;

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

📚 [Contribution] ebook2audiobook roadmap #32

All Features open to public Contributions ⭐

Wanted Extra Parameters

My Other Repos I Want to Integrate into the App for Extra Options :)

Create a standard function for load_model() and inference_model() for:

Create Readme in these languages

Binary builds Working pyinstaller script for:

🐍 Single pip command install that works for:

Extra Overkill for training models and such (All supported Coqui-tts models and piper-tts in one easy command)

For higher level developers:

Wanted Auto-testing scripts for development

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

📚 [Contribution] ebook2audiobook roadmap #32

Description

All Features open to public Contributions ⭐

Wanted Extra Parameters

My Other Repos I Want to Integrate into the App for Extra Options :)

Create a standard function for load_model() and inference_model() for:

Create Readme in these languages

Binary builds Working pyinstaller script for:

🐍 Single pip command install that works for:

Extra Overkill for training models and such (All supported Coqui-tts models and piper-tts in one easy command)

For higher level developers:

Wanted Auto-testing scripts for development

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions