Changes support and some tts model details

This commit is contained in:
Thorsten Mueller 2024-02-18 10:38:20 +01:00
parent 15936365b0
commit affbc8a906

View File

@ -11,10 +11,6 @@
- [Thorsten-Voice Dataset 2023.09 (Hessisch)](#thorsten-voice-dataset-202309-hessisch)
- [**Thorsten** TTS-Models](#tts-models)
- [Thorsten-21.04-Tacotron2-DCA](#thorsten-2104-tacotron2-dca)
- [Thorsten-22.05-VITS](#thorsten-2205-vits)
- [Thorsten-22.08-Tacotron2-DDC](#thorsten-2208-tacotron2-ddc)
- [Other models](#other-models)
- [Public talks](#public-talks)
@ -167,14 +163,7 @@ All emotional recordings where recorded by myself and i tried to feel and pronou
```
# TTS Models
## Thorsten-21.04-Tacotron2-DCA
This [TTS-model](https://drive.google.com/drive/folders/1m4RuffbvdOmQWnmy_Hmw0cZ_q0hj2o8B?usp=sharing) has been trained on [**Thorsten-21.02-neutral**](#thorsten-2102-neutral) dataset. The recommended trained Fullband-MelGAN Vocoder can be downloaded [here](https://drive.google.com/drive/folders/1hsfaconm4Yd9wPVyOtrXjWQs4ZAPoouY?usp=sharing).
Run the model:
* pip install TTS==0.5.0
* tts-server --model_name tts_models/de/thorsten/tacotron2-DCA
Based on these opensource voice datasets several TTS (text to speech) models have been trained using AI / machine learning technology.
## Thorsten-22.05-VITS
Trained on dataset **Thorsten-22.05-neutral**.
@ -195,17 +184,31 @@ To run TTS server just follow these steps:
* Open browser on http://localhost:5002 and enjoy playing
## Other models
### Silero
## Silero
You can use a free A-GPL licensed models trained on **Thorsten-21.02-neutral** dataset via the [silero-models](https://github.com/snakers4/silero-models/blob/master/models.yml) project.
* [Thorsten 16kHz](https://drive.google.com/drive/folders/1tR6w4kgRS2JJ1TWZhwoFuU04Xkgo6YAs?usp=sharing)
* [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/snakers4/silero-models/blob/master/examples_tts.ipynb)
### ZDisket
## ZDisket
[ZDisket](https://github.com/ZDisket/TensorVox) made a tool called TensorVox for setting up an TTS environment on Windows and included a german TTS model trained by [monatis](https://github.com/monatis/german-tts). Thanks for sharing that :thumbsup:. See it in action on [Youtube](https://youtu.be/tY6_xZnkv-A).
# Support & Thanks
If you like my voice contribution and would like to support my effort for an opensource voice technology future, you can support me, if you like:
* Subscribe to my "[https://youtube.com/@ThorstenMueller/](Thorsten-Voice)" youtube channel and follow me on my social media profiles
* Buy me a tea using [Ko-Fi](https://ko-fi.com/thorstenvoice) or [Github sponsor](https://github.com/sponsors/thorstenMueller)
I want to say thank you to great people who supported me on this journey with nice words, support and compute power:
Thanks [El-Tocino](https://github.com/el-tocino/), [Eren Gölge](https://github.com/erogol/), [Gras64](https://github.com/gras64/), [Kris Gesling](https://github.com/krisgesling/), [Nmstoker](https://github.com/nmstoker), [Othiele](https://discourse.mozilla.org/u/othiele/summary), [Repodiac](https://github.com/repodiac), [SanjaESC](https://github.com/SanjaESC), [Synesthesiam](https://github.com/synesthesiam/).
Special thanks to my dear colleague, Sebastian Kraus, for supporting me with audio recording equipment and for being the creative mastermind behind the logo design and of course to the dear **Dominik** ([@domcross](https://github.com/domcross/)) for him being so close by my side on this amazing journey.
# Public talks
I really want to bring the topic "**Open Voice For An Open Future**" to a bigger public attention.
@ -216,36 +219,3 @@ I really want to bring the topic "**Open Voice For An Open Future**" to a bigger
# Youtube channel
In summer 2021 i've started to share my lessons learned and experiences on open voice tech, in special **TTS** on my little [Youtube channel](https://www.youtube.com/c/ThorstenMueller). If you check out and like my videos i'd happy to welcome you as subscriber and member of my little Youtube community.
# Feel free to file an issue if you ...
* Use my TTS voice in your project(s)
* Want to share your trained "Thorsten" model
* Get to know about any abuse usage of my voice
# Thanks section
## Cool projects
* https://commonvoice.mozilla.org/
* https://coqui.ai/
* https://mycroft.ai/
* https://github.com/rhasspy/
## Cool people
* [El-Tocino](https://github.com/el-tocino/)
* [Eren Gölge](https://github.com/erogol/)
* [Gras64](https://github.com/gras64/)
* [Kris Gesling](https://github.com/krisgesling/)
* [Nmstoker](https://github.com/nmstoker)
* [Othiele](https://discourse.mozilla.org/u/othiele/summary)
* [Repodiac](https://github.com/repodiac)
* [SanjaESC](https://github.com/SanjaESC)
* [Synesthesiam](https://github.com/synesthesiam/)
## Even more special people
Additionally, a really nice thanks for my dear colleague, Sebastian Kraus, for supporting me with audio recording equipment and for being the creative mastermind behind the logo design.
And last but not least i want to say a **huge, huge thank you** to a special guy who supported me on this journey as a partner right from the beginning. Not just with nice words, but with his time, audio optimization knowhow and finally GPU power.
**Thank you so much, dear **Dominik** ([@domcross](https://github.com/domcross/)) for being my partner on this journey.**
Thorsten (*Twitter: @ThorstenVoice*)