From affbc8a906575f7a439190020a44cb57c0a92b62 Mon Sep 17 00:00:00 2001 From: Thorsten Mueller Date: Sun, 18 Feb 2024 10:38:20 +0100 Subject: [PATCH] Changes support and some tts model details --- README.md | 66 +++++++++++++++---------------------------------------- 1 file changed, 18 insertions(+), 48 deletions(-) diff --git a/README.md b/README.md index 981cf10..526463d 100644 --- a/README.md +++ b/README.md @@ -11,10 +11,6 @@ - [Thorsten-Voice Dataset 2023.09 (Hessisch)](#thorsten-voice-dataset-202309-hessisch) - [**Thorsten** TTS-Models](#tts-models) - - [Thorsten-21.04-Tacotron2-DCA](#thorsten-2104-tacotron2-dca) - - [Thorsten-22.05-VITS](#thorsten-2205-vits) - - [Thorsten-22.08-Tacotron2-DDC](#thorsten-2208-tacotron2-ddc) - - [Other models](#other-models) - [Public talks](#public-talks) @@ -167,14 +163,7 @@ All emotional recordings where recorded by myself and i tried to feel and pronou ``` # TTS Models - -## Thorsten-21.04-Tacotron2-DCA -This [TTS-model](https://drive.google.com/drive/folders/1m4RuffbvdOmQWnmy_Hmw0cZ_q0hj2o8B?usp=sharing) has been trained on [**Thorsten-21.02-neutral**](#thorsten-2102-neutral) dataset. The recommended trained Fullband-MelGAN Vocoder can be downloaded [here](https://drive.google.com/drive/folders/1hsfaconm4Yd9wPVyOtrXjWQs4ZAPoouY?usp=sharing). - -Run the model: -* pip install TTS==0.5.0 -* tts-server --model_name tts_models/de/thorsten/tacotron2-DCA - +Based on these opensource voice datasets several TTS (text to speech) models have been trained using AI / machine learning technology. ## Thorsten-22.05-VITS Trained on dataset **Thorsten-22.05-neutral**. @@ -195,17 +184,31 @@ To run TTS server just follow these steps: * Open browser on http://localhost:5002 and enjoy playing -## Other models -### Silero +## Silero You can use a free A-GPL licensed models trained on **Thorsten-21.02-neutral** dataset via the [silero-models](https://github.com/snakers4/silero-models/blob/master/models.yml) project. * [Thorsten 16kHz](https://drive.google.com/drive/folders/1tR6w4kgRS2JJ1TWZhwoFuU04Xkgo6YAs?usp=sharing) * [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/snakers4/silero-models/blob/master/examples_tts.ipynb) -### ZDisket +## ZDisket [ZDisket](https://github.com/ZDisket/TensorVox) made a tool called TensorVox for setting up an TTS environment on Windows and included a german TTS model trained by [monatis](https://github.com/monatis/german-tts). Thanks for sharing that :thumbsup:. See it in action on [Youtube](https://youtu.be/tY6_xZnkv-A). +# Support & Thanks +If you like my voice contribution and would like to support my effort for an opensource voice technology future, you can support me, if you like: + +* Subscribe to my "[https://youtube.com/@ThorstenMueller/](Thorsten-Voice)" youtube channel and follow me on my social media profiles +* Buy me a tea using [Ko-Fi](https://ko-fi.com/thorstenvoice) or [Github sponsor](https://github.com/sponsors/thorstenMueller) + +I want to say thank you to great people who supported me on this journey with nice words, support and compute power: +Thanks [El-Tocino](https://github.com/el-tocino/), [Eren Gölge](https://github.com/erogol/), [Gras64](https://github.com/gras64/), [Kris Gesling](https://github.com/krisgesling/), [Nmstoker](https://github.com/nmstoker), [Othiele](https://discourse.mozilla.org/u/othiele/summary), [Repodiac](https://github.com/repodiac), [SanjaESC](https://github.com/SanjaESC), [Synesthesiam](https://github.com/synesthesiam/). + +Special thanks to my dear colleague, Sebastian Kraus, for supporting me with audio recording equipment and for being the creative mastermind behind the logo design and of course to the dear **Dominik** ([@domcross](https://github.com/domcross/)) for him being so close by my side on this amazing journey. + + + + + # Public talks I really want to bring the topic "**Open Voice For An Open Future**" to a bigger public attention. @@ -216,36 +219,3 @@ I really want to bring the topic "**Open Voice For An Open Future**" to a bigger # Youtube channel In summer 2021 i've started to share my lessons learned and experiences on open voice tech, in special **TTS** on my little [Youtube channel](https://www.youtube.com/c/ThorstenMueller). If you check out and like my videos i'd happy to welcome you as subscriber and member of my little Youtube community. - - -# Feel free to file an issue if you ... -* Use my TTS voice in your project(s) -* Want to share your trained "Thorsten" model -* Get to know about any abuse usage of my voice - -# Thanks section -## Cool projects -* https://commonvoice.mozilla.org/ -* https://coqui.ai/ -* https://mycroft.ai/ -* https://github.com/rhasspy/ - -## Cool people -* [El-Tocino](https://github.com/el-tocino/) -* [Eren Gölge](https://github.com/erogol/) -* [Gras64](https://github.com/gras64/) -* [Kris Gesling](https://github.com/krisgesling/) -* [Nmstoker](https://github.com/nmstoker) -* [Othiele](https://discourse.mozilla.org/u/othiele/summary) -* [Repodiac](https://github.com/repodiac) -* [SanjaESC](https://github.com/SanjaESC) -* [Synesthesiam](https://github.com/synesthesiam/) - -## Even more special people -Additionally, a really nice thanks for my dear colleague, Sebastian Kraus, for supporting me with audio recording equipment and for being the creative mastermind behind the logo design. - -And last but not least i want to say a **huge, huge thank you** to a special guy who supported me on this journey as a partner right from the beginning. Not just with nice words, but with his time, audio optimization knowhow and finally GPU power. - -**Thank you so much, dear **Dominik** ([@domcross](https://github.com/domcross/)) for being my partner on this journey.** - -Thorsten (*Twitter: @ThorstenVoice*)