fixed typo

add link to sample page on index page.
changed index page to short version
2020-11-16 09:10:27 +01:00 · 2020-11-16 09:06:11 +01:00 · 2020-11-16 09:02:10 +01:00 · 2020-11-16 09:00:54 +01:00 · 2020-11-16 08:18:38 +01:00 · 2020-11-16 08:16:02 +01:00
38 changed files with 292 additions and 23079 deletions
--- a/EvolutionOfThorstenDataset.pdf
+++ b/EvolutionOfThorstenDataset.pdf
--- a/README.md
+++ b/README.md
@ -1,135 +0,0 @@
-# Introduction
-Many smart voice assistants like Amazon Alexa, Google Home, Apple Siri and Microsoft Cortana use cloud services to offer their (base) functionality.
-
-As some people have privacy concerns using these services there are some (open source) projects trying to build offline and/or privacy aware alternatives.
-
-But speech recognition and text synthesis still requires cloud services for providing these in a decent quality.
-
-# MyCroft AI
-> https://mycroft.ai/
-
-MyCroft is a company developing an opensource voice assistant with a very nice and active community. But the stt/tts parts are still cloud based (eg. google services), even if requests are anonymized by a mycroft proxy in between. But integration with locally hosted services such as deepspeech (stt) or mimic/tacotron (tts) is possible.
-
-# Mozilla
-Mozilla works on these really important aspects for free and open human machine voice interaction.
-
-## STT - speech to text
-> https://commonvoice.mozilla.org/
-
-"STT" needs lots of audio training data by many speakers (women/men/kids) of all ages, dialects and in various audio quality levels. So any voice contribution for common voice project is highly welcome.
-
-## TTS - text to speech
-> https://github.com/mozilla/tts
-
-"TTS" needs lots of clean recordings by one speaker to train a model. Mozilla is developing a software stack for proper model training based on tacotron2 papers.
-
-# And?!
-I want to make the most personal contribution i can give and contribute my personal voice (**german**) for TTS training to the community for free usage.
-
-## Please read some personal words before downloading the dataset
-I contribute my voice as a person believing in a world where all people are equal. No matter of gender, sexual orientation, religion, skin color and geocoordinates of birth location. A global world where everybody is warmly welcome on any place on this planet and open and free knowledge and education is available to everyone.
-
-So hopefully my voice is used in this manner to make this world a better place for all of us :-).
-
-**tl;dr** Please don't use for evil!
-
-# Dataset "thorsten"
-## Samples of my voice
-To get an impression what my voice sounds to decide if it fits to your project i published some sample recordings, so no need to download complete dataset first.
-
-* [Das Teilen eines Benutzerkontos ist strengstens untersagt.](./samples/original_recording/recorded_sample_01.wav )
-* [Der Prophet spricht stets in Gleichnissen.](./samples/original_recording/recorded_sample_02.wav )
-* [Bitte schmeißt euren Müll nicht einfach in die Walachei.](./samples/original_recording/recorded_sample_03.wav )
-* [So etwas würde mir nie in den Sinn kommen.](./samples/original_recording/recorded_sample_04.wav )
-* [Sie klettert auf einen Stein und nimmt eine Denkerpose ein.](./samples/original_recording/recorded_sample_05.wav )
-* [Jede gute Küchenwaage hat eine Tara-Funktion.](./samples/original_recording/recorded_sample_06.wav )
-* [Jeden Gedanken kannst du hier loswerden.](./samples/original_recording/recorded_sample_07.wav )
-
-
-## Dataset information
-
-* ljspeech-1.1 structure
-* 22.668 recorded phrases (wav files)
-* more than 23 hours of pure audio
-* samplerate 22.050Hz
-* mono
-* normalized to -24dB
-* phrase length (min/avg/max): 2 / 52 / 180 chars
-* no silence at beginning/ending
-* avg spoken chars per second: 14
-* sentences with question mark: 2.780
-* sentences with exclamation mark: 1.840
-
-
-![text length vs. mean audio duration](./img/thorsten-de---datasetAnalysis1.png)
-![text length vs. median audio duration](./img/thorsten-de---datasetAnalysis2.png)
-![text length vs. STD](./img/thorsten-de---datasetAnalysis3.png)
-![text length vs. number instances](./img/thorsten-de---datasetAnalysis4.png)
-![signal noise ratio](./img/thorsten-de---datasetAnalysis5.png)
-![bokeh](./img/thorsten-de---datasetAnalysis6.png)
-
-## Dataset evolution
-As decribed in the pdf document ([evolution of thorsten dataset](./EvolutionOfThorstenDataset.pdf)) this dataset consists of three recording phases.
-
-* phase1: Recorded with a cheap usb microphone
-* phase2: Recorded with a good microphone
-* phase3: Recorded with same good microphone but longer phrases (> 100 chars)
-
-If you wanna use just a dataset subset (phase1 and/or phase2 and/or phase3) you can see which files belong to which recording phase in [recording quality](./RecordingQuality.csv) csv file.
-
-
-## Download information
-> Download size: 2,7GB
-
-Version | Description | Date | Link
------------ | ------------- | ------------- | -------------
-thorsten-de-v01 | Initial version | 2020-06-28 | [Google Drive Download v01](https://drive.google.com/file/d/1yKJM1LAOQpRVojKunD9r8WN_p5KzBxjc/view?usp=sharing)
-thorsten-de-v02 | normalized to -24dB and split metadata.csv into shuffeled metadata_train.csv and metadata_val.csv | 2020-08-22 | [Google Drive Download v02](https://drive.google.com/file/d/1mGWfG0s2V2TEg-AI2m85tze1m4pyeM7b/view?usp=sharing)
-
-
-# Trained tacotron2 model "thorsten"
-If you trained a model on "thorsten" dataset please file an issue with some information on it. Sharing a trained model is highly appreciated. 
-
-## Trained models (TODO)
-
-Folder | Date | Link | Description
------------ | ------------- | ------------- | -------------
-thorsten-taco2-ddc-v0.1 | to do | to do | to do
-
-# Feel free to file an issue if you ...
-* have improvements on dataset
-* use my TTS voice in your project(s)
-* want to share your trained "thorsten" model
-* get to know about any abuse usage of my voice
-
-# Special thanks
-I want to thank all open source communities for providing great projects.
-
-Just to name some nice guys who joined me on this tts-roadtrip:
-
-* eltocino (https://github.com/el-tocino/)
-* erogol (https://github.com/erogol/)
-* gras64 (https://github.com/gras64/)
-* krisgesling (https://github.com/krisgesling/)
-* nmstoker (https://github.com/nmstoker)
-* othiele (https://discourse.mozilla.org/u/othiele/summary)
-* repodiac (https://github.com/repodiac)
-
-And last but not least i want to say a huge thank you to a special guy who supported me on this journey right from the beginning. Not just with nice words, but with his time, audio optimization knowhow and finally his gpu computing power. 
-
-Without his amazing support this dataset (in it's current way) would not exists.
-
-Thank you Dominik (@domcross / https://github.com/domcross/)
-
-# Links
-* https://discourse.mozilla.org/t/contributing-my-german-voice-for-tts/48150
-* https://community.mycroft.ai/
-* https://github.com/MycroftAI/mimic-recording-studio
-* https://voice.mozilla.org/
-* https://github.com/mozilla/TTS
-(https://github.com/repodiac/tit-for-tat/tree/master/thorsten-TTS)
-* https://raw.githubusercontent.com/mozilla/voice-web/master/server/data/de/sentence-collector.txt
-
-We'll hear us in future :-)
-
-Thorsten
--- a/RecordingQuality.csv
+++ b/RecordingQuality.csv
--- a/docs/_config.yml
+++ b/docs/_config.yml
@ -0,0 +1 @@
+theme: jekyll-theme-cayman
--- a/docs/audio_compare.md
+++ b/docs/audio_compare.md
@ -0,0 +1,184 @@
+# Vocoder Vergleich auf Basis des "thorsten" Tacotron 2 Modells
+Hier sind Hörproben mit unterschiedlichen Vocodern. Alle gesprochenen Texte (*Sample 1 - 4*) basieren auf Aufnahmen im Dataset, jedoch nicht auf dem Spektogramm von "ground truth", sondern auf Basis des trainierten Tacotron 2 Modells. Sample 5 ist der Beginn des Märchens "Der Froschkönig" und wurde nicht für das Dataset aufgezeichnet.
+
+## Sätze
+* **Sample #01**: Eure Schoko-Bonbons sind sagenhaft lecker!
+* **Sample #02**: Eure Tröte nervt.
+* **Sample #03**: Europa und Asien zusammengenommen wird auch als Eurasien bezeichnet.
+* **Sample #04**: Euer Plan hat ja toll geklappt.
+* *Sample #05: "In den alten Zeiten, wo das Wünschen noch geholfen hat, lebte ein König, dessen Töchter waren alle schön ..." (Anfang vom "Froschkönig")*
+
+# Ground truth
+Originalaufnahmen aus dem "thorsten" Dataset.
+
+<dl>
+
+<table>
+<thead>
+  <tr>
+    <th>Sample</th>
+    <th>Text</th>
+    <th>Audio</th>
+  </tr>
+</thead>
+<tbody>
+  <tr>
+    <td>01</td>
+    <td>Eure Schoko-Bonbons sind sagenhaft lecker</td>
+    <td><audio controls="" preload="none"><source src="samples/sample01-gt.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>02</td>
+    <td>Eure Tröte nervt</td>
+    <td><audio controls="" preload="none"><source src="samples/sample02-gt.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>03</td>
+    <td>Europa und Asien zusammengenommen wird auch als Eurasien bezeichnet</td>
+    <td><audio controls="" preload="none"><source src="samples/sample03-gt.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>04</td>
+    <td>Euer Plan hat ja toll geklappt.</td>
+    <td><audio controls="" preload="none"><source src="samples/sample04-gt.wav"></audio></td>
+  </tr>
+</tbody>
+</table>
+
+</dl>
+
+
+# Griffin Lim
+> Details zum Model: (todo: link)
+> Tacotron2 + DDC: 460k Schritte trainiert
+
+# ParallelWaveGAN
+> Tacotron2 + DDC: 360k Schritte trainiert, PWGAN Vocoder: 925k Schritte trainiert
+
+> Details: [Notebook von Olaf](https://colab.research.google.com/drive/15kJHTDTVxyIjxiZgqD1G_s5gUeVNLkfy?usp=sharing)
+<dl>
+
+<table>
+<thead>
+  <tr>
+    <th>Sample</th>
+    <th>Text</th>
+    <th>Audio</th>
+  </tr>
+</thead>
+<tbody>
+  <tr>
+    <td>01</td>
+    <td>Eure Schoko-Bonbons sind sagenhaft lecker</td>
+    <td><audio controls="" preload="none"><source src="samples/sample01-pwgan.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>02</td>
+    <td>Eure Tröte nervt</td>
+    <td><audio controls="" preload="none"><source src="samples/sample02-pwgan.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>03</td>
+    <td>Europa und Asien zusammengenommen wird auch als Eurasien bezeichnet</td>
+    <td><audio controls="" preload="none"><source src="samples/sample03-pwgan.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>04</td>
+    <td>Euer Plan hat ja toll geklappt.</td>
+    <td><audio controls="" preload="none"><source src="samples/sample04-pwgan.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>05</td>
+    <td>Anfang vom Froschkönig</td>
+    <td><audio controls="" preload="none"><source src="samples/sample05-pwgan.wav"></audio></td>
+  </tr>
+</tbody>
+</table>
+
+</dl>
+
+
+# WaveGrad
+> todo
+
+# HifiGAN
+> todo
+
+# VocGAN
+> **Diese Beispiele basieren auf "ground truth" und nicht auf dem Tacotron 2 Modell**
+> 200 Epochen / 284k Trainingsschritte
+
+<dl>
+
+<table>
+<thead>
+  <tr>
+    <th>Sample</th>
+    <th>Text</th>
+    <th>Audio</th>
+  </tr>
+</thead>
+<tbody>
+  <tr>
+    <td>01</td>
+    <td>Eure Schoko-Bonbons sind sagenhaft lecker</td>
+    <td><audio controls="" preload="none"><source src="samples/sample01-vocgan.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>02</td>
+    <td>Eure Tröte nervt</td>
+    <td><audio controls="" preload="none"><source src="samples/sample02-vocgan.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>03</td>
+    <td>Europa und Asien zusammengenommen wird auch als Eurasien bezeichnet</td>
+    <td><audio controls="" preload="none"><source src="samples/sample03-vocgan.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>04</td>
+    <td>Euer Plan hat ja toll geklappt.</td>
+    <td><audio controls="" preload="none"><source src="samples/sample04-vocgan.wav"></audio></td>
+  </tr>
+</tbody>
+</table>
+
+</dl>
+
+# GlowTTS / Waveglow
+> Details: [Github von Synesthesiam](https://github.com/rhasspy/de_larynx-thorsten)
+
+<dl>
+
+<table>
+<thead>
+  <tr>
+    <th>Sample</th>
+    <th>Text</th>
+    <th>Audio</th>
+  </tr>
+</thead>
+<tbody>
+  <tr>
+    <td>01</td>
+    <td>Eure Schoko-Bonbons sind sagenhaft lecker</td>
+    <td><audio controls="" preload="none"><source src="samples/sample01-waveglow.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>02</td>
+    <td>Eure Tröte nervt</td>
+    <td><audio controls="" preload="none"><source src="samples/sample02-waveglow.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>03</td>
+    <td>Europa und Asien zusammengenommen wird auch als Eurasien bezeichnet</td>
+    <td><audio controls="" preload="none"><source src="samples/sample03-waveglow.wav"></audio></td>
+  </tr>
+  <tr>
+    <td>04</td>
+    <td>Euer Plan hat ja toll geklappt.</td>
+    <td><audio controls="" preload="none"><source src="samples/sample04-waveglow.wav"></audio></td>
+  </tr>
+</tbody>
+</table>
+
+</dl>
--- a/docs/index.md
+++ b/docs/index.md
@ -0,0 +1,47 @@
+# Motivation
+
+<span style="font-size:1.5em;font-weight:bold">
+Eine kostenfreie, qualitativ hochwertige, deutsche TTS Stimme, die offline erzeugt werden kann sollte jedem Projekt ohne Lizenzrechtliche Probleme zur Verfügung stehen.
+</span>
+
+
+# Egal aus welchem Bereich du kommst:
+* Privates Bastelprojekt
+* OpenSource/Community Projekt
+* Bildung/Forschung/Wissenschaft
+* Kommerzielles Unternehmen
+* ...
+
+# Egal welcher Bereich dich interessiert:
+* Smarte Sprachassistenten
+* Navigationssysteme
+* Smart Homes
+* Sprechende Kühlschränke
+* Vorlesen von Bildschirmtexten (Barrierefreiheit)
+* Interaktive Robotik
+* ...
+
+# Wer wir sind
+Wir sind eine kleine motivierte Gruppe hobbymäßiger TTS-Enthusiasten die sich nach einem abgewandelten "Herr der Ringe Zitat" benannt hat - "**Fellowership of free german tts**"
+
+# Wo wir aktuell stehen
+Wir arbeiten weiterhin daran qualitativ noch bessere Modell zu trainieren, aber den aktuellen "stable" Stand kannst Du hier anhören:
+* [Es ist im Moment klarer Himmel bei 18 Grad.](https://drive.google.com/file/d/1cDIq4QG6i60WjUYNT6fr2cpEjFQIi8w5/view?usp=sharing)
+* [Ich verstehe das nicht, aber ich lerne jeden Tag neue Dinge.](https://drive.google.com/file/d/1kja_2RsFt6EmC33HTB4ozJyFlvh_DTFQ/view?usp=sharing)
+* [Ich bin jetzt bereit.](https://drive.google.com/file/d/1GkplGH7LMJcPDpgFJocXHCjRln_ccVFs/view?usp=sharing)
+* [Bitte warte einen Moment, bis ich fertig mit dem Booten bin.](https://drive.google.com/file/d/19Td-F14n_05F-squ3bNlt2BDE-NMFaq1/view?usp=sharing)
+* [Mein Name ist Mycroft und ich bin funky.](https://drive.google.com/file/d/1dbyOyE7Oy8YdAsYqQ4vz4VJjiWIyc8oV/view?usp=sharing)
+
+## Vergleich einiger Vocoder
+Wir experimentieren aktuell mit unterschiedlichen Konfigurationen um das beste Modell zu ermitteln. Ein Vergleich der bisherigen Ergebnisse findest Du auf dieser Seite. 
+> [Vergleich der unterschiedlichen Modell](./audio_compare)
+
+# Interessiert?
+[Weitere Details, Downloads und Danksagungen findet ihr hier.](https://github.com/thorstenMueller/deep-learning-german-tts "Dataset Details und Thorsten-Modell Download")
+
+
+---
+
+<span style="font-size:1.5em;font-weight:bold">
+Wir wünschen euch viel Spaß und Erfolg bei der Umsetzung eurer Projekte :-)
+</span>
--- a/docs/index_longDesc.md
+++ b/docs/index_longDesc.md
@ -0,0 +1,60 @@
+# tl;dr
+---
+
+<span style="font-family:Papyrus; font-size:3em;color:green"> Eine kostenfreie qualitativ hochwertige deutsche Stimme, die lokal erzeugt werden kann!</span> 
+
+---
+
+
+# Eine freie Deutsche Stimme
+Auch wenn die Überschrift stark nach einem politischen Statement klingt, geht es hier um ein völlig anderes Thema.
+
+Derzeit gewinnt die sprachbasierte Bedienung von Maschinen rasant an Bedeutung. Viele kennen diese Kommunikation bereits aus ihrem Alltag mit Smartphones oder so genannten smarten Assistenten wie Apple Siri, Google Home oder Amazon Alexa.
+
+Die Systeme der großen Hersteller bringen, neben sehr vielen Vorteilen, auch einige durchaus schwerwiegende Nachteile im Datenschutzbereich mit sich (Cloudzwang, mangelnde Hoheit über die eigenen Daten, Bedenken über "Mithörer", ...). Daher gibt es durchaus Menschen, die zwar die Vorteile solcher Systeme gerne nutzen möchten, aber aufgrund von den genannten Datenschutzbedenken von deren Nutzung absehen.
+
+# Alternativen zu (_online Spracherzeugung_) von Amazon, Google, Apple, ...
+Glücklicherweise bilden sich auch Alternativen (u.a. OpenSource) heraus um der Marktmarkt der "Big Player" etwas entgegenzutreten. Einige davon sind:
+
+* Mozilla Voice Projekte
+* MyCroft AI
+
+Diese (und andere) Communities arbeiten daran entsprechende Alternativen anzubieten. Jedoch steht hier oft die englische Sprache im Vordergrund. Dies ist gerade bei der Interaktion mit deutschsprachigen Anwendern natürlich nicht hilfreich.
+
+# Freies deutsches TTS - was ist das?
+Die meisten haben sicherlich schon einmal einen persönlichen smarten Assistenten (oder Smartphone) nach dem Wetter, Terminen, oder ähnlichem gefragt.
+Falls dem so ist und das Gerät eine gut verständliche deutsche Antwort geliefert hat wurden in diesem Fall "Cloud Resourcen" genutzt.
+
+Natürlich wissen Amazon, Google und Apple um die gute Qualität ihrer künstlichen Stimmen und sind u.a. daher nicht bereit, diese für eine private- und kostenfreie Offlinenutzung zur Verfügung zu stellen.
+Und genau da liegt eines der großen Probleme in (quelloffenen) Alternativen. Selbst wenn große Anteile kostenfrei und offline zu betreiben sind, spätestens bei der Sprachausgabe sind sie auf die "Big Player" angewiesen, sofern sie einen gewissen Qualitätsanspruch haben.
+
+# Wie und wem hilft dieses Projekt
+Das freie deutsche Dataset beinhaltet über 23 aufgezeichneter Stunden auf Basis freier Texte. Darauf basieren die mit machine learning trainierten TTS Modelle.
+Die Nutzung ist **ohne Lizenzrechtliche Bedenken** möglich und steht somit allen Interessierten offen. Zum Beispiel:
+
+* OpenSource Projekte/Communities
+* Bildung/Forschung/Wissenschaft
+* kommerzielle Einsatzzwecke
+
+Gerade den kleinen Communities soll hiermit die Möglichkeit geboten zu werden offline TTS Funktion in ihren Projekten mit auszuliefern.
+
+# Beispiele
+* [Es ist im Moment klarer Himmel bei 18 Grad.](https://drive.google.com/file/d/1cDIq4QG6i60WjUYNT6fr2cpEjFQIi8w5/view?usp=sharing)
+* [Ich verstehe das nicht, aber ich lerne jeden Tag neue Dinge.](https://drive.google.com/file/d/1kja_2RsFt6EmC33HTB4ozJyFlvh_DTFQ/view?usp=sharing)
+* [Ich bin jetzt bereit.](https://drive.google.com/file/d/1GkplGH7LMJcPDpgFJocXHCjRln_ccVFs/view?usp=sharing)
+* [Bitte warte einen Moment, bis ich fertig mit dem Booten bin.](https://drive.google.com/file/d/19Td-F14n_05F-squ3bNlt2BDE-NMFaq1/view?usp=sharing)
+* [Mein Name ist MyCroft und ich bin funky.](https://drive.google.com/file/d/1dbyOyE7Oy8YdAsYqQ4vz4VJjiWIyc8oV/view?usp=sharing)
+
+# Aktueller Stand
+Wir (eine Gruppe von netten TTS Enthusiasten) wissen, dass das aktuelle Modell qualitativ noch viel Luft nach oben hat. Aber wir sind weiterhin motiviert in Zukunft hoffentlich noch bessere Modelle zur Verfügung stellen zu können.
+
+# Zu guter Letzt
+Da ich keinen großen Einfluss habe, welche Aussagen mit meiner Stimme zukünftig gemacht werden, möchte ich doch einige Punkte sagen, die mir persönlich wichtig sind:
+
+Ich teile meine Stimme als Person, die daran glaubt, dass alle Menschen gleichberechtigt sind, unabhängig von Geschlecht, sexueller Orientierung, Religion, Hautfarbe oder den Geokoordinaten der Geburt. An eine Welt wo jeder Mensch zu jeder Zeit herzlich Willkommen ist und wo Bildung und Wissen für jeden  frei verfügbar ist.
+
+# Links
+* https://github.com/thorstenMueller/deep-learning-german-tts/
+* https://medium.com/@thorsten_Mueller/why-ive-chosen-to-donate-my-german-voice-for-mankind-177beeb91675
+* TODO Github Links der Mitstreiter
+* TODO Modell (TTS Server Package) veröffentlichen
--- a/docs/samples/sample01-gt.wav
+++ b/docs/samples/sample01-gt.wav
--- a/docs/samples/sample01-pwgan.wav
+++ b/docs/samples/sample01-pwgan.wav
--- a/docs/samples/sample01-vocgan.wav
+++ b/docs/samples/sample01-vocgan.wav
--- a/docs/samples/sample01-waveglow.wav
+++ b/docs/samples/sample01-waveglow.wav
--- a/docs/samples/sample02-gt.wav
+++ b/docs/samples/sample02-gt.wav
--- a/docs/samples/sample02-pwgan.wav
+++ b/docs/samples/sample02-pwgan.wav
--- a/docs/samples/sample02-vocgan.wav
+++ b/docs/samples/sample02-vocgan.wav
--- a/docs/samples/sample02-waveglow.wav
+++ b/docs/samples/sample02-waveglow.wav
--- a/docs/samples/sample03-gt.wav
+++ b/docs/samples/sample03-gt.wav
--- a/docs/samples/sample03-pwgan.wav
+++ b/docs/samples/sample03-pwgan.wav
--- a/docs/samples/sample03-vocgan.wav
+++ b/docs/samples/sample03-vocgan.wav
--- a/docs/samples/sample03-waveglow.wav
+++ b/docs/samples/sample03-waveglow.wav
--- a/docs/samples/sample04-gt.wav
+++ b/docs/samples/sample04-gt.wav
--- a/docs/samples/sample04-pwgan.wav
+++ b/docs/samples/sample04-pwgan.wav
--- a/docs/samples/sample04-vocgan.wav
+++ b/docs/samples/sample04-vocgan.wav
--- a/docs/samples/sample04-waveglow.wav
+++ b/docs/samples/sample04-waveglow.wav
--- a/docs/samples/sample05-pwgan.wav
+++ b/docs/samples/sample05-pwgan.wav
--- a/docs/samples/sample05-waveglow.wav
+++ b/docs/samples/sample05-waveglow.wav
--- a/img/thorsten-de---datasetAnalysis1.png
+++ b/img/thorsten-de---datasetAnalysis1.png
--- a/img/thorsten-de---datasetAnalysis2.png
+++ b/img/thorsten-de---datasetAnalysis2.png
--- a/img/thorsten-de---datasetAnalysis3.png
+++ b/img/thorsten-de---datasetAnalysis3.png
--- a/img/thorsten-de---datasetAnalysis4.png
+++ b/img/thorsten-de---datasetAnalysis4.png
--- a/img/thorsten-de---datasetAnalysis5.png
+++ b/img/thorsten-de---datasetAnalysis5.png
--- a/img/thorsten-de---datasetAnalysis6.png
+++ b/img/thorsten-de---datasetAnalysis6.png
--- a/samples/original_recording/recorded_sample_01.wav
+++ b/samples/original_recording/recorded_sample_01.wav
--- a/samples/original_recording/recorded_sample_02.wav
+++ b/samples/original_recording/recorded_sample_02.wav
--- a/samples/original_recording/recorded_sample_03.wav
+++ b/samples/original_recording/recorded_sample_03.wav
--- a/samples/original_recording/recorded_sample_04.wav
+++ b/samples/original_recording/recorded_sample_04.wav
--- a/samples/original_recording/recorded_sample_05.wav
+++ b/samples/original_recording/recorded_sample_05.wav
--- a/samples/original_recording/recorded_sample_06.wav
+++ b/samples/original_recording/recorded_sample_06.wav
--- a/samples/original_recording/recorded_sample_07.wav
+++ b/samples/original_recording/recorded_sample_07.wav
Author	SHA1	Message	Date
Thorsten Mueller	d4ec050d5d	fixed typo	2020-11-16 09:10:27 +01:00
Thorsten Mueller	908c6da144	add link to sample page on index page.	2020-11-16 09:06:11 +01:00
Thorsten Mueller	6c34cf0915	changed index page to short version	2020-11-16 09:02:10 +01:00
Thorsten Mueller	5e7318fbb1	add notebook link in pwgan section	2020-11-16 09:00:54 +01:00
Thorsten Mueller	14c924be46	update main page	2020-11-16 08:18:38 +01:00
Thorsten Mueller	988051f2ba	add samples to doc subfolder.	2020-11-16 08:16:02 +01:00
Thorsten Mueller	40e9e41b8e	link test	2020-11-15 22:51:07 +01:00
Thorsten Mueller	548d773ac3	link test	2020-11-15 22:48:03 +01:00
Thorsten Mueller	c3d0ad4036	Added GlowTTS to overview	2020-11-15 22:35:04 +01:00
Thorsten Mueller	aa1f517a17	Moved sample files and added GlowTTS	2020-11-15 22:30:20 +01:00
Thorsten Mueller	24c7b84feb	Added vocgan samples	2020-11-13 16:01:38 +01:00
Thorsten Mueller	3cb2d978de	first draft	2020-11-12 23:23:18 +01:00
Thorsten Mueller	618e3e01fc	fixed missing audio tag	2020-11-12 23:17:26 +01:00
Thorsten Mueller	1fe8e9c0e2	test	2020-11-12 23:05:10 +01:00
Thorsten Mueller	afb14c5c51	test	2020-11-12 22:59:00 +01:00
Thorsten Mueller	fc23689c54	test fix style issue	2020-11-12 22:49:46 +01:00
Thorsten Mueller	995b477d35	Fix audio location and formatting issue	2020-11-12 22:40:01 +01:00
Thorsten Mueller	84f791f106	First samples	2020-11-12 22:21:35 +01:00
Thorsten Mueller	a77b919eaf	test	2020-11-12 17:39:22 +01:00
Thorsten Mueller	ff368906c7	next style check	2020-11-12 17:36:35 +01:00
Thorsten Mueller	b6f7b5f9cd	Embedded audio in md file test	2020-11-12 17:31:43 +01:00
Thorsten Mueller	eda8f3580a	Audio test	2020-11-12 17:25:13 +01:00
Thorsten Mueller	3f49044c8b	test with embedded audio	2020-11-12 17:15:38 +01:00
Thorsten Mueller	5d07086291	Test for vocoder comparison github page	2020-11-12 12:32:47 +01:00
Thorsten Müller	6808036e3e	Update kurz.md	2020-09-29 19:47:59 +02:00
Thorsten Müller	aba036ad8d	Update kurz.md	2020-09-29 09:25:27 +02:00
Thorsten Müller	7a1e3908f4	Update kurz.md	2020-09-29 08:58:30 +02:00
Thorsten Müller	b2e36a830a	Update kurz.md	2020-09-29 08:54:56 +02:00
Thorsten Müller	19007448cf	Update kurz.md	2020-09-29 08:52:21 +02:00
Thorsten Müller	da11f3588b	Add files via upload	2020-09-29 08:40:59 +02:00
Thorsten Müller	0de86c2806	Update index.md	2020-09-29 06:47:57 +02:00
Thorsten Müller	cb8dd251e4	Update index.md	2020-09-29 06:44:17 +02:00
Thorsten Müller	b55c076d6f	Update index.md	2020-09-29 06:42:59 +02:00
Thorsten Müller	9ee6a2578a	Update index.md	2020-09-28 22:07:56 +02:00
Thorsten Müller	5f859c17cd	Update index.md	2020-09-28 21:54:25 +02:00
Thorsten Müller	0b110b1700	Update index.md	2020-09-28 21:50:28 +02:00
Thorsten Mueller	9fdd55d4b4	Remove readme file	2020-09-28 19:09:01 +02:00
Thorsten Mueller	9394108737	Merge branch 'githubPages' of https://github.com/thorstenMueller/deep-learning-german-tts into githubPages	2020-09-28 18:47:19 +02:00
Thorsten Mueller	a5cdf08504	Structure adjustment	2020-09-28 18:46:30 +02:00
Thorsten Müller	46d01d9652	Set theme jekyll-theme-cayman	2020-09-28 18:15:15 +02:00
Thorsten Mueller	5cc6f0e913	Initial test commit for github pages	2020-09-28 17:52:40 +02:00