det.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Mastodon Server des Unterhaltungsfernsehen Ehrenfeld zum dezentralen Diskurs.

Administered by:

Server stats:

1.7K
active users

#text2speech

0 posts0 participants0 posts today
Jordi Cabot<p><a href="https://fediscience.org/tags/Multilingual" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Multilingual</span></a> <a href="https://fediscience.org/tags/Speech2Speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Speech2Speech</span></a> <a href="https://fediscience.org/tags/Agents" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Agents</span></a> are here!</p><p>Supporting the latest <a href="https://fediscience.org/tags/OpenAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenAI</span></a> Speech Models and more. Also works for <a href="https://fediscience.org/tags/Luxembourgish" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Luxembourgish</span></a>! </p><p>⚙️<a href="https://besser-agentic-framework.readthedocs.io/latest/release_notes/v4.0.0.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">besser-agentic-framework.readt</span><span class="invisible">hedocs.io/latest/release_notes/v4.0.0.html</span></a></p><p><a href="https://fediscience.org/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://fediscience.org/tags/python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>python</span></a> <a href="https://fediscience.org/tags/text2speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>text2speech</span></a> <a href="https://fediscience.org/tags/speech2text" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speech2text</span></a> <a href="https://fediscience.org/tags/languagedetection" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>languagedetection</span></a> <a href="https://fediscience.org/tags/nlp" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nlp</span></a> <a href="https://fediscience.org/tags/lowcode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lowcode</span></a> <a href="https://fediscience.org/tags/llm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>llm</span></a> <a href="https://fediscience.org/tags/rag" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>rag</span></a></p>
Harald Schirmer<p><a href="https://colearn.social/tags/BarCamp" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BarCamp</span></a> Regeln kurz erklärt - in Versform ... kleine Experimente mit diversen KI Toosl und <a href="https://colearn.social/tags/Text2Speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Text2Speech</span></a> Murf.AI in Vorbereitung auf die <a href="https://colearn.social/tags/LOSCON25" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LOSCON25</span></a></p><p>Hier das Ergebnis zum Anhören: <a href="https://murf.ai/share/mcho8m9y" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">murf.ai/share/mcho8m9y</span><span class="invisible"></span></a></p><p>... ein bisschen spielen mit <a href="https://colearn.social/tags/KI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KI</span></a> </p><p><span class="h-card" translate="no"><a href="https://colearn.social/@simondueckert" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>simondueckert</span></a></span> </p><p>PS das mit der Versform habe ich in 5 verschiedenen KIs ausprobiert - es war hilfreich, aber keine hatte tatsächlich "kreative und sofort nutzbare" Vorschläge</p>
Verfassungklage@troet.cafe<p><a href="https://troet.cafe/tags/Speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Speech</span></a> <a href="https://troet.cafe/tags/Note" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Note</span></a> – <a href="https://troet.cafe/tags/Notizen" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Notizen</span></a> und mehr - </p><p>Bei der Recherche für einen Artikel über <a href="https://troet.cafe/tags/Text2Speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Text2Speech</span></a> und <a href="https://troet.cafe/tags/Speech2Text" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Speech2Text</span></a> unter <a href="https://troet.cafe/tags/Linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Linux</span></a> bin ich auf die kleine App Speech Note gestoßen, nicht zu verwechseln mit dem proprietären SpeechNotes. Insofern ist der Name nicht wirklich clever gewählt. Clever ist dagegen das Konzept der noch jungen Anwendung. </p><p>Speech Note ist eine vielseitige Anwendung für Notizen, die durch ihre Funktionen und Datenschutzorientierung hervorsticht. </p><p><a href="https://linuxnews.de/speech-note-notizen-und-mehr/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">linuxnews.de/speech-note-notiz</span><span class="invisible">en-und-mehr/</span></a></p>
Florian 'floe' Echtler<p>OK, this is probably a rather long shot, but does anyone know of a voice synthesis model that is open-weights or ideally even open-source, and capable of producing intonation (specifically, something like rap lyrics)? <a href="https://hci.social/tags/text2speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>text2speech</span></a> <a href="https://hci.social/tags/voicesynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>voicesynthesis</span></a> <a href="https://hci.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a></p>
KOM-IN-Netzwerk<p>Sogenannte <a href="https://bildung.social/tags/KI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KI</span></a> wird immer häufiger zur <a href="https://bildung.social/tags/Vertonung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Vertonung</span></a> von Texten eingesetzt.</p><p>Die <a href="https://bildung.social/tags/H%C3%B6rerInnen" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HörerInnen</span></a> unserer <a href="https://bildung.social/tags/Blindenh%C3%B6rzeitschriften" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Blindenhörzeitschriften</span></a> legen allerdings Wert auf natürliche SprecherInnen.</p><p><span class="h-card" translate="no"><a href="https://bildung.social/@Julia_Sophie" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>Julia_Sophie</span></a></span> hilft uns dabei dankenswerterweise als <a href="https://bildung.social/tags/Stimm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Stimm</span></a>- und <a href="https://bildung.social/tags/SprechTrainerin" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SprechTrainerin</span></a></p><p><span class="h-card" translate="no"><a href="https://social.tchncs.de/@klausgesprochen" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>klausgesprochen</span></a></span> hat darüber in seinem Blog geschrieben:<br><a href="https://klausgesprochen.de/blog/unterricht-hilft-immer/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">klausgesprochen.de/blog/unterr</span><span class="invisible">icht-hilft-immer/</span></a></p><p><a href="https://bildung.social/tags/Text2Speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Text2Speech</span></a> <a href="https://bildung.social/tags/blind" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>blind</span></a> <a href="https://bildung.social/tags/H%C3%B6rzeitschrift" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Hörzeitschrift</span></a></p>
FediVerseExplorer<p><span>Ich benutze die </span><a href="https://procial.tchncs.de/tags/Sprache" rel="nofollow noopener" target="_blank">#Sprache</a><span> zu </span><a href="https://procial.tchncs.de/tags/Text" rel="nofollow noopener" target="_blank">#Text</a><span> </span><a href="https://procial.tchncs.de/tags/App" rel="nofollow noopener" target="_blank">#App</a><span> </span><a href="https://procial.tchncs.de/tags/Futo" rel="nofollow noopener" target="_blank">#Futo</a><span> immer öfter.<br>Funktioniert gut. Sie soll keine </span><a href="https://procial.tchncs.de/tags/Daten" rel="nofollow noopener" target="_blank">#Daten</a><span> nach extern senden.<br></span><a href="https://voiceinput.futo.org/" rel="nofollow noopener" target="_blank">https://voiceinput.futo.org/</a><span><br><br></span><a href="https://procial.tchncs.de/tags/Text2Speech" rel="nofollow noopener" target="_blank">#Text2Speech</a><span> </span><a href="https://procial.tchncs.de/tags/selbstbestimmtDigital" rel="nofollow noopener" target="_blank">#selbstbestimmtDigital</a></p>
Karsten Eger<p>Wieso macht man sowas? <a href="https://social.tchncs.de/tags/Text2Speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Text2Speech</span></a> und dann noch nen Roboter mit Lippensynchronität mit genau dem T2S als Overlay ins Werbevideo? WTF</p><p><a href="https://www.youtube.com/watch?v=k7OxFp4_WW0" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">youtube.com/watch?v=k7OxFp4_WW</span><span class="invisible">0</span></a></p><p><a href="https://social.tchncs.de/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a></p>
Tobias Zeumer<p>A while ago I asked ChatGPT: "Write a song that explains the concept and differences of instances, holdings and item records in the FOLIO library system." See lyrics at <a href="https://openbiblio.social/@vform/110436159930086222" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">openbiblio.social/@vform/11043</span><span class="invisible">6159930086222</span></a></p><p>Now I have it read by <a href="https://elevenlabs.io" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">elevenlabs.io</span><span class="invisible"></span></a> (without repeating the chorus, only saying "chorus" where it would repeat). Pretty nice even though it is not exactly intended for songs or poems</p><p><a href="https://openbiblio.social/tags/Folio" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Folio</span></a> <a href="https://openbiblio.social/tags/ChatGPT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ChatGPT</span></a> <a href="https://openbiblio.social/tags/fun" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>fun</span></a> <a href="https://openbiblio.social/tags/music" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>music</span></a> <a href="https://openbiblio.social/tags/ElevenLabs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ElevenLabs</span></a> <a href="https://openbiblio.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://openbiblio.social/tags/text2speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>text2speech</span></a></p>
aaron ~# :blinkingcursor:<p>I just tried <a href="https://voiceinput.futo.org/" rel="nofollow noopener" target="_blank">Futo</a>. Definitely worth checking out. After downloading the models, the app works completely offline, and with high precisition too! Such a <a href="https://infosec.exchange/tags/privacyrespecting" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>privacyrespecting</span></a> <a href="https://infosec.exchange/tags/qualityoflife" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>qualityoflife</span></a> feature.<br><a href="https://infosec.exchange/tags/futo" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>futo</span></a> <a href="https://infosec.exchange/tags/text2speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>text2speech</span></a> <a href="https://infosec.exchange/tags/android" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>android</span></a> <a href="https://infosec.exchange/tags/foss" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>foss</span></a> <a href="https://infosec.exchange/tags/privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>privacy</span></a></p>
Sindre Wimberger<p>Facebooks Massively Multilingual Speech (MMS) <a href="https://mastodon.social/tags/KI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KI</span></a> Modell für 1100 Sprachen 🚀</p><p>🤖 Kann <a href="https://mastodon.social/tags/speech2text" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speech2text</span></a> und <a href="https://mastodon.social/tags/text2speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>text2speech</span></a> in 1100 Sprachen durchführen.<br>🤖 Kann 4000 gesprochene Sprachen erkennen.<br>🤖 Code und Modelle verfügbar unter der CC-BY-NC 4.0 Lizenz.<br>🤖 Halb so hohe Wortfehlerrate wie OpenAI Whisper.<br>🤖 trainiert anhand der Bibel des Neuen Testament</p><p>Blog: <a href="https://ai.facebook.com/blog/multilingual-model-speech-recognition/" rel="nofollow noopener" target="_blank"><span class="invisible">https://</span><span class="ellipsis">ai.facebook.com/blog/multiling</span><span class="invisible">ual-model-speech-recognition/</span></a></p>
Kilian Evang<p>Kennt ihr das Weltraumunternehmen Špačeks <a href="https://mastodon.social/tags/text2speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>text2speech</span></a></p>
abcxyz<p>And for the sake of completeness, combined with a third <a href="https://mastodontech.de/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a>, we also get the story read out nicely!</p><p><a href="https://mastodontech.de/tags/AIart" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AIart</span></a> <a href="https://mastodontech.de/tags/speechelo" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechelo</span></a> <a href="https://mastodontech.de/tags/text2speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>text2speech</span></a></p>