Speech-to-text/speech recognition programs (STT)

19 Replies, 4425 Views

Does anybody know if opensource standalone speech-to-text programs ever exist? The idea is to use speech recognition to create text files from interviews, videos (e.g. youtube) , films, hypnosis files, etc. I tried to use Google Voice input on my smartphone, but it's atrocious...
(This post was last modified: 08 Sep 2025, 18:36 by Like Ra.)
A program called Dragon Software allows one to talk into a mic and the computer converts it to text. The text is then stored in a file for storage or or loaded in a file to be sent by email.
Dragon speech to text.
https://en.wikipedia.org/wiki/Dragon_NaturallySpeaking

Thanks! But not free, not for Linux, and adapted for your voice only (at least, this is what I understand). That led me to this article:

https://en.wikipedia.org/wiki/List_of_sp...n_software

A lot to learn again... 😁
(This post was last modified: 03 Oct 2017, 21:06 by Like Ra.)
Well, it is an old program, dated back to the late 90s.
But somewhere in the back of my mind, there is another speech to text program that was free. For the life of me, I just can't remember.
The best speech recognition program so far is youtube!
I'm testing this with Kdenlive: https://alphacephei.com/vosk/models
I had a general text to speech program a few years ago but I forgot the name of it. Will update once I do

Tensorflow is one way to do it but programming is a pain.

Edit: I am an idiot. I have dragon pro. It does work nice though.
(This post was last modified: 30 May 2021, 05:42 by Lancer.)
(30 May 2021, 05:36 )Lancer Wrote: dragon pro.
From your work? Is it any better than youtube?
Here's an interesting option for speech to text recognition, with the windows dictation tool, and vb audio cable to select the input:

Source: https://www.youtube.com/watch?v=1DsrniDGOJQ

Possibly Related Threads…
Thread Author Replies Views Last Post
  Text to speech programs (TTS) Like Ra 56 13,545 24 Oct 2025, 00:59
Last Post: dustymoon1
  Elevenlabs now lets you create A.I. voices from text prompts AI_addict_hypnofan 0 181 26 Jun 2025, 10:47
Last Post: AI_addict_hypnofan