Questions tagged [speech-recognition]

Converting human voice into text

Speech recognition, also known as speech-to-text, is the conversion of spoken language into text.

20 questions
119
votes
13 answers

Is there any decent speech recognition software for Linux?

The short version of the question: I am looking for a speech recognition software that runs on Linux and has decent accuracy and usability. Any license and price is fine. It should not be restricted to voice commands, as I want to be able to dictate…
Franck Dernoncourt
  • 4,749
  • 15
  • 48
  • 79
7
votes
3 answers

Comfortable offline speech recognition software for Linux?

I'm looking for an offline speech recognition software for Linux which can handle also German language and which is easy to use and configure. I already tried CMU Sphinx and a few more others, but all of them had one in common: they have been way…
6
votes
1 answer

Convert audio to text

I heard about existence of some speech recognition systems, and it seems I need one of those. Basically, I have an audio file with speech (only one person is speaking most of the time), and I want to get a transcript of the speech. Is something like…
Rogach
  • 6,150
  • 11
  • 38
  • 41
4
votes
1 answer

Detect simple voice commands

I would like to detect simple words or phrases from my microphone and perform actions based on those phrases. I've looked into Python libraries and Google text-to-speech but these seem like extreme overkill1. I don't need something that is capable…
Display name
  • 1,267
  • 2
  • 14
  • 22
3
votes
2 answers

Redirect Output of Pocketsphinx_continuous to a file

I have an ugly command: pocketsphinx_continuous -samprate 48000 -nfft 2048 -hmm /usr/local/share/pocketsphinx/model/en-us/en-us -lm 9745.lm -dict 9745.dic -inmic yes Breakdown: It listens for any noise and when it detects some, it listens to it,…
Patrick Cook
  • 251
  • 3
  • 11
3
votes
3 answers

Running a C++ compiled program in the background and sending input whenever needed

I have a compiled program written in C++ for a UNIX environment which has this kind of structure: int main(){ ... LoadEngine() ... while(1){ std::cin >> buffer; ... ExecuteFunction(buffer); } } Loading the engine takes quite a while, so…
3
votes
1 answer

Logging in using voice commands with GDM

Is there a program capable of doing such thing? Something that would wait for me to either supply a username and a password, or select my username and a voice command that does the same thing.
Mahmoud Hossam
  • 475
  • 1
  • 5
  • 13
2
votes
3 answers

What are some current transcription or dictation software packages for Linux?

The Mozilla deepspeech project is interesting, but perhaps not sufficiently sophisticated. My results, at least, were underwhelming. Online transcription or dictation services are fine, but an offline software package would be preferred. Is this…
2
votes
1 answer

How can I run speech to text and save the result in a variable?

I would like to speak into my computer's microphone, have what I say converted to text and then have that available as a shell variable. Is this possible? I thought I might do it using Google's speech input feature:
2
votes
0 answers

arecord until sound level drops low enough

I am trying to implement constant voice recognition on my Pi at the moment, I am achieving this by having two threads running, one constantly recording (with arecord in a bash script) for X amount of seconds, saving that information to a WAV and…
Aphire
  • 131
  • 1
  • 5
1
vote
1 answer

Installing Simon Listens on Linux Mint

Recently I've heard about the Simon Listens package which enables you to create a speech recognition engine on Linux as well as windows. I have Linux Mind 14 - cinnamon installed on my laptop. I wanted to install Simon Listens on this system, I…
Jakub
  • 711
  • 1
  • 8
  • 7
1
vote
0 answers

speech recording and translate

I have a problem with converting from "wav" to "flac". Command: arecord -D plughw:0,0 -f cd -t wav -d 0 -q -r 16000 | flac - -s -f --best --sample-rate 16000 -o daveconroy.flac Always gives: ERROR: raw format options (--endian, --sign, --channels,…
1
vote
1 answer

How to manually begin/end speech recognition with X11?

Having found speech recognition software that works well (see this question), I'm still left with needing integration, in my case, an easy way to activate it. The outcome I'm looking for is: Press a shortcut to begin dictation. Press a shortcut to…
ideasman42
  • 1,091
  • 1
  • 10
  • 23
1
vote
0 answers

Feedback on voice-recognition software for Linux

I'd like to get some feedback on any of the voice recognition software that is available for Linux. Free or paid.. That can type on any program and have enter key and right mouse click. I'm currently using NaturallySpeaking.. Would like to switch…
Daniel.B
  • 11
  • 1
1
vote
0 answers

Error while trying to run a python program

Error occurs when Trying to run a python with speech recognition and py audio.. ALSA lib pcm_dsnoop.c:641:(snd_pcm_dsnoop_open) unable to open slave ALSA lib pcm_dmix.c:1089:(snd_pcm_dmix_open) unable to open slave ALSA lib…
abduzaabi
  • 33
  • 5
1
2