4

I don't want my laptop or mobile to just dictate/type down what I say, but instead write what the caller on the other end of the line says. Why doesn't Google or Nuance enable that functionality? Is it even possible? What would be needed for that to work?

ospeaks
  • 41

2 Answers2

1

When you are speaking your voice is being recorded through your computer's microphone and audio hardware. This audio signal is available to any applications on your computer. When you are speaking to someone over Skype the voice signal is encoded in a bunch of packets. You would need to reverse engineer Skype's protocol in order to decode the voice signal.

If you're talking on a cell phone in speakerphone mode the quality might not be good enough for voice recognition. Also the voice recognition software would need to be trained on both your voice and the other person's.

I'm not sure what relevance Google has here.

0

Dragon NaturallySpeaking (unlike Google) needs to be trained before being able to transcribe speech into words.

Google does speech to text transcriptions for the caller on the other end of the line when someone leaves a message on your autoresponder on Google voice.

Note that the voice quality is often altered quite heavily during a call, which makes voice recognition even harder.

Franck Dernoncourt
  • 24,246
  • 64
  • 231
  • 400