Search Unity

  1. Welcome to the Unity Forums! Please take the time to read our Code of Conduct to familiarize yourself with the forum rules and how to post constructively.

Audio Unity Automatic Speech Recognition

Discussion in 'Audio & Video' started by voxelltech, Jul 31, 2021.

  1. voxelltech

    voxelltech

    Joined:
    Oct 8, 2019
    Posts:
    44
    DevDunk likes this.
  2. kbabilinski

    kbabilinski

    Joined:
    Jul 12, 2012
    Posts:
    19
    Thank you! I got the AutomaticSpeechRecognition to work and will be pushing up the fixed example later today !
     
    DevDunk likes this.
  3. voxelltech

    voxelltech

    Joined:
    Oct 8, 2019
    Posts:
    44
    oh wow thank you so much! On the side note, for the previous version, did it crash Unity on your side? Cuz for me, everytime I go out from play mode, unity crashes for some mysterious reasons.. XD
     
  4. kbabilinski

    kbabilinski

    Joined:
    Jul 12, 2012
    Posts:
    19
    I decided to start from scratch and created a lightweight implementation. It processes the audio on a separate thread. Here is the link if you want to check it out - https://github.com/Babilinski/deep-speech-unity

    The reason why Unity crashes is because the thread is still writing to deepspeech when you exit Unity. Make sure that you check if you are writing to Deepspeech before trying to dispose it.
     
    Elecman likes this.
  5. voxelltech

    voxelltech

    Joined:
    Oct 8, 2019
    Posts:
    44
    This is absolutely gold! Thank you so much!
     
  6. Elecman

    Elecman

    Joined:
    May 5, 2011
    Posts:
    1,369
    Very interesting!

    It would be even better if you can make it work wih Coqui STT (a fork of DeepSpeech) because DeepSpeech development has been slowed down. The core developers of DeepSpeech moved over to Coqui STT. More information here:

    https://discourse.mozilla.org/t/a-new-in-speech-tech-town/77026
    https://discourse.mozilla.org/t/future-of-deepspeech-stt-after-recent-changes-at-mozilla/66191

    Edit:

    I managed to port Coqui STT to Unity using the DeepSpeech project from @kbabilinski as a reference. I made A tutorial with source files. It is available here:

    https://bitbarrelmedia.wordpress.com/2021/09/19/coqui-stt-in-unity/

    I slightly modified the ContinuousVoiceRecorder and SpeechTextToText scripts to make it work with CoquiSTT along with a few other minor imporvements like not requiring to enter a hard coded path to the model files.
     
    Last edited: Sep 19, 2021
    voxelltech likes this.
  7. voxelltech

    voxelltech

    Joined:
    Oct 8, 2019
    Posts:
    44
    This is really awesome! Will check it out! Thanks!
     
  8. AmmarSalim

    AmmarSalim

    Joined:
    Sep 2, 2016
    Posts:
    22
    good but
    why This error appears to me :
    FileNotFoundException: Cannot find the model file: C:\Users\kbabi\Documents\GitHub\Deep-Speech\Assets\StreamingAssets\deepspeech-0.9.3-models.pbmm
    DeepSpeechClient.DeepSpeech.CreateModel (System.String aModelPath)
     
  9. voxelltech

    voxelltech

    Joined:
    Oct 8, 2019
    Posts:
    44
    You need to download the deepspeech model (if you are using my repo, here is the model: https://drive.google.com/file/d/1RA9MDwconsoPexjngivo2jthiAG6ZQNa/view?usp=sharing)

    I have not updated my repo though, will try to implement kbabilinski's solution ASAP.

    Edit: the model should work with kbabilinski's repo too.