[RELEASED] Google Cloud Streaming Speech Recognition [VR\AR\Mobile\Desktop]

itra · Sep 23, 2020

Straafe said: ↑

@itra
Streaming recognition, here's a short video clip of a test app I wrote last year using the system. It detected the spoken language and provided a live transcription of spoken words using Google's streaming speech recognition.

That project is not on my machine right now and lives on the cloud, but if it would help I can pull it down and share some scripts with you. What it is is a Unity client connecting to a Java server over a TCP socket connection, the Unity client processes and streams the live audio to the Java server, which in turn uses Google's streaming speech recognition to get the transcript and information, which is streamed live back to the client. If I remember right, the biggest issues I had were with processing and transferring the audio bytes properly between the client and server and maintaining proper endian-ness.
Click to expand...

Yeah any code you can share would be great. What you've got working in the video is exactly what I'm trying to achieve! I've been playing around with the idea for the last few hours and so far I've got Unity communicating with my c# .Net server via TcpListener. My plan is to pass the byte array from Unity to the server and then just feed the byte array straight into Google's StreamingRecognizeRequest. Not sure if this is the right approach. Now I've hit a brick wall at the same place it sounds like you did - I am having trouble capturing the audio on the client and then once on the server, getting Google's API to recognise it. This is the error code I keep getting back from Google's API:

Status(StatusCode=OutOfRange, Detail="Audio Timeout Error: Long duration elapsed without audio. Audio should be sent close to real time.")

Thanks for the help!

Straafe · Sep 24, 2020

@itra
So in the client, looks like I used an asset called NatMic to get the mic's sample buffer as a byte array and then I send those bytes directly to the server over the TCP socket connection. I was recording the audio at a sample rate of 16,000 in single channel (I can't remember if that mattered for Google's specs, but I also matched that format on the server side when setting up the speech recognition configuration).

I PM'd you the main server script. It's Java, but might get you some insights on dealing with the audio bytes on your server and passing them over to Google. It was based on this.

itra · Sep 24, 2020

@Straafe I've got it working! Thanks for the help and for sharing this work around in the first place. Never would have thought of it otherwise.

itra · Sep 24, 2020

For anyone else interested I found this for use with Windows

https://github.com/oshoham/UnityGoogleStreamingSpeechToText

FrostweepGames · Jul 19, 2021

Hello,

our version of Google Cloud Streaming Speech recognition is now live at http://u3d.as/18RU

Best Regards

lmachado · Sep 16, 2021

Hi,
Is it possible to extend the time in which it detects the end of the recording?, i need to record a long speech of 3 pharagraphs

FrostweepGames · Sep 21, 2021

lmachado said: ↑

Hi,
Is it possible to extend the time in which it detects the end of the recording?, i need to record a long speech of 3 pharagraphs
Click to expand...

Hello,

I would say it mostly depends from google service.

its possible to change chunk sizes which transfers to gogole service which will help with it a bit.

Best Regards

ruxrux · Sep 27, 2021

hi!

just bought this add-on and getting some errors of the bat

1. had to import the add-on without newtonsoft-json as it creates conflicts
2. running the example doesnt work: it seems to recognize the microphone but doesnt access it: error message:
"Start Record Failed. Please check microphone and try again."

would love any help to run this plugin on Android / Oculus!

FrostweepGames · Sep 30, 2021

ruxrux said: ↑

hi!

just bought this add-on and getting some errors of the bat

1. had to import the add-on without newtonsoft-json as it creates conflicts
2. running the example doesnt work: it seems to recognize the microphone but doesnt access it: error message:
"Start Record Failed. Please check microphone and try again."

would love any help to run this plugin on Android / Oculus!
Click to expand...

Hello,

already replied in Discord.

to fix that issue you should delete all Newtonfot.Json librareis which are not in that asset, because there will conflicts. but you cannot use another version of Json lib becuase it is a dependency to other librariries which will throw errors without it.

Best Regards

OACB · Nov 17, 2021

Hello,
Can this store the recording of the voice and upload it to google drive?

FrostweepGames · Nov 29, 2021

Opida said: ↑

Hello,
Can this store the recording of the voice and upload it to google drive?
Click to expand...

Hello,

our asset dont have a possibility to upload files anywhere.. But you could export AudioClip as wav file and upload it anywhere by implementing your own uploading solution.

Best

DeLeT3D · Sep 6, 2022

I keep getting an error saying "Start record Failed. Please check microphone device and try again." I've tried multiple microphones and even tried refreshing the Microphones and restarting the machine. I'm able to use the microphones with other applications without any problems though. I've imported the Package in Unity 2020.3.27. Any assistance would be appreciated.

JPhilipp · Oct 18, 2022

A question. When using the Streaming version of the asset, and one does always want to listen for the user voice throughout the whole session, will this cause calls to the Google Cloud service even when nothing is spoken -- or will it dynamically only call the API when something is said?

A related question: I already bought the Streaming asset, but I don't need the intermittent speech recognitions of half-sentences. I only need the final result. Do I now need to also buy the non-Streaming asset to achieve this?

Thanks!

fire_crystal · May 12, 2023

Thank you for providing the wonderful asset "Streaming Speech Recognition using Google Cloud [VR\AR\Mobile\Desktop] Pro" (Version 1.0.3).

Now, when building with Unity 2021.3.25f1 with Scripting Backend set to IL2CPP, the following error occurs.

Building Library\Bee\artifacts\WinPlayerBuildProgram\u1oik\GameAssembly.dll failed with output:
��C�u�� Library/Bee/artifacts/WinPlayerBuildProgram/u1oik/GameAssembly.dll.lib �ƃI�u�W�F�N�g Library/Bee/artifacts/WinPlayerBuildProgram/u1oik/GameAssembly.dll.exp ��쐬��
3ii0_pc.Core__1.obj : error LNK2019: ��̊O��V��{�� dlopen ��֐� Mono_dlopen_m28A6FCFD6D4175345383F596F0DAA79E26C34070 �ŎQ�Ƃ��܂��
3ii0_pc.Core__1.obj : error LNK2019: ��̊O��V��{�� dlerror ��֐� Mono_dlerror_mCE4B2AE1A919E371751AEEAE600318E2470B3E88 �ŎQ�Ƃ��܂��
3ii0_pc.Core__1.obj : error LNK2019: ��̊O��V��{�� dlsym ��֐� Mono_dlsym_m7B83E4542E62BE8A07581ABFE015F499C692682E �ŎQ�Ƃ��܂��
Library\Bee\artifacts\WinPlayerBuildProgram\u1oik\GameAssembly.dll : fatal error LNK1120: 3 ��̖��̊O��Q��
UnityEngine.GUIUtilityrocessEvent (int,intptr,bool&)

If you change the Scripting Backend to Mono, the build will succeed, but I would like to use IL2CPP for performance and obfuscation.

By the way, delete "Streaming Speech Recognition using Google Cloud [VR\AR\Mobile\Desktop] Pro" (Version 1.0.3) from the project
No error occurs when building with Scripting Backend set to IL2CPP.

I would appreciate it if you could tell me the solution.

Fangh · Nov 16, 2023

a warning is making the build impossible :

Assets/FrostweepGames/StreamingSpeechRecognition/Scripts/GCStreamingSpeechRecognition.cs(459,6): warning CS0162: Unreachable code detected

using Unity 2021.3.32 on iOS

Fangh · Nov 16, 2023

There is also an error with IL2CPP
using Unity 2021.3.32 and iOS

Exception: Unity.IL2CPP.Building.BuilderFailedException: Build failed with 0 successful nodes and 0 failed ones
Error: Internal build system error. Backend exited with code 2.
tundra: error: Failed to open file "/Users/morgan/Documents/GitHub/Test/Library/Il2cppBuildCache/iOS/buildstate/tundra.log.json" for structured logging
at il2cpp.Program.DoRun(String[] args, RuntimePlatform platform, Il2CppCommandLineArguments il2CppCommandLineArguments, BuildingOptions buildingOptions, Boolean throwExceptions) in /Users/bokken/build/output/unity/il2cpp/il2cpp/Program.cs:line 339
UnityEditorInternal.Runner.RunProgram (UnityEditor.Utils.Program p, System.String exe, System.String args, System.String workingDirectory, UnityEditor.Scripting.Compilers.CompilerOutputParserBase parser) (at /Users/bokken/build/output/unity/unity/Editor/Mono/BuildPipeline/BuildUtils.cs:129)
UnityEditorInternal.Runner.RunNetCoreProgram (System.String exe, System.String args, System.String workingDirectory, UnityEditor.Scripting.Compilers.CompilerOutputParserBase parser, System.Action`1[T] setupStartInfo) (at /Users/bokken/build/output/unity/unity/Editor/Mono/BuildPipeline/BuildUtils.cs:91)
UnityEditorInternal.IL2CPPBuilder.RunIl2CppWithArguments (System.Collections.Generic.List`1[T] arguments, System.Action`1[T] setupStartInfo) (at /Users/bokken/build/output/unity/unity/Editor/Mono/BuildPipeline/Il2Cpp/IL2CPPUtils.cs:817)
UnityEditorInternal.IL2CPPBuilder.ConvertPlayerDlltoCpp (UnityEditor.Il2Cpp.Il2CppBuildPipelineData data) (at /Users/bokken/build/output/unity/unity/Editor/Mono/BuildPipeline/Il2Cpp/IL2CPPUtils.cs:801)
UnityEditorInternal.IL2CPPBuilder.Run () (at /Users/bokken/build/output/unity/unity/Editor/Mono/BuildPipeline/Il2Cpp/IL2CPPUtils.cs:639)
UnityEditorInternal.IL2CPPUtils.RunIl2Cpp (System.String tempFolder, System.String stagingAreaData, UnityEditorInternal.IIl2CppPlatformProvider platformProvider, System.Action`1[T] modifyOutputBeforeCompile, UnityEditor.RuntimeClassRegistry runtimeClassRegistry) (at /Users/bokken/build/output/unity/unity/Editor/Mono/BuildPipeline/Il2Cpp/IL2CPPUtils.cs:279)
UnityEditor.iOS.PostProcessiPhonePlayer.CrossCompileManagedDlls (UnityEditor.iOS.PostProcessiPhonePlayer+BuildSettings bs, UnityEditor.iOS.PostProcessiPhonePlayer+ProjectPaths paths, UnityEditor.AssemblyReferenceChecker checker, UnityEditor.RuntimeClassRegistry usedClassRegistry, UnityEditor.Build.Reporting.BuildReport buildReport) (at /Users/bokken/build/output/unity/unity/PlatformDependent/iPhonePlayer/Extensions/Common/BuildPostProcessor.cs:943)
UnityEditor.iOS.PostProcessiPhonePlayer.PostProcess (UnityEditor.iOS.PostProcessiPhonePlayer+BuildSettings bs, UnityEditor.iOS.PostProcessiPhonePlayer+ProjectPaths paths, UnityEditor.RuntimeClassRegistry usedClassRegistry, UnityEditor.Build.Reporting.BuildReport buildReport) (at /Users/bokken/build/output/unity/unity/PlatformDependent/iPhonePlayer/Extensions/Common/BuildPostProcessor.cs:759)
UnityEditor.iOS.PostProcessiPhonePlayer.PostProcess (UnityEditor.iOS.PostProcessorSettings postProcessorSettings, UnityEditor.Modules.BuildPostProcessArgs args) (at /Users/bokken/build/output/unity/unity/PlatformDependent/iPhonePlayer/Extensions/Common/BuildPostProcessor.cs:699)
UnityEditor.iOS.iOSBuildPostprocessor.PostProcess (UnityEditor.Modules.BuildPostProcessArgs args) (at /Users/bokken/build/output/unity/unity/PlatformDependent/iPhonePlayer/Extensions/Common/ExtensionModule.cs:45)
Rethrow as BuildFailedException: Exception of type 'UnityEditor.Build.BuildFailedException' was thrown.
UnityEditor.iOS.iOSBuildPostprocessor.PostProcess (UnityEditor.Modules.BuildPostProcessArgs args) (at /Users/bokken/build/output/unity/unity/PlatformDependent/iPhonePlayer/Extensions/Common/ExtensionModule.cs:49)
UnityEditor.Modules.DefaultBuildPostprocessor.PostProcess (UnityEditor.Modules.BuildPostProcessArgs args, UnityEditor.BuildProperties& outProperties) (at /Users/bokken/build/output/unity/unity/Editor/Mono/Modules/DefaultBuildPostprocessor.cs:28)
UnityEditor.PostprocessBuildPlayer.Postprocess (UnityEditor.BuildTargetGroup targetGroup, UnityEditor.BuildTarget target, System.Int32 subtarget, System.String installPath, System.String companyName, System.String productName, System.Int32 width, System.Int32 height, UnityEditor.BuildOptions options, UnityEditor.RuntimeClassRegistry usedClassRegistry, UnityEditor.Build.Reporting.BuildReport report) (at /Users/bokken/build/output/unity/unity/Editor/Mono/BuildPipeline/PostprocessBuildPlayer.cs:370)
UnityEngine.GUIUtilityrocessEvent(Int32, IntPtr, Boolean&) (at /Users/bokken/build/output/unity/unity/Modules/IMGUI/GUIUtility.cs:189)
Click to expand...

Fangh · Nov 16, 2023

Fangh said: ↑

There is also an error with IL2CPP
using Unity 2021.3.32 and iOS

View attachment 1331974
Click to expand...

I found the fix : https://stackoverflow.com/questions...-unity-il2cpp-building-builderfailedexception

Search Unity

Unity ID

Useful Searches

[RELEASED] Google Cloud Streaming Speech Recognition [VR\AR\Mobile\Desktop]