Search Unity

Search

[RELEASED] OpenCV for Unity

Discussion in 'Assets and Asset Store' started by EnoxSoftware, Oct 30, 2014.

Page 44 of 64

ina

Joined:

Nov 15, 2010

Posts:

1,085

Does the MaskRCNN example work with inceptionv3 or others?

ina, Sep 29, 2019

#2151
ina

Joined:

Nov 15, 2010

Posts:

1,085

redagrandrei said: ↑

@EnoxSoftware
Hello. Planing to buy this asset. Though as im not rly familiar with this plugin and opencv i have a question.
Is it posible to train a model in open cv (i assume its in pure .Net or maybe Python) and then import the trained model into unity using your plugin? So basicaly what environment i need and is the best for training a new network and how do i add the new network into the project?
Correct me on these steps if im wrong. Thank you.
Click to expand...

Yes they provide dnn support to load trained machine learning models. Training a model involves a lot of knowledge separate from OpenCVForUnity - maybe take a machine learning course for free on kaggle

ina, Sep 30, 2019

#2152
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

redagrandrei said: ↑

@EnoxSoftware
Hello. Planing to buy this asset. Though as im not rly familiar with this plugin and opencv i have a question.
Is it posible to train a model in open cv (i assume its in pure .Net or maybe Python) and then import the trained model into unity using your plugin? So basicaly what environment i need and is the best for training a new network and how do i add the new network into the project?
Correct me on these steps if im wrong. Thank you.
Click to expand...

OpenCVForUnity supports models trained by Python and other methods. Models that work with OpenCV4.1.0 will also work with OpenCVForUnity2.3.6.
https://github.com/opencv/opencv/wiki/TensorFlow-Object-Detection-API

EnoxSoftware, Oct 1, 2019

#2153
ZerotheLone

Joined:

Nov 9, 2017

Posts:

16

I tried building for IOS using xcode and I got a lot of "undefined symbol" errors, all of them have to do with OpenCV functions it seems (e.x. cv::Mat::eye(cv::Size_<int>,int)). Am I missing something? I have the latest version of opencvforunity.

ZerotheLone, Oct 1, 2019

#2154
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

ina said: ↑

Does the MaskRCNN example work with inceptionv3 or others?
Click to expand...

Unfortunately, I don't have an example of using inceptionv3.

EnoxSoftware, Oct 2, 2019

#2155
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

ZerotheLone said: ↑

I tried building for IOS using xcode and I got a lot of "undefined symbol" errors, all of them have to do with OpenCV functions it seems (e.x. cv::Mat::eye(cv::Size_<int>,int)). Am I missing something? I have the latest version of opencvforunity.
Click to expand...

Thank you very much for reporting.
Could you tell me about your test environment?
Unity version :
Xcode version :

EnoxSoftware, Oct 2, 2019

#2156
ZerotheLone

Joined:

Nov 9, 2017

Posts:

16

EnoxSoftware said: ↑

Thank you very much for reporting.
Could you tell me about your test environment?
Unity version :
Xcode version :
Click to expand...

Xode is version 11 and Unity is 2018.3.0f
Thank you.

ZerotheLone, Oct 3, 2019

#2157
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

ZerotheLone said: ↑

Xode is version 11 and Unity is 2018.3.0f
Thank you.
Click to expand...

From Unity2018.3.0f1 to Unity2018.3.3f1, it is necessary to turn off "Enable Bitcode" on Xcode.

EnoxSoftware, Oct 5, 2019

#2158
ZerotheLone

Joined:

Nov 9, 2017

Posts:

16

EnoxSoftware said: ↑

From Unity2018.3.0f1 to Unity2018.3.3f1, it is necessary to turn off "Enable Bitcode" on Xcode.
View attachment 493361
Click to expand...

Thank you for the fast reply. I disabled Bitcode but I'm still getting the 100 error. Here are pictures showing the errors I have. Could it be a problem with my version of Xcode? I have xcode 11 but I haven't updated to latest version since I'm low on space.

ZerotheLone, Oct 7, 2019

#2159
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

ZerotheLone said: ↑

Thank you for the fast reply. I disabled Bitcode but I'm still getting the 100 error. Here are pictures showing the errors I have. Could it be a problem with my version of Xcode? I have xcode 11 but I haven't updated to latest version since I'm low on space.
View attachment 494333 View attachment 494336 View attachment 494339
Click to expand...

Is ImportSettings properly set?

Also, It is recommended to use Xcode10.2 when using Unity2018.3.0f1.

EnoxSoftware, Oct 10, 2019

#2160
simar88

Joined:

Jul 7, 2012

Posts:

1
Dear Enox Software, building for iOS i'm reeveing this error:
Library not loaded: @rpath/opencv2.framework/opencv2
You can find details attached.
Thanks in advance,
Simone
Attached Files:
- Schermata 2019-10-10 alle 12.54.41.png
  
  File size:
  
  52.8 KB
  
  Views:
  
  504
simar88, Oct 10, 2019

#2161
milamila

Joined:

Nov 21, 2013

Posts:

16

Dear Enox Software, before purchasing the asset could you please help me to understand
will it be possible to add an "OpenPose deep learning library" to the project (https://github.com/CMU-Perceptual-Computing-Lab/openpose)
and then get something like this on ios device https://github.com/sarweshshah/gait_analysis/blob/master/results/pose trail.gif
Thank you in advance

milamila, Oct 10, 2019

#2162
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

simar88 said: ↑

Dear Enox Software, building for iOS i'm reeveing this error:
Library not loaded: @rpath/opencv2.framework/opencv2
You can find details attached.
Thanks in advance,
Simone
Click to expand...

Thank you very much for reporting.
Could you tell me about your test environment?
Unity version :
Xcode version :

Is ImportSettings properly set?

EnoxSoftware, Oct 11, 2019

#2163
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

milamila said: ↑

Dear Enox Software, before purchasing the asset could you please help me to understand
will it be possible to add an "OpenPose deep learning library" to the project (https://github.com/CMU-Perceptual-Computing-Lab/openpose)
and then get something like this on ios device https://github.com/sarweshshah/gait_analysis/blob/master/results/pose trail.gif
Thank you in advance
Click to expand...

OpenPoseExample is included in OpenCVForUnity.
https://github.com/EnoxSoftware/Ope...inModules/dnn/CaffeExample/OpenPoseExample.cs
for now, Since OpenCV's Dnn module uses a CPU, OpenPoseExample takes more than 900 ms to estimate human pose. Perhaps real-time processing is difficult.
But, this repository using OpenCVForUnity seems to work in real-time.
https://twitter.com/yukihiko_a/status/1131174274708910080
https://github.com/yukihiko/ThreeDPoseUnityForiOS

EnoxSoftware, Oct 11, 2019

#2164
milamila

Joined:

Nov 21, 2013

Posts:

16

EnoxSoftware said: ↑

OpenPoseExample is included in OpenCVForUnity.
https://github.com/EnoxSoftware/Ope...inModules/dnn/CaffeExample/OpenPoseExample.cs
for now, Since OpenCV's Dnn module uses a CPU, OpenPoseExample takes more than 900 ms to estimate human pose. Perhaps real-time processing is difficult.
But, this repository using OpenCVForUnity seems to work in real-time.
https://twitter.com/yukihiko_a/status/1131174274708910080
https://github.com/yukihiko/ThreeDPoseUnityForiOS
Click to expand...

Thanks a lot for your reply!

milamila, Oct 11, 2019

#2165
ZerotheLone

Joined:

Nov 9, 2017

Posts:

16

EnoxSoftware said: ↑

Is ImportSettings properly set?
View attachment 495560
View attachment 495563

Also, It is recommended to use Xcode10.2 when using Unity2018.3.0f1.
Click to expand...

I think I have this set up correctly, both files in iOS are set to iOS in the platform. I'll look into using an older Xcode.

ZerotheLone, Oct 13, 2019

#2166
Aske_S_K

Joined:

Mar 9, 2014

Posts:

13

EnoxSoftware said: ↑

I think that it is probably possible.But I do not have an implementation example.
Since this asset is a clone of OpenCV Java, you are able to use the same API as OpenCV Java.
If there is implementation example using "OpenCV Java", I think that it can be implemented even using "OpenCV for Unity".
Click to expand...

Hi.
Is there any update to whether one can achieve the same as with Vuforia Image Targets? Meaning, can I predefine a sample of photos (at least 2, hopefully up to 100) and then have the camera recognise at least one photo at a time?

Aske_S_K, Oct 15, 2019

#2167
ZerotheLone

Joined:

Nov 9, 2017

Posts:

16

EnoxSoftware said: ↑

Thank you very much for reporting.
Could you tell me about your test environment?
Unity version :
Xcode version :

Is ImportSettings properly set?
View attachment 496142
View attachment 496145
Click to expand...

Do build settings matter when trying to with OpenCVForUnity? I have my minimum iOS version at version 9.

ZerotheLone, Oct 17, 2019

#2168
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

Aske_S_K said: ↑

Hi.
Is there any update to whether one can achieve the same as with Vuforia Image Targets? Meaning, can I predefine a sample of photos (at least 2, hopefully up to 100) and then have the camera recognise at least one photo at a time?
Click to expand...

Unfortunately, I don't have an example of detecting multiple MarkerTargets.
The PatternDetector class supports only one pattern marker image. If you want to use multiple different markers, you need to use multiple PatternDetector classes.

EnoxSoftware, Oct 17, 2019

#2169
dimib

Joined:

Apr 16, 2017

Posts:

50

Hey together,

we purchased the OpenCV package and would like to run the SFM module that is included in OpenCV 3.0+ . Can this be included in an upcoming patch of this asset?

Any suggestions how we can use SFM with Unity right now?

Best regards

dimib, Oct 18, 2019

#2170
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

ZerotheLone said: ↑

Do build settings matter when trying to with OpenCVForUnity? I have my minimum iOS version at version 9.
Click to expand...

OpenCVForUnity requires iOS version 8 or higher.

EnoxSoftware, Oct 18, 2019

#2171
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

dimib said: ↑

Hey together,

we purchased the OpenCV package and would like to run the SFM module that is included in OpenCV 3.0+ . Can this be included in an upcoming patch of this asset?

Any suggestions how we can use SFM with Unity right now?

Best regards
Click to expand...

Since this asset is a clone of OpenCV Java 4.1.0, you are able to use the same API as OpenCV Java 4.1.0.
For now, the sfm module is not planned to be implemented.

EnoxSoftware, Oct 19, 2019

#2172
ZerotheLone

Joined:

Nov 9, 2017

Posts:

16

EnoxSoftware said: ↑

OpenCVForUnity requires iOS version 8 or higher.
Click to expand...

I updated my project from Unity 2018 to Unity 2019.2. My file is saved on GitHub but when I try to clone it and access it from my Mac I get this error in the editor when it tries to run OpenCV.

ZerotheLone, Oct 26, 2019

#2173
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

ZerotheLone said: ↑

I updated my project from Unity 2018 to Unity 2019.2. My file is saved on GitHub but when I try to clone it and access it from my Mac I get this error in the editor when it tries to run OpenCV.
View attachment 504242
Click to expand...

Is ImportSettings of "OpenCVForUnity / Assets / OpenCVForUnity / Plugins / macOS / opencvforunity.bundle" set correctly?

EnoxSoftware, Oct 27, 2019

#2174
ZerotheLone

Joined:

Nov 9, 2017

Posts:

16

EnoxSoftware said: ↑

Is ImportSettings of "OpenCVForUnity / Assets / OpenCVForUnity / Plugins / macOS / opencvforunity.bundle" set correctly?
View attachment 504536
Click to expand...

I don't have that folder. I have a version of OpenCVForUnity from 2018, do I need to get the latest version in order for it to work?

ZerotheLone, Oct 27, 2019

#2175
wcchoe

Joined:

Nov 22, 2013

Posts:

5
Thanks for the awesome plugin.
But recently with Unity 2019.2.9f1, running any of example using camera lead to crash.

I got error:

Code (CSharp):

E/Camera3-OutputStream: getBufferLocked: Stream 0: Can't dequeue next output buffer: Broken pipe (-32)

E/Camera3-Device: RequestThread: Can't get output buffer, skipping request: Broken pipe (-32)

Previous version of Unity I worked with (2018.4.6f1) has no such problem. I tried on galaxy S6 and S8+. It would be nice if you fix this problem.
wcchoe, Oct 28, 2019

#2176
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

ZerotheLone said: ↑

I don't have that folder. I have a version of OpenCVForUnity from 2018, do I need to get the latest version in order for it to work?
Click to expand...

Could you tell me about your test environment?
Unity version :
OpenCVForUnity version :
macOS version :

EnoxSoftware, Oct 28, 2019

#2177
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566
wcchoe said: ↑

Thanks for the awesome plugin.
But recently with Unity 2019.2.9f1, running any of example using camera lead to crash.

I got error:

Code (CSharp):

E/Camera3-OutputStream: getBufferLocked: Stream 0: Can't dequeue next output buffer: Broken pipe (-32)

E/Camera3-Device: RequestThread: Can't get output buffer, skipping request: Broken pipe (-32)

Previous version of Unity I worked with (2018.4.6f1) has no such problem. I tried on galaxy S6 and S8+. It would be nice if you fix this problem.
Click to expand...

Thank you very much for reporting.
Does this problem only occur when using OpenCVForUnity?
Also,
Does the following simple WebCamTexture code work without problems?
https://docs.unity3d.com/ScriptReference/WebCamTexture.Play.html

Code (CSharp):

using UnityEngine;

using System.Collections;

public class ExampleClass : MonoBehaviour

{

void Start()

{

WebCamTexture webcamTexture = new WebCamTexture();

Renderer renderer = GetComponent<Renderer>();

renderer.material.mainTexture = webcamTexture;

webcamTexture.Play();

}

}
EnoxSoftware, Oct 28, 2019

#2178
wcchoe

Joined:

Nov 22, 2013

Posts:

5
EnoxSoftware said: ↑

Thank you very much for reporting.
Does this problem only occur when using OpenCVForUnity?
Also,
Does the following simple WebCamTexture code work without problems?
https://docs.unity3d.com/ScriptReference/WebCamTexture.Play.html

Code (CSharp):

using UnityEngine;

using System.Collections;

public class ExampleClass : MonoBehaviour

{

void Start()

{

WebCamTexture webcamTexture = new WebCamTexture();

Renderer renderer = GetComponent<Renderer>();

renderer.material.mainTexture = webcamTexture;

webcamTexture.Play();

}

}

Click to expand...

Ok it was problem of WebCamTexture. Thanks for answering!
wcchoe, Oct 29, 2019

#2179

EnoxSoftware likes this.
UlyanovItTest

Joined:

Aug 6, 2019

Posts:

4

Good afternoon! I have a problem with Android:
1) cannot get the number of frames in the video. Always returns 0
2) I can not set the desired frame. Always returns false
Although I for example can take from all cadres on waiting lists.
Under Windows 10 everything works fine
I have:
Unity: 2018.2. 21f1
Watch honor of the 20 (of the Top 9)
Samsung Galaxy S6 Edge (Android 7.0)
Thanks in advance for the answer

UlyanovItTest, Oct 29, 2019

#2180
Jeff_Blumenthal

Joined:

Apr 7, 2017

Posts:

8
Hi,

I'm building a pupil detector using Keras model using Open CV For Unity that I purchased. I am having trouble converting the OpenCVForUnity.CoreModule.Mat object to a Numpy.NDarray that is required for the model predict method. I searched many sites and docs but cannot figure it out. Can someone please help me? Thank you.

Code (CSharp):

var model = Keras.Models.Model.ModelFromJson(jsonAIModel);

model.LoadWeight(aiModelWeights);

// error on next line: cannot convert from 'OpenCVForUnity.CoreModule.Mat' to 'Numpy.NDarray'

var result = model.Predict(image);
Jeff_Blumenthal, Oct 29, 2019

#2181
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

UlyanovItTest said: ↑

Good afternoon! I have a problem with Android:
1) cannot get the number of frames in the video. Always returns 0
2) I can not set the desired frame. Always returns false
Although I for example can take from all cadres on waiting lists.
Under Windows 10 everything works fine
I have:
Unity: 2018.2. 21f1
Watch honor of the 20 (of the Top 9)
Samsung Galaxy S6 Edge (Android 7.0)
Thanks in advance for the answer
Click to expand...

Thank you very much for reporting.
Is the example you tested a VideoCaptureExample ("768x576_mjpeg.mjpeg") ?
Also, It seems that whether the "Videoio.CAP_PROP_POS_FRAMES" property can be acquired depends on the video format.

EnoxSoftware, Oct 30, 2019

#2182
UlyanovItTest

Joined:

Aug 6, 2019

Posts:

4
EnoxSoftware said: ↑

Thank you very much for reporting.
Is the example you tested a VideoCaptureExample ("768x576_mjpeg.mjpeg") ?
Also, It seems that whether the "Videoio.CAP_PROP_POS_FRAMES" property can be acquired depends on the video format.
Click to expand...

I took the code from the example as a basis. Modified under its task. Now, tested on ios (Iphone 6). Everything works correctly. On Android:

Code (CSharp):

_video Capture.set (Videoio.CAP_PROP_POS_FRAMES, index);

return false
The same code on ios returns true
Video formats I use: avi, mp4, mov
UlyanovItTest, Oct 30, 2019

#2183
UlyanovItTest

Joined:

Aug 6, 2019

Posts:

4
Sorry, on Ios, too, not correctly fulfills. Although the ios code returns true, the end result is still incorrect.
Editor:
Mac
frameCount: 734

Code (CSharp):

videoCapture.set (Videoio.CAP_PROP_POS_FRAMES, UpdateFrame.First().Number) - return true

Fulfills completely correctly.

Win10:
frameCount: 734

Code (CSharp):

videoCapture.set (Videoio.CAP_PROP_POS_FRAMES, UpdateFrame.First().Number) - return true

Fulfills completely correctly.

Mobile:
Android:
frameCount: 0

Code (CSharp):

videoCapture.set (Videoio.CAP_PROP_POS_FRAMES, UpdateFrame.First().Number) - return false

Fulfills incorrectly

Ios:
frameCount: 1

Code (CSharp):

videoCapture.set (Videoio.CAP_PROP_POS_FRAMES, UpdateFrame.First().Number) - return true

Fulfills incorrectly

My code is frame extraction

Code (CSharp):

if (UpdateFrame.First().Number >= 0)

_videoCapture.set(Videoio.CAP_PROP_POS_FRAMES,

UpdateFrame.First().Number);

_videoCapture.grab();

_videoCapture.retrieve(_imgMat, 0);

Imgproc.resize(_imgMat, _imgMat, new Size(512, 512));

//Creating a texture from a new frame

#if UNITY_EDITOR || UNITY_IOS

Imgproc.cvtColor(_imgMat, _imgMat, Imgproc.COLOR_BGR2RGB);

#endif

if (UpdateFrame.First().texture == null)

{

var texture = new Texture2D(_imgMat.cols(),

_imgMat.rows(), TextureFormat.RGB24, false);

Utils.matToTexture2D(_imgMat, texture);

UpdateFrame.First().texture = texture;

}

else

{

Utils.fastMatToTexture2D(_imgMat,

(Texture2D)UpdateFrame.First().texture);

}

UpdateFrame.Remove(UpdateFrame.First());

Maybe I'm doing something wrong? Thanks in advance for the answer^^
UlyanovItTest, Oct 30, 2019

#2184
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566
UlyanovItTest said: ↑

Sorry, on Ios, too, not correctly fulfills. Although the ios code returns true, the end result is still incorrect.
Editor:
Mac
frameCount: 734

Code (CSharp):

videoCapture.set (Videoio.CAP_PROP_POS_FRAMES, UpdateFrame.First().Number) - return true

Fulfills completely correctly.

Win10:
frameCount: 734

Code (CSharp):

videoCapture.set (Videoio.CAP_PROP_POS_FRAMES, UpdateFrame.First().Number) - return true

Fulfills completely correctly.

Mobile:
Android:
frameCount: 0

Code (CSharp):

videoCapture.set (Videoio.CAP_PROP_POS_FRAMES, UpdateFrame.First().Number) - return false

Fulfills incorrectly

Ios:
frameCount: 1

Code (CSharp):

videoCapture.set (Videoio.CAP_PROP_POS_FRAMES, UpdateFrame.First().Number) - return true

Fulfills incorrectly

My code is frame extraction

Code (CSharp):

if (UpdateFrame.First().Number >= 0)

_videoCapture.set(Videoio.CAP_PROP_POS_FRAMES,

UpdateFrame.First().Number);

_videoCapture.grab();

_videoCapture.retrieve(_imgMat, 0);

Imgproc.resize(_imgMat, _imgMat, new Size(512, 512));

//Creating a texture from a new frame

#if UNITY_EDITOR || UNITY_IOS

Imgproc.cvtColor(_imgMat, _imgMat, Imgproc.COLOR_BGR2RGB);

#endif

if (UpdateFrame.First().texture == null)

{

var texture = new Texture2D(_imgMat.cols(),

_imgMat.rows(), TextureFormat.RGB24, false);

Utils.matToTexture2D(_imgMat, texture);

UpdateFrame.First().texture = texture;

}

else

{

Utils.fastMatToTexture2D(_imgMat,

(Texture2D)UpdateFrame.First().texture);

}

UpdateFrame.Remove(UpdateFrame.First());

Maybe I'm doing something wrong? Thanks in advance for the answer^^
Click to expand...

When using the mjpeg codec, we confirmed that the following example works on all platforms.
https://www.dropbox.com/s/vxeutihxekb4nhq/New_VideoCaptureExample.unitypackage?dl=0
It seems that it depends on the platform whether video files other than mjpeg support "Videoio.CAP_PROP_POS_FRAMES".
EnoxSoftware, Nov 1, 2019

#2185
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566
Jeff_Blumenthal said: ↑

Hi,

I'm building a pupil detector using Keras model using Open CV For Unity that I purchased. I am having trouble converting the OpenCVForUnity.CoreModule.Mat object to a Numpy.NDarray that is required for the model predict method. I searched many sites and docs but cannot figure it out. Can someone please help me? Thank you.

Code (CSharp):

var model = Keras.Models.Model.ModelFromJson(jsonAIModel);

model.LoadWeight(aiModelWeights);

// error on next line: cannot convert from 'OpenCVForUnity.CoreModule.Mat' to 'Numpy.NDarray'

var result = model.Predict(image);

Click to expand...

To convert from the Mat class to the Numpy.NDarray class, you need to set Mat data to the Numpy.NDarray class in some way.
Utils.copyFromMat () can copy byte array data from Mat.
https://enoxsoftware.github.io/Open..._utils.html#ab14e60409dd3fa505da151d92e4870c0
Does Numpy.NDarray class have a method to initialize by specifying byte array data?
EnoxSoftware, Nov 4, 2019

#2186
Luke58

Joined:

Sep 3, 2019

Posts:

2

Dear EnoxSoftware,

I am working in licence car plate recognition. I have an issue with loading classification.xml file from android and put into KNN recognition. Here is C++ version, I can't take it to work in C# in unity with opencvforunity. Is there function to read xml file and put into Mat? Thank you for your answer.

Best regards,
Luke.

cv::Mat matClassificationInts;

cv::FileStorage fsClassifications("classifications.xml", cv::FileStorage::READ); // open the classifications file

if (fsClassifications.isOpened() == false) { // if the file was not opened successfully
std::cout << "error, unable to open training classifications file, exiting program\n\n"; // show error message
return(false); // and exit program
}

fsClassifications["classifications"] >> matClassificationInts; // read classifications section into Mat classifications variable
fsClassifications.release(); // close the classifications file
// read in training classifications

Luke58, Nov 6, 2019

#2187
pouria77

Joined:

Aug 13, 2014

Posts:

36
Hello

I've been struggling with this for a couple of days now, so I was wondering if anyone can give me some guidance.
I'm following an example from the book "OpenCV by example" for text recognition which is in c++, and so far have successfully converted every part to C# and unity except this function:

Code (CSharp):

Mat drawER(const vector<Mat> &channels, const vector<vector<ERStat> > &regions, const vector<Vec2i>& group, const Rect& rect)

{

Mat out = Mat::zeros(channels[0].rows+2, channels[0].cols+2, CV_8UC1);

int flags = 4 //4 neighbors

+ (255 << 8) //paint mask in white (255)

+ FLOODFILL_FIXED_RANGE //fixed range

+ FLOODFILL_MASK_ONLY; //Paint just the mask

for (int g=0; g < group.size(); g++)

{

int idx = group[g][0];

ERStat er = regions[idx][group[g][1]];

//Ignore root region

if (er.parent == NULL)

continue;

//Transform the linear pixel value to row and col

int px = er.pixel % channels[idx].cols;

int py = er.pixel / channels[idx].cols;

//Create the point and adds it to the list.

Point p(px, py);

//Draw the extremal region

floodFill(

channels[idx], out, //Image and mask

p, Scalar(255), //Seed and color

nullptr, //No rect

Scalar(er.level),Scalar(0), //LoDiff and upDiff

flags //Flags

);

}

//Crop just the text area and find it's points

out = out(rect);

vector<Point> points;

findNonZero(out, points);

//Use deskew and crop to crop it perfectly

return deskewAndCrop(out, minAreaRect(points));

}

What it does basically is to use the floodfill algorithm (which is available in openCV for unity) to clean and then crop the rotated text areas (photos attached).

This is the explanation of that part from the book:

Fortunately, the ERFilter provides us with an object called ERStat, which contains pixels inside each extremal region. With these pixels, we can use the OpenCV floodFill function to reconstruct each letter. This function is capable of painting similar colored pixels based in a seed point, just like the bucket tool of most drawing applications.

The ERStat object is not available in "openCV for unity" so I have a hard time figuring out how to get that information from the region objects which are MatOfPoints.

I'd appreciate any help.

Thanks.
Attached Files:
- ocr.png
  
  File size:
  
  587.1 KB
  
  Views:
  
  488
pouria77, Nov 6, 2019

#2188
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

Luke58 said: ↑

Dear EnoxSoftware,

I am working in licence car plate recognition. I have an issue with loading classification.xml file from android and put into KNN recognition. Here is C++ version, I can't take it to work in C# in unity with opencvforunity. Is there function to read xml file and put into Mat? Thank you for your answer.

Best regards,
Luke.

cv::Mat matClassificationInts;

cv::FileStorage fsClassifications("classifications.xml", cv::FileStorage::READ); // open the classifications file

if (fsClassifications.isOpened() == false) { // if the file was not opened successfully
std::cout << "error, unable to open training classifications file, exiting program\n\n"; // show error message
return(false); // and exit program
}

fsClassifications["classifications"] >> matClassificationInts; // read classifications section into Mat classifications variable
fsClassifications.release(); // close the classifications file
// read in training classifications
Click to expand...

Since OpenCVForUnity is a clone of OpenCV Java, you are able to use the same API as OpenCV Java 4.1.0(https://docs.opencv.org/master/javadoc/). FileStorage class has not been implemented.

EnoxSoftware, Nov 7, 2019

#2189
Luke58

Joined:

Sep 3, 2019

Posts:

2
EnoxSoftware said: ↑

Since OpenCVForUnity is a clone of OpenCV Java, you are able to use the same API as OpenCV Java 4.1.0(https://docs.opencv.org/master/javadoc/). FileStorage class has not been implemented.
Click to expand...

I found solution, sometimes is better to go sleep and wake up with fresh mind. Here is solution to read xml file in opencv for unity. File path I get from Application.persistentDataPath where I put xml files.

Code (CSharp):

Mat classificationMat = new Mat (62, 62, CvType.CV_64FC1);

// string filename = "OCRHMM_transitions_table.xml";

// FileStorage fs(filename, FileStorage::READ);

// fs["transition_probabilities"] >> transition_p;

// fs.release();

//Load ClassificationData.

classificationMat.put (0, 0, GetClassificationsData(filepath));

///....

//function to load data from xml

double[] GetClassificationsData(string filePath)

{

XmlDocument xmlDoc = new XmlDocument();

xmlDoc.Load(filePath);

XmlNode dataNode = xmlDoc.GetElementsByTagName("data").Item(0);

// Debug.Log ("dataNode.InnerText " + dataNode.InnerText);

string[] dataString = dataNode.InnerText.Split(new string[] {

" ",

"\r\n", "\n"

}, StringSplitOptions.RemoveEmptyEntries);

// Debug.Log ("dataString.Length " + dataString.Length);

double[] data = new double[dataString.Length];

for (int i = 0; i < data.Length; i++)

{

try

{

data[i] = Convert.ToDouble(dataString[i]);

}

catch (FormatException)

{

Debug.Log("Unable to convert '{" + dataString[i] + "}' to a Double.");

}

catch (OverflowException)

{

Debug.Log("'{" + dataString[i] + "}' is outside the range of a Double.");

}

}

return data;

}
Luke58, Nov 7, 2019

#2190

EnoxSoftware likes this.
pouria77

Joined:

Aug 13, 2014

Posts:

36
Hi again

So I'll give more context of what I'm trying to do. I'll appreciate if anyone can tell me what I'm doing wrong.
I'm using OpenCV for Unity and am trying to scan book covers and get their title/author as strings.
This is my code so far (sorry it's a little long. I thought I'll be helpful post the whole thing):

Code (CSharp):

void Start()

{

Mat transition_p = new Mat(62, 62, CvType.CV_64FC1);

transition_p.put(0, 0, GetTransitionProbabilitiesData(OCRHMM_transitions_table_filepath));

Mat emission_p = Mat.eye(62, 62, CvType.CV_64FC1);

string voc = "abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789";

_decoder = OCRHMMDecoder.create(OCRHMM_knn_model_data_filepath, voc, transition_p, emission_p);

}

List<Mat> SeparateChannnels(Mat src)

{

List<Mat> channels = new List<Mat>();

//Grayscale images

if (src.type() == CvType.CV_8U || src.type() == CvType.CV_8UC1)

{

channels.Add(src);

channels.Add(new Scalar(255) - src);

}

//Colored images

else if (src.type() == CvType.CV_8UC3)

{

Text.computeNMChannels(src, channels);//, Text.ERFILTER_NM_IHSGrad);

int size = channels.Count;

for (int c = 0; c < size; c++)

channels.Add(new Scalar(255) - channels[c]);

}

return channels;

}

public void DetectText()

{

Mat frame = GetMatFromWebcam();

Mat original = new Mat(frame.size(), frame.type());

frame.copyTo(original);

_output.DisplayMat(frame);

List<Mat> channels = SeparateChannnels(frame);

foreach (Mat channel in channels)

_output.DisplayMat(channel);

ERFilter er_filter1 = Text.createERFilterNM1(trained_classifierNM1_filepath, 16, 0.00015f, 0.13f, 0.2f, true, 0.1f);

ERFilter er_filter2 = Text.createERFilterNM2(trained_classifierNM2_filepath, 0.5f);

string text = DetectTextRegions(original, channels, er_filter1, er_filter2);

UnityEngine.Debug.Log(text);

}

public string DetectTextRegions(Mat original, List<Mat> channels, ERFilter filter1, ERFilter filter2)

{

List<MatOfPoint> regions = new List<MatOfPoint>();

string outputText = "";

foreach (Mat channel in channels)

{

Text.detectRegions(channel, filter1, filter2, regions);

string groupingFilename = "text/trained_classifier_erGrouping.xml";

string filePath = Utils.getFilePath(groupingFilename);

MatOfRect groupRects = new MatOfRect();

Text.erGrouping(original, channel, regions, groupRects, Text.ERGROUPING_ORIENTATION_HORIZ, groupingFilename);

foreach(Rect rect in groupRects.toList())

{

try

{

Mat m = channel.submat(rect);

m = binarize(m);

_output.DisplayMat(m);

outputText += _decoder.run(m, 0) + "\n";

}

catch (Exception e)

{

}

}

}

return outputText;

}

Mat binarize(Mat mat)

{

//Uses otsu to threshold the input image

Mat binaryImage = mat;

Imgproc.cvtColor(mat, mat, Imgproc.COLOR_BGR2GRAY);

Imgproc.threshold(mat, binaryImage, 0, 255, Imgproc.THRESH_OTSU);

int white = Core.countNonZero(binaryImage);

int black = (int)binaryImage.size().area() - white;

return binaryImage;

}

Basically I'm getting the image of the book cover from the webcam, then separate channels, and for each channel, I separate the text into its own mat, and then pass that mat to the decoder function, which returns a text.
I call the binarize function on the Mat before passing it to the decoder, which uses threshold to simplify the image into two colors only.
I've attached the images for a sample harry potter book and of the channels and the separated text mats.
The problem is, the text I'm getting is not close to the text at all. For the attached book, the text I'm getting is this:

Wo
MPU
FU
IEWFI

I'm not sure what I am doing wrong. Do I need to process the mats further before passing on to the decoder function? or do I need to not use channels? I've tried a couple of solutions, but have not been successfull at all.

Here are the images.
These are the original image and the channels:

And these are the separated text mats:

As you see, the texts are separated nicely. I have separated POTTER and Cursed Child, which I pass to the decoder, but the text I get back is not even close. What am I missing?
Attached Files:
- img3.png
  
  File size:
  
  237.7 KB
  
  Views:
  
  485
Last edited: Nov 7, 2019

pouria77, Nov 7, 2019

#2191
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566
hellhammer said: ↑

Hi again

So I'll give more context of what I'm trying to do. I'll appreciate if anyone can tell me what I'm doing wrong.
I'm using OpenCV for Unity and am trying to scan book covers and get their title/author as strings.
This is my code so far (sorry it's a little long. I thought I'll be helpful post the whole thing):

Code (CSharp):

void Start()

{

Mat transition_p = new Mat(62, 62, CvType.CV_64FC1);

transition_p.put(0, 0, GetTransitionProbabilitiesData(OCRHMM_transitions_table_filepath));

Mat emission_p = Mat.eye(62, 62, CvType.CV_64FC1);

string voc = "abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789";

_decoder = OCRHMMDecoder.create(OCRHMM_knn_model_data_filepath, voc, transition_p, emission_p);

}

List<Mat> SeparateChannnels(Mat src)

{

List<Mat> channels = new List<Mat>();

//Grayscale images

if (src.type() == CvType.CV_8U || src.type() == CvType.CV_8UC1)

{

channels.Add(src);

channels.Add(new Scalar(255) - src);

}

//Colored images

else if (src.type() == CvType.CV_8UC3)

{

Text.computeNMChannels(src, channels);//, Text.ERFILTER_NM_IHSGrad);

int size = channels.Count;

for (int c = 0; c < size; c++)

channels.Add(new Scalar(255) - channels[c]);

}

return channels;

}

public void DetectText()

{

Mat frame = GetMatFromWebcam();

Mat original = new Mat(frame.size(), frame.type());

frame.copyTo(original);

_output.DisplayMat(frame);

List<Mat> channels = SeparateChannnels(frame);

foreach (Mat channel in channels)

_output.DisplayMat(channel);

ERFilter er_filter1 = Text.createERFilterNM1(trained_classifierNM1_filepath, 16, 0.00015f, 0.13f, 0.2f, true, 0.1f);

ERFilter er_filter2 = Text.createERFilterNM2(trained_classifierNM2_filepath, 0.5f);

string text = DetectTextRegions(original, channels, er_filter1, er_filter2);

UnityEngine.Debug.Log(text);

}

public string DetectTextRegions(Mat original, List<Mat> channels, ERFilter filter1, ERFilter filter2)

{

List<MatOfPoint> regions = new List<MatOfPoint>();

string outputText = "";

foreach (Mat channel in channels)

{

Text.detectRegions(channel, filter1, filter2, regions);

string groupingFilename = "text/trained_classifier_erGrouping.xml";

string filePath = Utils.getFilePath(groupingFilename);

MatOfRect groupRects = new MatOfRect();

Text.erGrouping(original, channel, regions, groupRects, Text.ERGROUPING_ORIENTATION_HORIZ, groupingFilename);

foreach(Rect rect in groupRects.toList())

{

try

{

Mat m = channel.submat(rect);

m = binarize(m);

_output.DisplayMat(m);

outputText += _decoder.run(m, 0) + "\n";

}

catch (Exception e)

{

}

}

}

return outputText;

}

Mat binarize(Mat mat)

{

//Uses otsu to threshold the input image

Mat binaryImage = mat;

Imgproc.cvtColor(mat, mat, Imgproc.COLOR_BGR2GRAY);

Imgproc.threshold(mat, binaryImage, 0, 255, Imgproc.THRESH_OTSU);

int white = Core.countNonZero(binaryImage);

int black = (int)binaryImage.size().area() - white;

return binaryImage;

}

Basically I'm getting the image of the book cover from the webcam, then separate channels, and for each channel, I separate the text into its own mat, and then pass that mat to the decoder function, which returns a text.
I call the binarize function on the Mat before passing it to the decoder, which uses threshold to simplify the image into two colors only.
I've attached the images for a sample harry potter book and of the channels and the separated text mats.
The problem is, the text I'm getting is not close to the text at all. For the attached book, the text I'm getting is this:

Wo
MPU
FU
IEWFI

I'm not sure what I am doing wrong. Do I need to process the mats further before passing on to the decoder function? or do I need to not use channels? I've tried a couple of solutions, but have not been successfull at all.

Here are the images.
These are the original image and the channels:

View attachment 510140

And these are the separated text mats:

View attachment 510143

As you see, the texts are separated nicely. I have separated POTTER and Cursed Child, which I pass to the decoder, but the text I get back is not even close. What am I missing?
Click to expand...

What recognition results can your code get when using the test images below?
Differences in text fonts may significantly affect recognition results.

Results of TextRecognitionExample
EnoxSoftware, Nov 9, 2019

#2192
pouria77

Joined:

Aug 13, 2014

Posts:

36
Hii

Thanks a lot for getting back to me.
On that image the results are near perfect. It gets almost all of it correctly.
That's the thing though. Every book has a different font so it becomes challenging.

I think if I get to clean the text area from the whole cover, I'm almost there. Do you know how I can imitate the ERStat code I posted using what we have in openCV for unity?

I'm talking about this part:

Code (CSharp):

for (int g=0; g < group.size(); g++)

{

int idx = group[g][0];

ERStat er = regions[idx][group[g][1]];

//Ignore root region

if (er.parent == NULL)

continue;

//Transform the linear pixel value to row and col

int px = er.pixel % channels[idx].cols;

int py = er.pixel / channels[idx].cols;

//Create the point and adds it to the list.

Point p(px, py);

//Draw the extremal region

floodFill(

channels[idx], out, //Image and mask

p, Scalar(255), //Seed and color

nullptr, //No rect

Scalar(er.level),Scalar(0), //LoDiff and upDiff

flags //Flags

);

}

Is there an object like ERStat in OpenCV for unity or any way to get that info?
Thanks again.
pouria77, Nov 11, 2019

#2193
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566
hellhammer said: ↑

Hii

Thanks a lot for getting back to me.
On that image the results are near perfect. It gets almost all of it correctly.
That's the thing though. Every book has a different font so it becomes challenging.

I think if I get to clean the text area from the whole cover, I'm almost there. Do you know how I can imitate the ERStat code I posted using what we have in openCV for unity?

I'm talking about this part:

Code (CSharp):

for (int g=0; g < group.size(); g++)

{

int idx = group[g][0];

ERStat er = regions[idx][group[g][1]];

//Ignore root region

if (er.parent == NULL)

continue;

//Transform the linear pixel value to row and col

int px = er.pixel % channels[idx].cols;

int py = er.pixel / channels[idx].cols;

//Create the point and adds it to the list.

Point p(px, py);

//Draw the extremal region

floodFill(

channels[idx], out, //Image and mask

p, Scalar(255), //Seed and color

nullptr, //No rect

Scalar(er.level),Scalar(0), //LoDiff and upDiff

flags //Flags

);

}

Is there an object like ERStat in OpenCV for unity or any way to get that info?
Thanks again.
Click to expand...

The "void erGrouping (InputArray image, InputArray channel, vector <vector <Point>> contours, CV_OUT std :: vector <Rect> & groups_rects, int method, const String & filename, float minProbability)" method is implemented in OpenCVForUnity. However, the "void erGrouping(InputArray image, InputArrayOfArrays channels, vector<vector<ERStat> > &regions, vector<vector<Vec2i> > &groups, vector<Rect> &groups_rects, int method, const string& filename, float minProbability)" method is not implemented.
In OpenCV C ++, "vector <vector <Point>>" is converted to "vector <vector <ERStat >>" with the following code.
https://github.com/opencv/opencv_co...b4b0e5971/modules/text/src/erfilter.cpp#L3958
https://github.com/opencv/opencv_co...971/modules/text/src/erfilter.cpp#L3965-L4036
EnoxSoftware, Nov 12, 2019

#2194
pouria77

Joined:

Aug 13, 2014

Posts:

36

Thank you very much. I'll keep that in mind.
For now, I'm actually back to using features to compare images. I think I have a better chance with that instead of OCR, although I'm not sure about the speed of searching features in thousands or millions of covers yet. I'll report back as soon as I test it.

pouria77, Nov 12, 2019

#2195
wwcher

Joined:

Nov 18, 2019

Posts:

1
Hi there

I'm customizing the marker based example, scene GyroSensorMarkerBasedARExample. Is there a way to stretch the webcam's quad to fit the width of the landscape screen while keeping its aspect ratio? Looking into GyroSensorMarkerBasedARExample.cs line 119:

Code (CSharp):

if (widthScale < heightScale) {

Camera.main.orthographicSize = (width * (float)Screen.height / (float)Screen.width) / 2;

imageSizeScale = (float)Screen.height / (float)Screen.width;

} else {

Camera.main.orthographicSize = height / 2;

}

I tried forcing the first block and that makes it fit but the position of ARObjects is wrong, there is something missing.
wwcher, Nov 18, 2019

#2196
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566
wwcher said: ↑

Hi there

I'm customizing the marker based example, scene GyroSensorMarkerBasedARExample. Is there a way to stretch the webcam's quad to fit the width of the landscape screen while keeping its aspect ratio? Looking into GyroSensorMarkerBasedARExample.cs line 119:

Code (CSharp):

if (widthScale < heightScale) {

Camera.main.orthographicSize = (width * (float)Screen.height / (float)Screen.width) / 2;

imageSizeScale = (float)Screen.height / (float)Screen.width;

} else {

Camera.main.orthographicSize = height / 2;

}

I tried forcing the first block and that makes it fit but the position of ARObjects is wrong, there is something missing.
Click to expand...

You will probably need to customize the ARCamera.fieldOfView value.
https://forum.unity.com/threads/released-opencv-for-unity.277080/page-23#post-3086194
EnoxSoftware, Nov 19, 2019

#2197
LevonRavel

Joined:

Feb 26, 2014

Posts:

179
(ENOX IGNORE I have fixed the issue) If anyone comes across this and wants to use it go for it. Only thing is I don't have any pretty things going on ie filters, so you'll have to add those in..

Code (CSharp):

private static void FaceSwap(Texture2D destination, Texture2D origin)

{

var faceLandmarkDetector = new FaceLandmarkDetector(DlibFaceLandmarkDetector.UnityUtils.Utils.getFilePath("sp_human_face_68.dat"));

List<UnityEngine.Rect> dRects = new List<UnityEngine.Rect>();

List<UnityEngine.Rect> oRects = new List<UnityEngine.Rect>();

List<List<Vector2>> landmarkPoints = new List<List<Vector2>>();

Mat dMat = new Mat(destination.height, destination.width, CvType.CV_8UC4);

Mat oMat = new Mat(origin.height, origin.width, CvType.CV_8UC4);

OpenCVForUnity.UnityUtils.Utils.texture2DToMat(destination, dMat);

OpenCVForUnity.UnityUtils.Utils.texture2DToMat(origin, oMat);

//Destination Landmarks

OpenCVForUnityUtils.SetImage(faceLandmarkDetector, dMat);

dRects = faceLandmarkDetector.Detect();

landmarkPoints.Add(faceLandmarkDetector.DetectLandmark(dRects[0]));

//Origin LandMarks

OpenCVForUnityUtils.SetImage(faceLandmarkDetector, oMat); //<---THIS IS THE FIX

oRects = faceLandmarkDetector.Detect();

landmarkPoints.Add(faceLandmarkDetector.DetectLandmark(oRects[0]));

DlibFaceChanger faceSwapper = new DlibFaceChanger();

faceSwapper.SetTargetImage(dMat);

faceSwapper.AddFaceChangeData(oMat, landmarkPoints[1], landmarkPoints[0], 1);

faceSwapper.ChangeFace();

OpenCVForUnity.UnityUtils.Utils.matToTexture2D(dMat, destination);

faceSwapper.Dispose();

}
Last edited: Nov 20, 2019

LevonRavel, Nov 20, 2019

#2198

EnoxSoftware likes this.
LevonRavel

Joined:

Feb 26, 2014

Posts:

179

Hey EnoxSoftware,

I thought I had fixed the issue, but for some reason I cannot get the above code to work on mobile, it seems like its working but the face swap does not happen. any tips please thank you levon.

Update.. the face swap wont work if the image is flipped a different direction from the camera, Is there a way to detect the face no matter what way the image is rotated?

Last edited: Nov 22, 2019

LevonRavel, Nov 22, 2019

#2199
EnoxSoftware

Joined:

Oct 29, 2014

Posts:

1,566

LevonRavel said: ↑

Hey EnoxSoftware,

I thought I had fixed the issue, but for some reason I cannot get the above code to work on mobile, it seems like its working but the face swap does not happen. any tips please thank you levon.

Update.. the face swap wont work if the image is flipped a different direction from the camera, Is there a way to detect the face no matter what way the image is rotated?

Click to expand...

You need to rotate the input Mat.
static void OpenCVForUnity.CoreModule.Core.rotate ( Mat src,,Mat dst, int rotateCode )

https://enoxsoftware.github.io/Open...1_core.html#a8d11b0f392585a665722be8e1e7e428c

EnoxSoftware, Nov 23, 2019

#2200

LevonRavel likes this.

(You must log in or sign up to reply here.)

Page 44 of 64