Search Unity

Search

BenchmarkNet (Stress test for ENet, UNet, LiteNetLib, Lidgren, MiniUDP, Hazel, Photon and others)

Discussion in 'UNet' started by nxrighthere, Jan 13, 2018.

Thread Status:: Not open for further replies.

Page 4 of 7

JesseLord

Joined:

Jan 5, 2015

Posts:

3

Anymore updates on uNet?

JesseLord, Mar 23, 2018

#151
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

I think they are busy due to GDC, but I'll ask them how things are going.

I several times offered Alex my help, but he said that he'll fix it himself.

On my side it's hard to find where the problem is, because the library is working outside of .NET environment and I can't reverse-engineer it. I dig into it with WinDbg, but UNet assembly is like a black box in which you are blindly trying to find something.

Last edited: Mar 27, 2018

nxrighthere, Mar 24, 2018

#152
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Work on the UNet is still in-progress, and there's some good news. Alex changed a lot of code in the library and, now on his machine he successfully runs the test with 8000 simultaneous connections using only one server thread. There are still some places for improvements, but he very busy due to new top priority work. The library will be updated as soon as he has time for it (most likely next week if everything goes smoothly).

Last edited: Mar 24, 2018

nxrighthere, Mar 24, 2018

#153

hottabych, TheBrizleOne, mons00n and 6 others like this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

@Zuntatos Valve is about to open their low-level transport layer. When they upload the source code, I'm going to write a C# bindings and try to integrate it with the application. So yea, no need to bother with the Steamworks API.

Last edited: Apr 3, 2018

nxrighthere, Mar 27, 2018

#154

TheBrizleOne, Zuntatos, Deleted User and 2 others like this.
hjupter

Joined:

Dec 23, 2011

Posts:

628

I just came across this thread and its really interesting. I was wondering if would make any sense to add GameSparks Realtime to this stress test, would be interesting to see how it performs compared to other solutions.
https://docs.gamesparks.com/tutorials/real-time-services/

hjupter, Mar 31, 2018

#155

Munchy2007 likes this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Well, I'm adding support only for low-level libraries and standalone servers, because of the idea behind this test. When you are dealing with solutions that work locally on your end, you can debug everything and gather any data that you want. You can fix the bugs or try to improve something, then just recompile your stuff and test it again. Cloud services it's a very different thing. This tool is not the appropriate solution for testings them.

nxrighthere, Mar 31, 2018

#156

Deleted User likes this.
Zuntatos

Joined:

Nov 18, 2012

Posts:

612

Valves' github on the GameNetworkingSockets has been filled out now; ( https://github.com/ValveSoftware/GameNetworkingSockets )

Zuntatos, Mar 31, 2018

#157
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

They are still working on it. I can't compile it for now due to this issue.

nxrighthere, Apr 1, 2018

#158

DMeville and Deleted User like this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

I think it might be interesting for some people here, so take a look at HumbleNet. It's lightweight, reliable P2P networking library that allows connecting peers between browsers and standalone platforms using the signaling server. HumbleNet supports Unity, pretty easy to integrate with, has an example project and it works well. The source code is available on GitHub as well as the Quake 3 demo.

Last edited: Apr 1, 2018

nxrighthere, Apr 1, 2018

#159
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

That's what I call a rapid issue resolution. I built it successfully. Road to C# bindings.

nxrighthere, Apr 2, 2018

#160

mischa2k and Deleted User like this.
mischa2k

Joined:

Sep 4, 2015

Posts:

4,347

nxrighthere said: ↑

That's what I call a rapid issue resolution. I built it successfully. Road to C# bindings.
Click to expand...

Interesting. Is a Valve networking benchmark planned?

mischa2k, Apr 3, 2018

#161
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Yea, I'm working on it right now.

Last edited: Apr 7, 2018

nxrighthere, Apr 3, 2018

#162

akuno, Kirsche, Deleted User and 2 others like this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Everything almost done, but there is another issue.

nxrighthere, Apr 4, 2018

#163

Deleted User and Kirsche like this.
Deleted User

Guest

nxrighthere said: ↑

Everything almost done, but there is another issue.
Click to expand...

Keep it up! Thank you for your effort!

Deleted User, Apr 4, 2018

#164
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Unfortunately, fellas, the library doesn't work for me. It's initializing fine, but everything else is not working. I spent almost the whole day trying to debug this, but no success. I'm tired.

Last edited: Apr 11, 2018

nxrighthere, Apr 4, 2018

#165

DMeville and Deleted User like this.
PrimeDerektive

Joined:

Dec 13, 2009

Posts:

3,090

With regards to the unet memory leak, how long does BenchmarkNet run, eg how fast does it accumulate? Does this make unet effectively useless?

PrimeDerektive, Apr 4, 2018

#166
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

~12,5 megabytes for initialization of each client and ~700 kilobytes per second for 64 clients during the process, until they stop sending messages. Also, CPU usage is about ~84% where the .NET library does the same job for ~13% with the same amount of logical threads per client.

The source code is closed, and all what we can do is wait the updated version for months. The verdict is up to you.

Last edited: Apr 7, 2018

nxrighthere, Apr 4, 2018

#167
PrimeDerektive

Joined:

Dec 13, 2009

Posts:

3,090

nxrighthere said: ↑

~12,5 megabytes for initialization of each client and ~700 kilobytes per second for 64 clients during the process, until they stop sending messages. Also, CPU usage is about ~84% where the .NET library does the same job for ~13% with the same amount of logical threads per client.

The source code is closed, and all we can do is wait the updated version for months. The verdict is up to you.
Click to expand...

Thanks for the response... is it only on the relay server or am I missing something? When I host a server and connect a client to myself and leave it running for an hour my memory allocation is the same as when I start.

PrimeDerektive, Apr 4, 2018

#168
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

The answer is here.

nxrighthere, Apr 4, 2018

#169
PrimeDerektive

Joined:

Dec 13, 2009

Posts:

3,090

nxrighthere said: ↑

The answer is here.
Click to expand...

I see (I think, I've never even heard of that product). So I assume if i'm just hosting with the basic HLAPI and headless unity instances on EC2 the leak doesn't really apply to me?

PrimeDerektive, Apr 5, 2018

#170
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Nah, LLAPI/HLAPI is unaffected and as far as I know, some improvements already shipped with the latest updates for Unity.

nxrighthere, Apr 5, 2018

#171
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Well, I finally found where the problem is...

It looks like Valve doesn't test well different building toolchains, but they are fixing everything pretty fast.

Last edited: Apr 6, 2018

nxrighthere, Apr 5, 2018

#172

DMeville likes this.
goldbug

Joined:

Oct 12, 2011

Posts:

767

@nxrighthere so you have not tested HLAPI/LLAPI?

goldbug, Apr 5, 2018

#173
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

No, I didn't. Other people tested it.

nxrighthere, Apr 5, 2018

#174
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

I'm stuck with another problem... Debugging such things is a long process, so I don't know when (and if) it will be resolved. If any C# interop guru reads this thread, I'd be glad for any help.

Last edited: Apr 7, 2018

nxrighthere, Apr 6, 2018

#175
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

All right, fellas, we wait when Valve add a flat interface.

Last edited: Apr 7, 2018

nxrighthere, Apr 6, 2018

#176

TheBrizleOne, DMeville and Deleted User like this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567
BenchmarkNet 1.08 has been released.

Added new debugging options

Updated DarkRift to the latest version

Improved performance of ENet wrapper

Improved measurement of length for transmitted data

Improved overall functionality

Fixed clients drop for Neutrino

Fixed string allocations during the process

As always the results were updated for all libraries, and here you can find information about new debugging options.

Man, updating all this stuff manually, makes me tired every time, you know?

Last week I read Writing High-Performance .NET Code written by Ben Watson and found an interesting thing about
P/Invoke:

I tried to use it with the ENet wrapper, and it works. Not a huge impact in this case, but still a bit better. This attribute can be applied to a whole class where you interop native functions and they all will be affected.

Neutrino now passed the test with 500 and 1000 simulated clients. After debugging, I found that the problem was sitting in peers connection timeout. The time interval was too short, and that was causing clients drop. Now it's fixed.

Also, I eliminated almost all in-process memory allocations in the application's functions, except those that caused by TPL:

I have an idea how to solve this but I need to read a couple more things before doing it.

In general, the application is now more optimized but yea, there is still some work to do.

By the way, we recently talked with Alex about how things are going with UNet, and they still have a few problems that must be solved. I'm also impressed that UNet has a Replay Protector and now it processing packets much faster than before due to some improvements. Can't wait for an updated version to see how it works after all changes that they made.
Last edited: Jul 16, 2018

nxrighthere, Apr 14, 2018

#177

moco2k and Deleted User like this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

UNet Server 1.0.0.9 is out. Now I just need to change some code in the application before running the test.

Last edited: Apr 14, 2018

nxrighthere, Apr 14, 2018

#178
SimpuKR

Joined:

Apr 13, 2018

Posts:

1

Hope to see big improvements! ☺

SimpuKR, Apr 14, 2018

#179

nxrighthere likes this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567
BenchmarkNet 1.09 has been released.

Added parameters calculation for UNet

Added timeout calculation for Neutrino

Updated UNet to the latest version

Updated LiteNetLib to the latest version

Updated DarkRift to the latest version

Rebuilt Lidgren with optimized functionality

Improved detection of initial failure

Improved detection of server thread failure

The results of the UNet were updated.

Yes, it finally happened! I've updated the UNet library, tuned new parameters a lot, and now we have a completely different picture. Moderate CPU usage, no more memory leaks, faster processing time in high load scenarios, and lower bandwidth usage. However, there are still a few problems remain. First one, a high memory consumption compared to other networking libraries. Memory is not growing insanely like before, but it's a memory allocation for each client due to initialization. And second, latency is too high when more than 800 simulated clients connected to the server. @aabramychev knows about these problems, and he will try to solve them as soon as possible. We can expect an even better performance most likely this month.
Last edited: Aug 21, 2018

nxrighthere, Apr 18, 2018

#180

mons00n, goldbug, TwoTen and 1 other person like this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

By the way, I implemented a pre-allocation mechanism for tasks, but the funny thing is that it didn't affect the results. The cost of memory allocation is very cheap, so yea, I didn't add this feature to the application. It's just a waste of time, and lines of source code.

Last edited: Jul 17, 2018

nxrighthere, Apr 18, 2018

#181

-chris likes this.
snacktime

Joined:

Apr 15, 2013

Posts:

3,356

Sorry it took so long to get this out. A bit rough I'll probably polish it up some over the next week or so.

FYI this is basically the collection of techniques for optimizing data for realtime games that I've worked out over the years.

https://github.com/gamemachine/MultiplayerSpaceEfficiency

snacktime, Apr 18, 2018

#182

Deleted User and nxrighthere like this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Ah, that's nice, Chris. Personally, I've never used Protobuf for reasons. I just in love with MessagePack.

By the way, I would add an integer encoding to these techniques. In some cases, this thing really helps.

Last edited: Apr 19, 2018

nxrighthere, Apr 18, 2018

#183

unlikelysurvival likes this.
TwoTen

Joined:

May 25, 2016

Posts:

1,168

The MLAPI has some sweet BitWriters & BitReaders that write everything as VarInts with ZigZag encoding similar to Protobuf. Also writes bools as bits not bytes etc.

https://github.com/TwoTenPvP/MLAPI/wiki/BitWriter-&-BitReader Here you can read about it

And here is source

https://github.com/TwoTenPvP/MLAPI/...tworkingManagerComponents/Binary/BitWriter.cs

https://github.com/TwoTenPvP/MLAPI/...tworkingManagerComponents/Binary/BitReader.cs

TwoTen, Apr 18, 2018

#184

nxrighthere likes this.
snacktime

Joined:

Apr 15, 2013

Posts:

3,356

nxrighthere said: ↑

Ah, that's nice, Chris. Personally, I've never used Protobuf for reasons, I just in love with MessagePack.

By the way, I would add an integer encoding to these techniques. In some cases, this thing really helps.
Click to expand...

That's what varint encoding does. Hence why protobuf. You get it all in one package.

snacktime, Apr 18, 2018

#185
TwoTen

Joined:

May 25, 2016

Posts:

1,168

snacktime said: ↑

That's what varint encoding does. Hence why protobuf. You get it all in one package.
Click to expand...

Well protobuf isn't that nice on your performance, especially not for realtime games. Mainly due to heap allocation. The BitWriter we have essentially has a list pool where you can stack objects. Thus it doesn't expand. And you don't have to allocate when writing. You can write to a pre allocated buffer. So internally, the MLAPI uses this and it results in almost no allocations when writing the headers.

Flatbuffers is also very intresting tho, allows random read access etc.

TwoTen, Apr 18, 2018

#186
snacktime

Joined:

Apr 15, 2013

Posts:

3,356

varint encoding variants have a lot of research being done on them, like this here:

https://lemire.me/blog/2017/09/27/stream-vbyte-breaking-new-speed-records-for-integer-compression/

id compression is a big deal in stuff like search engines.

FYI heap allocation is not really a protocol buffer issue per say. I have no per message allocation in my setup using protobuf-net combined with DotNetty. Combination of using ArrayPool and ByteBuffers.

Flatbuffers is good on memory bad on space. Not really suited for realtime games.

snacktime, Apr 18, 2018

#187

TwoTen likes this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

@snacktime Oh, now I see.

Great stuff guys!

The thing that I don't like in those serialization libraries is a schema pre-compilation. This is one of the reasons why I prefer MessagePack where a class/struct itself is the schema.

Yea, the buffer pooling is used everywhere typically. I've backported the System.Buffers from .NET Core to Unity with some changes to keep it thread-safe with .NET 3.5.

Last edited: Apr 20, 2018

nxrighthere, Apr 18, 2018

#188
Deleted User

Guest

snacktime said: ↑

Sorry it took so long to get this out. A bit rough I'll probably polish it up some over the next week or so.

FYI this is basically the collection of techniques for optimizing data for realtime games that I've worked out over the years.

https://github.com/gamemachine/MultiplayerSpaceEfficiency
Click to expand...

Hi there. Don't you know how to specify ZigZag encoding in .proto file, instead of runtime serialization?

Deleted User, Apr 18, 2018

#189
snacktime

Joined:

Apr 15, 2013

Posts:

3,356

.proto files aren't generally used with protobuf-net, in preference of the more idiomatic approach with attributes

snacktime, Apr 18, 2018

#190
snacktime

Joined:

Apr 15, 2013

Posts:

3,356

Added a section on the basic approach to zero GC serialization/deserialization. Uses protobuf-net as the example but should work for any library that provides a Merge functionality for deserialization. Also uses System.Buffers, although creating your own byte[] pool isn't hard.

snacktime, Apr 20, 2018

#191

nxrighthere likes this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Well, since you shared this, people will need the buffers library itself. So, [link removed] backported version for Unity.

By the way, I would like to know how you handle scope/area of interest.

Last edited: Oct 10, 2018

nxrighthere, Apr 20, 2018

#192

snacktime likes this.
snacktime

Joined:

Apr 15, 2013

Posts:

3,356

nxrighthere said: ↑

Well, since you shared this, people will need the buffers library itself. So here's it.

By the way, I would like to read how you handle scoping/area of interests.
Click to expand...

As in tracking stuff in range of a point?

snacktime, Apr 20, 2018

#193
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Yep. I know that you are using a concurrent fixed array for this, so I think it would be nice if you will add more information about it.

nxrighthere, Apr 20, 2018

#194
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Great news guys. Valve added a flat interface, so one more attempt to integrate it with the application.

nxrighthere, Apr 20, 2018

#195

Deleted User likes this.
snacktime

Joined:

Apr 15, 2013

Posts:

3,356

So I actually use a couple of different approaches depending on the context.

Originally I started out using spatial hashing.

Spatial hashing is popular but it has a few downsides.

- It doesn't give you exact precision on distance.
- You need to create a separate hash with different cell sizes for each distance range you want to query with.

The good thing about it is it scales well. Doesn't matter whether you have 200 or 200,000 entities the performance is the same. The base cost for updating and querying is higher though. Query results have to be read from cells and written to an array. A non alloc api for it is easy enough, but it is considerably slower then just iterating over a single array. It's how it scales is where it shines.

Just a note on spatial hashing vs quad trees. Quad trees give more precision. But most require regenerating the entire tree when you update anything. Generally these work best for static data, where you are not adding/removing entitie and the entities don't move.

The thing is I think the norm is that you care about the precision, and are working with a relatively small number of entities, few hundred at most. And linear iteration plus Vector2 distance checking is really quite cheap in that case.

The concurrent array thing was to find something that worked well for the linear search pattern. .Net concurrent structures that were appropriate like ConcurrentDictionary, allocate on iterating the values because everything is in buckets. There is no single backing array it has to allocate a new list on every call to Values.

So the concurrent array has a single backing array. A concurrent queue and concurrent dictionary to manage entity id's and map those to backing array indexes. It uses an optimistic lock when writing to the backing array. You can access entities by id as well as iterating over the backing array directly.

So it it's guaranteed to write a complete entity safely or not at all to the backing array. But it's not guaranteed that the write itself won't fail. This is done via Interlocked.CompareExchange and we just ignore the result. But we don't really care about that because the only cases where you might have two threads writing the same entity are for stuff like when you remove the entity.

Last edited: Apr 20, 2018

snacktime, Apr 20, 2018

#196

nxrighthere likes this.
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

Yea, hashing is what I'm currently using, and I'm still looking for a better approach. Thank you.

Last edited: Apr 20, 2018

nxrighthere, Apr 20, 2018

#197
buFFalo94

Joined:

Sep 14, 2015

Posts:

273

@nxrighthere sorry I don't want to annoy you but @arcturgray suggest here

- Make changes to ENetCS library that nxrighthere showed in #134. Build library for Any CPU.
Click to expand...

So i'm a bit lost what changes is referring to?

buFFalo94, Apr 23, 2018

#198
nxrighthere

Joined:

Mar 2, 2014

Posts:

567

This and this one. By the way, the original wrapper is not ideal and requires a lot of changes. I would like to share my private repository, but it's no longer compatible with original ENet, unfortunately.

nxrighthere, Apr 23, 2018

#199
buFFalo94

Joined:

Sep 14, 2015

Posts:

273

Thanks. I'll try to make it work

buFFalo94, Apr 23, 2018

#200

Page 4 of 7

Thread Status:: Not open for further replies.