You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
when configuring the engine_threads to any value > 1 I get an error when connecting from a client to a server.
For the configuration with engine_threads=2, I get the error occasionally e.g. not for every port combination. The configuration engine_threads=16 fails more often e.g. on every port combination I tried.
The error message on the client is
ubuntu@ip-172-31-32-21:~$ ${MSG_GEN} --local_ip 172.31.32.121 --remote_ip 172.31.32.120 --msg_size 64 --msg_window 32
I20241103 18:50:38.682282 1 main.cc:332] Starting in client mode, request size 64
Checking for file descriptor...
Got a file descriptor!
ERROR: Failed to dequeue response from control queue.
F20241103 18:50:49.975369 1 main.cc:346] Check failed: ret == 0 Failed to connect to remote host. machnet_connect() error: Unknown error -1
*** Check failure stack trace: ***
@ 0x7fa3d8ce3f03 google::LogMessage::Fail()
@ 0x7fa3d8ce793c google::LogMessage::SendToLog()
@ 0x7fa3d8ce39e7 google::LogMessage::Flush()
@ 0x7fa3d8ce509f google::LogMessageFatal::~LogMessageFatal()
@ 0x562d0c932a28 main
@ 0x7fa3d8866d90 (unknown)
I have a server running on another EC2 instance with this command
As @sarsanaee pointed out we have this experimental branch to achieve connectivity when using multiple engines. We have not yet tried that in AWS; if you could give it a spin and let us know that would be helpful!
Hello everyone,
when configuring the engine_threads to any value > 1 I get an error when connecting from a client to a server.
For the configuration with engine_threads=2, I get the error occasionally e.g. not for every port combination. The configuration engine_threads=16 fails more often e.g. on every port combination I tried.
The error message on the client is
I have a server running on another EC2 instance with this command
On the other hand, if I use engine_threads=1, the execution succeeds
MSG_GEN="docker run -v /var/run/machnet:/var/run/machnet ghcr.io/microsoft/machnet/machnet:latest release_build/src/apps/msg_gen/msg_gen"
Setup: Two EC2 instances of type c5n.18xlarge running Kernel 6.5.0-1014-aws on Ubuntu 23.10.
The text was updated successfully, but these errors were encountered: