Error
unknown file
C++ exception with description "The server socket has failed to listen on any local network address. The server socket has failed to bind to [::]:29500 (errno: 98 - Address already in use). The server could not be initialized on any address for port=29500, family=10 The server socket has failed to bind to 0.0.0.0:29500 (errno: 98 - Address already in use). The server could not be initialized on any address for port=29500, family=2
Exception raised from run at /var/lib/jenkins/workspace/torch/csrc/distributed/c10d/socket.cpp:558 (most recent call first):
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string
frame #1:
frame #2:
frame #3:
frame #4:
frame #5: c10d::TCPStore::TCPStore(std::__cxx11::basic_string
frame #6: testMultiTenantStores(bool) + 0xd3 (0x5618300c69c3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin/TCPStoreTest)
frame #7: void testing::internal::HandleExceptionsInMethodIfSupported
frame #8:
frame #9: testing::TestInfo::Run() + 0x40a (0x5618300f9faa in /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin/TCPStoreTest)
frame #10:
frame #11: testing::internal::UnitTestImpl::RunAllTests() + 0xf28 (0x5618300ff4d8 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin/TCPStoreTest)
frame #12: testing::UnitTest::Run() + 0x93 (0x5618300ffca3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin/TCPStoreTest)
frame #13: main + 0x44 (0x5618300c4874 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin/TCPStoreTest)
frame #14: __libc_start_main + 0xf3 (0x7fc96c604083 in /lib/x86_64-linux-gnu/libc.so.6)
frame #15: _start + 0x2e (0x5618300c4cde in /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin/TCPStoreTest)
" thrown in the test body.
unknown file:0: C++ failure
Solution
Please check that the job name looks correct.