Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
3.1.0
-
None
-
None
Description
I encountered a yarn bug where executing the command "yarn application -status ats-hbase" leads to a connection leak between the resourcemanager and datanode. The resourcemanager does not close the connections with the datanode, and on the resourcemanager node, many TCP connections with the datanode are in the CLOSE_WAIT state
The relevant issue and log screenshots are as follows. The tcpdump log capturing port 1019 is shown below
this is the resourcemanager log
this is the resourcemanager process
This is the tcpdump package info of resourcemanager with datanode 1019 port
this is the tcp connection of resoucemanager with datanode, after rm running a period of time will leave many close_wait state connection.
my service version is
amabri: 3.1.1.3.1.0.0-78
HDFS: 3.1.1.3.1
yarn: 3.1.0