Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Not A Bug
-
Impala 2.3.0
-
None
Description
In DataStreamSender::Channel::CloseInternal(), if the EOS RPC fails, the receiver side of the RPC will remain open indefinitely causing the cluster to hang. If the sending of the last row-batch fails during CloseInternal(), then the query can succeed but return incorrect results. Only an error would be logged via LogError() in this case.
This is an incremental task after IMPALA-2592 which was a temporary fix to reduce the window of vulnerability. However, this needs a more well thought of permanent fix.