Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-3156

Tez client keeps trying to talk to RM even if RM does not know about the application

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.7.1, 0.8.3
    • None
    • None

    Description

      Scenario :

      • Set RM/NM recovery to false.
         <property>
              <name>yarn.resourcemanager.recovery.enabled</name>
              <value>false</value>
            </property>
         <property>
              <name>yarn.nodemanager.recovery.enabled</name>
              <value>false</value>
            </property>
        
      • Start Mrrsleep application (application_1456883132071_0001)
        hadoop jar tez-tests-*.jar mrrsleep -m 1 -r 1 -mt 1000000 -rt 1000
        
      • When application is running, restart RM

      Since recovery is disabled and RM is restarted, it forgets mrrsleep application. At this point, mrrsleep application's tez-client keep trying to communicate with RM and loads RM with below exception.

      2016-03-02 02:01:24,708 INFO  ipc.Server (Server.java:run(2172)) - IPC Server handler 18 on 8050, call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplicationReport from xx.xx.xx.xxx:36191 Call#500250 Retry#0
      org.apache.hadoop.yarn.exceptions.ApplicationNotFoundException: Application with id 'application_1456883132071_0001' doesn't exist in RM.
      	at org.apache.hadoop.yarn.server.resourcemanager.ClientRMService.getApplicationReport(ClientRMService.java:328)
      	at org.apache.hadoop.yarn.api.impl.pb.service.ApplicationClientProtocolPBServiceImpl.getApplicationReport(ApplicationClientProtocolPBServiceImpl.java:175)
      	at org.apache.hadoop.yarn.proto.ApplicationClientProtocol$ApplicationClientProtocolService$2.callBlockingMethod(ApplicationClientProtocol.java:417)
      	at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:616)
      	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:969)
      	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2151)
      	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2147)
      	at java.security.AccessController.doPrivileged(Native Method)
      	at javax.security.auth.Subject.doAs(Subject.java:415)
      	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
      	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2145)
      2016-03-02 02:01:24,709 INFO  ipc.Server (Server.java:run(2172)) - IPC Server handler 27 on 8050, call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplicationReport from xx.xx.xx.xxx:36191 Call#500251 Retry#0
      org.apache.hadoop.yarn.exceptions.ApplicationNotFoundException: Application with id 'application_1456883132071_0001' doesn't exist in RM.
      	at org.apache.hadoop.yarn.server.resourcemanager.ClientRMService.getApplicationReport(ClientRMService.java:328)
      	at org.apache.hadoop.yarn.api.impl.pb.service.ApplicationClientProtocolPBServiceImpl.getApplicationReport(ApplicationClientProtocolPBServiceImpl.java:175)
      	at org.apache.hadoop.yarn.proto.ApplicationClientProtocol$ApplicationClientProtocolService$2.callBlockingMethod(ApplicationClientProtocol.java:417)
      	at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:616)
      	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:969)
      	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2151)
      	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2147)
      	at java.security.AccessController.doPrivileged(Native Method)
      	at javax.security.auth.Subject.doAs(Subject.java:415)
      	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
      	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2145)
      

      Attachments

        1. TEZ-3156.2.patch
          14 kB
          Hitesh Shah
        2. TEZ-3156.1.patch
          9 kB
          Hitesh Shah

        Issue Links

          Activity

            People

              hitesh Hitesh Shah
              yeshavora Yesha Vora
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: