[YARN-3021] YARN's delegation-token handling disallows certain trust setups to operate properly over DistCp - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.3.0
Fix Version/s: 2.8.0, 3.0.0-alpha1
Component/s: security
Labels:
None

Target Version/s:

2.8.0
Hadoop Flags:

Reviewed
Release Note:

Hide
ResourceManager renews delegation tokens for applications. This behavior has been changed to renew tokens only if the token's renewer is a non-empty string. MapReduce jobs can instruct ResourceManager to skip renewal of tokens obtained from certain hosts by specifying the hosts with configuration mapreduce.job.hdfs-servers.token-renewal.exclude=<host1>,<host2>,..,<hostN>.

Show
ResourceManager renews delegation tokens for applications. This behavior has been changed to renew tokens only if the token's renewer is a non-empty string. MapReduce jobs can instruct ResourceManager to skip renewal of tokens obtained from certain hosts by specifying the hosts with configuration mapreduce.job.hdfs-servers.token-renewal.exclude=<host1>,<host2>,..,<hostN>.

Description

Consider this scenario of 3 realms: A, B and COMMON, where A trusts COMMON, and B trusts COMMON (one way trusts both), and both A and B run HDFS + YARN clusters.

Now if one logs in with a COMMON credential, and runs a job on A's YARN that needs to access B's HDFS (such as a DistCp), the operation fails in the RM, as it attempts a renewDelegationToken(…) synchronously during application submission (to validate the managed token before it adds it to a scheduler for automatic renewal). The call obviously fails cause B realm will not trust A's credentials (here, the RM's principal is the renewer).

In the 1.x JobTracker the same call is present, but it is done asynchronously and once the renewal attempt failed we simply ceased to schedule any further attempts of renewals, rather than fail the job immediately.

We should change the logic such that we attempt the renewal but go easy on the failure and skip the scheduling alone, rather than bubble back an error to the client, failing the app submission. This way the old behaviour is retained.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

YARN-3021.007.patch
16/Apr/15 16:36
11 kB
Yongjun Zhang
YARN-3021.007.patch
07/Apr/15 14:30
11 kB
Yongjun Zhang
YARN-3021.007.patch
07/Apr/15 05:39
11 kB
Yongjun Zhang
YARN-3021.006.patch
24/Mar/15 01:39
11 kB
Yongjun Zhang
YARN-3021.005.patch
20/Mar/15 06:07
13 kB
Yongjun Zhang
YARN-3021.004.patch
19/Mar/15 16:32
9 kB
Yongjun Zhang
YARN-3021.003.patch
06/Feb/15 00:51
20 kB
Yongjun Zhang
YARN-3021.002.patch
01/Feb/15 06:34
17 kB
Yongjun Zhang
YARN-3021.001.patch
23/Jan/15 15:54
13 kB
Yongjun Zhang
YARN-3021.patch
09/Jan/15 17:28
4 kB
Harsh J

Issue Links

is related to

HDFS-9525 hadoop utilities need to support provided delegation tokens

Resolved

SPARK-34295 Allow option similar to mapreduce.job.hdfs-servers.token-renewal.exclude

Resolved

relates to

YARN-2836 RM behaviour on token renewal failures is broken

Open

Activity

People

Assignee:: Yongjun Zhang

Reporter:: Harsh J

Votes:: 0 Vote for this issue

Watchers:: 18 Start watching this issue

Dates

Created:: 08/Jan/15 22:02

Updated:: 01/Feb/21 22:09

Resolved:: 17/Apr/15 02:46