Description
Intermittent failure of LocatedFileStatusFetcher.getFileStatuses(), which is using globStatus to find files.
I'd say "turn s3guard on" except this appears to be the case, and the dataset being read is
over 1h old.
Which means it is harder than I'd like to blame S3 for what would sound like an inconsistency
We're hampered by the number of debug level statements in the globber code being approximately none; there's no debugging to turn on. All we know is that globFiles returns null without any explanation.
Attachments
Issue Links
- breaks
-
HIVE-24484 Upgrade Hadoop to 3.3.1 And Tez to 0.10.2
- Closed
- causes
-
HADOOP-16626 S3A ITestRestrictedReadAccess fails
- Resolved
-
HADOOP-17105 S3AFS globStatus attempts to resolve symlinks
- Resolved
- contains
-
HADOOP-13373 Add S3A implementation of FSMainOperationsBaseTest
- Resolved
- is depended upon by
-
HADOOP-16465 Tune S3AFileSystem.listLocatedStatus
- Resolved
- relates to
-
HADOOP-15748 S3 listing inconsistency can raise NPE in globber
- Resolved
- supercedes
-
HADOOP-13371 S3A globber to use bulk listObject call over recursive directory scan
- Resolved
- links to