Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.8.0
-
None
-
Reviewed
Description
We're cleaning up Hive and Spark's use of FileSystem.exists, because it is often the case we see code of exists+open, exists+delete, when the exists probe is needless. Against object stores, expensive needless.
Hadoop can set an example here by stripping them out. It will also show where there are opportunities to optimise things better and/or improve reporting.
Attachments
Attachments
Issue Links
- breaks
-
HADOOP-17255 JavaKeyStoreProvider fails to create a new key if the keystore is HDFS
- Reopened
- is depended upon by
-
HADOOP-13525 Optimize uses of FS operations in the ASF analysis frameworks and libraries
- Resolved
-
HADOOP-13321 Deprecate FileSystem APIs that promote inefficient call patterns.
- Resolved
- relates to
-
TEZ-3540 Eliminate inefficient use of FileSystem API
- Patch Available
-
SPARK-16736 remove redundant FileSystem status checks calls from Spark codebase
- Resolved
-
HIVE-14323 Reduce number of FS permissions and redundant FS operations
- Closed