[PARQUET-197] parquet-cascading and the mapred API does not create metadata file - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.6.0
Component/s: None
Labels:
None

Description

Repro: run a scalding job that writes parquet files to a folder. no _metadata and _common_metadata file is created

Impact: potential performance problem if parquet metadata is read from client side, which is the case for sparkSQL

casue: the metatdata writing logic is in the mapreduce API but not the mapred API of parquet.

Attachments

Issue Links

is a clone of

PARQUET-206 MapredParquetOutputCommitter does not work in hadoop2

Resolved

Activity

People

Assignee:: Tim

Reporter:: Tim

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 03/Mar/15 23:27

Updated:: 23/Jun/24 03:27

Resolved:: 13/Mar/15 20:37