[HBASE-20844] Duplicate rows returned while hbase snapshot reads - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Duplicate
Affects Version/s: 1.3.1
Fix Version/s: None
Component/s: mapreduce, snapshots, spark
Labels:
None
Environment:

Cluster Details

Java 1.7
Hbase 1.3.1
Spark 1.6.1

Description

We are trying to take snapshot from code and read data using MR and spark, both approaches are returning duplicate records.

On the API side, {{org.apache.hadoop.hbase.mapreduce.TableSnapshotInputFormat }} is used.

Snapshot was taken during the table was in a region split state.

We suspect it is due to data is being returned for both parent and daughter regions.

Attachments

Issue Links

duplicates

HBASE-16011 TableSnapshotScanner and TableSnapshotInputFormat can produce duplicate rows

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: ShivaKumar SS

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 04/Jul/18 08:28

Updated:: 01/Aug/18 06:21

Resolved:: 16/Jul/18 06:00