Description
Currently, when running a MapReduce job against data in a Cassandra data store, it reads through all the data for a particular ColumnFamily. This could be optimized to only read through those rows that have to do with the query.
Attachments
Issue Links
- blocks
-
CASSANDRA-2246 Enable Pig to use indexed data as described in CASSANDRA-2245
- Resolved
- depends upon
-
CASSANDRA-1125 Filter out ColumnFamily rows that aren't part of the query (using a KeyRange)
- Resolved
- is blocked by
-
CASSANDRA-1600 Merge get_indexed_slices with get_range_slices
- Resolved
- is duplicated by
-
CASSANDRA-2245 Enable map reduce to use indexes for ColumnFamilyInputFormat
- Resolved