[BEAM-8934] Store&Read offset with KafkaIO - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Open
Priority: P3
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: io-java-kafka
Labels:
- KafkaIO
- apache
- beam
- offset

Flags:

Important

Description

When creating a Pipeline through a KafkaIO object, I want to be able to specify the starting offset of consumption, and when traversing the message later, I can get the offset of the current message for storage in a relational database / NoSQL.

This feature is used to implement the exactly-once semantics of spark streaming consumption.

In the "Your own data store" section of the following url content, you can find how to achieve exactly-once semantics with spark streaming:
http://spark.apache.org/docs/latest/streaming-kafka-0-10-integration.html

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: jiefeng zheng

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 10/Dec/19 02:14

Updated:: 04/Jun/22 14:47