[FLUME-708] Flume nodes drop heartbeats when reconfiguring - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Not A Problem
Affects Version/s: 0.9.5
Fix Version/s: None
Component/s: Node, Technical Debt
Labels:
None

Description

When flume receives a reconfiguration command from the master it performs the changes in the heartbeat thread. This causes the node to drop heartbeats for as long as it takes to either complete the task or for flume to forcibly kill the existing driver thread. This isn't a show stopper because there is a timeout associated with shutting down the existing driver, but it's easy to see this as a place where errors can occur.

I believe this is indicative of a larger issue in the way the node handles heartbeats and (re)configuration and we should revisit this communication as part of the master re-arch which implicitly involves the heartbeat and communication systems.

Attachments

Activity

People

Assignee:: Eric Sammer

Reporter:: Eric Sammer

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 19/Jul/11 23:49

Updated:: 20/Jul/11 01:07

Resolved:: 20/Jul/11 01:07