Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
Impala 3.2.0
-
None
-
ghx-label-4
Description
When running 64-streams concurrent TPC-DS in debug builds at commit 681b256627e5e569778d4996bbdf530e30c528eb in a 140 node cluster, hit the following DCHECK after close to 14 hours of running.
F0228 00:42:35.132514 139036 system-state-info.cc:102] Check failed: total_tics > 0 (-4294962910 vs. 0)
void SystemStateInfo::ComputeCpuRatios() { const CpuValues& cur = cpu_values_[cur_val_idx_]; const CpuValues& old = cpu_values_[1 - cur_val_idx_]; // Sum up all counters int64_t cur_sum = accumulate(cur.begin(), cur.end(), 0); int64_t old_sum = accumulate(old.begin(), old.end(), 0); int64_t total_tics = cur_sum - old_sum; // If less than 1/USER_HZ time has time has passed for any of the counters, the ratio is // zero (to avoid dividing by zero). if (total_tics == 0) { memset(&cpu_ratios_, 0, sizeof(cpu_ratios_)); return; } DCHECK_GT(total_tics, 0); <<----