Details
-
Story
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
Mesos Q3 Sprint 5, Mesos Q3 Sprint 6, Twitter Q4 Sprint 1, Twitter Mesos Q4 Sprint 2
-
5
Description
There is some known kernel issue when we freeze the whole cgroup upon OOM. Mesos probably can just use PID namespace so that we will only need to kill the "init" of the pid namespace, instead of freezing all the processes and killing them one by one. But I am not quite sure if this would break the existing code.
Attachments
Issue Links
- relates to
-
MESOS-1689 Race with kernel to kill process / destroy cgroup after OOM
- Resolved
-
MESOS-1758 Freezer failure leads to lost task during container destruction.
- Resolved