As DevOps/Ops, you maintain DB instances or RAM intensive services. You see OOM issues occasionally, do you? Yes, the scaring Out-Of-Memory issues.
Nobody enjoy OOM issues. When it does happen, what to check? More importantly, how to monitor OOM issues? And get alerted, when it’s about to happen.
Here are some of my thoughts. Take a look and discuss with me!